BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 030722
         (172 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255583634|ref|XP_002532572.1| conserved hypothetical protein [Ricinus communis]
 gi|223527699|gb|EEF29806.1| conserved hypothetical protein [Ricinus communis]
          Length = 280

 Score =  211 bits (538), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 112/165 (67%), Positives = 127/165 (76%), Gaps = 2/165 (1%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MA +SISPLSIKS+N   SSS+ PY L + SKP  + CQ++  TE + +  DCS  +   
Sbjct: 1   MAFTSISPLSIKSVNISPSSSRSPYHLPSQSKPFHILCQLA--TEREDRILDCSTTRYKV 58

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
            ++K KNWR  VSTALAAA   +    + A ADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 59  HHSKPKNWRTLVSTALAAAAAVNLGFGLPAAADLNKFEAELRGEFGIGSAAQFGSADLRK 118

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFTG
Sbjct: 119 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTG 163


>gi|224071571|ref|XP_002303521.1| predicted protein [Populus trichocarpa]
 gi|222840953|gb|EEE78500.1| predicted protein [Populus trichocarpa]
          Length = 275

 Score =  204 bits (519), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 107/165 (64%), Positives = 124/165 (75%), Gaps = 7/165 (4%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MA +SIS +SIKS N  +     P+++ +LSKP  +A Q+   TE   QF DCS N    
Sbjct: 1   MAFTSISSMSIKSPNIST-----PHRILSLSKPFRIAYQL--DTERGNQFADCSKNGYEV 53

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             AK KNW   VST L AA ++  S N+ A+ADLN++EAETRGEFGIGSAAQFGSADLRK
Sbjct: 54  ETAKAKNWARVVSTTLVAAAISFSSCNLPAVADLNRFEAETRGEFGIGSAAQFGSADLRK 113

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AVH+ ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANFTG
Sbjct: 114 AVHLNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFTG 158


>gi|297741150|emb|CBI31881.3| unnamed protein product [Vitis vinifera]
          Length = 261

 Score =  191 bits (484), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 109/165 (66%), Positives = 119/165 (72%), Gaps = 21/165 (12%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSS+SPL I         SK P  L + SKP  V C+I  +    G +  C  N    
Sbjct: 1   MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNY--CRAN---- 42

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             A+ K W+  VSTALAAAVV + S  + A+ADLNKYEAETRGEFGIGSAAQFGSADLRK
Sbjct: 43  --AESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 99

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFTG
Sbjct: 100 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTG 144


>gi|449459702|ref|XP_004147585.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Cucumis sativus]
 gi|449520611|ref|XP_004167327.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Cucumis sativus]
          Length = 279

 Score =  190 bits (483), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 102/165 (61%), Positives = 120/165 (72%), Gaps = 4/165 (2%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSSIS LS+K L   SS S+ P  L    K + +  QI+ + +   Q  DCS  +  G
Sbjct: 1   MALSSISSLSVKCLPLNSSKSRHPCSLQT-RKQISMVSQINPQKD---QTQDCSERKHIG 56

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
              + K W+  VSTALAAA V   SS + ++A+LNKYEA+TRGEFGIGSAAQ+GSADLRK
Sbjct: 57  KITEPKRWQKLVSTALAAAAVIGFSSGMPSVAELNKYEADTRGEFGIGSAAQYGSADLRK 116

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AVH+ ENFRRANFTSADMRESDFSG  FNGAYLEKAVAYK NF+G
Sbjct: 117 AVHINENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSG 161


>gi|359474379|ref|XP_002265958.2| PREDICTED: uncharacterized protein LOC100250522 isoform 2 [Vitis
           vinifera]
          Length = 596

 Score =  189 bits (481), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 109/165 (66%), Positives = 119/165 (72%), Gaps = 21/165 (12%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSS+SPL I         SK P  L + SKP  V C+I  +    G +  C  N    
Sbjct: 336 MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNY--CRAN---- 377

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             A+ K W+  VSTALAAAVV + S  + A+ADLNKYEAETRGEFGIGSAAQFGSADLRK
Sbjct: 378 --AESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 434

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFTG
Sbjct: 435 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTG 479



 Score =  188 bits (477), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 106/165 (64%), Positives = 116/165 (70%), Gaps = 19/165 (11%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSS+SPL I         SK P  L +LSKP  V C+I  + E         NN    
Sbjct: 1   MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRE---------NNWRGE 42

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             A+ K W+  VSTALAAAVV + S  + A+ADLNKYE ETRGEFGIGSAAQFGSADLRK
Sbjct: 43  ANAESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRK 101

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AVHV ENFRRANFTSADMRESDFSGS FNG YLEKAVAYKA+ TG
Sbjct: 102 AVHVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLTG 146


>gi|297741151|emb|CBI31882.3| unnamed protein product [Vitis vinifera]
          Length = 201

 Score =  187 bits (474), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 105/164 (64%), Positives = 115/164 (70%), Gaps = 19/164 (11%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSS+SPL I         SK P  L +LSKP  V C+I  + E         NN    
Sbjct: 1   MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRE---------NNWRGE 42

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             A+ K W+  VSTALAAAVV + S  + A+ADLNKYE ETRGEFGIGSAAQFGSADLRK
Sbjct: 43  ANAESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRK 101

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           AVHV ENFRRANFTSADMRESDFSGS FNG YLEKAVAYKA+ T
Sbjct: 102 AVHVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLT 145


>gi|388505216|gb|AFK40674.1| unknown [Lotus japonicus]
          Length = 273

 Score =  177 bits (450), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 106/171 (61%), Positives = 121/171 (70%), Gaps = 24/171 (14%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQL------HALSKPLWVACQISSKTESDGQFPDCS 54
           MAL+S+SPLSI ++N    SS+   +L      H  S P+ V CQ++S  +     P  S
Sbjct: 2   MALNSLSPLSI-NINSLHVSSRPTSELSNSLHFHPKSSPI-VLCQMNSNRD----HPQES 55

Query: 55  NNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFG 114
                      K W   VS  LAAAV+A  SS++SALADLNK+EAE RGEFGIGSAAQFG
Sbjct: 56  -----------KKWGKLVSATLAAAVIA-FSSDMSALADLNKFEAEIRGEFGIGSAAQFG 103

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           SADLRKAVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+G
Sbjct: 104 SADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSG 154


>gi|255638223|gb|ACU19425.1| unknown [Glycine max]
          Length = 199

 Score =  175 bits (443), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 105/165 (63%), Positives = 123/165 (74%), Gaps = 17/165 (10%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S+SPLSI SL+  SSS+      H+ S P+ V CQI+S  +         + Q + 
Sbjct: 2   MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPV-VVCQINSNRD---------HRQEST 51

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
            + K+      VS  LAAAV+A  SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 52  KWGKV------VSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 104

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AVHV ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+G
Sbjct: 105 AVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSG 149


>gi|356540500|ref|XP_003538726.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Glycine max]
          Length = 260

 Score =  167 bits (424), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 100/165 (60%), Positives = 116/165 (70%), Gaps = 22/165 (13%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S+SPLSI SL+  SSS+      H+ S P+ V    ++++                
Sbjct: 1   MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPVVVKSVANAES---------------- 44

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                  W   VS  LAAAV+A  SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 45  -----TKWGKVVSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 98

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AVHV ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+G
Sbjct: 99  AVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSG 143


>gi|357481967|ref|XP_003611269.1| Thylakoid lumenal protein [Medicago truncatula]
 gi|355512604|gb|AES94227.1| Thylakoid lumenal protein [Medicago truncatula]
          Length = 147

 Score =  166 bits (420), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 96/165 (58%), Positives = 111/165 (67%), Gaps = 20/165 (12%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S +PLSI S +            +  S  +  + Q+  K   +   P  SN     
Sbjct: 1   MALNSFTPLSINSHH---------VSCYPSSSKVSKSSQVICKMSLNNDHPQESN----- 46

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                KNW   VS  LAAAV+   SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 47  -----KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKK 100

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
            VHV ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFTG
Sbjct: 101 TVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTG 145


>gi|357481963|ref|XP_003611267.1| Thylakoid lumenal protein [Medicago truncatula]
 gi|355512602|gb|AES94225.1| Thylakoid lumenal protein [Medicago truncatula]
          Length = 262

 Score =  166 bits (419), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 96/165 (58%), Positives = 111/165 (67%), Gaps = 20/165 (12%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S +PLSI S +            +  S  +  + Q+  K   +   P  SN     
Sbjct: 1   MALNSFTPLSINSHH---------VSCYPSSSKVSKSSQVICKMSLNNDHPQESN----- 46

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                KNW   VS  LAAAV+   SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 47  -----KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKK 100

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
            VHV ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFTG
Sbjct: 101 TVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTG 145


>gi|357481965|ref|XP_003611268.1| Thylakoid lumenal protein [Medicago truncatula]
 gi|355512603|gb|AES94226.1| Thylakoid lumenal protein [Medicago truncatula]
          Length = 232

 Score =  163 bits (413), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 81/100 (81%), Positives = 88/100 (88%), Gaps = 1/100 (1%)

Query: 66  KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
           KNW   VS  LAAAV+   SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K VHV 
Sbjct: 17  KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKKTVHVN 75

Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFTG
Sbjct: 76  ENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTG 115


>gi|356495617|ref|XP_003516671.1| PREDICTED: LOW QUALITY PROTEIN: thylakoid lumenal protein
           At1g12250, chloroplastic-like [Glycine max]
          Length = 222

 Score =  160 bits (406), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 98/166 (59%), Positives = 113/166 (68%), Gaps = 23/166 (13%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S SPLS+ SL+  S SS    +  + S P  V CQ +S  +               
Sbjct: 1   MALNSFSPLSVNSLHVSSISSSKISRSLSKSFP--VVCQTNSNRDH-------------- 44

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                +   V VS  LAAA++A  SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 45  -----RQGNV-VSATLAAAIIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 97

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           AVHV ENFR +NFT+ADMRESDFSGS FNGAYLEKAVAYKANF G 
Sbjct: 98  AVHVNENFRXSNFTAADMRESDFSGSTFNGAYLEKAVAYKANFPGV 143


>gi|116785652|gb|ABK23807.1| unknown [Picea sitchensis]
          Length = 291

 Score =  154 bits (389), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 81/126 (64%), Positives = 93/126 (73%), Gaps = 6/126 (4%)

Query: 40  ISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEA 99
           I+ K  +D    D    Q A    + KNW+  ++ ALA  V+ +    ++A ADLNKYEA
Sbjct: 52  ITGKISTDQHKKDA---QPASATPESKNWQRCLAAALATIVIGT---GMNAEADLNKYEA 105

Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
           ETRGEFGIGSAAQFGSA+LRK VH  ENFRRANFTSAD+RESDFSGS FNGAYLEKAVAY
Sbjct: 106 ETRGEFGIGSAAQFGSAELRKTVHANENFRRANFTSADIRESDFSGSTFNGAYLEKAVAY 165

Query: 160 KANFTG 165
           K NFTG
Sbjct: 166 KTNFTG 171


>gi|18391370|ref|NP_563902.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
 gi|75151954|sp|Q8H1Q1.1|TL225_ARATH RecName: Full=Thylakoid lumenal protein At1g12250, chloroplastic;
           Flags: Precursor
 gi|23297125|gb|AAN13098.1| unknown protein [Arabidopsis thaliana]
 gi|332190736|gb|AEE28857.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
          Length = 280

 Score =  151 bits (381), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 98/168 (58%), Positives = 122/168 (72%), Gaps = 8/168 (4%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--- 57
           MA SS+SPL +KSL+   SSS      +   + L    Q+SS+  S+ +  D SN +   
Sbjct: 1   MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTREGC 58

Query: 58  CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
           C+   A+   W+  +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSAD
Sbjct: 59  CSS--AESNTWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSAD 115

Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           L K VH  ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+G
Sbjct: 116 LSKTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSG 163


>gi|14334898|gb|AAK59627.1| unknown protein [Arabidopsis thaliana]
          Length = 280

 Score =  150 bits (380), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 98/168 (58%), Positives = 122/168 (72%), Gaps = 8/168 (4%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--- 57
           MA SS+SPL +KSL+   SSS      +   + L    Q+SS+  S+ +  D SN +   
Sbjct: 1   MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTREGC 58

Query: 58  CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
           C+   A+   W+  +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSAD
Sbjct: 59  CSS--AESNKWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSAD 115

Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           L K VH  ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+G
Sbjct: 116 LSKTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSG 163


>gi|297844088|ref|XP_002889925.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335767|gb|EFH66184.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 280

 Score =  147 bits (372), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 100/171 (58%), Positives = 125/171 (73%), Gaps = 14/171 (8%)

Query: 1   MALSSISPLSIKSLNFCSSSSKG---PYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ 57
           MA SS+SPL +KSL+   SSS     PY  H    PL    Q+SS++ S  +  D SN +
Sbjct: 1   MAFSSLSPLPMKSLDISRSSSSVSRSPY--HYQRYPLR-RLQLSSRSNS--EIKDSSNAR 55

Query: 58  ---CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFG 114
              C+   ++   W+  +S A+AAAV+AS SS++ A+A+LN++EA+TRGEFGIGSAAQ+G
Sbjct: 56  EGCCS--RSESNTWKRILSAAMAAAVIAS-SSSVPAMAELNRFEADTRGEFGIGSAAQYG 112

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           SADL K +H  ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+G
Sbjct: 113 SADLSKTIHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSG 163


>gi|212721536|ref|NP_001132582.1| uncharacterized protein LOC100194053 [Zea mays]
 gi|194694816|gb|ACF81492.1| unknown [Zea mays]
 gi|195647732|gb|ACG43334.1| hypothetical protein [Zea mays]
 gi|413937988|gb|AFW72539.1| hypothetical protein ZEAMMB73_749291 [Zea mays]
          Length = 268

 Score =  147 bits (372), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 70/78 (89%), Positives = 73/78 (93%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           + A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFTSADMRESDFSGS 
Sbjct: 74  MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 133

Query: 148 FNGAYLEKAVAYKANFTG 165
           FNGAYLEKAVAYKANFTG
Sbjct: 134 FNGAYLEKAVAYKANFTG 151


>gi|242066558|ref|XP_002454568.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
 gi|241934399|gb|EES07544.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
          Length = 270

 Score =  147 bits (370), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 70/78 (89%), Positives = 73/78 (93%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           + A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFTSADMRESDFSGS 
Sbjct: 76  MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 135

Query: 148 FNGAYLEKAVAYKANFTG 165
           FNGAYLEKAVAYKANFTG
Sbjct: 136 FNGAYLEKAVAYKANFTG 153


>gi|145323868|ref|NP_001077523.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
 gi|332190737|gb|AEE28858.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
          Length = 206

 Score =  147 bits (370), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 72/90 (80%), Positives = 82/90 (91%), Gaps = 1/90 (1%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           +AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH  ENFRRANFTS
Sbjct: 1   MAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTS 59

Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           ADMRESDFSGS FNGAYLEKAVAYKANF+G
Sbjct: 60  ADMRESDFSGSTFNGAYLEKAVAYKANFSG 89


>gi|357136761|ref|XP_003569972.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Brachypodium distachyon]
          Length = 268

 Score =  143 bits (361), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 67/78 (85%), Positives = 72/78 (92%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           + A ADLNK+EAE RGEFGIGSAAQFG+ADL+K VHV ENFRRANFTSADMRESDFSGS 
Sbjct: 74  MPAYADLNKFEAEQRGEFGIGSAAQFGNADLKKTVHVNENFRRANFTSADMRESDFSGST 133

Query: 148 FNGAYLEKAVAYKANFTG 165
           FNGAY+EKAVAYKANFTG
Sbjct: 134 FNGAYMEKAVAYKANFTG 151


>gi|125540470|gb|EAY86865.1| hypothetical protein OsI_08249 [Oryza sativa Indica Group]
          Length = 276

 Score =  142 bits (358), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 92/165 (55%), Positives = 111/165 (67%), Gaps = 6/165 (3%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL + SPL+  +   C+  +    +   L +   V+CQ +     DG     S +  A 
Sbjct: 1   MALPTTSPLAAAAARPCAFPTPWRCRSPPLRRLPHVSCQANRGGSRDGN--SLSTSAAAA 58

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             +    WR  VS ALAAA+V++      A ADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 59  AASPPPRWRAAVSAALAAAIVSA----APAYADLNKFEAEQRGEFGIGSAAQFGSADLKK 114

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AVHV ENFRRANFT+ADMRES+FSGS FNGAYLEKAVAY+ANFTG
Sbjct: 115 AVHVNENFRRANFTAADMRESNFSGSTFNGAYLEKAVAYRANFTG 159


>gi|115447561|ref|NP_001047560.1| Os02g0643500 [Oryza sativa Japonica Group]
 gi|49388647|dbj|BAD25782.1| thylakoid lumenal protein-like [Oryza sativa Japonica Group]
 gi|113537091|dbj|BAF09474.1| Os02g0643500 [Oryza sativa Japonica Group]
 gi|125583041|gb|EAZ23972.1| hypothetical protein OsJ_07699 [Oryza sativa Japonica Group]
 gi|215687060|dbj|BAG90906.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 277

 Score =  140 bits (352), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 66/74 (89%), Positives = 71/74 (95%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFT+ADMRES+FSGS FNGA
Sbjct: 87  ADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTAADMRESNFSGSTFNGA 146

Query: 152 YLEKAVAYKANFTG 165
           YLEKAVAY+ANFTG
Sbjct: 147 YLEKAVAYRANFTG 160


>gi|10086510|gb|AAG12570.1|AC022522_3 Hypothetical protein [Arabidopsis thaliana]
          Length = 293

 Score =  138 bits (348), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 87/155 (56%), Positives = 107/155 (69%), Gaps = 8/155 (5%)

Query: 11  IKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRV 70
           +KSL+   SSS      +   + L    Q+SS+  S+ +  D SN       A+   W+ 
Sbjct: 1   MKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTS-----AESNTWKR 53

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
            +S A  AA V + SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH  ENFRR
Sbjct: 54  ILSAA-MAAAVIASSSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRR 112

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+G
Sbjct: 113 ANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSG 147


>gi|326490876|dbj|BAJ90105.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 267

 Score =  137 bits (345), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 95/167 (56%), Positives = 112/167 (67%), Gaps = 19/167 (11%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQL-HALSKPLW-VACQISSKTESDGQFPDCSNNQC 58
           MAL+S SPL+        +  K P  L    S+ L  ++CQ ++     G   + SN   
Sbjct: 1   MALASTSPLAATV-----ARPKAPASLTRCRSRRLQRISCQATTDRSGGG---NASNTSP 52

Query: 59  AGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADL 118
           A P      WRV VS ALAAAVV +    + A ADLNKYEA+ RGEFGIGSAAQFG+ADL
Sbjct: 53  APPR-----WRVAVSAALAAAVVVA----MPAHADLNKYEADQRGEFGIGSAAQFGNADL 103

Query: 119 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           +  VHV ENFRRANFTSADMRESDFSGS FNGAY+EKAVA++ANFTG
Sbjct: 104 KNTVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFRANFTG 150


>gi|302822738|ref|XP_002993025.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
 gi|300139117|gb|EFJ05864.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
          Length = 196

 Score =  131 bits (330), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 61/78 (78%), Positives = 70/78 (89%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           ++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+  H  ENFRRANFTSADMRE+DFSGS 
Sbjct: 1   MNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFRRANFTSADMREADFSGST 60

Query: 148 FNGAYLEKAVAYKANFTG 165
           FNG YLEKAVAY+ NF+G
Sbjct: 61  FNGGYLEKAVAYRTNFSG 78


>gi|302780733|ref|XP_002972141.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
 gi|300160440|gb|EFJ27058.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
          Length = 219

 Score =  130 bits (326), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 66/91 (72%), Positives = 77/91 (84%), Gaps = 4/91 (4%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RRANFT 134
           LAA V+A+    ++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+  H  ENF RRANFT
Sbjct: 14  LAATVLAT---GMNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFSRRANFT 70

Query: 135 SADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           SADMRE+DFSGS FNG YLEKAVAY+ NF+G
Sbjct: 71  SADMREADFSGSTFNGGYLEKAVAYRTNFSG 101


>gi|168028137|ref|XP_001766585.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682230|gb|EDQ68650.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 225

 Score =  123 bits (308), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 57/76 (75%), Positives = 64/76 (84%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
           +LADLN  EA TRGEFGIGSA QFGSADL+K  H  ENFRR NFTSADM+E++FS S FN
Sbjct: 28  SLADLNSLEANTRGEFGIGSAVQFGSADLKKTQHANENFRRGNFTSADMKEANFSNSTFN 87

Query: 150 GAYLEKAVAYKANFTG 165
           GAYLEKAVAY+ NF+G
Sbjct: 88  GAYLEKAVAYRTNFSG 103


>gi|159478056|ref|XP_001697120.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
 gi|158274594|gb|EDP00375.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
          Length = 239

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 43/74 (58%), Positives = 51/74 (68%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
           ALADLN YEA T GEFGIGSA Q+G AD++      ++ RR+NFTSAD R + F GS   
Sbjct: 51  ALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCRNATFKGSNLQ 110

Query: 150 GAYLEKAVAYKANF 163
           GAY  KAV Y+ NF
Sbjct: 111 GAYFIKAVTYRTNF 124


>gi|302829835|ref|XP_002946484.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
           nagariensis]
 gi|300268230|gb|EFJ52411.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
           nagariensis]
          Length = 214

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 42/74 (56%), Positives = 51/74 (68%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
           A ADLN YEAE  GEFGIGSA Q+G AD++      ++ RR+NFTSAD R ++F GS   
Sbjct: 26  AFADLNVYEAEAGGEFGIGSAQQYGEADVQGRDFSGQDLRRSNFTSADCRNANFKGSNLQ 85

Query: 150 GAYLEKAVAYKANF 163
           GAY  KAV Y+ NF
Sbjct: 86  GAYFIKAVTYRTNF 99


>gi|384248119|gb|EIE21604.1| thylakoid lumenal protein [Coccomyxa subellipsoidea C-169]
          Length = 217

 Score = 78.6 bits (192), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 40/74 (54%), Positives = 49/74 (66%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
           A+ADLNKYEA   GEFG G+A Q+G ADL+      E+ RR+NFT+AD R  +F  S   
Sbjct: 29  AIADLNKYEAAAGGEFGNGTAQQYGEADLKGRDFHGEDLRRSNFTAADCRNCNFKDSNLQ 88

Query: 150 GAYLEKAVAYKANF 163
           GAY  K+V  KANF
Sbjct: 89  GAYFIKSVVPKANF 102


>gi|424513452|emb|CCO66074.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
          Length = 231

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 48/97 (49%), Positives = 58/97 (59%), Gaps = 5/97 (5%)

Query: 72  VSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF--- 128
           +S A A  V     S   A+A+LN  EA   GEF  GSA QFG  DLR A +V E +   
Sbjct: 21  LSVATAMIVSGIIPSPPFAVAELNSREANQGGEFNRGSAQQFGGYDLR-AENVSEKYGTD 79

Query: 129 -RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
            R +NFT A+MR+S   G+K NGAYL KAVA  A+FT
Sbjct: 80  LRLSNFTGAEMRDSKLVGAKLNGAYLMKAVAANADFT 116


>gi|308811122|ref|XP_003082869.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
 gi|116054747|emb|CAL56824.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
          Length = 247

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/103 (44%), Positives = 56/103 (54%), Gaps = 6/103 (5%)

Query: 66  KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
           K   V  S ALA A   S +    A A+LN+ EA   GEF  GSA QFG  DL K    K
Sbjct: 34  KKGHVITSIALATAFALSGAP---AHAELNRAEANRGGEFNRGSAKQFGGYDLVKVDIAK 90

Query: 126 E---NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           E   + R +NFT ADMR +   G+   GAY+ K VA + +FTG
Sbjct: 91  EYGKDLRLSNFTGADMRFAKLRGANLRGAYMMKMVAPEVDFTG 133


>gi|307105880|gb|EFN54127.1| hypothetical protein CHLNCDRAFT_31689 [Chlorella variabilis]
          Length = 259

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 32/74 (43%), Positives = 48/74 (64%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
           A A+LNKYE    GEF +G+A Q+G AD++      ++ +R+NFT+AD R+++F  SK  
Sbjct: 71  ASAELNKYEFGVTGEFNVGTARQYGEADVKGQDFSNQDLQRSNFTAADCRDANFQNSKLQ 130

Query: 150 GAYLEKAVAYKANF 163
            AY  K+V  +AN 
Sbjct: 131 AAYFMKSVLARANL 144


>gi|303288862|ref|XP_003063719.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454787|gb|EEH52092.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 277

 Score = 66.6 bits (161), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 39/83 (46%), Positives = 51/83 (61%), Gaps = 3/83 (3%)

Query: 86  SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE---NFRRANFTSADMRESD 142
           S+ +A A+LN  EA   GEF  GSA QFG  DLR    V +   + R +NFT A+MR + 
Sbjct: 81  SSPAAHAELNAREANRGGEFNRGSAQQFGGYDLRNEDVVGKYGADLRLSNFTGAEMRGAK 140

Query: 143 FSGSKFNGAYLEKAVAYKANFTG 165
             G+   GAYL KAVA++A+F G
Sbjct: 141 LRGANLTGAYLMKAVAFEADFEG 163


>gi|407005745|gb|EKE21794.1| pentapeptide repeat protein [uncultured bacterium]
          Length = 189

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/127 (23%), Positives = 53/127 (41%), Gaps = 9/127 (7%)

Query: 44  TESD---GQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAE 100
           TE+D    +F DC  N+C    +K+ N       +    +   C  +  +  ++NK+   
Sbjct: 36  TETDFVGTKFIDCVFNECNFSNSKILN------CSFCNVIFKECKMSGVSFNEINKFLLV 89

Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
              +  +     F   D++K+  ++      +F  AD+ ESDFS S   G   +     K
Sbjct: 90  WEFDNCVIKLCNFSKLDIKKSKFIQCVIHETDFVDADLSESDFSNSDLRGCKFQNTNLSK 149

Query: 161 ANFTGTL 167
            NF G +
Sbjct: 150 VNFIGAV 156


>gi|427725361|ref|YP_007072638.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
 gi|427357081|gb|AFY39804.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
          Length = 919

 Score = 46.6 bits (109), Expect = 0.004,   Method: Composition-based stats.
 Identities = 27/62 (43%), Positives = 38/62 (61%), Gaps = 4/62 (6%)

Query: 109 SAAQFGSADLRK----AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A   SA+LR+    A+ ++ NF  AN T AD+ E++ +GS F+ A L+ AV   ANFT
Sbjct: 748 SDANLSSANLRRSHLRAICLEANFTGANLTQADLCEANVTGSNFSDANLQGAVLKDANFT 807

Query: 165 GT 166
            T
Sbjct: 808 MT 809


>gi|78033474|emb|CAJ30090.1| hypothetical acidic protein, pentapeptide repeat [Magnetospirillum
           gryphiswaldense MSR-1]
 gi|144901135|emb|CAM77999.1| pentapeptide repeat containing protein [Magnetospirillum
           gryphiswaldense MSR-1]
          Length = 503

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 30/50 (60%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A+LRKAV    N R +N   A + ++D SG+K  GA L  A   +ANF+G
Sbjct: 28  ANLRKAVLSGANLRDSNLPRASLEDADLSGAKLQGANLAGATLLRANFSG 77


>gi|254414225|ref|ZP_05027992.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196178900|gb|EDX73897.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 963

 Score = 45.1 bits (105), Expect = 0.009,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 34/59 (57%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
            A    A L+ A   + N +RAN   A++ E++F G+ F GA LE A  ++AN  GT++
Sbjct: 890 GANLEGAHLKGANLKRANLKRANLKRANLFEANFEGANFEGATLEWANLFEANLKGTIL 948


>gi|145219796|ref|YP_001130505.1| pentapeptide repeat-containing protein [Chlorobium phaeovibrioides
           DSM 265]
 gi|145205960|gb|ABP37003.1| pentapeptide repeat protein [Chlorobium phaeovibrioides DSM 265]
          Length = 412

 Score = 45.1 bits (105), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 35/66 (53%), Gaps = 5/66 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANF 163
           S A F  ADLR+A   K  FR AN  +A  RE+     DFSG+   GAYL +A+   A  
Sbjct: 135 SGADFSGADLRRAECSKAGFRGANLQNAHFREASLRSVDFSGADLRGAYLWRAILDGAVL 194

Query: 164 TGTLIA 169
            G  ++
Sbjct: 195 MGVKVS 200


>gi|334137987|ref|ZP_08511411.1| pentapeptide repeat protein [Paenibacillus sp. HGF7]
 gi|333604520|gb|EGL15910.1| pentapeptide repeat protein [Paenibacillus sp. HGF7]
          Length = 242

 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 56/120 (46%), Gaps = 4/120 (3%)

Query: 49  QFPDCSNNQCAGPYAKLKNWRVFVSTALAAAV--VASCSSNISALADLNKYEAETRGEFG 106
              DC  ++     A++K+  + +ST +      V  C+ N+S  + L K         G
Sbjct: 89  DIADCVLSEATLRNAQMKDAEIKISTCIETCFDEVELCNGNLSG-STLIKATFRQANLHG 147

Query: 107 I-GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           I  S A F  +DLR A  V  +F  ++F SA++ E D S + F G  L  A+    NFTG
Sbjct: 148 ISASKAYFDESDLRGANLVNGDFEESDFISANLSEVDASYANFTGGNLTGAILCNGNFTG 207


>gi|254425612|ref|ZP_05039329.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
 gi|196188035|gb|EDX83000.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
          Length = 215

 Score = 44.3 bits (103), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 27/62 (43%), Positives = 34/62 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A   SADL +A   + N R A+ +SAD+R +D  G+K  GA L  A    AN TGT  
Sbjct: 68  SGADLRSADLFRADLSEANLRSADLSSADLRGADLPGAKLIGANLIGANLSIANVTGTQF 127

Query: 169 AT 170
            T
Sbjct: 128 GT 129


>gi|328541950|ref|YP_004302059.1| Pentapeptide repeat protein [Polymorphum gilvum SL003B-26A1]
 gi|326411700|gb|ADZ68763.1| Pentapeptide repeat protein [Polymorphum gilvum SL003B-26A1]
          Length = 276

 Score = 43.9 bits (102), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 33/58 (56%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIA 169
           +F  ADLR+A   + + RR++F+ A MR  D   +  +GA L+ A    A+  GT +A
Sbjct: 83  KFAKADLRRAELERADLRRSDFSGASMRAVDMEKADLSGAVLDGADLRDADLNGTSLA 140


>gi|421082377|ref|ZP_15543263.1| Pentapeptide repeat protein [Pectobacterium wasabiae CFBP 3304]
 gi|401702907|gb|EJS93144.1| Pentapeptide repeat protein [Pectobacterium wasabiae CFBP 3304]
          Length = 846

 Score = 43.9 bits (102), Expect = 0.024,   Method: Composition-based stats.
 Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 8/118 (6%)

Query: 56  NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQ 112
           + C+    +    R   +T L +AV +  S N +        ++  R    IG+    A+
Sbjct: 691 DSCSWVETQANEARFVGATWLTSAVASGSSMNGADFTQATLRQSNLRQASLIGAVFARAK 750

Query: 113 FGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
             ++DL +A   + NF+RAN     F   D RE++F+ +   GA L+K+    ANF G
Sbjct: 751 LENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLMGALLQKSQLSGANFRG 808


>gi|374583660|ref|ZP_09656754.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
           17734]
 gi|374419742|gb|EHQ92177.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
           17734]
          Length = 367

 Score = 43.9 bits (102), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 33/57 (57%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A    ADL +A     N RRAN + A++ E+D SG+  +GA L +A   +A+ +G
Sbjct: 153 SGANLSEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEADLSRADLSG 209



 Score = 40.0 bits (92), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A    ADL +A     N RRA+ + A++R +D SG+    A L +A   +AN +G
Sbjct: 233 SGANLSEADLSRADLSGANLRRADLSGANLRRADLSGANLRRADLSEANLSEANLSG 289



 Score = 38.9 bits (89), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 32/57 (56%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A    ADL +A     N  RA+ + A++ E+D SG+  +GA L +A   +A+ +G
Sbjct: 193 SGANLSEADLSRADLSGANLSRADLSGANLSEADLSGANLSGANLSEADLSRADLSG 249


>gi|167907368|ref|ZP_02494573.1| pentapeptide repeat protein [Burkholderia pseudomallei NCTC 13177]
          Length = 269

 Score = 43.5 bits (101), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 42/81 (51%), Gaps = 10/81 (12%)

Query: 84  CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 143
           C +N+S  ADL+  +A+ RG       A    ADLR A     N   AN + AD+ ++D 
Sbjct: 67  CGANLSG-ADLS--DADLRG-------ADLSDADLRGADLSVANLSGANLSGADLSDADL 116

Query: 144 SGSKFNGAYLEKAVAYKANFT 164
           SG+  +GAYL  A    AN +
Sbjct: 117 SGANLSGAYLSYANLSGANLS 137


>gi|428220816|ref|YP_007104986.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427994156|gb|AFY72851.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 418

 Score = 43.5 bits (101), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 5/63 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A F  +DL  A+ ++ + RRAN        AD+  +D SG  F+G+ L +A   +ANF
Sbjct: 143 SMANFTGSDLSGAIMIRADLRRANISRANLNEADISRADLSGVDFSGSNLSQANFEEANF 202

Query: 164 TGT 166
            GT
Sbjct: 203 LGT 205



 Score = 39.3 bits (90), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 38/72 (52%), Gaps = 8/72 (11%)

Query: 95  NKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           N  E +  G   IG   S A F  ADLR+A     N   ANF +A+++E+D SG+   GA
Sbjct: 221 NFREVDLSGSDLIGADLSNANFAEADLRRA-----NLVGANFNNANLKEADLSGAYLIGA 275

Query: 152 YLEKAVAYKANF 163
            L  A   +A+F
Sbjct: 276 TLVNANIVRADF 287



 Score = 35.0 bits (79), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 24/48 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           S A    A LR+A   + NF   N + AD+R  + SG+   GA L  A
Sbjct: 58  SGADLSRAKLRRATFGETNFSNTNLSEADLRRVNLSGADLRGANLSTA 105


>gi|298250074|ref|ZP_06973878.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297548078|gb|EFH81945.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 471

 Score = 43.5 bits (101), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 5/44 (11%)

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTG 165
           + R+AN + A M  +D SG+   GA LE      AVA+KANFTG
Sbjct: 135 DLRKANLSMARMHHTDLSGANLTGAILEGIDLKDAVAHKANFTG 178


>gi|427702634|ref|YP_007045856.1| low-complexity protein [Cyanobium gracile PCC 6307]
 gi|427345802|gb|AFY28515.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
          Length = 182

 Score = 43.5 bits (101), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 34/66 (51%), Gaps = 5/66 (7%)

Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKAN 162
           G  A+F  ADL  A+  +  F  A+F  AD     M + D SG+   GA L  A+A  +N
Sbjct: 77  GRQARFRDADLHGAILTQAAFPEADFHGADLSDALMDKVDMSGTDLTGAVLRGAIASGSN 136

Query: 163 FTGTLI 168
           FTG  +
Sbjct: 137 FTGATV 142


>gi|431802241|ref|YP_007229144.1| pentapeptide repeat-containing protein [Pseudomonas putida HB3267]
 gi|430793006|gb|AGA73201.1| pentapeptide repeat-containing protein [Pseudomonas putida HB3267]
          Length = 219

 Score = 43.5 bits (101), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   ADLR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGADLRGANLANQDLRKMNLAGADLRDADLRHARLDLANLEKANLQGANLT 93


>gi|113476913|ref|YP_722974.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
 gi|110167961|gb|ABG52501.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
          Length = 567

 Score = 43.5 bits (101), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A    A+L KAV V  N RR N + A++  ++   + F+GAYL +A   +AN  G
Sbjct: 418 ASLEGANLTKAVLVSANLRRVNLSGANLNSTNLRAANFSGAYLREAKLSRANLEG 472


>gi|254416875|ref|ZP_05030623.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196176239|gb|EDX71255.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 332

 Score = 43.1 bits (100), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 33/69 (47%), Gaps = 2/69 (2%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           Y A+ RG   I        ADLR A  +K N R AN    ++RE+D  G+  +GA L  A
Sbjct: 144 YTAKLRG--AILQNVDLQGADLRGADLLKVNLRGANLRETNLREADLRGANLSGANLSSA 201

Query: 157 VAYKANFTG 165
              + N  G
Sbjct: 202 FLTEVNLMG 210


>gi|339487133|ref|YP_004701661.1| pentapeptide repeat-containing protein [Pseudomonas putida S16]
 gi|338837976|gb|AEJ12781.1| pentapeptide repeat-containing protein [Pseudomonas putida S16]
          Length = 219

 Score = 43.1 bits (100), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   ADLR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGADLRGANLANQDLRKMNLAGADLRDADLRHARLDLANLEKANLQGANLT 93


>gi|87302980|ref|ZP_01085784.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
 gi|87282476|gb|EAQ74435.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
          Length = 203

 Score = 43.1 bits (100), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 5/63 (7%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F  ADL  ++  +  F R++F+ AD     M  +DFSG+  +GA L   +A  ++F+G
Sbjct: 101 ADFSGADLHGSILTQAAFLRSDFSGADLSDALMDRADFSGTDLSGALLRGVIAAGSSFSG 160

Query: 166 TLI 168
            +I
Sbjct: 161 AVI 163


>gi|37521689|ref|NP_925066.1| hypothetical protein glr2120 [Gloeobacter violaceus PCC 7421]
 gi|35212687|dbj|BAC90061.1| glr2120 [Gloeobacter violaceus PCC 7421]
          Length = 278

 Score = 43.1 bits (100), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 25/60 (41%), Positives = 31/60 (51%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIAT 170
           A    ADLR+A  V  N RRAN   AD RESD   +    A L +A  +KAN    L+ +
Sbjct: 179 ANLEGADLREASFVSANLRRANLRRADCRESDLFDANLCEADLREAKLHKANLRQALLVS 238



 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 28/49 (57%), Gaps = 5/49 (10%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTS-----ADMRESDFSGSKFNGAYLE 154
           A    ADLR+A   K N R+A   S     AD+RE+D SG+   GA+LE
Sbjct: 214 ANLCEADLREAKLHKANLRQALLVSADLRGADLREADLSGANLQGAHLE 262


>gi|423066634|ref|ZP_17055424.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|406711942|gb|EKD07140.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 351

 Score = 43.1 bits (100), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 31/57 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A    ADL ++V    NF  AN T A++  ++ +G+  NGA L  A   +AN TG
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTRANLTG 246



 Score = 39.3 bits (90), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 47/106 (44%), Gaps = 4/106 (3%)

Query: 69  RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 125
           R F   +L AA+    + N   L+  N  EA       IG   S +Q   ADL  AV + 
Sbjct: 21  RNFSDISLMAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQLSYADLSMAVLID 80

Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-TLIAT 170
            N   A  T   + ++D SG+  +GA L +      N TG +LI T
Sbjct: 81  ANLTGATMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGT 126


>gi|428215909|ref|YP_007089053.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428004290|gb|AFY85133.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 447

 Score = 43.1 bits (100), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 3/77 (3%)

Query: 91  LADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           LAD N   ++ RG   IG++        ADLR+A   + + R AN   AD+RE+D +G+ 
Sbjct: 327 LADANMKGSDLRGADLIGASLNKVNLTQADLREADLTRADLRGANLRLADLREADLTGAS 386

Query: 148 FNGAYLEKAVAYKANFT 164
            N   L +A     + T
Sbjct: 387 LNQVNLAEADLRGVDLT 403



 Score = 39.3 bits (90), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 29/55 (52%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A    +DLR A  +  +  + N T AD+RE+D + +   GA L  A   +A+ TG
Sbjct: 330 ANMKGSDLRGADLIGASLNKVNLTQADLREADLTRADLRGANLRLADLREADLTG 384


>gi|21674877|ref|NP_662942.1| pentapeptide repeat-containing protein [Chlorobium tepidum TLS]
 gi|21648101|gb|AAM73284.1| pentapeptide repeat family protein [Chlorobium tepidum TLS]
          Length = 439

 Score = 43.1 bits (100), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 34/70 (48%), Gaps = 15/70 (21%)

Query: 111 AQFGSADLRKAVHVKENFRRA---------------NFTSADMRESDFSGSKFNGAYLEK 155
           A+ G  DLRKA   K +F RA               NF  ADM+E++  G+   GA L++
Sbjct: 285 AELGGVDLRKASLSKSDFERANLDKANLAGANLAGVNFQRADMKEANLKGANLEGANLDR 344

Query: 156 AVAYKANFTG 165
           A    A+ +G
Sbjct: 345 AFLKGADLSG 354


>gi|421528695|ref|ZP_15975254.1| pentapeptide repeat-containing protein [Pseudomonas putida S11]
 gi|402213838|gb|EJT85176.1| pentapeptide repeat-containing protein [Pseudomonas putida S11]
          Length = 200

 Score = 43.1 bits (100), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   ADLR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGADLRGANLANQDLRKMNLAGADLRDADLRHARLDLANLEKANLQGANLT 93


>gi|443315235|ref|ZP_21044737.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
 gi|442785176|gb|ELR95014.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
          Length = 402

 Score = 42.7 bits (99), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 36/72 (50%), Gaps = 7/72 (9%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
           N + A+ RG       A   S DLR+A+  + N R+ANF  A+MR +  + +   GA L 
Sbjct: 318 NLHRADLRG-------ANLESTDLREAILRQANLRQANFRYANMRMAHLAEADLRGADLR 370

Query: 155 KAVAYKANFTGT 166
            A    AN  GT
Sbjct: 371 GADLTHANLWGT 382


>gi|261821705|ref|YP_003259811.1| hypothetical protein Pecwa_2443 [Pectobacterium wasabiae WPP163]
 gi|261605718|gb|ACX88204.1| Protein of unknown function DUF2169 [Pectobacterium wasabiae
           WPP163]
          Length = 846

 Score = 42.7 bits (99), Expect = 0.045,   Method: Composition-based stats.
 Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 8/118 (6%)

Query: 56  NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQ 112
           + C+    +    R   +T L +AV +  S N +        ++  R    IG+    A+
Sbjct: 691 DSCSWVETQANEARFTGATWLTSAVASGSSMNSADFTQATLRQSNLRQASLIGAVFALAK 750

Query: 113 FGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
             ++DL +A   + NF+RAN     F   D RE++F+ +   GA L+K+    ANF G
Sbjct: 751 LENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQLGGANFRG 808


>gi|325272495|ref|ZP_08138874.1| pentapeptide repeat-containing protein [Pseudomonas sp. TJI-51]
 gi|324102372|gb|EGB99839.1| pentapeptide repeat-containing protein [Pseudomonas sp. TJI-51]
          Length = 219

 Score = 42.7 bits (99), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   ADLR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGADLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93


>gi|390441101|ref|ZP_10229280.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
 gi|389835591|emb|CCI33406.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
          Length = 436

 Score = 42.7 bits (99), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 30/53 (56%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           S A    A+LR+A  +K N RRAN   A + E+D SG+    A L KA+  +A
Sbjct: 324 SGANLIDANLRRANLIKANLRRANLIEAILSEADLSGANLRRANLIKAILIEA 376


>gi|46202237|ref|ZP_00053526.2| COG1357: Uncharacterized low-complexity proteins [Magnetospirillum
           magnetotacticum MS-1]
          Length = 542

 Score = 42.7 bits (99), Expect = 0.049,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 31/57 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S +    A+LRKAV    N R  N   A + ++D SG+K  GA L  A   +ANF+G
Sbjct: 47  SGSLLSLANLRKAVLSGANLRDCNLPRACLEDADLSGAKLQGANLAGATLLRANFSG 103


>gi|83310097|ref|YP_420361.1| hypothetical protein amb0998 [Magnetospirillum magneticum AMB-1]
 gi|82944938|dbj|BAE49802.1| Uncharacterized low-complexity protein [Magnetospirillum magneticum
           AMB-1]
          Length = 542

 Score = 42.7 bits (99), Expect = 0.049,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 31/57 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S +    A+LRKAV    N R  N   A + ++D SG+K  GA L  A   +ANF+G
Sbjct: 47  SGSLLSLANLRKAVLSGANLRDCNLPRACLEDADLSGAKLQGANLAGATLLRANFSG 103


>gi|452962545|gb|EME67671.1| hypothetical protein H261_22313 [Magnetospirillum sp. SO-1]
          Length = 542

 Score = 42.7 bits (99), Expect = 0.054,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 31/57 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S +    A+LRKAV    N R  N   A + ++D SG+K  GA L  A   +ANF+G
Sbjct: 47  SGSLLSLANLRKAVLSGANLRDCNLPRACLEDADLSGAKLQGANLAGATLLRANFSG 103


>gi|354567474|ref|ZP_08986643.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353542746|gb|EHC12207.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 164

 Score = 42.4 bits (98), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 10/100 (10%)

Query: 65  LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
           +K+WRVF    LA  V+      +  L+      + +R         Q  +AD      +
Sbjct: 1   MKSWRVFAVLILAMVVL------LFPLSAEAAKSSSSR----FAGYKQMSNADFSGQTLI 50

Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           +E F +     A+   +D  G+ FN AYLEKA  + A+FT
Sbjct: 51  REEFTKVKLDKANFSNADLRGAVFNNAYLEKANLHGADFT 90


>gi|218439290|ref|YP_002377619.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218172018|gb|ACK70751.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 231

 Score = 42.4 bits (98), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 33/61 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A F  AD R +   K NF  A F  AD+ E+   G+ F GA LEKA+  +   +G ++
Sbjct: 33  SGADFSKADFRSSRLGKTNFAYACFFGADLSEAILWGTDFTGANLEKAILREVELSGAIL 92

Query: 169 A 169
           +
Sbjct: 93  S 93


>gi|428314300|ref|YP_007125277.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428255912|gb|AFZ21871.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 355

 Score = 42.4 bits (98), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 32/58 (55%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           A    ADL+ A     N   AN + AD+RE++ S +  +GA L+ A   +AN TG L+
Sbjct: 196 ADLSEADLKGANLSGANLSGANLSGADLREANLSHADLSGADLQGANLTRANLTGVLL 253


>gi|427415571|ref|ZP_18905754.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
 gi|425758284|gb|EKU99136.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
          Length = 184

 Score = 42.4 bits (98), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 5/62 (8%)

Query: 109 SAAQFGSADLRKA-VHVKE----NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A  G ADLRKA +H  +    + R A+ T A+++E+D S +  +GAYL +A    A  
Sbjct: 103 SGANLGGADLRKADLHKADLSDSDLRCADLTGANLQETDLSDANLDGAYLGEADLTGATI 162

Query: 164 TG 165
            G
Sbjct: 163 LG 164


>gi|406706438|ref|YP_006756791.1| pentapeptide repeat-containing protein [alpha proteobacterium
           HIMB5]
 gi|406652214|gb|AFS47614.1| pentapeptide repeat protein [alpha proteobacterium HIMB5]
          Length = 174

 Score = 42.4 bits (98), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 34/60 (56%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           FG    + F  A+L ++V +  NF + NFT A++ ++DF GS    A  + A   +ANFT
Sbjct: 80  FGTFPESTFYRANLYESVMIGANFEKTNFTGANLTKADFMGSTLIEANFQNANLMEANFT 139


>gi|428202965|ref|YP_007081554.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427980397|gb|AFY77997.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 179

 Score = 42.4 bits (98), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 29/57 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           + A    ADL K      N + AN  +AD+ E++   +   GA L++A   KAN TG
Sbjct: 97  AGANLQGADLEKGNLAGANLQTANLINADLEEANLQNANLQGASLQRADLEKANLTG 153


>gi|381204405|ref|ZP_09911476.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 135

 Score = 42.4 bits (98), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 37/76 (48%)

Query: 93  DLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 152
           DL K +       G    A     DLR+A       ++AN ++ADM E++ S +   GA 
Sbjct: 5   DLQKLKDTNSCPTGDFEGANLSGMDLRRANLSGAALKKANLSNADMTEANLSVADLTGAK 64

Query: 153 LEKAVAYKANFTGTLI 168
           LE A   +AN  G+L+
Sbjct: 65  LENAKLRQANLEGSLL 80


>gi|209528100|ref|ZP_03276576.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|209491459|gb|EDZ91838.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
          Length = 351

 Score = 42.0 bits (97), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    ADL ++V    NF  AN T A++  ++ +G+  NGA L +A   +AN T
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTRANLTRANLT 245



 Score = 38.9 bits (89), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 47/106 (44%), Gaps = 4/106 (3%)

Query: 69  RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 125
           R F   +L AA+    + N   L+  N  EA       IG   S +Q   ADL  AV + 
Sbjct: 21  RNFSDISLMAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQLSYADLSMAVLID 80

Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-TLIAT 170
            N   A  T   + ++D SG+  +GA L +      N TG +LI T
Sbjct: 81  ANLTGATMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGT 126


>gi|90019736|ref|YP_525563.1| hypothetical protein Sde_0087 [Saccharophagus degradans 2-40]
 gi|89949336|gb|ABD79351.1| pentapeptide repeat [Saccharophagus degradans 2-40]
          Length = 600

 Score = 42.0 bits (97), Expect = 0.082,   Method: Composition-based stats.
 Identities = 26/68 (38%), Positives = 32/68 (47%), Gaps = 10/68 (14%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE----------KAVA 158
           S A    ADLR A     NF+ A     D+R++D SG +F+GA L            A  
Sbjct: 197 SGANLRRADLRDAKFCSTNFKNAELNGVDLRKADLSGLEFDGADLTGCDLREAKLVGASL 256

Query: 159 YKANFTGT 166
            KAN TGT
Sbjct: 257 EKANITGT 264


>gi|209526959|ref|ZP_03275476.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|376005813|ref|ZP_09783205.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423064919|ref|ZP_17053709.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209492561|gb|EDZ92899.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|375325803|emb|CCE18958.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406714162|gb|EKD09330.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 331

 Score = 42.0 bits (97), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 43/97 (44%), Gaps = 9/97 (9%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
           F  T L AA +   +  ++ L D N  +A+ RG       A    ADLR A     N R 
Sbjct: 87  FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139

Query: 130 -RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
            R  + S ++R +D  G+   G  L  A   +AN TG
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLTG 176


>gi|157372424|ref|YP_001480413.1| pentapeptide repeat-containing protein [Serratia proteamaculans
           568]
 gi|157324188|gb|ABV43285.1| pentapeptide repeat protein [Serratia proteamaculans 568]
          Length = 844

 Score = 42.0 bits (97), Expect = 0.089,   Method: Composition-based stats.
 Identities = 33/142 (23%), Positives = 66/142 (46%), Gaps = 9/142 (6%)

Query: 30  LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNIS 89
           L K ++  C++ +   +      C+  +   PYA+ K   +    A+  + ++    + +
Sbjct: 668 LRKTVFQQCELQAAVFNGAWLESCNWVESKLPYAQFKAASLLTCAAVMESDLSGADFSEA 727

Query: 90  ALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDF 143
            L + N  +A  T+  F +   A+  ++DL +A   + NF RAN   +     D R+ +F
Sbjct: 728 TLKESNLRQALLTQANFTL---AKVENSDLSEADCQRANFTRANLVGSLLIRTDFRQVNF 784

Query: 144 SGSKFNGAYLEKAVAYKANFTG 165
           +G+   GA ++K     A+FT 
Sbjct: 785 TGANLMGALMQKTQLGGADFTA 806


>gi|443475317|ref|ZP_21065270.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443019839|gb|ELS33873.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 377

 Score = 42.0 bits (97), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 33/59 (55%), Gaps = 5/59 (8%)

Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A    A+L  A+ VK + +     RAN T AD+RE+D SG++   A L KA   KAN +
Sbjct: 140 ADLTQANLSAAILVKASLKQVILNRANLTEADLREADLSGAQLYLAVLSKANLAKANLS 198


>gi|78211810|ref|YP_380589.1| hypothetical protein Syncc9605_0258 [Synechococcus sp. CC9605]
 gi|78196269|gb|ABB34034.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 147

 Score = 42.0 bits (97), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 28/49 (57%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           AD R+A  +  +FR ++   AD+RE++  G+   GA LE A    AN T
Sbjct: 49  ADFRQAHLIGADFRGSDLRGADLREANLEGADLTGALLEGADLRGANLT 97


>gi|334118424|ref|ZP_08492513.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333459431|gb|EGK88044.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 479

 Score = 42.0 bits (97), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 32/57 (56%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A    ADL ++     N  RA+ T A +RE++  G++F GA L++A   KAN  G
Sbjct: 60  SGANLSGADLAESFLNLANLTRADLTGAVLREANLVGAEFTGANLKQASLIKANLVG 116



 Score = 37.4 bits (85), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 5/53 (9%)

Query: 110 AAQFGSADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
            A+F  A+L++A  +K      N   AN T A++  +D  GS+ +GA L+KAV
Sbjct: 96  GAEFTGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAV 148


>gi|113477694|ref|YP_723755.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110168742|gb|ABG53282.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 204

 Score = 42.0 bits (97), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
            F  A+L+KA  ++ N R A+FT AD+R +DF  +   GA L  A   +A+F G  +
Sbjct: 53  NFAGANLQKA-KLRANLRGADFTGADLRGADFRNADLRGAILIDAQLREASFAGAFL 108


>gi|376001358|ref|ZP_09779228.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375330187|emb|CCE14981.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 351

 Score = 41.6 bits (96), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 30/57 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A    ADL ++V    NF  AN T A++  ++ +G+  NGA L  A    AN TG
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTGANLTG 246



 Score = 39.7 bits (91), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 47/106 (44%), Gaps = 4/106 (3%)

Query: 69  RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 125
           R F   +L AA+    + N   L+  N  EA       IG   S +Q   ADL  AV + 
Sbjct: 21  RNFSDISLVAAIFNEVTLNRINLSGANLSEALMVHTRLIGANLSRSQLSYADLSMAVLID 80

Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-TLIAT 170
            N   A  T   + ++D SG+  +GA L +      N TG +LI T
Sbjct: 81  ANLTGATMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGT 126


>gi|116073351|ref|ZP_01470613.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
 gi|116068656|gb|EAU74408.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
          Length = 167

 Score = 41.6 bits (96), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 34/67 (50%), Gaps = 5/67 (7%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKA 161
           +G  A F  ADL  A+  +  F  A+F+ AD+ +S     DFSG+    A L   +A  +
Sbjct: 62  VGRGADFSGADLHGAIFTQGAFAEADFSDADLSDSLMDRADFSGTNLTNALLNGVIASGS 121

Query: 162 NFTGTLI 168
           +F G  I
Sbjct: 122 SFAGASI 128


>gi|313204014|ref|YP_004042671.1| pentapeptide repeat-containing protein [Paludibacter
           propionicigenes WB4]
 gi|312443330|gb|ADQ79686.1| pentapeptide repeat protein [Paludibacter propionicigenes WB4]
          Length = 186

 Score = 41.6 bits (96), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 52/145 (35%), Gaps = 8/145 (5%)

Query: 19  SSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAA 78
           S  K P +++   +  ++ C   S    D  F DC+   C    A LKN      TAL+ 
Sbjct: 13  SQKKFPCEVYDNCR--FLNCNFYSSNLVDVSFRDCTFESCDFSLASLKN------TALSD 64

Query: 79  AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM 138
                C        + N +    R E  +   A F    L+K   +  N    +FT ADM
Sbjct: 65  IQFIGCKLVGVQFDECNPFLFSVRFENCVLKLAVFQKVKLKKTRFINCNLEETDFTEADM 124

Query: 139 RESDFSGSKFNGAYLEKAVAYKANF 163
                     N A   K    KA+F
Sbjct: 125 SSGVLDNCNLNRAIFHKTNLEKADF 149


>gi|218247298|ref|YP_002372669.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
 gi|218167776|gb|ACK66513.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
          Length = 371

 Score = 41.6 bits (96), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 5/89 (5%)

Query: 80  VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 139
           + A+ + N++ L  L  +   T    G   AA+  + +L  A   + NFR AN T A++ 
Sbjct: 218 LYAANTHNLAELIKLAHFNPLTDLAGGNFLAAELSAVELSGANLTQTNFRGANLTDAELS 277

Query: 140 ES-----DFSGSKFNGAYLEKAVAYKANF 163
           E+      FSG+  +GAYL  A   KA+F
Sbjct: 278 EAILNYCKFSGADLSGAYLGNAQLVKADF 306


>gi|158341584|ref|YP_001522748.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158311825|gb|ABW33434.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 521

 Score = 41.6 bits (96), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 32/59 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIA 169
           A  G ADL  A     N  RANF  A ++E+D + +  +GA+L  A    AN +G L++
Sbjct: 88  AYLGGADLYSANLRGANLIRANFNDAHLKEADLTNANLSGAHLRGANLLNANLSGALLS 146


>gi|257061367|ref|YP_003139255.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
 gi|256591533|gb|ACV02420.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
          Length = 371

 Score = 41.6 bits (96), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 5/89 (5%)

Query: 80  VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 139
           + A+ + N++ L  L  +   T    G   AA+  + +L  A   + NFR AN T A++ 
Sbjct: 218 LYAANTHNLAELIKLAHFNPLTDLAGGNFLAAELSAVELSGANLTQTNFRGANLTDAELS 277

Query: 140 ES-----DFSGSKFNGAYLEKAVAYKANF 163
           E+      FSG+  +GAYL  A   KA+F
Sbjct: 278 EAILNYCKFSGADLSGAYLGNAQLVKADF 306


>gi|150014700|gb|ABR57221.1| PedD [Pseudomonas putida]
          Length = 219

 Score = 41.6 bits (96), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   A+LR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGANLRGAKLANQDLRKMNLAGADLRDADLRHARLDLANLEKARLQGANLT 93


>gi|298489886|ref|YP_003720063.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
 gi|298231804|gb|ADI62940.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
          Length = 256

 Score = 41.6 bits (96), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 29/55 (52%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A    ADLR A     N  RAN T AD+R ++ +G+   G  L +A   +AN TG
Sbjct: 51  ADLSGADLRGANLEGANLSRANLTGADLRSANLAGASLFGVNLSRAKLNEANLTG 105


>gi|302556667|ref|ZP_07309009.1| pentapeptide repeats-containing protein [Streptomyces griseoflavus
           Tu4000]
 gi|302474285|gb|EFL37378.1| pentapeptide repeats-containing protein [Streptomyces griseoflavus
           Tu4000]
          Length = 355

 Score = 41.2 bits (95), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 33/55 (60%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIAT 170
           ADLR A  V+ + R A+FT  D+RE++   +  +GA  ++A    A+  GT ++T
Sbjct: 241 ADLRAAKLVETDLRDADFTGTDLREANLRKAGAHGAVFQRADLRMADLRGTDLST 295


>gi|254430459|ref|ZP_05044162.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
 gi|197624912|gb|EDY37471.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
          Length = 180

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 34/66 (51%), Gaps = 5/66 (7%)

Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKAN 162
           G  A F  A+L  A+  +  F  A+F  AD     M + DFSG+ F GA L   +A  +N
Sbjct: 76  GRHADFSGANLHGAILTQAAFPEASFAGADLSGVLMDKVDFSGADFTGADLSDVIASGSN 135

Query: 163 FTGTLI 168
           F+G  +
Sbjct: 136 FSGATV 141



 Score = 38.5 bits (88), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 5/58 (8%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           A F  ADL   +  K +F  A+FT AD+ +   SGS F+GA +       A+FTG LI
Sbjct: 99  ASFAGADLSGVLMDKVDFSGADFTGADLSDVIASGSNFSGATVT-----NADFTGALI 151


>gi|163795566|ref|ZP_02189532.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
 gi|159179165|gb|EDP63698.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
          Length = 427

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 36/74 (48%), Gaps = 2/74 (2%)

Query: 94  LNKYEAETRGEF--GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           LN Y    R +   G  + AQ    DLR+A+    +FR A F  A++ E+  +GS+   A
Sbjct: 23  LNNYPGGQRADMRGGRHNGAQLNGVDLRRAMMSAADFRGAQFVGANLSEATLAGSQLRVA 82

Query: 152 YLEKAVAYKANFTG 165
            L  A   K +F G
Sbjct: 83  DLSGAKLVKTDFRG 96


>gi|409994014|ref|ZP_11277136.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|291569676|dbj|BAI91948.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
 gi|409935088|gb|EKN76630.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 331

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 41/101 (40%), Gaps = 14/101 (13%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
           F  T L AA +   +  ++ L D N  +A+ RG       A    ADLR A     N R 
Sbjct: 87  FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139

Query: 130 ------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
                   N   AD+R +D  G    GA L +A    AN T
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLMGANLT 180


>gi|162450958|ref|YP_001613325.1| WD repeat-containing protein [Sorangium cellulosum So ce56]
 gi|161161540|emb|CAN92845.1| Hypothetical WD-repeat protein [Sorangium cellulosum So ce56]
          Length = 2305

 Score = 41.2 bits (95), Expect = 0.14,   Method: Composition-based stats.
 Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 13/81 (16%)

Query: 97   YEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDF---------- 143
            +  ET G    G+     Q    DLR A     N R AN + AD+  +D           
Sbjct: 1111 WAEETAGWISEGADLHGVQLAGEDLRGAPLAGANLRDANLSGADLSGADLTDAALSGAML 1170

Query: 144  SGSKFNGAYLEKAVAYKANFT 164
            SG+K +G  L +A+A++A+FT
Sbjct: 1171 SGAKLHGTILRRAIAHRADFT 1191


>gi|260436217|ref|ZP_05790187.1| pentapeptide repeat protein [Synechococcus sp. WH 8109]
 gi|260414091|gb|EEX07387.1| pentapeptide repeat protein [Synechococcus sp. WH 8109]
          Length = 147

 Score = 41.2 bits (95), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 27/49 (55%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           AD R+A  +  +FR  +   AD+RE++  G+   GA LE A    AN T
Sbjct: 49  ADFRQAHLIGADFRGTDLRGADLREANLEGADLTGALLEGADLRGANLT 97


>gi|291571459|dbj|BAI93731.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
          Length = 351

 Score = 41.2 bits (95), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 30/57 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A    ADL ++V    NF  AN T A++  ++ +G+  NGA L  A    AN TG
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLAGANLAGANLNGANLTGANLTGANLTG 246



 Score = 39.3 bits (90), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 48/106 (45%), Gaps = 4/106 (3%)

Query: 69  RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 125
           R F   +L AA+    + N   L+  N  EA       IG   S +Q   ADL  AV + 
Sbjct: 21  RNFSDISLVAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQLSYADLSMAVLID 80

Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-TLIAT 170
            N   A+ T   + ++D SG+  +GA L +      N TG +LI T
Sbjct: 81  ANLTGASMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGT 126


>gi|126655992|ref|ZP_01727376.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
 gi|126622272|gb|EAZ92978.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
          Length = 319

 Score = 41.2 bits (95), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 29/54 (53%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           Q   ADLR       +FR  +F+ A++RE DF+G+    AYL +A     N TG
Sbjct: 25  QLRRADLRGLNLSNTDFRGVDFSYANLREVDFTGADLRDAYLNEADLTGVNLTG 78


>gi|119485597|ref|ZP_01619872.1| hypothetical protein L8106_24480 [Lyngbya sp. PCC 8106]
 gi|119456922|gb|EAW38049.1| hypothetical protein L8106_24480 [Lyngbya sp. PCC 8106]
          Length = 253

 Score = 41.2 bits (95), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 39/83 (46%), Gaps = 11/83 (13%)

Query: 92  ADLNKYEAETRGEFGIG------SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRE 140
           ADL K + +    F +       S A F +ADLR+    K N   ANFT A     D+R 
Sbjct: 126 ADLRKADLQDANLFKVNFSEAYLSEANFENADLRQVTFFKANLADANFTDANLFGSDLRL 185

Query: 141 SDFSGSKFNGAYLEKAVAYKANF 163
           ++  G+ F+ A L+ A+    N 
Sbjct: 186 ANLKGADFSNANLQAAILVNTNI 208



 Score = 35.0 bits (79), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 35/82 (42%), Gaps = 8/82 (9%)

Query: 91  LADLNKYEAETRGEFGIG--------SAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
           LAD N YEA  R     G        S A    ADLRKA     N  + NF+ A + E++
Sbjct: 93  LADANLYEANLRYANLQGADLRQADLSRASLTRADLRKADLQDANLFKVNFSEAYLSEAN 152

Query: 143 FSGSKFNGAYLEKAVAYKANFT 164
           F  +        KA    ANFT
Sbjct: 153 FENADLRQVTFFKANLADANFT 174


>gi|330809494|ref|YP_004353956.1| hypothetical protein PSEBR_a2659 [Pseudomonas brassicacearum subsp.
           brassicacearum NFM421]
 gi|423697147|ref|ZP_17671637.1| pentapeptide repeat protein PedD [Pseudomonas fluorescens Q8r1-96]
 gi|327377602|gb|AEA68952.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp.
           brassicacearum NFM421]
 gi|388004031|gb|EIK65358.1| pentapeptide repeat protein PedD [Pseudomonas fluorescens Q8r1-96]
          Length = 214

 Score = 41.2 bits (95), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 33/58 (56%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  ++Q   A+LR A    ++ R+ N + AD+R++D   ++ + A LEKA    AN T
Sbjct: 31  IAESSQCPGANLRGAKLANQDLRKMNLSGADLRDADLRHARLDLANLEKAQLQGANLT 88


>gi|403382392|ref|ZP_10924449.1| pentapeptide repeat-containing protein [Paenibacillus sp. JC66]
          Length = 292

 Score = 41.2 bits (95), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 44/108 (40%), Gaps = 5/108 (4%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
           AK+  W +  S      +V   +    A      + ++    + IG  A    A L KA 
Sbjct: 164 AKVNEWLLETSE-----LVRKTAREARAAEQRKPFRSKQNSRYRIGRGADLAGARLSKAD 218

Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIAT 170
               N R A   +AD+RE+D  G+   GA L  A    A+  G+L  T
Sbjct: 219 LRGANLRGAFLIAADLREADLRGADLIGADLRDADLRCADLRGSLFLT 266


>gi|434394476|ref|YP_007129423.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428266317|gb|AFZ32263.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 183

 Score = 40.8 bits (94), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 29/53 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A   SADL +A     N ++AN   AD+ E+D  G+  +GA L+ A   +AN 
Sbjct: 89  ANLQSADLDQANLRDANLQQANLRDADLEEADLQGANLSGANLQSADLEEANL 141



 Score = 38.9 bits (89), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 29/55 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A   SADL +A     NF+ AN  +AD+ ++   G+ F+GA L+ A     N 
Sbjct: 127 SGANLQSADLEEANLQNANFQNANLQNADLEDARVQGANFDGANLQGADLEGTNL 181


>gi|428320418|ref|YP_007118300.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428244098|gb|AFZ09884.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 479

 Score = 40.8 bits (94), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 31/57 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A    ADL ++     N  RA+ T A +RE++  G +F GA L++A   KAN  G
Sbjct: 60  SGANLSGADLAESFLNLANLTRADLTGAVLREANLVGVEFTGANLKQASLIKANLVG 116



 Score = 37.0 bits (84), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 27/49 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
           + A    A L KA  V  N   AN T A++  +D  GS+ +GA L+KAV
Sbjct: 100 TGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAV 148


>gi|403357343|gb|EJY78297.1| hypothetical protein OXYTRI_24550 [Oxytricha trifallax]
          Length = 290

 Score = 40.8 bits (94), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 28/54 (51%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            A F   D  KAV    NF +A  ++ADMRE DF  S FN A L  A   +AN 
Sbjct: 199 GANFMHVDFVKAVGKDCNFLKAKLSNADMREGDFENSNFNEASLHGANLERANL 252


>gi|189499620|ref|YP_001959090.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
           BS1]
 gi|189495061|gb|ACE03609.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
          Length = 300

 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 1/58 (1%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-FNGAYLEKAVAYKANFTGT 166
           AA F  ADLR A   + N R A+ T AD+R +  S S+   G+ L  A+ + AN  GT
Sbjct: 200 AANFSGADLRDADLSEVNLRNADLTGADLRGARLSFSQNMTGSTLNNAILHSANLIGT 257


>gi|317970566|ref|ZP_07971956.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
          Length = 175

 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 32/64 (50%), Gaps = 5/64 (7%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKA 161
           +G AA F  ADL  A+  +  F  ANF  AD+ +     +D SG+    A L   +A  +
Sbjct: 70  VGKAANFSGADLHGAILTQGAFPDANFNGADLSDVLLDRTDMSGTDLRNAVLVGVIASGS 129

Query: 162 NFTG 165
            FTG
Sbjct: 130 TFTG 133


>gi|168705224|ref|ZP_02737501.1| pentapeptide repeat [Gemmata obscuriglobus UQM 2246]
          Length = 831

 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 32/53 (60%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           F +A+L  A  V  + R  NFT+AD+R+++F G+   GA L  A    A+FTG
Sbjct: 266 FTAANLAGATCVDADLRGTNFTNADLRKANFRGANLAGADLTGANVAGADFTG 318



 Score = 39.7 bits (91), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 54/119 (45%), Gaps = 9/119 (7%)

Query: 52  DCSNNQCAGPYAKLKNWRV----FVSTALAAAVVASCSSNISALADLNKYEAE---TRGE 104
           D SN + AG  A+L N  +    F    L+ A  +      ++ AD+   +A     R  
Sbjct: 522 DLSNEKLAG--ARLNNLDLRGAKFDGAMLSEASFSGSQIQGASFADVPARKANFASARAA 579

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
             +   A   +A+LR A  ++ NF+  + T AD   SD  G+ F GA L+ A   +A F
Sbjct: 580 DAVFRGAILANANLRAATFLRTNFQNVDLTGADFAFSDLRGADFTGATLKNASFSQAKF 638


>gi|407684714|ref|YP_006799888.1| pentapeptide repeat-containing protein [Alteromonas macleodii str.
           'English Channel 673']
 gi|407246325|gb|AFT75511.1| pentapeptide repeat-containing protein [Alteromonas macleodii str.
           'English Channel 673']
          Length = 451

 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 5/60 (8%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           GIG  + F SADLRKA     N RRA+   A M +S+ + ++ +    ++A    A F G
Sbjct: 267 GIGQLSLFDSADLRKA-----NLRRADIRQAQMNQSNLNDAELDYTIFDRAQLQSAQFIG 321


>gi|378950893|ref|YP_005208381.1| protein PedD [Pseudomonas fluorescens F113]
 gi|359760907|gb|AEV62986.1| PedD [Pseudomonas fluorescens F113]
          Length = 214

 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 33/58 (56%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  ++Q   A+LR A    ++ R+ N + AD+R++D   ++ + A LEKA    AN T
Sbjct: 31  IAESSQCPGANLRGAKLANQDLRKMNLSGADLRDADLRHARLDLANLEKAQLQGANLT 88


>gi|126656956|ref|ZP_01728134.1| hypothetical protein CY0110_02219 [Cyanothece sp. CCY0110]
 gi|126621794|gb|EAZ92503.1| hypothetical protein CY0110_02219 [Cyanothece sp. CCY0110]
          Length = 1084

 Score = 40.8 bits (94), Expect = 0.18,   Method: Composition-based stats.
 Identities = 26/70 (37%), Positives = 36/70 (51%), Gaps = 7/70 (10%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
           A+ RG +  G  A  G ADL  A     +   A+ T AD+R +D +G+   GAYLE A  
Sbjct: 931 ADLRGAYLEG--ADLGGADLTGA-----DLEGADLTGADLRGADLTGAYLEGAYLEGADL 983

Query: 159 YKANFTGTLI 168
             A+ TG  +
Sbjct: 984 TGADLTGAYL 993


>gi|159029340|emb|CAO90206.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
          Length = 405

 Score = 40.8 bits (94), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 7/67 (10%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
           A+ RG F   S A    ADLR+A         AN + AD+ E++ SG+   GA L  A+ 
Sbjct: 245 ADLRGAFL--SEANLKGADLRRAF-----LSEANLSGADLSEANLSGADLRGAILSGAIL 297

Query: 159 YKANFTG 165
           + AN  G
Sbjct: 298 WGANLKG 304


>gi|434399306|ref|YP_007133310.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428270403|gb|AFZ36344.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 298

 Score = 40.8 bits (94), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 29/50 (58%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A+LR A  +  N R AN + AD++ ++  G+ F GA L KA    ANF G
Sbjct: 224 ANLRDANLIGANLRGANLSQADLKGANLEGANFKGANLTKADLRGANFKG 273



 Score = 37.7 bits (86), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 38/84 (45%), Gaps = 14/84 (16%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
           L D N   A  RG       A    ADL+ A     NF+ AN T AD+R     G+ F G
Sbjct: 226 LRDANLIGANLRG-------ANLSQADLKGANLEGANFKGANLTKADLR-----GANFKG 273

Query: 151 AYLEKAVAYKANFTGTLI--ATEH 172
           A L+ A+       GT++   T+H
Sbjct: 274 ANLQDAIFKNTKLQGTIMPDGTKH 297


>gi|443668754|ref|ZP_21134246.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|443330716|gb|ELS45411.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 403

 Score = 40.8 bits (94), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 7/67 (10%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
           A+ RG F   S A    ADLR+A         AN + AD+ E++ SG+   GA L  A+ 
Sbjct: 243 ADLRGAFL--SEANLKGADLRRAF-----LSEANLSGADLSEANLSGADLRGAILSGAIL 295

Query: 159 YKANFTG 165
           + AN  G
Sbjct: 296 WGANLKG 302


>gi|440804190|gb|ELR25067.1| pentapeptide repeatcontaining protein [Acanthamoeba castellanii
           str. Neff]
          Length = 293

 Score = 40.8 bits (94), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 30/55 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AQ   ADLR+A        +AN   AD+RE++ SG+    A L  A+  +A+ +G
Sbjct: 162 AQLEDADLRQANLANAKMTKANLMHADLREANLSGAVMLRADLRSAILRRADLSG 216


>gi|332707710|ref|ZP_08427737.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332353413|gb|EGJ32926.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 285

 Score = 40.8 bits (94), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 31/56 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    A+L +A+  + N R A+   AD+  +D +G+   GAY+ +A   KAN T
Sbjct: 58  SQATLTGANLSQAILREANLRGADLRGADLTGADLTGADLEGAYVNRADLRKANLT 113


>gi|332711030|ref|ZP_08430965.1| hypothetical cyclic nucleotide-binding domain protein [Moorea
           producens 3L]
 gi|332350156|gb|EGJ29761.1| hypothetical cyclic nucleotide-binding domain protein [Moorea
           producens 3L]
          Length = 328

 Score = 40.8 bits (94), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 7/73 (9%)

Query: 86  SNISALADLNKYEAETRGEFG--IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 143
           S ++ +A+LN YE   R +            +ADLR       NFR AN T AD R ++ 
Sbjct: 34  SQLAEIAELNLYEDLARVDLSGVNLENVNLNNADLRGT-----NFRNANLTGADFRNANL 88

Query: 144 SGSKFNGAYLEKA 156
           +G+ FN A L+ A
Sbjct: 89  TGADFNDAILDNA 101


>gi|425440351|ref|ZP_18820656.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
 gi|389719234|emb|CCH96913.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
          Length = 333

 Score = 40.8 bits (94), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|166364712|ref|YP_001656985.1| hypothetical protein MAE_19710 [Microcystis aeruginosa NIES-843]
 gi|425466893|ref|ZP_18846187.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
 gi|166087085|dbj|BAG01793.1| hypothetical protein MAE_19710 [Microcystis aeruginosa NIES-843]
 gi|389830484|emb|CCI27530.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
          Length = 333

 Score = 40.8 bits (94), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|332711043|ref|ZP_08430978.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332350169|gb|EGJ29774.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 343

 Score = 40.8 bits (94), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 49/102 (48%), Gaps = 9/102 (8%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFR 129
            +   LA A++   S N + L   N   A+ T+      + A   +A L KA+ ++ N  
Sbjct: 170 LIDIDLANAILHQASLNDAELTGANLTGADLTKANL---ARANLNTAKLSKALLIRANLS 226

Query: 130 RANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           + N +     +AD+R +D SG+ F GA L  A    AN TG+
Sbjct: 227 KTNLSITELRNADLRNADLSGANFMGADLTGADLTSANLTGS 268


>gi|16330983|ref|NP_441711.1| eukariotic protein kinase [Synechocystis sp. PCC 6803]
 gi|383322725|ref|YP_005383578.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr. GT-I]
 gi|383325894|ref|YP_005386747.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr.
           PCC-P]
 gi|383491778|ref|YP_005409454.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr.
           PCC-N]
 gi|384437045|ref|YP_005651769.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803]
 gi|451815141|ref|YP_007451593.1| eukariotic protein kinase [Synechocystis sp. PCC 6803]
 gi|15214308|sp|P74297.1|SPKB_SYNY3 RecName: Full=Serine/threonine-protein kinase B
 gi|1653478|dbj|BAA18391.1| eukariotic protein kinase [Synechocystis sp. PCC 6803]
 gi|11022717|dbj|BAB17034.1| Ser/Thr protein kinase SpkB [Synechocystis sp. PCC 6803]
 gi|339274077|dbj|BAK50564.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803]
 gi|359272044|dbj|BAL29563.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr. GT-I]
 gi|359275214|dbj|BAL32732.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr.
           PCC-N]
 gi|359278384|dbj|BAL35901.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr.
           PCC-P]
 gi|407961651|dbj|BAM54891.1| eukariotic protein kinase [Bacillus subtilis BEST7613]
 gi|451781110|gb|AGF52079.1| eukariotic protein kinase [Synechocystis sp. PCC 6803]
          Length = 574

 Score = 40.8 bits (94), Expect = 0.21,   Method: Composition-based stats.
 Identities = 29/95 (30%), Positives = 41/95 (43%), Gaps = 7/95 (7%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
            V   LA A V   +   + L + N  +AE        + A FG A L+  +    N   
Sbjct: 456 LVGIVLAKAFVPGINCYQANLTNANFEQAEL-------TRADFGKARLKNVIFKGANLSD 508

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F  AD+R +D  G+  NG   + A    ANF+G
Sbjct: 509 AYFGYADLRGADLRGANLNGVNFKYANLQGANFSG 543


>gi|220909896|ref|YP_002485207.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219866507|gb|ACL46846.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 184

 Score = 40.8 bits (94), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 30/53 (56%), Gaps = 5/53 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKA 156
           S A  G ADLRKA   K +      R A+ + A++RE+D S +  +GAYL  A
Sbjct: 103 SGANLGGADLRKADLSKADLSGADLRGADLSGANLRETDLSDADLDGAYLGHA 155


>gi|254409695|ref|ZP_05023476.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183692|gb|EDX78675.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 350

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 30/56 (53%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           A    ADL+ A+ ++    +A+ T+A +RE+D SG+   GA L  A    A   GT
Sbjct: 66  ADLSKADLKNALLIEATLSQADLTAAILREADLSGAILTGATLLDADLRHATLIGT 121



 Score = 39.3 bits (90), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 32/60 (53%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + A    ADL+ A     N  RAN + A++  ++ +G+   GA+L+ A    AN +G L+
Sbjct: 194 TGADLSDADLKGADLSHANLSRANLSCANLSHANLTGANLTGAHLQNANLSLANLSGLLL 253


>gi|298245346|ref|ZP_06969152.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297552827|gb|EFH86692.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 186

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 33/58 (56%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIATEH 172
           +ADLR A   + + R AN  +A +  ++F  +   GA+L +A  + ANFTG  +A  H
Sbjct: 53  AADLRDADVSEADLRCANLKAAQLMRTNFQNADLRGAFLSRAECHNANFTGANLAGAH 110


>gi|144900552|emb|CAM77416.1| low-complexity proteins [Magnetospirillum gryphiswaldense MSR-1]
          Length = 433

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 8/96 (8%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRRAN 132
           L+ A++A+ S   + L+D    E+   G    + +  AAQ G A+L  A     + R AN
Sbjct: 300 LSGAILANASFREADLSDAFMAESRLDGADFRYAVLGAAQLGGANLGVAQLRHADMRLAN 359

Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
              A +R +D SG++ +GA L       A+FTG  +
Sbjct: 360 LEGAQLRGADLSGARLSGAKLS-----GADFTGATL 390


>gi|409990095|ref|ZP_11273525.1| pentapeptide repeat-containing protein, partial [Arthrospira
           platensis str. Paraca]
 gi|409939047|gb|EKN80281.1| pentapeptide repeat-containing protein, partial [Arthrospira
           platensis str. Paraca]
          Length = 220

 Score = 40.4 bits (93), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 53/119 (44%), Gaps = 6/119 (5%)

Query: 56  NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQ 112
           N+    YA+    R F   +L AA+    + N   L+  N  EA       IG   S +Q
Sbjct: 10  NKLLTRYAQ--GERNFSDISLVAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQ 67

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-TLIAT 170
              ADL  AV +  N   A+ T   + ++D SG+  +GA L +      N TG +LI T
Sbjct: 68  LSYADLSMAVLIDANLTGASMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGT 126


>gi|148548300|ref|YP_001268402.1| pentapeptide repeat-containing protein [Pseudomonas putida F1]
 gi|395448857|ref|YP_006389110.1| pentapeptide repeat-containing protein [Pseudomonas putida ND6]
 gi|148512358|gb|ABQ79218.1| pentapeptide repeat protein [Pseudomonas putida F1]
 gi|388562854|gb|AFK71995.1| pentapeptide repeat-containing protein [Pseudomonas putida ND6]
          Length = 219

 Score = 40.4 bits (93), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   A+LR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93


>gi|186684326|ref|YP_001867522.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
           73102]
 gi|186466778|gb|ACC82579.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
          Length = 413

 Score = 40.4 bits (93), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 34/65 (52%), Gaps = 5/65 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANF 163
           S A    A+L KA+ V  N +  NFT A++ E+D S     GS F  A L KA   +AN 
Sbjct: 216 SNADLTEANLSKAIFVGANLQWVNFTQANLSEADLSITNLCGSVFYEANLSKATLPEANL 275

Query: 164 TGTLI 168
            G ++
Sbjct: 276 QGVIL 280


>gi|425444319|ref|ZP_18824373.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
 gi|425455654|ref|ZP_18835369.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
 gi|389730303|emb|CCI05384.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
 gi|389803421|emb|CCI17652.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
          Length = 333

 Score = 40.4 bits (93), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|397695427|ref|YP_006533310.1| pentapeptide repeat-containing protein [Pseudomonas putida DOT-T1E]
 gi|421520705|ref|ZP_15967367.1| pentapeptide repeat-containing protein [Pseudomonas putida LS46]
 gi|298682200|gb|ADI95267.1| PedD [Pseudomonas putida DOT-T1E]
 gi|397332157|gb|AFO48516.1| pentapeptide repeat-containing protein [Pseudomonas putida DOT-T1E]
 gi|402755315|gb|EJX15787.1| pentapeptide repeat-containing protein [Pseudomonas putida LS46]
          Length = 219

 Score = 40.4 bits (93), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   A+LR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93


>gi|425435715|ref|ZP_18816162.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
 gi|425462172|ref|ZP_18841646.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
 gi|440755045|ref|ZP_20934247.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
 gi|389679721|emb|CCH91528.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
 gi|389824858|emb|CCI25881.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
 gi|440175251|gb|ELP54620.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
          Length = 333

 Score = 40.4 bits (93), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|422302321|ref|ZP_16389684.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
 gi|389788496|emb|CCI15816.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
          Length = 333

 Score = 40.4 bits (93), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|209526071|ref|ZP_03274603.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|423067542|ref|ZP_17056332.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209493459|gb|EDZ93782.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|406711116|gb|EKD06318.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 517

 Score = 40.4 bits (93), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 7/81 (8%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           L  A++   + N++ LA ++  EA+      I        A+L +A   K NF +AN   
Sbjct: 86  LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138

Query: 136 ADMRESDFSGSKFNGAYLEKA 156
           AD+RE+    + FNGA L  A
Sbjct: 139 ADLRETKLQQTNFNGANLSGA 159



 Score = 37.7 bits (86), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F +A+LR+A     N   A+F+ A+MR  D  G+  +GA L +A    AN +G
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSG 243


>gi|390441606|ref|ZP_10229649.1| conserved hypothetical protein [Microcystis sp. T1-4]
 gi|389835072|emb|CCI33775.1| conserved hypothetical protein [Microcystis sp. T1-4]
          Length = 333

 Score = 40.4 bits (93), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|354564859|ref|ZP_08984035.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353549985|gb|EHC19424.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 166

 Score = 40.4 bits (93), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 30/56 (53%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A   +ADL KA     N   AN T+AD+ E++ +G+   GA  ++A    AN T
Sbjct: 89  SNANLTNADLEKANLSNANLSGANLTNADLEEANLTGANLRGANFQRADLEDANLT 144


>gi|386012542|ref|YP_005930819.1| Pentapeptide repeat-containing protein [Pseudomonas putida BIRD-1]
 gi|313499248|gb|ADR60614.1| Pentapeptide repeat-containing protein [Pseudomonas putida BIRD-1]
          Length = 219

 Score = 40.4 bits (93), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   A+LR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93


>gi|428300458|ref|YP_007138764.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428237002|gb|AFZ02792.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 202

 Score = 40.4 bits (93), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 30/60 (50%), Gaps = 5/60 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A  G   LR A     N +  NFT A++R +D +G+   GA L  A  Y A+ TG  +
Sbjct: 49  SGANLGGVILRDA-----NLKGVNFTGANLRGADLTGANLEGAVLNNANLYGASLTGATL 103


>gi|300864976|ref|ZP_07109808.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300337032|emb|CBN54958.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 279

 Score = 40.4 bits (93), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 34/65 (52%), Gaps = 5/65 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA-----NF 163
           + A    ADLR+A+ +  N   AN T A++R ++ S S   GA L  A  Y+A     N 
Sbjct: 183 NGANLSGADLRQAIAIGSNLSDANLTQANLRVANVSWSTLRGANLTGANLYRAKLNWSNL 242

Query: 164 TGTLI 168
           +G ++
Sbjct: 243 SGAIL 247


>gi|119488860|ref|ZP_01621822.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
 gi|119455021|gb|EAW36163.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
          Length = 1011

 Score = 40.4 bits (93), Expect = 0.26,   Method: Composition-based stats.
 Identities = 21/55 (38%), Positives = 30/55 (54%), Gaps = 5/55 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A   +ADLR A     N  RAN + A++R ++ SG+  +G YL  A   +AN 
Sbjct: 850 SGADLRTADLRSA-----NLIRANLSDANLRSANLSGANLSGVYLNSADLRRANL 899


>gi|26989392|ref|NP_744817.1| pentapeptide repeat-containing protein [Pseudomonas putida KT2440]
 gi|24984254|gb|AAN68281.1|AE016462_7 pentapeptide repeat family protein [Pseudomonas putida KT2440]
          Length = 219

 Score = 40.4 bits (93), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   A+LR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93


>gi|376002767|ref|ZP_09780589.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375328823|emb|CCE16342.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 517

 Score = 40.4 bits (93), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 7/81 (8%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           L  A++   + N++ LA ++  EA+      I        A+L +A   K NF +AN   
Sbjct: 86  LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138

Query: 136 ADMRESDFSGSKFNGAYLEKA 156
           AD+RE+    + FNGA L  A
Sbjct: 139 ADLRETKLQQTNFNGANLSGA 159



 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F +A+LR+A     N   A+F+ A+MR  D  G+  +GA L +A    AN +G
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSG 243


>gi|443662162|ref|ZP_21132897.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|159030702|emb|CAO88375.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443332138|gb|ELS46762.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 333

 Score = 40.4 bits (93), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|332705327|ref|ZP_08425405.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
 gi|332355687|gb|EGJ35149.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
          Length = 221

 Score = 40.4 bits (93), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 27/53 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A    ADLR  +    + R AN T AD+R +D  G+   GA L +A   +AN 
Sbjct: 111 AILTRADLRLTILQDTDLRGANLTRADLRYADLRGANLTGACLHQADLTRANL 163


>gi|255528664|ref|ZP_05395416.1| pentapeptide repeat protein [Clostridium carboxidivorans P7]
 gi|255507642|gb|EET84130.1| pentapeptide repeat protein [Clostridium carboxidivorans P7]
          Length = 281

 Score = 40.4 bits (93), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 38/77 (49%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
           NK  A  + +  I  +  F  ADLRK   +  + R A   +A++R ++ SG+   GA L 
Sbjct: 179 NKRNANLKNKKTISRSLDFFGADLRKTNLIGADLRGACLIAANLRGTNLSGADLIGADLR 238

Query: 155 KAVAYKANFTGTLIATE 171
            A    AN T ++  T+
Sbjct: 239 DADLSGANLTNSIFLTQ 255


>gi|434396750|ref|YP_007130754.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428267847|gb|AFZ33788.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 331

 Score = 40.4 bits (93), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 31/60 (51%), Gaps = 5/60 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A    ++L KA  ++ NF RAN T A + ++D S     GA L  A+  K N T  ++
Sbjct: 65  SGADLSQSNLEKAQLIETNFSRANLTEASLIQADLS-----GAILSSAIGTKTNLTAAIL 119


>gi|425453004|ref|ZP_18832819.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
 gi|389764929|emb|CCI09042.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
          Length = 333

 Score = 40.4 bits (93), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|359460720|ref|ZP_09249283.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 294

 Score = 40.0 bits (92), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 37/65 (56%), Gaps = 12/65 (18%)

Query: 111 AQFGSADLRKAVHVKE------NFRRANF-----TSADMRESDFSGSKFNGAYLEKAVAY 159
           A+F  ADLR+ V++++      NF RAN      T AD+RE+DF+ +    A L +A   
Sbjct: 173 ARFQDADLRR-VNLQQAFVKSANFARANLVGADLTKADLRETDFTRANLTQAVLTQAKLR 231

Query: 160 KANFT 164
            ANF+
Sbjct: 232 DANFS 236


>gi|338740277|ref|YP_004677239.1| hypothetical protein HYPMC_3462 [Hyphomicrobium sp. MC1]
 gi|337760840|emb|CCB66673.1| protein of unknown function [Hyphomicrobium sp. MC1]
          Length = 1588

 Score = 40.0 bits (92), Expect = 0.29,   Method: Composition-based stats.
 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 7/87 (8%)

Query: 83  SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
           +CSS   A A +  +       F + S   F  ADL+ A   +E  R A F++AD+R+ D
Sbjct: 876 NCSSGDCANAKMKGWN------FSVISQTDFSGADLKGAEFPRET-RGAKFSNADLRDVD 928

Query: 143 FSGSKFNGAYLEKAVAYKANFTGTLIA 169
            SG +F       A   +ANF  + +A
Sbjct: 929 ISGKQFQSCSFIGANLREANFGSSEVA 955



 Score = 40.0 bits (92), Expect = 0.30,   Method: Composition-based stats.
 Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 27/109 (24%)

Query: 52  DCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLN--KYEAETRGEFGIGS 109
           +CS+  CA   AK+K W   V +           ++ S  ADL   ++  ETRG      
Sbjct: 876 NCSSGDCAN--AKMKGWNFSVIS----------QTDFSG-ADLKGAEFPRETRG------ 916

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYL 153
            A+F +ADLR      + F+  +F  A++RE++F     +G  F+G++L
Sbjct: 917 -AKFSNADLRDVDISGKQFQSCSFIGANLREANFGSSEVAGPNFSGSFL 964


>gi|434394477|ref|YP_007129424.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428266318|gb|AFZ32264.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 132

 Score = 40.0 bits (92), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 33/55 (60%), Gaps = 4/55 (7%)

Query: 115 SADLRKAVHVKE----NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S++L++ ++ K+    N R AN  +A++ E++ SG+   GA L+ A   KAN  G
Sbjct: 40  SSELQRLLNTKQCPGCNLRGANLRNANLEEANLSGANLQGANLQNADLEKANLQG 94


>gi|440681678|ref|YP_007156473.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
 gi|428678797|gb|AFZ57563.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
          Length = 402

 Score = 40.0 bits (92), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           A    ADL KA   K NF  ANFT A + E+   G+ F  AYL +A    AN TGT
Sbjct: 281 AILAGADLTKA---KANFTGANFTGAILTEAILIGANFEKAYLIRADLTGANLTGT 333



 Score = 36.6 bits (83), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 47/108 (43%), Gaps = 24/108 (22%)

Query: 71  FVSTALAAAVV--ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 128
           F    L  A++  A+    I A ADL K +A   G       A F  A L +A+ +  NF
Sbjct: 263 FTRAILTEAILIGANFEEAILAGADLTKAKANFTG-------ANFTGAILTEAILIGANF 315

Query: 129 RRA---------------NFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
            +A               N T AD+ E+D +G+    AYL KA+  +A
Sbjct: 316 EKAYLIRADLTGANLTGTNLTRADLTEADLTGANLTRAYLIKAILEEA 363



 Score = 36.6 bits (83), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 2/55 (3%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR--ESDFSGSKFNGAYLEKAVAYKANF 163
           A F  A L +A+ +  NF  A    AD+   +++F+G+ F GA L +A+   ANF
Sbjct: 261 ANFTRAILTEAILIGANFEEAILAGADLTKAKANFTGANFTGAILTEAILIGANF 315


>gi|297569025|ref|YP_003690369.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
 gi|296924940|gb|ADH85750.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
          Length = 830

 Score = 40.0 bits (92), Expect = 0.31,   Method: Composition-based stats.
 Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 7/80 (8%)

Query: 90  ALADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
           ALADL   +      +R  F   S A+   ADLR+ +  + +FR A+   AD RE+    
Sbjct: 225 ALADLGGADLRRADLSRANF---SQARLRQADLRQVLFSESDFRHADARRADFREATLRQ 281

Query: 146 SKFNGAYLEKAVAYKANFTG 165
           + F+GA L +A+    + TG
Sbjct: 282 ANFSGADLSRAIFSGTDLTG 301



 Score = 36.2 bits (82), Expect = 5.1,   Method: Composition-based stats.
 Identities = 17/48 (35%), Positives = 27/48 (56%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           S + F  AD R+A   +   R+ANF+ AD+  + FSG+   G   ++A
Sbjct: 260 SESDFRHADARRADFREATLRQANFSGADLSRAIFSGTDLTGGVFQQA 307


>gi|425470595|ref|ZP_18849461.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
 gi|389883733|emb|CCI35905.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
          Length = 333

 Score = 40.0 bits (92), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRCANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|378719423|ref|YP_005284312.1| pentapeptide repeat-containing protein [Gordonia polyisoprenivorans
           VH2]
 gi|375754126|gb|AFA74946.1| pentapeptide repeat family protein [Gordonia polyisoprenivorans
           VH2]
          Length = 481

 Score = 40.0 bits (92), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 31/63 (49%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIA 169
            A F  AD R A     + R AN T A++ +  F+G+   GA L  A   +ANF G  + 
Sbjct: 394 GASFVGADGRLASFTGADLRGANLTGANLSQGSFTGANLTGANLSGANLTEANFLGADLT 453

Query: 170 TEH 172
           T +
Sbjct: 454 TAN 456


>gi|354564871|ref|ZP_08984047.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353549997|gb|EHC19436.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 105

 Score = 40.0 bits (92), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 29/56 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    ADL KA     N   AN T+AD+ E++ +G+   GA  ++A    AN T
Sbjct: 28  SNANLTGADLEKANLSNANLSGANLTNADLEEANLTGANLKGANFQRADLEDANLT 83


>gi|22299142|ref|NP_682389.1| hypothetical protein tlr1599 [Thermosynechococcus elongatus BP-1]
 gi|22295324|dbj|BAC09151.1| tlr1599 [Thermosynechococcus elongatus BP-1]
          Length = 309

 Score = 40.0 bits (92), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 40/91 (43%), Gaps = 13/91 (14%)

Query: 89  SALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFR-----RANFTSADMRE 140
           +AL   N   A+ RG    G   S A    ADLR  + V  + R     +AN T AD+  
Sbjct: 45  AALQSTNLQRADLRGAILTGANLSQADLRGADLRGVILVSADLRWVSLRKANLTGADLTR 104

Query: 141 -----SDFSGSKFNGAYLEKAVAYKANFTGT 166
                +D S +   GA L +A+   AN T T
Sbjct: 105 ANLANADLSEANLTGAQLSEAIVRDANLTLT 135



 Score = 36.6 bits (83), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 30/61 (49%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           SA     A+L +A+ +  N RRA    A++RE  F  +    A L+KA    A+  G  +
Sbjct: 183 SATNLQQANLERAILIGANLRRARLEEANLREVAFKEANLRHACLDKANLVGADLRGVSL 242

Query: 169 A 169
           A
Sbjct: 243 A 243


>gi|428301869|ref|YP_007140175.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428238413|gb|AFZ04203.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 256

 Score = 40.0 bits (92), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 44/82 (53%), Gaps = 15/82 (18%)

Query: 92  ADL---NKYEAETRGEFGIGSAAQFGSADLRKA-VH----VKENFRRANFTSADMRESDF 143
           ADL   N YEAE +G +     A    A+LRKA +H    ++ NF  A+ + AD+R +  
Sbjct: 164 ADLSVANLYEAEMQGSYLY--QANLCRANLRKAHLHHGYLLRVNFAEADLSDADLRWTVL 221

Query: 144 SGSKFNGAYLEKAVAYKANFTG 165
           SG+ F G  L     + ANFTG
Sbjct: 222 SGANFAGTNL-----HGANFTG 238


>gi|428307960|ref|YP_007144785.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
           9333]
 gi|428249495|gb|AFZ15275.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
          Length = 201

 Score = 40.0 bits (92), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 43/85 (50%), Gaps = 11/85 (12%)

Query: 84  CSSNISALADL---NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE 140
           C +N+   ADL   + +EA   G   IG  A+   ADL  A     NFR AN   AD+ E
Sbjct: 93  CEANLGG-ADLIEADLFEANLTGANLIG--AKLIGADLTGA-----NFREANLMGADLFE 144

Query: 141 SDFSGSKFNGAYLEKAVAYKANFTG 165
           ++ SG+  +GA L  A    AN +G
Sbjct: 145 ANLSGANLSGANLSGANLTLANLSG 169



 Score = 37.4 bits (85), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 5/59 (8%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTG 165
            F  ADL KA+ +  N   AN   AD+ E++FS      +   GA L +A  ++AN TG
Sbjct: 56  NFSKADLSKAILMGANLMGANLCEADIMEANFSKANLCEANLGGADLIEADLFEANLTG 114


>gi|53721218|ref|YP_110203.1| hypothetical protein BPSS0182 [Burkholderia pseudomallei K96243]
 gi|167818308|ref|ZP_02449988.1| hypothetical protein Bpse9_24431 [Burkholderia pseudomallei 91]
 gi|418395056|ref|ZP_12969100.1| type VI secretion system [Burkholderia pseudomallei 354a]
 gi|418554994|ref|ZP_13119746.1| type VI secretion system [Burkholderia pseudomallei 354e]
 gi|52211632|emb|CAH37627.1| conserved hypothetical protein [Burkholderia pseudomallei K96243]
 gi|385369399|gb|EIF74730.1| type VI secretion system [Burkholderia pseudomallei 354e]
 gi|385374364|gb|EIF79254.1| type VI secretion system [Burkholderia pseudomallei 354a]
          Length = 825

 Score = 40.0 bits (92), Expect = 0.35,   Method: Composition-based stats.
 Identities = 23/59 (38%), Positives = 34/59 (57%), Gaps = 5/59 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTLI 168
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  LI
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILI 801


>gi|290962440|ref|YP_003493622.1| hypothetical protein SCAB_81351 [Streptomyces scabiei 87.22]
 gi|260651966|emb|CBG75096.1| putative membrane protein [Streptomyces scabiei 87.22]
          Length = 387

 Score = 40.0 bits (92), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 31/55 (56%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIAT 170
           ADLR A  V  + R A+FT  D+RE++   +  +GA   +A    A+  GT ++T
Sbjct: 273 ADLRAAKLVGTDLRDADFTETDLREANLRKADAHGAVFHRADLRMADLRGTDLST 327


>gi|209964001|ref|YP_002296916.1| pentapeptide repeat-containing protein [Rhodospirillum centenum SW]
 gi|209957467|gb|ACI98103.1| pentapeptide repeat family protein [Rhodospirillum centenum SW]
          Length = 433

 Score = 40.0 bits (92), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 29/55 (52%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A    A L KA  V+ N R AN + AD+R +D +G+    A L  A+  +A  TG
Sbjct: 367 ANLSGAKLVKASLVRANLRNANLSGADLRGADLTGANLIDANLRGALLDEAVLTG 421


>gi|418939008|ref|ZP_13492446.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
 gi|375054283|gb|EHS50653.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
          Length = 229

 Score = 40.0 bits (92), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 29/54 (53%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A    A+LR A     NF RA+  SAD+R +D  G+ F GA LE AV    + T
Sbjct: 65  ANLKGANLRGADCDGANFTRADLKSADLRWADCDGANFTGANLESAVLQHTDLT 118


>gi|258612055|ref|ZP_05243959.2| phage protein [Listeria monocytogenes FSL R2-503]
 gi|258608006|gb|EEW20614.1| phage protein [Listeria monocytogenes FSL R2-503]
          Length = 187

 Score = 40.0 bits (92), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 35/70 (50%)

Query: 96  KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 155
           K+  +  GE      A    ADLR A     N RRA+ + AD+  +D +G+  NGA L +
Sbjct: 15  KWLRDGYGERANLRGANLRGADLRGADLSYANLRRADLSRADLNGADLNGADLNGADLSR 74

Query: 156 AVAYKANFTG 165
           A    A+ +G
Sbjct: 75  ADLNGADLSG 84


>gi|218442262|ref|YP_002380590.1| hypothetical protein PCC7424_5569 [Cyanothece sp. PCC 7424]
 gi|218175403|gb|ACK74133.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 184

 Score = 40.0 bits (92), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 34/61 (55%), Gaps = 5/61 (8%)

Query: 111 AQFGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A  G ADLR A   + N  R     A+ T A++++++  G+  +GAYL+++    A+  G
Sbjct: 105 ANLGGADLRGANLSQTNLSRADLRGADLTGANLKQANLEGANLDGAYLDQSDLTGASLEG 164

Query: 166 T 166
           T
Sbjct: 165 T 165


>gi|334120837|ref|ZP_08494914.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333455836|gb|EGK84476.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 197

 Score = 39.7 bits (91), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 1/60 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A    A+L++AV ++ N R A+ + AD+R +DF  +   GA    A+   A+F G  +
Sbjct: 42  SGANLAGANLQRAV-LRANLRGADLSGADLRGADFRNADLRGASFANALVRDASFGGAFL 100


>gi|193213002|ref|YP_001998955.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
           8327]
 gi|193086479|gb|ACF11755.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
          Length = 193

 Score = 39.7 bits (91), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 31/128 (24%), Positives = 49/128 (38%), Gaps = 11/128 (8%)

Query: 35  WVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADL 94
           +V C ++    S   F +CS  QC    AKL      + T         C       +D 
Sbjct: 34  FVQCNLAQADLSGFMFRECSFEQCDMGLAKL------IDTGFQEVKFIDCKLLGVQFSDC 87

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
            K   E   +  I   + F + DL+  V     F   +   AD  E++ +GS+F+   L 
Sbjct: 88  RKLLLEINFKRCILKLSVFTNLDLKNTV-----FDDCDMQEADFTEANLTGSRFDNCDLR 142

Query: 155 KAVAYKAN 162
            A+ +  N
Sbjct: 143 LAIFFHTN 150


>gi|167034127|ref|YP_001669358.1| pentapeptide repeat-containing protein [Pseudomonas putida GB-1]
 gi|166860615|gb|ABY99022.1| pentapeptide repeat protein [Pseudomonas putida GB-1]
          Length = 219

 Score = 39.7 bits (91), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   A+LR A    ++ R+ N   AD+R++D   ++ + A LE+A    AN T
Sbjct: 36  IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLERARLQGANLT 93


>gi|300865105|ref|ZP_07109930.1| serine/threonine protein kinase [Oscillatoria sp. PCC 6506]
 gi|300336876|emb|CBN55080.1| serine/threonine protein kinase [Oscillatoria sp. PCC 6506]
          Length = 540

 Score = 39.7 bits (91), Expect = 0.38,   Method: Composition-based stats.
 Identities = 30/81 (37%), Positives = 38/81 (46%), Gaps = 9/81 (11%)

Query: 91  LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFS 144
           LA  N YEA  TR        A    A+L  A  V+ N R AN T     +A+++ +D  
Sbjct: 430 LAGANFYEARLTRANL---QGADLSEANLGHARLVEANLRDANLTQAYCSTANLQSADLR 486

Query: 145 GSKFNGAYLEKAVAYKANFTG 165
           G+   GAYL KA    AN  G
Sbjct: 487 GANLAGAYLSKANLRGANLCG 507


>gi|348176753|ref|ZP_08883647.1| pentapeptide repeat-containing protein [Saccharopolyspora spinosa
           NRRL 18395]
          Length = 198

 Score = 39.7 bits (91), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
           F  T L  + +  CS   S+ AD  +  A T  E  + +    G ADLR        FR 
Sbjct: 71  FERTVLGKSTLDGCSLLGSSFADC-RLRAWTLRETDL-TLVGMGKADLRGLDLRGIRFRE 128

Query: 131 ANFTSADMR-----ESDFSGSKFNGAYLEKA 156
           AN T  D+R     E+DF+G++  GA LE+A
Sbjct: 129 ANLTECDLRRCDLREADFTGARLLGARLEEA 159


>gi|167838610|ref|ZP_02465469.1| pentapeptide repeat family protein [Burkholderia thailandensis
           MSMB43]
          Length = 825

 Score = 39.7 bits (91), Expect = 0.39,   Method: Composition-based stats.
 Identities = 30/83 (36%), Positives = 42/83 (50%), Gaps = 10/83 (12%)

Query: 98  EAETRGEFGIGSA----AQFGSADLRKAVHVKENFRRANFTSADMRESD-----FSGSKF 148
           EA  RG   IGS     A   +AD R A     +F RA+FT AD+R++D       G+  
Sbjct: 723 EASFRGA-RIGSCDFTDACLRAADFRGAKAQGSHFVRADFTRADLRDTDLIAAYLRGATL 781

Query: 149 NGAYLEKAVAYKANFTGTLIATE 171
           +GA L +A  ++AN +  L   E
Sbjct: 782 DGADLRRANLFRANLSQILADAE 804


>gi|443328868|ref|ZP_21057461.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442791604|gb|ELS01098.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 266

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 18/54 (33%), Positives = 28/54 (51%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A    ADLR+A  ++ N    + + AD+R ++  G    GA L KA   +AN +
Sbjct: 153 ADLNDADLREAQLIRANLSEVDLSGADLRAANLKGVNLRGADLNKADLSRANLS 206


>gi|441147419|ref|ZP_20964505.1| OxyO [Streptomyces rimosus subsp. rimosus ATCC 10970]
 gi|440620240|gb|ELQ83273.1| OxyO [Streptomyces rimosus subsp. rimosus ATCC 10970]
          Length = 345

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 30/67 (44%), Gaps = 5/67 (7%)

Query: 111 AQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A    ADLR+A   + N R A     N   AD+R +D  G    G  L  AV Y+A   G
Sbjct: 223 ADLREADLREATPARANLRDADLSDANVRKADLRFADLRGVDLWGTDLRGAVLYRAKLAG 282

Query: 166 TLIATEH 172
             ++  H
Sbjct: 283 LELSEAH 289


>gi|406910529|gb|EKD50527.1| Pentapeptide repeat protein, partial [uncultured bacterium]
          Length = 529

 Score = 39.7 bits (91), Expect = 0.40,   Method: Composition-based stats.
 Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 7/100 (7%)

Query: 70  VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 129
           VF    L  A +     ++++L D N  +A+ RG       A F  A+L+ A  V     
Sbjct: 398 VFKGADLMGASLDGAKFDLASLEDANMTDAKLRG-------ASFTYANLKSAKLVSAQLS 450

Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIA 169
            +    AD+R +D S +  +GA L +A    A   GT +A
Sbjct: 451 GSKLMHADLRRADLSYADLSGADLTEAKLSYAVLEGTRLA 490


>gi|427717281|ref|YP_007065275.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
 gi|427349717|gb|AFY32441.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
          Length = 483

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 29/52 (55%)

Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           DLRK    + +   AN   AD+RESD SG+   G+ L +A    A  +G+++
Sbjct: 216 DLRKTDIRRADLLGANLEQADLRESDLSGADLRGSNLRRADLEGAKLSGSIL 267


>gi|86608820|ref|YP_477582.1| pentapeptide repeat-containing protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
 gi|86557362|gb|ABD02319.1| pentapeptide repeat family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
          Length = 328

 Score = 39.7 bits (91), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
            G A L+KA  V  N   AN + AD+ E+D   ++ +GA L+ A  + AN T  L+
Sbjct: 52  LGRAKLQKANLVGANLGGANLSQADLSEADLRDAQLHGATLQGADLHGANLTLALL 107


>gi|73621284|gb|AAZ78338.1| OxyO [Streptomyces rimosus]
          Length = 353

 Score = 39.7 bits (91), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 30/67 (44%), Gaps = 5/67 (7%)

Query: 111 AQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A    ADLR+A   + N R A     N   AD+R +D  G    G  L  AV Y+A   G
Sbjct: 231 ADLREADLREATPARANLRDADLSDANVRKADLRFADLRGVDLWGTDLRGAVLYRAKLAG 290

Query: 166 TLIATEH 172
             ++  H
Sbjct: 291 LELSEAH 297


>gi|86606920|ref|YP_475683.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86555462|gb|ABD00420.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
          Length = 154

 Score = 39.7 bits (91), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 5/57 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S AQ   A+LR  V      R A+ + AD+RE D SG+  +GA L  A   + N  G
Sbjct: 32  SGAQLSGANLRGIV-----LRDADLSGADLREGDLSGADLSGADLRGAKLRRVNLIG 83


>gi|443326649|ref|ZP_21055296.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442793770|gb|ELS03210.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 920

 Score = 39.7 bits (91), Expect = 0.42,   Method: Composition-based stats.
 Identities = 21/55 (38%), Positives = 30/55 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A    A+L  A  V+ N  RAN   A++  ++ +G+   GA LEKA+   ANF G
Sbjct: 801 ANLDGANLEGANLVRANLVRANLVRANLDGANLNGAILEGANLEKAILEGANFRG 855


>gi|434388230|ref|YP_007098841.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
 gi|428019220|gb|AFY95314.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
          Length = 193

 Score = 39.7 bits (91), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 26/57 (45%)

Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           G  A    ADLR A     N  + N   AD+R +D +G    GA L +A    AN T
Sbjct: 97  GDRASLHKADLRLASLQGANLSQVNLVGADLRYADLTGVNLTGANLSRANLTGANLT 153


>gi|300864770|ref|ZP_07109621.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
 gi|300337239|emb|CBN54769.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
          Length = 334

 Score = 39.7 bits (91), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 27/55 (49%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A     DLR    ++ N  + N T AD+RE+D S +  N A L+ A    AN  G
Sbjct: 230 ADLHDTDLRGGNLIQANLMKTNLTEADLREADLSHTNLNLANLKGADLSGANLQG 284


>gi|87311950|ref|ZP_01094060.1| hypothetical protein DSM3645_13340 [Blastopirellula marina DSM
           3645]
 gi|87285312|gb|EAQ77236.1| hypothetical protein DSM3645_13340 [Blastopirellula marina DSM
           3645]
          Length = 586

 Score = 39.7 bits (91), Expect = 0.43,   Method: Composition-based stats.
 Identities = 23/64 (35%), Positives = 32/64 (50%), Gaps = 5/64 (7%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTG 165
           AQ    DLR+    + NF+  NF  AD+  SDF+G++   A L    A       ANF+G
Sbjct: 128 AQLPGCDLREVSGKQANFQDVNFARADLSRSDFTGAQLAEADLSGVTAVAAQWKLANFSG 187

Query: 166 TLIA 169
             +A
Sbjct: 188 AQLA 191


>gi|119486205|ref|ZP_01620265.1| hypothetical protein L8106_17717 [Lyngbya sp. PCC 8106]
 gi|119456696|gb|EAW37825.1| hypothetical protein L8106_17717 [Lyngbya sp. PCC 8106]
          Length = 160

 Score = 39.7 bits (91), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 29/54 (53%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
            ADLR+A     +   AN   AD+ ++D  G+   GA L +   + ANF GT++
Sbjct: 53  QADLRQADLKGADLWEANLKGADLTDADLRGANLWGADLSQTQTFGANFQGTIL 106


>gi|344339023|ref|ZP_08769953.1| pentapeptide repeat protein [Thiocapsa marina 5811]
 gi|343800943|gb|EGV18887.1| pentapeptide repeat protein [Thiocapsa marina 5811]
          Length = 284

 Score = 39.7 bits (91), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A   + + R AN   ADMR++DF GS        KA+  +AN 
Sbjct: 103 SKANLERADLRHADVRRADLRGANLAHADMRDTDFQGSDLCHVVAPKALFIRANL 157



 Score = 38.9 bits (89), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 51/104 (49%), Gaps = 14/104 (13%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETR----GEFGIGSAAQFGSADLR----KAVHVKE- 126
           L  A +  C  N + LA  + +EA+      G F + + A F  ADLR    ++V  +E 
Sbjct: 162 LCGADLRDCHLNDANLAGASMHEADLTSALPGGFTVINLANFEGADLRGSKLRSVSAQET 221

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIAT 170
           NFR AN T  D+     + +   GA L +A    A+F+G  +A+
Sbjct: 222 NFRNANLTDVDL-----TNAVLGGAILRRADVTNADFSGVELAS 260


>gi|383763560|ref|YP_005442542.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381383828|dbj|BAM00645.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 189

 Score = 39.7 bits (91), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 29/55 (52%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A    A+L++A     N  RAN + AD+  +D SG+   GA L  A   +AN TG
Sbjct: 40  ADLSFANLQRANLAGANLERANLSGADLEGADLSGANLVGANLTGARLMRANLTG 94


>gi|411119960|ref|ZP_11392336.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410710116|gb|EKQ67627.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 262

 Score = 39.7 bits (91), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 28/55 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A    A+L +A  V+ N  RAN   AD+R ++  G   NGA L  A    AN TG
Sbjct: 51  ADLAGANLMQANLVQANLSRANLQGADLRGANLVGVSLNGAILVGARLDGANLTG 105


>gi|425452313|ref|ZP_18832131.1| Genome sequencing data, contig C306 [Microcystis aeruginosa PCC
           7941]
 gi|389765978|emb|CCI08285.1| Genome sequencing data, contig C306 [Microcystis aeruginosa PCC
           7941]
          Length = 188

 Score = 39.7 bits (91), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 16/42 (38%), Positives = 27/42 (64%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 152
           +Q    +LR A  +  N RRANFT AD+ +++F+G++ +  Y
Sbjct: 128 SQLMDVNLRGASLINANIRRANFTGADVTDTNFTGAQCSDGY 169


>gi|425434358|ref|ZP_18814827.1| Pentapeptide repeat containing protein [Microcystis aeruginosa PCC
           9432]
 gi|389676157|emb|CCH94764.1| Pentapeptide repeat containing protein [Microcystis aeruginosa PCC
           9432]
          Length = 179

 Score = 39.7 bits (91), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 33/61 (54%), Gaps = 1/61 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + A    ADL  A +++ N R A+FT A++R +DF  +   GA L  A+   A+  G  +
Sbjct: 25  AGADLAGADLAGA-NLRANLRGADFTGANLRGADFRNADLRGAILLDAIVTGASLAGAFL 83

Query: 169 A 169
           A
Sbjct: 84  A 84


>gi|118592119|ref|ZP_01549513.1| hypothetical protein SIAM614_25622 [Stappia aggregata IAM 12614]
 gi|118435415|gb|EAV42062.1| hypothetical protein SIAM614_25622 [Labrenzia aggregata IAM 12614]
          Length = 275

 Score = 39.7 bits (91), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 40/84 (47%), Gaps = 14/84 (16%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD--------- 142
           +D  + EAE R +F   S + F  A++R     K N  +ANF  AD+R+ D         
Sbjct: 85  SDFRRTEAE-RADF---SGSDFSGANMRSVDLEKANLNKANFQDADLRDGDLNTVEANEA 140

Query: 143 -FSGSKFNGAYLEKAVAYKANFTG 165
            F G+        ++VA KA+F G
Sbjct: 141 IFDGADMRNVLFTRSVANKASFKG 164



 Score = 39.3 bits (90), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 38/73 (52%), Gaps = 8/73 (10%)

Query: 99  AETRG---EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY--- 152
           AE RG   E G  +       DL++A+    NF+ ++F   +   +DFSGS F+GA    
Sbjct: 50  AELRGLVLENGDFAGTNLREVDLKEAMLPNANFKNSDFRRTEAERADFSGSDFSGANMRS 109

Query: 153 --LEKAVAYKANF 163
             LEKA   KANF
Sbjct: 110 VDLEKANLNKANF 122


>gi|88808683|ref|ZP_01124193.1| hypothetical protein WH7805_03297 [Synechococcus sp. WH 7805]
 gi|88787671|gb|EAR18828.1| hypothetical protein WH7805_03297 [Synechococcus sp. WH 7805]
          Length = 176

 Score = 39.7 bits (91), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 49/113 (43%), Gaps = 14/113 (12%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEA----ETRGEFGIGS-AAQFGSAD 117
           A L N R  ++TAL AA+V         L D    EA    E RG+  +    +     D
Sbjct: 5   ALLCNLRRHLTTALLAALVVFTG----VLIDGPSVEAITAPELRGQRAVQDITSDMHGRD 60

Query: 118 LRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTG 165
           L++   +K + R  +   AD+R      S   G+   GA LE  VA+ + F G
Sbjct: 61  LKEKEFLKADLREVDLGEADLRGAVINTSQLQGADLRGADLEDVVAFSSRFDG 113


>gi|428303610|ref|YP_007113059.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428238815|gb|AFZ04603.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 490

 Score = 39.7 bits (91), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 28/56 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           A F  ADLR A+    + R AN   AD+R +D  G+   GA L      +AN  GT
Sbjct: 187 ASFKKADLRNAILEGADLREANLEGADLRGADLRGANLWGADLTGVDLCEANLEGT 242



 Score = 37.0 bits (84), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 27/51 (52%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           ADL +A   K + R A    AD+RE++  G+   GA L  A  + A+ TG 
Sbjct: 182 ADLERASFKKADLRNAILEGADLREANLEGADLRGADLRGANLWGADLTGV 232


>gi|320353524|ref|YP_004194863.1| pentapeptide repeat-containing protein [Desulfobulbus propionicus
           DSM 2032]
 gi|320122026|gb|ADW17572.1| pentapeptide repeat protein [Desulfobulbus propionicus DSM 2032]
          Length = 342

 Score = 39.7 bits (91), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 28/60 (46%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + AQ   ADL  A   K N R AN   AD+  +D  G+   GA LE A+       G ++
Sbjct: 85  TGAQLSLADLSGANLKKANLRNANLHGADLAYADLEGANLTGASLEGAIFKATKMKGRIV 144


>gi|158338817|ref|YP_001519994.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158309058|gb|ABW30675.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 294

 Score = 39.7 bits (91), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 37/65 (56%), Gaps = 12/65 (18%)

Query: 111 AQFGSADLRKAVHVKE------NFRRANF-----TSADMRESDFSGSKFNGAYLEKAVAY 159
           A+F  ADLR+ V++++      NF RAN      T AD+RE+DF+ +    A L +A   
Sbjct: 173 ARFQDADLRR-VNLQQAFVKSANFARANLVGADLTKADLRETDFTRANLTQAALTQAKLR 231

Query: 160 KANFT 164
            ANF+
Sbjct: 232 DANFS 236


>gi|297170923|gb|ADI21940.1| uncharacterized low-complexity proteins [uncultured nuHF2 cluster
           bacterium HF0130_29D04]
          Length = 695

 Score = 39.7 bits (91), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 32/62 (51%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIA 169
            A F  ADLR+A  V  + R ANF  A+++ +    +   GA LE+A  Y A+  G  + 
Sbjct: 139 GANFRGADLREAKLVGADLREANFRGANLQTAYLIKADLKGANLEEASLYGADLEGAKLD 198

Query: 170 TE 171
            E
Sbjct: 199 PE 200



 Score = 39.3 bits (90), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 33/68 (48%), Gaps = 5/68 (7%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANFT 164
            A   +ADLR A     + R  N  SAD+R SD  G+ F GA L +A        +ANF 
Sbjct: 104 GADLRNADLRGADLWGADLRGVNLWSADLRNSDLRGANFRGADLREAKLVGADLREANFR 163

Query: 165 GTLIATEH 172
           G  + T +
Sbjct: 164 GANLQTAY 171


>gi|345872411|ref|ZP_08824346.1| pentapeptide repeat protein [Thiorhodococcus drewsii AZ1]
 gi|343918959|gb|EGV29716.1| pentapeptide repeat protein [Thiorhodococcus drewsii AZ1]
          Length = 284

 Score = 39.3 bits (90), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 30/58 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           S AQ   ADLR A     N + AN + AD+R +DF GS  +     KA+  +AN   T
Sbjct: 103 SDAQLTGADLRCAEVRYANLKHANLSHADLRGTDFHGSDLSHMVAIKALLIRANLRET 160


>gi|332711272|ref|ZP_08431204.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332349821|gb|EGJ29429.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 153

 Score = 39.3 bits (90), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 25/49 (51%)

Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           DL  A  +K N    N   AD+RE+D SG+   GA L  A    A+ +G
Sbjct: 79  DLTAATLIKANLTETNLHKADLREADLSGANLQGADLSGANLQGADLSG 127


>gi|150389232|ref|YP_001319281.1| pentapeptide repeat-containing protein [Alkaliphilus
           metalliredigens QYMF]
 gi|149949094|gb|ABR47622.1| pentapeptide repeat protein [Alkaliphilus metalliredigens QYMF]
          Length = 298

 Score = 39.3 bits (90), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 31/65 (47%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           +G    F  ADLRK     EN R A   +AD+R  D SG+   GA    A    AN + +
Sbjct: 207 LGRGIDFIGADLRKNDLRGENLRGAYLIAADLRGVDLSGADVIGADFRDADLRGANLSRS 266

Query: 167 LIATE 171
           +  T+
Sbjct: 267 IFLTQ 271


>gi|427719675|ref|YP_007067669.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
 gi|427352111|gb|AFY34835.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
          Length = 291

 Score = 39.3 bits (90), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 8/103 (7%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
           AKL    +  +  +AA ++A+       L++ N YEAE  G +     A    A+L KA 
Sbjct: 172 AKLMRANLSFANLIAANLIATD------LSEANLYEAEVMGAYLY--QADLYKANLSKAH 223

Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
                  RAN T AD+R ++ S S  +GA L  A   +AN TG
Sbjct: 224 LSSAYLFRANLTKADLRGANLSWSNLSGANLAGADLCRANLTG 266


>gi|389874428|ref|YP_006373784.1| pentapeptide repeat-containing protein [Tistrella mobilis
           KA081020-065]
 gi|388531608|gb|AFK56802.1| pentapeptide repeat-containing protein [Tistrella mobilis
           KA081020-065]
          Length = 178

 Score = 39.3 bits (90), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 33/67 (49%), Gaps = 5/67 (7%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYK 160
           G   AA F  ADL+ A   +    RA+FT AD+R +     D  G+ F GA L  A  Y 
Sbjct: 95  GKAEAAIFAEADLQSADFTRSKAARADFTGADLRRARFYRADLRGADFTGANLTGADLYD 154

Query: 161 ANFTGTL 167
           A+  G +
Sbjct: 155 ADLEGAV 161



 Score = 35.4 bits (80), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 31/62 (50%), Gaps = 5/62 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANF 163
           +   F  ADLR A  V      A F  AD++ +DF+ SK     F GA L +A  Y+A+ 
Sbjct: 78  TGTDFSGADLRGAKFVSGKAEAAIFAEADLQSADFTRSKAARADFTGADLRRARFYRADL 137

Query: 164 TG 165
            G
Sbjct: 138 RG 139


>gi|220907627|ref|YP_002482938.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219864238|gb|ACL44577.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 267

 Score = 39.3 bits (90), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 18/57 (31%), Positives = 30/57 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A   +A+ +KA  +      +N T AD+ ++D +G   + A L +A   + NFTG
Sbjct: 132 SQANMSAANFQKATLISAYLHNSNLTQADLSDADLTGINLSDANLSQATLIRTNFTG 188


>gi|381204293|ref|ZP_09911364.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 156

 Score = 39.3 bits (90), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 50/107 (46%), Gaps = 9/107 (8%)

Query: 71  FVSTALAAAVVASCSSNISALAD-LNKYEAETRG----EFGIGSAAQFGS----ADLRKA 121
            V+T L A   A    ++  L D  N  + + RG    EF +     + S    ADLRKA
Sbjct: 20  IVATLLTADASAYKQEDLDKLQDTYNCVKCDLRGAILREFNLTGTNLYKSDLRKADLRKA 79

Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
                N    N T A +RE++ +G+  +GA+L +A    AN  G ++
Sbjct: 80  DLRDTNLGDTNLTGAVLREANLTGANMSGAHLWEANLTGANLEGAVL 126


>gi|428774386|ref|YP_007166174.1| serine/threonine protein kinase with pentapeptide repeats
           [Cyanobacterium stanieri PCC 7202]
 gi|428688665|gb|AFZ48525.1| serine/threonine protein kinase with pentapeptide repeats
           [Cyanobacterium stanieri PCC 7202]
          Length = 506

 Score = 39.3 bits (90), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 30/55 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F  AD  +A  V+ N  +A+   A+++ +DF  +   GA LE A  YKAN  G
Sbjct: 420 ANFYHADFSRARLVRANLTKAHLFKAELQYADFRNANLTGANLEGANLYKANLCG 474


>gi|336250332|ref|YP_004594042.1| hypothetical protein EAE_19280 [Enterobacter aerogenes KCTC 2190]
 gi|334736388|gb|AEG98763.1| hypothetical protein EAE_19280 [Enterobacter aerogenes KCTC 2190]
          Length = 846

 Score = 39.3 bits (90), Expect = 0.55,   Method: Composition-based stats.
 Identities = 19/55 (34%), Positives = 28/55 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           +    AD R A  ++ N   + F   D RE+DF+ +   GA L+K+    ANF G
Sbjct: 754 SDLSEADCRDASFIRANLVGSLFVRTDFREADFTDANLMGALLQKSQLAGANFEG 808


>gi|242277903|ref|YP_002990032.1| pentapeptide repeat-containing protein [Desulfovibrio salexigens DSM
            2638]
 gi|242120797|gb|ACS78493.1| pentapeptide repeat protein [Desulfovibrio salexigens DSM 2638]
          Length = 1277

 Score = 39.3 bits (90), Expect = 0.56,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 31/58 (53%)

Query: 106  GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
             IG +A F  A LR+A   +  F +A F  +D+ E++ + + F GA   KAV    NF
Sbjct: 1004 AIGMSADFSKASLRRADLSRGLFNKALFVESDLSEANGAQAIFKGAQFPKAVLRDTNF 1061


>gi|444351422|ref|YP_007387566.1| pentapeptide repeat [Enterobacter aerogenes EA1509E]
 gi|443902252|emb|CCG30026.1| pentapeptide repeat [Enterobacter aerogenes EA1509E]
          Length = 846

 Score = 39.3 bits (90), Expect = 0.57,   Method: Composition-based stats.
 Identities = 19/55 (34%), Positives = 28/55 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           +    AD R A  ++ N   + F   D RE+DF+ +   GA L+K+    ANF G
Sbjct: 754 SDLSEADCRDASFIRANLVGSLFVRTDFREADFTDANLMGALLQKSQLAGANFEG 808


>gi|156081718|ref|XP_001608352.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148800923|gb|EDL42328.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 1301

 Score = 39.3 bits (90), Expect = 0.57,   Method: Composition-based stats.
 Identities = 19/66 (28%), Positives = 34/66 (51%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           Y     G   +G+ A    AD+ +A  ++ +F RA+F  AD   +D + + FN A + +A
Sbjct: 33  YREALPGRAALGTEADLSRADVSRADAIRADFNRADFNRADFNRADVNRADFNRADVSRA 92

Query: 157 VAYKAN 162
              +A+
Sbjct: 93  NFNRAD 98


>gi|158337660|ref|YP_001518836.1| pentapeptide repeat-containing serine/threonine kinase
           [Acaryochloris marina MBIC11017]
 gi|158307901|gb|ABW29518.1| serine/threonine kinase with pentapeptide repeats [Acaryochloris
           marina MBIC11017]
          Length = 532

 Score = 39.3 bits (90), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 31/65 (47%), Gaps = 10/65 (15%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAYK 160
            +F + DLR A+ +  NF RANFT A++R           +D + +   GA L  A    
Sbjct: 428 GKFQNTDLRDAILINANFGRANFTGANLRNANLMQAYMSHADLANADLRGANLSDAYLSH 487

Query: 161 ANFTG 165
           AN  G
Sbjct: 488 ANLRG 492


>gi|418939072|ref|ZP_13492497.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
 gi|375054219|gb|EHS50602.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
          Length = 202

 Score = 39.3 bits (90), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 31/60 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + A    ADLR A     NF  AN  SAD++ +D + +   GA L  A   +AN TG ++
Sbjct: 63  TGANLTGADLRWADCDGANFTGANLKSADLQHTDLTNANLTGANLTGANLTEANLTGAIL 122


>gi|428297376|ref|YP_007135682.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428233920|gb|AFY99709.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 179

 Score = 39.3 bits (90), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 31/60 (51%), Gaps = 5/60 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A  G A+L +A     N   AN + AD+R ++  G+   GA L  A+   AN TG ++
Sbjct: 122 SGANLGGANLTQA-----NLVNANLSGADLRGANLGGANLKGANLSGALLDGANTTGAIM 176


>gi|428213676|ref|YP_007086820.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428002057|gb|AFY82900.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 471

 Score = 39.3 bits (90), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 8/69 (11%)

Query: 100 ETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           + RG   IG   + A F   DL++A     N  +A+   AD+RE++ +G+  +GA L +A
Sbjct: 294 DLRGAILIGCSLTQANFAGGDLQEA-----NLSQADLRGADLREANLAGANLSGANLNEA 348

Query: 157 VAYKANFTG 165
               AN  G
Sbjct: 349 DLDGANLAG 357


>gi|209967175|ref|YP_002300090.1| pentapeptide repeat-containing protein [Rhodospirillum centenum SW]
 gi|209960641|gb|ACJ01278.1| pentapeptide repeat family protein [Rhodospirillum centenum SW]
          Length = 429

 Score = 39.3 bits (90), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 5/67 (7%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKA 161
           I + A F    +  AV ++ +F  AN    D+R++D  G+ F GA LE      A   +A
Sbjct: 152 IAAKADFSEVRMNGAVVLRADFTDANLARVDLRDADLRGANFRGANLEGANLQGAQVQEA 211

Query: 162 NFTGTLI 168
           +F G ++
Sbjct: 212 DFAGAVL 218



 Score = 37.0 bits (84), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 27/54 (50%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
            ADLR A     N   AN   A ++E+DF+G+    A L    A  ANF G ++
Sbjct: 185 DADLRGANFRGANLEGANLQGAQVQEADFAGAVLTDAQLRDVQAAGANFRGAIL 238


>gi|443326309|ref|ZP_21054967.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442794049|gb|ELS03478.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 366

 Score = 39.3 bits (90), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 42/93 (45%), Gaps = 5/93 (5%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-----R 130
           L   +  + +  IS LA+L K + +T    G   AA     DL  A   K N R      
Sbjct: 211 LIEQIYIAKTEQISELAELAKLDLKTDLAGGNLLAANLAGIDLNGANLQKTNLRGVILND 270

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A+ +  ++R ++  G+  +GAYLE A    AN 
Sbjct: 271 ADLSETNLRHANLGGADLSGAYLENADLTHANL 303


>gi|162455067|ref|YP_001617434.1| hypothetical protein sce6785 [Sorangium cellulosum So ce56]
 gi|161165649|emb|CAN96954.1| hypothetical protein sce6785 [Sorangium cellulosum So ce56]
          Length = 973

 Score = 39.3 bits (90), Expect = 0.64,   Method: Composition-based stats.
 Identities = 25/94 (26%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEF---GIGSAAQFGSADLRKAVHVKEN 127
           F     + A +A  +   ++LA  +  +A+ RG        + A+   A+L +A+  + N
Sbjct: 854 FAGADFSGATLAGANLMGTSLAGTDLSDADLRGALLNEADLTEARLDRANLAEAMLTRAN 913

Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
             RA+  +AD+R+S  + ++  GA  EKA  + A
Sbjct: 914 LTRASLYAADLRQSILNSARVEGASFEKASLFSA 947


>gi|428317459|ref|YP_007115341.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428241139|gb|AFZ06925.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 197

 Score = 38.9 bits (89), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 40/80 (50%), Gaps = 8/80 (10%)

Query: 89  SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 148
           S LAD N  +A   G       A    A+L++AV ++ N R A+ + AD+R +DF  +  
Sbjct: 29  SDLADANLSQANLSG-------ANLVGANLQRAV-LRANLRGADLSGADLRGADFRNADL 80

Query: 149 NGAYLEKAVAYKANFTGTLI 168
            GA    A+   A+F G  +
Sbjct: 81  RGASFANALVRDASFGGAFL 100


>gi|359459150|ref|ZP_09247713.1| pentapeptide repeat-containing serine/threonine kinase
           [Acaryochloris sp. CCMEE 5410]
          Length = 514

 Score = 38.9 bits (89), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 31/65 (47%), Gaps = 10/65 (15%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAYK 160
            +F + DLR A+ +  NF RANFT A++R           +D + +   GA L  A    
Sbjct: 410 GKFQNTDLRDAILINANFGRANFTGANLRNANLMQAYMSHADLANADLRGANLSDAYLSH 469

Query: 161 ANFTG 165
           AN  G
Sbjct: 470 ANLRG 474


>gi|427707611|ref|YP_007049988.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
 gi|427360116|gb|AFY42838.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
          Length = 521

 Score = 38.9 bits (89), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 30/55 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A   +ADLR+A   K N RRAN + A ++ S  +G+    A L  A  ++ + +G
Sbjct: 120 ANLSNADLREATLRKANLRRANLSEASLKGSSLAGTNLEMANLNAADLHRTDLSG 174


>gi|434398137|ref|YP_007132141.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428269234|gb|AFZ35175.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 223

 Score = 38.9 bits (89), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 4/72 (5%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           ADL + + +     G    A    ADL K      N  RAN   AD+ +++ +G+   GA
Sbjct: 129 ADLERADLQKTNLIG----ANLQGADLGKTNIAGANLERANLFDADLEKANLAGTNLAGA 184

Query: 152 YLEKAVAYKANF 163
            L+KA   K N 
Sbjct: 185 NLQKADLEKTNL 196


>gi|153871558|ref|ZP_02000700.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
 gi|152071976|gb|EDN69300.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
          Length = 179

 Score = 38.9 bits (89), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 29/57 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A    ADL + +    N   AN  SAD+ E+D SG+  +GA L +    +AN  G
Sbjct: 104 SGADLRWADLYRTILNDANLSYANLCSADLSEADLSGANLSGANLSRVDLSEANLEG 160


>gi|389694674|ref|ZP_10182768.1| putative low-complexity protein [Microvirga sp. WSM3557]
 gi|388588060|gb|EIM28353.1| putative low-complexity protein [Microvirga sp. WSM3557]
          Length = 251

 Score = 38.9 bits (89), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 42/159 (26%), Positives = 65/159 (40%), Gaps = 29/159 (18%)

Query: 33  PLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRV--------------FVSTALAA 78
           P W  CQ       DG  P    + C+     L N  +              F S+ +A 
Sbjct: 22  PAWAKCQ-------DGPGPGVDWSGCSKARLMLTNEDLTGTNFQRSLLTLSDFASSKMAG 74

Query: 79  AVVASCSSNISAL--ADLNKYEAET----RGEFGIG--SAAQFGSADLRKAVHVKENFRR 130
           A ++    + +    ADL+K         R  FG    + A FGSAD+ ++   +     
Sbjct: 75  ANLSETEVSRTRFEGADLSKANFTKALGWRANFGQANLTGADFGSADMNRSNFAQVKAAG 134

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIA 169
           ANF+ +++  SDFSG+  +GA + KA   +  F    IA
Sbjct: 135 ANFSKSELNRSDFSGADLSGANISKAELARVLFQSAKIA 173


>gi|359464028|ref|ZP_09252591.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 238

 Score = 38.9 bits (89), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 31/54 (57%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           F  ADL +A+    +   A+ +SA +RE+D SG+K     L KA  YKA+  GT
Sbjct: 58  FREADLSEALLWGADLSGADLSSAILREADLSGAKLVQVNLAKANLYKASLCGT 111


>gi|288957355|ref|YP_003447696.1| hypothetical protein AZL_005140 [Azospirillum sp. B510]
 gi|288909663|dbj|BAI71152.1| hypothetical protein AZL_005140 [Azospirillum sp. B510]
          Length = 450

 Score = 38.9 bits (89), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 24/41 (58%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           ADLRKA  V  N   A+ T AD+ E+D +G+   GA L  A
Sbjct: 395 ADLRKANLVGANLAGADLTGADLSEADLTGADLTGAMLTGA 435


>gi|440681606|ref|YP_007156401.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
 gi|428678725|gb|AFZ57491.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
          Length = 943

 Score = 38.9 bits (89), Expect = 0.70,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 34/61 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A F  A+L +A  ++ N   A+ +SAD+  +D S +  +GA L  A    ANF+G  +
Sbjct: 845 SEALFNHANLHEANFIRANLTGADLSSADLNYADLSLADLSGANLSGANLEDANFSGAKL 904

Query: 169 A 169
           +
Sbjct: 905 S 905



 Score = 37.0 bits (84), Expect = 3.0,   Method: Composition-based stats.
 Identities = 22/61 (36%), Positives = 32/61 (52%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIA 169
            A    ADL +A     NF R N + A M  ++FS + FN A L +A   +AN TG  ++
Sbjct: 811 GANLSHADLSRANLNCANFSRTNCSGAYMISANFSEALFNHANLHEANFIRANLTGADLS 870

Query: 170 T 170
           +
Sbjct: 871 S 871


>gi|307152500|ref|YP_003887884.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306982728|gb|ADN14609.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 305

 Score = 38.9 bits (89), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 44/85 (51%), Gaps = 10/85 (11%)

Query: 90  ALADL-NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK- 147
           A+ DL NKY+A  R      S  +    DLR     + NF+ A+F+ A++RE DFSG+  
Sbjct: 6   AVIDLKNKYDAGERN----FSKIELRRVDLRGFNLSQANFKGADFSYANLREVDFSGADL 61

Query: 148 ----FNGAYLEKAVAYKANFTGTLI 168
               FN A L  A   +AN  G+ +
Sbjct: 62  SEAFFNEADLTGANLQEANLQGSYL 86


>gi|217977179|ref|YP_002361326.1| pentapeptide repeat-containing protein [Methylocella silvestris
           BL2]
 gi|217502555|gb|ACK49964.1| pentapeptide repeat protein [Methylocella silvestris BL2]
          Length = 260

 Score = 38.9 bits (89), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           NF +A M  ++FSG K +GA L++A A KANF G
Sbjct: 79  NFRAARMNNTNFSGGKLDGAVLDQAWALKANFAG 112


>gi|298242229|ref|ZP_06966036.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297555283|gb|EFH89147.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 270

 Score = 38.9 bits (89), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 29/55 (52%), Gaps = 5/55 (9%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F  ADL  A     + R AN   A++RE+D S +   GA LE+A    ANF G
Sbjct: 188 ATFEPADLSGA-----DLRGANLHQANLREADLSNANLRGANLEQAQVEGANFQG 237


>gi|119486617|ref|ZP_01620667.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
 gi|119456234|gb|EAW37366.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
          Length = 710

 Score = 38.9 bits (89), Expect = 0.72,   Method: Composition-based stats.
 Identities = 20/61 (32%), Positives = 35/61 (57%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + A     ++R+A  ++    +A+ + AD+  ++FS +K  GA LE+A    A FTGT +
Sbjct: 494 TGAFLSHINMRRANLLRATLNKADLSQADLTGANFSSAKLIGANLEQAKLNNAKFTGTDL 553

Query: 169 A 169
           A
Sbjct: 554 A 554


>gi|428214178|ref|YP_007087322.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428002559|gb|AFY83402.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 346

 Score = 38.9 bits (89), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 2/99 (2%)

Query: 67  NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
           NW       L+ A +A+   + + L+  N   A+    + IG+     S DLR+A     
Sbjct: 95  NWADLSGANLSGANLANADVSGANLSGANLSGAKLNQTYLIGT--NLKSVDLREANLSLA 152

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           +  +A+ T A++R++D +G+K   + L  A    AN TG
Sbjct: 153 SLNKADLTKANLRQADLTGAKLKQSNLNLADLTHANLTG 191


>gi|167645176|ref|YP_001682839.1| pentapeptide repeat-containing protein [Caulobacter sp. K31]
 gi|167347606|gb|ABZ70341.1| pentapeptide repeat protein [Caulobacter sp. K31]
          Length = 419

 Score = 38.9 bits (89), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 30/55 (54%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           I + A F  A L+ A  V+ N ++ANF  A++  +D SG+   GA L  AV   A
Sbjct: 166 IATKADFSDAILKDAKLVRANLKQANFNGANLAGADLSGANLTGADLRNAVLVGA 220


>gi|297172608|gb|ADI23577.1| uncharacterized low-complexity proteins [uncultured nuHF2 cluster
           bacterium HF0770_42C12]
          Length = 134

 Score = 38.9 bits (89), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 34/63 (53%), Gaps = 10/63 (15%)

Query: 116 ADLRKAVHVKENFRRA-----NFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTG 165
           ADL KAV ++ N  +A     N   A++RE+       SG+    A L+ AV + AN +G
Sbjct: 57  ADLSKAVLIRANLEKADLHAANLKGANLREARLYTASISGANLQKANLQGAVLWGANLSG 116

Query: 166 TLI 168
           T++
Sbjct: 117 TIL 119


>gi|118593941|ref|ZP_01551297.1| PipB-like protein [Stappia aggregata IAM 12614]
 gi|118433481|gb|EAV40152.1| PipB-like protein [Stappia aggregata IAM 12614]
          Length = 162

 Score = 38.9 bits (89), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            AA    A  + ++ V   F  AN TS +M  SDF+G+ F  A +   +A + NF
Sbjct: 7   DAANLTGASFKNSIGVNATFIEANLTSVEMNNSDFTGADFTKADMRHVIASETNF 61


>gi|399075150|ref|ZP_10751398.1| putative low-complexity protein [Caulobacter sp. AP07]
 gi|398039446|gb|EJL32581.1| putative low-complexity protein [Caulobacter sp. AP07]
          Length = 380

 Score = 38.9 bits (89), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 30/55 (54%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           I + A F  A L+ A  V+ N ++ANF  A++  +D SG+   GA L  AV   A
Sbjct: 127 IATKADFSDAILKDAKLVRANLKQANFNGANLAGADLSGANLTGADLRNAVLVGA 181


>gi|428320140|ref|YP_007118022.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428243820|gb|AFZ09606.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 353

 Score = 38.9 bits (89), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 18/56 (32%), Positives = 28/56 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           A    A L +A+    +  RAN T A + E D +G+  +GA L +A   + N +G 
Sbjct: 137 ANLSGATLSRAIMSGVDLSRANLTRAILSEVDLTGANLSGATLTRAYLNRGNLSGV 192



 Score = 37.4 bits (85), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 31/58 (53%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           A    A LR+A     N   A+ + AD++ ++ +GS  +GA L+ A   KAN  G ++
Sbjct: 197 ASLSEASLREASICAANLSAADLSGADLQSANLNGSNLSGADLQGANLSKANLNGLIL 254


>gi|329850490|ref|ZP_08265335.1| pentapeptide repeat 8 copies family protein [Asticcacaulis
           biprosthecum C19]
 gi|328840805|gb|EGF90376.1| pentapeptide repeat 8 copies family protein [Asticcacaulis
           biprosthecum C19]
          Length = 163

 Score = 38.9 bits (89), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 31/65 (47%), Gaps = 7/65 (10%)

Query: 104 EFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           + GIG  S   F  ADLR        F RA+F  ADM  ++F      GAYLE A    A
Sbjct: 64  DLGIGIFSGTNFSGADLRDVNGSAALFGRASFAGADMTNANFV-----GAYLEHANFRGA 118

Query: 162 NFTGT 166
           N TG 
Sbjct: 119 NLTGV 123


>gi|158333662|ref|YP_001514834.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158303903|gb|ABW25520.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 238

 Score = 38.9 bits (89), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 31/54 (57%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           F  ADL +A+    +   A+ +SA +RE+D SG+K     L KA  YKA+  GT
Sbjct: 58  FREADLSEALLWGADLSGADLSSAVLREADLSGAKLVQVNLAKANLYKASLCGT 111


>gi|167741122|ref|ZP_02413896.1| pentapeptide repeat family protein [Burkholderia pseudomallei 14]
          Length = 328

 Score = 38.9 bits (89), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 246 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 303


>gi|443475902|ref|ZP_21065833.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443019187|gb|ELS33316.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 133

 Score = 38.5 bits (88), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 33/90 (36%), Positives = 45/90 (50%), Gaps = 8/90 (8%)

Query: 79  AVVASCSSNISALADLNKYEA--ETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTS 135
           A + S S+  SALAD N+     +TR   G   S  +   A+LR A     N R AN  S
Sbjct: 16  AAITSISAIESALADPNQIRQVLQTRECAGCNLSREKLSFANLRGA-----NLRNANLFS 70

Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AD++ +D   +   GA L+KA    A+ TG
Sbjct: 71  ADLKLADLREANLIGAILDKADLRGADLTG 100


>gi|167826694|ref|ZP_02458165.1| pentapeptide repeat family protein [Burkholderia pseudomallei 9]
          Length = 326

 Score = 38.5 bits (88), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 244 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 301


>gi|118593342|ref|ZP_01550726.1| hypothetical protein SIAM614_00507 [Stappia aggregata IAM 12614]
 gi|118434020|gb|EAV40677.1| hypothetical protein SIAM614_00507 [Stappia aggregata IAM 12614]
          Length = 313

 Score = 38.5 bits (88), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 33/59 (55%), Gaps = 5/59 (8%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIA 169
           A+   ADL +      +FR ++FT A M+ ++FS + F  A L KA A  ANF GT I 
Sbjct: 128 AKMNDADLSEG-----DFRNSDFTKAKMQRTNFSKATFREADLGKADARDANFDGTEIG 181


>gi|443654625|ref|ZP_21131408.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|443333765|gb|ELS48307.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 97

 Score = 38.5 bits (88), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A    ADL  A +++ N R A+ T A++R +DF  +   GA L  A+   A+  G  +
Sbjct: 25  SGADLAGADLAGA-NLRANLRGADLTGANLRGADFRNADLRGAILLDAIVTGASLAGAFL 83

Query: 169 A 169
           A
Sbjct: 84  A 84


>gi|422303997|ref|ZP_16391346.1| Genome sequencing data, contig C282 [Microcystis aeruginosa PCC
           9806]
 gi|389790959|emb|CCI13207.1| Genome sequencing data, contig C282 [Microcystis aeruginosa PCC
           9806]
          Length = 179

 Score = 38.5 bits (88), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A    ADL  A +++ N R A+ T A++R +DF  +   GA L  A+   A+  G  +
Sbjct: 25  SGADLAGADLAGA-NLRANLRGADLTGANLRGADFRNADLRGAILLDAIVTGASLAGAFL 83

Query: 169 A 169
           A
Sbjct: 84  A 84


>gi|386050184|ref|YP_005968175.1| gp51 protein [Listeria monocytogenes FSL R2-561]
 gi|346424030|gb|AEO25555.1| gp51 protein [Listeria monocytogenes FSL R2-561]
          Length = 211

 Score = 38.5 bits (88), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 28/48 (58%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           S A   +ADLR AV +  +   A+ T AD+RE+   G+   GA LE++
Sbjct: 45  SEAILSNADLRNAVLLNIDLSDADLTRADLRETVLDGADLTGAELERS 92


>gi|167722130|ref|ZP_02405366.1| pentapeptide repeat family protein [Burkholderia pseudomallei DM98]
          Length = 323

 Score = 38.5 bits (88), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 241 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 298


>gi|159026911|emb|CAO89162.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
          Length = 179

 Score = 38.5 bits (88), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A    ADL  A +++ N R A+ T A++R +DF  +   GA L  A+   A+  G  +
Sbjct: 25  SGADLAGADLAGA-NLRANLRGADLTGANLRGADFRNADLRGAILLDAIVTGASLAGAFL 83

Query: 169 A 169
           A
Sbjct: 84  A 84


>gi|451981277|ref|ZP_21929641.1| putative Pentapeptide repeat protein [Nitrospina gracilis 3/211]
 gi|451761500|emb|CCQ90895.1| putative Pentapeptide repeat protein [Nitrospina gracilis 3/211]
          Length = 484

 Score = 38.5 bits (88), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 44/89 (49%), Gaps = 2/89 (2%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           L  AV+   S   SALA  +  +A+ +G   +   A    A LR A  VK + + A+   
Sbjct: 377 LKEAVLGKASLKNSALAGADLRKAKLKG--AVLEGADLAGARLRHASLVKAHLKGADLHR 434

Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFT 164
            ++ E+DFS +   GA L  A  ++AN T
Sbjct: 435 TELDEADFSNADLQGANLTGAKLWEANLT 463


>gi|284929723|ref|YP_003422245.1| hypothetical protein UCYN_11960 [cyanobacterium UCYN-A]
 gi|284810167|gb|ADB95864.1| uncharacterized low-complexity protein [cyanobacterium UCYN-A]
          Length = 243

 Score = 38.5 bits (88), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 9/63 (14%)

Query: 94  LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
           LNKY+   R          F S  LR+    + N  + NF SAD+R+S    S FNGA L
Sbjct: 7   LNKYDLGER---------NFQSICLREVDLTEVNLPKINFESADIRQSRLGKSNFNGAIL 57

Query: 154 EKA 156
           ++A
Sbjct: 58  KQA 60


>gi|443314265|ref|ZP_21043839.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
 gi|442786137|gb|ELR95903.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
          Length = 887

 Score = 38.5 bits (88), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 32/55 (58%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S ++F  A L  A  ++ +  R N    ++ E++F  ++F+GA L  +VA KANF
Sbjct: 233 SESEFRGAKLAHAKFIRADLSRTNLIRTNLAEANFERARFHGANLNNSVAKKANF 287



 Score = 36.6 bits (83), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 33/62 (53%), Gaps = 1/62 (1%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT-LI 168
           AA    A+L + + VK   R  N   A++ E+D S S+F GA L  A   +A+ + T LI
Sbjct: 199 AANLMYANLYRCIIVKSRLRGVNLLEANLEEADLSESEFRGAKLAHAKFIRADLSRTNLI 258

Query: 169 AT 170
            T
Sbjct: 259 RT 260



 Score = 35.4 bits (80), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
             SA+L  A   K + R     +AD+ ++D SGSK  GAYL      +AN +G 
Sbjct: 370 LNSANLTGACLHKSDLRGVTARNADLMDADLSGSKIQGAYLAGVNFERANLSGV 423


>gi|427714529|ref|YP_007063153.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
 gi|427378658|gb|AFY62610.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
          Length = 333

 Score = 38.5 bits (88), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A  G A+ R+A     + R AN T AD+ ES    +  +GA LEKA+   A+ T
Sbjct: 51  SFALLGRANFRRANLAGADLRGANLTQADLTESLLQEANLHGASLEKAILVGADIT 106



 Score = 36.6 bits (83), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 29/55 (52%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F  A L + +  K NF +A FT AD+R ++  GS    A L +A   + + TG
Sbjct: 244 ANFSHAHLDQIIGEKANFTQAIFTKADLRRANLRGSTLKEARLIEAYLARTDLTG 298


>gi|113476301|ref|YP_722362.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110167349|gb|ABG51889.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 517

 Score = 38.5 bits (88), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 28/57 (49%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A    A+LR A   + N   AN   AD+  +D SGS  +GA L K+    AN  G
Sbjct: 138 SGANLKGANLRFAFITESNLIEANLEGADLSGADLSGSDLSGAELRKSNLTGANLNG 194


>gi|428777412|ref|YP_007169199.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
 gi|428691691|gb|AFZ44985.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
          Length = 333

 Score = 38.5 bits (88), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 29/53 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           S A   +ADL KA     N R AN   A++  +D SG+   GAYL +A  ++A
Sbjct: 198 SEANLFNADLSKANLKGANLRGANLIRANLERADLSGADLRGAYLNEAKMFEA 250


>gi|167848210|ref|ZP_02473718.1| pentapeptide repeat protein [Burkholderia pseudomallei B7210]
          Length = 333

 Score = 38.5 bits (88), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 251 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 308


>gi|386829057|ref|ZP_10116164.1| putative low-complexity protein [Beggiatoa alba B18LD]
 gi|386429941|gb|EIJ43769.1| putative low-complexity protein [Beggiatoa alba B18LD]
          Length = 284

 Score = 38.5 bits (88), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 20/59 (33%), Positives = 28/59 (47%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIATE 171
              ADLR A   + + RRAN   AD++ +D   +    A L      KAN  G L ++E
Sbjct: 194 LQQADLRGAFLTEADLRRANLREADLQGADLQKADLREADLTNTNLRKANLQGALFSSE 252


>gi|434395385|ref|YP_007130332.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428267226|gb|AFZ33172.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 271

 Score = 38.5 bits (88), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 41/78 (52%), Gaps = 2/78 (2%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           ++ L  +N  EA   G +  G  A    A+L KA     N  +ANFT AD+ +++F+G+ 
Sbjct: 188 MAELVQVNLSEAHLTGAYLFG--ANLSGANLFKADFRWTNLSKANFTGADLSQANFTGAN 245

Query: 148 FNGAYLEKAVAYKANFTG 165
            + A    A+  + +FTG
Sbjct: 246 LSKANFTGAMLNEVDFTG 263


>gi|385871982|gb|AFI90502.1| Pentapeptide repeat protein [Pectobacterium sp. SCC3193]
          Length = 273

 Score = 38.5 bits (88), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 8/101 (7%)

Query: 73  STALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFR 129
           +T L +AV +  S N +        ++  R    IG+    A+  ++DL +A   + NF+
Sbjct: 135 ATWLTSAVASGSSMNSADFTQATLRQSNLRQASLIGAVFALAKLENSDLSEADCQQTNFQ 194

Query: 130 RAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           RAN     F   D RE++F+ +   GA L+K+    ANF G
Sbjct: 195 RANLAGSLFVRTDFREANFTDANLIGALLQKSQLGGANFRG 235


>gi|376003246|ref|ZP_09781060.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375328406|emb|CCE16813.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 264

 Score = 38.5 bits (88), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 29/51 (56%), Gaps = 5/51 (9%)

Query: 127 NFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANFTGTLIATEH 172
           N RR NFT A++      +++ S + F+ A L  A+ Y+ANF GT +   H
Sbjct: 130 NLRRGNFTQANLAAVNLNQANLSHANFHEAVLINAIGYQANFYGTNLVNSH 180


>gi|428209167|ref|YP_007093520.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
           PCC 7203]
 gi|428011088|gb|AFY89651.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
          Length = 163

 Score = 38.5 bits (88), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 29/55 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A   +ADL +A     N + AN  +AD+ E++  G+   GA L +A   KAN 
Sbjct: 61  SGANLQNADLDEANLQGANLQNANLQNADLEEANLQGANLQGANLIRADLEKANL 115


>gi|218437556|ref|YP_002375885.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218170284|gb|ACK69017.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 203

 Score = 38.5 bits (88), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 29/56 (51%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A  F   DLR A  V  N  +A+F+SA++  +D S +    A L  A+    NFTG
Sbjct: 116 ATDFRGTDLRGASLVGSNLGQADFSSANLSGADLSQADLEEAILRGALLRGTNFTG 171


>gi|428318770|ref|YP_007116652.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428242450|gb|AFZ08236.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 238

 Score = 38.5 bits (88), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 29/58 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           A   +ADLR+++  + N  RAN + ADM  +D   +    A L      +A  TGT I
Sbjct: 142 ANLSNADLRRSLLFRVNLNRANLSGADMSFADVRDTNLQNAILSNTRLPRAQLTGTNI 199


>gi|425455322|ref|ZP_18835042.1| Genome sequencing data, contig C282 [Microcystis aeruginosa PCC
           9807]
 gi|389803857|emb|CCI17301.1| Genome sequencing data, contig C282 [Microcystis aeruginosa PCC
           9807]
          Length = 179

 Score = 38.5 bits (88), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A    ADL  A +++ N R A+ T A++R +DF  +   GA L  A+   A+  G  +
Sbjct: 25  SGADLAGADLAGA-NLRANLRGADLTGANLRGADFRNADLRGAILLDAIVTGASLAGAFL 83

Query: 169 A 169
           A
Sbjct: 84  A 84


>gi|337286774|ref|YP_004626247.1| Ion transport 2 domain-containing protein [Thermodesulfatator
           indicus DSM 15286]
 gi|335359602|gb|AEH45283.1| Ion transport 2 domain protein [Thermodesulfatator indicus DSM
           15286]
          Length = 304

 Score = 38.5 bits (88), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 25/82 (30%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFT--------------------SADMRESDFSGSKF 148
            AA FG A+L+KA      FR A+FT                     AD+RE+DFSG+KF
Sbjct: 68  EAAGFGMANLKKARLFNAKFRHASFTKATLKGADAKCADFSLARLREADLREADFSGAKF 127

Query: 149 NGAYL-----EKAVAYKANFTG 165
             A+L     E A+   A+  G
Sbjct: 128 KEAHLNLSRVEGAIFKDADLRG 149



 Score = 37.0 bits (84), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 28/52 (53%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            F   DL  A   + N +RA FT A+++ +DF+G+   GA LE   A  A F
Sbjct: 21  DFSGEDLAGAKFFRANLKRALFTGANLKGADFTGADLEGANLEGVDAEAAGF 72


>gi|332704968|ref|ZP_08425054.1| hypothetical protein LYNGBM3L_00780 [Moorea producens 3L]
 gi|332356320|gb|EGJ35774.1| hypothetical protein LYNGBM3L_00780 [Moorea producens 3L]
          Length = 520

 Score = 38.5 bits (88), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 31/58 (53%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           + A    ADLR+A   + N  RA F  A++  +  S +   GA+L     +KA+F+G+
Sbjct: 118 TGANLSGADLREATLRQANLSRATFNEANLGGATLSQANLKGAHLNGTNLHKADFSGS 175


>gi|418019711|ref|ZP_12659144.1| putative low-complexity protein [Candidatus Regiella insecticola
           R5.15]
 gi|347604938|gb|EGY29471.1| putative low-complexity protein [Candidatus Regiella insecticola
           R5.15]
          Length = 381

 Score = 38.5 bits (88), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 27/53 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           S       DL K    + N  +AN T A++RE D +G+   GA LE+A   +A
Sbjct: 76  SHTYLAGLDLSKMDLSRVNLEKANLTGANLREMDLTGANLTGANLERARLVRA 128


>gi|392382619|ref|YP_005031816.1| conserved protein of unknown function; Pentapeptide repeat
           [Azospirillum brasilense Sp245]
 gi|356877584|emb|CCC98426.1| conserved protein of unknown function; Pentapeptide repeat
           [Azospirillum brasilense Sp245]
          Length = 439

 Score = 38.5 bits (88), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 26/47 (55%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           AA    ADLR+A+       +AN T AD+  +D  G+   GA L++A
Sbjct: 383 AANLMGADLRQAMLTDSRMVQANLTDADLESADLDGADLAGAKLQRA 429


>gi|149922858|ref|ZP_01911281.1| serine/threonine kinase [Plesiocystis pacifica SIR-1]
 gi|149816325|gb|EDM75829.1| serine/threonine kinase [Plesiocystis pacifica SIR-1]
          Length = 655

 Score = 38.5 bits (88), Expect = 0.97,   Method: Composition-based stats.
 Identities = 18/55 (32%), Positives = 32/55 (58%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A+ G   L KA  ++ +  RA+   AD+R + F+ +  +GA L +A+ + A+F
Sbjct: 556 SGARLGGLRLDKAEFIQASMARAHLRGADLRRARFNHADLSGADLREAIVWNADF 610


>gi|76819210|ref|YP_336861.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
           1710b]
 gi|76583683|gb|ABA53157.1| pentapeptide repeat family protein [Burkholderia pseudomallei
           1710b]
          Length = 862

 Score = 38.5 bits (88), Expect = 0.99,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 780 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 837


>gi|425447143|ref|ZP_18827135.1| Genome sequencing data, contig C282 [Microcystis aeruginosa PCC
           9443]
 gi|389732374|emb|CCI03682.1| Genome sequencing data, contig C282 [Microcystis aeruginosa PCC
           9443]
          Length = 179

 Score = 38.5 bits (88), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A    ADL  A +++ N R A+ T A++R +DF  +   GA L  A+   A+  G  +
Sbjct: 25  SGADLAGADLAGA-NLRANLRGADLTGANLRGADFRNADLRGAILLDAIVTGASLAGAFL 83

Query: 169 A 169
           A
Sbjct: 84  A 84


>gi|409993510|ref|ZP_11276649.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|291566334|dbj|BAI88606.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
 gi|409935658|gb|EKN77183.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 259

 Score = 38.5 bits (88), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 34/71 (47%), Gaps = 2/71 (2%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
           N   A+ R E  +G A   G  DLR+A     N  +AN   A + ESD   S    A+L 
Sbjct: 107 NLQRADLR-EADLGQA-MLGRVDLRRADLRDANLFQANLGQAYLEESDLMNSNLQRAFLF 164

Query: 155 KAVAYKANFTG 165
           +A   +AN TG
Sbjct: 165 RANLERANLTG 175


>gi|443478905|ref|ZP_21068593.1| serine/threonine protein kinase with pentapeptide repeats
           [Pseudanabaena biceps PCC 7429]
 gi|443015732|gb|ELS30565.1| serine/threonine protein kinase with pentapeptide repeats
           [Pseudanabaena biceps PCC 7429]
          Length = 545

 Score = 38.5 bits (88), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 29/51 (56%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           ADL  A  +  + R AN  SA M ++D SG+  +GA L+ A   +AN  GT
Sbjct: 459 ADLGSASMILADMREANLQSAYMSKADLSGANLSGANLKGAYLSQANLNGT 509


>gi|229490072|ref|ZP_04383924.1| pentapeptide repeat protein [Rhodococcus erythropolis SK121]
 gi|229323028|gb|EEN88797.1| pentapeptide repeat protein [Rhodococcus erythropolis SK121]
          Length = 470

 Score = 38.5 bits (88), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 32/61 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           ++A    ADLR+A     +   AN T AD+ +++   ++   A L KAV   A+  GT +
Sbjct: 324 TSANLSEADLREANLTDAHLSSANLTKADLTKANLKDARMPAANLTKAVLVDADLRGTFL 383

Query: 169 A 169
           A
Sbjct: 384 A 384



 Score = 38.1 bits (87), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 35/64 (54%), Gaps = 5/64 (7%)

Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A+  +A+L KAV V  + R      AN T A + +++ +G++F  A L  A  + AN TG
Sbjct: 361 ARMPAANLTKAVLVDADLRGTFLAEANLTGAFLHDANLTGTQFGAANLSGASLHGANLTG 420

Query: 166 TLIA 169
             +A
Sbjct: 421 AWLA 424


>gi|381204843|ref|ZP_09911914.1| hypothetical protein SclubJA_04390 [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 214

 Score = 38.5 bits (88), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 33/67 (49%), Gaps = 6/67 (8%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTG 165
           A    ADLR+      N   AN   A++R++     D SG+K + A L  AV   AN TG
Sbjct: 65  ASLVGADLRRVDLSGANLSNANLVGANLRKANLTGADLSGAKLSNANLTGAVLSSANLTG 124

Query: 166 T-LIATE 171
           T L+  E
Sbjct: 125 TNLLGVE 131


>gi|428313912|ref|YP_007124889.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428255524|gb|AFZ21483.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 149

 Score = 38.5 bits (88), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 25/54 (46%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           Q  SA L+KA     N   AN   AD+  +D  G+   GA LE A    AN  G
Sbjct: 62  QLSSASLKKAQLTNANLSGANLKGADLENADLRGANLKGANLELANLSGANLEG 115


>gi|186683437|ref|YP_001866633.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
           73102]
 gi|186465889|gb|ACC81690.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
          Length = 176

 Score = 38.5 bits (88), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 4/119 (3%)

Query: 41  SSKTESDGQFPDCSNNQCAGPY--AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYE 98
           SS  E+D    D +N   +G      + +  +    A+  A ++     ++ L + N  E
Sbjct: 55  SSLIEADLNGADLTNANLSGSNLSGAILDGAILDGAAMEGANLSQADLTVAKLIETNLSE 114

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
           A+ +    I  AA    ADL  A     +  +AN T AD+ +++ SG+  +GA +E  +
Sbjct: 115 ADLQEASLI--AANLDGADLSGADLTVADLSQANLTQADLNQTNLSGANLDGANIEGTI 171


>gi|37523524|ref|NP_926901.1| hypothetical protein gll3955 [Gloeobacter violaceus PCC 7421]
 gi|35214528|dbj|BAC91896.1| gll3955 [Gloeobacter violaceus PCC 7421]
          Length = 159

 Score = 38.5 bits (88), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 58/135 (42%), Gaps = 32/135 (23%)

Query: 68  WRVFVSTALAAAVVASCSSNISALADL-NKY-----EAETRGEFGIGSA---------AQ 112
           WR  V   LAA +V      +SA AD+ N Y     E  +  E  +  A           
Sbjct: 2   WRSGVLAGLAAGLV--LPGLVSAQADIQNNYNGAYLEGRSVAEQNLKQAQFYKANLRGVD 59

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV---AYK--------- 160
           F S+DLR A     + R ANF  A + +++ S +   GA L++AV   AY          
Sbjct: 60  FSSSDLRGASLFAASLRGANFNKARLDDAELSNADLQGAKLDQAVLAGAYMTAARLKDVS 119

Query: 161 ---ANFTGTLIATEH 172
              A+FTGT+I  + 
Sbjct: 120 VDGADFTGTIINNQQ 134


>gi|411117186|ref|ZP_11389673.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410713289|gb|EKQ70790.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 544

 Score = 38.5 bits (88), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 30/58 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           S A     +LR+A   + N R AN T A++R +D SG+  + A L  A    AN TG 
Sbjct: 173 SGADLSYTELRQANLSRANLRGANLTGANLRWADLSGADLSWADLSGARLSGANLTGV 230



 Score = 37.0 bits (84), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 13/104 (12%)

Query: 68  WRVFVSTALAAAVVASC---SSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
           W  F S  L+ A +      SS++S  ADL+  E            A    A+LR A   
Sbjct: 149 WATFTSANLSQANLHGTDLSSSDLSG-ADLSYTELRQ---------ANLSRANLRGANLT 198

Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
             N R A+ + AD+  +D SG++ +GA L       AN  GT++
Sbjct: 199 GANLRWADLSGADLSWADLSGARLSGANLTGVNLSYANLLGTIL 242


>gi|254264016|ref|ZP_04954881.1| pentapeptide repeat protein [Burkholderia pseudomallei 1710a]
 gi|254215018|gb|EET04403.1| pentapeptide repeat protein [Burkholderia pseudomallei 1710a]
          Length = 825

 Score = 38.5 bits (88), Expect = 1.1,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|254299592|ref|ZP_04967041.1| pentapeptide repeat protein [Burkholderia pseudomallei 406e]
 gi|418542641|ref|ZP_13108060.1| type VI secretion system [Burkholderia pseudomallei 1258a]
 gi|418549165|ref|ZP_13114243.1| type VI secretion system [Burkholderia pseudomallei 1258b]
 gi|157809489|gb|EDO86659.1| pentapeptide repeat protein [Burkholderia pseudomallei 406e]
 gi|385355180|gb|EIF61399.1| type VI secretion system [Burkholderia pseudomallei 1258a]
 gi|385356028|gb|EIF62174.1| type VI secretion system [Burkholderia pseudomallei 1258b]
          Length = 825

 Score = 38.5 bits (88), Expect = 1.1,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|431931296|ref|YP_007244342.1| low-complexity protein [Thioflavicoccus mobilis 8321]
 gi|431829599|gb|AGA90712.1| putative low-complexity protein [Thioflavicoccus mobilis 8321]
          Length = 284

 Score = 38.5 bits (88), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 50/104 (48%), Gaps = 14/104 (13%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETR----GEFGIGSAAQFGSADLRKA--VHV---KE 126
           L  A +  C  N + LA  + ++A+      G F + + A F SADLR A   HV     
Sbjct: 162 LCGADLRDCHLNDANLAMASLHDADLSSKQPGGFTVINLANFESADLRGANLRHVLAQDV 221

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIAT 170
           N R AN T     + DF+ +   GA L +A    ANF+G  +A+
Sbjct: 222 NMRNANLT-----DVDFTDAVIGGAILRRADVTNANFSGVELAS 260


>gi|254411535|ref|ZP_05025312.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196182036|gb|EDX77023.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 125

 Score = 38.5 bits (88), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 28/54 (51%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            A    ADLR+A     N   AN   AD+RE++ +G+   GA++  A   +AN 
Sbjct: 22  GAHLIGADLREANLQGANLSHANLEGADLREANLAGANLTGAFVTNADMKEANL 75


>gi|167913453|ref|ZP_02500544.1| pentapeptide repeat family protein [Burkholderia pseudomallei 112]
 gi|403521532|ref|YP_006657101.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
           BPC006]
 gi|403076599|gb|AFR18178.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
           BPC006]
          Length = 825

 Score = 38.5 bits (88), Expect = 1.1,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|425472052|ref|ZP_18850903.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
 gi|389881943|emb|CCI37532.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
          Length = 330

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 40/87 (45%), Gaps = 10/87 (11%)

Query: 83  SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
           +C++N+ A    N Y A+ +G       A     +L +A  V  NF+ AN    D+R++D
Sbjct: 143 NCNTNLQAA---NLYRADLQG-------ANMKGVNLVRANLVGANFKEANLCDVDLRKAD 192

Query: 143 FSGSKFNGAYLEKAVAYKANFTGTLIA 169
            + +   GA L  A    A   G  +A
Sbjct: 193 LTNANLQGALLTDANLIGARLVGANLA 219


>gi|256397701|ref|YP_003119265.1| pentapeptide repeat-containing protein [Catenulispora acidiphila
           DSM 44928]
 gi|256363927|gb|ACU77424.1| pentapeptide repeat protein [Catenulispora acidiphila DSM 44928]
          Length = 354

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 44/94 (46%), Gaps = 9/94 (9%)

Query: 70  VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN-- 127
           V VS   A   +A  +     LADL       R    + + A F  ADLR+AV  K    
Sbjct: 218 VSVSLQHAEMRLAKLTEARCVLADLRG----ARMAEAVLNGADFTRADLREAVLRKTQAQ 273

Query: 128 ---FRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
              F  A+  +AD+R +D S ++ +GA  E AVA
Sbjct: 274 NTVFHHADLRNADLRGADLSSAELDGARFEGAVA 307


>gi|254411218|ref|ZP_05024995.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196181719|gb|EDX76706.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 293

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 33/69 (47%), Gaps = 10/69 (14%)

Query: 110 AAQFGSADLRKAVHVKENFRRAN----------FTSADMRESDFSGSKFNGAYLEKAVAY 159
           +A    A+L  A+ ++ N ++AN          FT AD+ E D S ++ NG  L +A+  
Sbjct: 163 SANLEKANLTNAILLETNLKQANLNKALLHGANFTQADLTEVDLSQARLNGVNLTRAILV 222

Query: 160 KANFTGTLI 168
            A   G  I
Sbjct: 223 GAKLRGVSI 231


>gi|254417642|ref|ZP_05031376.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196175560|gb|EDX70590.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 436

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 29/54 (53%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +A    ADLR+A     + R AN + AD+RE++ SG+    A L  A   +A F
Sbjct: 348 SADLSDADLREANLSGADLREANLSGADLREANLSGADLREANLSGANVKQAKF 401


>gi|87125517|ref|ZP_01081362.1| hypothetical protein RS9917_02051 [Synechococcus sp. RS9917]
 gi|86166817|gb|EAQ68079.1| hypothetical protein RS9917_02051 [Synechococcus sp. RS9917]
          Length = 180

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 5/67 (7%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKA 161
           +G  A F  A+L  A+  +  F  A+F  AD     M  +DFSG+   G  L   +A  +
Sbjct: 75  VGRGADFSDANLHGAIFTQGAFANADFHGADLSDALMDRADFSGTDLRGTLLSGVIASGS 134

Query: 162 NFTGTLI 168
           +F G  I
Sbjct: 135 SFAGAQI 141


>gi|427739890|ref|YP_007059434.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427374931|gb|AFY58887.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 447

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 22/35 (62%)

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           ANF  A +RE+D SG+   G   +KAV Y+ N +G
Sbjct: 372 ANFAEASLREADLSGANLMGTDFQKAVLYETNLSG 406


>gi|427734924|ref|YP_007054468.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427369965|gb|AFY53921.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 213

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 30/59 (50%)

Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           G   Q   A+L      + +  RAN   A++  ++F+GSKF GA+LE A    AN   T
Sbjct: 9   GELKQLAGANLEDENLSQTDLSRANLAGANLVGTNFAGSKFEGAHLEGANLMGANLKET 67


>gi|209523485|ref|ZP_03272040.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|376006316|ref|ZP_09783597.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
 gi|423064680|ref|ZP_17053470.1| hypothetical protein SPLC1_S204920 [Arthrospira platensis C1]
 gi|209496227|gb|EDZ96527.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|375325207|emb|CCE19350.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
 gi|406713923|gb|EKD09091.1| hypothetical protein SPLC1_S204920 [Arthrospira platensis C1]
          Length = 259

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 34/71 (47%), Gaps = 2/71 (2%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
           N   A+ R E  +G A   G  DLR+A     N  +AN   A + ESD   S    A+L 
Sbjct: 107 NLQRADLR-EADLGQA-MLGRVDLRRADLRDANLFQANLGQAYLEESDLMNSNLQRAFLF 164

Query: 155 KAVAYKANFTG 165
           +A   +AN TG
Sbjct: 165 RANLERANLTG 175


>gi|254413108|ref|ZP_05026880.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196180272|gb|EDX75264.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 885

 Score = 38.1 bits (87), Expect = 1.1,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 32/57 (56%), Gaps = 5/57 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A+ G+ADLR A  V      A+ + AD+RE+D +     GA L+KA   + NF G
Sbjct: 818 SWAKLGNADLRGAELVGAKLVGASLSGADLREADLT-----GANLDKADLSEVNFEG 869


>gi|443663577|ref|ZP_21133147.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|159026610|emb|CAO86542.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443331854|gb|ELS46494.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 145

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 31/68 (45%), Gaps = 7/68 (10%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
           A+ RG   IG       ADLRKA     N   AN   AD+ +++  G+    A+L  A  
Sbjct: 46  ADLRGAHLIG-------ADLRKANLQGANLEEANLEGADLTDANLEGANLTAAFLTNASL 98

Query: 159 YKANFTGT 166
            +AN  G 
Sbjct: 99  NQANLNGV 106


>gi|448412419|ref|ZP_21576534.1| hypothetical protein C475_19468 [Halosimplex carlsbadense 2-9-1]
 gi|445668180|gb|ELZ20811.1| hypothetical protein C475_19468 [Halosimplex carlsbadense 2-9-1]
          Length = 561

 Score = 38.1 bits (87), Expect = 1.1,   Method: Composition-based stats.
 Identities = 20/57 (35%), Positives = 29/57 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           +A    S D   A     +FRRA   +A++R++D  G+ F GA L  A    A+ TG
Sbjct: 251 TAGTLESVDFGGATLTDASFRRAGLQNAELRDADLVGADFQGADLRNASLTNADLTG 307


>gi|427420948|ref|ZP_18911131.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
 gi|425756825|gb|EKU97679.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
          Length = 193

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 31/58 (53%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           A+   A+L K    + N        AD+RE++ +G+  NGA L+ A    A+ TGT++
Sbjct: 71  AELSDANLSKTNLQEANLMLTALQGADLREANLAGASMNGARLQGADLRGADLTGTVL 128


>gi|53715998|ref|YP_106439.1| pentapeptide repeat-containing protein [Burkholderia mallei ATCC
           23344]
 gi|121597894|ref|YP_990510.1| pentapeptide repeat-containing protein [Burkholderia mallei SAVP1]
 gi|124382797|ref|YP_001025000.1| pentapeptide repeat-containing protein [Burkholderia mallei NCTC
           10229]
 gi|126447556|ref|YP_001079344.1| pentapeptide repeat-containing protein [Burkholderia mallei NCTC
           10247]
 gi|166999172|ref|ZP_02265018.1| pentapeptide repeat family protein [Burkholderia mallei PRL-20]
 gi|238561876|ref|ZP_00441284.2| pentapeptide repeat family protein [Burkholderia mallei GB8 horse
           4]
 gi|254176522|ref|ZP_04883180.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 10399]
 gi|254203434|ref|ZP_04909795.1| pentapeptide repeat family protein [Burkholderia mallei FMH]
 gi|254205313|ref|ZP_04911666.1| pentapeptide repeat family protein [Burkholderia mallei JHU]
 gi|254356120|ref|ZP_04972397.1| pentapeptide repeat family protein [Burkholderia mallei 2002721280]
 gi|52421968|gb|AAU45538.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 23344]
 gi|121225692|gb|ABM49223.1| pentapeptide repeat family protein [Burkholderia mallei SAVP1]
 gi|126240410|gb|ABO03522.1| pentapeptide repeat family protein [Burkholderia mallei NCTC 10247]
 gi|147745673|gb|EDK52752.1| pentapeptide repeat family protein [Burkholderia mallei FMH]
 gi|147754899|gb|EDK61963.1| pentapeptide repeat family protein [Burkholderia mallei JHU]
 gi|148025103|gb|EDK83272.1| pentapeptide repeat family protein [Burkholderia mallei 2002721280]
 gi|160697564|gb|EDP87534.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 10399]
 gi|238523698|gb|EEP87135.1| pentapeptide repeat family protein [Burkholderia mallei GB8 horse
           4]
 gi|243064727|gb|EES46913.1| pentapeptide repeat family protein [Burkholderia mallei PRL-20]
 gi|261826983|gb|ABM99323.2| pentapeptide repeat family protein [Burkholderia mallei NCTC 10229]
          Length = 825

 Score = 38.1 bits (87), Expect = 1.1,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|425434939|ref|ZP_18815403.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
           9432]
 gi|389675416|emb|CCH95473.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
           9432]
          Length = 470

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 10/67 (14%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAY 159
           + I S A+   A+LR+A     N R AN + AD+R+++ SG+   GA L +     A+  
Sbjct: 324 WAILSGAKLSGANLREA-----NLREANLSGADLRKANLSGANLWGAILIEANLWGAILI 378

Query: 160 KANFTGT 166
           +AN  G 
Sbjct: 379 EANLRGV 385


>gi|226194659|ref|ZP_03790253.1| pentapeptide repeat protein [Burkholderia pseudomallei Pakistan 9]
 gi|386863935|ref|YP_006276883.1| type VI secretion system [Burkholderia pseudomallei 1026b]
 gi|418534996|ref|ZP_13100802.1| type VI secretion system [Burkholderia pseudomallei 1026a]
 gi|225933225|gb|EEH29218.1| pentapeptide repeat protein [Burkholderia pseudomallei Pakistan 9]
 gi|385357281|gb|EIF63347.1| type VI secretion system [Burkholderia pseudomallei 1026a]
 gi|385661063|gb|AFI68485.1| type VI secretion system [Burkholderia pseudomallei 1026b]
          Length = 825

 Score = 38.1 bits (87), Expect = 1.1,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|217423045|ref|ZP_03454547.1| pentapeptide repeat protein [Burkholderia pseudomallei 576]
 gi|217393953|gb|EEC33973.1| pentapeptide repeat protein [Burkholderia pseudomallei 576]
          Length = 825

 Score = 38.1 bits (87), Expect = 1.1,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|254182800|ref|ZP_04889393.1| pentapeptide repeat protein [Burkholderia pseudomallei 1655]
 gi|184213334|gb|EDU10377.1| pentapeptide repeat protein [Burkholderia pseudomallei 1655]
          Length = 825

 Score = 38.1 bits (87), Expect = 1.1,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|167921391|ref|ZP_02508482.1| pentapeptide repeat protein [Burkholderia pseudomallei BCC215]
          Length = 825

 Score = 38.1 bits (87), Expect = 1.1,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|254189534|ref|ZP_04896044.1| pentapeptide repeat protein [Burkholderia pseudomallei Pasteur
           52237]
 gi|157937212|gb|EDO92882.1| pentapeptide repeat protein [Burkholderia pseudomallei Pasteur
           52237]
          Length = 825

 Score = 38.1 bits (87), Expect = 1.1,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|126455703|ref|YP_001074295.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
           1106a]
 gi|167896768|ref|ZP_02484170.1| pentapeptide repeat protein [Burkholderia pseudomallei 7894]
 gi|242312992|ref|ZP_04812009.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106b]
 gi|254195379|ref|ZP_04901807.1| pentapeptide repeat protein [Burkholderia pseudomallei S13]
 gi|126229471|gb|ABN92884.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106a]
 gi|169652126|gb|EDS84819.1| pentapeptide repeat protein [Burkholderia pseudomallei S13]
 gi|242136231|gb|EES22634.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106b]
          Length = 825

 Score = 38.1 bits (87), Expect = 1.1,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|145356542|ref|XP_001422487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582730|gb|ABP00804.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 114

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 31/60 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S+  F  ADLR A     N R A        E DF+G+  + A +++AV  KANFT  ++
Sbjct: 1   SSQNFTGADLRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALMDRAVLVKANFTNAIL 60


>gi|194336259|ref|YP_002018053.1| pentapeptide repeat-containing protein [Pelodictyon
           phaeoclathratiforme BU-1]
 gi|194308736|gb|ACF43436.1| pentapeptide repeat protein [Pelodictyon phaeoclathratiforme BU-1]
          Length = 180

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/129 (22%), Positives = 55/129 (42%), Gaps = 11/129 (8%)

Query: 35  WVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADL 94
           ++ C  +S   S  +  DC    C    AKLKN      ++L      +C       +D 
Sbjct: 21  FIHCNFNSADLSGVRMIDCRFEGCDLSLAKLKN------SSLQKVKFVNCKLLGVLFSDC 74

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
            K+  +   +  I   + F    L+    +  + + A+F+ AD+     SG+KF+G+ L 
Sbjct: 75  RKFMLDLDFDRCILKLSLFAGLKLKNTRFINCDLQEADFSEADL-----SGAKFDGSDLL 129

Query: 155 KAVAYKANF 163
           + + + +N 
Sbjct: 130 QTIFFHSNL 138


>gi|407961546|dbj|BAM54786.1| hypothetical protein BEST7613_5855 [Synechocystis sp. PCC 6803]
          Length = 194

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 6/99 (6%)

Query: 65  LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
           L  W+  V T +   +VA+    + +LA  +      RG       A F   DLR ++  
Sbjct: 32  LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 85

Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
             N R A+FT A+++ + F  +  +GA LE A A   +F
Sbjct: 86  HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDF 124


>gi|167905147|ref|ZP_02492352.1| pentapeptide repeat protein [Burkholderia pseudomallei NCTC 13177]
 gi|237508538|ref|ZP_04521253.1| pentapeptide repeat family protein [Burkholderia pseudomallei
           MSHR346]
 gi|235000743|gb|EEP50167.1| pentapeptide repeat family protein [Burkholderia pseudomallei
           MSHR346]
          Length = 825

 Score = 38.1 bits (87), Expect = 1.2,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|17228826|ref|NP_485374.1| hypothetical protein alr1331 [Nostoc sp. PCC 7120]
 gi|17130678|dbj|BAB73288.1| alr1331 [Nostoc sp. PCC 7120]
          Length = 953

 Score = 38.1 bits (87), Expect = 1.2,   Method: Composition-based stats.
 Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 5/68 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A  G   LR A   + +FR      A+ + AD+ E+D + +  +GAYL  A    A+ 
Sbjct: 850 SGADLGDTSLRGAFLREADFRGAYLDAADLSGADLTEADLTEANLSGAYLSGAYLSNADL 909

Query: 164 TGTLIATE 171
           +G  ++ E
Sbjct: 910 SGAYLSDE 917


>gi|134280632|ref|ZP_01767342.1| pentapeptide repeat protein [Burkholderia pseudomallei 305]
 gi|134247654|gb|EBA47738.1| pentapeptide repeat protein [Burkholderia pseudomallei 305]
          Length = 825

 Score = 38.1 bits (87), Expect = 1.2,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|126442493|ref|YP_001061349.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
           668]
 gi|126221984|gb|ABN85489.1| pentapeptide repeat protein [Burkholderia pseudomallei 668]
          Length = 825

 Score = 38.1 bits (87), Expect = 1.2,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANFTGTL 167
           +ADLR A      F RA+ T AD+R++D       G+K +GA L +A  ++AN +  L
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQIL 800


>gi|428217541|ref|YP_007102006.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427989323|gb|AFY69578.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 353

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 29/53 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A FGSA+L  A   + N  +AN   AD+ ++D  G+K  G  L +A   +AN 
Sbjct: 54  ANFGSANLLGANLSEANLTKANLREADLYKADLGGAKLIGTSLIRAYLREANL 106


>gi|300863681|ref|ZP_07108615.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300338313|emb|CBN53761.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 238

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 7/75 (9%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
           L +++   AE +G        +F  ADLR++   K NF +A+F   D+ ES   G+    
Sbjct: 22  LQEIDLLNAELQG-------IEFDRADLRQSRLGKTNFTQASFQETDLSESILWGTDLTE 74

Query: 151 AYLEKAVAYKANFTG 165
           A L +AV  +A+ +G
Sbjct: 75  ANLYRAVLREADLSG 89


>gi|126661305|ref|ZP_01732374.1| hypothetical protein CY0110_08576 [Cyanothece sp. CCY0110]
 gi|126617401|gb|EAZ88201.1| hypothetical protein CY0110_08576 [Cyanothece sp. CCY0110]
          Length = 368

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 5/59 (8%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANF 163
             +    +L  A   + NFR AN T AD+ E     + FSG+  +GAYL  A   KA+F
Sbjct: 246 GTELSGVELNGANLTQSNFRGANLTDADLSEAILSYTRFSGADLSGAYLGNANLQKADF 304


>gi|428222472|ref|YP_007106642.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427995812|gb|AFY74507.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 340

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 30/58 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           S +    A+L  AV +  NF  AN +     ESDFS +  + A L +A+  K N TG+
Sbjct: 102 SESNLSRANLGNAVAIAANFIMANLSGTYFSESDFSRANLSSANLTEAILVKTNLTGS 159



 Score = 36.6 bits (83), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 12/107 (11%)

Query: 66  KNWR--VFVSTA------LAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSA 116
            NWR  VF S        L+AA ++S + +++ L  +N   A  ++      S A  G A
Sbjct: 18  NNWRSEVFRSKIDLSYADLSAATLSSINLSLANLRSINLSRANLSKANL---SGAILGKA 74

Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +L +A  +  N   ANF  AD+  +  S S  + A L  AVA  ANF
Sbjct: 75  NLTEASLINANLSMANFIMADLSGAYLSESNLSRANLGNAVAIAANF 121


>gi|218440036|ref|YP_002378365.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218172764|gb|ACK71497.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 210

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 26/54 (48%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           F  ADLRK      N   AN   AD+RE +  G+   GA  + A   +AN  GT
Sbjct: 31  FTGADLRKIDLSNVNLINANLAGADLREVNLIGADLTGANFDGADLTEANLIGT 84


>gi|423067871|ref|ZP_17056661.1| endoribonuclease L-PSP [Arthrospira platensis C1]
 gi|406710614|gb|EKD05821.1| endoribonuclease L-PSP [Arthrospira platensis C1]
          Length = 379

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-KFNGAYLEKAVAYKANFTGTLI 168
           + +F  ADL  A  VK + R  +F+ AD+  + F  S +F+   ++KA+    NF+G  +
Sbjct: 69  SVRFVKADLTNACLVKSDLRDIDFSRADLTGAQFQASRRFSDIKIDKAIIKNVNFSGIKL 128

Query: 169 A 169
           A
Sbjct: 129 A 129


>gi|288957041|ref|YP_003447382.1| hypothetical protein AZL_002000 [Azospirillum sp. B510]
 gi|288909349|dbj|BAI70838.1| hypothetical protein AZL_002000 [Azospirillum sp. B510]
          Length = 424

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 27/47 (57%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           AA F +  L  A   + + R ANF+ AD+R +D +GS   GA LE A
Sbjct: 166 AADFTNTRLAGARLDRTDLRDANFSGADLRGADLNGSDLRGAILEGA 212


>gi|428215879|ref|YP_007089023.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428004260|gb|AFY85103.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 284

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 31/57 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S +QF SA L+ A  V+ N  +    +AD+R +D S +   G+ L +A   + N TG
Sbjct: 43  SHSQFCSAILQGATLVEANLEQTKLRAADLRRADLSHANLMGSDLSRADMIETNLTG 99



 Score = 37.0 bits (84), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 37/73 (50%), Gaps = 12/73 (16%)

Query: 109 SAAQFGSADLRKAVHVKE------NFRRANFTSADMRESDFSGSKFNGA-----YLEKAV 157
           S      A+L++A H+ E      N  RAN +  D+ E+D SG+  + A      L +A+
Sbjct: 133 SGINLSGANLQEA-HIAEVSFHNANLSRANLSGLDLSETDLSGANLSYADLSDTQLTEAI 191

Query: 158 AYKANFTGTLIAT 170
            Y AN TG ++ +
Sbjct: 192 LYGANLTGAILTS 204


>gi|17230797|ref|NP_487345.1| hypothetical protein all3305 [Nostoc sp. PCC 7120]
 gi|17132400|dbj|BAB75004.1| all3305 [Nostoc sp. PCC 7120]
          Length = 496

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 5/52 (9%)

Query: 117 DLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +LR+   +  N +RA+     F  AD+R  DFSG+   GA L  ++ ++ANF
Sbjct: 234 NLRRVELLGANLQRADLRGCDFRGADLRGCDFSGANLEGAELAGSILFEANF 285


>gi|218440259|ref|YP_002378588.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218172987|gb|ACK71720.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 340

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 24/38 (63%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
           A+LR+A+    N R  N  SAD+ E+DF G+  +GA L
Sbjct: 246 ANLRQAILTYANLRGCNLLSADLAEADFEGANLSGAGL 283


>gi|16331083|ref|NP_441811.1| hypothetical protein sll0274 [Synechocystis sp. PCC 6803]
 gi|383322826|ref|YP_005383679.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383325995|ref|YP_005386848.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383491879|ref|YP_005409555.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384437147|ref|YP_005651871.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
 gi|451815240|ref|YP_007451692.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
 gi|1653576|dbj|BAA18489.1| sll0274 [Synechocystis sp. PCC 6803]
 gi|339274179|dbj|BAK50666.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
 gi|359272145|dbj|BAL29664.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359275315|dbj|BAL32833.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359278485|dbj|BAL36002.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|451781209|gb|AGF52178.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
          Length = 196

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 6/99 (6%)

Query: 65  LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
           L  W+  V T +   +VA+    + +LA  +      RG       A F   DLR ++  
Sbjct: 34  LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 87

Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
             N R A+FT A+++ + F  +  +GA LE A A   +F
Sbjct: 88  HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDF 126


>gi|300868113|ref|ZP_07112748.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
 gi|300333887|emb|CBN57928.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
          Length = 169

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 57/111 (51%), Gaps = 10/111 (9%)

Query: 62  YAKLKNWRVFVSTALAAAVVASCSSNISALA-DLNKYEAETRGEFGIG---SAAQFGSAD 117
           Y  L +  +FV+ +L A  +    + + ALA D N+ EA    +F  G   + AQF  A+
Sbjct: 10  YTFLFSLVLFVAVSLVAIAI----NPVPALALDYNR-EALVGADFS-GRDLTDAQFTKAN 63

Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           LR +     N +  +F +A+M +++F G+   GA L+ A   K N T  ++
Sbjct: 64  LRNSNFSNANLQGVSFFAANMEDANFEGANLRGATLDLARMIKVNLTNAIL 114


>gi|440233072|ref|YP_007346865.1| uncharacterized low-complexity protein [Serratia marcescens FGI94]
 gi|440054777|gb|AGB84680.1| uncharacterized low-complexity protein [Serratia marcescens FGI94]
          Length = 846

 Score = 38.1 bits (87), Expect = 1.3,   Method: Composition-based stats.
 Identities = 32/141 (22%), Positives = 60/141 (42%), Gaps = 7/141 (4%)

Query: 30  LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNIS 89
           L K  ++   + + + S      CS  +     A+     +    A + +V+     + +
Sbjct: 670 LHKTTFMKTTLEAASFSGASLESCSWVESHAEQARFDGATLVTCAAASESVLNGADFSNA 729

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRAN-----FTSADMRESDFS 144
            L   N  +   RG     + A+  ++DL +A     +F RAN     F  +D R+++FS
Sbjct: 730 TLKQCNLRQTPLRG--ARFTLAKLENSDLSEACCQGADFTRANLVGSLFVRSDFRQANFS 787

Query: 145 GSKFNGAYLEKAVAYKANFTG 165
            +   GA L+K++   A F G
Sbjct: 788 DANLMGAILQKSLLGGARFNG 808


>gi|193213578|ref|YP_001999531.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
           8327]
 gi|193087055|gb|ACF12331.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
          Length = 439

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 30/54 (55%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           + F SADL KA     N    NF+ ADM +++  G+   GA L++A   +A+ +
Sbjct: 301 SDFESADLDKANLAGANLAGGNFSRADMEKANLKGANLEGAVLDRAFMKQADLS 354


>gi|116753519|ref|YP_842637.1| pentapeptide repeat-containing protein [Methanosaeta thermophila
           PT]
 gi|116664970|gb|ABK13997.1| pentapeptide repeat protein [Methanosaeta thermophila PT]
          Length = 862

 Score = 38.1 bits (87), Expect = 1.3,   Method: Composition-based stats.
 Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 2/65 (3%)

Query: 101 TRGE-FGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
           TR E FG   S      ADL KA  ++ N   A+ T A + ++DFSG+   GA + + V 
Sbjct: 669 TRAELFGADLSGTDLSGADLVKAYALRANLSGADLTDAKLDDADFSGAILRGAKMPELVI 728

Query: 159 YKANF 163
              NF
Sbjct: 729 RSVNF 733



 Score = 37.4 bits (85), Expect = 1.9,   Method: Composition-based stats.
 Identities = 20/52 (38%), Positives = 26/52 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
           A F  A LR A   +   R  NF  AD+ ++D SG +F   Y+  AV   AN
Sbjct: 711 ADFSGAILRGAKMPELVIRSVNFGQADLSDADMSGCRFEALYVSNAVMRSAN 762


>gi|75911159|ref|YP_325455.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75704884|gb|ABA24560.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 489

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 5/52 (9%)

Query: 117 DLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +LR+   +  N +RA+     F  AD+R  DFSG+   GA L  ++ ++ANF
Sbjct: 227 NLRRVELLGANLQRADLRGCDFRGADLRGCDFSGANLEGAELAGSILFEANF 278


>gi|434384824|ref|YP_007095435.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
 gi|428015814|gb|AFY91908.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
          Length = 377

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 17/49 (34%), Positives = 27/49 (55%)

Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           DL +   ++ N +RAN   A++  +D  G+   GA L+KA   +AN  G
Sbjct: 200 DLAQTNLIRANLKRANLQGANLEGADLEGANLQGANLKKANLKRANLQG 248


>gi|428211266|ref|YP_007084410.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|427999647|gb|AFY80490.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 279

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/58 (31%), Positives = 31/58 (53%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           + S  + G A+L++A   + N  RAN + AD+  ++  G+    A L +A+  K N T
Sbjct: 116 LASETRLGWANLKEATMNQANLSRANLSEADLTGANLEGANLTIAILIQAIMEKVNLT 173


>gi|428226754|ref|YP_007110851.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427986655|gb|AFY67799.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 330

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 34/72 (47%), Gaps = 12/72 (16%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           Y+A+ RG            ADL      + + R AN T A +RE+D SG+  +GA L  A
Sbjct: 154 YKADLRG-------VNLSGADL-----TRVDLREANLTEASLRETDLSGADLSGANLTGA 201

Query: 157 VAYKANFTGTLI 168
           +   A   G ++
Sbjct: 202 LLSDACLEGAIL 213


>gi|334118008|ref|ZP_08492098.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333459993|gb|EGK88603.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 171

 Score = 38.1 bits (87), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 31/56 (55%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           F  A+LR +     + R  +F +A+M E++  G+ F GA L+ A   KAN T  ++
Sbjct: 60  FNKANLRNSNFTNADLRGVSFFAANMEEANLEGANFTGATLDLARMMKANLTNAIL 115


>gi|23014351|ref|ZP_00054172.1| COG1357: Uncharacterized low-complexity proteins [Magnetospirillum
           magnetotacticum MS-1]
          Length = 164

 Score = 38.1 bits (87), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 37/71 (52%), Gaps = 1/71 (1%)

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           R + G+ S A F  ADL  A  V+ + RRA F  A +R +D +G+K  GA L  A    A
Sbjct: 87  RLDDGLFSDADFTKADLGGASLVRADLRRARFFHASLRGADLTGAKTLGAELLNADLSGA 146

Query: 162 NFT-GTLIATE 171
            +T G  I  E
Sbjct: 147 RWTDGKTICAE 157


>gi|116754331|ref|YP_843449.1| pentapeptide repeat-containing protein [Methanosaeta thermophila
           PT]
 gi|116665782|gb|ABK14809.1| pentapeptide repeat protein [Methanosaeta thermophila PT]
          Length = 389

 Score = 38.1 bits (87), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 58/125 (46%), Gaps = 13/125 (10%)

Query: 46  SDGQFPDCSNNQCAGP---YAKLKNWRVFVSTALAAAV-VASCSSNISALADLNKYEAET 101
           +D    D S    +G     AKL+N R+  ++ + A + +A C+  +  + D++  +AE 
Sbjct: 99  ADLSMADLSGANLSGTDLSRAKLRNARLSGASLVNANLTMADCTEAL--MDDVSLEDAEM 156

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
            G        +F   DL  AV    +   ANF  A +  +D S S+F  +   +A  Y A
Sbjct: 157 TG-------TRFFRTDLTGAVFSGASLSHANFVGAHLSWADMSRSRFRESQFSRAELYGA 209

Query: 162 NFTGT 166
           N TGT
Sbjct: 210 NLTGT 214


>gi|172037842|ref|YP_001804343.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
           51142]
 gi|354556328|ref|ZP_08975624.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
 gi|171699296|gb|ACB52277.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
 gi|353551765|gb|EHC21165.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
          Length = 319

 Score = 38.1 bits (87), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 28/53 (52%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           Q   ADLR       +FR  + + A++RE DF+G+    AYL +A     NFT
Sbjct: 25  QLRRADLRGLNLSHTDFRGVDLSYANLREVDFTGADLRDAYLNEADLTAVNFT 77


>gi|442319041|ref|YP_007359062.1| pentapeptide repeat-containing protein [Myxococcus stipitatus DSM
           14675]
 gi|441486683|gb|AGC43378.1| pentapeptide repeat-containing protein [Myxococcus stipitatus DSM
           14675]
          Length = 344

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 17/41 (41%), Positives = 25/41 (60%)

Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           +  F RA+ + A MR +DF+G+ F GA L  A   ++NF G
Sbjct: 75  RVRFVRADLSEAFMRSADFTGADFTGANLRNAEMGRSNFAG 115


>gi|417147800|ref|ZP_11988300.1| pentapeptide repeat protein [Escherichia coli 1.2264]
 gi|432414449|ref|ZP_19657095.1| hypothetical protein WG9_04959 [Escherichia coli KTE39]
 gi|432449036|ref|ZP_19691321.1| hypothetical protein A13S_05120 [Escherichia coli KTE191]
 gi|432639030|ref|ZP_19874892.1| hypothetical protein A1UY_04405 [Escherichia coli KTE81]
 gi|433026911|ref|ZP_20214794.1| hypothetical protein WI9_05012 [Escherichia coli KTE106]
 gi|433186914|ref|ZP_20371055.1| hypothetical protein WGO_05288 [Escherichia coli KTE85]
 gi|215272912|emb|CAT00693.1| protein mcbG [Escherichia coli]
 gi|386162365|gb|EIH24165.1| pentapeptide repeat protein [Escherichia coli 1.2264]
 gi|430931206|gb|ELC51659.1| hypothetical protein WG9_04959 [Escherichia coli KTE39]
 gi|430969334|gb|ELC86475.1| hypothetical protein A13S_05120 [Escherichia coli KTE191]
 gi|431167788|gb|ELE68043.1| hypothetical protein A1UY_04405 [Escherichia coli KTE81]
 gi|431524910|gb|ELI01731.1| hypothetical protein WI9_05012 [Escherichia coli KTE106]
 gi|431695578|gb|ELJ60883.1| hypothetical protein WGO_05288 [Escherichia coli KTE85]
          Length = 187

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 18/56 (32%), Positives = 29/56 (51%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
             F S  L+K++ +   FR   F   D+R+SDF+GS+FN      +     +F+ T
Sbjct: 97  VDFISLRLQKSIFLSSRFRDCLFEETDLRKSDFTGSEFNNTEFRHSDLSHCDFSMT 152


>gi|443324431|ref|ZP_21053184.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442795950|gb|ELS05284.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 239

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 5/56 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTG 165
           SADL +++    +F +A+ + A MRE+D SG+    A LEKA   K     ANF+G
Sbjct: 59  SADLSESILWGTDFTQADLSQAVMREADLSGAILTQANLEKANLIKSILEGANFSG 114



 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 33/60 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S     +ADLR+A   + NF  A   SAD+ ES   G+ F  A L +AV  +A+ +G ++
Sbjct: 33  SNVDLTAADLRQARLGRSNFGHACLRSADLSESILWGTDFTQADLSQAVMREADLSGAIL 92


>gi|428317848|ref|YP_007115730.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428241528|gb|AFZ07314.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 171

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 31/56 (55%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           F  A+LR +     + R  +F +A+M E++F G+   GA L+ A   KAN T  ++
Sbjct: 60  FNKANLRNSNFTNADLRGVSFFAANMEEANFEGANLTGATLDLARMMKANLTNAIL 115


>gi|254421888|ref|ZP_05035606.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
 gi|196189377|gb|EDX84341.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
          Length = 194

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 59/149 (39%), Gaps = 12/149 (8%)

Query: 21  SKGPYQLHALSKPLWVACQIS-SKTESDGQFPDCSNNQCAGPYAKLK--NWRV--FVSTA 75
           S G   L   + P W   Q       +D + P C  N+     A+L   N +V       
Sbjct: 18  SIGLIGLLGFAAPSWAYLQEDVDMLMNDNECPVCILNEADLVGAQLNHANLKVASLTGAN 77

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           L  A ++  +  +S L   N   A   G       AQ   A L+ AV    +   AN T 
Sbjct: 78  LTGADLSETNLMLSELIGTNLTNASLAG-------AQMNGAQLKDAVLKGADLSGANLTQ 130

Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A++ +++F G+K     +  AV   ANFT
Sbjct: 131 ANLEDANFVGAKLINTEMTAAVVGVANFT 159


>gi|385679319|ref|ZP_10053247.1| pentapeptide repeat-containing protein [Amycolatopsis sp. ATCC
           39116]
          Length = 194

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 53/119 (44%), Gaps = 18/119 (15%)

Query: 53  CSNNQCAGPYAKLKNWR---------VFVSTALAAAVVASCSSNISALADLNKYEAETRG 103
           C+ ++C    A L   R         VF  T LA +  ++CS   S+  D        R 
Sbjct: 40  CTFDECDFSGADLGESRHQASAFRSCVFDRTVLADSTWSACSLLGSSFVDGGLRGMSVRD 99

Query: 104 -EFGIGSAAQFGSADLRKAVHVKENFRRANF-----TSADMRESDFSGSKFNGAYLEKA 156
            +F   S A F  A+LR+       FR A+F     T AD+R+SDF G++  GA L  A
Sbjct: 100 SDF---SLANFSRANLRRRSLSGLRFREASFVDANLTEADLRDSDFRGARLGGADLTGA 155


>gi|417305110|ref|ZP_12092092.1| oxidoreductase molybdopterin binding protein [Rhodopirellula
           baltica WH47]
 gi|327538543|gb|EGF25205.1| oxidoreductase molybdopterin binding protein [Rhodopirellula
           baltica WH47]
          Length = 349

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 5/55 (9%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F S +LR+A     NFR AN +++ ++ SD  G+ F GA LE A    A+  G
Sbjct: 244 ADFRSCNLRQA-----NFRDANLSNSKLQRSDLQGANFTGADLEGADLSGADLRG 293


>gi|409991580|ref|ZP_11274829.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|291567915|dbj|BAI90187.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
 gi|409937560|gb|EKN78975.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 390

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 32/53 (60%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A    ADL +A+ +K NF +A+ +SA++ +S+   + F  AYL KA   +A+ 
Sbjct: 112 AHLNWADLTEAIFIKTNFHKADLSSANLTKSNLQSANFVRAYLIKANLSEADL 164


>gi|428307622|ref|YP_007144447.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
 gi|428249157|gb|AFZ14937.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
          Length = 378

 Score = 37.7 bits (86), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 30/58 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           S A    A L+ A  ++ N   A+ + AD+R +D SG+    A L KA   +AN T T
Sbjct: 43  SNADLSRASLKDAKLIRVNLSNADLSWADLRGADLSGANLENANLSKASLDQANLTNT 100


>gi|291570912|dbj|BAI93184.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
          Length = 517

 Score = 37.7 bits (86), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 7/81 (8%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           L  A++   + N++ L   +  EA+      I        A+L +A   K NF +AN   
Sbjct: 86  LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138

Query: 136 ADMRESDFSGSKFNGAYLEKA 156
           AD+RES    + FNGA L  A
Sbjct: 139 ADLRESKLQQTNFNGANLSGA 159



 Score = 37.7 bits (86), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 32/55 (58%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F +A+LR+A     N   A+F+ A++R +D  G+  +GA L +A    AN +G
Sbjct: 189 ADFTNAELRQANLTYANLSNADFSGANLRWTDLQGADLSGANLTEANLSGANLSG 243


>gi|282895655|ref|ZP_06303780.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
 gi|281199349|gb|EFA74214.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
          Length = 171

 Score = 37.7 bits (86), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 44/98 (44%), Gaps = 16/98 (16%)

Query: 68  WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 127
           + V ++ +L   +  +C   +++ A   +Y  E      I   A F   DLR +      
Sbjct: 12  FLVILNLSLLVIIPLTCLVGLTSTALALEYNKE------ILIGADFSQRDLRDS------ 59

Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
               +FT A++R+SDFSGS   G     A    ANFTG
Sbjct: 60  ----SFTKANLRQSDFSGSNLTGVSFFAANLESANFTG 93


>gi|409994208|ref|ZP_11277326.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|409934956|gb|EKN76502.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 517

 Score = 37.7 bits (86), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 7/81 (8%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           L  A++   + N++ L   +  EA+      I        A+L +A   K NF +AN   
Sbjct: 86  LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138

Query: 136 ADMRESDFSGSKFNGAYLEKA 156
           AD+RES    + FNGA L  A
Sbjct: 139 ADLRESKLQQTNFNGANLSGA 159



 Score = 37.7 bits (86), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 32/55 (58%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F +A+LR+A     N   A+F+ A++R +D  G+  +GA L +A    AN +G
Sbjct: 189 ADFTNAELRQANLTYANLSNADFSGANLRWTDLQGADLSGANLTEANLSGANLSG 243


>gi|376002742|ref|ZP_09780564.1| serine/threonine kinase [Arthrospira sp. PCC 8005]
 gi|375328798|emb|CCE16317.1| serine/threonine kinase [Arthrospira sp. PCC 8005]
          Length = 548

 Score = 37.7 bits (86), Expect = 1.7,   Method: Composition-based stats.
 Identities = 23/75 (30%), Positives = 36/75 (48%), Gaps = 7/75 (9%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
           L+++N   A+ R        A FG A L +A     N   A  ++A++ ++D  G+   G
Sbjct: 447 LSNINFQNADLRD-------ANFGRARLTQAKLKNANLENAYLSTANLEKADLRGANLQG 499

Query: 151 AYLEKAVAYKANFTG 165
           AYL +A    AN  G
Sbjct: 500 AYLTRANLRGANLCG 514


>gi|162456757|ref|YP_001619124.1| pentapeptide repeat-containing protein [Sorangium cellulosum So
           ce56]
 gi|161167339|emb|CAN98644.1| pentapeptide repeats hypothetical protein [Sorangium cellulosum So
           ce56]
          Length = 895

 Score = 37.7 bits (86), Expect = 1.7,   Method: Composition-based stats.
 Identities = 33/101 (32%), Positives = 49/101 (48%), Gaps = 15/101 (14%)

Query: 74  TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGS----ADLRKAVHVKENFR 129
           T L  AV+A  +     LA  N  +A  RG   +G AA  G+    ADL++AV  +    
Sbjct: 615 TNLEGAVLARAN-----LAGANLADARLRGA-NLGGAALRGASLDRADLKEAVLSRAELE 668

Query: 130 RANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFTG 165
           RA F+ AD+  +D+      G+ F GA L +    K + +G
Sbjct: 669 RARFSGADLTGADWFETKPGGADFTGATLGQCNLLKVDLSG 709


>gi|440713213|ref|ZP_20893815.1| oxidoreductase molybdopterin binding protein [Rhodopirellula
           baltica SWK14]
 gi|436442020|gb|ELP35204.1| oxidoreductase molybdopterin binding protein [Rhodopirellula
           baltica SWK14]
          Length = 365

 Score = 37.7 bits (86), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 5/55 (9%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F S +LR+A     NFR AN +++ ++ SD  G+ F GA LE A    A+  G
Sbjct: 260 ADFRSCNLRQA-----NFRDANLSNSKLQRSDLQGANFTGADLEGADLSGADLRG 309


>gi|359462469|ref|ZP_09251032.1| periplasmic binding protein/LacI transcriptional regulator
           [Acaryochloris sp. CCMEE 5410]
          Length = 702

 Score = 37.7 bits (86), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 30/58 (51%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           A F  A L K++    NF +AN T A M +     +KFN A L +   Y+AN   +++
Sbjct: 275 ANFTDAVLHKSLLNNANFTKANLTRAKMHQVQGIWTKFNHAILHRTDLYQANLNRSIL 332


>gi|291570935|dbj|BAI93207.1| serine/threonine protein kinase [Arthrospira platensis NIES-39]
          Length = 543

 Score = 37.7 bits (86), Expect = 1.7,   Method: Composition-based stats.
 Identities = 19/55 (34%), Positives = 28/55 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A FG A L +A     N   A  ++A++ ++D  G+   GAYL +A    AN  G
Sbjct: 455 ANFGRARLTEANFKNANLENAYLSTANLEKADLRGANLQGAYLTRANLRGANLCG 509


>gi|425463676|ref|ZP_18843006.1| Pentapeptide repeat containing protein [Microcystis aeruginosa PCC
           9809]
 gi|389830336|emb|CCI27806.1| Pentapeptide repeat containing protein [Microcystis aeruginosa PCC
           9809]
          Length = 179

 Score = 37.7 bits (86), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + A    ADL  A +++ N R A+ T A++R +DF  +   GA L  A+   A+  G  +
Sbjct: 25  AGADLAGADLAGA-NLRANLRGADLTGANLRGADFRNADLRGAILLDAIVTGASLAGAFL 83

Query: 169 A 169
           A
Sbjct: 84  A 84


>gi|172055186|ref|YP_001806513.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
           51142]
 gi|171701467|gb|ACB54447.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
          Length = 280

 Score = 37.7 bits (86), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 47/98 (47%), Gaps = 8/98 (8%)

Query: 70  VFVSTALAAAVVASCSSNISALADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKE 126
           + ++    + V    + N + L D N  +A+  G    +   S A   SA+LR A     
Sbjct: 84  ILLNLRFTSKVTKKANLNYADLKDHNLSKADLSGADLNYANLSGANLTSANLRYA----- 138

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           N R A+ + AD+ E++F+ +  +GA L  A   +AN T
Sbjct: 139 NLRGADLSGADLSETNFTYANLSGASLRYANLSRANLT 176


>gi|381207646|ref|ZP_09914717.1| hypothetical protein SclubJA_18738 [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 219

 Score = 37.7 bits (86), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 27/55 (49%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            +A    ADL +A   K + R AN   AD+RES+  G    GA L+ A    AN 
Sbjct: 126 QSADLSEADLYRADLEKSDLRDANLYKADLRESNLQGVNLQGANLQGADLEGANL 180


>gi|434397472|ref|YP_007131476.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428268569|gb|AFZ34510.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 455

 Score = 37.7 bits (86), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 8/73 (10%)

Query: 99  AETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 155
           A  RG   +GS    A F  ADLR A     NF  A+ T+A++ +++ SG+ F GA L +
Sbjct: 99  ANPRGARLVGSNLNLANFSGADLRVA-----NFNGADLTAANLSQANLSGADFFGATLIR 153

Query: 156 AVAYKANFTGTLI 168
           A    AN  G ++
Sbjct: 154 ADLSLANLEGAIL 166



 Score = 36.2 bits (82), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 25/41 (60%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           ADL +A   K NF+RAN T AD  E++   + F GA L +A
Sbjct: 189 ADLSEANLNKANFQRANLTEADFVEANLVQTNFKGANLSRA 229


>gi|209526096|ref|ZP_03274628.1| serine/threonine protein kinase with pentapeptide repeats
           [Arthrospira maxima CS-328]
 gi|409994186|ref|ZP_11277304.1| serine/threonine protein kinase [Arthrospira platensis str. Paraca]
 gi|209493484|gb|EDZ93807.1| serine/threonine protein kinase with pentapeptide repeats
           [Arthrospira maxima CS-328]
 gi|409934934|gb|EKN76480.1| serine/threonine protein kinase [Arthrospira platensis str. Paraca]
          Length = 548

 Score = 37.7 bits (86), Expect = 1.7,   Method: Composition-based stats.
 Identities = 23/75 (30%), Positives = 36/75 (48%), Gaps = 7/75 (9%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
           L+++N   A+ R        A FG A L +A     N   A  ++A++ ++D  G+   G
Sbjct: 447 LSNINFQNADLRD-------ANFGRARLTQAKLKNANLENAYLSTANLEKADLRGANLQG 499

Query: 151 AYLEKAVAYKANFTG 165
           AYL +A    AN  G
Sbjct: 500 AYLTRANLRGANLCG 514


>gi|427734893|ref|YP_007054437.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427369934|gb|AFY53890.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 257

 Score = 37.7 bits (86), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 27/50 (54%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           ADLR A     N  RAN + AD+R+++ SG+   GA L  A   + N  G
Sbjct: 57  ADLRGANLSGANLSRANLSGADLRDANLSGAGLFGANLSNAKLSRVNLLG 106


>gi|392410675|ref|YP_006447282.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
 gi|390623811|gb|AFM25018.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
          Length = 179

 Score = 37.7 bits (86), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 32/51 (62%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           SA+L +A+ +  +  RAN  SAD+  ++ + ++  GA L+ A   +A+FTG
Sbjct: 98  SANLHRALLIGADLSRANLASADLIGANLTDARLTGANLKAAFLTRADFTG 148


>gi|425450096|ref|ZP_18829928.1| Pentapeptide repeat containing protein [Microcystis aeruginosa PCC
           7941]
 gi|425461428|ref|ZP_18840906.1| Pentapeptide repeat containing protein [Microcystis aeruginosa PCC
           9808]
 gi|425468340|ref|ZP_18847367.1| Pentapeptide repeat containing protein [Microcystis aeruginosa PCC
           9701]
 gi|389769245|emb|CCI05876.1| Pentapeptide repeat containing protein [Microcystis aeruginosa PCC
           7941]
 gi|389825713|emb|CCI24321.1| Pentapeptide repeat containing protein [Microcystis aeruginosa PCC
           9808]
 gi|389885019|emb|CCI34748.1| Pentapeptide repeat containing protein [Microcystis aeruginosa PCC
           9701]
          Length = 179

 Score = 37.7 bits (86), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + A    ADL  A +++ N R A+ T A++R +DF  +   GA L  A+   A+  G  +
Sbjct: 25  AGADLAGADLAGA-NLRANLRGADLTGANLRGADFRNADLRGAILLDAIVTGASLAGAFL 83

Query: 169 A 169
           A
Sbjct: 84  A 84


>gi|159044411|ref|YP_001533205.1| hypothetical protein Dshi_1862 [Dinoroseobacter shibae DFL 12]
 gi|157912171|gb|ABV93604.1| hypothetical protein Dshi_1862 [Dinoroseobacter shibae DFL 12]
          Length = 103

 Score = 37.7 bits (86), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 27/52 (51%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
            A L +A  +  N R AN T A++  +DF+GS    A L  A    ANFT T
Sbjct: 19  DAKLNRARLIGANLRGANLTGAELVGADFTGSNLENAILHGADISGANFTKT 70


>gi|300867252|ref|ZP_07111912.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
 gi|300334729|emb|CBN57078.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
          Length = 508

 Score = 37.7 bits (86), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 30/57 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           + A F   DLR+A   + N   AN + A++R +D SG+   GA L +A    AN  G
Sbjct: 179 NGADFSGTDLRQANLCQVNLSGANLSGANLRWADLSGANLRGADLNEAKLSGANLYG 235



 Score = 37.7 bits (86), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 30/57 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A    A+LR A     NF  AN   AD+ ++D +G+ F+G  L +A   + N +G
Sbjct: 144 SEANLSGANLRGASGTAANFELANLHGADLSKADLNGADFSGTDLRQANLCQVNLSG 200



 Score = 37.0 bits (84), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 55/112 (49%), Gaps = 18/112 (16%)

Query: 73  STALAAAVVASCSSNISAL--ADLNKYE----AETRGEFGIG-------SAAQFGSADLR 119
           S+ L  A++   + N++ L  ADL++ +    A  RGE           S A    ADLR
Sbjct: 75  SSHLVRAILQGATLNVANLVRADLSEAQLMGAALIRGELIRAELSKANFSKANLTGADLR 134

Query: 120 KAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGT 166
           +A   + NF  AN + A++R      ++F  +  +GA L KA    A+F+GT
Sbjct: 135 EAKLTEVNFSEANLSGANLRGASGTAANFELANLHGADLSKADLNGADFSGT 186


>gi|428773363|ref|YP_007165151.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
           7202]
 gi|428687642|gb|AFZ47502.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
          Length = 319

 Score = 37.7 bits (86), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 16/38 (42%), Positives = 22/38 (57%)

Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           F  ANF+ AD+RE+D   S   G +L +A   +AN  G
Sbjct: 172 FNEANFSRADLREADLKNSILEGVFLHRANLSRANLRG 209


>gi|166367330|ref|YP_001659603.1| pentapeptide repeat-containing protein [Microcystis aeruginosa
           NIES-843]
 gi|166089703|dbj|BAG04411.1| pentapeptide repeat containing protein [Microcystis aeruginosa
           NIES-843]
          Length = 179

 Score = 37.7 bits (86), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + A    ADL  A +++ N R A+ T A++R +DF  +   GA L  A+   A+  G  +
Sbjct: 25  AGADLAGADLAGA-NLRANLRGADLTGANLRGADFRNADLRGAILLDAIVTGASLAGAFL 83

Query: 169 A 169
           A
Sbjct: 84  A 84


>gi|428203120|ref|YP_007081709.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427980552|gb|AFY78152.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 241

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F  +DLR+A   + NF +A F  AD+ E+   GS  + A L +AV   A+ +G
Sbjct: 39  ADFSRSDLRQARLGRTNFMQAIFREADLSEAILWGSDLSQADLSRAVLRDADLSG 93


>gi|323137846|ref|ZP_08072921.1| pentapeptide repeat protein [Methylocystis sp. ATCC 49242]
 gi|322396849|gb|EFX99375.1| pentapeptide repeat protein [Methylocystis sp. ATCC 49242]
          Length = 263

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 29/51 (56%)

Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           LR AV     F++AN  SAD+R +  +G+ F GA L  A A  A+  G ++
Sbjct: 178 LRSAVLEGAQFQKANLMSADLRFAHAAGADFTGANLMNADASGADLAGVIL 228


>gi|158337957|ref|YP_001519133.1| periplasmic binding protein/LacI transcriptional regulator
           [Acaryochloris marina MBIC11017]
 gi|158308198|gb|ABW29815.1| periplasmic binding protein/LacI transcriptional regulator,
           putative [Acaryochloris marina MBIC11017]
          Length = 702

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 30/58 (51%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           A F  A L K++    NF +AN T A M +     +KFN A L +   Y+AN   +++
Sbjct: 275 ANFTDAVLHKSLLNNANFTKANLTRAKMHQVQGIWTKFNHAILHRTDLYQANLNRSIL 332


>gi|440751694|ref|ZP_20930897.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
 gi|440176187|gb|ELP55460.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
          Length = 179

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + A    ADL  A +++ N R A+ T A++R +DF  +   GA L  A+   A+  G  +
Sbjct: 25  AGADLAGADLAGA-NLRANLRGADLTGANLRGADFRNADLRGAILLDAIVTGASLAGAFL 83

Query: 169 A 169
           A
Sbjct: 84  A 84


>gi|428220362|ref|YP_007104532.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427993702|gb|AFY72397.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 329

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 48/95 (50%), Gaps = 7/95 (7%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           LA A + + ++ ++   D N  +A+ R +  +   A    A LR A   KE  R AN T 
Sbjct: 92  LAGATMVNANAALANFLDANLIDADMR-DISL-RGANLAGACLRGANLRKELKRSANLTG 149

Query: 136 A-----DMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A     D+R ++F+G+  +GA L  A   +A FTG
Sbjct: 150 ACLHKADLRGANFTGADLSGADLRSANLTEATFTG 184


>gi|254409700|ref|ZP_05023481.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183697|gb|EDX78680.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 448

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 2/78 (2%)

Query: 89  SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 148
           S L  +N  EA+        +  QF  ADLRKA  +  + + AN T  +++E+D   +K 
Sbjct: 110 SKLVKVNLQEADLEDANLQNANLQF--ADLRKANLMNASLQNANLTRTNLQETDLRQAKL 167

Query: 149 NGAYLEKAVAYKANFTGT 166
             A LE A    AN  GT
Sbjct: 168 ANASLEGANLGDANLEGT 185


>gi|254410638|ref|ZP_05024417.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196182844|gb|EDX77829.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 304

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 29/58 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           + A   +A+L KA  V  N   AN  SA + + + SG+  NGA L  A   K+N  G 
Sbjct: 232 TGANLSNANLLKAFLVNANLSEANLNSARLVDINMSGANLNGADLSDAELRKSNLCGV 289


>gi|33866170|ref|NP_897729.1| hypothetical protein SYNW1636 [Synechococcus sp. WH 8102]
 gi|33639145|emb|CAE08151.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 171

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 22/68 (32%), Positives = 34/68 (50%), Gaps = 5/68 (7%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKA 161
           +G  A F  ADL  A+  +  F  A+F+ AD     M  +DF+G+    A L   +A  +
Sbjct: 66  VGRGANFSGADLHGAIFTQGAFAEADFSGADLSDALMDRADFAGTNLRDAVLTGIIASGS 125

Query: 162 NFTGTLIA 169
           +F+   IA
Sbjct: 126 SFSDAQIA 133


>gi|354556796|ref|ZP_08976083.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
 gi|353551246|gb|EHC20655.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
          Length = 253

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 47/98 (47%), Gaps = 8/98 (8%)

Query: 70  VFVSTALAAAVVASCSSNISALADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKE 126
           + ++    + V    + N + L D N  +A+  G    +   S A   SA+LR A     
Sbjct: 57  ILLNLRFTSKVTKKANLNYADLKDHNLSKADLSGADLNYANLSGANLTSANLRYA----- 111

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           N R A+ + AD+ E++F+ +  +GA L  A   +AN T
Sbjct: 112 NLRGADLSGADLSETNFTYANLSGASLRYANLSRANLT 149


>gi|307150734|ref|YP_003886118.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306980962|gb|ADN12843.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 231

 Score = 37.4 bits (85), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 32/60 (53%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A F  +D R +   K NF  A F  AD  E+   G+ F+ A LEKA+  + + +G ++
Sbjct: 33  SRADFSYSDFRSSRLGKTNFSAACFLGADFSEAILWGTDFSKANLEKAILREVDLSGAIL 92


>gi|428201834|ref|YP_007080423.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427979266|gb|AFY76866.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 143

 Score = 37.4 bits (85), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 28/55 (50%), Gaps = 5/55 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           SAA    ADLR+A     N   AN T A++  +D +G+   GA L KA    A F
Sbjct: 49  SAAHLIGADLREA-----NLSGANLTEANLEGADLTGANLQGANLTKAFVTNATF 98


>gi|300867247|ref|ZP_07111907.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
 gi|300334724|emb|CBN57073.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
          Length = 520

 Score = 37.4 bits (85), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 9/75 (12%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           ADL   +A  RG       A    A L +A+ V+ NF+ A+ + AD+  +D  GS+   A
Sbjct: 135 ADLT--QANLRG-------AHLSGASLTEALLVEANFQGADLSRADLSHADLRGSELRQA 185

Query: 152 YLEKAVAYKANFTGT 166
            L +A+   A+ +G 
Sbjct: 186 NLTQAILSGADLSGV 200


>gi|167615625|ref|ZP_02384260.1| pentapeptide repeat family protein [Burkholderia thailandensis Bt4]
          Length = 346

 Score = 37.4 bits (85), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 32/58 (55%), Gaps = 5/58 (8%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFTG 165
           F    LR A   +    RA+F++AD+RE+ F      G+ F GA L++A    A+FTG
Sbjct: 58  FAGCRLRGASFERALLSRADFSNADLREATFVDASAPGASFRGAALDRARLAHADFTG 115


>gi|307154970|ref|YP_003890354.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306985198|gb|ADN17079.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 231

 Score = 37.4 bits (85), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 28/47 (59%)

Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           DLR+A   + N   AN   AD+ +++ SG+  + A+LEKA+   AN 
Sbjct: 55  DLREANLTQANLNWANLHKADLTQANLSGANLSQAFLEKAILIAANL 101


>gi|425440521|ref|ZP_18820821.1| Genome sequencing data, contig C282 [Microcystis aeruginosa PCC
           9717]
 gi|389719024|emb|CCH97087.1| Genome sequencing data, contig C282 [Microcystis aeruginosa PCC
           9717]
          Length = 179

 Score = 37.4 bits (85), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + A    ADL  A +++ N R A+ T A++R +DF  +   GA L  A+   A+  G  +
Sbjct: 25  AGADLAGADLAGA-NLRANLRGADLTGANLRGADFRNADLRGAILLDAIVTGASLAGAFL 83

Query: 169 A 169
           A
Sbjct: 84  A 84


>gi|428315093|ref|YP_007119110.1| pentapeptide repeat-containing protein [Oscillatoria nigro-viridis
           PCC 7112]
 gi|428245128|gb|AFZ10911.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 319

 Score = 37.4 bits (85), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 29/58 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           S     +A+LR    V+ NF +AN T+A ++ +D SG+      L      + N TGT
Sbjct: 86  SDTNLENANLRSTCLVEANFSKANLTNAQLKYADLSGANLENTNLNNVSLAETNLTGT 143


>gi|288960397|ref|YP_003450737.1| pentapeptide repeat protein [Azospirillum sp. B510]
 gi|288912705|dbj|BAI74193.1| pentapeptide repeat protein [Azospirillum sp. B510]
          Length = 431

 Score = 37.4 bits (85), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 27/55 (49%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
            F  A +R    V+   R ANFT ++M  +D SG+   GA L  AV   A  T T
Sbjct: 167 DFSDAVMRGCKLVRATMRGANFTGSNMEGADLSGADLRGACLRGAVLTGATMTMT 221


>gi|145220459|ref|YP_001131168.1| pentapeptide repeat-containing protein [Chlorobium phaeovibrioides
           DSM 265]
 gi|145206623|gb|ABP37666.1| pentapeptide repeat protein [Chlorobium phaeovibrioides DSM 265]
          Length = 442

 Score = 37.4 bits (85), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 30/55 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F  AD+++    + N ++ANF  A ++ +D SG+  +GA L  A    AN  G
Sbjct: 324 ADFRKADMKRTCLKEANLQKANFDRAFLKNADLSGANLSGAMLYGASLSGANLNG 378


>gi|189347104|ref|YP_001943633.1| pentapeptide repeat-containing protein [Chlorobium limicola DSM
           245]
 gi|189341251|gb|ACD90654.1| pentapeptide repeat protein [Chlorobium limicola DSM 245]
          Length = 408

 Score = 37.4 bits (85), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 33/55 (60%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A+  SA LR A+ V+ +  +A   +AD+ ++    + F GA+++ AV  KA+ TG
Sbjct: 92  ARLDSAVLRSALLVRASLDKARLHNADLEDAVLEAASFKGAFMQTAVLKKADCTG 146


>gi|428309499|ref|YP_007120476.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428251111|gb|AFZ17070.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 166

 Score = 37.4 bits (85), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 46/106 (43%), Gaps = 22/106 (20%)

Query: 72  VSTALAAAVVASCSSNISALADLNKY--------EAETRGEFGIGSAAQFGSADLRKAVH 123
           ++T L A +V  C   + ALA   KY         AE +G+        F    LR A  
Sbjct: 6   LATFLLALIVWCCP--LPALAQATKYYPPPLSYSNAELKGK-------DFSGQTLRSAEF 56

Query: 124 VKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFT 164
              N  R NFT AD+R + FS S       +GA L  A+  + +FT
Sbjct: 57  SNANLERTNFTDADLRGTIFSASVMTHANLHGADLSNAMIDQVSFT 102


>gi|386828484|ref|ZP_10115591.1| putative low-complexity protein [Beggiatoa alba B18LD]
 gi|386429368|gb|EIJ43196.1| putative low-complexity protein [Beggiatoa alba B18LD]
          Length = 986

 Score = 37.4 bits (85), Expect = 2.2,   Method: Composition-based stats.
 Identities = 14/39 (35%), Positives = 23/39 (58%)

Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           +N R  +F+  D+R +DFSG+    A  + A+ Y  NF+
Sbjct: 645 QNLRGQDFSGQDLRYADFSGADLTDALFKNAILYHVNFS 683


>gi|218711080|ref|YP_002418700.1| Microcin immunity mcbG [Escherichia coli ED1a]
 gi|218349863|emb|CAQ87265.1| Microcin immunity mcbG [Escherichia coli ED1a]
          Length = 187

 Score = 37.4 bits (85), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 29/122 (23%), Positives = 48/122 (39%), Gaps = 10/122 (8%)

Query: 49  QFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG 108
           +F DC   +C      +KN +      L    +  C      L  +N  +      F + 
Sbjct: 37  KFRDCEFEKCRFVNCSIKNLK------LNFFKLIDCEFKDCLLQGVNAADIMFPCTFSLV 90

Query: 109 SA----AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S       F    L+K++ +  +FR   F   D+R+SDF+GS FN      +     +F+
Sbjct: 91  SCDLRFVDFIGLRLQKSIFLSSHFRDCLFEETDLRKSDFTGSAFNNTEFRHSDLSHCDFS 150

Query: 165 GT 166
            T
Sbjct: 151 MT 152


>gi|428200510|ref|YP_007079099.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427977942|gb|AFY75542.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 174

 Score = 37.4 bits (85), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 5/55 (9%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A    ADLR+A  +      AN + AD++E++ SG+  + A L  AV  KAN +G
Sbjct: 60  ASLDRADLREACLIV-----ANLSGADLKEANLSGANLSEAVLTGAVLQKANLSG 109


>gi|407781463|ref|ZP_11128681.1| pentapeptide repeat-containing protein [Oceanibaculum indicum P24]
 gi|407207680|gb|EKE77611.1| pentapeptide repeat-containing protein [Oceanibaculum indicum P24]
          Length = 443

 Score = 37.4 bits (85), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 30/58 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           S A   +ADLR A     NFR A  T  ++   + +G+ F+GA L  A  + ANF G 
Sbjct: 171 SEANLSNADLRNADLRMSNFRNAIMTGVNLIGVNAAGADFHGAVLTNARIHDANFDGV 228


>gi|119486371|ref|ZP_01620430.1| hypothetical protein L8106_16994 [Lyngbya sp. PCC 8106]
 gi|119456584|gb|EAW37714.1| hypothetical protein L8106_16994 [Lyngbya sp. PCC 8106]
          Length = 772

 Score = 37.4 bits (85), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 52/104 (50%), Gaps = 5/104 (4%)

Query: 62  YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG-SAAQFGSADLRK 120
           +A LKN  +  +  +AA++    S+N+S  A+L+    E     G   + A    A+LR 
Sbjct: 532 HANLKNANLSTANLMAASL---NSANLSD-ANLSHANLECANLKGANLTGANLSYANLRG 587

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A     N R AN + AD+R  + S +  + AYL  A  Y+AN +
Sbjct: 588 ANLSGVNLRDANLSYADLRRVNLSQANLDSAYLRGANLYRANIS 631


>gi|428181173|gb|EKX50038.1| hypothetical protein GUITHDRAFT_135709 [Guillardia theta CCMP2712]
          Length = 1263

 Score = 37.4 bits (85), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 24/50 (48%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
            A F   DL  A+    N R ANFT A +  +DFSGS   GA +     Y
Sbjct: 577 GADFSHCDLSFAMLQNCNLRGANFTGAKLTGTDFSGSDLEGAIMPDMEGY 626


>gi|428226949|ref|YP_007111046.1| hypothetical protein GEI7407_3527 [Geitlerinema sp. PCC 7407]
 gi|427986850|gb|AFY67994.1| Tetratricopeptide TPR_1 repeat-containing protein [Geitlerinema sp.
           PCC 7407]
          Length = 575

 Score = 37.4 bits (85), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 28/102 (27%), Positives = 46/102 (45%), Gaps = 1/102 (0%)

Query: 68  WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKE 126
           WR   + AL  A +    + ++   +  K   ETR       S   +G A+L        
Sbjct: 15  WRSLAALALVVAPMVGTDAALAEKPEHRKQLLETRRCISCDLSNGDYGRANLSGFDLSNS 74

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           N   A+F SAD++ +DFS +    A LE+A   +A+F   ++
Sbjct: 75  NLENADFESADLQRTDFSSANLRRADLERADLERADFQSAIL 116


>gi|381151529|ref|ZP_09863398.1| putative low-complexity protein [Methylomicrobium album BG8]
 gi|380883501|gb|EIC29378.1| putative low-complexity protein [Methylomicrobium album BG8]
          Length = 739

 Score = 37.4 bits (85), Expect = 2.3,   Method: Composition-based stats.
 Identities = 29/103 (28%), Positives = 42/103 (40%), Gaps = 7/103 (6%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
           AKL+    +  T L  A++       + L + +   A+ RG       A    ADL  A 
Sbjct: 444 AKLQGVAGWDKTQLQGAILGGTQLQGAVLVEADLQGADLRG-------ADLQGADLSWAN 496

Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
               + R AN    D+R +   G+   GA L+ A   KAN  G
Sbjct: 497 LQSADLRGANLQGVDLRGAKLQGADLRGAKLQGATLRKANLQG 539


>gi|220910319|ref|YP_002485630.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219866930|gb|ACL47269.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 165

 Score = 37.4 bits (85), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 30/55 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           + AQ   ADLRKAV    N   AN   A +R ++F+G+  +GA L  A A ++  
Sbjct: 93  TGAQLPKADLRKAVLSGANLAGANLRDAKLRGANFAGADLHGADLFGAEALRSEL 147


>gi|409989360|ref|ZP_11272974.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|409939778|gb|EKN80828.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 333

 Score = 37.4 bits (85), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 39/87 (44%), Gaps = 7/87 (8%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
           F+   L  A +  C    + L++ N   A  RG       A    A+LR A     N   
Sbjct: 242 FIKANLMKADLEECDLRNADLSNTNLNLANLRG-------ADLTGANLRGAYLWGANLDG 294

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAV 157
           AN  +AD+R++ F  +  NGA L+ A+
Sbjct: 295 ANLENADLRDASFRHATLNGAMLQDAI 321



 Score = 36.2 bits (82), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 19/70 (27%), Positives = 36/70 (51%), Gaps = 2/70 (2%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
            EA   G F  G+   +  A LR +  ++ +  +A+ + A + +++  G+K +GA +   
Sbjct: 58  LEANLNGAFLYGTNLNY--AKLRDSCLIEADLTKADLSGAQLHKANLMGAKLSGAVMSWV 115

Query: 157 VAYKANFTGT 166
             Y+ANF G 
Sbjct: 116 TLYRANFPGV 125


>gi|119512769|ref|ZP_01631839.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
 gi|119462587|gb|EAW43554.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
          Length = 268

 Score = 37.4 bits (85), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 23/37 (62%)

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           N   AN  +AD+ E++   ++ NGAYL KA  YKAN 
Sbjct: 160 NLIEANLINADLSEANLYEAQLNGAYLYKANFYKANL 196


>gi|434395496|ref|YP_007130443.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428267337|gb|AFZ33283.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 249

 Score = 37.4 bits (85), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 5/60 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A    A+L+ A     N   AN + AD+ E+D SG+  +GA L   +   AN +  ++
Sbjct: 128 SGANLAQANLKGA-----NLTEANLSKADLTEADLSGADLSGATLSGVILSDANLSDAIL 182


>gi|390437869|ref|ZP_10226382.1| Pentapeptide repeat containing protein [Microcystis sp. T1-4]
 gi|389838702|emb|CCI30506.1| Pentapeptide repeat containing protein [Microcystis sp. T1-4]
          Length = 179

 Score = 37.4 bits (85), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + A    ADL  A +++ N R A+ T A++R +DF  +   GA L  A+   A+  G  +
Sbjct: 25  AGADLAGADLAGA-NLRANLRGADLTGANLRGADFRNADLRGAILLDAIITGASLAGAFL 83

Query: 169 A 169
           A
Sbjct: 84  A 84


>gi|428306568|ref|YP_007143393.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
           9333]
 gi|428248103|gb|AFZ13883.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
          Length = 272

 Score = 37.4 bits (85), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 30/58 (51%), Gaps = 5/58 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           S++Q   ADL +A     N  RAN   AD+  ++ SG+   GA L  A    AN TGT
Sbjct: 71  SSSQLQGADLTRA-----NLSRANLAGADLTGANLSGTSLYGANLTGANLTGANLTGT 123


>gi|434391008|ref|YP_007125955.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428262849|gb|AFZ28795.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 139

 Score = 37.0 bits (84), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 30/55 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A    ADL +A  V  NFR A+ + A++  ++ SG+  NGA L  A    AN +G
Sbjct: 40  ASLSEADLSQADLVGTNFREASLSKANLSAANLSGAILNGANLFAANLKGANLSG 94


>gi|334119117|ref|ZP_08493204.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333458588|gb|EGK87205.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 238

 Score = 37.0 bits (84), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 28/58 (48%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           A   +ADLR+++  + N  RAN + ADM  +D   +    A L       A  TGT I
Sbjct: 142 ANLSNADLRRSLLFRVNLNRANLSGADMSFADVRDTNLQNAILSNTRLPGAQLTGTNI 199


>gi|16126499|ref|NP_421063.1| pentapeptide repeat-containing protein [Caulobacter crescentus
           CB15]
 gi|221235279|ref|YP_002517716.1| hypothetical protein CCNA_02343 [Caulobacter crescentus NA1000]
 gi|13423771|gb|AAK24231.1| pentapeptide repeat family protein [Caulobacter crescentus CB15]
 gi|220964452|gb|ACL95808.1| hypothetical protein with pentapeptide repeats [Caulobacter
           crescentus NA1000]
          Length = 419

 Score = 37.0 bits (84), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 30/55 (54%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           + + A F  A L+ A  V+ N ++ANF  A++  +D SG+   GA L  AV   A
Sbjct: 166 VATKADFSDAILKDAKLVRANLKQANFNGANLAGADLSGANLAGADLRNAVLVGA 220


>gi|414077510|ref|YP_006996828.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
 gi|413970926|gb|AFW95015.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
          Length = 269

 Score = 37.0 bits (84), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 38/85 (44%), Gaps = 17/85 (20%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKA----VHV------KENFRRANFTSADMRE 140
           L++ N YEAE          AQ   A+L KA     H+      K N   AN T AD+  
Sbjct: 171 LSEANLYEAELM-------TAQLYQANLHKANLTKAHLGNAFLSKANLTEANLTEADLSW 223

Query: 141 SDFSGSKFNGAYLEKAVAYKANFTG 165
           ++  G+   GA L+ A    ANF G
Sbjct: 224 ANLKGANLAGANLKGATIRGANFQG 248


>gi|71906323|ref|YP_283910.1| pentapeptide repeat-containing protein [Dechloromonas aromatica
           RCB]
 gi|71845944|gb|AAZ45440.1| Pentapeptide repeat [Dechloromonas aromatica RCB]
          Length = 215

 Score = 37.0 bits (84), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 50/112 (44%), Gaps = 15/112 (13%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA----AQFGSADLRKAVH--- 123
           F    L  A + S     S LAD N   A+ RG   + SA    AQ GSA L KA     
Sbjct: 75  FKGADLRGANLKSARLERSDLADANLEGADLRGA-NLRSASLTRAQLGSAKLSKAFLEGA 133

Query: 124 -------VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
                  +  +  RA    AD++  D  G++ +GA+L  +   +AN +G+ I
Sbjct: 134 DLSSSRLIGADLTRAQLVGADLQLVDLRGAQLDGAFLNYSNLKQANLSGSSI 185


>gi|376007502|ref|ZP_09784697.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
 gi|375324138|emb|CCE20450.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
          Length = 179

 Score = 37.0 bits (84), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 29/62 (46%)

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           RGE+        G AD+          R+AN T+A+M + DF+G+ F  + L  +     
Sbjct: 40  RGEYSSCQGCNLGGADMSNQSRRNAQLRQANLTNANMSDGDFTGAFFTCSNLSNSNLSGG 99

Query: 162 NF 163
           NF
Sbjct: 100 NF 101


>gi|427718922|ref|YP_007066916.1| peptidase C14 caspase catalytic subunit p20 [Calothrix sp. PCC
           7507]
 gi|427351358|gb|AFY34082.1| peptidase C14 caspase catalytic subunit p20 [Calothrix sp. PCC
           7507]
          Length = 1102

 Score = 37.0 bits (84), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 28/56 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S+A  G ADLR A       R AN + AD+R +D  G+   GA L  A    AN +
Sbjct: 839 SSANLGGADLRGADLSSAYLRGANLSYADLRGADLRGADLRGADLRGANLSSANLS 894


>gi|411116620|ref|ZP_11389107.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410712723|gb|EKQ70224.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 168

 Score = 37.0 bits (84), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 34/73 (46%), Gaps = 15/73 (20%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADM---------------RESDFSGSKFNGAYL 153
           S+A    A+L+ A   K N R+AN   AD+                E++  G+ FN A L
Sbjct: 82  SSANLARANLKVANLDKANLRKANLEFADLSWAGLIHANLLEANLHEANLHGANFNSATL 141

Query: 154 EKAVAYKANFTGT 166
            +A+  KAN  GT
Sbjct: 142 YRAILTKANLEGT 154


>gi|56751209|ref|YP_171910.1| hypothetical protein syc1200_c [Synechococcus elongatus PCC 6301]
 gi|81299124|ref|YP_399332.1| hypothetical protein Synpcc7942_0313 [Synechococcus elongatus PCC
           7942]
 gi|56686168|dbj|BAD79390.1| hypothetical protein [Synechococcus elongatus PCC 6301]
 gi|81168005|gb|ABB56345.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 170

 Score = 37.0 bits (84), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 33/62 (53%), Gaps = 10/62 (16%)

Query: 117 DLRKAVHVKENFRR-----ANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGT 166
           D  K + ++ NF       ANFT A++R SDFS     G +F GA LE    + A+ + T
Sbjct: 39  DFTKEILIESNFSNRDLSDANFTKANLRSSDFSNSVLVGVRFYGANLESVDLHGADLSNT 98

Query: 167 LI 168
           ++
Sbjct: 99  IL 100


>gi|428318454|ref|YP_007116336.1| serine/threonine protein kinase with pentapeptide repeats
           [Oscillatoria nigro-viridis PCC 7112]
 gi|428242134|gb|AFZ07920.1| serine/threonine protein kinase with pentapeptide repeats
           [Oscillatoria nigro-viridis PCC 7112]
          Length = 543

 Score = 37.0 bits (84), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 28/56 (50%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AA    A+L  A  ++ N R AN T A    ++F G+ F GA L  A   KAN  G
Sbjct: 450 AADLSGANLGHARLIQANLRDANLTEAYCSTANFEGADFRGADLTGAYLTKANLRG 505


>gi|428305676|ref|YP_007142501.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
           9333]
 gi|428247211|gb|AFZ12991.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
          Length = 330

 Score = 37.0 bits (84), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 34/71 (47%), Gaps = 3/71 (4%)

Query: 89  SALADLNKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
           S L+  N  +A+ RG     +    A+   ADLR A     N   AN   AD+R++D SG
Sbjct: 65  SKLSGANLIQADLRGAMLHDADLHGARLQGADLRGADITLANLLDANLMEADLRDADLSG 124

Query: 146 SKFNGAYLEKA 156
           +   GA L  A
Sbjct: 125 ANLTGACLRGA 135


>gi|33862830|ref|NP_894390.1| hypothetical protein PMT0557 [Prochlorococcus marinus str. MIT
           9313]
 gi|33634746|emb|CAE20732.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 198

 Score = 37.0 bits (84), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 34/72 (47%), Gaps = 10/72 (13%)

Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN----------GAYL 153
           EF  G A +  S D+      ++NF +A+    D+ E+D  G+ FN          GA L
Sbjct: 63  EFRGGQAIEEISKDMHGRDLKEQNFLKADLRGVDLSEADLRGAVFNSSQLQEADLQGADL 122

Query: 154 EKAVAYKANFTG 165
           E  VA+ + F G
Sbjct: 123 ENVVAFASRFDG 134


>gi|428222289|ref|YP_007106459.1| serine/threonine protein kinase [Synechococcus sp. PCC 7502]
 gi|427995629|gb|AFY74324.1| serine/threonine protein kinase [Synechococcus sp. PCC 7502]
          Length = 563

 Score = 37.0 bits (84), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 28/51 (54%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           ADL  A   + + R AN  SA M ++D SG+   GA L+ A   +AN  GT
Sbjct: 477 ADLGGACLNQADLREANLQSAYMSKADLSGADLTGANLKGAYLSQANLRGT 527


>gi|75910293|ref|YP_324589.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75704018|gb|ABA23694.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 143

 Score = 37.0 bits (84), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 4/70 (5%)

Query: 96  KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 155
           KYEA  R +F    AA+   ADL++      +   AN  +A + ++D SG+  +G YL K
Sbjct: 18  KYEAGER-DF---RAAELSKADLQETYLEGVDLSGANLDAAKLSKADLSGANLSGVYLRK 73

Query: 156 AVAYKANFTG 165
           A    AN +G
Sbjct: 74  ANLRGANLSG 83


>gi|186681231|ref|YP_001864427.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
           73102]
 gi|186463683|gb|ACC79484.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
          Length = 282

 Score = 37.0 bits (84), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 31/57 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A+   ADL + V    +   A+   A+++ +D SG+   GAYL +A   +AN +G
Sbjct: 59  SKAKLLGADLSELVLSNADLSGADLRGANLQGADLSGANLQGAYLNRANLQQANLSG 115


>gi|425437827|ref|ZP_18818239.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
           9432]
 gi|389677087|emb|CCH93934.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
           9432]
          Length = 976

 Score = 37.0 bits (84), Expect = 2.7,   Method: Composition-based stats.
 Identities = 26/89 (29%), Positives = 38/89 (42%), Gaps = 8/89 (8%)

Query: 91  LADLNKYEAETRGEFGIGS--------AAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
           L   N YEA   G +  G+         A    A+L  A   + N  RAN   A++  ++
Sbjct: 884 LKRANLYEANLYGAYLAGAYLEGANLERANLYGANLEGANLERANLERANLKGANLEGAN 943

Query: 143 FSGSKFNGAYLEKAVAYKANFTGTLIATE 171
              +   GA+L  A    AN  GT++ TE
Sbjct: 944 LERANLEGAFLRGANFKDANVKGTILDTE 972


>gi|158335471|ref|YP_001516643.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158305712|gb|ABW27329.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 502

 Score = 37.0 bits (84), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 3/74 (4%)

Query: 91  LADLNKYEAE-TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           L D+N   A  +R    +   S A    +DL  A   + N  R NF+ AD+ ++D S ++
Sbjct: 33  LKDINLINANLSRANLSLANLSGAFLAGSDLSDAFLSEANLSRVNFSRADLTKADLSFAR 92

Query: 148 FNGAYLEKAVAYKA 161
             GA L +A  Y+A
Sbjct: 93  LQGATLIEATLYQA 106


>gi|124023397|ref|YP_001017704.1| hypothetical protein P9303_16951 [Prochlorococcus marinus str. MIT
           9303]
 gi|123963683|gb|ABM78439.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9303]
          Length = 198

 Score = 37.0 bits (84), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 34/72 (47%), Gaps = 10/72 (13%)

Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN----------GAYL 153
           EF  G A +  S D+      ++NF +A+    D+ E+D  G+ FN          GA L
Sbjct: 63  EFRGGQAIEEISKDMHGRDLKEQNFLKADLRGVDLSEADLRGAVFNSSQLQEADLQGADL 122

Query: 154 EKAVAYKANFTG 165
           E  VA+ + F G
Sbjct: 123 ENVVAFASRFDG 134


>gi|427720966|ref|YP_007068960.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
 gi|427353402|gb|AFY36126.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
          Length = 168

 Score = 37.0 bits (84), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 33/72 (45%), Gaps = 5/72 (6%)

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK- 160
           R  F + +   + + +L       E+   A F +A+MR ++F G+    A L K V  K 
Sbjct: 27  RPAFALTNVINYNNINLENRDFAHEDLTGATFVAAEMRGANFQGANLTNAVLTKGVLLKA 86

Query: 161 ----ANFTGTLI 168
               AN TG L+
Sbjct: 87  DLSDANLTGALV 98


>gi|307152584|ref|YP_003887968.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306982812|gb|ADN14693.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 333

 Score = 37.0 bits (84), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 7/71 (9%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
           N ++A+ +G       A     DL +A     N + AN   AD+R++D S +   G  L 
Sbjct: 152 NLFKADLQG-------ANMKGVDLARANLSGANLKEANLRDADLRKADLSKANLTGTILS 204

Query: 155 KAVAYKANFTG 165
           +A    AN TG
Sbjct: 205 EANLVGANLTG 215


>gi|83717943|ref|YP_439059.1| pentapeptide repeat-containing protein [Burkholderia thailandensis
           E264]
 gi|257142167|ref|ZP_05590429.1| pentapeptide repeat-containing protein [Burkholderia thailandensis
           E264]
 gi|83651768|gb|ABC35832.1| pentapeptide repeat family protein [Burkholderia thailandensis
           E264]
          Length = 872

 Score = 37.0 bits (84), Expect = 2.8,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 32/58 (55%), Gaps = 5/58 (8%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFTG 165
           F    LR A   +    RA+F++AD+RE+ F      G+ F GA L++A    A+FTG
Sbjct: 584 FAGCRLRGASFERALLSRADFSNADLREATFVDASAPGASFRGAALDRARLAHADFTG 641


>gi|334119964|ref|ZP_08494048.1| serine/threonine protein kinase with pentapeptide repeats
           [Microcoleus vaginatus FGP-2]
 gi|333457605|gb|EGK86228.1| serine/threonine protein kinase with pentapeptide repeats
           [Microcoleus vaginatus FGP-2]
          Length = 543

 Score = 37.0 bits (84), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 28/56 (50%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AA    A+L  A  ++ N R AN T A    ++F G+ F GA L  A   KAN  G
Sbjct: 450 AADLSGANLGHARLIQANLRDANLTEAYCSTANFEGADFRGADLTGAYLTKANLRG 505


>gi|75911106|ref|YP_325402.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75704831|gb|ABA24507.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 268

 Score = 37.0 bits (84), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 48/113 (42%), Gaps = 39/113 (34%)

Query: 81  VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE 140
           VA+ S +I   ADL      +   F IG  A F  A+LR A+  + N    +F+SAD+R+
Sbjct: 93  VANLSQSILTQADL------SHAHF-IG--ADFSGANLRGAIVAEANLIGTDFSSADLRD 143

Query: 141 SDFSGSKF------------------------------NGAYLEKAVAYKANF 163
           +D +G+K                                GAYL KA  YKAN 
Sbjct: 144 ADLAGAKLIRSNLCFANLIAANLIAADFSEANLYQAEVMGAYLYKANFYKANL 196



 Score = 36.2 bits (82), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 33/68 (48%), Gaps = 10/68 (14%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVA 158
           S A   SA+L +A   + N   ANF          T AD+  + F G+ F+GA L  A+ 
Sbjct: 67  SGADLSSANLYQAKISEANLSAANFSVANLSQSILTQADLSHAHFIGADFSGANLRGAIV 126

Query: 159 YKANFTGT 166
            +AN  GT
Sbjct: 127 AEANLIGT 134


>gi|298245086|ref|ZP_06968892.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297552567|gb|EFH86432.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 394

 Score = 37.0 bits (84), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 34/69 (49%), Gaps = 5/69 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANF 163
           S      A+L KA  +  N  R + + AD+ ++DF     SG+  +GA L +A+  KAN 
Sbjct: 318 SRTNLTKANLSKADLISANLSRGDLSGADLSKADFSGANLSGANLSGATLNEAILNKANI 377

Query: 164 TGTLIATEH 172
              L  TE 
Sbjct: 378 QQALNITEE 386


>gi|448684742|ref|ZP_21692829.1| pentapeptide repeat-containing protein [Haloarcula japonica DSM
           6131]
 gi|445782673|gb|EMA33514.1| pentapeptide repeat-containing protein [Haloarcula japonica DSM
           6131]
          Length = 710

 Score = 37.0 bits (84), Expect = 2.9,   Method: Composition-based stats.
 Identities = 19/47 (40%), Positives = 26/47 (55%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
           AQFG+ D   A   + +FR A F +A    + FSG  FNG   ++AV
Sbjct: 230 AQFGTGDFYHATFDEADFRWAEFGTARFYGATFSGGYFNGTSYDEAV 276


>gi|282899050|ref|ZP_06307031.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
 gi|281195966|gb|EFA70882.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
          Length = 268

 Score = 37.0 bits (84), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 41/85 (48%), Gaps = 8/85 (9%)

Query: 89  SALADLNKYEAETRGEFGIG--------SAAQFGSADLRKAVHVKENFRRANFTSADMRE 140
           S L D N YEAE    +           S +  GS+ L +A  ++ N   A+ T A++++
Sbjct: 169 SNLQDCNLYEAEIINSYLYDTNLSRANLSRSHLGSSYLCRANFMEANLTSADLTGANLKD 228

Query: 141 SDFSGSKFNGAYLEKAVAYKANFTG 165
           ++ +G+   GA L  A    AN TG
Sbjct: 229 ANLAGANLQGANLRCANLTGANLTG 253


>gi|430900982|ref|ZP_19484783.1| LPXTG-domain-containing protein cell wall anchor domain
           [Enterococcus faecium E1575]
 gi|430554860|gb|ELA94429.1| LPXTG-domain-containing protein cell wall anchor domain
           [Enterococcus faecium E1575]
          Length = 1074

 Score = 37.0 bits (84), Expect = 2.9,   Method: Composition-based stats.
 Identities = 20/51 (39%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQF 113
           +KLKNWRV V   +   ++AS  S+I   AD+N  E +   E+G+G+  +F
Sbjct: 4   SKLKNWRVAVVVVMIIQLLASFVSSIIVHADINHPE-QVSIEYGVGTGYRF 53


>gi|303287274|ref|XP_003062926.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455562|gb|EEH52865.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 182

 Score = 37.0 bits (84), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 5/52 (9%)

Query: 120 KAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           KA HV E+F  ++     +T  D+R SDFSGS    A   +A+    N  G+
Sbjct: 17  KAEHVNEDFSHSDLVGAIYTEGDLRGSDFSGSDLRAAIFSRAIMPGVNLEGS 68


>gi|390440421|ref|ZP_10228750.1| membrane hypothetical protein [Microcystis sp. T1-4]
 gi|389836163|emb|CCI32876.1| membrane hypothetical protein [Microcystis sp. T1-4]
          Length = 904

 Score = 37.0 bits (84), Expect = 3.0,   Method: Composition-based stats.
 Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 7/67 (10%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
           L D+N  E++  G       A   +A+L+KA   +    R +FT+AD+ ++D +G+   G
Sbjct: 534 LKDINFTESDLSG-------ALLRNANLKKANLTRTILNRVDFTNADLSDADLTGASVKG 586

Query: 151 AYLEKAV 157
           A  + A+
Sbjct: 587 AKFDNAI 593


>gi|443325444|ref|ZP_21054139.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442794954|gb|ELS04346.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 791

 Score = 37.0 bits (84), Expect = 3.0,   Method: Composition-based stats.
 Identities = 18/56 (32%), Positives = 31/56 (55%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           A+   A L+K+     NF+ AN   A++  ++ S +   GA L+ A   +ANF+G+
Sbjct: 635 AKLNLASLKKSDFTGSNFKGANLEGANLEGANLSKADLKGANLKSASINQANFSGS 690


>gi|428314067|ref|YP_007125044.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428255679|gb|AFZ21638.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 745

 Score = 37.0 bits (84), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 5/56 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S+A+  SADLR+ V        A+ T AD+ E+ F+ +  +GA L K  A +++FT
Sbjct: 564 SSAKLISADLRQGV-----LENASLTGADLGEAKFARANLHGARLGKVKAVRSDFT 614


>gi|282900610|ref|ZP_06308552.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
 gi|281194410|gb|EFA69365.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
          Length = 167

 Score = 37.0 bits (84), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 25/41 (60%)

Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           + + R ++FT A++R+SDFSGS   G     A    ANFTG
Sbjct: 49  QRDLRDSSFTKANLRQSDFSGSNLTGVSFFAANLESANFTG 89


>gi|448736468|ref|ZP_21718581.1| Ion transport 2 domain-containing protein [Halococcus thailandensis
           JCM 13552]
 gi|445806103|gb|EMA56272.1| Ion transport 2 domain-containing protein [Halococcus thailandensis
           JCM 13552]
          Length = 345

 Score = 37.0 bits (84), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 27/55 (49%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
            A    ADLR+A   + + RRA F  AD+  + F  +    A L +A  Y+  FT
Sbjct: 90  GADLSGADLRRATFDRVDARRARFDGADVEGATFENADLRDASLNRAKLYRTGFT 144


>gi|218438459|ref|YP_002376788.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218171187|gb|ACK69920.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 295

 Score = 37.0 bits (84), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 31/60 (51%), Gaps = 10/60 (16%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + A F  A L   + ++ +   ANFT+ D+RE +F GS  N          +ANFT ++I
Sbjct: 218 TKANFERATLWGILFIESDLTEANFTNCDIREVNFEGSNLN----------RANFTNSII 267


>gi|113476307|ref|YP_722368.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110167355|gb|ABG51895.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 225

 Score = 37.0 bits (84), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 27/53 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A F   +L+ A     N    NF  AD+  ++ SG+   GA LEKA  Y+A+ 
Sbjct: 52  ANFHDINLKNANMSGANLTGVNFQGADLNGANLSGANLTGANLEKANLYRADI 104


>gi|428304926|ref|YP_007141751.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
           9333]
 gi|428246461|gb|AFZ12241.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
          Length = 329

 Score = 37.0 bits (84), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 16/38 (42%), Positives = 23/38 (60%)

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           AN   A MR +D +G+   GAYLE    ++AN TG ++
Sbjct: 285 ANLNVAAMRGADLTGASMRGAYLEATDWHQANLTGAIM 322


>gi|332712234|ref|ZP_08432162.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332349040|gb|EGJ28652.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 280

 Score = 37.0 bits (84), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 27/57 (47%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           + A F  ADL +A     N   A+F  AD+  +D SG+   GA L       +N TG
Sbjct: 179 TGANFSRADLSQANLSNANLTGADFAGADLANADLSGANLTGANLSNTDLKGSNLTG 235


>gi|194336315|ref|YP_002018109.1| pentapeptide repeat-containing protein [Pelodictyon
           phaeoclathratiforme BU-1]
 gi|194308792|gb|ACF43492.1| pentapeptide repeat protein [Pelodictyon phaeoclathratiforme BU-1]
          Length = 441

 Score = 37.0 bits (84), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 29/58 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           S AQ   ADL +   V  N R A+ + A +  +D + +  NGA+L  A   KAN   T
Sbjct: 34  SGAQLNKADLSRTDLVGANLRGADLSGAQLNMADLNRADLNGAHLYNANFGKANLIKT 91


>gi|448677922|ref|ZP_21689112.1| pentapeptide repeat-containing protein [Haloarcula argentinensis
           DSM 12282]
 gi|445773597|gb|EMA24630.1| pentapeptide repeat-containing protein [Haloarcula argentinensis
           DSM 12282]
          Length = 428

 Score = 36.6 bits (83), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           +A  F +  LR A     +  +AN +SAD+RE+D SG+    A L  A   KA+ +G
Sbjct: 49  NAISFENTGLRGADLSDADLGKANLSSADLREADLSGADLGSADLSGANLQKADLSG 105


>gi|86606624|ref|YP_475387.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86555166|gb|ABD00124.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
          Length = 371

 Score = 36.6 bits (83), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 33/61 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
             A F  A+LRKA     NF  A+   AD+R+++  G+K +GA L+ A    A+  G  +
Sbjct: 150 GGANFYEANLRKANLGLCNFNGAHLHQADLRQANLQGAKLSGAVLQGADLRGADLRGAKV 209

Query: 169 A 169
           +
Sbjct: 210 S 210


>gi|126656212|ref|ZP_01727596.1| hypothetical protein CY0110_03979 [Cyanothece sp. CCY0110]
 gi|126622492|gb|EAZ93198.1| hypothetical protein CY0110_03979 [Cyanothece sp. CCY0110]
          Length = 261

 Score = 36.6 bits (83), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 32/67 (47%), Gaps = 5/67 (7%)

Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A   +  L+KA+  + N +     +ANF  A +RE +   S  N A L+KA+   +N  G
Sbjct: 44  AYLTNTSLKKAILTQANLQNCYLNKANFEQAKLREVNLQNSYLNQANLDKAILDNSNLRG 103

Query: 166 TLIATEH 172
             +   H
Sbjct: 104 AYLTDAH 110


>gi|428205595|ref|YP_007089948.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
           PCC 7203]
 gi|428007516|gb|AFY86079.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
          Length = 330

 Score = 36.6 bits (83), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           ++A+ RG    G       ADLR A  V+ N R AN   A++ E++ + +  + A L++A
Sbjct: 154 FQADLRGANMAG--VDLSGADLRCANLVEVNLRGANLYGANLSEANLADAFLSDANLDRA 211

Query: 157 VAYKANFTGT 166
           +  +AN + T
Sbjct: 212 ILREANLSNT 221


>gi|402773132|ref|YP_006592669.1| pentapeptide repeat protein [Methylocystis sp. SC2]
 gi|401775152|emb|CCJ08018.1| Pentapeptide repeat protein [Methylocystis sp. SC2]
          Length = 261

 Score = 36.6 bits (83), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 28/53 (52%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A F S  L  A   K +    NFT AD++ +DFSG++ N A L  A+   A F
Sbjct: 115 ADFFSTKLAGAKLAKADLSATNFTRADLQNADFSGARMNAATLYAALLDGATF 167


>gi|119487879|ref|ZP_01621376.1| hypothetical protein L8106_28486 [Lyngbya sp. PCC 8106]
 gi|119455455|gb|EAW36593.1| hypothetical protein L8106_28486 [Lyngbya sp. PCC 8106]
          Length = 514

 Score = 36.6 bits (83), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 4/102 (3%)

Query: 68  WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 127
           W+V  STA +   V +  + + AL DLN       G   I S A     +L  A  V+ N
Sbjct: 113 WQVVDSTATSG--VFASRARLKALQDLNNEGVSLDG-LDI-SQAYLKEINLSGANLVEAN 168

Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIA 169
              AN   A +  ++ SG+   GA L+ A  ++ NF G  +A
Sbjct: 169 LEGANLQGASLSHANLSGANLQGADLQGANLHETNFQGANLA 210



 Score = 36.2 bits (82), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 27/98 (27%), Positives = 47/98 (47%), Gaps = 10/98 (10%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQ---FGSADLRKAVHVKE------ 126
           L  A+++  + + S LAD N  +A+  G    G+  +      A L +  H++E      
Sbjct: 265 LKQAILSEVNLSESNLADANLEQADLMGAELRGATLKGTNLSQAYLVRTNHLREVKNLRE 324

Query: 127 -NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            N + AN T A++RE +  G+    A L++A+   AN 
Sbjct: 325 ANLKGANLTRANLREVNLQGANLQQANLQQAILQGANL 362


>gi|163797895|ref|ZP_02191839.1| pentapeptide repeat family protein [alpha proteobacterium BAL199]
 gi|159176857|gb|EDP61425.1| pentapeptide repeat family protein [alpha proteobacterium BAL199]
          Length = 396

 Score = 36.6 bits (83), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 32/57 (56%), Gaps = 5/57 (8%)

Query: 114 GSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTG 165
           G+AD + A     N    +FT AD+RE DF+     G++F GA L +AV   A+ +G
Sbjct: 15  GAADGQPASFANANLFGFDFTGADLREVDFAGASLQGARFVGADLTRAVLVGADLSG 71


>gi|108762763|ref|YP_635370.1| pentapeptide repeat-containing protein [Myxococcus xanthus DK 1622]
 gi|108466643|gb|ABF91828.1| pentapeptide repeat domain protein [Myxococcus xanthus DK 1622]
          Length = 203

 Score = 36.6 bits (83), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    A LR A+  + N  RA+F  AD++ +D  G+   GAYL  A    AN 
Sbjct: 87  SKANLDYALLRGAILTQVNALRASFGEADLQGADLQGADLQGAYLVSANLASANL 141


>gi|427416960|ref|ZP_18907143.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
 gi|425759673|gb|EKV00526.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
          Length = 321

 Score = 36.6 bits (83), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 5/77 (6%)

Query: 94  LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKF 148
           + +Y+    G+    S A F  A+L        N   ANFT+A++++  F+G     S F
Sbjct: 129 ITEYQLTAVGKGANLSRANFKQANLTGTDFTGCNLEGANFTAANLKDCKFTGTNLAKSSF 188

Query: 149 NGAYLEKAVAYKANFTG 165
           NGA L  A+   AN +G
Sbjct: 189 NGADLSNAILTGANLSG 205


>gi|268325885|emb|CBH39473.1| conserved hypothetical protein [uncultured archaeon]
          Length = 358

 Score = 36.6 bits (83), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 28/57 (49%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           + A      L KA   K + R A    AD+R +D S +K NGA L  A  Y A+ +G
Sbjct: 258 TGASLNGGKLYKAKLRKADLRGAKMYKADLRWADLSSTKLNGADLTDADLYGADLSG 314


>gi|162456753|ref|YP_001619120.1| pentapeptide repeat-containing protein [Sorangium cellulosum So
           ce56]
 gi|161167335|emb|CAN98640.1| pentapeptide repeats hypothetical protein [Sorangium cellulosum So
           ce56]
          Length = 831

 Score = 36.6 bits (83), Expect = 3.5,   Method: Composition-based stats.
 Identities = 24/71 (33%), Positives = 31/71 (43%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
           A  RG+  I SAA    ADL  A  V  +F  AN   A     DF  ++F+ A L  A  
Sbjct: 710 ARARGDGAILSAANLAGADLEDARFVGASFAGANLRDAKADRGDFMSARFDRADLGAASF 769

Query: 159 YKANFTGTLIA 169
            K      ++A
Sbjct: 770 CKTRLVAAVLA 780


>gi|332710578|ref|ZP_08430523.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332350633|gb|EGJ30228.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 185

 Score = 36.6 bits (83), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 31/63 (49%), Gaps = 5/63 (7%)

Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKAN 162
           G+ A F +A+L  A     N  RA+F+ A     +   +D SGS F GA L     +KAN
Sbjct: 88  GTGATFRNANLDSAYATGANMSRADFSGASVVWANFISADLSGSSFRGADLSNTTFFKAN 147

Query: 163 FTG 165
             G
Sbjct: 148 LNG 150


>gi|150016367|ref|YP_001308621.1| pentapeptide repeat-containing protein [Clostridium beijerinckii
            NCIMB 8052]
 gi|149902832|gb|ABR33665.1| pentapeptide repeat protein [Clostridium beijerinckii NCIMB 8052]
          Length = 1084

 Score = 36.6 bits (83), Expect = 3.5,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 30/59 (50%), Gaps = 5/59 (8%)

Query: 113  FGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
            F  A+L  AV  + NF+ A F +      D+ E+D +G+  + A L  A   KA F GT
Sbjct: 991  FNHANLSSAVMRESNFKNATFINTCLRNVDLEEADLTGADMSNANLSNAKINKAIFEGT 1049


>gi|166363932|ref|YP_001656205.1| pentapeptide repeat-containing protein [Microcystis aeruginosa
           NIES-843]
 gi|166086305|dbj|BAG01013.1| pentapeptide repeat family protein [Microcystis aeruginosa
           NIES-843]
          Length = 164

 Score = 36.6 bits (83), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 17/57 (29%), Positives = 33/57 (57%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           + +     SA+L++AV +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF
Sbjct: 50  VLNGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANF 106


>gi|443324425|ref|ZP_21053179.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442795970|gb|ELS05303.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 305

 Score = 36.6 bits (83), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 42/93 (45%), Gaps = 18/93 (19%)

Query: 91  LADLNKYEAETRGEFGIGS------------------AAQFGSADLRKAVHVKENFRRAN 132
           L++ N YEAE    FG  +                   A F  A+L K      N  RAN
Sbjct: 173 LSNANLYEAELLNIFGYKTNFCRVQAIATHMSRAYLFQANFSEAELIKIDLRWANCDRAN 232

Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           F +A+++++D  G+  N A L++A   +AN  G
Sbjct: 233 FRNANLQQADLRGTNLNQADLKQANLTRANLRG 265


>gi|359458687|ref|ZP_09247250.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 203

 Score = 36.6 bits (83), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 32/66 (48%), Gaps = 10/66 (15%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAY 159
            A F SADLRKA   + + R A    AD+R           ++ SG+  +GA L  A+ Y
Sbjct: 52  GANFASADLRKAKLFRADLRAACLYRADLRGANLKGANLFGANLSGANLSGANLSNAMLY 111

Query: 160 KANFTG 165
            AN  G
Sbjct: 112 CANLGG 117


>gi|425473009|ref|ZP_18851753.1| Genome sequencing data, contig C314 [Microcystis aeruginosa PCC
           9701]
 gi|389880711|emb|CCI38594.1| Genome sequencing data, contig C314 [Microcystis aeruginosa PCC
           9701]
          Length = 453

 Score = 36.6 bits (83), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 5/65 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 163
           + A    A+L +A  +  N   AN   A++       ++ +G+  NGAYL  A+ Y AN 
Sbjct: 319 NGANLKGANLNEANLIGANLNEANLIGANLNGAILYRANLNGANLNGAYLNGAILYGANL 378

Query: 164 TGTLI 168
            G ++
Sbjct: 379 YGAIL 383


>gi|119493270|ref|ZP_01624110.1| hypothetical protein L8106_30600 [Lyngbya sp. PCC 8106]
 gi|119452743|gb|EAW33921.1| hypothetical protein L8106_30600 [Lyngbya sp. PCC 8106]
          Length = 332

 Score = 36.6 bits (83), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 30/56 (53%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    A+L++A  V  N R AN   A+++ ++ +GS   GA L  A+   +N T
Sbjct: 62  SGADLEEANLQEANLVNANLRNANLKKANLQNANLTGSDLRGADLSFAILKGSNLT 117


>gi|428218432|ref|YP_007102897.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427990214|gb|AFY70469.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 403

 Score = 36.6 bits (83), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 17/57 (29%), Positives = 31/57 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A   S +++ A+ V+ N   AN  +A++  ++   +  NGA L +A   +AN +G
Sbjct: 302 SGADLSSTEMKGAILVRTNLNGANLANANLTGANLEQANLNGANLGEANLNRANLSG 358


>gi|443328213|ref|ZP_21056814.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442792183|gb|ELS01669.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 281

 Score = 36.6 bits (83), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 28/56 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           A    ADL  A  V  N  +AN T A++  ++ +G+   GA L  A+   A  TGT
Sbjct: 72  ANLAGADLTGANLVNANLSQANLTGANLTGANLTGASLFGANLSGAILTDATLTGT 127


>gi|298241513|ref|ZP_06965320.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297554567|gb|EFH88431.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 413

 Score = 36.6 bits (83), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 31/59 (52%), Gaps = 2/59 (3%)

Query: 98  EAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           E E +G    GS  Q   ADLRKA     +F  AN   AD+ +++  G+ F GA LE A
Sbjct: 256 EVEAKGANFTGS--QLAGADLRKANLQGASFLGANLRGADLSQANLEGAVFVGAQLEGA 312


>gi|428209141|ref|YP_007093494.1| serine/threonine protein kinase [Chroococcidiopsis thermalis PCC
           7203]
 gi|428011062|gb|AFY89625.1| serine/threonine protein kinase [Chroococcidiopsis thermalis PCC
           7203]
          Length = 535

 Score = 36.6 bits (83), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 3/70 (4%)

Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
           G+FG    +Q   A+LR AV  K      + + AD+R +D SG+  + A L  A    AN
Sbjct: 452 GDFGQACLSQ---ANLRNAVLTKAYMSYTDLSGADLRGADLSGAYLSNANLRGANLCGAN 508

Query: 163 FTGTLIATEH 172
            TG  ++ + 
Sbjct: 509 LTGATLSDDQ 518


>gi|254422357|ref|ZP_05036075.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
 gi|196189846|gb|EDX84810.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
          Length = 346

 Score = 36.6 bits (83), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 37/65 (56%), Gaps = 5/65 (7%)

Query: 109 SAAQFGSADLRK-----AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A+  +A+LR      AV  + + R+A  +  D+R SD + +  +GA L +A  +KA+ 
Sbjct: 66  SGAKLDNANLRNSILSGAVLHRVSLRKARMSGIDLRNSDLNEADLSGANLSQAKLHKASL 125

Query: 164 TGTLI 168
           +G ++
Sbjct: 126 SGAIL 130


>gi|88857428|ref|ZP_01132071.1| hypothetical protein PTD2_02671 [Pseudoalteromonas tunicata D2]
 gi|88820625|gb|EAR30437.1| hypothetical protein PTD2_02671 [Pseudoalteromonas tunicata D2]
          Length = 966

 Score = 36.6 bits (83), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 27/59 (45%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           I    QF  A LR A+ +K +   AN  +AD+R   F G+         A   K +F+G
Sbjct: 892 IAQHTQFIEAKLRNALFLKADLFEANLMNADLRSGQFKGANLYSVSFLNATIGKTDFSG 950


>gi|425439840|ref|ZP_18820154.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9717]
 gi|389719844|emb|CCH96379.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9717]
          Length = 225

 Score = 36.6 bits (83), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 17/55 (30%), Positives = 32/55 (58%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +     SA+L++AV +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF
Sbjct: 113 NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANF 167


>gi|428778554|ref|YP_007170340.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
 gi|428692833|gb|AFZ48983.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
          Length = 256

 Score = 36.6 bits (83), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 28/56 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           A    A L++A     N  RA+ T AD+R ++ SG+  +GA L  A    AN   T
Sbjct: 51  ADLAGAQLQRANLTNANLSRADLTGADLRGANLSGASLHGADLRGANLSGANLVAT 106


>gi|428316248|ref|YP_007114130.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428239928|gb|AFZ05714.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 388

 Score = 36.6 bits (83), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 5/68 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANF 163
           + A   SA L  A+ V  N    +F +AD+   + SG+  NGA L +A     V   ANF
Sbjct: 307 TGANLSSAYLPNAILVNANLTNTDFKNADLSGVNLSGANLNGADLSRADLKNTVVKNANF 366

Query: 164 TGTLIATE 171
           +G L  +E
Sbjct: 367 SGCLGISE 374


>gi|427734465|ref|YP_007054009.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427369506|gb|AFY53462.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 269

 Score = 36.6 bits (83), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 28/56 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A  G ADLR+A     N   AN T A +  ++ SGS  +GA L  A    AN T
Sbjct: 68  SEADLGEADLREANLKGANLTGANLTGATLMNANLSGSNLSGACLSGAKLSGANLT 123


>gi|254501374|ref|ZP_05113525.1| Pentapeptide repeat protein [Labrenzia alexandrii DFL-11]
 gi|222437445|gb|EEE44124.1| Pentapeptide repeat protein [Labrenzia alexandrii DFL-11]
          Length = 296

 Score = 36.6 bits (83), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 17/51 (33%), Positives = 28/51 (54%)

Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTL 167
           DL++AV  + NF R++F   +   +DFS S F GA +      K+N   ++
Sbjct: 92  DLKEAVMPRSNFERSDFRRTEAERADFSASDFAGASMRAVDLEKSNLNSSI 142


>gi|374293141|ref|YP_005040176.1| hypothetical protein AZOLI_2775 [Azospirillum lipoferum 4B]
 gi|357425080|emb|CBS87961.1| Conserved protein of unknown function; pentapeptide repeat domains
           [Azospirillum lipoferum 4B]
          Length = 425

 Score = 36.6 bits (83), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 27/47 (57%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           AA F +  L  A   + + R ANF+ AD+R +D +GS   GA L+ A
Sbjct: 166 AADFTNTRLAGARLDRTDLRDANFSGADLRGADLNGSDLRGAILDGA 212


>gi|334117108|ref|ZP_08491200.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333461928|gb|EGK90533.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 509

 Score = 36.6 bits (83), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 9/74 (12%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           ADL K E          S     +AD+R+A   + N   AN + A+++ +D +G+  NGA
Sbjct: 171 ADLTKAEL---------SGVNLSNADMRQASLQQVNLSSANLSGANLKWADLTGANLNGA 221

Query: 152 YLEKAVAYKANFTG 165
            L  A    AN  G
Sbjct: 222 DLSFAKLSGANLNG 235


>gi|291569916|dbj|BAI92188.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
          Length = 340

 Score = 36.6 bits (83), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 19/70 (27%), Positives = 36/70 (51%), Gaps = 2/70 (2%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
            EA   G F  G+   +  A LR +  ++ +  +A+ + A + +++  G+K +GA +   
Sbjct: 65  LEANLNGAFLYGTNLNY--AKLRDSCLIEADLTKADLSGAQLHKANLMGAKLSGAVMSWV 122

Query: 157 VAYKANFTGT 166
             Y+ANF G 
Sbjct: 123 TLYRANFPGV 132



 Score = 36.2 bits (82), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 39/87 (44%), Gaps = 7/87 (8%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
           F+   L  A +  C    + L++ N   A  RG       A    A+LR A     N   
Sbjct: 249 FIKANLMKADLEECDLINADLSNTNLNLANLRG-------ADLTGANLRGAYLWGANLDG 301

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAV 157
           AN  +AD+R++ F  +  NGA L+ A+
Sbjct: 302 ANLENADLRDASFRHATLNGAMLQDAI 328


>gi|428224795|ref|YP_007108892.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427984696|gb|AFY65840.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 284

 Score = 36.6 bits (83), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 26/52 (50%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
            A    ADL +A     N RRANFT+A MR +    S   GA + +   Y+A
Sbjct: 184 GANLSDADLTRANLGSTNLRRANFTNAKMRGASLIWSSLRGAKMIRVNLYRA 235


>gi|374723788|gb|EHR75868.1| Pentapeptide repeats containing protein [uncultured marine group II
           euryarchaeote]
          Length = 148

 Score = 36.6 bits (83), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 23/48 (47%)

Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           LRK  H   NFRR     AD+ E DFS   F  A + +    K+ F G
Sbjct: 35  LRKGRHAGSNFRRGILDGADLTEGDFSNCDFRKASMYEVDLMKSAFDG 82


>gi|163797086|ref|ZP_02191041.1| pentapeptide repeat protein [alpha proteobacterium BAL199]
 gi|159177602|gb|EDP62155.1| pentapeptide repeat protein [alpha proteobacterium BAL199]
          Length = 421

 Score = 36.2 bits (82), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 27/46 (58%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           A F  ADLR +V    +  +A F++A + + DF+G+K  GA L  A
Sbjct: 51  ALFAGADLRGSVFAGGHLEQAQFSTARLEQVDFAGAKLMGANLRGA 96


>gi|255083653|ref|XP_002508401.1| predicted protein [Micromonas sp. RCC299]
 gi|226523678|gb|ACO69659.1| predicted protein [Micromonas sp. RCC299]
          Length = 187

 Score = 36.2 bits (82), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 5/51 (9%)

Query: 120 KAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           KA H+ E+F   +     +T  D+R SDFSGS    A   +AV    N  G
Sbjct: 30  KAEHINEDFSHEDLVGAIYTEGDLRGSDFSGSDLRAAIFSRAVMPGVNLEG 80


>gi|166364324|ref|YP_001656597.1| pentapeptide repeat-containing protein [Microcystis aeruginosa
           NIES-843]
 gi|166086697|dbj|BAG01405.1| pentapeptide repeat family protein [Microcystis aeruginosa
           NIES-843]
          Length = 330

 Score = 36.2 bits (82), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 5/59 (8%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           I +  +F  A+L +A     NF+ A    AD+R ++ SG  F+ AYL  A   +AN TG
Sbjct: 242 IMTQIKFERANLSQA-----NFQAARMNHADLRRANLSGVNFSEAYLVDAFLARANLTG 295


>gi|159045175|ref|YP_001533969.1| hypothetical protein Dshi_2635 [Dinoroseobacter shibae DFL 12]
 gi|157912935|gb|ABV94368.1| hypothetical protein Dshi_2635 [Dinoroseobacter shibae DFL 12]
          Length = 245

 Score = 36.2 bits (82), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 30/62 (48%), Gaps = 5/62 (8%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTG 165
           A  G ADL  A     NF  A    A +RE+D +G++  GA L +     AV   A F+G
Sbjct: 167 ADLGGADLSGAFLEGANFGNARLVGAVLREADLTGARLTGADLSEADLTGAVTQAAGFSG 226

Query: 166 TL 167
            +
Sbjct: 227 AV 228


>gi|427736744|ref|YP_007056288.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427371785|gb|AFY55741.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 443

 Score = 36.2 bits (82), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 33/61 (54%), Gaps = 5/61 (8%)

Query: 109 SAAQFGSADLRKAVHVKEN-----FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           ++ +F  ADLR+A  V  N     F  AN +  ++  +D SG+  +GAYL  A  Y A+ 
Sbjct: 319 TSTKFIGADLREANFVGANLDNVDFSNANLSGTNLSGADLSGADLSGAYLSGAYFYDADL 378

Query: 164 T 164
           +
Sbjct: 379 S 379


>gi|302831317|ref|XP_002947224.1| hypothetical protein VOLCADRAFT_120426 [Volvox carteri f.
           nagariensis]
 gi|300267631|gb|EFJ51814.1| hypothetical protein VOLCADRAFT_120426 [Volvox carteri f.
           nagariensis]
          Length = 244

 Score = 36.2 bits (82), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 24/50 (48%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A L KA  VK NF  A+ T+A +   DFSG+   G      V   A F G
Sbjct: 135 AVLTKAYAVKANFENADMTNAVVDRVDFSGANLRGVRFNNTVVTGAQFAG 184


>gi|209526910|ref|ZP_03275429.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|423063829|ref|ZP_17052619.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209492689|gb|EDZ93025.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|406714678|gb|EKD09839.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 740

 Score = 36.2 bits (82), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 27/54 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A     +LR A     N   A+   AD+R +D  G+ F GA L +A  Y+AN T
Sbjct: 580 ANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANIT 633


>gi|17230748|ref|NP_487296.1| hypothetical protein all3256 [Nostoc sp. PCC 7120]
 gi|17132351|dbj|BAB74955.1| all3256 [Nostoc sp. PCC 7120]
          Length = 268

 Score = 36.2 bits (82), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 35/83 (42%), Gaps = 30/83 (36%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF---------------------- 148
           A F  A+LR A+  + N    +F+SAD+R++D +G+K                       
Sbjct: 114 ADFSGANLRGAIVTEANLIGTDFSSADLRDADLAGAKLIRSNLCFANLIAANFIAVDFSE 173

Query: 149 --------NGAYLEKAVAYKANF 163
                    GAYL KA  YKAN 
Sbjct: 174 ANLYQAEVMGAYLYKANFYKANL 196



 Score = 36.2 bits (82), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 10/68 (14%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVA 158
           S A   SA+L  A   + N   ANF          T AD+  + F G+ F+GA L  A+ 
Sbjct: 67  SGADLSSANLHHAKLSEANLSAANFSVANLSQSVLTHADLSHAHFIGADFSGANLRGAIV 126

Query: 159 YKANFTGT 166
            +AN  GT
Sbjct: 127 TEANLIGT 134


>gi|376003692|ref|ZP_09781500.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375327990|emb|CCE17253.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 740

 Score = 36.2 bits (82), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 27/54 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A     +LR A     N   A+   AD+R +D  G+ F GA L +A  Y+AN T
Sbjct: 580 ANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANIT 633


>gi|451980423|ref|ZP_21928815.1| conserved hypothetical protein, contains pentapeptide repeats
           [Nitrospina gracilis 3/211]
 gi|451762323|emb|CCQ90046.1| conserved hypothetical protein, contains pentapeptide repeats
           [Nitrospina gracilis 3/211]
          Length = 289

 Score = 36.2 bits (82), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 31/64 (48%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A+F  A L++A     N  R+ F  A M E++ +G +FN + L  A+    N  G  I
Sbjct: 100 SGAKFHQALLKRAQFEGANLVRSEFLEAQMNEANLAGVRFNKSDLRGAMMIGINLAGAQI 159

Query: 169 ATEH 172
              H
Sbjct: 160 PQSH 163


>gi|428775742|ref|YP_007167529.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
 gi|428690021|gb|AFZ43315.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
          Length = 309

 Score = 36.2 bits (82), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 16/61 (26%), Positives = 31/61 (50%)

Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTL 167
           G    F  A  +KA+ +K  F  +N T  ++RE++   + ++G  +  +   K NFT  +
Sbjct: 129 GKNINFTQAIFQKAILIKSQFEESNLTQVNLREANLKQANWSGIIIPNSKLQKTNFTEAI 188

Query: 168 I 168
           +
Sbjct: 189 L 189


>gi|425467653|ref|ZP_18846932.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9809]
 gi|389829528|emb|CCI29082.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9809]
          Length = 220

 Score = 36.2 bits (82), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 17/57 (29%), Positives = 33/57 (57%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           + +     SA+L++AV +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF
Sbjct: 106 VLNGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANF 162


>gi|304393841|ref|ZP_07375766.1| pentapeptide repeat-containing protein [Ahrensia sp. R2A130]
 gi|303294040|gb|EFL88415.1| pentapeptide repeat-containing protein [Ahrensia sp. R2A130]
          Length = 247

 Score = 36.2 bits (82), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 4/60 (6%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           G+G +   GS    + V    +F   +FT A+M  SDFSGS      + K+   +ANFTG
Sbjct: 109 GVGLSKVEGS----RTVLQNSDFTDTDFTKAEMFRSDFSGSILKNVNMNKSEFSRANFTG 164


>gi|209525619|ref|ZP_03274157.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|423065193|ref|ZP_17053983.1| hypothetical protein SPLC1_S230580 [Arthrospira platensis C1]
 gi|209493952|gb|EDZ94269.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|406713325|gb|EKD08496.1| hypothetical protein SPLC1_S230580 [Arthrospira platensis C1]
          Length = 333

 Score = 36.2 bits (82), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 7/87 (8%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
           F+   L  A +  C    + L++ N   A  RG       A    A+LR A     N   
Sbjct: 242 FIKANLMKADLQECDLRNADLSNTNLNLANLRG-------ADLTGANLRGAYLWGANLDG 294

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAV 157
           AN  +AD+R++ F  +  NGA L  A+
Sbjct: 295 ANLENADLRDASFRDATLNGAILNGAI 321


>gi|443310759|ref|ZP_21040400.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
 gi|442779202|gb|ELR89454.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
          Length = 330

 Score = 36.2 bits (82), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 2/68 (2%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           ++A+ RG    G  A    ADLR A   + N R A+ + AD+  +D S +  N A L+ A
Sbjct: 154 FKADVRGANLAG--ANLSRADLRYANFNEVNLRGADLSCADLSNTDLSYALLNDANLDGA 211

Query: 157 VAYKANFT 164
           +   AN +
Sbjct: 212 ILTGANLS 219


>gi|75911046|ref|YP_325342.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75704771|gb|ABA24447.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 576

 Score = 36.2 bits (82), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 28/51 (54%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           GI S A    ADL  AV +  +F  AN  SA++  S+ SG+  NGA L  A
Sbjct: 475 GILSEADLTGADLSDAVLLGTDFSFANLNSANLSGSNLSGAILNGADLSSA 525


>gi|163793836|ref|ZP_02187810.1| hypothetical protein BAL199_12426 [alpha proteobacterium BAL199]
 gi|159180947|gb|EDP65464.1| hypothetical protein BAL199_12426 [alpha proteobacterium BAL199]
          Length = 227

 Score = 36.2 bits (82), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 29/58 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           S A    ADLR A   +  FR A    A +  +    + F+GA L++AV  +A  TGT
Sbjct: 57  SGADLSGADLRGACLNRSRFRLATLRGAHLDGASIDAACFDGANLDRAVFDRARVTGT 114


>gi|376004329|ref|ZP_09782046.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375327291|emb|CCE17799.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 340

 Score = 36.2 bits (82), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 7/87 (8%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
           F+   L  A +  C    + L++ N   A  RG       A    A+LR A     N   
Sbjct: 249 FIKANLMKADLQECDLRNADLSNTNLNLANLRG-------ADLTGANLRGAYLWGANLDG 301

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAV 157
           AN  +AD+R++ F  +  NGA L  A+
Sbjct: 302 ANLENADLRDASFRDATLNGAILNGAI 328


>gi|300863988|ref|ZP_07108896.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300338009|emb|CBN54042.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 189

 Score = 36.2 bits (82), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 44/106 (41%), Gaps = 13/106 (12%)

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
           PYA L    ++  T L AA           L D+  Y+A   G    GS      A+LR+
Sbjct: 63  PYANLSQANLY-KTQLTAA----------QLGDVQLYQANLSGADLEGS--NLSRANLRR 109

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           A     N  RA+   AD+  +D SG+    A L +     A  TGT
Sbjct: 110 ANLQGANLSRASLQGADLYNADLSGADLTYADLSRVNLENAKLTGT 155


>gi|83309857|ref|YP_420121.1| hypothetical protein amb0758 [Magnetospirillum magneticum AMB-1]
 gi|82944698|dbj|BAE49562.1| Uncharacterized low-complexity protein [Magnetospirillum magneticum
           AMB-1]
          Length = 164

 Score = 36.2 bits (82), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 48/111 (43%), Gaps = 24/111 (21%)

Query: 63  AKLKNWRV-FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 121
           AK+  +RV F+S AL  A                      R + G+ S A F  ADL  A
Sbjct: 69  AKVDGYRVRFISAALVGA----------------------RLDDGVFSEADFTKADLGGA 106

Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-GTLIATE 171
              + + RRA F  A +R +D +G++  GA L  A    A +T G  I  E
Sbjct: 107 SLARADLRRARFYHASLRGADLTGARTLGAELLNADLSGARWTDGKTICAE 157


>gi|229489413|ref|ZP_04383276.1| pentapeptide repeat protein [Rhodococcus erythropolis SK121]
 gi|229323510|gb|EEN89268.1| pentapeptide repeat protein [Rhodococcus erythropolis SK121]
          Length = 206

 Score = 36.2 bits (82), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 25/53 (47%), Gaps = 5/53 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKA 156
           S    G ADLRK       FR AN   ADMR      +D SG++  G  LE A
Sbjct: 113 SLTSLGGADLRKIDFTSCRFREANLVRADMRGAVLASADLSGARTGGLKLEGA 165


>gi|453067770|ref|ZP_21971056.1| hypothetical protein G418_04063 [Rhodococcus qingshengii BKS 20-40]
 gi|452766713|gb|EME24957.1| hypothetical protein G418_04063 [Rhodococcus qingshengii BKS 20-40]
          Length = 206

 Score = 36.2 bits (82), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 25/53 (47%), Gaps = 5/53 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKA 156
           S    G ADLRK       FR AN   ADMR      +D SG++  G  LE A
Sbjct: 113 SLTSLGGADLRKIDFTSCRFREANLVRADMRGAVLASADLSGARTGGLKLEGA 165


>gi|409992999|ref|ZP_11276159.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|291569979|dbj|BAI92251.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
 gi|409936146|gb|EKN77650.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 201

 Score = 36.2 bits (82), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 8/78 (10%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
           LAD +  +A+  G       A    A+L +A+ V+ N R AN T A++ ++DF  +   G
Sbjct: 36  LADADLSQAKLMG-------ANLSGANLARAI-VRANLRGANLTGANLIQADFRNADLRG 87

Query: 151 AYLEKAVAYKANFTGTLI 168
           A L      +A F G  +
Sbjct: 88  AILLDTDPREATFAGAFL 105


>gi|425452623|ref|ZP_18832440.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 7941]
 gi|389765493|emb|CCI08619.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 7941]
          Length = 220

 Score = 36.2 bits (82), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 17/55 (30%), Positives = 32/55 (58%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +     SA+L++AV +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF
Sbjct: 108 NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANF 162


>gi|425471163|ref|ZP_18850023.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9701]
 gi|389882952|emb|CCI36586.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9701]
          Length = 220

 Score = 36.2 bits (82), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 17/55 (30%), Positives = 32/55 (58%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +     SA+L++AV +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF
Sbjct: 108 NGVNLNSANLQQAVLIDADFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANF 162


>gi|316934318|ref|YP_004109300.1| pentapeptide repeat-containing protein [Rhodopseudomonas palustris
           DX-1]
 gi|315602032|gb|ADU44567.1| pentapeptide repeat protein [Rhodopseudomonas palustris DX-1]
          Length = 273

 Score = 36.2 bits (82), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 30/57 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A    ADL  A     N  RA+ + A++  +D SG+  +GA L +A  + AN +G
Sbjct: 57  SGANLSGADLSGANLSGANLYRADLSGANLSGADLSGANLSGANLYRAKLFSANLSG 113


>gi|310639667|ref|YP_003944425.1| low-complexity protein [Paenibacillus polymyxa SC2]
 gi|386038870|ref|YP_005957824.1| bTB/POZ domain-containing protein KCTD9 [Paenibacillus polymyxa M1]
 gi|309244617|gb|ADO54184.1| Uncharacterized low-complexity protein [Paenibacillus polymyxa SC2]
 gi|343094908|emb|CCC83117.1| bTB/POZ domain-containing protein KCTD9 [Paenibacillus polymyxa M1]
          Length = 430

 Score = 36.2 bits (82), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 32/57 (56%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
             A+F  A LR+AV    + +RA    AD+ ++ F+G+  +GA  + A    A+FTG
Sbjct: 324 QGARFQDAQLREAVFQDASLQRAFLNGADLTDACFAGADVSGASFKGACLIGADFTG 380


>gi|226184599|dbj|BAH32703.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
          Length = 203

 Score = 36.2 bits (82), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 25/53 (47%), Gaps = 5/53 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKA 156
           S    G ADLRK       FR AN   ADMR      +D SG++  G  LE A
Sbjct: 110 SLTSLGGADLRKIDFTSCRFREANLVRADMRGAVLASADLSGARTGGLKLEGA 162


>gi|162453209|ref|YP_001615576.1| hypothetical protein sce4933 [Sorangium cellulosum So ce56]
 gi|161163791|emb|CAN95096.1| hypothetical protein sce4933 [Sorangium cellulosum So ce56]
          Length = 890

 Score = 36.2 bits (82), Expect = 4.9,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 27/57 (47%), Gaps = 5/57 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           + A F   DLR        F RA    AD+R +D SG+   GA L KA    AN TG
Sbjct: 577 TGADFRGVDLRGM-----RFARAFLEGADLRGADLSGAVLEGAVLAKADLSGANLTG 628


>gi|425458309|ref|ZP_18837797.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9808]
 gi|389827863|emb|CCI20729.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9808]
          Length = 220

 Score = 36.2 bits (82), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 17/55 (30%), Positives = 32/55 (58%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +     SA+L++AV +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF
Sbjct: 108 NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFRGADLNYANLSGSLLYRANF 162


>gi|158340059|ref|YP_001521229.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158310300|gb|ABW31915.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 483

 Score = 36.2 bits (82), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 35/70 (50%), Gaps = 15/70 (21%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSAD----------MRESDFSGSKFNGAYLEKAVA 158
           S +    A+LR A     NFR+AN + AD          + ++D SG+ F+GAYL     
Sbjct: 305 SYSNLRKANLRHAHLSGANFRKANLSLADISKAHLGHAHLNDADLSGAYFSGAYL----- 359

Query: 159 YKANFTGTLI 168
           YKAN +   +
Sbjct: 360 YKANLSSAFL 369


>gi|443325448|ref|ZP_21054143.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442794958|gb|ELS04350.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 308

 Score = 36.2 bits (82), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 30/60 (50%)

Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTL 167
            ++A    ADLR  V    NF +AN    +   +    + F+ A L KA+A + NFT ++
Sbjct: 114 ATSAILTGADLRDVVGTAPNFSQANLEEVNFTNARLEHANFSNASLRKAIAVQTNFTHSI 173


>gi|425467193|ref|ZP_18846477.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
 gi|389830101|emb|CCI28138.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
          Length = 330

 Score = 36.2 bits (82), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 5/59 (8%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           I +  +F  A+L +A     NF+ A    AD+R ++ SG  F+ AYL  A   +AN TG
Sbjct: 242 IMTQIKFDRANLSQA-----NFQAARMNHADLRRANLSGVNFSEAYLVDAFFARANLTG 295


>gi|428225078|ref|YP_007109175.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427984979|gb|AFY66123.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 303

 Score = 36.2 bits (82), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 1/77 (1%)

Query: 92  ADLN-KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
           ADL+  + A   G+    + A+   ADL  A   +  F + + T  ++R +DF  +   G
Sbjct: 219 ADLSWAHLARVEGQGSDLTEAKLRGADLSSANFSEAVFLKTDLTKTNLRRADFRLANLTG 278

Query: 151 AYLEKAVAYKANFTGTL 167
           A L  AV Y+  F G++
Sbjct: 279 ADLTDAVLYRTEFAGSI 295


>gi|254456441|ref|ZP_05069870.1| Pentapeptide repeat protein [Candidatus Pelagibacter sp. HTCC7211]
 gi|207083443|gb|EDZ60869.1| Pentapeptide repeat protein [Candidatus Pelagibacter sp. HTCC7211]
          Length = 169

 Score = 36.2 bits (82), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 33/65 (50%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           FG    + F  A+L + V +  NF + NF+ +++   DF G+    A  + +   +ANFT
Sbjct: 74  FGTFPESTFVRANLYETVSIGANFEKTNFSGSNLTRVDFMGATLIEANFQNSNLMEANFT 133

Query: 165 GTLIA 169
            + I 
Sbjct: 134 SSNIT 138


>gi|443477206|ref|ZP_21067069.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443017715|gb|ELS32099.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 167

 Score = 36.2 bits (82), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 34/57 (59%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           QF  +DLR A  V  + +  +F +A+M+E++ +G+  + + L+ A   KAN T  +I
Sbjct: 56  QFNESDLRNASFVNADAQGVSFFAANMKEANLTGANLSYSTLDNARLDKANLTNAVI 112


>gi|39933849|ref|NP_946125.1| pentapeptide repeat-containing protein [Rhodopseudomonas palustris
           CGA009]
 gi|39647696|emb|CAE26216.1| Pentapeptide repeat [Rhodopseudomonas palustris CGA009]
          Length = 437

 Score = 36.2 bits (82), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 24/43 (55%)

Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
            NF  ++F SADMR  DF+G+ F GA L  A     NF   ++
Sbjct: 28  RNFDCSDFASADMRRVDFTGASFVGADLTAAELQGGNFVNAIL 70


>gi|427702733|ref|YP_007045955.1| low-complexity protein [Cyanobium gracile PCC 6307]
 gi|427345901|gb|AFY28614.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
          Length = 247

 Score = 36.2 bits (82), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 5/49 (10%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
           +AA F  ADLR A     NF  A+ T AD+R +   G++F+GA L + +
Sbjct: 187 TAADFRGADLRGA-----NFSGADLTQADLRGALLDGARFHGAVLSRTL 230


>gi|209525586|ref|ZP_03274124.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|376005449|ref|ZP_09782952.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423065230|ref|ZP_17054020.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209493919|gb|EDZ94236.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|375326163|emb|CCE18705.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406713362|gb|EKD08533.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 210

 Score = 36.2 bits (82), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 8/78 (10%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
           LAD +  +A+  G       A    A+L +A+ V+ N R AN T A++ ++DF  +   G
Sbjct: 37  LADADLSQAKLMG-------ANLSGANLARAI-VRANLRGANLTGANLIQADFRNADLRG 88

Query: 151 AYLEKAVAYKANFTGTLI 168
           A L      +A F G  +
Sbjct: 89  AILLDTDPREATFAGAFL 106


>gi|383752519|ref|YP_005427619.1| hypothetical protein RTTH1527_02660 [Rickettsia typhi str. TH1527]
 gi|383843354|ref|YP_005423857.1| hypothetical protein RTB9991CWPP_02660 [Rickettsia typhi str.
           B9991CWPP]
 gi|380759162|gb|AFE54397.1| hypothetical protein RTTH1527_02660 [Rickettsia typhi str. TH1527]
 gi|380760001|gb|AFE55235.1| hypothetical protein RTB9991CWPP_02660 [Rickettsia typhi str.
           B9991CWPP]
          Length = 585

 Score = 36.2 bits (82), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 38/62 (61%), Gaps = 6/62 (9%)

Query: 112 QFGSADLR--KAVHVK---ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           +FGS +L+  K + +K   E+    +FTS ++  +DFSGS  + A L  AV  ++NFT +
Sbjct: 55  KFGS-NLKGVKLIGIKLTNEDLSGIDFTSCEILRTDFSGSNLDKAILTNAVIQESNFTDS 113

Query: 167 LI 168
           +I
Sbjct: 114 VI 115


>gi|193212588|ref|YP_001998541.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
           8327]
 gi|193086065|gb|ACF11341.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
          Length = 430

 Score = 36.2 bits (82), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 43/90 (47%), Gaps = 9/90 (10%)

Query: 82  ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRES 141
           A  ++ + A ADL++ + E          A F   DL +A     N R A+  SA++++ 
Sbjct: 326 APLANRVLAYADLHEADLEK---------ASFKRTDLDEADFRGANLRGADLRSANLQQV 376

Query: 142 DFSGSKFNGAYLEKAVAYKANFTGTLIATE 171
           DF  +   GA L  A   ++ F G +++ E
Sbjct: 377 DFRQADLRGANLWLANTSRSKFDGAIVSPE 406


>gi|68171987|ref|ZP_00545289.1| Pentapeptide repeat [Ehrlichia chaffeensis str. Sapulpa]
 gi|67998589|gb|EAM85340.1| Pentapeptide repeat [Ehrlichia chaffeensis str. Sapulpa]
          Length = 435

 Score = 36.2 bits (82), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 102 RGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
           + EFG   S A F   DLR +V    N   ANFT A++  S F  S   GA    A   K
Sbjct: 83  KKEFGNNLSGADFSDLDLRGSVFDNVNLLHANFTRANLSNSTFIDSNMQGASFINANLSK 142

Query: 161 AN 162
           +N
Sbjct: 143 SN 144


>gi|440682098|ref|YP_007156893.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
 gi|428679217|gb|AFZ57983.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
          Length = 330

 Score = 35.8 bits (81), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 39/90 (43%), Gaps = 7/90 (7%)

Query: 67  NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
           N  +     L  A ++  + N ++L   N ++A+  G       A    ADLR A     
Sbjct: 53  NRAILDRANLICAKLSGANLNQASLISTNLHDADLHG-------ASLQGADLRNANLTLA 105

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           + R  N   AD+R +D SG+   GA L  A
Sbjct: 106 DLRDVNLMDADLRGADLSGANLKGACLRGA 135


>gi|227498348|ref|ZP_03928498.1| pentapeptide repeat protein [Acidaminococcus sp. D21]
 gi|352685375|ref|YP_004897360.1| pentapeptide repeat-containing protein [Acidaminococcus intestini
           RyC-MR95]
 gi|226903810|gb|EEH89728.1| pentapeptide repeat protein [Acidaminococcus sp. D21]
 gi|350280030|gb|AEQ23220.1| pentapeptide repeat protein [Acidaminococcus intestini RyC-MR95]
          Length = 250

 Score = 35.8 bits (81), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 41/77 (53%), Gaps = 8/77 (10%)

Query: 95  NKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           N Y A+ R    ++  G+ A F SA+L ++   + NF  ANFTSA++ +     ++F  A
Sbjct: 138 NLYTADLRESNFDYASGAMANFYSANLARSWFFRSNFMSANFTSANLYD-----ARFRRA 192

Query: 152 YLEKAVAYKANFTGTLI 168
            L +A+   AN T   +
Sbjct: 193 NLSEALLRSANLTSATV 209


>gi|218248608|ref|YP_002373979.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
 gi|218169086|gb|ACK67823.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
          Length = 152

 Score = 35.8 bits (81), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 10/67 (14%)

Query: 112 QFGSADLRKAVHVKENFRRANFT----------SADMRESDFSGSKFNGAYLEKAVAYKA 161
            F   DLR A+    N R +NF+          SA++  ++F G+   GA LE A   + 
Sbjct: 27  DFSGQDLRDALFDHANLRGSNFSHANLQGVRFFSANLEGANFEGADLRGADLESARLTRV 86

Query: 162 NFTGTLI 168
           NFT  L+
Sbjct: 87  NFTNALL 93


>gi|254409607|ref|ZP_05023388.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183604|gb|EDX78587.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 278

 Score = 35.8 bits (81), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 49/111 (44%), Gaps = 28/111 (25%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEF--------------GIGSA----AQFGSAD 117
           L+ A +     N + L D N +EAE  G +               +G A    A  G+AD
Sbjct: 156 LSDASLIHAHLNRANLTDANLHEAELIGAYLYQADLERANLSQAHLGGAYLFGANLGAAD 215

Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           L        + R AN T A+++ ++  G++  GA+L      KAN TG ++
Sbjct: 216 LEGT-----DLRWANITGANLQAANLKGARLEGAHLN-----KANLTGAVL 256


>gi|254409899|ref|ZP_05023679.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196182935|gb|EDX77919.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 478

 Score = 35.8 bits (81), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 39/89 (43%), Gaps = 18/89 (20%)

Query: 95  NKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANF---------------TSA 136
           N  EA  RG F  G+    A   +ADL ++     NFR A F               + A
Sbjct: 141 NLSEANLRGAFVTGANLEGANLNAADLSRSDLSNSNFRHAEFKQANLSCANLAGADLSGA 200

Query: 137 DMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           ++R +D SG+  + A L +A    AN TG
Sbjct: 201 NLRWTDLSGANLSWANLSEAKLSGANLTG 229


>gi|386016243|ref|YP_005934529.1| hypothetical protein PAJ_1653 [Pantoea ananatis AJ13355]
 gi|327394311|dbj|BAK11733.1| hypothetical protein PAJ_1653 [Pantoea ananatis AJ13355]
          Length = 846

 Score = 35.8 bits (81), Expect = 5.6,   Method: Composition-based stats.
 Identities = 34/147 (23%), Positives = 60/147 (40%), Gaps = 13/147 (8%)

Query: 30  LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNIS 89
           LSK  ++   +     +  +   CS  +    +A         S  L  AV +  S N +
Sbjct: 670 LSKTTFIKSTLEQAVFNRAELESCSWVETQADHATFSG-----SIWLTCAVASGSSLNDA 724

Query: 90  ALADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRRAN-----FTSADMRES 141
                   ++  R       + + A+  ++DL +A     NF++AN     F   D R++
Sbjct: 725 DFTHATLRQSNLRQTPLNAAVFTQAKLDNSDLSEASCKGANFQQANLAGSLFVRTDFRDA 784

Query: 142 DFSGSKFNGAYLEKAVAYKANFTGTLI 168
           DF+ +   GA L+K+    A F GT +
Sbjct: 785 DFTDANLMGAILQKSQLGGACFRGTTL 811


>gi|300867251|ref|ZP_07111911.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
 gi|300334728|emb|CBN57077.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
          Length = 520

 Score = 35.8 bits (81), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 26/55 (47%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A    A LR +  V  N  RAN   AD+  +D  G   + A L +A   +AN +G
Sbjct: 140 ADLSGAHLRGSSLVSANLERANLHRADLNRADLRGVNLSNAELRQANLSQANLSG 194


>gi|409992244|ref|ZP_11275446.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|409936908|gb|EKN78370.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 264

 Score = 35.8 bits (81), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 20/66 (30%), Positives = 33/66 (50%), Gaps = 5/66 (7%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           I        A+LR+      NF RAN ++  + +++ S + F+ A L  A+ Y+ANF  T
Sbjct: 120 IARRINLNGANLRRG-----NFTRANLSAVSLNQANLSYANFHEAVLINAIGYQANFHYT 174

Query: 167 LIATEH 172
            +   H
Sbjct: 175 NLVNSH 180


>gi|359457318|ref|ZP_09245881.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 510

 Score = 35.8 bits (81), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 3/74 (4%)

Query: 91  LADLNKYEAE-TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           L D+N   A  +R    +   S A    +DL  A   + N  R NF+ AD+ ++D S ++
Sbjct: 41  LQDINLINANLSRANLSLANLSGAFLAGSDLSNAFLSEANLSRVNFSRADLTKADLSFAR 100

Query: 148 FNGAYLEKAVAYKA 161
             GA L +A  Y+A
Sbjct: 101 LQGATLIEANLYQA 114


>gi|334145352|ref|YP_004538562.1| pentapeptide repeat-containing protein [Novosphingobium sp. PP1Y]
 gi|333937236|emb|CCA90595.1| pentapeptide repeat-containing protein [Novosphingobium sp. PP1Y]
          Length = 228

 Score = 35.8 bits (81), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 53/131 (40%), Gaps = 26/131 (19%)

Query: 30  LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWR----VFVSTALAAAVVASCS 85
           L    +VAC  ++ T            +C    A+L   R     F  T L  A++A  S
Sbjct: 85  LGDARFVACDFNNATFKRANLQSARFERCKLTGAELSELRGIDIAFEETLLVNAILAGHS 144

Query: 86  SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
                 A+L + +              F  ADLRK      +FR+A+FT   +RE+   G
Sbjct: 145 FR---RANLKRTD--------------FSQADLRKC-----DFRQAHFTECSLREASMEG 182

Query: 146 SKFNGAYLEKA 156
           ++F GA L  A
Sbjct: 183 ARFEGADLRGA 193


>gi|172039494|ref|YP_001805995.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
           51142]
 gi|354552240|ref|ZP_08971548.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
 gi|171700948|gb|ACB53929.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
 gi|353555562|gb|EHC24950.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
          Length = 320

 Score = 35.8 bits (81), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 31/66 (46%), Gaps = 10/66 (15%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRE----------SDFSGSKFNGAYLEKAVA 158
           S      ADLR       NF RA+ + AD+RE          +D S SK +   L++A  
Sbjct: 138 SRVDLSEADLRGVDLSGANFSRADLSGADLREVDLTNANLYKADISDSKLHNIDLQEAFL 197

Query: 159 YKANFT 164
            KANF+
Sbjct: 198 QKANFS 203


>gi|427415347|ref|ZP_18905532.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
 gi|425756112|gb|EKU96971.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
          Length = 358

 Score = 35.8 bits (81), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 45/96 (46%), Gaps = 8/96 (8%)

Query: 73  STALAAAVVASCSSNISALADLNKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFR 129
           ST L+ A ++      + L D + +    R     GS   +A   SADLR A     N +
Sbjct: 235 STDLSGADLSGAELAGADLRDADLWSTNLRSALLWGSNLRSANLRSADLRNA-----NLK 289

Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
            AN  SAD+R+++  G+    A L +A    A+ +G
Sbjct: 290 DANLRSADLRDANLKGADLAAANLWRANLESADLSG 325


>gi|332710048|ref|ZP_08430003.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332351191|gb|EGJ30776.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 739

 Score = 35.8 bits (81), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 28/51 (54%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           F SADL ++     N  RAN ++A+++  DF+ ++  GA L  A  Y A  
Sbjct: 610 FSSADLSQSSWQGANLSRANLSNANLKNVDFNSTQLVGANLRNAKLYNAKL 660


>gi|291569162|dbj|BAI91434.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
          Length = 264

 Score = 35.8 bits (81), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 20/66 (30%), Positives = 33/66 (50%), Gaps = 5/66 (7%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           I        A+LR+      NF RAN ++  + +++ S + F+ A L  A+ Y+ANF  T
Sbjct: 120 IARRINLNGANLRRG-----NFTRANLSAVSLNQANLSYANFHEAVLINAIGYQANFHYT 174

Query: 167 LIATEH 172
            +   H
Sbjct: 175 NLVNSH 180


>gi|209526319|ref|ZP_03274848.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|376001485|ref|ZP_09779353.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423062694|ref|ZP_17051484.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209493248|gb|EDZ93574.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|375330094|emb|CCE15106.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406715650|gb|EKD10803.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 390

 Score = 35.8 bits (81), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 17/48 (35%), Positives = 30/48 (62%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           ADL +A+ +K N  +A+ +SA++ +S+   + F  AYL KA   +A+ 
Sbjct: 117 ADLTEAIFIKTNLHKADLSSANLTKSNLQSANFVRAYLIKANLSEADL 164


>gi|67924929|ref|ZP_00518320.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
 gi|67853235|gb|EAM48603.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
          Length = 366

 Score = 35.8 bits (81), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 29/59 (49%), Gaps = 5/59 (8%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANF 163
           A +    +L  A     NFR AN T  D+ E     S FSG+  +GAYL  A   +A+F
Sbjct: 246 ATELSGIELSGANLTHSNFRGANLTDVDLSEAILSYSRFSGADLSGAYLGNANLQQADF 304


>gi|119943823|ref|YP_941503.1| pentapeptide repeat-containing protein [Psychromonas ingrahamii 37]
 gi|119862427|gb|ABM01904.1| pentapeptide repeat protein [Psychromonas ingrahamii 37]
          Length = 976

 Score = 35.8 bits (81), Expect = 5.9,   Method: Composition-based stats.
 Identities = 37/155 (23%), Positives = 58/155 (37%), Gaps = 18/155 (11%)

Query: 32  KPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISAL 91
           KP+      +  T +   F     N      A++KN R      L  A     + N + L
Sbjct: 812 KPILTDASFTGSTLNGTNFVIAELNNSCFERARMKNTRFVGGCLLNEACFNFATINETNL 871

Query: 92  AD--LNKYEAE----TRGEFG------------IGSAAQFGSADLRKAVHVKENFRRANF 133
            D  LN  +      ++ +FG            I    QF  ++L  +   K +F  AN 
Sbjct: 872 RDCQLNNCDFSDADISKSDFGESSIKNSQFNCTIARQVQFIDSNLNGSQFKKADFMEANL 931

Query: 134 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
             AD+R  +FSG+   GA    A     +F G ++
Sbjct: 932 MQADIRGCNFSGANLYGASFLNATLGSTSFYGAIL 966


>gi|434388295|ref|YP_007098906.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
 gi|428019285|gb|AFY95379.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
          Length = 384

 Score = 35.8 bits (81), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 18/54 (33%), Positives = 29/54 (53%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
            G+ +L  A+ +  N R A  + A++ ++D SG+   G  LE A   KA+F  T
Sbjct: 290 LGNTNLSGAILIGANLRGATLSKANLTKADLSGADLAGVNLEGANLNKADFRKT 343


>gi|422302315|ref|ZP_16389678.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
 gi|389788490|emb|CCI15806.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
          Length = 354

 Score = 35.8 bits (81), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 49/112 (43%), Gaps = 15/112 (13%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRG---------EFGIGSAAQF 113
           A+L   R F    L AA ++  S  ++     N Y+A  RG         E   GS A F
Sbjct: 219 AELDPLRDFTGANLLAAELSGISLGMA-----NLYQANLRGANLTDADLSEIN-GSHASF 272

Query: 114 GSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
             ADL  A+    +   A+F  + +  ++  GS   GA L +    +ANF+G
Sbjct: 273 KGADLSGALLANADLSYADFYRSSLALANLIGSNLEGANLVEVNITQANFSG 324


>gi|338734131|ref|YP_004672604.1| hypothetical protein SNE_A22360 [Simkania negevensis Z]
 gi|336483514|emb|CCB90113.1| hypothetical protein SNE_A22360 [Simkania negevensis Z]
          Length = 138

 Score = 35.8 bits (81), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 39/87 (44%), Gaps = 6/87 (6%)

Query: 80  VVASCSSNISALADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADM 138
           +V  C+S+     D+ K E   R    +  + A  G+ DL+       N R AN T  ++
Sbjct: 2   LVGGCASH-----DITKAEKGQRHLQNVNLTNADLGNLDLKNVNLSNSNLRSANLTQTNL 56

Query: 139 RESDFSGSKFNGAYLEKAVAYKANFTG 165
             +      F GA+L+KA+   AN  G
Sbjct: 57  TGATLVNVNFQGAFLQKAILTNANCQG 83


>gi|416406325|ref|ZP_11688097.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
 gi|357261078|gb|EHJ10386.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
          Length = 366

 Score = 35.8 bits (81), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 29/59 (49%), Gaps = 5/59 (8%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANF 163
           A +    +L  A     NFR AN T  D+ E     S FSG+  +GAYL  A   +A+F
Sbjct: 246 ATELSGIELSGANLTHSNFRGANLTDVDLSEAILSYSRFSGADLSGAYLGNANLQQADF 304


>gi|148240085|ref|YP_001225472.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
 gi|147848624|emb|CAK24175.1| Secreted pentapeptide repeats protein [Synechococcus sp. WH 7803]
          Length = 174

 Score = 35.8 bits (81), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 5/63 (7%)

Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKAN 162
           G  A F  A+L+ A+  +  F  ANF  AD     M  +DF+G+    A L   +A  ++
Sbjct: 70  GKGADFSGANLQGAIFTQGAFADANFHGADLSDALMDRADFTGTDLRDAVLIGVIASGSS 129

Query: 163 FTG 165
           F G
Sbjct: 130 FAG 132


>gi|451979949|ref|ZP_21928351.1| hypothetical protein NITGR_130031 [Nitrospina gracilis 3/211]
 gi|451762821|emb|CCQ89565.1| hypothetical protein NITGR_130031 [Nitrospina gracilis 3/211]
          Length = 239

 Score = 35.8 bits (81), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 32/62 (51%), Gaps = 5/62 (8%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIA 169
            A F   DLR+A     N   ANF+SA++ +SDFS +   G     AV + AN  G +  
Sbjct: 46  GADFIETDLREANFSNTNLMWANFSSANLYKSDFSQANLKG-----AVFWGANLNGAMFG 100

Query: 170 TE 171
           ++
Sbjct: 101 SK 102


>gi|443476389|ref|ZP_21066298.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443018640|gb|ELS32855.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 152

 Score = 35.8 bits (81), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 20/80 (25%)

Query: 109 SAAQFGSADLRKA---------VHVKE------NFRRANFTSADM-----RESDFSGSKF 148
           S A+   ADL+KA          ++K+      N R ANFT+AD+     R +D  G+KF
Sbjct: 58  SLAKMRGADLQKANLVGANLIGTYLKDANFQGANLRWANFTNADLENVDFRGADLRGAKF 117

Query: 149 NGAYLEKAVAYKANFTGTLI 168
           N A L+   + K+ F  T++
Sbjct: 118 NAALLKDVNSQKSLFCRTIM 137


>gi|88658408|ref|YP_507868.1| pentapeptide repeat-containing protein [Ehrlichia chaffeensis str.
           Arkansas]
 gi|88599865|gb|ABD45334.1| pentapeptide repeat protein [Ehrlichia chaffeensis str. Arkansas]
          Length = 607

 Score = 35.8 bits (81), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 102 RGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
           + EFG   S A F   DLR +V    N   ANFT A++  S F  S   GA    A   K
Sbjct: 64  KKEFGNNLSGADFSDLDLRGSVFDNVNLLHANFTRANLSNSTFIDSNMQGASFINANLSK 123

Query: 161 AN 162
           +N
Sbjct: 124 SN 125


>gi|254417634|ref|ZP_05031369.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196175575|gb|EDX70604.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 470

 Score = 35.8 bits (81), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 43/99 (43%), Gaps = 7/99 (7%)

Query: 67  NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
           N  + V T L  AV+   +  I+ L   N   A+ +G   IG    F  A+L KA     
Sbjct: 277 NGAILVRTTLREAVLNGSNFQIADLTQANLQGAQLKG---IG----FNRANLTKANLEGA 329

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           +   A    AD+  +  +G+  + AYL  A    AN +G
Sbjct: 330 DLTNAKLAIADLTNAQLTGAILHSAYLHSATLANANLSG 368


>gi|428219581|ref|YP_007104046.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427991363|gb|AFY71618.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 508

 Score = 35.8 bits (81), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 5/63 (7%)

Query: 109 SAAQFGSADLRKAVHV-----KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           + A F  A+L +A+       + N  RAN   A+M E++ SG+    A L +A A   NF
Sbjct: 387 AGANFVRANLSRAILSGASLSEANLGRANLYGANMSEANLSGANLENADLSRAQAIATNF 446

Query: 164 TGT 166
           T T
Sbjct: 447 TST 449


>gi|385677695|ref|ZP_10051623.1| pentapeptide repeat-containing protein [Amycolatopsis sp. ATCC
           39116]
          Length = 354

 Score = 35.8 bits (81), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 1/47 (2%)

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT-LIATEH 172
           +F  A F SA +  +DFSGS F GA  E A   +++F+G  L   EH
Sbjct: 288 DFTGARFDSAKLGSADFSGSTFTGADFEYADLGRSDFSGADLTGAEH 334


>gi|225631183|ref|ZP_03787884.1| pentapeptide repeat domain protein [Wolbachia endosymbiont of
           Muscidifurax uniraptor]
 gi|225591121|gb|EEH12302.1| pentapeptide repeat domain protein [Wolbachia endosymbiont of
           Muscidifurax uniraptor]
          Length = 601

 Score = 35.8 bits (81), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 15/38 (39%), Positives = 24/38 (63%)

Query: 129 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           + AN   A+++ESDF+GS  + AYL  ++   +NF  T
Sbjct: 231 KNANIYGAELKESDFTGSNLSAAYLNSSIIINSNFDET 268


>gi|22298427|ref|NP_681674.1| hypothetical protein tlr0884 [Thermosynechococcus elongatus BP-1]
 gi|22294606|dbj|BAC08436.1| tlr0884 [Thermosynechococcus elongatus BP-1]
          Length = 182

 Score = 35.8 bits (81), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 25/48 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           S A    A LR +   + NF  +N T AD+R  DF  +  NGA  E+A
Sbjct: 103 SGANLSRAILRHSHARRANFSNSNLTGADVRYGDFRRAHLNGATFEQA 150


>gi|189499236|ref|YP_001958706.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
           BS1]
 gi|189494677|gb|ACE03225.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
          Length = 442

 Score = 35.8 bits (81), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 18/56 (32%), Positives = 28/56 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    ADL+ +   +  F  A+ T  D R ++  G+   GA LE A+ + AN +
Sbjct: 111 SGANLRGADLKNSYAKEAKFINADLTGTDFRYANLEGADLTGAVLENALFFDANLS 166


>gi|113477518|ref|YP_723579.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110168566|gb|ABG53106.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 710

 Score = 35.8 bits (81), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 5/65 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           + A  GSADL KA   + N  +       F  +D+RES++ G+  +GA   +A   KA+ 
Sbjct: 550 TGADLGSADLSKANLYRANLSKVKAEGTTFQLSDLRESNWQGANLSGANFSRANLKKADL 609

Query: 164 TGTLI 168
           +  L+
Sbjct: 610 SLALL 614


>gi|381206757|ref|ZP_09913828.1| hypothetical protein SclubJA_14172 [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 155

 Score = 35.8 bits (81), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 28/56 (50%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
            A    ADLR++   + + R A+   AD+RE+D  G+ F  A L +A      F G
Sbjct: 90  GADLRGADLRRSKLAQADLRGADLRGADLREADLFGADFRDADLREANLEMTAFGG 145


>gi|158340189|ref|YP_001521359.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158310430|gb|ABW32045.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 309

 Score = 35.8 bits (81), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 15/54 (27%), Positives = 30/54 (55%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIA 169
           ADL  A+ +  NF+ +N   + + + DFSG+ F    L ++V +++   G + +
Sbjct: 122 ADLSTAIGLGANFKNSNLQESTLSDGDFSGANFRETKLTRSVGHRSILNGAVFS 175


>gi|428227020|ref|YP_007111117.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427986921|gb|AFY68065.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 166

 Score = 35.8 bits (81), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 29/60 (48%), Gaps = 5/60 (8%)

Query: 111 AQFGSADLRKAVH-----VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F  ADLR AV      V  N R  +F+      SDFS +  + A L  A+  K+ FTG
Sbjct: 64  ADFSGADLRGAVFNGSTLVHANLRGVDFSDGIAYISDFSDANLSDAVLSSAMLLKSRFTG 123


>gi|17230606|ref|NP_487154.1| hypothetical protein all3114 [Nostoc sp. PCC 7120]
 gi|17132208|dbj|BAB74813.1| all3114 [Nostoc sp. PCC 7120]
          Length = 576

 Score = 35.8 bits (81), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 28/51 (54%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           GI S A    ADL  A+ +  +F  AN  SA++  S+ SG+  NGA L  A
Sbjct: 475 GILSEADLTGADLSDAILLGTDFSFANLNSANLSGSNLSGAILNGADLSSA 525


>gi|381206177|ref|ZP_09913248.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 210

 Score = 35.8 bits (81), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 31/63 (49%), Gaps = 7/63 (11%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
           L + N  +A+ RG       A   SADLR+AV V      AN   ADMR+++   +   G
Sbjct: 140 LRETNLQKADLRG-------ADLRSADLREAVLVAAYLNEANLDGADMRKANLYRASMGG 192

Query: 151 AYL 153
           A L
Sbjct: 193 AIL 195


>gi|298249019|ref|ZP_06972823.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297547023|gb|EFH80890.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 153

 Score = 35.8 bits (81), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 25/46 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
           + A    ADLR+A     N + AN T ADMRE+    +  +GA L+
Sbjct: 92  TGADVTDADLRQANFSFANLQAANLTRADMRETILDAANLDGAILD 137


>gi|257061674|ref|YP_003139562.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
 gi|256591840|gb|ACV02727.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
          Length = 167

 Score = 35.8 bits (81), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 10/67 (14%)

Query: 112 QFGSADLRKAVHVKENFRRANFT----------SADMRESDFSGSKFNGAYLEKAVAYKA 161
            F   DLR A+    N R +NF+          SA++  ++F G+   GA LE A   + 
Sbjct: 46  DFSGQDLRDALFDHANLRGSNFSHANLQGVRFFSANLEGANFEGADLRGADLESARLTRV 105

Query: 162 NFTGTLI 168
           NFT  L+
Sbjct: 106 NFTNALL 112


>gi|158337601|ref|YP_001518776.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158307842|gb|ABW29459.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 172

 Score = 35.8 bits (81), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 19/33 (57%)

Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           NFT AD+R  DF    F GA L  A+  KAN T
Sbjct: 41  NFTFADLRYEDFENKNFEGASLAGAILLKANLT 73


>gi|332709304|ref|ZP_08429266.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332351850|gb|EGJ31428.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 784

 Score = 35.8 bits (81), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 16/37 (43%), Positives = 22/37 (59%)

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +   AN   AD+   DF+ +K NGA L K++  KANF
Sbjct: 225 DLEEANLCEADLSRVDFTRTKLNGANLAKSILVKANF 261


>gi|390455029|ref|ZP_10240557.1| hypothetical protein PpeoK3_13479 [Paenibacillus peoriae KCTC 3763]
          Length = 425

 Score = 35.8 bits (81), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 18/55 (32%), Positives = 30/55 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A F  A L+ A+    + ++A+F  AD+ ++ F G+   GA    A   +A+FTG
Sbjct: 326 ASFQHAQLQGAIFQDSSLQKASFIGADLTDACFVGADITGASFTDACLVRADFTG 380


>gi|119486435|ref|ZP_01620493.1| hypothetical protein L8106_00535 [Lyngbya sp. PCC 8106]
 gi|119456337|gb|EAW37468.1| hypothetical protein L8106_00535 [Lyngbya sp. PCC 8106]
          Length = 691

 Score = 35.4 bits (80), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 18/65 (27%), Positives = 31/65 (47%)

Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +F    +    +ADLR    ++ +   AN  S D+R ++ +G+   GA L  A     N 
Sbjct: 615 QFANLKSVNLNNADLRGVNMIQAHLGGANLISVDLRAANLTGADLTGADLTNAKLGGVNL 674

Query: 164 TGTLI 168
           T T++
Sbjct: 675 TNTIL 679


>gi|434384803|ref|YP_007095414.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
 gi|428015793|gb|AFY91887.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
          Length = 237

 Score = 35.4 bits (80), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 33/66 (50%), Gaps = 5/66 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 163
           S A F  ADL +A+    +   ++   A +RE+D +G+K N     GAYL KA     N 
Sbjct: 53  SHADFRGADLSEALLWGTDLTESHLDRAILRETDLTGAKLNRAQLSGAYLAKASLCGVNL 112

Query: 164 TGTLIA 169
            G  +A
Sbjct: 113 AGANLA 118


>gi|423067569|ref|ZP_17056359.1| serine/threonine protein kinase with pentapeptide repeat protein
           [Arthrospira platensis C1]
 gi|406711143|gb|EKD06345.1| serine/threonine protein kinase with pentapeptide repeat protein
           [Arthrospira platensis C1]
          Length = 548

 Score = 35.4 bits (80), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 28/55 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A FG A L +A     N   A  ++A++ ++D  G+   GAYL +A    AN  G
Sbjct: 460 ANFGCARLTQAKLKNANLENAYLSTANLEKADLRGANLQGAYLTRANLRGANLCG 514


>gi|291568925|dbj|BAI91197.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
          Length = 379

 Score = 35.4 bits (80), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 19/64 (29%), Positives = 35/64 (54%), Gaps = 1/64 (1%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-KFNGAYLEKAVAYKANFTGTLI 168
           + +F +ADL  A  +K +    NF+ AD+  + F  S +F+    +KA+    NF+G  +
Sbjct: 69  SVKFVNADLTNACCIKSDLNTINFSGADLTGAQFQASRRFSDIKRDKAILKNVNFSGIKM 128

Query: 169 ATEH 172
           + E+
Sbjct: 129 SGEN 132


>gi|221234905|ref|YP_002517341.1| hypothetical protein CCNA_01968 [Caulobacter crescentus NA1000]
 gi|220964077|gb|ACL95433.1| pentapeptide repeats containing protein [Caulobacter crescentus
           NA1000]
          Length = 289

 Score = 35.4 bits (80), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 33/58 (56%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           A+F +A+L  A     N R A+F++AD+  ++F G++F+GA    A    +N  G + 
Sbjct: 149 ARFANAELIAANLSGANARDADFSNADINHANFQGARFDGARFHNADMTGSNLRGGIF 206


>gi|443325445|ref|ZP_21054140.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442794955|gb|ELS04347.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 429

 Score = 35.4 bits (80), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 25/38 (65%)

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           NF  A+ TSA ++++DFS S F GA LE+A    AN +
Sbjct: 271 NFECADLTSASLQKADFSKSNFKGAKLERANLIGANLS 308


>gi|427738964|ref|YP_007058508.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427374005|gb|AFY57961.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 187

 Score = 35.4 bits (80), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 27/50 (54%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A+L +A   K N R AN   AD+  ++ S +   GA L+ AV   AN +G
Sbjct: 46  ANLSQANLSKVNLRDANLRGADLTGTNLSNADLTGADLQNAVLIGANLSG 95


>gi|428319027|ref|YP_007116909.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428242707|gb|AFZ08493.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 520

 Score = 35.4 bits (80), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 9/74 (12%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           ADL K E          S     +AD+R+A   + N   AN + A+++ +D +G+  NGA
Sbjct: 171 ADLTKAEL---------SGVNLSNADMRQASLQQVNLSSANLSGANLKWADLTGANLNGA 221

Query: 152 YLEKAVAYKANFTG 165
            L  A    AN  G
Sbjct: 222 DLSCAKLSGANLHG 235


>gi|410464803|ref|ZP_11318199.1| putative low-complexity protein [Desulfovibrio magneticus str.
           Maddingley MBC34]
 gi|409982086|gb|EKO38579.1| putative low-complexity protein [Desulfovibrio magneticus str.
           Maddingley MBC34]
          Length = 408

 Score = 35.4 bits (80), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 28/55 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           + F  A L KA     NF  ANF+ A++ E++FSG+    A +     YK N  G
Sbjct: 107 SNFAGAILAKANLTCSNFSEANFSRANLAEANFSGANLTKANMPHTNLYKTNLGG 161


>gi|359460928|ref|ZP_09249491.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 172

 Score = 35.4 bits (80), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 19/33 (57%)

Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           NFT AD+R  DF    F GA L  A+  KAN T
Sbjct: 41  NFTFADLRYEDFENKNFEGASLAGAILLKANLT 73


>gi|251793607|ref|YP_003008336.1| pentapeptide repeat protein [Aggregatibacter aphrophilus NJ8700]
 gi|416892174|ref|ZP_11923604.1| pentapeptide repeat protein [Aggregatibacter aphrophilus ATCC
           33389]
 gi|422337246|ref|ZP_16418217.1| hypothetical protein HMPREF9335_01405 [Aggregatibacter aphrophilus
           F0387]
 gi|247535003|gb|ACS98249.1| pentapeptide repeat protein [Aggregatibacter aphrophilus NJ8700]
 gi|347814938|gb|EGY31582.1| pentapeptide repeat protein [Aggregatibacter aphrophilus ATCC
           33389]
 gi|353345460|gb|EHB89752.1| hypothetical protein HMPREF9335_01405 [Aggregatibacter aphrophilus
           F0387]
          Length = 160

 Score = 35.4 bits (80), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 5/56 (8%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           A  F +A L+     K+NF+  +FT+AD+R ++ S      A LE A+   ANF G
Sbjct: 82  AEDFINAKLQGVNFAKKNFKGKDFTNADLRNANLS-----HANLEDAILINANFAG 132


>gi|427419259|ref|ZP_18909442.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
 gi|425761972|gb|EKV02825.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
          Length = 447

 Score = 35.4 bits (80), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 26/57 (45%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A     DL  A     N + AN +SAD+ ++   G+   GA L +A     NF G
Sbjct: 377 SRANLAGTDLEDATLDNANLQEANLSSADLEDASLIGANLLGANLSEADLEDTNFCG 433


>gi|119510688|ref|ZP_01629816.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
 gi|119464642|gb|EAW45551.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
          Length = 152

 Score = 35.4 bits (80), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 26/57 (45%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A     +L  A     N + AN TSA +   +F  +  NGA L +A  Y AN  G
Sbjct: 64  SGANLEGVNLEGADLTNANLKGANLTSAMLTNVNFKAANLNGANLTRAQIYDANVYG 120


>gi|428769912|ref|YP_007161702.1| serine/threonine protein kinase with pentapeptide repeats
           [Cyanobacterium aponinum PCC 10605]
 gi|428684191|gb|AFZ53658.1| serine/threonine protein kinase with pentapeptide repeats
           [Cyanobacterium aponinum PCC 10605]
          Length = 506

 Score = 35.4 bits (80), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 29/57 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A F SADL  A  +  N  +A F+ +++   DF+G+    A    A    ANF+G
Sbjct: 415 SQANFYSADLTNANFINANLFQAYFSKSNLENVDFTGANLGSADFTNANVTNANFSG 471


>gi|75910285|ref|YP_324581.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75704010|gb|ABA23686.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 189

 Score = 35.4 bits (80), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 28/52 (53%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
            A    A+L + +    N R A  T+A++ ESDFSG+   GA L +A  + A
Sbjct: 64  GANLEHANLSEVIFSGANLREATLTTANLNESDFSGAYLCGADLREASLHMA 115


>gi|77404498|ref|YP_345074.1| hypothetical protein pREC1_0013 [Rhodococcus erythropolis PR4]
 gi|77019879|dbj|BAE46254.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
          Length = 589

 Score = 35.4 bits (80), Expect = 8.0,   Method: Composition-based stats.
 Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 3/101 (2%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKEN 127
           F    L+ A +     +++ L + +  EA   G   IG+    A    ADL KA     +
Sbjct: 450 FSGANLSGADLTDADLSVADLEEADLTEANLTGAVLIGANLAHANLTDADLSKANLSDAD 509

Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
              AN T A++ ++D SG+    A L   +  + + TG ++
Sbjct: 510 LYSANLTDANLSDADLSGATLTRAGLMGTILTRVDLTGAVL 550


>gi|338737168|ref|YP_004674130.1| Pentapeptide repeat protein [Hyphomicrobium sp. MC1]
 gi|337757731|emb|CCB63554.1| Pentapeptide repeat protein [Hyphomicrobium sp. MC1]
          Length = 276

 Score = 35.4 bits (80), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 32/64 (50%), Gaps = 5/64 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A+   A++ +AV     F  AN     FT AD+  +DFSG+   GA   +A    ANF
Sbjct: 194 SGARLEDANMTRAVLWFARFNGANLKGTDFTDADLSRADFSGADITGADFTRADLDGANF 253

Query: 164 TGTL 167
            G +
Sbjct: 254 VGAI 257


>gi|428220990|ref|YP_007105160.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427994330|gb|AFY73025.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 165

 Score = 35.4 bits (80), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 31/63 (49%), Gaps = 5/63 (7%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG-----AYLEKAVAYKANFTG 165
           A F +A+L        N   A+FT AD+R S    ++ NG     A LE A  Y  +F G
Sbjct: 62  ANFRNANLAGVSLFGANMTAADFTGADLRYSTLDTARMNGANLTNAVLEGAFVYGTSFVG 121

Query: 166 TLI 168
           T+I
Sbjct: 122 TVI 124


>gi|428209239|ref|YP_007093592.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
           PCC 7203]
 gi|428011160|gb|AFY89723.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
          Length = 165

 Score = 35.4 bits (80), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 36/122 (29%), Positives = 52/122 (42%), Gaps = 21/122 (17%)

Query: 65  LKNWRVFVSTALAAAVV--------ASCSSNISALADLN----KYEAE--TRGEFGIGSA 110
           ++ +R F+  A+A  V         A+ S+ I A  D+      Y  +   R EF     
Sbjct: 1   MRGFRYFLGIAIALLVFVTLPLPAQAASSAAIRAYDDVQVQTKDYSGQNLVRAEFNNTKL 60

Query: 111 AQ--FGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANF 163
           A+  F SADLR AV      R+AN    D        SD S +  + A L  A+  ++NF
Sbjct: 61  AEANFSSADLRGAVFNSAVLRKANLHGVDFSYGIAYLSDLSAADLSDAILTSAMMLRSNF 120

Query: 164 TG 165
            G
Sbjct: 121 KG 122


>gi|427736183|ref|YP_007055727.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427371224|gb|AFY55180.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 247

 Score = 35.4 bits (80), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 5/62 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANF 163
           S A   SA L  A  +  NF +AN   A ++ +D +G+ F GA+LE      A    ANF
Sbjct: 52  SGANLSSAGLGSAQLIGANFSQANLKEAWLQSADLTGANFCGAFLEDANLGSADLADANF 111

Query: 164 TG 165
           +G
Sbjct: 112 SG 113


>gi|423453389|ref|ZP_17430242.1| hypothetical protein IEE_02133 [Bacillus cereus BAG5X1-1]
 gi|423469527|ref|ZP_17446271.1| hypothetical protein IEM_00833 [Bacillus cereus BAG6O-2]
 gi|401138182|gb|EJQ45755.1| hypothetical protein IEE_02133 [Bacillus cereus BAG5X1-1]
 gi|402439265|gb|EJV71273.1| hypothetical protein IEM_00833 [Bacillus cereus BAG6O-2]
          Length = 237

 Score = 35.4 bits (80), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 32/67 (47%), Gaps = 7/67 (10%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
           L D N  E + R          F +ADL  A+ +  NF  ANFT A +   D+ G+K  G
Sbjct: 170 LEDANLSEVDARN-------VDFTNADLTGAILLNGNFTDANFTGAKLDNVDWKGAKVEG 222

Query: 151 AYLEKAV 157
           A  ++ V
Sbjct: 223 AKFDENV 229


>gi|302383319|ref|YP_003819142.1| pentapeptide repeat protein [Brevundimonas subvibrioides ATCC
           15264]
 gi|302193947|gb|ADL01519.1| pentapeptide repeat protein [Brevundimonas subvibrioides ATCC
           15264]
          Length = 308

 Score = 35.4 bits (80), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 28/57 (49%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           SA+ F  ADLR A     NF  ANFT A +  ++  GS FN A L  A    A   G
Sbjct: 95  SASDFSRADLRGAQIQGSNFSAANFTDAVLTGAEAQGSNFNRADLTNANLSGAELVG 151


>gi|392410087|ref|YP_006446694.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
 gi|390623223|gb|AFM24430.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
          Length = 490

 Score = 35.4 bits (80), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 5/63 (7%)

Query: 109 SAAQFGSADLRKAV-----HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A F   DL + +      V  + +  +F  AD+  +DF+ + F+G  + +  A  AN 
Sbjct: 388 SKASFRDGDLTRVIALATIFVSADLQNTSFKDADVSAADFTNANFSGVTMSQVAAVNANL 447

Query: 164 TGT 166
           TGT
Sbjct: 448 TGT 450


>gi|186683195|ref|YP_001866391.1| pentapeptide repeat-containing serine/threonine kinase [Nostoc
           punctiforme PCC 73102]
 gi|186465647|gb|ACC81448.1| serine/threonine protein kinase with pentapeptide repeats [Nostoc
           punctiforme PCC 73102]
          Length = 534

 Score = 35.4 bits (80), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 25/53 (47%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           FG A L KA     N  +A F  AD+  +D  G+  + AYL  A    AN  G
Sbjct: 450 FGRASLSKANLKDANLTKAYFNHADLEGADLRGADLSNAYLSNANLRGANLCG 502


>gi|442322241|ref|YP_007362262.1| pentapeptide repeat-containing protein [Myxococcus stipitatus DSM
           14675]
 gi|441489883|gb|AGC46578.1| pentapeptide repeat-containing protein [Myxococcus stipitatus DSM
           14675]
          Length = 222

 Score = 35.4 bits (80), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 29/58 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           S   F    LRK   +K   R +NF   D+ ++DF+GS   G+     V  +A+F+ T
Sbjct: 113 SYCSFVGLSLRKTPFLKCVARESNFYDLDLTDADFTGSDLGGSNFRGCVLLRADFSDT 170


>gi|425440291|ref|ZP_18820596.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
           9717]
 gi|389719320|emb|CCH96834.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
           9717]
          Length = 450

 Score = 35.4 bits (80), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 30/65 (46%), Gaps = 10/65 (15%)

Query: 111 AQFGSADLRKA----------VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
           A    ADLRKA          + ++ N R  N + A++R  + SG   +GA L  A    
Sbjct: 345 ANLSGADLRKANLSGANLWGAILIEANLRGVNLSGANLRGVNLSGVNLSGAILRGANLSG 404

Query: 161 ANFTG 165
           AN +G
Sbjct: 405 ANLSG 409


>gi|320156222|ref|YP_004188601.1| hypothetical protein VVMO6_01376 [Vibrio vulnificus MO6-24/O]
 gi|319931534|gb|ADV86398.1| hypothetical protein VVMO6_01376 [Vibrio vulnificus MO6-24/O]
          Length = 689

 Score = 35.4 bits (80), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 29/61 (47%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           S A   SAD + ++ V  NF +A+ T AD    DF+ +   GA L      +A  T + I
Sbjct: 607 SKASLDSADFKSSIFVNANFEKADLTQADFGGCDFTNANLQGAELSGCDLTQARLTSSNI 666

Query: 169 A 169
            
Sbjct: 667 T 667


>gi|16126134|ref|NP_420698.1| pentapeptide repeat-containing protein [Caulobacter crescentus
           CB15]
 gi|13423338|gb|AAK23866.1| pentapeptide repeat family protein [Caulobacter crescentus CB15]
          Length = 250

 Score = 35.4 bits (80), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 33/58 (56%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           A+F +A+L  A     N R A+F++AD+  ++F G++F+GA    A    +N  G + 
Sbjct: 110 ARFANAELIAANLSGANARDADFSNADINHANFQGARFDGARFHNADMTGSNLRGGIF 167


>gi|406987204|gb|EKE07615.1| hypothetical protein ACD_18C00027G0001, partial [uncultured
           bacterium]
          Length = 406

 Score = 35.4 bits (80), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 16/46 (34%), Positives = 24/46 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
           +   F ++DLR       N    NFT++D+R +DF G+ F G   E
Sbjct: 337 TLTNFTNSDLRNVNFRDANLTWTNFTNSDLRNADFRGASFTGTIFE 382


>gi|355340866|gb|AER58204.1| pentapeptide repeat-containing protein [uncultured Acidobacteria
           bacterium]
          Length = 306

 Score = 35.4 bits (80), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 28/57 (49%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A  G++   +AV  + +FR A+ + +D   SD S     GA L  A    AN TG
Sbjct: 210 SRADLGASQFDRAVLTEADFRHADLSGSDFSRSDLSNVHLGGAKLFTADLRAANLTG 266


>gi|428298761|ref|YP_007137067.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428235305|gb|AFZ01095.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 169

 Score = 35.0 bits (79), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 26/111 (23%), Positives = 51/111 (45%), Gaps = 6/111 (5%)

Query: 64  KLKN--WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG----SAAQFGSAD 117
           KL N  WR+ +S  L   +    +  ++ +A   +Y  E   +        S + F  A+
Sbjct: 4   KLSNNFWRIVLSALLGTVIWMISTWGLTPIAFALEYNKEILIQSDFSGRDLSDSSFTKAN 63

Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           L+++     N R  +F +A++   D +G+  + + L+ A   KAN T  ++
Sbjct: 64  LKQSNFSNTNLRGVSFFAANLESVDLTGADLSNSTLDSARLVKANLTNAIL 114


>gi|428768931|ref|YP_007160721.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
           10605]
 gi|428683210|gb|AFZ52677.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
          Length = 320

 Score = 35.0 bits (79), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 27/55 (49%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           + F  A+L+     K NF  ANFT A++  +D SG    GA   +A    AN  G
Sbjct: 115 SDFSYANLQNCKLTKANFMGANFTRANLSGADLSGVNLTGADFTRADLSGANLQG 169


>gi|166364719|ref|YP_001656992.1| hypothetical protein MAE_19780 [Microcystis aeruginosa NIES-843]
 gi|166087092|dbj|BAG01800.1| hypothetical protein MAE_19780 [Microcystis aeruginosa NIES-843]
          Length = 354

 Score = 35.0 bits (79), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 49/112 (43%), Gaps = 15/112 (13%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRG---------EFGIGSAAQF 113
           A+L   R F    L AA ++  S  ++     N Y+A  RG         E   GS A F
Sbjct: 219 AELDPLRDFTGANLLAAELSGISLGMA-----NLYQANLRGANLTDADLSEIN-GSHASF 272

Query: 114 GSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
             ADL  A+    +   A+F  + +  ++  GS   GA L +    +ANF+G
Sbjct: 273 RGADLSGALLANADLSYADFYRSSLALANLIGSNLEGANLVEVNITQANFSG 324


>gi|443326153|ref|ZP_21054816.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442794217|gb|ELS03641.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 293

 Score = 35.0 bits (79), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 52/124 (41%), Gaps = 22/124 (17%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG-------------S 109
           A+ + W+V      AA  + +  + I AL DLN      +G F  G             S
Sbjct: 106 AQYEAWQVVD----AAHGLKTSYARIQALQDLNGDNVTMKGLFASGADLRNIDLHGADLS 161

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FT 164
            A F  ADLR A     N   AN  +AD    + + ++ +G+ L +A   +AN     F 
Sbjct: 162 NADFQDADLRGANLSNTNLSNANLANADFSNVNLANARLSGSDLSEANFVEANLNNVDFV 221

Query: 165 GTLI 168
           G +I
Sbjct: 222 GAII 225


>gi|418055242|ref|ZP_12693297.1| pentapeptide repeat protein [Hyphomicrobium denitrificans 1NES1]
 gi|353210824|gb|EHB76225.1| pentapeptide repeat protein [Hyphomicrobium denitrificans 1NES1]
          Length = 279

 Score = 35.0 bits (79), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 30/57 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           S A+   A +++ V     F  A+ T AD  ++D S + F GA +  AV  +ANF G
Sbjct: 197 SGARLHGAHMQRTVMWFAKFTGADLTGADFTDADLSRADFAGADISGAVFTRANFEG 253


>gi|428218257|ref|YP_007102722.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427990039|gb|AFY70294.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 191

 Score = 35.0 bits (79), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 71/162 (43%), Gaps = 13/162 (8%)

Query: 10  SIKSLNFCSSSSKGPYQLHA-LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNW 68
           S+++L+ C ++ +G     A LSK       +     S G    C  +      A LK  
Sbjct: 25  SLQNLDLCGANLRGAKLAGANLSKVDLSGSDLIGTDLSRGNLSSCDLSAACLRGANLKEA 84

Query: 69  RVFVSTALAAAVVASCSSNISAL--ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
               +  L++A +   S N ++L  ADL      +  + G+   A+  +ADL +A     
Sbjct: 85  N-LANADLSSADLRGASLNRASLCKADL------SAADLGV---AELINADLSEANFKGA 134

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           + R ANFT A   ++D  GS  + A L  A    A+F G ++
Sbjct: 135 DLRGANFTGAIFHKTDLRGSDLHEAILTDADLAGADFRGAIM 176


>gi|427713585|ref|YP_007062209.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
 gi|427377714|gb|AFY61666.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
          Length = 178

 Score = 35.0 bits (79), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 27/108 (25%), Positives = 45/108 (41%), Gaps = 18/108 (16%)

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
           PYA+L+            A++   +   + L D   Y+A         +AA    A+L +
Sbjct: 51  PYAQLQR-----------AILIKANFTEAELGDTQLYQANL-------TAANLAGANLSR 92

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLI 168
           A     N + AN   AD+  +D  G+   GA L  A+  +AN    ++
Sbjct: 93  ANLRAANLQGANLQGADLTRADLRGANLAGADLRDAILSRANLDAAVL 140


>gi|86605838|ref|YP_474601.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86554380|gb|ABC99338.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
          Length = 158

 Score = 35.0 bits (79), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 27/96 (28%), Positives = 41/96 (42%), Gaps = 7/96 (7%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
            V   L  A +   + +   L+ +N  EA+ RG       A   SA+L  A     N   
Sbjct: 31  LVRATLQGANLRGANLSFGKLSGINLQEADLRG-------ADLSSANLMGANLRGANLWE 83

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           AN   AD+  +D   +  +GAYL +A   +A   G+
Sbjct: 84  ANLIGADLSFADLREANLHGAYLWEAKLTRAQLQGS 119


>gi|293393781|ref|ZP_06638088.1| pentapeptide repeat-containing protein [Serratia odorifera DSM
           4582]
 gi|291423608|gb|EFE96830.1| pentapeptide repeat-containing protein [Serratia odorifera DSM
           4582]
          Length = 353

 Score = 35.0 bits (79), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 33/142 (23%), Positives = 64/142 (45%), Gaps = 9/142 (6%)

Query: 30  LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNIS 89
           L K ++  C + +   +D +   C+    A P A+     +     +  + ++  +   +
Sbjct: 177 LHKTVFQHCDLQAALFNDARLESCNWVASALPQAQFNGATLLTCAVVMDSDLSGANFGNA 236

Query: 90  ALADLNKYEAETRG-EFGIGSAAQFGSADLRKAVHVKENFRRAN-----FTSADMRESDF 143
            L + N  +A   G +F   S A+  ++DL +A   +  F RAN     F   D R+++F
Sbjct: 237 TLIESNLRQASLAGADF---SLAKLENSDLSEANCQRARFIRANLVGSLFIRTDFRQANF 293

Query: 144 SGSKFNGAYLEKAVAYKANFTG 165
           + +   GA L+K     A+F+G
Sbjct: 294 TNANLMGALLQKTQLGGADFSG 315


>gi|295690280|ref|YP_003593973.1| pentapeptide repeat-containing protein [Caulobacter segnis ATCC
           21756]
 gi|295432183|gb|ADG11355.1| pentapeptide repeat protein [Caulobacter segnis ATCC 21756]
          Length = 422

 Score = 35.0 bits (79), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 18/55 (32%), Positives = 29/55 (52%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           + + A F  A L+ A  ++ N ++ANF  A++  +D SG+   GA L   V   A
Sbjct: 169 VATKADFSDAILKDAKLIRANLKQANFNGANLAGADLSGANLAGADLRNTVLVGA 223


>gi|110597243|ref|ZP_01385531.1| Pentapeptide repeat [Chlorobium ferrooxidans DSM 13031]
 gi|110341079|gb|EAT59547.1| Pentapeptide repeat [Chlorobium ferrooxidans DSM 13031]
          Length = 447

 Score = 35.0 bits (79), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 29/58 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGT 166
           S + F SA L +A     N  + NF  ADM+ +   G+   GA L++A    A+ + T
Sbjct: 304 SGSSFKSASLDEANLAGANLSKVNFHKADMKGAHLQGANLQGANLDRAFLKDADLSNT 361


>gi|428218014|ref|YP_007102479.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427989796|gb|AFY70051.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 253

 Score = 35.0 bits (79), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 30/55 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     NF+ AN  +A++R S+  G+  + A LEKA  + AN 
Sbjct: 150 SEAVLNKADLRGANLSCTNFQGANLRAANLRHSNLQGANLSYADLEKADLHGANL 204


>gi|298492040|ref|YP_003722217.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
 gi|298233958|gb|ADI65094.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
          Length = 167

 Score = 35.0 bits (79), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 10/68 (14%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAYK 160
           A F   DLR +   K N R++NFT A++R           ++  G+    A L+ A   +
Sbjct: 45  ADFSRRDLRDSSFTKANLRQSNFTGANLRGVSFFAANLESANLEGADLTNATLDSARLIR 104

Query: 161 ANFTGTLI 168
           AN T  ++
Sbjct: 105 ANLTNAVL 112


>gi|423066191|ref|ZP_17054981.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|406712233|gb|EKD07422.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 264

 Score = 35.0 bits (79), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 18/58 (31%), Positives = 32/58 (55%), Gaps = 5/58 (8%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGTLIATEH 172
            A+LR+      NF +AN  + ++ +++ S + F+ A L  A+ Y+A F GT +   H
Sbjct: 128 GANLRRG-----NFTQANLAAVNLNQANLSHANFHEAVLINAIGYQAYFYGTNLVNSH 180


>gi|392384479|ref|YP_005033675.1| putative Pentapeptide repeat family protein [Azospirillum
           brasilense Sp245]
 gi|356881194|emb|CCD02176.1| putative Pentapeptide repeat family protein [Azospirillum
           brasilense Sp245]
          Length = 428

 Score = 35.0 bits (79), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 21/35 (60%)

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
           AN + AD+R +DFS +K  GA L  AV   A F G
Sbjct: 180 ANLSGADLRGADFSMAKLKGAILNNAVVAGATFQG 214


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.125    0.363 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,489,536,311
Number of Sequences: 23463169
Number of extensions: 87488778
Number of successful extensions: 224763
Number of sequences better than 100.0: 726
Number of HSP's better than 100.0 without gapping: 433
Number of HSP's successfully gapped in prelim test: 293
Number of HSP's that attempted gapping in prelim test: 218456
Number of HSP's gapped (non-prelim): 5798
length of query: 172
length of database: 8,064,228,071
effective HSP length: 132
effective length of query: 40
effective length of database: 9,262,057,059
effective search space: 370482282360
effective search space used: 370482282360
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 71 (32.0 bits)