BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 023545
         (281 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255583634|ref|XP_002532572.1| conserved hypothetical protein [Ricinus communis]
 gi|223527699|gb|EEF29806.1| conserved hypothetical protein [Ricinus communis]
          Length = 280

 Score =  433 bits (1114), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 223/282 (79%), Positives = 239/282 (84%), Gaps = 3/282 (1%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MA +SISPLSIKS+N   SSS+ PY L + SKP  + CQ++  TE + +  DCS  +   
Sbjct: 1   MAFTSISPLSIKSVNISPSSSRSPYHLPSQSKPFHILCQLA--TEREDRILDCSTTRYKV 58

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
            ++K KNWR  VSTALAAA   +    + A ADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 59  HHSKPKNWRTLVSTALAAAAAVNLGFGLPAAADLNKFEAELRGEFGIGSAAQFGSADLRK 118

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AVHV ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 119 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 178

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           ANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVIDL QKQALCKYANGTN ITGVSTRKSL
Sbjct: 179 ANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNSITGVSTRKSL 238

Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCD  TGLCDAK
Sbjct: 239 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDEATGLCDAK 280


>gi|224071571|ref|XP_002303521.1| predicted protein [Populus trichocarpa]
 gi|222840953|gb|EEE78500.1| predicted protein [Populus trichocarpa]
          Length = 275

 Score =  421 bits (1082), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 214/282 (75%), Positives = 236/282 (83%), Gaps = 8/282 (2%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MA +SIS +SIKS N  +     P+++ +LSKP  +A Q+   TE   QF DCS N    
Sbjct: 1   MAFTSISSMSIKSPNIST-----PHRILSLSKPFRIAYQL--DTERGNQFADCSKNGYEV 53

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             AK KNW   VST L AA ++  S N+ A+ADLN++EAETRGEFGIGSAAQFGSADLRK
Sbjct: 54  ETAKAKNWARVVSTTLVAAAISFSSCNLPAVADLNRFEAETRGEFGIGSAAQFGSADLRK 113

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AVH+ ENFR ANFT+ADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 114 AVHLNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 173

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           +NLTNAVLVR+VLTRSDLGGA+I GADFSDAVIDL QKQALCKYA+GTNPITGVSTR SL
Sbjct: 174 SNLTNAVLVRSVLTRSDLGGALIAGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASL 233

Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           GCGNSRRNAYG+PSSPLLSAPPQKLLDRDGFCD GTGLCDAK
Sbjct: 234 GCGNSRRNAYGTPSSPLLSAPPQKLLDRDGFCDQGTGLCDAK 275


>gi|297741150|emb|CBI31881.3| unnamed protein product [Vitis vinifera]
          Length = 261

 Score =  407 bits (1045), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 217/282 (76%), Positives = 230/282 (81%), Gaps = 22/282 (7%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSS+SPL I         SK P  L + SKP  V C+I  +    G +  C  N    
Sbjct: 1   MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNY--CRAN---- 42

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             A+ K W+  VSTALAAAVV + S  + A+ADLNKYEAETRGEFGIGSAAQFGSADLRK
Sbjct: 43  --AESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 99

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AVHV ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 100 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 159

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           ANLTNAVL RTVLTRSDLGGA+IEGADFSDAVIDL QKQALCKYA+GTNPITGVSTR SL
Sbjct: 160 ANLTNAVLARTVLTRSDLGGAVIEGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASL 219

Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           GCGNSRR+AYGSPSSPLLSAPP KLLDRDGFCD GTGLCDAK
Sbjct: 220 GCGNSRRSAYGSPSSPLLSAPPPKLLDRDGFCDEGTGLCDAK 261


>gi|359474379|ref|XP_002265958.2| PREDICTED: uncharacterized protein LOC100250522 isoform 2 [Vitis
           vinifera]
          Length = 596

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 217/282 (76%), Positives = 230/282 (81%), Gaps = 22/282 (7%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSS+SPL I         SK P  L + SKP  V C+I  +    G +  C  N    
Sbjct: 336 MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNY--CRAN---- 377

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             A+ K W+  VSTALAAAVV + S  + A+ADLNKYEAETRGEFGIGSAAQFGSADLRK
Sbjct: 378 --AESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 434

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AVHV ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 435 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 494

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           ANLTNAVL RTVLTRSDLGGA+IEGADFSDAVIDL QKQALCKYA+GTNPITGVSTR SL
Sbjct: 495 ANLTNAVLARTVLTRSDLGGAVIEGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASL 554

Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           GCGNSRR+AYGSPSSPLLSAPP KLLDRDGFCD GTGLCDAK
Sbjct: 555 GCGNSRRSAYGSPSSPLLSAPPPKLLDRDGFCDEGTGLCDAK 596



 Score =  186 bits (473), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 109/179 (60%), Positives = 121/179 (67%), Gaps = 20/179 (11%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSS+SPL I         SK P  L +LSKP  V C+I  + E         NN    
Sbjct: 1   MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRE---------NNWRGE 42

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             A+ K W+  VSTALAAAVV + S  + A+ADLNKYE ETRGEFGIGSAAQFGSADLRK
Sbjct: 43  ANAESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRK 101

Query: 121 AVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
           AVHV ENF RANFTSADMRESDFSGS FNG YLEKAVAYKA+ TG D       +MVL+
Sbjct: 102 AVHVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLTGPDAPHARPYKMVLH 160


>gi|449459702|ref|XP_004147585.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Cucumis sativus]
 gi|449520611|ref|XP_004167327.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Cucumis sativus]
          Length = 279

 Score =  399 bits (1025), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 205/281 (72%), Positives = 229/281 (81%), Gaps = 5/281 (1%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSSIS LS+K L   SS S+ P  L    K + +  QI+ + +   Q  DCS  +  G
Sbjct: 1   MALSSISSLSVKCLPLNSSKSRHPCSLQT-RKQISMVSQINPQKD---QTQDCSERKHIG 56

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
              + K W+  VSTALAAA V   SS + ++A+LNKYEA+TRGEFGIGSAAQ+GSADLRK
Sbjct: 57  KITEPKRWQKLVSTALAAAAVIGFSSGMPSVAELNKYEADTRGEFGIGSAAQYGSADLRK 116

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AVH+ ENFR ANFTSADMRESDFSG  FNGAYLEKAVAYK NF+GADLSDTLMDRMVLNE
Sbjct: 117 AVHINENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNE 176

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           AN TNAVLVR+VLTRSDLGGAII GADFSDAVIDL QKQALCKYA+GTNP+TGVSTR SL
Sbjct: 177 ANFTNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASL 236

Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDA 280
           GCGNSRRNAYG+PSSPLLSAPPQ+LLDRDGFCD  TGLC+A
Sbjct: 237 GCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQDTGLCEA 277


>gi|388505216|gb|AFK40674.1| unknown [Lotus japonicus]
          Length = 273

 Score =  386 bits (991), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 207/287 (72%), Positives = 230/287 (80%), Gaps = 25/287 (8%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQL------HALSKPLWVACQISSKTESDGQFPDCS 54
           MAL+S+SPLSI ++N    SS+   +L      H  S P+ V CQ++S  +     P  S
Sbjct: 2   MALNSLSPLSI-NINSLHVSSRPTSELSNSLHFHPKSSPI-VLCQMNSNRD----HPQES 55

Query: 55  NNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFG 114
                      K W   VS  LAAAV+A  SS++SALADLNK+EAE RGEFGIGSAAQFG
Sbjct: 56  -----------KKWGKLVSATLAAAVIA-FSSDMSALADLNKFEAEIRGEFGIGSAAQFG 103

Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           SADLRKAVHV ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMD
Sbjct: 104 SADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMD 163

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
           RMVLNEANLTNA+LVRTVLTRSDLGG+IIEGADFSDAV+DL QK ALCKYA+GTNP+TGV
Sbjct: 164 RMVLNEANLTNAILVRTVLTRSDLGGSIIEGADFSDAVLDLTQKLALCKYASGTNPVTGV 223

Query: 234 STRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDA 280
           STR SLGCGN RRNAYG+PSSPLLSAPPQKLL+RDGFCD  TGLCD+
Sbjct: 224 STRVSLGCGNKRRNAYGTPSSPLLSAPPQKLLNRDGFCDEATGLCDS 270


>gi|356540500|ref|XP_003538726.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Glycine max]
          Length = 260

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 206/282 (73%), Positives = 227/282 (80%), Gaps = 23/282 (8%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S+SPLSI SL+  SSS+      H+ S P+ V    ++++                
Sbjct: 1   MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPVVVKSVANAES---------------- 44

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                  W   VS  LAAAV+A  SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 45  -----TKWGKVVSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 98

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AVHV ENFR ANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNE
Sbjct: 99  AVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNE 158

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           ANLTNA+L+RTVLTRSDLGGAIIEGADFSDAV+DL QKQALCKYA+GTNP+TGVSTR SL
Sbjct: 159 ANLTNAILLRTVLTRSDLGGAIIEGADFSDAVLDLPQKQALCKYASGTNPVTGVSTRVSL 218

Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           GCGN RRNAYGSPSSPLLSAPPQKLLDRDGFCD  TGLCDAK
Sbjct: 219 GCGNKRRNAYGSPSSPLLSAPPQKLLDRDGFCDDATGLCDAK 260


>gi|357481963|ref|XP_003611267.1| Thylakoid lumenal protein [Medicago truncatula]
 gi|355512602|gb|AES94225.1| Thylakoid lumenal protein [Medicago truncatula]
          Length = 262

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 198/282 (70%), Positives = 220/282 (78%), Gaps = 21/282 (7%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S +PLSI S +            +  S  +  + Q+  K   +   P  SN     
Sbjct: 1   MALNSFTPLSINSHH---------VSCYPSSSKVSKSSQVICKMSLNNDHPQESN----- 46

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                KNW   VS  LAAAV+   SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 47  -----KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKK 100

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
            VHV ENFR ANFTSADMRESDFSGS FNGAY+EKAVA+KANFTGADLSDTLMDRMVLNE
Sbjct: 101 TVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTGADLSDTLMDRMVLNE 160

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           ANLTNA+L RTVLTRSDLGGAIIEGADFSDAV+DL QK ALCKYA+GTNP+TGVSTR SL
Sbjct: 161 ANLTNAILSRTVLTRSDLGGAIIEGADFSDAVLDLPQKLALCKYASGTNPVTGVSTRVSL 220

Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           GCGN RRNAYG+PSSPLLSAPPQKLLDRDGFCD  +GLCD+K
Sbjct: 221 GCGNKRRNAYGTPSSPLLSAPPQKLLDRDGFCDEASGLCDSK 262


>gi|357481965|ref|XP_003611268.1| Thylakoid lumenal protein [Medicago truncatula]
 gi|355512603|gb|AES94226.1| Thylakoid lumenal protein [Medicago truncatula]
          Length = 232

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 183/217 (84%), Positives = 197/217 (90%), Gaps = 2/217 (0%)

Query: 66  KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
           KNW   VS  LAAAV+   SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K VHV 
Sbjct: 17  KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKKTVHVN 75

Query: 126 ENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           ENFR ANFTSADMRESDFSGS FNGAY+EKAVA+KANFTGADLSDTLMDRMVLNEANLTN
Sbjct: 76  ENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTGADLSDTLMDRMVLNEANLTN 135

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 244
           A+L RTVLTRSDLGGAIIEGADFSDAV+DL QK ALCKYA+GTNP+TGVSTR SLGCGN 
Sbjct: 136 AILSRTVLTRSDLGGAIIEGADFSDAVLDLPQKLALCKYASGTNPVTGVSTRVSLGCGNK 195

Query: 245 RRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           RRNAYG+PSSPLLSAPPQKLLDRDGFCD  +GLCD+K
Sbjct: 196 RRNAYGTPSSPLLSAPPQKLLDRDGFCDEASGLCDSK 232


>gi|116785652|gb|ABK23807.1| unknown [Picea sitchensis]
          Length = 291

 Score =  369 bits (947), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 186/242 (76%), Positives = 205/242 (84%), Gaps = 7/242 (2%)

Query: 40  ISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEA 99
           I+ K  +D    D    Q A    + KNW+  ++ ALA  V+ +    ++A ADLNKYEA
Sbjct: 52  ITGKISTDQHKKDA---QPASATPESKNWQRCLAAALATIVIGT---GMNAEADLNKYEA 105

Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAY 158
           ETRGEFGIGSAAQFGSA+LRK VH  ENFR ANFTSAD+RESDFSGS FNGAYLEKAVAY
Sbjct: 106 ETRGEFGIGSAAQFGSAELRKTVHANENFRRANFTSADIRESDFSGSTFNGAYLEKAVAY 165

Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
           K NFTGADLSDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVID  QKQ
Sbjct: 166 KTNFTGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDFTQKQ 225

Query: 219 ALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLC 278
           ALCKYA+GTNPITG+STRKSLGCGNSRRNAYG+PS+PLLSAPP+KLLD+DGFCDS TGLC
Sbjct: 226 ALCKYASGTNPITGISTRKSLGCGNSRRNAYGTPSAPLLSAPPEKLLDKDGFCDSSTGLC 285

Query: 279 DA 280
           DA
Sbjct: 286 DA 287


>gi|212721536|ref|NP_001132582.1| uncharacterized protein LOC100194053 [Zea mays]
 gi|194694816|gb|ACF81492.1| unknown [Zea mays]
 gi|195647732|gb|ACG43334.1| hypothetical protein [Zea mays]
 gi|413937988|gb|AFW72539.1| hypothetical protein ZEAMMB73_749291 [Zea mays]
          Length = 268

 Score =  364 bits (934), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 177/194 (91%), Positives = 186/194 (95%), Gaps = 1/194 (0%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK 146
           + A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFR ANFTSADMRESDFSGS 
Sbjct: 74  MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 133

Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
           FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGAIIEGAD
Sbjct: 134 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGAD 193

Query: 207 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 266
           FSDAVIDL+QKQALCKYA+GTNP+TGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQK+LD
Sbjct: 194 FSDAVIDLSQKQALCKYASGTNPMTGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKILD 253

Query: 267 RDGFCDSGTGLCDA 280
           RDGFCD  TG+CDA
Sbjct: 254 RDGFCDPATGMCDA 267


>gi|14334898|gb|AAK59627.1| unknown protein [Arabidopsis thaliana]
          Length = 280

 Score =  361 bits (927), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 203/285 (71%), Positives = 230/285 (80%), Gaps = 9/285 (3%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--- 57
           MA SS+SPL +KSL+   SSS      +   + L    Q+SS+  S+ +  D SN +   
Sbjct: 1   MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTREGC 58

Query: 58  CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
           C+   A+   W+  +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSAD
Sbjct: 59  CSS--AESNKWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSAD 115

Query: 118 LRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
           L K VH  ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMV
Sbjct: 116 LSKTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMV 175

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 236
           LNEANLTNAVLVR+VLTRSDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TR
Sbjct: 176 LNEANLTNAVLVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTR 235

Query: 237 KSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           KSLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD  TGLCD K
Sbjct: 236 KSLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDVK 280


>gi|18391370|ref|NP_563902.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
 gi|75151954|sp|Q8H1Q1.1|TL225_ARATH RecName: Full=Thylakoid lumenal protein At1g12250, chloroplastic;
           Flags: Precursor
 gi|23297125|gb|AAN13098.1| unknown protein [Arabidopsis thaliana]
 gi|332190736|gb|AEE28857.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
          Length = 280

 Score =  361 bits (927), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 203/285 (71%), Positives = 230/285 (80%), Gaps = 9/285 (3%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--- 57
           MA SS+SPL +KSL+   SSS      +   + L    Q+SS+  S+ +  D SN +   
Sbjct: 1   MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTREGC 58

Query: 58  CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
           C+   A+   W+  +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSAD
Sbjct: 59  CSS--AESNTWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSAD 115

Query: 118 LRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
           L K VH  ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMV
Sbjct: 116 LSKTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMV 175

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 236
           LNEANLTNAVLVR+VLTRSDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TR
Sbjct: 176 LNEANLTNAVLVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTR 235

Query: 237 KSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           KSLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD  TGLCD K
Sbjct: 236 KSLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDVK 280


>gi|297844088|ref|XP_002889925.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335767|gb|EFH66184.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 280

 Score =  361 bits (927), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 207/288 (71%), Positives = 235/288 (81%), Gaps = 15/288 (5%)

Query: 1   MALSSISPLSIKSLNFCSSSSKG---PYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ 57
           MA SS+SPL +KSL+   SSS     PY  H    PL    Q+SS++ S  +  D SN +
Sbjct: 1   MAFSSLSPLPMKSLDISRSSSSVSRSPY--HYQRYPLR-RLQLSSRSNS--EIKDSSNAR 55

Query: 58  ---CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFG 114
              C+   ++   W+  +S A+AAAV+AS SS++ A+A+LN++EA+TRGEFGIGSAAQ+G
Sbjct: 56  EGCCS--RSESNTWKRILSAAMAAAVIAS-SSSVPAMAELNRFEADTRGEFGIGSAAQYG 112

Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           SADL K +H  ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMD
Sbjct: 113 SADLSKTIHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMD 172

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
           RMVLNEANLTNAVLVR+VLTRSDLGGA IEGADFSDAVIDL QKQALCKYANGTNP+TGV
Sbjct: 173 RMVLNEANLTNAVLVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYANGTNPLTGV 232

Query: 234 STRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
            TRKSLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD  TGLCDAK
Sbjct: 233 DTRKSLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDAK 280


>gi|125540470|gb|EAY86865.1| hypothetical protein OsI_08249 [Oryza sativa Indica Group]
          Length = 276

 Score =  360 bits (924), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 201/282 (71%), Positives = 223/282 (79%), Gaps = 7/282 (2%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL + SPL+  +   C+  +    +   L +   V+CQ +     DG     S +  A 
Sbjct: 1   MALPTTSPLAAAAARPCAFPTPWRCRSPPLRRLPHVSCQANRGGSRDGN--SLSTSAAAA 58

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             +    WR  VS ALAAA+V++      A ADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 59  AASPPPRWRAAVSAALAAAIVSA----APAYADLNKFEAEQRGEFGIGSAAQFGSADLKK 114

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AVHV ENFR ANFT+ADMRES+FSGS FNGAYLEKAVAY+ANFTGADLSDTLMDRMVLNE
Sbjct: 115 AVHVNENFRRANFTAADMRESNFSGSTFNGAYLEKAVAYRANFTGADLSDTLMDRMVLNE 174

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           ANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVIDL QKQALCKYANGTNP+TGVSTRKSL
Sbjct: 175 ANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNPLTGVSTRKSL 234

Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           GCGNSRRNAYGSPSSPLLSAPP KLLDRDGFCD  TG+CDAK
Sbjct: 235 GCGNSRRNAYGSPSSPLLSAPPPKLLDRDGFCDEATGMCDAK 276


>gi|242066558|ref|XP_002454568.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
 gi|241934399|gb|EES07544.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
          Length = 270

 Score =  360 bits (923), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 177/194 (91%), Positives = 184/194 (94%), Gaps = 1/194 (0%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK 146
           + A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFR ANFTSADMRESDFSGS 
Sbjct: 76  MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 135

Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
           FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGAIIEGAD
Sbjct: 136 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGAD 195

Query: 207 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 266
           FSDAVIDL QKQALCKYA+GTN ITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD
Sbjct: 196 FSDAVIDLPQKQALCKYASGTNSITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 255

Query: 267 RDGFCDSGTGLCDA 280
           RDGFCD  TG+C+A
Sbjct: 256 RDGFCDPATGMCEA 269


>gi|115447561|ref|NP_001047560.1| Os02g0643500 [Oryza sativa Japonica Group]
 gi|49388647|dbj|BAD25782.1| thylakoid lumenal protein-like [Oryza sativa Japonica Group]
 gi|113537091|dbj|BAF09474.1| Os02g0643500 [Oryza sativa Japonica Group]
 gi|125583041|gb|EAZ23972.1| hypothetical protein OsJ_07699 [Oryza sativa Japonica Group]
 gi|215687060|dbj|BAG90906.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 277

 Score =  358 bits (918), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 201/282 (71%), Positives = 222/282 (78%), Gaps = 6/282 (2%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL + SPL+  +   C+  +    +   L +   V+CQ +     DG     S    A 
Sbjct: 1   MALPTTSPLAAAAARPCAFPTPWRCRSPPLRRLPHVSCQANRGGSRDGNSLSTSAAAAAA 60

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                + WR  VS ALAAA+V++      A ADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 61  ASPPPR-WRAAVSAALAAAIVSA----APAYADLNKFEAEQRGEFGIGSAAQFGSADLKK 115

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AVHV ENFR ANFT+ADMRES+FSGS FNGAYLEKAVAY+ANFTGADLSDTLMDRMVLNE
Sbjct: 116 AVHVNENFRRANFTAADMRESNFSGSTFNGAYLEKAVAYRANFTGADLSDTLMDRMVLNE 175

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           ANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVIDL QKQALCKYANGTNP+TGVSTRKSL
Sbjct: 176 ANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNPLTGVSTRKSL 235

Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           GCGNSRRNAYGSPSSPLLSAPP KLLDRDGFCD  TG+CDAK
Sbjct: 236 GCGNSRRNAYGSPSSPLLSAPPPKLLDRDGFCDEATGMCDAK 277


>gi|145323868|ref|NP_001077523.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
 gi|332190737|gb|AEE28858.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
          Length = 206

 Score =  357 bits (917), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 177/207 (85%), Positives = 190/207 (91%), Gaps = 2/207 (0%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTS 134
           +AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH  ENFR ANFTS
Sbjct: 1   MAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTS 59

Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
           ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEANLTNAVLVR+VLTR
Sbjct: 60  ADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTR 119

Query: 195 SDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSS 254
           SDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TRKSLGCGNSRRNAYGSPSS
Sbjct: 120 SDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTRKSLGCGNSRRNAYGSPSS 179

Query: 255 PLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           PLLSAPPQ+LL RDGFCD  TGLCD K
Sbjct: 180 PLLSAPPQRLLGRDGFCDEKTGLCDVK 206


>gi|357136761|ref|XP_003569972.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Brachypodium distachyon]
          Length = 268

 Score =  353 bits (906), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 172/195 (88%), Positives = 181/195 (92%), Gaps = 1/195 (0%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK 146
           + A ADLNK+EAE RGEFGIGSAAQFG+ADL+K VHV ENFR ANFTSADMRESDFSGS 
Sbjct: 74  MPAYADLNKFEAEQRGEFGIGSAAQFGNADLKKTVHVNENFRRANFTSADMRESDFSGST 133

Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
           FNGAY+EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL RTVLTRSDLGGA IEGAD
Sbjct: 134 FNGAYMEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLARTVLTRSDLGGATIEGAD 193

Query: 207 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 266
           FSDAV+DL QK ALCKYA+GTNP+TGVSTRKSLGCGNSRRNAYGSPSSPLLSAPP KLLD
Sbjct: 194 FSDAVLDLQQKLALCKYASGTNPVTGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPPKLLD 253

Query: 267 RDGFCDSGTGLCDAK 281
           RDGFCD  TG+CDAK
Sbjct: 254 RDGFCDEATGMCDAK 268


>gi|326490876|dbj|BAJ90105.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 267

 Score =  348 bits (892), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 202/284 (71%), Positives = 221/284 (77%), Gaps = 20/284 (7%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQL-HALSKPLW-VACQISSKTESDGQFPDCSNNQC 58
           MAL+S SPL+        +  K P  L    S+ L  ++CQ ++     G   + SN   
Sbjct: 1   MALASTSPLAA-----TVARPKAPASLTRCRSRRLQRISCQATTDRSGGG---NASNTSP 52

Query: 59  AGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADL 118
           A P      WRV VS ALAAAVV +    + A ADLNKYEA+ RGEFGIGSAAQFG+ADL
Sbjct: 53  APP-----RWRVAVSAALAAAVVVA----MPAHADLNKYEADQRGEFGIGSAAQFGNADL 103

Query: 119 RKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           +  VHV ENFR ANFTSADMRESDFSGS FNGAY+EKAVA++ANFTGADLSDTLMDRMVL
Sbjct: 104 KNTVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFRANFTGADLSDTLMDRMVL 163

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
           NEANLTNAVL RTVLTRSDLGGA IEGADFSDAVIDL QK ALCKYA+GTNPITGVSTRK
Sbjct: 164 NEANLTNAVLSRTVLTRSDLGGATIEGADFSDAVIDLPQKLALCKYASGTNPITGVSTRK 223

Query: 238 SLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
           SLGCGNSRRNAYGSPSSPLLSAPP KLLDRDGFCD  +GLCDAK
Sbjct: 224 SLGCGNSRRNAYGSPSSPLLSAPPPKLLDRDGFCDEASGLCDAK 267


>gi|10086510|gb|AAG12570.1|AC022522_3 Hypothetical protein [Arabidopsis thaliana]
          Length = 293

 Score =  335 bits (858), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 192/301 (63%), Positives = 215/301 (71%), Gaps = 38/301 (12%)

Query: 11  IKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRV 70
           +KSL+   SSS      +   + L    Q+SS+  S+ +  D SN       A+   W+ 
Sbjct: 1   MKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTS-----AESNTWKR 53

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
            +S A  AA V + SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH  ENFR 
Sbjct: 54  ILSAA-MAAAVIASSSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRR 112

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEANLTNAVLVR
Sbjct: 113 ANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVR 172

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQ-----------------------------AL 220
           +VLTRSDLGGA IEGADFSDAVIDL QKQ                             AL
Sbjct: 173 SVLTRSDLGGAKIEGADFSDAVIDLLQKQVTTTHHYIYPSFRSTIKKYFTNGFHNVLKAL 232

Query: 221 CKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDA 280
           CKYA GTNP+TGV TRKSLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD  TGLCD 
Sbjct: 233 CKYATGTNPLTGVDTRKSLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDV 292

Query: 281 K 281
           K
Sbjct: 293 K 293


>gi|302780733|ref|XP_002972141.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
 gi|300160440|gb|EFJ27058.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
          Length = 219

 Score =  331 bits (849), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 160/205 (78%), Positives = 183/205 (89%), Gaps = 5/205 (2%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF--RANFT 133
           LAA V+A+    ++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+  H  ENF  RANFT
Sbjct: 14  LAATVLAT---GMNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFSRRANFT 70

Query: 134 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
           SADMRE+DFSGS FNG YLEKAVAY+ NF+GADLSDTLMDRMVLNEA+LTNA+LVR VLT
Sbjct: 71  SADMREADFSGSTFNGGYLEKAVAYRTNFSGADLSDTLMDRMVLNEADLTNALLVRAVLT 130

Query: 194 RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPS 253
           RSDLGGA IEGADFSDAV+DLAQKQALCKYANG NP+TG+ TRKSLGCGN+RRNAYG+PS
Sbjct: 131 RSDLGGAKIEGADFSDAVLDLAQKQALCKYANGVNPVTGMDTRKSLGCGNARRNAYGTPS 190

Query: 254 SPLLSAPPQKLLDRDGFCDSGTGLC 278
           +P+LSAPP++LLD+DGFCD  TG C
Sbjct: 191 APILSAPPERLLDKDGFCDDATGKC 215


>gi|302822738|ref|XP_002993025.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
 gi|300139117|gb|EFJ05864.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
          Length = 196

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 155/192 (80%), Positives = 176/192 (91%), Gaps = 1/192 (0%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK 146
           ++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+  H  ENFR ANFTSADMRE+DFSGS 
Sbjct: 1   MNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFRRANFTSADMREADFSGST 60

Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
           FNG YLEKAVAY+ NF+GADLSDTLMDRMVLNEA+LTNA+LVR VLTRSDLGGA IEGAD
Sbjct: 61  FNGGYLEKAVAYRTNFSGADLSDTLMDRMVLNEADLTNALLVRAVLTRSDLGGAKIEGAD 120

Query: 207 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 266
           FSDAV+DLAQKQALCKYANG NP+TG+ TRKSLGCGN+RRNAYG+PS+P+LSAPP++LLD
Sbjct: 121 FSDAVLDLAQKQALCKYANGVNPVTGMDTRKSLGCGNARRNAYGTPSAPILSAPPERLLD 180

Query: 267 RDGFCDSGTGLC 278
           +DGFCD  TG C
Sbjct: 181 KDGFCDDATGKC 192


>gi|168028137|ref|XP_001766585.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682230|gb|EDQ68650.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 225

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 153/193 (79%), Positives = 170/193 (88%), Gaps = 3/193 (1%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFN 148
           +LADLN  EA TRGEFGIGSA QFGSADL+K  H  ENFR  NFTSADM+E++FS S FN
Sbjct: 28  SLADLNSLEANTRGEFGIGSAVQFGSADLKKTQHANENFRRGNFTSADMKEANFSNSTFN 87

Query: 149 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           GAYLEKAVAY+ NF+GADLSDTLMDRMVLNEANL+NA+LVR VLTRSDLG AIIEGADFS
Sbjct: 88  GAYLEKAVAYRTNFSGADLSDTLMDRMVLNEANLSNALLVRAVLTRSDLGSAIIEGADFS 147

Query: 209 DAVIDLAQKQ--ALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 266
           DAV+DL QKQ  ALCKYA+GTNP+TG+STRKSLGCGN+RRNAYGSPSSP LSAPP  LLD
Sbjct: 148 DAVLDLTQKQAFALCKYASGTNPVTGMSTRKSLGCGNARRNAYGSPSSPELSAPPPILLD 207

Query: 267 RDGFCDSGTGLCD 279
           ++GFCD+ TG CD
Sbjct: 208 KNGFCDNSTGKCD 220


>gi|356495617|ref|XP_003516671.1| PREDICTED: LOW QUALITY PROTEIN: thylakoid lumenal protein
           At1g12250, chloroplastic-like [Glycine max]
          Length = 222

 Score =  290 bits (741), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 166/251 (66%), Positives = 186/251 (74%), Gaps = 30/251 (11%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S SPLS+ SL+  S SS    +  + S P  V CQ +S  +               
Sbjct: 1   MALNSFSPLSVNSLHVSSISSSKISRSLSKSFP--VVCQTNSNRDH-------------- 44

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                +   V VS  LAAA++A  SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 45  -----RQGNV-VSATLAAAIIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 97

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AVHV ENFR +NFT+ADMRESDFSGS FNGAYLEKAVAYKANF G DLSDTL DRMVLNE
Sbjct: 98  AVHVNENFRXSNFTAADMRESDFSGSTFNGAYLEKAVAYKANFPGVDLSDTLTDRMVLNE 157

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           ANL+NA+L+RTVLTRSDLGGAIIEGADFSDAV+DL QK ALCKY      +T VSTR SL
Sbjct: 158 ANLSNAILLRTVLTRSDLGGAIIEGADFSDAVLDLPQKHALCKY------VTRVSTRVSL 211

Query: 240 GCGNSRRNAYG 250
           GCGN RRNAYG
Sbjct: 212 GCGNKRRNAYG 222


>gi|159478056|ref|XP_001697120.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
 gi|158274594|gb|EDP00375.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
          Length = 239

 Score =  198 bits (503), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 100/167 (59%), Positives = 119/167 (71%), Gaps = 1/167 (0%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFN 148
           ALADLN YEA T GEFGIGSA Q+G AD++      ++ R +NFTSAD R + F GS   
Sbjct: 51  ALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCRNATFKGSNLQ 110

Query: 149 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           GAY  KAV Y+ NF  A+LSD LMDR  + EANL NA+L RTV TRSDL  A+IEGADF+
Sbjct: 111 GAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLKNAILQRTVFTRSDLKDAVIEGADFT 170

Query: 209 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 255
           +A++D  Q  ALCKYA+GTNP+TG  TRKSLGCG  RR     PS+P
Sbjct: 171 NALLDKTQVMALCKYASGTNPVTGADTRKSLGCGGKRRYQASYPSNP 217


>gi|302829835|ref|XP_002946484.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
           nagariensis]
 gi|300268230|gb|EFJ52411.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
           nagariensis]
          Length = 214

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 99/167 (59%), Positives = 120/167 (71%), Gaps = 1/167 (0%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFN 148
           A ADLN YEAE  GEFGIGSA Q+G AD++      ++ R +NFTSAD R ++F GS   
Sbjct: 26  AFADLNVYEAEAGGEFGIGSAQQYGEADVQGRDFSGQDLRRSNFTSADCRNANFKGSNLQ 85

Query: 149 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           GAY  KAV Y+ NF  A+LSD LMDR  + EANL NAVL R V TRSDL  A++EGADF+
Sbjct: 86  GAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLRNAVLQRAVFTRSDLKDAVVEGADFT 145

Query: 209 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 255
           +A++D  Q  ALCKYA+G NP+TGVSTRKSLGCG+ RR     PS+P
Sbjct: 146 NALLDKTQVMALCKYADGVNPVTGVSTRKSLGCGSQRRYKASYPSNP 192


>gi|255638223|gb|ACU19425.1| unknown [Glycine max]
          Length = 199

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 115/176 (65%), Positives = 133/176 (75%), Gaps = 18/176 (10%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S+SPLSI SL+  SSS+      H+ S P+ V CQI+S  +         + Q + 
Sbjct: 2   MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPV-VVCQINSNRD---------HRQEST 51

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
            + K+      VS  LAAAV+A  SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 52  KWGKV------VSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 104

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           AVHV ENFR ANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRM
Sbjct: 105 AVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRM 160


>gi|384248119|gb|EIE21604.1| thylakoid lumenal protein [Coccomyxa subellipsoidea C-169]
          Length = 217

 Score =  190 bits (482), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 103/185 (55%), Positives = 126/185 (68%), Gaps = 3/185 (1%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLR-KAVHVKENFRANFTSADMRESDFSGSKFN 148
           A+ADLNKYEA   GEFG G+A Q+G ADL+ +  H ++  R+NFT+AD R  +F  S   
Sbjct: 29  AIADLNKYEAAAGGEFGNGTAQQYGEADLKGRDFHGEDLRRSNFTAADCRNCNFKDSNLQ 88

Query: 149 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           GAY  K+V  KANF  A+LSD LMDR VLNEANL NA   R VLTRSDLGGA I G DF+
Sbjct: 89  GAYFIKSVVPKANFENANLSDVLMDRAVLNEANLRNANFQRAVLTRSDLGGADINGTDFT 148

Query: 209 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRD 268
           +A++D  Q+ ALC+YA+GTN  TGV TRKSLGCG+ RR    SPS+P    P    +D+ 
Sbjct: 149 NALLDKTQQIALCRYADGTNTETGVETRKSLGCGSRRRFRESSPSNP--EGPQVADVDKK 206

Query: 269 GFCDS 273
            F  S
Sbjct: 207 AFVKS 211


>gi|297741151|emb|CBI31882.3| unnamed protein product [Vitis vinifera]
          Length = 201

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 106/169 (62%), Positives = 116/169 (68%), Gaps = 20/169 (11%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSS+SPL I         SK P  L +LSKP  V C+I  + E         NN    
Sbjct: 1   MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRE---------NNWRGE 42

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             A+ K W+  VSTALAAAVV + S  + A+ADLNKYE ETRGEFGIGSAAQFGSADLRK
Sbjct: 43  ANAESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRK 101

Query: 121 AVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           AVHV ENF RANFTSADMRESDFSGS FNG YLEKAVAYKA+ T A  S
Sbjct: 102 AVHVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLTDAQSS 150


>gi|303288862|ref|XP_003063719.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454787|gb|EEH52092.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 277

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 104/198 (52%), Positives = 130/198 (65%), Gaps = 9/198 (4%)

Query: 86  SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE---NFR-ANFTSADMRESD 141
           S+ +A A+LN  EA   GEF  GSA QFG  DLR    V +   + R +NFT A+MR + 
Sbjct: 81  SSPAAHAELNAREANRGGEFNRGSAQQFGGYDLRNEDVVGKYGADLRLSNFTGAEMRGAK 140

Query: 142 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
             G+   GAYL KAVA++A+F GA+LSD LMDR VLN AN  +A+L R VLT SDLG A 
Sbjct: 141 LRGANLTGAYLMKAVAFEADFEGANLSDALMDRAVLNSANFRDAILTRVVLTSSDLGDAK 200

Query: 202 IEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPL---LS 258
           I+GADFSDA+ID +Q+Q LC+YA+GTN +TGVSTR+SL CG   R +  SPS  +    S
Sbjct: 201 IDGADFSDALIDKSQQQKLCQYASGTNSVTGVSTRRSLNCGGGVRTS--SPSRYMTDETS 258

Query: 259 APPQKLLDRDGFCDSGTG 276
           A P+   D   F   GTG
Sbjct: 259 AKPEAAFDASRFSAYGTG 276


>gi|307105880|gb|EFN54127.1| hypothetical protein CHLNCDRAFT_31689 [Chlorella variabilis]
          Length = 259

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 90/167 (53%), Positives = 118/167 (70%), Gaps = 1/167 (0%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFN 148
           A A+LNKYE    GEF +G+A Q+G AD++      ++  R+NFT+AD R+++F  SK  
Sbjct: 71  ASAELNKYEFGVTGEFNVGTARQYGEADVKGQDFSNQDLQRSNFTAADCRDANFQNSKLQ 130

Query: 149 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            AY  K+V  +AN   ADLSD LMDR V+ +ANL  AVL R +LTRSDL  + I GADF+
Sbjct: 131 AAYFMKSVLARANLENADLSDALMDRAVIVDANLRGAVLQRAILTRSDLDRSDIYGADFT 190

Query: 209 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 255
           +A++D  Q+ ALCKYA+G NP+TGVSTRKSL CG+SRR    SPS+P
Sbjct: 191 NALVDKTQQMALCKYADGVNPMTGVSTRKSLNCGSSRRFKASSPSNP 237


>gi|424513452|emb|CCO66074.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
          Length = 231

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 105/197 (53%), Positives = 121/197 (61%), Gaps = 7/197 (3%)

Query: 72  VSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF--- 128
           +S A A  V     S   A+A+LN  EA   GEF  GSA QFG  DLR A +V E +   
Sbjct: 21  LSVATAMIVSGIIPSPPFAVAELNSREANQGGEFNRGSAQQFGGYDLR-AENVSEKYGTD 79

Query: 129 --RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
              +NFT A+MR+S   G+K NGAYL KAVA  A+FT ADLSD LMDR V   AN TNA+
Sbjct: 80  LRLSNFTGAEMRDSKLVGAKLNGAYLMKAVAANADFTDADLSDALMDRGVFVNANFTNAI 139

Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRR 246
           L R VLT SDL GA I  ADFSDA++D   +  LCK A GTNP TGV+TRKSL C   R 
Sbjct: 140 LARVVLTSSDLNGANITNADFSDALLDNTMQMKLCKIATGTNPTTGVNTRKSLNCTGGRG 199

Query: 247 NAYGSPSSPLLSAPPQK 263
           N  GSPS  +     QK
Sbjct: 200 NV-GSPSRYMTEEDAQK 215


>gi|308811122|ref|XP_003082869.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
 gi|116054747|emb|CAL56824.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
          Length = 247

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 97/184 (52%), Positives = 116/184 (63%), Gaps = 7/184 (3%)

Query: 66  KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
           K   V  S ALA A   S +    A A+LN+ EA   GEF  GSA QFG  DL K    K
Sbjct: 34  KKGHVITSIALATAFALSGAP---AHAELNRAEANRGGEFNRGSAKQFGGYDLVKVDIAK 90

Query: 126 E---NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
           E   + R +NFT ADMR +   G+   GAY+ K VA + +FTGAD+SD LMDR VL  AN
Sbjct: 91  EYGKDLRLSNFTGADMRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALMDRSVLVGAN 150

Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            T+AVL R VLT SD+  AIIE ADF+DA++D   +QALCK A+G NP TGV+TR SLGC
Sbjct: 151 FTDAVLNRVVLTSSDMKDAIIENADFTDALLDPKTQQALCKTASGKNPETGVATRVSLGC 210

Query: 242 GNSR 245
              R
Sbjct: 211 SGGR 214


>gi|357481967|ref|XP_003611269.1| Thylakoid lumenal protein [Medicago truncatula]
 gi|355512604|gb|AES94227.1| Thylakoid lumenal protein [Medicago truncatula]
          Length = 147

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 95/165 (57%), Positives = 110/165 (66%), Gaps = 21/165 (12%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S +PLSI S +            +  S  +  + Q+  K   +   P  SN     
Sbjct: 1   MALNSFTPLSINSHH---------VSCYPSSSKVSKSSQVICKMSLNNDHPQESN----- 46

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                KNW   VS  LAAAV+   SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 47  -----KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKK 100

Query: 121 AVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
            VHV ENF RANFTSADMRESDFSGS FNGAY+EKAVA+KANFTG
Sbjct: 101 TVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTG 145


>gi|255087366|ref|XP_002505606.1| predicted protein [Micromonas sp. RCC299]
 gi|226520876|gb|ACO66864.1| predicted protein [Micromonas sp. RCC299]
          Length = 146

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 71/108 (65%), Positives = 85/108 (78%)

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           MR++   G+   GAYL KAVA+ A+F GA+LSD LMDR VLN AN  +A++ R VLT SD
Sbjct: 1   MRKAKLRGANLTGAYLMKAVAFAADFEGANLSDALMDRAVLNNANFKDAIMTRVVLTSSD 60

Query: 197 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 244
           LG A+IEGADFSDA+ID+ Q+QALCKYANG N +TGVSTRKSL CG S
Sbjct: 61  LGDAVIEGADFSDALIDVKQQQALCKYANGVNSVTGVSTRKSLNCGGS 108


>gi|145356542|ref|XP_001422487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582730|gb|ABP00804.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 114

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 68/111 (61%), Positives = 84/111 (75%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NFT AD+R +   G+   GAY+ K VA + +FTGAD+SD LMDR VL +AN TNA+L R 
Sbjct: 4   NFTGADLRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALMDRAVLVKANFTNAILNRV 63

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           VLT SDL GAI+E ADF+DA++D+  +QALCK A+G NP TGVSTR SLGC
Sbjct: 64  VLTSSDLEGAIVENADFTDALLDVKTQQALCKTASGKNPETGVSTRVSLGC 114


>gi|224125144|ref|XP_002329904.1| predicted protein [Populus trichocarpa]
 gi|222871141|gb|EEF08272.1| predicted protein [Populus trichocarpa]
          Length = 108

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 63/81 (77%), Positives = 68/81 (83%), Gaps = 4/81 (4%)

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 234
           MV+NEANLTNAVLVR+ LTR DLGGA I GAD SD+VIDL QKQ    YA+GTNP TGVS
Sbjct: 1   MVINEANLTNAVLVRSALTRCDLGGAQIAGADSSDSVIDLPQKQ----YASGTNPTTGVS 56

Query: 235 TRKSLGCGNSRRNAYGSPSSP 255
            R SLGCGNSRRNAYG+PSSP
Sbjct: 57  NRASLGCGNSRRNAYGTPSSP 77


>gi|434390855|ref|YP_007125802.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428262696|gb|AFZ28642.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 176

 Score =  120 bits (300), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 59/110 (53%), Positives = 77/110 (70%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MRE++F G+    A L K V  +AN  GA+L+  L+DR+ L+EANL NA+L   +
Sbjct: 66  FVAAEMREANFQGADLTNAILTKGVLLRANLEGANLTGALVDRVTLDEANLKNAILQEAI 125

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           LTRS L  A I GADF+DA+ID  Q   LC  A+G NP+TGVSTR+SLGC
Sbjct: 126 LTRSRLFDADITGADFTDALIDRYQVSLLCDRADGVNPVTGVSTRESLGC 175


>gi|416382245|ref|ZP_11684306.1| Pentapeptide repeat containing protein [Crocosphaera watsonii WH
           0003]
 gi|357265427|gb|EHJ14194.1| Pentapeptide repeat containing protein [Crocosphaera watsonii WH
           0003]
          Length = 171

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/129 (46%), Positives = 82/129 (63%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DL K V         F +ADMRE++F GS  + A   +A   KAN  GA+L+ +L+
Sbjct: 51  FSHKDLEKGV---------FAAADMREANFEGSNLSYAIFTEATLLKANLKGANLTSSLL 101

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LT+A+L+  + TR+    A+I GADF+DAVID  Q   +C+ A G NP+TG
Sbjct: 102 DRVTLDFADLTDAILIDAIATRTRFYDAVITGADFTDAVIDRYQVSLMCERAEGVNPVTG 161

Query: 233 VSTRKSLGC 241
           VSTR SLGC
Sbjct: 162 VSTRDSLGC 170


>gi|67921246|ref|ZP_00514765.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
 gi|67857363|gb|EAM52603.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
          Length = 172

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/129 (46%), Positives = 82/129 (63%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DL K V         F +ADMRE++F GS  + A   +A   KAN  GA+L+ +L+
Sbjct: 52  FSHKDLEKGV---------FAAADMREANFEGSNLSYAIFTEATLLKANLKGANLTSSLL 102

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LT+A+L+  + TR+    A+I GADF+DAVID  Q   +C+ A G NP+TG
Sbjct: 103 DRVTLDFADLTDAILIDAIATRTRFYDAVITGADFTDAVIDRYQVSLMCERAEGVNPVTG 162

Query: 233 VSTRKSLGC 241
           VSTR SLGC
Sbjct: 163 VSTRDSLGC 171


>gi|218247318|ref|YP_002372689.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
 gi|218167796|gb|ACK66533.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
          Length = 172

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 61/129 (47%), Positives = 83/129 (64%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DL KAV         F +A+MRE++F GS  + A L + V  KAN   A+L+ +L+
Sbjct: 52  FSHRDLEKAV---------FAAAEMRETNFEGSNLSYAILTEGVLLKANLKDANLTGSLL 102

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LTNA+LV  + TR+     II GADF+DAVID  Q   +C+ A+G NP+TG
Sbjct: 103 DRVTLDFADLTNAILVDAIATRTRFYDTIITGADFTDAVIDRYQVALMCERADGVNPVTG 162

Query: 233 VSTRKSLGC 241
           V+TR SLGC
Sbjct: 163 VATRDSLGC 171


>gi|254421873|ref|ZP_05035591.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
 gi|196189362|gb|EDX84326.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
          Length = 187

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 53/110 (48%), Positives = 74/110 (67%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +AD R+++F G+  +G  L KA   + N  GAD + T  DR++ + A+LTNA+ V  +
Sbjct: 76  FAAADARDANFEGADMSGTILTKATFLRTNLKGADFTKTFADRVLFDGADLTNAIFVEAI 135

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            T S  G  II GADFSDA+ID  Q + +CK A+G NP+TG+STR+SLGC
Sbjct: 136 ATSSSFGDTIITGADFSDAIIDRFQVKKMCKRADGINPVTGISTRESLGC 185


>gi|257061347|ref|YP_003139235.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
 gi|256591513|gb|ACV02400.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
          Length = 172

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 60/129 (46%), Positives = 82/129 (63%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DL KAV         F +A+MRE++F GS  + A L + V  KAN    +L+ +L+
Sbjct: 52  FSHRDLEKAV---------FAAAEMRETNFEGSNLSYAILTEGVLLKANLKDVNLTGSLL 102

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LTNA+LV  + TR+     II GADF+DAVID  Q   +C+ A+G NP+TG
Sbjct: 103 DRVTLDFADLTNAILVDAIATRTRFYDTIITGADFTDAVIDRYQVALMCERADGVNPVTG 162

Query: 233 VSTRKSLGC 241
           V+TR SLGC
Sbjct: 163 VATRDSLGC 171


>gi|434384986|ref|YP_007095597.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
 gi|428015976|gb|AFY92070.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
          Length = 165

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 62/129 (48%), Positives = 78/129 (60%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           FG  DL   V         F S+++R  + SG+    A L  AV  K N +GA+L+  L 
Sbjct: 45  FGGQDLTGGV---------FVSSELRGVNMSGANLTNAMLTMAVLLKTNLSGANLTGALA 95

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR   +EA+LTNA+L    LTRS   GA I GADF+DA+ID AQ + LC  A+G NP+TG
Sbjct: 96  DRATFDEADLTNAILTEATLTRSRFYGAKITGADFTDALIDRAQAKLLCDRADGINPVTG 155

Query: 233 VSTRKSLGC 241
           VSTR SLGC
Sbjct: 156 VSTRDSLGC 164


>gi|428316344|ref|YP_007114226.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428240024|gb|AFZ05810.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 169

 Score =  110 bits (275), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 55/110 (50%), Positives = 72/110 (65%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MR ++F G+    A L K V   AN +GA+LS  L DR+  + ANLTNA     +
Sbjct: 59  FVAAEMRGTNFQGADLTNAILTKGVLLNANLSGANLSGALADRVTFDGANLTNANFTEAI 118

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +TR+    A I GADFSDA+ID  Q   LC+ A+G NP+TGVSTR+SLGC
Sbjct: 119 MTRTRFFDAAISGADFSDAIIDAYQVSILCEKADGVNPVTGVSTRESLGC 168


>gi|126658078|ref|ZP_01729230.1| hypothetical protein CY0110_05667 [Cyanothece sp. CCY0110]
 gi|126620716|gb|EAZ91433.1| hypothetical protein CY0110_05667 [Cyanothece sp. CCY0110]
          Length = 181

 Score =  110 bits (275), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 74/110 (67%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +ADMRE++F GS  + +   + +   AN  G DLS +L+DR+ L+ A+LTNA+LV  +
Sbjct: 71  FAAADMREANFEGSNLSYSIFTEGILLGANLKGVDLSSSLLDRVTLDFADLTNAILVDAI 130

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            TR+    A I GADF++AVID  Q   +C+ A G NP+TGVSTR SLGC
Sbjct: 131 ATRTRFYDATITGADFTNAVIDRYQVSLMCERAEGVNPVTGVSTRDSLGC 180


>gi|434405844|ref|YP_007148729.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
 gi|428260099|gb|AFZ26049.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
          Length = 168

 Score =  110 bits (275), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 55/110 (50%), Positives = 72/110 (65%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MR ++F G+    A L K V  KAN  GA+L+  L+DR+ L+ ANL NA+     
Sbjct: 58  FVAAEMRGTNFQGANLTNAILTKGVLLKANLEGANLAGALVDRVTLDGANLKNAIFTEAT 117

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           LTRS    A + GADF+DA+ID  Q   LCK A+G NP+TG+STR SLGC
Sbjct: 118 LTRSRFFDADVTGADFTDALIDRYQVALLCKSADGVNPVTGISTRDSLGC 167


>gi|75908890|ref|YP_323186.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75702615|gb|ABA22291.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 168

 Score =  110 bits (275), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 55/112 (49%), Positives = 73/112 (65%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
            NF +A+MR ++F G+    A L K V  KAN + A+L+  L+DR+ L+ ANL NA+   
Sbjct: 56  VNFVAAEMRGTNFQGANLTNAILTKGVLLKANLSEANLTGALVDRVTLDNANLKNAIFTE 115

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
             LTRS    A I GADF+DA+ID  Q   LC+ A+G NP+TGV+TR SLGC
Sbjct: 116 ATLTRSRFYDADITGADFTDAIIDRYQVSLLCERADGVNPVTGVATRDSLGC 167


>gi|254412921|ref|ZP_05026693.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196180085|gb|EDX75077.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 180

 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 53/110 (48%), Positives = 73/110 (66%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  AD+R + F G+   G+ L KA  ++A+ TGA+LS+TL DR+V + ANLTNA+    +
Sbjct: 70  FAGADLRGASFRGASLQGSILTKAAFFEADLTGANLSETLADRVVFDGANLTNAIFTNAI 129

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +RS      I GADFS A++D  Q   +C+ A+G NP+TGVSTR SLGC
Sbjct: 130 ASRSRFFDTTITGADFSGAILDTYQISLMCQRADGVNPVTGVSTRDSLGC 179


>gi|354555882|ref|ZP_08975181.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
 gi|353552206|gb|EHC21603.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
          Length = 182

 Score =  110 bits (274), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 75/110 (68%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +ADMRE++F GS  + +   + +   AN  GA+LS +L+DR+ L+ A+LTNA+LV  +
Sbjct: 72  FAAADMREANFEGSNLSYSIFTEGILLGANLKGANLSSSLLDRVTLDFADLTNAILVDAI 131

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            TR+    A I GADF++AVID  Q   +C+ A G NP+TGVSTR SLGC
Sbjct: 132 ATRTRFYDATITGADFTNAVIDRYQVSLMCERAEGVNPVTGVSTRDSLGC 181


>gi|172037118|ref|YP_001803619.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
           51142]
 gi|171698572|gb|ACB51553.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
          Length = 184

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 75/110 (68%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +ADMRE++F GS  + +   + +   AN  GA+LS +L+DR+ L+ A+LTNA+LV  +
Sbjct: 74  FAAADMREANFEGSNLSYSIFTEGILLGANLKGANLSSSLLDRVTLDFADLTNAILVDAI 133

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            TR+    A I GADF++AVID  Q   +C+ A G NP+TGVSTR SLGC
Sbjct: 134 ATRTRFYDATITGADFTNAVIDRYQVSLMCERAEGVNPVTGVSTRDSLGC 183


>gi|332712340|ref|ZP_08432267.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332348814|gb|EGJ28427.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 169

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 52/113 (46%), Positives = 75/113 (66%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R  F  A+MR ++F G+  +G+   K    KAN  GA+L+D+L DR++L++ANLTNA+L 
Sbjct: 56  RGVFAGAEMRGTNFQGADLSGSIFTKGNLLKANLEGANLTDSLADRVILDQANLTNAILT 115

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
             ++  +    A I GADF+DA+ID  Q + +C  A G NP+TG+STR SLGC
Sbjct: 116 DAIMNSTRFYDAEITGADFTDALIDRYQAKLMCGRATGVNPVTGISTRDSLGC 168


>gi|295293762|gb|ADF88289.1| pentapeptide repeat-containing protein [Aphanizomenon sp. 10E6]
          Length = 168

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 71/110 (64%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MR ++F G+    A   K V  KAN   A+L+  L+DR+ L+ ANL NA+  +  
Sbjct: 58  FVAAEMRGTNFQGANLTNAIFTKGVLLKANLEAANLTGALVDRVTLDSANLRNAIFTKAT 117

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           LTRS    A I GADF+DA+ID  Q   LC+ A+G NP+TGVSTR SLGC
Sbjct: 118 LTRSRFYDADITGADFTDALIDRYQVSLLCQRADGVNPVTGVSTRDSLGC 167


>gi|186685193|ref|YP_001868389.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
           73102]
 gi|186467645|gb|ACC83446.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
          Length = 168

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 72/110 (65%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MR ++F G+    A L K V  KAN  GA+LS  L+DR+ ++ ANL NA+     
Sbjct: 58  FVAAEMRGTNFQGANLTNAILTKGVLLKANLEGANLSGALVDRVTMDGANLKNAIFTEAT 117

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           LTRS    A I GADF+DA+ID  Q   +C+ A+G NP+TG+STR SLGC
Sbjct: 118 LTRSRFFDAEITGADFTDALIDRYQVSLMCERADGVNPVTGMSTRDSLGC 167


>gi|428224803|ref|YP_007108900.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427984704|gb|AFY65848.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 176

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 53/110 (48%), Positives = 71/110 (64%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F SA+MR ++F G+  + A L K V   AN  GA+L+  L DR+   +ANL NA+LV   
Sbjct: 67  FVSAEMRNANFEGANLSNAILTKGVLLNANLEGANLTGALADRVFWLDANLRNAILVDVT 126

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            TR+   G  + GADFSDA++D  + + LCK A G NP+TGV+TR SLGC
Sbjct: 127 ATRTSFEGVDVTGADFSDAILDRYELKELCKRAEGVNPVTGVATRDSLGC 176


>gi|428778133|ref|YP_007169920.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
 gi|428692412|gb|AFZ45706.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
          Length = 174

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 62/137 (45%), Positives = 82/137 (59%), Gaps = 9/137 (6%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           + I S   F + DL  AV         F +A+MR+++FSGS    A   K     A+ + 
Sbjct: 46  YTIVSERDFSNKDLVGAV---------FAAAEMRKTNFSGSNLENAMFTKGTLINADLSN 96

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
            +LS  LMDR+ L+ A+L NAVL  T LTRS L G  IEGADF+DA+++  Q + LC+ A
Sbjct: 97  TNLSGALMDRVSLDGADLRNAVLQGTFLTRSTLEGTKIEGADFTDAILNRYQVKLLCERA 156

Query: 225 NGTNPITGVSTRKSLGC 241
            G NP TGV+TR SLGC
Sbjct: 157 EGVNPKTGVATRDSLGC 173


>gi|427728139|ref|YP_007074376.1| putative low-complexity protein [Nostoc sp. PCC 7524]
 gi|427364058|gb|AFY46779.1| putative low-complexity protein [Nostoc sp. PCC 7524]
          Length = 168

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 53/110 (48%), Positives = 71/110 (64%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MR ++F G+    A   K V   AN +GA+L+  L+DR  L+ ANL NA+     
Sbjct: 58  FVAAEMRGTNFQGANLTNAIFTKGVLLNANLSGANLTGALVDRATLDSANLKNAIFTEAT 117

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           LTRS    A I GADF+DA+ID  Q   LC+ A+G NP+TGV+TR+SLGC
Sbjct: 118 LTRSRFYDADITGADFTDAIIDRYQVSLLCERADGINPVTGVATRESLGC 167


>gi|334119379|ref|ZP_08493465.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333458167|gb|EGK86786.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 169

 Score =  107 bits (266), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 71/110 (64%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MR ++F G+    A L K V   AN +GA+LS  L DR+  + ANLTNA     +
Sbjct: 59  FVAAEMRGTNFQGADLTNAILTKGVLLNANLSGANLSGALADRVTFDGANLTNANFSEAI 118

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +TR+    A I GADF+DA+ID  Q   LC+ A+G NP TGVSTR+SLGC
Sbjct: 119 MTRTRFFDAAISGADFTDAIIDAYQVSILCEKADGVNPATGVSTRESLGC 168


>gi|440681954|ref|YP_007156749.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
 gi|428679073|gb|AFZ57839.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
          Length = 168

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 71/110 (64%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MR ++F G+  + A L K V  KAN   A+L+  L+DR+ L+ ANL NA+     
Sbjct: 58  FVAAEMRGANFQGANLSNAILTKGVLLKANLEDANLTGALVDRVTLDSANLKNAIFTEAT 117

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           LTRS    A I GADF+DA+ID  Q   LC+ ANG N +TG+STR SLGC
Sbjct: 118 LTRSRFYDADITGADFTDALIDRYQVSLLCERANGVNSVTGISTRDSLGC 167


>gi|17227682|ref|NP_484230.1| hypothetical protein all0186 [Nostoc sp. PCC 7120]
 gi|17135164|dbj|BAB77710.1| all0186 [Nostoc sp. PCC 7120]
          Length = 168

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/112 (48%), Positives = 71/112 (63%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
            NF +A+MR ++F G+    A L K V  KAN + A+L+  L+DR  L+ ANL NA+   
Sbjct: 56  VNFVAAEMRGTNFQGANLTNAILTKGVLLKANLSEANLTGALVDRATLDNANLKNAIFTE 115

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
             LTRS    A I GADF+DA+ID  Q   LC+ ANG N +TG++TR SLGC
Sbjct: 116 ATLTRSRFYDADITGADFTDALIDRYQVSLLCERANGVNRVTGIATRDSLGC 167


>gi|428299988|ref|YP_007138294.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428236532|gb|AFZ02322.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 193

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 57/111 (51%), Positives = 71/111 (63%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NF +A+MR  +F G+    A L K V  KAN  GA+L+  L+DR+ L+ ANL NA     
Sbjct: 82  NFVAAEMRGINFEGANLTNAMLTKGVMLKANLEGANLTAALVDRVALDGANLKNANFTDA 141

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            LTRS L  A I GADFS+A+ID  Q + LC  A+GTNP+TGV TR SL C
Sbjct: 142 TLTRSRLFDADITGADFSNALIDTYQMKLLCDRASGTNPVTGVDTRDSLEC 192


>gi|428779391|ref|YP_007171177.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
 gi|428693670|gb|AFZ49820.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
          Length = 171

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 61/137 (44%), Positives = 82/137 (59%), Gaps = 9/137 (6%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           + + S   F + DL  AV         F +A+MR ++FSGS    A   K     A+ + 
Sbjct: 43  YTVVSERDFSNKDLVGAV---------FAAAEMRRTNFSGSNLENAMFTKGTLINADLSN 93

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
            +LS  LMDR+ L+ A+L+NAVL  T LTRS L G  I GADF+DA+++  Q + LC+ A
Sbjct: 94  TNLSGALMDRVNLDGADLSNAVLNGTFLTRSTLEGTKITGADFTDAILNRYQVKLLCEKA 153

Query: 225 NGTNPITGVSTRKSLGC 241
            G NP TGVSTR+SLGC
Sbjct: 154 EGVNPKTGVSTRESLGC 170


>gi|427720966|ref|YP_007068960.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
 gi|427353402|gb|AFY36126.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
          Length = 168

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 58/141 (41%), Positives = 81/141 (57%), Gaps = 1/141 (0%)

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
           R  F + +   + + +L       E+   A F +A+MR ++F G+    A L K V  KA
Sbjct: 27  RPAFALTNVINYNNINLENRDFAHEDLTGATFVAAEMRGANFQGANLTNAVLTKGVLLKA 86

Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
           + + A+L+  L+DR+ L+ ANL NA+     LTRS    A I GADF+DA+ID  Q   +
Sbjct: 87  DLSDANLTGALVDRVTLDGANLKNAIFTEATLTRSRFYDAEITGADFTDALIDRYQVSLM 146

Query: 221 CKYANGTNPITGVSTRKSLGC 241
           C  A G NP+TGVSTR SLGC
Sbjct: 147 CDRAAGINPVTGVSTRDSLGC 167


>gi|16331228|ref|NP_441956.1| hypothetical protein sll0301 [Synechocystis sp. PCC 6803]
 gi|383322971|ref|YP_005383824.1| hypothetical protein SYNGTI_2062 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383326140|ref|YP_005386993.1| hypothetical protein SYNPCCP_2061 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383492024|ref|YP_005409700.1| hypothetical protein SYNPCCN_2061 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384437292|ref|YP_005652016.1| hypothetical protein SYNGTS_2063 [Synechocystis sp. PCC 6803]
 gi|451815384|ref|YP_007451836.1| hypothetical protein MYO_120830 [Synechocystis sp. PCC 6803]
 gi|1001404|dbj|BAA10026.1| sll0301 [Synechocystis sp. PCC 6803]
 gi|339274324|dbj|BAK50811.1| hypothetical protein SYNGTS_2063 [Synechocystis sp. PCC 6803]
 gi|359272290|dbj|BAL29809.1| hypothetical protein SYNGTI_2062 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359275460|dbj|BAL32978.1| hypothetical protein SYNPCCN_2061 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359278630|dbj|BAL36147.1| hypothetical protein SYNPCCP_2061 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|451781353|gb|AGF52322.1| hypothetical protein MYO_120830 [Synechocystis sp. PCC 6803]
          Length = 169

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 58/137 (42%), Positives = 82/137 (59%), Gaps = 9/137 (6%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           +G  + + F   DL KAV         F +AD+RES+F GS  + + L  AV   A+  G
Sbjct: 41  YGDLARSDFSHQDLNKAV---------FAAADLRESNFEGSDLSFSILTDAVFLHASLRG 91

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
           A+LS +L+DR+ L+ A+L + +    + TR+      I GADFSDAVID  Q + +C+ A
Sbjct: 92  ANLSGSLVDRVTLDFADLRDTIFTEAIATRTRFYDTDITGADFSDAVIDAYQVKLMCERA 151

Query: 225 NGTNPITGVSTRKSLGC 241
            G NP+TGV+TR SLGC
Sbjct: 152 EGVNPVTGVATRDSLGC 168


>gi|359460928|ref|ZP_09249491.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 172

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 58/131 (44%), Positives = 73/131 (55%), Gaps = 20/131 (15%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA--------------------DLSDT 170
           NFT AD+R  DF    F GA L  A+  KAN T A                    DL++T
Sbjct: 41  NFTFADLRYEDFENKNFEGASLAGAILLKANLTNANLKGTILTMATFQRSNLTNADLTET 100

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
             DR++ NEA+LTNA+    +LT S    A I GADFS A +D  Q   +C+YA+G NP+
Sbjct: 101 FADRVLFNEADLTNAIFTDAMLTSSKFYDATITGADFSYAFLDRDQVTMMCEYADGVNPV 160

Query: 231 TGVSTRKSLGC 241
           TGVSTR+SL C
Sbjct: 161 TGVSTRESLEC 171


>gi|158337601|ref|YP_001518776.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158307842|gb|ABW29459.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 172

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 58/131 (44%), Positives = 73/131 (55%), Gaps = 20/131 (15%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA--------------------DLSDT 170
           NFT AD+R  DF    F GA L  A+  KAN T A                    DL++T
Sbjct: 41  NFTFADLRYEDFENKNFEGASLAGAILLKANLTNANLKGTILTMATFQRSNLTNADLTET 100

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
             DR++ NEA+LTNA+    +LT S    A I GADFS A +D  Q   +C+YA+G NP+
Sbjct: 101 FADRVLFNEADLTNAIFTDAMLTSSKFYDATITGADFSYAFLDRDQVTMMCEYADGVNPV 160

Query: 231 TGVSTRKSLGC 241
           TGVSTR+SL C
Sbjct: 161 TGVSTRESLEC 171


>gi|407961395|dbj|BAM54635.1| hypothetical protein BEST7613_5704 [Synechocystis sp. PCC 6803]
          Length = 147

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 58/137 (42%), Positives = 82/137 (59%), Gaps = 9/137 (6%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           +G  + + F   DL KAV         F +AD+RES+F GS  + + L  AV   A+  G
Sbjct: 19  YGDLARSDFSHQDLNKAV---------FAAADLRESNFEGSDLSFSILTDAVFLHASLRG 69

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
           A+LS +L+DR+ L+ A+L + +    + TR+      I GADFSDAVID  Q + +C+ A
Sbjct: 70  ANLSGSLVDRVTLDFADLRDTIFTEAIATRTRFYDTDITGADFSDAVIDAYQVKLMCERA 129

Query: 225 NGTNPITGVSTRKSLGC 241
            G NP+TGV+TR SLGC
Sbjct: 130 EGVNPVTGVATRDSLGC 146


>gi|87302980|ref|ZP_01085784.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
 gi|87282476|gb|EAQ74435.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
          Length = 203

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 51/111 (45%), Positives = 75/111 (67%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +F     R++DFSG+  +G+ L +A   +++F+GADLSD LMDR   +  +L+ A+L   
Sbjct: 92  SFAGVMARDADFSGADLHGSILTQAAFLRSDFSGADLSDALMDRADFSGTDLSGALLRGV 151

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +   S   GA+I+ ADFSDA++D + ++ALC+ A GTNP TGVSTR SL C
Sbjct: 152 IAAGSSFSGAVIDDADFSDALLDRSDQRALCRRAQGTNPTTGVSTRLSLDC 202


>gi|255083653|ref|XP_002508401.1| predicted protein [Micromonas sp. RCC299]
 gi|226523678|gb|ACO69659.1| predicted protein [Micromonas sp. RCC299]
          Length = 187

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 57/133 (42%), Positives = 75/133 (56%), Gaps = 6/133 (4%)

Query: 120 KAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           KA H+ E+F       A +T  D+R SDFSGS    A   +AV    N  GAD+S++ +D
Sbjct: 30  KAEHINEDFSHEDLVGAIYTEGDLRGSDFSGSDLRAAIFSRAVMPGVNLEGADMSNSFLD 89

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
            +VL  +N+   +       RSDLG   +  ADF++AVID  Q   LC  A+GTNP TGV
Sbjct: 90  YVVLRGSNMRGVIAREANFVRSDLGDCDVTDADFTEAVIDRYQAIGLCDSASGTNPFTGV 149

Query: 234 STRKSLGCGNSRR 246
            TR SLGC   +R
Sbjct: 150 DTRDSLGCERLKR 162


>gi|427706655|ref|YP_007049032.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
 gi|427359160|gb|AFY41882.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
          Length = 168

 Score =  103 bits (258), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 53/110 (48%), Positives = 70/110 (63%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MR ++F  +    A   K V  KAN  GA+L+  L+DR+ L+ ANL NA      
Sbjct: 58  FVAAEMRGTNFQAANLTNAIFTKGVLLKANLEGANLTGALVDRVTLDGANLKNANFTEAT 117

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           LTRS    A I GADF+DA+ID  Q   LC+ A+G NP+TGV+TR+SLGC
Sbjct: 118 LTRSRFYDADITGADFTDALIDRYQISLLCERADGVNPVTGVATRESLGC 167


>gi|443313318|ref|ZP_21042930.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
 gi|442776723|gb|ELR87004.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
          Length = 182

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 53/110 (48%), Positives = 70/110 (63%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MR ++F G+    A + K V   AN  GA+LS  L+DR+ L+ ANL NA+     
Sbjct: 72  FVAAEMRGANFQGADLTNAIMTKGVLLGANLEGANLSGALVDRVTLDNANLKNAIFTDAT 131

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           LTRS    A I GADFS+A+ID  Q   LC  A GTNP+TG++T +SLGC
Sbjct: 132 LTRSRFFDADITGADFSNALIDRYQINLLCDRATGTNPVTGITTTESLGC 181


>gi|428203864|ref|YP_007082453.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427981296|gb|AFY78896.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 170

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 52/110 (47%), Positives = 72/110 (65%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +ADMR  +F  S  +   L + V   AN  GA+L+++LMDR+ L+ A+LTNA+ V  +
Sbjct: 60  FAAADMRGINFEDSDLSNTILTEGVLLGANLKGANLTNSLMDRVTLDFADLTNAIFVDAI 119

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            TR+      I GADFS AV+D  Q + LC  A+G NP+TG+STR+SLGC
Sbjct: 120 ATRTRFYDTTITGADFSGAVLDRYQVKLLCDRADGVNPVTGISTRESLGC 169


>gi|425438309|ref|ZP_18818714.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9432]
 gi|425452591|ref|ZP_18832408.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 7941]
 gi|440756403|ref|ZP_20935604.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
 gi|443646807|ref|ZP_21129485.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|159025958|emb|CAO87888.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|389676535|emb|CCH94452.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9432]
 gi|389765527|emb|CCI08587.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 7941]
 gi|440173625|gb|ELP53083.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
 gi|443335636|gb|ELS50100.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 166

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 57/129 (44%), Positives = 78/129 (60%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR  V         F +A MR  +  GS  + + L +AV  KAN  GADL+ +L+
Sbjct: 46  FSHQDLRGGV---------FAAAAMRGVNLEGSDLSYSILTEAVLLKANLKGADLTASLV 96

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LTN +    + TRS     II GADF++AVID  Q + +C+ A+G NP+TG
Sbjct: 97  DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTG 156

Query: 233 VSTRKSLGC 241
           V+TR SLGC
Sbjct: 157 VATRDSLGC 165


>gi|443314355|ref|ZP_21043921.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
 gi|442786047|gb|ELR95821.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
          Length = 173

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 62/182 (34%), Positives = 95/182 (52%), Gaps = 14/182 (7%)

Query: 62  YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQ-FGSADLRK 120
           + +   WR  +   L  A+       I+A A              IG   Q F  +DL +
Sbjct: 3   WQRSGEWRQILRGGLLFAIAIVLWGGIAARA------------IAIGEITQDFTYSDLNR 50

Query: 121 AVHVKENFRANFTSADM-RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
                EN      +A   RE++FSG+  +   L K   YKA   GA+L+ +  DR++ + 
Sbjct: 51  QDFAGENLAGASLAAADAREANFSGADLSQTILTKGNFYKAKLVGANLTQSFADRVIFDG 110

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           A+L+NA++V  ++T +  G A I+GADFS  ++D  Q   +C+YA+G NP+TGV+TR SL
Sbjct: 111 ADLSNALVVDAIMTSTSFGEATIQGADFSGTILDRYQVAQMCEYADGVNPVTGVATRDSL 170

Query: 240 GC 241
           GC
Sbjct: 171 GC 172


>gi|425469693|ref|ZP_18848608.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9701]
 gi|389880432|emb|CCI38813.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9701]
          Length = 166

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/129 (43%), Positives = 78/129 (60%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR  V         F +A MR  +  G+  + + L +AV  KAN  GADL+ +L+
Sbjct: 46  FSHQDLRGGV---------FAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLV 96

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LTN +    + TRS     II GADF++AVID  Q + +C+ A+G NP+TG
Sbjct: 97  DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDAYQVKLMCERADGINPVTG 156

Query: 233 VSTRKSLGC 241
           V+TR SLGC
Sbjct: 157 VATRDSLGC 165


>gi|443322626|ref|ZP_21051645.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
 gi|442787675|gb|ELR97389.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
          Length = 164

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 58/129 (44%), Positives = 73/129 (56%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DL  AV         F  AD+R ++F  +    + L + V   AN T A+L+D L 
Sbjct: 43  FSGQDLEGAV---------FADADLRGANFQAANLANSILTQGVFLNANLTKANLTDALA 93

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR    EANLT+A+LV  + +RS    AII GADFS A++D  Q   LC  A GTNP+TG
Sbjct: 94  DRATFAEANLTDAILVNIIASRSSFVDAIITGADFSGAILDKYQVALLCDRAQGTNPVTG 153

Query: 233 VSTRKSLGC 241
           VSTR SL C
Sbjct: 154 VSTRASLNC 162


>gi|422303610|ref|ZP_16390961.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9806]
 gi|389791366|emb|CCI12792.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9806]
          Length = 166

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/129 (43%), Positives = 78/129 (60%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR  V         F +A MR  +  G+  + + L +AV  KAN  GADL+ +L+
Sbjct: 46  FSHQDLRGGV---------FAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLV 96

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LTN +    + TRS     II GADF++AVID  Q + +C+ A+G NP+TG
Sbjct: 97  DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDAYQVKLMCERADGINPVTG 156

Query: 233 VSTRKSLGC 241
           V+TR SLGC
Sbjct: 157 VATRDSLGC 165


>gi|425465439|ref|ZP_18844748.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9809]
 gi|389832325|emb|CCI24153.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9809]
          Length = 166

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/129 (43%), Positives = 79/129 (61%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR  V         F +A MR ++  G+  + + L +AV  KAN  GADL+ +L+
Sbjct: 46  FSHQDLRGGV---------FAAAAMRGANLEGADLSYSILTEAVLLKANLKGADLTASLV 96

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LTN +    + TRS     II GADF++AVID  Q + +C+ A+G NP+TG
Sbjct: 97  DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTG 156

Query: 233 VSTRKSLGC 241
           V+TR SLGC
Sbjct: 157 VATRDSLGC 165


>gi|300868096|ref|ZP_07112733.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
 gi|300333934|emb|CBN57911.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
          Length = 174

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 52/110 (47%), Positives = 68/110 (61%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MR ++F G+    A L K V   AN + A+LS  L DR+  + ANLTNA     +
Sbjct: 64  FVAAEMRNTNFEGADLTNAILTKGVLLNANLSNANLSGALADRVTFDGANLTNANFTEAI 123

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           LTR+      I GADF+DA+ID  Q   LC+ A G N +TGVSTR+SLGC
Sbjct: 124 LTRTRFYDTAISGADFTDAIIDSYQVNLLCEKAEGVNSVTGVSTRESLGC 173


>gi|443328655|ref|ZP_21057250.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442791786|gb|ELS01278.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 222

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 51/110 (46%), Positives = 72/110 (65%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +AD+R S+F GS  + + L KA+    N +G DL+++ MDR+ L+ +NL+NA+L   +
Sbjct: 112 FAAADVRGSNFEGSDLSNSILTKAIFTDTNLSGVDLTNSFMDRVDLSNSNLSNAILQDII 171

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            T ++     I GADFS A+ID  Q   LC+ A G NP+TGVSTR SLGC
Sbjct: 172 ATSTNFYNTDITGADFSGAIIDRYQTYVLCQRAAGVNPVTGVSTRYSLGC 221


>gi|414075538|ref|YP_006994856.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
 gi|413968954|gb|AFW93043.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
          Length = 168

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 51/110 (46%), Positives = 68/110 (61%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MR ++F  +    A   K V  KAN   A+L+  L+DR+  + ANL NA+     
Sbjct: 58  FVAAEMRGTNFQDANLTNAIFTKGVLLKANLESANLTGALVDRVTFDSANLRNAIFAEAT 117

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           LTRS    A I GADF+DA+ID  Q   LC+ A+G NP+TG+STR SLGC
Sbjct: 118 LTRSRFYDADITGADFTDALIDRYQVSLLCQRADGVNPVTGISTRDSLGC 167


>gi|425446471|ref|ZP_18826474.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9443]
 gi|389733275|emb|CCI02926.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9443]
          Length = 166

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 56/129 (43%), Positives = 78/129 (60%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR  V         F +A MR  +  G+  + + L +AV  KAN  GADL+ +L+
Sbjct: 46  FSHQDLRGGV---------FAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLV 96

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LTN +    + TRS     II GADF++AVID  Q + +C+ A+G NP+TG
Sbjct: 97  DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTG 156

Query: 233 VSTRKSLGC 241
           V+TR SLGC
Sbjct: 157 VATRDSLGC 165


>gi|166365075|ref|YP_001657348.1| hypothetical protein MAE_23340 [Microcystis aeruginosa NIES-843]
 gi|166087448|dbj|BAG02156.1| hypothetical protein MAE_23340 [Microcystis aeruginosa NIES-843]
          Length = 166

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 56/129 (43%), Positives = 78/129 (60%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR  V         F +A MR  +  G+  + + L +AV  KAN  GADL+ +L+
Sbjct: 46  FSHQDLRGGV---------FAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLV 96

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LTN +    + TRS     II GADF++AVID  Q + +C+ A+G NP+TG
Sbjct: 97  DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTG 156

Query: 233 VSTRKSLGC 241
           V+TR SLGC
Sbjct: 157 VATRDSLGC 165


>gi|411119939|ref|ZP_11392315.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410710095|gb|EKQ67606.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 169

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 51/110 (46%), Positives = 72/110 (65%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F SA+MR ++FSG+    A   K     AN +GA+L   L+DR  L +A+L+NA+L+   
Sbjct: 59  FVSAEMRGTNFSGAILTNAMFTKGNLLGANLSGANLEGALLDRTTLYKADLSNAILIDAT 118

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L+ S L  A ++GADF++A++D      LCK A GTNP TG+STR+SLGC
Sbjct: 119 LSNSILDEATVDGADFTNAIVDRYAVSQLCKRAQGTNPTTGISTRESLGC 168


>gi|425439807|ref|ZP_18820122.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9717]
 gi|425456970|ref|ZP_18836676.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9807]
 gi|389719892|emb|CCH96344.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9717]
 gi|389801790|emb|CCI19079.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9807]
          Length = 166

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 56/129 (43%), Positives = 78/129 (60%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR  V         F +A MR  +  G+  + + L +AV  KAN  GADL+ +L+
Sbjct: 46  FSHQDLRGGV---------FAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLV 96

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LTN +    + TRS     II GADF++AVID  Q + +C+ A+G NP+TG
Sbjct: 97  DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTG 156

Query: 233 VSTRKSLGC 241
           V+TR SLGC
Sbjct: 157 VATRDSLGC 165


>gi|116073351|ref|ZP_01470613.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
 gi|116068656|gb|EAU74408.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
          Length = 167

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 54/111 (48%), Positives = 72/111 (64%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +F  A  R +DFSG+  +GA   +    +A+F+ ADLSD+LMDR   +  NLTNA+L   
Sbjct: 57  SFAGAVGRGADFSGADLHGAIFTQGAFAEADFSDADLSDSLMDRADFSGTNLTNALLNGV 116

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           + + S   GA IEGADFSDA++D      LC+ A G NPITG++TR SLGC
Sbjct: 117 IASGSSFAGASIEGADFSDALLDRDDVVRLCRDAEGVNPITGMATRDSLGC 167


>gi|318040416|ref|ZP_07972372.1| hypothetical protein SCB01_01865 [Synechococcus sp. CB0101]
          Length = 174

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 52/111 (46%), Positives = 72/111 (64%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +F  A  + ++F+G+  +GA   +    +A+F+GADLSD LMDR  ++  NL NAVLV  
Sbjct: 64  SFAGAVGKGANFAGANLHGAIFTQGAFPEADFSGADLSDVLMDRTDMSHTNLRNAVLVGV 123

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +   +   GA + GADFSDA+ID A ++ LC  A+GTNP TG  TR SLGC
Sbjct: 124 IAAGASFSGADVTGADFSDALIDRADQRQLCAKASGTNPSTGADTRASLGC 174


>gi|428308896|ref|YP_007119873.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428250508|gb|AFZ16467.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 176

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 58/134 (43%), Positives = 77/134 (57%), Gaps = 1/134 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S+  + S DL       +N   A F +A+MR ++F  S    A L K V   AN   A+L
Sbjct: 42  SSINYSSTDLTNRDFSHKNLVGAVFVAAEMRGTNFQESDLTNAILTKGVMLGANLQDANL 101

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 227
           +  L+DR+ L+ ANL NA+     + RS    A I GADF+DA+ID  Q   LC+ A+G 
Sbjct: 102 TGALVDRVTLDNANLKNAIFQEATMIRSRFYDADITGADFTDAIIDRYQVSLLCEKASGV 161

Query: 228 NPITGVSTRKSLGC 241
           NPITGV+TR SLGC
Sbjct: 162 NPITGVATRDSLGC 175


>gi|390440134|ref|ZP_10228485.1| Similar to Pentapeptide repeat [Microcystis sp. T1-4]
 gi|389836418|emb|CCI32609.1| Similar to Pentapeptide repeat [Microcystis sp. T1-4]
          Length = 166

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 55/129 (42%), Positives = 78/129 (60%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR  V         F +A MR  +  G+  + + L +AV  KAN  GADL+ +L+
Sbjct: 46  FSHQDLRGGV---------FAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLV 96

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LTN +    + +RS     II GADF++AVID  Q + +C+ A+G NP+TG
Sbjct: 97  DRVTLDFADLTNTIFTDAIASRSRFYDTIITGADFTNAVIDAYQVKLMCERADGINPVTG 156

Query: 233 VSTRKSLGC 241
           V+TR SLGC
Sbjct: 157 VATRDSLGC 165


>gi|434407744|ref|YP_007150629.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
 gi|428261999|gb|AFZ27949.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
          Length = 162

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 49/112 (43%), Positives = 76/112 (67%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F++A++  ++F+G+   GA L  +V  KAN  GADL++ ++D++ L  A+L++AV + 
Sbjct: 50  AEFSNANLELTNFTGADLRGAVLSASVMTKANLHGADLTNAMVDQVNLTRADLSDAVFIE 109

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +L R+      IEGADF+DA++D AQ + LC+ A+G N  TGV TR SLGC
Sbjct: 110 ALLLRAIFTDVNIEGADFTDAILDRAQVKELCEKASGVNSQTGVQTRDSLGC 161


>gi|170077406|ref|YP_001734044.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
 gi|169885075|gb|ACA98788.1| Pentapeptide repeats protein [Synechococcus sp. PCC 7002]
          Length = 169

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 57/120 (47%), Positives = 77/120 (64%), Gaps = 1/120 (0%)

Query: 125 KENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
            EN +A +F  AD+R SDF+GS  + A L +    +AN T A+LS+  MD++ +  ANLT
Sbjct: 50  HENLQAASFARADVRGSDFTGSDLSRAILTEGKFMEANLTEANLSEAFMDQVNMEGANLT 109

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 243
           NA+ V  V   ++   AII+GADFS A++D  Q   LCK A+GTN ITG+ TR SL C N
Sbjct: 110 NALFVDAVAPGTNFAEAIIDGADFSGALLDRYQLSELCKRASGTNTITGIDTRYSLNCKN 169


>gi|33862602|ref|NP_894162.1| hypothetical protein PMT0329 [Prochlorococcus marinus str. MIT
           9313]
 gi|33634518|emb|CAE20504.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 179

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 53/122 (43%), Positives = 77/122 (63%)

Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           K  H ++   ++F  A  R +DFS S  +GA L +    ++NF+GADLSD LMDR+   +
Sbjct: 58  KDFHAQDLSNSSFAGAVARAADFSNSNLHGAILTQGTFTQSNFSGADLSDALMDRVDFVD 117

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
            +L N VL   + + S   GA I+GADFSDA++DL  ++ LC  A+G N ITG++T +SL
Sbjct: 118 TDLRNCVLKGVIASGSSFAGAQIDGADFSDALLDLDDQRRLCLDADGINQITGIATFESL 177

Query: 240 GC 241
            C
Sbjct: 178 NC 179


>gi|298489879|ref|YP_003720056.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
 gi|298231797|gb|ADI62933.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
          Length = 163

 Score =  100 bits (249), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 49/112 (43%), Positives = 76/112 (67%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F++A++  ++F+G+   G     +V  KAN  GA+L++ +++ + LN A+L++A+L+ 
Sbjct: 51  AEFSNANLEMANFAGADLRGTVFSASVMTKANLHGANLTNAMVNEVKLNGADLSDAILLE 110

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +L RS      IEGADFSDA++D +Q Q LCK A+G N  TGV TR+SLGC
Sbjct: 111 ALLLRSIFTDVNIEGADFSDAILDRSQIQELCKKASGVNSQTGVETRESLGC 162


>gi|440684176|ref|YP_007158971.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
 gi|428681295|gb|AFZ60061.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
          Length = 162

 Score =  100 bits (249), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 49/112 (43%), Positives = 77/112 (68%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F++A++  ++F+G+   GA L  +V  +AN  GADL++ ++D++ LN A+L++A+L+ 
Sbjct: 51  AEFSNANLEMANFTGADLRGAVLSASVMTQANLHGADLTNAMIDQVKLNGADLSDAILLE 110

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +L RS      I GADF+DA++D AQ + LC+ A+G N  TGV TR SLGC
Sbjct: 111 ALLLRSIFTDVNIAGADFTDAILDKAQIKELCQKASGVNSRTGVETRDSLGC 162


>gi|434400337|ref|YP_007134341.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428271434|gb|AFZ37375.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 169

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 51/104 (49%), Positives = 68/104 (65%)

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           R ++F GS  + + L KAV   AN    +L+ +LMDR+ L+ +NLTNA++   V T ++ 
Sbjct: 65  RGANFEGSDLSNSILTKAVFSNANLAEINLTKSLMDRVALDNSNLTNAIIREAVATSTNF 124

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            GA I GADFSD+++D  Q   LCK A G NP TGVSTR SLGC
Sbjct: 125 DGATITGADFSDSILDRYQIYLLCKRAEGVNPTTGVSTRDSLGC 168


>gi|124023686|ref|YP_001017993.1| hypothetical protein P9303_19861 [Prochlorococcus marinus str. MIT
           9303]
 gi|123963972|gb|ABM78728.1| Uncharacterized low-complexity proteins [Prochlorococcus marinus
           str. MIT 9303]
          Length = 179

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 53/122 (43%), Positives = 76/122 (62%)

Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           K  H ++    +F  A  R +DFS S   GA L +    ++NF+GADLSD LMDR+   +
Sbjct: 58  KDFHAQDLSNTSFAGAVARAADFSNSNLRGAILTQGTFTQSNFSGADLSDALMDRVDFVD 117

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
            +L N+VL   + + S   GA I+GADFSDA++DL  ++ LC  A+G N ITG++T +SL
Sbjct: 118 TDLRNSVLKGVIASGSSFAGAQIDGADFSDALLDLDDQRRLCLDADGINQITGIATFESL 177

Query: 240 GC 241
            C
Sbjct: 178 NC 179


>gi|425462969|ref|ZP_18842432.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9808]
 gi|389823905|emb|CCI27601.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9808]
          Length = 166

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/129 (42%), Positives = 78/129 (60%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR  V         F +A MR ++   +  + + L +AV  KAN  GADL+ +L+
Sbjct: 46  FSHQDLRGGV---------FAAAAMRGANLEEADLSYSILTEAVLLKANLKGADLTASLV 96

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+ L+ A+LTN +    + TRS     II GADF++AVID  Q + +C+ A+G NP+TG
Sbjct: 97  DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTG 156

Query: 233 VSTRKSLGC 241
           V+TR SLGC
Sbjct: 157 VATRDSLGC 165


>gi|86609913|ref|YP_478675.1| pentapeptide repeat-containing protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
 gi|86558455|gb|ABD03412.1| pentapeptide repeat family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
          Length = 173

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 83/130 (63%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +ADL+      +++R ++F SA+++ +D  G+   GA   KA    AN +GADLS++L
Sbjct: 43  FNNADLQGQDLSGQDWRGSSFVSANLQGADLHGANLAGAAFTKANLAGANLSGADLSNSL 102

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           +D   L  A+L  A L   +  R+   GA I GADFS+A +D A K+ LC+ A G++PIT
Sbjct: 103 LDLANLAGADLRGAKLTGAIAARAVWQGAQIAGADFSEAYVDRAAKRQLCERAEGSHPIT 162

Query: 232 GVSTRKSLGC 241
           GV+TR+SLGC
Sbjct: 163 GVTTRESLGC 172


>gi|17230233|ref|NP_486781.1| hypothetical protein alr2741 [Nostoc sp. PCC 7120]
 gi|17131834|dbj|BAB74440.1| alr2741 [Nostoc sp. PCC 7120]
          Length = 182

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 53/130 (40%), Positives = 82/130 (63%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +A+L +     E+ +A  F++A++  ++F G+   GA L  +V  +AN  GADL++ +
Sbjct: 52  FSNAELSRHNFAGESLQAAEFSNANLEMTNFVGADLRGAVLSASVMTQANLQGADLTNAM 111

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           +D++ L  ANL++ VL   +L R+      IEGADF+DA++D AQ + LC  A+G N  T
Sbjct: 112 VDQVNLTGANLSDVVLKEALLLRAIFANVNIEGADFTDAILDKAQIKELCTKASGVNTKT 171

Query: 232 GVSTRKSLGC 241
           GV TR SLGC
Sbjct: 172 GVETRDSLGC 181


>gi|428309499|ref|YP_007120476.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428251111|gb|AFZ17070.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 166

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 70/179 (39%), Positives = 92/179 (51%), Gaps = 28/179 (15%)

Query: 72  VSTALAAAVVASCSSNISALADLNKY--------EAETRGEFGIGSAAQFGSADLRKAVH 123
           ++T L A +V  C   + ALA   KY         AE +G+        F    LR A  
Sbjct: 6   LATFLLALIVWCCP--LPALAQATKYYPPPLSYSNAELKGK-------DFSGQTLRSAEF 56

Query: 124 VKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
              N  R NFT AD+R + FS S          V   AN  GADLS+ ++D++    A+L
Sbjct: 57  SNANLERTNFTDADLRGTIFSAS----------VMTHANLHGADLSNAMIDQVSFTNADL 106

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           ++AVL  +++ RS      I GADFSDA++D AQ + LC  A G N  TGVSTR SLGC
Sbjct: 107 SDAVLTESIMLRSTFDNVDITGADFSDAILDGAQIKELCTKATGVNSQTGVSTRDSLGC 165


>gi|75910505|ref|YP_324801.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75704230|gb|ABA23906.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 182

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 53/130 (40%), Positives = 82/130 (63%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +A+L +     E+ +A  F++A++  ++F G+   GA L  +V  +AN  GADL++ +
Sbjct: 52  FSNAELSRHNFAGESLQAAEFSNANLEMTNFVGADLRGAVLSASVMTQANLQGADLTNAM 111

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           +D++ L  ANL++ VL   +L R+      IEGADF+DA++D AQ + LC  A+G N  T
Sbjct: 112 VDQVNLTGANLSDVVLKEALLLRAIFANVNIEGADFTDAILDKAQIKELCTKASGVNTKT 171

Query: 232 GVSTRKSLGC 241
           GV TR SLGC
Sbjct: 172 GVKTRDSLGC 181


>gi|427722287|ref|YP_007069564.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
 gi|427354007|gb|AFY36730.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
          Length = 175

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 54/114 (47%), Positives = 71/114 (62%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+F  AD+R SDFSGS  + A L +      N +GADL++  MD++ L+ ANLTNA+   
Sbjct: 62  ASFARADVRSSDFSGSDLSRAILSEGKFMDTNLSGADLTEAFMDQVNLSGANLTNAIFTD 121

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 243
            V   ++   A I GADFS A++D  Q   LCK A+GTN ITG+ TR SL C N
Sbjct: 122 AVAPGTNFTDANIAGADFSGALLDRYQLSQLCKRASGTNAITGIETRYSLNCEN 175


>gi|56750202|ref|YP_170903.1| hypothetical protein syc0193_c [Synechococcus elongatus PCC 6301]
 gi|81300170|ref|YP_400378.1| hypothetical protein Synpcc7942_1361 [Synechococcus elongatus PCC
           7942]
 gi|56685161|dbj|BAD78383.1| hypothetical protein [Synechococcus elongatus PCC 6301]
 gi|81169051|gb|ABB57391.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 167

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 51/110 (46%), Positives = 69/110 (62%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F S +MR+++   +    A L   V   ANF GADLS  L+DR+ L  A+LT+A+LV   
Sbjct: 57  FVSTEMRKANLEEANLRNAILTLGVFLDANFHGADLSGALLDRVFLVGADLTDALLVDVT 116

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            TR+      I GADF+DA+ID  +++ LC  A+G NP TGV+TR SLGC
Sbjct: 117 ATRTSFQDVKITGADFTDAIIDRYEQKQLCLRADGVNPKTGVATRDSLGC 166


>gi|428224653|ref|YP_007108750.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427984554|gb|AFY65698.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 187

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 48/110 (43%), Positives = 70/110 (63%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F++A++  ++F G+   G     +V   AN  GA+L++ LMD+  L  A+L  A+L   +
Sbjct: 77  FSNANLERANFEGADVRGGVFSASVLTDANLQGANLTNALMDQANLTRADLRGAILSEAI 136

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L  S      I GADFSDA++D AQ +ALC+ A G NP+TG+STR+SLGC
Sbjct: 137 LLGSTFAETAIAGADFSDAILDGAQIKALCQRAEGVNPVTGLSTRESLGC 186


>gi|427716094|ref|YP_007064088.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
 gi|427348530|gb|AFY31254.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
          Length = 163

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 53/130 (40%), Positives = 83/130 (63%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +A+L++     E  + A F++A++ +++F+G+   GA L  +V  + N  GADL+D L
Sbjct: 33  FSNAELKRHDFSGETLQGAEFSNANLEQANFAGADLRGAVLSASVMTQTNLHGADLTDAL 92

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           +D++ L +A+L++AVL   +L R+      I  ADF+DAV+D AQ + LC  A+G N  T
Sbjct: 93  VDQVNLTKADLSDAVLKEALLLRAIFTDVNINSADFTDAVLDRAQIKELCGKASGVNSKT 152

Query: 232 GVSTRKSLGC 241
           GV TR SLGC
Sbjct: 153 GVQTRDSLGC 162


>gi|428206519|ref|YP_007090872.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
           PCC 7203]
 gi|428008440|gb|AFY87003.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
          Length = 192

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 48/112 (42%), Positives = 76/112 (67%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F++A+M + +F+ +   GA +  +V  +AN  GADLS  ++D++ +  A+L++AVL  
Sbjct: 80  AEFSNANMEQVNFTDADLRGAIMSASVMTQANLHGADLSIAMVDQVKMTGADLSDAVLQE 139

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +L R+   G  I GADFSDA++D AQ + LC+ A+G N  TG++TR+SLGC
Sbjct: 140 ALLLRTIFTGVDITGADFSDAILDGAQVKELCQRASGINSKTGIATRESLGC 191


>gi|428209239|ref|YP_007093592.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
           PCC 7203]
 gi|428011160|gb|AFY89723.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
          Length = 165

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 50/113 (44%), Positives = 71/113 (62%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RA F +  + E++FS +   GA    AV  KAN  G D S  +     L+ A+L++A+L 
Sbjct: 52  RAEFNNTKLAEANFSSADLRGAVFNSAVLRKANLHGVDFSYGIAYLSDLSAADLSDAILT 111

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
             ++ RS+  GA + GADFS+AV+D  Q   LC+YA+G NP+TGV TR+SLGC
Sbjct: 112 SAMMLRSNFKGAKVTGADFSEAVLDREQVVQLCEYASGVNPVTGVDTRESLGC 164


>gi|428316951|ref|YP_007114833.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428240631|gb|AFZ06417.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 165

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 51/130 (39%), Positives = 82/130 (63%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +A+L +     +  RA  F++A+M  ++FS +   GA +  +V  +AN  GA+L++ +
Sbjct: 35  FSNAELTRRDFSGQMLRAAEFSNANMDLTNFSNADLRGAIMSASVMTQANLHGANLTNAM 94

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           +D++    A+L++A+L  T+L RS   G  I GADF+DA++D +Q + LC  A G N  T
Sbjct: 95  IDQVKFTNADLSDAILAETILLRSTFDGVDITGADFTDAIMDGSQVKELCTKATGINSQT 154

Query: 232 GVSTRKSLGC 241
           G+STR SLGC
Sbjct: 155 GISTRDSLGC 164


>gi|428306100|ref|YP_007142925.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
           9333]
 gi|428247635|gb|AFZ13415.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
          Length = 174

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 51/129 (39%), Positives = 79/129 (61%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F + DL  AV         F +A M+ ++F GS  + A L +     ANF  A+L++ L+
Sbjct: 54  FSNTDLTGAV---------FAAAQMKGANFQGSNLSNAILSQGTLSNANFADANLTNALV 104

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           D++ L+ A+LTNA+  +  +  ++   + I GADF+DA+ID  Q + LC+ A+G NP+T 
Sbjct: 105 DQVTLDGADLTNAIFRQATMVGTNFNDSAIAGADFTDAIIDRYQLKQLCQRASGVNPVTA 164

Query: 233 VSTRKSLGC 241
           VSTR+SLGC
Sbjct: 165 VSTRESLGC 173


>gi|33866170|ref|NP_897729.1| hypothetical protein SYNW1636 [Synechococcus sp. WH 8102]
 gi|33639145|emb|CAE08151.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 171

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 52/119 (43%), Positives = 73/119 (61%)

Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           H +     +F  A  R ++FSG+  +GA   +    +A+F+GADLSD LMDR      NL
Sbjct: 53  HGQHLANTSFAGAVGRGANFSGADLHGAIFTQGAFAEADFSGADLSDALMDRADFAGTNL 112

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +AVL   + + S    A I GADFSDA++DL  ++ LC+ A+G NP+TGV+T  SLGC
Sbjct: 113 RDAVLTGIIASGSSFSDAQIAGADFSDALLDLDDQRRLCRDADGVNPVTGVATLDSLGC 171


>gi|434390929|ref|YP_007125876.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428262770|gb|AFZ28716.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 163

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 52/130 (40%), Positives = 81/130 (62%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +A+L+      +  RA+ F++A+M +++F+ +   GA    +V  KAN  GA+L++ +
Sbjct: 33  FSNAELKGRDFSGQMLRASEFSNANMEQTNFTDADLRGAIFSASVMTKANLHGANLTNAM 92

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
            D++    A+L+ AVL  T+L RS      I  ADFSDA++D  Q + LC+ A+G NP T
Sbjct: 93  ADQVNFTNADLSAAVLAETILLRSVFDNTDITAADFSDAILDGVQIKELCQRASGVNPTT 152

Query: 232 GVSTRKSLGC 241
           GV TR+SLGC
Sbjct: 153 GVDTRESLGC 162


>gi|317970566|ref|ZP_07971956.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
          Length = 175

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 51/110 (46%), Positives = 71/110 (64%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  A  + ++FSG+  +GA L +     ANF GADLSD L+DR  ++  +L NAVLV  +
Sbjct: 66  FAGAVGKAANFSGADLHGAILTQGAFPDANFNGADLSDVLLDRTDMSGTDLRNAVLVGVI 125

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            + S   GA +E ADF+DA++D A ++  C  A+GTNP TG +TR SLGC
Sbjct: 126 ASGSTFTGAQVENADFTDALLDRADQRNFCISASGTNPTTGANTRASLGC 175


>gi|186684198|ref|YP_001867394.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
           73102]
 gi|186466650|gb|ACC82451.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
          Length = 174

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 76/112 (67%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F++A+M  ++FS +   GA +  +V  KAN  GADL++ ++D++ L +A+L++A+   
Sbjct: 62  AEFSNANMELANFSNADLRGAVMSASVMTKANLHGADLTNAMVDQVNLTKADLSDAIFKE 121

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +L R+      I+GADF+DA++D AQ + LC+ A+G N  TGV TR+SLGC
Sbjct: 122 ALLLRAIFNDVNIDGADFTDAILDRAQIKELCRKASGVNSKTGVQTRESLGC 173


>gi|414079521|ref|YP_007000945.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
 gi|413972800|gb|AFW96888.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
          Length = 162

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 74/112 (66%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F++A++  ++F+G+   G     +V  KAN  GADL++ +++ + L  A+L+NAVL+ 
Sbjct: 50  AEFSNANLEMANFTGADLRGTVFSASVMTKANLHGADLTNAMVNEVKLAGADLSNAVLIE 109

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +L R+      I GADF+DA++D AQ + LC+ A+G N  TGV TR+SLGC
Sbjct: 110 ALLLRTVFTDVNITGADFTDAILDKAQIKELCQKASGVNSQTGVETRESLGC 161


>gi|307153777|ref|YP_003889161.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306984005|gb|ADN15886.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 173

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 53/118 (44%), Positives = 75/118 (63%), Gaps = 1/118 (0%)

Query: 125 KENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           ++N R A F +ADMR + F  S  + A L + +   AN  GA+L+ TL+DR+ L+ A+L 
Sbjct: 54  EKNLRGAVFAAADMRGASFENSDLSYAILTEGILLNANLKGANLTGTLLDRVTLDFADLR 113

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +A+L   + TR+    + I GADF+ AVID  Q   +C+ A+G N ITGVSTR SLGC
Sbjct: 114 DAILTDAIATRTRFYDSDITGADFTGAVIDTYQISLMCERADGVNSITGVSTRDSLGC 171


>gi|428770661|ref|YP_007162451.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
           10605]
 gi|428684940|gb|AFZ54407.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
          Length = 165

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 53/112 (47%), Positives = 74/112 (66%), Gaps = 10/112 (8%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANF+++D+R     G+ FN A LE+A     NF GADL++  +    LN A+LT+A+L  
Sbjct: 63  ANFSNSDLR-----GAVFNAARLEEA-----NFHGADLTNGFIYVTSLNRADLTDAILRE 112

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ R+ L GA ++GADF+ AV+D  Q   LCK A G NP+TG STR+SLGC
Sbjct: 113 AIMKRTTLKGANVDGADFTFAVLDNEQVIELCKNAQGINPVTGASTRQSLGC 164


>gi|282900610|ref|ZP_06308552.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
 gi|281194410|gb|EFA69365.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
          Length = 167

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 59/142 (41%), Positives = 78/142 (54%), Gaps = 11/142 (7%)

Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           E   E  IG  A F   DLR +         +FT A++R+SDFSGS   G     A    
Sbjct: 36  EYNKEILIG--ADFSQRDLRDS---------SFTKANLRQSDFSGSNLTGVSFFAANLES 84

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
           ANFTGADL++  +D      ANLTNA+L  +    +   GAII GADF+D ++   ++  
Sbjct: 85  ANFTGADLTNATLDSARFIGANLTNAILEGSFAASAKFDGAIIAGADFTDVLLRRDEQNK 144

Query: 220 LCKYANGTNPITGVSTRKSLGC 241
           LC+ ANG NP TG  TR++L C
Sbjct: 145 LCQVANGINPTTGRHTRETLFC 166


>gi|428777417|ref|YP_007169204.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
 gi|428691696|gb|AFZ44990.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
          Length = 165

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 50/116 (43%), Positives = 72/116 (62%), Gaps = 5/116 (4%)

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           E +  N  +AD  +++  G+ FNGA L     + AN+ G + S+ +         +LTNA
Sbjct: 54  EFYDENLEAADFHDANLEGAVFNGATL-----HNANWRGVNFSNGIAYLTDFTGVDLTNA 108

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           VL   ++ RS   GAI+EGADF++AV+D  Q + LC+ A+G NP TGVSTR+SLGC
Sbjct: 109 VLTEAMMLRSKFEGAIVEGADFTNAVVDRLQVKKLCERASGVNPTTGVSTRESLGC 164


>gi|148240085|ref|YP_001225472.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
 gi|147848624|emb|CAK24175.1| Secreted pentapeptide repeats protein [Synechococcus sp. WH 7803]
          Length = 174

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 50/111 (45%), Positives = 67/111 (60%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +F  A  + +DFSG+   GA   +     ANF GADLSD LMDR      +L +AVL+  
Sbjct: 64  SFAGAAGKGADFSGANLQGAIFTQGAFADANFHGADLSDALMDRADFTGTDLRDAVLIGV 123

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           + + S   GA ++GADFSDA++D   ++ LC+ A G NP TGV TR SL C
Sbjct: 124 IASGSSFAGAQVDGADFSDALLDRDDQRRLCQEAEGVNPTTGVLTRDSLSC 174


>gi|119509637|ref|ZP_01628783.1| hypothetical protein N9414_21581 [Nodularia spumigena CCY9414]
 gi|119465656|gb|EAW46547.1| hypothetical protein N9414_21581 [Nodularia spumigena CCY9414]
          Length = 221

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 49/110 (44%), Positives = 66/110 (60%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+MR ++F G+    A L K V   AN   A+L   L+DR+ ++ ANL NA+     
Sbjct: 111 FVAAEMRGANFQGANLKNAILTKGVLLNANLENANLEGALVDRVTMDGANLKNAIFTEAT 170

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +TRS    A I GADF+DA+ID  Q   +C  A G N +TGV+TR SLGC
Sbjct: 171 MTRSRFFDADITGADFTDALIDRYQVALMCDRAAGINSVTGVATRDSLGC 220


>gi|334116781|ref|ZP_08490873.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333461601|gb|EGK90206.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 165

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 50/130 (38%), Positives = 82/130 (63%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +A+L +     +  RA  F++A+M  ++FS +   GA +  +V  +AN  GA+L++ +
Sbjct: 35  FSNAELTRRDFSGQMLRAAEFSNANMDLTNFSNADLQGAIMSASVMTQANLHGANLTNAM 94

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           +D++    A+L++A+L  T+L RS   G  I GADF+DA++D +Q + LC  A+G N  T
Sbjct: 95  IDQVKFTNADLSDAILAETILLRSTFEGVDITGADFTDAIMDGSQIKELCTKASGINSQT 154

Query: 232 GVSTRKSLGC 241
           G+ TR SLGC
Sbjct: 155 GIYTRDSLGC 164


>gi|260435516|ref|ZP_05789486.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
 gi|260413390|gb|EEX06686.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
          Length = 163

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 62/147 (42%), Positives = 88/147 (59%), Gaps = 18/147 (12%)

Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           E RG+F    A Q  SAD+   + +KE     F  AD+RE + SG+   GA +  +    
Sbjct: 28  ELRGQF----AVQEISADMH-GLDLKEK---EFLKADLREVNLSGTDLRGAVINTSQLQG 79

Query: 160 ANFTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
           A+   ADLSD +      +   L  AN TNA+++++  T      A I+GADF++AVIDL
Sbjct: 80  ADLRDADLSDVVGFASHFEGADLRGANFTNAMMMQSRFT-----DAQIDGADFTNAVIDL 134

Query: 215 AQKQALCKYANGTNPITGVSTRKSLGC 241
            Q++ALC  A+G+NPI+GVSTR+SLGC
Sbjct: 135 PQQRALCVRADGSNPISGVSTRESLGC 161


>gi|354568879|ref|ZP_08988040.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353539391|gb|EHC08878.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 172

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 47/110 (42%), Positives = 71/110 (64%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  A+M+ ++F G+  +G  L K    +A+ + A+L++   DR++ N+ANLTNA+    +
Sbjct: 62  FAGAEMQGANFQGANLSGTILTKGSFLQADLSNANLAEAFADRVIFNKANLTNAIFRDAM 121

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L  S    A I GADFS A++D  Q + +C  A+G NP+TGVSTR+SLGC
Sbjct: 122 LASSRFFEAEITGADFSGAIVDPYQVKLMCDRADGINPVTGVSTRESLGC 171


>gi|86605651|ref|YP_474414.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86554193|gb|ABC99151.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
          Length = 165

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 79/130 (60%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +ADL+      +++R ++F SA+++ +D  G+   G    KA    AN  GADLS++L
Sbjct: 35  FSNADLQGQDLSGQDWRGSSFVSANLQGADLQGANLAGVAFTKANLAGANLAGADLSNSL 94

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           +D   L  A+L  A L   +  R+   GA I GADFSDA +D A  + LC+ A G++PIT
Sbjct: 95  LDLANLAGADLRGANLRGAIAARAVWDGAQIAGADFSDAYVDRAALRQLCQRAEGSHPIT 154

Query: 232 GVSTRKSLGC 241
           GVSTR SLGC
Sbjct: 155 GVSTRASLGC 164


>gi|87303664|ref|ZP_01086439.1| hypothetical protein WH5701_12843 [Synechococcus sp. WH 5701]
 gi|87281769|gb|EAQ73734.1| hypothetical protein WH5701_12843 [Synechococcus sp. WH 5701]
          Length = 153

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 60/146 (41%), Positives = 80/146 (54%), Gaps = 9/146 (6%)

Query: 105 FGIGSAA---------QFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKA 155
            G+GSAA         Q    DL+  +H ++  +  F  A M   D SGS   GA    +
Sbjct: 7   MGVGSAAAITAPELRGQRALQDLQPDMHGRDLRQQEFLKASMGGFDLSGSDLRGAVFNSS 66

Query: 156 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
                N + A+L D +      + A+L+ AVL   +L +S   GA IEGADFSDAV+DL+
Sbjct: 67  DLTNTNLSAANLEDAVAFATRFDGADLSGAVLRNAMLMQSRFTGAQIEGADFSDAVLDLS 126

Query: 216 QKQALCKYANGTNPITGVSTRKSLGC 241
           Q +ALC  A+G NP TGVST +SLGC
Sbjct: 127 QVKALCSRADGVNPSTGVSTVESLGC 152


>gi|427735661|ref|YP_007055205.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427370702|gb|AFY54658.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 168

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/112 (43%), Positives = 68/112 (60%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
            NF SA+MR ++F G+    A   K     AN  GA+ ++ L+D++ L+ ANL NA   +
Sbjct: 56  VNFISAEMRGTNFQGADLTNAMFTKGNLLGANLEGANFTNALVDQVTLDNANLKNANFTQ 115

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
             ++RS    A I GADF+DA+ID  Q + +C  A+G NP TGV TR SLGC
Sbjct: 116 ATMSRSRFFDADITGADFTDAIIDRYQVKLMCDRASGVNPETGVETRYSLGC 167


>gi|282895655|ref|ZP_06303780.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
 gi|281199349|gb|EFA74214.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
          Length = 171

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 62/174 (35%), Positives = 89/174 (51%), Gaps = 15/174 (8%)

Query: 68  WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 127
           + V ++ +L   +  +C   +++ A   +Y  E      I   A F   DLR +      
Sbjct: 12  FLVILNLSLLVIIPLTCLVGLTSTALALEYNKE------ILIGADFSQRDLRDS------ 59

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
              +FT A++R+SDFSGS   G     A    ANFTGADL++  +D      ANLTNA+L
Sbjct: 60  ---SFTKANLRQSDFSGSNLTGVSFFAANLESANFTGADLTNATLDSARFIGANLTNAIL 116

Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                  +   GAII GADF+D ++   ++  LC+ A G NP TG  TRK+L C
Sbjct: 117 EGAFAASAKFDGAIITGADFTDVLLRRDEQNKLCQLAKGINPTTGRHTRKTLFC 170


>gi|220905675|ref|YP_002480986.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219862286|gb|ACL42625.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 162

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/110 (44%), Positives = 68/110 (61%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A M E++F G+    A L KA   +ANF GA+L+D L D +    ++L+NA+L    
Sbjct: 52  FAAAVMPEANFEGANLRNAILSKAELSQANFRGANLTDVLADGVSWANSDLSNAILAGAT 111

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L  +   G  I GADFSDA+ID      LC+ A G NP+TG++TR+SLGC
Sbjct: 112 LIGTTFTGVTITGADFSDALIDRYDVSLLCQRAEGINPVTGIATRESLGC 161


>gi|411116478|ref|ZP_11388965.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410712581|gb|EKQ70082.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 165

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 67/162 (41%), Positives = 89/162 (54%), Gaps = 21/162 (12%)

Query: 80  VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRE 139
           V A+ S+ I A  D+     +  G+  + S  +FG +DL+ A         NF  AD+R 
Sbjct: 24  VYAASSAAIRAYDDVEATTKDYSGQNLVRS--EFGDSDLQGA---------NFAGADLR- 71

Query: 140 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 199
               G+ FNGA L  A     N  G D SD +        A+L++A+L   +L +S   G
Sbjct: 72  ----GAVFNGAKLTNA-----NLHGVDFSDGIAYITDFANADLSDAILNSAMLLKSSFKG 122

Query: 200 AIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           A I GADFSDA ID AQ  ALC+ A+GTNP+TGV TR+SLGC
Sbjct: 123 ANITGADFSDAAIDRAQVLALCQTASGTNPVTGVDTRESLGC 164


>gi|427708609|ref|YP_007050986.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
 gi|427361114|gb|AFY43836.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
          Length = 189

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 49/112 (43%), Positives = 74/112 (66%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F++A+M  ++F+G+   GA L  +V  KAN   ADL++ ++D++ L  A+L++AV   
Sbjct: 77  AEFSNANMEMANFTGADLRGAVLSASVMTKANLHQADLTNAMVDQVNLTGADLSDAVFKE 136

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +L R+      I+GADF+DAV+D AQ + LC  A+G N  TGV TR+SLGC
Sbjct: 137 ALLLRALFTDVNIQGADFTDAVLDKAQIKELCSKASGVNSKTGVETRESLGC 188


>gi|116072323|ref|ZP_01469590.1| hypothetical protein BL107_11066 [Synechococcus sp. BL107]
 gi|116064845|gb|EAU70604.1| hypothetical protein BL107_11066 [Synechococcus sp. BL107]
          Length = 186

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 50/111 (45%), Positives = 71/111 (63%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +F  A  R +DF  +  +GA L +    +A+F GADLSD LMDR     ++L +AVL+  
Sbjct: 76  SFAGATGRGADFRDAILHGAILTQGAFAEADFRGADLSDALMDRADFVASDLRDAVLIGV 135

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           + + S    A+IEGADF+DA++D   ++ LC+ A+G NP TGVST  SLGC
Sbjct: 136 IASGSSFSKALIEGADFTDALLDRDDQRRLCRDADGINPTTGVSTFDSLGC 186


>gi|427729477|ref|YP_007075714.1| putative low-complexity protein [Nostoc sp. PCC 7524]
 gi|427365396|gb|AFY48117.1| putative low-complexity protein [Nostoc sp. PCC 7524]
          Length = 170

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 51/130 (39%), Positives = 83/130 (63%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +A+L +     ++ +A  F++A++  +DF+G+   GA L  +V  +AN   ADL++ +
Sbjct: 41  FSNAELARHDFAGDSLQAAEFSNANLEMTDFTGADLRGAVLSASVMTQANLHKADLTNAM 100

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           +D++ L  A+L++AV    +L R+      IEGADF+DA++D AQ + LC  A+G N  T
Sbjct: 101 VDQVNLTGADLSDAVFKEALLLRAIFNDVNIEGADFTDALLDKAQIKELCTKASGVNSQT 160

Query: 232 GVSTRKSLGC 241
           GV+TR SLGC
Sbjct: 161 GVATRDSLGC 170


>gi|428222027|ref|YP_007106197.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427995367|gb|AFY74062.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 161

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 48/110 (43%), Positives = 71/110 (64%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+M  ++F  +   GA    ++   AN   A+ +  ++D++    A+LT+A+LV T+
Sbjct: 51  FANANMENANFERADLRGAVFSASILRNANLRAANFTTGMLDQIDFANADLTDAILVDTL 110

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L RS    A I+GADF+DA++D AQ + LC  A GTNP TGVSTR+SLGC
Sbjct: 111 LLRSTFDFAKIDGADFTDALLDGAQIKWLCSKAKGTNPFTGVSTRESLGC 160


>gi|218439896|ref|YP_002378225.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218172624|gb|ACK71357.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 170

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 49/117 (41%), Positives = 74/117 (63%)

Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           K  + A F +A+MR +    S  + + L +AV   AN  GA+L+ +L+DR+ L+ A+LTN
Sbjct: 52  KNLYGAVFAAANMRGASLENSDLSYSILTEAVLLNANLKGANLTGSLVDRVTLDFADLTN 111

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           A+    + +R+      I GADFS A++D  Q   +C+ A+G NP+TGVSTR+SLGC
Sbjct: 112 AIFTDAIASRTRFYDTTITGADFSGAILDQYQVYLMCERASGVNPVTGVSTRESLGC 168


>gi|427420479|ref|ZP_18910662.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
 gi|425756356|gb|EKU97210.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
          Length = 169

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 47/108 (43%), Positives = 71/108 (65%)

Query: 134 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
           +A++R ++F G+  +   L KA   + + TGA+LS+T  DR+    ++LTNAV+   ++T
Sbjct: 60  AAEVRNANFRGADLSATILTKAKFIRTDLTGANLSETFADRVEFTGSDLTNAVVTDALMT 119

Query: 194 RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            S    A I GADFS  ++D  Q + LC+ A+G NP+TGVSTR+SLGC
Sbjct: 120 SSTFADATITGADFSYTILDRFQVKYLCERADGMNPVTGVSTRESLGC 167


>gi|78212794|ref|YP_381573.1| hypothetical protein Syncc9605_1263 [Synechococcus sp. CC9605]
 gi|78197253|gb|ABB35018.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 169

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 61/147 (41%), Positives = 88/147 (59%), Gaps = 18/147 (12%)

Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           E RG+F    A Q  SAD+   + +KE     F  AD+RE + SG+   GA +  +    
Sbjct: 34  ELRGQF----AVQEISADM-HGLDLKEK---EFLKADLREVNLSGTDLRGAVINTSQLQG 85

Query: 160 ANFTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
           A+   A+LSD +      +   L  AN TNA+++++  T      A I+GADF++AVIDL
Sbjct: 86  ADLRDANLSDVVGFASHFEGADLRGANFTNAMMMQSRFT-----DAQIDGADFTNAVIDL 140

Query: 215 AQKQALCKYANGTNPITGVSTRKSLGC 241
            Q++ALC  A+G+NPI+GVSTR+SLGC
Sbjct: 141 PQQRALCARADGSNPISGVSTRESLGC 167


>gi|78185103|ref|YP_377538.1| hypothetical protein Syncc9902_1536 [Synechococcus sp. CC9902]
 gi|78169397|gb|ABB26494.1| conserved hypothetical protein [Synechococcus sp. CC9902]
          Length = 182

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 49/111 (44%), Positives = 70/111 (63%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +F  A  R +DF  +  +GA L +    +A+F GADLSD LMDR      +L +AVL+  
Sbjct: 72  SFAGATGRGADFRDANLHGAILTQGAFAEADFRGADLSDALMDRADFVATDLRDAVLIGV 131

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           + + S    A+IEGADF+DA++D   ++ LC+ A+G NP TG+ST  SLGC
Sbjct: 132 IASGSSFSKALIEGADFTDALLDRDDQRLLCRDADGINPTTGISTFDSLGC 182


>gi|428227020|ref|YP_007111117.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427986921|gb|AFY68065.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 166

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 50/114 (43%), Positives = 71/114 (62%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
            +A F++A+++ +DFSG+   GA    +    AN  G D SD +      ++ANL++AVL
Sbjct: 52  LQAEFSNANLKNADFSGADLRGAVFNGSTLVHANLRGVDFSDGIAYISDFSDANLSDAVL 111

Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +L +S   GA + GADF+DAV+D AQ   LCK A+G N ITG  TR+SLGC
Sbjct: 112 SSAMLLKSRFTGADVTGADFTDAVLDRAQVLQLCKTASGVNSITGADTRESLGC 165


>gi|16332305|ref|NP_443033.1| hypothetical protein sll0577 [Synechocystis sp. PCC 6803]
 gi|383324046|ref|YP_005384900.1| hypothetical protein SYNGTI_3138 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383327215|ref|YP_005388069.1| hypothetical protein SYNPCCP_3137 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383493099|ref|YP_005410776.1| hypothetical protein SYNPCCN_3137 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384438367|ref|YP_005653092.1| hypothetical protein SYNGTS_3139 [Synechocystis sp. PCC 6803]
 gi|451816456|ref|YP_007452908.1| hypothetical protein MYO_131750 [Synechocystis sp. PCC 6803]
 gi|1653935|dbj|BAA18845.1| sll0577 [Synechocystis sp. PCC 6803]
 gi|339275400|dbj|BAK51887.1| hypothetical protein SYNGTS_3139 [Synechocystis sp. PCC 6803]
 gi|359273366|dbj|BAL30885.1| hypothetical protein SYNGTI_3138 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359276536|dbj|BAL34054.1| hypothetical protein SYNPCCN_3137 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359279706|dbj|BAL37223.1| hypothetical protein SYNPCCP_3137 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407960039|dbj|BAM53279.1| hypothetical protein BEST7613_4348 [Synechocystis sp. PCC 6803]
 gi|451782425|gb|AGF53394.1| hypothetical protein MYO_131750 [Synechocystis sp. PCC 6803]
          Length = 169

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 76/139 (54%), Gaps = 16/139 (11%)

Query: 114 GSADLRKAVHVKENFR------ANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANF 162
           G++     V  + +FR      A FT+ D+  S     D  GS FNGA L  A     N 
Sbjct: 35  GASAFENMVLAETDFRDQDLLTAQFTNVDLTSSIFEAMDLRGSVFNGANLTDA-----NL 89

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
            G DL++ L      N ANL NA+L   ++ R+    A I+GADFS AV+D  Q  ALCK
Sbjct: 90  KGVDLTNGLTYLTSFNGANLENAILAEAIMLRTSFKNAKIQGADFSLAVLDTEQIAALCK 149

Query: 223 YANGTNPITGVSTRKSLGC 241
            A+G NP TG+STR+SLGC
Sbjct: 150 VADGVNPKTGISTRESLGC 168


>gi|282902031|ref|ZP_06309929.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
 gi|281193118|gb|EFA68117.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
          Length = 162

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 48/130 (36%), Positives = 81/130 (62%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +A+L +     +N +A  F++A++  ++F+ +   GA    +V  +AN  GADL++ +
Sbjct: 32  FSNAELGRHNFSGQNLQAAEFSNANLEMANFANADLRGAVFSASVMTQANLHGADLTNAM 91

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           +D++ L +A+L++A+ +  +L RS+     I+GADFS A++D  Q + LCK A G N  T
Sbjct: 92  LDQVKLTDADLSDAIFIEAILLRSNFAKTNIDGADFSKAILDRGQIRDLCKSARGINSRT 151

Query: 232 GVSTRKSLGC 241
            V TR SLGC
Sbjct: 152 HVQTRDSLGC 161


>gi|317969830|ref|ZP_07971220.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
          Length = 178

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 54/135 (40%), Positives = 77/135 (57%), Gaps = 10/135 (7%)

Query: 112 QFGSADLRKAVH-----VKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           Q  + DL+  +H      +E  +A+    D+ E+D  G+ FN A L+ A     N + AD
Sbjct: 48  QRSAQDLQPDMHGRNLQQQEFLKASMEGFDLSETDLRGAVFNTANLQNA-----NLSAAD 102

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
           L D +      + A+L+ AV    +L  S   GA+IEG DF+DAV+DL Q++ALC  A+G
Sbjct: 103 LEDAVAFATRFDNADLSGAVFRNAMLMNSKFTGAVIEGTDFTDAVLDLPQQKALCARASG 162

Query: 227 TNPITGVSTRKSLGC 241
            NP TGV TR+SL C
Sbjct: 163 VNPRTGVDTRESLAC 177


>gi|87125517|ref|ZP_01081362.1| hypothetical protein RS9917_02051 [Synechococcus sp. RS9917]
 gi|86166817|gb|EAQ68079.1| hypothetical protein RS9917_02051 [Synechococcus sp. RS9917]
          Length = 180

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 51/119 (42%), Positives = 71/119 (59%)

Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           H ++    +F  A  R +DFS +  +GA   +     A+F GADLSD LMDR   +  +L
Sbjct: 62  HGQDLRNTSFAGAVGRGADFSDANLHGAIFTQGAFANADFHGADLSDALMDRADFSGTDL 121

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +L   + + S   GA IEGADFSDA++D    + LC+ A G++P TGVSTR+SLGC
Sbjct: 122 RGTLLSGVIASGSSFAGAQIEGADFSDALLDRDDVRRLCRDAEGSHPHTGVSTRESLGC 180


>gi|443312247|ref|ZP_21041866.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
 gi|442777717|gb|ELR87991.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
          Length = 162

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 52/130 (40%), Positives = 80/130 (61%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +A+L+      +  RA  F++A+M  ++FS +   GA    +V   A+  GADLS+ +
Sbjct: 32  FSNAELKSRDFSGQTLRAAEFSNANMELANFSNADLRGAVFSASVMTGASLHGADLSNAM 91

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           +D++ L +A+L++AVL   +L R+      I  ADF+DA++D AQ + LC  A+G NP T
Sbjct: 92  VDQVNLTKADLSDAVLTEALLLRAIFDDVSIVNADFTDAILDRAQIKELCAKASGVNPKT 151

Query: 232 GVSTRKSLGC 241
           GV TR SLGC
Sbjct: 152 GVETRYSLGC 161


>gi|428302010|ref|YP_007140316.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428238554|gb|AFZ04344.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 162

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 49/112 (43%), Positives = 72/112 (64%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F++A+M  +DF G+   GA +  +   KAN  GA+L++ L+D++ L  A+L++AVL  
Sbjct: 50  AEFSNANMELADFRGADLRGAVMSASTMTKANLHGANLANALVDQVNLTGADLSDAVLQE 109

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +L R+      I GADF+DA++D AQ + LC  A+G N  TGV TR SLGC
Sbjct: 110 ALLLRAIFTDVKINGADFTDAILDGAQIRELCNIASGVNSQTGVETRYSLGC 161


>gi|412992118|emb|CCO19831.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
          Length = 293

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 55/143 (38%), Positives = 80/143 (55%), Gaps = 9/143 (6%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S   F + DLR  + V+         A++R ++FS S   GA + +++   A+  G+D+S
Sbjct: 145 SNEDFSNLDLRGTIWVE---------AELRNTNFSKSDMRGAVMTRSIMPNADVHGSDVS 195

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
           + L D ++L  AN  +AV V     RSD+G   I+ ADF++AVID  Q   LC+ A G N
Sbjct: 196 NVLFDYVLLRGANFEDAVAVGANFIRSDMGEMKIKNADFTEAVIDRYQVLGLCETAEGVN 255

Query: 229 PITGVSTRKSLGCGNSRRNAYGS 251
           P TGV TR SLGC +  +   GS
Sbjct: 256 PYTGVDTRMSLGCDSFVKKYEGS 278


>gi|254415547|ref|ZP_05029307.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196177728|gb|EDX72732.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 165

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 52/112 (46%), Positives = 68/112 (60%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+F  AD+RES+FS ++  G     A    ANF GA+LS + +D+  LN ANL NAVL  
Sbjct: 53  ASFNQADLRESNFSHAELQGVSFFGANLKLANFEGANLSYSTLDKARLNGANLKNAVLEG 112

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +   GA IEGADF+DA +D   ++ LC+ A GTNP TG  TR +L C
Sbjct: 113 AYAFNAQFDGATIEGADFTDAFLDPKAEEKLCQMATGTNPTTGRQTRDTLFC 164


>gi|88808683|ref|ZP_01124193.1| hypothetical protein WH7805_03297 [Synechococcus sp. WH 7805]
 gi|88787671|gb|EAR18828.1| hypothetical protein WH7805_03297 [Synechococcus sp. WH 7805]
          Length = 176

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 92/179 (51%), Gaps = 8/179 (4%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
           A L N R  ++TAL AA+V         L D    EA T  E     A Q    D+   +
Sbjct: 5   ALLCNLRRHLTTALLAALVVFTG----VLIDGPSVEAITAPELRGQRAVQ----DITSDM 56

Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           H ++     F  AD+RE D   +   GA +  +    A+  GADL D +      + A+L
Sbjct: 57  HGRDLKEKEFLKADLREVDLGEADLRGAVINTSQLQGADLRGADLEDVVAFSSRFDGADL 116

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            NA     +L +S    A IEG DF++AVIDL+Q +ALC  A+G N ++GVST++SLGC
Sbjct: 117 RNANFTNAMLMQSRFNDAEIEGTDFTNAVIDLSQLKALCGRASGVNSLSGVSTKESLGC 175


>gi|318041364|ref|ZP_07973320.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0101]
          Length = 170

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 52/122 (42%), Positives = 73/122 (59%), Gaps = 5/122 (4%)

Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           + +  +E  +AN    D  ESD  G+ FN A L+ A     N   ADL D +      + 
Sbjct: 53  RNLQQQEFLKANLEGFDFSESDLRGAVFNTANLQGA-----NLHAADLEDAVAFASRFDN 107

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           A+L++AVL   +L  S   G++I+GADF+DAV+DL Q++ALC+ A GTN  TGV+TR SL
Sbjct: 108 ADLSDAVLRNAMLMNSKFAGSVIDGADFTDAVLDLPQQKALCERAGGTNARTGVNTRDSL 167

Query: 240 GC 241
            C
Sbjct: 168 NC 169


>gi|443321745|ref|ZP_21050787.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
 gi|442788515|gb|ELR98206.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
          Length = 149

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 48/112 (42%), Positives = 71/112 (63%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+F  A +RES+FS +   G     A     NF GA+L++  +D   LN+ANL NA+L+ 
Sbjct: 37  ASFDLASLRESNFSHANLTGVRFFSANLESVNFEGANLTNATLDSARLNDANLKNAILIG 96

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
             ++ + + G  IEGADF+DA+I   +++ LCK A GTNP+TG  TR++L C
Sbjct: 97  AFVSNAKVQGVNIEGADFTDALILPYEQKLLCKVAQGTNPVTGRDTRETLFC 148


>gi|88809155|ref|ZP_01124664.1| hypothetical protein WH7805_05666 [Synechococcus sp. WH 7805]
 gi|88787097|gb|EAR18255.1| hypothetical protein WH7805_05666 [Synechococcus sp. WH 7805]
          Length = 180

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 49/111 (44%), Positives = 66/111 (59%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +F  A  + +DFSG+   GA   +     ANF GADLSD LMDR      +L +AVL+  
Sbjct: 70  SFAGAAAKGADFSGANLQGAIFTQGAFADANFRGADLSDALMDRADFTGTDLRDAVLIGV 129

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           + + S    A ++GADFSDA++D   ++ LC+ A G NP TGV TR SL C
Sbjct: 130 IASGSSFARAQVDGADFSDALLDRDDQRKLCQEAEGLNPTTGVLTRDSLSC 180


>gi|427702634|ref|YP_007045856.1| low-complexity protein [Cyanobium gracile PCC 6307]
 gi|427345802|gb|AFY28515.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
          Length = 182

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 72/112 (64%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ++F  A  R++ F  +  +GA L +A   +A+F GADLSD LMD++ ++  +LT AVL  
Sbjct: 70  SSFAGATGRQARFRDADLHGAILTQAAFPEADFHGADLSDALMDKVDMSGTDLTGAVLRG 129

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            + + S+  GA +  ADF+DA++D   ++ LC+ A GTNP+TG  TR SL C
Sbjct: 130 AIASGSNFTGATVTDADFTDALLDRVDQRNLCREARGTNPVTGADTRLSLDC 181


>gi|298492040|ref|YP_003722217.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
 gi|298233958|gb|ADI65094.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
          Length = 167

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 53/131 (40%), Positives = 75/131 (57%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F   DLR +         +FT A++R+S+F+G+   G     A    AN  GADL++ 
Sbjct: 45  ADFSRRDLRDS---------SFTKANLRQSNFTGANLRGVSFFAANLESANLEGADLTNA 95

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D   L  ANLTNAVL       +   GAI++GADF+DA++   +++ LC  A GTNPI
Sbjct: 96  TLDSARLIRANLTNAVLEGAFAASAKFDGAIVDGADFTDALLRQDEQKKLCNLAKGTNPI 155

Query: 231 TGVSTRKSLGC 241
           TG  TR++L C
Sbjct: 156 TGRDTRETLFC 166


>gi|411119374|ref|ZP_11391754.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410711237|gb|EKQ68744.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 182

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 73/112 (65%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F++A++   +F+ +   GA +  +     +  GADL+  ++D++ +   +L++A+L  
Sbjct: 71  AEFSNANLNRVNFTNADLRGAVMSASTMVDTSLHGADLTQAMLDQVKMIRTDLSDAILAN 130

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           T+L R+      +EGADF+DA++D AQ +ALC++A+G N  TGVSTR SLGC
Sbjct: 131 TILLRTTFENINLEGADFTDAILDGAQVKALCQFASGANSKTGVSTRDSLGC 182


>gi|443478408|ref|ZP_21068166.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443016315|gb|ELS31005.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 150

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 47/110 (42%), Positives = 70/110 (63%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +A+M  ++F  +   GA    ++  KAN  G D S  L+D+    +A+L+NA+LV T+
Sbjct: 40  FANANMEGANFENADVRGAVFSASILRKANLKGTDFSGGLLDQADFAKADLSNALLVETI 99

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L RS      I+GADF+DA++D AQ++ LC  A GTN  TG++TR+SL C
Sbjct: 100 LLRSTFDFVNIDGADFTDAIMDGAQRKWLCSKAKGTNAKTGINTRESLEC 149


>gi|260435480|ref|ZP_05789450.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
 gi|260413354|gb|EEX06650.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
          Length = 173

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 51/110 (46%), Positives = 67/110 (60%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  A  R ++F G+  +GA L +    +A+F GADLSD LMDR      +L NAVL   +
Sbjct: 64  FAGAVGRGANFRGANLHGAILTQGAFAEADFQGADLSDALMDRADFVATDLRNAVLTGII 123

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            + S    A IEGADF+DA++D   ++ LC  A+G NP TGVST  SLGC
Sbjct: 124 ASGSSFSNAQIEGADFTDALLDRDDQRRLCGEADGINPSTGVSTFDSLGC 173


>gi|428772631|ref|YP_007164419.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
           7202]
 gi|428686910|gb|AFZ46770.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
          Length = 166

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 50/130 (38%), Positives = 73/130 (56%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F S  L+      +N + A FT  ++ ++ F+ S   GA      A  ANF+G D+SD L
Sbjct: 36  FESKSLKGEDFTNQNLQLAEFTKVNLEDAKFNDSDLRGAVFNGVNAEGANFSGVDMSDGL 95

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           +     N  +L+NA+    ++ R+    A +EGADF+ AV+D  Q   LCK A+G NP+T
Sbjct: 96  VYVTSFNNTDLSNAIFRDAIMLRTTFKNANVEGADFTFAVLDSEQVNQLCKNASGVNPVT 155

Query: 232 GVSTRKSLGC 241
             STR+SLGC
Sbjct: 156 NASTRQSLGC 165


>gi|254430459|ref|ZP_05044162.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
 gi|197624912|gb|EDY37471.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
          Length = 180

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 48/111 (43%), Positives = 70/111 (63%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +F  A  R +DFSG+  +GA L +A   +A+F GADLS  LMD++  + A+ T A L   
Sbjct: 70  SFAGAAGRHADFSGANLHGAILTQAAFPEASFAGADLSGVLMDKVDFSGADFTGADLSDV 129

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           + + S+  GA +  ADF+ A+ID   ++ LC+ A GT+P+TG  TR SLGC
Sbjct: 130 IASGSNFSGATVTNADFTGALIDRVDQRLLCRDAEGTHPLTGADTRLSLGC 180


>gi|33865660|ref|NP_897219.1| hypothetical protein SYNW1126 [Synechococcus sp. WH 8102]
 gi|33632830|emb|CAE07641.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 190

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 60/163 (36%), Positives = 89/163 (54%), Gaps = 4/163 (2%)

Query: 79  AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMR 138
           ++VA+    +S L   N  +A T  E     A Q  +AD+   + +KE     F  AD+R
Sbjct: 30  SLVAAILVVVSTLLWTNSAQAITAPELRGQRAVQEITADM-HGLDLKEK---EFLKADLR 85

Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
           E + S +   GA +  +    A+  GADLS+ +      + A+L  A     +L +S   
Sbjct: 86  EVNLSDTDLRGAVINTSQLQGADLRGADLSNVVGFASRFDGADLRGATFTNAMLMQSRFA 145

Query: 199 GAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            A IEGADF+DAV+DL Q++ LC  A G +P++GVSTR+SLGC
Sbjct: 146 DARIEGADFTDAVLDLPQQKLLCATAAGEHPVSGVSTRESLGC 188


>gi|308814214|ref|XP_003084412.1| COG1357: Uncharacterized low-complexity proteins (ISS)
           [Ostreococcus tauri]
 gi|116056297|emb|CAL56680.1| COG1357: Uncharacterized low-complexity proteins (ISS)
           [Ostreococcus tauri]
          Length = 186

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 45/118 (38%), Positives = 69/118 (58%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           +  +D+R ++ S +   GA   +A+         D S+ + D  VL  A++ + V     
Sbjct: 46  YAESDLRNANISNTDARGAVFSRAIMPGVKLNATDASNAMFDYAVLRGADMRDGVFANAN 105

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAY 249
             R+D+G A+IEGADFS+AVID  +   LC+ A+GTNP TG+ TR +LGC +SR + Y
Sbjct: 106 FVRADMGEAMIEGADFSEAVIDRYEAIRLCERASGTNPWTGIETRATLGCDDSRVSKY 163


>gi|87124337|ref|ZP_01080186.1| hypothetical protein RS9917_12025 [Synechococcus sp. RS9917]
 gi|86167909|gb|EAQ69167.1| hypothetical protein RS9917_12025 [Synechococcus sp. RS9917]
          Length = 178

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/120 (44%), Positives = 68/120 (56%)

Query: 122 VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
           +H  +     F  AD++  D SGS   GA +  +    A+  GADLSD +      + A+
Sbjct: 58  MHGMDLKEKEFLKADLQGVDLSGSDLRGAVINTSSLQGADLQGADLSDVVAFASRFDGAD 117

Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L NAV    +L +S  G A I+GADF+DAVIDL Q +ALC  A G N  TGV TR SLGC
Sbjct: 118 LRNAVFTNAMLMQSRFGDAQIDGADFTDAVIDLPQLKALCARAAGENSRTGVLTRDSLGC 177


>gi|123968679|ref|YP_001009537.1| hypothetical protein A9601_11461 [Prochlorococcus marinus str.
           AS9601]
 gi|126696485|ref|YP_001091371.1| hypothetical protein P9301_11471 [Prochlorococcus marinus str. MIT
           9301]
 gi|123198789|gb|ABM70430.1| conserved hypothetical protein [Prochlorococcus marinus str.
           AS9601]
 gi|126543528|gb|ABO17770.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9301]
          Length = 172

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 47/104 (45%), Positives = 66/104 (63%)

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           R++DFS    +G  L  +    +N  G DL+DTL DR+   + +L NAVL+  + + S  
Sbjct: 68  RDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLSDRVNFQKTDLRNAVLINMIASGSSF 127

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            GA IEGADFS A++D   ++ LC+ A+G NP TGVSTR+SL C
Sbjct: 128 AGAQIEGADFSYAILDSEDQRNLCEIADGINPTTGVSTRESLEC 171


>gi|352096257|ref|ZP_08957137.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
 gi|351676951|gb|EHA60102.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
          Length = 177

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 49/119 (41%), Positives = 71/119 (59%)

Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           H +     +F  A  R +DFS +   G    +A   +ANF GA+LSD LMDR   ++ +L
Sbjct: 59  HGQNLVNTSFAGATGRGADFSDANLQGTIFTQAEFPEANFHGANLSDALMDRADFSKTDL 118

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +A+L   +   S   GA IEGADF+DA++D   ++ LC+ A+G NP +GV+TR SL C
Sbjct: 119 RDALLQGVIAAGSSFAGADIEGADFTDALLDREDQRRLCQDADGVNPSSGVATRDSLDC 177


>gi|78779436|ref|YP_397548.1| hypothetical protein PMT9312_1053 [Prochlorococcus marinus str. MIT
           9312]
 gi|78712935|gb|ABB50112.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9312]
          Length = 172

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 47/104 (45%), Positives = 66/104 (63%)

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           R++DFS    +G  L  +    +N  G DL+DTL DR+   + +L NAVL+  + + S  
Sbjct: 68  RDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLSDRVNFQKTDLRNAVLINMIASGSSF 127

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            GA IEGADFS A++D   ++ LC+ A+G NP TGVSTR+SL C
Sbjct: 128 AGAKIEGADFSYAILDSEDQRNLCEIADGINPTTGVSTRESLEC 171


>gi|172036187|ref|YP_001802688.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
           51142]
 gi|354552985|ref|ZP_08972292.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
 gi|171697641|gb|ACB50622.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
 gi|353554815|gb|EHC24204.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
          Length = 179

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 50/112 (44%), Positives = 68/112 (60%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+AD+ +S+FS +   GA    +    A+  GADL++ L        A+LTNAVL  
Sbjct: 67  AQFTNADLTDSNFSEADLRGAVFNGSALIGADLHGADLTNGLAYLTSFKGADLTNAVLTE 126

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ R+    A I GADFS AV+D+ +   LC  A+G NP TGVSTR+SLGC
Sbjct: 127 AIMMRTKFDDAKITGADFSLAVLDVYEVDKLCDRADGVNPKTGVSTRESLGC 178


>gi|303287274|ref|XP_003062926.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455562|gb|EEH52865.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 182

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 54/142 (38%), Positives = 73/142 (51%), Gaps = 15/142 (10%)

Query: 120 KAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           KA HV E+F       A +T  D+R SDFSGS    A   +A+    N  G+D+ +  +D
Sbjct: 17  KAEHVNEDFSHSDLVGAIYTEGDLRGSDFSGSDLRAAIFSRAIMPGVNLEGSDMQNAFLD 76

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA---------LCKYA 224
            +VL  AN+   +       RSDLG   +  ADF++AVID  Q ++         LC  A
Sbjct: 77  YVVLRGANMRGVIASGANFVRSDLGDVDVTNADFTEAVIDRYQARSISHWSPYDPLCDGA 136

Query: 225 NGTNPITGVSTRKSLGCGNSRR 246
           +G N  TGV TR SLGC   +R
Sbjct: 137 SGVNEFTGVDTRDSLGCDRLKR 158


>gi|72382551|ref|YP_291906.1| hypothetical protein PMN2A_0712 [Prochlorococcus marinus str.
           NATL2A]
 gi|72002401|gb|AAZ58203.1| conserved hypothetical protein [Prochlorococcus marinus str.
           NATL2A]
          Length = 184

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 49/118 (41%), Positives = 69/118 (58%), Gaps = 6/118 (5%)

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           R++DFS    +G  L  +    +N  G DL+DTL DR+   + +L N++LV  + + S  
Sbjct: 71  RDADFSNVDLHGTTLTLSDLKGSNLNGVDLTDTLSDRVNFQKTDLRNSILVNMIASGSSF 130

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 255
            GA IEGADF+ A++D   ++ LCK A+G NP TGVSTR SL C   +      PS P
Sbjct: 131 AGAQIEGADFTFAILDSEDQRNLCKIADGVNPTTGVSTRASLECKGDK------PSMP 182


>gi|282897737|ref|ZP_06305736.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
 gi|281197416|gb|EFA72313.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
          Length = 162

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 48/130 (36%), Positives = 80/130 (61%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +A+L +     +N +A  F++A++  ++F+ +   GA    +V  +AN  GADL++ +
Sbjct: 32  FSNAELGRHNFSGQNLQAAEFSNANLEMANFANADLRGAVFSASVMTQANLHGADLTNAM 91

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           +D++ L  A+L++A+ +  +L RS    A I+GADF++A++D  Q   LCK A G N  T
Sbjct: 92  LDQVKLTGADLSDAIFLEAILLRSIFTEANIDGADFTEAILDRGQVGELCKSARGVNSQT 151

Query: 232 GVSTRKSLGC 241
            V TR SLGC
Sbjct: 152 HVQTRDSLGC 161


>gi|113953693|ref|YP_729958.1| hypothetical protein sync_0742 [Synechococcus sp. CC9311]
 gi|113881044|gb|ABI46002.1| conserved hypothetical protein [Synechococcus sp. CC9311]
          Length = 190

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 50/119 (42%), Positives = 71/119 (59%)

Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           H +     +F  A  R +DFS +   G    +A   +ANF GA+LSD LMDR   ++ +L
Sbjct: 72  HGQNLVNTSFAGATGRGADFSDANLQGTIFTQAEFPEANFHGANLSDALMDRADFSKTDL 131

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +A+LV  +   S   GA IEGADF+DA++D   ++ LC+ A+G N  +GVSTR SL C
Sbjct: 132 RDALLVGVIAAGSSFAGADIEGADFTDALLDREDQRRLCQDADGVNSSSGVSTRDSLDC 190


>gi|428211433|ref|YP_007084577.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|427999814|gb|AFY80657.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 166

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 45/112 (40%), Positives = 75/112 (66%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F++AD++ ++FS  +  GA    ++  +ANF GADL++++++   L  A+ T+AVLV 
Sbjct: 54  AEFSNADLQFTNFSNVQAEGAIFSLSMMKEANFHGADLTNSMLEWTNLTNADFTDAVLVE 113

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +   +++    + GADF+DA++D AQ + LC+ A+G N  TGV TR+SLGC
Sbjct: 114 ALFLGANVKKMKVTGADFTDAILDGAQVKQLCENASGVNSKTGVDTRESLGC 165


>gi|33240611|ref|NP_875553.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           subsp. marinus str. CCMP1375]
 gi|33238139|gb|AAQ00206.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
           subsp. marinus str. CCMP1375]
          Length = 183

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 53/134 (39%), Positives = 77/134 (57%), Gaps = 2/134 (1%)

Query: 122 VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
           +H ++  +++   A  R+S+ S    +G  +  A    +N  G +L+DTL DR+   + +
Sbjct: 50  LHGQDLSKSSIAGATARDSNLSDVDLHGTVVTLADLKGSNLNGINLTDTLSDRVNFQKTD 109

Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L NAVLV  + + S   GA IEGADFS AV+D   ++ LC+ A GTNP TG+STR+SL C
Sbjct: 110 LRNAVLVNMIASGSSFAGAQIEGADFSYAVLDSDDQRNLCEIAEGTNPQTGISTRESLEC 169

Query: 242 GNSRRNAYGSPSSP 255
             S R     P  P
Sbjct: 170 --SERGVGYKPPMP 181


>gi|157413511|ref|YP_001484377.1| hypothetical protein P9215_11761 [Prochlorococcus marinus str. MIT
           9215]
 gi|254526043|ref|ZP_05138095.1| Pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
 gi|157388086|gb|ABV50791.1| conserved hpothetical protein [Prochlorococcus marinus str. MIT
           9215]
 gi|221537467|gb|EEE39920.1| Pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
          Length = 172

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 47/104 (45%), Positives = 65/104 (62%)

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           R++DFS    +G  L  +    +N  G DL+DTL DR+   + +L NAVL+  + + S  
Sbjct: 68  RDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLSDRVNFQKTDLRNAVLINMIASGSSF 127

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            GA IEGADFS A++D   ++ LC+ A+G NP TGVSTR SL C
Sbjct: 128 AGAQIEGADFSYAILDSEDQRNLCEIADGINPTTGVSTRDSLEC 171


>gi|124026254|ref|YP_001015370.1| hypothetical protein NATL1_15481 [Prochlorococcus marinus str.
           NATL1A]
 gi|123961322|gb|ABM76105.1| Hypothetical protein NATL1_15481 [Prochlorococcus marinus str.
           NATL1A]
          Length = 184

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 49/118 (41%), Positives = 69/118 (58%), Gaps = 6/118 (5%)

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           R++DFS    +G  L  +    +N  G DL+DTL DR+   + +L N++LV  + + S  
Sbjct: 71  RDADFSNVDLHGTTLTLSDLKGSNLNGVDLTDTLSDRVNFQKTDLRNSILVNMIASGSSF 130

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 255
            GA IEGADF+ A++D   ++ LCK A+G NP TGVSTR SL C   +      PS P
Sbjct: 131 AGAQIEGADFTFAILDSEDQRNLCKIADGVNPTTGVSTRASLECKGDK------PSIP 182


>gi|126657693|ref|ZP_01728847.1| hypothetical protein CY0110_25878 [Cyanothece sp. CCY0110]
 gi|126620910|gb|EAZ91625.1| hypothetical protein CY0110_25878 [Cyanothece sp. CCY0110]
          Length = 167

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+AD+ +S+FS +   GA    +    A+  GADL++ L        A+LT+AVL  
Sbjct: 55  AQFTNADLTDSNFSKADLRGAVFNGSALIGADLHGADLTNGLAYLTSFKGADLTDAVLTE 114

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ R+    A I GADFS AV+D+ + + LC  A+G NP TG+STR+SLGC
Sbjct: 115 AIMMRTKFDDAKITGADFSLAVLDIYEVEKLCDRADGVNPKTGISTRESLGC 166


>gi|123966365|ref|YP_001011446.1| hypothetical protein P9515_11321 [Prochlorococcus marinus str. MIT
           9515]
 gi|123200731|gb|ABM72339.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9515]
          Length = 172

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 46/104 (44%), Positives = 64/104 (61%)

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           R++DFS    +G  L  +    +N  G DL+DTL DR+   + +L N++L+  + + S  
Sbjct: 68  RDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLADRVNFQKTDLRNSILINMIASGSSF 127

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            GA IEGADFS A++D   ++ LCK A G NP TGVSTR SL C
Sbjct: 128 AGAQIEGADFSYAILDSEDQRNLCKIAEGVNPTTGVSTRDSLEC 171


>gi|78212400|ref|YP_381179.1| hypothetical protein Syncc9605_0856 [Synechococcus sp. CC9605]
 gi|78196859|gb|ABB34624.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 181

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 50/110 (45%), Positives = 67/110 (60%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  A  R ++F G+  +GA L +    +A+F GADLSD LMDR      +L NAVL   +
Sbjct: 72  FAGAVGRGANFRGANLHGAILTQGAFAEADFQGADLSDALMDRADFVGTDLRNAVLNGII 131

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            + S    A IEGADF+DA++D   ++ LC  A+G NP TGV+T  SLGC
Sbjct: 132 ASGSSFSNAQIEGADFTDALLDRDDQRRLCGEADGINPSTGVATFDSLGC 181


>gi|116070665|ref|ZP_01467934.1| hypothetical protein BL107_13505 [Synechococcus sp. BL107]
 gi|116066070|gb|EAU71827.1| hypothetical protein BL107_13505 [Synechococcus sp. BL107]
          Length = 169

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 54/134 (40%), Positives = 78/134 (58%), Gaps = 3/134 (2%)

Query: 108 GSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           GS++  G  +    + +KE     F  A++R+ + SG+   GA +       A+   A+L
Sbjct: 37  GSSSYQGITEDMHGMDLKEK---EFLKANLRDVNLSGADLRGAVINTTQLQGADLRDANL 93

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 227
           SD +      + A+L  AVL   +L +S    A IEGADF+DAVIDL Q++ALC  A+G 
Sbjct: 94  SDVVGFASRFDGADLRGAVLTNAMLMQSRFTDAQIEGADFTDAVIDLPQQRALCSSADGV 153

Query: 228 NPITGVSTRKSLGC 241
           NP +GVSTR+SLGC
Sbjct: 154 NPQSGVSTRESLGC 167


>gi|352094392|ref|ZP_08955563.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
 gi|351680732|gb|EHA63864.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
          Length = 172

 Score = 90.1 bits (222), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 52/135 (38%), Positives = 76/135 (56%), Gaps = 10/135 (7%)

Query: 112 QFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           QF   D+   +H ++     F  AD+R  D S +   GA +  +    A+  GA+L D +
Sbjct: 42  QFAVQDISNDMHGRDLKEKEFLKADLRGVDLSDTDLRGAVINTSQLQGADLHGANLEDVV 101

Query: 172 -----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
                 D   L++AN TNA+L+++         A IEG DF++AVIDL Q +ALC  A+G
Sbjct: 102 AFSSRFDETDLSDANFTNAMLMQSRFV-----DARIEGTDFTNAVIDLTQMKALCGRASG 156

Query: 227 TNPITGVSTRKSLGC 241
            N ++GVSTR+SLGC
Sbjct: 157 VNSVSGVSTRESLGC 171


>gi|78184792|ref|YP_377227.1| hypothetical protein Syncc9902_1219 [Synechococcus sp. CC9902]
 gi|78169086|gb|ABB26183.1| conserved hypothetical protein [Synechococcus sp. CC9902]
          Length = 169

 Score = 90.1 bits (222), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 54/134 (40%), Positives = 78/134 (58%), Gaps = 3/134 (2%)

Query: 108 GSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           GS++  G  +    + +KE     F  A++R+ + SG+   GA +       A+   A+L
Sbjct: 37  GSSSYQGITEDMHGMDLKEK---EFLKANLRDVNLSGADLRGAVINTTQLQGADLRDANL 93

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 227
           SD +      + A+L  AVL   +L +S    A IEGADF+DAVIDL Q++ALC  A+G 
Sbjct: 94  SDVVGFASRFDGADLRGAVLTNAMLMQSRFTDAQIEGADFTDAVIDLPQQRALCSSADGV 153

Query: 228 NPITGVSTRKSLGC 241
           NP +GVSTR+SLGC
Sbjct: 154 NPQSGVSTRESLGC 167


>gi|33861598|ref|NP_893159.1| hypothetical protein PMM1042 [Prochlorococcus marinus subsp.
           pastoris str. CCMP1986]
 gi|33634175|emb|CAE19501.1| conserved hpothetical protein [Prochlorococcus marinus subsp.
           pastoris str. CCMP1986]
          Length = 172

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 46/104 (44%), Positives = 64/104 (61%)

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           R++DFS    +G  L  +    +N  G DL+DTL DR+   + +L N++L+  + + S  
Sbjct: 68  RDADFSEVDLHGTTLTLSDLKGSNLNGIDLTDTLADRVNFQKTDLRNSILINMIASGSSF 127

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            GA IEGADFS A++D   ++ LCK A G NP TGVSTR SL C
Sbjct: 128 AGAQIEGADFSYAILDSEDQRNLCKIAEGVNPTTGVSTRDSLEC 171


>gi|113953830|ref|YP_730899.1| pentapeptide repeat-containing protein [Synechococcus sp. CC9311]
 gi|113881181|gb|ABI46139.1| Secreted pentapeptide repeats protein [Synechococcus sp. CC9311]
          Length = 172

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/135 (38%), Positives = 77/135 (57%), Gaps = 10/135 (7%)

Query: 112 QFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           QF   D+ + +H ++     F  AD+R  D S +   GA +  +    A+  GA+L D +
Sbjct: 42  QFALQDISEDMHGRDLKEKEFLKADLRGIDLSDTDLRGAVINTSQLQGADLHGANLEDVV 101

Query: 172 -----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
                 D   L++AN TNA+L+++         A IEG DF++AVIDL Q +ALC  A+G
Sbjct: 102 AFSSRFDETDLSDANFTNAMLMQSRFV-----DARIEGTDFTNAVIDLTQLKALCGRASG 156

Query: 227 TNPITGVSTRKSLGC 241
            N ++GVSTR+SLGC
Sbjct: 157 VNSVSGVSTRESLGC 171


>gi|254430802|ref|ZP_05044505.1| secreted pentapeptide repeats protein [Cyanobium sp. PCC 7001]
 gi|197625255|gb|EDY37814.1| secreted pentapeptide repeats protein [Cyanobium sp. PCC 7001]
          Length = 173

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 51/130 (39%), Positives = 76/130 (58%), Gaps = 10/130 (7%)

Query: 117 DLRKAVH-----VKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           DL+  +H      +E  +A+    D  E+D  G+ FNG+ L +A     + + A+L D +
Sbjct: 48  DLQPDMHGRNLRQQEFLKASLEGFDFSEADLRGAVFNGSSLREA-----DLSAANLEDVV 102

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
                 +++NL  A+L   +L +S   G+ I GADFSDAV+DL +++ALC  A G NP T
Sbjct: 103 AYATRFDDSNLEGAILRNAMLMQSRFKGSSITGADFSDAVLDLPEQKALCARATGVNPST 162

Query: 232 GVSTRKSLGC 241
           GVSTR+SL C
Sbjct: 163 GVSTRESLAC 172


>gi|427701840|ref|YP_007045062.1| low-complexity protein [Cyanobium gracile PCC 6307]
 gi|427345008|gb|AFY27721.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
          Length = 184

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/126 (39%), Positives = 76/126 (60%), Gaps = 6/126 (4%)

Query: 117 DLR-KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           DLR + +  +E  +A+    D+R++D  G+ FN   L +A     +  GADL D +    
Sbjct: 63  DLRGRNLQQQEFLKASMEGFDLRDADLRGAVFNSTDLRQA-----DLRGADLEDVVAFAT 117

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 235
             + A+L  A     +L +S    A I+GADFSDAV+DL +++ALC  A+G++P+TGV T
Sbjct: 118 RFDGADLRGAQFRNAMLMQSRFRDARIDGADFSDAVLDLPEQKALCARASGSHPLTGVDT 177

Query: 236 RKSLGC 241
           R+SLGC
Sbjct: 178 RESLGC 183


>gi|159903694|ref|YP_001551038.1| hypothetical protein P9211_11531 [Prochlorococcus marinus str. MIT
           9211]
 gi|159888870|gb|ABX09084.1| Hypothetical protein P9211_11531 [Prochlorococcus marinus str. MIT
           9211]
          Length = 183

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/120 (38%), Positives = 72/120 (60%)

Query: 122 VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
           +H ++  +++   A  R+++ S    +G  +  A    +N  G DL+DTL DR+   + +
Sbjct: 50  LHGQDLSKSSIAGATARDANLSDVDLHGTVVTLADLKGSNLNGIDLTDTLSDRVNFQKTD 109

Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L NAVLV  + + S   GA+I GADFSD+V+D   ++ LC+ A G NP TG++TR SL C
Sbjct: 110 LRNAVLVNMIASGSSFAGALIAGADFSDSVLDRDDQRNLCEIAEGVNPKTGIATRDSLEC 169


>gi|220907989|ref|YP_002483300.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219864600|gb|ACL44939.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 171

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/113 (43%), Positives = 67/113 (59%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +  F  A +  ++ SG+   GA    AV   AN  G + SD +      ++A+L NAVL 
Sbjct: 58  QVEFGDARLSGANLSGANLRGAVFNAAVLTGANLQGVNFSDGIGYLCDFSDADLENAVLD 117

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
             +L +S+  GA I GADFS A++D  Q   LC+YA+G NP TGVSTR+SLGC
Sbjct: 118 SAMLLKSEFKGAKINGADFSFALLDRPQVLQLCEYASGVNPTTGVSTRESLGC 170


>gi|414079727|ref|YP_007001151.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
 gi|413973006|gb|AFW97094.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
          Length = 167

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 61/178 (34%), Positives = 90/178 (50%), Gaps = 15/178 (8%)

Query: 64  KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVH 123
           K +NW   +S  L A +  +  ++    A   +Y  E      I  +A F   DL  +  
Sbjct: 4   KHRNWISILSLLLWAIISTTALASFVPTAVALEYNKE------ILISADFSGRDLTDS-- 55

Query: 124 VKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
                  +FT A++R S+FS S   G     A    AN  GADL++T +D   L +A+LT
Sbjct: 56  -------SFTKANLRYSNFSHSNLRGVSFFAANLESANLQGADLTNTTLDSARLIKADLT 108

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           NA+L       +   GAII+GADF+D ++   +++ LCK A GTNP+T   TR +L C
Sbjct: 109 NAILEGAFAANARFDGAIIDGADFTDVLLRQDEQKKLCKLAKGTNPVTKRDTRDTLYC 166


>gi|17232102|ref|NP_488650.1| hypothetical protein alr4610 [Nostoc sp. PCC 7120]
 gi|17133747|dbj|BAB76309.1| alr4610 [Nostoc sp. PCC 7120]
          Length = 164

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 70/189 (37%), Positives = 101/189 (53%), Gaps = 38/189 (20%)

Query: 65  LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
           +K+WRV VS  LA  +        A+ SS+I+  A       +  G+  IGS  +F + D
Sbjct: 1   MKDWRVVVSFVLAMVLFLFPGSAQAASSSSITRSAGDELKAKDFSGQSLIGS--EFTNVD 58

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLM 172
           L       EN  ANF++AD+R   F+G+   G  L      + +AY ANF  ADLSD + 
Sbjct: 59  L-------EN--ANFSNADLRGGVFNGTVLEGVNLHGVDFSEGIAYLANFKNADLSDAI- 108

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
                    LTNA+++R++    +     + GADF++AV+D+ Q + LC  ANG N  TG
Sbjct: 109 ---------LTNAMMLRSIFDNVN-----VTGADFTNAVLDITQVKKLCLKANGVNSKTG 154

Query: 233 VSTRKSLGC 241
           V TR+SLGC
Sbjct: 155 VDTRESLGC 163


>gi|254409676|ref|ZP_05023457.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183673|gb|EDX78656.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 163

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 71/112 (63%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F +A+++ S+F+ +   GA    +V   AN  GADLS  ++D+  L  A+L++ +LV 
Sbjct: 51  AEFANANLQLSNFAYADLRGAIFSGSVMTHANLHGADLSYGMLDQADLTGADLSDVILVE 110

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           T+L  S     +I GADF+DA++D AQ + LC+ A+G N  TGV+T  SLGC
Sbjct: 111 TLLLGSVFDNTLITGADFTDALLDGAQLKHLCQQASGINSKTGVATSDSLGC 162


>gi|119389531|pdb|2G0Y|A Chain A, Crystal Structure Of A Lumenal Pentapeptide Repeat Protein
           From Cyanothece Sp 51142 At 2.3 Angstrom Resolution.
           Tetragonal Crystal Form
          Length = 184

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 49/112 (43%), Positives = 67/112 (59%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+AD+ +S+FS +   GA    +    A+  GADL++ L        A+LTNAVL  
Sbjct: 72  AQFTNADLTDSNFSEADLRGAVFNGSALIGADLHGADLTNGLAYLTSFKGADLTNAVLTE 131

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ R+    A I GADFS AV+D+ +   LC  A+G NP TGVSTR+SL C
Sbjct: 132 AIMMRTKFDDAKITGADFSLAVLDVYEVDKLCDRADGVNPKTGVSTRESLRC 183


>gi|428203139|ref|YP_007081728.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427980571|gb|AFY78171.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 177

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 47/110 (42%), Positives = 66/110 (60%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  A++R S+FS +K  G     A    ANF GADL+   ++   L  AN TNA+LV   
Sbjct: 67  FDHANLRGSNFSNAKLQGVRFFAANLESANFEGADLTGADLESARLVRANFTNAILVGAF 126

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            T +   GAII+GADF+D ++    ++ LC+ A GTNP+TG +TR +L C
Sbjct: 127 ATNTLFNGAIIDGADFTDVLLRPDTEKKLCEIARGTNPVTGRNTRDTLNC 176


>gi|67922694|ref|ZP_00516198.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
 gi|416392485|ref|ZP_11685875.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
 gi|67855476|gb|EAM50731.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
 gi|357263639|gb|EHJ12621.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
          Length = 170

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 49/112 (43%), Positives = 66/112 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+AD+ +S+FS +   GA    +     +   ADL++ L        A+LTNAVL  
Sbjct: 58  AQFTNADLTDSNFSDADLRGAVFNGSALIGTDLHQADLTNGLAYLTSFEGADLTNAVLTE 117

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ R+    A I GADFS AV+DL Q   LCK A+G N  TG+STR+SLGC
Sbjct: 118 AIMMRTTFKNANITGADFSLAVLDLQQVAELCKRADGVNSKTGISTRESLGC 169


>gi|145356305|ref|XP_001422373.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582615|gb|ABP00690.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 123

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 49/118 (41%), Positives = 68/118 (57%), Gaps = 1/118 (0%)

Query: 125 KENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           +E+ R A +  AD+R SD   S   GA   +AV    +   AD SD + D  +L  ++ T
Sbjct: 6   REDLRGAIYAEADLRRSDLRESDARGAVFSRAVMPGVDARDADFSDAMFDYALLRGSDFT 65

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           N+V V     R+DLG  +   ADF++AVID  Q  +LC+ A+GTNP TG +TR SL C
Sbjct: 66  NSVFVGANFVRADLGEVVATNADFTEAVIDRYQTLSLCERASGTNPYTGANTRDSLLC 123


>gi|443314247|ref|ZP_21043822.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
 gi|442786146|gb|ELR95911.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
          Length = 166

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 52/136 (38%), Positives = 69/136 (50%), Gaps = 1/136 (0%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           +  A  F   +LR      ++ R N +T ADM E + +G+   G  L      KAN  GA
Sbjct: 30  VAQAESFDRQNLRMRDFSGQDLRGNDYTRADMAEVNLTGANLQGVRLFDTNLTKANLEGA 89

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
           DL    +D      ANLTNA+L  +    +D   AII+GADF+D  +D      LC  A 
Sbjct: 90  DLRGATLDGARFLAANLTNAILAGSYAFNTDFRKAIIDGADFTDVFLDPKTNDLLCAVAQ 149

Query: 226 GTNPITGVSTRKSLGC 241
           GTNP+TG  TR +L C
Sbjct: 150 GTNPVTGRDTRDTLYC 165


>gi|440684721|ref|YP_007159516.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
 gi|428681840|gb|AFZ60606.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
          Length = 167

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 46/110 (41%), Positives = 66/110 (60%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           FT A++R+S+FS +   G     A    AN  GADL++  +D   L  ANLTN +L    
Sbjct: 57  FTKANLRQSNFSHANLRGVSFFAANLESANLEGADLTNATLDSARLIRANLTNTILEGAF 116

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +   GAII+GADF+DA++   +++ LCK A G NP+TG  TR++L C
Sbjct: 117 AASARFDGAIIDGADFTDALLRGDEQKKLCKVAKGNNPVTGRDTRETLFC 166


>gi|354567474|ref|ZP_08986643.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353542746|gb|EHC12207.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 164

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 62/183 (33%), Positives = 93/183 (50%), Gaps = 26/183 (14%)

Query: 65  LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
           +K+WRVF    LA  V+      +  L+      + +R         Q  +AD      +
Sbjct: 1   MKSWRVFAVLILAMVVL------LFPLSAEAAKSSSSR----FAGYKQMSNADFSGQTLI 50

Query: 125 KENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
           +E F      +ANF++AD+R     G+ FN AYLEKA     N  GAD ++ +   +   
Sbjct: 51  REEFTKVKLDKANFSNADLR-----GAVFNNAYLEKA-----NLHGADFTNGIAYLVDFR 100

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
           +A+L++A+   T+L  S      I G DF++AV+D  + + LC  ANG N  TGVSTR+S
Sbjct: 101 DADLSDAIFTDTMLLYSTFDNVEITGTDFTNAVLDGPELKKLCARANGVNSKTGVSTRES 160

Query: 239 LGC 241
           L C
Sbjct: 161 LEC 163


>gi|186686067|ref|YP_001869263.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
           73102]
 gi|186468519|gb|ACC84320.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
          Length = 191

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 67/112 (59%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ++FT A++R+S+FS +  NG     A    AN  G+DL +  +D   L  ANLTNA+L  
Sbjct: 79  SSFTKANLRQSNFSRANLNGVSFFAANLESANLEGSDLRNATLDSARLVRANLTNALLEG 138

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +   GAII+GADF+D ++   +++ LCK A GTNP TG  TR +L C
Sbjct: 139 AFAANARFDGAIIDGADFTDTLLRPDEQKKLCKLAKGTNPTTGRDTRDTLFC 190


>gi|427706684|ref|YP_007049061.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
 gi|427359189|gb|AFY41911.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
          Length = 169

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 47/110 (42%), Positives = 65/110 (59%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           FT A++R+S+FS S   G     A    AN  G DL++  +D   L +A+LTNAVL    
Sbjct: 59  FTKANLRQSNFSNSNLRGVSFFAANLESANLQGTDLTNATLDSARLMKADLTNAVLEGAF 118

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +   GAII+GADF+D ++   +++ LCK A GTNP TG  TR +L C
Sbjct: 119 AANAKFDGAIIDGADFTDVLLRPDEQKKLCKVAKGTNPTTGRDTRDTLFC 168


>gi|302768839|ref|XP_002967839.1| hypothetical protein SELMODRAFT_408705 [Selaginella moellendorffii]
 gi|300164577|gb|EFJ31186.1| hypothetical protein SELMODRAFT_408705 [Selaginella moellendorffii]
          Length = 126

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 56/136 (41%), Positives = 70/136 (51%), Gaps = 19/136 (13%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A     DLR AV         F + D R+ +  GS      L+ +    A F G DL DT
Sbjct: 4   ADLSGQDLRGAV---------FAACDCRKINLRGSN-----LDSSTDTFAGFEGGDLQDT 49

Query: 171 -----LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
                L DR+V    NL NA+    +LT S   GA I GADF++A++D  Q+  LCK A 
Sbjct: 50  SWVQALADRVVFRMTNLQNAIFTNAILTGSQFDGADITGADFTEAILDNYQRLKLCKRAT 109

Query: 226 GTNPITGVSTRKSLGC 241
           GTN ITGV TR+SL C
Sbjct: 110 GTNSITGVETRESLAC 125


>gi|428773304|ref|YP_007165092.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
           7202]
 gi|428687583|gb|AFZ47443.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
          Length = 164

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 67/112 (59%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F ++ MR      S    A + ++V   A+  G + S  L+DR+  + ++L++A+L+ 
Sbjct: 51  AVFAASSMRRVSMRNSDLTNAMMTESVLLDADLHGVNFSGALIDRVTFDFSDLSDAILIG 110

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            + TR+      I GADF+DAVID  Q   +C+ A+G NP+TGV+TR SLGC
Sbjct: 111 AIATRTRFYDTDITGADFTDAVIDRYQVSLMCERADGVNPVTGVATRDSLGC 162


>gi|119389418|pdb|2F3L|A Chain A, Crystal Structure Of A Lumenal Rfr-Domain Protein
           (Contig83.1_1_243_746) From Cyanothece Sp. 51142 At 2.1
           Angstrom Resolution
          Length = 184

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 49/112 (43%), Positives = 66/112 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+AD+ +S+FS +   GA    +    A+  GADL++ L        A+LTNAVL  
Sbjct: 72  AQFTNADLTDSNFSEADLRGAVFNGSALIGADLHGADLTNGLAYLTSFKGADLTNAVLTE 131

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +  R+    A I GADFS AV+D+ +   LC  A+G NP TGVSTR+SL C
Sbjct: 132 AIXXRTKFDDAKITGADFSLAVLDVYEVDKLCDRADGVNPKTGVSTRESLRC 183


>gi|75909862|ref|YP_324158.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75703587|gb|ABA23263.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 194

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 67/112 (59%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ++FT A++R+S+FS S   G     A    AN  G +L++  +D   L +ANLTNAVL  
Sbjct: 82  SSFTKANLRQSNFSKSNLTGVSFFAANLESANLEGTNLTNATLDSARLIKANLTNAVLEG 141

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +   GAII+GADF+D ++   +++ LCK A GTNP TG  TR +L C
Sbjct: 142 AFAASTKFDGAIIDGADFTDVLLRPDEQKKLCKVAKGTNPTTGRETRDTLFC 193


>gi|434400099|ref|YP_007134103.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428271196|gb|AFZ37137.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 167

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 51/129 (39%), Positives = 74/129 (57%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR A+         F  A +R SDFS S  +G  L  +   + NFTGA+LS+  +
Sbjct: 47  FSHQDLRDAI---------FDHASLRGSDFSYSDLSGVRLFGSNLSRVNFTGANLSNADL 97

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           +   L  AN TNA+L    +T + L  AIIEGADF++ ++    ++ LC+ A+GTNP TG
Sbjct: 98  ESCRLTRANFTNAILTGAFMTNTLLDEAIIEGADFTNVLLSPTTEKMLCENASGTNPTTG 157

Query: 233 VSTRKSLGC 241
            +T+ +L C
Sbjct: 158 RNTKDTLFC 166


>gi|427731475|ref|YP_007077712.1| putative low-complexity protein [Nostoc sp. PCC 7524]
 gi|427367394|gb|AFY50115.1| putative low-complexity protein [Nostoc sp. PCC 7524]
          Length = 185

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 54/149 (36%), Positives = 80/149 (53%), Gaps = 10/149 (6%)

Query: 103 GEFGIGSAAQFG----SADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYL 152
           G  GI + A F     + D  K + ++ +F       ++FT A++R+S+FS S   G   
Sbjct: 36  GILGITTIAGFAPTALALDYNKEILIEADFSGRDLTDSSFTKANLRQSNFSNSNLQGVSF 95

Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             A    AN  G +LS+  +D   L +A+LTNAVL       +   GAII+GADF+D ++
Sbjct: 96  FAANLESANLQGVNLSNATLDSARLIKADLTNAVLEGAFAANAKFDGAIIDGADFTDVLL 155

Query: 213 DLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +++ LCK A GTNP TG  T  +L C
Sbjct: 156 RPDEQKKLCKVAKGTNPTTGRDTHDTLYC 184


>gi|119490210|ref|ZP_01622723.1| hypothetical protein L8106_15969 [Lyngbya sp. PCC 8106]
 gi|119454096|gb|EAW35249.1| hypothetical protein L8106_15969 [Lyngbya sp. PCC 8106]
          Length = 177

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 64/112 (57%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           + F  A++R+S+FS S   G  L  A   + NF  ADLS   +D   LN ANLTNA+L  
Sbjct: 65  SEFDFANLRDSNFSHSNLRGVSLFGAKLQRTNFEAADLSYATLDTARLNRANLTNAILEG 124

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +D   A+I GADF+D ++    ++ LC  A GTNP+TG  TR +L C
Sbjct: 125 AFAYNTDFSDAMIAGADFTDVLLRRDMQEKLCALAEGTNPVTGRDTRDTLYC 176


>gi|116074723|ref|ZP_01471984.1| hypothetical protein RS9916_29354 [Synechococcus sp. RS9916]
 gi|116067945|gb|EAU73698.1| hypothetical protein RS9916_29354 [Synechococcus sp. RS9916]
          Length = 173

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 66/189 (34%), Positives = 93/189 (49%), Gaps = 29/189 (15%)

Query: 64  KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLR-KAV 122
           +L N R   S  LA  V   C     AL   +  +A T  E     A Q  SAD+  + +
Sbjct: 2   RLLNPRALCSGLLATLV---CCVISVALLPSSPAQAITAPELRGQKAVQDISADMHGRDL 58

Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLE----------KAVAYKANFTGADLSDTLM 172
             KE  +A+    D+ E+D  G+  N + L+            VA+ + F GADL D   
Sbjct: 59  KEKEFLKADLQGVDLSEADLRGAVINTSLLQGSDLRSADLGDVVAFASRFDGADLRD--- 115

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
                  A   NA+L+++  T ++     IEGADF++AVIDL Q +A+C  A G N  TG
Sbjct: 116 -------ARFVNAMLMQSRFTEAN-----IEGADFTNAVIDLPQLKAMCARAEGVNSATG 163

Query: 233 VSTRKSLGC 241
           +STR+SLGC
Sbjct: 164 ISTRESLGC 172


>gi|148239470|ref|YP_001224857.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
 gi|147848009|emb|CAK23560.1| Secreted pentapeptide repeats protein [Synechococcus sp. WH 7803]
          Length = 176

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 52/135 (38%), Positives = 74/135 (54%), Gaps = 10/135 (7%)

Query: 112 QFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           Q    D+   +H ++     F  AD+RE D   +   GA +  +    A+  GADL D +
Sbjct: 46  QRAVQDISSNMHGRDLKEKEFLKADLREVDLGDADLRGAVINTSQLQGADLRGADLEDVV 105

Query: 172 -----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
                 D   L +AN TNA+L++     S    A IEG DF++AVIDL Q +ALC  A+G
Sbjct: 106 AFSSRFDGADLRDANFTNAMLMQ-----SRFNDAQIEGTDFTNAVIDLPQLKALCGRASG 160

Query: 227 TNPITGVSTRKSLGC 241
            N ++GVST++SLGC
Sbjct: 161 VNSLSGVSTKESLGC 175


>gi|170078800|ref|YP_001735438.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
 gi|169886469|gb|ACB00183.1| secreted pentapeptide repeats protein [Synechococcus sp. PCC 7002]
          Length = 165

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 65/113 (57%), Gaps = 5/113 (4%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           + N T+AD+  SD  G+ FN   LE       N  GAD ++ +        A+LT+A+ V
Sbjct: 58  QVNLTNADLSGSDLRGAVFNSTLLETT-----NLHGADFTNGIAYLSKFTGADLTDAIFV 112

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
             +L RS    A I+GADFS AV+D  Q++ LC  A G NP+TG++T  SLGC
Sbjct: 113 EAILLRSTFENAKIDGADFSFAVLDGPQQKKLCAVATGVNPVTGIATADSLGC 165


>gi|116070732|ref|ZP_01468001.1| hypothetical protein BL107_13840 [Synechococcus sp. BL107]
 gi|116066137|gb|EAU71894.1| hypothetical protein BL107_13840 [Synechococcus sp. BL107]
          Length = 165

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 50/135 (37%), Positives = 74/135 (54%), Gaps = 6/135 (4%)

Query: 113 FGSADLRKAVHV------KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           F + D+ K V +      K+   A F  +++RE+D SGS   GA L  A    A+ +  D
Sbjct: 30  FAAVDVAKQVLIGADYANKDLVGATFNLSNLREADLSGSDLRGASLYGAKLQDADLSDTD 89

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
           L +  +D  V+   NL+NAV+       +     +I GADF+D  +   Q ++LC  A+G
Sbjct: 90  LREATLDSAVMTGTNLSNAVMEGAFAFNTRFKDVVITGADFTDVPMRPDQLKSLCSVADG 149

Query: 227 TNPITGVSTRKSLGC 241
           TNP+TG STR+SLGC
Sbjct: 150 TNPVTGRSTRESLGC 164


>gi|220907029|ref|YP_002482340.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219863640|gb|ACL43979.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 174

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 61/160 (38%), Positives = 82/160 (51%), Gaps = 15/160 (9%)

Query: 84  CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFS 143
           C  N+ +L     + A+   E  +G    F   DLR +          FT A++  SDFS
Sbjct: 26  CWFNLLSLPIAPGWAADYTKESLVG--VDFSGKDLRDS---------EFTQANLSRSDFS 74

Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
            S   G     A    AN +GADL  T +D   L  ANLTNA+L       +   GA I 
Sbjct: 75  QSDLRGVSFFAANLESANLSGADLRLTTLDNARLTHANLTNAILEGAFAFNARFQGATIT 134

Query: 204 GADFSDAVIDLAQ--KQALCKYANGTNPITGVSTRKSLGC 241
           GADF+D  +DL Q  +  LC+ A+GTNP+TG +TR++LGC
Sbjct: 135 GADFTD--VDLRQDAQTILCQGASGTNPVTGRNTRETLGC 172


>gi|428298761|ref|YP_007137067.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428235305|gb|AFZ01095.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 169

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/180 (32%), Positives = 91/180 (50%), Gaps = 17/180 (9%)

Query: 64  KLKN--WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 121
           KL N  WR+ +S  L   +    +  ++ +A   +Y  E      I   + F   DL  +
Sbjct: 4   KLSNNFWRIVLSALLGTVIWMISTWGLTPIAFALEYNKE------ILIQSDFSGRDLSDS 57

Query: 122 VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
                    +FT A++++S+FS +   G     A     + TGADLS++ +D   L +AN
Sbjct: 58  ---------SFTKANLKQSNFSNTNLRGVSFFAANLESVDLTGADLSNSTLDSARLVKAN 108

Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           LTNA+L       +   GAII+GADF+D ++   ++  LCK A GTNP T  +TR +L C
Sbjct: 109 LTNAILEGAFAISAKFEGAIIDGADFTDILLRDDEQARLCKIATGTNPTTKRNTRDTLMC 168


>gi|17230824|ref|NP_487372.1| hypothetical protein all3332 [Nostoc sp. PCC 7120]
 gi|17132427|dbj|BAB75031.1| all3332 [Nostoc sp. PCC 7120]
          Length = 206

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 53/147 (36%), Positives = 81/147 (55%), Gaps = 10/147 (6%)

Query: 105 FGIGSAAQFG----SADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEK 154
           FG+ + A F     + +  K + V+ +F       ++FT A++R+S+FS S   G     
Sbjct: 59  FGMITIANFTPPAFALEYNKEILVEADFSGRDLTDSSFTKANLRQSNFSKSNLTGVSFFA 118

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
           A    AN  G++L++  +D   L +ANL NAVL       +   GAII+GADF+D ++  
Sbjct: 119 ANLESANLEGSNLTNATLDSARLIKANLKNAVLEGAFAASTKFDGAIIDGADFTDVLLRP 178

Query: 215 AQKQALCKYANGTNPITGVSTRKSLGC 241
            +++ LCK A GTNP TG  TR +L C
Sbjct: 179 DEQKKLCKVAKGTNPTTGRETRDTLFC 205


>gi|119511413|ref|ZP_01630525.1| hypothetical protein N9414_20009 [Nodularia spumigena CCY9414]
 gi|119463958|gb|EAW44883.1| hypothetical protein N9414_20009 [Nodularia spumigena CCY9414]
          Length = 126

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 70/112 (62%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F++A+M  ++F+ +   GA +  +V  +AN  GADL++ ++D++    A+L++AV   
Sbjct: 14  AEFSNANMELANFADADLRGAVMSASVMTQANLHGADLTNAMVDQVKFAGADLSDAVFKE 73

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +L RS      I+ ADF+DA++D  Q + LC  A+G N  TGV TR SLGC
Sbjct: 74  ALLLRSTFTDVNIDSADFTDAILDGVQIKELCSKASGVNSKTGVETRYSLGC 125


>gi|119512324|ref|ZP_01631410.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
 gi|119463037|gb|EAW43988.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
          Length = 170

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/110 (41%), Positives = 67/110 (60%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           FT A++R+SDF+ +   G     A    AN   ADLS   +D   L +ANLTNA+L    
Sbjct: 60  FTKANLRQSDFNHANLRGVSFFAANLESANLESADLSFATLDSARLIKANLTNAILEGAF 119

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            + +   GAII+GADF+D ++   +++ LC+ A GTNP+TG +TR +L C
Sbjct: 120 ASNARFDGAIIDGADFTDILLRQDEEKKLCQLAKGTNPVTGRNTRDTLFC 169


>gi|33240300|ref|NP_875242.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           subsp. marinus str. CCMP1375]
 gi|33237827|gb|AAP99894.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
           subsp. marinus str. CCMP1375]
          Length = 170

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 56/134 (41%), Positives = 76/134 (56%), Gaps = 21/134 (15%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S  +F   DLR       NFR +N T A    S  +G+  +GA L+ A+AY ++F  ADL
Sbjct: 57  SGYEFVKFDLRGI-----NFRDSNLTGAVFNNSKLNGADLHGANLKDALAYASDFEDADL 111

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 227
           +D+          NL+NA+L+      S    AIIEGADF+DAV+   Q++ LC  A+GT
Sbjct: 112 TDS----------NLSNALLME-----SSFNNAIIEGADFTDAVLSRIQQKQLCSIADGT 156

Query: 228 NPITGVSTRKSLGC 241
           N  TG+ST  SLGC
Sbjct: 157 NSSTGISTSYSLGC 170


>gi|434386546|ref|YP_007097157.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
 gi|428017536|gb|AFY93630.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
          Length = 212

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 42/112 (37%), Positives = 69/112 (61%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+A + +++F+G+   G  +  +   + N  GA+L+  L+D++    A+L++AV V 
Sbjct: 100 AVFTTAKLDDTNFAGADLTGVVISSSTLNRTNLHGANLTQGLLDQVRFVGADLSDAVFVE 159

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ RS      I GADF+DA++   Q++ LC+ A G N  TGV+TR SLGC
Sbjct: 160 AMMLRSTFTDVNIAGADFTDAILGKLQQKELCQIATGVNSKTGVATRDSLGC 211


>gi|428300991|ref|YP_007139297.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428237535|gb|AFZ03325.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 166

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 61/188 (32%), Positives = 92/188 (48%), Gaps = 35/188 (18%)

Query: 65  LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
           +K W+  V   L   + AS +           Y A +    G   A      D      +
Sbjct: 1   MKFWQFLVGLVLTFVIFASSTP---------AYAASSSAVTGSIVAGSLKGKDFSGQSLI 51

Query: 125 KENF------RANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMD 173
            E F      +ANF++AD+R + F+GS  + A L+     + +AY ++F GA+LSD    
Sbjct: 52  AEEFTSVNLEKANFSAADLRGAVFNGSMLHDANLQGIDFSEGIAYLSDFKGANLSD---- 107

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
                 A  TNA+++R+  +  D     + GADF++AV+D  + Q LC  A+G NP TGV
Sbjct: 108 ------AVFTNAMMLRSAFSDVD-----VTGADFTNAVLDRTEVQKLCVNASGVNPKTGV 156

Query: 234 STRKSLGC 241
            TR+SLGC
Sbjct: 157 ETRQSLGC 164


>gi|123968372|ref|YP_001009230.1| hypothetical protein A9601_08391 [Prochlorococcus marinus str.
           AS9601]
 gi|123198482|gb|ABM70123.1| conserved hypothetical protein [Prochlorococcus marinus str.
           AS9601]
          Length = 170

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 52/113 (46%), Positives = 63/113 (55%), Gaps = 15/113 (13%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
            +N   A    S    SKFNGA L  A+AY  +FT ADLSD           N TNA+L+
Sbjct: 73  ESNLEGAVFNNSKLQNSKFNGANLRDALAYATDFTDADLSDV----------NFTNALLM 122

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                 S+  GA I+GADF+DAV+   Q++ LC  ANGTN  TG ST  SLGC
Sbjct: 123 -----ESNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170


>gi|124023397|ref|YP_001017704.1| hypothetical protein P9303_16951 [Prochlorococcus marinus str. MIT
           9303]
 gi|123963683|gb|ABM78439.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9303]
          Length = 198

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 58/149 (38%), Positives = 79/149 (53%), Gaps = 26/149 (17%)

Query: 104 EFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFN----------GAYL 152
           EF  G A +  S D+      ++NF +A+    D+ E+D  G+ FN          GA L
Sbjct: 63  EFRGGQAIEEISKDMHGRDLKEQNFLKADLRGVDLSEADLRGAVFNSSQLQEADLQGADL 122

Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           E  VA+ + F GADL            AN TNA+L++     S    A+IEGADFS+AV+
Sbjct: 123 ENVVAFASRFDGADLRG----------ANFTNAMLMQ-----SQFKDALIEGADFSNAVL 167

Query: 213 DLAQKQALCKYANGTNPITGVSTRKSLGC 241
           D  Q+  LC  ANGTN ++G +T  SLGC
Sbjct: 168 DRRQQNELCSRANGTNAVSGSNTIDSLGC 196


>gi|428202122|ref|YP_007080711.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427979554|gb|AFY77154.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 168

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 52/135 (38%), Positives = 74/135 (54%), Gaps = 1/135 (0%)

Query: 108 GSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           G+AA F   +L       +N + A FT+ D+  ++FS +   GA    +   + N  GAD
Sbjct: 33  GAAASFEDKNLSGQDFSGQNLQTAQFTNVDLTSANFSNTDLRGAVFNGSALKETNLHGAD 92

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
           L++ L      N A+L++AVL   ++ R+   GA I GADF+ AV+D  Q   LC  A+G
Sbjct: 93  LTNGLAYLSSFNGADLSDAVLTEAIMLRTTFDGANITGADFTLAVLDGDQVAKLCTIASG 152

Query: 227 TNPITGVSTRKSLGC 241
            N  TGV TR SLGC
Sbjct: 153 VNSKTGVETRASLGC 167


>gi|119486074|ref|ZP_01620136.1| hypothetical protein L8106_06120 [Lyngbya sp. PCC 8106]
 gi|119456849|gb|EAW37977.1| hypothetical protein L8106_06120 [Lyngbya sp. PCC 8106]
          Length = 161

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 68/112 (60%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F ++++  ++F  S+  G+   KA+   AN  GADL+  ++D++  + A+L+N++   
Sbjct: 49  AEFANSNLESANFDHSQLVGSVFSKAMMKNANMRGADLTYAMLDQVDFSNADLSNSIFTE 108

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +   S      I GADF+DA++D  Q + LC  A+G NP TGVSTR SLGC
Sbjct: 109 VLFFGSTFKDTKITGADFTDALLDGEQLRQLCITASGVNPKTGVSTRYSLGC 160


>gi|443312459|ref|ZP_21042076.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
 gi|442777437|gb|ELR87713.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
          Length = 167

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 46/112 (41%), Positives = 65/112 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+FT A++R S+ S S   G     A    AN  GA+L++  +D   + + NLTNAVL  
Sbjct: 55  ASFTKANLRNSNLSHSDLTGVSFFAANLESANLEGANLTNATLDAARIIKTNLTNAVLTG 114

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +   GAII+GADF+D ++   ++  LCK A GTNP TG  TR++L C
Sbjct: 115 AFAANAKFDGAIIDGADFTDVLLRQDEQDKLCKVAQGTNPTTGKQTRETLMC 166


>gi|428770110|ref|YP_007161900.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
           10605]
 gi|428684389|gb|AFZ53856.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
          Length = 193

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 55/132 (41%), Positives = 71/132 (53%), Gaps = 20/132 (15%)

Query: 130 ANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFT----------GADLSDT---- 170
            NFT A +   DFS     GS F  + L  A  Y++N T          GADL +T    
Sbjct: 59  VNFTYAQLEGEDFSHRDLTGSVFAASNLRNASFYQSNLTNSVMTEGILFGADLRETNFTG 118

Query: 171 -LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
            L+DR+ L+ A+L NA+    + TR+      IEGADF+ AVID  Q   +C  A+G N 
Sbjct: 119 SLIDRVTLDFADLRNAIFTDAIATRTRFYDTNIEGADFTGAVIDRYQVALMCDRASGVNS 178

Query: 230 ITGVSTRKSLGC 241
           ITGV+TR SLGC
Sbjct: 179 ITGVATRDSLGC 190


>gi|427715923|ref|YP_007063917.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
 gi|427348359|gb|AFY31083.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
          Length = 169

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 78/141 (55%), Gaps = 6/141 (4%)

Query: 107 IGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
           +G A    + +  K + ++ +F       ++FT A++R+S+FS +  +G     A    A
Sbjct: 28  VGGATTALALEYNKEILIEADFSGRDLTDSSFTKANLRQSNFSNANLSGVSFFAANLESA 87

Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
           N  GA+L++  +D     + NLTNAVL       +   GAII+GADF+D ++   +++ L
Sbjct: 88  NLQGANLTNATLDSARFIKTNLTNAVLEGAFAANAKFDGAIIDGADFTDVLLRQDEQKKL 147

Query: 221 CKYANGTNPITGVSTRKSLGC 241
           CK A GTNP TG  TR +L C
Sbjct: 148 CKVAKGTNPTTGRDTRDTLFC 168


>gi|425455123|ref|ZP_18834848.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9807]
 gi|389804043|emb|CCI17099.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9807]
          Length = 161

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 53/121 (43%), Positives = 68/121 (56%), Gaps = 10/121 (8%)

Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
           +F   D+R+S F      GS F+ A LE    + AN  GAD SD  M  +      L  A
Sbjct: 40  DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 99

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
           N TNAVL     T   + GAII+GADF+DA+I    ++ LC+ A GTNPITG +TR +L 
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIATGTNPITGRNTRDTLF 159

Query: 241 C 241
           C
Sbjct: 160 C 160


>gi|425440692|ref|ZP_18820990.1| Pentapeptide repeat family protein (modular protein) [Microcystis
           aeruginosa PCC 9717]
 gi|389718807|emb|CCH97279.1| Pentapeptide repeat family protein (modular protein) [Microcystis
           aeruginosa PCC 9717]
          Length = 213

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 53/121 (43%), Positives = 68/121 (56%), Gaps = 10/121 (8%)

Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
           +F   D+R+S F      GS F+ A LE    + AN  GAD SD  M  +      L  A
Sbjct: 92  DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 151

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
           N TNAVL     T   + GAII+GADF+DA+I    ++ LC+ A GTNPITG +TR +L 
Sbjct: 152 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIATGTNPITGRNTRDTLF 211

Query: 241 C 241
           C
Sbjct: 212 C 212


>gi|194476536|ref|YP_002048715.1| hypothetical protein PCC_0045 [Paulinella chromatophora]
 gi|171191543|gb|ACB42505.1| hypothetical protein PCC_0045 [Paulinella chromatophora]
          Length = 167

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 51/138 (36%), Positives = 74/138 (53%), Gaps = 26/138 (18%)

Query: 115 SADLR-KAVHVKENFRANFTSADMRESDFSGSKFN----------GAYLEKAVAYKANFT 163
           +AD+  + +  +E  +A+    D  ESD  G+ FN           A L+  VA+ + F 
Sbjct: 45  NADMHGRKLQQQEFLKADLQKIDFSESDLRGTVFNNSDLRNANLNAADLQDVVAFASRFD 104

Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
           GADL  T          NL N +L++     S     +IEGADF+DA++DL Q++ LC +
Sbjct: 105 GADLRQT----------NLRNGMLIQ-----SKFKDTLIEGADFTDAILDLKQQKILCSF 149

Query: 224 ANGTNPITGVSTRKSLGC 241
           ANGTN  TGV T++SL C
Sbjct: 150 ANGTNLKTGVDTKESLRC 167


>gi|425470227|ref|ZP_18849097.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9701]
 gi|389884202|emb|CCI35462.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9701]
          Length = 161

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 53/121 (43%), Positives = 69/121 (57%), Gaps = 10/121 (8%)

Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
           +F   D+R+S F      GS F+ A LE    + AN  GAD SD  M  +      L +A
Sbjct: 40  DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTKA 99

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
           N TNAVL     T   + GAII+GADF+DA+I    ++ LC+ A GTNPITG +TR +L 
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPITGRNTRDTLF 159

Query: 241 C 241
           C
Sbjct: 160 C 160


>gi|33865584|ref|NP_897143.1| hypothetical protein SYNW1050 [Synechococcus sp. WH 8102]
 gi|33632753|emb|CAE07565.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 162

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 53/139 (38%), Positives = 76/139 (54%), Gaps = 7/139 (5%)

Query: 111 AQFGSA-DLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
           AQ  +A D+ K V +  ++       A F  +++RE++ SGS   GA L  A    A+ +
Sbjct: 24  AQVSAAMDVAKQVLIGSDYSGKDLRGATFNLSNLREANLSGSDLRGASLYGAKLQDADLS 83

Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
           G DL +  +D  V+   NL+NAVL       +      I GADF+D  +   Q ++LC  
Sbjct: 84  GTDLREATLDAAVMTGTNLSNAVLEGAFAFNTRFVDVTISGADFTDVPMRGDQLKSLCAV 143

Query: 224 ANGTNPITGVSTRKSLGCG 242
           A+GTNP+TG STR SLGCG
Sbjct: 144 ADGTNPVTGRSTRDSLGCG 162


>gi|428225171|ref|YP_007109268.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427985072|gb|AFY66216.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 170

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 48/112 (42%), Positives = 66/112 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           +NFT A+MR S+ S +   G     A    AN  GA L+   +D   L +ANLTNA+L  
Sbjct: 58  SNFTKANMRSSNLSRANLQGVSFFGANLESANLEGAQLNYATLDSARLVKANLTNAILEG 117

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           T    +   GA IEGADF+DA++   + + LC+ A+G NP TG +TR+SL C
Sbjct: 118 TYAFNAKFAGATIEGADFTDALLRDDEIEHLCEVASGVNPTTGRATRESLMC 169


>gi|218438105|ref|YP_002376434.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218170833|gb|ACK69566.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 168

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 45/112 (40%), Positives = 65/112 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+ D+ E+DFS +   GA    +   +    GADL++ L        A+L++A+L  
Sbjct: 56  AQFTNVDLSEADFSNADLRGAVFNGSALIEGKLRGADLTNALGYLSSFERADLSDAILAE 115

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ R+    A + GADFS AV+D  Q   LC+ A+G N  TGVSTR+SLGC
Sbjct: 116 VIMKRTSFKNADVTGADFSYAVLDGEQIANLCRTASGVNSKTGVSTRESLGC 167


>gi|78184858|ref|YP_377293.1| hypothetical protein Syncc9902_1285 [Synechococcus sp. CC9902]
 gi|78169152|gb|ABB26249.1| conserved hypothetical protein [Synechococcus sp. CC9902]
          Length = 162

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 46/117 (39%), Positives = 67/117 (57%)

Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           K+   A F  +++RE+D SGS   GA L  A    A+ +  DL +  +D  V+   NL+N
Sbjct: 45  KDLVGATFNLSNLREADLSGSDLRGASLYGAKLQDADLSDTDLREATLDSAVMTGTNLSN 104

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           AV+       +     +I GADF+D  +   Q ++LC  A+GTNP+TG STR+SLGC
Sbjct: 105 AVMEGAFAFNTRFKDVVITGADFTDVPMRPDQLKSLCSVADGTNPVTGRSTRESLGC 161


>gi|422302957|ref|ZP_16390315.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9806]
 gi|389792132|emb|CCI12113.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9806]
          Length = 161

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 52/121 (42%), Positives = 69/121 (57%), Gaps = 10/121 (8%)

Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
           +F   D+R+S F      GS F+ A LE    + AN  GAD SD  M  +      L +A
Sbjct: 40  DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTKA 99

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
           N TNAVL     T   + GAII+GADF+DA+I    ++ LC+ A GTNP+TG +TR +L 
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159

Query: 241 C 241
           C
Sbjct: 160 C 160


>gi|425434011|ref|ZP_18814483.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9432]
 gi|425451971|ref|ZP_18831790.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           7941]
 gi|440753099|ref|ZP_20932302.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
 gi|389678210|emb|CCH92885.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9432]
 gi|389766463|emb|CCI07918.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           7941]
 gi|440177592|gb|ELP56865.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
          Length = 161

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 53/121 (43%), Positives = 68/121 (56%), Gaps = 10/121 (8%)

Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
           +F   D+R+S F      GS F+ A LE    + AN  GAD SD  M  +      L  A
Sbjct: 40  DFGGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 99

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
           N TNAVL     T   + GAII+GADF+DA+I    ++ LC+ A GTNPITG +TR +L 
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIATGTNPITGRNTRDTLF 159

Query: 241 C 241
           C
Sbjct: 160 C 160


>gi|126696874|ref|YP_001091760.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9301]
 gi|126543917|gb|ABO18159.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9301]
          Length = 186

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 71/124 (57%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L+  +H  +     +   D+   D   +   GAY+    A  ++F GA+++D +      
Sbjct: 50  LKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMTDLIAYATRF 109

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
           + A+ T+A L    L +S   GAII+GADF+DA +DL+Q+++LC+ A+GTN  TGV+T  
Sbjct: 110 DNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLSQRKSLCERASGTNSQTGVNTID 169

Query: 238 SLGC 241
           SL C
Sbjct: 170 SLEC 173


>gi|123969083|ref|YP_001009941.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. AS9601]
 gi|123199193|gb|ABM70834.1| Pentapeptide repeats [Prochlorococcus marinus str. AS9601]
          Length = 186

 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 71/124 (57%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L+  +H  +     +   D+   D   +   GAY+    A  ++F GA+++D +      
Sbjct: 50  LKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMTDLIAYATRF 109

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
           + A+ T+A L    L +S   GAII+GADF+DA +DL+Q+++LC+ A+GTN  TGV+T  
Sbjct: 110 DNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLSQRKSLCERASGTNTKTGVNTID 169

Query: 238 SLGC 241
           SL C
Sbjct: 170 SLEC 173


>gi|416389980|ref|ZP_11685429.1| pentapeptide repeat protein [Crocosphaera watsonii WH 0003]
 gi|357264135|gb|EHJ13061.1| pentapeptide repeat protein [Crocosphaera watsonii WH 0003]
          Length = 164

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 51/131 (38%), Positives = 70/131 (53%), Gaps = 8/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
             F   DLRK         A F  A++R+S+FS +   G     A    ANF GADL   
Sbjct: 41  VDFSGQDLRK--------EALFDHANLRDSNFSNANVQGVRFFSANLDSANFEGADLRYA 92

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            ++   L + N TNA+L     T   + GAII+GADF+D ++D   ++ LC  A GTNPI
Sbjct: 93  DLEVARLTKVNFTNAILEGAFATNILVQGAIIDGADFTDVLLDPKTEKYLCTIATGTNPI 152

Query: 231 TGVSTRKSLGC 241
           TG +T+ +L C
Sbjct: 153 TGRNTKDTLYC 163


>gi|67922307|ref|ZP_00515820.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
 gi|67855883|gb|EAM51129.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
          Length = 164

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 51/131 (38%), Positives = 70/131 (53%), Gaps = 8/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
             F   DLRK         A F  A++R+S+FS +   G     A    ANF GADL   
Sbjct: 41  VDFSGQDLRK--------EALFDHANLRDSNFSNANVQGVRFFSANLDSANFEGADLRYA 92

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            ++   L + N TNA+L     T   + GAII+GADF+D ++D   ++ LC  A GTNPI
Sbjct: 93  DLEVARLTKVNFTNAILEGAFATNILVQGAIIDGADFTDVLLDPKTEKYLCTIATGTNPI 152

Query: 231 TGVSTRKSLGC 241
           TG +T+ +L C
Sbjct: 153 TGRNTKDTLYC 163


>gi|427723591|ref|YP_007070868.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
 gi|427355311|gb|AFY38034.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
          Length = 165

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/128 (36%), Positives = 67/128 (52%), Gaps = 1/128 (0%)

Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           S D        +NF+ A FT  + R +D S +   GA    +     N  GAD+S+ +  
Sbjct: 38  SEDFANENFAGQNFQGAEFTQVNFRNADMSNTDLRGAVFNSSQLQNTNLHGADMSNGIAY 97

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
                 A+L+ A+    +L RS    A I+GADFS AV+D +Q++ LC  A G NP+TG+
Sbjct: 98  LSAFTGADLSGAIFEEAILLRSTFDDANIDGADFSFAVLDGSQQKKLCAAATGVNPVTGI 157

Query: 234 STRKSLGC 241
            T  SLGC
Sbjct: 158 ETADSLGC 165


>gi|390440388|ref|ZP_10228721.1| Pentapeptide repeat family protein [Microcystis sp. T1-4]
 gi|389836192|emb|CCI32847.1| Pentapeptide repeat family protein [Microcystis sp. T1-4]
          Length = 161

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/121 (42%), Positives = 68/121 (56%), Gaps = 10/121 (8%)

Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
           +F   D+R+S F      GS F+ A LE    + AN  GAD SD  M  +      L  A
Sbjct: 40  DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 99

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
           N TNAVL     T   + GAII+GADF+DA+I    ++ LC+ A GTNP+TG +TR +L 
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159

Query: 241 C 241
           C
Sbjct: 160 C 160


>gi|425463375|ref|ZP_18842714.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9809]
 gi|389833543|emb|CCI21857.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9809]
          Length = 161

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/121 (42%), Positives = 69/121 (57%), Gaps = 10/121 (8%)

Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
           +F   D+R+S F      GS F+ A LE    + AN  GAD SD  M  +      L +A
Sbjct: 40  DFAGQDLRDSTFDHSNLRGSNFSRANLEGVRFFSANLEGADFSDANMRNVDLESARLTKA 99

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
           N TNAVL     T   + GAII+GADF+DA+I    ++ LC+ A GTNP+TG +TR +L 
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159

Query: 241 C 241
           C
Sbjct: 160 C 160


>gi|166362955|ref|YP_001655228.1| hypothetical protein MAE_02140 [Microcystis aeruginosa NIES-843]
 gi|166085328|dbj|BAG00036.1| hypothetical protein MAE_02140 [Microcystis aeruginosa NIES-843]
          Length = 186

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 66/112 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+ ++++S+FS +   GA    A   + NF GADL++ L        ++L++A+   
Sbjct: 73  AQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSDAIFAE 132

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ R+   G  I GADFS AV+D  Q + LC+ A G N  TG+ST +SLGC
Sbjct: 133 AIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEGVNSKTGISTPESLGC 184


>gi|425445790|ref|ZP_18825810.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9443]
 gi|389734131|emb|CCI02174.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9443]
          Length = 169

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 66/112 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+ ++++S+FS +   GA    A   + NF GADL++ L        ++L++A+   
Sbjct: 56  AQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSDAIFAE 115

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ R+   G  I GADFS AV+D  Q + LC+ A G N  TGVST +SLGC
Sbjct: 116 AIMLRTIFEGVNINGADFSFAVLDAEQIKNLCERAEGVNSKTGVSTPESLGC 167


>gi|425458741|ref|ZP_18838229.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9808]
 gi|389824728|emb|CCI26060.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9808]
          Length = 161

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/121 (42%), Positives = 68/121 (56%), Gaps = 10/121 (8%)

Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
           +F   D+R+S F      GS F+ A LE    + AN  GAD SD  M  +      L  A
Sbjct: 40  DFGGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 99

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
           N TNAVL     T   + GAII+GADF+DA+I    ++ LC+ A GTNP+TG +TR +L 
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159

Query: 241 C 241
           C
Sbjct: 160 C 160


>gi|425447360|ref|ZP_18827349.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9443]
 gi|389732098|emb|CCI03919.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
           9443]
          Length = 161

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 53/121 (43%), Positives = 67/121 (55%), Gaps = 10/121 (8%)

Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
           +F   D+R+S F      GS F+ A LE    + AN  GAD SD  M  +      L  A
Sbjct: 40  DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 99

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
           N TNAVL     T   + GAII+GADF+DA+I    +  LC+ A GTNPITG +TR +L 
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEIYLCEIAKGTNPITGRNTRDTLF 159

Query: 241 C 241
           C
Sbjct: 160 C 160


>gi|443326265|ref|ZP_21054925.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442794122|gb|ELS03549.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 172

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 50/112 (44%), Positives = 65/112 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F   ++  SDFS S   G     A   +ANF  A+L    ++   L++ANLTNAVL  
Sbjct: 60  ATFDHTNLIGSDFSDSNLFGVRFFAANLREANFANANLKFADLEAARLSDANLTNAVLAG 119

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
             LT + L G IIEGADFS A++D   ++ LC  A GTNP TG +TR +L C
Sbjct: 120 AYLTNALLDGVIIEGADFSGALLDRNDEKMLCDIATGTNPTTGRNTRDTLFC 171


>gi|126696175|ref|YP_001091061.1| hypothetical protein P9301_08371 [Prochlorococcus marinus str. MIT
           9301]
 gi|91070292|gb|ABE11210.1| conserved hypothetical protein [uncultured Prochlorococcus marinus
           clone HF10-88D1]
 gi|126543218|gb|ABO17460.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9301]
          Length = 170

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/113 (45%), Positives = 62/113 (54%), Gaps = 15/113 (13%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
            +N   A    S    SKF GA L  A+AY  +FT ADLSD           N TNA+L+
Sbjct: 73  ESNLEGAVFNNSKLQNSKFTGANLRDALAYATDFTDADLSD----------VNFTNALLM 122

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                 S+  GA I+GADF+DAV+   Q++ LC  ANGTN  TG ST  SLGC
Sbjct: 123 E-----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170


>gi|390438199|ref|ZP_10226689.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
 gi|425441109|ref|ZP_18821396.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9717]
 gi|425454770|ref|ZP_18834496.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9807]
 gi|425466166|ref|ZP_18845469.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9809]
 gi|425468563|ref|ZP_18847571.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9701]
 gi|389718271|emb|CCH97753.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9717]
 gi|389804467|emb|CCI16499.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9807]
 gi|389831470|emb|CCI25816.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9809]
 gi|389838386|emb|CCI30813.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
 gi|389884775|emb|CCI34954.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9701]
          Length = 169

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 66/112 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+ ++++S+FS +   GA    A   + NF GADL++ L        ++L++A+   
Sbjct: 56  AQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSDAIFAE 115

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ R+   G  I GADFS AV+D  Q + LC+ A G N  TGVST +SLGC
Sbjct: 116 AIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEGVNSKTGVSTPESLGC 167


>gi|354567943|ref|ZP_08987110.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353541617|gb|EHC11084.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 169

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 9/132 (6%)

Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           AA F   DL  +         +F  A++R S+FS S   G     +     +FTGADLS 
Sbjct: 46  AADFSKQDLTDS---------SFDHANLRNSNFSNSNLRGVRFFSSNLASVDFTGADLSY 96

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
             ++   + +ANLTNA+L     T +   GAII+GADF+D  I       LC+ A GTNP
Sbjct: 97  ADLESARMTKANLTNAILEGAFTTGTMFDGAIIDGADFTDTYIREDTLNKLCQVAKGTNP 156

Query: 230 ITGVSTRKSLGC 241
           +TG +TR +L C
Sbjct: 157 VTGRNTRDTLAC 168


>gi|443666115|ref|ZP_21133744.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|159030126|emb|CAO91018.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443331286|gb|ELS45952.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 169

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 66/112 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+ ++++S+FS +   GA    A   + NF GADL++ L        ++L++A+   
Sbjct: 56  AQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSDAIFAE 115

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ R+   G  I GADFS AV+D  Q + LC+ A G N  TGVST +SLGC
Sbjct: 116 AIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEGVNSKTGVSTPESLGC 167


>gi|157413206|ref|YP_001484072.1| hypothetical protein P9215_08711 [Prochlorococcus marinus str. MIT
           9215]
 gi|254525828|ref|ZP_05137880.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
           MIT 9202]
 gi|157387781|gb|ABV50486.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9215]
 gi|221537252|gb|EEE39705.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
           MIT 9202]
          Length = 170

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/113 (45%), Positives = 62/113 (54%), Gaps = 15/113 (13%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
            +N   A    S    SKF GA L  A+AY  +FT ADLSD           N TNA+L+
Sbjct: 73  ESNLEGAVFNNSKLQNSKFTGANLRDALAYATDFTDADLSD----------VNFTNALLM 122

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                 S+  GA I+GADF+DAV+   Q++ LC  ANGTN  TG ST  SLGC
Sbjct: 123 E-----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170


>gi|434404813|ref|YP_007147698.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
 gi|428259068|gb|AFZ25018.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
          Length = 172

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 45/110 (40%), Positives = 64/110 (58%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  A++R+S+ S +  NG     A    AN  GADL ++ +D   L  ANLTNA+L    
Sbjct: 62  FAKANLRQSNLSHTNLNGVSFFAANLESANLEGADLRNSTLDSARLVRANLTNALLEGAF 121

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +   GAII+GADF+D ++   +++ LCK A GTNP+T   TR +L C
Sbjct: 122 AANARFDGAIIDGADFTDMLLRQDEQKKLCKLAKGTNPVTLRDTRDTLFC 171


>gi|168067322|ref|XP_001785569.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162662809|gb|EDQ49618.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 545

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 50/130 (38%), Positives = 67/130 (51%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F  ADLR      +N R   F + D R+ +  GS  +G+    A     N   +      
Sbjct: 416 FDHADLRGRDMSNQNLRGVVFAACDCRKINLEGSTMDGSTDTFAGFEGGNLKNSSWIRAF 475

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
            DR+V   ANL NA     VL+ S   GA I GADF+DA++D  Q+  +C+ A G NP T
Sbjct: 476 ADRVVFRGANLENANFTDAVLSGSQFDGADITGADFTDALVDNYQRLQMCRRAKGVNPTT 535

Query: 232 GVSTRKSLGC 241
           GV+TR+SL C
Sbjct: 536 GVATRESLFC 545


>gi|422301609|ref|ZP_16388976.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9806]
 gi|389789327|emb|CCI14609.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9806]
          Length = 169

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 66/112 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+ ++++S+FS +   GA    A   + NF GADL++ L        ++L++A+   
Sbjct: 56  AQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSDAIFAE 115

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ R+   G  I GADFS AV+D  Q + LC+ A G N  TG+ST +SLGC
Sbjct: 116 AIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEGVNSKTGISTLESLGC 167


>gi|428317848|ref|YP_007115730.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428241528|gb|AFZ07314.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 171

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/145 (35%), Positives = 79/145 (54%), Gaps = 7/145 (4%)

Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAYLEKAV 156
           G  GI  A  F + D  K + V  +F        +F  A++R S+F+ +   G     A 
Sbjct: 26  GAIGINPAPAF-ALDRDKEILVGADFTGKVLTDDSFNKANLRNSNFTNADLRGVSFFAAN 84

Query: 157 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
             +ANF GA+L+   +D   + +ANLTNA+L       + L GA+I+GADF++ ++    
Sbjct: 85  MEEANFEGANLTGATLDLARMMKANLTNAILEGAFAYNTRLEGAVIDGADFTETLLRDDM 144

Query: 217 KQALCKYANGTNPITGVSTRKSLGC 241
            + LCK A GTNP+TG  TR++L C
Sbjct: 145 IEKLCKVAKGTNPVTGRDTRETLFC 169


>gi|428780675|ref|YP_007172461.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
 gi|428694954|gb|AFZ51104.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
          Length = 167

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 54/147 (36%), Positives = 82/147 (55%), Gaps = 8/147 (5%)

Query: 98  EAETRGEFGIGS--AAQFGSADLR-KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
           EA+T   F   S  +A     DL  + + ++E   AN T AD+  +D  GS F  + ++ 
Sbjct: 25  EAQTSTRFQRQSLISADLSEEDLSGETLQLREISDANLTGADLSNADLRGSIFTASVMKN 84

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
           A  + ANFT      T+++ +    A+L+ A+L   +L+R+ L    I GADF++AV+D 
Sbjct: 85  ANLHGANFTF-----TVLNGVDFTNADLSQAILEDAILSRAILKDVDITGADFTNAVLDN 139

Query: 215 AQKQALCKYANGTNPITGVSTRKSLGC 241
            Q   LC+ A G N  TGV+TR+SLGC
Sbjct: 140 QQYNQLCEMATGVNEETGVATRESLGC 166


>gi|148242344|ref|YP_001227501.1| pentapeptide repeat-containing protein [Synechococcus sp. RCC307]
 gi|147850654|emb|CAK28148.1| Secreted pentapeptide repeat protein [Synechococcus sp. RCC307]
          Length = 164

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/131 (37%), Positives = 70/131 (53%), Gaps = 10/131 (7%)

Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL----- 171
           DL   +H +   +  F   D+  +DFS S   G          +NF+GADL D +     
Sbjct: 39  DLSSDMHGRNLQQKEFLKMDLEGTDFSDSDLRGTVFNTTQLQDSNFSGADLRDVVAFSSR 98

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
            DR  L++A L N +L+++  T      A I+GADF++AV+DL Q + LC  A G N  +
Sbjct: 99  FDRADLSQARLDNGMLLQSKFT-----DATIDGADFTNAVLDLPQIKQLCARATGVNERS 153

Query: 232 GVSTRKSLGCG 242
           G+ST  SLGCG
Sbjct: 154 GLSTADSLGCG 164


>gi|78779169|ref|YP_397281.1| hypothetical protein PMT9312_0785 [Prochlorococcus marinus str. MIT
           9312]
 gi|78712668|gb|ABB49845.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9312]
          Length = 170

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/112 (45%), Positives = 62/112 (55%), Gaps = 15/112 (13%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           +N   A    S    SKF GA L  A+AY  +FT ADLSD           N TNA+L+ 
Sbjct: 74  SNLEGAVFNNSKLQNSKFTGANLRDALAYATDFTDADLSD----------VNFTNALLME 123

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                S+  GA I+GADF+DAV+   Q++ LC  ANGTN  TG ST  SLGC
Sbjct: 124 -----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170


>gi|116074641|ref|ZP_01471902.1| hypothetical protein RS9916_28944 [Synechococcus sp. RS9916]
 gi|116067863|gb|EAU73616.1| hypothetical protein RS9916_28944 [Synechococcus sp. RS9916]
          Length = 158

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/110 (42%), Positives = 59/110 (53%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F   ++RE+DFSGS   GA L  A    AN T  +L D  +D  VL+  NLTNAVL    
Sbjct: 49  FNLTNLREADFSGSDLQGASLYGAKLQDANLTDTNLRDATLDSAVLDGTNLTNAVLEDAF 108

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +     II GADF++        + LC  A GTNP+TG  TR +LGC
Sbjct: 109 AFNTRFSNVIITGADFTNVPFRGDALKTLCAAAEGTNPVTGRDTRDTLGC 158


>gi|334118008|ref|ZP_08492098.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333459993|gb|EGK88603.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 171

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 53/118 (44%), Positives = 72/118 (61%), Gaps = 11/118 (9%)

Query: 125 KENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           K N R +NFT+AD+R     G  F  A +E+A    ANFTGA L    + RM+  +ANLT
Sbjct: 62  KANLRNSNFTNADLR-----GVSFFAANMEEANLEGANFTGATLD---LARMM--KANLT 111

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           NA+L       + L GA+I+GADF+D ++     + LCK A GTNP+TG  TR++L C
Sbjct: 112 NAILEGAFAYNTRLEGAVIDGADFTDTLLRDDMIEKLCKVAKGTNPVTGRDTRETLFC 169


>gi|443663881|ref|ZP_21133269.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|443331763|gb|ELS46407.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 150

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 55/135 (40%), Positives = 68/135 (50%), Gaps = 19/135 (14%)

Query: 112 QFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
            FG  DLR +     N RA              S F+ A LE    + AN  GAD SD  
Sbjct: 29  DFGGQDLRDSTFDHSNLRA--------------SNFSHANLEGVRFFSANLEGADFSDAN 74

Query: 172 MDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
           M  +      L  AN TNAVL     T   + GAII+GADF+DA+I    ++ LC+ A G
Sbjct: 75  MRNVDLESARLTRANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIAKG 134

Query: 227 TNPITGVSTRKSLGC 241
           TNPITG +TR +L C
Sbjct: 135 TNPITGRNTRDTLFC 149


>gi|428305184|ref|YP_007142009.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
           9333]
 gi|428246719|gb|AFZ12499.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
          Length = 169

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 63/112 (56%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT A++R  +FS +   G  L  A     N  GA+LS+  +D     +ANLTNAVL  
Sbjct: 57  ATFTKANLRNCNFSHADLRGVSLFGANLELVNLEGANLSNATLDTAKFTKANLTNAVLEG 116

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +   GAII+GADF+D ++    ++ LCK A GTNP TG  TR +L C
Sbjct: 117 AFAFNAKFDGAIIDGADFTDVLVRQDVQKQLCKIATGTNPTTGRETRDTLLC 168


>gi|428781463|ref|YP_007173249.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
 gi|428695742|gb|AFZ51892.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
          Length = 165

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 71/133 (53%), Gaps = 20/133 (15%)

Query: 114 GSADLRKAVHVKENFRANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTGADLS 168
           G + +    + +E  RA+F +A++  + F+G+      + G      +AY  +FTG D  
Sbjct: 47  GESLIEAEFYDEELERADFHNANLEAAVFNGANLTNANWQGVNFTNGIAYLTDFTGVDF- 105

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
                         TNA+L   ++ RS    A +EG DF++AV+D  Q + LC+ A+G N
Sbjct: 106 --------------TNAILTEAMMLRSTFNDATVEGVDFTNAVVDRLQVKRLCERASGVN 151

Query: 229 PITGVSTRKSLGC 241
           P TGVSTR+SLGC
Sbjct: 152 PTTGVSTRESLGC 164


>gi|291566844|dbj|BAI89116.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
          Length = 174

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 46/112 (41%), Positives = 64/112 (57%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           + F  A+++ S+FS +   G  L  A     N   ADL    +D   L  ANLTNA+L  
Sbjct: 62  SEFDFANLQGSNFSHTDLRGVSLFGAKMQDVNLESADLRFATLDTARLVRANLTNALLEE 121

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +D  GAII GADF+D ++   Q+Q LC+ A+GTNP+TG  TR++L C
Sbjct: 122 AYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 173


>gi|425436672|ref|ZP_18817106.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9432]
 gi|425449430|ref|ZP_18829270.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           7941]
 gi|425458879|ref|ZP_18838365.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9808]
 gi|440755734|ref|ZP_20934936.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
 gi|389678572|emb|CCH92580.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9432]
 gi|389763888|emb|CCI09674.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           7941]
 gi|389823689|emb|CCI27950.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9808]
 gi|440175940|gb|ELP55309.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
          Length = 169

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 66/112 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT+ ++++S+FS +   GA    A   + NF GADL++ L        ++L++A+   
Sbjct: 56  AQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSDAIFSE 115

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ R+   G  I GADFS AV+D  Q + LC+ A G N  TG+ST +SLGC
Sbjct: 116 AIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEGVNSKTGISTPESLGC 167


>gi|409992571|ref|ZP_11275753.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|409936565|gb|EKN78047.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 149

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 46/112 (41%), Positives = 64/112 (57%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           + F  A+++ S+FS +   G  L  A     N   ADL    +D   L  ANLTNA+L  
Sbjct: 37  SEFDFANLQGSNFSHTDLRGVSLFGAKMQDVNLESADLRLATLDTARLVRANLTNALLEE 96

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +D  GAII GADF+D ++   Q+Q LC+ A+GTNP+TG  TR++L C
Sbjct: 97  AYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 148


>gi|428220990|ref|YP_007105160.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427994330|gb|AFY73025.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 165

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 45/110 (40%), Positives = 62/110 (56%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F   D+  ++F  +   G  L  A    A+FTGADL  + +D   +N ANLTNAVL    
Sbjct: 54  FNKTDLHNANFRNANLAGVSLFGANMTAADFTGADLRYSTLDTARMNGANLTNAVLEGAF 113

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +  +   G +I+GADFSD  +    +  LCK A GTNP+TG  TR++L C
Sbjct: 114 VYGTSFVGTVIDGADFSDVDLRNTTRSLLCKVAKGTNPVTGRDTRETLEC 163


>gi|423066922|ref|ZP_17055712.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|406711687|gb|EKD06887.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 137

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 46/112 (41%), Positives = 64/112 (57%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           + F  A+++ S+FS +   G  L  A     N   ADL    +D   L  ANLTNA+L  
Sbjct: 25  SEFDFANLQGSNFSHTDLRGVSLFGAKMQDVNLESADLRLATLDTARLVRANLTNALLEE 84

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +D  GAII GADF+D ++   Q+Q LC+ A+GTNP+TG  TR++L C
Sbjct: 85  AYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 136


>gi|33862830|ref|NP_894390.1| hypothetical protein PMT0557 [Prochlorococcus marinus str. MIT
           9313]
 gi|33634746|emb|CAE20732.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 198

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 57/149 (38%), Positives = 77/149 (51%), Gaps = 26/149 (17%)

Query: 104 EFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFN----------GAYL 152
           EF  G A +  S D+      ++NF +A+    D+ E+D  G+ FN          GA L
Sbjct: 63  EFRGGQAIEEISKDMHGRDLKEQNFLKADLRGVDLSEADLRGAVFNSSQLQEADLQGADL 122

Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           E  VA+ + F GADL            AN TNA+L++     S    A+IEGADFS+AV+
Sbjct: 123 ENVVAFASRFDGADLRG----------ANFTNAMLMQ-----SQFKDALIEGADFSNAVL 167

Query: 213 DLAQKQALCKYANGTNPITGVSTRKSLGC 241
           D  Q+  LC  A+GTN  +G  T  SLGC
Sbjct: 168 DRRQQNELCARADGTNAASGSQTLDSLGC 196


>gi|359460819|ref|ZP_09249382.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 164

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 47/132 (35%), Positives = 73/132 (55%), Gaps = 10/132 (7%)

Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN- 178
           +A+  ++    +F+  D+RE++FS ++  GA   +A      F G DL+   +  + +  
Sbjct: 32  RAIDDEDIVTQDFSGQDLREAEFSNNQLAGANFSEADLTAVVFNGVDLTGASLKNVDMTG 91

Query: 179 ---------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
                    EA+L+ A+L   +L +S L  A +  ADFS AVID  Q + LC+ A+G NP
Sbjct: 92  GMAYLSSFAEADLSGAILTEAMLLQSSLRNATVTDADFSFAVIDKDQVKILCETASGVNP 151

Query: 230 ITGVSTRKSLGC 241
           +TGV TR SLGC
Sbjct: 152 VTGVDTRDSLGC 163


>gi|166364098|ref|YP_001656371.1| pentapeptide repeat-containing protein [Microcystis aeruginosa
           NIES-843]
 gi|166086471|dbj|BAG01179.1| pentapeptide repeat family protein [Microcystis aeruginosa
           NIES-843]
          Length = 161

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 51/121 (42%), Positives = 69/121 (57%), Gaps = 10/121 (8%)

Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
           +F   D+R+S F      GS F+ A LE    + AN  GA+ SD  M  +      L +A
Sbjct: 40  DFAGQDLRDSTFDHSNLRGSNFSRANLEGVRFFSANLEGANFSDANMRNVDLESARLTKA 99

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
           N TNAVL     T   + GAII+GADF+DA+I    ++ LC+ A GTNP+TG +TR +L 
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159

Query: 241 C 241
           C
Sbjct: 160 C 160


>gi|113475775|ref|YP_721836.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110166823|gb|ABG51363.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 165

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 77/155 (49%), Gaps = 22/155 (14%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           I + A F   ++     V+ N   N+T  D+   DFS    +G     A  +KANF GA+
Sbjct: 12  ILTVAGFWVMNIYSVQAVENN--VNYTLTDLNNRDFSYKDLHGTSFAGATMWKANFQGAN 69

Query: 167 LSDTLM--------------------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
           L +T++                    DR+  ++++LTNA+    +L  S    A + G D
Sbjct: 70  LQNTILTKGDFLRANLTEADFTGTFADRVSFDKSDLTNAIFTDAMLMSSTFRDATVIGTD 129

Query: 207 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           FS A++D  Q + +C+ A+G N  TGV TR+SLGC
Sbjct: 130 FSGAMVDRYQIKLMCETASGKNKTTGVETRESLGC 164


>gi|158337467|ref|YP_001518642.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158307708|gb|ABW29325.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 164

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 47/132 (35%), Positives = 73/132 (55%), Gaps = 10/132 (7%)

Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN- 178
           +A+  ++    +F+  D+RE++FS ++  GA   +A      F G DL+   +  + +  
Sbjct: 32  RAIDDEDIVTQDFSGQDLREAEFSNNQLAGANFSEADLTAVVFNGVDLTGASLKNVDMTG 91

Query: 179 ---------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
                    EA+L+ A+L   +L +S L  A +  ADFS AVID  Q + LC+ A+G NP
Sbjct: 92  GMAYLSSFAEADLSGAILTEAMLLQSSLRDATVTDADFSFAVIDKDQVKILCETASGVNP 151

Query: 230 ITGVSTRKSLGC 241
           +TGV TR SLGC
Sbjct: 152 VTGVDTRDSLGC 163


>gi|428313239|ref|YP_007124216.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428254851|gb|AFZ20810.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 169

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 46/112 (41%), Positives = 63/112 (56%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ++FT A++R S+FS S   G     A    ANF GA+L +  +D   L  A+L NAVL  
Sbjct: 57  SSFTKANLRSSNFSHSNLEGVSFFSANLESANFEGANLRNATLDTARLTRASLKNAVLEG 116

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +   GA IEGADF++ +     ++ LC  A+GTNP TG STR +L C
Sbjct: 117 AFAFNTKFDGATIEGADFTEVLFRQDVQKQLCHVASGTNPTTGRSTRDTLFC 168


>gi|159903526|ref|YP_001550870.1| hypothetical protein P9211_09851 [Prochlorococcus marinus str. MIT
           9211]
 gi|159888702|gb|ABX08916.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9211]
          Length = 169

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 53/152 (34%), Positives = 78/152 (51%), Gaps = 21/152 (13%)

Query: 96  KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADM-----RESDFSGSKFNG 149
           K   E R +  +  +    + DL     VK + R  NF  +D+       S+ + ++FNG
Sbjct: 33  KRPPEIRNQDDLNISQDMHAQDLSGREFVKFDLRGINFKDSDLSGAVFNNSNLTNAQFNG 92

Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           A +  ++AY  NF   DLSD          ANLTNA+L+ +    +      I+GADF+D
Sbjct: 93  ADMHDSLAYATNFENTDLSD----------ANLTNALLMESTFVNTK-----IDGADFTD 137

Query: 210 AVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           AV+   Q++ LC  A+GTN  TG+ T  SLGC
Sbjct: 138 AVLSRIQQKQLCSIASGTNSNTGIDTEYSLGC 169


>gi|209527449|ref|ZP_03275954.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|376003366|ref|ZP_09781178.1| pentapeptide repeat-containing protein [Arthrospira sp. PCC 8005]
 gi|209492122|gb|EDZ92472.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|375328288|emb|CCE16931.1| pentapeptide repeat-containing protein [Arthrospira sp. PCC 8005]
          Length = 137

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 46/112 (41%), Positives = 64/112 (57%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           + F  A+++ S+FS +   G  L  A     N   ADL    +D   L  ANLTNA+L  
Sbjct: 25  SEFDFANLQGSNFSHTDLRGVSLFGAKMQDINLESADLRLATLDTARLVRANLTNALLEE 84

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +D  GAII GADF+D ++   Q+Q LC+ A+GTNP+TG  TR++L C
Sbjct: 85  AYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 136


>gi|300868113|ref|ZP_07112748.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
 gi|300333887|emb|CBN57928.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
          Length = 169

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 45/112 (40%), Positives = 64/112 (57%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT A++R S+FS +   G     A    ANF GA+L    +D   + + NLTNA+L  
Sbjct: 57  AQFTKANLRNSNFSNANLQGVSFFAANMEDANFEGANLRGATLDLARMIKVNLTNAILEG 116

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +    AI++GADF+D +I     + LCK A GTNP+TG +TR++L C
Sbjct: 117 AFAYNTKFERAIVDGADFTDILIRDDMVEKLCKVARGTNPVTGRNTRETLFC 168


>gi|113474577|ref|YP_720638.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110165625|gb|ABG50165.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 144

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 47/110 (42%), Positives = 64/110 (58%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           FT + +R+S+FS +  +G  L  A    AN  GA+LS + +D  V N+ANLTNA+L    
Sbjct: 34  FTKSILRKSNFSNANLSGVSLFGAHLEGANLEGANLSYSTLDDAVFNKANLTNAILEGAF 93

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +    AII+GADF+DA +     + LCK A G N ITG  TR +L C
Sbjct: 94  AFHTQFRDAIIDGADFTDAFLRKDTTKDLCKIAQGKNSITGKETRDTLFC 143


>gi|434386960|ref|YP_007097571.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
 gi|428017950|gb|AFY94044.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
          Length = 168

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 44/110 (40%), Positives = 62/110 (56%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           FT A +R   F  +   G  L       A+ TGA+L++ L+D       N TNA+LV   
Sbjct: 58  FTQASVRNGKFINANLTGVSLIGGNFDSADMTGANLTNALLDTARFTRTNFTNAILVGAF 117

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            + ++  GAII+GADF+D ++    ++ LCK A GTNP TG  TR+SL C
Sbjct: 118 TSVTNFDGAIIDGADFTDVLLRKDIQKKLCKVAKGTNPTTGRDTRESLEC 167


>gi|427734374|ref|YP_007053918.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427369415|gb|AFY53371.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 167

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 48/131 (36%), Positives = 71/131 (54%), Gaps = 6/131 (4%)

Query: 117 DLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           D  K + ++ +F       ++FT A++R+S+FS S   G          AN   A+L   
Sbjct: 36  DYNKEILIEADFSGQDLTDSSFTKANLRDSNFSNSNLQGVRFFATNLESANLRNANLRYA 95

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D   L +A+LTNAVL     + +   GAII+GADF+D ++   ++  LCK A GTNP 
Sbjct: 96  TLDSARLVKADLTNAVLEGAFASNARFDGAIIDGADFTDVLLRADEQDKLCKLAKGTNPT 155

Query: 231 TGVSTRKSLGC 241
           TG  TR +L C
Sbjct: 156 TGRDTRDTLFC 166


>gi|33861334|ref|NP_892895.1| hypothetical protein PMM0777 [Prochlorococcus marinus subsp.
           pastoris str. CCMP1986]
 gi|33633911|emb|CAE19236.1| conserved hypothetical protein [Prochlorococcus marinus subsp.
           pastoris str. CCMP1986]
          Length = 170

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 47/125 (37%), Positives = 68/125 (54%)

Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
           DL + +H ++     F   ++   DFS S   GA    +    A  TGA+LSD L     
Sbjct: 46  DLEEDMHGQDLSGNEFVKFNLNGFDFSQSNLEGAVFNNSKLQNATMTGANLSDALAYATD 105

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 236
             +A+L++      +L  S+  GA I+GADF++AV+   Q++ LC+ ANGTN  TG ST 
Sbjct: 106 FTDADLSDVNFTNALLMESNFEGAKIDGADFTNAVLSRIQQKELCEIANGTNSSTGESTE 165

Query: 237 KSLGC 241
            SLGC
Sbjct: 166 YSLGC 170


>gi|428310976|ref|YP_007121953.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428252588|gb|AFZ18547.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 167

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 49/118 (41%), Positives = 64/118 (54%), Gaps = 20/118 (16%)

Query: 129 RANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           + NF +AD+R   FS S       +GA     +AY ++FTGADLSD              
Sbjct: 64  QTNFNNADLRNVVFSSSTLKQASLHGADFTSGIAYLSDFTGADLSD-------------- 109

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            AVL   ++ RS    A I GADF+DAV+D  Q + LC  A G N  TG++TR+SLGC
Sbjct: 110 -AVLTEAIMLRSRFDEADITGADFTDAVLDGVQIKKLCARATGVNSKTGMATRESLGC 166


>gi|428215647|ref|YP_007088791.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428004028|gb|AFY84871.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 183

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 45/113 (39%), Positives = 65/113 (57%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +A+F  A++R+S+ S +   GA L  A    AN  GA+LS+T +D       NL NA+L 
Sbjct: 70  QASFNHANLRKSNLSHANLQGASLFAAHLEDANLEGANLSNTTLDTARFIRTNLKNAILE 129

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +    +   GA IEGADF+D  +     + LC+ A GTNP+TG +TR +L C
Sbjct: 130 GSFAFSAKFNGANIEGADFTDVFLRDDANEILCELATGTNPVTGRNTRDTLYC 182


>gi|318041291|ref|ZP_07973247.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0101]
          Length = 161

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 49/131 (37%), Positives = 69/131 (52%), Gaps = 6/131 (4%)

Query: 117 DLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           D+ K V +  +F       A F   ++RE+DF GS   GA L  A    AN +G DL+D 
Sbjct: 30  DVAKQVLIGHDFAGMDLRGATFNLTNLREADFHGSDLRGASLFGAKLQDANLSGTDLTDA 89

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D  VL+  +L NAVL       +     +IEGADF++        + LC  A+GTNP+
Sbjct: 90  TLDSAVLDGTDLRNAVLENAFAFNTRFNNVLIEGADFTNVPFRGDVLKTLCASASGTNPV 149

Query: 231 TGVSTRKSLGC 241
           TG +TR +L C
Sbjct: 150 TGRNTRDTLEC 160


>gi|427420100|ref|ZP_18910283.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
 gi|425762813|gb|EKV03666.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
          Length = 165

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 47/133 (35%), Positives = 66/133 (49%), Gaps = 1/133 (0%)

Query: 110 AAQFGSADLRKAVHVKENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           A  +    LR+     ++ R N +TS DM E+D S +   G  L      KAN   AD+S
Sbjct: 32  AKNYDRQSLRQQSFAGQDLRGNNYTSTDMAEADLSNTDLRGVRLFDTNLTKANLESADMS 91

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
              +D      ANL NA+        +D   A IEGADF+D  +D+     LC+ A G N
Sbjct: 92  GATLDGARFIRANLKNAIFEGAYAFSTDFRKANIEGADFTDVDLDVKTNDMLCEVATGVN 151

Query: 229 PITGVSTRKSLGC 241
           P+TG +T+ +L C
Sbjct: 152 PVTGRATKDTLYC 164


>gi|78779832|ref|YP_397944.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9312]
 gi|78713331|gb|ABB50508.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9312]
          Length = 186

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 48/142 (33%), Positives = 74/142 (52%), Gaps = 10/142 (7%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----M 172
           L++ +H  +     F   D+   D   +   GAY+    A  ++F  A++ D +      
Sbjct: 50  LKEDLHGADLQNNEFVKYDLSNQDLGEANLQGAYMSVTTAANSSFKSANMKDLIAYAVRF 109

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           D   L++ANLTN  L+++V       GA I+GADF+DA +DL Q+++LC+ A GTN  TG
Sbjct: 110 DNADLSDANLTNGELMKSVF-----DGATIDGADFTDATLDLPQRKSLCERATGTNSKTG 164

Query: 233 VSTRKSLGCGNSRRNAYGSPSS 254
           V T  SL C   R     +P +
Sbjct: 165 VDTVDSLECSGLRGYIPATPEA 186


>gi|172037018|ref|YP_001803519.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
           51142]
 gi|354555787|ref|ZP_08975086.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
 gi|171698472|gb|ACB51453.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
 gi|353552111|gb|EHC21508.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
          Length = 167

 Score = 80.1 bits (196), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 49/130 (37%), Positives = 73/130 (56%), Gaps = 11/130 (8%)

Query: 123 HVKENF-RANFTSADMRESDFS-----GSKFNGAYLEKAVAYK-----ANFTGADLSDTL 171
           + K+N    +F+S D+R+SDF      G  F+ A L+    +      ANF GADL    
Sbjct: 37  YAKQNLVERDFSSQDLRDSDFEHANLRGCNFSHANLQGVRFFASNLEGANFEGADLRYAD 96

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           ++   L   N TNA+L     T +   GA+I+GADF+D ++ L  ++ LC+ A GTNPIT
Sbjct: 97  LESARLVRVNFTNAILEGAFATNTLFNGAVIDGADFTDVLLRLDTEKKLCEIAKGTNPIT 156

Query: 232 GVSTRKSLGC 241
           G +T+ +L C
Sbjct: 157 GRNTKDTLFC 166


>gi|434395414|ref|YP_007130361.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428267255|gb|AFZ33201.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 168

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 44/110 (40%), Positives = 63/110 (57%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  A++R S+FS +   G  L  A    AN  GA+L++  +D   L+ ANL +AVL    
Sbjct: 58  FNHANLRNSNFSHANLEGVSLFAANLESANLEGANLTNATLDSARLSNANLKDAVLEGAF 117

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +    AII+GADF+D ++   ++  LCK A GTNP TG  TR++L C
Sbjct: 118 AANAKFDKAIIDGADFTDVLLRRDEQDKLCKVAKGTNPTTGRETRETLMC 167


>gi|254526458|ref|ZP_05138510.1| pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
 gi|221537882|gb|EEE40335.1| pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
          Length = 179

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 67/124 (54%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L+  +H  +     +   D+   D   +   GAY+    A  ++F GA++ D +      
Sbjct: 43  LKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRF 102

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
           + A+ T+A L    L +S   GAII+GADF+DA +DL  +++LC+ A GTN  TGV+T  
Sbjct: 103 DNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLKTRKSLCERATGTNSQTGVNTAD 162

Query: 238 SLGC 241
           SL C
Sbjct: 163 SLEC 166


>gi|157413912|ref|YP_001484778.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9215]
 gi|157388487|gb|ABV51192.1| Pentapeptide repeat-containing proteins [Prochlorococcus marinus
           str. MIT 9215]
          Length = 186

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 67/124 (54%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L+  +H  +     +   D+   D   +   GAY+    A  ++F GA++ D +      
Sbjct: 50  LKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRF 109

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
           + A+ T+A L    L +S   GAII+GADF+DA +DL  +++LC+ A GTN  TGV+T  
Sbjct: 110 DNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLKTRKSLCERATGTNSQTGVNTAD 169

Query: 238 SLGC 241
           SL C
Sbjct: 170 SLEC 173


>gi|218245449|ref|YP_002370820.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
 gi|257058486|ref|YP_003136374.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
 gi|218165927|gb|ACK64664.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
 gi|256588652|gb|ACU99538.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
          Length = 168

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 44/110 (40%), Positives = 67/110 (60%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F +AD+ E++FS S   GA        +ANF GA+L++ L       +A+L++A+L   +
Sbjct: 58  FANADLTEANFSDSDLRGAVFNGVELKQANFHGANLTNGLAYLSSFRDADLSDAILSEVI 117

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           + R+    A I GADF+ AV+D  +   LC+ A+G N  TG+STR+SLGC
Sbjct: 118 MLRTVFDNANITGADFTLAVLDGEEVAKLCQRADGVNSKTGMSTRESLGC 167


>gi|91070378|gb|ABE11292.1| pentapeptide repeats [uncultured Prochlorococcus marinus clone
           HF10-88H9]
          Length = 186

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 67/124 (54%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L+  +H  +     +   D+   D   +   GAY+    A  ++F GA++ D +      
Sbjct: 50  LKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRF 109

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
           + A+ T+A L    L +S   GAII+GADF+DA +DL  +++LC+ A GTN  TGV+T  
Sbjct: 110 DNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLKTRKSLCERATGTNSQTGVNTAD 169

Query: 238 SLGC 241
           SL C
Sbjct: 170 SLEC 173


>gi|254414183|ref|ZP_05027950.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196178858|gb|EDX73855.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 178

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/118 (40%), Positives = 63/118 (53%), Gaps = 20/118 (16%)

Query: 129 RANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           + N ++ D+R     +S  + +   GA    ++AYK NF GADLSD              
Sbjct: 75  QTNLSNTDLRSVVISDSTMTDANLQGADFSYSIAYKVNFKGADLSD-------------- 120

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            AVL   +L  S L    I GADFS+AV+D  Q Q+LC  A+G N  TGV TR+SLGC
Sbjct: 121 -AVLEEAILLGSRLDDVNITGADFSNAVLDRVQVQSLCTKASGVNSKTGVETRESLGC 177


>gi|428771687|ref|YP_007163477.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
           10605]
 gi|428685966|gb|AFZ55433.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
          Length = 159

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 49/129 (37%), Positives = 69/129 (53%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   +L  A   K N R    SA++ +SD  G  F GA ++       N  GA+L+++++
Sbjct: 39  FSGQNLTDATFNKTNLR----SANLSQSDLQGVSFFGANMDSI-----NLEGANLTNSIL 89

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           D   L  ANL NAVL     T +   GA IEGADF+D ++    ++ LC+ A G NP TG
Sbjct: 90  DSARLTRANLRNAVLEGAFATNTKFEGANIEGADFTDVILRPDVEEMLCEKAKGVNPTTG 149

Query: 233 VSTRKSLGC 241
             TR +L C
Sbjct: 150 RKTRDTLYC 158


>gi|434392213|ref|YP_007127160.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428264054|gb|AFZ30000.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 165

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/136 (37%), Positives = 75/136 (55%), Gaps = 29/136 (21%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGA 165
           A+F +ADL  A         NF++AD+R   F+G+K      +GA     +AY  +FTGA
Sbjct: 54  AEFANADLEAA---------NFSNADLRGVVFNGAKLIKANLHGADFTNGIAYIVDFTGA 104

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
           +LSD +M+           A+++R++    D     I GADF++AV+D    + LC  A+
Sbjct: 105 NLSDAVMEE----------AMMLRSIFNDVD-----ITGADFTNAVLDRTVVKKLCAQAS 149

Query: 226 GTNPITGVSTRKSLGC 241
           G N  TGV+TR SLGC
Sbjct: 150 GVNSKTGVATRDSLGC 165


>gi|428205702|ref|YP_007090055.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
           PCC 7203]
 gi|428007623|gb|AFY86186.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
          Length = 169

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 43/110 (39%), Positives = 62/110 (56%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F+ A++R S+FS S   G  L  A    ANF GA+L+   +D   L  ANL +A+L    
Sbjct: 59  FSHANLRSSNFSHSNLEGVSLFAANLDSANFEGANLASATLDSARLTRANLKDAILEGAF 118

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +   GA+I+GADF+D ++    +  LC+ A G NP TG +TR +L C
Sbjct: 119 AANTKFDGAVIDGADFTDVLMRRDVQDKLCQVAKGVNPTTGRATRDTLFC 168


>gi|254431831|ref|ZP_05045534.1| pentapeptide repeat protein [Cyanobium sp. PCC 7001]
 gi|197626284|gb|EDY38843.1| pentapeptide repeat protein [Cyanobium sp. PCC 7001]
          Length = 174

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 69/135 (51%), Gaps = 6/135 (4%)

Query: 113 FGSADLRKAVHVKENFRAN------FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
             + D+ K V +  ++         F   ++R++D SGS   GA L  A    A+ +  +
Sbjct: 39  LAAVDVAKQVLIGADYHGQDLRGGTFNLTNLRDADLSGSDLQGASLFGAKLQDADLSNTN 98

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
           L +T +D  V N  +LTNAVL       +     II+GADF++  +     +ALC  A G
Sbjct: 99  LRETTLDSAVFNGTDLTNAVLEDAFAFNTKFSDVIIDGADFTNVPLRGDALKALCAVARG 158

Query: 227 TNPITGVSTRKSLGC 241
           TNP+TG  TR +LGC
Sbjct: 159 TNPVTGRQTRDTLGC 173


>gi|113954335|ref|YP_730803.1| pentapeptide repeat-containing protein [Synechococcus sp. CC9311]
 gi|113881686|gb|ABI46644.1| pentapeptide repeat protein [Synechococcus sp. CC9311]
          Length = 157

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 67/135 (49%), Gaps = 6/135 (4%)

Query: 113 FGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           F + D  K V +  +F         F   ++RE+D SGS   GA L  A    AN + ++
Sbjct: 22  FAAMDYAKQVLIGADFSNREMQGVTFNLTNLREADLSGSDLQGASLYGAKLQDANLSNSN 81

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
           L D  +D  V +  NLTNAVL       +      +EGADF++  +     + LC  A G
Sbjct: 82  LRDATLDSAVFDGTNLTNAVLEDAFAFNTRFINVTVEGADFTNVPLRTDALKVLCANAEG 141

Query: 227 TNPITGVSTRKSLGC 241
            NP+TG  TR++LGC
Sbjct: 142 VNPVTGRDTRETLGC 156


>gi|123966744|ref|YP_001011825.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9515]
 gi|123201110|gb|ABM72718.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9515]
          Length = 192

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 52/132 (39%), Positives = 70/132 (53%), Gaps = 21/132 (15%)

Query: 116 ADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           ADL+   +VK +        AN   A M  +    S F GA ++  +AY   F  AD SD
Sbjct: 63  ADLQNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFSD 122

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
                     ANLTN  L+++V       GAII+GADF+DA +DL  +++LC+ A GTN 
Sbjct: 123 ----------ANLTNGELMKSVF-----DGAIIDGADFTDANLDLKTRKSLCERATGTNS 167

Query: 230 ITGVSTRKSLGC 241
            TGV T +SL C
Sbjct: 168 RTGVDTFESLEC 179


>gi|56752263|ref|YP_172964.1| hypothetical protein syc2254_d [Synechococcus elongatus PCC 6301]
 gi|24251237|gb|AAN46157.1| unknown protein [Synechococcus elongatus PCC 7942]
 gi|56687222|dbj|BAD80444.1| hypothetical protein [Synechococcus elongatus PCC 6301]
          Length = 171

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/114 (40%), Positives = 62/114 (54%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
            +A F S  ++   F G+   GA        +ANF  AD +D +     L   N  NA L
Sbjct: 56  IQAEFASVRLKGVSFRGADLRGAVFNGVDLREANFEDADFTDGIAYVSDLRNVNFRNANL 115

Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +L +S+L G+ + GADFS AV+   Q  ALC+ A+GTNP TG  TR+SLGC
Sbjct: 116 TSAMLLQSELQGSDVTGADFSFAVLSKQQITALCETASGTNPKTGADTRESLGC 169


>gi|443475471|ref|ZP_21065420.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443019714|gb|ELS33767.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 164

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 66/117 (56%)

Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           K   RA FTS  ++ ++F+ +   GA     +   AN  G+D S  +         +L++
Sbjct: 47  KNLIRAEFTSVTLKNANFTNADLRGAIFNGVLLDGANLHGSDFSSGIAYISRFKNVDLSD 106

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           AVL  T + RS      + GADF++A++D+ Q + LC  A+GTN  TGVSTR+SLGC
Sbjct: 107 AVLNDTNMLRSTFDNVEVTGADFTNALLDIQQLKKLCINASGTNSKTGVSTRESLGC 163


>gi|81300649|ref|YP_400857.1| hypothetical protein Synpcc7942_1840 [Synechococcus elongatus PCC
           7942]
 gi|81169530|gb|ABB57870.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 168

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/114 (40%), Positives = 62/114 (54%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
            +A F S  ++   F G+   GA        +ANF  AD +D +     L   N  NA L
Sbjct: 53  IQAEFASVRLKGVSFRGADLRGAVFNGVDLREANFEDADFTDGIAYVSDLRNVNFRNANL 112

Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +L +S+L G+ + GADFS AV+   Q  ALC+ A+GTNP TG  TR+SLGC
Sbjct: 113 TSAMLLQSELQGSDVTGADFSFAVLSKQQITALCETASGTNPKTGADTRESLGC 166


>gi|434397761|ref|YP_007131765.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428268858|gb|AFZ34799.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 166

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/130 (35%), Positives = 68/130 (52%), Gaps = 1/130 (0%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F   D R      +N ++ +F   D+  ++FS +   GA    +    AN  G D S   
Sbjct: 36  FSEVDFRSKDFSGKNLQSIDFAKVDLESANFSNADLRGAVFNASNLANANLQGVDFSYGF 95

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
                 + A+LT+A+   T+L+ S   GA I+ ADF+ AV++  Q + LC  A+G NP T
Sbjct: 96  AYLTNFDGADLTDAIFQETILSFSTFEGAKIKNADFTFAVLEKWQVKQLCANASGVNPKT 155

Query: 232 GVSTRKSLGC 241
           GV TR+SLGC
Sbjct: 156 GVDTRESLGC 165


>gi|78212716|ref|YP_381495.1| hypothetical protein Syncc9605_1185 [Synechococcus sp. CC9605]
 gi|78197175|gb|ABB34940.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 165

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 64/112 (57%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F  +++RE++ SGS   GA L  A    A+ +G DL +  +D  V+   NL +AVL  
Sbjct: 53  ATFNLSNLREANLSGSDLRGASLYGAKLQDADLSGTDLREATLDAAVMTGTNLEDAVLEG 112

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +     +I GADF+D  +   Q ++LC  A+GTN +TG STR+SLGC
Sbjct: 113 AFAFNTRFSDVLITGADFTDVPMRGDQLKSLCAVADGTNSVTGRSTRESLGC 164


>gi|443320013|ref|ZP_21049146.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
 gi|442790267|gb|ELR99867.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
          Length = 164

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 65/131 (49%), Gaps = 1/131 (0%)

Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           +F + DLR      +N +   FT   ++  +F+ +   G         +AN  G D S  
Sbjct: 33  RFDNRDLRGESFANQNLQTVEFTKVKLQGVNFANADLIGVVFNSTALDQANLQGVDFSQG 92

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
           +      +  +L +A+LV  +L RS      I GADFS AV+D  Q   LC YA+G N  
Sbjct: 93  IAYLTSFDGVDLRDALLVEALLLRSTFKDTKISGADFSSAVLDQDQLDKLCSYADGVNSK 152

Query: 231 TGVSTRKSLGC 241
           TGV TR+SLGC
Sbjct: 153 TGVKTRESLGC 163


>gi|56751209|ref|YP_171910.1| hypothetical protein syc1200_c [Synechococcus elongatus PCC 6301]
 gi|81299124|ref|YP_399332.1| hypothetical protein Synpcc7942_0313 [Synechococcus elongatus PCC
           7942]
 gi|56686168|dbj|BAD79390.1| hypothetical protein [Synechococcus elongatus PCC 6301]
 gi|81168005|gb|ABB56345.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 170

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 47/131 (35%), Positives = 68/131 (51%), Gaps = 6/131 (4%)

Query: 117 DLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           D  K + ++ NF       ANFT A++R SDFS S   G     A     +  GADLS+T
Sbjct: 39  DFTKEILIESNFSNRDLSDANFTKANLRSSDFSNSVLVGVRFYGANLESVDLHGADLSNT 98

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
           ++D+  +   +LT+A+L       +   GA I GADF+D ++    +  LC  A G N  
Sbjct: 99  ILDQARMTNTDLTDAILEGAYAFNALFQGAKITGADFTDVLMRQDAQDLLCSVAEGVNSK 158

Query: 231 TGVSTRKSLGC 241
           TG +TR +L C
Sbjct: 159 TGRATRDTLDC 169


>gi|123966041|ref|YP_001011122.1| hypothetical protein P9515_08061 [Prochlorococcus marinus str. MIT
           9515]
 gi|123200407|gb|ABM72015.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9515]
          Length = 170

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 54/140 (38%), Positives = 72/140 (51%), Gaps = 30/140 (21%)

Query: 117 DLRKAVHVK-----ENFRANFTSADMRESDFSGSKFN----------GAYLEKAVAYKAN 161
           DL + +H +     E  + N    D  +S+  G+ FN          GA L  A+AY  +
Sbjct: 46  DLEQDMHGQDLSGNEFVKFNLNGFDFSQSNLEGAVFNNSKLQNATLNGANLTDALAYATD 105

Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
           FT ADLSD           N TNA+L+      S+  GA I+GADF++AV+   Q++ LC
Sbjct: 106 FTDADLSD----------VNFTNALLME-----SNFEGAKIDGADFTNAVLSRIQQKELC 150

Query: 222 KYANGTNPITGVSTRKSLGC 241
             ANGTN  TG ST  SLGC
Sbjct: 151 AIANGTNSSTGESTEYSLGC 170


>gi|124025420|ref|YP_001014536.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. NATL1A]
 gi|123960488|gb|ABM75271.1| Pentapeptide repeats [Prochlorococcus marinus str. NATL1A]
          Length = 156

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 60/112 (53%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F  +D+++SDFSGS   GA    A    AN +  ++ D  MD  +LN ANL+N+VL  
Sbjct: 45  ATFYLSDLQDSDFSGSDLQGASFFDAKLENANLSNTNMRDVTMDAAILNGANLSNSVLEG 104

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +     IIEGADF+D +I    +  LC  ANG N +T   T  +L C
Sbjct: 105 AFAYNAKFENVIIEGADFTDVLIANDVRNKLCLIANGINSVTNKKTSDTLDC 156


>gi|307151213|ref|YP_003886597.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306981441|gb|ADN13322.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 174

 Score = 77.8 bits (190), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 54/136 (39%), Positives = 71/136 (52%), Gaps = 29/136 (21%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGA 165
           AQF + DL +A         NF+ AD+R + F+GS     K +GA L  A+AY ++F GA
Sbjct: 56  AQFTNVDLTQA---------NFSDADLRGAVFNGSALKEVKLHGADLTNALAYLSSFEGA 106

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
           DLSD               A+    +L R+    A + G DFS AV+D  +   LCK A+
Sbjct: 107 DLSD---------------AIFAEAILKRTSFKNADVTGTDFSFAVLDGEEIANLCKSAS 151

Query: 226 GTNPITGVSTRKSLGC 241
           G N  TGVSTR SL C
Sbjct: 152 GVNSKTGVSTRDSLRC 167


>gi|218438527|ref|YP_002376856.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218171255|gb|ACK69988.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 172

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 49/131 (37%), Positives = 66/131 (50%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           + F   DLR A          F  A++R S+FS     G     A    ANF GA+L   
Sbjct: 49  SDFSGQDLRDA---------KFDHANLRSSNFSNVNAEGVRFFAANLESANFEGANLRYA 99

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            ++   L   N TNAVL     T +   GAII+GADF+D ++    +Q LC  A GTNP+
Sbjct: 100 DLESARLTRVNFTNAVLEGAFATNTLFKGAIIDGADFTDVLLRPDTEQYLCTIAKGTNPV 159

Query: 231 TGVSTRKSLGC 241
           TG +T+ +L C
Sbjct: 160 TGRNTKDTLYC 170


>gi|449018152|dbj|BAM81554.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 321

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 43/113 (38%), Positives = 68/113 (60%), Gaps = 1/113 (0%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT- 190
           F  + +R+ DFSGS    A    A    ANF  A+LS   ++   L +A+L NA+L    
Sbjct: 209 FQQSIVRDVDFSGSNLQDASFFDADCSGANFQNANLSRANLELANLRKADLRNAILTNAY 268

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 243
           V+ ++ L G  IEG+D++D ++   Q++ LCK A+G NP+T ++T+ SLGC +
Sbjct: 269 VVGQTKLEGIQIEGSDWTDVLLRPDQRRLLCKRASGENPVTHIATKDSLGCAD 321


>gi|22298403|ref|NP_681650.1| hypothetical protein tll0860 [Thermosynechococcus elongatus BP-1]
 gi|22294582|dbj|BAC08412.1| tll0860 [Thermosynechococcus elongatus BP-1]
          Length = 178

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 49/129 (37%), Positives = 68/129 (52%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR +   K    AN   +++  ++  G  F GA LE A     N  GADL    +
Sbjct: 54  FSGRDLRGSEFTK----ANLFHSNLSHTNLQGVSFFGANLETA-----NLEGADLRYATL 104

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           D   L +ANLTNA+L       ++   AII GADF+D  +    ++ LCK A+GTNP+TG
Sbjct: 105 DTARLTKANLTNAILEGAFAFNTNFDDAIITGADFTDVELREDAQRKLCKVASGTNPVTG 164

Query: 233 VSTRKSLGC 241
             T ++L C
Sbjct: 165 RKTWETLHC 173


>gi|427728200|ref|YP_007074437.1| putative low-complexity protein [Nostoc sp. PCC 7524]
 gi|427364119|gb|AFY46840.1| putative low-complexity protein [Nostoc sp. PCC 7524]
          Length = 164

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 67/184 (36%), Positives = 92/184 (50%), Gaps = 28/184 (15%)

Query: 65  LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
           ++ WRV  S  LA  ++       A+ SS+I+  A       +  G+  IG  A+F +AD
Sbjct: 1   MRYWRVLASFVLAMILLLFPLSAEAASSSSITRSAGDEVARKDFSGQSLIG--AEFTNAD 58

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L       EN  ANF+ AD+R     G  FNG  LE       N  G D S+ +      
Sbjct: 59  L-------EN--ANFSDADLR-----GGVFNGTVLEGV-----NLHGVDFSNGIAYLAKF 99

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
             ANL++AVL   ++ RS      I G DF++AV+D  Q + LC  A+G N  TGV TR+
Sbjct: 100 KNANLSDAVLTDAMMLRSTFDNVDITGTDFTNAVLDGPQVKKLCTKASGVNSKTGVDTRE 159

Query: 238 SLGC 241
           SLGC
Sbjct: 160 SLGC 163


>gi|428776639|ref|YP_007168426.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
 gi|428690918|gb|AFZ44212.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
          Length = 167

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 42/122 (34%), Positives = 72/122 (59%), Gaps = 5/122 (4%)

Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           + + ++E   AN ++AD+ ++D  GS F  + ++ A  + ANFT      T+++ +    
Sbjct: 50  ETLQLREISDANLSAADLSDTDMRGSIFTASVMKDANLHGANFTF-----TVLNGVDFTN 104

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           A+L+  +L   +L+R+      I GADF++AV+D  Q   LC+ A+G N  TG++TR SL
Sbjct: 105 ADLSQTILEDAILSRATFENTDITGADFTNAVLDSRQIDQLCETASGVNEETGMATRDSL 164

Query: 240 GC 241
           GC
Sbjct: 165 GC 166


>gi|72381929|ref|YP_291284.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. NATL2A]
 gi|72001779|gb|AAZ57581.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
           NATL2A]
          Length = 156

 Score = 77.0 bits (188), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 60/112 (53%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F  +D++ SDFSGS   GA    A    AN +  ++ D  MD  +LN ANL+N++L  
Sbjct: 45  ATFYLSDLQNSDFSGSDLQGASFFDAKLENANLSNTNMRDVTMDAAILNGANLSNSILEG 104

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +     IIEGADF+D +I    +  LC  ANG N +T   T ++L C
Sbjct: 105 AFAYNAKFENVIIEGADFTDVLIANDVRNKLCLIANGINSVTNKKTSETLDC 156


>gi|452821017|gb|EME28052.1| thylakoid lumenal protein [Galdieria sulphuraria]
          Length = 217

 Score = 77.0 bits (188), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 47/114 (41%), Positives = 64/114 (56%), Gaps = 1/114 (0%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  + +RE+DF G+K   A    A    AN   ADL++  ++   L  A L NAVL R  
Sbjct: 104 FQQSLLRETDFHGAKLVSASFFGAELSYANLEDADLTEANLELANLRSAKLKNAVLRRAY 163

Query: 192 LT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 244
            +  + L    I+GADFS+ ++   QK+ LC  ANGTN  TGV T+ SLGC +S
Sbjct: 164 FSGNTRLENVDIDGADFSEVILRKDQKKYLCNIANGTNSHTGVETKTSLGCNSS 217


>gi|422295781|gb|EKU23080.1| pentapeptide repeat protein [Nannochloropsis gaditana CCMP526]
          Length = 217

 Score = 77.0 bits (188), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 69/117 (58%)

Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           K+  + +F+ A  + ++F G+K  GA   K+   +A+FTGADL+    +   + +A L +
Sbjct: 100 KDFSKKDFSGAFAQRANFKGAKLMGARFYKSALTEADFTGADLTSASFEGANMVDAILKD 159

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           A++     T + L    IEGADFSD ++D   ++ LC+ A GTNP T V TR+SL C
Sbjct: 160 AIVNNAYFTETVLKVGSIEGADFSDTLLDRFVQKKLCEKATGTNPKTKVDTRESLLC 216


>gi|218248608|ref|YP_002373979.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
 gi|218169086|gb|ACK67823.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
          Length = 152

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 47/129 (36%), Positives = 66/129 (51%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR A+         F  A++R S+FS +   G     A    ANF GADL    +
Sbjct: 28  FSGQDLRDAL---------FDHANLRGSNFSHANLQGVRFFSANLEGANFEGADLRGADL 78

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           +   L   N TNA+L     T   + G II+GADF+D ++    ++ LC  A GTNP+TG
Sbjct: 79  ESARLTRVNFTNALLEGAFATNVLIKGVIIDGADFTDVLLRPDVEKQLCAIAQGTNPVTG 138

Query: 233 VSTRKSLGC 241
            +T+ +L C
Sbjct: 139 RNTKDTLFC 147


>gi|317969761|ref|ZP_07971151.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
          Length = 160

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 60/112 (53%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F   ++RE+DF G+   GA L  A    AN  GADLSD  +D  VL   +L NAVL  
Sbjct: 48  ATFNLTNLREADFHGADLRGASLYGAKLQDANLAGADLSDATLDSAVLEGTDLRNAVLEN 107

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +     +I+GADF++        + LC  A+GTNP+TG  T+ +L C
Sbjct: 108 AFAFNTRFKDVLIDGADFTNVPFRGDVLKTLCASASGTNPVTGRVTKDTLEC 159


>gi|72382023|ref|YP_291378.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. NATL2A]
 gi|124025522|ref|YP_001014638.1| hypothetical protein NATL1_08151 [Prochlorococcus marinus str.
           NATL1A]
 gi|72001873|gb|AAZ57675.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
           NATL2A]
 gi|123960590|gb|ABM75373.1| conserved hypothetical protein [Prochlorococcus marinus str.
           NATL1A]
          Length = 170

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 58/165 (35%), Positives = 82/165 (49%), Gaps = 22/165 (13%)

Query: 77  AAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSAD 136
           A +V A   + I  L +LN  +  +  +    S   F   DL K ++  E   +N T A 
Sbjct: 28  AKSVFARTPAEIRNLEELNISQDMSSQDL---SGNDFVKLDL-KGINFSE---SNLTGAV 80

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
              S  +G+  +GA L  A+AY ++F GADL D   +  +L E+N T+A           
Sbjct: 81  FNNSKLNGADLHGAQLNDALAYASDFEGADLRDVDFNGALLMESNFTDA----------- 129

Query: 197 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
               +IEGADF+DAVI   Q++ LC  A+GTN  T   T  SLGC
Sbjct: 130 ----LIEGADFTDAVISRIQQKELCNMASGTNSKTDEDTSYSLGC 170


>gi|443328810|ref|ZP_21057403.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442791546|gb|ELS01040.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 170

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 42/113 (37%), Positives = 62/113 (54%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R +F   D+ E++FS S   GA    +    AN  GAD +           A+L++A+  
Sbjct: 56  RLDFAKVDLSEANFSNSDLRGAVFNASDLSNANLHGADFTYGFAYLTDFQGADLSDAIFR 115

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            T+L+ S    A+I+GADF+ A+++  Q   LC+ A G N  TGV TR+SLGC
Sbjct: 116 ETILSFSSFEDAMIDGADFTLAILEKWQVNQLCENATGVNSQTGVDTRRSLGC 168


>gi|428774426|ref|YP_007166214.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
           7202]
 gi|428688705|gb|AFZ48565.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
          Length = 158

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 48/131 (36%), Positives = 72/131 (54%), Gaps = 20/131 (15%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTNA 185
           ++T A + ESDFSG   +G+   K     ++FT A+LS+       +D   L  ANLTNA
Sbjct: 27  DYTKAHLVESDFSGQDLSGSTFNKTNLRSSDFTNANLSNVSFFGANLDSANLEGANLTNA 86

Query: 186 VLVRTVLTRSDLGGAI---------------IEGADFSDAVIDLAQKQALCKYANGTNPI 230
           VL    +TR++L  A+               IEGADF+D ++    ++ LC+ A+G NP+
Sbjct: 87  VLDSARVTRANLHNAVLEGAFATNTKFEKANIEGADFTDVLLRPDVEEMLCEVASGINPV 146

Query: 231 TGVSTRKSLGC 241
           TG +TR +L C
Sbjct: 147 TGRNTRDTLYC 157


>gi|158337082|ref|YP_001518257.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158307323|gb|ABW28940.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 175

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 60/112 (53%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+FT AD+R SDFS S   G     A     N  GA+LS   +D      ANLTNA L  
Sbjct: 63  ASFTKADLRGSDFSNSDLRGVSFFAANLEDVNLEGANLSVATLDSARFARANLTNANLEG 122

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                ++   AII+GADF+D  +     + LC  A GTNP+TG +TR +L C
Sbjct: 123 AFAFNTEFRRAIIDGADFTDVDLRDDTLEILCAAAQGTNPVTGRNTRDTLYC 174


>gi|257061674|ref|YP_003139562.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
 gi|256591840|gb|ACV02727.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
          Length = 167

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/129 (36%), Positives = 66/129 (51%), Gaps = 9/129 (6%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DLR A+         F  A++R S+FS +   G     A    ANF GADL    +
Sbjct: 47  FSGQDLRDAL---------FDHANLRGSNFSHANLQGVRFFSANLEGANFEGADLRGADL 97

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           +   L   N TNA+L     T   + G II+GADF+D ++    ++ LC  A GTNP+TG
Sbjct: 98  ESARLTRVNFTNALLEGAFATNVLIKGVIIDGADFTDVLLRPDVEKQLCAIAQGTNPVTG 157

Query: 233 VSTRKSLGC 241
            +T+ +L C
Sbjct: 158 RNTKDTLFC 166


>gi|359460626|ref|ZP_09249189.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 175

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 60/112 (53%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+FT AD+R SDFS S   G     A     N  GA+LS   +D      ANLTNA L  
Sbjct: 63  ASFTKADLRGSDFSNSDLRGVSFFAANLEDVNLEGANLSVATLDSARFARANLTNANLEG 122

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                ++   AII+GADF+D  +     + LC  A GTNP+TG +TR +L C
Sbjct: 123 AFAFNAEFRKAIIDGADFTDVDLRDDTLEILCAAAQGTNPVTGRNTRDTLYC 174


>gi|75908971|ref|YP_323267.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75702696|gb|ABA22372.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 164

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 68/189 (35%), Positives = 96/189 (50%), Gaps = 38/189 (20%)

Query: 65  LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
           +K WRV  S  LA  +        A+ SS+I+  A       +  G+  IGS  +F + D
Sbjct: 1   MKYWRVVASFVLAMVLFLFPGSAQAASSSSITRSAGDELKAKDFSGQSLIGS--EFTNVD 58

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLM 172
           L       EN  ANF++AD+R   F+G+   G  L        +AY A F  ADLSD   
Sbjct: 59  L-------EN--ANFSNADLRGGVFNGTVLEGVNLHGVDFSNGIAYLARFKNADLSD--- 106

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
                  A LT+A+++R+V    D     + GADF++AV+D  + + LC  A+G N  TG
Sbjct: 107 -------AVLTDAMMLRSVFDNVD-----VSGADFTNAVLDGTEVKKLCVKASGVNSKTG 154

Query: 233 VSTRKSLGC 241
           V TR+SLGC
Sbjct: 155 VDTRESLGC 163


>gi|352094203|ref|ZP_08955374.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
 gi|351680543|gb|EHA63675.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
          Length = 159

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 6/135 (4%)

Query: 113 FGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           F + D  K V +  +F         F   ++RE+D SGS   GA L  A    AN +  +
Sbjct: 24  FAAMDYAKQVLIGADFSNREMQGVTFNLTNLREADLSGSDLQGASLYGAKLQDANLSNTN 83

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
           L D  +D  V +  NLTNAVL       +      +EGADF++  +     + LC  A G
Sbjct: 84  LRDATLDSAVFDGTNLTNAVLEDAFAFNTRFINVTVEGADFTNVPLRADALKVLCANAEG 143

Query: 227 TNPITGVSTRKSLGC 241
            NP+TG  T ++LGC
Sbjct: 144 VNPVTGRDTSETLGC 158


>gi|434406341|ref|YP_007149226.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
 gi|428260596|gb|AFZ26546.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
          Length = 165

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 67/117 (57%), Gaps = 20/117 (17%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           ANF+ AD+R   F+G+   G  L      + +AY  NF GAD +D          A  T+
Sbjct: 63  ANFSDADLRGVVFNGTLLKGVNLHGVDFSQGIAYLVNFKGADFTD----------AVFTD 112

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           A+++R++    +     + GADF++AV+D+ Q + LC  A+G N  TGV+TR+SLGC
Sbjct: 113 AMMLRSLFDDVN-----VTGADFTNAVLDMQQVKKLCLKASGVNSQTGVNTRESLGC 164


>gi|427736970|ref|YP_007056514.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427372011|gb|AFY55967.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 164

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 45/116 (38%), Positives = 65/116 (56%), Gaps = 20/116 (17%)

Query: 131 NFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           NF++ADMR + F+GS       +G      +AY +NF  +DLSD +           TNA
Sbjct: 63  NFSNADMRGAVFNGSLLENSNLHGVDFTDGIAYLSNFKDSDLSDAI----------FTNA 112

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +++RT+    D     + GADFS A++D  + + LC+ A+G N  TGVSTR SL C
Sbjct: 113 MMLRTIFRNVD-----VTGADFSGAILDRVEVKKLCETASGVNSKTGVSTRASLEC 163


>gi|443477206|ref|ZP_21067069.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443017715|gb|ELS32099.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 167

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/110 (39%), Positives = 61/110 (55%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  +D+R + F  +   G     A   +AN TGA+LS + +D   L++ANLTNAV+  + 
Sbjct: 57  FNESDLRNASFVNADAQGVSFFAANMKEANLTGANLSYSTLDNARLDKANLTNAVIEGSF 116

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +     II+GADF+D  +    +Q LCK A G NP TG  TR +L C
Sbjct: 117 AYGTSFNNVIIDGADFTDVDLRTPIRQKLCKSAKGQNPTTGRLTRDTLEC 166


>gi|88770664|gb|ABD51935.1| chloroplast thylakoid 11 kDa protein [Guillardia theta]
          Length = 242

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 59/107 (55%), Gaps = 2/107 (1%)

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           M++ DFS  KF  A + K  A +A F GAD S+ +MDR    +++   A+    VL+ S+
Sbjct: 134 MQKGDFSKVKFKDAVMSKVFADEATFDGADFSNAVMDRGTWRKSSFKGAIFANAVLSGSE 193

Query: 197 LGGAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGC 241
             G+ +  +DFSD  +     + +CK     GTNP+TGV TR S  C
Sbjct: 194 FEGSDLTDSDFSDTYMGDFDNKKICKNPTLQGTNPVTGVDTRASASC 240


>gi|33861906|ref|NP_893467.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           subsp. pastoris str. CCMP1986]
 gi|33640274|emb|CAE19809.1| Pentapeptide repeats [Prochlorococcus marinus subsp. pastoris str.
           CCMP1986]
          Length = 192

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/136 (37%), Positives = 71/136 (52%), Gaps = 21/136 (15%)

Query: 116 ADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           ADL+   +VK +        AN   A M  +    S F GA ++  +AY   F  AD SD
Sbjct: 63  ADLQNNEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFSD 122

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
                     ANLTN  L+++V       GA I+GADF++A +DL  +++LC+ A+GTN 
Sbjct: 123 ----------ANLTNGELMKSVFD-----GATIDGADFTNANLDLKTRKSLCERASGTNS 167

Query: 230 ITGVSTRKSLGCGNSR 245
            TGV T +SL C   R
Sbjct: 168 QTGVDTFESLECSGLR 183


>gi|428164857|gb|EKX33868.1| hypothetical protein GUITHDRAFT_155908 [Guillardia theta CCMP2712]
          Length = 237

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 59/107 (55%), Gaps = 2/107 (1%)

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           M++ DFS  KF  A + K  A +A F GAD S+ +MDR    +++   A+    VL+ S+
Sbjct: 129 MQKGDFSKVKFKDAVMSKVFADEATFDGADFSNAVMDRGTWRKSSFKGAIFANAVLSGSE 188

Query: 197 LGGAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGC 241
             G+ +  +DFSD  +     + +CK     GTNP+TGV TR S  C
Sbjct: 189 FEGSDLTDSDFSDTYMGDFDNKKICKNPTLQGTNPVTGVDTRASASC 235


>gi|124023314|ref|YP_001017621.1| hypothetical protein P9303_16121 [Prochlorococcus marinus str. MIT
           9303]
 gi|123963600|gb|ABM78356.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9303]
          Length = 158

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 66/131 (50%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F + DLR            F  A++RE++ SGS   G+ L  A  + AN +  +L D+
Sbjct: 36  ADFSNQDLRGDT---------FNLANLREANLSGSDLEGSTLFGAKLHDANLSNTNLRDS 86

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D  + +  +LTNAVL       +      I GADF++  +       LC+ A GTNPI
Sbjct: 87  TLDSAIFDGTDLTNAVLEDAFAFNTRFKNVTITGADFTNVPLRGDALTTLCEVAEGTNPI 146

Query: 231 TGVSTRKSLGC 241
           TG +T  SLGC
Sbjct: 147 TGRNTADSLGC 157


>gi|186683889|ref|YP_001867085.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
           73102]
 gi|186466341|gb|ACC82142.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
          Length = 165

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 63/112 (56%), Gaps = 10/112 (8%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANF++AD+R     G  FNG  LE       N  G D S+ +       +A+L++AVL  
Sbjct: 63  ANFSNADLR-----GGVFNGTLLEGV-----NLHGVDFSEGIAYLTRFKDADLSDAVLTD 112

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ RS      + GADF++A++D  Q + LC  A+G N  TGV TR+SLGC
Sbjct: 113 AMMLRSTFDDVNVTGADFTNAILDGTQVKKLCVKASGVNSKTGVDTRQSLGC 164


>gi|224098455|ref|XP_002311180.1| predicted protein [Populus trichocarpa]
 gi|222851000|gb|EEE88547.1| predicted protein [Populus trichocarpa]
          Length = 218

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 68/214 (31%), Positives = 104/214 (48%), Gaps = 28/214 (13%)

Query: 40  ISSKTESDGQFPDCSNNQCAGPYAKLKNWRV---FVSTALAAAVVASCSSNISALA--DL 94
           I+  + S    P  S + C  P A + N ++   F  T   A +  S      ALA    
Sbjct: 20  ITKPSLSIPHLPSLSFSHCDKPQALIPNKQLVEDFAKTGFLAILSVSLFFTDPALAFKGG 79

Query: 95  NKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLE 153
             Y +E TRG+   G        D      +K++F+ +     +R+++F G+K  GA   
Sbjct: 80  GPYGSEVTRGQDLTGK-------DFSGRTLIKQDFKTSI----LRQANFKGAKLLGASF- 127

Query: 154 KAVAYKANFTGADLSDTLM---DRMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGADF 207
               + A+ TGADLSD  +   D  + N  +ANL+NA L   + T  +   G+ I GADF
Sbjct: 128 ----FDADLTGADLSDADLRSADFSLTNVTKANLSNANLEGALATGNTSFRGSNITGADF 183

Query: 208 SDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +D  +   Q++ LCK+A+G NP TG +TR +L C
Sbjct: 184 TDVPLREDQREYLCKFADGVNPTTGNATRDTLLC 217


>gi|254423673|ref|ZP_05037391.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
 gi|196191162|gb|EDX86126.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
          Length = 190

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 47/133 (35%), Positives = 67/133 (50%), Gaps = 1/133 (0%)

Query: 110 AAQFGSADLRKAVHVKENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           A  F   +LR+     ++   N +T AD+ E+D S +      L      +AN  GA+L+
Sbjct: 57  ADNFDRMNLRQQDFSGQDLTDNDYTRADLTEADLSHTNLERVRLFTTRLNRANLEGANLT 116

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
              +D   L  ANL +AVL        D  G  IEGADF+D ++D      LC+ A GTN
Sbjct: 117 GATLDGASLVGANLKDAVLEGAYAINIDFRGIDIEGADFTDVLLDPKDNDKLCEIATGTN 176

Query: 229 PITGVSTRKSLGC 241
           P TG  T+++L C
Sbjct: 177 PTTGRKTKETLYC 189


>gi|307154028|ref|YP_003889412.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306984256|gb|ADN16137.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 172

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 42/110 (38%), Positives = 60/110 (54%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  A++R S+FS +   G     +    ANF GA+L    ++   L   N TNAVL    
Sbjct: 61  FDHANLRSSNFSNANLEGVRFFASNLESANFEGANLRYADLESARLIRVNFTNAVLEGAF 120

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            T +   GAII+GADF+D ++    ++ LC  A GTNP+TG  T+ +L C
Sbjct: 121 ATNTLFKGAIIDGADFTDVLLRPDVEKYLCTIAKGTNPVTGRDTKDTLYC 170


>gi|298250074|ref|ZP_06973878.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297548078|gb|EFH81945.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 471

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 45/101 (44%), Positives = 60/101 (59%), Gaps = 14/101 (13%)

Query: 115 SADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSD 169
            ADLRKA         N + A M  +D SG+   GA LE      AVA+KANFTGA+LSD
Sbjct: 133 QADLRKA---------NLSMARMHHTDLSGANLTGAILEGIDLKDAVAHKANFTGANLSD 183

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            L+D+  L+E++L+NA L  ++L  +DL  AI+ G   S A
Sbjct: 184 GLLDQANLSESDLSNANLHNSILDETDLSKAILRGTTLSKA 224



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 52/94 (55%), Gaps = 10/94 (10%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANL 182
           F+A+ + A +RE++ +G+  +GA L KA     + Y+A   GA+L DT +    L +A+L
Sbjct: 77  FKADLSEASIREANMTGANLSGATLHKADLQRVILYRATLAGANLFDTTLHEANLCQADL 136

Query: 183 TNAVLVRTVLTRSDLGGA-----IIEGADFSDAV 211
             A L    +  +DL GA     I+EG D  DAV
Sbjct: 137 RKANLSMARMHHTDLSGANLTGAILEGIDLKDAV 170



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 49/100 (49%), Gaps = 8/100 (8%)

Query: 128 FRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           +  +   AD+ +++FSG+        GA LE AV Y+ +   ADLS+  +    +  ANL
Sbjct: 37  WEIDLMGADLSQTNFSGANLVRASLQGARLENAVLYRTSLFKADLSEASIREANMTGANL 96

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
           + A L +  L R  L  A + GA+  D  +  A    LC+
Sbjct: 97  SGATLHKADLQRVILYRATLAGANLFDTTLHEAN---LCQ 133


>gi|126659509|ref|ZP_01730642.1| hypothetical protein CY0110_07279 [Cyanothece sp. CCY0110]
 gi|126619243|gb|EAZ89979.1| hypothetical protein CY0110_07279 [Cyanothece sp. CCY0110]
          Length = 167

 Score = 73.9 bits (180), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 46/130 (35%), Positives = 70/130 (53%), Gaps = 11/130 (8%)

Query: 123 HVKENF-RANFTSADMRESDFS-----GSKFNGAYLEKAVAYK-----ANFTGADLSDTL 171
           + K+N    +F+  D+R+SDF      G  F+ A L+    +      ANF GADL    
Sbjct: 37  YAKQNLVERDFSGQDLRDSDFEHANLRGCNFSHANLQGVRFFASNLEGANFEGADLRYAD 96

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           ++   L   N TNA+L     T +   GA+I+GADF+D ++ L  ++ LC  A GTNP+T
Sbjct: 97  LESARLVRVNFTNAILEGAFATNTLFNGAVIDGADFTDVLLRLDTEKKLCDIAKGTNPVT 156

Query: 232 GVSTRKSLGC 241
             +T+ +L C
Sbjct: 157 RRNTKDTLFC 166


>gi|427719897|ref|YP_007067891.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
 gi|427352333|gb|AFY35057.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
          Length = 165

 Score = 73.9 bits (180), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 61/114 (53%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
            +A FTS +++ ++FS +   G      +    N  GAD S+ +         +L++A+L
Sbjct: 51  IQAEFTSVNLKNTNFSNADLRGGVFNSTLLEGVNLHGADFSEGIAYLARFKNTDLSDAIL 110

Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              ++ RS      I GADF++AV+D  Q + LC  A+G N  TG  TR+SLGC
Sbjct: 111 TDAMMLRSTFDDVDITGADFTNAVLDGVQIKKLCVNASGVNSKTGTDTRESLGC 164


>gi|16331083|ref|NP_441811.1| hypothetical protein sll0274 [Synechocystis sp. PCC 6803]
 gi|383322826|ref|YP_005383679.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383325995|ref|YP_005386848.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383491879|ref|YP_005409555.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384437147|ref|YP_005651871.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
 gi|451815240|ref|YP_007451692.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
 gi|1653576|dbj|BAA18489.1| sll0274 [Synechocystis sp. PCC 6803]
 gi|339274179|dbj|BAK50666.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
 gi|359272145|dbj|BAL29664.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359275315|dbj|BAL32833.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359278485|dbj|BAL36002.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|451781209|gb|AGF52178.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
          Length = 196

 Score = 73.6 bits (179), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 57/178 (32%), Positives = 85/178 (47%), Gaps = 17/178 (9%)

Query: 65  LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
           L  W+  V T +   +VA+    + +LA  +      RG       A F   DLR ++  
Sbjct: 34  LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 87

Query: 125 KENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
             N R A+FT A+++ + F  +  +GA LE A A   +F  A L+           ANL 
Sbjct: 88  HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDFESARLT----------HANLR 137

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           NA L  +  T +  G   IEGAD +D ++    +  LC  A GTNP+TG  T+++L C
Sbjct: 138 NARLEGSFGTNTKFGEVDIEGADLTDIILRPDTEDYLCGLAKGTNPVTGRETKETLFC 195


>gi|407961546|dbj|BAM54786.1| hypothetical protein BEST7613_5855 [Synechocystis sp. PCC 6803]
          Length = 194

 Score = 73.6 bits (179), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 57/178 (32%), Positives = 85/178 (47%), Gaps = 17/178 (9%)

Query: 65  LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
           L  W+  V T +   +VA+    + +LA  +      RG       A F   DLR ++  
Sbjct: 32  LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 85

Query: 125 KENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
             N R A+FT A+++ + F  +  +GA LE A A   +F  A L+           ANL 
Sbjct: 86  HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDFESARLT----------HANLR 135

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           NA L  +  T +  G   IEGAD +D ++    +  LC  A GTNP+TG  T+++L C
Sbjct: 136 NARLEGSFGTNTKFGEVDIEGADLTDIILRPDTEDYLCGLAKGTNPVTGRETKETLFC 193


>gi|282897571|ref|ZP_06305571.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
 gi|281197494|gb|EFA72390.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
          Length = 164

 Score = 73.6 bits (179), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 57/184 (30%), Positives = 87/184 (47%), Gaps = 28/184 (15%)

Query: 65  LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
           +K W++FV   L A          A+ SS+I+  A       +  G+  +G   +F +  
Sbjct: 1   MKYWQIFVGLVLTAVFFVSNLPAQAASSSSITRSAGSEIEIQDYSGKSLVGK--EFTNIK 58

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L  A         NF++AD+R     G  FNG  L       AN  G + SD +      
Sbjct: 59  LENA---------NFSNADLR-----GVVFNGTLL-----IDANLHGVNFSDGISYLSNF 99

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
             +NL++A+    ++ RS      + GADF++A++D  + + LC  A+G N  TGV TRK
Sbjct: 100 KNSNLSDAIFTNAMMLRSTFNNVDVTGADFTNAILDGVEVKKLCANASGVNSQTGVDTRK 159

Query: 238 SLGC 241
           SLGC
Sbjct: 160 SLGC 163


>gi|440680470|ref|YP_007155265.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
 gi|428677589|gb|AFZ56355.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
          Length = 168

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 47/121 (38%), Positives = 64/121 (52%), Gaps = 30/121 (24%)

Query: 131 NFTSADMRESDFSGSKFNGAYLE----------KAVAYKANFTGADLSDTLMDRMVLNEA 180
           NF++AD+R     G  FNGA LE          + +AY A F   D SD          A
Sbjct: 67  NFSNADLR-----GGVFNGALLEGVNLHGVDFRQGIAYLARFKNTDFSD----------A 111

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
            LT+A+++RT     D     + GADF++A++D+ Q + LC  A G N  TGV TR+SLG
Sbjct: 112 VLTDAMMLRTTFDDVD-----VTGADFTNAILDMTQVKKLCVNARGVNSQTGVDTRESLG 166

Query: 241 C 241
           C
Sbjct: 167 C 167


>gi|86605126|ref|YP_473889.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86553668|gb|ABC98626.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
          Length = 176

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/110 (40%), Positives = 62/110 (56%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  A++R+SD S  K  GA L  A   KAN  GADL    +D   L  A+L  A L  ++
Sbjct: 66  FLKANLRQSDLSHVKAAGANLFGANLSKANLRGADLRGATLDMANLQGADLREAQLQDSM 125

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +  + + G  I+GADF++A+I       LC+ A G NP+TG +TR +L C
Sbjct: 126 MWLARVEGIQIDGADFTNALIRQDALSILCERATGVNPVTGRATRDTLEC 175


>gi|409993003|ref|ZP_11276163.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|409936150|gb|EKN77654.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 162

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 62/112 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F ++++  ++F  ++  G+   +A+        ADL+  ++D++  ++A+L++++   
Sbjct: 50  AEFANSNLEYANFDEAELRGSVFSRAIMLGVTMRKADLTYAMLDQVDFSQADLSDSIFTE 109

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +   S      I GADF+DA+ D  Q + LC  A G N  TGV TR SLGC
Sbjct: 110 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSTTGVDTRYSLGC 161


>gi|72382760|ref|YP_292115.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. NATL2A]
 gi|72002610|gb|AAZ58412.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
           NATL2A]
          Length = 182

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/125 (35%), Positives = 65/125 (52%), Gaps = 16/125 (12%)

Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
           DL+   +VK          D+   D  G+ F GAY   +    ++ TGA++++ +     
Sbjct: 54  DLQNTEYVKY---------DLSGKDLGGTNFTGAYFSVSTLKDSDLTGANMTNVIAYATR 104

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 236
            + ANLTN  L    L +S   G  I+GADF+DAV+D +Q++ LCK A G       ST 
Sbjct: 105 FDNANLTNVNLTGAELLKSVFDGVTIDGADFTDAVLDRSQQKNLCKVATG-------STA 157

Query: 237 KSLGC 241
           +SLGC
Sbjct: 158 ESLGC 162


>gi|124026482|ref|YP_001015597.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. NATL1A]
 gi|123961550|gb|ABM76333.1| Pentapeptide repeats [Prochlorococcus marinus str. NATL1A]
          Length = 182

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/125 (35%), Positives = 65/125 (52%), Gaps = 16/125 (12%)

Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
           DL+   +VK          D+   D  G+ F GAY   +    ++ TGA++++ +     
Sbjct: 54  DLQNTEYVKY---------DLSGKDLGGTNFTGAYFSVSTLKDSDLTGANMTNVIAYATR 104

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 236
            + ANLTN  L    L +S   G  I+GADF+DAV+D +Q++ LCK A G       ST 
Sbjct: 105 FDNANLTNVNLTGAELLKSVFDGVTIDGADFTDAVLDRSQQKNLCKVATG-------STA 157

Query: 237 KSLGC 241
           +SLGC
Sbjct: 158 ESLGC 162


>gi|33862899|ref|NP_894459.1| hypothetical protein PMT0626 [Prochlorococcus marinus str. MIT
           9313]
 gi|33634815|emb|CAE20801.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 158

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 66/131 (50%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F + DLR            F  A++RE++ SGS   G+ L  A  + AN +  +L D+
Sbjct: 36  ADFSNQDLRGDT---------FNLANLREANLSGSDLEGSTLFGAKLHDANLSNTNLRDS 86

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D  + +  +LTNAVL       +      I GADF++  +       LC+ A GTNPI
Sbjct: 87  TLDSAIFDGTDLTNAVLEDAFAFNTRFKNVTITGADFTNVPLRGDALTTLCEVAEGTNPI 146

Query: 231 TGVSTRKSLGC 241
           TG +T  +LGC
Sbjct: 147 TGRNTADTLGC 157


>gi|148239424|ref|YP_001224811.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
 gi|147847963|emb|CAK23514.1| Secreted pentapeptide repeat protein [Synechococcus sp. WH 7803]
          Length = 158

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/143 (34%), Positives = 69/143 (48%), Gaps = 6/143 (4%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAY 158
           FG+   +   + D  K V +  +F         F   ++RE+D SGS   GA L  A   
Sbjct: 16  FGLLLPSAEAAMDYAKQVLIGADFSNRDMQGVTFNLTNLREADLSGSDLQGASLYGAKLQ 75

Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
            AN +  +L D  +D  VLN  +LT+AVL       +      I GADF++  +     +
Sbjct: 76  DANLSRTNLRDATLDSAVLNGTDLTDAVLEDAFAFNTRFIDVTISGADFTNVPLRGDVLK 135

Query: 219 ALCKYANGTNPITGVSTRKSLGC 241
            LC  A GTNP+TG  TR +LGC
Sbjct: 136 TLCAAAEGTNPVTGRDTRDTLGC 158


>gi|88808450|ref|ZP_01123960.1| hypothetical protein WH7805_02132 [Synechococcus sp. WH 7805]
 gi|88787438|gb|EAR18595.1| hypothetical protein WH7805_02132 [Synechococcus sp. WH 7805]
          Length = 159

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 43/110 (39%), Positives = 58/110 (52%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F   ++RE+D SGS   GA L  A    AN +  +L D  +D  VLN  +LT+AVL    
Sbjct: 50  FNLTNLREADLSGSDLQGASLYGAKLQDANLSRTNLRDATLDSAVLNGTDLTDAVLEDAF 109

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +      I GADF++  +     + LC  A GTNP+TG  TR +LGC
Sbjct: 110 AFNTRFIDVTISGADFTNVPLRGDVLKTLCAAAEGTNPVTGRDTRDTLGC 159


>gi|87124267|ref|ZP_01080116.1| hypothetical protein RS9917_11675 [Synechococcus sp. RS9917]
 gi|86167839|gb|EAQ69097.1| hypothetical protein RS9917_11675 [Synechococcus sp. RS9917]
          Length = 183

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 61/117 (52%)

Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           +E     F   ++RE+D SGS   GA L  A    A+ +  +L D  +D  VL+  NL+N
Sbjct: 67  REMQGVTFNLTNLREADLSGSDLQGASLFGAKLQDADLSNTNLRDATLDSAVLDGTNLSN 126

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           AVL       +      I GADF++  +     + LC  A GTNP+TG +TR +LGC
Sbjct: 127 AVLEDAFAFNTRFINVTISGADFTNVPLRGDVLKTLCAVAEGTNPVTGRNTRDTLGC 183


>gi|86609869|ref|YP_478631.1| pentapeptide repeat-containing protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
 gi|86558411|gb|ABD03368.1| pentapeptide repeat family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
          Length = 176

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/110 (40%), Positives = 62/110 (56%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  A++R+SD S  K  GA L  A   KAN  GADL    +D   L  A+L  A L  ++
Sbjct: 66  FLKANLRQSDLSHVKAAGANLFGANLSKANLRGADLRGATLDMANLQGADLREAQLQDSM 125

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +  + + G  I+GADF++A+I       LC+ A G NP+TG +TR +L C
Sbjct: 126 MWLARVEGIQIDGADFTNALIRQDALSILCERATGVNPVTGRATRDTLEC 175


>gi|291569983|dbj|BAI92255.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
          Length = 170

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 62/112 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F ++++  ++F  ++  G+   +A+        ADL+  ++D++  ++A+L++++   
Sbjct: 58  AEFANSNLEYANFDEAELRGSVFSRAIMLGVTMRKADLTYAMLDQVDFSQADLSDSIFTE 117

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +   S      I GADF+DA+ D  Q + LC  A G N  TGV TR SLGC
Sbjct: 118 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSTTGVDTRYSLGC 169


>gi|148241708|ref|YP_001226865.1| pentapeptide repeat-containing protein [Synechococcus sp. RCC307]
 gi|147850018|emb|CAK27512.1| Secreted pentapeptide repeats protein [Synechococcus sp. RCC307]
          Length = 156

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/112 (40%), Positives = 59/112 (52%), Gaps = 7/112 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ++F  A +R +DFSG+K +GA   +     +NF GADLSD LMDR      NL+   L  
Sbjct: 51  SSFAGAVVRNADFSGAKLHGAIFTQGAFAGSNFAGADLSDVLMDRADFTGTNLSGTNLSG 110

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            V   S    A IEGADF+ A++D   +  LC+ A G        TR SL C
Sbjct: 111 VVANGSSFAKAEIEGADFTGALLDRDDQITLCRKAKG-------ETRLSLDC 155


>gi|411119230|ref|ZP_11391610.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410711093|gb|EKQ68600.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 192

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/112 (40%), Positives = 62/112 (55%), Gaps = 4/112 (3%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           FT A++RES+F G+  +G     A    AN  GADL +  +D   L+ +NL NA L    
Sbjct: 82  FTKANLRESNFRGADLHGVSFFGANLEGANLEGADLRNATLDTARLSRSNLKNANLEGAF 141

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYANGTNPITGVSTRKSLGC 241
              +   GA I+GADF+   +D+ Q  + ALC  A GTNP T  +TR +L C
Sbjct: 142 AFNAKFDGATIDGADFTG--VDMRQDVQHALCDRAAGTNPTTKRNTRDTLNC 191


>gi|209525582|ref|ZP_03274120.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|423065234|ref|ZP_17054024.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209493915|gb|EDZ94232.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|406713366|gb|EKD08537.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 177

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 62/112 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F ++++  ++F  S+  G+   +A+        ADL+  ++D++  ++A+L++++   
Sbjct: 65  AEFANSNLEYANFDESELRGSVFSRAIMLGVTMRKADLTYAMVDQVDFSQADLSDSIFTE 124

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +   S      I GADF+DA+ D  Q + LC  A G N  TGV TR SLGC
Sbjct: 125 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSRTGVDTRYSLGC 176


>gi|33863821|ref|NP_895381.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9313]
 gi|33635404|emb|CAE21729.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9313]
          Length = 209

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/122 (36%), Positives = 67/122 (54%), Gaps = 6/122 (4%)

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           E  + +    D+ E+D  GS F+   L+ A     N  G +L D L      + A+L+ +
Sbjct: 87  EFVKYDLAGYDLSEADLRGSTFSVTSLKNA-----NLHGTNLEDVLAYATRFDNADLSES 141

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC-GNS 244
           +L    L +S+  GA+I+GADF++A++D  +++ALC  A G N  TGV T  SL C G S
Sbjct: 142 ILRNANLRKSEFAGALIDGADFTNALLDKQEQKALCARATGKNSKTGVDTYSSLDCSGIS 201

Query: 245 RR 246
            R
Sbjct: 202 ER 203


>gi|119511352|ref|ZP_01630465.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
 gi|119463974|gb|EAW44898.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
          Length = 164

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/116 (38%), Positives = 62/116 (53%), Gaps = 20/116 (17%)

Query: 131 NFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           NF++AD R   F+GS+  G  L        +AY   F GADL+D          A  TNA
Sbjct: 63  NFSNADFRGGVFNGSRLEGVNLHGVDFSDGIAYLTQFKGADLTD----------AVFTNA 112

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +++R+V    D     I GADF++A++D  Q + LC  A+G N  TG  TR+SL C
Sbjct: 113 MMLRSVFDDVD-----ITGADFTNAILDGTQIKKLCTQASGVNSQTGADTRESLEC 163


>gi|384252144|gb|EIE25621.1| hypothetical protein COCSUDRAFT_83628, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 122

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 66/117 (56%), Gaps = 1/117 (0%)

Query: 126 ENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           ++FR      AD+R ++FS +   GA L  A    A F GA L++  ++ +    A+L+ 
Sbjct: 5   KDFRGQKLYKADLRGTNFSKANMEGASLFGAFCKDAKFVGAHLNNADLESVDFENADLSE 64

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           A+L    +T +      I G+D++D V+    +Q LCK A+GTNPITG  TR++L C
Sbjct: 65  AILEGAQVTNAKFKNVNIAGSDWTDVVLRRDVQQQLCKIASGTNPITGQDTRETLIC 121


>gi|428180855|gb|EKX49721.1| hypothetical protein GUITHDRAFT_135885 [Guillardia theta CCMP2712]
          Length = 244

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 70/235 (29%), Positives = 105/235 (44%), Gaps = 35/235 (14%)

Query: 20  SSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAA 79
           S KGP+ L  + KP+  +   + + E+D    D                +  VS AL +A
Sbjct: 28  SLKGPHALSGM-KPVTRSHPAAVRMEADADAFDAK--------------KFAVSLALGSA 72

Query: 80  VVASCSSNISALADLNKYEAETRGEFGI--GSAAQFGSAD----LRKAVHVKENFRAN-- 131
           ++ S    I A A       +  G F +  G+A+   S       R A+    NF     
Sbjct: 73  LLFSSGMPIPAFA-------QQGGSFKVLKGAASTQDSGSRRTITRGALLEGSNFDGQNL 125

Query: 132 ----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
               F  +  R+  F G+   GA         AN  GAD+S+   +   + +ANL NA++
Sbjct: 126 PGISFQQSLCRDCSFVGTNLKGASFFDGDLTNANMEGADVSNVNFELTCMKDANLKNAIV 185

Query: 188 VRT-VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
               + + + L G  IEGADF+D  +   Q++ LCK A+GTNP TGV T+ SL C
Sbjct: 186 NNAYIQSTTKLDGINIEGADFTDTELRKDQQRYLCKRASGTNPKTGVDTKDSLRC 240


>gi|124022089|ref|YP_001016396.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9303]
 gi|123962375|gb|ABM77131.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9303]
          Length = 202

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 44/122 (36%), Positives = 67/122 (54%), Gaps = 6/122 (4%)

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           E  + +    D+ E+D  GS F+   L+ A     N  G +L D L      + A+L+ +
Sbjct: 80  EFVKYDLAGYDLSEADLRGSTFSVTTLKNA-----NLHGTNLEDVLAYATRFDNADLSES 134

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC-GNS 244
           +L    L +S+  GA+I+GADF++A++D  +++ALC  A G N  TGV T  SL C G S
Sbjct: 135 ILRNANLRKSEFAGALIDGADFTNALLDRQEQKALCARATGKNSKTGVDTYTSLDCSGIS 194

Query: 245 RR 246
            R
Sbjct: 195 ER 196


>gi|37523524|ref|NP_926901.1| hypothetical protein gll3955 [Gloeobacter violaceus PCC 7421]
 gi|35214528|dbj|BAC91896.1| gll3955 [Gloeobacter violaceus PCC 7421]
          Length = 159

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 58/175 (33%), Positives = 78/175 (44%), Gaps = 18/175 (10%)

Query: 68  WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 127
           WR  V   LAA +V      +SA AD+                  +  A L      ++N
Sbjct: 2   WRSGVLAGLAAGLV--LPGLVSAQADIQN---------------NYNGAYLEGRSVAEQN 44

Query: 128 FR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
            + A F  A++R  DFS S   GA L  A    ANF  A L D  +    L  A L  AV
Sbjct: 45  LKQAQFYKANLRGVDFSSSDLRGASLFAASLRGANFNKARLDDAELSNADLQGAKLDQAV 104

Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L    +T + L    ++GADF+  +I+  QK   C  A GTN +T   TR++LGC
Sbjct: 105 LAGAYMTAARLKDVSVDGADFTGTIINNQQKTYQCGRATGTNGLTKRQTRRTLGC 159


>gi|427710138|ref|YP_007052515.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
 gi|427362643|gb|AFY45365.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
          Length = 164

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 45/112 (40%), Positives = 61/112 (54%), Gaps = 10/112 (8%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANF++AD+R     G  FNG  LE       N  G D S+ +        A+L++AVL  
Sbjct: 62  ANFSNADLR-----GGVFNGIVLEGV-----NMHGVDFSNGIAYLARFKNADLSDAVLTD 111

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            ++ RS      I GADF++AV+D  Q + LC  A+G N  T V TR+SLGC
Sbjct: 112 AMMLRSTFDNVEITGADFTNAVLDGTQVKKLCAKASGVNSKTSVDTRESLGC 163


>gi|427701765|ref|YP_007044987.1| low-complexity protein [Cyanobium gracile PCC 6307]
 gi|427344933|gb|AFY27646.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
          Length = 175

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 65/131 (49%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F  ADLR            F   ++R++D SG+   GA L  A    A+ +G+DL D 
Sbjct: 53  ADFHGADLRGVT---------FNLTNLRDADLSGADLRGASLFGAKLQDADLSGSDLRDA 103

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D  V    +L NA L       +   G +I+GADF++  +      +LC  A+GTNP+
Sbjct: 104 TLDSAVFEGTDLRNARLDDAFAFNTKFRGVLIDGADFTNVPLRGDALTSLCAAASGTNPV 163

Query: 231 TGVSTRKSLGC 241
           TG  TR +L C
Sbjct: 164 TGRLTRDTLNC 174


>gi|376005445|ref|ZP_09782948.1| conserved exported hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375326159|emb|CCE18701.1| conserved exported hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 177

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 62/112 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F ++++  ++F  ++  G+   +A+        ADL+  ++D++  ++A+L++++   
Sbjct: 65  AEFANSNLEYANFDEAELRGSVFSRAIMLGVTMRKADLTYAMVDQVDFSQADLSDSIFTE 124

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            +   S      I GADF+DA+ D  Q + LC  A G N  TGV TR SLGC
Sbjct: 125 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSRTGVDTRYSLGC 176


>gi|414077638|ref|YP_006996956.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
 gi|413971054|gb|AFW95143.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
          Length = 165

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 65/116 (56%), Gaps = 20/116 (17%)

Query: 131 NFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           NF +AD+R + F+G+      F+G    + +AY + F  +DLSD          A  T A
Sbjct: 64  NFNNADLRGAVFNGTLLDTVNFHGVDFSQGIAYLSRFKNSDLSD----------AVFTEA 113

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +++R+   + D     + GADF++A++D+ Q + +C  A+G N  TGV TR SLGC
Sbjct: 114 MMLRSTFDQVD-----VTGADFTNAILDMIQIKKICINASGVNSKTGVDTRASLGC 164


>gi|148242416|ref|YP_001227573.1| pentapeptide repeat-containing protein [Synechococcus sp. RCC307]
 gi|147850726|emb|CAK28220.1| Secreted pentapeptide repeat protein [Synechococcus sp. RCC307]
          Length = 162

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 47/131 (35%), Positives = 62/131 (47%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F S DL+            F   ++RE+D SGS    A L  A    AN +G+DL + 
Sbjct: 40  ADFSSRDLKGVT---------FNLTNLREADLSGSDLRAASLFGAKLQDANLSGSDLREA 90

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D  V N  +L++A L       +   G  I GADFSD  +       LC  A GTN +
Sbjct: 91  TLDSAVFNGTDLSDARLEGAFAFNTRFSGVTITGADFSDVPLRGDALSTLCAVAEGTNSV 150

Query: 231 TGVSTRKSLGC 241
           TG  TR +LGC
Sbjct: 151 TGRDTRDTLGC 161


>gi|224112717|ref|XP_002316270.1| predicted protein [Populus trichocarpa]
 gi|222865310|gb|EEF02441.1| predicted protein [Populus trichocarpa]
          Length = 219

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 65/210 (30%), Positives = 101/210 (48%), Gaps = 38/210 (18%)

Query: 49  QFPDCSNNQCAGPYAKLKNWRV---FVSTALAAAVVASCSSNISALA--DLNKYEAE-TR 102
           +F   S+++C  P A + N ++   F  T L A +  S      ALA      Y +E TR
Sbjct: 30  RFLSLSHSRCPNPQALILNKQLLEDFAKTGLLALLSVSLFFTDPALAFKGGGPYGSEVTR 89

Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           G+   G        D       K++F+ +     +R+++F G+K  GA       + A+ 
Sbjct: 90  GQDLTGK-------DFSGRTLTKQDFKTSI----LRQANFKGAKLLGASF-----FDADL 133

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI-----------IEGADFSDAV 211
           TGADLSD       L  A+L+ A + +  L+ ++L GA+           I GADF+D  
Sbjct: 134 TGADLSDA-----DLRSADLSLANVAKVNLSNANLEGALATGNTSFRGSNITGADFTDVP 188

Query: 212 IDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +   Q++ LCK A+G NP TG +TR +L C
Sbjct: 189 LREDQREYLCKVADGVNPTTGNATRDTLLC 218


>gi|449018747|dbj|BAM82149.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 269

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 43/115 (37%), Positives = 64/115 (55%), Gaps = 2/115 (1%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           + +F+ +  R+++FSGS  +GA   KA   +ANF  A L    +++ VL  +N  NAVL 
Sbjct: 153 QKDFSGSTCRKTNFSGSDLSGARFFKADLTEANFENAQLIGASLEQTVLRGSNFQNAVLR 212

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 241
            T  T S L  A IE  D++DA+++   +  LC    A G N +T   TR+SL C
Sbjct: 213 STYWTESVLTIANIENTDWTDALLEPTWQMKLCSRSDAKGMNTLTNTDTRESLMC 267


>gi|33240260|ref|NP_875202.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           subsp. marinus str. CCMP1375]
 gi|33237787|gb|AAP99854.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
           subsp. marinus str. CCMP1375]
          Length = 158

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 68/139 (48%), Gaps = 6/139 (4%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           + + F S D  K   V E+F       A F   +++++D SGS   GA L  A    +N 
Sbjct: 19  TQSSFASIDYGKQTLVGEDFSKLDLKGATFYLTNLQDADLSGSDLEGASLFGAKLLNSNL 78

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
           + A+L +  +D  V    NL NAVL    +  +      I+G+DF++ ++       LC 
Sbjct: 79  SNANLHNATLDSAVFEGTNLENAVLEDAFVFNARFSDVNIQGSDFTNVILRNQDLSYLCS 138

Query: 223 YANGTNPITGVSTRKSLGC 241
            ANGTNP+T   T+ +L C
Sbjct: 139 IANGTNPVTKRKTKDTLQC 157


>gi|282900932|ref|ZP_06308865.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
 gi|281194023|gb|EFA68987.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
          Length = 164

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 65/117 (55%), Gaps = 20/117 (17%)

Query: 130 ANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           ANF++AD+R   F+G+       +G      ++Y +NF  ++LSD +           TN
Sbjct: 62  ANFSNADLRGVVFNGTLLIDTNLHGVNFSDGISYLSNFKNSNLSDAI----------FTN 111

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           A+++R+     D     I GADF++A++D  + + LC  A+G N  TGV TR+SLGC
Sbjct: 112 AMMLRSTFNNVD-----ITGADFTNAILDGVEVKKLCADASGVNSQTGVDTRESLGC 163


>gi|427714384|ref|YP_007063008.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
 gi|427378513|gb|AFY62465.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
          Length = 177

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/131 (36%), Positives = 65/131 (49%), Gaps = 11/131 (8%)

Query: 112 QFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
            F   DLR +   K N F +N +  D+R     G  F  A LE A     + TGADL   
Sbjct: 54  DFSGKDLRDSEFTKANLFHSNLSHTDLR-----GVSFFAANLETA-----DLTGADLRVA 103

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D     +ANLT+A L       +   GAII+GADF+D  +    ++ LC  A G NP+
Sbjct: 104 TLDTARFTKANLTDANLEGAFAFNTIFDGAIIDGADFTDVDLRPDARKMLCSVAKGVNPV 163

Query: 231 TGVSTRKSLGC 241
           TG +T  +L C
Sbjct: 164 TGRATHDTLEC 174


>gi|332706397|ref|ZP_08426459.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332354834|gb|EGJ34312.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 126

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 63/114 (55%), Gaps = 1/114 (0%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +   T  ++ +++ + +K   A ++      AN  GADL+ +    +  N+A+LT+ +  
Sbjct: 12  KVKITYCNLDQANLADAKLIQASIKHTTLNNANLHGADLTKSDTYNISFNDADLTDVIFT 71

Query: 189 RTVLTRSDLGGAIIEGADFSDAVID-LAQKQALCKYANGTNPITGVSTRKSLGC 241
             +L R+   GA I GADF+  +I  + ++  LC  A+G NP TGV TR SLGC
Sbjct: 72  GALLQRASFDGADITGADFTSTLIQPVRERLKLCDVASGVNPTTGVVTRDSLGC 125


>gi|428306980|ref|YP_007143805.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
           9333]
 gi|428248515|gb|AFZ14295.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
          Length = 160

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 66/139 (47%), Gaps = 21/139 (15%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANF 162
           S   F    L  +  V+ N    NF +AD+R   F+GS   G+ L  A     +AY A+F
Sbjct: 36  SGKDFSGQTLISSEFVEANLDNTNFNNADIRGVVFNGSTLKGSSLHSADFTNGLAYAADF 95

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
           + ADLSD               AV   ++L +S      I G DFS  V+D    + LC 
Sbjct: 96  SNADLSD---------------AVFSESILLKSRFDEVNINGTDFSGVVLDGTNVKKLCD 140

Query: 223 YANGTNPITGVSTRKSLGC 241
            A+G N  TGV+TR SLGC
Sbjct: 141 VADGVNSKTGVATRASLGC 159


>gi|224006618|ref|XP_002292269.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971911|gb|EED90244.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 255

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 66/114 (57%), Gaps = 4/114 (3%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  + +R+SDFS S   GA    A    +NF  AD++   ++    N ANL NA++    
Sbjct: 140 FQQSIVRDSDFSNSNLYGASFFDATLDGSNFENADMTLCNVEMAQFNRANLKNAIVKDMY 199

Query: 192 LTRSDL--GGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 241
           ++ + L  G   IEG+D+S+  +   Q++ LC +  A GTNP+TGV+TR+SL C
Sbjct: 200 VSGATLFEGVKDIEGSDWSETQLRKDQQKYLCNHPTAKGTNPVTGVNTRESLMC 253


>gi|449016903|dbj|BAM80305.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 341

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 43/128 (33%), Positives = 68/128 (53%), Gaps = 11/128 (8%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA---------- 180
           + +S D+  +  +G+  +GA L  A  ++   +GA+L     D  +L+EA          
Sbjct: 211 DLSSVDLSTAALAGADLHGAALSHANLFQVQLSGANLRGAKFDASILDEAALDGADLSGA 270

Query: 181 NLTNAVLVRTVLTRSDLGGAI-IEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           +L  A++ RT+L  + L   I I+GADFS A+ID   ++ LC+ A G N  TGV+T  SL
Sbjct: 271 DLRQALVRRTLLLGARLDANISIDGADFSGALIDRTNQRLLCELAQGVNSRTGVATATSL 330

Query: 240 GCGNSRRN 247
            C   + N
Sbjct: 331 ACPEPKTN 338


>gi|302756827|ref|XP_002961837.1| hypothetical protein SELMODRAFT_76876 [Selaginella moellendorffii]
 gi|300170496|gb|EFJ37097.1| hypothetical protein SELMODRAFT_76876 [Selaginella moellendorffii]
          Length = 180

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/119 (37%), Positives = 69/119 (57%), Gaps = 11/119 (9%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN-----EANLT 183
           + +F ++ +R+++F G+K  GA       + AN TGAD SD  +    L+     +AN T
Sbjct: 66  KQDFKTSILRQANFKGAKLFGASF-----FDANLTGADFSDADLRGADLSLADATKANFT 120

Query: 184 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           NA L   ++T  + L GA I GADF+D +    Q+  LC+ A+G NP+T  STR++L C
Sbjct: 121 NANLEGALVTGNTSLKGANITGADFTDVLWREDQRSYLCRIADGINPVTSNSTRETLLC 179


>gi|225449424|ref|XP_002282933.1| PREDICTED: thylakoid lumenal 15 kDa protein 1, chloroplastic [Vitis
           vinifera]
 gi|296086195|emb|CBI31636.3| unnamed protein product [Vitis vinifera]
          Length = 221

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 46/123 (37%), Positives = 72/123 (58%), Gaps = 11/123 (8%)

Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 179
           K   + +F ++ +R+++F G+   GA       + A+ TGADLSD  +   D  + N  +
Sbjct: 103 KSLIKQDFKTSILRQANFKGANLLGASF-----FDADLTGADLSDADLRGADFSLANVTK 157

Query: 180 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
           ANL+NA L   + T  +   G+II GADF+D  +   Q++ LCK A+G NP TG +TR++
Sbjct: 158 ANLSNANLEGALATGNTSFRGSIITGADFTDVPLREDQREYLCKVADGVNPTTGNATRET 217

Query: 239 LGC 241
           L C
Sbjct: 218 LLC 220


>gi|302798106|ref|XP_002980813.1| hypothetical protein SELMODRAFT_178497 [Selaginella moellendorffii]
 gi|300151352|gb|EFJ17998.1| hypothetical protein SELMODRAFT_178497 [Selaginella moellendorffii]
          Length = 180

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/119 (37%), Positives = 69/119 (57%), Gaps = 11/119 (9%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN-----EANLT 183
           + +F ++ +R+++F G+K  GA       + AN TGAD SD  +    L+     +AN T
Sbjct: 66  KQDFKTSILRQANFKGAKLFGASF-----FDANLTGADFSDADLRGADLSLADATKANFT 120

Query: 184 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           NA L   ++T  + L GA I GADF+D +    Q+  LC+ A+G NP+T  STR++L C
Sbjct: 121 NANLEGALVTGNTSLKGANITGADFTDVLWREDQRSYLCRIADGINPVTSNSTRETLLC 179


>gi|87302765|ref|ZP_01085576.1| hypothetical protein WH5701_13470 [Synechococcus sp. WH 5701]
 gi|87282648|gb|EAQ74606.1| hypothetical protein WH5701_13470 [Synechococcus sp. WH 5701]
          Length = 168

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/132 (35%), Positives = 62/132 (46%), Gaps = 11/132 (8%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F  ADLR       N R AN + ADMR +   G+K             A+  G DL +
Sbjct: 46  ADFHDADLRGVTFNLTNLRDANLSGADMRNASLFGAKLQ----------DADMHGVDLRE 95

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
             +D  VL   +L  AVL       +      IEGADF++  +      +LC  A+GTNP
Sbjct: 96  ATLDSAVLEGTDLREAVLEDAFAFNTKFVDVAIEGADFTNVPLRGDVLTSLCAIASGTNP 155

Query: 230 ITGVSTRKSLGC 241
           +TG  TR +LGC
Sbjct: 156 VTGRVTRDTLGC 167


>gi|332705869|ref|ZP_08425945.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332355661|gb|EGJ35125.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 150

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/115 (35%), Positives = 57/115 (49%), Gaps = 2/115 (1%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +A F + D+   D SG  F+ A         AN +  +     + ++    ANL  A   
Sbjct: 35  KATFANTDLSGQDLSGQDFHNAVFSSVNLQSANLSNVNFKGANITKVNFTNANLQGADFS 94

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI--TGVSTRKSLGC 241
              +   +  GA I GADF+ A++D  Q + LCK A+ TNPI  TGV TR SLGC
Sbjct: 95  YAFINVCNFKGANITGADFTFAILDSKQYRELCKNASATNPITDTGVDTRYSLGC 149


>gi|427723472|ref|YP_007070749.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
 gi|427355192|gb|AFY37915.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
          Length = 170

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/134 (35%), Positives = 66/134 (49%), Gaps = 7/134 (5%)

Query: 115 SADLRKAVHVKENF-----RAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           + D  K   ++E+F     R N +  + +R SDFS     G     A     NF GAD+ 
Sbjct: 36  AVDYNKRTFIQEDFSHQDLRDNSYDLSSLRGSDFSYCDLRGVRFFSANLEFVNFEGADMR 95

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLG-GAIIEGADFSDAVIDLAQKQALCKYANGT 227
             ++D   +  AN TNA L    L    +    +I+GADF+DA+I   +   LC  A GT
Sbjct: 96  GAVLDSARIGHANFTNANLEGAYLASVKITPSTVIDGADFTDALILKNENDKLCDLATGT 155

Query: 228 NPITGVSTRKSLGC 241
           NP TGV T +SL C
Sbjct: 156 NPDTGVDTAESLYC 169


>gi|255645177|gb|ACU23086.1| unknown [Glycine max]
          Length = 222

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 45/123 (36%), Positives = 72/123 (58%), Gaps = 11/123 (8%)

Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 179
           K   + +F ++ +R+++F G+K  GA       + A+ TGADLSD  +   D  + N  +
Sbjct: 104 KTLIKQDFKTSILRQANFKGAKLIGASF-----FDADLTGADLSDADLRNADFSLANVTK 158

Query: 180 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
           ANL+NA L   ++T  +   G+ + GADF+D  +   Q++ LCK A+G NP TG +TR +
Sbjct: 159 ANLSNANLEGALVTGNTSFRGSNVTGADFTDVPLREDQREYLCKVADGVNPTTGNATRDT 218

Query: 239 LGC 241
           L C
Sbjct: 219 LFC 221


>gi|298492954|ref|YP_003723131.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
 gi|298234872|gb|ADI66008.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
          Length = 164

 Score = 67.8 bits (164), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 63/116 (54%), Gaps = 20/116 (17%)

Query: 131 NFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           NF+++D+R   F+G+   G  L      + +AY   F  AD SD +          LT+A
Sbjct: 63  NFSNSDLRGGVFNGTLLEGVNLHGVDFSQGIAYLVKFNNADFSDAI----------LTDA 112

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +++R+V    D     + GADF++A++D  + + LC  A+G N  T V TR+SLGC
Sbjct: 113 MMLRSVFDNVD-----VTGADFTNAILDGVEIKKLCLKASGVNSKTAVDTRESLGC 163


>gi|449441422|ref|XP_004138481.1| PREDICTED: thylakoid lumenal 15 kDa protein 1, chloroplastic-like
           [Cucumis sativus]
          Length = 214

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 47/142 (33%), Positives = 75/142 (52%), Gaps = 15/142 (10%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           G+         D      +K++F+ +     +R+++F G+   GA       + A+ TGA
Sbjct: 81  GVTRGQDLSGKDFSGKTLIKQDFKTSI----LRQANFKGANLLGASF-----FDADLTGA 131

Query: 166 DLSDTLM---DRMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQA 219
           DLSD  +   D  + N  +ANL+NA L   + T  +   G+ I GADF+D  +   Q++ 
Sbjct: 132 DLSDADLRGADFSLANVTKANLSNANLEGALATGNTSFRGSTINGADFTDVPLREDQREY 191

Query: 220 LCKYANGTNPITGVSTRKSLGC 241
           LCK A+G NP TG +TR++L C
Sbjct: 192 LCKVADGVNPTTGNATRETLLC 213


>gi|170079322|ref|YP_001735960.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
 gi|169886991|gb|ACB00705.1| Pentapeptide repeat containing protein [Synechococcus sp. PCC 7002]
          Length = 166

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 65/134 (48%), Gaps = 7/134 (5%)

Query: 115 SADLRKAVHVKENF-----RAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           + D  K   ++E+F     R N +  + +R  DFS S   G     A     NF GADL 
Sbjct: 32  AVDYNKRTFIQEDFSHQDLRDNSYDLSSLRGCDFSYSDLRGVRFFSANLEFVNFEGADLR 91

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLG-GAIIEGADFSDAVIDLAQKQALCKYANGT 227
             ++D   +  AN  NA L    L    +    +IEGADF+DA+I   +   LC+ A+GT
Sbjct: 92  GAVLDSARIGHANFKNANLEGAFLASVKITPSTVIEGADFTDALILARENDKLCELASGT 151

Query: 228 NPITGVSTRKSLGC 241
           NP TG  T  +L C
Sbjct: 152 NPTTGRDTAATLYC 165


>gi|351722845|ref|NP_001236746.1| uncharacterized protein LOC100500352 [Glycine max]
 gi|255630103|gb|ACU15405.1| unknown [Glycine max]
          Length = 224

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 46/123 (37%), Positives = 71/123 (57%), Gaps = 11/123 (8%)

Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 179
           K   + +F ++ +R+++F G+K  GA       + A+ TGADLSD  +   D  + N  +
Sbjct: 106 KTLIKQDFKTSILRQANFKGAKLIGASF-----FDADLTGADLSDADLRNADFSLANVTK 160

Query: 180 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
           ANL+NA L   + T  +   G+ I GADF+D  +   Q++ LCK A+G NP TG +TR +
Sbjct: 161 ANLSNANLEGALATGNTSFKGSNITGADFTDVPLREDQREYLCKVADGVNPTTGNATRDA 220

Query: 239 LGC 241
           L C
Sbjct: 221 LFC 223


>gi|298705858|emb|CBJ29003.1| thylakoid lumenal protein [Ectocarpus siliculosus]
          Length = 199

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 59/105 (56%), Gaps = 2/105 (1%)

Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
           E++FS   F    + KA A  +N+  AD ++ ++DR+  + +++  A+    VLT +   
Sbjct: 92  EANFSKGDFKEVVMSKAYARSSNWEEADFTNAVVDRVSFDGSSMKGAIFQNAVLTSTSFT 151

Query: 199 GAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGC 241
           GA +E ADF++A +    ++ LCK     GTNP+T   TR S GC
Sbjct: 152 GADVENADFTEAYMGDFDQKNLCKNPTLKGTNPVTNADTRASAGC 196


>gi|168022043|ref|XP_001763550.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685343|gb|EDQ71739.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 165

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 42/115 (36%), Positives = 68/115 (59%), Gaps = 1/115 (0%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
            + +F ++ +R+++F G+K  GA    +    A+ T ADL    +    L++ANLTNA L
Sbjct: 50  IKQDFKTSILRQANFKGAKLLGASFFDSDLTGADLTDADLRGADLSLARLSKANLTNANL 109

Query: 188 VRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
               +T  + L G+II GADF++      Q++ LC  A+G NP+TG +TR++L C
Sbjct: 110 EGASVTGNTYLKGSIITGADFTEVNWRDDQRKELCLIADGVNPVTGNATRETLLC 164


>gi|388521435|gb|AFK48779.1| unknown [Lotus japonicus]
          Length = 225

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 71/119 (59%), Gaps = 11/119 (9%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 183
           + +F ++ +R+++F G+K  GA       + ++ TGADLSD  +   D  + N  +ANL+
Sbjct: 111 KQDFKTSILRQANFKGAKLLGASF-----FDSDLTGADLSDADLRSADFFLANVTKANLS 165

Query: 184 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           NA L   + T  +   G+ I GADF+D  +   Q++ LCK A+G NP TG +TR++L C
Sbjct: 166 NANLEGALATGNTSFKGSNITGADFTDVPLRDDQREYLCKVADGVNPTTGNATRETLLC 224


>gi|397595313|gb|EJK56448.1| hypothetical protein THAOC_23663 [Thalassiosira oceanica]
          Length = 238

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 2/111 (1%)

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           M ++D S + F  A   K     +NF  AD ++ ++DR     ++L   +    VLT + 
Sbjct: 128 MSKTDLSKANFREAQFSKGYLRDSNFEEADFTNAIVDRATFKGSSLKGTIFSNAVLTATS 187

Query: 197 LGGAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGCGNSR 245
             GA +E ADF+DA I     + LCK     G NP+TG  TR S  CG  R
Sbjct: 188 FEGADVENADFTDAYIGDFDIRNLCKNPTLKGENPLTGADTRLSANCGPGR 238


>gi|359806262|ref|NP_001240959.1| uncharacterized protein LOC100806792 [Glycine max]
 gi|255626639|gb|ACU13664.1| unknown [Glycine max]
          Length = 222

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 45/123 (36%), Positives = 71/123 (57%), Gaps = 11/123 (8%)

Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 179
           K   + +F ++ +R+++F G+K  GA       + A+ TGADLSD  +   D  + N  +
Sbjct: 104 KTLIKQDFKTSILRQANFKGAKLIGASF-----FDADLTGADLSDADLRNADFSLANVTK 158

Query: 180 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
           ANL+NA L   + T  +   G+ + GADF+D  +   Q++ LCK A+G NP TG +TR +
Sbjct: 159 ANLSNANLEGALATGNTSFRGSNVTGADFTDVPLREDQREYLCKVADGVNPTTGNATRDT 218

Query: 239 LGC 241
           L C
Sbjct: 219 LFC 221


>gi|302837694|ref|XP_002950406.1| hypothetical protein VOLCADRAFT_120854 [Volvox carteri f.
           nagariensis]
 gi|300264411|gb|EFJ48607.1| hypothetical protein VOLCADRAFT_120854 [Volvox carteri f.
           nagariensis]
          Length = 182

 Score = 67.0 bits (162), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 63/113 (55%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +   T A++R+++F+ +   G  L  +++  A F GA+L +  ++      A+ TNAVL 
Sbjct: 69  KLKLTKANLRQTNFTDANLEGVSLFGSLSESAIFRGANLRNADLESGNYEFADFTNAVLE 128

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
              +  +      I G+D++D V+    ++ LC  A+G NP TGVSTR+SL C
Sbjct: 129 GAFVNNAQFVKVTITGSDWTDVVLRKDVQKELCAIADGVNPTTGVSTRESLLC 181


>gi|255570589|ref|XP_002526251.1| Thylakoid lumenal 17.4 kDa protein, chloroplast precursor, putative
           [Ricinus communis]
 gi|223534416|gb|EEF36120.1| Thylakoid lumenal 17.4 kDa protein, chloroplast precursor, putative
           [Ricinus communis]
          Length = 228

 Score = 67.0 bits (162), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 68/245 (27%), Positives = 109/245 (44%), Gaps = 24/245 (9%)

Query: 3   LSSIS-PLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGP 61
           +++IS PLS++SL    SS +  + +  L  P+ + C  S+       F +  +  C   
Sbjct: 1   MATISFPLSVRSL----SSERSRFPVPQLHPPIKIICSGSADGSKSKPFKELQSVACG-- 54

Query: 62  YAKLKNWRVFVSTALAAAVVASCSSNISALA-DLNKYEAETRGE-FGIGSAAQFGSADLR 119
              L  W V      +A+ V + S  +  L+ + N+ E    G   G  +       DLR
Sbjct: 55  --LLAAWAV-----TSASPVIAASQRLPPLSTEPNRCEKAFVGNTIGQANGVYDKPIDLR 107

Query: 120 KAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
              +  E  N +  +  +A M ++ F G+  +   + KA A  A+F G D S+ ++DR+ 
Sbjct: 108 FCDYTNEKSNLKGKSLAAALMSDAKFDGADMSEVVMSKAYAVGASFKGVDFSNAVLDRVN 167

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 236
             +ANL  AV   TVL+ S    A +  A F D +I     Q LCK     N    +  R
Sbjct: 168 FGKANLQGAVFKNTVLSGSTFDEAQLADAVFEDTIIGYIDLQKLCK-----NTSINLEGR 222

Query: 237 KSLGC 241
           + LGC
Sbjct: 223 EILGC 227


>gi|147774410|emb|CAN74472.1| hypothetical protein VITISV_013914 [Vitis vinifera]
          Length = 221

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/123 (36%), Positives = 71/123 (57%), Gaps = 11/123 (8%)

Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 179
           K   + +F ++ +R+++F  +   GA       + A+ TGADLSD  +   D  + N  +
Sbjct: 103 KSLIKQDFKTSILRQANFKXANLLGASF-----FDADLTGADLSDADLRGADFSLANVTK 157

Query: 180 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
           ANL+NA L   + T  +   G+II GADF+D  +   Q++ LCK A+G NP TG +TR++
Sbjct: 158 ANLSNANLEGALATGNTSFRGSIITGADFTDVPLREDQREYLCKVADGVNPTTGNATRET 217

Query: 239 LGC 241
           L C
Sbjct: 218 LLC 220


>gi|388510406|gb|AFK43269.1| unknown [Lotus japonicus]
          Length = 225

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 71/119 (59%), Gaps = 11/119 (9%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 183
           + +F ++ +R+++F G+K  GA       + ++ TGADLSD  +   D  + N  +ANL+
Sbjct: 111 KQDFKTSILRQANFKGAKLLGASF-----FDSDLTGADLSDADLRSADFSLANVTKANLS 165

Query: 184 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           NA L   + T  +   G+ I GADF+D  +   Q++ LCK A+G NP TG +TR++L C
Sbjct: 166 NANLEGALATGNTSFKGSNITGADFTDVPLRDDQREYLCKVADGVNPTTGNATRETLLC 224


>gi|428166498|gb|EKX35473.1| hypothetical protein GUITHDRAFT_97823, partial [Guillardia theta
           CCMP2712]
          Length = 230

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 62/119 (52%), Gaps = 2/119 (1%)

Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           K+  + +F+    +E+ F G+K  G    KA    A+FTGADLS   ++   L+   L N
Sbjct: 112 KDFSKKDFSGCAAKEAKFVGTKLRGTRFFKADLTGADFTGADLSTASLEDAKLDGVVLKN 171

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 241
           A+L  +           I GADF+DA++       LCK   A GTNP+T   TR+SLGC
Sbjct: 172 AILSNSYTNLGLDKVKDISGADFTDALVRPDILAKLCKRSDATGTNPVTKADTRESLGC 230


>gi|413968546|gb|AFW90610.1| chloroplast thylakoid lumenal 17.4 kDa protein [Solanum tuberosum]
          Length = 228

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 66/231 (28%), Positives = 101/231 (43%), Gaps = 26/231 (11%)

Query: 3   LSSIS-PLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGP 61
           ++SIS PL+ KS +   S    P QLH+   P+ + C  S          DCSN++ +  
Sbjct: 1   MASISIPLAYKSHSLRRSPIYRPSQLHS---PIQIKCSASK---------DCSNSEESS- 47

Query: 62  YAKLKNWRVFVSTALAAAVVASCSSNISA-------LADLNKYEAETRGE-FGIGSAAQF 113
             + K  R      LA   ++S S  I+A         D N+ E    G   G  +    
Sbjct: 48  -TQFKQLRNVACGFLAVWALSSVSPVIAAGQRLPPLSTDPNRCERAFVGSTIGQANGVYD 106

Query: 114 GSADLRKAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
              DLR   +  E  N +  +  +A M ++ F G+      + KA A  A+F   D S+ 
Sbjct: 107 KPLDLRFCDYTNEKTNLKGKSLAAALMSDAKFDGADMTEVIMSKAYAVGASFKAMDFSNA 166

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
           ++DR+   +ANL  A    TVL+ S    A ++G DF D +I     Q +C
Sbjct: 167 VLDRVNFEKANLQGASFKNTVLSGSTFNDAQLDGVDFEDTIIGYIDLQKIC 217


>gi|159467845|ref|XP_001692102.1| hypothetical protein CHLREDRAFT_115715 [Chlamydomonas reinhardtii]
 gi|158278829|gb|EDP04592.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 124

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 65/110 (59%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            T A++R+++ +G+   G  L  +++  A F GA+L +  ++     +A+ ++A+L    
Sbjct: 14  LTKANLRQTNLTGANLEGVSLFGSLSEGAVFKGANLRNADLESGNYEDADFSDAILEGAF 73

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +  +      I+G+D++D V+    ++ALC  A+G NP TGVSTR+SL C
Sbjct: 74  VNNAQFVRVNIKGSDWTDVVLRKDIQKALCAIADGVNPTTGVSTRESLMC 123


>gi|298715141|emb|CBJ27829.1| Thylakoid lumenal 15 kDa protein, chloroplast precursor (p15)
           [Ectocarpus siliculosus]
          Length = 245

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 52/146 (35%), Positives = 71/146 (48%), Gaps = 14/146 (9%)

Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           AA  G    RK V    N   A++   D+    F  S   G   + A    A F  ADLS
Sbjct: 99  AASTGDKGARKTVTRGVNIENADYHDKDLSSVSFQQSLVRGTNFKNAKLVAAGFFDADLS 158

Query: 169 D-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGA------IIEGADFSDAVIDLAQK 217
           +       M++  L  ANL+ A +   ++T + + GA      IIEGADF+D  +   Q 
Sbjct: 159 NCNFESANMNQANLELANLSGANMKNALVTEAYVSGATKMEPAIIEGADFTDTFLRKDQV 218

Query: 218 QALC--KYANGTNPITGVSTRKSLGC 241
           + LC  + A GTNP++GV TR SLGC
Sbjct: 219 RYLCGLETAKGTNPVSGVDTRDSLGC 244


>gi|219116042|ref|XP_002178816.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409583|gb|EEC49514.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 109

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 2/106 (1%)

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           + ++F  S   G    KA   +A+F+GADL    ++   ++EA L + V V    + S +
Sbjct: 3   KSTNFGKSNLKGCRFYKAYLVRADFSGADLRGASLEDTSMDEALLKDTVAVGAYFSASIM 62

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 241
               +E ADF+DA   +     LC+   A GTNP+TGV TR+SL C
Sbjct: 63  DTLTVENADFTDAQFPIKTLPLLCERSDATGTNPVTGVDTRESLMC 108


>gi|357133836|ref|XP_003568528.1| PREDICTED: thylakoid lumenal 15 kDa protein 1, chloroplastic-like
           [Brachypodium distachyon]
          Length = 200

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 70/119 (58%), Gaps = 11/119 (9%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 183
           + +F ++ +R+++F G+K  GA       + A+ TGADLSDT +   D  + N  + NLT
Sbjct: 86  KQDFKTSILRQTNFKGAKLLGASF-----FDADLTGADLSDTDLRNADFSLANVTKVNLT 140

Query: 184 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           NA L   ++T  +   G+ I GADF+D  +   Q+  LCK A+G N  TG +T+++L C
Sbjct: 141 NANLEGALVTGNTSFKGSTIYGADFTDVPLRDDQRDYLCKIADGVNTTTGNATKETLFC 199


>gi|298711847|emb|CBJ32870.1| Pentapeptide repeat [Ectocarpus siliculosus]
          Length = 238

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/132 (37%), Positives = 64/132 (48%), Gaps = 16/132 (12%)

Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
           DL K  +  ++F  +      +E +FSGS   G    KA   KA+FTGA+L         
Sbjct: 113 DLSKGKYKSKDFSGSIA----KEVNFSGSDLRGVRFFKADLKKADFTGANLGTA-----S 163

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAI-----IEGADFSDAVIDLAQKQALCKY--ANGTNP 229
           L EA+L   ++   V T S  G  +     I GADF+DA+I     + LC    A GTNP
Sbjct: 164 LEEADLEGTIMTNAVATGSYFGNNMNNVGDISGADFTDALIRKDVAKILCARPDAKGTNP 223

Query: 230 ITGVSTRKSLGC 241
            TG  TR SL C
Sbjct: 224 TTGTDTRDSLLC 235


>gi|219116308|ref|XP_002178949.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409716|gb|EEC49647.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 131

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 62/116 (53%), Gaps = 4/116 (3%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  + +R  DFS S   GA    A    +NF  ++L +  ++   L   +  NAV+    
Sbjct: 16  FQQSIVRNCDFSNSDLRGASFFDATLTDSNFENSNLENVNLEMAQLTRVSFKNAVVTDAY 75

Query: 192 LTRSDLGGAI--IEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGCGN 243
           ++ + +   +  +EG+D+S+  +   QK+ LC +  A GTNP+TGV TR+SL C N
Sbjct: 76  VSGATIFDGVKDVEGSDWSETYLRADQKKLLCNHPTAKGTNPVTGVDTRESLMCPN 131


>gi|255073547|ref|XP_002500448.1| predicted protein [Micromonas sp. RCC299]
 gi|226515711|gb|ACO61706.1| predicted protein [Micromonas sp. RCC299]
          Length = 215

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 66/128 (51%), Gaps = 6/128 (4%)

Query: 117 DLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           DLR+  +V ++      + A M ++ F G+      + KA A  A+FTGA+ ++ ++DR+
Sbjct: 93  DLRQCNYVDKDLSTKTLSGALMVDATFKGANMTEVVMSKAYAVNADFTGANFTNAVVDRV 152

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 235
             + ANL+NA     V+T +   G  + GA F +A+I     + LC+     NP     T
Sbjct: 153 TFDGANLSNANFFNAVITGATFEGTNLAGAQFDEALIGKEDVKKLCE-----NPTLVEET 207

Query: 236 RKSLGCGN 243
           R  +GC N
Sbjct: 208 RFQVGCRN 215


>gi|217071608|gb|ACJ84164.1| unknown [Medicago truncatula]
          Length = 240

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 69/242 (28%), Positives = 103/242 (42%), Gaps = 27/242 (11%)

Query: 13  SLNFCSSSSKGP-YQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVF 71
           SL+  + S+K P +   AL  P  + C +  + E DG      N        K+K     
Sbjct: 12  SLSIRNFSTKRPCFTTSAL--PFTITCSVVGEAELDGT----ENKPRLLSLNKIKGVACG 65

Query: 72  VSTALAAAVVASCSSNISALA--------DLNKYEAETRGE-FGIGSAAQFGSADLRKA- 121
           +   LAA  V S S  ++A          D N+ E    G   G  +     + DLRK  
Sbjct: 66  I---LAAYAVTSASFPVTAATQRLPPLSTDPNRCERAFVGNTIGQANGVYDKALDLRKCD 122

Query: 122 -VHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
             + K N +    ++A M ++ F G+      + KA A   +F G D S+ ++DR+   +
Sbjct: 123 FTNEKSNLKGKTLSAALMSDAKFDGADMTEVVMSKAYAVGGSFKGVDFSNAVLDRVNFGK 182

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           A+L  AV   TVL+ S    A +EGA F D +I     Q +C+     N   G   R  L
Sbjct: 183 ADLQGAVFRNTVLSGSTFDDAKLEGAVFEDTIIGYIDLQKICR-----NTTIGDEGRAEL 237

Query: 240 GC 241
           GC
Sbjct: 238 GC 239


>gi|159474024|ref|XP_001695129.1| thylakoid lumenal 17.4 kDa protein [Chlamydomonas reinhardtii]
 gi|158276063|gb|EDP01837.1| thylakoid lumenal 17.4 kDa protein [Chlamydomonas reinhardtii]
          Length = 185

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 58/110 (52%), Gaps = 5/110 (4%)

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           + ++D S +    A L KA A KANF GAD+++ ++DR+    ANL     + TV+T + 
Sbjct: 81  LADADLSNTNLQEAVLTKAYAVKANFEGADMTNAVVDRVDFTNANLKRVKFINTVVTGAS 140

Query: 197 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRR 246
             GA +EG+ + DA+I       LC+     NP     +R  +GC   R+
Sbjct: 141 FAGADLEGSVWEDALIGSQDVGKLCE-----NPTLTGESRAQVGCRAVRK 185


>gi|115434488|ref|NP_001042002.1| Os01g0144100 [Oryza sativa Japonica Group]
 gi|13486898|dbj|BAB40127.1| unknown protein [Oryza sativa Japonica Group]
 gi|113531533|dbj|BAF03916.1| Os01g0144100 [Oryza sativa Japonica Group]
 gi|215678959|dbj|BAG96389.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765141|dbj|BAG86838.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 198

 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 44/120 (36%), Positives = 69/120 (57%), Gaps = 11/120 (9%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANL 182
            R +F ++ +R+++F G+K  GA       + A+ TGADLSD  +   D  + N  + NL
Sbjct: 83  IRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSDADLRGADFSLANVSKVNL 137

Query: 183 TNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           TNA L   + T  +   G+ I GADF+D  +   Q++ LCK A+G N  TG +T+++L C
Sbjct: 138 TNANLEGALATGNTTFKGSNIYGADFTDVPLRDDQREYLCKIADGVNTTTGNATKETLFC 197


>gi|443326649|ref|ZP_21055296.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442793770|gb|ELS03210.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 920

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 39/104 (37%), Positives = 56/104 (53%), Gaps = 1/104 (0%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A+L  A  V+ N  RAN   A++  ++ +G+   GA LEKA+   ANF GA+L++
Sbjct: 801 ANLDGANLEGANLVRANLVRANLVRANLDGANLNGAILEGANLEKAILEGANFRGANLNE 860

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
             +    L+EAN   A   R  L R D   A  +GADF  A++D
Sbjct: 861 ANLRGAHLSEANFQEADFDRADLQRVDFDRADFQGADFDRAIMD 904



 Score = 43.9 bits (102), Expect = 0.069,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 38/66 (57%)

Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           A L +A  Y+AN   A+L    ++   L  ANL  A LVR  L  ++L GAI+EGA+   
Sbjct: 786 ANLYRANLYRANLVRANLDGANLEGANLVRANLVRANLVRANLDGANLNGAILEGANLEK 845

Query: 210 AVIDLA 215
           A+++ A
Sbjct: 846 AILEGA 851



 Score = 43.9 bits (102), Expect = 0.081,   Method: Composition-based stats.
 Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 5/96 (5%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           +RAN   A++  ++  G+   GA L +A   +AN   A+L    ++  +L  ANL  A+L
Sbjct: 789 YRANLYRANLVRANLDGANLEGANLVRANLVRANLVRANLDGANLNGAILEGANLEKAIL 848

Query: 188 ----VRTV-LTRSDLGGAIIEGADFSDAVIDLAQKQ 218
                R   L  ++L GA +  A+F +A  D A  Q
Sbjct: 849 EGANFRGANLNEANLRGAHLSEANFQEADFDRADLQ 884


>gi|116785879|gb|ABK23895.1| unknown [Picea sitchensis]
 gi|116792150|gb|ABK26251.1| unknown [Picea sitchensis]
          Length = 239

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/118 (34%), Positives = 59/118 (50%), Gaps = 6/118 (5%)

Query: 125 KENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           K N R  +  +A M ++ F G+  +   + KA A  A+F G D S+ ++DR+   +AN+ 
Sbjct: 126 KTNLRGKSLAAALMSDAKFDGADMSEVIMSKAYAVGASFKGVDFSNAVIDRVNFGKANMQ 185

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +AV   TVL+ S    A +EGA F D +I     Q LC     TN       R  LGC
Sbjct: 186 DAVFRNTVLSGSTFVDANLEGAKFEDTIIGYIDLQKLC-----TNQTLSDEGRDILGC 238


>gi|307108672|gb|EFN56912.1| hypothetical protein CHLNCDRAFT_51710 [Chlorella variabilis]
          Length = 155

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 57/110 (51%), Gaps = 5/110 (4%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            + A M E+D SG+      L KA A  AN  GADL++ ++DR+  +  +L  A LV  V
Sbjct: 49  LSGAYMNEADMSGANMREVVLTKAYAVGANLRGADLTNAVIDRVAFDGVDLEGAQLVNAV 108

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +T +   GA ++ A+F DA+I     + LC      NP     +R  +GC
Sbjct: 109 ITGTTFTGANLKDANFEDALIGSEDAKRLC-----ANPTLVGESRDQVGC 153


>gi|425437827|ref|ZP_18818239.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
           9432]
 gi|389677087|emb|CCH93934.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
           9432]
          Length = 976

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 40/106 (37%), Positives = 55/106 (51%), Gaps = 6/106 (5%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A   SA+L +A     N       RAN   A++ E++  G+   GAYLE A   +AN  G
Sbjct: 857 ANLYSANLERANLYMANLERANLERANLKRANLYEANLYGAYLAGAYLEGANLERANLYG 916

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           A+L    ++R  L  ANL  A L    L R++L GA + GA+F DA
Sbjct: 917 ANLEGANLERANLERANLKGANLEGANLERANLEGAFLRGANFKDA 962



 Score = 43.9 bits (102), Expect = 0.081,   Method: Composition-based stats.
 Identities = 33/102 (32%), Positives = 52/102 (50%), Gaps = 1/102 (0%)

Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           ++ +RAN   A++  ++  G+   GA LE+A    AN   A+L    + R  L  A L  
Sbjct: 787 RDLYRANLERANLERANLYGAYLYGANLERANLKGANLYMANLERANLYRAYLYRAYLYR 846

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
           A L R  L R++L  A +E A+   A ++ A  ++A  K AN
Sbjct: 847 AYLERAYLERANLYSANLERANLYMANLERANLERANLKRAN 888



 Score = 43.9 bits (102), Expect = 0.083,   Method: Composition-based stats.
 Identities = 34/102 (33%), Positives = 46/102 (45%), Gaps = 1/102 (0%)

Query: 110 AAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
            A    A+L+ A     N  RAN   A +  +    +    AYLE+A  Y AN   A+L 
Sbjct: 811 GANLERANLKGANLYMANLERANLYRAYLYRAYLYRAYLERAYLERANLYSANLERANLY 870

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
              ++R  L  ANL  A L    L  + L GA +EGA+   A
Sbjct: 871 MANLERANLERANLKRANLYEANLYGAYLAGAYLEGANLERA 912



 Score = 39.3 bits (90), Expect = 1.8,   Method: Composition-based stats.
 Identities = 30/93 (32%), Positives = 43/93 (46%), Gaps = 5/93 (5%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           +RA    A +  +    +    A LE+A  Y AN   A+L    + R  L EANL  A L
Sbjct: 840 YRAYLYRAYLERAYLERANLYSANLERANLYMANLERANLERANLKRANLYEANLYGAYL 899

Query: 188 VRTV-----LTRSDLGGAIIEGADFSDAVIDLA 215
                    L R++L GA +EGA+   A ++ A
Sbjct: 900 AGAYLEGANLERANLYGANLEGANLERANLERA 932



 Score = 38.1 bits (87), Expect = 4.5,   Method: Composition-based stats.
 Identities = 36/125 (28%), Positives = 55/125 (44%), Gaps = 5/125 (4%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGA 150
           A+L +   E    +G    A    A+L +A     N + AN   A++  +    +    A
Sbjct: 792 ANLERANLERANLYG----AYLYGANLERANLKGANLYMANLERANLYRAYLYRAYLYRA 847

Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           YLE+A   +AN   A+L    +    L  ANL  A L R  L  ++L GA + GA    A
Sbjct: 848 YLERAYLERANLYSANLERANLYMANLERANLERANLKRANLYEANLYGAYLAGAYLEGA 907

Query: 211 VIDLA 215
            ++ A
Sbjct: 908 NLERA 912


>gi|383763560|ref|YP_005442542.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381383828|dbj|BAM00645.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 189

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 48/123 (39%), Positives = 65/123 (52%), Gaps = 13/123 (10%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVA 157
           A  RG   + +   F  A+L++A     N  RAN + AD+  +D SG+   GA L  A  
Sbjct: 30  AHLRGAHLVEADLSF--ANLQRANLAGANLERANLSGADLEGADLSGANLVGANLTGARL 87

Query: 158 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGG-----AIIEGADF 207
            +AN TGA+L D L++R  L E     ANL NA  V + L R+DLG      A+ +GAD 
Sbjct: 88  MRANLTGANLRDALVNRADLTEALLVDANLRNAHFVESTLVRADLGDANALKAVFKGADL 147

Query: 208 SDA 210
           S A
Sbjct: 148 SGA 150



 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+   A + E+D S +    A L  A   +AN +GADL    +    L  ANLT A L+R
Sbjct: 30  AHLRGAHLVEADLSFANLQRANLAGANLERANLSGADLEGADLSGANLVGANLTGARLMR 89

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             LT ++L  A++  AD ++A++
Sbjct: 90  ANLTGANLRDALVNRADLTEALL 112


>gi|159903302|ref|YP_001550646.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9211]
 gi|159888478|gb|ABX08692.1| Pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9211]
          Length = 158

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F   DLR A          F   +++ ++ SGS   GA L  A   K + +  +L + 
Sbjct: 36  ADFSDTDLRGAT---------FYLTNLQNANLSGSNLEGASLFGAKLLKTDLSNTNLKNA 86

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D  +L+ A+LTNA L       +      I G+DF++ +I   Q+  LC  A+GTN +
Sbjct: 87  TLDSSILDGADLTNAYLEDAFAFNTQFKDVKISGSDFTNVLITNDQRNYLCSIASGTNSV 146

Query: 231 TGVSTRKSLGC 241
           +  +TR SL C
Sbjct: 147 STRNTRDSLEC 157


>gi|428219116|ref|YP_007103581.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427990898|gb|AFY71153.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 179

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 42/131 (32%), Positives = 65/131 (49%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F   DLR A          F  A +R  +F+ +  +G  L  +    AN +GA+L  +
Sbjct: 57  ADFSGKDLRDA---------QFNKAVLRSVNFANANLSGVSLFGSDLTNANLSGANLRYS 107

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D   +   +L+NA+L    +  +      I GADF+D  +    ++ LC+ A GTNP 
Sbjct: 108 SLDTSRMVGTDLSNAILEGAFVYGAKFKNLKIAGADFTDVDLRETIREELCEVATGTNPT 167

Query: 231 TGVSTRKSLGC 241
           TG  TR++LGC
Sbjct: 168 TGRDTRETLGC 178


>gi|18406661|ref|NP_566030.1| thylakoid lumenal protein 1 [Arabidopsis thaliana]
 gi|20141847|sp|O22160.2|TL15A_ARATH RecName: Full=Thylakoid lumenal 15 kDa protein 1, chloroplastic;
           AltName: Full=p15; Flags: Precursor
 gi|20196925|gb|AAM14836.1| pentapeptide repeat family protein [Arabidopsis thaliana]
 gi|330255391|gb|AEC10485.1| thylakoid lumenal protein 1 [Arabidopsis thaliana]
          Length = 224

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/120 (37%), Positives = 67/120 (55%), Gaps = 11/120 (9%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANL 182
            R +F ++ +R+++F G+K  GA       + A+ TGADLS+  +   D  + N  + NL
Sbjct: 109 IRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTKVNL 163

Query: 183 TNAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           TNA L   TV   +   G+ I GADF+D  +   Q+  LCK A+G N  TG +TR +L C
Sbjct: 164 TNANLEGATVTGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDTLLC 223


>gi|222423354|dbj|BAH19651.1| AT2G44920 [Arabidopsis thaliana]
          Length = 224

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/120 (37%), Positives = 67/120 (55%), Gaps = 11/120 (9%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANL 182
            R +F ++ +R+++F G+K  GA       + A+ TGADLS+  +   D  + N  + NL
Sbjct: 109 IRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGGDFSLANVTKVNL 163

Query: 183 TNAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           TNA L   TV   +   G+ I GADF+D  +   Q+  LCK A+G N  TG +TR +L C
Sbjct: 164 TNANLEGATVTGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDTLLC 223


>gi|260434702|ref|ZP_05788672.1| secreted pentapeptide repeat protein [Synechococcus sp. WH 8109]
 gi|260412576|gb|EEX05872.1| secreted pentapeptide repeat protein [Synechococcus sp. WH 8109]
          Length = 160

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 1/112 (0%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F  +++RE++ SGS   GA L  A    A+ +G DL +  +D  V+   NL +AVL  
Sbjct: 49  ATFNLSNLREANLSGSDLRGASLYGAKLQDADLSGTDLREATLDAAVMTGTNLEDAVLEG 108

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
                +     +I GADF+D        + L +    TN +TG STR+SLGC
Sbjct: 109 AFAFNTRFRDVLITGADFTDVPCAGTNSKPL-RRCRRTNSVTGRSTRESLGC 159


>gi|449456995|ref|XP_004146234.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
           [Cucumis sativus]
 gi|449522387|ref|XP_004168208.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
           [Cucumis sativus]
          Length = 237

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 5/108 (4%)

Query: 134 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
           +A M ++ F G+  +   + KA A  A+F G D S+ ++DR+   +ANL  A+   TVL+
Sbjct: 134 AALMSDAKFDGADLSEVVMSKAYAVGASFKGVDFSNAVLDRVNFGKANLQGALFKNTVLS 193

Query: 194 RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            S    A +E A F D +I     Q LC      NP      R  LGC
Sbjct: 194 GSTFDDAQLEDAVFEDTIIGYIDLQKLC-----VNPTISPEGRAELGC 236


>gi|219130181|ref|XP_002185250.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403429|gb|EEC43382.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 235

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 52/107 (48%), Gaps = 2/107 (1%)

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           M  +D S + F  AY  K     +   GAD ++ ++DR     ++L  A+    VLT + 
Sbjct: 128 MTNTDASNANFAEAYFSKGYLRDSMLDGADFTNAIVDRATFKGSSLRGAIFANAVLTGTG 187

Query: 197 LGGAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGC 241
             GA +E ADF+DA I     + LCK     G NP TG  TR S  C
Sbjct: 188 FEGADVENADFTDAYIGDFDIRLLCKNPTLKGENPKTGADTRMSANC 234


>gi|297824527|ref|XP_002880146.1| thylakoid lumenal 15 kDa protein, chloroplast [Arabidopsis lyrata
           subsp. lyrata]
 gi|297325985|gb|EFH56405.1| thylakoid lumenal 15 kDa protein, chloroplast [Arabidopsis lyrata
           subsp. lyrata]
          Length = 226

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/120 (36%), Positives = 67/120 (55%), Gaps = 11/120 (9%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANL 182
            R +F ++ +R+++F G+K  GA       + A+ TGADLS+  +   D  + N  + NL
Sbjct: 111 IRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTKVNL 165

Query: 183 TNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           TNA L     T  +   G+ I GADF+D  +   Q++ LCK A+G N  TG +TR +L C
Sbjct: 166 TNANLEGATATGNTSFKGSNITGADFTDVPLRDDQREYLCKIADGVNATTGNATRDTLLC 225


>gi|126656956|ref|ZP_01728134.1| hypothetical protein CY0110_02219 [Cyanothece sp. CCY0110]
 gi|126621794|gb|EAZ92503.1| hypothetical protein CY0110_02219 [Cyanothece sp. CCY0110]
          Length = 1084

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 55/160 (34%), Positives = 75/160 (46%), Gaps = 20/160 (12%)

Query: 99   AETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY 158
            A+ RG +  G  A  G ADL  A    +   A+ T AD+R +D +G+   GAYLE A   
Sbjct: 931  ADLRGAYLEG--ADLGGADLTGA----DLEGADLTGADLRGADLTGAYLEGAYLEGADLT 984

Query: 159  KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLA 215
             A+ TGA L    ++   L  A+LT A L    L  +DLGGA + GAD + A +   DL 
Sbjct: 985  GADLTGAYLEGAYLEGADLGGADLTGADLEGADLRGADLGGADLGGADLTGADLRGADLT 1044

Query: 216  Q-----------KQALCKYANGTNPITGVSTRKSLGCGNS 244
            +           KQ      NG + I      K LG G++
Sbjct: 1045 KTDLNEARYLTVKQVQEAKNNGKDAIYDEEMEKKLGLGDN 1084



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 33/86 (38%), Positives = 44/86 (51%)

Query: 130  ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
            A+ T AD+  +D  G+   GAYLE A    A+ TGADL    +    L  A+LT A L  
Sbjct: 916  ADLTGADLTGADLEGADLRGAYLEGADLGGADLTGADLEGADLTGADLRGADLTGAYLEG 975

Query: 190  TVLTRSDLGGAIIEGADFSDAVIDLA 215
              L  +DL GA + GA    A ++ A
Sbjct: 976  AYLEGADLTGADLTGAYLEGAYLEGA 1001



 Score = 45.1 bits (105), Expect = 0.034,   Method: Composition-based stats.
 Identities = 29/75 (38%), Positives = 40/75 (53%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           ++ E+  +G+   GAYLE A    A+ TGADL+   ++   L  A L  A L    LT +
Sbjct: 892 ELYEAKLTGADLTGAYLEGADLGGADLTGADLTGADLEGADLRGAYLEGADLGGADLTGA 951

Query: 196 DLGGAIIEGADFSDA 210
           DL GA + GAD   A
Sbjct: 952 DLEGADLTGADLRGA 966



 Score = 42.7 bits (99), Expect = 0.18,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 43/90 (47%)

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           E + A  T AD+  +   G+   GA L  A    A+  GADL    ++   L  A+LT A
Sbjct: 892 ELYEAKLTGADLTGAYLEGADLGGADLTGADLTGADLEGADLRGAYLEGADLGGADLTGA 951

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
            L    LT +DL GA + GA    A ++ A
Sbjct: 952 DLEGADLTGADLRGADLTGAYLEGAYLEGA 981


>gi|116792169|gb|ABK26257.1| unknown [Picea sitchensis]
          Length = 237

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 63/117 (53%), Gaps = 6/117 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLTNA 185
           N    D + S    +KF GA L  A  + A+ TGADLSD  +   D  + N  + NL+NA
Sbjct: 121 NLIQQDFKTSILRQAKFKGAKLIGASFFDADLTGADLSDADLRGADFSLANVTKVNLSNA 180

Query: 186 VLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            L   ++T  +   G+ I GADF+D  +   Q++ LC  A+G N  TG +TR +L C
Sbjct: 181 NLEGALVTGNTSFKGSNISGADFTDVPLRDDQRRYLCNIADGVNLTTGNATRDTLLC 237


>gi|302143933|emb|CBI23038.3| unnamed protein product [Vitis vinifera]
          Length = 232

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 71/249 (28%), Positives = 109/249 (43%), Gaps = 26/249 (10%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MA  SI PLS++     SS  +  + +  L  P  ++C  S     D      S++Q   
Sbjct: 1   MATLSI-PLSLQH----SSPKRHRFSVPELHSPFRISCSASW----DSPELKASSSQ--- 48

Query: 61  PYAKLKNWR---VFVSTALAAAVVASCSSNISALA-DLNKYEAETRGE-FGIGSAAQFGS 115
            + +LKN     + V    AA+ V + S  +  L+ + N+ E    G   G  +      
Sbjct: 49  -FKELKNVAFGILAVCAVTAASPVIAASQRLPPLSTEPNRCERAFVGNTIGQANGVYDKP 107

Query: 116 ADLRKAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
            DLR   +  E  N +  +  +A M E+ F G+  +   + KA A  A+F G D ++ ++
Sbjct: 108 IDLRFCDYTNEKSNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGVDFTNAVL 167

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+   +ANL  AV   TVL+ S    A +E A F D +I     Q +C     TN    
Sbjct: 168 DRVNFGKANLQGAVFKNTVLSGSTFDQAQLEDAVFEDTIIGYIDLQKIC-----TNTSIN 222

Query: 233 VSTRKSLGC 241
              R  LGC
Sbjct: 223 ADGRAELGC 231


>gi|359490718|ref|XP_002275994.2| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic [Vitis
           vinifera]
          Length = 244

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 71/249 (28%), Positives = 109/249 (43%), Gaps = 26/249 (10%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MA  SI PLS++     SS  +  + +  L  P  ++C  S     D      S++Q   
Sbjct: 13  MATLSI-PLSLQH----SSPKRHRFSVPELHSPFRISCSASW----DSPELKASSSQ--- 60

Query: 61  PYAKLKNWR---VFVSTALAAAVVASCSSNISALA-DLNKYEAETRGE-FGIGSAAQFGS 115
            + +LKN     + V    AA+ V + S  +  L+ + N+ E    G   G  +      
Sbjct: 61  -FKELKNVAFGILAVCAVTAASPVIAASQRLPPLSTEPNRCERAFVGNTIGQANGVYDKP 119

Query: 116 ADLRKAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
            DLR   +  E  N +  +  +A M E+ F G+  +   + KA A  A+F G D ++ ++
Sbjct: 120 IDLRFCDYTNEKSNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGVDFTNAVL 179

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
           DR+   +ANL  AV   TVL+ S    A +E A F D +I     Q +C     TN    
Sbjct: 180 DRVNFGKANLQGAVFKNTVLSGSTFDQAQLEDAVFEDTIIGYIDLQKIC-----TNTSIN 234

Query: 233 VSTRKSLGC 241
              R  LGC
Sbjct: 235 ADGRAELGC 243


>gi|397570889|gb|EJK47511.1| hypothetical protein THAOC_33758, partial [Thalassiosira oceanica]
          Length = 122

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 62/112 (55%), Gaps = 10/112 (8%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  + +R++DF G+   GA    A    ++F GAD++   ++  ++ E  ++ A L   V
Sbjct: 17  FQQSIVRDTDFRGTNLFGASFFDATLDGSDFEGADMTLCNVENAIVKEMYVSGATLFEGV 76

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 241
            +        IE +D+SD  +   Q++ LC++  A GTNP+TGV TR+SL C
Sbjct: 77  KS--------IENSDWSDTQLRKDQQKYLCEHPTAKGTNPVTGVDTRESLMC 120


>gi|356509222|ref|XP_003523350.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
           [Glycine max]
          Length = 240

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 42/128 (32%), Positives = 62/128 (48%), Gaps = 8/128 (6%)

Query: 117 DLRKAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           DLR+     E  N +  + ++A M ++ F G+      + KA A  A+F G D S+ ++D
Sbjct: 117 DLRQCDFTDEKTNLKGKSLSAALMSDAKFDGADMTEVVMSKAYAVGASFKGVDFSNAVLD 176

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
           R+   +A+L  AV   TVL+ S    A ++ A F D +I     Q LC     TN   G 
Sbjct: 177 RVNFEKADLEGAVFKNTVLSGSTFDDAKLDNAVFEDTIIGYIDLQKLC-----TNKTIGD 231

Query: 234 STRKSLGC 241
             R  LGC
Sbjct: 232 EWRVELGC 239


>gi|340707640|pdb|3N90|A Chain A, The 1.7 Angstrom Resolution Crystal Structure Of
           At2g44920, A Pentapeptide Repeat Protein From
           Arabidopsis Thaliana Thylakoid Lumen
          Length = 152

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 68/119 (57%), Gaps = 11/119 (9%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 183
           R +F ++ +R+++F G+K  GA       + A+ TGADLS+  +   D  + N  + NLT
Sbjct: 30  RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTKVNLT 84

Query: 184 NAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           NA L   T++  +   G+ I GADF+D  +   Q+  LCK A+G N  TG +TR +L C
Sbjct: 85  NANLEGATMMGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDTLLC 143


>gi|428219581|ref|YP_007104046.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427991363|gb|AFY71618.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 508

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 53/140 (37%), Positives = 74/140 (52%), Gaps = 9/140 (6%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A F  A+L  A   K +        ANF+ AD+R ++ SG+  NGA L +A   +AN 
Sbjct: 172 SVASFNGANLTGASLAKLDLSGLDLSDANFSGADLRGANLSGANLNGADLSRANLSRANL 231

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALC 221
           + A+LS T   R  LNEANL+ A L  + L+R+DL  A +  AD   A + +++   A  
Sbjct: 232 SRANLSRTNFVRTELNEANLSEASLSGSNLSRADLSRANLIKADLHGANLSMSKLAGAYL 291

Query: 222 KYAN--GTNPITGVSTRKSL 239
             AN  GTN I+   TR  L
Sbjct: 292 VRANLLGTNLISADLTRAVL 311



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 54/99 (54%), Gaps = 1/99 (1%)

Query: 115 SADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           SADL +AV ++ + FRAN T A++  +D + +    A   +A    AN  G DL+   + 
Sbjct: 303 SADLTRAVLIEADLFRANLTEANLSRADLNRANLTEASFIEANLISANLCGTDLTRANLT 362

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            +   +A +  A+L++T L+ + L GA    A+ S A++
Sbjct: 363 GVYAIDAEIVGAILIKTNLSEASLAGANFVRANLSRAIL 401



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 33/82 (40%), Positives = 43/82 (52%), Gaps = 5/82 (6%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN   AD+  ++ S SK  GAYL +A     N   ADL+     R VL EA+L  A L 
Sbjct: 268 RANLIKADLHGANLSMSKLAGAYLVRANLLGTNLISADLT-----RAVLIEADLFRANLT 322

Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
              L+R+DL  A +  A F +A
Sbjct: 323 EANLSRADLNRANLTEASFIEA 344



 Score = 41.6 bits (96), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 6/101 (5%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A+    DL +A   + N  RAN T A +  +D        A L +A   +AN  GA+LS 
Sbjct: 49  AELSRIDLSRADLSESNLKRANLTEAVLVGADLISINLGRATLTEANLNRANLIGANLSG 108

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
                 +L EA+L    L  + LT++DL GA + GAD S A
Sbjct: 109 A-----ILVEADLARCDLRVSNLTKADLMGANLSGADLSVA 144



 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 57/114 (50%), Gaps = 6/114 (5%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+L +A   + NF R     A++ E+  SGS  + A L +A   KA+  GA+L
Sbjct: 222 SRANLSRANLSRANLSRTNFVRTELNEANLSEASLSGSNLSRADLSRANLIKADLHGANL 281

Query: 168 SDTLMDRMVLNEANL--TNAV---LVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           S + +    L  ANL  TN +   L R VL  +DL  A +  A+ S A ++ A 
Sbjct: 282 SMSKLAGAYLVRANLLGTNLISADLTRAVLIEADLFRANLTEANLSRADLNRAN 335



 Score = 40.4 bits (93), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 54/184 (29%), Positives = 77/184 (41%), Gaps = 35/184 (19%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK-------------- 154
           S A     DL KA  V+    AN   A +  + F+G+   GA L K              
Sbjct: 147 SGANLSQVDLSKATLVE----ANLKDAKLSVASFNGANLTGASLAKLDLSGLDLSDANFS 202

Query: 155 ------AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
                 A    AN  GADLS   + R  L+ ANL+    VRT L  ++L  A + G++ S
Sbjct: 203 GADLRGANLSGANLNGADLSRANLSRANLSRANLSRTNFVRTELNEANLSEASLSGSNLS 262

Query: 209 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQK--LLD 266
            A  DL++   +    +G N    +S  K  G    R N  G   + L+SA   +  L++
Sbjct: 263 RA--DLSRANLIKADLHGAN----LSMSKLAGAYLVRANLLG---TNLISADLTRAVLIE 313

Query: 267 RDGF 270
            D F
Sbjct: 314 ADLF 317



 Score = 40.4 bits (93), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 16/116 (13%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAY---------------LEK 154
           A    ADL +A   + +F  AN  SA++  +D + +   G Y               L +
Sbjct: 324 ANLSRADLNRANLTEASFIEANLISANLCGTDLTRANLTGVYAIDAEIVGAILIKTNLSE 383

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           A    ANF  A+LS  ++    L+EANL  A L    ++ ++L GA +E AD S A
Sbjct: 384 ASLAGANFVRANLSRAILSGASLSEANLGRANLYGANMSEANLSGANLENADLSRA 439


>gi|298243143|ref|ZP_06966950.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297556197|gb|EFH90061.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 338

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 34/90 (37%), Positives = 54/90 (60%), Gaps = 10/90 (11%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++RE+DFSG+  +G+          + +GADLS  ++ R +L  A+L+ A+L  
Sbjct: 95  ANLVGANLREADFSGNDLSGS----------DLSGADLSRAILRRAILRRADLSEAILRD 144

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
            VL R+DL  A + GAD +DA +  A++ A
Sbjct: 145 AVLRRADLTDADLRGADLTDADLTGAKRDA 174


>gi|172036979|ref|YP_001803480.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
           51142]
 gi|354554778|ref|ZP_08974082.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
 gi|171698433|gb|ACB51414.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
 gi|353553587|gb|EHC22979.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
          Length = 325

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 63/224 (28%), Positives = 105/224 (46%), Gaps = 40/224 (17%)

Query: 3   LSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPY 62
           L+ IS +++K      +    P+QL  L++ +          E+D QF         G  
Sbjct: 95  LTQISGVTVKQFKLVKTH---PFQLEDLAEQI---------DENDPQFLLIERIMSQGG- 141

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
               N + F    L+ A++  C++N+  LADL   EA   G     S A    ADL  A 
Sbjct: 142 ----NDQDFREANLSGAIL--CNANL-ILADL--REANLMGTDL--SGANLMGADLSGAD 190

Query: 123 HVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVA---------------YKANFTGAD 166
            +  N   AN   A++ E++ +G+    A L++A                  +AN  GA 
Sbjct: 191 LLGANLTGANLMGANLTEANLTGADLGDAILQEADLCWADLSEVNLIGADLSQANLKGAI 250

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           L+D+L+    LNEANL+ A+L R++L++++L G+I+   D ++A
Sbjct: 251 LTDSLLSHTNLNEANLSEAILNRSILSKTNLSGSILSQTDLTNA 294



 Score = 37.0 bits (84), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 27/114 (23%)

Query: 108 GSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           G+   F  A+L  A+       AN   AD+RE++  G+  +GA L       A+ +GADL
Sbjct: 141 GNDQDFREANLSGAILC----NANLILADLREANLMGTDLSGANL-----MGADLSGADL 191

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
                       ANLT A L+   LT ++L GA     D  DA++   Q+  LC
Sbjct: 192 LG----------ANLTGANLMGANLTEANLTGA-----DLGDAIL---QEADLC 227


>gi|326523645|dbj|BAJ92993.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524189|dbj|BAJ97105.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 200

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 71/131 (54%), Gaps = 15/131 (11%)

Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---D 173
           D      +K++F+ +     +R+++F G+   GA       + A+ TGADLSD  +   D
Sbjct: 78  DFSGQTLIKQDFKTSI----LRQTNFKGANLLGASF-----FDADLTGADLSDADLRNAD 128

Query: 174 RMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
             + N  + NLTNA L   ++T  +   G+ I GADF+D  +   Q+  LCK A+G N  
Sbjct: 129 FSLANVTKVNLTNANLEGALVTGNTSFKGSNIYGADFTDVPLRDDQRDYLCKIADGVNTT 188

Query: 231 TGVSTRKSLGC 241
           TG +T+++L C
Sbjct: 189 TGNATKETLFC 199


>gi|449016876|dbj|BAM80278.1| similar to thylakoid lumenal 17.4 kD protein, chloroplast precursor
           [Cyanidioschyzon merolae strain 10D]
          Length = 288

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 43/128 (33%), Positives = 59/128 (46%), Gaps = 18/128 (14%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLS---------------DTLMDRMVLNEA 180
           D+R  DFSG   +G  LE A A +A F    LS               D ++DR+    A
Sbjct: 161 DLRGRDFSGYDLSGVLLEGATADEARFRSTQLSKAYAPGFKCRRCDFEDAVVDRVNFENA 220

Query: 181 NLTNAVLVRTVLTRSDLG-GAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRK 237
           +L+ +V    VL+ S    G  +   DF+D  I     + LC+    +G NP+TG  TR 
Sbjct: 221 DLSGSVFRNAVLSDSMFSDGTNVRDVDFTDVYIGEYGLRRLCRNPTLDGENPLTGAPTRA 280

Query: 238 SLGCGNSR 245
           SLGC   R
Sbjct: 281 SLGCRAER 288


>gi|413947393|gb|AFW80042.1| putative homeobox DNA-binding domain superfamily protein [Zea mays]
          Length = 202

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 41/126 (32%), Positives = 66/126 (52%), Gaps = 5/126 (3%)

Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
           D      +K++F+ +     +R+++F G+   GA    A    A+ + ADL    +    
Sbjct: 80  DFSGQTLIKQDFKTSI----LRQANFKGANLLGASFFDADLTSADLSDADLRGADLSLAN 135

Query: 177 LNEANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 235
           L +ANL+NA L   + T  +   GA I GADF+D  +   Q++ LCK A+G N  TG  T
Sbjct: 136 LTKANLSNANLEGALATGNTSFKGADITGADFTDVPLRDDQREYLCKIADGVNSTTGNPT 195

Query: 236 RKSLGC 241
           +++L C
Sbjct: 196 KETLFC 201


>gi|384246084|gb|EIE19575.1| hypothetical protein COCSUDRAFT_31020 [Coccomyxa subellipsoidea
           C-169]
          Length = 203

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 39/126 (30%), Positives = 60/126 (47%), Gaps = 25/126 (19%)

Query: 136 DMRESDFSGSKFNG--------------------AYLEKAVAYKANFTGADLSDTLMDRM 175
           D+R  DF+G   +G                      L KA A  ANF+GAD+++ ++DR+
Sbjct: 80  DLRMCDFTGKDLSGKTLSGALLKDAILPNSTMRETVLTKAYAVGANFSGADMTNAVIDRV 139

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 235
              +ANL+N   +  V+T +   GA ++GA F DA+I     + LC      NP     +
Sbjct: 140 DFRKANLSNVKFINAVITGTAFDGANLDGAIFEDALIGNEDVKRLC-----LNPTLTGES 194

Query: 236 RKSLGC 241
           R  +GC
Sbjct: 195 RMGVGC 200


>gi|218187501|gb|EEC69928.1| hypothetical protein OsI_00358 [Oryza sativa Indica Group]
          Length = 191

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 14/117 (11%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R +F ++ +R+++F G+K  GA       + A+ TGADLSD       L  A+ + A + 
Sbjct: 84  RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSDA-----DLRGADFSLANVS 133

Query: 189 RTVLTRSDLGGAIIEG----ADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +  LT ++L GA+  G     DF+D  +   Q++ LCK A+G N  TG +T+++L C
Sbjct: 134 KVNLTNANLEGALATGNTTFKDFTDVPLRDDQREYLCKIADGVNTTTGNATKETLFC 190


>gi|363807626|ref|NP_001241901.1| uncharacterized protein LOC100785667 [Glycine max]
 gi|255647148|gb|ACU24042.1| unknown [Glycine max]
          Length = 239

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/128 (32%), Positives = 61/128 (47%), Gaps = 8/128 (6%)

Query: 117 DLRKA--VHVKENFRANFTSAD-MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           DLR+    + K N +    SA  M ++ F G+      + KA A  A+F G D S+ ++D
Sbjct: 116 DLRQCDFTNEKTNLKGKSPSAALMSDAKFDGADMTEVVMSKAYAAGASFKGVDFSNAVLD 175

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
           R+   +A+L  A+   TVL+ S    A ++ A F D +I     Q LC     TN   G 
Sbjct: 176 RVNFEKADLEGAIFKNTVLSGSPFDDAKLDNAVFEDTIIGYIDFQKLC-----TNKTIGD 230

Query: 234 STRKSLGC 241
             R  LGC
Sbjct: 231 EWRVELGC 238


>gi|302831317|ref|XP_002947224.1| hypothetical protein VOLCADRAFT_120426 [Volvox carteri f.
           nagariensis]
 gi|300267631|gb|EFJ51814.1| hypothetical protein VOLCADRAFT_120426 [Volvox carteri f.
           nagariensis]
          Length = 244

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 58/114 (50%), Gaps = 5/114 (4%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
              A + ++D S +    A L KA A KANF  AD+++ ++DR+  + ANL       TV
Sbjct: 117 LAGALLADADLSNTNLQEAVLTKAYAVKANFENADMTNAVVDRVDFSGANLRGVRFNNTV 176

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSR 245
           +T +   GA +EG+ + DA+I       LC+     NP     +R  +GC  SR
Sbjct: 177 VTGAQFAGADLEGSVWEDALIGSQDVGKLCE-----NPTLTGESRMQVGCRVSR 225



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 41/81 (50%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           D+R   +SG   +G  L  A+   A+ +  +L + ++ +    +AN  NA +   V+ R 
Sbjct: 101 DLRLCSYSGKDLHGRVLAGALLADADLSNTNLQEAVLTKAYAVKANFENADMTNAVVDRV 160

Query: 196 DLGGAIIEGADFSDAVIDLAQ 216
           D  GA + G  F++ V+  AQ
Sbjct: 161 DFSGANLRGVRFNNTVVTGAQ 181



 Score = 38.1 bits (87), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 23/59 (38%), Positives = 30/59 (50%), Gaps = 1/59 (1%)

Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           A L KA  VK NF  A+ T+A +   DFSG+   G      V   A F GADL  ++ +
Sbjct: 135 AVLTKAYAVKANFENADMTNAVVDRVDFSGANLRGVRFNNTVVTGAQFAGADLEGSVWE 193


>gi|168060251|ref|XP_001782111.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666451|gb|EDQ53105.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 158

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 57/110 (51%), Gaps = 5/110 (4%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            ++A M E+ F G+      + KA A  A+F G+  ++ ++DR+  +++++     + TV
Sbjct: 52  LSAALMSEAKFDGADLTEVIMSKAYAVGASFKGSVFTNAVVDRVAFDKSDMQGVQFINTV 111

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L+ S   GA +EGA F +A+I     Q LCK     NP     +R  L C
Sbjct: 112 LSGSTFEGANLEGASFENALIGYVDIQKLCK-----NPTLPEESRIDLAC 156


>gi|307109822|gb|EFN58059.1| hypothetical protein CHLNCDRAFT_57123 [Chlorella variabilis]
          Length = 608

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 58/101 (57%), Gaps = 2/101 (1%)

Query: 126 ENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           ++ R N +T AD+R ++ S +   G  L  A+A  ANF+GA+L +  ++ + L  A+L+N
Sbjct: 49  QDLRKNKYTKADLRGTNLSNANLEGVTLFGALATNANFSGANLRNADLELVELEGADLSN 108

Query: 185 AVLVRTVLTRSDLGGAI-IEGADFSDAVIDLAQKQALCKYA 224
           AVL   +LT + LG    I GADF+D V        LC+ A
Sbjct: 109 AVLEGAMLTNAQLGRVKSITGADFTDVVFRKDVMMGLCRIA 149


>gi|440804190|gb|ELR25067.1| pentapeptide repeatcontaining protein [Acanthamoeba castellanii
           str. Neff]
          Length = 293

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 57/108 (52%), Gaps = 6/108 (5%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           AQ   ADLR+A        +AN   AD+RE++ SG+    A L  A+  +A+ +GA L +
Sbjct: 162 AQLEDADLRQANLANAKMTKANLMHADLREANLSGAVMLRADLRSAILRRADLSGAALPN 221

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDAVI 212
             + R  L  ANLT A L    LT +D     L GA + GAD S++ +
Sbjct: 222 VELQRASLRRANLTGANLTWATLTDADCTQANLSGANLSGADLSNSTL 269



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 15/98 (15%)

Query: 130 ANFTSADMRE----------SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AN   A+MRE          ++ SG+  + A L KA   +AN +GA+L +  ++   L +
Sbjct: 112 ANLKGANMREVQLASTNLTRANLSGANLHLARLGKAQLRRANLSGANLEEAQLEDADLRQ 171

Query: 180 ANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDAVI 212
           ANL NA + +  L  +D     L GA++  AD   A++
Sbjct: 172 ANLANAKMTKANLMHADLREANLSGAVMLRADLRSAIL 209



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 58/118 (49%), Gaps = 7/118 (5%)

Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           Q  S +L +A     N   A    A +R ++ SG+    A LE A   +AN   A ++  
Sbjct: 123 QLASTNLTRANLSGANLHLARLGKAQLRRANLSGANLEEAQLEDADLRQANLANAKMTKA 182

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYANGT 227
            +    L EANL+ AV++     R+DL  AI+  AD S A + ++  ++A  + AN T
Sbjct: 183 NLMHADLREANLSGAVML-----RADLRSAILRRADLSGAALPNVELQRASLRRANLT 235



 Score = 37.7 bits (86), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 44/96 (45%), Gaps = 12/96 (12%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD----------LSDTLMDRMVLNEA 180
           + T A +   D  G  F  A L +A     N TGA+          L+ T + R  L+ A
Sbjct: 78  DLTGARLFRCDLRGVDFQWANLTEATLTDCNLTGANLKGANMREVQLASTNLTRANLSGA 137

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           NL  A L +  L R++L GA +E A   DA  DL Q
Sbjct: 138 NLHLARLGKAQLRRANLSGANLEEAQLEDA--DLRQ 171


>gi|303279747|ref|XP_003059166.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459002|gb|EEH56298.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 213

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/126 (30%), Positives = 62/126 (49%), Gaps = 6/126 (4%)

Query: 117 DLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           DLRK  +  ++      + A M ++ F G+      + KA A  A+FTGA+ ++ ++DR+
Sbjct: 91  DLRKCEYDGKDLSTKTLSGALMVDASFKGTNLTEVVMSKAYALNADFTGANFTNAVVDRV 150

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 235
             + ANL NA     V+T +   G  + GA F +A+I     + LC      NP     T
Sbjct: 151 TFDGANLANADFHNAVITGTTYEGTDLTGATFEEALIGKEDVKRLCD-----NPTVKGPT 205

Query: 236 RKSLGC 241
           R  +GC
Sbjct: 206 RFEVGC 211


>gi|388504750|gb|AFK40441.1| unknown [Lotus japonicus]
          Length = 239

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 56/118 (47%), Gaps = 6/118 (5%)

Query: 125 KENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           K N +    ++A M ++ F G+      + KA A   +F G D S+ ++DR+   +A+L 
Sbjct: 126 KSNLKGKTLSAALMSDAKFDGADMTEVVMSKAYAVGGSFKGVDFSNAVLDRVNFEKADLQ 185

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            AV   TVL+ S    A +EGA F D +I     Q LC+     N       R  LGC
Sbjct: 186 GAVFKNTVLSGSTFDDAKLEGAVFEDTIIGYIDLQKLCR-----NKTIADDWRVELGC 238


>gi|302819846|ref|XP_002991592.1| hypothetical protein SELMODRAFT_133757 [Selaginella moellendorffii]
 gi|300140625|gb|EFJ07346.1| hypothetical protein SELMODRAFT_133757 [Selaginella moellendorffii]
          Length = 157

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 57/110 (51%), Gaps = 5/110 (4%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            ++A M ++ F G+      + KA A  A+F G D ++ ++DR+V ++A++  AV   TV
Sbjct: 51  LSAALMADAKFDGADMTEVVMSKAYAVGASFKGTDFTNAVLDRVVFDKADMKGAVFRNTV 110

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L+ S   GA +E ADF +A+I     + LC      NP     +   L C
Sbjct: 111 LSGSTFQGANLENADFENALIGYNDARKLC-----LNPTLSEESTIELAC 155


>gi|376001358|ref|ZP_09779228.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375330187|emb|CCE14981.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 351

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/148 (33%), Positives = 75/148 (50%), Gaps = 8/148 (5%)

Query: 69  RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 125
           R F   +L AA+    + N   L+  N  EA       IG   S +Q   ADL  AV + 
Sbjct: 21  RNFSDISLVAAIFNEVTLNRINLSGANLSEALMVHTRLIGANLSRSQLSYADLSMAVLID 80

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-TLMDRMVLNEANLTN 184
               AN T A M E+    +  +GA L  A+  + N TG +L+  +L+   +LN + LT+
Sbjct: 81  ----ANLTGATMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTCLLNGSQLTD 136

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A+LV   LTRS L GA + GA+ + +++
Sbjct: 137 AILVGATLTRSVLSGAHMTGANLNRSIL 164



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 1/100 (1%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL ++V    NF  AN T A++  ++ +G+  NGA L  A    AN TGA+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTGANLTGANL 249

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           +   +    L  ANL+ A L    LT ++L GA +  AD 
Sbjct: 250 NGLTLQSADLRLANLSKADLRGANLTGANLAGANLLEADL 289



 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 40/81 (49%), Gaps = 15/81 (18%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
              A +  S  SG+   GA L +++  + + +GA+L+   + R+ LN+ NL+        
Sbjct: 139 LVGATLTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLS-------- 190

Query: 192 LTRSDLGGAIIEGADFSDAVI 212
                  GA + GAD S++VI
Sbjct: 191 -------GANLTGADLSESVI 204



 Score = 37.0 bits (84), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 51/111 (45%), Gaps = 11/111 (9%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 159
           A    A+L  A     N   AN T A++  ++ +G+  NG          A L KA    
Sbjct: 212 ANLTGANLTGANLTGANLNGANLTGANLTGANLTGANLNGLTLQSADLRLANLSKADLRG 271

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           AN TGA+L+   +    L  ANLT+A L    L  + L GA + GA+ + A
Sbjct: 272 ANLTGANLAGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQA 322


>gi|409990095|ref|ZP_11273525.1| pentapeptide repeat-containing protein, partial [Arthrospira
           platensis str. Paraca]
 gi|409939047|gb|EKN80281.1| pentapeptide repeat-containing protein, partial [Arthrospira
           platensis str. Paraca]
          Length = 220

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/167 (32%), Positives = 83/167 (49%), Gaps = 13/167 (7%)

Query: 56  NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQ 112
           N+    YA+    R F   +L AA+    + N   L+  N  EA       IG   S +Q
Sbjct: 10  NKLLTRYAQ--GERNFSDISLVAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQ 67

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-TL 171
              ADL  AV +     AN T A M E+    +  +GA L  A+  + N TG +L+  +L
Sbjct: 68  LSYADLSMAVLID----ANLTGASMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASL 123

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV---IDLA 215
           +   +LN + LT+A+LV   +TRS L GA + GA+ + ++   IDL+
Sbjct: 124 IGTCLLNGSQLTDAILVGATMTRSVLSGAHMTGANLNRSILSEIDLS 170



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 42/76 (55%), Gaps = 5/76 (6%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
              A M  S  SG+   GA L +++  + + +GA+L+   + R+ LN+ NL+ A      
Sbjct: 139 LVGATMTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGA-----N 193

Query: 192 LTRSDLGGAIIEGADF 207
           LT +DL  ++I+ ++F
Sbjct: 194 LTGADLSESVIQNSNF 209


>gi|224120874|ref|XP_002318440.1| predicted protein [Populus trichocarpa]
 gi|222859113|gb|EEE96660.1| predicted protein [Populus trichocarpa]
          Length = 240

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 57/118 (48%), Gaps = 6/118 (5%)

Query: 125 KENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           K N +  +  +A M ++ F G+      + KA A  A+F G D S+ ++DR+   +A+L 
Sbjct: 127 KSNLKGKSLAAALMSDAKFDGADMTEVVMSKAYAVGASFRGVDFSNAVLDRVNFGKADLK 186

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            AV   TVL+ S    A +E A F D +I     Q +C+     N   G   R  LGC
Sbjct: 187 GAVFKNTVLSGSTFDEAQLEDAIFEDTIIGYIDLQKICR-----NTSIGPDGRAELGC 239


>gi|33240880|ref|NP_875822.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           subsp. marinus str. CCMP1375]
 gi|33238409|gb|AAQ00475.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
           subsp. marinus str. CCMP1375]
          Length = 184

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 65/117 (55%), Gaps = 12/117 (10%)

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           E  + + +  D+ ++D SGS F+ + L+KA     +  GA++ + +      + A+L+NA
Sbjct: 59  EYVKYDLSGRDLGDADLSGSYFSVSNLQKA-----DLRGANMQNVIAYATRFDNADLSNA 113

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCG 242
                 L +S   GA+I+G +F++AV+DL Q ++LC+ A G        T +SL CG
Sbjct: 114 NFSGAELLKSRFDGAVIDGTNFTNAVLDLPQVKSLCERATG-------QTAESLECG 163


>gi|242052129|ref|XP_002455210.1| hypothetical protein SORBIDRAFT_03g006310 [Sorghum bicolor]
 gi|241927185|gb|EES00330.1| hypothetical protein SORBIDRAFT_03g006310 [Sorghum bicolor]
          Length = 200

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 62/114 (54%), Gaps = 1/114 (0%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           + +F ++ +R+++F G+   GA    A    A+ + ADL         L + NL+NA L 
Sbjct: 86  KQDFKTSILRQANFKGANLLGASFFDADLTSADLSDADLRGADFSLANLTKTNLSNANLE 145

Query: 189 RTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
             ++T  +   GA I GADF+D  +   Q++ LCK A+G N  TG  T+++L C
Sbjct: 146 GALVTGNTSFKGANITGADFTDVPLRDDQREYLCKIADGVNSTTGNPTKETLFC 199


>gi|159903945|ref|YP_001551289.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9211]
 gi|159889121|gb|ABX09335.1| Pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9211]
          Length = 184

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 32/126 (25%)

Query: 126 ENFRANFTSADMRESDFSGSKFN----------GAYLEKAVAYKANFTGADLSDTLMDRM 175
           E  + + +  D+ +++ SGS F+          GA L+  +AY   F  ADLS       
Sbjct: 59  EYVKYDLSGRDLGDANLSGSYFSVSSLKNADLRGANLQNVIAYATRFDNADLSG------ 112

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 235
               ANL+ A L+++V       GA+IEG DF++AV+DL Q ++LC+ A G        T
Sbjct: 113 ----ANLSGAELLKSVFN-----GAVIEGTDFTNAVLDLPQVKSLCERATG-------KT 156

Query: 236 RKSLGC 241
            +SL C
Sbjct: 157 AESLQC 162


>gi|428307622|ref|YP_007144447.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
 gi|428249157|gb|AFZ14937.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
          Length = 378

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/111 (38%), Positives = 61/111 (54%), Gaps = 6/111 (5%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A L+ A  ++ N   A+ + AD+R +D SG+    A L KA   +AN T  DL
Sbjct: 43  SNADLSRASLKDAKLIRVNLSNADLSWADLRGADLSGANLENANLSKASLDQANLTNTDL 102

Query: 168 SDTLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           S   ++R      +L++ANL NA L  T L  +DLG A +E AD S+A +D
Sbjct: 103 SSANLNRASLDYALLSKANLINADLSGTNLVGADLGRANLENADLSNATLD 153



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 57/104 (54%), Gaps = 1/104 (0%)

Query: 111 AQFGSADL-RKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A   +ADL R ++   +  R N ++AD+  +D  G+  +GA LE A   KA+   A+L++
Sbjct: 40  ADLSNADLSRASLKDAKLIRVNLSNADLSWADLRGADLSGANLENANLSKASLDQANLTN 99

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           T +    LN A+L  A+L +  L  +DL G  + GAD   A ++
Sbjct: 100 TDLSSANLNRASLDYALLSKANLINADLSGTNLVGADLGRANLE 143



 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 63/134 (47%), Gaps = 14/134 (10%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    ADLR A     N       +A+   A++  +D S +  N A L+ A+  KAN 
Sbjct: 63  SNADLSWADLRGADLSGANLENANLSKASLDQANLTNTDLSSANLNRASLDYALLSKANL 122

Query: 163 TGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
             ADLS T      + R  L  A+L+NA L  ++L  ++ G A ++ A   +A I+ A  
Sbjct: 123 INADLSGTNLVGADLGRANLENADLSNATLDNSILISANFGAANLKKASLCNANIERASL 182

Query: 218 QA---LCKYANGTN 228
           +    +    NGTN
Sbjct: 183 EGANLISANLNGTN 196


>gi|223995969|ref|XP_002287658.1| thylakoid lumenal 17.4 kDa protein, chloroplast precursor
           [Thalassiosira pseudonana CCMP1335]
 gi|220976774|gb|EED95101.1| thylakoid lumenal 17.4 kDa protein, chloroplast precursor
           [Thalassiosira pseudonana CCMP1335]
          Length = 245

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 5/110 (4%)

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           M ++D S  +F  A   K     +NF GAD ++ ++DR     ++L  AV    VLT + 
Sbjct: 128 MTKTDVSNGQFKEAQFSKGYLRDSNFDGADFTNAIVDRASFKGSSLKGAVFKNAVLTATS 187

Query: 197 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRR 246
             GA +E ADF+DA I     + LCK     NP   VS    +   N++R
Sbjct: 188 FEGADVENADFTDAYIGDFDIRTLCK-----NPTLKVSRFYRMTYRNAQR 232


>gi|254424332|ref|ZP_05038050.1| DnaJ domain protein [Synechococcus sp. PCC 7335]
 gi|196191821|gb|EDX86785.1| DnaJ domain protein [Synechococcus sp. PCC 7335]
          Length = 411

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 44/77 (57%), Gaps = 10/77 (12%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + + A+++E DFSG   +GA          N +GADLSDT M ++ LN ANL  A L R 
Sbjct: 298 DMSGANLKEKDFSGRNLSGA----------NLSGADLSDTFMHKVNLNRANLRKARLFRA 347

Query: 191 VLTRSDLGGAIIEGADF 207
            L ++DL  A + GAD 
Sbjct: 348 NLLQADLSHADLSGADL 364


>gi|323452967|gb|EGB08840.1| hypothetical protein AURANDRAFT_25565 [Aureococcus anophagefferens]
          Length = 176

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 46/139 (33%), Positives = 70/139 (50%), Gaps = 11/139 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           SA   G  D  +A    ++F        +F+  D  +++F+ SK  GA   KA   +A+F
Sbjct: 42  SAVSGGGKDYAEATIKGQDFSGKTFNNKDFSGCDAVDTNFAKSKLRGARFFKADLARADF 101

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
           +GADLS   ++   L    LT A+   T  +++ L    + GADF+DAVI    ++ LC 
Sbjct: 102 SGADLSAASLEGANLEGTKLTGALAEGTAFSQTILDAGDLTGADFTDAVIQPYVQKGLC- 160

Query: 223 YANGTNPITGVSTRKSLGC 241
              G   +TG +TR SL C
Sbjct: 161 ---GRKDVTG-ATRDSLFC 175


>gi|427713339|ref|YP_007061963.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
 gi|427377468|gb|AFY61420.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
          Length = 327

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 46/121 (38%), Positives = 59/121 (48%), Gaps = 15/121 (12%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A+   ADLR AV        N   AD+  +D  G+   GA L K    KAN TGADL+
Sbjct: 48  SGAKLQRADLRGAVLSA----INLNHADLIGADLRGAMLMGADLRKVNLRKANLTGADLT 103

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGT 227
                      ANLT A+L    LT +D+  AI+ GAD +   + LA+ +Q     AN T
Sbjct: 104 ----------RANLTGAILSEANLTAADMSQAILRGADLTLTDLTLAELEQVNLSQANLT 153

Query: 228 N 228
           N
Sbjct: 154 N 154



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 52/108 (48%), Gaps = 16/108 (14%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A+   A+LR+A   + N R      A    AD+R +  + +   GA L +A+   AN  G
Sbjct: 205 ARLEGANLREATLTEANLRYACLDEACLIGADLRGASLARAMLRGAQLNEAILTGANLMG 264

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A+LS          EA L  A L+  +LT ++L G  + G D S+ V+
Sbjct: 265 ANLS----------EAQLRGANLIEAILTGANLTGVDLTGVDLSETVM 302



 Score = 37.0 bits (84), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 36/147 (24%)

Query: 111 AQFGSADLRKAVHVKENF----------------RANFTSADM-----RESDFSGSKFNG 149
           A    ADLRK    K N                  AN T+ADM     R +D + +    
Sbjct: 80  AMLMGADLRKVNLRKANLTGADLTRANLTGAILSEANLTAADMSQAILRGADLTLTDLTL 139

Query: 150 AYLEKAVAYKANFT-----GADLSDTLMDRMVLNEA----------NLTNAVLVRTVLTR 194
           A LE+    +AN T     GAD++D ++    L +A          NL  A L +T L  
Sbjct: 140 AELEQVNLSQANLTNAYLRGADMADAILLEATLIQANLRGANLRNCNLQGANLQKTNLRG 199

Query: 195 SDLGGAIIEGADFSDAVIDLAQKQALC 221
           ++L  A +EGA+  +A +  A  +  C
Sbjct: 200 ANLRQARLEGANLREATLTEANLRYAC 226


>gi|427731151|ref|YP_007077388.1| putative low-complexity protein [Nostoc sp. PCC 7524]
 gi|427367070|gb|AFY49791.1| putative low-complexity protein [Nostoc sp. PCC 7524]
          Length = 572

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 54/104 (51%), Gaps = 6/104 (5%)

Query: 111 AQFGSADLRKAVHVKEN------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    ADL  A+    N      F  N T A +  +D S +K NGA L  A    A F G
Sbjct: 391 ADLSGADLSHAILNGTNLSDTILFSTNLTDASLMAADLSYAKLNGAKLIDAKLNGAMFLG 450

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           ADLS   + R+VLN+A+L+ ++L    L+ +DL  AI+ G D S
Sbjct: 451 ADLSGVDLSRVVLNDADLSGSILSEADLSSADLSDAILLGTDLS 494



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 47/81 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + AD+  +D S +   GA L  A  Y+ +F+ ADLS   ++   +  A+L+ A L  
Sbjct: 296 ANLSGADLSSADLSSANLTGANLTGATLYRTDFSRADLSSCHLNDAEMGHADLSGANLRD 355

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
           T L R++L  AI+ GA+ SDA
Sbjct: 356 TQLCRTNLTNAILFGANLSDA 376



 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 56/111 (50%), Gaps = 6/111 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL  A     N   AN T A +  +DFS +  +  +L  A    A+ +GA+L
Sbjct: 294 SGANLSGADLSSADLSSANLTGANLTGATLYRTDFSRADLSSCHLNDAEMGHADLSGANL 353

Query: 168 SDTLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
            DT + R      +L  ANL++A L    L+ +DL  A + GAD S A+++
Sbjct: 354 RDTQLCRTNLTNAILFGANLSDANLKHINLSHADLCRADLSGADLSHAILN 404



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANF  A + +++ +G+ F GA L  A    AN TGA+  D  +    L +ANL+ A L  
Sbjct: 241 ANFRGAYLGDANLTGANFQGANLSGAYLGDANLTGANFQDANLAGANLGDANLSGANLSG 300

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L+ +DL  A + GA+ + A +
Sbjct: 301 ADLSSADLSSANLTGANLTGATL 323



 Score = 40.4 bits (93), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 35/69 (50%)

Query: 142 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
             G+ F GAYL  A    ANF GA+LS   +    L  AN  +A L    L  ++L GA 
Sbjct: 238 LKGANFRGAYLGDANLTGANFQGANLSGAYLGDANLTGANFQDANLAGANLGDANLSGAN 297

Query: 202 IEGADFSDA 210
           + GAD S A
Sbjct: 298 LSGADLSSA 306



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 45/81 (55%), Gaps = 5/81 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F  AD+   D S    N A L  ++  +A+ + ADLSD ++    L+ ANL +A    
Sbjct: 446 AMFLGADLSGVDLSRVVLNDADLSGSILSEADLSSADLSDAILLGTDLSFANLNSA---- 501

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L+ S+L GA++ GAD S+A
Sbjct: 502 -NLSGSNLSGAMLNGADLSEA 521


>gi|302779862|ref|XP_002971706.1| hypothetical protein SELMODRAFT_95422 [Selaginella moellendorffii]
 gi|300160838|gb|EFJ27455.1| hypothetical protein SELMODRAFT_95422 [Selaginella moellendorffii]
          Length = 157

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 5/110 (4%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            ++A M ++ F G+      + KA A   +F G D ++ ++DR+V ++A++  AV   TV
Sbjct: 51  LSAALMADAKFDGADMTEVVMSKAYAVGGSFKGTDFTNAVLDRVVFDKADMKGAVFRNTV 110

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L+ S   GA +E ADF +A+I     + LC      NP     +   L C
Sbjct: 111 LSGSTFQGANLENADFENALIGYNDARKLC-----LNPTLSEESTIELAC 155


>gi|428226754|ref|YP_007110851.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427986655|gb|AFY67799.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 330

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 50/148 (33%), Positives = 70/148 (47%), Gaps = 24/148 (16%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNG 149
           L D N ++A+  G       A    ADLR A + +     AN   AD+R ++ SG+   G
Sbjct: 77  LVDANLHDADLHG-------ASLRGADLRGADLSLAVLLDANLMDADLRNANLSGADLTG 129

Query: 150 AYLEKA----------------VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
           A L  A                + YKA+  G +LS   + R+ L EANLT A L  T L+
Sbjct: 130 ACLRGANLRQEMRSQHTNLRGSILYKADLRGVNLSGADLTRVDLREANLTEASLRETDLS 189

Query: 194 RSDLGGAIIEGADFSDAVIDLAQKQALC 221
            +DL GA + GA  SDA ++ A  +  C
Sbjct: 190 GADLSGANLTGALLSDACLEGAILEGAC 217



 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 1/94 (1%)

Query: 118 LRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
           LR A   + N  +AN   A+++ +    ++  GA L++ +  +A  T  DLS   +    
Sbjct: 218 LRNAKLERANLSQANLFRANLQNALLPQARLTGAGLQQTIFAQAKLTDVDLSRADLFEAD 277

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           L EANLT A L RT LTR++L  A++  A+ S A
Sbjct: 278 LREANLTGAYLARTNLTRANLSDALLVRAELSSA 311



 Score = 44.3 bits (103), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 38/109 (34%), Positives = 56/109 (51%), Gaps = 21/109 (19%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAV 156
           Y+A+ RG            ADL + V ++E   AN T A +RE+D SG+  +G       
Sbjct: 154 YKADLRG-------VNLSGADLTR-VDLRE---ANLTEASLRETDLSGADLSG------- 195

Query: 157 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
              AN TGA LSD  ++  +L  A L NA L R  L++++L  A ++ A
Sbjct: 196 ---ANLTGALLSDACLEGAILEGACLRNAKLERANLSQANLFRANLQNA 241



 Score = 41.2 bits (95), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 51/100 (51%), Gaps = 4/100 (4%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
              ADLR +    + F A    A++R+++  G++ +GA L +A    AN   ADL    +
Sbjct: 37  LSQADLRSS----DLFFAYLNRANLRQANLLGARLSGANLSQATLVDANLHDADLHGASL 92

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
               L  A+L+ AVL+   L  +DL  A + GAD + A +
Sbjct: 93  RGADLRGADLSLAVLLDANLMDADLRNANLSGADLTGACL 132



 Score = 40.8 bits (94), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 34/85 (40%), Positives = 42/85 (49%), Gaps = 15/85 (17%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N + AD+R SD     F  AYL +A   +AN  GA LS           ANL+ A LV  
Sbjct: 36  NLSQADLRSSDLF---F--AYLNRANLRQANLLGARLSG----------ANLSQATLVDA 80

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
            L  +DL GA + GAD   A + LA
Sbjct: 81  NLHDADLHGASLRGADLRGADLSLA 105



 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 47/84 (55%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +A  T A ++++ F+ +K     L +A  ++A+   A+L+   + R  L  ANL++A+LV
Sbjct: 245 QARLTGAGLQQTIFAQAKLTDVDLSRADLFEADLREANLTGAYLARTNLTRANLSDALLV 304

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
           R  L+ ++L  A ++ A   D  +
Sbjct: 305 RAELSSANLMDANLQRAVLPDGKV 328


>gi|158340319|ref|YP_001521675.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158310560|gb|ABW32174.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 284

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 44/141 (31%), Positives = 73/141 (51%), Gaps = 16/141 (11%)

Query: 109 SAAQFGSADLRKAVHVK-ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S   F ++ L++++ +  + + A+F+ AD+R +DFS +K + A L++    +AN  GADL
Sbjct: 68  SGVNFKASKLQRSLAIWVQAYWADFSDADLRHADFSCAKLSAAQLKRTDFSQANLMGADL 127

Query: 168 SDTLMDRMVL----------NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
           SD++                  ANLTNA L    +  + L GA +  +D S + +     
Sbjct: 128 SDSVAQDSCFKGANLWGVWAQRANLTNACLSHVDMATAKLTGAQLLDSDLSWSCL----S 183

Query: 218 QALCKYANGTNP-ITGVSTRK 237
           QA+CK AN T+  + G   RK
Sbjct: 184 QAVCKGANLTSACLEGSDLRK 204


>gi|85860772|ref|YP_462974.1| pentapeptide repeat-containing protein [Syntrophus aciditrophicus
           SB]
 gi|85723863|gb|ABC78806.1| pentapeptide repeat domain protein [Syntrophus aciditrophicus SB]
          Length = 306

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 59/105 (56%), Gaps = 6/105 (5%)

Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A   + DLR+A +H  +   AN T AD+++S+ S +  N   L      +AN +GADL
Sbjct: 157 SEANLSNTDLREADLHGADLSDANLTGADLQKSNLSKANLNWTRL-----REANLSGADL 211

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           S+  + R  L +ANL+ A LV   L R++L G  + GAD  +A +
Sbjct: 212 SEAYLKRADLRKANLSRANLVDANLNRANLRGTDLRGADLGNANL 256



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 50/85 (58%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +A+ + A+++E+D SG+  + A L  A     N  GADLS+  ++   L+EA+L  A L 
Sbjct: 53  KADLSEANLQETDLSGANLHKADLNGANLKGVNLVGADLSEACLNGADLSEADLGKADLR 112

Query: 189 RTVLTRSDLGGAIIEGADFSDAVID 213
           RT L++ +L G  +  A+ S+  +D
Sbjct: 113 RTCLSKVNLRGTKLIEANLSNTDLD 137



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 15/128 (11%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A  G ADLR+    K N R      AN ++ D+ E +  G       L  A   +AN 
Sbjct: 102 SEADLGKADLRRTCLSKVNLRGTKLIEANLSNTDLDEVELRGQNLRRTKLIGANLSEANL 161

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG-----GAIIEGADFSDAVIDLAQK 217
           +  DL +  +    L++ANLT A L ++ L++++L       A + GAD S+A +    K
Sbjct: 162 SNTDLREADLHGADLSDANLTGADLQKSNLSKANLNWTRLREANLSGADLSEAYL----K 217

Query: 218 QALCKYAN 225
           +A  + AN
Sbjct: 218 RADLRKAN 225



 Score = 44.3 bits (103), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 54/105 (51%), Gaps = 6/105 (5%)

Query: 112 QFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           +    +LR+   +  N   AN ++ D+RE+D  G+  + A L  A   K+N + A+L+ T
Sbjct: 140 ELRGQNLRRTKLIGANLSEANLSNTDLREADLHGADLSDANLTGADLQKSNLSKANLNWT 199

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
                 L EANL+ A L    L R+DL  A +  A+  DA ++ A
Sbjct: 200 -----RLREANLSGADLSEAYLKRADLRKANLSRANLVDANLNRA 239



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 50/110 (45%), Gaps = 26/110 (23%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    ADL+K+   K N        AN + AD+ E          AYL++A   KAN 
Sbjct: 177 SDANLTGADLQKSNLSKANLNWTRLREANLSGADLSE----------AYLKRADLRKANL 226

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           + A+L D          ANL  A L  T L  +DLG A + GAD  +A +
Sbjct: 227 SRANLVD----------ANLNRANLRGTDLRGADLGNANLAGADLREANL 266



 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 45/100 (45%), Gaps = 11/100 (11%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADLRKA     N  RAN   A++  ++  G+   GA L       AN  GADL
Sbjct: 212 SEAYLKRADLRKA-----NLSRANLVDANLNRANLRGTDLRGADL-----GNANLAGADL 261

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
            +  + +  L  A L  A L  T L+ +D  G  +  AD 
Sbjct: 262 REANLGKTCLRGARLQGAKLNETDLSDADFTGVDLSEADL 301


>gi|212721648|ref|NP_001132583.1| uncharacterized protein LOC100194054 [Zea mays]
 gi|194694818|gb|ACF81493.1| unknown [Zea mays]
 gi|413933909|gb|AFW68460.1| hypothetical protein ZEAMMB73_478838 [Zea mays]
          Length = 225

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 61/128 (47%), Gaps = 8/128 (6%)

Query: 117 DLRKAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           DLR   +  E  N +  +  +A M E+ F G+  +   + KA A  A+F G D ++ ++D
Sbjct: 102 DLRFCDYTNEKTNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGTDFTNAVID 161

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
           R+   +A+LT A+   TVL+ S    A ++   F D +I     Q LC     TN     
Sbjct: 162 RVNFEKADLTGAIFKNTVLSGSTFDDAKMDDVVFEDTIIGYIDLQKLC-----TNTSISP 216

Query: 234 STRKSLGC 241
             R  LGC
Sbjct: 217 DARLELGC 224


>gi|323454309|gb|EGB10179.1| hypothetical protein AURANDRAFT_23610 [Aureococcus anophagefferens]
          Length = 107

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 6/101 (5%)

Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII---- 202
           FN A L  A  + A+  G    D  M ++ L  A+L+NA L    LT + + GA+I    
Sbjct: 6   FNKAQLFSASFFDADLAGTTFVDADMKQVNLEMADLSNADLTNADLTEAYMAGAVIKDLK 65

Query: 203 --EGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
             +  D++D  +   Q+  LC  A GTNP TG+ TR +L C
Sbjct: 66  KIDNTDWTDVDMRKDQRTYLCSIAKGTNPKTGMDTRDTLMC 106


>gi|312195986|ref|YP_004016047.1| pentapeptide repeat-containing protein [Frankia sp. EuI1c]
 gi|311227322|gb|ADP80177.1| pentapeptide repeat protein [Frankia sp. EuI1c]
          Length = 377

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 56/104 (53%), Gaps = 9/104 (8%)

Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           AA+   ADL  ++ +K    A         +D +G++ + A L+ A    AN TGA L D
Sbjct: 237 AARLTGADLTGSILIKTKLTA---------TDLAGARLSQANLDGADLANANLTGARLDD 287

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
            ++  + L+E  L +AVL R  L R+DL GA + GAD + A +D
Sbjct: 288 AILTGVHLSEGRLVDAVLTRANLHRADLVGADLTGADLTGARLD 331


>gi|91070460|gb|ABE11370.1| pentapeptide repeats [uncultured Prochlorococcus marinus clone
           HOT0M-10G7]
          Length = 157

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 37/131 (28%), Positives = 64/131 (48%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F  +DL+ A          F   D+++++ SG +   A L  A     N + ++L + 
Sbjct: 33  ADFSGSDLKGAT---------FYLTDLQDANLSGCELQNATLYGAKLKDTNLSNSNLREV 83

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D  VL+  +L+N  L  +    +      I+GADF++  +     +  C+ A+GTNPI
Sbjct: 84  TLDSAVLDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIVREFCEIASGTNPI 143

Query: 231 TGVSTRKSLGC 241
           T   TR++L C
Sbjct: 144 TNRDTRETLEC 154


>gi|358458677|ref|ZP_09168884.1| pentapeptide repeat protein [Frankia sp. CN3]
 gi|357077988|gb|EHI87440.1| pentapeptide repeat protein [Frankia sp. CN3]
          Length = 377

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 40/109 (36%), Positives = 55/109 (50%), Gaps = 9/109 (8%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           F    AA+   ADL  AV  K    A         +D +G++ + A L+ A    AN TG
Sbjct: 232 FATFVAARLTGADLTGAVLAKTKLTA---------TDLAGTRLSRANLDGADLANANLTG 282

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           A L D ++    L+EA L  A+L R  L R+DL GA + GAD + A +D
Sbjct: 283 ARLDDAVLTGAHLSEARLVGAILTRADLHRADLVGADLTGADLTGARLD 331



 Score = 37.4 bits (85), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 38/82 (46%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           ++T A    +   G++  G  L  A    A  TGADL+  ++ +  L   +L    L R 
Sbjct: 209 DWTIAHYPGAQLVGARLAGRDLTFATFVAARLTGADLTGAVLAKTKLTATDLAGTRLSRA 268

Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
            L  +DL  A + GA   DAV+
Sbjct: 269 NLDGADLANANLTGARLDDAVL 290


>gi|376002767|ref|ZP_09780589.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375328823|emb|CCE16342.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 517

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 52/180 (28%), Positives = 85/180 (47%), Gaps = 29/180 (16%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F +A+LR+A     N   A+F+ A+MR  D  G+  +GA L +A    AN +GA+LS 
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSGANLSS 248

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---------SDAV----IDLA- 215
            ++ +  L  A+L+ A L+R   + +DL GA + GA           +D +    +DL+ 
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKLYQVSRFNLKADEITCEWVDLSA 308

Query: 216 ----------QKQALCKYANGTNPITGVSTRKSL--GCGNSRRNAYGSPSS--PLLSAPP 261
                      +++L K+ N T PI  +    SL      +  N Y   +   P++  PP
Sbjct: 309 NGDHSQVYHFDRESLRKFFNQTRPIVEILVNSSLDQDANMALANIYHKIAQEFPVMERPP 368



 Score = 44.3 bits (103), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 67/156 (42%), Gaps = 28/156 (17%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTS 134
           L  A++   + N++ LA ++  EA+      I        A+L +A   K NF +AN   
Sbjct: 86  LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138

Query: 135 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 174
           AD+RE+    + FNGA L                      A   K N   AD S+  + +
Sbjct: 139 ADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDLRGADLVKVNLPKADFSNAELRQ 198

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             L  ANL+NA      +   DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANMRWIDLQGADLSGANLTEA 234



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 49/156 (31%), Positives = 67/156 (42%), Gaps = 40/156 (25%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSA---------------DMRESDFSGSKF 147
           S A   ++DLR+    + N        AN T A               D+ E+    S  
Sbjct: 57  SVANLSASDLREVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEAQLINSLL 116

Query: 148 NGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDL 197
             A L +A   KANFT     GADL +T + +   N ANL+ A L       T  T++DL
Sbjct: 117 IRAELIRAKLTKANFTQANLNGADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDL 176

Query: 198 GGAI-----IEGADFSDAVIDLAQKQALCKYANGTN 228
            GA      +  ADFS+A +    +QA   YAN +N
Sbjct: 177 RGADLVKVNLPKADFSNAEL----RQANLTYANLSN 208



 Score = 38.1 bits (87), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 49/102 (48%), Gaps = 3/102 (2%)

Query: 112 QFGSADLRKAVHVKENFR---ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           Q   +D+ K   + + +R    +F   ++ E + S     GA L  A    AN + +DL 
Sbjct: 8   QNSESDVLKVYEIVKKYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +  + R  LN A L+NA L + +L ++ +  A +   D ++A
Sbjct: 68  EVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEA 109


>gi|209526071|ref|ZP_03274603.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|423067542|ref|ZP_17056332.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209493459|gb|EDZ93782.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|406711116|gb|EKD06318.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 517

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 52/180 (28%), Positives = 85/180 (47%), Gaps = 29/180 (16%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F +A+LR+A     N   A+F+ A+MR  D  G+  +GA L +A    AN +GA+LS 
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSGANLSS 248

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---------SDAV----IDLA- 215
            ++ +  L  A+L+ A L+R   + +DL GA + GA           +D +    +DL+ 
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKLYQVSRFNLKADEITCEWVDLSA 308

Query: 216 ----------QKQALCKYANGTNPITGVSTRKSL--GCGNSRRNAYGSPSS--PLLSAPP 261
                      +++L K+ N T PI  +    SL      +  N Y   +   P++  PP
Sbjct: 309 NGDHSQVYHFDRESLRKFFNQTRPIVEILVNSSLDQDANMALANIYHKIAQEFPVMERPP 368



 Score = 44.3 bits (103), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 67/156 (42%), Gaps = 28/156 (17%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTS 134
           L  A++   + N++ LA ++  EA+      I        A+L +A   K NF +AN   
Sbjct: 86  LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138

Query: 135 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 174
           AD+RE+    + FNGA L                      A   K N   AD S+  + +
Sbjct: 139 ADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDLRGADLVKVNLPKADFSNAELRQ 198

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             L  ANL+NA      +   DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANMRWIDLQGADLSGANLTEA 234



 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 49/156 (31%), Positives = 67/156 (42%), Gaps = 40/156 (25%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSA---------------DMRESDFSGSKF 147
           S A   ++DLR+    + N        AN T A               D+ E+    S  
Sbjct: 57  SVANLSASDLREVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEAQLINSLL 116

Query: 148 NGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDL 197
             A L +A   KANFT     GADL +T + +   N ANL+ A L       T  T++DL
Sbjct: 117 IRAELIRAKLTKANFTQANLNGADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDL 176

Query: 198 GGAI-----IEGADFSDAVIDLAQKQALCKYANGTN 228
            GA      +  ADFS+A +    +QA   YAN +N
Sbjct: 177 RGADLVKVNLPKADFSNAEL----RQANLTYANLSN 208



 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 49/102 (48%), Gaps = 3/102 (2%)

Query: 112 QFGSADLRKAVHVKENFR---ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           Q   +D+ K   + + +R    +F   ++ E + S     GA L  A    AN + +DL 
Sbjct: 8   QNSESDVLKVYEIVKKYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +  + R  LN A L+NA L + +L ++ +  A +   D ++A
Sbjct: 68  EVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEA 109


>gi|427417538|ref|ZP_18907721.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
 gi|425760251|gb|EKV01104.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
          Length = 397

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 25/103 (24%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+F  A+++E DFSG   +          K+N  GADLSDT + ++ LN+ANL  A L R
Sbjct: 283 ADFKGANLKEKDFSGRNLS----------KSNLEGADLSDTFLHKVNLNQANLHKAKLFR 332

Query: 190 TVLTR---------------SDLGGAIIEGADFSDAVIDLAQK 217
             L +               +DL GA + GAD S A+I    K
Sbjct: 333 ANLLQANLSHANLREANLIGADLSGADLSGADLSGAIIGYGDK 375


>gi|186682860|ref|YP_001866056.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
           73102]
 gi|186465312|gb|ACC81113.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
          Length = 589

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 54/106 (50%), Gaps = 6/106 (5%)

Query: 111 AQFGSADLRKAVHVKEN------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    ADL  A+    N      F  N + A +  +D S +K NGA L  A    A F G
Sbjct: 408 ADLSGADLSHAILNGTNLSDTILFSTNLSDAILMAADLSYAKLNGAKLNNARLNGAMFLG 467

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           ADLS   + R+ LNEA+L+  +L    L+ +DL  AI+ G DFS A
Sbjct: 468 ADLSGVDLSRVSLNEADLSGVILSEADLSGADLTDAILFGTDFSYA 513



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 56/102 (54%), Gaps = 9/102 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A FG A+L  A         N + A++  +D S +  +GA L +A   +A+   ADLS
Sbjct: 301 SGANFGDANLSGA---------NLSGANLSGADLSSTNLSGANLSRANLSRADLNRADLS 351

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            T ++R  L+  NL+ A L  T  +R+DL  AI+ GA+ S+A
Sbjct: 352 STNLNRADLSNTNLSRADLSSTNFSRADLSNAILFGANLSEA 393



 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 11/111 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANF 162
           S A    ADL        N  RAN + AD+  +D S +  N A      L +A     NF
Sbjct: 316 SGANLSGADLSSTNLSGANLSRANLSRADLNRADLSSTNLNRADLSNTNLSRADLSSTNF 375

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           + ADLS+ ++    L+EANL+N  L    L R+DL      GAD S A+++
Sbjct: 376 SRADLSNAILFGANLSEANLSNVSLNHADLCRADL-----SGADLSHAILN 421



 Score = 45.1 bits (105), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 1/105 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A  G A+L     +  N   ANF  A++  ++ SG+  +GA L       AN + A+L
Sbjct: 281 SLAYLGDANLTGVNFIGANLSGANFGDANLSGANLSGANLSGADLSSTNLSGANLSRANL 340

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           S   ++R  L+  NL  A L  T L+R+DL       AD S+A++
Sbjct: 341 SRADLNRADLSSTNLNRADLSNTNLSRADLSSTNFSRADLSNAIL 385



 Score = 42.0 bits (97), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 42/115 (36%), Positives = 57/115 (49%), Gaps = 9/115 (7%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
           AKL N R+  +  L A +     S +S    LN  EA+  G   I S A    ADL  A+
Sbjct: 453 AKLNNARLNGAMFLGADLSGVDLSRVS----LN--EADLSGV--ILSEADLSGADLTDAI 504

Query: 123 HVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
               +F  AN  SA++  S+ SG+  NGA L  +    A  +GADLSD  M++M 
Sbjct: 505 LFGTDFSYANLNSANLSGSNLSGAILNGANLSHSNLSYAILSGADLSDANMEKMT 559



 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 21/126 (16%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL +A     N  RA+ ++ ++  +D S + F+ A L  A+ + AN + A+L
Sbjct: 336 SRANLSRADLNRADLSSTNLNRADLSNTNLSRADLSSTNFSRADLSNAILFGANLSEANL 395

Query: 168 SDTLMDRM---------------VLNEANLTNAVLVRT-----VLTRSDLGGAIIEGADF 207
           S+  ++                 +LN  NL++ +L  T     +L  +DL  A + GA  
Sbjct: 396 SNVSLNHADLCRADLSGADLSHAILNGTNLSDTILFSTNLSDAILMAADLSYAKLNGAKL 455

Query: 208 SDAVID 213
           ++A ++
Sbjct: 456 NNARLN 461



 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 32/90 (35%), Positives = 49/90 (54%), Gaps = 1/90 (1%)

Query: 127 NFRANFT-SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           NFR+ +   A++  +DFSG+  + AYL  A     NF GA+LS        L+ ANL+ A
Sbjct: 259 NFRSAYLGDANLTGADFSGADLSLAYLGDANLTGVNFIGANLSGANFGDANLSGANLSGA 318

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
            L    L+ ++L GA +  A+ S A ++ A
Sbjct: 319 NLSGADLSSTNLSGANLSRANLSRADLNRA 348



 Score = 37.4 bits (85), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 46/101 (45%), Gaps = 1/101 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S+  F  ADL  A+    N   AN ++  +  +D   +  +GA L  A+    N +   L
Sbjct: 371 SSTNFSRADLSNAILFGANLSEANLSNVSLNHADLCRADLSGADLSHAILNGTNLSDTIL 430

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
             T +   +L  A+L+ A L    L  + L GA+  GAD S
Sbjct: 431 FSTNLSDAILMAADLSYAKLNGAKLNNARLNGAMFLGADLS 471


>gi|428310629|ref|YP_007121606.1| serine/threonine protein kinase [Microcoleus sp. PCC 7113]
 gi|428252241|gb|AFZ18200.1| serine/threonine protein kinase [Microcoleus sp. PCC 7113]
          Length = 542

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 46/87 (52%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R +F S D+   D      +G    +A   K NF GADLS+    R  LN +NL +A L 
Sbjct: 415 RRDFASQDLSGLDLHKVDLSGGIFHQAKLAKTNFQGADLSNADFGRASLNRSNLRDANLG 474

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
           R  L+ +DL GA + GAD S A ++ A
Sbjct: 475 RAYLSYADLEGADLRGADLSYAYLNHA 501


>gi|119486763|ref|ZP_01620738.1| hypothetical protein L8106_10952 [Lyngbya sp. PCC 8106]
 gi|119456056|gb|EAW37189.1| hypothetical protein L8106_10952 [Lyngbya sp. PCC 8106]
          Length = 331

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 61/128 (47%), Gaps = 20/128 (15%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR--------ANFTSADMRE 139
           ++ L D N  +A+ RG         F  ADLR A     N R         N   AD+R 
Sbjct: 104 LAILLDANLIQADLRG-------VNFQGADLRGACLRGANLRYERRIYDGVNLRGADLRG 156

Query: 140 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 199
           +D  G    GA L +A     N  GA+L++T++   +L +ANLT A L    LT +DL G
Sbjct: 157 ADLQGVNLTGADLTRA-----NLRGANLAETVLRGAILKQANLTQANLQSAFLTEADLSG 211

Query: 200 AIIEGADF 207
           A + GA+ 
Sbjct: 212 ARLIGANL 219



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 60/106 (56%), Gaps = 11/106 (10%)

Query: 118 LRKAVHVKENF-RANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSDTL 171
           LR A+  + N  +AN  SA + E+D SG++  GA      LE+A+  +A   G +L D++
Sbjct: 184 LRGAILKQANLTQANLQSAFLTEADLSGARLIGANLRKVKLERAILIEAQLPGVELCDSI 243

Query: 172 MDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 212
           +  + L+ ANL+ A L RT L     TR+DL  A +  AD +DA +
Sbjct: 244 LPDVKLSSANLSGADLSRTNLVRADLTRTDLSNANLTQADLTDASV 289



 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 59/132 (44%), Gaps = 25/132 (18%)

Query: 95  NKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRANFTSADMRESDF----------- 142
           N+Y+A  R    I    A   SADL           ANF  AD++ S+F           
Sbjct: 8   NRYQAGERDFRDIHLRNANLNSADL---------IDANFNHADLQGSEFVFAYLNSVNFV 58

Query: 143 ----SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
                 +K +GAYL KA    AN + ADL   ++      +ANL+ A+L+   L ++DL 
Sbjct: 59  RANLGSAKLSGAYLNKANLSGANLSDADLHGAVLQGADFRKANLSLAILLDANLIQADLR 118

Query: 199 GAIIEGADFSDA 210
           G   +GAD   A
Sbjct: 119 GVNFQGADLRGA 130


>gi|119488860|ref|ZP_01621822.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
 gi|119455021|gb|EAW36163.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
          Length = 1011

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 37/102 (36%), Positives = 54/102 (52%), Gaps = 14/102 (13%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A   +ADLR A  +    RAN + A++R ++ SG+  +G YL  A   +AN   A+  
Sbjct: 850 SGADLRTADLRSANLI----RANLSDANLRSANLSGANLSGVYLNSADLRRANLNDAN-- 903

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
                   LN+A+L+ A L    L+ +DL GA +  ADFS A
Sbjct: 904 --------LNDADLSGANLRSADLSGADLSGADLSVADFSSA 937



 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 6/90 (6%)

Query: 127 NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD-----RMVLNEA 180
           N R ++ + AD+R +D   +    A L  A    AN +GA+LS   ++     R  LN+A
Sbjct: 843 NLRTSDLSGADLRTADLRSANLIRANLSDANLRSANLSGANLSGVYLNSADLRRANLNDA 902

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           NL +A L    L  +DL GA + GAD S A
Sbjct: 903 NLNDADLSGANLRSADLSGADLSGADLSVA 932


>gi|119512769|ref|ZP_01631839.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
 gi|119462587|gb|EAW43554.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
          Length = 268

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 56/106 (52%), Gaps = 6/106 (5%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A     DL  A  ++ N        AN  +AD+ E++   ++ NGAYL KA  YKAN   
Sbjct: 139 ANLRETDLSTAKLIRANLGFANLIEANLINADLSEANLYEAQLNGAYLYKANFYKANLHQ 198

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           A LS   + R   +EANL+ A L  + LT ++L GA ++GA+   A
Sbjct: 199 AHLSGAYLFRANFSEANLSCANLTWSNLTGANLAGANLQGANLRGA 244



 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 52/101 (51%), Gaps = 1/101 (0%)

Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A   +A+L+ A+  + N    +   A++RE+D S +K   A L  A   +AN   ADLS+
Sbjct: 114 ADLSTANLQGAIIAEANLIGTDLRDANLRETDLSTAKLIRANLGFANLIEANLINADLSE 173

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             +    LN A L  A   +  L ++ L GA +  A+FS+A
Sbjct: 174 ANLYEAQLNGAYLYKANFYKANLHQAHLSGAYLFRANFSEA 214



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD--RMV---LNEANLTN 184
           AN    ++R ++  G   N   L  A+  +AN + ADLS   +   +++   L+EANL+ 
Sbjct: 34  ANLKGENLRGANLQGVNLNKVDLSHALLVRANLSNADLSGANLHQAKLIEANLSEANLSV 93

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
           A L    LT+++L  A + GAD S A
Sbjct: 94  ANLSGATLTQANLSYAHLIGADLSTA 119



 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 54/108 (50%), Gaps = 6/108 (5%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           SAA     +LR A     N  + + + A +  ++ S +  +GA L +A   +AN + A+L
Sbjct: 32  SAANLKGENLRGANLQGVNLNKVDLSHALLVRANLSNADLSGANLHQAKLIEANLSEANL 91

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDA 210
           S   +    L +ANL+ A L+   L+ ++L GAII      G D  DA
Sbjct: 92  SVANLSGATLTQANLSYAHLIGADLSTANLQGAIIAEANLIGTDLRDA 139


>gi|359459933|ref|ZP_09248496.1| hypothetical protein ACCM5_14478 [Acaryochloris sp. CCMEE 5410]
          Length = 315

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 53/92 (57%), Gaps = 5/92 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           ++  AD++E DFSG   + A L     + A  +K N  GA+L++  + R  L +ANLT A
Sbjct: 202 DWHGADLQERDFSGRNLSQANLANVNLKDAFMHKVNLAGANLTNANLTRANLLQANLTQA 261

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
            L    LT +DL GA + GADF+ A + + +K
Sbjct: 262 NLQGANLTAADLSGADLRGADFTGANMGIGKK 293


>gi|209526959|ref|ZP_03275476.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|376005813|ref|ZP_09783205.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423064919|ref|ZP_17053709.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209492561|gb|EDZ92899.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|375325803|emb|CCE18958.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406714162|gb|EKD09330.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 331

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 67/138 (48%), Gaps = 10/138 (7%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRA 130
           F  T L AA +   +  ++ L D N  +A+ RG       A    ADLR A     N R 
Sbjct: 87  FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139

Query: 131 N---FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
               + S ++R +D  G+   G  L  A   +AN TGA+L++ ++   +LN+ NL+   L
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLTGANLTECVLRGAILNQTNLSETNL 199

Query: 188 VRTVLTRSDLGGAIIEGA 205
              +LT  +L GA + G+
Sbjct: 200 QGAILTEVNLSGANLIGS 217



 Score = 43.5 bits (101), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 63/126 (50%), Gaps = 7/126 (5%)

Query: 94  LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFR-ANFTSA-----DMRESDFSGSK 146
           LN+Y +  +   G+    A+  +ADL  A     +F+ ANF  A     ++  ++   ++
Sbjct: 7   LNQYRSGEKLFRGVNLRNAELSNADLIGANLSGGDFQGANFVLAYLNGVNLTRANLEKAR 66

Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
             GA L +A    A  T AD   T++    L +ANLT A LV   L ++DL GA ++GAD
Sbjct: 67  LGGANLSRANLSGAQLTDADFHGTILQAADLRKANLTLATLVDANLIQADLRGANLQGAD 126

Query: 207 FSDAVI 212
              A +
Sbjct: 127 LRGACL 132


>gi|22299142|ref|NP_682389.1| hypothetical protein tlr1599 [Thermosynechococcus elongatus BP-1]
 gi|22295324|dbj|BAC09151.1| tlr1599 [Thermosynechococcus elongatus BP-1]
          Length = 309

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 50/143 (34%), Positives = 71/143 (49%), Gaps = 9/143 (6%)

Query: 89  SALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGS 145
           +AL   N   A+ RG    G   S A    ADLR  + V  + R  + S  +R+++ +G+
Sbjct: 45  AALQSTNLQRADLRGAILTGANLSQADLRGADLRGVILVSADLR--WVS--LRKANLTGA 100

Query: 146 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
               A L  A   +AN TGA LS+ ++    L   +LT A L R  LTR++L  A + GA
Sbjct: 101 DLTRANLANADLSEANLTGAQLSEAIVRDANLTLTDLTLAELERANLTRANLTEAYLRGA 160

Query: 206 DFSDAVIDLAQKQALCKYANGTN 228
           D +DAV  L + Q L     G N
Sbjct: 161 DLTDAV--LRESQLLQANLRGAN 181



 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 56/105 (53%), Gaps = 6/105 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           SA     A+L +A+ +  N R A    A++RE  F  +    A L+KA     N  GADL
Sbjct: 183 SATNLQQANLERAILIGANLRRARLEEANLREVAFKEANLRHACLDKA-----NLVGADL 237

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
               + + +L  ANL++A+L+   L  ++L GA + GA+  +A++
Sbjct: 238 RGVSLAQALLRGANLSSAILIGANLMGANLSGADLRGANLIEAIL 282



 Score = 44.3 bits (103), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 49/108 (45%), Gaps = 16/108 (14%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A+   A+LR+    + N R      AN   AD+R    + +   GA L  A+   AN  G
Sbjct: 205 ARLEEANLREVAFKEANLRHACLDKANLVGADLRGVSLAQALLRGANLSSAILIGANLMG 264

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A+LS           A+L  A L+  +LT + L G  +   D S+A++
Sbjct: 265 ANLSG----------ADLRGANLIEAILTGASLNGVDLSAVDMSEAIL 302



 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 62/131 (47%), Gaps = 16/131 (12%)

Query: 91  LADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSK 146
           L DL   E E    TR      + A    ADL  AV ++E   +    A++R ++ S + 
Sbjct: 134 LTDLTLAELERANLTRANL---TEAYLRGADLTDAV-LRE---SQLLQANLRGANLSATN 186

Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AI 201
              A LE+A+   AN   A L +  +  +   EANL +A L +  L  +DL G     A+
Sbjct: 187 LQQANLERAILIGANLRRARLEEANLREVAFKEANLRHACLDKANLVGADLRGVSLAQAL 246

Query: 202 IEGADFSDAVI 212
           + GA+ S A++
Sbjct: 247 LRGANLSSAIL 257



 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 47/89 (52%), Gaps = 5/89 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD----RMV-LNEANLT 183
           RA+ T A ++ ++   +   GA L  A   +A+  GADL   ++     R V L +ANLT
Sbjct: 39  RADLTDAALQSTNLQRADLRGAILTGANLSQADLRGADLRGVILVSADLRWVSLRKANLT 98

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            A L R  L  +DL  A + GA  S+A++
Sbjct: 99  GADLTRANLANADLSEANLTGAQLSEAIV 127



 Score = 41.6 bits (96), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 52/108 (48%), Gaps = 11/108 (10%)

Query: 109 SAAQFGSADLRKA------VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           + AQ   A +R A      + + E  RAN T A++ E+   G+    A L ++   +AN 
Sbjct: 118 TGAQLSEAIVRDANLTLTDLTLAELERANLTRANLTEAYLRGADLTDAVLRESQLLQANL 177

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            GA+LS T      L +ANL  A+L+   L R+ L  A +    F +A
Sbjct: 178 RGANLSAT-----NLQQANLERAILIGANLRRARLEEANLREVAFKEA 220



 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 37/72 (51%), Gaps = 15/72 (20%)

Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
           + DF+G     A+L + +      TG DLS           A+LT+A L  T L R+DL 
Sbjct: 14  DRDFAGIHLRRAHLSRCI-----LTGIDLS----------RADLTDAALQSTNLQRADLR 58

Query: 199 GAIIEGADFSDA 210
           GAI+ GA+ S A
Sbjct: 59  GAILTGANLSQA 70


>gi|308813604|ref|XP_003084108.1| COG1357: Uncharacterized low-complexity proteins (ISS)
           [Ostreococcus tauri]
 gi|116055991|emb|CAL58524.1| COG1357: Uncharacterized low-complexity proteins (ISS)
           [Ostreococcus tauri]
          Length = 177

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 63/129 (48%), Gaps = 17/129 (13%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT   ++ ++F G+   G     A    A F GA+L +  + +  L++A+LT+A+L  
Sbjct: 48  AFFTKGSLKRANFDGANLEGITFFGADLTGATFRGANLQNANLGQANLSKADLTDAILSG 107

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQ-----------------ALCKYANGTNPITG 232
            +++ +      IEG+D+S+ ++   + +                  LCK A G NP+TG
Sbjct: 108 AIVSSAQFDDVKIEGSDWSEVIVRKREAKDDTTDDLFCVAYQDILTGLCKVAKGENPVTG 167

Query: 233 VSTRKSLGC 241
           + T  +L C
Sbjct: 168 LPTELTLMC 176


>gi|300867252|ref|ZP_07111912.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
 gi|300334729|emb|CBN57078.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
          Length = 508

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 49/143 (34%), Positives = 70/143 (48%), Gaps = 11/143 (7%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F   DLR+A   + N   AN + A++R +D SG+   GA L +A    AN  GA+LS+
Sbjct: 181 ADFSGTDLRQANLCQVNLSGANLSGANLRWADLSGANLRGADLNEAKLSGANLYGANLSN 240

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
                     ANLTNA LV   LT ++L GA   GAD S + +  A+   + ++      
Sbjct: 241 ----------ANLTNASLVHADLTLANLNGADWVGADLSGSTLSGAKLYDVPRFGIKAEE 290

Query: 230 ITGVSTRKSLGCGNSRRNAYGSP 252
           +T      S    NS+   +GSP
Sbjct: 291 VTCEWVDLSSNGDNSQVYRFGSP 313



 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 76/152 (50%), Gaps = 14/152 (9%)

Query: 73  STALAAAVVASCSSNISAL--ADLNKYE----AETRGEFGIG-------SAAQFGSADLR 119
           S+ L  A++   + N++ L  ADL++ +    A  RGE           S A    ADLR
Sbjct: 75  SSHLVRAILQGATLNVANLVRADLSEAQLMGAALIRGELIRAELSKANFSKANLTGADLR 134

Query: 120 KAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
           +A   + NF  AN + A++R +  + + F  A L  A   KA+  GAD S T + +  L 
Sbjct: 135 EAKLTEVNFSEANLSGANLRGASGTAANFELANLHGADLSKADLNGADFSGTDLRQANLC 194

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           + NL+ A L    L  +DL GA + GAD ++A
Sbjct: 195 QVNLSGANLSGANLRWADLSGANLRGADLNEA 226



 Score = 41.2 bits (95), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 45/80 (56%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NFT  ++ E++ S    + A L +A  +  N +GA+L++  +    LN A L+++ LVR 
Sbjct: 22  NFTGINLNEANLSRINLSQANLSEASLFVTNLSGANLNEVNLSNANLNVARLSSSHLVRA 81

Query: 191 VLTRSDLGGAIIEGADFSDA 210
           +L  + L  A +  AD S+A
Sbjct: 82  ILQGATLNVANLVRADLSEA 101



 Score = 37.7 bits (86), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 43/88 (48%), Gaps = 5/88 (5%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           F  N + A++ E + S +  N A L  +   +A   GA L+   + R  L+EA L  A L
Sbjct: 49  FVTNLSGANLNEVNLSNANLNVARLSSSHLVRAILQGATLNVANLVRADLSEAQLMGAAL 108

Query: 188 VRTVLTRSDLGGAI-----IEGADFSDA 210
           +R  L R++L  A      + GAD  +A
Sbjct: 109 IRGELIRAELSKANFSKANLTGADLREA 136


>gi|158336687|ref|YP_001517861.1| hypothetical protein AM1_3555 [Acaryochloris marina MBIC11017]
 gi|158306928|gb|ABW28545.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 315

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 53/92 (57%), Gaps = 5/92 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           ++  AD++E DFSG   + A L     + A  +K N  GA+L++  + R  L +ANLT A
Sbjct: 202 DWHGADLQERDFSGRNLSQANLANVNLKDAFMHKVNLAGANLTNANLTRANLLQANLTQA 261

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
            L    LT +DL GA + GADF+ A + + +K
Sbjct: 262 NLQGANLTAADLSGADLRGADFTGANMGIGKK 293


>gi|78779034|ref|YP_397146.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9312]
 gi|78712533|gb|ABB49710.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9312]
          Length = 157

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 37/131 (28%), Positives = 64/131 (48%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F  +DL+ A          F   D+++++ SG +   A L  A     N + ++L + 
Sbjct: 33  ADFSGSDLKGAT---------FYLTDLQDANLSGCELQNATLYGAKLKDTNLSNSNLREV 83

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D  VL+  +L+N  L  +    +      I+GADF++  +     +  C+ A+GTNPI
Sbjct: 84  TLDSAVLDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIVRKFCESASGTNPI 143

Query: 231 TGVSTRKSLGC 241
           T   TR++L C
Sbjct: 144 TNRDTRETLEC 154


>gi|94266259|ref|ZP_01289965.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
 gi|93453141|gb|EAT03609.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
          Length = 818

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/98 (41%), Positives = 54/98 (55%), Gaps = 12/98 (12%)

Query: 127 NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMD------R 174
           +FRA N + AD   +DFS + F GA L  AV  + + TG     A+L+D  +D      R
Sbjct: 372 DFRAANLSRADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSR 431

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             L  ANLTNA L    LT +DL  AI+ GAD  +AV+
Sbjct: 432 ATLIRANLTNASLREADLTGADLSNAILTGADLREAVL 469



 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 1/95 (1%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N    D+RE DF G++ +G   ++A    A+F+GADL    +    L  A L  A L R 
Sbjct: 142 NLAGMDLREVDFRGARLHGVSFQEANLRGADFSGADLMHADLSEADLRGAKLVGANLSRV 201

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYA 224
            L R+DLG A +  AD + A +  A+ +QA+ + A
Sbjct: 202 NLARADLGEADLSEADLTRANLGGARLRQAILRRA 236



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 49/138 (35%), Positives = 65/138 (47%), Gaps = 14/138 (10%)

Query: 109 SAAQFGSADLRKAVHVK------ENFRANFTSADMRESDFSG-SKFNGAYLEKAVAYKAN 161
           S A F  A+L  AV  +      E   AN T A + ++D S  +    A L  A   +A+
Sbjct: 389 SKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLTNASLREAD 448

Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
            TGADLS+      +L  A+L  AVLVRT LT + L  A +  +  SDA  DL+      
Sbjct: 449 LTGADLSNA-----ILTGADLREAVLVRTRLTHAHLNRADLAWSTLSDA--DLSNADLKE 501

Query: 222 KYANGTNPITGVSTRKSL 239
              NG N   G S  +SL
Sbjct: 502 ASLNGVNLGAGASVLQSL 519



 Score = 43.9 bits (102), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 6/99 (6%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           LR A+    ++ + F   D+R +D  G+    A L  A    A+   ADLS   + R  L
Sbjct: 519 LRSAI----SWSSRFVRYDLRNADLRGANLRDADLADADLSNADLANADLSRANLSRSDL 574

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
             ANLT+A+L  T+L+ + L  A    A+F++A  DL Q
Sbjct: 575 RWANLTDAILQGTILSNASLNDANFNRANFAEA--DLTQ 611



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 8/111 (7%)

Query: 111 AQFGSADLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A FG  D RK    + +FR       NF+ AD+  + F+ +  +GA L++     A   G
Sbjct: 236 ALFGETDARKVDARQADFRGATFQRGNFSGADLSRARFADTDLSGAILQEVDLAGAELEG 295

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           +DLS   +  + L +ANL  A L    L  +DL  A +  AD S A  DLA
Sbjct: 296 SDLSRLALPGVRLVKANLGGANLYGADLRAADLTDASLVEADLSAA--DLA 344



 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 8/123 (6%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG 149
           LA ++  E + RG    G    F  A+LR A     +F  A+   AD+ E+D  G+K  G
Sbjct: 143 LAGMDLREVDFRGARLHG--VSFQEANLRGA-----DFSGADLMHADLSEADLRGAKLVG 195

Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           A L +    +A+   ADLS+  + R  L  A L  A+L R +   +D        ADF  
Sbjct: 196 ANLSRVNLARADLGEADLSEADLTRANLGGARLRQAILRRALFGETDARKVDARQADFRG 255

Query: 210 AVI 212
           A  
Sbjct: 256 ATF 258



 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 53/112 (47%), Gaps = 24/112 (21%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS-- 168
           A  G A LR+A+      RA F   D R+ D   + F GA  +     + NF+GADLS  
Sbjct: 221 ANLGGARLRQAILR----RALFGETDARKVDARQADFRGATFQ-----RGNFSGADLSRA 271

Query: 169 ---DTLMDRMVLNEANLTNAVLVRTVLTR----------SDLGGAIIEGADF 207
              DT +   +L E +L  A L  + L+R          ++LGGA + GAD 
Sbjct: 272 RFADTDLSGAILQEVDLAGAELEGSDLSRLALPGVRLVKANLGGANLYGADL 323



 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 3/121 (2%)

Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A   +ADL  A  V+ +  A +   A +++S  +G+  +G+ L    A  A+F  A+LS 
Sbjct: 321 ADLRAADLTDASLVEADLSAADLAGAKLQKSIMAGATLHGSRLVSVTARNADFRAANLSR 380

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYANGT 227
                   ++AN   A L   VL ++DL G  +  A+ +DA +D A    +A    AN T
Sbjct: 381 ADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLT 440

Query: 228 N 228
           N
Sbjct: 441 N 441


>gi|94266194|ref|ZP_01289904.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
 gi|93453242|gb|EAT03697.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
          Length = 818

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/98 (41%), Positives = 54/98 (55%), Gaps = 12/98 (12%)

Query: 127 NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMD------R 174
           +FRA N + AD   +DFS + F GA L  AV  + + TG     A+L+D  +D      R
Sbjct: 372 DFRAANLSRADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSR 431

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             L  ANLTNA L    LT +DL  AI+ GAD  +AV+
Sbjct: 432 ATLIRANLTNASLREADLTGADLSNAILTGADLREAVL 469



 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 1/95 (1%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N    D+RE DF G++ +G   ++A    A+F+GADL    +    L  A L  A L R 
Sbjct: 142 NLAGMDLREVDFRGARLHGVSFQEANLRGADFSGADLMHADLSEADLRGAKLVGANLSRV 201

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYA 224
            L R+DLG A +  AD + A +  A+ +QA+ + A
Sbjct: 202 NLARADLGEADLSEADLTRANLGGARLRQAILRRA 236



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 49/138 (35%), Positives = 65/138 (47%), Gaps = 14/138 (10%)

Query: 109 SAAQFGSADLRKAVHVK------ENFRANFTSADMRESDFSG-SKFNGAYLEKAVAYKAN 161
           S A F  A+L  AV  +      E   AN T A + ++D S  +    A L  A   +A+
Sbjct: 389 SKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLTNASLREAD 448

Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
            TGADLS+      +L  A+L  AVLVRT LT + L  A +  +  SDA  DL+      
Sbjct: 449 LTGADLSNA-----ILTGADLREAVLVRTRLTHAHLNRADLAWSTLSDA--DLSNADLKE 501

Query: 222 KYANGTNPITGVSTRKSL 239
              NG N   G S  +SL
Sbjct: 502 ASLNGVNLGAGASVLQSL 519



 Score = 43.9 bits (102), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 6/99 (6%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           LR A+    ++ + F   D+R +D  G+    A L  A    A+   ADLS   + R  L
Sbjct: 519 LRSAI----SWSSRFVRYDLRNADLRGANLRDADLADADLSNADLANADLSRANLSRSDL 574

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
             ANLT+A+L  T+L+ + L  A    A+F++A  DL Q
Sbjct: 575 RWANLTDAILQGTILSNASLNDANFNRANFAEA--DLTQ 611



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 8/111 (7%)

Query: 111 AQFGSADLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A FG  D RK    + +FR       NF+ AD+  + F+ +  +GA L++     A   G
Sbjct: 236 ALFGETDARKVDARQADFRGATFQRGNFSGADLSRARFADTDLSGAILQEVDLAGAELEG 295

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           +DLS   +  + L +ANL  A L    L  +DL  A +  AD S A  DLA
Sbjct: 296 SDLSRLALPGVRLVKANLGGANLYGADLRAADLTDASLVEADLSAA--DLA 344



 Score = 42.7 bits (99), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 8/123 (6%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG 149
           LA ++  E + RG    G    F  A+LR A     +F  A+   AD+ E+D  G+K  G
Sbjct: 143 LAGMDLREVDFRGARLHG--VSFQEANLRGA-----DFSGADLMHADLSEADLRGAKLVG 195

Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           A L +    +A+   ADLS+  + R  L  A L  A+L R +   +D        ADF  
Sbjct: 196 ANLSRVNLARADLGEADLSEADLTRANLGGARLRQAILRRALFGETDARKVDARQADFRG 255

Query: 210 AVI 212
           A  
Sbjct: 256 ATF 258



 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 53/112 (47%), Gaps = 24/112 (21%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS-- 168
           A  G A LR+A+      RA F   D R+ D   + F GA  +     + NF+GADLS  
Sbjct: 221 ANLGGARLRQAILR----RALFGETDARKVDARQADFRGATFQ-----RGNFSGADLSRA 271

Query: 169 ---DTLMDRMVLNEANLTNAVLVRTVLTR----------SDLGGAIIEGADF 207
              DT +   +L E +L  A L  + L+R          ++LGGA + GAD 
Sbjct: 272 RFADTDLSGAILQEVDLAGAELEGSDLSRLALPGVRLVKANLGGANLYGADL 323



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 3/121 (2%)

Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A   +ADL  A  V+ +  A +   A +++S  +G+  +G+ L    A  A+F  A+LS 
Sbjct: 321 ADLRAADLTDASLVEADLSAADLAGAKLQKSIMAGATLHGSRLVSVTARNADFRAANLSR 380

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYANGT 227
                   ++AN   A L   VL ++DL G  +  A+ +DA +D A    +A    AN T
Sbjct: 381 ADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLT 440

Query: 228 N 228
           N
Sbjct: 441 N 441


>gi|428222198|ref|YP_007106368.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427995538|gb|AFY74233.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 225

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 58/105 (55%), Gaps = 9/105 (8%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A+   ADL  A   K    A  + A++  ++ SG+  +  +L +AV   AN   ADL+  
Sbjct: 26  AELNDADLSGANLSK----ARMSGAELNRANMSGANLHSTHLNRAVMKNANLENADLTGA 81

Query: 171 LMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAIIEGADFSDA 210
            M  + L+EANLTNA L     V + LT ++L GAI+  ADFS++
Sbjct: 82  KMMEVNLSEANLTNANLSNVSGVESNLTMANLAGAILSSADFSNS 126



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 58/122 (47%), Gaps = 11/122 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 162
           S A   +A+L     V+ N   AN   A +  +DFS S  +     GA L+ A+    N 
Sbjct: 89  SEANLTNANLSNVSGVESNLTMANLAGAILSSADFSNSNLSKVNLVGADLQGAIFSNTNL 148

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
           TGADLS   +  + L+ ANL+ A L   +     LGGA I  A+F+   +  A  + +  
Sbjct: 149 TGADLSGINLKGVNLSGANLSMANLSGAI-----LGGANITKANFAQTDLSNADLRDVNI 203

Query: 223 YA 224
           YA
Sbjct: 204 YA 205



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 34/119 (28%), Positives = 53/119 (44%), Gaps = 11/119 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 163
           S A   S  L +AV    N   A+ T A M E + S +    A L      ++N T    
Sbjct: 54  SGANLHSTHLNRAVMKNANLENADLTGAKMMEVNLSEANLTNANLSNVSGVESNLTMANL 113

Query: 164 ------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
                  AD S++ + ++ L  A+L  A+   T LT +DL G  ++G + S A + +A 
Sbjct: 114 AGAILSSADFSNSNLSKVNLVGADLQGAIFSNTNLTGADLSGINLKGVNLSGANLSMAN 172


>gi|428308708|ref|YP_007119685.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428250320|gb|AFZ16279.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 294

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 44/79 (55%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NF  A +  +   G+   GA L  A   + N  GADLS   ++R  L +ANLT A+L RT
Sbjct: 186 NFRRAKLTAATLEGANLTGANLTDAQLNRVNLQGADLSGANLERACLEDANLTGAILRRT 245

Query: 191 VLTRSDLGGAIIEGADFSD 209
            L+ +++ G  + G DFSD
Sbjct: 246 QLSEANMSGTKLYGVDFSD 264



 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 6/106 (5%)

Query: 109 SAAQFGSADLR--KAVHVK----ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S AQ   A+L   K   VK    E  RAN + A M +S   G+K +GA L  A    AN 
Sbjct: 58  SGAQMNWANLSFVKMNEVKLIETELTRANLSGAFMVKSLLPGAKMSGADLMGANLRGANL 117

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            GA+L  + ++R+ L +ANL         L+ + L GA++ G+  +
Sbjct: 118 WGANLCGSQLERVNLRDANLMGVNFKWANLSEARLMGAMLYGSSLN 163


>gi|21674877|ref|NP_662942.1| pentapeptide repeat-containing protein [Chlorobium tepidum TLS]
 gi|21648101|gb|AAM73284.1| pentapeptide repeat family protein [Chlorobium tepidum TLS]
          Length = 439

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 58/118 (49%), Gaps = 16/118 (13%)

Query: 111 AQFGSADLRKAVHVKENFR----------------ANFTSADMRESDFSGSKFNGAYLEK 154
           A+ G  DLRKA   K +F                  NF  ADM+E++  G+   GA L++
Sbjct: 285 AELGGVDLRKASLSKSDFERANLDKANLAGANLAGVNFQRADMKEANLKGANLEGANLDR 344

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A    A+ +GA+L   ++   +L  ANL  A+L    L  ++L  A ++GAD + A +
Sbjct: 345 AFLKGADLSGANLKGAILYGAMLYGANLDGAILTNVSLFDANLEKASLKGADLTGATL 402



 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 51/105 (48%), Gaps = 1/105 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A L  A   K N  +A+ + A + +++  G+  +  YL+KA     N   A L
Sbjct: 56  SKANLEDAKLNGANLSKANLSKADLSGASLDKANLEGANLSMTYLKKANMKAVNAAHAWL 115

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           +D  ++   + +A+L  A L R  L  + + GA +E A   DAV+
Sbjct: 116 ADANLNGAFMKDASLKAANLARANLRWAKMSGADLEQASLKDAVL 160


>gi|427729960|ref|YP_007076197.1| putative low-complexity protein [Nostoc sp. PCC 7524]
 gi|427365879|gb|AFY48600.1| putative low-complexity protein [Nostoc sp. PCC 7524]
          Length = 937

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 33/104 (31%), Positives = 57/104 (54%), Gaps = 1/104 (0%)

Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
            A    A+L+ A   + N + AN   A++  ++  G+   GA L++A+  +A   GA+L 
Sbjct: 812 GANLYGANLQGANLQRANLQGANLQRANLYGANLEGANLYGANLQRAILQRAILEGANLQ 871

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             ++ R  L  ANL  A+L R  L  ++L GA +EGA+  +A++
Sbjct: 872 RAILQRANLEGANLQRAILQRANLEGANLEGANLEGANLQEAIL 915



 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 11/111 (9%)

Query: 111 AQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F  A+L+ A     N + AN   A+++ ++  G+    A L  A    AN  GA+L  
Sbjct: 798 ANFQRANLQGANLQGANLYGANLQGANLQRANLQGANLQRANLYGANLEGANLYGANLQR 857

Query: 170 TLMDRMVLNEANLTNAV----------LVRTVLTRSDLGGAIIEGADFSDA 210
            ++ R +L  ANL  A+          L R +L R++L GA +EGA+   A
Sbjct: 858 AILQRAILEGANLQRAILQRANLEGANLQRAILQRANLEGANLEGANLEGA 908



 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 6/110 (5%)

Query: 107 IGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           I S+  F  A+ ++A     N + AN   A+++ ++   +   GA L++A  Y AN  GA
Sbjct: 789 ILSSKDFYMANFQRANLQGANLQGANLYGANLQGANLQRANLQGANLQRANLYGANLEGA 848

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           +L    + R +L  A L  A L R +L R++L     EGA+   A++  A
Sbjct: 849 NLYGANLQRAILQRAILEGANLQRAILQRANL-----EGANLQRAILQRA 893


>gi|220907627|ref|YP_002482938.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219864238|gb|ACL44577.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 267

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 56/105 (53%), Gaps = 1/105 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A   +A+ +KA  +      +N T AD+ ++D +G   + A L +A   + NFTG DL
Sbjct: 132 SQANMSAANFQKATLISAYLHNSNLTQADLSDADLTGINLSDANLSQATLIRTNFTGGDL 191

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           S  ++    L E NLT   L    L+R++L G ++ GA+ +  ++
Sbjct: 192 SRVMLVGANLAETNLTAVNLSDANLSRAELNGVVLAGANLNRVIL 236



 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 57/104 (54%), Gaps = 6/104 (5%)

Query: 112 QFGSADLRKA--VHVK----ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
            F  A+L KA  VH        F A  ++A++ +++ S + F  A L  A  + +N T A
Sbjct: 100 NFSEANLIKANLVHAALYCANFFMAMMSAANLSQANMSAANFQKATLISAYLHNSNLTQA 159

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           DLSD  +  + L++ANL+ A L+RT  T  DL   ++ GA+ ++
Sbjct: 160 DLSDADLTGINLSDANLSQATLIRTNFTGGDLSRVMLVGANLAE 203


>gi|242034055|ref|XP_002464422.1| hypothetical protein SORBIDRAFT_01g017890 [Sorghum bicolor]
 gi|241918276|gb|EER91420.1| hypothetical protein SORBIDRAFT_01g017890 [Sorghum bicolor]
          Length = 221

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 6/118 (5%)

Query: 125 KENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           K N +  +  +A M E+ F G+  +   + KA A  A+F G D ++ ++DR+   +A+LT
Sbjct: 108 KTNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGTDFTNAVIDRVNFEKADLT 167

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            A+    VL+ S    A ++   F D +I     Q LC     TN      +R  LGC
Sbjct: 168 GAIFKNAVLSGSTFDDAKMDDVVFEDTIIGYIDLQKLC-----TNTSISPDSRLELGC 220


>gi|397645344|gb|EJK76787.1| hypothetical protein THAOC_01435 [Thalassiosira oceanica]
          Length = 224

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 54/113 (47%), Gaps = 2/113 (1%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +FT    + + FS S   G    KA    A+F+GAD      +   ++ ANL N V   +
Sbjct: 111 DFTQIIAKGTIFSKSNLQGCRFYKAYLVNADFSGADARGAAFEDTSMDGANLRNIVASGS 170

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN--GTNPITGVSTRKSLGC 241
              +S L    +EG DF+DA I     + +C   +  GTNP TG  TR SL C
Sbjct: 171 YFGQSLLDVESLEGGDFTDAQIPPKTLKLVCDREDVKGTNPTTGADTRDSLMC 223


>gi|422295276|gb|EKU22575.1| hypothetical protein NGA_0469800 [Nannochloropsis gaditana CCMP526]
          Length = 90

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 47/83 (56%), Gaps = 2/83 (2%)

Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
           NF GAD S+ ++DR+  + +NL  ++    VL+ +   GA +  +DF+D  +     + L
Sbjct: 7   NFEGADFSNAVVDRVSFDGSNLKGSIFSNAVLSGTSFVGADLTDSDFTDTYMGEFNLREL 66

Query: 221 CKYAN--GTNPITGVSTRKSLGC 241
           CK     GTNP+T   T++S GC
Sbjct: 67  CKNPTLKGTNPVTQAPTKESAGC 89


>gi|83955651|ref|ZP_00964231.1| hypothetical protein NAS141_07590 [Sulfitobacter sp. NAS-14.1]
 gi|83839945|gb|EAP79121.1| hypothetical protein NAS141_07590 [Sulfitobacter sp. NAS-14.1]
          Length = 189

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 10/116 (8%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
           N  EA+ RG   +  A   G ADLR A  ++E   A+ + A++  +D SG+K  GA L +
Sbjct: 12  NLTEADLRGA-DLREADLSGRADLRGA-DLRE---ADLSGAELFYADLSGAKLIGAILSR 66

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           A+   AN +GADL      R+ L+ A+L+  +L+   LT +DL GA +  AD S A
Sbjct: 67  AILISANLSGADLR-----RVDLSGADLSGTILIGANLTGADLTGANLSSADLSGA 117



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 60/123 (48%), Gaps = 11/123 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG---- 164
           S A+   A L +A+ +     AN + AD+R  D SG+  +G  L  A    A+ TG    
Sbjct: 55  SGAKLIGAILSRAILIS----ANLSGADLRRVDLSGADLSGTILIGANLTGADLTGANLS 110

Query: 165 -ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
            ADLS   +  M+L  ANL+ A L R  L+ ++L GA +  AD   A  +L +      Y
Sbjct: 111 SADLSGANLSGMILRGANLSGANLSRADLSGANLSGASVTEADLGGA--NLTEANLTRTY 168

Query: 224 ANG 226
            NG
Sbjct: 169 LNG 171



 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 46/95 (48%), Gaps = 1/95 (1%)

Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADL   + +  N   A+ T A++  +D SG+  +G  L  A    AN + ADLS   +  
Sbjct: 87  ADLSGTILIGANLTGADLTGANLSSADLSGANLSGMILRGANLSGANLSRADLSGANLSG 146

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
             + EA+L  A L    LTR+ L GA +     SD
Sbjct: 147 ASVTEADLGGANLTEANLTRTYLNGATLCNTTMSD 181


>gi|150016367|ref|YP_001308621.1| pentapeptide repeat-containing protein [Clostridium beijerinckii
            NCIMB 8052]
 gi|149902832|gb|ABR33665.1| pentapeptide repeat protein [Clostridium beijerinckii NCIMB 8052]
          Length = 1084

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 71/144 (49%), Gaps = 11/144 (7%)

Query: 92   ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA 150
            ADL++   +  G     S   F  ADL  A+ V+    +A+F+ A + E+   G+ FN +
Sbjct: 914  ADLSRASMDYTGL----SYCNFEKADLSYAILVESGVSKADFSEASLSEAHIEGTFFNKS 969

Query: 151  YLEKAV-----AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
              EKA       ++++F   + +   +   V+ E+N  NA  + T L   DL  A + GA
Sbjct: 970  KFEKASLIMTQMWRSDFEDCNFNHANLSSAVMRESNFKNATFINTCLRNVDLEEADLTGA 1029

Query: 206  DFSDAVIDLAQ-KQALCKYANGTN 228
            D S+A +  A+  +A+ +  N TN
Sbjct: 1030 DMSNANLSNAKINKAIFEGTNLTN 1053



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 41/127 (32%), Positives = 54/127 (42%), Gaps = 23/127 (18%)

Query: 109  SAAQFGSADLRKAVHVKENF-------RANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
            S A F  A L +A H++  F       +A+     M  SDF    FN A L  AV  ++N
Sbjct: 947  SKADFSEASLSEA-HIEGTFFNKSKFEKASLIMTQMWRSDFEDCNFNHANLSSAVMRESN 1005

Query: 162  FTGADLSDTLMDRMVLNEANLT---------------NAVLVRTVLTRSDLGGAIIEGAD 206
            F  A   +T +  + L EA+LT                A+   T LT  DL    IE  D
Sbjct: 1006 FKNATFINTCLRNVDLEEADLTGADMSNANLSNAKINKAIFEGTNLTNVDLTNVDIENID 1065

Query: 207  FSDAVID 213
            FS  +ID
Sbjct: 1066 FSKTIID 1072



 Score = 40.4 bits (93), Expect = 0.88,   Method: Composition-based stats.
 Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 10/112 (8%)

Query: 111  AQFGSADLRKAVHVKENFRANFTSADMRES--DFSG---SKFNGAYLEKAV-----AYKA 160
            A FG A+L  +      +  NF  AD+  +  D++G     F  A L  A+       KA
Sbjct: 890  ANFGYANLNDSHISGTLYNCNFKEADLSRASMDYTGLSYCNFEKADLSYAILVESGVSKA 949

Query: 161  NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            +F+ A LS+  ++    N++    A L+ T + RSD        A+ S AV+
Sbjct: 950  DFSEASLSEAHIEGTFFNKSKFEKASLIMTQMWRSDFEDCNFNHANLSSAVM 1001


>gi|145355959|ref|XP_001422212.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582452|gb|ABP00529.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 125

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 57/110 (51%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           FT   ++ ++F+ +   G  L  A    A F  A+LS+  + +  L  A+ TNA+L   +
Sbjct: 16  FTKGSLKRANFNDANLTGITLFGADLSNATFVNANLSNANLGQANLTGADFTNAILSGAI 75

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           ++ + L    +  +D+SD ++       LCK A+G NP+TG  T  SL C
Sbjct: 76  VSSAQLDEVKLTNSDWSDVIVRKDVLTGLCKVADGENPVTGNITALSLMC 125


>gi|167771967|ref|ZP_02444020.1| hypothetical protein ANACOL_03340 [Anaerotruncus colihominis DSM
           17241]
 gi|167665765|gb|EDS09895.1| pentapeptide repeat protein [Anaerotruncus colihominis DSM 17241]
          Length = 314

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/129 (35%), Positives = 67/129 (51%), Gaps = 10/129 (7%)

Query: 94  LNKYEAETRGEF-GIG---SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFS 143
           L+K+ A  RGE  G+    + A    ADL KA     N       +AN + A++  ++ S
Sbjct: 7   LDKHAAWLRGEPEGVKADLTGANLPGADLSKANLSGANLFGANLSKANLSGANLFGANLS 66

Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
           G+   GA L KA    AN +GADLS T +    L++ANL+ A L    L+R+ L GA + 
Sbjct: 67  GANLFGANLSKANLSGANLSGADLSRTHLPGADLSKANLSGANLSGADLSRTHLPGADLS 126

Query: 204 GADFSDAVI 212
            A+ S A +
Sbjct: 127 KANLSKANL 135



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 65/133 (48%), Gaps = 12/133 (9%)

Query: 92  ADLNKYEAETRGEFGIG------SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSG 144
           ADL+K        FG        S A    A+L  A     N  +AN + A++  +D S 
Sbjct: 33  ADLSKANLSGANLFGANLSKANLSGANLFGANLSGANLFGANLSKANLSGANLSGADLSR 92

Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGG 199
           +   GA L KA    AN +GADLS T      + +  L++ANL+ A L    L++++L G
Sbjct: 93  THLPGADLSKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANLFGANLSKANLSG 152

Query: 200 AIIEGADFSDAVI 212
           A + GA+ S A +
Sbjct: 153 ANLFGANLSGANL 165



 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 6/106 (5%)

Query: 109 SAAQFGSADL-RKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL R  +   +  +AN + A++  +D S +   GA L KA   KAN +GA+L
Sbjct: 81  SGANLSGADLSRTHLPGADLSKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANL 140

Query: 168 SDTLMDRMVLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFS 208
               + +  L+ ANL     + A L    L++++L GA + GAD S
Sbjct: 141 FGANLSKANLSGANLFGANLSGANLFGANLSKANLSGANLSGADLS 186



 Score = 45.1 bits (105), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A    ADL +  H+     A+ + A++ +++ SG+   GA L KA    AN  GA+LS
Sbjct: 106 SGANLSGADLSR-THLPG---ADLSKANLSKANLSGANLFGANLSKANLSGANLFGANLS 161

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
              +    L++ANL+ A L    L+R+ L GA +  A+ S A +
Sbjct: 162 GANLFGANLSKANLSGANLSGADLSRTHLPGADLSKANLSKANL 205



 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 50/102 (49%), Gaps = 6/102 (5%)

Query: 109 SAAQFGSADLRKAVHVKEN------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S      ADL KA   K N      F AN + A++  ++  G+  +GA L  A   KAN 
Sbjct: 116 SRTHLPGADLSKANLSKANLSGANLFGANLSKANLSGANLFGANLSGANLFGANLSKANL 175

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           +GA+LS   + R  L  A+L+ A L +  L+ ++L G    G
Sbjct: 176 SGANLSGADLSRTHLPGADLSKANLSKANLSGANLSGPTCPG 217


>gi|123965950|ref|YP_001011031.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9515]
 gi|123200316|gb|ABM71924.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9515]
          Length = 157

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 55/112 (49%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F   D+++++ S      A L  A     N + ++L +  +D  VL+  +LTN  L  
Sbjct: 43  ATFYLTDLQDANLSDCDLQNASLYGAKLKDTNLSNSNLREVTLDSAVLDGTDLTNTNLED 102

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +    +      I+GADF++  +     +  CK A+GTNP T   TR++L C
Sbjct: 103 SFAYSTQFENVKIQGADFTNVYLPKDIVREFCKEASGTNPFTNRETRETLEC 154


>gi|261821705|ref|YP_003259811.1| hypothetical protein Pecwa_2443 [Pectobacterium wasabiae WPP163]
 gi|261605718|gb|ACX88204.1| Protein of unknown function DUF2169 [Pectobacterium wasabiae
           WPP163]
          Length = 846

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 45/160 (28%), Positives = 76/160 (47%), Gaps = 12/160 (7%)

Query: 78  AAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADM 137
            A++ SCS  +   A+  ++   T     + S +   SAD  +A   + N R     A +
Sbjct: 687 GALLDSCSW-VETQANEARFTGATWLTSAVASGSSMNSADFTQATLRQSNLR----QASL 741

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
             + F+ +K   + L +A   + NF  A+L+ +L  R    EAN T+A L+  +L +S L
Sbjct: 742 IGAVFALAKLENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQL 801

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
           GGA   GA+   A  DL+Q      + + T  + G  T++
Sbjct: 802 GGANFRGANLFRA--DLSQ-----AFTSNTTQLDGAWTKR 834


>gi|58613539|gb|AAW79356.1| chloroplast thylakoid 11kDa protein [Heterocapsa triquetra]
          Length = 91

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 14/103 (13%)

Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
           ++  +G+ F GA L +A    A  TGADL++ ++    +N    T  + V++        
Sbjct: 1   DAGLAGADFTGAVLTQANLELAQLTGADLTNAIVTEAYING---TTKLEVKSA------- 50

Query: 199 GAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
               +GADF+D  +   Q+  LC  A GTNP+T V TR+S+ C
Sbjct: 51  ----DGADFTDTPLRKDQQMYLCGIAKGTNPVTKVDTRESMAC 89


>gi|115482792|ref|NP_001064989.1| Os10g0502000 [Oryza sativa Japonica Group]
 gi|22165076|gb|AAM93693.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|31432906|gb|AAP54482.1| Thylakoid lumenal 17.4 kDa protein, chloroplast precursor,
           putative, expressed [Oryza sativa Japonica Group]
 gi|113639598|dbj|BAF26903.1| Os10g0502000 [Oryza sativa Japonica Group]
 gi|125532544|gb|EAY79109.1| hypothetical protein OsI_34214 [Oryza sativa Indica Group]
 gi|125575308|gb|EAZ16592.1| hypothetical protein OsJ_32066 [Oryza sativa Japonica Group]
 gi|215704684|dbj|BAG94312.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 236

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 6/118 (5%)

Query: 125 KENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           K N +  +  +A M +S F G+  +   + KA A  A+F G D ++ ++DR+   +A+L 
Sbjct: 123 KTNLKGKSLAAALMSDSKFDGADMSEVVMSKAYAVGASFKGTDFTNAVIDRVNFEKADLQ 182

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            A+   TVL+ S    A ++   F D +I     Q LC     TN      +R  LGC
Sbjct: 183 GAIFRNTVLSGSTFDDAKMQDVVFEDTIIGYIDLQKLC-----TNTSISADSRLELGC 235


>gi|383763954|ref|YP_005442936.1| hypothetical protein CLDAP_29990 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381384222|dbj|BAM01039.1| hypothetical protein CLDAP_29990 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 244

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 66/128 (51%), Gaps = 13/128 (10%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK--- 146
           L + N YEA+        S A    ADLR A   +   R A    AD+R+++ +G+    
Sbjct: 87  LREANLYEADL-------SNAVLDQADLRYATLERAVLRSATLRGADLRDANLAGADLRV 139

Query: 147 --FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
             F+GA +E+A+   A+   A+L++ ++ R  L  ANL NAVL    L  +DL GA + G
Sbjct: 140 ADFSGAQMERAILTGASLVDANLANAVLRRADLRNANLRNAVLRYADLRGADLSGADLMG 199

Query: 205 ADFSDAVI 212
           AD   A +
Sbjct: 200 ADLMGARL 207



 Score = 45.1 bits (105), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 32/90 (35%), Positives = 46/90 (51%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R NFT A + +++ S +    A L +A   +AN   ADLS+ ++D+  L  A L  AVL 
Sbjct: 59  RVNFTEASLNQANLSRATLLMAILSRAQLREANLYEADLSNAVLDQADLRYATLERAVLR 118

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
              L  +DL  A + GAD   A    AQ +
Sbjct: 119 SATLRGADLRDANLAGADLRVADFSGAQME 148



 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 41/71 (57%), Gaps = 9/71 (12%)

Query: 159 KANFTGADLSDTLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           + N  G DLS   ++R+      LN+ANL+ A L+  +L+R+ L  A +  AD S+AV+D
Sbjct: 44  QVNLDGHDLSRADLNRVNFTEASLNQANLSRATLLMAILSRAQLREANLYEADLSNAVLD 103

Query: 214 LAQKQALCKYA 224
               QA  +YA
Sbjct: 104 ----QADLRYA 110


>gi|17230606|ref|NP_487154.1| hypothetical protein all3114 [Nostoc sp. PCC 7120]
 gi|17132208|dbj|BAB74813.1| all3114 [Nostoc sp. PCC 7120]
          Length = 576

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 46/81 (56%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           F  N + A +  +D S +K NGA L  A    A F GADLS   +  +VLN+A+L+  +L
Sbjct: 418 FSTNLSDAILEAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGIL 477

Query: 188 VRTVLTRSDLGGAIIEGADFS 208
               LT +DL  AI+ G DFS
Sbjct: 478 SEADLTGADLSDAILLGTDFS 498



 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 67/132 (50%), Gaps = 23/132 (17%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A  G A+L  A     NF+ AN T AD  +++ S    +GA L  A    AN TGA+L
Sbjct: 268 SGAYLGDANLTGA-----NFQDANLTGADFGDANLSSVNLSGANLSSADLSSANLTGANL 322

Query: 168 SDTLMDRMVLNEANLTNAVL--------------VRTV-LTRSDLGGAIIEGADFSDAVI 212
           S   + R  L+ A+L++++L              +R   L R++L  AI+ GA+ SDA +
Sbjct: 323 SGANLQRADLSRADLSSSILNDGEFSHANLSGVNLRDAELRRANLSNAILFGANLSDANL 382

Query: 213 DLAQ--KQALCK 222
           + A   +  LC+
Sbjct: 383 NHADLSRADLCR 394



 Score = 45.1 bits (105), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F  AD+   D +G   N A L   +  +A+ TGADLSD ++     + ANL +A    
Sbjct: 450 AMFLGADLSGVDLTGVVLNDADLSGGILSEADLTGADLSDAILLGTDFSFANLNSA---- 505

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
             L+ S+L GAI+ GAD S A +  A
Sbjct: 506 -NLSGSNLSGAILNGADLSSANLSYA 530



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 51/90 (56%), Gaps = 6/90 (6%)

Query: 127 NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEA 180
           NF+ A   +A++   +FSG+  +GAYL  A    ANF     TGAD  D  +  + L+ A
Sbjct: 246 NFQGAYLGNANLTGVNFSGANLSGAYLGDANLTGANFQDANLTGADFGDANLSSVNLSGA 305

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           NL++A L    LT ++L GA ++ AD S A
Sbjct: 306 NLSSADLSSANLTGANLSGANLQRADLSRA 335



 Score = 41.6 bits (96), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 53/115 (46%), Gaps = 11/115 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVA 157
           S A   SADL  A     N            RA+ +S+ + + +FS +  +G  L  A  
Sbjct: 303 SGANLSSADLSSANLTGANLSGANLQRADLSRADLSSSILNDGEFSHANLSGVNLRDAEL 362

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            +AN + A L    +    LN A+L+ A L R  L+ +DL  A + G + SD ++
Sbjct: 363 RRANLSNAILFGANLSDANLNHADLSRADLCRADLSGADLTHATLNGTNLSDTIL 417



 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 58/114 (50%), Gaps = 17/114 (14%)

Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           GEF   S A     +LR A    E  RAN ++A +  ++ S +  N A L +A   +A+ 
Sbjct: 345 GEF---SHANLSGVNLRDA----ELRRANLSNAILFGANLSDANLNHADLSRADLCRADL 397

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           +GADL+        LN  NL++ +L  T     +L  AI+E AD S A ++ A+
Sbjct: 398 SGADLT-----HATLNGTNLSDTILFST-----NLSDAILEAADLSYAKLNGAK 441



 Score = 38.1 bits (87), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 1/88 (1%)

Query: 124 VKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           V E  R  NF  A +  ++ +G  F+GA L  A    AN TGA+  D  +      +ANL
Sbjct: 238 VGEFLRGGNFQGAYLGNANLTGVNFSGANLSGAYLGDANLTGANFQDANLTGADFGDANL 297

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           ++  L    L+ +DL  A + GA+ S A
Sbjct: 298 SSVNLSGANLSSADLSSANLTGANLSGA 325



 Score = 37.7 bits (86), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 5/89 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEANLT 183
           RA+   AD+  +D + +  NG  L   + +  N +      ADLS   ++   LN A L 
Sbjct: 389 RADLCRADLSGADLTHATLNGTNLSDTILFSTNLSDAILEAADLSYAKLNGAKLNYARLN 448

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            A+ +   L+  DL G ++  AD S  ++
Sbjct: 449 GAMFLGADLSGVDLTGVVLNDADLSGGIL 477


>gi|443476541|ref|ZP_21066442.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443018491|gb|ELS32731.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 400

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 89/188 (47%), Gaps = 30/188 (15%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFS-----G 144
           L + N  EA     F I   A    A L +A  V  N   AN TSA M  +D S     G
Sbjct: 61  LVEANLAEANLTSAFLI--RADLQRACLNQAYLVAANLNSANLTSASMVNADLSLATLTG 118

Query: 145 SKFNGAYLEKA-----VAYKANFTGADLSDT-----LMDRMVLNEANLTNAVLVRTVLTR 194
           +  NGA L +A        ++N  GADLSD+     LM +  L+ ANL+ A L+   LT 
Sbjct: 119 ACLNGANLSRAKLNGTFFIESNLLGADLSDSDFTGALMIKANLSGANLSQACLMNVDLTE 178

Query: 195 SDLGGAIIEGADFSDAVIDLAQKQAL-CKYANGTNPITGVSTRKS-------LGCGNSRR 246
           ++L GA ++G D + A+++ A   A+   YAN    ++GVS  ++       LG    + 
Sbjct: 179 ANLTGAELQGVDLAGAILNAANLNAVDLVYAN----LSGVSLSRANLSWANLLGTNLEKT 234

Query: 247 NAYGSPSS 254
           N  GS  S
Sbjct: 235 NLVGSDLS 242



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 40/73 (54%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + AD+   D  GS      L  A+  +AN TGA+L + +++   LN ANL  A L R
Sbjct: 299 ANLSGADLSNVDLRGSYLIRTNLHNAILNEANLTGANLDEAVLNGASLNRANLNRASLTR 358

Query: 190 TVLTRSDLGGAII 202
             LT ++L GA +
Sbjct: 359 ASLTGANLKGAFM 371



 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
            ++N + A++   + S +  +GA L  A    AN +GADLS+  +    L   NL NA+L
Sbjct: 267 MKSNLSGANLNGVNLSNANLSGANLSGANLMGANLSGADLSNVDLRGSYLIRTNLHNAIL 326

Query: 188 VRTVLTRSDLGGAIIEGADFSDAVID 213
               LT ++L  A++ GA  + A ++
Sbjct: 327 NEANLTGANLDEAVLNGASLNRANLN 352



 Score = 40.4 bits (93), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 44/83 (53%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N T A + +S+ SG+  NG  L  A    AN +GA+L    +    L+  +L  + L+RT
Sbjct: 260 NLTGAFLMKSNLSGANLNGVNLSNANLSGANLSGANLMGANLSGADLSNVDLRGSYLIRT 319

Query: 191 VLTRSDLGGAIIEGADFSDAVID 213
            L  + L  A + GA+  +AV++
Sbjct: 320 NLHNAILNEANLTGANLDEAVLN 342



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 11/100 (11%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A   + DLR +  ++ N        AN T A++ E+  +G+  N A L +A   +A+ 
Sbjct: 302 SGADLSNVDLRGSYLIRTNLHNAILNEANLTGANLDEAVLNGASLNRANLNRASLTRASL 361

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
           TGA+L    M        NL  A ++ T L  +++ GAI+
Sbjct: 362 TGANLKGAFMLW-----TNLRGAFMLWTNLDGANMTGAIL 396



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 16/104 (15%)

Query: 113 FGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           F  A+L K++    N       R N + A + ++  S +   GA+L +A   +AN   A+
Sbjct: 6   FTKANLTKSILEGINLKGADLKRVNLSEAKLADAKLSKANLTGAFLHRADLNRANLVEAN 65

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           L+          EANLT+A L+R  L R+ L  A +  A+ + A
Sbjct: 66  LA----------EANLTSAFLIRADLQRACLNQAYLVAANLNSA 99



 Score = 37.4 bits (85), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 44/85 (51%), Gaps = 17/85 (20%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN    ++ E+D S +   GA+L      K+N +GA+L          N  NL+NA L  
Sbjct: 244 ANLNETNLAEADLSWTNLTGAFL-----MKSNLSGANL----------NGVNLSNANLSG 288

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDL 214
             L+ ++L GA + GAD S+  +DL
Sbjct: 289 ANLSGANLMGANLSGADLSN--VDL 311


>gi|307592031|ref|YP_003899622.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306985676|gb|ADN17556.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 161

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 56/108 (51%), Gaps = 5/108 (4%)

Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A  F  +++   +  K+    +   AD+ E D +G K  GA L KA  Y AN +GA LS 
Sbjct: 28  AYAFVQSNIDTLLSTKDCHNCDLVEADLHEKDLAGVKLYGADLSKAKLYGANLSGASLSG 87

Query: 170 TLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDAVI 212
             +    L+ ANL+ + L +       L +++L GA + GAD SDAV+
Sbjct: 88  ANLSGASLSGANLSGSYLQKANLKGAYLQKANLEGAALYGADLSDAVL 135



 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 4/98 (4%)

Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           ADL KA    + + AN + A +  ++ SG+  +GA L  +   KAN  GA L    ++  
Sbjct: 68  ADLSKA----KLYGANLSGASLSGANLSGASLSGANLSGSYLQKANLKGAYLQKANLEGA 123

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
            L  A+L++AVL    L  + L GA +EGA    A+ D
Sbjct: 124 ALYGADLSDAVLYGANLKGAKLKGANLEGAKTKGAIFD 161


>gi|126696014|ref|YP_001090900.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9301]
 gi|126543057|gb|ABO17299.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9301]
          Length = 157

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 62/131 (47%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F  +DL+ A          F   D+++++ SG +   A L  A     N + ++L + 
Sbjct: 33  ADFSGSDLKGAT---------FYLTDLQDANLSGCELQNATLYGAKLKDTNLSNSNLREV 83

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D  VL+  +L+N  L  +    +      I+GADF++  +     +  C+ A GTNP 
Sbjct: 84  TLDSAVLDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIIKKFCESATGTNPF 143

Query: 231 TGVSTRKSLGC 241
           T   TR++L C
Sbjct: 144 TNRETRETLEC 154


>gi|386828484|ref|ZP_10115591.1| putative low-complexity protein [Beggiatoa alba B18LD]
 gi|386429368|gb|EIJ43196.1| putative low-complexity protein [Beggiatoa alba B18LD]
          Length = 986

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 26/109 (23%)

Query: 126 ENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAY--------------------KANFTG 164
           +N R  +F+  D+R +DFSG+    A  + A+ Y                     ANF+ 
Sbjct: 645 QNLRGQDFSGQDLRYADFSGADLTDALFKNAILYHVNFSNATLKNADFTKTDLSNANFSD 704

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           ADL+D L    +L  AN ++A L     T++DL       A+F+DA+ D
Sbjct: 705 ADLTDALFKNAILQHANFSDATLKNADFTKTDL-----SNANFTDAICD 748



 Score = 55.1 bits (131), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 41/148 (27%), Positives = 69/148 (46%), Gaps = 14/148 (9%)

Query: 78  AAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADM 137
           +A+++    +++      K+ +    + G+ S  Q  +  + K   +K N R    S D 
Sbjct: 588 SALMSQFFVDLAGREQATKWASRIIKQRGVASIIQNNADSILK--QLKNNQR---NSLDR 642

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           R  +  G  F+G  L  A     +F+GADL+D L    +L   N +NA L     T++DL
Sbjct: 643 RGQNLRGQDFSGQDLRYA-----DFSGADLTDALFKNAILYHVNFSNATLKNADFTKTDL 697

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYAN 225
             A    AD +DA+     K A+ ++AN
Sbjct: 698 SNANFSDADLTDALF----KNAILQHAN 721


>gi|332712234|ref|ZP_08432162.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332349040|gb|EGJ28652.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 280

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 38/95 (40%), Positives = 52/95 (54%), Gaps = 1/95 (1%)

Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADL  A     NF RA+ + A++  ++ +G+ F GA L  A    AN TGA+LS+T +  
Sbjct: 171 ADLTNANLTGANFSRADLSQANLSNANLTGADFAGADLANADLSGANLTGANLSNTDLKG 230

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
             L  ANL    L R  L RSDL  A+  GA+F +
Sbjct: 231 SNLTGANLNGTDLARADLERSDLRDAMTNGANFEN 265



 Score = 45.4 bits (106), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 2/98 (2%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + + AD+  ++ +G+ F+ A L +A    AN TGAD +   +    L+ ANLT A L  T
Sbjct: 167 DLSGADLTNANLTGANFSRADLSQANLSNANLTGADFAGADLANADLSGANLTGANLSNT 226

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
            L  S+L GA + G D + A  DL +        NG N
Sbjct: 227 DLKGSNLTGANLNGTDLARA--DLERSDLRDAMTNGAN 262


>gi|428296910|ref|YP_007135216.1| RDD domain-containing protein [Calothrix sp. PCC 6303]
 gi|428233454|gb|AFY99243.1| RDD domain containing protein [Calothrix sp. PCC 6303]
          Length = 718

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/112 (36%), Positives = 61/112 (54%), Gaps = 14/112 (12%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY---------- 158
           S+AQ   ADLR AV   EN  A+ T AD+ E+  + ++  GA L +A+A           
Sbjct: 540 SSAQMVGADLRNAVL--EN--ASLTGADLGEAKLNEAELYGARLNRAIAIGAQLSYANLT 595

Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           K ++  ADLS + +DR+ L  ANL+ A L   +L  ++L GA +  AD + A
Sbjct: 596 KTDWQAADLSGSYLDRVNLTNANLSTARLTGAILRSANLEGANLRNADLTLA 647



 Score = 41.2 bits (95), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 50/98 (51%), Gaps = 14/98 (14%)

Query: 130 ANFTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMDRM 175
            NF  A++ ++ F  S+F G              A L +A   +ANF+ A+LS  L+++ 
Sbjct: 458 VNFKGANLDQASFKNSRFRGPGDDGLWDTFDDAIADLSQAQLKQANFSEANLSRVLLNKS 517

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
            L+ + L  A L  + L  ++L  A + GAD  +AV++
Sbjct: 518 DLSRSTLNKANLAGSRLIGANLSSAQMVGADLRNAVLE 555



 Score = 38.5 bits (88), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 33/130 (25%), Positives = 61/130 (46%), Gaps = 15/130 (11%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFS 143
           A+ADL++ + +          A F  A+L + +  K +       +AN   + +  ++ S
Sbjct: 490 AIADLSQAQLK---------QANFSEANLSRVLLNKSDLSRSTLNKANLAGSRLIGANLS 540

Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
            ++  GA L  AV   A+ TGADL +  ++   L  A L  A+ +   L+ ++L     +
Sbjct: 541 SAQMVGADLRNAVLENASLTGADLGEAKLNEAELYGARLNRAIAIGAQLSYANLTKTDWQ 600

Query: 204 GADFSDAVID 213
            AD S + +D
Sbjct: 601 AADLSGSYLD 610


>gi|428216484|ref|YP_007100949.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427988266|gb|AFY68521.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 673

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 55/91 (60%), Gaps = 7/91 (7%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDRMVLNEANLTNA 185
           N  +A + E+DFS ++  GA L  A+A  A+     F+GADL++  +   +++E NLT A
Sbjct: 435 NLQNALLSETDFSDARLGGANLTGAIATGADLRGVDFSGADLTEANLTNAIMSEVNLTGA 494

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
            L+R  L ++DL  A++ GA+   A  DL+Q
Sbjct: 495 RLLRANLKQADLNFAVLRGAELMRA--DLSQ 523



 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 45/89 (50%), Gaps = 9/89 (10%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A+ G A+L  A+          T AD+R  DFSG+    A L  A+  + N TGA L 
Sbjct: 447 SDARLGGANLTGAIA---------TGADLRGVDFSGADLTEANLTNAIMSEVNLTGARLL 497

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
              + +  LN A L  A L+R  L+++DL
Sbjct: 498 RANLKQADLNFAVLRGAELMRADLSQTDL 526



 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 28/131 (21%)

Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
           A+++ES+ S ++   A LE AV   A+   A+L    ++   L E +L++A L     T+
Sbjct: 549 ANLQESNLSAAELENAQLEAAVLLLADLRSANLKLANLNYADLREVDLSSADL-----TQ 603

Query: 195 SDLGG----------------AIIEGADFSDAVIDLAQ--KQALCKYANGT----NPITG 232
           ++L G                A I+GADF+D V++LA   K   CK A G     +P   
Sbjct: 604 ANLIGANLSGANLRGTDVNQLASIDGADFTD-VVNLADTSKTYFCKIAAGQTFAESPEQR 662

Query: 233 VSTRKSLGCGN 243
            +TR +L C N
Sbjct: 663 RATRATLDCPN 673


>gi|75911046|ref|YP_325342.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75704771|gb|ABA24447.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 576

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 46/81 (56%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           F  N + A +  +D S +K NGA L  A    A F GADLS   +  +VLN+A+L+  +L
Sbjct: 418 FSTNLSDAILEAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGIL 477

Query: 188 VRTVLTRSDLGGAIIEGADFS 208
               LT +DL  A++ G DFS
Sbjct: 478 SEADLTGADLSDAVLLGTDFS 498



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 67/117 (57%), Gaps = 13/117 (11%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           + A FG A+L  +V++     AN +SAD+  ++ +G+  +GA LE+A     + + ADLS
Sbjct: 288 TGADFGDANL-SSVNLS---GANLSSADLSSANLTGANLSGANLERA-----DLSRADLS 338

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA---VIDLAQKQALCK 222
             +++   L+ ANL+        L R++L  AI+ GA+ SDA    +DL++   LC+
Sbjct: 339 SCILNDGELSHANLSGVNFRDAELCRANLSNAILFGANLSDANLNHVDLSRAD-LCR 394



 Score = 45.4 bits (106), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 47/84 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F  AD+   D +G   N A L   +  +A+ TGADLSD ++     + ANL +A L  
Sbjct: 450 AMFLGADLSGVDLTGVVLNDADLSGGILSEADLTGADLSDAVLLGTDFSFANLNSANLSG 509

Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
           + L+ + L GA +  A+FS A++D
Sbjct: 510 SNLSGAILNGADLSSANFSYAILD 533



 Score = 44.7 bits (104), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 36/90 (40%), Positives = 51/90 (56%), Gaps = 6/90 (6%)

Query: 127 NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEA 180
           NF+ A   +A++   +FSG+  +GAYL  A    ANF     TGAD  D  +  + L+ A
Sbjct: 246 NFQGAYLGNANLTGVNFSGANLSGAYLGDANLTGANFQGANLTGADFGDANLSSVNLSGA 305

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           NL++A L    LT ++L GA +E AD S A
Sbjct: 306 NLSSADLSSANLTGANLSGANLERADLSRA 335



 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 49/106 (46%), Gaps = 1/106 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A   SADL  A     N   AN   AD+  +D S    N   L  A     NF  A+L
Sbjct: 303 SGANLSSADLSSANLTGANLSGANLERADLSRADLSSCILNDGELSHANLSGVNFRDAEL 362

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
               +   +L  ANL++A L    L+R+DL  A + GAD + A ++
Sbjct: 363 CRANLSNAILFGANLSDANLNHVDLSRADLCRADLSGADLTHATLN 408



 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 5/89 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEANLT 183
           RA+   AD+  +D + +  NG  L   + +  N +      ADLS   ++   LN A L 
Sbjct: 389 RADLCRADLSGADLTHATLNGTNLSDTILFSTNLSDAILEAADLSYAKLNGAKLNYARLN 448

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            A+ +   L+  DL G ++  AD S  ++
Sbjct: 449 GAMFLGADLSGVDLTGVVLNDADLSGGIL 477


>gi|428225059|ref|YP_007109156.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427984960|gb|AFY66104.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 315

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 10/119 (8%)

Query: 131 NFTSADMRESDFSGSKFNGAYL----------EKAVAYKANFTGADLSDTLMDRMVLNEA 180
           N    D+R +  SG+  +GA L                 AN +GA+LS   + R  LN A
Sbjct: 181 NLDGVDLRSTKLSGATLHGANLAATNFSDAKMHGGSFTGANLSGANLSRAFLKRANLNWA 240

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
           NLT A L    LT ++L GA IEGA+F+   +    ++ L   A G  P +   TR +L
Sbjct: 241 NLTRADLTDADLTEANLLGARIEGAEFTGVTLSDPTRRYLRLIATGVTPWSQQPTRSTL 299



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 23/125 (18%)

Query: 117 DLRKAVHVKENFRANFTSADMRESDFSG----------SKFNGAYLEKAVAYKANFTGA- 165
           D+   +   E    NF   D+R +D SG          +   GA L +A   +AN +GA 
Sbjct: 2   DVNYLLRAYEAGERNFAGVDLRGADLSGVTLIAVDLSDANLMGANLSRAFLTQANLSGAF 61

Query: 166 ---------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
                     LS+  +  + L +ANL+ A +V++   R+ L GA + GA+   + +   Q
Sbjct: 62  LNWADLRYVKLSEGCLTHVDLTKANLSGAFMVKSDFNRAKLSGANLNGANLRGSHL---Q 118

Query: 217 KQALC 221
              LC
Sbjct: 119 HANLC 123


>gi|78189684|ref|YP_380022.1| pentapeptide repeat-containing protein [Chlorobium chlorochromatii
           CaD3]
 gi|78171883|gb|ABB28979.1| pentapeptide repeat family protein [Chlorobium chlorochromatii
           CaD3]
          Length = 389

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 49/88 (55%), Gaps = 5/88 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTN 184
           ANF  ADM+ +   G+   GA+ ++A   +AN  GA+L+  L     +D+  L  ANLT 
Sbjct: 270 ANFYKADMKGAQLQGANLQGAHCDRAFLLQANLQGANLTKALLFGATLDKADLRNANLTE 329

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A L       +DL GAI+  A+ +DAV+
Sbjct: 330 ASLFGANCEGADLRGAILTRANVTDAVL 357


>gi|220910076|ref|YP_002485387.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219866687|gb|ACL47026.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 332

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 43/78 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN    D+RE+D SG+   GA L     ++AN  GADLS++ +  + L  ANL  A L  
Sbjct: 181 ANLREVDLREADLSGANLRGALLTDVNLFQANLAGADLSNSNLKGVDLQRANLQQAKLTG 240

Query: 190 TVLTRSDLGGAIIEGADF 207
             LT ++L G +++ A  
Sbjct: 241 ATLTEANLAGVMMQRAQM 258



 Score = 45.1 bits (105), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 1/99 (1%)

Query: 111 AQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A+LR A+    N F+AN   AD+  S+  G     A L++A    A  T A+L+ 
Sbjct: 191 ADLSGANLRGALLTDVNLFQANLAGADLSNSNLKGVDLQRANLQQAKLTGATLTEANLAG 250

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            +M R  + +  L  A L R  L  +DL GA + GA+ +
Sbjct: 251 VMMQRAQMFQVRLNRANLSRANLQGADLRGASLIGANLA 289



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 41/95 (43%), Gaps = 16/95 (16%)

Query: 131 NFTSADMRESDFSGSKFNGA----------------YLEKAVAYKANFTGADLSDTLMDR 174
           N    D+RE+D SG+   GA                 L  A+    +  GA+LS   + R
Sbjct: 111 NLIETDLREADLSGANLTGACLRSANLRTERRGTPVNLRGAILAGVDLRGANLSGASLVR 170

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           + L  ANL  A L    L  +DL GA + GA  +D
Sbjct: 171 VNLQGANLEEANLREVDLREADLSGANLRGALLTD 205


>gi|409994014|ref|ZP_11277136.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|291569676|dbj|BAI91948.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
 gi|409935088|gb|EKN76630.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 331

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 42/138 (30%), Positives = 66/138 (47%), Gaps = 10/138 (7%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRA 130
           F  T L AA +   +  ++ L D N  +A+ RG       A    ADLR A     N R 
Sbjct: 87  FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139

Query: 131 N---FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
               + S ++R +D  G+   G  L  A   +AN  GA+L++ ++   +LN+ NL+   L
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLMGANLTECVLRGAILNQTNLSETNL 199

Query: 188 VRTVLTRSDLGGAIIEGA 205
              +LT  +L GA + G+
Sbjct: 200 QGAILTEVNLSGANLIGS 217



 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 63/126 (50%), Gaps = 7/126 (5%)

Query: 94  LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFR-ANFTSA-----DMRESDFSGSK 146
           LNKY +  +   G+    A+  +ADL  A     +F+ ANF  A     ++  ++   +K
Sbjct: 7   LNKYRSGEKLFRGVNLRNAELSNADLIGANLSGGDFQGANFVLAYLNGVNLTRANLEKAK 66

Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
             GA L +A    A  T AD   T++    L +ANLT A LV   L ++DL GA ++GAD
Sbjct: 67  LGGANLSRANLSGAQLTDADFHGTILQAADLRKANLTLATLVDANLIQADLRGANLQGAD 126

Query: 207 FSDAVI 212
              A +
Sbjct: 127 LRGACL 132



 Score = 37.7 bits (86), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 5/85 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN ++A++  ++ S +    A L +     AN T ADL+D  + R  L  ANL+ A L R
Sbjct: 247 ANLSNANLSHANLSRANLVRAELNRTNLSSANLTQADLTDASLGRTNLRNANLSYAYLTR 306

Query: 190 TVLTRS-----DLGGAIIEGADFSD 209
           T  + +     +L GAI+   +  D
Sbjct: 307 TEFSSANTIGVNLHGAIMPNGEIHD 331


>gi|428314577|ref|YP_007151024.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428256301|gb|AFZ22256.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 281

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 61/111 (54%), Gaps = 6/111 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    ADL +A   + N        AN + A + +++ S +  + A+L +A    AN 
Sbjct: 123 SRANLSRADLSEANLSRANLSRADLSDANLSPASLSDANLSRANLSRAFLSRANLSDANL 182

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           + A+LSD  + R  L+ ANL+ A L R  L+ ++LGGA + GA+F ++ ID
Sbjct: 183 SRANLSDANLSRADLSRANLSRANLSRADLSGANLGGANLSGANFRNSEID 233



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 49/86 (56%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN    ++ E++ S +  + A L +A   +AN + ADLSD  +    L++ANL+ A L R
Sbjct: 110 ANLREINLSEANLSRANLSRADLSEANLSRANLSRADLSDANLSPASLSDANLSRANLSR 169

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
             L+R++L  A +  A+ SDA +  A
Sbjct: 170 AFLSRANLSDANLSRANLSDANLSRA 195



 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 5/86 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A   +A++RE + S +  + A L +A   +AN + A+LS     R  L++ANL+ A L  
Sbjct: 105 APLENANLREINLSEANLSRANLSRADLSEANLSRANLS-----RADLSDANLSPASLSD 159

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
             L+R++L  A +  A+ SDA +  A
Sbjct: 160 ANLSRANLSRAFLSRANLSDANLSRA 185


>gi|33861206|ref|NP_892767.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           subsp. pastoris str. CCMP1986]
 gi|33639938|emb|CAE19108.1| Pentapeptide repeats [Prochlorococcus marinus subsp. pastoris str.
           CCMP1986]
          Length = 157

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 55/112 (49%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F   D+++++ S      A L  A     N + ++L +  +D  VL+  +LTN  L  
Sbjct: 43  ATFYLTDLQDANLSDCDLQNASLYGAKLKDTNLSNSNLREVTLDSAVLDGTDLTNTNLED 102

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           +    +      I+GADF++  +     +  CK A+GTNP T   TR++L C
Sbjct: 103 SFAYSTQFENVKIQGADFTNVYLPKDVLREFCKDASGTNPFTNRETRETLEC 154


>gi|218438018|ref|YP_002376347.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218170746|gb|ACK69479.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 333

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 69/135 (51%), Gaps = 6/135 (4%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRANF 132
           LA A++      ++ L D N   A+ RG    G   S A    A++R+    K++F  N 
Sbjct: 92  LAGAILQETDLTLALLIDANLIGADLRGADLSGANLSGACLKGANMRQE---KKSFNTNL 148

Query: 133 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 192
             A++ ++D SG+   G  L KA    AN T A+L D  + ++ L  ANLTN +L    L
Sbjct: 149 QGANLFKADLSGANMKGVDLAKANLSGANLTEANLRDADLRKVDLTNANLTNTILSEANL 208

Query: 193 TRSDLGGAIIEGADF 207
           + ++L GA ++ A+ 
Sbjct: 209 SEANLTGATLKKANL 223



 Score = 40.0 bits (92), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 52/115 (45%), Gaps = 11/115 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYL-----EKAVA 157
           S A    A L+KA  V+           NFT A M  ++   +   GA L       A  
Sbjct: 209 SEANLTGATLKKANLVRAKMMHTQLSEVNFTEAIMTHANLKAANLKGANLSLTRMNHADL 268

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            +AN +GA L +  +  +    ANLT A L  T LTR+DL  A +  A+ + A++
Sbjct: 269 TRANLSGAILKEAELIEVFFARANLTGADLQGTNLTRADLMSANLSNANLTGAIM 323



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 45/101 (44%), Gaps = 1/101 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A     DL KA     N   AN   AD+R+ D + +      L +A   +AN TGA L
Sbjct: 159 SGANMKGVDLAKANLSGANLTEANLRDADLRKVDLTNANLTNTILSEANLSEANLTGATL 218

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
               + R  +    L+       ++T ++L  A ++GA+ S
Sbjct: 219 KKANLVRAKMMHTQLSEVNFTEAIMTHANLKAANLKGANLS 259



 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 46/90 (51%), Gaps = 15/90 (16%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN T A++R++D                 K + T A+L++T++    L+EANLT A L +
Sbjct: 176 ANLTEANLRDADL---------------RKVDLTNANLTNTILSEANLSEANLTGATLKK 220

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
             L R+ +    +   +F++A++  A  +A
Sbjct: 221 ANLVRAKMMHTQLSEVNFTEAIMTHANLKA 250



 Score = 37.4 bits (85), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 15/87 (17%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL-----NEANLT 183
           RAN    ++  +D SG+  +          +A+ TGADL   ++  ++L      E +LT
Sbjct: 54  RANLAHTNLVTTDLSGANLS----------QADLTGADLRSAILHGIILAGAILQETDLT 103

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
            A+L+   L  +DL GA + GA+ S A
Sbjct: 104 LALLIDANLIGADLRGADLSGANLSGA 130


>gi|440793397|gb|ELR14582.1| K+ channel tetramerisation subfamily protein [Acanthamoeba
           castellanii str. Neff]
          Length = 381

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 56/108 (51%), Gaps = 17/108 (15%)

Query: 112 QFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           +F   DLR     A+H++   RANF   D+   D   +K NGA L +     AN +GA  
Sbjct: 229 KFNGCDLRGFDFHAMHLR---RANFHRCDLTGVDLRHAKLNGACLVECCLRDANLSGA-- 283

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
                   VL+  +LT+A   R  LT +DL GA++ GAD S+A +D A
Sbjct: 284 --------VLSGVDLTDADCRRADLTNADLRGAVLSGADLSEAKLDRA 323


>gi|434384824|ref|YP_007095435.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
 gi|428015814|gb|AFY91908.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
          Length = 377

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 40/115 (34%), Positives = 61/115 (53%), Gaps = 6/115 (5%)

Query: 117 DLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           DL +   ++ N  RAN   A++  +D  G+   GA L+KA   +AN  GA+L    ++ +
Sbjct: 200 DLAQTNLIRANLKRANLQGANLEGADLEGANLQGANLKKANLKRANLQGANLMIANLEGI 259

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEG-----ADFSDAVIDLAQKQALCKYAN 225
            L  ANL  A+L+R  L  ++L GA +EG     A+F  A +  A  QA   +AN
Sbjct: 260 NLVRANLEGAILIRANLEGANLEGANLEGAILLLANFKGAYLSKANLQACHGHAN 314



 Score = 40.4 bits (93), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 40/82 (48%), Gaps = 1/82 (1%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN   A +  ++  G+   GA LE A+   ANF GA LS   + +     AN   A L 
Sbjct: 263 RANLEGAILIRANLEGANLEGANLEGAILLLANFKGAYLSKANL-QACHGHANFAGAYLS 321

Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
           +     +DL GA +EGA+   A
Sbjct: 322 KANFEGADLEGANLEGANLQRA 343


>gi|254526129|ref|ZP_05138181.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9202]
 gi|221537553|gb|EEE40006.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9202]
          Length = 148

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 62/131 (47%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F  +DL+ A          F   D+++++ S  +   A L  A     N + ++L + 
Sbjct: 24  ADFSGSDLKGAT---------FYLTDLQDANLSDCELQNATLYGAKLKDTNLSNSNLREV 74

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D  +L+  +L+N  L  +    +      I+GADF++  +     +  C+ A GTNPI
Sbjct: 75  TLDSAILDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVYLPKDIIREFCESATGTNPI 134

Query: 231 TGVSTRKSLGC 241
           T   TR++L C
Sbjct: 135 TNRDTRETLEC 145


>gi|291571459|dbj|BAI93731.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
          Length = 351

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 80/184 (43%), Gaps = 40/184 (21%)

Query: 69  RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 125
           R F   +L AA+    + N   L+  N  EA       IG   S +Q   ADL  AV + 
Sbjct: 21  RNFSDISLVAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQLSYADLSMAVLID 80

Query: 126 ENFR-ANFTSADMRESDFSGSKFNGAYLE------------------------------- 153
            N   A+ T   + ++D SG+  +GA L                                
Sbjct: 81  ANLTGASMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTCLLNGSQLTDAILV 140

Query: 154 -----KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
                ++V   A+ TGA+L+ +++  + L+ ANLT A L+R  L + +L GA + GAD S
Sbjct: 141 GATMTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGANLTGADLS 200

Query: 209 DAVI 212
           ++VI
Sbjct: 201 ESVI 204



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 1/100 (1%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL ++V    NF  AN T A++  ++ +G+  NGA L  A    AN TGA+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLAGANLAGANLNGANLTGANLTGANLTGANL 249

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           +   +    L  ANL+ A L    LT ++L GA +  AD 
Sbjct: 250 NGLTLQCADLRLANLSKADLRGANLTGANLAGANLLEADL 289



 Score = 37.0 bits (84), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 45/91 (49%), Gaps = 10/91 (10%)

Query: 130 ANFTSADMRESDFSGSKFNG----------AYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AN T A++  ++ +G+  NG          A L KA    AN TGA+L+   +    L  
Sbjct: 232 ANLTGANLTGANLTGANLNGLTLQCADLRLANLSKADLRGANLTGANLAGANLLEADLRL 291

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           ANLT+A L    L  + L GA + GA+ + A
Sbjct: 292 ANLTDANLCGAGLLLTSLRGANLAGANLNQA 322


>gi|158340188|ref|YP_001521358.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158310429|gb|ABW32044.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 292

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 43/142 (30%), Positives = 74/142 (52%), Gaps = 16/142 (11%)

Query: 109 SAAQFGSADLRKAVHVK-ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A F ++ L++++ +  + + ++F+ AD+R +DFS +K + A L++    +AN  GADL
Sbjct: 68  SGANFKASKLQRSLAIWVQAYWSDFSDADLRHADFSCAKLSAAQLKRTDFSQANLMGADL 127

Query: 168 SDTLMDRMVLNEA----------NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
           SD+         A          NLTN  L +  +T SDL  A +  +D S + +     
Sbjct: 128 SDSEAQDACFKGANLWGVWAQRTNLTNVCLSQVDMTTSDLTEAQLSESDLSWSFL----S 183

Query: 218 QALCKYANGTNP-ITGVSTRKS 238
           QA+C  AN T+  + G   +K+
Sbjct: 184 QAVCVGANLTSACLEGSDLKKT 205



 Score = 43.9 bits (102), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 10/92 (10%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA----------DLSDTLMDRMVLN 178
           + + T++D+ E+  S S  + ++L +AV   AN T A          D  D  + R  L+
Sbjct: 159 QVDMTTSDLTEAQLSESDLSWSFLSQAVCVGANLTSACLEGSDLKKTDFQDACLSRADLS 218

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            A+  NA      L ++DL GA + GADF  A
Sbjct: 219 AADCENACFFNANLYKADLRGAKLCGADFRGA 250



 Score = 38.9 bits (89), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 11/113 (9%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F  A L  A   + +F +AN   AD+ +S+   + F GA L    A + N T   LS 
Sbjct: 100 ADFSCAKLSAAQLKRTDFSQANLMGADLSDSEAQDACFKGANLWGVWAQRTNLTNVCLSQ 159

Query: 170 TLMDRMVLNEAN----------LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             M    L EA           L+ AV V   LT + L G+ ++  DF DA +
Sbjct: 160 VDMTTSDLTEAQLSESDLSWSFLSQAVCVGANLTSACLEGSDLKKTDFQDACL 212


>gi|448473532|ref|ZP_21601674.1| RDD domain-containing protein [Halorubrum aidingense JCM 13560]
 gi|445819044|gb|EMA68893.1| RDD domain-containing protein [Halorubrum aidingense JCM 13560]
          Length = 348

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 56/111 (50%), Gaps = 7/111 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++  +D S +    A L KA  Y AN +GADL+  L+D+  L  A+L       
Sbjct: 63  ANLRGANITGADLSSANLTDALLTKANLYSANLSGADLTGALLDKANLRSADLRGVGFTE 122

Query: 190 TVLTRSDLGGAIIEGADFSD------AVIDLAQKQALCKYAN-GTNPITGV 233
             LTR+DL  A + GA+FSD      AV D   + A    AN G   +TGV
Sbjct: 123 AHLTRADLHSADLRGANFSDADLFGAAVTDADLRGADLTDANLGDTDLTGV 173



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 50/105 (47%), Gaps = 11/105 (10%)

Query: 109 SAAQFGSADLRKAVHVKEN-FRANFTSAD----------MRESDFSGSKFNGAYLEKAVA 157
           + A   SA+L  A+  K N + AN + AD          +R +D  G  F  A+L +A  
Sbjct: 71  TGADLSSANLTDALLTKANLYSANLSGADLTGALLDKANLRSADLRGVGFTEAHLTRADL 130

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
           + A+  GA+ SD  +    + +A+L  A L    L  +DL G I+
Sbjct: 131 HSADLRGANFSDADLFGAAVTDADLRGADLTDANLGDTDLTGVIL 175


>gi|398354158|ref|YP_006399622.1| hypothetical protein USDA257_c43260 [Sinorhizobium fredii USDA 257]
 gi|390129484|gb|AFL52865.1| hypothetical protein USDA257_c43260 [Sinorhizobium fredii USDA 257]
          Length = 249

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 62/124 (50%), Gaps = 12/124 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A+  +A+L KA  V+ +       +ANF+  +    DFSG    GA    +   +A+F
Sbjct: 85  SGAELTAANLEKATLVRASLAGAKADKANFSRVEAYRGDFSGISAEGALFVSSELQRADF 144

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGA-DFSDAVIDLAQ 216
           TGA L+    ++  L  AN   AVL  T      L+R++L GA+ EG  DF  A + L +
Sbjct: 145 TGARLTGADFEKAELGRANFGKAVLTGTRFSVANLSRANLSGALFEGPLDFDRAFLFLTR 204

Query: 217 KQAL 220
            + L
Sbjct: 205 IEGL 208



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 40/88 (45%), Gaps = 5/88 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT-----N 184
           ++    D   +D SG++   A LEKA   +A+  GA        R+     + +      
Sbjct: 72  SHLVDTDFASTDLSGAELTAANLEKATLVRASLAGAKADKANFSRVEAYRGDFSGISAEG 131

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A+ V + L R+D  GA + GADF  A +
Sbjct: 132 ALFVSSELQRADFTGARLTGADFEKAEL 159


>gi|118592119|ref|ZP_01549513.1| hypothetical protein SIAM614_25622 [Stappia aggregata IAM 12614]
 gi|118435415|gb|EAV42062.1| hypothetical protein SIAM614_25622 [Labrenzia aggregata IAM 12614]
          Length = 275

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 69/152 (45%), Gaps = 35/152 (23%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESD--------- 141
           +D  + EAE R +F   S + F  A++R     K N  +ANF  AD+R+ D         
Sbjct: 85  SDFRRTEAE-RADF---SGSDFSGANMRSVDLEKANLNKANFQDADLRDGDLNTVEANEA 140

Query: 142 -FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR------ 194
            F G+        ++VA KA+F GA + D  ++R+ LN AN  +A + +  L R      
Sbjct: 141 IFDGADMRNVLFTRSVANKASFKGAKMDDANLERVDLNGANFQDARMRQAKLDRVKAQNA 200

Query: 195 --------------SDLGGAIIEGADFSDAVI 212
                         SDL GA + G DF  A++
Sbjct: 201 NFSGADFSGVRLVSSDLTGANLTGVDFDGALL 232



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 62/124 (50%), Gaps = 9/124 (7%)

Query: 99  AETRG---EFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAY--- 151
           AE RG   E G  +       DL++A+    NF+ ++F   +   +DFSGS F+GA    
Sbjct: 50  AELRGLVLENGDFAGTNLREVDLKEAMLPNANFKNSDFRRTEAERADFSGSDFSGANMRS 109

Query: 152 --LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
             LEKA   KANF  ADL D  ++ +  NEA    A +   + TRS    A  +GA   D
Sbjct: 110 VDLEKANLNKANFQDADLRDGDLNTVEANEAIFDGADMRNVLFTRSVANKASFKGAKMDD 169

Query: 210 AVID 213
           A ++
Sbjct: 170 ANLE 173



 Score = 43.9 bits (102), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 20/124 (16%)

Query: 93  DLNKYEAETRGEFGIGSAAQFGSADLR-----KAVHVKENFR------ANFTSADMRESD 141
           DLN  EA         + A F  AD+R     ++V  K +F+      AN    D+  ++
Sbjct: 131 DLNTVEA---------NEAIFDGADMRNVLFTRSVANKASFKGAKMDDANLERVDLNGAN 181

Query: 142 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
           F  ++   A L++  A  ANF+GAD S   +    L  ANLT       +L R+ L GA 
Sbjct: 182 FQDARMRQAKLDRVKAQNANFSGADFSGVRLVSSDLTGANLTGVDFDGALLRRTRLAGAD 241

Query: 202 IEGA 205
           + GA
Sbjct: 242 LSGA 245


>gi|357014784|ref|ZP_09079783.1| hypothetical protein PelgB_35370 [Paenibacillus elgii B69]
          Length = 843

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 62/119 (52%), Gaps = 14/119 (11%)

Query: 114 GSADLR--KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL---- 167
           G AD++  KAV   +   A   SAD++   F  +  + A L     Y A FTG DL    
Sbjct: 140 GLADIQATKAVVQTDLTWAYMASADLKSVSFEDADLSHADLSGCNLYGALFTGDDLKLSH 199

Query: 168 ----SDTL----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
               S TL    M+ +V++ A+ TNAV+    LT S+L G  + GAD +DA+I+ AQ Q
Sbjct: 200 TVFASATLSYARMNEIVIDSADFTNAVMTNVYLTNSNLQGNSLTGADMTDALINGAQFQ 258


>gi|193213578|ref|YP_001999531.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
           8327]
 gi|193087055|gb|ACF12331.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
          Length = 439

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 71/128 (55%), Gaps = 6/128 (4%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S  + G A+L +      N + ++F SAD+ +++ +G+   G    +A   KAN  GA+L
Sbjct: 279 SEEKLGDANLEEVDLSNANLKQSDFESADLDKANLAGANLAGGNFSRADMEKANLKGANL 338

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANG 226
              ++DR  + +A+L+NA L    L  + L GA ++GAD ++A + D   ++A  K   G
Sbjct: 339 EGAVLDRAFMKQADLSNANLRNANLFGAMLSGANLDGADLTNASLFDANLEKASLK---G 395

Query: 227 TNPITGVS 234
           TN +TG +
Sbjct: 396 TN-LTGAN 402



 Score = 43.9 bits (102), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 70/152 (46%), Gaps = 14/152 (9%)

Query: 109 SAAQFGSADLRKA----VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           S A    A+LRKA     ++K   RA+   AD+ E+    +    A+L+ A   +AN +G
Sbjct: 81  SGASLDQANLRKANLSMTYLK---RADLKKADLSEAWMVSANLRDAFLKDARLSRANLSG 137

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-SDAVIDLAQKQALCKY 223
            +L    +    L +ANL +A L  T   R++L G +   A F  +AV++ A      K 
Sbjct: 138 TNLRWAKLWDADLGQANLKDANLFETSFERANLKGTLFTKARFLENAVMNDA------KV 191

Query: 224 ANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 255
           +N T   +G    +     ++ R     PS+P
Sbjct: 192 SNNTVIPSGEPASRGWAMRHNSRFVQEEPSAP 223



 Score = 42.0 bits (97), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 61/128 (47%), Gaps = 16/128 (12%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-T 170
           F SADL KA     N    NF+ ADM +++  G+   GA L++A   +A+ + A+L +  
Sbjct: 303 FESADLDKANLAGANLAGGNFSRADMEKANLKGANLEGAVLDRAFMKQADLSNANLRNAN 362

Query: 171 LMDRMV--------------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           L   M+              L +ANL  A L  T LT ++L G  + GA  S + +  + 
Sbjct: 363 LFGAMLSGANLDGADLTNASLFDANLEKASLKGTNLTGANLIGINLTGAAISSSTLTPSG 422

Query: 217 KQALCKYA 224
           K A   +A
Sbjct: 423 KPATRSWA 430



 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 32/90 (35%), Positives = 45/90 (50%), Gaps = 6/90 (6%)

Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
           D S +    A L+ A   +AN + ADLS   +D+  L +ANL+   L R  L ++DL  A
Sbjct: 54  DLSKANLEDANLDGANLSEANLSKADLSGASLDQANLRKANLSMTYLKRADLKKADLSEA 113

Query: 201 IIEGADFSDAVIDLAQKQALCKYAN--GTN 228
            +  A+  DA +    K A    AN  GTN
Sbjct: 114 WMVSANLRDAFL----KDARLSRANLSGTN 139



 Score = 37.4 bits (85), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 53/106 (50%), Gaps = 13/106 (12%)

Query: 117 DLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANFTGADLSDT 170
           DL KA     N   AN + A++ ++D SG+  + A L KA   + Y  +A+   ADLS+ 
Sbjct: 54  DLSKANLEDANLDGANLSEANLSKADLSGASLDQANLRKANLSMTYLKRADLKKADLSEA 113

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
            M       ANL +A L    L+R++L G  +  A   DA  DL Q
Sbjct: 114 WMV-----SANLRDAFLKDARLSRANLSGTNLRWAKLWDA--DLGQ 152


>gi|224014282|ref|XP_002296804.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968659|gb|EED87005.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 2544

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 5/113 (4%)

Query: 131  NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
            ++   D+   DFS + + G    +      NF GAD+     +   ++ ANL + V V +
Sbjct: 2434 DYAGIDISGQDFSNASYKGKDFTQV---NTNFEGADVRGVSFEDTSMDNANLKDIVAVGS 2490

Query: 191  VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN--GTNPITGVSTRKSLGC 241
               +S +    +E  DF+DA I     + +C   +  GTNP TG  TR SL C
Sbjct: 2491 YFGQSLVDVKTLENGDFTDATIPPKTLKLVCDREDVKGTNPTTGADTRDSLMC 2543


>gi|298245086|ref|ZP_06968892.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297552567|gb|EFH86432.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 394

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 42/120 (35%), Positives = 62/120 (51%), Gaps = 11/120 (9%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGA 150
           L  +N Y+++ R        A     DLR+A    +  RAN   A++RE+    +    A
Sbjct: 247 LYKINLYKSDLR-------EANLSKTDLREA----DISRANLYKANLRETFLLKANLYEA 295

Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            L +A   +AN + A+LS T + R  L +ANL+ A L+   L+R DL GA +  ADFS A
Sbjct: 296 DLHRANLSEANLSEANLSKTDLSRTNLTKANLSKADLISANLSRGDLSGADLSKADFSGA 355



 Score = 44.7 bits (104), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 57/117 (48%), Gaps = 18/117 (15%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLE 153
           N Y+A+ RG            AD  KA     N R AN   A++RE+D S      A+L 
Sbjct: 206 NLYKADLRG------------ADFSKATLCGANLREANLCEANLREADIS-----RAFLY 248

Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           K   YK++   A+LS T +    ++ ANL  A L  T L +++L  A +  A+ S+A
Sbjct: 249 KINLYKSDLREANLSKTDLREADISRANLYKANLRETFLLKANLYEADLHRANLSEA 305



 Score = 40.8 bits (94), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 53/105 (50%), Gaps = 1/105 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A   S DL+       +F  AN   AD+R +DFS +   GA L +A   +AN   AD+
Sbjct: 183 SQADMKSMDLKGVKAHNIDFSGANLYKADLRGADFSKATLCGANLREANLCEANLREADI 242

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           S   + ++ L +++L  A L +T L  +D+  A +  A+  +  +
Sbjct: 243 SRAFLYKINLYKSDLREANLSKTDLREADISRANLYKANLRETFL 287



 Score = 40.4 bits (93), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 49/94 (52%), Gaps = 5/94 (5%)

Query: 95  NKYEAET-RGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYL 152
           N YEA+  R      S A    A+L K    + N  +AN + AD+  ++ S    +GA L
Sbjct: 291 NLYEADLHRANL---SEANLSEANLSKTDLSRTNLTKANLSKADLISANLSRGDLSGADL 347

Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
            KA    AN +GA+LS   ++  +LN+AN+  A+
Sbjct: 348 SKADFSGANLSGANLSGATLNEAILNKANIQQAL 381


>gi|37520785|ref|NP_924162.1| hypothetical protein gll1216 [Gloeobacter violaceus PCC 7421]
 gi|35211780|dbj|BAC89157.1| gll1216 [Gloeobacter violaceus PCC 7421]
          Length = 287

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 44/130 (33%), Positives = 63/130 (48%), Gaps = 9/130 (6%)

Query: 105 FGIGSAAQFGSADLRKAVHVKE-NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           F +   A    ADL ++V++K  + R A    AD+R +   G+  +G+ LE A   K   
Sbjct: 137 FAVLPFADLSGADLSRSVNLKRADLRGARLVGADLRGAFLHGANLSGSRLEAADLMKVAL 196

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI-----IEGADFSDAVIDLAQK 217
            GA+LS   + R  L  A+L  A L RT L  +DL GA      +EGAD   A ++ A  
Sbjct: 197 AGANLSGADLSRANLRAAHLEGADLRRTNLGEADLAGAFLRGARLEGADLRRARLEGADL 256

Query: 218 QALCKYANGT 227
           +  C    GT
Sbjct: 257 E--CAATEGT 264


>gi|418020640|ref|ZP_12659878.1| putative low-complexity protein [Candidatus Regiella insecticola
           R5.15]
 gi|347604005|gb|EGY28733.1| putative low-complexity protein [Candidatus Regiella insecticola
           R5.15]
          Length = 148

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 56/116 (48%), Gaps = 21/116 (18%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFN------------GAYLEKAVAY 158
           A    AD+R+          +   ADMRE+   G K N            GA L   +  
Sbjct: 9   ATLNDADMREV---------DLVGADMREAKLIGKKTNLEGANLSGADLQGAELYHTILI 59

Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
           KA  + ADLS+  ++R+ L EANL +A+L  T L  + L  A +EG +  DAV+++
Sbjct: 60  KAVLSWADLSNAKLERVNLREANLYHAILEETSLYITKLENANLEGVNLKDAVLEV 115


>gi|209528100|ref|ZP_03276576.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|209491459|gb|EDZ91838.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
          Length = 351

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 84/197 (42%), Gaps = 42/197 (21%)

Query: 56  NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQ 112
           N+    YA+    R F   +L AA+    + N   L+  N  EA       IG   S +Q
Sbjct: 10  NKLLTRYAQ--GERNFSDISLMAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQ 67

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLE------------------ 153
              ADL  AV +  N   A  T   + ++D SG+  +GA L                   
Sbjct: 68  LSYADLSMAVLIDANLTGATMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTC 127

Query: 154 ------------------KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
                             ++V   A+ TGA+L+ +++  + L+ ANLT A L+R  L + 
Sbjct: 128 LLNGSQLTDAILVGATLTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQG 187

Query: 196 DLGGAIIEGADFSDAVI 212
           +L GA + GAD S++VI
Sbjct: 188 NLSGANLTGADLSESVI 204



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 53/100 (53%), Gaps = 1/100 (1%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL ++V    NF  AN T A++  ++ +G+  NGA L +A   +AN T A+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTRANLTRANLTRANL 249

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           +   +    L  ANL+ A L    LT ++L GA +  AD 
Sbjct: 250 NGLTLQSADLRLANLSKADLRGANLTGANLAGANLLEADL 289



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 49/101 (48%), Gaps = 13/101 (12%)

Query: 109 SAAQFGSADLRKAVHVKE-NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           S A    A L + VH+ + N   AN T AD+ ES    S F  A L  A    AN TGA+
Sbjct: 170 SGANLTGATLIR-VHLNQGNLSGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGAN 228

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
                     LN ANLT A L R  LTR++L G  ++ AD 
Sbjct: 229 ----------LNGANLTRANLTRANLTRANLNGLTLQSADL 259


>gi|158316060|ref|YP_001508568.1| pentapeptide repeat-containing protein [Frankia sp. EAN1pec]
 gi|158111465|gb|ABW13662.1| pentapeptide repeat protein [Frankia sp. EAN1pec]
          Length = 411

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 53/95 (55%), Gaps = 6/95 (6%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN T A++ ++D +G++   A L  A+ ++A  TGA L    +    L  A+LTNAVL 
Sbjct: 287 RANLTDAELVDADLTGARLADATLAGALLFRATLTGAQLGRADLTGAQLGGADLTNAVLD 346

Query: 189 RTVLTRSDLGG-----AIIEGADFSDAVIDLAQKQ 218
             +L  + L G     A ++GAD + A   LAQKQ
Sbjct: 347 EAILADAVLSGANLTNARLDGADLT-AATGLAQKQ 380



 Score = 44.3 bits (103), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 45/128 (35%), Positives = 60/128 (46%), Gaps = 18/128 (14%)

Query: 114 GSADLRKAVHVKENFRANFTSADMRES--DFSGSKFNGAYLEKAVAYKANFT-------- 163
           G ADL  A  +      N T AD R +  DF+G   +   L +A   +AN T        
Sbjct: 240 GHADLPGAPSLAHLTLTNATLADARLAGVDFTGGSLDDVDLARADLRRANLTDAELVDAD 299

Query: 164 --GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA-QKQAL 220
             GA L+D  +   +L  A LT A      L R+DL GA + GAD ++AV+D A    A+
Sbjct: 300 LTGARLADATLAGALLFRATLTGA-----QLGRADLTGAQLGGADLTNAVLDEAILADAV 354

Query: 221 CKYANGTN 228
              AN TN
Sbjct: 355 LSGANLTN 362


>gi|411118568|ref|ZP_11390949.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410712292|gb|EKQ69798.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 321

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 43/105 (40%), Positives = 56/105 (53%), Gaps = 6/105 (5%)

Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           AA    A+L +A+    N   AN T A++ E+  S ++  GA L++A   KAN T ADLS
Sbjct: 194 AANLSGANLGRALLEGVNLIGANLTQANLIEARLSLAEMRGAKLDQAELTKANLTEADLS 253

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
                   L+ A L  AV+V  V     L  AI+ GADFSDA ID
Sbjct: 254 WASFRGTNLSAATLHKAVMVDVV-----LDAAILRGADFSDATID 293



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 41/80 (51%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NF + D+   +   +   GA L KAV ++A+ TGA+L D  +  + L   NLT A L   
Sbjct: 16  NFDTVDLSGVNLRQADLRGASLRKAVLFEADLTGANLVDVELHGVALRHTNLTAACLAGV 75

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            L  +DL  A +  AD S A
Sbjct: 76  KLVGADLSAAQLVRADLSGA 95



 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 37/126 (29%), Positives = 61/126 (48%), Gaps = 21/126 (16%)

Query: 109 SAAQFGSADLRKA-----------VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA 157
           SAAQ   ADL  A           +H     R N  +A++ E+D + ++ + A L +A  
Sbjct: 83  SAAQLVRADLSGANLWRSLLRNANLHAANLERTNLHAANLVEADLTTARLSHANLAEANL 142

Query: 158 YKANFTGADL----------SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
             A+ TGA L          S + +  + L +A+L  AVLV   L+R++L  A + GA+ 
Sbjct: 143 SDADLTGATLRWVNGVEAMFSRSRLRGVDLEQADLKKAVLVEVDLSRANLEAANLSGANL 202

Query: 208 SDAVID 213
             A+++
Sbjct: 203 GRALLE 208



 Score = 40.4 bits (93), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 6/93 (6%)

Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTG 164
           A+   A++R A +   E  +AN T AD+  + F G+  + A L KAV        A   G
Sbjct: 225 ARLSLAEMRGAKLDQAELTKANLTEADLSWASFRGTNLSAATLHKAVMVDVVLDAAILRG 284

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           AD SD  +D   LN+++LT  +L   VL  S L
Sbjct: 285 ADFSDATIDPACLNQSSLTWVILPSGVLQISSL 317



 Score = 37.4 bits (85), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 51/103 (49%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A LR    V+  F R+     D+ ++D   +      L +A    AN +GA+L
Sbjct: 143 SDADLTGATLRWVNGVEAMFSRSRLRGVDLEQADLKKAVLVEVDLSRANLEAANLSGANL 202

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
              L++ + L  ANLT A L+   L+ +++ GA ++ A+ + A
Sbjct: 203 GRALLEGVNLIGANLTQANLIEARLSLAEMRGAKLDQAELTKA 245


>gi|393766611|ref|ZP_10355166.1| pentapeptide repeat-containing protein [Methylobacterium sp. GXF4]
 gi|392727929|gb|EIZ85239.1| pentapeptide repeat-containing protein [Methylobacterium sp. GXF4]
          Length = 448

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 34/89 (38%), Positives = 51/89 (57%), Gaps = 5/89 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTN 184
           A F  A MR +D SG+  +GA   +A  + A+F+GAD  DT+     +D   L +ANLT+
Sbjct: 133 ARFGQAAMRFADLSGALLDGASFAEADLWGADFSGADADDTVFRDARLDEAKLADANLTH 192

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           A      LT++ L G+ + GA F+ A +D
Sbjct: 193 ADFEGASLTKASLAGSRLRGAKFTGAKLD 221



 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 54/182 (29%), Positives = 75/182 (41%), Gaps = 16/182 (8%)

Query: 74  TALAAAVVASCSSNISAL----ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 129
           TALAA   A   +    L    ADL++   E          A    A+LR+A       R
Sbjct: 41  TALAAGGTAPADAESGGLPLAEADLSRARIEE---------ADLSGANLRRASLTGAVGR 91

Query: 130 AN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +  F  A + E+D S +  +GA     VA +  F  A L D    +  +  A+L+ A+L 
Sbjct: 92  STRFVGAILEETDLSEADMSGADFTGIVAGQVKFASAMLEDARFGQAAMRFADLSGALLD 151

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYANGTNP-ITGVSTRKSLGCGNSRR 246
                 +DL GA   GAD  D V  D    +A    AN T+    G S  K+   G+  R
Sbjct: 152 GASFAEADLWGADFSGADADDTVFRDARLDEAKLADANLTHADFEGASLTKASLAGSRLR 211

Query: 247 NA 248
            A
Sbjct: 212 GA 213



 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEANLTN 184
           A+F  A + ++  +GS+  GA    A    A+ +GADLSDT + R+      L  A    
Sbjct: 193 ADFEGASLTKASLAGSRLRGAKFTGAKLDGADLSGADLSDTDLVRLNLATCRLRHARFAG 252

Query: 185 AVLVRTVLTRSDLGGAIIE 203
           A L  T ++   LGGA+ E
Sbjct: 253 AWLNGTRMSVEQLGGAVGE 271


>gi|423066634|ref|ZP_17055424.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|406711942|gb|EKD07140.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 351

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 84/197 (42%), Gaps = 42/197 (21%)

Query: 56  NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQ 112
           N+    YA+    R F   +L AA+    + N   L+  N  EA       IG   S +Q
Sbjct: 10  NKLLTRYAQ--GERNFSDISLMAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQ 67

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLE------------------ 153
              ADL  AV +  N   A  T   + ++D SG+  +GA L                   
Sbjct: 68  LSYADLSMAVLIDANLTGATMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTC 127

Query: 154 ------------------KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
                             ++V   A+ TGA+L+ +++  + L+ ANLT A L+R  L + 
Sbjct: 128 LLNGSQLTDAILVGATLTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQG 187

Query: 196 DLGGAIIEGADFSDAVI 212
           +L GA + GAD S++VI
Sbjct: 188 NLSGANLTGADLSESVI 204



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 1/100 (1%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL ++V    NF  AN T A++  ++ +G+  NGA L  A   +AN TGA+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTRANLTGANL 249

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           +   +    L  ANL+ A L    LT ++L GA +  AD 
Sbjct: 250 NGLTLQSADLRLANLSKADLRGANLTGANLAGANLLEADL 289



 Score = 43.9 bits (102), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 48/101 (47%), Gaps = 13/101 (12%)

Query: 109 SAAQFGSADLRKAVHVKE-NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           S A    A L + VH+ + N   AN T AD+ ES    S F  A L  A    AN TGA+
Sbjct: 170 SGANLTGATLIR-VHLNQGNLSGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGAN 228

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
                     LN ANLT A L R  LT ++L G  ++ AD 
Sbjct: 229 ----------LNGANLTGANLTRANLTGANLNGLTLQSADL 259



 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN   + + E D SG+   GA     +L +     AN TGADLS++++       ANLT 
Sbjct: 157 ANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGANLTGADLSESVIQNSNFCIANLTG 216

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
           A L    LT ++L GA + GA+ + A
Sbjct: 217 ANLTGANLTGANLNGANLTGANLTRA 242



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 46/151 (30%), Positives = 70/151 (46%), Gaps = 20/151 (13%)

Query: 78  AAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSA 135
           A+++ +C  N S L D     A  TR      S A    A+L +++  + +   AN T A
Sbjct: 121 ASLIGTCLLNGSQLTDAILVGATLTRSVL---SGAHMTGANLNRSILSEIDLSGANLTGA 177

Query: 136 -----DMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEANLTNA 185
                 + + + SG+   GA L ++V   +NF     TGA+L+   +    LN ANLT A
Sbjct: 178 TLIRVHLNQGNLSGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGA 237

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
            L     TR++L GA + G     A + LA 
Sbjct: 238 NL-----TRANLTGANLNGLTLQSADLRLAN 263



 Score = 37.7 bits (86), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 51/111 (45%), Gaps = 11/111 (9%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 159
           A    A+L  A     N   AN T A++  ++ +G+  NG          A L KA    
Sbjct: 212 ANLTGANLTGANLTGANLNGANLTGANLTRANLTGANLNGLTLQSADLRLANLSKADLRG 271

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           AN TGA+L+   +    L  ANLT+A L    L  + L GA + GA+ + A
Sbjct: 272 ANLTGANLAGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQA 322


>gi|86605838|ref|YP_474601.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86554380|gb|ABC99338.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
          Length = 158

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 46/87 (52%), Gaps = 5/87 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N   AD+R +D S +   GA L  A  ++AN  GADLS   +    L+ A L  A L R 
Sbjct: 55  NLQEADLRGADLSSANLMGANLRGANLWEANLIGADLSFADLREANLHGAYLWEAKLTRA 114

Query: 191 VLTRSDL-----GGAIIEGADFSDAVI 212
            L  SDL     GGA++ GAD S A++
Sbjct: 115 QLQGSDLSGAKIGGAVLTGADLSGAIL 141



 Score = 43.9 bits (102), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 55/118 (46%), Gaps = 8/118 (6%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN-FR 129
            V   L  A +   + +   L+ +N  EA+ RG       A   SA+L  A     N + 
Sbjct: 31  LVRATLQGANLRGANLSFGKLSGINLQEADLRG-------ADLSSANLMGANLRGANLWE 83

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           AN   AD+  +D   +  +GAYL +A   +A   G+DLS   +   VL  A+L+ A+L
Sbjct: 84  ANLIGADLSFADLREANLHGAYLWEAKLTRAQLQGSDLSGAKIGGAVLTGADLSGAIL 141


>gi|300023195|ref|YP_003755806.1| pentapeptide repeat protein [Hyphomicrobium denitrificans ATCC
           51888]
 gi|299525016|gb|ADJ23485.1| pentapeptide repeat protein [Hyphomicrobium denitrificans ATCC
           51888]
          Length = 282

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 60/112 (53%), Gaps = 2/112 (1%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           FG+ + + F  ADL  A+    N +  F     R +D SG+  +GA L +A   +  F+ 
Sbjct: 149 FGVFAGSNFAGADLTDAISAPLN-KTGFIEYIWR-TDLSGANLSGAQLTRANMTQTRFSF 206

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           A L D  +   +L EA+L+ AVL    L+ +DL GA + GAD + A +D A+
Sbjct: 207 AVLRDASLHDTILREADLSGAVLTGADLSGADLTGADLSGADVTGANLDGAK 258


>gi|119487930|ref|ZP_01621427.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
 gi|119455506|gb|EAW36644.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
          Length = 276

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 56/103 (54%), Gaps = 6/103 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A   SADL  A   + N R  N T++ + E+   G+    AYL +A     NFT ADL
Sbjct: 38  SGANLISADLSHANLCQTNLRGINLTNSTLSEARLRGADLCDAYLSEA-----NFTRADL 92

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S+  +    L EANLT+A LV T L  ++L  A ++ A+ S+A
Sbjct: 93  SEAQLLNAYLKEANLTHAQLVNTNLNGANLSNAKLQNANLSNA 135



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 42/80 (52%), Gaps = 5/80 (6%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
            ANFT AD+ E+    +    A L  A     N  GA+LS+       L  ANL+NA L+
Sbjct: 84  EANFTRADLSEAQLLNAYLKEANLTHAQLVNTNLNGANLSNA-----KLQNANLSNANLL 138

Query: 189 RTVLTRSDLGGAIIEGADFS 208
            TVLT  +L GA + GA+ +
Sbjct: 139 NTVLTGVNLTGANLNGANLT 158



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 50/114 (43%), Gaps = 10/114 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A F  ADL +A    +   A    A++  +    +  NGA L  A    AN + A+L 
Sbjct: 83  SEANFTRADLSEA----QLLNAYLKEANLTHAQLVNTNLNGANLSNAKLQNANLSNANLL 138

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
           +T++  + L  ANL  A L    L R +L G  I      D    L+QK  L +
Sbjct: 139 NTVLTGVNLTGANLNGANLTGVELCRVNLNGTQI------DENTQLSQKWLLVQ 186


>gi|297569025|ref|YP_003690369.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
 gi|296924940|gb|ADH85750.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
          Length = 830

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 55/104 (52%), Gaps = 16/104 (15%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTG 164
           A  G ADLR+A   + NF +A    AD+R     ESDF  +    A   +A   +ANF+G
Sbjct: 227 ADLGGADLRRADLSRANFSQARLRQADLRQVLFSESDFRHADARRADFREATLRQANFSG 286

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           ADLS     R + +  +LT  V       +++L GA++EGAD S
Sbjct: 287 ADLS-----RAIFSGTDLTGGVF-----QQANLAGAVLEGADLS 320



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 2/111 (1%)

Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           E R +  +G   Q  + D  + +  K+    N    D+R  +F+ S+ +G  L+ A    
Sbjct: 134 EAREQIAMGQVQQALAGD--RNLQGKDLSTLNLAGLDLRGVNFADSRLHGVNLQGANLRG 191

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           A+F+ ADL    +    L EA L +A L R  L  +DLGGA +  AD S A
Sbjct: 192 ADFSRADLMHADLSEADLREAKLVDANLARASLALADLGGADLRRADLSRA 242



 Score = 43.9 bits (102), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 43/143 (30%), Positives = 63/143 (44%), Gaps = 33/143 (23%)

Query: 90  ALADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVKENFR---------------- 129
           ALADL   +      +R  F   S A+   ADLR+ +  + +FR                
Sbjct: 225 ALADLGGADLRRADLSRANF---SQARLRQADLRQVLFSESDFRHADARRADFREATLRQ 281

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANF+ AD+  + FSG+   G   ++A    A   GADLS     R+      L    +V+
Sbjct: 282 ANFSGADLSRAIFSGTDLTGGVFQQANLAGAVLEGADLS-----RLA-----LAGVKMVK 331

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L  S+L GA + G D +DA +
Sbjct: 332 ANLAGSNLYGADLRGVDLTDASL 354



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 30/114 (26%), Positives = 53/114 (46%), Gaps = 11/114 (9%)

Query: 111 AQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           A    ADLR+A  V  N             A+   AD+  ++FS ++   A L + +  +
Sbjct: 202 ADLSEADLREAKLVDANLARASLALADLGGADLRRADLSRANFSQARLRQADLRQVLFSE 261

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           ++F  AD          L +AN + A L R + + +DL G + + A+ + AV++
Sbjct: 262 SDFRHADARRADFREATLRQANFSGADLSRAIFSGTDLTGGVFQQANLAGAVLE 315



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 39/109 (35%), Positives = 53/109 (48%), Gaps = 9/109 (8%)

Query: 109 SAAQFGSADLRKAVHVK------ENFRANFTSADMRESDFSG--SKFNGAYLEKAVAYKA 160
           S A F  A+L  AV  +      +   AN T+A++  +D +   S   G  L  A   KA
Sbjct: 410 SQADFTGANLTAAVFSEAIMAGAKLLEANLTNANLDGADLTSRVSMIRG-NLTNASLQKA 468

Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           +  GADLS+ ++   VL EANL    L    L R+DL  A I  AD S+
Sbjct: 469 DLHGADLSNAIVTGAVLREANLRRVRLSHASLNRADLSWATIVDADLSN 517



 Score = 41.6 bits (96), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 77/180 (42%), Gaps = 39/180 (21%)

Query: 70  VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 129
           VF    LA AV+     +  ALA +   +A   G    G  A     DL  A  ++ +  
Sbjct: 303 VFQQANLAGAVLEGADLSRLALAGVKMVKANLAGSNLYG--ADLRGVDLTDASLLEADLS 360

Query: 130 A-NFTSADMRESDFSGSKFNGAYLEKAVAY--------------------KANFTGADL- 167
           A +   A + ++ F+G   +GA L  AVA                     +A+FTGA+L 
Sbjct: 361 AADLAGARLDKAVFAGGTLHGARLLSAVARNADFRAANLTRVAAQQADFSQADFTGANLT 420

Query: 168 ----SDTLMDRMVLNEANLTNAVL-----------VRTVLTRSDLGGAIIEGADFSDAVI 212
               S+ +M    L EANLTNA L           +R  LT + L  A + GAD S+A++
Sbjct: 421 AAVFSEAIMAGAKLLEANLTNANLDGADLTSRVSMIRGNLTNASLQKADLHGADLSNAIV 480



 Score = 40.8 bits (94), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 41/84 (48%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RA+   AD+ E+D   +K   A L +A    A+  GADL    + R   ++A L  A L 
Sbjct: 196 RADLMHADLSEADLREAKLVDANLARASLALADLGGADLRRADLSRANFSQARLRQADLR 255

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
           + + + SD   A    ADF +A +
Sbjct: 256 QVLFSESDFRHADARRADFREATL 279



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 39/82 (47%), Gaps = 1/82 (1%)

Query: 117 DLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           DLR A  V  N R A+   AD+  +D   +    A L ++     N T ADLS  ++   
Sbjct: 554 DLRNANLVNANLRDADLADADLSNADLRQANLARANLSRSDLRWVNLTDADLSGAILSGA 613

Query: 176 VLNEANLTNAVLVRTVLTRSDL 197
            LN+A+   AV     LTR+ L
Sbjct: 614 SLNDADFNRAVFAEANLTRASL 635


>gi|376003692|ref|ZP_09781500.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375327990|emb|CCE17253.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 740

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 6/106 (5%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    A+LR       N R      A+   AD+R +D  G+ F GA L +A  Y+AN T 
Sbjct: 575 ANLAHANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITE 634

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            + +   + R+  N ++L +A L+R  L++S L  A ++GA+ S +
Sbjct: 635 GNFNGANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQS 680



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 42/128 (32%), Positives = 56/128 (43%), Gaps = 8/128 (6%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSG 144
           L  +N   A  RG  G    A    ADLR A     NF      RANF  A++ E +F+G
Sbjct: 582 LRGVNLRNANLRG--GNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITEGNFNG 639

Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           +        ++    A     DLS + +    L  ANL+ + L  T  TR+DL  A   G
Sbjct: 640 ANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGTDFTRADLSNAKFNG 699

Query: 205 ADFSDAVI 212
           AD S  +I
Sbjct: 700 ADLSFTLI 707



 Score = 40.4 bits (93), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 53/115 (46%), Gaps = 7/115 (6%)

Query: 111 AQFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           +QF   DLR    K V++K     +F  ADMRE +  G       L      KAN + A 
Sbjct: 430 SQFQGQDLRQKNLKGVNLKT---IDFKGADMREKNLKGMSLIKLDLRLVNLAKANLSHAI 486

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
           L+ + +    L  AN+  A LV+T L R+DL    +  A  + A +  A  ++ C
Sbjct: 487 LNGSKLAVANLKGANMQEASLVKTDLRRADLEDVNLSYASLTTAQLQRANLRSAC 541



 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 56/111 (50%), Gaps = 4/111 (3%)

Query: 95  NKYEAE-TRGEFGIGSAAQ--FGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA 150
           N Y+A  T G F   +  +  F  +DLR A  ++ +  ++   SA ++ ++ S S   G 
Sbjct: 626 NFYQANITEGNFNGANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGT 685

Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
              +A    A F GADLS TL+    L+ A+LTNA L +  L  S+  G I
Sbjct: 686 DFTRADLSNAKFNGADLSFTLIRHANLSGADLTNAKLEKANLFGSNTVGCI 736


>gi|303289212|ref|XP_003063894.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454962|gb|EEH52267.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 124

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 58/122 (47%), Gaps = 22/122 (18%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           +T   M+ ++FS S  +G  L       ANFTGADLS+          AN+    L  T+
Sbjct: 12  YTKGSMKRANFSNSNLSGVTLFGGDLSYANFTGADLSN----------ANIGQCNLTGTI 61

Query: 192 LTRSDLGGAIIEGA-----------DFSDAVIDLAQKQALC-KYANGTNPITGVSTRKSL 239
            T ++L GAI+ GA           D++D ++       +C K  +G NP+TG  T  +L
Sbjct: 62  FTNANLSGAIVSGANMDELGDITGSDWTDVIVRKDVNDKICAKGVSGENPVTGNPTAMTL 121

Query: 240 GC 241
            C
Sbjct: 122 FC 123


>gi|209526910|ref|ZP_03275429.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|423063829|ref|ZP_17052619.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209492689|gb|EDZ93025.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|406714678|gb|EKD09839.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 740

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 6/106 (5%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    A+LR       N R      A+   AD+R +D  G+ F GA L +A  Y+AN T 
Sbjct: 575 ANLAHANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITE 634

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            + +   + R+  N ++L +A L+R  L++S L  A ++GA+ S +
Sbjct: 635 GNFNGANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQS 680



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 42/128 (32%), Positives = 56/128 (43%), Gaps = 8/128 (6%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSG 144
           L  +N   A  RG  G    A    ADLR A     NF      RANF  A++ E +F+G
Sbjct: 582 LRGVNLRNANLRG--GNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITEGNFNG 639

Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           +        ++    A     DLS + +    L  ANL+ + L  T  TR+DL  A   G
Sbjct: 640 ANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGTDFTRADLSNAKFNG 699

Query: 205 ADFSDAVI 212
           AD S  +I
Sbjct: 700 ADLSFTLI 707



 Score = 40.4 bits (93), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 53/115 (46%), Gaps = 7/115 (6%)

Query: 111 AQFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           +QF   DLR    K V++K     +F  ADMRE +  G       L      KAN + A 
Sbjct: 430 SQFQGQDLRQKNLKGVNLKT---IDFKGADMREKNLKGMSLIKLDLRLVNLAKANLSHAI 486

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
           L+ + +    L  AN+  A LV+T L R+DL    +  A  + A +  A  ++ C
Sbjct: 487 LNGSKLAVANLKGANMQEASLVKTDLRRADLEDVNLSYASLTTAQLQRANLRSAC 541



 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 56/111 (50%), Gaps = 4/111 (3%)

Query: 95  NKYEAE-TRGEFGIGSAAQ--FGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA 150
           N Y+A  T G F   +  +  F  +DLR A  ++ +  ++   SA ++ ++ S S   G 
Sbjct: 626 NFYQANITEGNFNGANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGT 685

Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
              +A    A F GADLS TL+    L+ A+LTNA L +  L  S+  G I
Sbjct: 686 DFTRADLSNAKFNGADLSFTLIRHANLSGADLTNAKLEKANLFGSNTVGCI 736


>gi|334121546|ref|ZP_08495612.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333454932|gb|EGK83604.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 388

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 45/118 (38%), Positives = 62/118 (52%), Gaps = 19/118 (16%)

Query: 111 AQFGSADLRKAVHVKENFR-ANF-----TSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    A+L KA  +K N   ANF     + A ++E+D + ++  GA L KA    AN T 
Sbjct: 143 AVLTEANLSKAYLIKANLNGANFQDAYLSLASLKEADLTEAQLTGAELSKANLAGANLTR 202

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
           A+LS          +ANL  A L RT LT++ L GA + G+D S+A +D A    LCK
Sbjct: 203 ANLS----------KANLLKANLRRTNLTQAYLNGACLIGSDLSEACLDRAN---LCK 247



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 109 SAAQFGSADLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    ADL  A     N  A      N   A++  ++ +G+  N A L  A+   AN 
Sbjct: 71  SRANLSKADLSGANLTGANLMAASLSGANLIGANLTGANLAGAHLNWANLTGAILPNANL 130

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
            GAD+S   + + VL EANL+ A L++          A + GA+F DA + LA
Sbjct: 131 IGADMSAANLTKAVLTEANLSKAYLIK----------ANLNGANFQDAYLSLA 173



 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 42/81 (51%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   ADM  ++ + +    A L KA   KAN  GA+  D  +    L EA+LT A L  
Sbjct: 128 ANLIGADMSAANLTKAVLTEANLSKAYLIKANLNGANFQDAYLSLASLKEADLTEAQLTG 187

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L++++L GA +  A+ S A
Sbjct: 188 AELSKANLAGANLTRANLSKA 208



 Score = 44.3 bits (103), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 8/108 (7%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           AQ   A+L KA     N  RAN + A++ +++   +    AYL  A        G+DLS+
Sbjct: 183 AQLTGAELSKANLAGANLTRANLSKANLLKANLRRTNLTQAYLNGAC-----LIGSDLSE 237

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
             +DR  L +A+L+   L    L  S L G    GAD S   +DL++K
Sbjct: 238 ACLDRANLCKADLSKTYLRNITLNGSHLSGINFSGADLSG--VDLSRK 283



 Score = 42.0 bits (97), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 57/108 (52%), Gaps = 11/108 (10%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A     DL + +    N        AN + A + E++ SG+  + A L  ++AY  N 
Sbjct: 271 SGADLSGVDLSRKLLTGINMAEALLNEANLSGAYLMEANLSGANLSKANL--SLAYLIN- 327

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             ADLS++ +  + L++ANL+ A L +  LT ++L GAI+  AD + A
Sbjct: 328 --ADLSNSCLHEINLSKANLSKASLQKADLTGANLRGAILTEADLTGA 373



 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 44/87 (50%), Gaps = 5/87 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN   AD+ ++       NG++L        NF+GADLS   + R +L   N+  A+L 
Sbjct: 242 RANLCKADLSKTYLRNITLNGSHLS-----GINFSGADLSGVDLSRKLLTGINMAEALLN 296

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
              L+ + L  A + GA+ S A + LA
Sbjct: 297 EANLSGAYLMEANLSGANLSKANLSLA 323


>gi|347735787|ref|ZP_08868588.1| pentapeptide repeat family protein [Azospirillum amazonense Y2]
 gi|346920906|gb|EGY01818.1| pentapeptide repeat family protein [Azospirillum amazonense Y2]
          Length = 451

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 51/93 (54%), Gaps = 9/93 (9%)

Query: 117 DLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           DLR A+ VK +         N   AD+ E++ SG+K +GA L +A+   AN + A L   
Sbjct: 178 DLRGAIFVKADLSGSDLTGCNLEGADLSEANLSGTKLDGAVLTRALLRSANLSKASLLGA 237

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
           L+D + L+ ANLT A LVR +    D+ G I E
Sbjct: 238 LLDDVDLSMANLTGADLVRRL---DDIEGTIGE 267



 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 69/155 (44%), Gaps = 51/155 (32%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGA 150
           L+D +  EA+ R      SA  FG ADLR+A       R N T AD+R     G+ F GA
Sbjct: 69  LSDADLSEADLR------SACLFG-ADLRRATLE----RTNLTRADLR-----GAAFRGA 112

Query: 151 YLEKAVAYKANFT--------------------GADLSDTLMDRMVLNEANLTNAVLVRT 190
            + + V  +A+                       A+L++  M +  L+ A L+NA +V+T
Sbjct: 113 SMRRVVMVEADLRDGHLMRSKNGELTPNVQGNPSAELAEASMTKADLSYAKLSNAFVVQT 172

Query: 191 VLTRSDLGGAI---------------IEGADFSDA 210
            L  +DL GAI               +EGAD S+A
Sbjct: 173 DLRDTDLRGAIFVKADLSGSDLTGCNLEGADLSEA 207



 Score = 40.4 bits (93), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 46/140 (32%), Positives = 64/140 (45%), Gaps = 25/140 (17%)

Query: 82  ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSA----- 135
           A+ S  + A+ADL+   A+ RG       A+   ADLR A   +     AN  +A     
Sbjct: 316 ANLSGTVLAMADLSM--ADLRG-------AELAGADLRGACLERATLNEANLANAVACPM 366

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV-----LVRT 190
           D+R      + F+     KA   + N  GADL+   ++   L++ NL NAV     L   
Sbjct: 367 DLRNGHEWPTNFS-----KARMVRVNLAGADLTMARLEDGDLSQGNLRNAVLAGACLTDA 421

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            LT +DL GA I  ADF  A
Sbjct: 422 TLTMADLRGADIRNADFRGA 441



 Score = 37.7 bits (86), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 44/103 (42%), Gaps = 25/103 (24%)

Query: 131 NFTSADMRESDFSGSKFNG-----AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           N ++A +R++   G+  +G     A L  A    A   GADL    ++R  LNEANL NA
Sbjct: 302 NLSAAILRQTTLKGANLSGTVLAMADLSMADLRGAELAGADLRGACLERATLNEANLANA 361

Query: 186 V--------------------LVRTVLTRSDLGGAIIEGADFS 208
           V                    +VR  L  +DL  A +E  D S
Sbjct: 362 VACPMDLRNGHEWPTNFSKARMVRVNLAGADLTMARLEDGDLS 404


>gi|123968240|ref|YP_001009098.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. AS9601]
 gi|123198350|gb|ABM69991.1| Pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. AS9601]
          Length = 157

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 62/131 (47%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F  +DL+ A          F   D+++++ S  +   A L  A     N + ++L + 
Sbjct: 33  ADFSGSDLKGAT---------FYLTDLQDANLSDCELQNATLYGAKLKDTNLSNSNLREV 83

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D  +L+  +L+N  L  +    +      I+GADF++  +     +  C+ A GTNPI
Sbjct: 84  TLDSAILDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIIRKFCESATGTNPI 143

Query: 231 TGVSTRKSLGC 241
           T   TR++L C
Sbjct: 144 TNRETRETLEC 154


>gi|334117749|ref|ZP_08491840.1| stress protein [Microcoleus vaginatus FGP-2]
 gi|333460858|gb|EGK89466.1| stress protein [Microcoleus vaginatus FGP-2]
          Length = 578

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 57/105 (54%), Gaps = 1/105 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S+A   +A L +   +  N + AN  S +++ +D   +  +GA L KA+ Y A    A+L
Sbjct: 312 SSANLANAKLIQVNLIGSNLQGANLNSTNLQSADLIEANLSGANLTKAILYYARLIHANL 371

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           S   +    L++ANLT A L R  LT++ LG A + GAD S + +
Sbjct: 372 SQANLSEAKLDKANLTTANLSRANLTQASLGSANLTGADLSQSKV 416



 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 1/101 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A L  A  +  N  +AN + A + +++ + +  + A L +A    AN TGADL
Sbjct: 352 SGANLTKAILYYARLIHANLSQANLSEAKLDKANLTTANLSRANLTQASLGSANLTGADL 411

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           S + + ++ L+ ANL+   L    LT  +L G  + G + S
Sbjct: 412 SQSKVTKVNLSGANLSGVNLTGVSLTGVNLQGVNLSGMNLS 452


>gi|434392029|ref|YP_007126976.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428263870|gb|AFZ29816.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 532

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 58/115 (50%), Gaps = 11/115 (9%)

Query: 110 AAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVAY 158
           A Q  +A+L  +  +  NF            A+ + AD+R++D SG+   GA L  A   
Sbjct: 310 ATQLNNANLSDSQLIGANFSNVVAEDIFLENADLSGADLRDADLSGANLKGANLSGANLT 369

Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
                GADLS+  +   +LN A L NA++ +T LT +D   A + GAD  +A+ D
Sbjct: 370 GVELDGADLSEANLAGAILNGAVLDNALVQKTDLTGADFTNATLTGADLKEAIGD 424



 Score = 44.3 bits (103), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 56/117 (47%), Gaps = 17/117 (14%)

Query: 111 AQFGSADLRKAV-HVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT----- 163
           A    ADL++A+     NF  AN   A +    F GS F  A L      KANFT     
Sbjct: 411 ATLTGADLKEAIGDSLTNFTGANLNGASLEVGSFIGSNFTDAALRDTNLIKANFTDALFI 470

Query: 164 --------GADL-SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
                   GADL S T +D + +N  +  NA+LV   LT+++  GA + GA+ S A+
Sbjct: 471 DGSDANSVGADLTSSTFIDGIAIN-GDFRNALLVNANLTKANFTGANLAGANLSGAI 526



 Score = 40.8 bits (94), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 56/117 (47%), Gaps = 6/117 (5%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           F     +A++ +S   G+ F+    E      A+ +GADL D  +    L  ANL+ A L
Sbjct: 309 FATQLNNANLSDSQLIGANFSNVVAEDIFLENADLSGADLRDADLSGANLKGANLSGANL 368

Query: 188 VRTVLTRSDLGGAIIEGADFSDAVID--LAQKQAL--CKYANGTNPITGVSTRKSLG 240
               L  +DL  A + GA  + AV+D  L QK  L    + N T  +TG   ++++G
Sbjct: 369 TGVELDGADLSEANLAGAILNGAVLDNALVQKTDLTGADFTNAT--LTGADLKEAIG 423


>gi|163797791|ref|ZP_02191737.1| hypothetical protein BAL199_22152 [alpha proteobacterium BAL199]
 gi|159176913|gb|EDP61479.1| hypothetical protein BAL199_22152 [alpha proteobacterium BAL199]
          Length = 427

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 52/157 (33%), Positives = 70/157 (44%), Gaps = 38/157 (24%)

Query: 92  ADLNK---YEAETRGEFGIGS---AAQFGSADLRKAVHVKENF-------RANFTSADMR 138
           ADLN      A+ RG F  GS    A    ADLR    +  N        R+N   +DM 
Sbjct: 78  ADLNHALLIRADLRGAFMRGSNLAGANLKEADLRGGALISGNLAAPATIIRSNIGQSDMD 137

Query: 139 ESDFSGSKFN----------GAYLEKAV----------AYKANFTGADLSDTLMD--RMV 176
           E+D  G+  +          GA LEK +             AN  GADLS   +   R++
Sbjct: 138 EADMGGANLSGTDLSHSSMIGATLEKTLLCGANLSGVNLEGANLQGADLSGANLSSARII 197

Query: 177 ---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
              L+ ANL+ A++ RT   +S+L GAI+E  D S A
Sbjct: 198 GANLSGANLSGALIHRTQFQKSELHGAILENVDLSTA 234



 Score = 44.3 bits (103), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 57/115 (49%), Gaps = 16/115 (13%)

Query: 116 ADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           ADLR+A+ V    R           +N + AD+R+S+ +G    GA L  A    A+ T 
Sbjct: 294 ADLREAILVSAVMRRTSLVMSDLSGSNLSGADLRDSELAGINLAGANLTNARIAGADLTS 353

Query: 165 ADLSD---TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
            +L         R+ +  +NL+ AVLV   LT + L GA ++GAD + A +  AQ
Sbjct: 354 VELKGPDGQATGRLWV--SNLSGAVLVNADLTGARLTGANLKGADLTGAKLARAQ 406



 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 49/94 (52%), Gaps = 3/94 (3%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN +  ++  ++  G+  +GA L  A    AN +GA+LS  L+ R    ++ L  A+L  
Sbjct: 169 ANLSGVNLEGANLQGADLSGANLSSARIIGANLSGANLSGALIHRTQFQKSELHGAILEN 228

Query: 190 TVLTRSDLGGAII---EGADFSDAVIDLAQKQAL 220
             L+ +DL GA +   +G   S ++ D+  + A+
Sbjct: 229 VDLSTADLSGANLTSGDGRGLSRSLRDILHEHAV 262



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 32/57 (56%)

Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +A   K   TG D+SD  +    L EA L +AV+ RT L  SDL G+ + GAD  D+
Sbjct: 273 RAQLAKTELTGIDVSDVNLSGADLREAILVSAVMRRTSLVMSDLSGSNLSGADLRDS 329


>gi|409994208|ref|ZP_11277326.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|409934956|gb|EKN76502.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 517

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 35/98 (35%), Positives = 57/98 (58%), Gaps = 1/98 (1%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F +A+LR+A     N   A+F+ A++R +D  G+  +GA L +A    AN +GA+LS 
Sbjct: 189 ADFTNAELRQANLTYANLSNADFSGANLRWTDLQGADLSGANLTEANLSGANLSGANLSS 248

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
            ++ +  L  A+L+ A L+R   + +DL GA + GA  
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKL 286



 Score = 44.7 bits (104), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 67/156 (42%), Gaps = 28/156 (17%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTS 134
           L  A++   + N++ L   +  EA+      I        A+L +A   K NF +AN   
Sbjct: 86  LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138

Query: 135 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 174
           AD+RES    + FNGA L                      A   KAN   AD ++  + +
Sbjct: 139 ADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPKADFTNAELRQ 198

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             L  ANL+NA      L  +DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANLRWTDLQGADLSGANLTEA 234



 Score = 43.9 bits (102), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 23/144 (15%)

Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           TR    +   A+  +A+L KA+  +     AN   AD+ E+    +    A L +A   K
Sbjct: 72  TRANLNV---ARLSNANLTKAILNQATINVANLVRADLTEAQLINTLLIRAELVRAKLSK 128

Query: 160 ANFT-----GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAI-----IEG 204
           ANFT     GADL ++ + +   N ANL+ A L       T  T++DL GA      +  
Sbjct: 129 ANFTQANLNGADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPK 188

Query: 205 ADFSDAVIDLAQKQALCKYANGTN 228
           ADF++A +    +QA   YAN +N
Sbjct: 189 ADFTNAEL----RQANLTYANLSN 208


>gi|428218432|ref|YP_007102897.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427990214|gb|AFY70469.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 403

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 51/102 (50%), Gaps = 14/102 (13%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A    ADL+    VK    AN T A + ++D S     GAYL  A   +AN  GA     
Sbjct: 14  ASLTRADLKGVDLVK----ANLTGASLSDADLSQVNLTGAYLNGADLNRANLAGA----- 64

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                +L+EANL  A L+R  L R+ L  AI+ GA+F +A +
Sbjct: 65  -----ILDEANLAAAFLIRANLQRASLNEAILAGANFHEASL 101



 Score = 43.5 bits (101), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 47/82 (57%), Gaps = 5/82 (6%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +AN + A++R +D +G+  + A +  A  ++ + TGA+L     D+  LN A+L NA L 
Sbjct: 158 KANLSGANLRSADLTGADLSHATMTGAELHQVDLTGANL-----DQTNLNAADLVNASLD 212

Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
              L+R++LG A + G    +A
Sbjct: 213 GAFLSRANLGWANLIGTTMKEA 234



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 11/108 (10%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVA-----YK 159
           A    A L +A+    NF       AN  SAD+  +D +G+   GA L  A        +
Sbjct: 79  ANLQRASLNEAILAGANFHEASLTGANLRSADLSLADLAGADLAGANLSDACMNSAFFIE 138

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           AN  GADLS T +    L +ANL+ A L    LT +DL  A + GA+ 
Sbjct: 139 ANLLGADLSLTSLRGASLAKANLSGANLRSADLTGADLSHATMTGAEL 186



 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 1/95 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A   S +++ A+ V+ N   AN  +A++  ++   +  NGA L +A   +AN +GA L
Sbjct: 302 SGADLSSTEMKGAILVRTNLNGANLANANLTGANLEQANLNGANLGEANLNRANLSGASL 361

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
           +   +    L  ANL    L+   L  ++L GAI+
Sbjct: 362 TGANLKGAFLLWANLKGTFLLWANLDEANLTGAIL 396



 Score = 40.4 bits (93), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 46/89 (51%), Gaps = 5/89 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
            AN + AD+  ++  G+  +GA L     + A+  + N  GA+L++  +    L +ANL 
Sbjct: 283 NANLSGADLSNTNLMGTSLSGADLSSTEMKGAILVRTNLNGANLANANLTGANLEQANLN 342

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            A L    L R++L GA + GA+   A +
Sbjct: 343 GANLGEANLNRANLSGASLTGANLKGAFL 371



 Score = 40.4 bits (93), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 10/89 (11%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN----- 184
           AN T A +  ++ +G+  NGA L       AN +GADLS+T +    L+ A+L++     
Sbjct: 259 ANLTGAFLMGANLNGANLNGANL-----TNANLSGADLSNTNLMGTSLSGADLSSTEMKG 313

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           A+LVRT L  ++L  A + GA+   A ++
Sbjct: 314 AILVRTNLNGANLANANLTGANLEQANLN 342



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 15/90 (16%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +AN T+A +  +D  G              KAN TGA LSD       L++ NLT A L 
Sbjct: 8   KANLTNASLTRADLKGVDL----------VKANLTGASLSDA-----DLSQVNLTGAYLN 52

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
              L R++L GAI++ A+ + A +  A  Q
Sbjct: 53  GADLNRANLAGAILDEANLAAAFLIRANLQ 82



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 41/83 (49%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
             AN   AD+  +   G+    A L  A    A+ TGADLS   M    L++ +LT A L
Sbjct: 137 IEANLLGADLSLTSLRGASLAKANLSGANLRSADLTGADLSHATMTGAELHQVDLTGANL 196

Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
            +T L  +DL  A ++GA  S A
Sbjct: 197 DQTNLNAADLVNASLDGAFLSRA 219



 Score = 37.7 bits (86), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 48/98 (48%), Gaps = 6/98 (6%)

Query: 118 LRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTGADLSDTL 171
           LR A   K N   AN  SAD+  +D S +   GA L +     A   + N   ADL +  
Sbjct: 151 LRGASLAKANLSGANLRSADLTGADLSHATMTGAELHQVDLTGANLDQTNLNAADLVNAS 210

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           +D   L+ ANL  A L+ T +  ++L GA +  A+ ++
Sbjct: 211 LDGAFLSRANLGWANLIGTTMKEANLVGADLSWANLNE 248


>gi|411117892|ref|ZP_11390273.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410711616|gb|EKQ69122.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 577

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 43/128 (33%), Positives = 58/128 (45%), Gaps = 26/128 (20%)

Query: 111 AQFGSADLRKAVHVKENF-----------RANFTSADMRE---------------SDFSG 144
           AQ   A+LR+A  V  N            +AN T AD+                 +D S 
Sbjct: 165 AQLDEANLREATLVGTNLNEASLIGAYLRQANLTEADLHRVVLSSADLSEAILANADLSR 224

Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           +   GAYL KA  +KA+   ADL D  + R  L+EANL  A L R  L+ + L   I+  
Sbjct: 225 ANLAGAYLLKASFHKAHLLRADLQDVYLLRADLSEANLRGANLQRADLSGAYLNHTILSE 284

Query: 205 ADFSDAVI 212
           AD S+A +
Sbjct: 285 ADLSEAYL 292



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 45/83 (54%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           D   SD SG+  +G  L  A   +AN T A+LS   +++ +L  ANL  A L    L+ +
Sbjct: 16  DFSHSDLSGANLSGFNLRGANFTEANLTEANLSWAFLNQAILTGANLRRADLRNASLSGA 75

Query: 196 DLGGAIIEGADFSDAVIDLAQKQ 218
           DL  AI+ GA+ S   + LAQ Q
Sbjct: 76  DLNHAILHGANLSKIDLRLAQLQ 98



 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 41/81 (50%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A     ++  +  + ++  GA L +A   +AN  GA+L    +    L+EANL  A LV 
Sbjct: 120 AKLDQVNLERAKLNSAQLKGAELMEANLRRANLAGANLDQANLREAQLDEANLREATLVG 179

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
           T L  + L GA +  A+ ++A
Sbjct: 180 TNLNEASLIGAYLRQANLTEA 200



 Score = 37.7 bits (86), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 46/111 (41%), Gaps = 25/111 (22%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTGADLSDTLM------- 172
           AN   AD+R +  SG+  N A L  A   K          AN   A L D  M       
Sbjct: 60  ANLRRADLRNASLSGADLNHAILHGANLSKIDLRLAQLQQANLNWATLQDADMGGANLAF 119

Query: 173 --------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
                   +R  LN A L  A L+   L R++L GA ++ A+  +A +D A
Sbjct: 120 AKLDQVNLERAKLNSAQLKGAELMEANLRRANLAGANLDQANLREAQLDEA 170


>gi|157413067|ref|YP_001483933.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
           str. MIT 9215]
 gi|157387642|gb|ABV50347.1| Pentapeptide repeat-containing proteins [Prochlorococcus marinus
           str. MIT 9215]
          Length = 157

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 62/131 (47%), Gaps = 9/131 (6%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F  +DL+ A          F   D+++++ S  +   A L  A     N + ++L + 
Sbjct: 33  ADFSGSDLKGAT---------FYLTDLQDANLSDCELQNATLYGAKLKDTNLSNSNLREV 83

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            +D  +L+  +L+N  L  +    +      I+GADF++  +     +  C+ A GTNPI
Sbjct: 84  TLDSAILDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVYLPKDIIREFCESATGTNPI 143

Query: 231 TGVSTRKSLGC 241
           T   TR++L C
Sbjct: 144 TNRDTRETLEC 154


>gi|119491336|ref|ZP_01623390.1| hypothetical protein L8106_22104 [Lyngbya sp. PCC 8106]
 gi|119453500|gb|EAW34662.1| hypothetical protein L8106_22104 [Lyngbya sp. PCC 8106]
          Length = 122

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 45/75 (60%), Gaps = 6/75 (8%)

Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           H + NF AN T AD+R+SD S ++  GA LE      AN TGA+LS T + +  L +A+L
Sbjct: 47  HAQLNF-ANLTHADLRDSDLSHAQLIGATLE-----GANLTGANLSHTNLSQANLKQADL 100

Query: 183 TNAVLVRTVLTRSDL 197
           T A L  T+ + S L
Sbjct: 101 TEATLQDTIYSHSTL 115


>gi|428305945|ref|YP_007142770.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
           9333]
 gi|428247480|gb|AFZ13260.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
          Length = 273

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 55/167 (32%), Positives = 77/167 (46%), Gaps = 23/167 (13%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    ADL +   V  N        A   SAD+  +D S +  N AYL  A    AN 
Sbjct: 123 SGASLLGADLSRINLVAANLSNAHLEGATMISADLSHADLSQTNINDAYLHLANLSNANL 182

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
           TGA+LS +      L+ A+L+NA L    L  ++L  A + GAD S+AV      +A  +
Sbjct: 183 TGANLSGS-----ELHIADLSNANLSEAQLNSAELNNANLLGADLSNAVF----AEANLR 233

Query: 223 YANGT-NPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRD 268
             N T N I+  +   ++G G       G+ +S +L   P  L DRD
Sbjct: 234 GTNLTSNQISSANLEGAIGLG------EGASASTVLD-QPTILEDRD 273



 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 51/108 (47%), Gaps = 11/108 (10%)

Query: 109 SAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S  Q   ADL     V  N  + N   A++R++D  G     A L +A    A+  GADL
Sbjct: 78  SRVQLSGADL-----VDANLNSSNLIQANLRDTDMLGVDLREANLSEADLSGASLLGADL 132

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           S     R+ L  ANL+NA L    +  +DL  A +   + +DA + LA
Sbjct: 133 S-----RINLVAANLSNAHLEGATMISADLSHADLSQTNINDAYLHLA 175


>gi|428298482|ref|YP_007136788.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428235026|gb|AFZ00816.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 567

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 9/100 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A+  SADL          RA+F  A++R +DFSG+  N A    A    ANF+ ADL+
Sbjct: 83  SDAKLNSADLS---------RADFYQANLRNTDFSGANLNSANFRNADLRNANFSNADLA 133

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           +     + L   N +NA +  T L R +L G  + GAD S
Sbjct: 134 NADFSGLDLYGVNFSNAKMRGTRLDRVNLSGVNLSGADLS 173



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 48/96 (50%), Gaps = 1/96 (1%)

Query: 116 ADL-RKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADL RK +   + + AN   +D+R +D S +K N A L +A  Y+AN    D S   ++ 
Sbjct: 55  ADLSRKNLKRADLYNANLQRSDLRNTDLSDAKLNSADLSRADFYQANLRNTDFSGANLNS 114

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
                A+L NA      L  +D  G  + G +FS+A
Sbjct: 115 ANFRNADLRNANFSNADLANADFSGLDLYGVNFSNA 150



 Score = 40.4 bits (93), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 43/140 (30%), Positives = 63/140 (45%), Gaps = 30/140 (21%)

Query: 92  ADLNK---YEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSG- 144
           ADL++   Y+A  R     G   ++A F +ADLR A         NF++AD+  +DFSG 
Sbjct: 90  ADLSRADFYQANLRNTDFSGANLNSANFRNADLRNA---------NFSNADLANADFSGL 140

Query: 145 ---------SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
                    +K  G  L++      N +GADLS      + L   NL    L R  L+ +
Sbjct: 141 DLYGVNFSNAKMRGTRLDRVNLSGVNLSGADLSG-----IDLRNVNLRGINLTRINLSHA 195

Query: 196 DLGGAIIEGADFSDAVIDLA 215
           +L G    G D  +A +  A
Sbjct: 196 NLIGFDFRGTDLRNANLSYA 215



 Score = 37.0 bits (84), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 37/70 (52%)

Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
           D SG+  +   L++A  Y AN   +DL +T +    LN A+L+ A   +  L  +D  GA
Sbjct: 51  DLSGADLSRKNLKRADLYNANLQRSDLRNTDLSDAKLNSADLSRADFYQANLRNTDFSGA 110

Query: 201 IIEGADFSDA 210
            +  A+F +A
Sbjct: 111 NLNSANFRNA 120


>gi|170751525|ref|YP_001757785.1| pentapeptide repeat-containing protein [Methylobacterium
           radiotolerans JCM 2831]
 gi|170658047|gb|ACB27102.1| pentapeptide repeat protein [Methylobacterium radiotolerans JCM
           2831]
          Length = 456

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 49/89 (55%), Gaps = 5/89 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTN 184
           A F  A MR +D SG+  +G     A  + A+F+GAD  DT+     +D   L +ANLT+
Sbjct: 141 ARFGEAAMRFADLSGALLDGTDFAGADLWGADFSGADADDTVFRGARLDEAKLADANLTH 200

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           A      LT++ L G+ + GA F+ A +D
Sbjct: 201 ADFAEASLTKASLAGSRLRGAHFTGAKLD 229



 Score = 45.1 bits (105), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 5/86 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ + A M E+D SG+   GA L  AV     FTGA     +++   L+EA+L+ A    
Sbjct: 71  ADLSRARMEEADLSGANLRGASLTGAVGRSTRFTGA-----ILEAADLSEADLSGADFTG 125

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
            V  +    GA++E A F +A +  A
Sbjct: 126 IVAGQVKFAGAMLEDARFGEAAMRFA 151



 Score = 43.9 bits (102), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 47/105 (44%), Gaps = 6/105 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A L  AV     F  A   +AD+ E+D SG+ F G      VA +  F GA L
Sbjct: 84  SGANLRGASLTGAVGRSTRFTGAILEAADLSEADLSGADFTG-----IVAGQVKFAGAML 138

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            D       +  A+L+ A+L  T    +DL GA   GAD  D V 
Sbjct: 139 EDARFGEAAMRFADLSGALLDGTDFAGADLWGADFSGADADDTVF 183



 Score = 40.8 bits (94), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEANLTN 184
           A+F  A + ++  +GS+  GA+   A    A+ +GADLSDT + R+      L  A    
Sbjct: 201 ADFAEASLTKASLAGSRLRGAHFTGAKLDGADLSGADLSDTDLVRLNLATCRLRHARFAG 260

Query: 185 AVLVRTVLTRSDLGGAIIE 203
           A L  T ++   LGGA+ E
Sbjct: 261 AWLNGTRMSVEQLGGAVGE 279



 Score = 40.0 bits (92), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 43/85 (50%), Gaps = 15/85 (17%)

Query: 131 NFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           +F  AD+  +DFSG+      F GA L++A    AN T AD +          EA+LT A
Sbjct: 162 DFAGADLWGADFSGADADDTVFRGARLDEAKLADANLTHADFA----------EASLTKA 211

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDA 210
            L  + L  +   GA ++GAD S A
Sbjct: 212 SLAGSRLRGAHFTGAKLDGADLSGA 236


>gi|428304969|ref|YP_007141794.1| heat shock protein DnaJ domain-containing protein [Crinalium
           epipsammum PCC 9333]
 gi|428246504|gb|AFZ12284.1| heat shock protein DnaJ domain protein [Crinalium epipsammum PCC
           9333]
          Length = 242

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 48/92 (52%), Gaps = 5/92 (5%)

Query: 131 NFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           N + AD++E DFSG     +  + A L  A  +K N  GA+L    + R  L +ANL+NA
Sbjct: 128 NMSGADLKEKDFSGRNLSDANLSHANLSDAFLHKVNLQGANLYKANLFRANLLQANLSNA 187

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
            L    L  +DL GA + GAD + A I    K
Sbjct: 188 CLREANLIGADLSGADLRGADLTGAKIGFNDK 219


>gi|428313200|ref|YP_007124177.1| pentapeptide repeat protein,protein kinase family protein
           [Microcoleus sp. PCC 7113]
 gi|428254812|gb|AFZ20771.1| pentapeptide repeat protein,protein kinase family protein
           [Microcoleus sp. PCC 7113]
          Length = 464

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 48/85 (56%), Gaps = 6/85 (7%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           FR N +  +++E++ SG  F  A L K      NF GADLSD    +  LN+ANL NA L
Sbjct: 326 FR-NISGLNLQEANLSGGLFYSAKLAKT-----NFQGADLSDAYFGQANLNQANLRNANL 379

Query: 188 VRTVLTRSDLGGAIIEGADFSDAVI 212
             T  + +DL GA ++GAD   A +
Sbjct: 380 GGTSFSNADLSGADLQGADLRFAYL 404



 Score = 41.6 bits (96), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 39/109 (35%), Positives = 51/109 (46%), Gaps = 26/109 (23%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A FG A+L +A     N R AN        +D SG+   GA L  A   KAN  GA+L
Sbjct: 360 SDAYFGQANLNQA-----NLRNANLGGTSFSNADLSGADLQGADLRFAYLSKANLKGANL 414

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
                      EANL+NA          ++ GA + GA+ S+A+I  AQ
Sbjct: 415 C----------EANLSNA----------NIKGANLCGANLSNAIITEAQ 443


>gi|411117186|ref|ZP_11389673.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410713289|gb|EKQ70790.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 544

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 59/123 (47%), Gaps = 11/123 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A     +LR+A   + N R AN T A++R +D SG+  + A L  A    AN TG +L
Sbjct: 173 SGADLSYTELRQANLSRANLRGANLTGANLRWADLSGADLSWADLSGARLSGANLTGVNL 232

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 227
           S           ANL   +LV   LTR+ L GA   G+D S A +  A+   + ++   T
Sbjct: 233 S----------YANLLGTILVHADLTRASLIGADWAGSDLSGATLTGAKLHGVLRFGVKT 282

Query: 228 NPI 230
             I
Sbjct: 283 EGI 285



 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 4/135 (2%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A     DL+ A   + N   AN + A ++ + F+ +  + A L       ++ +GADL
Sbjct: 118 SFANLSGVDLKDAKLRQANLSHANISRASLKWATFTSANLSQANLHGTDLSSSDLSGADL 177

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL-CKYAN- 225
           S T + +  L+ ANL  A L    L  +DL GA +  AD S A +  A    +   YAN 
Sbjct: 178 SYTELRQANLSRANLRGANLTGANLRWADLSGADLSWADLSGARLSGANLTGVNLSYANL 237

Query: 226 -GTNPITGVSTRKSL 239
            GT  +    TR SL
Sbjct: 238 LGTILVHADLTRASL 252



 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 51/96 (53%), Gaps = 4/96 (4%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           +N + A++ ++  + +K +GA L KA   +AN   A+L+   +    L +A+L  A + R
Sbjct: 50  SNLSEANLSKAKLNVAKLSGANLSKANLEEANLNVANLTLADLSHAELRQASLVRAEMAR 109

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
             L+ ++L  A + G D  DA +    +QA   +AN
Sbjct: 110 AELSEANLSFANLSGVDLKDAKL----RQANLSHAN 141



 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 6/126 (4%)

Query: 84  CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA-VHVKENFRANFTSADMRESDF 142
           C SN+S  A+L+K     +      S A    A+L +A ++V     A+ + A++R++  
Sbjct: 48  CGSNLSE-ANLSK----AKLNVAKLSGANLSKANLEEANLNVANLTLADLSHAELRQASL 102

Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
             ++   A L +A    AN +G DL D  + +  L+ AN++ A L     T ++L  A +
Sbjct: 103 VRAEMARAELSEANLSFANLSGVDLKDAKLRQANLSHANISRASLKWATFTSANLSQANL 162

Query: 203 EGADFS 208
            G D S
Sbjct: 163 HGTDLS 168


>gi|291570912|dbj|BAI93184.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
          Length = 517

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 1/96 (1%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F +A+LR+A     N   A+F+ A++R +D  G+  +GA L +A    AN +GA+LS 
Sbjct: 189 ADFTNAELRQANLTYANLSNADFSGANLRWTDLQGADLSGANLTEANLSGANLSGANLSS 248

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
            ++ +  L  A+L+ A L+R   + +DL GA + GA
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGA 284



 Score = 44.3 bits (103), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 67/156 (42%), Gaps = 28/156 (17%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTS 134
           L  A++   + N++ L   +  EA+      I        A+L +A   K NF +AN   
Sbjct: 86  LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138

Query: 135 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 174
           AD+RES    + FNGA L                      A   KAN   AD ++  + +
Sbjct: 139 ADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPKADFTNAELRQ 198

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             L  ANL+NA      L  +DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANLRWTDLQGADLSGANLTEA 234



 Score = 43.9 bits (102), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 23/144 (15%)

Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           TR    +   A+  +A+L KA+  +     AN   AD+ E+    +    A L +A   K
Sbjct: 72  TRANLNV---ARLSNANLTKAILNQATINVANLVRADLTEAQLINTLLIRAELVRAKLSK 128

Query: 160 ANFT-----GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAI-----IEG 204
           ANFT     GADL ++ + +   N ANL+ A L       T  T++DL GA      +  
Sbjct: 129 ANFTQANLNGADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPK 188

Query: 205 ADFSDAVIDLAQKQALCKYANGTN 228
           ADF++A +    +QA   YAN +N
Sbjct: 189 ADFTNAEL----RQANLTYANLSN 208



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 28/102 (27%), Positives = 50/102 (49%), Gaps = 3/102 (2%)

Query: 112 QFGSADLRKAVHVKENFR---ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           Q   +D+ K   + + +R    +F   ++ E + S     GA L  A    AN + +DL 
Sbjct: 8   QNSESDVLKVYEIVKKYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +  + R  LN A L+NA L + +L ++ +  A +  AD ++A
Sbjct: 68  EVNLTRANLNVARLSNANLTKAILNQATINVANLVRADLTEA 109



 Score = 37.4 bits (85), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 55/128 (42%), Gaps = 22/128 (17%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A   ++DLR+          N T A++  +  S +    A L +A    AN   ADL+
Sbjct: 57  SVANLSASDLREV---------NLTRANLNVARLSNANLTKAILNQATINVANLVRADLT 107

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
                     EA L N +L+R  L R+ L  A    A+ + A  DL + +      NG N
Sbjct: 108 ----------EAQLINTLLIRAELVRAKLSKANFTQANLNGA--DLRESKLQQTNFNGAN 155

Query: 229 PITGVSTR 236
            ++G + R
Sbjct: 156 -LSGANLR 162


>gi|157803630|ref|YP_001492179.1| hypothetical protein A1E_02245 [Rickettsia canadensis str. McKiel]
 gi|157784893|gb|ABV73394.1| Uncharacterized low-complexity protein [Rickettsia canadensis str.
           McKiel]
          Length = 956

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 40/113 (35%), Positives = 61/113 (53%), Gaps = 7/113 (6%)

Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           +A++ KA+  K N   AN T A + ++    +K + A LEKA A      G +++D +  
Sbjct: 559 NANMNKALLDKANLEYANLTGAILTDASAQFAKLSNATLEKAEA-----EGLNIADAIAK 613

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 225
            M   EAN  NA++ R  LT+++L  AI+E AD   A  +D   K+A  K AN
Sbjct: 614 NMNAKEANFKNAIMKRADLTKANLEKAILENADMQAAEALDAIFKEANLKQAN 666



 Score = 38.9 bits (89), Expect = 2.4,   Method: Composition-based stats.
 Identities = 33/117 (28%), Positives = 50/117 (42%), Gaps = 7/117 (5%)

Query: 121 AVHVKENFRANFTSADMRESDFSGSKFNG------AYLEKAVAYKANFTGADLSDTLMDR 174
           A+  K   + N  S   R + FS ++F        A L  A+  + N   A+++  L+D+
Sbjct: 510 ALEAKFKKQCNMKSITARNAYFSDAEFENILSLEEADLRNAIMERVNLVNANMNKALLDK 569

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANGTNPI 230
             L  ANLT A+L       + L  A +E A+     + D   K    K AN  N I
Sbjct: 570 ANLEYANLTGAILTDASAQFAKLSNATLEKAEAEGLNIADAIAKNMNAKEANFKNAI 626



 Score = 38.5 bits (88), Expect = 3.1,   Method: Composition-based stats.
 Identities = 43/147 (29%), Positives = 63/147 (42%), Gaps = 26/147 (17%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSA 135
           L  A++   S+  + L++    +AE  G   I  A       + K ++ KE   ANF +A
Sbjct: 577 LTGAILTDASAQFAKLSNATLEKAEAEG-LNIADA-------IAKNMNAKE---ANFKNA 625

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
            M+ +D +      A LEKA+   A+   A+  D      +  EANL  A L    L R 
Sbjct: 626 IMKRADLTK-----ANLEKAILENADMQAAEALDA-----IFKEANLKQANLKAANLARI 675

Query: 196 DLGGAIIEGADFSDAVIDLAQKQALCK 222
           +       GADF  A +D A K    K
Sbjct: 676 NKA-----GADFDQAKVDDATKMHYTK 697


>gi|347755497|ref|YP_004863061.1| putative low-complexity protein [Candidatus Chloracidobacterium
           thermophilum B]
 gi|347588015|gb|AEP12545.1| putative low-complexity protein [Candidatus Chloracidobacterium
           thermophilum B]
          Length = 419

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 45/123 (36%), Positives = 60/123 (48%), Gaps = 9/123 (7%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL-- 167
           A   SA LR A  V+ N   AN   AD+  ++  G+   GA L +A    AN  GADL  
Sbjct: 57  ANLASASLRDAFLVRANLEGANLRGADLESANLEGANLRGADLSRANLEGANLEGADLTG 116

Query: 168 ----SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCK 222
               S  L+D   L  A L NAV     L  + LGGA +   DF +A+++ A  ++AL  
Sbjct: 117 ARLPSAQLID-AKLGVATLENAVFANADLRNAYLGGANLTAVDFQNAILEAANFEEALLT 175

Query: 223 YAN 225
            AN
Sbjct: 176 GAN 178



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 1/99 (1%)

Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F +ADLR A     N  A +F +A +  ++F  +   GA L  AV  +A   GADLS 
Sbjct: 137 AVFANADLRNAYLGGANLTAVDFQNAILEAANFEEALLTGANLRDAVLRRAVLPGADLSG 196

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
             ++R VL  A+L+   L+      +   GA ++GA FS
Sbjct: 197 AKLERAVLEGADLSQVSLLEADCRHATFQGARLKGAKFS 235



 Score = 43.9 bits (102), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 45/92 (48%), Gaps = 14/92 (15%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF-----TGA 165
           A+ G A L  AV         F +AD+R +   G+       + A+   ANF     TGA
Sbjct: 127 AKLGVATLENAV---------FANADLRNAYLGGANLTAVDFQNAILEAANFEEALLTGA 177

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           +L D ++ R VL  A+L+ A L R VL  +DL
Sbjct: 178 NLRDAVLRRAVLPGADLSGAKLERAVLEGADL 209



 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 11/111 (9%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRE-----SDFSGSKFNGAYLEKAVAYK 159
           A   +A+LR+A     N       RAN  SA +R+     ++  G+   GA LE A    
Sbjct: 32  ANLDNANLRRADLEGANLEEASLRRANLASASLRDAFLVRANLEGANLRGADLESANLEG 91

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           AN  GADLS   ++   L  A+LT A L    L  + LG A +E A F++A
Sbjct: 92  ANLRGADLSRANLEGANLEGADLTGARLPSAQLIDAKLGVATLENAVFANA 142



 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 55/111 (49%), Gaps = 19/111 (17%)

Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           ADLR+    K    AN  +A++R +D  G     A LE+A   +AN   A L D  + R 
Sbjct: 22  ADLRELDLAK----ANLDNANLRRADLEG-----ANLEEASLRRANLASASLRDAFLVRA 72

Query: 176 VLNEANLTNAVL---------VRTV-LTRSDLGGAIIEGADFSDAVIDLAQ 216
            L  ANL  A L         +R   L+R++L GA +EGAD + A +  AQ
Sbjct: 73  NLEGANLRGADLESANLEGANLRGADLSRANLEGANLEGADLTGARLPSAQ 123


>gi|379022817|ref|YP_005299478.1| hypothetical protein RCA_02115 [Rickettsia canadensis str. CA410]
 gi|376323755|gb|AFB20996.1| hypothetical protein RCA_02115 [Rickettsia canadensis str. CA410]
          Length = 956

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 40/113 (35%), Positives = 61/113 (53%), Gaps = 7/113 (6%)

Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           +A++ KA+  K N   AN T A + ++    +K + A LEKA A      G +++D +  
Sbjct: 559 NANMNKALLDKANLEYANLTGAILTDASAQFAKLSNATLEKAEA-----EGLNIADAIAK 613

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 225
            M   EAN  NA++ R  LT+++L  AI+E AD   A  +D   K+A  K AN
Sbjct: 614 NMNAKEANFKNAIMKRADLTKANLEKAILENADMQAAEALDAIFKEANLKQAN 666



 Score = 38.9 bits (89), Expect = 2.4,   Method: Composition-based stats.
 Identities = 33/117 (28%), Positives = 50/117 (42%), Gaps = 7/117 (5%)

Query: 121 AVHVKENFRANFTSADMRESDFSGSKFNG------AYLEKAVAYKANFTGADLSDTLMDR 174
           A+  K   + N  S   R + FS ++F        A L  A+  + N   A+++  L+D+
Sbjct: 510 ALEAKFKKQCNMKSITARNAYFSDAEFENILSLEEADLRNAIMERVNLVNANMNKALLDK 569

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANGTNPI 230
             L  ANLT A+L       + L  A +E A+     + D   K    K AN  N I
Sbjct: 570 ANLEYANLTGAILTDASAQFAKLSNATLEKAEAEGLNIADAIAKNMNAKEANFKNAI 626



 Score = 38.5 bits (88), Expect = 3.2,   Method: Composition-based stats.
 Identities = 43/147 (29%), Positives = 63/147 (42%), Gaps = 26/147 (17%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSA 135
           L  A++   S+  + L++    +AE  G   I  A       + K ++ KE   ANF +A
Sbjct: 577 LTGAILTDASAQFAKLSNATLEKAEAEG-LNIADA-------IAKNMNAKE---ANFKNA 625

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
            M+ +D +      A LEKA+   A+   A+  D      +  EANL  A L    L R 
Sbjct: 626 IMKRADLTK-----ANLEKAILENADMQAAEALDA-----IFKEANLKQANLKAANLARI 675

Query: 196 DLGGAIIEGADFSDAVIDLAQKQALCK 222
           +       GADF  A +D A K    K
Sbjct: 676 NKA-----GADFDQAKVDDATKMHYTK 697


>gi|254416875|ref|ZP_05030623.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196176239|gb|EDX71255.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 332

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 56/121 (46%), Gaps = 13/121 (10%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA 155
           Y A+ RG   I        ADLR A  +K N R AN    ++RE+D  G+  +GA L  A
Sbjct: 144 YTAKLRG--AILQNVDLQGADLRGADLLKVNLRGANLRETNLREADLRGANLSGANLSSA 201

Query: 156 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
              + N  GA+L   +          L N  L R +L+ +DL G  ++GA   D  +  A
Sbjct: 202 FLTEVNLMGANLRGAI----------LKNVKLERAILSEADLTGVNLQGAVMPDVRLSKA 251

Query: 216 Q 216
           Q
Sbjct: 252 Q 252



 Score = 41.2 bits (95), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 57/124 (45%), Gaps = 7/124 (5%)

Query: 94  LNKYEAETRGEFGIG-SAAQFGSADLRKAV------HVKENFRANFTSADMRESDFSGSK 146
           L++YEA      GI  S      ADL   V      H      A  + A+ R+++  G++
Sbjct: 7   LHRYEAGETKFTGISLSGVNLFGADLIGIVLNGADLHGATLIFAYLSRANFRKANLVGTR 66

Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
            +GA L +A     N + ADL    +    L  ANLT A L+   L  +DL GA + GAD
Sbjct: 67  LSGANLNQAWLSGVNLSNADLHGASLQSADLRSANLTLASLLDANLMDADLRGANLSGAD 126

Query: 207 FSDA 210
            + A
Sbjct: 127 LTGA 130



 Score = 38.1 bits (87), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 38/130 (29%), Positives = 59/130 (45%), Gaps = 12/130 (9%)

Query: 91  LADLNKYEAETRG--------EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDF 142
           L ++N   A  RG        E  I S A     +L+ AV          + A +   + 
Sbjct: 203 LTEVNLMGANLRGAILKNVKLERAILSEADLTGVNLQGAVMPD----VRLSKAQVSGGNL 258

Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
           S ++ N A L +    +AN + +DL +  + R  L  ANL+NA L R  L+ ++L GA +
Sbjct: 259 SFARLNRADLSRTNLREANLSDSDLIEAYLARTNLMGANLSNANLTRAELSTTNLMGANL 318

Query: 203 EGADFSDAVI 212
           +GA   D  I
Sbjct: 319 QGATMPDGRI 328


>gi|436841883|ref|YP_007326261.1| Pentapeptide repeat protein [Desulfovibrio hydrothermalis AM13 = DSM
            14728]
 gi|432170789|emb|CCO24160.1| Pentapeptide repeat protein [Desulfovibrio hydrothermalis AM13 = DSM
            14728]
          Length = 1278

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 37/100 (37%), Positives = 50/100 (50%), Gaps = 7/100 (7%)

Query: 116  ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
            AD R A   K  F+ +    AD R++D   + FNGA   K V  K NF GA+L      R
Sbjct: 1094 ADFRNAFIKKSIFKGSTLDGADFRKADVHETLFNGA---KGV--KVNFAGANLDKLRTGR 1148

Query: 175  MV-LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
                 EA+ T A L  +    +DL GA+  GAD  +A++D
Sbjct: 1149 NAEFPEADFTGATLRSSAFRETDLTGALFRGADLENALVD 1188



 Score = 45.8 bits (107), Expect = 0.022,   Method: Composition-based stats.
 Identities = 45/135 (33%), Positives = 61/135 (45%), Gaps = 24/135 (17%)

Query: 86   SNISALADLNKYEAETRGEF---GIGSAAQFGSADLRKA-VH---------VKENFR-AN 131
            S +S  AD    EA+ R  F    I   +    AD RKA VH         VK NF  AN
Sbjct: 1085 SMVSGKAD----EADFRNAFIKKSIFKGSTLDGADFRKADVHETLFNGAKGVKVNFAGAN 1140

Query: 132  FT------SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
                    +A+  E+DF+G+    +   +     A F GADL + L+D  +L +ANL  A
Sbjct: 1141 LDKLRTGRNAEFPEADFTGATLRSSAFRETDLTGALFRGADLENALVDNCMLVDANLNGA 1200

Query: 186  VLVRTVLTRSDLGGA 200
                   T+S+L GA
Sbjct: 1201 SAKGARFTKSNLEGA 1215



 Score = 42.7 bits (99), Expect = 0.15,   Method: Composition-based stats.
 Identities = 24/86 (27%), Positives = 41/86 (47%)

Query: 130  ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
            AN T   ++ +DF  +  + A L + +   A+FT A L     +R +L  A    + L  
Sbjct: 980  ANLTGCQLKNTDFKETCLDNAKLIQTMGRSADFTKASLKGVNFERAMLGNAIFEESDLTG 1039

Query: 190  TVLTRSDLGGAIIEGADFSDAVIDLA 215
                ++   G+  +GA  +DAV D+A
Sbjct: 1040 AQARQASFKGSSFKGATLADAVFDMA 1065



 Score = 38.9 bits (89), Expect = 2.4,   Method: Composition-based stats.
 Identities = 32/99 (32%), Positives = 48/99 (48%), Gaps = 7/99 (7%)

Query: 115  SADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
            SAD  KA     NF RA   +A   ESD +G++   A  + +     +F GA L+D + D
Sbjct: 1009 SADFTKASLKGVNFERAMLGNAIFEESDLTGAQARQASFKGS-----SFKGATLADAVFD 1063

Query: 174  RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
              +L + + + A L    +  S + G   E ADF +A I
Sbjct: 1064 MAILEKTDFSKANLSGARINMSMVSGKADE-ADFRNAFI 1101


>gi|163797895|ref|ZP_02191839.1| pentapeptide repeat family protein [alpha proteobacterium BAL199]
 gi|159176857|gb|EDP61425.1| pentapeptide repeat family protein [alpha proteobacterium BAL199]
          Length = 396

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/95 (41%), Positives = 48/95 (50%), Gaps = 11/95 (11%)

Query: 114 GSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           G+AD + A     N F  +FT AD+RE DF+G+   GA    A   +A   GADLS    
Sbjct: 15  GAADGQPASFANANLFGFDFTGADLREVDFAGASLQGARFVGADLTRAVLVGADLSGVSF 74

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
              VL EA+LT A LV          GA+ EGAD 
Sbjct: 75  RNAVLLEADLTGARLV----------GAVFEGADL 99



 Score = 44.3 bits (103), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F  A M  ++ +G+KF     E  V  + + TGA+L    + R  ++ A L NA+L+   
Sbjct: 127 FAGARMHRANLTGAKF-----ENVVLAQTDLTGANLERASLRRASMSGAVLRNAILIDAD 181

Query: 192 LTRSDLGGAIIEGADFSDAVID 213
           L+ +DL  +++ GAD S A +D
Sbjct: 182 LSHADLTDSLVTGADLSGAQLD 203



 Score = 40.4 bits (93), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 44/89 (49%), Gaps = 1/89 (1%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + T AD+R  + SG+   GA L +AV   A    ADLS   +    L   NL+ A L   
Sbjct: 270 DLTDADLRSLNLSGADLRGAVLRRAVLTDALLVLADLSGADLTLASLARCNLSGANLAGA 329

Query: 191 VLTRSDLGGAIIEGAD-FSDAVIDLAQKQ 218
            L+R+DL  AI+  A   S A  D  ++Q
Sbjct: 330 NLSRADLTDAILTAAPILSQAGADTGRRQ 358



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 55/108 (50%), Gaps = 1/108 (0%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN T A       + +   GA LE+A   +A+ +GA L + ++    L+ A+LT++++ 
Sbjct: 134 RANLTGAKFENVVLAQTDLTGANLERASLRRASMSGAVLRNAILIDADLSHADLTDSLVT 193

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYANGTNPITGVST 235
              L+ + L GA +E A+F  A + D+   +     A  T P   V+T
Sbjct: 194 GADLSGAQLDGATVERANFVGARLRDVDLSRVDTSKARLTPPTDSVTT 241


>gi|428222472|ref|YP_007106642.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427995812|gb|AFY74507.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 340

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 78/155 (50%), Gaps = 13/155 (8%)

Query: 66  KNWR--VFVSTA------LAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSA 116
            NWR  VF S        L+AA ++S + +++ L  +N   A  ++      S A  G A
Sbjct: 18  NNWRSEVFRSKIDLSYADLSAATLSSINLSLANLRSINLSRANLSKANL---SGAILGKA 74

Query: 117 DLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           +L +A  +  N   ANF  AD+  +  S S  + A L  AVA  ANF  A+LS T     
Sbjct: 75  NLTEASLINANLSMANFIMADLSGAYLSESNLSRANLGNAVAIAANFIMANLSGTYFSES 134

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             + ANL++A L   +L +++L G+ +  A+F+ A
Sbjct: 135 DFSRANLSSANLTEAILVKTNLTGSYLSKANFTSA 169



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 58/108 (53%), Gaps = 6/108 (5%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A   SA+L +A+ VK N       +ANFTSA++  +D S +  + A +  A    AN 
Sbjct: 137 SRANLSSANLTEAILVKTNLTGSYLSKANFTSANLSMTDLSEADLSSANMHLADLSMANL 196

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           + A+L   ++  + L +ANLT A L    LT +DL  + + GA+F  A
Sbjct: 197 SSANLIGAILTDVDLRQANLTGAYLNTANLTGADLATSTLVGANFYQA 244



 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 56/103 (54%), Gaps = 11/103 (10%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F  A+LR+A   + N + AN + A +  ++ +G+   GA L  A    AN  GA    
Sbjct: 239 ANFYQANLREANLDRANAQNANLSEAYLSNANLTGTILEGANLSSAYISNANLVGA---- 294

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                 VL  A+LT A+L+   LT+++  GA ++GADF+ A++
Sbjct: 295 ------VLKGADLTGAILIGANLTKANFSGAKLDGADFTSAIM 331



 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 39/120 (32%), Positives = 58/120 (48%), Gaps = 21/120 (17%)

Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSA----------DMRESDFSGSKFN-----GAYL 152
           S      ADL  A +H+ +   AN +SA          D+R+++ +G+  N     GA L
Sbjct: 172 SMTDLSEADLSSANMHLADLSMANLSSANLIGAILTDVDLRQANLTGAYLNTANLTGADL 231

Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             +    ANF  A+L +  +DR     AN  NA L    L+ ++L G I+EGA+ S A I
Sbjct: 232 ATSTLVGANFYQANLREANLDR-----ANAQNANLSEAYLSNANLTGTILEGANLSSAYI 286



 Score = 44.7 bits (104), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 11/103 (10%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S +    A+L  AV +  NF  AN +     ESDFS +  + A L +A+  K N TG+ L
Sbjct: 102 SESNLSRANLGNAVAIAANFIMANLSGTYFSESDFSRANLSSANLTEAILVKTNLTGSYL 161

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S          +AN T+A L  T L+ +DL  A +  AD S A
Sbjct: 162 S----------KANFTSANLSMTDLSEADLSSANMHLADLSMA 194


>gi|427735932|ref|YP_007055476.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427370973|gb|AFY54929.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 713

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/152 (31%), Positives = 70/152 (46%), Gaps = 31/152 (20%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY---------- 158
           S+A    ADLR AV   EN  A+ T AD+ E+  + +   GA L + VA           
Sbjct: 534 SSASLAKADLRNAVL--EN--ASLTGADLGEARLNDADLYGARLGRVVAIGTQLSNANLI 589

Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL---------------TRSDLGGAIIE 203
           K  + GADLS   +DR  L+ ANL+ A L   +L               + +DL GA + 
Sbjct: 590 KTEWQGADLSSAYLDRANLSNANLSAARLTGAILRSTNLQNVNLRNADLSLADLRGANLA 649

Query: 204 GADFSDAVIDLAQKQALCKYANGTNPITGVST 235
           GADF   ++   Q+    K+ +   P TG+ +
Sbjct: 650 GADFQGTILSARQQNPADKFVD--TPTTGIQS 679



 Score = 43.9 bits (102), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 50/97 (51%), Gaps = 14/97 (14%)

Query: 131 NFTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMDRMV 176
           +F  A++ ++ F+GS+F G              A L +A   +AN +GA+LS  LM R  
Sbjct: 453 DFKYANLDKASFTGSRFRGPGKDGRWDTYDDWIANLSQAQLKQANLSGANLSRVLMVRTN 512

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           L+ +NL  A L    L  ++L  A +  AD  +AV++
Sbjct: 513 LSRSNLNKANLSAARLVGANLSSASLAKADLRNAVLE 549



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 54/111 (48%), Gaps = 6/111 (5%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    A+L + + V+ N       +AN ++A +  ++ S +    A L  AV   A+ TG
Sbjct: 496 ANLSGANLSRVLMVRTNLSRSNLNKANLSAARLVGANLSSASLAKADLRNAVLENASLTG 555

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           ADL +  ++   L  A L   V + T L+ ++L     +GAD S A +D A
Sbjct: 556 ADLGEARLNDADLYGARLGRVVAIGTQLSNANLIKTEWQGADLSSAYLDRA 606


>gi|308205942|gb|ADO19342.1| pentapeptide repeat protein [Nostoc flagelliforme str. Sunitezuoqi]
          Length = 146

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 52/106 (49%), Gaps = 10/106 (9%)

Query: 115 SADLRKAVHVKENFRANFTSA----------DMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           SA +R+ +  +E F  N T A          D+R ++  G+   GA LE A    AN   
Sbjct: 28  SAPVRRLLETRECFGCNLTGANLKGAHLIGVDLRNANLKGANLEGANLEGADLTGANLKY 87

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           A+L+   +   +LN ANLTN  L  + L  SD+ GA++   D S A
Sbjct: 88  ANLTKAFVSDTILNNANLTNVNLSNSRLYNSDVDGAVMANIDLSGA 133


>gi|427715911|ref|YP_007063905.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
 gi|427348347|gb|AFY31071.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
          Length = 589

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 57/111 (51%), Gaps = 6/111 (5%)

Query: 111 AQFGSADLRKAVHVKEN------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    AD+ KA+    N      F  N + A +  +D S +K NGA L  A    A F G
Sbjct: 408 ADLSGADMSKAILNGTNLSDTILFSTNLSDAILIAADLSYAKLNGAKLNYARLNGAMFLG 467

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           ADLS   +  ++LN+A+L+  +L    L+ +DL  AI+ G D S A ++ A
Sbjct: 468 ADLSGVDLSGVILNDADLSGVLLSEADLSDADLSDAILFGTDLSYANLNRA 518



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 39/110 (35%), Positives = 55/110 (50%), Gaps = 6/110 (5%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A   SA+L  A     +  R N +SAD+  +D S +  N A L  A    AN + ADL
Sbjct: 321 SHADLSSANLSGANLTNTDLNRTNLSSADLSSADLSSTNLNSADLSSANLKDANLSSADL 380

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVI 212
           S T +    L++ANL+   L    L R+DL G     AI+ G + SD ++
Sbjct: 381 SHTHLFGANLSDANLSGVNLSHADLCRADLSGADMSKAILNGTNLSDTIL 430



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 57/110 (51%), Gaps = 9/110 (8%)

Query: 103 GEFGIGS---AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAY 158
           GEF  G     A  G A+L  A     NF  AN + A + +++ +G  F+GA L  A   
Sbjct: 252 GEFLRGGNFRGAYLGDANLTGA-----NFSGANLSGAYLGDANLTGVNFSGANLSGANLG 306

Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            AN +GA+LS+  +    L+ ANL+ A L  T L R++L  A +  AD S
Sbjct: 307 DANLSGANLSNANLSHADLSSANLSGANLTNTDLNRTNLSSADLSSADLS 356



 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 58/103 (56%), Gaps = 6/103 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A  G A+L        NF  AN + A++ +++ SG+  + A L  A    AN +GA+L
Sbjct: 281 SGAYLGDANLTGV-----NFSGANLSGANLGDANLSGANLSNANLSHADLSSANLSGANL 335

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           ++T ++R  L+ A+L++A L  T L  +DL  A ++ A+ S A
Sbjct: 336 TNTDLNRTNLSSADLSSADLSSTNLNSADLSSANLKDANLSSA 378



 Score = 43.5 bits (101), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 50/94 (53%), Gaps = 10/94 (10%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM----------DRMVLNE 179
           A F  AD+   D SG   N A L   +  +A+ + ADLSD ++          +R  L+ 
Sbjct: 463 AMFLGADLSGVDLSGVILNDADLSGVLLSEADLSDADLSDAILFGTDLSYANLNRANLSG 522

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           +NL+ A+L    L+ ++L  AI+ GAD SDA ++
Sbjct: 523 SNLSGALLNGADLSHTNLSCAILGGADVSDANLE 556



 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 44/81 (54%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN ++A++  +D S +  +GA L      + N + ADLS   +    LN A+L++A L  
Sbjct: 313 ANLSNANLSHADLSSANLSGANLTNTDLNRTNLSSADLSSADLSSTNLNSADLSSANLKD 372

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L+ +DL    + GA+ SDA
Sbjct: 373 ANLSSADLSHTHLFGANLSDA 393



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S+    SADL  A ++K+   AN +SAD+  +   G+  + A L       A+   ADLS
Sbjct: 356 SSTNLNSADLSSA-NLKD---ANLSSADLSHTHLFGANLSDANLSGVNLSHADLCRADLS 411

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
              M + +LN  NL++ +L  T     +L  AI+  AD S A ++
Sbjct: 412 GADMSKAILNGTNLSDTILFST-----NLSDAILIAADLSYAKLN 451



 Score = 37.0 bits (84), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 41/89 (46%), Gaps = 5/89 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEANLT 183
            A+   AD+  +D S +  NG  L   + +  N +      ADLS   ++   LN A L 
Sbjct: 402 HADLCRADLSGADMSKAILNGTNLSDTILFSTNLSDAILIAADLSYAKLNGAKLNYARLN 461

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            A+ +   L+  DL G I+  AD S  ++
Sbjct: 462 GAMFLGADLSGVDLSGVILNDADLSGVLL 490


>gi|425447182|ref|ZP_18827173.1| Genome sequencing data, contig C314 (fragment) [Microcystis
           aeruginosa PCC 9443]
 gi|389732326|emb|CCI03724.1| Genome sequencing data, contig C314 (fragment) [Microcystis
           aeruginosa PCC 9443]
          Length = 285

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 46/78 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T A++ E+  +G+  NGA LE+A    A+  GA+L +  ++   L EANL  A L+R
Sbjct: 137 ADLTEANLTEAKLNGADLNGANLEEAKLNGADLNGANLEEAKLNGAFLEEANLKRANLIR 196

Query: 190 TVLTRSDLGGAIIEGADF 207
             L  S L GA ++GA+ 
Sbjct: 197 ANLIGSGLWGANLKGANL 214


>gi|428312148|ref|YP_007123125.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428253760|gb|AFZ19719.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 223

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 49/83 (59%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N +  D+R +DF G+    A L +A    AN  GA LS  +++R VLN A L++A+L   
Sbjct: 26  NLSDTDLRGADFRGADLFDANLARADLSDANLGGAILSRAVLNRAVLNRAVLSSALLSNA 85

Query: 191 VLTRSDLGGAIIEGADFSDAVID 213
            L R+ L GA++ GA  + A+++
Sbjct: 86  FLNRAVLCGAVLRGAILNGAILN 108



 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 41/83 (49%), Gaps = 3/83 (3%)

Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
           E  F      G  L       A+F GADL D  + R  L++ANL  A+L R VL R+ L 
Sbjct: 14  ERSFDFPNLEGINLSDTDLRGADFRGADLFDANLARADLSDANLGGAILSRAVLNRAVLN 73

Query: 199 GAIIEGADFSDAVIDLAQKQALC 221
            A++  A  S+A ++   +  LC
Sbjct: 74  RAVLSSALLSNAFLN---RAVLC 93



 Score = 38.1 bits (87), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 5/87 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEANLT 183
           RA    A +R +  +G+  NGA L  A  Y AN +G     ADL    ++  +L EA+L 
Sbjct: 89  RAVLCGAVLRGAILNGAILNGANLSGADLYHANLSGALLGYADLYHAYLNSALLREADLY 148

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +A L    L  ++L  A + GAD + A
Sbjct: 149 HAYLREANLFGANLRSANLSGADLTGA 175


>gi|428216301|ref|YP_007100766.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427988083|gb|AFY68338.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 188

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/127 (34%), Positives = 56/127 (44%), Gaps = 12/127 (9%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN    ++R +    S FN A L  A     N TGA   D  MD   L+ ANL +A L  
Sbjct: 60  ANLADTNLRGASLKNSNFNRANLSWANMSWTNLTGASFMDARMDVTNLSSANLIDADLRG 119

Query: 190 TVLTRSDLGGAIIEG-----------ADFSDAV-IDLAQKQALCKYANGTNPITGVSTRK 237
             L  ++L G  + G           ADFS    +D   +  LC  A G +P T  STR 
Sbjct: 120 ANLQGANLRGTNLRGTQIEPLRSIDNADFSRVKNLDQRVRVYLCSIATGAHPFTKNSTRA 179

Query: 238 SLGCGNS 244
           +L C NS
Sbjct: 180 TLECNNS 186


>gi|254409695|ref|ZP_05023476.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183692|gb|EDX78675.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 350

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/118 (37%), Positives = 62/118 (52%), Gaps = 5/118 (4%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A   +ADL+  +      RA  + AD+R ++  G+ F  AYL +A    AN TGADLS
Sbjct: 144 SRANLKAADLQGVIL----NRAILSQADLRGANLRGACFIRAYLHRADLRDANLTGADLS 199

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA-LCKYAN 225
           D  +    L+ ANL+ A L    L+ ++L GA + GA   +A + LA     L K AN
Sbjct: 200 DADLKGADLSHANLSRANLSCANLSHANLTGANLTGAHLQNANLSLANLSGLLLKKAN 257



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 64/136 (47%), Gaps = 36/136 (26%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYL----------------- 152
           A    ADL+ A+ ++    +A+ T+A +RE+D SG+   GA L                 
Sbjct: 66  ADLSKADLKNALLIEATLSQADLTAAILREADLSGAILTGATLLDADLRHATLIGTSLID 125

Query: 153 ---EKAVAYKANFTG----------ADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTR 194
              ++A   KAN TG          ADL   +++R +L++     ANL  A  +R  L R
Sbjct: 126 AKMKRAKLAKANCTGASFSRANLKAADLQGVILNRAILSQADLRGANLRGACFIRAYLHR 185

Query: 195 SDLGGAIIEGADFSDA 210
           +DL  A + GAD SDA
Sbjct: 186 ADLRDANLTGADLSDA 201



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 47/82 (57%), Gaps = 5/82 (6%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N   AD+ E++ S    + A+L++A   KA   GA L D       L++A+L NA+L+  
Sbjct: 27  NLIRADLTEANLSRINLSAAHLQRANLAKAKLIGAQLKDA-----DLSKADLKNALLIEA 81

Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
            L+++DL  AI+  AD S A++
Sbjct: 82  TLSQADLTAAILREADLSGAIL 103



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 55/106 (51%), Gaps = 11/106 (10%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           + A    ADL+ A     N  RAN + A++  ++ +G+   GA+L+ A    AN +G   
Sbjct: 194 TGADLSDADLKGADLSHANLSRANLSCANLSHANLTGANLTGAHLQNANLSLANLSG--- 250

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
                  ++L +ANL +A L +  L R++L  A + GA+  +A ++
Sbjct: 251 -------LLLKKANLQSAQLSKANLNRANLYKANLSGANLLEANLE 289



 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 48/90 (53%), Gaps = 10/90 (11%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN +   +++++   ++ + A L +A  YKAN +GA+L +  ++   L E+NL  A L+ 
Sbjct: 246 ANLSGLLLKKANLQSAQLSKANLNRANLYKANLSGANLLEANLEHANLAESNLQRAGLLL 305

Query: 190 TVLTRSDLG----------GAIIEGADFSD 209
             LT ++L           GA + GAD SD
Sbjct: 306 AYLTDANLSHANLNGANLIGANLMGADLSD 335


>gi|316934318|ref|YP_004109300.1| pentapeptide repeat-containing protein [Rhodopseudomonas palustris
           DX-1]
 gi|315602032|gb|ADU44567.1| pentapeptide repeat protein [Rhodopseudomonas palustris DX-1]
          Length = 273

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 55/103 (53%), Gaps = 6/103 (5%)

Query: 109 SAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL  A     N +RA+ + A++  +D SG+  +GA L +A  + AN +GADL
Sbjct: 57  SGANLSGADLSGANLSGANLYRADLSGANLSGADLSGANLSGANLYRAKLFSANLSGADL 116

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S        L+ ANL  A L    L R+DL GA + GAD S A
Sbjct: 117 SGA-----NLSGANLYRADLSGANLYRADLSGANLSGADLSGA 154



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 46/80 (57%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N + AD+  ++ SG+  +GA L  A  Y+A   GA+LS   +    L+ ANL+ A L R 
Sbjct: 20  NLSGADLSGANLSGADLSGANLSGANLYRAKLFGANLSGANLSGADLSGANLSGANLYRA 79

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            L+ ++L GA + GA+ S A
Sbjct: 80  DLSGANLSGADLSGANLSGA 99



 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 6/95 (6%)

Query: 109 SAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANF 162
           S A    ADL  A     N +RA   SA++  +D SG+  +GA L +A       Y+A+ 
Sbjct: 82  SGANLSGADLSGANLSGANLYRAKLFSANLSGADLSGANLSGANLYRADLSGANLYRADL 141

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           +GA+LS   +    L+ ANL+ A  V   L R+ +
Sbjct: 142 SGANLSGADLSGANLHRANLSGAKGVDLSLARTRI 176


>gi|374583660|ref|ZP_09656754.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
           17734]
 gi|374419742|gb|EHQ92177.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
           17734]
          Length = 367

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 58/103 (56%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL +A   + +   AN + A++ E+D S +  +GA L +A    AN +GA+L
Sbjct: 98  SGANLSEADLSRADLSEADLSGANLSGANLSEADLSRADLSGANLSEADLSGANLSGANL 157

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S+  + R  L+ ANL  A L    L+ +DL GA + GA+ S+A
Sbjct: 158 SEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEA 200



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/86 (38%), Positives = 50/86 (58%), Gaps = 5/86 (5%)

Query: 130 ANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN + AD+  +D SG+       +GA L +A    AN +GA+LS+  + R  L+ ANL+ 
Sbjct: 155 ANLSEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEADLSRADLSGANLSR 214

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
           A L    L+ +DL GA + GA+ S+A
Sbjct: 215 ADLSGANLSEADLSGANLSGANLSEA 240



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 5/86 (5%)

Query: 130 ANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN + AD+  +D SG+  +     GA L +A    AN +GA+LS+  + R  L+ ANL  
Sbjct: 195 ANLSEADLSRADLSGANLSRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANLRR 254

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
           A L    L R+DL GA +  AD S+A
Sbjct: 255 ADLSGANLRRADLSGANLRRADLSEA 280



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 55/108 (50%), Gaps = 6/108 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    ADL +A     N        AN + A++ E+D S +  +GA L +A    AN 
Sbjct: 123 SGANLSEADLSRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANLRRANLSGANL 182

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           + ADLS   +    L+EA+L+ A L    L+R+DL GA +  AD S A
Sbjct: 183 SEADLSGANLSGANLSEADLSRADLSGANLSRADLSGANLSEADLSGA 230



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 41/118 (34%), Positives = 58/118 (49%), Gaps = 16/118 (13%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 165
           S A    ADL +A     N R AN + A++ E+D SG+  +GA L +A   +A+ +GA  
Sbjct: 153 SGANLSEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEADLSRADLSGANL 212

Query: 166 -------------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
                        DLS   +    L+EA+L+ A L    L R+DL GA +  AD S A
Sbjct: 213 SRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANLRRADLSGANLRRADLSGA 270



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 1/100 (1%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL +A     N  RA+ + A++ E+D SG+  +GA L +A   +A+ +GA+L
Sbjct: 193 SGANLSEADLSRADLSGANLSRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANL 252

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
               +    L  A+L+ A L R  L+ ++L  A + GAD 
Sbjct: 253 RRADLSGANLRRADLSGANLRRADLSEANLSEANLSGADL 292



 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 17/128 (13%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN---- 184
           RA+ + A++ E+D SG+  +GA L +A   +A+ +GA+L      R  L+ ANL+     
Sbjct: 134 RADLSGANLSEADLSGANLSGANLSEADLSRADLSGANLR-----RANLSGANLSEADLS 188

Query: 185 ------AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
                 A L    L+R+DL GA +  AD S A  +L++        +G N      +R  
Sbjct: 189 GANLSGANLSEADLSRADLSGANLSRADLSGA--NLSEADLSGANLSGANLSEADLSRAD 246

Query: 239 LGCGNSRR 246
           L   N RR
Sbjct: 247 LSGANLRR 254



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 51/91 (56%), Gaps = 10/91 (10%)

Query: 130 ANFTSAD----------MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AN + A+          + E+D SG+  +GA L +A   +A+ +GA+LS+  +    L+ 
Sbjct: 95  ANLSGANLSEADLSRADLSEADLSGANLSGANLSEADLSRADLSGANLSEADLSGANLSG 154

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           ANL+ A L R  L+ ++L  A + GA+ S+A
Sbjct: 155 ANLSEADLSRADLSGANLRRANLSGANLSEA 185



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 42/70 (60%)

Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
           + SG+  +GA L +A   +A+ + ADLS   +    L+EA+L+ A L    L+ +DL GA
Sbjct: 91  NLSGANLSGANLSEADLSRADLSEADLSGANLSGANLSEADLSRADLSGANLSEADLSGA 150

Query: 201 IIEGADFSDA 210
            + GA+ S+A
Sbjct: 151 NLSGANLSEA 160


>gi|119487545|ref|ZP_01621155.1| hypothetical protein L8106_26852 [Lyngbya sp. PCC 8106]
 gi|119455714|gb|EAW36850.1| hypothetical protein L8106_26852 [Lyngbya sp. PCC 8106]
          Length = 277

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 64/116 (55%), Gaps = 5/116 (4%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A+    DL  A  ++ N   AN T+A     D +GS   G+  +     +AN T A+L++
Sbjct: 60  AKLMGVDLSDANLMEANLIGANLTNAKFDRCDLTGSNLRGSSSKLVSLTQANLTDANLTE 119

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
             +       ANLTNA L+RT L +++L GA++EGA+ ++ ++    ++++ + AN
Sbjct: 120 ANLAEANFVGANLTNATLIRTNLMKANLTGAVLEGANLTNVIL----RESILEGAN 171



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 15/86 (17%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN T A++ E++F G+    A L +    KAN TGA          VL  ANLTN +L  
Sbjct: 115 ANLTEANLAEANFVGANLTNATLIRTNLMKANLTGA----------VLEGANLTNVILRE 164

Query: 190 TV-----LTRSDLGGAIIEGADFSDA 210
           ++     L  + L GA++  A+F+DA
Sbjct: 165 SILEGANLIHATLSGALLISANFTDA 190



 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 43/137 (31%), Positives = 57/137 (41%), Gaps = 31/137 (22%)

Query: 111 AQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           A F  A+L  A  ++ N             AN T+  +RES   G+    A L  A+   
Sbjct: 125 ANFVGANLTNATLIRTNLMKANLTGAVLEGANLTNVILRESILEGANLIHATLSGALLIS 184

Query: 160 ANFT----------GADLSDTLMDRM----------VLNEANLTNAVLVRTVLTRSDLGG 199
           ANFT          GADLSD  +  +           L  ANL+ A L RT L+ S+L G
Sbjct: 185 ANFTDADMSRVTMIGADLSDANLSGVNLRAANVSWTTLRGANLSRARLYRTKLSWSNLSG 244

Query: 200 AIIEGADFSDAVIDLAQ 216
           A +  A   D  +D A 
Sbjct: 245 ANLIEAVLLDTRLDHAN 261


>gi|312194409|ref|YP_004014470.1| pentapeptide repeat-containing protein [Frankia sp. EuI1c]
 gi|311225745|gb|ADP78600.1| pentapeptide repeat protein [Frankia sp. EuI1c]
          Length = 2027

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 30/83 (36%), Positives = 45/83 (54%)

Query: 130  ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
            A+ T  D+ ++D +G+    A L+ A    AN TGA L+     R+ L  ANLT+A L R
Sbjct: 1243 ADLTGLDLSDADLAGANLTDADLDDANLTGANLTGARLTGVRARRLRLTGANLTDADLRR 1302

Query: 190  TVLTRSDLGGAIIEGADFSDAVI 212
              LT  DL G ++ G+ +  A +
Sbjct: 1303 ARLTDPDLTGTVLTGSKWERAAL 1325



 Score = 45.8 bits (107), Expect = 0.019,   Method: Composition-based stats.
 Identities = 25/71 (35%), Positives = 39/71 (54%)

Query: 130  ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
            AN T AD+ +++ +G+   GA L    A +   TGA+L+D  + R  L + +LT  VL  
Sbjct: 1258 ANLTDADLDDANLTGANLTGARLTGVRARRLRLTGANLTDADLRRARLTDPDLTGTVLTG 1317

Query: 190  TVLTRSDLGGA 200
            +   R+ L GA
Sbjct: 1318 SKWERAALLGA 1328


>gi|392410087|ref|YP_006446694.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
 gi|390623223|gb|AFM24430.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
          Length = 490

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/95 (40%), Positives = 53/95 (55%), Gaps = 10/95 (10%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANL 182
           FRAN + A +  ++F+G+  N A L      KANFT ADLS+ +     M R+ L+   L
Sbjct: 123 FRANLSKAIIDTANFTGANLNCANLAGNKLSKANFTKADLSEAVLTSSDMSRIQLSGNKL 182

Query: 183 TNA-----VLVRTVLTRSDLGGAIIEGADFSDAVI 212
           T A     VL +  + R+DL GA +E AD SDA +
Sbjct: 183 TKADLSWGVLSKARIERADLTGANLERADLSDAKL 217



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 51/87 (58%), Gaps = 5/87 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
            A+ +  D+ E+DFSGS  + + LE +   K  F+  +LS+T M R  L++ NLT A L 
Sbjct: 64  EADLSEIDLTEADFSGSNLSKSKLEGSCLKKGIFSRCNLSNTDMTRTTLSDCNLTEANLF 123

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
           R  L++     AII+ A+F+ A ++ A
Sbjct: 124 RANLSK-----AIIDTANFTGANLNCA 145



 Score = 45.1 bits (105), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 54/112 (48%), Gaps = 11/112 (9%)

Query: 110 AAQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAY 158
           +A F +A ++ AV  + N +           A+F+ A   + DFSG+   GA L++A  +
Sbjct: 314 SANFSNAQMQGAVLTRTNLQEADFQKAAAQNADFSQASGEKVDFSGAVLQGANLQEANFF 373

Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           KA    ADLS   + +    + +LT  + + T+   +DL     + AD S A
Sbjct: 374 KAKLERADLSSANVSKASFRDGDLTRVIALATIFVSADLQNTSFKDADVSAA 425



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 48/95 (50%), Gaps = 10/95 (10%)

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEA 180
           E F  + + A +R+SD +G+ F+ + L      K++   ANF+ A +   ++ R  L EA
Sbjct: 276 EGFNCDLSGAAVRDSDLTGANFSSSQLVETDFSKSILVSANFSNAQMQGAVLTRTNLQEA 335

Query: 181 NL-----TNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +       NA   +    + D  GA+++GA+  +A
Sbjct: 336 DFQKAAAQNADFSQASGEKVDFSGAVLQGANLQEA 370


>gi|330509039|ref|YP_004385467.1| pentapeptide repeat-containing protein [Methanosaeta concilii GP6]
 gi|328929847|gb|AEB69649.1| pentapeptide repeat protein [Methanosaeta concilii GP6]
          Length = 386

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 57/110 (51%), Gaps = 6/110 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA-VAY----KANF 162
           S +    AD  +A  ++ N   AN   ADM  +D + +   GA L+ A + Y    KANF
Sbjct: 204 SGSDLSDADFTRAYLMRSNLTGANIDWADMAYADLTEAVLTGASLKSAKMPYSDLTKANF 263

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           TGADLS+  +D  +L  A L NA L R  L   DL G  + GA   ++V+
Sbjct: 264 TGADLSEAYLDGAILAGATLRNAKLDRVNLREVDLRGLEMGGASLKNSVL 313



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 48/88 (54%), Gaps = 5/88 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEANLTN 184
           A+   +D++  + +GS  +GAYL  A    ++  G     ADL+  ++    L  A+LT 
Sbjct: 51  AHLNQSDLQGCNLNGSNLDGAYLRSAWLMASHLNGSTLENADLTGAVLTEADLTGADLTG 110

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A L+R  ++++ L GA I  AD ++A I
Sbjct: 111 ANLIRVQMSKAKLNGARIVKADLTEADI 138



 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 45/81 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A   SA +  S  +GS    A L  AV  +A+ TGADL+   + R+ +++A L  A +V+
Sbjct: 71  AYLRSAWLMASHLNGSTLENADLTGAVLTEADLTGADLTGANLIRVQMSKAKLNGARIVK 130

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             LT +D+  + +  AD +DA
Sbjct: 131 ADLTEADISDSDLSDADLTDA 151



 Score = 40.4 bits (93), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 49/104 (47%), Gaps = 6/104 (5%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    AD+  A   +  F RA   S ++  SD S + F  AYL      ++N TGA++  
Sbjct: 176 AHISWADMSVAYLSQGQFSRAELYSTNLSGSDLSDADFTRAYL-----MRSNLTGANIDW 230

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
             M    L EA LT A L    +  SDL  A   GAD S+A +D
Sbjct: 231 ADMAYADLTEAVLTGASLKSAKMPYSDLTKANFTGADLSEAYLD 274



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 1/103 (0%)

Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           +ADL  AV  + +   A+ T A++     S +K NGA + KA   +A+ + +DLSD  + 
Sbjct: 90  NADLTGAVLTEADLTGADLTGANLIRVQMSKAKLNGARIVKADLTEADISDSDLSDADLT 149

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
              L   +L+ A L    LT +++ GA I  AD S A +   Q
Sbjct: 150 DARLFRTDLSGAKLKGIYLTSANMIGAHISWADMSVAYLSQGQ 192



 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 51/98 (52%), Gaps = 15/98 (15%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKA-------------VAY--KANFTGADLSDTLMDR 174
           A+ T A +  +D SG+K  G YL  A             VAY  +  F+ A+L  T +  
Sbjct: 146 ADLTDARLFRTDLSGAKLKGIYLTSANMIGAHISWADMSVAYLSQGQFSRAELYSTNLSG 205

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             L++A+ T A L+R+ LT +++  A +  AD ++AV+
Sbjct: 206 SDLSDADFTRAYLMRSNLTGANIDWADMAYADLTEAVL 243



 Score = 37.7 bits (86), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 19/112 (16%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A    ADL +AV       A+  SA M  SD + + F GA L +A    A   GA L + 
Sbjct: 231 ADMAYADLTEAVLTG----ASLKSAKMPYSDLTKANFTGADLSEAYLDGAILAGATLRNA 286

Query: 171 LMDRMVLNE----------ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            +DR+ L E          A+L N+VL    +  +DL      GAD  DA +
Sbjct: 287 KLDRVNLREVDLRGLEMGGASLKNSVLTGVFMAMTDLA-----GADLRDATL 333


>gi|110597243|ref|ZP_01385531.1| Pentapeptide repeat [Chlorobium ferrooxidans DSM 13031]
 gi|110341079|gb|EAT59547.1| Pentapeptide repeat [Chlorobium ferrooxidans DSM 13031]
          Length = 447

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 57/108 (52%), Gaps = 6/108 (5%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S + F SA L +A     N  + NF  ADM+ +   G+   GA L++A    A+ +  +L
Sbjct: 304 SGSSFKSASLDEANLAGANLSKVNFHKADMKGAHLQGANLQGANLDRAFLKDADLSNTNL 363

Query: 168 SD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S+     T++    L  ANL NA L    L  ++LGGA ++GA+ +DA
Sbjct: 364 SNAVLFGTILTGANLQNANLENASLFEADLEEANLGGANLKGANITDA 411



 Score = 45.4 bits (106), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 46/83 (55%), Gaps = 5/83 (6%)

Query: 135 ADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ADM+ +D SG     +   G+++++A+   AN  GA+L   +++   + +ANL N VL  
Sbjct: 103 ADMKGTDLSGACLIKANMKGSFMKEAIFRGANLQGANLRWVMLEEADMEDANLANTVLFE 162

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L  ++L GA ++ A F D  +
Sbjct: 163 ANLENANLKGANLKDAVFLDQAL 185


>gi|434395496|ref|YP_007130443.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428267337|gb|AFZ33283.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 249

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 57/104 (54%), Gaps = 9/104 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A    A+L+ A ++ E   AN + AD+ E+D SG+  +GA L   +   AN + A LS
Sbjct: 128 SGANLAQANLKGA-NLTE---ANLSKADLTEADLSGADLSGATLSGVILSDANLSDAILS 183

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             ++   VL  ANL+ AVL    LT  +L     EGA+ S+AV+
Sbjct: 184 RAILTLAVLQGANLSGAVLSGVNLTEVNL-----EGANLSNAVL 222



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 52/82 (63%), Gaps = 5/82 (6%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N + A++ +++  G+    A L KA   +A+ +GADLS   +  ++L++ANL++A+L R 
Sbjct: 126 NLSGANLAQANLKGANLTEANLSKADLTEADLSGADLSGATLSGVILSDANLSDAILSRA 185

Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
           +LT      A+++GA+ S AV+
Sbjct: 186 ILTL-----AVLQGANLSGAVL 202



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 44/78 (56%)

Query: 146 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
            F+G  L  A   +A    ADLS+ ++   +L +A L+ A L RT+LT++DL  A++ GA
Sbjct: 16  NFSGENLRSADLTRATLNAADLSEAILSEAILTQAELSEANLSRTILTKADLTEAVLAGA 75

Query: 206 DFSDAVIDLAQKQALCKY 223
             + A++  A+   +  Y
Sbjct: 76  KLTGAILTEAELSRVNLY 93


>gi|284929723|ref|YP_003422245.1| hypothetical protein UCYN_11960 [cyanobacterium UCYN-A]
 gi|284810167|gb|ADB95864.1| uncharacterized low-complexity protein [cyanobacterium UCYN-A]
          Length = 243

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/125 (33%), Positives = 62/125 (49%), Gaps = 15/125 (12%)

Query: 94  LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYL 152
           LNKY+   R          F S  LR+    + N  + NF SAD+R+S    S FNGA L
Sbjct: 7   LNKYDLGER---------NFQSICLREVDLTEVNLPKINFESADIRQSRLGKSNFNGAIL 57

Query: 153 EKA-----VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           ++A     + +  N    +LS  ++    L+ A LTNA L    L+++ L GA +  A+ 
Sbjct: 58  KQADLSESIIWGTNLENTNLSKAILRDTDLSGAELTNADLTNAYLSKASLCGANLAKANL 117

Query: 208 SDAVI 212
           S AV+
Sbjct: 118 SHAVL 122



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 51/105 (48%), Gaps = 1/105 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           + A    ADL +++    N    N + A +R++D SG++   A L  A   KA+  GA+L
Sbjct: 53  NGAILKQADLSESIIWGTNLENTNLSKAILRDTDLSGAELTNADLTNAYLSKASLCGANL 112

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           +   +   VL E +L      RT L R++L    +  A  S A++
Sbjct: 113 AKANLSHAVLYEVDLRPLSNRRTNLGRANLSSTDLSYAKLSSALL 157



 Score = 37.7 bits (86), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 42/91 (46%), Gaps = 6/91 (6%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTGAD 166
           F SAD+R++   K NF  A    AD+ ES   G+      L KA+        A  T AD
Sbjct: 37  FESADIRQSRLGKSNFNGAILKQADLSESIIWGTNLENTNLSKAILRDTDLSGAELTNAD 96

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           L++  + +  L  ANL  A L   VL   DL
Sbjct: 97  LTNAYLSKASLCGANLAKANLSHAVLYEVDL 127


>gi|434400818|ref|YP_007134822.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428271915|gb|AFZ37856.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 209

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 67/137 (48%), Gaps = 31/137 (22%)

Query: 127 NF-RANFTSADMRESDFSGS---------------KFNGAYLEKAVAYKANFTGADLSDT 170
           NF +AN T AD RE D + +                   A LE+AV Y+A+    +LS +
Sbjct: 36  NFSQANLTGADFREIDLTQAILCEANLSQTILIEANLTKANLERAVLYRASLQLVNLSQS 95

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAI----------IEGADFSDAVIDLAQKQAL 220
           ++    L EANLT A+L +T L ++ L GA+          + GA+ S A++     QA 
Sbjct: 96  ILTEADLREANLTEALLYKTSLGKAQLQGAVLNRAILQRTFLRGANLSQAIL----SQAN 151

Query: 221 CKYANGTNP-ITGVSTR 236
            + AN T+  +TG + R
Sbjct: 152 LQEANLTDADLTGANLR 168



 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 33/131 (25%), Positives = 64/131 (48%), Gaps = 1/131 (0%)

Query: 84  CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDF 142
           C +N+S    +     +   E  +   A     +L +++  + + R AN T A + ++  
Sbjct: 58  CEANLSQTILIEANLTKANLERAVLYRASLQLVNLSQSILTEADLREANLTEALLYKTSL 117

Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
             ++  GA L +A+  +    GA+LS  ++ +  L EANLT+A L    L  ++L GA +
Sbjct: 118 GKAQLQGAVLNRAILQRTFLRGANLSQAILSQANLQEANLTDADLTGANLRGANLQGAFL 177

Query: 203 EGADFSDAVID 213
             A+  +A ++
Sbjct: 178 VEANLFEASLE 188



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 44/81 (54%), Gaps = 2/81 (2%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           + R+ D S     G  L+     +AN TGAD  +  + + +L EANL+  +L+   LT++
Sbjct: 16  EFRQVDLSYRVLRGVDLQAINFSQANLTGADFREIDLTQAILCEANLSQTILIEANLTKA 75

Query: 196 DLGGAIIEGADFSDAVIDLAQ 216
           +L  A++  A     +++L+Q
Sbjct: 76  NLERAVLYRASLQ--LVNLSQ 94


>gi|428223553|ref|YP_007107650.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427983454|gb|AFY64598.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 521

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/128 (32%), Positives = 62/128 (48%), Gaps = 12/128 (9%)

Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLE 153
           E    +G G    FG  DLR+A   + N        A+ + A++  ++ SG+   GA L 
Sbjct: 5   ELLKRYGAGER-NFGGMDLREANLSRANLSHIDLSGADLSVANLSGANLSGADLRGARLN 63

Query: 154 KAVAYKANFTGADLSDTLMD-----RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            A    AN +GA+LS  +++     R  L  ANL  A L+R  L R+DL  A ++ AD  
Sbjct: 64  VAKLSGANLSGANLSSCILNVANLVRADLTGANLNQAALIRAELMRADLKQATLDSADLG 123

Query: 209 DAVIDLAQ 216
            A +  AQ
Sbjct: 124 GAQLQEAQ 131



 Score = 44.7 bits (104), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 11/124 (8%)

Query: 111 AQFGSADLRKAVHVKEN-FRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTG 164
           A F  A+LR A   + N  RANF +A++R ++ S     G+  +GA L  A    A   G
Sbjct: 155 AVFDQANLRGADLNRANATRANFRNAELRLANLSEILLIGADLHGANLRWANLTGARLRG 214

Query: 165 ADLSDTLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
           ADL++  +         L   NLT+A L+   L+R++L G    GA+ + A +  A+   
Sbjct: 215 ADLTEAKLSGAAIVGADLRNVNLTHASLIHADLSRANLIGTDWIGAELTGATLTGAKLHG 274

Query: 220 LCKY 223
           + +Y
Sbjct: 275 VSRY 278



 Score = 43.5 bits (101), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 5/90 (5%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR-----MVLNEANL 182
            RA    AD++++    +   GA L++A  ++ANF+ A+LS+    R      V ++ANL
Sbjct: 103 IRAELMRADLKQATLDSADLGGAQLQEAQLHQANFSRANLSEVNFHRATLADAVFDQANL 162

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             A L R   TR++   A +  A+ S+ ++
Sbjct: 163 RGADLNRANATRANFRNAELRLANLSEILL 192



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 49/164 (29%), Positives = 72/164 (43%), Gaps = 31/164 (18%)

Query: 76  LAAAVVASCSSNISAL-------ADLNKYEAETRGEF-------GIGSAAQFGSADLRKA 121
           L+ A ++SC  N++ L       A+LN+  A  R E            +A  G A L++A
Sbjct: 72  LSGANLSSCILNVANLVRADLTGANLNQ-AALIRAELMRADLKQATLDSADLGGAQLQEA 130

Query: 122 VHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
              + NF        NF  A + ++ F  +   GA L +A A +ANF  A+L    +  +
Sbjct: 131 QLHQANFSRANLSEVNFHRATLADAVFDQANLRGADLNRANATRANFRNAELRLANLSEI 190

Query: 176 V----------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           +          L  ANLT A L    LT + L GA I GAD  +
Sbjct: 191 LLIGADLHGANLRWANLTGARLRGADLTEAKLSGAAIVGADLRN 234


>gi|220907082|ref|YP_002482393.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219863693|gb|ACL44032.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 309

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 6/103 (5%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F  A+L ++     N R A  T AD+RE+     K N A L ++   +AN TGADL  
Sbjct: 185 ADFQGANLSRSTLTGANLRGAYLTGADLREA-----KLNEANLRRSDLSQANLTGADLRG 239

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             ++R  L  ANL  ++L+   L  ++L  A ++GA+  +AV+
Sbjct: 240 ANLNRATLRGANLRESILIGASLMGANLSQASLQGANLLEAVL 282



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 54/106 (50%), Gaps = 1/106 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A++  A+  + N   A+ T A++ +++  G+   GAYL  A       TGA+L
Sbjct: 113 SEANLTGAEISAAILREANLTLADLTLAELSQTNLRGANLTGAYLRGAELLGTQLTGAEL 172

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           S        L EA+   A L R+ LT ++L GA + GAD  +A ++
Sbjct: 173 SQANFRGTNLTEADFQGANLSRSTLTGANLRGAYLTGADLREAKLN 218



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 16/103 (15%)

Query: 116 ADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           ADLR+A   + N R      AN T AD+R ++ + +   GA L +++   A+  GA+LS 
Sbjct: 210 ADLREAKLNEANLRRSDLSQANLTGADLRGANLNRATLRGANLRESILIGASLMGANLS- 268

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                    +A+L  A L+  VLT ++L G  + G D S  V+
Sbjct: 269 ---------QASLQGANLLEAVLTGANLTGVDLTGVDLSATVM 302



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 52/108 (48%), Gaps = 14/108 (12%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A    ADLR A        AN ++AD+R +D  G     A L K    KAN TGADL+
Sbjct: 48  SGANLQGADLRGATLAA----ANLSNADLRGADLRGVLLMEADLRKVNLRKANLTGADLT 103

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
                      ANLT A L    LT +++  AI+  A+ + A + LA+
Sbjct: 104 G----------ANLTGADLSEANLTGAEISAAILREANLTLADLTLAE 141



 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 57/103 (55%), Gaps = 7/103 (6%)

Query: 109 SAAQFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           +AA   +ADLR    + V + E   A+    ++R+++ +G+   GA L  A   +AN TG
Sbjct: 63  AAANLSNADLRGADLRGVLLME---ADLRKVNLRKANLTGADLTGANLTGADLSEANLTG 119

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           A++S  ++    L  A+LT A L +T L  ++L GA + GA+ 
Sbjct: 120 AEISAAILREANLTLADLTLAELSQTNLRGANLTGAYLRGAEL 162



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 4/105 (3%)

Query: 122 VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
           VH+ +    +   A++R +  SG+   GA L  A    AN + ADL    +  ++L EA+
Sbjct: 30  VHLSQ---VDLQGANLRGAGLSGANLQGADLRGATLAAANLSNADLRGADLRGVLLMEAD 86

Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
           L    L +  LT +DL GA + GAD S+A +  A+   A+ + AN
Sbjct: 87  LRKVNLRKANLTGADLTGANLTGADLSEANLTGAEISAAILREAN 131


>gi|428220994|ref|YP_007105164.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427994334|gb|AFY73029.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 283

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/108 (37%), Positives = 55/108 (50%), Gaps = 6/108 (5%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTG 164
           A   +A++  A     N R A    A++R S  +G+      F GA L +AV    N T 
Sbjct: 169 ANLDTANISDADLTNANLRWATLRDANLRGSILTGANGNLANFTGANLSQAVLRGINLTN 228

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           ADLS+  ++   L+ ANL  A LV   LT +DL GA I  AD S AV+
Sbjct: 229 ADLSNAKLNAADLSNANLVGASLVGANLTSADLTGANITNADLSGAVM 276



 Score = 38.1 bits (87), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 45/77 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN T+A++R +    S  + A L +A   +A+ + A  ++  +D   +++A+LTNA L  
Sbjct: 129 ANLTAANLRSASLYKSNLSLAILTQATLAEADLSDASFTEANLDTANISDADLTNANLRW 188

Query: 190 TVLTRSDLGGAIIEGAD 206
             L  ++L G+I+ GA+
Sbjct: 189 ATLRDANLRGSILTGAN 205


>gi|378579963|ref|ZP_09828623.1| hypothetical protein CKS_2597 [Pantoea stewartii subsp. stewartii
           DC283]
 gi|377817422|gb|EHU00518.1| hypothetical protein CKS_2597 [Pantoea stewartii subsp. stewartii
           DC283]
          Length = 272

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 54/105 (51%), Gaps = 9/105 (8%)

Query: 108 GSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           GS A    ADLR A        A+ + AD+  +D SG+   GAYL  A    A+ +GADL
Sbjct: 24  GSRADLRGADLRGAYLRG----ADLSGADLSGADLSGADLRGAYLRDADLRGADLSGADL 79

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           SD  +    L +A+L  A      L+ +DL GA + GAD S A +
Sbjct: 80  SDADLRGAYLRDADLRGA-----DLSDADLSGAYLRGADLSGADL 119



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 51/110 (46%), Gaps = 6/110 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    ADLR A     + R      A+ + A +R +D SG+   GAYL  A    A+ 
Sbjct: 75  SGADLSDADLRGAYLRDADLRGADLSDADLSGAYLRGADLSGADLRGAYLRDADLRGADL 134

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           + ADLS   +    L  A+L  A L    L  +DL GA +  AD S A +
Sbjct: 135 SDADLSGAYLRDADLRGADLRGADLRGAYLRDADLRGADLSDADLSGAYL 184



 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 41/115 (35%), Positives = 55/115 (47%), Gaps = 3/115 (2%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVA 157
           A+ RG +  G  A    ADL  A     + R A    AD+R +D SG+  + A L  A  
Sbjct: 32  ADLRGAYLRG--ADLSGADLSGADLSGADLRGAYLRDADLRGADLSGADLSDADLRGAYL 89

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             A+  GADLSD  +    L  A+L+ A L    L  +DL GA +  AD S A +
Sbjct: 90  RDADLRGADLSDADLSGAYLRGADLSGADLRGAYLRDADLRGADLSDADLSGAYL 144



 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 46/91 (50%), Gaps = 10/91 (10%)

Query: 127 NFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
           + R N + AD+R     G+   GAYL  A    A+ +GADLS   +    L +A+L  A 
Sbjct: 19  SLRQNGSRADLR-----GADLRGAYLRGADLSGADLSGADLSGADLRGAYLRDADLRGAD 73

Query: 187 LVRTVLTRSDLGGAIIE-----GADFSDAVI 212
           L    L+ +DL GA +      GAD SDA +
Sbjct: 74  LSGADLSDADLRGAYLRDADLRGADLSDADL 104


>gi|86606854|ref|YP_475617.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86555396|gb|ABD00354.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
          Length = 248

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 59/128 (46%), Gaps = 15/128 (11%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD---------------LSDTLMDR 174
           +NFT+A + +S F G  F+ +   +A    AN T  +               LS  ++  
Sbjct: 109 SNFTAAKLDKSSFQGGHFSHSIFREASLVAANLTEGNFFAADFRQANLFRCNLSQAILSS 168

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 234
             L  AN   A+LV   L  + + GA   GADF+DA +    ++ L + A+GTN +T   
Sbjct: 169 CQLQNANFDQALLVGANLQEAQIEGASFVGADFTDAKLSDEMRKFLLERASGTNELTQRD 228

Query: 235 TRKSLGCG 242
           T  +L  G
Sbjct: 229 TLNTLLAG 236


>gi|325106774|ref|YP_004267842.1| pentapeptide repeat-containing protein [Planctomyces brasiliensis
           DSM 5305]
 gi|324967042|gb|ADY57820.1| pentapeptide repeat protein [Planctomyces brasiliensis DSM 5305]
          Length = 194

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 68/148 (45%), Gaps = 16/148 (10%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN + AD+ E+D  G+  +GA L +A   +A+  GADLS   +    L+ ANL+ A L 
Sbjct: 25  RANLSEADLSEADLRGADLSGANLSEADLSEADLRGADLSGANLSWANLSWANLSEADLS 84

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQ------KQALCKYANGTNPITGVSTRKSLGCG 242
              L+ +DL  A + GAD S A +  A        +A+ +   G   I       S+GC 
Sbjct: 85  GANLSEADLSEADLRGADLSGANLRGANLSGANLSEAVARLDFGAWSICVRKDVTSIGCR 144

Query: 243 NSRRNAYGSPSSPLLSAPPQKLLDRDGF 270
             R + +       L   P    D DGF
Sbjct: 145 TYRNDRW-------LEWTPD---DVDGF 162


>gi|428222289|ref|YP_007106459.1| serine/threonine protein kinase [Synechococcus sp. PCC 7502]
 gi|427995629|gb|AFY74324.1| serine/threonine protein kinase [Synechococcus sp. PCC 7502]
          Length = 563

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 54/97 (55%), Gaps = 5/97 (5%)

Query: 119 RKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-----TLMD 173
           RK +    + + NF + D+ ++  +G+  +G  + ++   + +F  +DL+       +M 
Sbjct: 396 RKVIVEYGHGKRNFANLDLSKASLAGTNLSGIVMSRSKLVETDFCQSDLTHASFTGAIMT 455

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           ++ LN ANL  A + R +LT++DLGGA +  AD  +A
Sbjct: 456 QVKLNGANLAQAKMQRAILTKADLGGACLNQADLREA 492



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 59/123 (47%), Gaps = 17/123 (13%)

Query: 104 EFGIGSAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGS-----KF 147
           E+G G    F + DL KA     N              +F  +D+  + F+G+     K 
Sbjct: 401 EYGHGKR-NFANLDLSKASLAGTNLSGIVMSRSKLVETDFCQSDLTHASFTGAIMTQVKL 459

Query: 148 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           NGA L +A   +A  T ADL    +++  L EANL +A + +  L+ +DL GA ++GA  
Sbjct: 460 NGANLAQAKMQRAILTKADLGGACLNQADLREANLQSAYMSKADLSGADLTGANLKGAYL 519

Query: 208 SDA 210
           S A
Sbjct: 520 SQA 522


>gi|378826441|ref|YP_005189173.1| BTB/POZ domain-containing protein KCTD9 [Sinorhizobium fredii
           HH103]
 gi|365179493|emb|CCE96348.1| BTB/POZ domain-containing protein KCTD9 [Sinorhizobium fredii
           HH103]
          Length = 250

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 60/122 (49%), Gaps = 12/122 (9%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A   +A+L KA  V+ +       +ANF+  +    DFSG    GA    +   +A+FTG
Sbjct: 88  ADLTAANLEKATLVRASLAGAKADKANFSRVEGYRGDFSGISAEGALFVSSELQRADFTG 147

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGA-DFSDAVIDLAQKQ 218
           A L+    ++  L  AN   AV+  T      L+R+DL GA+ EG  DF  A + L + +
Sbjct: 148 ARLTGADFEKAELGRANFGKAVVTGTRFSVANLSRADLSGAVFEGPIDFDRAFLFLTRIE 207

Query: 219 AL 220
            L
Sbjct: 208 GL 209



 Score = 37.4 bits (85), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 38/87 (43%), Gaps = 5/87 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN-----A 185
           N    D   +D +G+    A LEKA   +A+  GA        R+     + +      A
Sbjct: 74  NLVDTDFASTDLNGADLTAANLEKATLVRASLAGAKADKANFSRVEGYRGDFSGISAEGA 133

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVI 212
           + V + L R+D  GA + GADF  A +
Sbjct: 134 LFVSSELQRADFTGARLTGADFEKAEL 160


>gi|425458953|ref|ZP_18838439.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
           9808]
 gi|389823440|emb|CCI28334.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
           9808]
          Length = 425

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 60/120 (50%), Gaps = 5/120 (4%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A L  A+ ++ N R A  + AD+ E+D SG+    A L KA+  +A    A LS+
Sbjct: 285 ANLIKAILSWAILIEANLRGAILSEADLSEADLSGANLRRANLIKAILRRAILIEAILSE 344

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
             +    L  ANL  A+L+  +L  +DL GA +  A+ S+A I+     A+   A G  P
Sbjct: 345 ADLSGANLRRANLIKAILIEAILIEADLRGADLRWANLSEADIE----NAIFIDATGITP 400



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 48/82 (58%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           + + + AD+ E+D SG+  +GA L +A    AN +GA+LS   +    L  ANL  A+L 
Sbjct: 234 QVDLSGADLSEADLSGAILSGANLSEANLSGANLSGANLSWANLIDANLRRANLIKAILS 293

Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
             +L  ++L GAI+  AD S+A
Sbjct: 294 WAILIEANLRGAILSEADLSEA 315



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 51/104 (49%), Gaps = 9/104 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A    ADL  A+          + A++ E++ SG+  +GA L  A    AN   A+L 
Sbjct: 238 SGADLSEADLSGAI---------LSGANLSEANLSGANLSGANLSWANLIDANLRRANLI 288

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             ++   +L EANL  A+L    L+ +DL GA +  A+   A++
Sbjct: 289 KAILSWAILIEANLRGAILSEADLSEADLSGANLRRANLIKAIL 332



 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 49/103 (47%), Gaps = 2/103 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+LR+A  +K   R A    A + E+D SG+    A L KA+  +A    ADL
Sbjct: 313 SEADLSGANLRRANLIKAILRRAILIEAILSEADLSGANLRRANLIKAILIEAILIEADL 372

Query: 168 SDTLMDRMVLNEANLTNAVLVR-TVLTRSDLGGAIIEGADFSD 209
               +    L+EA++ NA+ +  T +T       I  GA F D
Sbjct: 373 RGADLRWANLSEADIENAIFIDATGITPEQKQDLIRRGAIFGD 415


>gi|116754331|ref|YP_843449.1| pentapeptide repeat-containing protein [Methanosaeta thermophila
           PT]
 gi|116665782|gb|ABK14809.1| pentapeptide repeat protein [Methanosaeta thermophila PT]
          Length = 389

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 57/106 (53%), Gaps = 1/106 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A F  A L  A   +  FR + F+ A++  ++ +G+  +G+   ++   +A  TGADL
Sbjct: 177 SHANFVGAHLSWADMSRSRFRESQFSRAELYGANLTGTDLSGSDFTRSYMMRARMTGADL 236

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           SD  +D   L EA L +  L    +  +DL GA + GAD S+ V+D
Sbjct: 237 SDASLDYADLTEAELRDTDLSGCKMRYADLSGANLAGADISEVVLD 282



 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 49/176 (27%), Positives = 78/176 (44%), Gaps = 34/176 (19%)

Query: 46  SDGQFPDCSNNQCAGP---YAKLKNWRVFVSTALAAAV-VASCSSNISALADLNKYEAET 101
           +D    D S    +G     AKL+N R+  ++ + A + +A C+  +  + D++  +AE 
Sbjct: 99  ADLSMADLSGANLSGTDLSRAKLRNARLSGASLVNANLTMADCTEAL--MDDVSLEDAEM 156

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
            G        +F   DL  AV    +   ANF  A +  +D S S+F  +   +A  Y A
Sbjct: 157 TG-------TRFFRTDLTGAVFSGASLSHANFVGAHLSWADMSRSRFRESQFSRAELYGA 209

Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           N TG DLS +          + T + ++R  +T          GAD SDA +D A 
Sbjct: 210 NLTGTDLSGS----------DFTRSYMMRARMT----------GADLSDASLDYAD 245



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 49/111 (44%), Gaps = 16/111 (14%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 159
           A    A LR A  V  N   A+   AD+  +D SG+  +G          A L  A    
Sbjct: 74  ANLNGAYLRSAWLVNANLEGASLAGADLSMADLSGANLSGTDLSRAKLRNARLSGASLVN 133

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           AN T AD ++ LMD + L +A +T     RT     DL GA+  GA  S A
Sbjct: 134 ANLTMADCTEALMDDVSLEDAEMTGTRFFRT-----DLTGAVFSGASLSHA 179



 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 43/84 (51%), Gaps = 5/84 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T A++R++D SG K   A L  A     N  GAD+S+ ++D +     NL+ A+L +
Sbjct: 244 ADLTEAELRDTDLSGCKMRYADLSGA-----NLAGADISEVVLDSVKTTGVNLSGAILYK 298

Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
           T L   DL    + G     A +D
Sbjct: 299 TSLFNLDLRDIDMHGVQIKKAKMD 322


>gi|113477234|ref|YP_723295.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110168282|gb|ABG52822.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 227

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 60/122 (49%), Gaps = 11/122 (9%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMR-----ESDFSGSKFNGAYLEKAVAYK 159
           A+F  ADL +A  ++ +       + N   AD+      E D  G+   G    +A+  K
Sbjct: 90  AKFNKADLTRAKLIRADLSCADFSQVNMVDADLSRAILYEIDLHGANLYGVNFRRAILNK 149

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
           A+  GA+L    M  + L EANLT A L   +L+ +DL GA + GA+ SD  +  A  QA
Sbjct: 150 ADLIGANLIRANMTGVDLIEANLTRANLTEAILSGADLNGASLLGANISDVNLVGAALQA 209

Query: 220 LC 221
           + 
Sbjct: 210 VI 211



 Score = 43.9 bits (102), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 45/77 (58%), Gaps = 5/77 (6%)

Query: 139 ESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
           E +FSG     A+L     E A  ++AN TGA+LS   + R+ L +ANLT A L+ T L+
Sbjct: 19  EKNFSGLYLQEAHLLKANLEGANFFEANLTGANLSQANLSRVNLAKANLTGANLIGTDLS 78

Query: 194 RSDLGGAIIEGADFSDA 210
            ++L   ++ GA F+ A
Sbjct: 79  EANLSDTLLVGAKFNKA 95



 Score = 42.4 bits (98), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 49/95 (51%), Gaps = 10/95 (10%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL----------SDTLMDRMVL 177
            +AN   A+  E++ +G+  + A L +    KAN TGA+L          SDTL+     
Sbjct: 33  LKANLEGANFFEANLTGANLSQANLSRVNLAKANLTGANLIGTDLSEANLSDTLLVGAKF 92

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           N+A+LT A L+R  L+ +D     +  AD S A++
Sbjct: 93  NKADLTRAKLIRADLSCADFSQVNMVDADLSRAIL 127



 Score = 42.4 bits (98), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 66/154 (42%), Gaps = 26/154 (16%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFS-----GSKFN 148
           N +EA   G       A    A+L +    K N   AN    D+ E++ S     G+KFN
Sbjct: 41  NFFEANLTG-------ANLSQANLSRVNLAKANLTGANLIGTDLSEANLSDTLLVGAKFN 93

Query: 149 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            A L +A   +A+ + AD S          + N+ +A L R +L   DL GA + G +F 
Sbjct: 94  KADLTRAKLIRADLSCADFS----------QVNMVDADLSRAILYEIDLHGANLYGVNFR 143

Query: 209 DAVI---DLAQKQALCKYANGTNPITGVSTRKSL 239
            A++   DL     +     G + I    TR +L
Sbjct: 144 RAILNKADLIGANLIRANMTGVDLIEANLTRANL 177


>gi|309792396|ref|ZP_07686863.1| pentapeptide repeat-containing protein [Oscillochloris trichoides
           DG-6]
 gi|308225551|gb|EFO79312.1| pentapeptide repeat-containing protein [Oscillochloris trichoides
           DG6]
          Length = 314

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/125 (34%), Positives = 60/125 (48%), Gaps = 10/125 (8%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADLRK      N   AN T A++R ++ S + F+GA L  A     N +G DL D
Sbjct: 89  ADLSDADLRKGDLAWANLEFANLTGANLRGANLSAADFSGANLYGANLSLCNLSGVDLRD 148

Query: 170 TLMDRMVLNEANLTNAVLVRTV--------LTRSDLGGAIIEGADFSDA-VIDLAQKQAL 220
           T+M    L EA L  A LV           L +  LGGA ++G + S A ++    ++A 
Sbjct: 149 TIMIGANLTEAQLREAQLVNLSGANLSGANLNKVSLGGASMQGVNLSGASLLSANLREAT 208

Query: 221 CKYAN 225
            + AN
Sbjct: 209 LREAN 213



 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 68/146 (46%), Gaps = 13/146 (8%)

Query: 78  AAVVASCSSNISALADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKEN------FRA 130
           A +V    +N+S  A+LNK         G+  S A   SA+LR+A   + N      + A
Sbjct: 164 AQLVNLSGANLSG-ANLNKVSLGGASMQGVNLSGASLLSANLREATLREANLIGANLYEA 222

Query: 131 NFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           N + AD+  +D S +  +G YL     E A+   AN + A+LS   +    LN  NL  A
Sbjct: 223 NLSEADLSAADLSMANLSGIYLSGANLEGAILTHANLSRANLSGCNLRGAQLNGCNLREA 282

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAV 211
            L    LT +DL GA +   D S  +
Sbjct: 283 SLADADLTGADLTGADLSECDLSGVI 308


>gi|359458687|ref|ZP_09247250.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 203

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 66/146 (45%), Gaps = 23/146 (15%)

Query: 111 AQFGSADLRKAVHVKENFRA-----------NFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           A F SADLRKA   + + RA           N   A++  ++ SG+  +GA L  A+ Y 
Sbjct: 53  ANFASADLRKAKLFRADLRAACLYRADLRGANLKGANLFGANLSGANLSGANLSNAMLYC 112

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF--SDAV-IDLAQ 216
           AN  GA+L  T++D   L   N ++  L   +L  + L G   EG     +D + I+L Q
Sbjct: 113 ANLGGANLRGTILDSANLMRVNFSHGDLRNAMLRNAKLQGTHFEGTRMLQTDLIEINLNQ 172

Query: 217 KQALCKY---------ANGTNPITGV 233
            Q    Y         A G   ITG+
Sbjct: 173 AQIDGVYLMDPDANNTAMGNTAITGI 198


>gi|428771470|ref|YP_007163260.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
           10605]
 gi|428685749|gb|AFZ55216.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
          Length = 195

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 49/82 (59%), Gaps = 6/82 (7%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +  +AD+R +D  G    GA L+KA    AN +GADLS     +  L EANL+ A+L  T
Sbjct: 113 DLCNADLRGADLRGVNLVGACLQKADLSNANLSGADLS-----QADLEEANLSGAILHGT 167

Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
            LT+++L  AI+EG  F D VI
Sbjct: 168 NLTQANLLCAIVEGVSF-DYVI 188



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 21/98 (21%)

Query: 130 ANFTSADMRESDFSGSKFNG-----AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN + AD+ +S+F+GS   G     A LEKA+  + N  GADL+   +    L  A+L  
Sbjct: 28  ANLSGADLAQSNFTGSNLTGVNLTGANLEKAI-LRCNLRGADLTGASLQGADLRGADLRG 86

Query: 185 AVLVRT---------------VLTRSDLGGAIIEGADF 207
           A+L+ +               +LT  DL  A + GAD 
Sbjct: 87  AILLSSQVENISLAGSFLAGAILTNLDLCNADLRGADL 124


>gi|434396750|ref|YP_007130754.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428267847|gb|AFZ33788.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 331

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/111 (36%), Positives = 57/111 (51%), Gaps = 11/111 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 163
           S A    ++L KA  ++ NF RAN T A + ++D SG     A L  A+  K N T    
Sbjct: 65  SGADLSQSNLEKAQLIETNFSRANLTEASLIQADLSG-----AILSSAIGTKTNLTAAIL 119

Query: 164 -GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
            G  L  T + +  L EANLT A L   +LT S+L  AI+  A  S+A ++
Sbjct: 120 IGCSLVGTQLLKSKLKEANLTGASLTGAILTGSNLTRAILTRAILSNANLE 170



 Score = 44.7 bits (104), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 43/89 (48%), Gaps = 6/89 (6%)

Query: 123 HVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
           H + NF      +ANF  A +   D   +    A   +A    AN +GADLS + +++  
Sbjct: 19  HGQRNFQAIKLIKANFQRASLNNIDLKMAVLKKANFNQAQLINANLSGADLSQSNLEKAQ 78

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
           L E N + A L    L ++DL GAI+  A
Sbjct: 79  LIETNFSRANLTEASLIQADLSGAILSSA 107



 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 6/101 (5%)

Query: 116 ADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           ADLR A     N +       N   AD+ E++ S +   GA L  A   +    G +L+ 
Sbjct: 202 ADLRGANLEGANLQGANLEGVNLQDADLTEANLSAANLEGAVLSNANLQQVILKGTNLTG 261

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           T +    L +ANL+ A L +  L  +DL GA + GAD + A
Sbjct: 262 TNLLNANLGQANLSQANLCQAGLLFTDLTGANLMGADLTSA 302



 Score = 42.0 bits (97), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 52/94 (55%), Gaps = 9/94 (9%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A   + DL+ AV  K    ANF  A +  ++ SG+  + + LEKA   + NF+ A+L++ 
Sbjct: 37  ASLNNIDLKMAVLKK----ANFNQAQLINANLSGADLSQSNLEKAQLIETNFSRANLTEA 92

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
                 L +A+L+ A+L   + T+++L  AI+ G
Sbjct: 93  -----SLIQADLSGAILSSAIGTKTNLTAAILIG 121



 Score = 40.8 bits (94), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 44/91 (48%), Gaps = 10/91 (10%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           +R  +H     +AN   AD+R +D  G+   GA L+ A     N   ADL+         
Sbjct: 180 IRAYLHRVNLKKANLEKADLRFADLRGANLEGANLQGANLEGVNLQDADLT--------- 230

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            EANL+ A L   VL+ ++L   I++G + +
Sbjct: 231 -EANLSAANLEGAVLSNANLQQVILKGTNLT 260


>gi|298492301|ref|YP_003722478.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
 gi|298234219|gb|ADI65355.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
          Length = 264

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 55/106 (51%), Gaps = 6/106 (5%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTG 164
           A    ADL  A  ++ N   AN   A++  +D SG+    A     YL +A  YKAN T 
Sbjct: 139 ANLKDADLAAAKLIRSNLSFANLVGANLITTDLSGANLYEAELMQTYLYQANLYKANLTN 198

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           + L  + + R  L+EANLTNA L    LT ++L GA + GA+   A
Sbjct: 199 SHLGSSYLFRANLSEANLTNADLTCANLTGANLRGANLRGANLRGA 244



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 66/139 (47%), Gaps = 24/139 (17%)

Query: 117 DLRKAVHVKENFR-ANFTSADMRESDFS----------GSKFNGAYLEKAVAYKANFTGA 165
           DL  A    ENFR AN    ++ + DFS          G+  + A L +A   +AN + A
Sbjct: 30  DLSTANLQGENFRGANLQGVNLTKVDFSHALLVRTNLSGANLSIANLHQAKLIEANLSEA 89

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA-----DFSDAVI---DLAQK 217
           +LS   +    L +ANL+   L+   L+ ++L GA I GA     DF +A +   DLA  
Sbjct: 90  NLSIANLRNATLTQANLSQVNLIGADLSEANLIGAAITGANLIGTDFRNANLKDADLAAA 149

Query: 218 QAL---CKYAN--GTNPIT 231
           + +     +AN  G N IT
Sbjct: 150 KLIRSNLSFANLVGANLIT 168


>gi|440681678|ref|YP_007156473.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
 gi|428678797|gb|AFZ57563.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
          Length = 402

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 53/103 (51%), Gaps = 4/103 (3%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADL KA   K NF  ANFT A + E+   G+ F  AYL +A    AN TG +L+ 
Sbjct: 281 AILAGADLTKA---KANFTGANFTGAILTEAILIGANFEKAYLIRADLTGANLTGTNLTR 337

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             +    L  ANLT A L++ +L  + L   I+ GA    A++
Sbjct: 338 ADLTEADLTGANLTRAYLIKAILEEAILEEVILRGAILRGAIL 380



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 71/149 (47%), Gaps = 18/149 (12%)

Query: 71  FVSTALAAAVV--ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 128
           F    L  A++  A+    I A ADL K +A   G       A F  A L +A+ +    
Sbjct: 263 FTRAILTEAILIGANFEEAILAGADLTKAKANFTG-------ANFTGAILTEAILIG--- 312

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS-----DTLMDRMVLNEANLT 183
            ANF  A +  +D +G+   G  L +A   +A+ TGA+L+       +++  +L E  L 
Sbjct: 313 -ANFEKAYLIRADLTGANLTGTNLTRADLTEADLTGANLTRAYLIKAILEEAILEEVILR 371

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            A+L   +LTR+ L GA ++GA   D  I
Sbjct: 372 GAILRGAILTRAILRGANLKGATMPDGSI 400



 Score = 43.9 bits (102), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N + A++ E++F  +    A L++A+   ANF GA     +  R  L EAN T A+L   
Sbjct: 217 NISKANLTEANFKRAILAEANLKRAILIGANFEGA-----IFTRADLAEANFTRAILTEA 271

Query: 191 VLTRSDLGGAIIEGADFSDA 210
           +L  ++   AI+ GAD + A
Sbjct: 272 ILIGANFEEAILAGADLTKA 291


>gi|86605499|ref|YP_474262.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86554041|gb|ABC98999.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
          Length = 330

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 58/118 (49%), Gaps = 4/118 (3%)

Query: 99  AETRGEFGIG---SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEK 154
           A+ RG   +G      Q G A+L++A+  + N   AN + AD+  +D S +    A L +
Sbjct: 207 ADLRGASFLGGDLQGVQMGRANLKEAMLSQVNLAEANLSEADLAGADLSAACLRSAKLAR 266

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
               +AN  GADL    +    L   NL NA L   +LTR+DL  A + GA+   A +
Sbjct: 267 TDLSRANLAGADLRSASLVDAYLGRTNLENADLREAILTRADLSTANLAGANLRGATL 324



 Score = 44.7 bits (104), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 4/104 (3%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RA+    D+ ++D  G     AYL +A   KAN  GA+LS   + +  L+EA+L +A L 
Sbjct: 31  RADLIGIDLSQADLHGINLIFAYLGRAKLQKANLVGANLSGANLSQADLSEADLRDAHLH 90

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
            T L  +DL GA +  A   DA +     +A  ++AN T+   G
Sbjct: 91  GTTLQGADLHGANLALALLIDANL----LEADLRWANLTSANLG 130



 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 48/98 (48%), Gaps = 11/98 (11%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A  G A L+KA  V  N   AN + AD+ E+D   +  +G  L+ A  + AN   A    
Sbjct: 52  AYLGRAKLQKANLVGANLSGANLSQADLSEADLRDAHLHGTTLQGADLHGANLALA---- 107

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
                 +L +ANL  A L    LT ++LGGA + GA+ 
Sbjct: 108 ------LLIDANLLEADLRWANLTSANLGGACLRGANL 139



 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 64/135 (47%), Gaps = 8/135 (5%)

Query: 74  TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRAN-F 132
           T L  A +   +  ++ L D N  EA+ R  +   ++A  G A LR A    E+ RA   
Sbjct: 92  TTLQGADLHGANLALALLIDANLLEADLR--WANLTSANLGGACLRGANLRFESRRAAVL 149

Query: 133 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 192
            SA++  +D SG+   GA L      +A+  GA+L +  +    L  ANL  A L   +L
Sbjct: 150 RSANLSRADLSGANLAGADL-----TRADLRGANLKEASLIGAHLQGANLQRACLRGALL 204

Query: 193 TRSDLGGAIIEGADF 207
           + +DL GA   G D 
Sbjct: 205 SNADLRGASFLGGDL 219



 Score = 37.4 bits (85), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 40/87 (45%), Gaps = 10/87 (11%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKA----------NFTGADLSDTLMDRMVLN 178
           RA+   A+++E+   G+   GA L++A    A          +F G DL    M R  L 
Sbjct: 171 RADLRGANLKEASLIGAHLQGANLQRACLRGALLSNADLRGASFLGGDLQGVQMGRANLK 230

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGA 205
           EA L+   L    L+ +DL GA +  A
Sbjct: 231 EAMLSQVNLAEANLSEADLAGADLSAA 257



 Score = 37.0 bits (84), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 132 FTSADMRESDFSGSKFNG-----AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
            ++AD+R + F G    G     A L++A+  + N   A+LS+  +    L+ A L +A 
Sbjct: 204 LSNADLRGASFLGGDLQGVQMGRANLKEAMLSQVNLAEANLSEADLAGADLSAACLRSAK 263

Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVI 212
           L RT L+R++L GA +  A   DA +
Sbjct: 264 LARTDLSRANLAGADLRSASLVDAYL 289


>gi|75910595|ref|YP_324891.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75704320|gb|ABA23996.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 521

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 59/116 (50%), Gaps = 7/116 (6%)

Query: 115 SADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           SA+LR A   + NFR      A+ + A++R +D SG   + A L  A    AN  GADLS
Sbjct: 174 SANLRDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSNAKLSGANLVGADLS 233

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY 223
           +  +    L  ANL  A L+R     +DL  AI+ GA  +S +   L  +  +C++
Sbjct: 234 NANLTNASLVHANLIQAKLIRAEWVGADLTSAILTGAKLYSTSRFGLKTEGLICQW 289



 Score = 44.7 bits (104), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 9/101 (8%)

Query: 130 ANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN + +++ E+DFS +K N     GA L  A+   ++   A+L  + + R  L  A+L  
Sbjct: 45  ANLSGSNLSEADFSHAKLNVARLSGANLTNAIFNHSSLNVANLIRSDLSRAQLRGASLVR 104

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
           A L+R  L+R DL  A +  AD  +A +    + A  ++AN
Sbjct: 105 AELIRAELSRVDLSEANLNSADLREATL----RHANLRHAN 141



 Score = 41.2 bits (95), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A   SADLR+A             A++R ++ +G+   GA L  A    AN  G+DLS
Sbjct: 118 SEANLNSADLREAT---------LRHANLRHANLNGASLKGASLVGANLEMANLNGSDLS 168

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
                R  L  ANL +A L +     ++L GA + GA+ 
Sbjct: 169 -----RCDLTSANLRDAELKQVNFRHANLSGADLSGANL 202



 Score = 40.4 bits (93), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 26/121 (21%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTNA 185
           NF+  D+ E++ SG K  G    +A    AN +G++LS+       LN A     NLTNA
Sbjct: 16  NFSGVDLSEANLSGVKLCGVNFSQANLSIANLSGSNLSEADFSHAKLNVARLSGANLTNA 75

Query: 186 V----------LVRTVLTRSDLGGAIIEGA----------DFSDAVIDLAQ-KQALCKYA 224
           +          L+R+ L+R+ L GA +  A          D S+A ++ A  ++A  ++A
Sbjct: 76  IFNHSSLNVANLIRSDLSRAQLRGASLVRAELIRAELSRVDLSEANLNSADLREATLRHA 135

Query: 225 N 225
           N
Sbjct: 136 N 136


>gi|153873268|ref|ZP_02001907.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
 gi|152070268|gb|EDN68095.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
          Length = 159

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 49/83 (59%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           FRAN +  D+  +D SG+  +GA L +A    ANFT A+LS+  +      +ANLT+A L
Sbjct: 47  FRANLSHVDLTNTDLSGANLSGANLNEANLTNANFTKANLSEANLCESYFAKANLTDANL 106

Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
               LT++ L  + + GA+ S+A
Sbjct: 107 SEANLTKAYLIESFLSGANLSEA 129



 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 43/78 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANFT A++ E++   S F  A L  A   +AN T A L ++ +    L+EANL  + L  
Sbjct: 79  ANFTKANLSEANLCESYFAKANLTDANLSEANLTKAYLIESFLSGANLSEANLFRSNLFE 138

Query: 190 TVLTRSDLGGAIIEGADF 207
           + L R++L GA +  A F
Sbjct: 139 SDLFRANLTGANLYKAKF 156


>gi|194337742|ref|YP_002019536.1| pentapeptide repeat-containing protein [Pelodictyon
           phaeoclathratiforme BU-1]
 gi|194310219|gb|ACF44919.1| pentapeptide repeat protein [Pelodictyon phaeoclathratiforme BU-1]
          Length = 408

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 46/78 (58%), Gaps = 5/78 (6%)

Query: 135 ADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A++R+SDF+GS   GA     +++ AV  +AN  GA+L   +++   LN ANLT A L  
Sbjct: 111 ANLRKSDFTGSSLTGANLQGSFMKGAVLREANLEGANLRWAMLENGDLNRANLTGATLFE 170

Query: 190 TVLTRSDLGGAIIEGADF 207
             L  +DL GA ++ A F
Sbjct: 171 ANLAGADLKGANLKNAHF 188



 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 46/84 (54%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +A+   A+M+ +   G+   GA L++A    A+ + ++LS+ L+    L  ANL+ A L 
Sbjct: 285 KADLHKAEMKSAKLQGADLQGANLDRAFLKGADLSNSNLSNALLYGAKLGNANLSGANLE 344

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
              L  +DL GA +EGA+   A I
Sbjct: 345 GASLFEADLEGANLEGANLKGANI 368



 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 5/90 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLT 183
           R     +  + ++ +G+ F+ A L KA    A   GADL    +DR  L  A     NL+
Sbjct: 265 RTRVEQSSFQNTNMAGADFHKADLHKAEMKSAKLQGADLQGANLDRAFLKGADLSNSNLS 324

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           NA+L    L  ++L GA +EGA   +A ++
Sbjct: 325 NALLYGAKLGNANLSGANLEGASLFEADLE 354



 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 5/88 (5%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEANL 182
            R     + +  +D  G+    A ++KA   K++FTG     A+L  + M   VL EANL
Sbjct: 84  IRVKLNGSKLDMADLKGANLTMALIKKANLRKSDFTGSSLTGANLQGSFMKGAVLREANL 143

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             A L   +L   DL  A + GA   +A
Sbjct: 144 EGANLRWAMLENGDLNRANLTGATLFEA 171


>gi|428219623|ref|YP_007104088.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427991405|gb|AFY71660.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 172

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/101 (40%), Positives = 54/101 (53%), Gaps = 6/101 (5%)

Query: 117 DLRKAVHVKENFRANF-TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           DLR A     N R  F  +A +R SD +G+    A L  A    AN TGADL+   M+  
Sbjct: 69  DLRGA-----NLRGAFLKNARLRGSDLTGADLRDATLTGAYFTGANLTGADLAGAEMEWA 123

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
            L +ANL +A L    L+RSDL GA ++GAD   A +  A+
Sbjct: 124 NLRDANLQDANLQDANLSRSDLDGANLDGADLRGANLSRAK 164


>gi|443324431|ref|ZP_21053184.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442795950|gb|ELS05284.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 239

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 60/105 (57%), Gaps = 11/105 (10%)

Query: 112 QFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGA 165
           QF   +L++A  +K N    + T+AD+R++    S F  A L  A     + +  +FT A
Sbjct: 16  QFSRINLQEAELIKVNLSNVDLTAADLRQARLGRSNFGHACLRSADLSESILWGTDFTQA 75

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           DLS     + V+ EA+L+ A+L +  L +++L  +I+EGA+FS A
Sbjct: 76  DLS-----QAVMREADLSGAILTQANLEKANLIKSILEGANFSGA 115



 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 60/112 (53%), Gaps = 4/112 (3%)

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
           R  FG    A   SADL +++    +F +A+ + A MRE+D SG+    A LEKA   K+
Sbjct: 49  RSNFG---HACLRSADLSESILWGTDFTQADLSQAVMREADLSGAILTQANLEKANLIKS 105

Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
              GA+ S   +   ++ E +L  A   RT L+++DL  A +  A+ S A++
Sbjct: 106 ILEGANFSGAKLRHALMIEVDLRPASDYRTNLSQADLSYADLSYANLSMALL 157



 Score = 44.3 bits (103), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 56/123 (45%), Gaps = 19/123 (15%)

Query: 107 IGSAAQFGSADLRKAVHVK------ENFRANFTSADMRESDFS----------GSKFNGA 150
           I   A F  A LR A+ ++       ++R N + AD+  +D S           +K +GA
Sbjct: 106 ILEGANFSGAKLRHALMIEVDLRPASDYRTNLSQADLSYADLSYANLSMALLYQAKLDGA 165

Query: 151 YLEKA---VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
            L +A        N    DL++  +    L+ ANLT A+L R  LT +DL G I+   D 
Sbjct: 166 RLSRANLSAGRGENALATDLTEASLRDADLSYANLTGAILHRADLTGADLTGTILTNTDL 225

Query: 208 SDA 210
            +A
Sbjct: 226 REA 228


>gi|359464087|ref|ZP_09252650.1| hypothetical protein ACCM5_35600 [Acaryochloris sp. CCMEE 5410]
          Length = 237

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 45/123 (36%), Positives = 60/123 (48%), Gaps = 21/123 (17%)

Query: 111 AQFGSADLRKAVHVKENF------RANF----------TSADMR-----ESDFSGSKFNG 149
           A F  ADLR++   + NF      RAN           TSADMR     E+D SG+K   
Sbjct: 35  ADFSDADLRQSRFGRTNFSYTCFRRANLSETIFWGADLTSADMRQANLREADLSGAKLIQ 94

Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
             L +A   KA   GA+LS   MD  +L E +L      RT L R++L GA +  A+ S 
Sbjct: 95  TQLTEANLLKACLCGANLSAVQMDGAILIEVDLRPTSDQRTDLGRANLAGADLSYANLSQ 154

Query: 210 AVI 212
           A++
Sbjct: 155 ALL 157



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 56/102 (54%), Gaps = 1/102 (0%)

Query: 113 FGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F   +LR+A  +      A+F+ AD+R+S F  + F+     +A   +  F GADL+   
Sbjct: 17  FHRIELREAELINSELCGADFSDADLRQSRFGRTNFSYTCFRRANLSETIFWGADLTSAD 76

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           M +  L EA+L+ A L++T LT ++L  A + GA+ S   +D
Sbjct: 77  MRQANLREADLSGAKLIQTQLTEANLLKACLCGANLSAVQMD 118


>gi|359151325|ref|ZP_09184042.1| pentapeptide repeat-containing protein [Streptomyces sp. S4]
          Length = 240

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/142 (30%), Positives = 65/142 (45%), Gaps = 2/142 (1%)

Query: 69  RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 128
           R  +  A   A  A+ S    A+A+  +  ++T   + +  AA   +   R +       
Sbjct: 14  RSLLYLACPGAPPAAISDTARAIAE--RSGSQTSPTYAVAEAASLTAVPPRNSGRFHNLS 71

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN  SAD+   + +G+   GA L +     AN TGADL    +    L   NLT A + 
Sbjct: 72  RANLISADLARVNLTGANLTGADLARVNLTGANLTGADLIYANLAGADLTRVNLTRARMK 131

Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
            T LT +DL GA + G D ++A
Sbjct: 132 LTNLTGADLTGADLAGGDLTNA 153



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 61/145 (42%), Gaps = 11/145 (7%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-----------ANFTSAD 136
           ++  A L        G F   S A   SADL +      N             AN T AD
Sbjct: 50  VAEAASLTAVPPRNSGRFHNLSRANLISADLARVNLTGANLTGADLARVNLTGANLTGAD 109

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           +  ++ +G+      L +A     N TGADL+   +    L  A+LTNA L    LT  D
Sbjct: 110 LIYANLAGADLTRVNLTRARMKLTNLTGADLTGADLAGGDLTNADLTNADLTGAHLTNVD 169

Query: 197 LGGAIIEGADFSDAVIDLAQKQALC 221
           L GAI+ GA+   A +  A++  L 
Sbjct: 170 LTGAILTGANLGGANLAAARQLRLV 194


>gi|428314172|ref|YP_007125149.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428255784|gb|AFZ21743.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 276

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 60/114 (52%), Gaps = 11/114 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           SAA    A LR+A     N + AN  + D++ +D  G+   GA L++A     N +GADL
Sbjct: 104 SAATLKGAKLREA-----NLQGANLRAVDLKNADLCGANLQGADLKRADLINTNLSGADL 158

Query: 168 S-----DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           S     D + +++ L EANL  A L    L+ +DL GA +  A+ + A +  AQ
Sbjct: 159 SGANLTDVIFEKVNLREANLRGANLQGLDLSEADLTGADLSEANLNGARLQEAQ 212



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 58/119 (48%), Gaps = 16/119 (13%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGS-----KFNGAYLEKAVA 157
           S A    A+L   +  K N R      AN    D+ E+D +G+       NGA L++A  
Sbjct: 154 SGADLSGANLTDVIFEKVNLREANLRGANLQGLDLSEADLTGADLSEANLNGARLQEAQL 213

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
            +AN +G D     M  + L+ ANL  A L    L+++ L G  + GA+  +A++D A+
Sbjct: 214 SQANLSGLD-----MTHLNLSGANLRQANLSEAQLSQAQLYGTDLRGANLDEAILDQAK 267


>gi|254409899|ref|ZP_05023679.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196182935|gb|EDX77919.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 478

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 69/151 (45%), Gaps = 20/151 (13%)

Query: 95  NKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFR----------------ANFTSA 135
           N  EA  RG F  G+    A   +ADL ++     NFR                A+ + A
Sbjct: 141 NLSEANLRGAFVTGANLEGANLNAADLSRSDLSNSNFRHAEFKQANLSCANLAGADLSGA 200

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           ++R +D SG+  + A L +A    AN TGADL+   +    L  A+LT A L+      +
Sbjct: 201 NLRWTDLSGANLSWANLSEAKLSGANLTGADLTHANLLNTSLVHADLTQARLIHADWIGA 260

Query: 196 DLGGAIIEGADFSD-AVIDLAQKQALCKYAN 225
           DL GA + GA     + + L  +  +C++ +
Sbjct: 261 DLTGATLTGAKLHGVSRVGLKTQGIVCEWVD 291



 Score = 45.4 bits (106), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 64/129 (49%), Gaps = 4/129 (3%)

Query: 91  LADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSK 146
           L++ N   A   G + IG   S A+   A L  A   K N  +AN   A++  +D  G++
Sbjct: 37  LSEANLSVANLSGAYLIGTNLSRARLNVARLSGANLTKANLTKANLNVANLIRADLGGAQ 96

Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
              A + +A   +A  +GA L++  +    L EA L +A L R  L+ ++L GA + GA+
Sbjct: 97  LTQAAMIRAELIRAKLSGATLTEANLSGADLREAALRDAKLQRANLSEANLRGAFVTGAN 156

Query: 207 FSDAVIDLA 215
              A ++ A
Sbjct: 157 LEGANLNAA 165



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 45/80 (56%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NF+ A++ E++ SG   +GA L +A    AN +GA L  T + R  LN A L+ A L + 
Sbjct: 16  NFSGANLAEANLSGINLSGADLSEANLSVANLSGAYLIGTNLSRARLNVARLSGANLTKA 75

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            LT+++L  A +  AD   A
Sbjct: 76  NLTKANLNVANLIRADLGGA 95



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 49/101 (48%), Gaps = 6/101 (5%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A+LR A     N   AN  +AD+  SD S S F  A  ++A    AN  GADLS 
Sbjct: 140 ANLSEANLRGAFVTGANLEGANLNAADLSRSDLSNSNFRHAEFKQANLSCANLAGADLSG 199

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             +    L+ ANL+ A      L+ + L GA + GAD + A
Sbjct: 200 ANLRWTDLSGANLSWA-----NLSEAKLSGANLTGADLTHA 235



 Score = 42.0 bits (97), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 48/94 (51%), Gaps = 5/94 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A +  ++ S ++ N A L  A   KAN T A+L+   + R  L  A LT A ++R
Sbjct: 45  ANLSGAYLIGTNLSRARLNVARLSGANLTKANLTKANLNVANLIRADLGGAQLTQAAMIR 104

Query: 190 TVLTRSDLGGAII-----EGADFSDAVIDLAQKQ 218
             L R+ L GA +      GAD  +A +  A+ Q
Sbjct: 105 AELIRAKLSGATLTEANLSGADLREAALRDAKLQ 138



 Score = 38.1 bits (87), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 47/101 (46%), Gaps = 6/101 (5%)

Query: 111 AQFGSADLRKAVHVK-ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A  G A L +A  ++ E  RA  + A + E++ SG+    A L  A   +AN + A+L  
Sbjct: 90  ADLGGAQLTQAAMIRAELIRAKLSGATLTEANLSGADLREAALRDAKLQRANLSEANLRG 149

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             +    L  ANL  A L     +RSDL  +    A+F  A
Sbjct: 150 AFVTGANLEGANLNAADL-----SRSDLSNSNFRHAEFKQA 185


>gi|17228637|ref|NP_485185.1| hypothetical protein alr1142 [Nostoc sp. PCC 7120]
 gi|17130488|dbj|BAB73099.1| alr1142 [Nostoc sp. PCC 7120]
          Length = 521

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 59/116 (50%), Gaps = 7/116 (6%)

Query: 115 SADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           SA+LR A   + NFR      A+ + A++R +D SG   + A L  A    AN  GADLS
Sbjct: 174 SANLRDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSNAKLSGANLVGADLS 233

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY 223
           +  +    L  ANL  A L+R     +DL  AI+ GA  +S +   L  +  +C++
Sbjct: 234 NANLTNASLVHANLIQAKLIRAEWVGADLTSAILTGAKLYSTSRFGLKTEGLICQW 289



 Score = 44.3 bits (103), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 52/101 (51%), Gaps = 9/101 (8%)

Query: 130 ANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN + +++ E+DFS +K N     GA L  A+   ++   A+L    + R  L  A+L  
Sbjct: 45  ANLSGSNLSEADFSHAKLNVARLSGANLTNAIFNHSSLNVANLIRADLSRAQLRGASLVR 104

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
           A L+R  L+R DL  A +  AD  +A +    + A  ++AN
Sbjct: 105 AELIRAELSRVDLSEANLNSADLREATL----RHANLRHAN 141



 Score = 41.2 bits (95), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A   SADLR+A             A++R ++ +G+   GA L  A    AN  G+DLS
Sbjct: 118 SEANLNSADLREAT---------LRHANLRHANLNGASLKGASLVGANLEMANLNGSDLS 168

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
                R  L  ANL +A L +     ++L GA + GA+ 
Sbjct: 169 -----RCDLTSANLRDAELKQVNFRHANLSGADLSGANL 202



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 26/121 (21%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTNA 185
           NF+  D+ E++ SG K  G    +A    AN +G++LS+       LN A     NLTNA
Sbjct: 16  NFSGVDLSEANLSGVKLCGVNFSQANLSIANLSGSNLSEADFSHAKLNVARLSGANLTNA 75

Query: 186 V----------LVRTVLTRSDLGGAIIEGA----------DFSDAVIDLAQ-KQALCKYA 224
           +          L+R  L+R+ L GA +  A          D S+A ++ A  ++A  ++A
Sbjct: 76  IFNHSSLNVANLIRADLSRAQLRGASLVRAELIRAELSRVDLSEANLNSADLREATLRHA 135

Query: 225 N 225
           N
Sbjct: 136 N 136


>gi|90419937|ref|ZP_01227846.1| conserved hypothetical protein with pentapeptide repeats
           [Aurantimonas manganoxydans SI85-9A1]
 gi|90335978|gb|EAS49726.1| conserved hypothetical protein with pentapeptide repeats
           [Aurantimonas manganoxydans SI85-9A1]
          Length = 292

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 57/108 (52%), Gaps = 6/108 (5%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAY-LEKAVAYKANFTGADLSD 169
           A F  ADL  A    +  RA+F  A+M+ +DFS    N +  L + V   A+ TGADLS 
Sbjct: 168 ATFDGADLSAARIAGDFSRASFVRANMKGADFSADMRNQSMGLMRGVLNSADLTGADLSG 227

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTR-----SDLGGAIIEGADFSDAVI 212
             + R     A+ T+A L    LTR     ++  G ++EGADF+DA +
Sbjct: 228 ANLSRAAAEFADFTDADLSGADLTRFEASGANFNGTMVEGADFADAEL 275



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 4/83 (4%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL-- 187
           A+ TSA +  +D S ++  GA L++A    ANFTGADLS   + +  + +A    A L  
Sbjct: 118 ADLTSAYLNGTDLSNARLAGAKLDQAWGLGANFTGADLSGASLFQSQMQDATFDGADLSA 177

Query: 188 --VRTVLTRSDLGGAIIEGADFS 208
             +    +R+    A ++GADFS
Sbjct: 178 ARIAGDFSRASFVRANMKGADFS 200


>gi|428770507|ref|YP_007162297.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
           10605]
 gi|428684786|gb|AFZ54253.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
          Length = 355

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 59/114 (51%), Gaps = 10/114 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S+A F  A+LR A         + T  D+ E++   +K NG  L  A    AN T A+L+
Sbjct: 245 SSANFQDANLRGA---------DLTDVDLSEANLQNTKLNGVDLSGAYLEGANLTNANLT 295

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALC 221
           +  +    L  ANLTNA L  T L  + LG  I++GA F++ + ++  +KQ L 
Sbjct: 296 NASLALSNLIGANLTNANLTNTNLQNTSLGQTIVKGAIFANNLGLNEEKKQELI 349


>gi|392410624|ref|YP_006447231.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
 gi|390623760|gb|AFM24967.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
          Length = 285

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 56/102 (54%), Gaps = 1/102 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A L+KA     +F RA+ + AD+  +D SG+  +GA L  A   + + +  DL
Sbjct: 161 SGADLFGAKLKKAALSAVDFSRADLSGADLSGADLSGAILSGARLNGANLSRVDLSFTDL 220

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           S   +    L+ ANLT A L  + L+ +DL GA ++GAD +D
Sbjct: 221 SGAHLSGANLSAANLTGAYLPGSDLSGADLSGANLQGADITD 262



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 10/83 (12%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ + A++ ++D S +  +GA L KA+   A+ +GADL            A L  A L  
Sbjct: 128 ADLSKANLSQADLSRAILSGANLSKALLPFADLSGADLF----------GAKLKKAALSA 177

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
              +R+DL GA + GAD S A++
Sbjct: 178 VDFSRADLSGADLSGADLSGAIL 200


>gi|163795566|ref|ZP_02189532.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
 gi|159179165|gb|EDP63698.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
          Length = 427

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 65/147 (44%), Gaps = 25/147 (17%)

Query: 94  LNKYEAETRGEF--GIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGA 150
           LN Y    R +   G  + AQ    DLR+A+    +FR A F  A++ E+  +GS+   A
Sbjct: 23  LNNYPGGQRADMRGGRHNGAQLNGVDLRRAMMSAADFRGAQFVGANLSEATLAGSQLRVA 82

Query: 151 YLEKAVAYKANFTGADL------SDTLMDR----------------MVLNEANLTNAVLV 188
            L  A   K +F GADL      S  + D                   L+ A+L +   V
Sbjct: 83  DLSGAKLVKTDFRGADLEQAKLTSSDITDADFRATTIGAPAGSDIATKLDGADLDHVKAV 142

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
           RT LTR+ L GA   GA F  A +D A
Sbjct: 143 RTNLTRASLMGATARGAHFDGASLDRA 169



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 9/108 (8%)

Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A +   ADL    HVK   R N T A +  +   G+ F+GA L++A    AN   A    
Sbjct: 128 ATKLDGADLD---HVKA-VRTNLTRASLMGATARGAHFDGASLDRANFKGANLEHATFVS 183

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVI 212
           + +    L E N  +A L  T LT +DL      GA + GAD +D VI
Sbjct: 184 SSLRGANLQEVNFADATLSNTDLTGADLRSCHLDGADMSGADLTDCVI 231



 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 50/98 (51%), Gaps = 6/98 (6%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTLM 172
           +RK  H   N+      ADMR    +G++ NG  L +A+   A+     F GA+LS+  +
Sbjct: 16  IRKHGHFLNNYPGG-QRADMRGGRHNGAQLNGVDLRRAMMSAADFRGAQFVGANLSEATL 74

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
               L  A+L+ A LV+T    +DL  A +  +D +DA
Sbjct: 75  AGSQLRVADLSGAKLVKTDFRGADLEQAKLTSSDITDA 112


>gi|282896932|ref|ZP_06304938.1| hglK (Pentapeptide repeat protein) [Raphidiopsis brookii D9]
 gi|281198341|gb|EFA73231.1| hglK (Pentapeptide repeat protein) [Raphidiopsis brookii D9]
          Length = 689

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 58/105 (55%), Gaps = 6/105 (5%)

Query: 109 SAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S AQ   ADL  A   + +   +  + +++ ++++ G+  + +YL  A    ANF+ A+L
Sbjct: 536 SGAQLQEADLYAAQLARVSAIGSQLSHSNLTKTNWQGADLSESYLNHANLNSANFSAANL 595

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           S       +L  AN+TNA L    ++R+DL GA +EG DF  A++
Sbjct: 596 SGA-----ILRSANMTNANLRNADISRADLRGANLEGTDFQGAIL 635



 Score = 42.0 bits (97), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 53/117 (45%), Gaps = 24/117 (20%)

Query: 132 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMDRM-- 175
             SA++ ++ F  S+F                A L KA    +N + A+LS  LM R+  
Sbjct: 431 LKSANLNQASFKSSRFRSVGEDGRWDTYDDIIADLSKAQLKGSNLSSANLSRVLMSRVDL 490

Query: 176 ---VLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
              VLN ANL N+ L+     R  L  SDL  AI++ A  + A I  AQ Q    YA
Sbjct: 491 SFSVLNRANLANSKLIGANLSRAQLVGSDLQQAILQDAILTGADISGAQLQEADLYA 547



 Score = 37.7 bits (86), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 33/131 (25%), Positives = 65/131 (49%), Gaps = 15/131 (11%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSG 144
           +ADL+K  A+ +G       +   SA+L + +  + +       RAN  ++ +  ++ S 
Sbjct: 462 IADLSK--AQLKG-------SNLSSANLSRVLMSRVDLSFSVLNRANLANSKLIGANLSR 512

Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           ++  G+ L++A+   A  TGAD+S   +    L  A L     + + L+ S+L     +G
Sbjct: 513 AQLVGSDLQQAILQDAILTGADISGAQLQEADLYAAQLARVSAIGSQLSHSNLTKTNWQG 572

Query: 205 ADFSDAVIDLA 215
           AD S++ ++ A
Sbjct: 573 ADLSESYLNHA 583


>gi|220907270|ref|YP_002482581.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219863881|gb|ACL44220.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 369

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 44/88 (50%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN    D+   + S +  + A L  A  +K NF GA+L    + R  L +ANLTNA L  
Sbjct: 260 ANLAEKDLAGRNLSNANLSSANLSDAFLHKTNFHGANLFRANLFRANLLQANLTNANLRE 319

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQK 217
           T L  +DL GA + GAD   A I    K
Sbjct: 320 TNLIGADLSGADLRGADLRGAKIGFDNK 347


>gi|434393337|ref|YP_007128284.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428265178|gb|AFZ31124.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 213

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 51/102 (50%), Gaps = 6/102 (5%)

Query: 107 IGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
           I +   F   DL +A   K N R       NFT A + ++D SGS  +   L +A    A
Sbjct: 105 IATQVGFLETDLERANLKKVNLRDRDLSYTNFTKAKLEKADLSGSNLSHTNLSRAKLRNA 164

Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
           N +GA+LS+  + R  L  ANL  A L    L+R+ L GAI+
Sbjct: 165 NLSGANLSNADLSRADLRNANLIGANLDGANLSRAKLEGAIM 206



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 48/85 (56%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN    ++R+ D S + F  A LEKA    +N +  +LS   +    L+ ANL+NA L 
Sbjct: 118 RANLKKVNLRDRDLSYTNFTKAKLEKADLSGSNLSHTNLSRAKLRNANLSGANLSNADLS 177

Query: 189 RTVLTRSDLGGAIIEGADFSDAVID 213
           R  L  ++L GA ++GA+ S A ++
Sbjct: 178 RADLRNANLIGANLDGANLSRAKLE 202


>gi|75911045|ref|YP_325341.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75704770|gb|ABA24446.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 973

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 36/96 (37%), Positives = 47/96 (48%), Gaps = 1/96 (1%)

Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           SADL  A     + R A    AD+  +D SG+  NGAYL  A    A  + ADLS   + 
Sbjct: 841 SADLSGAYLRGADLRDAYLNGADLSGADLSGAYLNGAYLNGAYLNGAYLSHADLSRADLR 900

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
              L  ANL +A L+   L  +DL GA +  A+  D
Sbjct: 901 SADLRSANLISADLISADLISADLNGADLSHANLGD 936



 Score = 47.4 bits (111), Expect = 0.008,   Method: Composition-based stats.
 Identities = 32/86 (37%), Positives = 43/86 (50%), Gaps = 5/86 (5%)

Query: 130 ANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           A    AD+R++     D SG+  +GAYL  A    A   GA LS   + R  L  A+L +
Sbjct: 847 AYLRGADLRDAYLNGADLSGADLSGAYLNGAYLNGAYLNGAYLSHADLSRADLRSADLRS 906

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
           A L+   L  +DL  A + GAD S A
Sbjct: 907 ANLISADLISADLISADLNGADLSHA 932



 Score = 46.2 bits (108), Expect = 0.014,   Method: Composition-based stats.
 Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 2/92 (2%)

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           +R +D SG+   GA L  A    A+ +GADLS   ++   LN A L  A L    L+R+D
Sbjct: 839 LRSADLSGAYLRGADLRDAYLNGADLSGADLSGAYLNGAYLNGAYLNGAYLSHADLSRAD 898

Query: 197 LGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
           L  A +  A+   A  DL     +    NG +
Sbjct: 899 LRSADLRSANLISA--DLISADLISADLNGAD 928



 Score = 43.1 bits (100), Expect = 0.15,   Method: Composition-based stats.
 Identities = 27/76 (35%), Positives = 38/76 (50%)

Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
           AD+  +D SG+   G +L  A    A   GADL D  ++   L+ A+L+ A L    L  
Sbjct: 822 ADLSGADLSGAFLKGVFLRSADLSGAYLRGADLRDAYLNGADLSGADLSGAYLNGAYLNG 881

Query: 195 SDLGGAIIEGADFSDA 210
           + L GA +  AD S A
Sbjct: 882 AYLNGAYLSHADLSRA 897


>gi|428310592|ref|YP_007121569.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428252204|gb|AFZ18163.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 522

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 3/134 (2%)

Query: 94  LNKYEAETRGEFGIGSA-AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAY 151
           L KY A  R   G+  A     +A+L  A   + N   AN + A++  ++ S +K N A 
Sbjct: 7   LKKYAAGDRDFSGLNLAEVNLSAANLSGANLSEVNLSVANLSGANLSGANLSRAKLNVAR 66

Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
           L  A   KAN   A L+ T + R  L  ANLT A L+R  L R++L GA ++ A+ S A 
Sbjct: 67  LSGANISKANLIQASLNVTNLIRADLRRANLTQAALIRAELIRAELSGATLKEANLSGAD 126

Query: 212 I-DLAQKQALCKYA 224
           + + A +QA+   A
Sbjct: 127 LREAALRQAILSRA 140



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 67/147 (45%), Gaps = 17/147 (11%)

Query: 98  EAETRGEF---GIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
           EA  RG F    I        ADL +A     N R     AD+R+++ S +  +GA L +
Sbjct: 144 EANLRGAFLTASILEGTNLNKADLNRADLSDSNIR----EADLRQANLSFANLSGADLSR 199

Query: 155 AVAYKANFTGADLS-DTLMDRMV---------LNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           A    A+ +GADL    L D  +         L+ ANL NA LV   LT++ L      G
Sbjct: 200 ANLRWADLSGADLRWANLSDAKLSGANLMGADLSHANLHNASLVHADLTQASLIKVDWIG 259

Query: 205 ADFSDAVIDLAQKQALCKYANGTNPIT 231
           AD S A +  A+  A+ ++   T  IT
Sbjct: 260 ADLSGATMTGAKLYAVSRFGLKTTGIT 286



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 60/116 (51%), Gaps = 20/116 (17%)

Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-----T 170
           ADLR         RAN T A +  ++   ++ +GA L++A     N +GADL +      
Sbjct: 90  ADLR---------RANLTQAALIRAELIRAELSGATLKEA-----NLSGADLREAALRQA 135

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
           ++ R  L+EANL  A L  ++L  ++L  A +  AD SD+ I  A  +QA   +AN
Sbjct: 136 ILSRATLSEANLRGAFLTASILEGTNLNKADLNRADLSDSNIREADLRQANLSFAN 191



 Score = 40.8 bits (94), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 1/101 (0%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADLR+A   +    RA  + A++R +  + S   G  L KA   +A+ + +++ +
Sbjct: 120 ANLSGADLREAALRQAILSRATLSEANLRGAFLTASILEGTNLNKADLNRADLSDSNIRE 179

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             + +  L+ ANL+ A L R  L  +DL GA +  A+ SDA
Sbjct: 180 ADLRQANLSFANLSGADLSRANLRWADLSGADLRWANLSDA 220


>gi|428215909|ref|YP_007089053.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428004290|gb|AFY85133.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 447

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 51/101 (50%), Gaps = 11/101 (10%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    +DLR A  +  +  + N T AD+RE+D + +   GA L  A   +A+ TGA    
Sbjct: 330 ANMKGSDLRGADLIGASLNKVNLTQADLREADLTRADLRGANLRLADLREADLTGAS--- 386

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
                  LN+ NL  A L    LTR+DL GA + GAD  +A
Sbjct: 387 -------LNQVNLAEADLRGVDLTRADLRGANLSGADLREA 420



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 35/95 (36%), Positives = 50/95 (52%), Gaps = 9/95 (9%)

Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           ADLR A+          +SA++ ++D +G+  + A L KA    AN  G+DL    +   
Sbjct: 295 ADLRGAM---------LSSANLSQADMTGTDLSRANLRKAYLADANMKGSDLRGADLIGA 345

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            LN+ NLT A L    LTR+DL GA +  AD  +A
Sbjct: 346 SLNKVNLTQADLREADLTRADLRGANLRLADLREA 380



 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 51/102 (50%), Gaps = 25/102 (24%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKA-----------VAYK---------ANFTGADLSDT 170
           NF + D+   D  G+   G+YL +A           + Y          AN +GADLSD 
Sbjct: 29  NFMTPDLSNKDLIGASLRGSYLREAKLSGANLSEAILCYADLIGADLKGANLSGADLSDA 88

Query: 171 LMDRMVLNEANLTNA-----VLVRTVLTRSDLGGAIIEGADF 207
            ++   L+E+NLT A     +LV T L+ +DL GA ++GA+ 
Sbjct: 89  NLNLANLSESNLTGANFKGSLLVGTDLSEADLRGANLKGANL 130



 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 58/122 (47%), Gaps = 12/122 (9%)

Query: 91  LADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKF 147
           LAD N   ++ RG   IG++        ADLR+A         + T AD+R ++   +  
Sbjct: 327 LADANMKGSDLRGADLIGASLNKVNLTQADLREA---------DLTRADLRGANLRLADL 377

Query: 148 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
             A L  A   + N   ADL    + R  L  ANL+ A L    LT+++L  A ++GA+ 
Sbjct: 378 READLTGASLNQVNLAEADLRGVDLTRADLRGANLSGADLREADLTKANLHWANLDGANL 437

Query: 208 SD 209
           +D
Sbjct: 438 TD 439



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 51/102 (50%), Gaps = 10/102 (9%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++ ES+ +G+ F G+ L      +A+  GA+L    +    L EANL+ A L  
Sbjct: 88  ANLNLANLSESNLTGANFKGSLLVGTDLSEADLRGANLKGANLIGAKLAEANLSGANLSG 147

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
           T L+ +DL G I++      AV DL       ++  G +P T
Sbjct: 148 TDLSEADLRGTILQ-----KAVYDLR-----TRFCEGLDPQT 179


>gi|300864770|ref|ZP_07109621.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
 gi|300337239|emb|CBN54769.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
          Length = 334

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 1/103 (0%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A     DLR    ++ N  + N T AD+RE+D S +  N A L+ A    AN  GA L  
Sbjct: 230 ADLHDTDLRGGNLIQANLMKTNLTEADLREADLSHTNLNLANLKGADLSGANLQGAYLWA 289

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           T +D   L  A+L  A L   +++ +DL  AI+ GA   D  I
Sbjct: 290 TNLDGACLKGADLRGASLRNAIISGADLRDAILTGATMPDGKI 332



 Score = 41.6 bits (96), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 50/110 (45%), Gaps = 6/110 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S AQ   A+L   V      R      AN   AD+ ++D  G     A L K    +A+ 
Sbjct: 198 SGAQLSGANLSGTVLSGARMRFTKLEQANLKQADLHDTDLRGGNLIQANLMKTNLTEADL 257

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             ADLS T ++   L  A+L+ A L    L  ++L GA ++GAD   A +
Sbjct: 258 READLSHTNLNLANLKGADLSGANLQGAYLWATNLDGACLKGADLRGASL 307



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 52/114 (45%), Gaps = 11/114 (9%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMR-----ESDFSGSKFNGAY 151
            EA   G F  G+   F      K  H+     A+ T AD+R     + D +G++ +GA 
Sbjct: 58  LEANLNGAFLYGANLSFAKL---KGSHL---LGADLTKADLRGAQLAKVDLTGAQLSGAI 111

Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
           L     ++AN  G +L    +  + L  ANL  A L    LT + L GA ++GA
Sbjct: 112 LSWVSLFQANLPGVNLCGANLSGINLRSANLAGANLNWANLTGARLSGANLKGA 165


>gi|220909896|ref|YP_002485207.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219866507|gb|ACL46846.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 184

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 47/80 (58%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +F+  ++ +   + +K  GA L  A    A+  G DL+   +++  LN+ANL  A +++ 
Sbjct: 16  DFSHVNLVQVCLTNAKLVGARLNGAELVGADLQGVDLTAAHLNQARLNQANLAGAEMIQA 75

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            LTR+DL GA + GAD +DA
Sbjct: 76  CLTRADLSGAYLAGADLTDA 95



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 43/79 (54%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+  +D SG+   GA L KA   KA+ +GADL    +    L E +L++A L  
Sbjct: 90  ADLTDADLSGADLSGANLGGADLRKADLSKADLSGADLRGADLSGANLRETDLSDADLDG 149

Query: 190 TVLTRSDLGGAIIEGADFS 208
             L  +DL GA +E   F+
Sbjct: 150 AYLGHADLTGADVERTRFN 168



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 36/68 (52%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   AD+R++D S +  +GA L  A    AN    DLSD  +D   L  A+LT A + R
Sbjct: 105 ANLGGADLRKADLSKADLSGADLRGADLSGANLRETDLSDADLDGAYLGHADLTGADVER 164

Query: 190 TVLTRSDL 197
           T   +S L
Sbjct: 165 TRFNQSQL 172



 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 10/82 (12%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +AN   A+M ++  + +  +GAYL  A    A+ +GADLS           ANL  A L 
Sbjct: 64  QANLAGAEMIQACLTRADLSGAYLAGADLTDADLSGADLS----------GANLGGADLR 113

Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
           +  L+++DL GA + GAD S A
Sbjct: 114 KADLSKADLSGADLRGADLSGA 135



 Score = 37.4 bits (85), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 39/81 (48%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A    A++  +D  G     A+L +A   +AN  GA++    + R  L+ A L  A L  
Sbjct: 35  ARLNGAELVGADLQGVDLTAAHLNQARLNQANLAGAEMIQACLTRADLSGAYLAGADLTD 94

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L+ +DL GA + GAD   A
Sbjct: 95  ADLSGADLSGANLGGADLRKA 115


>gi|239909009|ref|YP_002955751.1| hypothetical protein DMR_43740 [Desulfovibrio magneticus RS-1]
 gi|239798876|dbj|BAH77865.1| hypothetical protein [Desulfovibrio magneticus RS-1]
          Length = 972

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 34/90 (37%), Positives = 50/90 (55%), Gaps = 10/90 (11%)

Query: 129 RANFTSADMRESDFSGS-----KFNGAYLEKAVAYKA-----NFTGADLSDTLMDRMVLN 178
           + NF SA +RES+F+ +      F  A +EK+  +KA     NF  ADL++T      L 
Sbjct: 828 KTNFESASLRESNFTNAICNNANFKKARMEKSNLHKATLINTNFEKADLTNTNFSEASLE 887

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            ANL+N+ L    LTR++L  A + GA+ S
Sbjct: 888 GANLSNSKLKEANLTRANLCDANLVGANLS 917



 Score = 44.7 bits (104), Expect = 0.045,   Method: Composition-based stats.
 Identities = 23/82 (28%), Positives = 41/82 (50%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           F ANF  +++ E +F+G+K +      A+  K NF  A L ++     + N AN   A +
Sbjct: 797 FNANFMFSNLSEVNFNGAKLDDVEFANAILNKTNFESASLRESNFTNAICNNANFKKARM 856

Query: 188 VRTVLTRSDLGGAIIEGADFSD 209
            ++ L ++ L     E AD ++
Sbjct: 857 EKSNLHKATLINTNFEKADLTN 878



 Score = 42.4 bits (98), Expect = 0.24,   Method: Composition-based stats.
 Identities = 40/119 (33%), Positives = 58/119 (48%), Gaps = 7/119 (5%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A+   ++L KA  +  NF        NF+ A +  ++ S SK   A L +A    AN  G
Sbjct: 854 ARMEKSNLHKATLINTNFEKADLTNTNFSEASLEGANLSNSKLKEANLTRANLCDANLVG 913

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALCK 222
           A+LS + + +   N+ANL NA L+      S   GA ++ A F D V IDL   Q  C+
Sbjct: 914 ANLSGSDLSKANFNKANLANANLLNCKFNFSKFLGANLDNAKFDDDVDIDLLTNQKRCQ 972


>gi|158337660|ref|YP_001518836.1| pentapeptide repeat-containing serine/threonine kinase
           [Acaryochloris marina MBIC11017]
 gi|158307901|gb|ABW29518.1| serine/threonine kinase with pentapeptide repeats [Acaryochloris
           marina MBIC11017]
          Length = 532

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 57/116 (49%), Gaps = 21/116 (18%)

Query: 112 QFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           +F + DLR A+ +  NF RANFT A++R ++        AY+  A    A+  GA+LSD 
Sbjct: 429 KFQNTDLRDAILINANFGRANFTGANLRNANLMQ-----AYMSHADLANADLRGANLSDA 483

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
                 L+ ANL  A          +L GA + GA  S++ +  AQ   L  Y NG
Sbjct: 484 -----YLSHANLRGA----------NLCGADLSGAKLSESQLSFAQTNWLTVYPNG 524



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 44/104 (42%), Gaps = 21/104 (20%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F   DLR       N R     SA+  E  F  +    A L  A   +ANFTGA      
Sbjct: 405 FSGQDLRNL-----NLRKFQLPSANFHEGKFQNTDLRDAILINANFGRANFTGA------ 453

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
                    NL NA L++  ++ +DL  A + GA+ SDA +  A
Sbjct: 454 ---------NLRNANLMQAYMSHADLANADLRGANLSDAYLSHA 488


>gi|254489813|ref|ZP_05103008.1| Pentapeptide repeat protein [Methylophaga thiooxidans DMS010]
 gi|224464898|gb|EEF81152.1| Pentapeptide repeat protein [Methylophaga thiooxydans DMS010]
          Length = 154

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 46/137 (33%), Positives = 63/137 (45%), Gaps = 21/137 (15%)

Query: 108 GSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSG-----SKFNGAYLEKAV------ 156
           GSAA F    + + +  ++    N + AD+   DFSG     S  NG  L +A       
Sbjct: 15  GSAAAFEQIYVDRLLETRQCHHCNLSEADLSGKDFSGADMSESILNGINLSQATLVGVWF 74

Query: 157 ----AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                  AN  GAD S++LMD  +LN ANL  A L  + L  +DL       AD + A +
Sbjct: 75  THSKMQGANLEGADASNSLMDYALLNGANLKGANLNGSQLIFADL-----TDADLTGASV 129

Query: 213 DLAQKQALCKYANGTNP 229
           D AQ + +  Y N T P
Sbjct: 130 DNAQMRGVL-YCNTTMP 145


>gi|452964739|gb|EME69773.1| serine/threonine protein kinase [Magnetospirillum sp. SO-1]
          Length = 137

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/89 (41%), Positives = 46/89 (51%), Gaps = 15/89 (16%)

Query: 140 SDFSGSKFNGAYLEKAVAYKANFTGA----------DLSDTLMDRMVLNEAN-----LTN 184
           SDFSGS  N A L +AV   ANF GA          DL++    R VLN AN     L  
Sbjct: 8   SDFSGSVLNAADLRQAVLIGANFEGAVLNHARLTDADLTEARFLRSVLNNANMHGACLKG 67

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           A+L   V+  +DL  A +EGAD   A+I+
Sbjct: 68  AILAGAVMNNADLSCATLEGADLRGAIIN 96



 Score = 45.1 bits (105), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 59/111 (53%), Gaps = 2/111 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S +   +ADLR+AV +  NF  A    A + ++D + ++F  + L  A  + A   GA L
Sbjct: 11  SGSVLNAADLRQAVLIGANFEGAVLNHARLTDADLTEARFLRSVLNNANMHGACLKGAIL 70

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
           +  +M+   L+ A L  A L   ++  +DL GA + GAD + A ++L + Q
Sbjct: 71  AGAVMNNADLSCATLEGADLRGAIINNADLSGADLRGADLTGA-LNLTRDQ 120


>gi|16331795|ref|NP_442523.1| hypothetical protein slr0516 [Synechocystis sp. PCC 6803]
 gi|383323538|ref|YP_005384392.1| hypothetical protein SYNGTI_2630 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383326707|ref|YP_005387561.1| hypothetical protein SYNPCCP_2629 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383492591|ref|YP_005410268.1| hypothetical protein SYNPCCN_2629 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384437859|ref|YP_005652584.1| hypothetical protein SYNGTS_2631 [Synechocystis sp. PCC 6803]
 gi|451815947|ref|YP_007452399.1| hypothetical protein MYO_126560 [Synechocystis sp. PCC 6803]
 gi|6226382|sp|Q55837.1|Y516_SYNY3 RecName: Full=Uncharacterized protein slr0516
 gi|1001755|dbj|BAA10593.1| slr0516 [Synechocystis sp. PCC 6803]
 gi|339274892|dbj|BAK51379.1| hypothetical protein SYNGTS_2631 [Synechocystis sp. PCC 6803]
 gi|359272858|dbj|BAL30377.1| hypothetical protein SYNGTI_2630 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359276028|dbj|BAL33546.1| hypothetical protein SYNPCCN_2629 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359279198|dbj|BAL36715.1| hypothetical protein SYNPCCP_2629 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407960570|dbj|BAM53810.1| hypothetical protein BEST7613_4879 [Bacillus subtilis BEST7613]
 gi|451781916|gb|AGF52885.1| hypothetical protein MYO_126560 [Synechocystis sp. PCC 6803]
          Length = 166

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 5/83 (6%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN-----LTNA 185
           N  +A +  SD SG+  +G  L +A+  +AN TGA+LS+T +    L EAN     L+ A
Sbjct: 54  NLENARLNRSDLSGANLSGVNLRRALLDRANLTGANLSETDLTEAALTEANLAGADLSGA 113

Query: 186 VLVRTVLTRSDLGGAIIEGADFS 208
            L R+ L   DL GA ++GA+ +
Sbjct: 114 NLERSFLRDVDLTGANLKGANLA 136



 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 45/83 (54%), Gaps = 10/83 (12%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N   AD+RE       FN   LE A   +++ +GA+LS   + R +L+ ANLT A L  T
Sbjct: 44  NLAGADLRE-------FN---LENARLNRSDLSGANLSGVNLRRALLDRANLTGANLSET 93

Query: 191 VLTRSDLGGAIIEGADFSDAVID 213
            LT + L  A + GAD S A ++
Sbjct: 94  DLTEAALTEANLAGADLSGANLE 116



 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 15/120 (12%)

Query: 92  ADLNKYEAET-RGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGA 150
           ADL ++  E  R      S A     +LR+A+      RAN T A++ E+D +       
Sbjct: 48  ADLREFNLENARLNRSDLSGANLSGVNLRRALL----DRANLTGANLSETDLT------- 96

Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
              +A   +AN  GADLS   ++R  L + +LT A L    L  ++L  A +   D  +A
Sbjct: 97  ---EAALTEANLAGADLSGANLERSFLRDVDLTGANLKGANLAWANLTAANLTDVDLEEA 153


>gi|381205231|ref|ZP_09912302.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 236

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 58/107 (54%), Gaps = 6/107 (5%)

Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           AQ   ADL  A +   + F AN   A+++ ++ +G+    A L  A  YKAN  GADL  
Sbjct: 99  AQLVGADLEGADLDRADLFEANLEIANLQWANLAGASLENANLGLANLYKANLQGADLRG 158

Query: 170 TLMDRMVLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDAV 211
             +   +L EANL+N     A L+   L+R++L GA ++GA   +A+
Sbjct: 159 ANLTGAMLGEANLSNANLEGARLMVVNLSRANLKGANLKGAKIHEAI 205


>gi|332710048|ref|ZP_08430003.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332351191|gb|EGJ30776.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 739

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 60/111 (54%), Gaps = 4/111 (3%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S+AQ  +AD R+A+   EN  A+ T A++ E+ FS S  +GA L K  A +++F+ ADLS
Sbjct: 561 SSAQLINADFRRAI--LEN--ASLTGANLGEAKFSLSSLHGARLGKVSAVRSDFSSADLS 616

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
            +      L+ ANL+NA L       + L GA +  A   +A +  A   A
Sbjct: 617 QSSWQGANLSRANLSNANLKNVDFNSTQLVGANLRNAKLYNAKLRYANLSA 667



 Score = 37.0 bits (84), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 51/111 (45%), Gaps = 14/111 (12%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           +G G    FG+ D         ++ ++F+ AD+R  + +G     A L+  +  + N  G
Sbjct: 497 YGPGEDQHFGTFD---------DWVSDFSGADLRAVNLTG-----AILDNVLMNRTNLIG 542

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           A L+        L  ANL++A L+     R+ L  A + GA+  +A   L+
Sbjct: 543 ATLNRARFYNSSLIGANLSSAQLINADFRRAILENASLTGANLGEAKFSLS 593


>gi|158341150|ref|YP_001522487.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158311391|gb|ABW33002.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 150

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 48/82 (58%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +AN T+A +  + F G+ F  A L+ A    AN +GA+L +  +   +L  ANLT A L 
Sbjct: 22  KANLTNAILHGATFIGTSFQQANLQAAGLISANLSGANLKEANLTNALLTTANLTGADLR 81

Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
            ++L R+ L  AI++GA+  DA
Sbjct: 82  SSILCRAVLTDAILQGANLRDA 103



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 6/110 (5%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A F  A+L  A+     F      +AN  +A +  ++ SG+    A L  A+   AN TG
Sbjct: 18  ASFAKANLTNAILHGATFIGTSFQQANLQAAGLISANLSGANLKEANLTNALLTTANLTG 77

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
           ADL  +++ R VL +A L  A L    L  +D   A + GAD S A ++L
Sbjct: 78  ADLRSSILCRAVLTDAILQGANLRDADLRETDFKNADLTGADLSGAKVNL 127


>gi|154251684|ref|YP_001412508.1| pentapeptide repeat-containing protein [Parvibaculum
           lavamentivorans DS-1]
 gi|154155634|gb|ABS62851.1| pentapeptide repeat protein [Parvibaculum lavamentivorans DS-1]
          Length = 363

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 48/92 (52%), Gaps = 10/92 (10%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RA+FT  D+   DFS +   GA+  +A+   ANF          ++ +L  A+ +NA+L 
Sbjct: 273 RADFTRMDLSRKDFSRAVLAGAHFREAILADANF----------EKAILAAADFSNAILF 322

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
           R  L  +DL GA + GAD  +A  D  +K  L
Sbjct: 323 RANLAGADLRGADLRGADLKNARQDDTKKGEL 354



 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 37/77 (48%)

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           M+E D SG  F             +FTG DL D       L  AN  +A L RT  +R+D
Sbjct: 62  MKECDLSGLDFRNLNFSHGHFIGCDFTGCDLEDAHFSGANLFSANFDHANLTRTNFSRAD 121

Query: 197 LGGAIIEGADFSDAVID 213
           L GA  E A+ +DA +D
Sbjct: 122 LRGANFEDAEMADAQLD 138



 Score = 37.7 bits (86), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 46/104 (44%), Gaps = 17/104 (16%)

Query: 111 AQFGSADLRKAVHVK---------EN--FRA------NFTSADMRESDFSGSKFNGAYLE 153
           AQ   ADLR+   ++         EN  FR       N     + ++DF G+  +GA L+
Sbjct: 135 AQLDGADLRRGAVIRRGASAPVGRENSSFRGARMYGTNMAECKLLDADFEGASISGASLQ 194

Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
            A    ANF GA+L    +    L +A+   AV+    + R D+
Sbjct: 195 GADLRGANFAGAELKGVELSGANLADADFRRAVMDEATIARGDM 238


>gi|434407898|ref|YP_007150783.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
 gi|428262153|gb|AFZ28103.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
          Length = 182

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 5/84 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+F  A +  +D SG+K  GA  E  +  +AN  GADLS+       L  + LT A LVR
Sbjct: 90  ADFRGAQLNHADLSGAKLCGANFEGCLMVRANLAGADLSNA-----SLAGSALTGANLVR 144

Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
              +++DL  A++ GA+  DAV D
Sbjct: 145 ANFSQADLTNAVLFGAETEDAVFD 168


>gi|434398906|ref|YP_007132910.1| heat shock protein DnaJ domain protein [Stanieria cyanosphaera PCC
           7437]
 gi|428270003|gb|AFZ35944.1| heat shock protein DnaJ domain protein [Stanieria cyanosphaera PCC
           7437]
          Length = 272

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 46/85 (54%), Gaps = 15/85 (17%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + + A+++E DFSG   +G          AN  GADLSD+ + ++ L EANL  A L R 
Sbjct: 159 DLSRANLKEKDFSGRNLSG----------ANLQGADLSDSFLHKVNLEEANLQEANLFRA 208

Query: 191 VLTRSDLGGAIIE-----GADFSDA 210
            L +++L  A +      GADFS A
Sbjct: 209 NLLKANLRKANLRDTNLIGADFSGA 233


>gi|428200510|ref|YP_007079099.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427977942|gb|AFY75542.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 174

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 53/98 (54%), Gaps = 4/98 (4%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A    ADLR+A  +     AN + AD++E++ SG+  + A L  AV  KAN +GA L   
Sbjct: 60  ASLDRADLREACLIV----ANLSGADLKEANLSGANLSEAVLTGAVLQKANLSGAKLRGA 115

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           ++  + L E+NL  A L    L  +DL GA +  AD S
Sbjct: 116 ILAGVNLAESNLRGANLQGANLYGADLRGADLRNADLS 153



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 53/101 (52%), Gaps = 5/101 (4%)

Query: 128 FRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
            RAN + A +R ++ SG+       + A L +A    AN +GADL +  +    L+EA L
Sbjct: 38  IRANLSGALLRGANLSGAFLVVASLDRADLREACLIVANLSGADLKEANLSGANLSEAVL 97

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
           T AVL +  L+ + L GAI+ G + +++ +  A  Q    Y
Sbjct: 98  TGAVLQKANLSGAKLRGAILAGVNLAESNLRGANLQGANLY 138



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 45/82 (54%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +F+  D+   D + +K +GA L +A    A   GA+LS   +    L+ A+L  A L+  
Sbjct: 16  DFSRIDLHGVDLAQAKLSGANLIRANLSGALLRGANLSGAFLVVASLDRADLREACLIVA 75

Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
            L+ +DL  A + GA+ S+AV+
Sbjct: 76  NLSGADLKEANLSGANLSEAVL 97


>gi|119485665|ref|ZP_01619940.1| hypothetical protein L8106_24820 [Lyngbya sp. PCC 8106]
 gi|119456990|gb|EAW38117.1| hypothetical protein L8106_24820 [Lyngbya sp. PCC 8106]
          Length = 433

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 60/111 (54%), Gaps = 8/111 (7%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKA------VAYKAN 161
           S A F  A+LR+A   K N   A+ + A + ++D  G K  GA L  A      + Y AN
Sbjct: 116 SGANFRDANLREAYLWKANLSNADLSDAYLEKADLRGVKLEGADLGYAMLKGANLGY-AN 174

Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           F  A L++T +    L +ANL  A LV   L ++DL GA +EGA+ S+A +
Sbjct: 175 FVRARLANTDLSNANLWQANLREAHLVDANLQQADLRGAKLEGANLSNAKL 225



 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 55/103 (53%), Gaps = 1/103 (0%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A+  + DL  A   + N R A+   A+++++D  G+K  GA L  A   +AN   A    
Sbjct: 178 ARLANTDLSNANLWQANLREAHLVDANLQQADLRGAKLEGANLSNAKLVQANLESAIFVG 237

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             ++   L++A+L  A L +T +TR+DLG A ++ A   DA +
Sbjct: 238 ANLENANLHQASLKGANLAKTQMTRADLGFANLQKASLGDAQL 280



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 62/137 (45%), Gaps = 14/137 (10%)

Query: 91  LADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFR-----------ANFTSAD 136
           L D N  +A+ RG   E    S A+   A+L  A+ V  N             AN     
Sbjct: 200 LVDANLQQADLRGAKLEGANLSNAKLVQANLESAIFVGANLENANLHQASLKGANLAKTQ 259

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           M  +D   +    A L  A   +AN   ADL++  +    L +ANL NA+L +  L  + 
Sbjct: 260 MTRADLGFANLQKASLGDAQLSQANLESADLTEAKLWVAKLEDANLNNAILEKAKLGFAQ 319

Query: 197 LGGAIIEGADFSDAVID 213
           L GA +E A+ +DA+++
Sbjct: 320 LKGANLEDANLTDAILE 336



 Score = 44.7 bits (104), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 58/111 (52%), Gaps = 11/111 (9%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYK---------- 159
           A  G A+L+KA        +AN  SAD+ E+    +K   A L  A+  K          
Sbjct: 263 ADLGFANLQKASLGDAQLSQANLESADLTEAKLWVAKLEDANLNNAILEKAKLGFAQLKG 322

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           AN   A+L+D +++ ++L +ANL +A L    L +++L GA ++ A+ ++A
Sbjct: 323 ANLEDANLTDAILEGVILEDANLEDANLEGAKLEQANLIGAYLKDANLTEA 373



 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 7/85 (8%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++  +    +   GAYL+ A   +AN  GADL         L +ANL NA L  
Sbjct: 343 ANLEDANLEGAKLEQANLIGAYLKDANLTEANLQGADLRGA-----NLTKANLRNAYLQG 397

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDL 214
             L  ++L GA ++GA+  D  +DL
Sbjct: 398 ANLRGANLKGASLKGANLRD--VDL 420


>gi|319791261|ref|YP_004152901.1| hypothetical protein Varpa_0569 [Variovorax paradoxus EPS]
 gi|315593724|gb|ADU34790.1| Protein of unknown function DUF2169 [Variovorax paradoxus EPS]
          Length = 865

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 33/80 (41%), Positives = 41/80 (51%), Gaps = 5/80 (6%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + T AD    D  G  F GA+LE A    AN +GA+LS       VL  ANL  A+ V T
Sbjct: 550 DLTGADFSGLDLRGVNFTGAWLESANFENANLSGANLS-----HAVLAHANLRGAIAVET 604

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            L  ++LGGA +  A   DA
Sbjct: 605 SLVGANLGGARLASAVLEDA 624



 Score = 39.3 bits (90), Expect = 2.0,   Method: Composition-based stats.
 Identities = 29/82 (35%), Positives = 40/82 (48%), Gaps = 10/82 (12%)

Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT----------VLT 193
           G    GA L  A    ANF G DLS   +   +L+ ANL    L R+          +L 
Sbjct: 734 GCSLVGADLGHAAMGSANFGGMDLSQVSLVGSMLDGANLIGTRLARSDWRLASAKGVLLC 793

Query: 194 RSDLGGAIIEGADFSDAVIDLA 215
           ++DL  A + GA+FS+AV+  A
Sbjct: 794 KADLAHARMAGANFSNAVLQHA 815


>gi|163797086|ref|ZP_02191041.1| pentapeptide repeat protein [alpha proteobacterium BAL199]
 gi|159177602|gb|EDP62155.1| pentapeptide repeat protein [alpha proteobacterium BAL199]
          Length = 421

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 60/116 (51%), Gaps = 14/116 (12%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F  ADLR +V    +  +A F++A + + DF+G+K  GA L  A    A    ADL+D
Sbjct: 51  ALFAGADLRGSVFAGGHLEQAQFSTARLEQVDFAGAKLMGANLRGANLKGAKLMAADLTD 110

Query: 170 --------TLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                     M+R +     L++A+L+NA  VRT L+ +++   I  G  F  AV+
Sbjct: 111 ADLRPAKIVDMNRTIEQSANLHKADLSNAQFVRTNLSGANMSAIIAVGTAFQSAVL 166


>gi|86610069|ref|YP_478831.1| pentapeptide repeat-containing protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
 gi|86558611|gb|ABD03568.1| pentapeptide repeat family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
          Length = 160

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 45/87 (51%), Gaps = 5/87 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N   AD+R +D S +   GA L  A  ++AN  GADLS   +    L+ A L  A L R 
Sbjct: 67  NLQEADLRGADLSSANLMGANLRGANLWEANLIGADLSFADLREANLHGAYLWEAKLTRA 126

Query: 191 VLTRSDL-----GGAIIEGADFSDAVI 212
            L  SDL     GGA++ GAD   A++
Sbjct: 127 QLQGSDLSGAKIGGAVLTGADLRGAIL 153



 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 48/98 (48%), Gaps = 8/98 (8%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNG 149
           L+ +N  EA+ RG       A   SA+L  A     N + AN   AD+  +D   +  +G
Sbjct: 63  LSGINLQEADLRG-------ADLSSANLMGANLRGANLWEANLIGADLSFADLREANLHG 115

Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           AYL +A   +A   G+DLS   +   VL  A+L  A+L
Sbjct: 116 AYLWEAKLTRAQLQGSDLSGAKIGGAVLTGADLRGAIL 153


>gi|434399306|ref|YP_007133310.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428270403|gb|AFZ36344.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 298

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 48/143 (33%), Positives = 68/143 (47%), Gaps = 18/143 (12%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA----VHVKEN--FR 129
           L+ A +   + N + L + N Y+AE  G F       F  A+L K     VH  +   F 
Sbjct: 151 LSEANLVEANLNQAELINANLYDAELIGAF-------FYQANLTKVNAIKVHASKTYCFA 203

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A++++SDF  S    A L  A    AN  GA+LS     +  L  ANL  A    
Sbjct: 204 ANLSEANLKKSDFRWSNLTYANLRDANLIGANLRGANLS-----QADLKGANLEGANFKG 258

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             LT++DL GA  +GA+  DA+ 
Sbjct: 259 ANLTKADLRGANFKGANLQDAIF 281



 Score = 37.7 bits (86), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 1/101 (0%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A   +ADLR+A  +  +    N  +AD RE++   +    A L   +  K N   A+L+ 
Sbjct: 84  ANLSNADLRQAYLIDADLTEINAIAADFREANCRCANLKEANLIGTLMRKVNLQQANLTA 143

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             + R  L+EANL  A L +  L  ++L  A + GA F  A
Sbjct: 144 VKLHRSNLSEANLVEANLNQAELINANLYDAELIGAFFYQA 184


>gi|434407711|ref|YP_007150596.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
 gi|428261966|gb|AFZ27916.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
          Length = 268

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 53/101 (52%), Gaps = 1/101 (0%)

Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A  G++ L +      N  A N  +AD+ E+    ++  GAYL K   YKAN T A LS 
Sbjct: 144 ADLGTSKLHRTNLCFANLIAVNLIAADLSEATLHEAEVMGAYLYKTDLYKANLTEAHLSG 203

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             + R  L EA+L+NA L  T L  ++L GA + GA+   A
Sbjct: 204 AYLLRANLTEADLSNADLSWTNLRGANLTGANLRGANLRGA 244



 Score = 37.7 bits (86), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 43/97 (44%), Gaps = 14/97 (14%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A  G ADL  A      + A    A++  ++ S +   GA L +A    AN  G+DLS  
Sbjct: 64  ANLGGADLTGA----NLYNAKLIEANLSAANLSAANLRGATLTQADMNCANLIGSDLS-- 117

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
                   EANL  AV+    L  +DL GA +  AD 
Sbjct: 118 --------EANLKGAVITDANLIGADLRGANLRDADL 146



 Score = 37.0 bits (84), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 6/104 (5%)

Query: 110 AAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           A    +ADL +A +H  E   A     D+ +++ + +  +GAYL      +AN T ADLS
Sbjct: 163 AVNLIAADLSEATLHEAEVMGAYLYKTDLYKANLTEAHLSGAYL-----LRANLTEADLS 217

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           +  +    L  ANLT A L    L  ++L GA +   +  + ++
Sbjct: 218 NADLSWTNLRGANLTGANLRGANLRGANLTGANLSSVNLHETIM 261


>gi|443310213|ref|ZP_21039874.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
 gi|442779757|gb|ELR89989.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
          Length = 253

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 54/101 (53%), Gaps = 1/101 (0%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A   SA+L +A  ++ N   AN T A +  +D S +    A L  A+ YKA    A+L+D
Sbjct: 139 ANLKSANLSEAKLIRANLNEANLTEAHLNYADLSHANLGSASLVGAILYKAELRQANLND 198

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             + +  L +ANL+ A L+   L  ++L GA + GA+ + A
Sbjct: 199 AYLHKAYLFDANLSQARLINADLRWANLRGANLRGANLTGA 239



 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 40/76 (52%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   AD++ ++ S +      L  A    AN   A+LS+  + R  LNEANLT A L  
Sbjct: 109 ANLIGADLQGANLSNADLENVNLIGANLQNANLKSANLSEAKLIRANLNEANLTEAHLNY 168

Query: 190 TVLTRSDLGGAIIEGA 205
             L+ ++LG A + GA
Sbjct: 169 ADLSHANLGSASLVGA 184


>gi|436670209|ref|YP_007317948.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
 gi|428262481|gb|AFZ28430.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
          Length = 309

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 42/120 (35%), Positives = 57/120 (47%), Gaps = 8/120 (6%)

Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENF--RANFTSADMRESDFSGSKFNGAYLEKAVA 157
           +T  E  I S A     DL K+  + E    RA+ T AD+ E+D   +    A L +   
Sbjct: 162 QTNWEGAILSQASLQRVDLEKS-QLNETILRRADLTEADLVEADLRYADLTEAILCRVAL 220

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI 212
             AN  GADLS   + R  L  A+L  AVL  T L  +DL  A      + G+DFSD+ +
Sbjct: 221 ELANLVGADLSRATLKRASLFRADLEGAVLQDTNLVETDLRYANFKDTQLMGSDFSDSRV 280



 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 46/84 (54%), Gaps = 5/84 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN     ++E+D +G+ F+ A L      + N+ GA LS   + R+ L ++ L   +L 
Sbjct: 137 RANLFKVSLKEADCTGANFDEANLR-----QTNWEGAILSQASLQRVDLEKSQLNETILR 191

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
           R  LT +DL  A +  AD ++A++
Sbjct: 192 RADLTEADLVEADLRYADLTEAIL 215



 Score = 44.3 bits (103), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 49/85 (57%), Gaps = 5/85 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R +    +++ SDFS +  N A L    +Y AN +G  L  T ++R  L +ANLT A L+
Sbjct: 27  RVDLKGTNLKSSDFSHANLNSADL----SY-ANLSGTSLIWTDLNRANLRQANLTQACLL 81

Query: 189 RTVLTRSDLGGAIIEGADFSDAVID 213
           R+ L  +DL  A +  A+ S+A+++
Sbjct: 82  RSSLFWADLQEATLVNANLSNALLN 106



 Score = 40.4 bits (93), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 44/85 (51%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
            AN  SAD+  ++ SG+      L +A   +AN T A L  + +    L EA L NA L 
Sbjct: 42  HANLNSADLSYANLSGTSLIWTDLNRANLRQANLTQACLLRSSLFWADLQEATLVNANLS 101

Query: 189 RTVLTRSDLGGAIIEGADFSDAVID 213
             +L   +L  A ++GAD S+A ++
Sbjct: 102 NALLNHVNLTSACLKGADLSEASLE 126


>gi|158338487|ref|YP_001519664.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158308728|gb|ABW30345.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 464

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 66/129 (51%), Gaps = 21/129 (16%)

Query: 111 AQFGSADLR----KAVHVKEN------------FRANFTSADMRESDFSGSKFNGAYLEK 154
           A+ G ADLR    K  ++KE              RA+   AD+RE++ S ++   + LEK
Sbjct: 36  AKLGGADLRNANLKGANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLTLSNLEK 95

Query: 155 -----AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
                A+ ++AN + A L+ + ++   L +ANL+ A L    L R++LG A +  A+ + 
Sbjct: 96  SQLGAAILFRANLSQAQLTLSNLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTT 155

Query: 210 AVIDLAQKQ 218
           A +  A+ Q
Sbjct: 156 ANLSQARLQ 164



 Score = 41.2 bits (95), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 45/85 (52%), Gaps = 5/85 (5%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT---- 183
           FRAN + A +  S+   ++   A L +A   +AN   A+L    +++  L  ANL+    
Sbjct: 104 FRANLSQAQLTLSNLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTTANLSQARL 163

Query: 184 -NAVLVRTVLTRSDLGGAIIEGADF 207
            NA LV T L  ++L GA ++GA+ 
Sbjct: 164 QNASLVGTQLINANLEGASLKGANL 188



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 46/87 (52%), Gaps = 5/87 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +A    AD+R ++       GA L++A    A   GADL    + +  L EANL++A L 
Sbjct: 35  KAKLGGADLRNANLK-----GANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLT 89

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
            + L +S LG AI+  A+ S A + L+
Sbjct: 90  LSNLEKSQLGAAILFRANLSQAQLTLS 116



 Score = 37.0 bits (84), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 41/74 (55%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           + +F  A + +++ + S  +GA L +A  ++A+ TGA L    +    L EANL NA + 
Sbjct: 382 QVDFFRAQLPQANLAQSILDGANLTEANLFRADLTGASLKAATLKNANLAEANLENANIE 441

Query: 189 RTVLTRSDLGGAII 202
            T L  + L GAI+
Sbjct: 442 GTNLDDAYLCGAIM 455


>gi|390438023|ref|ZP_10226524.1| Pentapeptide repeat protein [Microcystis sp. T1-4]
 gi|389838556|emb|CCI30648.1| Pentapeptide repeat protein [Microcystis sp. T1-4]
          Length = 275

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 51/139 (36%), Positives = 69/139 (49%), Gaps = 19/139 (13%)

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           A+  K N   A   +A++R +D SG+   GAYL  A    AN   A LS   + R  L  
Sbjct: 135 AIGPKANLTGAYLNNANLRFADLSGANLRGAYLSGADLTGANLAAAALSGANLQRASLTG 194

Query: 180 ANLTNAVLVRTVLTRSDLGGAI-----------IEGADFS--DAVIDLAQKQALCKYAN- 225
           A L +A LV   L  +DL GA            +EGADFS  + + DL ++  LC  ++ 
Sbjct: 195 AFLRDARLVGVELQFADLRGADLTGAILEQIQNLEGADFSQVEGLSDL-ERSYLCGRSSR 253

Query: 226 --GT-NPITGVSTRKSLGC 241
             GT NP T  +T +SLGC
Sbjct: 254 ELGTWNPYTRSNTGQSLGC 272



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 54/111 (48%), Gaps = 1/111 (0%)

Query: 117 DLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           D+RKA    ++  A N    D+ + D   + F GA L  A    AN TGA+L    + R 
Sbjct: 17  DVRKARDKGQSLSAANLEGIDLSQMDLKNADFTGAILLGADLAGANLTGANLEAADLRRA 76

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
            L  ++L  A L  T+L R+ L GA ++GAD + A I L+       +  G
Sbjct: 77  NLRGSDLRGANLRDTLLYRAILCGANLQGADLTGAKISLSVYDGTTSWPEG 127


>gi|167921391|ref|ZP_02508482.1| pentapeptide repeat protein [Burkholderia pseudomallei BCC215]
          Length = 825

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/126 (37%), Positives = 62/126 (49%), Gaps = 16/126 (12%)

Query: 83  SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDF 142
            C+ +  A A L+   A  R E  + SAA  G     +++ V     A+ T AD+   D 
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----QSLQV-----ADLTGADLSGMDL 524

Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
            G++  GA LE A    A+ TGADLS     R VL  A+LT A LV   LT ++L  A  
Sbjct: 525 RGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAHC 579

Query: 203 EGADFS 208
           E  DFS
Sbjct: 580 ERTDFS 585


>gi|86608820|ref|YP_477582.1| pentapeptide repeat-containing protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
 gi|86557362|gb|ABD02319.1| pentapeptide repeat family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
          Length = 328

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 59/119 (49%), Gaps = 4/119 (3%)

Query: 98  EAETRGEFGIGS---AAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLE 153
           E + RG   +G+    AQ   A+L++A+  + N   AN + AD+  +D S S    A L 
Sbjct: 204 ETDLRGVSFLGADLQGAQMARANLKEAILRQVNLTEANLSEADLAGADLSASSLCSAKLA 263

Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           +    +AN  GADL    +    L   NL NA L   +LTR+DL  A + GA+   A +
Sbjct: 264 RTDLSRANLAGADLRCANLVDAYLGRTNLENADLGEAILTRADLSTANLSGANLRGATL 322



 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 35/96 (36%), Positives = 51/96 (53%), Gaps = 11/96 (11%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
            G A L+KA  V  N   AN + AD+ E+D   ++ +GA L+ A  + AN T A      
Sbjct: 52  LGRAKLQKANLVGANLGGANLSQADLSEADLRDAQLHGATLQGADLHGANLTLA------ 105

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
               +L +ANL +A L    LT ++LGGA + GA+ 
Sbjct: 106 ----LLIDANLLDADLRWANLTSANLGGACLRGANL 137



 Score = 40.8 bits (94), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 61/118 (51%), Gaps = 16/118 (13%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMR-----ESDFSGSKFNGAYLE----------K 154
           A    A+L++A  +K N + AN   A ++     E+D  G  F GA L+          +
Sbjct: 170 ADLSGANLKEASLIKANLQGANLQQARLQGAILSETDLRGVSFLGADLQGAQMARANLKE 229

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A+  + N T A+LS+  +    L+ ++L +A L RT L+R++L GA +  A+  DA +
Sbjct: 230 AILRQVNLTEANLSEADLAGADLSASSLCSAKLARTDLSRANLAGADLRCANLVDAYL 287



 Score = 40.4 bits (93), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 84/167 (50%), Gaps = 10/167 (5%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
           A L++ ++  +T L  A +   +  ++ L D N  +A+ R  +   ++A  G A LR A 
Sbjct: 80  ADLRDAQLHGAT-LQGADLHGANLTLALLIDANLLDADLR--WANLTSANLGGACLRGAN 136

Query: 123 HVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
              ++ R A   +A++  +D SG+  +GA L      +A+ +GA+L +  + +  L  AN
Sbjct: 137 LRFDSRRGAVLRNANLSRADLSGANLSGADL-----TRADLSGANLKEASLIKANLQGAN 191

Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGT 227
           L  A L   +L+ +DL G    GAD   A +  A  K+A+ +  N T
Sbjct: 192 LQQARLQGAILSETDLRGVSFLGADLQGAQMARANLKEAILRQVNLT 238



 Score = 40.4 bits (93), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 6/101 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL +A     N + A+   A+++ ++   ++  GA L +      +F GADL
Sbjct: 158 SGANLSGADLTRADLSGANLKEASLIKANLQGANLQQARLQGAILSETDLRGVSFLGADL 217

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
               M R     ANL  A+L +  LT ++L  A + GAD S
Sbjct: 218 QGAQMAR-----ANLKEAILRQVNLTEANLSEADLAGADLS 253


>gi|119488469|ref|ZP_01621642.1| hypothetical protein L8106_23865 [Lyngbya sp. PCC 8106]
 gi|119455280|gb|EAW36420.1| hypothetical protein L8106_23865 [Lyngbya sp. PCC 8106]
          Length = 463

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 64/123 (52%), Gaps = 15/123 (12%)

Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY--------- 158
           + A    A+L++A  H  +    N  SAD+R +D S +    ++ +K VA          
Sbjct: 107 TGASLNHANLKQANFHNADLDAVNLISADLRGADLSSASL--SWYDKVVANLSRADLTEA 164

Query: 159 ---KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
              +AN  GA+L +T + R  LN+ANL +A L+RT+L  SDL  A +  A   DA ++ A
Sbjct: 165 NLSEANLCGANLLETNLTRANLNKANLQDANLIRTILLESDLSLAELSNARLQDANLEGA 224

Query: 216 QKQ 218
           + Q
Sbjct: 225 KLQ 227



 Score = 43.9 bits (102), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 15/98 (15%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
            R     +D+  ++ S ++   A LE A   +AN TG +LS   + R+ LN ANL NA L
Sbjct: 197 IRTILLESDLSLAELSNARLQDANLEGAKLQQANLTGINLSRLNLARVNLNRANLKNANL 256

Query: 188 VRTV---------------LTRSDLGGAIIEGADFSDA 210
           + T                L R++L  A + GAD +DA
Sbjct: 257 LETSFEGANLRIVNLNQANLIRANLSRASLIGADLTDA 294



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 46/84 (54%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +AN T  ++   + +    N A L+ A   + +F GA+L    +++  L  ANL+ A L+
Sbjct: 228 QANLTGINLSRLNLARVNLNRANLKNANLLETSFEGANLRIVNLNQANLIRANLSRASLI 287

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
              LT ++L GA +E A+F  AV+
Sbjct: 288 GADLTDANLYGANLENAEFLGAVM 311


>gi|17227929|ref|NP_484477.1| hypothetical protein alr0433 [Nostoc sp. PCC 7120]
 gi|17129778|dbj|BAB72391.1| alr0433 [Nostoc sp. PCC 7120]
          Length = 143

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 44/84 (52%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +A+   AD+R ++ +G+    A LE A    AN  GA+LS        L+  NLTN  L+
Sbjct: 50  QAHLIGADLRNANLAGANLKLANLEGADLTGANLKGANLSQVFASDASLSATNLTNVKLI 109

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
              L  +DL GA++  AD   A++
Sbjct: 110 NAELYNADLEGAVLANADLRGAIL 133



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 51/99 (51%), Gaps = 11/99 (11%)

Query: 116 ADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A+L++A  +  + R      AN   A++  +D +G+   GA L +  A  A+ +  +L++
Sbjct: 46  ANLQQAHLIGADLRNANLAGANLKLANLEGADLTGANLKGANLSQVFASDASLSATNLTN 105

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
                + L  A L NA L   VL  +DL GAI+ GA +S
Sbjct: 106 -----VKLINAELYNADLEGAVLANADLRGAILFGALYS 139


>gi|434394300|ref|YP_007129247.1| heat shock protein DnaJ domain protein [Gloeocapsa sp. PCC 7428]
 gi|428266141|gb|AFZ32087.1| heat shock protein DnaJ domain protein [Gloeocapsa sp. PCC 7428]
          Length = 213

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 46/89 (51%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN    D+   DF  + F GA L  A  +K N +GA+L    + R  L +ANL++A L 
Sbjct: 103 RANLKEKDLSGRDFRNANFTGANLSDAFMHKVNLSGANLFQANLFRANLLQANLSHANLR 162

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
              L  +DL G+ + GAD   A I +  +
Sbjct: 163 EANLVGADLSGSDLSGADLRGARIGVGDR 191


>gi|126455703|ref|YP_001074295.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
           1106a]
 gi|167896768|ref|ZP_02484170.1| pentapeptide repeat protein [Burkholderia pseudomallei 7894]
 gi|242312992|ref|ZP_04812009.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106b]
 gi|254195379|ref|ZP_04901807.1| pentapeptide repeat protein [Burkholderia pseudomallei S13]
 gi|126229471|gb|ABN92884.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106a]
 gi|169652126|gb|EDS84819.1| pentapeptide repeat protein [Burkholderia pseudomallei S13]
 gi|242136231|gb|EES22634.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106b]
          Length = 825

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/126 (37%), Positives = 62/126 (49%), Gaps = 16/126 (12%)

Query: 83  SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDF 142
            C+ +  A A L+   A  R E  + SAA  G     +++ V     A+ T AD+   D 
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----QSLQV-----ADLTGADLSGMDL 524

Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
            G++  GA LE A    A+ TGADLS     R VL  A+LT A LV   LT ++L  A  
Sbjct: 525 RGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAHC 579

Query: 203 EGADFS 208
           E  DFS
Sbjct: 580 ERTDFS 585


>gi|332706458|ref|ZP_08426519.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332354342|gb|EGJ33821.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 345

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 46/88 (52%), Gaps = 5/88 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANLTN 184
           A+F  AD++E DFS      A L +A       ++ N  GA+L    + R  L +ANL+N
Sbjct: 231 ADFRGADLKERDFSNRNLQSANLSQANLKDAFLHRVNLAGANLEGANLFRANLFQANLSN 290

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A L    L  +D+ GA + GAD S A +
Sbjct: 291 ANLREANLIGADMSGADLSGADLSGAKV 318


>gi|53721218|ref|YP_110203.1| hypothetical protein BPSS0182 [Burkholderia pseudomallei K96243]
 gi|167818308|ref|ZP_02449988.1| hypothetical protein Bpse9_24431 [Burkholderia pseudomallei 91]
 gi|418395056|ref|ZP_12969100.1| type VI secretion system [Burkholderia pseudomallei 354a]
 gi|418554994|ref|ZP_13119746.1| type VI secretion system [Burkholderia pseudomallei 354e]
 gi|52211632|emb|CAH37627.1| conserved hypothetical protein [Burkholderia pseudomallei K96243]
 gi|385369399|gb|EIF74730.1| type VI secretion system [Burkholderia pseudomallei 354e]
 gi|385374364|gb|EIF79254.1| type VI secretion system [Burkholderia pseudomallei 354a]
          Length = 825

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/126 (37%), Positives = 62/126 (49%), Gaps = 16/126 (12%)

Query: 83  SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDF 142
            C+ +  A A L+   A  R E  + SAA  G     +++ V     A+ T AD+   D 
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----QSLQV-----ADLTGADLSGMDL 524

Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
            G++  GA LE A    A+ TGADLS     R VL  A+LT A LV   LT ++L  A  
Sbjct: 525 RGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAHC 579

Query: 203 EGADFS 208
           E  DFS
Sbjct: 580 ERTDFS 585



 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 34/60 (56%), Gaps = 1/60 (1%)

Query: 115 SADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           +ADLR A      F RA+ T AD+R++D   +   GA L+ A   +AN   A+LS  L+D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILID 802


>gi|344171276|emb|CCA83758.1| hypothetical protein, Pentapeptide repeat domains [blood disease
           bacterium R229]
          Length = 325

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 47/83 (56%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ + AD+  +  SG+  +GAYL  A    A+ +GADLS   +    L+ ANL+ A L  
Sbjct: 54  ADLSGADLSGAYLSGAYLSGAYLSDADLSGADLSGADLSGAYLSGAYLSGANLSGADLSG 113

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L+ +DL GA + GAD S A +
Sbjct: 114 ANLSGADLSGADLSGADLSGAYL 136



 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 51/103 (49%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTS-ADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL  A          + S AD+  +D SG+  +GAYL  A    AN +GADL
Sbjct: 52  SGADLSGADLSGAYLSGAYLSGAYLSDADLSGADLSGADLSGAYLSGAYLSGANLSGADL 111

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S   +    L+ A+L+ A L    L+ + L GA + GAD S A
Sbjct: 112 SGANLSGADLSGADLSGADLSGAYLSGAYLSGAYLSGADLSGA 154



 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 46/83 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + AD+  +D SG+  +GAYL  A    A  +GADLS   +    L+ A L+ A L  
Sbjct: 114 ANLSGADLSGADLSGADLSGAYLSGAYLSGAYLSGADLSGADLSGADLSGAYLSGAYLSS 173

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L+ +DL GA + GA+ S A +
Sbjct: 174 ANLSGADLSGANLSGANLSGAYL 196



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 50/105 (47%), Gaps = 11/105 (10%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTS-ADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL  A          + S AD+  +D SG+  +GAYL  A    AN +GADL
Sbjct: 122 SGADLSGADLSGAYLSGAYLSGAYLSGADLSGADLSGADLSGAYLSGAYLSSANLSGADL 181

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           S           ANL+ A L    L+ +DL GA + GA+ S A +
Sbjct: 182 SG----------ANLSGANLSGAYLSSADLSGANLSGANLSGAYL 216



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL  A     +   A+ + AD+  +  SG+  +GAYL  A    A+ +GADL
Sbjct: 102 SGANLSGADLSGANLSGADLSGADLSGADLSGAYLSGAYLSGAYLSGADLSGADLSGADL 161

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S   +    L+ ANL+ A L    L+ ++L GA +  AD S A
Sbjct: 162 SGAYLSGAYLSSANLSGADLSGANLSGANLSGAYLSSADLSGA 204



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 46/93 (49%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L KAV       AN + A + ++D S +  +GA L  A    A  +GA LS   +    L
Sbjct: 22  LMKAVEQAVKGSANLSGAYLSDADLSDADLSGADLSGADLSGAYLSGAYLSGAYLSDADL 81

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           + A+L+ A L    L+ + L GA + GAD S A
Sbjct: 82  SGADLSGADLSGAYLSGAYLSGANLSGADLSGA 114



 Score = 37.0 bits (84), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 42/81 (51%), Gaps = 5/81 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A  + A++  +D SG+  +GA L       A+ +GADLS   +    L+ A L+ A L  
Sbjct: 99  AYLSGANLSGADLSGANLSGADLS-----GADLSGADLSGAYLSGAYLSGAYLSGADLSG 153

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L+ +DL GA + GA  S A
Sbjct: 154 ADLSGADLSGAYLSGAYLSSA 174


>gi|412993172|emb|CCO16705.1| predicted protein [Bathycoccus prasinos]
          Length = 163

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 65/140 (46%), Gaps = 6/140 (4%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
            G  +A    + DLRK  +   + +    + A M +S F  S F+   + K  A KA+F 
Sbjct: 29  IGQANAVSDKTLDLRKCQYDNVSVKGITLSGALMVDSVFDNSDFSETVMSKVYATKASFK 88

Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
             + ++ ++DR   + +++T A     VLT     GA + GA+F +A+I     + LC  
Sbjct: 89  NVNFTNAVIDRATFDGSDMTGANFQNAVLTGVSYEGANLTGANFEEALIGDQDVKLLC-- 146

Query: 224 ANGTNPITGVSTRKSLGCGN 243
               NP     +R  +GC N
Sbjct: 147 ---LNPTVVDESRMQIGCKN 163


>gi|428214427|ref|YP_007087571.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428002808|gb|AFY83651.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 155

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 35/88 (39%), Positives = 48/88 (54%), Gaps = 5/88 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +  S D+ ++D SG   + A L  A   +AN TGA+LS   +    L EANLT+A L  T
Sbjct: 61  DLQSVDLEKADLSGVDLSNANLTNADLEEANLTGANLSTADLTNADLEEANLTDANLQNT 120

Query: 191 VLTRSDLGGAI-----IEGADFSDAVID 213
             T +DL  AI     + GADF+ A +D
Sbjct: 121 NFTSADLEDAILTNANVTGADFTGADLD 148



 Score = 44.7 bits (104), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 55/109 (50%), Gaps = 12/109 (11%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S     S DL KA     +   AN T+AD+ E++ +G+  + A L  A   +AN T A+L
Sbjct: 58  SGCDLQSVDLEKADLSGVDLSNANLTNADLEEANLTGANLSTADLTNADLEEANLTDANL 117

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
            +T          N T+A L   +LT +++ GA   GAD  D+VI L +
Sbjct: 118 QNT----------NFTSADLEDAILTNANVTGADFTGADL-DSVIGLTR 155


>gi|304404631|ref|ZP_07386292.1| pentapeptide repeat protein [Paenibacillus curdlanolyticus YK9]
 gi|304346438|gb|EFM12271.1| pentapeptide repeat protein [Paenibacillus curdlanolyticus YK9]
          Length = 288

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 48/86 (55%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +  F  + +  SDFSG+   G+  + +   +ANF GA+L+D     + L +A+   ++LV
Sbjct: 100 KGQFKGSALHGSDFSGADLTGSSFKGSDVREANFDGANLTDCSFTALDLTKASFNKSILV 159

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDL 214
           RT  ++S L GA  +G   +D V+ L
Sbjct: 160 RTNFSKSGLDGAAFKGVKLTDVVLTL 185


>gi|452966664|gb|EME71673.1| putative low-complexity protein [Magnetospirillum sp. SO-1]
          Length = 241

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 49/101 (48%), Gaps = 1/101 (0%)

Query: 111 AQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A L+ AV    + F A F  ADM  +D S +   GA L       A F GA L D
Sbjct: 70  ANLSGASLKGAVFAGADLFHAIFDEADMTGADLSDTYLFGANLIATRLVGAEFKGAFLKD 129

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            LM+R  L++A +    ++R V   + L GA + GAD + A
Sbjct: 130 VLMERADLSQAKMAGVYMLRGVFEEAKLAGADLSGADMTGA 170



 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 49/103 (47%), Gaps = 6/103 (5%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A   + DLR A      F +AN   A++  +   G+ F GA L  A+  +A+ TGADL
Sbjct: 43  SGAMLENVDLRGARLDGARFAKANLKWANLSGASLKGAVFAGADLFHAIFDEADMTGADL 102

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           SDT      L  ANL    LV      + L   ++E AD S A
Sbjct: 103 SDT-----YLFGANLIATRLVGAEFKGAFLKDVLMERADLSQA 140


>gi|381205548|ref|ZP_09912619.1| hypothetical protein SclubJA_07991 [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 253

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 2/89 (2%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           F A+    ++  ++ SG    GA L  +    AN +G+DLS+       L +A+L+NA+L
Sbjct: 88  FSASMEGCNLENANLSGVDLQGADLSHSYLPGANLSGSDLSNANFSGATLRDADLSNAIL 147

Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
             T+L  +DL GA + GA+ +DA  DLA+
Sbjct: 148 KGTLLKEADLSGANLSGANLTDA--DLAK 174



 Score = 45.1 bits (105), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 47/81 (58%), Gaps = 5/81 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANF+ A +R++D S +   G  L++A    AN +GA+L+D       L +ANL+ A L+ 
Sbjct: 130 ANFSGATLRDADLSNAILKGTLLKEADLSGANLSGANLTDA-----DLAKANLSPATLLG 184

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             LTR++L    +  A+F +A
Sbjct: 185 ATLTRTNLSDTNLVKANFEEA 205


>gi|113477518|ref|YP_723579.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110168566|gb|ABG53106.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 710

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 43/127 (33%), Positives = 60/127 (47%), Gaps = 15/127 (11%)

Query: 91  LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFS 143
           L + N ++A  T   F   + A  GSADL KA   + N          F  +D+RES++ 
Sbjct: 534 LIETNLHQANLTEATF---TGADLGSADLSKANLYRANLSKVKAEGTTFQLSDLRESNWQ 590

Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
           G+  +GA        +AN   ADLS  L+       A L NA L  T ++ +DL GA + 
Sbjct: 591 GANLSGANFS-----RANLKKADLSLALLTNANFRNAQLQNANLRNTDISLADLRGANLS 645

Query: 204 GADFSDA 210
           G DF  A
Sbjct: 646 GTDFKGA 652



 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 47/107 (43%), Gaps = 14/107 (13%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           + A F SA L            N   A++ E+ F+G+    A L KA  Y+AN +     
Sbjct: 525 TQANFSSAKL---------IETNLHQANLTEATFTGADLGSADLSKANLYRANLSKVKAE 575

Query: 169 DTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            T      L E     ANL+ A   R  L ++DL  A++  A+F +A
Sbjct: 576 GTTFQLSDLRESNWQGANLSGANFSRANLKKADLSLALLTNANFRNA 622



 Score = 37.7 bits (86), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 44/93 (47%), Gaps = 10/93 (10%)

Query: 128 FRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           FRA  + A M      +++FS +K     L +A   +A FTGADL    + +  L  ANL
Sbjct: 510 FRATLSKAIMPGSTITQANFSSAKLIETNLHQANLTEATFTGADLGSADLSKANLYRANL 569

Query: 183 TNAVLVRTVLTRSDL-----GGAIIEGADFSDA 210
           +      T    SDL      GA + GA+FS A
Sbjct: 570 SKVKAEGTTFQLSDLRESNWQGANLSGANFSRA 602


>gi|86606624|ref|YP_475387.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86555166|gb|ABD00124.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
          Length = 371

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 48/152 (31%), Positives = 65/152 (42%), Gaps = 15/152 (9%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGI----GSAAQFGSADLRKAVHVKENFR-- 129
           L  A V+  S   S L++  + E   R  + +    G    F   DL KA       R  
Sbjct: 204 LRGAKVSGTSLRGSRLSEETRLEERLRHIWQLQNWGGQGQDFSGQDLSKADLRGLGLRQI 263

Query: 130 ----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEA 180
               AN    D+R S+  G+   GA L++A    AN       GADL    + +  L  A
Sbjct: 264 RLRGANLKRVDLRGSNLEGADLRGANLQRADLRGANLQNADLEGADLGGAELRQAQLQGA 323

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           NL  A L R  LT+++L GA IEG   S + I
Sbjct: 324 NLRRADLSRANLTQANLEGAQIEGLKHSGSQI 355



 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 50/100 (50%), Gaps = 5/100 (5%)

Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
            A F  A+LRKA     NF  A+   AD+R+++  G+K +GA L+ A    A+  GA +S
Sbjct: 151 GANFYEANLRKANLGLCNFNGAHLHQADLRQANLQGAKLSGAVLQGADLRGADLRGAKVS 210

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            T +    L+E       L R +    + GG   +G DFS
Sbjct: 211 GTSLRGSRLSEETRLEERL-RHIWQLQNWGG---QGQDFS 246



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 37/79 (46%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+    D++E+   G+ F  A L KA     NF GA L         L +ANL  A L  
Sbjct: 137 ADLEGVDLQEARLGGANFYEANLRKANLGLCNFNGAHLHQA-----DLRQANLQGAKLSG 191

Query: 190 TVLTRSDLGGAIIEGADFS 208
            VL  +DL GA + GA  S
Sbjct: 192 AVLQGADLRGADLRGAKVS 210


>gi|428319029|ref|YP_007116911.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428242709|gb|AFZ08495.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 520

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 41/121 (33%), Positives = 62/121 (51%), Gaps = 2/121 (1%)

Query: 94  LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAY 151
           L KY A  R   GI  + A     +L  A     N   AN + A++ +++ +G+K N A 
Sbjct: 7   LKKYAAGERNFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSKTNLTGAKLNIAR 66

Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
           L  A    A+ T ADL+   + R+ L +A L  A L+R  L R++L GA + GA+ S A 
Sbjct: 67  LSGAHLGGADLTDADLNVAYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANLSGAT 126

Query: 212 I 212
           +
Sbjct: 127 L 127



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 1/95 (1%)

Query: 117 DLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           DL+KA+ +     RA    A++  ++ SG+  +GA L +A    AN   A+L    +   
Sbjct: 91  DLKKAILIGAKLIRAELIRAELSGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGA 150

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            L EANL  A L    L+R+DL GA + G +   A
Sbjct: 151 CLTEANLEQANLQGADLSRADLSGADLRGTELRQA 185



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 53/108 (49%), Gaps = 6/108 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+L  A   +   R AN   A++R +  SG+    A LE+A    A+ + ADL
Sbjct: 113 SGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRADL 172

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG-----ADFSDA 210
           S   +    L +ANLT AVL    L+  +L  AI+ G     AD S+A
Sbjct: 173 SGADLRGTELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEA 220



 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 10/94 (10%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV----------LN 178
           +AN   AD+  +D SG+   G  L +A   +A  +GADLS   +   +          L+
Sbjct: 159 QANLQGADLSRADLSGADLRGTELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLS 218

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           EA L+ A L R  L  ++L  A +  AD S+A +
Sbjct: 219 EAKLSGADLSRADLCHANLLNASLVHADLSNAYL 252



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 54/123 (43%), Gaps = 4/123 (3%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A    ADLR      E  +AN T A +  +D SG     A L       A+ + A LS
Sbjct: 168 SRADLSGADLRGT----ELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEAKLS 223

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
              + R  L  ANL NA LV   L+ + L  A   GAD + A +  A+  A+ +    T 
Sbjct: 224 GADLSRADLCHANLLNASLVHADLSNAYLIRADWIGADLTGATLTGAKLHAVSRLGIKTE 283

Query: 229 PIT 231
            +T
Sbjct: 284 GMT 286



 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 15/85 (17%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NF   ++ E++ SG   +GA L+ A    AN +GA+LS T          NLT A L   
Sbjct: 16  NFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSKT----------NLTGAKL--- 62

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
               + L GA + GAD +DA +++A
Sbjct: 63  --NIARLSGAHLGGADLTDADLNVA 85



 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 11/108 (10%)

Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A  G ADL  A ++V     A     D++++   G+K   A L +A    AN +GA+L
Sbjct: 68  SGAHLGGADLTDADLNV-----AYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANL 122

Query: 168 SDTLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S   +    L      +ANL  A L    LT ++L  A ++GAD S A
Sbjct: 123 SGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRA 170


>gi|359457996|ref|ZP_09246559.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 464

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 66/129 (51%), Gaps = 21/129 (16%)

Query: 111 AQFGSADLR----KAVHVKEN------------FRANFTSADMRESDFSGSKFNGAYLEK 154
           A+ G ADLR    K  ++KE              RA+   AD+RE++ S ++   + LEK
Sbjct: 36  AKLGGADLRNANLKGANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLTLSNLEK 95

Query: 155 -----AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
                A+ ++AN + A L+ + ++   L +ANL+ A L    L R++LG A +  A+ + 
Sbjct: 96  SQLGAAILFRANLSQAQLTLSDLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTT 155

Query: 210 AVIDLAQKQ 218
           A +  A+ Q
Sbjct: 156 ANLSQARLQ 164



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 45/85 (52%), Gaps = 5/85 (5%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT---- 183
           FRAN + A +  SD   ++   A L +A   +AN   A+L    +++  L  ANL+    
Sbjct: 104 FRANLSQAQLTLSDLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTTANLSQARL 163

Query: 184 -NAVLVRTVLTRSDLGGAIIEGADF 207
            NA LV T L  ++L GA ++GA+ 
Sbjct: 164 QNASLVGTQLINANLEGASLKGANL 188



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 46/87 (52%), Gaps = 5/87 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +A    AD+R ++       GA L++A    A   GADL    + +  L EANL++A L 
Sbjct: 35  KAKLGGADLRNANLK-----GANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLT 89

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
            + L +S LG AI+  A+ S A + L+
Sbjct: 90  LSNLEKSQLGAAILFRANLSQAQLTLS 116


>gi|119489371|ref|ZP_01622151.1| hypothetical protein L8106_02407 [Lyngbya sp. PCC 8106]
 gi|119454644|gb|EAW35790.1| hypothetical protein L8106_02407 [Lyngbya sp. PCC 8106]
          Length = 166

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 40/109 (36%), Positives = 59/109 (54%), Gaps = 9/109 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A+   +DL   V++K    AN + A +   +FS +  +GA L  A   +ANFT A+LS
Sbjct: 55  SGAKLNGSDLS-GVNLK---GANLSGALLDNVNFSQADLSGANLSSAALTQANFTEANLS 110

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLG-----GAIIEGADFSDAVI 212
           +  +    L  A LTNA L    L ++DL      GA I+GADF +A++
Sbjct: 111 EANLTGAFLRSAILTNAKLTNASLNKADLNTAKLEGAEIKGADFKEAIM 159


>gi|359459150|ref|ZP_09247713.1| pentapeptide repeat-containing serine/threonine kinase
           [Acaryochloris sp. CCMEE 5410]
          Length = 514

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 57/116 (49%), Gaps = 21/116 (18%)

Query: 112 QFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           +F + DLR A+ +  NF RANFT A++R ++        AY+  A    A+  GA+LSD 
Sbjct: 411 KFQNTDLRDAILINANFGRANFTGANLRNANLMQ-----AYMSHADLANADLRGANLSDA 465

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
                 L+ ANL  A          +L GA + GA  S++ +  AQ   L  Y NG
Sbjct: 466 -----YLSHANLRGA----------NLCGADLSGAKLSESQLSFAQTNWLTVYPNG 506



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 44/104 (42%), Gaps = 21/104 (20%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F   DLR       N R     SA+  E  F  +    A L  A   +ANFTGA      
Sbjct: 387 FSGQDLRNL-----NLRKFQLPSANFHEGKFQNTDLRDAILINANFGRANFTGA------ 435

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
                    NL NA L++  ++ +DL  A + GA+ SDA +  A
Sbjct: 436 ---------NLRNANLMQAYMSHADLANADLRGANLSDAYLSHA 470


>gi|428203771|ref|YP_007082360.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427981203|gb|AFY78803.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 180

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 52/97 (53%), Gaps = 1/97 (1%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADL KA  V  N    N   AD+  ++ SG+   GA L  A  + AN + A+L +
Sbjct: 60  ANLTDADLIKANLVGANLIEINLIGADLTSANLSGADLTGADLRCANLHNANLSQANLRE 119

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
             +D   L+ ANL+ A+LV T L+ +D  GA ++G D
Sbjct: 120 VHLDGADLSGANLSGAILVNTDLSVADTVGAKLDGID 156



 Score = 41.6 bits (96), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NF   ++  ++  G KF G  L      +A+ +GADLS+T +    L +ANLT+A L++ 
Sbjct: 16  NFEEVNLHIANLQGLKFQGINL-----TRADLSGADLSETDLSGACLKQANLTDADLIKA 70

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            L  ++L    + GAD + A
Sbjct: 71  NLVGANLIEINLIGADLTSA 90



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 46/99 (46%), Gaps = 15/99 (15%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT----- 183
           RA+ + AD+ E+D SG+    A L  A   KAN  GA+L +  +    L  ANL+     
Sbjct: 39  RADLSGADLSETDLSGACLKQANLTDADLIKANLVGANLIEINLIGADLTSANLSGADLT 98

Query: 184 ----------NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                     NA L +  L    L GA + GA+ S A++
Sbjct: 99  GADLRCANLHNANLSQANLREVHLDGADLSGANLSGAIL 137


>gi|254189534|ref|ZP_04896044.1| pentapeptide repeat protein [Burkholderia pseudomallei Pasteur
           52237]
 gi|157937212|gb|EDO92882.1| pentapeptide repeat protein [Burkholderia pseudomallei Pasteur
           52237]
          Length = 825

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 47/126 (37%), Positives = 62/126 (49%), Gaps = 16/126 (12%)

Query: 83  SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDF 142
            C+ +  A A L+   A  R E  + SAA  G     +++ V     A+ T AD+   D 
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----QSLQV-----ADLTGADLSGMDL 524

Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
            G++  GA LE A    A+ TGADLS     R VL  A+LT A LV   LT ++L  A  
Sbjct: 525 RGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAHC 579

Query: 203 EGADFS 208
           E  DFS
Sbjct: 580 ERTDFS 585


>gi|119487879|ref|ZP_01621376.1| hypothetical protein L8106_28486 [Lyngbya sp. PCC 8106]
 gi|119455455|gb|EAW36593.1| hypothetical protein L8106_28486 [Lyngbya sp. PCC 8106]
          Length = 514

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 72/146 (49%), Gaps = 16/146 (10%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQ---FGSADLRKAVHVKE--NFR- 129
           L  A+++  + + S LAD N  +A+  G    G+  +      A L +  H++E  N R 
Sbjct: 265 LKQAILSEVNLSESNLADANLEQADLMGAELRGATLKGTNLSQAYLVRTNHLREVKNLRE 324

Query: 130 -----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
                AN T A++RE +  G+    A L++A+       GA+L D  + R  L EA L +
Sbjct: 325 ANLKGANLTRANLREVNLQGANLQQANLQQAI-----LQGANLKDANLIRANLREAKLQD 379

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
           A L R  L R++L  A +  A+ S+A
Sbjct: 380 AKLQRVNLERANLQAANLTDANLSNA 405



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 61/126 (48%), Gaps = 22/126 (17%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYK--- 159
           S A    ADL+ A   + NF+      AN   A+++ +DF G+    ++L++A + +   
Sbjct: 185 SGANLQGADLQGANLHETNFQGANLAGANLGGANLKCTDFQGTNLQESHLKQAYSVRKAK 244

Query: 160 -------------ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
                         N  GA+L   ++  + L+E+NL +A L +  L  ++L GA ++G +
Sbjct: 245 FAQANLSGVDFQGVNLRGANLKQAILSEVNLSESNLADANLEQADLMGAELRGATLKGTN 304

Query: 207 FSDAVI 212
            S A +
Sbjct: 305 LSQAYL 310



 Score = 41.6 bits (96), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 11/98 (11%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A+L++A+    N + AN   A++RE+    +K     LE+A    AN T A+LS+
Sbjct: 345 ANLQQANLQQAILQGANLKDANLIRANLREAKLQDAKLQRVNLERANLQAANLTDANLSN 404

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
                     ANLT+A L  T L ++    A++   DF
Sbjct: 405 ----------ANLTDASLCDTCLNQTQFYQAVLIRVDF 432



 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 58/113 (51%), Gaps = 16/113 (14%)

Query: 113 FGSADLRKAVHVKENF---RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           F   +L+++ H+K+ +   +A F  A++   DF G    GA L++A+  + N + ++L+D
Sbjct: 224 FQGTNLQES-HLKQAYSVRKAKFAQANLSGVDFQGVNLRGANLKQAILSEVNLSESNLAD 282

Query: 170 TLMDR----------MVLNEANLTNAVLVRTVLTRS--DLGGAIIEGADFSDA 210
             +++            L   NL+ A LVRT   R   +L  A ++GA+ + A
Sbjct: 283 ANLEQADLMGAELRGATLKGTNLSQAYLVRTNHLREVKNLREANLKGANLTRA 335


>gi|428216913|ref|YP_007101378.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427988695|gb|AFY68950.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 227

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 57/98 (58%), Gaps = 4/98 (4%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ ++ ++  +D S S  + A L +     ANF+ A LS+  +  + LN+ANL++A+L  
Sbjct: 43  ADLSAGNLNHADLSNSDLSRANLYRCSLKHANFSAAKLSNANLKDVQLNDANLSDAILSC 102

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 227
             L  +DL GAI+ GAD S A  DL   + LC +AN T
Sbjct: 103 ANLAEADLSGAILVGADLSGA--DLTNAE-LC-HANLT 136



 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 49/178 (27%), Positives = 79/178 (44%), Gaps = 17/178 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           SA     ADL  +   + N        ANF++A +  ++    + N A L  A+   AN 
Sbjct: 46  SAGNLNHADLSNSDLSRANLYRCSLKHANFSAAKLSNANLKDVQLNDANLSDAILSCANL 105

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDAVIDLAQK 217
             ADLS  ++    L+ A+LTNA L    LT ++L G ++      GA+F++A ++ AQ 
Sbjct: 106 AEADLSGAILVGADLSGADLTNAELCHANLTGANLEGVLLHNANLTGANFTNANMENAQL 165

Query: 218 QALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKL--LDRDGFCDS 273
                 A+ TN     +T  ++   NS   A    ++ L     Q    L+    CD+
Sbjct: 166 DG----ADLTNANLSGTTLHNVNLANSNLQAVNLTNADLRGVNLQHTHNLETANLCDA 219


>gi|158341584|ref|YP_001522748.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158311825|gb|ABW33434.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 521

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 38/99 (38%), Positives = 52/99 (52%), Gaps = 1/99 (1%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A  G ADL  A     N  RANF  A ++E+D + +  +GA+L  A    AN +GA LS 
Sbjct: 88  AYLGGADLYSANLRGANLIRANFNDAHLKEADLTNANLSGAHLRGANLLNANLSGALLSR 147

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
             ++   L+ ANL NA L    L  +DL  A ++ AD S
Sbjct: 148 ANLENADLSYANLENADLSYANLENADLSHANLKNADLS 186



 Score = 40.4 bits (93), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 48/84 (57%), Gaps = 5/84 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+  SA++R ++   + FN A+L++A    AN +GA L        +LN ANL+ A+L R
Sbjct: 93  ADLYSANLRGANLIRANFNDAHLKEADLTNANLSGAHLRGA----NLLN-ANLSGALLSR 147

Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
             L  +DL  A +E AD S A ++
Sbjct: 148 ANLENADLSYANLENADLSYANLE 171



 Score = 37.0 bits (84), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 38/109 (34%), Positives = 54/109 (49%), Gaps = 7/109 (6%)

Query: 127 NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           NF  A  +SA +  ++  G   N AYL  A  Y AN  GA+L     +   L EA+LTNA
Sbjct: 64  NFEIAYLSSAKLSCANLEGINLNRAYLGGADLYSANLRGANLIRANFNDAHLKEADLTNA 123

Query: 186 VL----VRTV-LTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGTN 228
            L    +R   L  ++L GA++  A+  +A +  A  + A   YAN  N
Sbjct: 124 NLSGAHLRGANLLNANLSGALLSRANLENADLSYANLENADLSYANLEN 172


>gi|119356056|ref|YP_910700.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
           DSM 266]
 gi|119353405|gb|ABL64276.1| pentapeptide repeat protein [Chlorobium phaeobacteroides DSM 266]
          Length = 446

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 55/99 (55%), Gaps = 3/99 (3%)

Query: 109 SAAQFGSADLRKAVHVKENF--RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           S A   +ADLR + ++++ F  +A+   AD+RE+         A++EK++  KAN   A+
Sbjct: 82  SGANLNNADLRGS-NLQQAFIKKADLKGADLREAYLVKVNLKEAFMEKSMLQKANLQSAN 140

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
           L  T   R  L  +NL +AVL  T    +DL GA ++GA
Sbjct: 141 LRWTRFHRADLAGSNLQDAVLFETSFVDADLRGANLKGA 179



 Score = 45.4 bits (106), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 6/103 (5%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTG 164
           A F  A+L +A+    +  +A+F  ADM++    G+  +GA     ++E A    AN +G
Sbjct: 307 ADFEDANLDEAMMEGADLSKADFQKADMKKVKLQGANLSGANLDRSFMEGADLRNANLSG 366

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           A+L   ++    L+ ANL+ A L  T L  ++L GA ++GA+ 
Sbjct: 367 ANLFGAMLKDANLSGANLSGASLFETDLEGANLSGANLKGANL 409



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 49/96 (51%), Gaps = 1/96 (1%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A+    +L+KA     +F  AN   A M  +D S + F  A ++K     AN +GA+L  
Sbjct: 292 ARLKGVNLQKASMPGADFEDANLDEAMMEGADLSKADFQKADMKKVKLQGANLSGANLDR 351

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
           + M+   L  ANL+ A L   +L  ++L GA + GA
Sbjct: 352 SFMEGADLRNANLSGANLFGAMLKDANLSGANLSGA 387



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 48/91 (52%), Gaps = 15/91 (16%)

Query: 130 ANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGAD----------LSDTLMDR 174
           AN +++ +  ++ SG+  N     G+ L++A   KA+  GAD          L +  M++
Sbjct: 69  ANLSNSSLVRAELSGANLNNADLRGSNLQQAFIKKADLKGADLREAYLVKVNLKEAFMEK 128

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
            +L +ANL +A L  T   R+DL G+ ++ A
Sbjct: 129 SMLQKANLQSANLRWTRFHRADLAGSNLQDA 159



 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 8/90 (8%)

Query: 126 ENFR---ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           EN R    N   A M  +DF  +  + A +E A   KA+F  AD     M ++ L  ANL
Sbjct: 290 ENARLKGVNLQKASMPGADFEDANLDEAMMEGADLSKADFQKAD-----MKKVKLQGANL 344

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           + A L R+ +  +DL  A + GA+   A++
Sbjct: 345 SGANLDRSFMEGADLRNANLSGANLFGAML 374


>gi|428313290|ref|YP_007124267.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428254902|gb|AFZ20861.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 283

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 40/78 (51%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN    ++R +   G+    A L+  +   AN TGA+LS   +    LNEANL  A L+ 
Sbjct: 34  ANLIGVNLRGAHLQGTNLRKALLDHTLLIAANLTGANLSQANLSHASLNEANLVEACLID 93

Query: 190 TVLTRSDLGGAIIEGADF 207
           T L  +DL  A + GA+ 
Sbjct: 94  TTLISADLSHAELTGANL 111



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 40/113 (35%), Positives = 58/113 (51%), Gaps = 11/113 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S AQ    +L +A  V+ N    N ++A++ E++  G+     YL KA   KAN + A L
Sbjct: 147 SGAQLLRTNLSEAKLVQANLSHTNLSNANLHEAELIGT-----YLYKAELQKANLSEAHL 201

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIEGADFSDAVIDLA 215
           S   + R  L EA+L  A L    L+RS     DL GA + GA+ S A ++ A
Sbjct: 202 SGAYLSRANLREADLERADLRWANLSRSNLCEADLKGANLRGANLSKANLERA 254



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 10/90 (11%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-----TLMDRMVLNEANLTNAV 186
             SAD+  ++ +G+   GA L     Y AN  G DLSD     T + R+ L  A+L+ A 
Sbjct: 96  LISADLSHAELTGANLIGADL-----YGANLKGVDLSDANLIGTNLRRVNLQGADLSGAQ 150

Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           L+RT L+ + L  A +   + S+A +  A+
Sbjct: 151 LLRTNLSEAKLVQANLSHTNLSNANLHEAE 180


>gi|354569053|ref|ZP_08988212.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353539057|gb|EHC08553.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 519

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 63/117 (53%), Gaps = 2/117 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A   +A++R+A     NF  AN + A++R +D +G+  + A L +A    AN  GADL
Sbjct: 173 SGANCRNAEMRQANLSHSNFSGANLSGANLRWADLNGANLSWADLSEAKLSGANLIGADL 232

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY 223
           S+  +    L  A+LT A L++     +DL GA + GA  +S +   L  +  +C++
Sbjct: 233 SNANLTNASLVHADLTQAKLIKAEWVGADLSGATLTGAKLYSTSRFGLKTEGMICEW 289



 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 71/147 (48%), Gaps = 27/147 (18%)

Query: 94  LNKYEAETRGEFGIGSAAQFGSADLRKA--VHVKENFRANFTSADMR----------ESD 141
           L KYEA  R          F S DL +A    VK N  ANF+ A++            +D
Sbjct: 7   LAKYEAGER---------DFRSVDLSEANLSGVKLN-EANFSHANLSIVNLSGSHLCGTD 56

Query: 142 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
           FS ++ N A L  A  ++AN   A L+   + R  L+ A L +A L+R  L R+DL  A 
Sbjct: 57  FSHAQINVARLSGAYLHQANLNHASLNVANLIRADLSRAQLQSASLIRAELIRADLSRAD 116

Query: 202 IEGADFSDAVIDLAQ---KQALCKYAN 225
           +  A+ + A  DL +   + A+ +YAN
Sbjct: 117 LFAANLNCA--DLREASLRHAILRYAN 141



 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 60/114 (52%), Gaps = 4/114 (3%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A+LR ++  + N   AN  + D+  +D SG+    A + +A    +NF+GA+LS 
Sbjct: 140 ANLNEANLRDSLLTEANLEGANLNNTDLSRTDCSGANCRNAEMRQANLSHSNFSGANLSG 199

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQKQAL 220
             +    LN ANL+ A L    L+ ++L GA +  A+ ++A +   DL Q + +
Sbjct: 200 ANLRWADLNGANLSWADLSEAKLSGANLIGADLSNANLTNASLVHADLTQAKLI 253



 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 55/110 (50%), Gaps = 11/110 (10%)

Query: 109 SAAQFGSADLRKAVHVKEN------FRANFTSADMRESDFSG-----SKFNGAYLEKAVA 157
           S AQ  SA L +A  ++ +      F AN   AD+RE+         +  N A L  ++ 
Sbjct: 93  SRAQLQSASLIRAELIRADLSRADLFAANLNCADLREASLRHAILRYANLNEANLRDSLL 152

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
            +AN  GA+L++T + R   + AN  NA + +  L+ S+  GA + GA+ 
Sbjct: 153 TEANLEGANLNNTDLSRTDCSGANCRNAEMRQANLSHSNFSGANLSGANL 202



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 50/101 (49%), Gaps = 1/101 (0%)

Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+L  A ++V    RA+ + A ++ +    ++   A L +A  + AN   ADL
Sbjct: 68  SGAYLHQANLNHASLNVANLIRADLSRAQLQSASLIRAELIRADLSRADLFAANLNCADL 127

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            +  +   +L  ANL  A L  ++LT ++L GA +   D S
Sbjct: 128 REASLRHAILRYANLNEANLRDSLLTEANLEGANLNNTDLS 168


>gi|119488080|ref|ZP_01621524.1| hypothetical protein L8106_11802 [Lyngbya sp. PCC 8106]
 gi|119455369|gb|EAW36508.1| hypothetical protein L8106_11802 [Lyngbya sp. PCC 8106]
          Length = 351

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 53/99 (53%), Gaps = 6/99 (6%)

Query: 118 LRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           LR+    + NF       A   +A++   + S +   GA L K     AN +GADLS+  
Sbjct: 13  LRRYAKGERNFSEINLMAAQLNAANLNRVNLSYANLTGANLSKTRLICANLSGADLSNAN 72

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           + + +L EA L  A L +T+L +++L GA++ G+  S+A
Sbjct: 73  LSQAILIEATLNGASLTQTLLVQANLSGALLSGSILSEA 111



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 48/87 (55%), Gaps = 6/87 (6%)

Query: 130 ANFTSADM-RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE-----ANLT 183
           AN T A +   S  +GSK   A L  A   +A  +  DLS   + R +L+E     ANL+
Sbjct: 116 ANLTGASLIGTSLLNGSKLIEATLIGATLSRATLSAIDLSGVNLTRAILSESELGGANLS 175

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +A L+R  L RS+L GA + GAD S+A
Sbjct: 176 SACLIRAYLNRSNLSGANLMGADLSEA 202



 Score = 45.8 bits (107), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 58/103 (56%), Gaps = 14/103 (13%)

Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTG 164
           AAQ  +A+L + V++     AN T A++ ++     + SG+  + A L +A+  +A   G
Sbjct: 30  AAQLNAANLNR-VNLS---YANLTGANLSKTRLICANLSGADLSNANLSQAILIEATLNG 85

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           A L+ TL+      +ANL+ A+L  ++L+ +DL GA + GA  
Sbjct: 86  ASLTQTLLV-----QANLSGALLSGSILSEADLSGANLTGASL 123



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N T A + ES+  G+  + A L +A   ++N +GA+L         L+EA+L NA L   
Sbjct: 158 NLTRAILSESELGGANLSSACLIRAYLNRSNLSGANLMGA-----DLSEASLCNANLCVA 212

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            LTR++L GA +EGA+ + A
Sbjct: 213 NLTRANLQGADLEGANLNGA 232



 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 49/103 (47%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL +A     N   AN T A+++ +D  G+  NGA L  A     N   A+L
Sbjct: 190 SGANLMGADLSEASLCNANLCVANLTRANLQGADLEGANLNGAQLSGANLKSTNLKNANL 249

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +  ++    L  A+L+ A L    LT ++L GA +  AD   A
Sbjct: 250 NGLILHEADLRLADLSQANLRGANLTGANLAGASLLEADLRGA 292



 Score = 41.2 bits (95), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 56/106 (52%), Gaps = 2/106 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+L K   +  N   A+ ++A++ ++    +  NGA L + +  +AN +GA L
Sbjct: 44  SYANLTGANLSKTRLICANLSGADLSNANLSQAILIEATLNGASLTQTLLVQANLSGALL 103

Query: 168 SDTLMDRMVLNEANLTNAVLVRT-VLTRSDLGGAIIEGADFSDAVI 212
           S +++    L+ ANLT A L+ T +L  S L  A + GA  S A +
Sbjct: 104 SGSILSEADLSGANLTGASLIGTSLLNGSKLIEATLIGATLSRATL 149



 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 8/111 (7%)

Query: 107 IGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           I S ++ G A+L  A  ++    R+N + A++  +D S +    A L  A   +AN  GA
Sbjct: 163 ILSESELGGANLSSACLIRAYLNRSNLSGANLMGADLSEASLCNANLCVANLTRANLQGA 222

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           DL     +   LN A L+ A L  T L  ++L G I+  AD   A  DL+Q
Sbjct: 223 DL-----EGANLNGAQLSGANLKSTNLKNANLNGLILHEADLRLA--DLSQ 266


>gi|86608529|ref|YP_477291.1| pentapeptide repeat-containing protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
 gi|86557071|gb|ABD02028.1| pentapeptide repeat protein [Synechococcus sp. JA-2-3B'a(2-13)]
          Length = 248

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 15/128 (11%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDRMVLNEA---- 180
           +NFT+A + +S F G +F+ +   +A    AN     F  AD     + R  L++A    
Sbjct: 109 SNFTAAKLDKSSFQGGRFSHSIFREASLVAANLAEGNFFAADFRQANLSRCNLSQAALVS 168

Query: 181 ------NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 234
                 NL  A+LV   L  + +   +  GADF+DA +    ++ L + A+GTN +T   
Sbjct: 169 CQLQFANLEQAILVGANLRDAQIEDTLFSGADFTDAKLSDETRKLLIERASGTNELTQRD 228

Query: 235 TRKSLGCG 242
           T  +L  G
Sbjct: 229 TLNTLLAG 236


>gi|357146891|ref|XP_003574148.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
           [Brachypodium distachyon]
          Length = 227

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 58/128 (45%), Gaps = 8/128 (6%)

Query: 117 DLRKAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           DLR   +  E  N +    ++A M ++ F G+      + KA A  A+F G D ++ ++D
Sbjct: 104 DLRFCDYTNEKNNLKGKTLSAALMSDAKFDGADLTEVVMSKAYAVGASFKGTDFTNAVID 163

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
           R    +A+L  A+   TVL+ S    A ++   F D +I     Q LC+     N     
Sbjct: 164 RANFGKADLEGAIFKNTVLSGSTFDDANMKDVVFEDTIIGYIDLQKLCR-----NMSINE 218

Query: 234 STRKSLGC 241
             R  LGC
Sbjct: 219 DARLDLGC 226


>gi|313681545|ref|YP_004059283.1| pentapeptide repeat-containing protein [Sulfuricurvum kujiense DSM
           16994]
 gi|313154405|gb|ADR33083.1| pentapeptide repeat protein [Sulfuricurvum kujiense DSM 16994]
          Length = 198

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 57/104 (54%), Gaps = 6/104 (5%)

Query: 105 FGIG-SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
           F IG S + + +A L+KA+  KE    +   AD+ ++DFSG  F+G+ L +A  +++ F 
Sbjct: 8   FWIGVSLSAYDAAHLKKALEDKECIGCDLRGADLSQNDFSGGDFHGSDLSEADLHESIFE 67

Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
             DLSD       L+ AN  NA+  +  + R+DL      GA+F
Sbjct: 68  MGDLSDC-----NLSGANAENALFWKGTMERADLTRIHARGANF 106


>gi|158340059|ref|YP_001521229.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158310300|gb|ABW31915.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 483

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/100 (35%), Positives = 54/100 (54%), Gaps = 1/100 (1%)

Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A+ RKA + + +  +A+   A + ++D SG+ F+GAYL KA    A   GADLS 
Sbjct: 317 AHLSGANFRKANLSLADISKAHLGHAHLNDADLSGAYFSGAYLYKANLSSAFLIGADLSR 376

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
             +  ++L  ANL +A L    L+ +DL  AI+   D  +
Sbjct: 377 ANLSDVILRGANLLSANLSDASLSSADLNNAILLNTDLRE 416



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 5/83 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           +N   A++R +  SG+ F  A L      KA    A+   ADLS        L +ANL++
Sbjct: 307 SNLRKANLRHAHLSGANFRKANLSLADISKAHLGHAHLNDADLSGAYFSGAYLYKANLSS 366

Query: 185 AVLVRTVLTRSDLGGAIIEGADF 207
           A L+   L+R++L   I+ GA+ 
Sbjct: 367 AFLIGADLSRANLSDVILRGANL 389



 Score = 38.1 bits (87), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 41/81 (50%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++  S+   +    A+L  A   KAN + AD+S   +    LN+A+L+ A    
Sbjct: 297 ANLGGANLSYSNLRKANLRHAHLSGANFRKANLSLADISKAHLGHAHLNDADLSGAYFSG 356

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L +++L  A + GAD S A
Sbjct: 357 AYLYKANLSSAFLIGADLSRA 377


>gi|428221053|ref|YP_007105223.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427994393|gb|AFY73088.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 270

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 55/98 (56%), Gaps = 9/98 (9%)

Query: 115 SADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           +ADLR+A         N T+AD+  ++   +  +GA L  A    AN + A+L D L+ +
Sbjct: 137 NADLRQA---------NLTNADLIYANLKNANLSGANLSGANLSGANLSDANLEDALLHK 187

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             L+ ANL +A    T+L R++L GA + GA F +A++
Sbjct: 188 AKLSNANLKSANFSGTILVRANLIGADLTGAIFKEAIL 225



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 55/117 (47%), Gaps = 11/117 (9%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY 158
           A   G + IG  A     DL  +  V  N R    S ++ ++D  G+      L  A   
Sbjct: 63  ANLMGAYLIG--ANLSHVDLSGSNLVGANLR----SINLNDTDLKGADLRETILRNARMA 116

Query: 159 KANFTGADLSD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           + N TG++LS+       ++   L +ANLTNA L+   L  ++L GA + GA+ S A
Sbjct: 117 RVNLTGSNLSNADLVYVNLENADLRQANLTNADLIYANLKNANLSGANLSGANLSGA 173



 Score = 40.4 bits (93), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 11/103 (10%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A+L  A  +  N + AN + A++  ++ SG+  + A LE A+ +KA  + A+L  
Sbjct: 138 ADLRQANLTNADLIYANLKNANLSGANLSGANLSGANLSDANLEDALLHKAKLSNANLK- 196

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                     AN +  +LVR  L  +DL GAI + A    A +
Sbjct: 197 ---------SANFSGTILVRANLIGADLTGAIFKEAILVHATM 230


>gi|424513094|emb|CCO66678.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
          Length = 140

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/130 (27%), Positives = 63/130 (48%), Gaps = 21/130 (16%)

Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           H ++  +  FT   ++ ++F G+  +G  L  A   +A+FTGA+L +          ANL
Sbjct: 20  HDQDLTQTYFTKGSLKRANFRGANLSGISLFGANLEEADFTGANLEN----------ANL 69

Query: 183 TNAVLVRTVLTRSDLGGAIIEGA-----------DFSDAVIDLAQKQALCKYANGTNPIT 231
               L++T  T ++L  AI+ GA           D+S  +I       +C  A+G +P++
Sbjct: 70  GQCNLLKTNFTGANLTNAIVSGASNLETVKANDSDWSQVIIRKDVLMGICANADGVSPVS 129

Query: 232 GVSTRKSLGC 241
           G  T+ +L C
Sbjct: 130 GDPTKMTLEC 139


>gi|186685487|ref|YP_001868683.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
           73102]
 gi|186467939|gb|ACC83740.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
          Length = 146

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 53/106 (50%), Gaps = 10/106 (9%)

Query: 115 SADLRKAVHVKE---------NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           SA +R+ +  +E         N + A+    D+R ++  G+   GA LE A    AN   
Sbjct: 28  SAPVRRLLETRECLGCNLAGANLKGAHLIGVDLRNANLKGANLEGANLEGADLTGANLKS 87

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           A+L++  +   +LN ANLTN  L  + L  +D+ GA++   D S A
Sbjct: 88  ANLTEAFVSDTILNNANLTNVNLSNSRLYNTDVDGAVLANIDLSGA 133


>gi|334118424|ref|ZP_08492513.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333459431|gb|EGK88044.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 479

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 56/105 (53%), Gaps = 11/105 (10%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL ++     N  RA+ T A +RE++  G++F GA L++A   KAN  GA+L
Sbjct: 60  SGANLSGADLAESFLNLANLTRADLTGAVLREANLVGAEFTGANLKQASLIKANLVGANL 119

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                     +EANLT A L    L  S L GAI++ A +++  I
Sbjct: 120 ----------HEANLTRANLSGADLRGSQLSGAILDKAVYNNRTI 154



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/97 (39%), Positives = 51/97 (52%), Gaps = 11/97 (11%)

Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           SADLR     + +   AN + AD+RE+DF+G          A    AN +GADL    + 
Sbjct: 338 SADLRGVDLTRADLSGANLSDADLRETDFTG----------ATLLFANLSGADLRGVDLT 387

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +  L+ ANLT A L +  L R +L GA +  AD SDA
Sbjct: 388 KADLSGANLTEADLRKADLMRVNLEGADLTEADLSDA 424



 Score = 44.7 bits (104), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 46/84 (54%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + AD+ ES  + +    A L  AV  +AN  GA+ +   + +  L +ANL  A L  
Sbjct: 62  ANLSGADLAESFLNLANLTRADLTGAVLREANLVGAEFTGANLKQASLIKANLVGANLHE 121

Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
             LTR++L GA + G+  S A++D
Sbjct: 122 ANLTRANLSGADLRGSQLSGAILD 145



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 5/80 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           +N +S +++  DFS +    AYL+ A   + +  GADLS       +L++ NL++A L  
Sbjct: 289 SNLSSVNLKNVDFSRASLKKAYLKGANLEQTDLRGADLSGA-----ILHQVNLSSADLRG 343

Query: 190 TVLTRSDLGGAIIEGADFSD 209
             LTR+DL GA +  AD  +
Sbjct: 344 VDLTRADLSGANLSDADLRE 363



 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 68/159 (42%), Gaps = 37/159 (23%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAV-------- 156
           A+F  A+L++A  +K N        AN T A++  +D  GS+ +GA L+KAV        
Sbjct: 97  AEFTGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAVYNNRTIFP 156

Query: 157 ------AYKA------------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
                 A  A            N    DL++  +    L   NL  A+L    L R++L 
Sbjct: 157 EDIDPGAMGAFLLAPNASLPGLNLAMVDLTEADLKGADLRRTNLYKAILFGAKLDRANLA 216

Query: 199 GAIIEGADFSDAVID--LAQKQALCK---YANGTNPITG 232
           GA +  AD  +A +   + +K    K   ++ G +P  G
Sbjct: 217 GANLSAADLREASLSGTILEKAVYSKKTLFSEGIDPALG 255



 Score = 40.4 bits (93), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 49/103 (47%), Gaps = 16/103 (15%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 162
           S A    ADLR+          AN + AD+R     ++D SG+    A L KA   + N 
Sbjct: 352 SGANLSDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGANLTEADLRKADLMRVNL 411

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
            GADL+          EA+L++A L R  L  ++L G  ++GA
Sbjct: 412 EGADLT----------EADLSDAHLFRVNLRGANLKGTNLKGA 444


>gi|428307284|ref|YP_007144109.1| serine/threonine protein kinase with pentapeptide repeats
           [Crinalium epipsammum PCC 9333]
 gi|428248819|gb|AFZ14599.1| serine/threonine protein kinase with pentapeptide repeats
           [Crinalium epipsammum PCC 9333]
          Length = 564

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 51/85 (60%), Gaps = 5/85 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N ++ ++++++ SG  F+ A L +      NF GA+LS+T M +  L+ A L +A LVR 
Sbjct: 444 NLSNLNLQKANLSGGNFHQANLTQT-----NFQGANLSNTDMGQTSLSGAMLRDANLVRA 498

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
            L+ +DL GA + GAD S A  + A
Sbjct: 499 YLSYADLEGADLRGADLSFAYFNYA 523


>gi|428215789|ref|YP_007088933.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428004170|gb|AFY85013.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 222

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)

Query: 140 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT-----NAVLVRTVLTR 194
           ++ S +  +GA L+ A    AN +GA+LS+  M  + L+EANLT     NA L   ++++
Sbjct: 68  ANLSNANLSGALLKDAKLQTANLSGANLSNAEMSGITLSEANLTGANLSNAELENALMSK 127

Query: 195 SDLGGAIIEGADFSDAVI 212
            DL GA + GAD  DA+I
Sbjct: 128 VDLTGADLTGADLIDAII 145



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 51/84 (60%), Gaps = 5/84 (5%)

Query: 130 ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN ++A+M      E++ +G+  + A LE A+  K + TGADL+   +   ++++ANL+N
Sbjct: 93  ANLSNAEMSGITLSEANLTGANLSNAELENALMSKVDLTGADLTGADLIDAIISDANLSN 152

Query: 185 AVLVRTVLTRSDLGGAIIEGADFS 208
           A + +  L ++ L  + + GADFS
Sbjct: 153 ASVTQAQLKKAILSRSNLSGADFS 176



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 53/92 (57%), Gaps = 6/92 (6%)

Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
           A ++ ++ SG+  + A +      +AN TGA+LS+  ++  ++++ +LT A      LT 
Sbjct: 83  AKLQTANLSGANLSNAEMSGITLSEANLTGANLSNAELENALMSKVDLTGA-----DLTG 137

Query: 195 SDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
           +DL  AII  A+ S+A +  AQ K+A+   +N
Sbjct: 138 ADLIDAIISDANLSNASVTQAQLKKAILSRSN 169



 Score = 37.7 bits (86), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 41/81 (50%), Gaps = 5/81 (6%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            +  D+   D  GS  NGA L  A     N +GA L D  +    L+ ANL+NA +    
Sbjct: 50  LSGVDLSGKDLYGSALNGANLSNA-----NLSGALLKDAKLQTANLSGANLSNAEMSGIT 104

Query: 192 LTRSDLGGAIIEGADFSDAVI 212
           L+ ++L GA +  A+  +A++
Sbjct: 105 LSEANLTGANLSNAELENALM 125


>gi|376002766|ref|ZP_09780588.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
 gi|375328822|emb|CCE16341.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
          Length = 529

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 59/107 (55%), Gaps = 14/107 (13%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN---------- 178
           +ANFT A +  ++FSG+   G  L +A    +  +GA L    ++  VLN          
Sbjct: 44  QANFTEAVLSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLS 103

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
           +ANL +A L+R  L R++L  AI+ GA+ ++A  DL  ++A  ++A+
Sbjct: 104 QANLVDASLIRAELMRAELSEAIVNGANLTEA--DL--REATLRHAD 146



 Score = 40.4 bits (93), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
           DFS      A L +    +ANFT A LS T       + ANLT   L R  L  S L GA
Sbjct: 26  DFSAILLCEANLSRVNLSQANFTEAVLSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 80

Query: 201 IIEGADFSDAVIDLA 215
           I++GA+ ++AV+++A
Sbjct: 81  ILQGANLNEAVLNVA 95



 Score = 40.4 bits (93), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 37/129 (28%), Positives = 64/129 (49%), Gaps = 13/129 (10%)

Query: 101 TRGEFGIG--SAAQFGSADLRKAV-HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA 157
           TR +  +   S A    A+L +AV +V    RA+ + A++ ++    ++   A L +A+ 
Sbjct: 68  TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLVDASLIRAELMRAELSEAIV 127

Query: 158 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSDLGGAIIEGADF 207
             AN T ADL +  +    L +     ANL+ A L+     R+ LTR+DL  A + G + 
Sbjct: 128 NGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNL 187

Query: 208 SDAVIDLAQ 216
            +A +  A+
Sbjct: 188 RNAELRQAE 196



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 42/83 (50%), Gaps = 5/83 (6%)

Query: 130 ANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN T AD+RE+     D   +  +GA L +A    +N   ++L+   + R  L   NL N
Sbjct: 130 ANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNLRN 189

Query: 185 AVLVRTVLTRSDLGGAIIEGADF 207
           A L +  L  +DL GA + GA+ 
Sbjct: 190 AELRQAELNGADLRGANLSGANL 212



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 1/91 (1%)

Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           A+L +A  +  N  R+N T AD+  +D  G     A L +A    A+  GA+LS   +  
Sbjct: 155 ANLSEACLILSNLERSNLTRADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRW 214

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
             L+ ANL+ A L  T L+ + L GA + GA
Sbjct: 215 ANLSGANLSGANLEATQLSGASLRGANLSGA 245



 Score = 37.4 bits (85), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 1/96 (1%)

Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADL +A     N R A    A++  +D  G+  +GA L  A    AN +GA+L  T +  
Sbjct: 175 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 234

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             L  ANL+ A L+      +DL  A +   D++DA
Sbjct: 235 ASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDA 270


>gi|354567192|ref|ZP_08986362.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353543493|gb|EHC12951.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 206

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/88 (39%), Positives = 47/88 (53%), Gaps = 5/88 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R NF  AD+R ++ SG+   GA L +A   + NF  ADLS     +  L +ANL  A L 
Sbjct: 42  RINFKGADLRSANLSGAILTGANLREANLQQVNFCDADLS-----QADLTQANLCGACLW 96

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           R  L+ S L GA +  AD  +A +  AQ
Sbjct: 97  RVQLSDSQLWGASLCNADLREADLSAAQ 124



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 47/81 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+  +AD+RE+D S ++   A L +A   +AN T A L  +++     N+ANLTNA L  
Sbjct: 108 ASLCNADLREADLSAAQLIEASLVEANLVRANLTKAKLCGSVLIEANFNQANLTNADLKW 167

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
           T L  ++   A +E A+F +A
Sbjct: 168 TNLMAANFSEANLENANFKNA 188


>gi|428309179|ref|YP_007120156.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428250791|gb|AFZ16750.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 303

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 52/106 (49%), Gaps = 6/106 (5%)

Query: 113 FGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           F  + L +AV  + +F        +F  AD+RE+DF+   F+ A L +A    AN   A 
Sbjct: 136 FWRSHLMRAVLRRVDFHEAILQETSFRQADLREADFTRVYFSEASLSEANLRGANLDQAL 195

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           +  T   R  L +A+L  A L R V  ++DL GA  +GA    AV 
Sbjct: 196 VKRTSFWRTNLQQASLKGAYLKRIVFNQTDLSGASFQGAQLQGAVF 241



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 51/100 (51%), Gaps = 15/100 (15%)

Query: 129 RANFTSADMRESDFSGSKF----------NGAYLEKAVAYK-----ANFTGADLSDTLMD 173
           RAN + A++  ++ SG++           N A LE A+ ++     AN  GA L +T + 
Sbjct: 58  RANLSRANLSHANLSGARLECVSLSRANLNQADLEGAILFQSNLSQANLIGASLPETDLQ 117

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
              L +ANLT A L  T+  RS L  A++   DF +A++ 
Sbjct: 118 VATLFQANLTGACLRGTIFWRSHLMRAVLRRVDFHEAILQ 157



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 55/116 (47%), Gaps = 21/116 (18%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMR----------ESDFSGSKFNGAYLEKAVA 157
           S A    A+L +A+  + +F R N   A ++          ++D SG+ F GA L+ AV 
Sbjct: 182 SEANLRGANLDQALVKRTSFWRTNLQQASLKGAYLKRIVFNQTDLSGASFQGAQLQGAVF 241

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
             AN TGA+     ++R V   ANLT           ++L GA ++ A F +  I+
Sbjct: 242 RGANLTGANFEGANLERAVFRGANLTG----------TNLKGASLQWAVFKEVNIE 287



 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 45/103 (43%), Gaps = 21/103 (20%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL  A+  + N  +AN   A + E+D          L+ A  ++AN TGA L
Sbjct: 82  SRANLNQADLEGAILFQSNLSQANLIGASLPETD----------LQVATLFQANLTGACL 131

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             T+  R          + L+R VL R D   AI++   F  A
Sbjct: 132 RGTIFWR----------SHLMRAVLRRVDFHEAILQETSFRQA 164


>gi|209526072|ref|ZP_03274604.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|423067543|ref|ZP_17056333.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209493460|gb|EDZ93783.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|406711117|gb|EKD06319.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 519

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 59/107 (55%), Gaps = 14/107 (13%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN---------- 178
           +ANFT A +  ++FSG+   G  L +A    +  +GA L    ++  VLN          
Sbjct: 34  QANFTEAVLSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLS 93

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
           +ANL +A L+R  L R++L  AI+ GA+ ++A  DL  ++A  ++A+
Sbjct: 94  QANLVDASLIRAELMRAELSEAIVNGANLTEA--DL--REATLRHAD 136



 Score = 40.4 bits (93), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
           DFS      A L +    +ANFT A LS T       + ANLT   L R  L  S L GA
Sbjct: 16  DFSAILLCEANLSRVNLSQANFTEAVLSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 70

Query: 201 IIEGADFSDAVIDLA 215
           I++GA+ ++AV+++A
Sbjct: 71  ILQGANLNEAVLNVA 85



 Score = 40.4 bits (93), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 37/129 (28%), Positives = 64/129 (49%), Gaps = 13/129 (10%)

Query: 101 TRGEFGIG--SAAQFGSADLRKAV-HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA 157
           TR +  +   S A    A+L +AV +V    RA+ + A++ ++    ++   A L +A+ 
Sbjct: 58  TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLVDASLIRAELMRAELSEAIV 117

Query: 158 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSDLGGAIIEGADF 207
             AN T ADL +  +    L +     ANL+ A L+     R+ LTR+DL  A + G + 
Sbjct: 118 NGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNL 177

Query: 208 SDAVIDLAQ 216
            +A +  A+
Sbjct: 178 RNAELRQAE 186



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 42/83 (50%), Gaps = 5/83 (6%)

Query: 130 ANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN T AD+RE+     D   +  +GA L +A    +N   ++L+   + R  L   NL N
Sbjct: 120 ANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNLRN 179

Query: 185 AVLVRTVLTRSDLGGAIIEGADF 207
           A L +  L  +DL GA + GA+ 
Sbjct: 180 AELRQAELNGADLRGANLSGANL 202



 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 1/91 (1%)

Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           A+L +A  +  N  R+N T AD+  +D  G     A L +A    A+  GA+LS   +  
Sbjct: 145 ANLSEACLILSNLERSNLTRADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRW 204

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
             L+ ANL+ A L  T L+ + L GA + GA
Sbjct: 205 ANLSGANLSGANLEATQLSGASLRGANLSGA 235



 Score = 37.4 bits (85), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 1/96 (1%)

Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADL +A     N R A    A++  +D  G+  +GA L  A    AN +GA+L  T +  
Sbjct: 165 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 224

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             L  ANL+ A L+      +DL  A +   D++DA
Sbjct: 225 ASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDA 260


>gi|307152584|ref|YP_003887968.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306982812|gb|ADN14693.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 333

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 47/171 (27%), Positives = 85/171 (49%), Gaps = 13/171 (7%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRANF 132
           L+ A++      ++ L D N   A+ RG    G   + A    A++R+    K+NF  N 
Sbjct: 92  LSGAILQETDLTLAMLLDANLIGADLRGSDLSGANLTGACLRGANMRQE---KKNFNTNL 148

Query: 133 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 192
            +A++ ++D  G+   G  L +A     N +GA+L +  +    L +A+L+ A L  T+L
Sbjct: 149 QAANLFKADLQGANMKGVDLARA-----NLSGANLKEANLRDADLRKADLSKANLTGTIL 203

Query: 193 TRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 243
           + ++L GA + GAD ++A  +L + + +   A G N    + T  +L   N
Sbjct: 204 SEANLVGANLTGADLNNA--NLVRAKMMQAEAGGANFKGAIMTHINLNATN 252



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 1/87 (1%)

Query: 127 NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           NF+ A  T  ++  ++ SG+  +   L  A   +AN +GA L +  +  +   +ANLT A
Sbjct: 237 NFKGAIMTHINLNATNLSGANLSFTRLNHADLTRANLSGAYLKEAELIEVFFAKANLTGA 296

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVI 212
            L  T LTRSDL  A +   + S+A++
Sbjct: 297 DLSNTNLTRSDLMSANLSRVNLSEAIM 323



 Score = 37.0 bits (84), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 42/82 (51%), Gaps = 10/82 (12%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN   A++  SD SG+  +          +A+ TGADL   ++  ++L+ A L    L 
Sbjct: 54  RANLAQANLVASDLSGANLS----------QADLTGADLRSAMLHGIILSGAILQETDLT 103

Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
             +L  ++L GA + G+D S A
Sbjct: 104 LAMLLDANLIGADLRGSDLSGA 125


>gi|307152500|ref|YP_003887884.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306982728|gb|ADN14609.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 305

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 48/137 (35%), Positives = 65/137 (47%), Gaps = 26/137 (18%)

Query: 90  ALADL-NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK- 146
           A+ DL NKY+A  R      S  +    DLR     + NF+ A+F+ A++RE DFSG+  
Sbjct: 6   AVIDLKNKYDAGERN----FSKIELRRVDLRGFNLSQANFKGADFSYANLREVDFSGADL 61

Query: 147 ----FN---------------GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
               FN               G+YL KA   K N   A+LS   +    L+++NLTNA L
Sbjct: 62  SEAFFNEADLTGANLQEANLQGSYLMKAYLMKTNLQSANLSKAYLTGAYLSKSNLTNANL 121

Query: 188 VRTVLTRSDLGGAIIEG 204
               L  S L GA + G
Sbjct: 122 TGAYLNGSKLNGADLTG 138


>gi|409991580|ref|ZP_11274829.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|291567915|dbj|BAI90187.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
 gi|409937560|gb|EKN78975.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 390

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 64/122 (52%), Gaps = 21/122 (17%)

Query: 110 AAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL- 167
           +A    ADL +A+ +K NF +A+ +SA++ +S+   + F  AYL      KAN + ADL 
Sbjct: 111 SAHLNWADLTEAIFIKTNFHKADLSSANLTKSNLQSANFVRAYL-----IKANLSEADLF 165

Query: 168 -----SDTLMDRMV---------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
                S  L D  +         +  ANL  A L    LT+++LG A + GA+ +DA ++
Sbjct: 166 QADLSSANLKDVNLSAANLTECKMTRANLMGANLTEADLTKANLGRANLRGANLTDAYLN 225

Query: 214 LA 215
           LA
Sbjct: 226 LA 227



 Score = 44.3 bits (103), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 26/160 (16%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLE---------------- 153
           A F  ADL  A     N  + NF+ A++  ++ SGS  NGA L+                
Sbjct: 57  ADFSEADLSGAHLSLANLSKVNFSGANLTGANLSGSSLNGANLQGATLSAVNLESAHLNW 116

Query: 154 ----KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
               +A+  K NF  ADLS   + +  L  AN   A L++  L+ +DL  A +  A+  D
Sbjct: 117 ADLTEAIFIKTNFHKADLSSANLTKSNLQSANFVRAYLIKANLSEADLFQADLSSANLKD 176

Query: 210 AVIDLAQKQALCKY--AN--GTNPITGVSTRKSLGCGNSR 245
             +  A     CK   AN  G N      T+ +LG  N R
Sbjct: 177 VNLSAANLTE-CKMTRANLMGANLTEADLTKANLGRANLR 215



 Score = 41.2 bits (95), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 42/153 (27%), Positives = 66/153 (43%), Gaps = 23/153 (15%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA-------------- 121
           L+AA +  C    + L   N  EA+        + A  G A+LR A              
Sbjct: 179 LSAANLTECKMTRANLMGANLTEADL-------TKANLGRANLRGANLTDAYLNLASLVE 231

Query: 122 --VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
             +H     RAN + A++ ++       NGA+L K     A+  G DLS  L+  + L  
Sbjct: 232 ADLHQANLTRANLSRANLSKTYLRDICLNGAHLTKVNLSGADLGGVDLSHKLLTGINLAG 291

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A L+ A LV  +L  ++L  A + GA+  +A +
Sbjct: 292 AYLSEATLVGALLMEANLSAANLSGANLQNACL 324



 Score = 41.2 bits (95), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 52/102 (50%), Gaps = 6/102 (5%)

Query: 115 SADLRKAVHVKEN------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           SA+  +A  +K N      F+A+ +SA++++ + S +      + +A    AN T ADL+
Sbjct: 146 SANFVRAYLIKANLSEADLFQADLSSANLKDVNLSAANLTECKMTRANLMGANLTEADLT 205

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
              + R  L  ANLT+A L    L  +DL  A +  A+ S A
Sbjct: 206 KANLGRANLRGANLTDAYLNLASLVEADLHQANLTRANLSRA 247


>gi|441147419|ref|ZP_20964505.1| OxyO [Streptomyces rimosus subsp. rimosus ATCC 10970]
 gi|440620240|gb|ELQ83273.1| OxyO [Streptomyces rimosus subsp. rimosus ATCC 10970]
          Length = 345

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 43/86 (50%), Gaps = 6/86 (6%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    ADLR+A   + N R      AN   AD+R +D  G    G  L  AV Y+A   G
Sbjct: 223 ADLREADLREATPARANLRDADLSDANVRKADLRFADLRGVDLWGTDLRGAVLYRAKLAG 282

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRT 190
            +LS+  +D   L  A+LT+A + R+
Sbjct: 283 LELSEAHLDGADLRGADLTDAAVARS 308



 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 46/93 (49%), Gaps = 1/93 (1%)

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           A H +   R A     D+R++  SG+   GA L +A    A+   ADL +    R  L +
Sbjct: 183 ADHKRAQLRGAILRDCDLRDARLSGADLRGARLARADLADADLREADLREATPARANLRD 242

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A+L++A + +  L  +DL G  + G D   AV+
Sbjct: 243 ADLSDANVRKADLRFADLRGVDLWGTDLRGAVL 275


>gi|217423045|ref|ZP_03454547.1| pentapeptide repeat protein [Burkholderia pseudomallei 576]
 gi|217393953|gb|EEC33973.1| pentapeptide repeat protein [Burkholderia pseudomallei 576]
          Length = 825

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVD 566

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585


>gi|428181173|gb|EKX50038.1| hypothetical protein GUITHDRAFT_135709 [Guillardia theta CCMP2712]
          Length = 1263

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 56/105 (53%)

Query: 114 GSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           GS   +  +H  +  + + ++ ++  +D S S    A L +++ Y+AN + A+L  + M+
Sbjct: 487 GSKLEKSNLHKSKLSKVDLSNCNLTLTDMSSSDLQKADLSRSLFYRANLSSANLKSSNMN 546

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
              L+  NL++A L R  L  S L GA +EGADFS   +  A  Q
Sbjct: 547 GADLSHCNLSSACLERASLYGSKLEGANLEGADFSHCDLSFAMLQ 591



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 9/99 (9%)

Query: 104  EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
            +F   +   F   DLR            F ++D R  DFSGSK +G  L KA    +N +
Sbjct: 1020 KFAGATGLNFKDVDLRSC---------KFANSDFRGQDFSGSKLSGVQLSKANLTGSNLS 1070

Query: 164  GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
              DL+ + M +  L  ANL  AVL  + L+++ L GA++
Sbjct: 1071 SCDLTGSDMSKCHLERANLLGAVLKGSDLSQARLKGAVL 1109



 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 55/115 (47%), Gaps = 22/115 (19%)

Query: 120  KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
            +++  K+   +  + AD+   DF+G+ F+G+ L +A   ++   G DLS           
Sbjct: 926  RSLKGKDLRNSKLSEADLSHQDFAGADFSGSKLSRANLRQSKLDGCDLS----------- 974

Query: 180  ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL-------CKYANGT 227
                N  L R++L  + L GA+I G DFS+A ++ A   A        CK+A  T
Sbjct: 975  ----NCDLSRSILEGASLQGAVIRGTDFSNAKLEGAALPAWVEVDFECCKFAGAT 1025



 Score = 44.7 bits (104), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 50/107 (46%), Gaps = 6/107 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANF 162
           S++    ADL +++  + N   AN  S++M  +D S    + A LE+A  Y      AN 
Sbjct: 516 SSSDLQKADLSRSLFYRANLSSANLKSSNMNGADLSHCNLSSACLERASLYGSKLEGANL 575

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
            GAD S   +   +L   NL  A      LT +D  G+ +EGA   D
Sbjct: 576 EGADFSHCDLSFAMLQNCNLRGANFTGAKLTGTDFSGSDLEGAIMPD 622



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 5/88 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD-----LSDTLMDRMVLNEANLTNA 185
           N +  D+ +++  G+K  GA L  +   + N + A      L  ++M R  LN+ +  +A
Sbjct: 274 NLSYNDLSDANLEGAKLEGADLSYSNLSQCNLSQASCSRIMLQFSVMTRARLNDGDFGSA 333

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVID 213
            L    LT S L  +  EGADF D+V+D
Sbjct: 334 NLSECDLTHSQLSSSCFEGADFRDSVLD 361



 Score = 40.4 bits (93), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 51/123 (41%), Gaps = 21/123 (17%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAY----------- 158
           A F   DL  A+    N R ANFT A +  +DFSGS   GA +     Y           
Sbjct: 578 ADFSHCDLSFAMLQNCNLRGANFTGAKLTGTDFSGSDLEGAIMPDMEGYDLQGVCLSGTS 637

Query: 159 ---------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
                    +AN   ADL    +  + L +A+L+ A L    L  +DL G  + G + S 
Sbjct: 638 GFFKDKSARRANLCDADLRGQELSGVNLQQADLSFADLTGANLQGADLTGTKLNGTNLSQ 697

Query: 210 AVI 212
           + +
Sbjct: 698 SRL 700



 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 47/103 (45%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A L +A     N   +N +S D+ ++  SG+   GA L  A  +  + +   L
Sbjct: 191 SRADLSEAKLCRADLTHANLTESNLSSCDLSDTILSGANLGGADLSGAKLFNCDLSRTSL 250

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            D  + + +L +A L  A L    L+ +DL  A +EGA    A
Sbjct: 251 MDVNLSKAMLQQARLQGAQLQGCNLSYNDLSDANLEGAKLEGA 293



 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 39/82 (47%), Gaps = 15/82 (18%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NF  AD+R++DFS +   G  L  A   +AN   AD +          E NLT   L   
Sbjct: 826 NFVGADLRKADFSQAVLKGHDLSAADLSQANLRNADFT----------ECNLTGCNL--- 872

Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
             T+S+L G   +GA  S A+I
Sbjct: 873 --TQSNLSGCNFDGAILSGAII 892



 Score = 38.1 bits (87), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 43/90 (47%), Gaps = 1/90 (1%)

Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S AQF +A+L    ++     R+NF+ +++   DFSG+  N +  E+A   KA F G ++
Sbjct: 386 SDAQFVNANLSNVKLNAARVLRSNFSESNLTACDFSGAVMNDSNFERANLTKARFVGCEM 445

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
            +          A  ++  +    LT  DL
Sbjct: 446 RNASFQHATFASATFSDVKMEGVDLTGCDL 475



 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 46/90 (51%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RA+ T A++ ES+ S    +   L  A    A+ +GA L +  + R  L + NL+ A+L 
Sbjct: 202 RADLTHANLTESNLSSCDLSDTILSGANLGGADLSGAKLFNCDLSRTSLMDVNLSKAMLQ 261

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
           +  L  + L G  +   D SDA ++ A+ +
Sbjct: 262 QARLQGAQLQGCNLSYNDLSDANLEGAKLE 291



 Score = 37.0 bits (84), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 16/101 (15%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A+   A LR A     +F      + NF+  D+ +++ S S  + A L +A   +A+ T 
Sbjct: 148 ARLDRATLRMATLRGSSFVSSSCAQTNFSRCDLSDANLSMSTLSRADLSEAKLCRADLTH 207

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
           A+L+          E+NL++  L  T+L+ ++LGGA + GA
Sbjct: 208 ANLT----------ESNLSSCDLSDTILSGANLGGADLSGA 238


>gi|167905147|ref|ZP_02492352.1| pentapeptide repeat protein [Burkholderia pseudomallei NCTC 13177]
 gi|237508538|ref|ZP_04521253.1| pentapeptide repeat family protein [Burkholderia pseudomallei
           MSHR346]
 gi|235000743|gb|EEP50167.1| pentapeptide repeat family protein [Burkholderia pseudomallei
           MSHR346]
          Length = 825

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVD 566

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585


>gi|113476913|ref|YP_722974.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
 gi|110167961|gb|ABG52501.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
          Length = 567

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 52/106 (49%), Gaps = 6/106 (5%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 164
           A    A+L KAV V  N R  N + A++  ++   + F+GAYL +A   +AN  G     
Sbjct: 418 ASLEGANLTKAVLVSANLRRVNLSGANLNSTNLRAANFSGAYLREAKLSRANLEGANLKK 477

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           A+LS   M    L  A+L  A L    L R DL GA + G  F DA
Sbjct: 478 ANLSGANMSHASLRGADLRRATLKDANLKRVDLVGANLAGVTFLDA 523



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 43/77 (55%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           + R ++F+  K   AYL  A  ++AN  G +L    +    L +A L  ++L++  L ++
Sbjct: 354 NFRRANFAALKLEDAYLRNADLFQANLRGVELRGARLQNANLKKAQLQGSILIKAKLQKA 413

Query: 196 DLGGAIIEGADFSDAVI 212
           +L  A +EGA+ + AV+
Sbjct: 414 NLYRASLEGANLTKAVL 430



 Score = 40.4 bits (93), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 6/111 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           +A +   A LR A   + N R      A   +A+++++   GS    A L+KA  Y+A+ 
Sbjct: 361 AALKLEDAYLRNADLFQANLRGVELRGARLQNANLKKAQLQGSILIKAKLQKANLYRASL 420

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
            GA+L+  ++    L   NL+ A L  T L  ++  GA +  A  S A ++
Sbjct: 421 EGANLTKAVLVSANLRRVNLSGANLNSTNLRAANFSGAYLREAKLSRANLE 471



 Score = 37.4 bits (85), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 53/111 (47%), Gaps = 11/111 (9%)

Query: 110 AAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
           AA F  A LR+A   + N       +AN + A+M  +   G+    A L+ A   + +  
Sbjct: 452 AANFSGAYLREAKLSRANLEGANLKKANLSGANMSHASLRGADLRRATLKDANLKRVDLV 511

Query: 164 GADLSD-TLMDRMV----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           GA+L+  T +D  +    L  ANL NA L+   L   +L GA ++GA   D
Sbjct: 512 GANLAGVTFLDADLQGANLKGANLKNANLLGANLENVNLQGANLQGAIMPD 562


>gi|113477694|ref|YP_723755.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110168742|gb|ABG53282.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 204

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 43/131 (32%), Positives = 61/131 (46%), Gaps = 14/131 (10%)

Query: 96  KYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFR-----------ANFTSADMRESD 141
           K  A  RG    G+    A F +ADLR A+ +    R           A F + D+   D
Sbjct: 63  KLRANLRGADFTGADLRGADFRNADLRGAILIDAQLREASFAGAFLNGAIFNNLDLSGID 122

Query: 142 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
           F G+   G  L KA  ++A  + A+LS   +    L EANL+ AVL  T L  S+L  A 
Sbjct: 123 FRGADLRGVNLSKANLFRAELSNANLSGADLSSADLEEANLSGAVLRGTNLQSSNLLCAS 182

Query: 202 IEGADFSDAVI 212
           +E AD +  ++
Sbjct: 183 VEQADLTGTLL 193



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 58/109 (53%), Gaps = 12/109 (11%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F  A+L+KA  ++ N R A+FT AD+R +DF  +   GA L  A   +A+F GA L+  +
Sbjct: 54  FAGANLQKA-KLRANLRGADFTGADLRGADFRNADLRGAILIDAQLREASFAGAFLNGAI 112

Query: 172 MDRMVLN----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            + + L+            NL+ A L R  L+ ++L GA +  AD  +A
Sbjct: 113 FNNLDLSGIDFRGADLRGVNLSKANLFRAELSNANLSGADLSSADLEEA 161


>gi|354564725|ref|ZP_08983901.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353549851|gb|EHC19290.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 564

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 11/106 (10%)

Query: 109 SAAQFGSADLR-----KAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S      ADLR       +    N R A   +AD+  +  +G+K NGA L  A+   A+ 
Sbjct: 386 SGTNLNHADLRGSNLSDTILFSTNLRNAILIAADLSYAKLNGAKLNGANLRSAILLGADL 445

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            G DL+D     ++LNEA+L+  VL    L+ +D+  AI+ G D S
Sbjct: 446 GGVDLTD-----VILNEADLSGVVLNEADLSGADISDAILFGTDLS 486



 Score = 44.3 bits (103), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 57/114 (50%), Gaps = 8/114 (7%)

Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
           GEF  GS   F  A L  A     NF A N TSA + +++ +G  F+ A L  A    AN
Sbjct: 227 GEFLQGS--NFSGAYLGDANLTGVNFSAANLTSAYLGDANLTGVNFSAANLNAANLGDAN 284

Query: 162 FTGADLSD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            +GA+LS      T +    L+ ANL  A L R  L+ +DL  A + GAD S A
Sbjct: 285 LSGANLSGANLRCTDLSSANLSGANLAGADLYRADLSHADLSSANLSGADLSHA 338



 Score = 43.5 bits (101), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 47/81 (58%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A++R +D S +  +GA L  A  Y+A+ + ADLS   +    L+ ANL++A L  
Sbjct: 288 ANLSGANLRCTDLSSANLSGANLAGADLYRADLSHADLSSANLSGADLSHANLSSANLRD 347

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L+ S L  AI+  A+ SDA
Sbjct: 348 AELSSSYLSHAILFAANLSDA 368



 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 25/108 (23%)

Query: 130 ANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF-----TGADLSDTLMDRMVLNE 179
           AN + AD+  +D SG+  N     G+ L   + +  N        ADLS   ++   LN 
Sbjct: 373 ANLSYADLCRADLSGTNLNHADLRGSNLSDTILFSTNLRNAILIAADLSYAKLNGAKLNG 432

Query: 180 ANLTNAVLV----------RTVLTRSDLGGAIIE-----GADFSDAVI 212
           ANL +A+L+            +L  +DL G ++      GAD SDA++
Sbjct: 433 ANLRSAILLGADLGGVDLTDVILNEADLSGVVLNEADLSGADISDAIL 480



 Score = 37.0 bits (84), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 53/109 (48%), Gaps = 14/109 (12%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A    ADL         +RA+ + AD+  ++ SG+  + A L  A    A  + + LS
Sbjct: 306 SGANLAGADL---------YRADLSHADLSSANLSGADLSHANLSSANLRDAELSSSYLS 356

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVI 212
             ++    L++ANL +A L    L R+DL G     A + G++ SD ++
Sbjct: 357 HAILFAANLSDANLNSANLSYADLCRADLSGTNLNHADLRGSNLSDTIL 405


>gi|443668754|ref|ZP_21134246.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|443330716|gb|ELS45411.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 403

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 60/117 (51%), Gaps = 11/117 (9%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY 158
           A+ RG F   S A    ADLR+A   +    AN + AD+ E++ SG+   GA L  A+ +
Sbjct: 243 ADLRGAFL--SEANLKGADLRRAFLSE----ANLSGADLSEANLSGADLRGAILSGAILW 296

Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDA 210
            AN  GA LS   +   +L+ ANL  A L         L+ ++L GAI+  AD  +A
Sbjct: 297 GANLKGAGLSLAFLRGAILSGANLGQADLWEANLSGANLSEANLSGAILWEADLIEA 353



 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 10/103 (9%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           ++KA  +K           +++ D SG+   GA L  A   +AN  GADL      R  L
Sbjct: 211 IKKAELIKAIREGTIDKTTLQQVDLSGAILRGADLRGAFLSEANLKGADLR-----RAFL 265

Query: 178 NEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVIDLA 215
           +EANL+ A L    L+ +DL      GAI+ GA+   A + LA
Sbjct: 266 SEANLSGADLSEANLSGADLRGAILSGAILWGANLKGAGLSLA 308



 Score = 37.0 bits (84), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 10/78 (12%)

Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
           AD+R +  S +   GA L +A   +AN +GADLS          EANL+ A L   +L+ 
Sbjct: 243 ADLRGAFLSEANLKGADLRRAFLSEANLSGADLS----------EANLSGADLRGAILSG 292

Query: 195 SDLGGAIIEGADFSDAVI 212
           + L GA ++GA  S A +
Sbjct: 293 AILWGANLKGAGLSLAFL 310


>gi|159029340|emb|CAO90206.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
          Length = 405

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 60/117 (51%), Gaps = 11/117 (9%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY 158
           A+ RG F   S A    ADLR+A   +    AN + AD+ E++ SG+   GA L  A+ +
Sbjct: 245 ADLRGAFL--SEANLKGADLRRAFLSE----ANLSGADLSEANLSGADLRGAILSGAILW 298

Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDA 210
            AN  GA LS   +   +L+ ANL  A L         L+ ++L GAI+  AD  +A
Sbjct: 299 GANLKGAGLSLAFLRGAILSGANLGQADLWEANLSGANLSEANLSGAILWEADLIEA 355



 Score = 41.6 bits (96), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 10/103 (9%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           ++KA  +K           +++ D SG+   GA L  A   +AN  GADL      R  L
Sbjct: 213 IKKAELIKAIREGTIDKTTLQQVDLSGAILRGADLRGAFLSEANLKGADLR-----RAFL 267

Query: 178 NEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVIDLA 215
           +EANL+ A L    L+ +DL      GAI+ GA+   A + LA
Sbjct: 268 SEANLSGADLSEANLSGADLRGAILSGAILWGANLKGAGLSLA 310



 Score = 37.0 bits (84), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 10/78 (12%)

Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
           AD+R +  S +   GA L +A   +AN +GADLS          EANL+ A L   +L+ 
Sbjct: 245 ADLRGAFLSEANLKGADLRRAFLSEANLSGADLS----------EANLSGADLRGAILSG 294

Query: 195 SDLGGAIIEGADFSDAVI 212
           + L GA ++GA  S A +
Sbjct: 295 AILWGANLKGAGLSLAFL 312


>gi|152980852|ref|YP_001353914.1| pentapeptide repeat-containing protein [Janthinobacterium sp.
           Marseille]
 gi|151280929|gb|ABR89339.1| Uncharacterized conserved protein, pentapeptide repeat family
           [Janthinobacterium sp. Marseille]
          Length = 243

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 42/116 (36%), Positives = 58/116 (50%), Gaps = 6/116 (5%)

Query: 96  KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEK 154
           +++ E         AA    A+LR A     N R AN   AD+R++D SG+    A L  
Sbjct: 16  EHDIEDNTMLATVKAALAAGANLRDADLSGANLRGANLRDADLRDADLSGANLRDADLSG 75

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           A    A+ +GA+LSD       L+ ANL+ A L    L  ++LGGA + GAD S A
Sbjct: 76  ANLRDADLSGANLSDA-----DLSGANLSGADLSGANLGGANLGGANLSGADLSGA 126



 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 6/105 (5%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTG 164
           A    A+LR A     + R A+ + A++R++D SG+       +GA L  A    AN +G
Sbjct: 41  ADLSGANLRGANLRDADLRDADLSGANLRDADLSGANLRDADLSGANLSDADLSGANLSG 100

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           ADLS   +    L  ANL+ A L    L+ ++L GA + GA+  D
Sbjct: 101 ADLSGANLGGANLGGANLSGADLSGANLSGANLRGANLSGANLRD 145



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 47/101 (46%), Gaps = 6/101 (5%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    A+LR A     N R      AN + AD+  ++ SG+  +GA L  A    AN +G
Sbjct: 61  ADLSGANLRDADLSGANLRDADLSGANLSDADLSGANLSGADLSGANLGGANLGGANLSG 120

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
           ADLS   +    L  ANL+ A L    +   D+  A+ E A
Sbjct: 121 ADLSGANLSGANLRGANLSGANLRDYPVKIKDIHKAVYEAA 161


>gi|126442493|ref|YP_001061349.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
           668]
 gi|126221984|gb|ABN85489.1| pentapeptide repeat protein [Burkholderia pseudomallei 668]
          Length = 825

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVD 566

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585


>gi|113474166|ref|YP_720227.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110165214|gb|ABG49754.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 1033

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 54/184 (29%), Positives = 84/184 (45%), Gaps = 18/184 (9%)

Query: 56  NQCAGPYAKLKNWRVFVSTA------LAAAVVASCSSNISALADLNKYEAETRGEFGIGS 109
           +QC G  A  +    F+S A      L+ A +   +   + L+      A+  G + IG+
Sbjct: 816 SQCLGVGAFWETVGQFLSGADLRYADLSGAYLIVANLRYADLSGAYLISADLSGAYLIGA 875

Query: 110 ---AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
               A    ADLR A    +   AN + A +  ++ S +K +GA L  A    A+ +GAD
Sbjct: 876 NLIGADLSRADLRYA----DLSGANLSDAKLSGANLSDAKLSGAGLSGADLRYADLSGAD 931

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDAVIDLAQKQALC 221
           LS   +    L+ ANL+ A L    L  +DL GA +      GAD SDA +   +  +  
Sbjct: 932 LSRAKLSDAGLSGANLSVAGLSGADLRYADLSGADLRYADLSGADLSDANLSNVRWNSQT 991

Query: 222 KYAN 225
           K++N
Sbjct: 992 KWSN 995


>gi|326506328|dbj|BAJ86482.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 181

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 61/116 (52%), Gaps = 15/116 (12%)

Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---D 173
           D      +K++F+ +     +R+++F G+   GA       + A+ TGADLSD  +   D
Sbjct: 75  DFSGQTLIKQDFKTSI----LRQTNFKGANLLGASF-----FDADLTGADLSDADLRNAD 125

Query: 174 RMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
             + N  + NLTNA L   ++T  +   G+ I GADF+D  +   Q+  LCK A+G
Sbjct: 126 FSLANVTKVNLTNANLEGALVTGNTSFKGSNIYGADFTDVPLRDDQRDYLCKIADG 181


>gi|163760882|ref|ZP_02167961.1| hypothetical protein HPDFL43_07047 [Hoeflea phototrophica DFL-43]
 gi|162281926|gb|EDQ32218.1| hypothetical protein HPDFL43_07047 [Hoeflea phototrophica DFL-43]
          Length = 239

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 53/101 (52%), Gaps = 4/101 (3%)

Query: 115 SADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           S DLR++  ++    AN   A +  +  +GSK  GA  ++  AY+A+F+  D +      
Sbjct: 71  STDLRESNLIE----ANLEKATLFRASLAGSKATGARFDRIEAYRADFSNLDATGASFGS 126

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
             +  A L N++L  T  T++DLG A  +GAD S +   LA
Sbjct: 127 AEMQRAKLNNSMLANTDFTKADLGRAQFDGADISGSRFSLA 167


>gi|334120837|ref|ZP_08494914.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333455836|gb|EGK84476.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 197

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 59/111 (53%), Gaps = 7/111 (6%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF----- 162
           S A    A+L++AV ++ N R A+ + AD+R +DF  +   GA    A+   A+F     
Sbjct: 42  SGANLAGANLQRAV-LRANLRGADLSGADLRGADFRNADLRGASFANALVRDASFGGAFL 100

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           TGA + +  +  + L  A+L  A L R +L  +DL  A + GAD S A ++
Sbjct: 101 TGASIGNLDLSGVDLRGADLRGAALARAILHSADLSNANLSGADLSGADLE 151



 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 54/114 (47%), Gaps = 21/114 (18%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSA----------DMRESDFSGSKFNGAYLEKAVAYK 159
           A F +ADLR A       R A+F  A          D+   D  G+   GA L +A+ + 
Sbjct: 73  ADFRNADLRGASFANALVRDASFGGAFLTGASIGNLDLSGVDLRGADLRGAALARAILHS 132

Query: 160 ANFT----------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
           A+ +          GADL + +++  VL  ANLT A L+  ++ ++   GA+++
Sbjct: 133 ADLSNANLSGADLSGADLEEAILNGAVLRGANLTGANLLCAMIEQTLWDGALLD 186



 Score = 37.4 bits (85), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 45/84 (53%), Gaps = 9/84 (10%)

Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTGADLSDT 170
           ADLR A       RA   SAD+  ++ SG+  +GA LE+     AV   AN TGA+L   
Sbjct: 118 ADLRGAALA----RAILHSADLSNANLSGADLSGADLEEAILNGAVLRGANLTGANLLCA 173

Query: 171 LMDRMVLNEANLTNAVLVRTVLTR 194
           ++++ + + A L  A L  T L+R
Sbjct: 174 MIEQTLWDGALLDRACLQGTPLSR 197


>gi|443475216|ref|ZP_21065173.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443020003|gb|ELS34017.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 352

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 47/87 (54%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A    A + ++DF+G   NGA +    A +    GADL+D  + R  L+ ANL  A++VR
Sbjct: 243 AKLERAILIDADFNGVTLNGAIMADIKASRVQMQGADLTDAKLSRADLSRANLKGAIMVR 302

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQ 216
             L  + L    +  AD +DA+++ A+
Sbjct: 303 ANLIEAYLARTNLADADLTDAILNRAE 329



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 60/120 (50%), Gaps = 7/120 (5%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKF-----NGAYLE 153
           A  R E  +G   +   +  RK   +     AN + AD+R ++ SG+        GA L+
Sbjct: 144 ANLRQERAVGDRDEIDVS--RKKRSIASLIGANLSGADLRGANLSGADLYKADLRGANLQ 201

Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           +A    AN + A L++  +  + L EANL++A LV  VL  + L  AI+  ADF+   ++
Sbjct: 202 EATLSGANLSEAKLNNAYLQGVFLTEANLSSASLVGAVLNNAKLERAILIDADFNGVTLN 261



 Score = 42.4 bits (98), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 43/137 (31%), Positives = 69/137 (50%), Gaps = 10/137 (7%)

Query: 88  ISALADLNKYEAETR----GEFGIGSAAQFGSADLR--KAVHVKENF---RANFTSADMR 138
           ++ L D N  +A+ R    G   +G A   G A+LR  +AV  ++     R   + A + 
Sbjct: 113 LANLMDANLIDADMRTINLGGANLGGACMRG-ANLRQERAVGDRDEIDVSRKKRSIASLI 171

Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
            ++ SG+   GA L  A  YKA+  GA+L +  +    L+EA L NA L    LT ++L 
Sbjct: 172 GANLSGADLRGANLSGADLYKADLRGANLQEATLSGANLSEAKLNNAYLQGVFLTEANLS 231

Query: 199 GAIIEGADFSDAVIDLA 215
            A + GA  ++A ++ A
Sbjct: 232 SASLVGAVLNNAKLERA 248



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 46/84 (54%), Gaps = 5/84 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R     AD+ ++  S +  + A L+ A+  +AN   A L+     R  L +A+LT+A+L 
Sbjct: 272 RVQMQGADLTDAKLSRADLSRANLKGAIMVRANLIEAYLA-----RTNLADADLTDAILN 326

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
           R  L+ ++L GAI++GA   D  +
Sbjct: 327 RAELSSANLVGAILKGATLPDGKV 350


>gi|418744036|ref|ZP_13300395.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
           CBC379]
 gi|418751631|ref|ZP_13307915.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
           MOR084]
 gi|409968104|gb|EKO35917.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
           MOR084]
 gi|410795431|gb|EKR93328.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
           CBC379]
          Length = 263

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 48/92 (52%), Gaps = 4/92 (4%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + +S  + + +F G  F+GA L  A    ++F GA+ S   +    LN ANL N      
Sbjct: 151 DLSSIILEKQNFDGVDFSGANLGHAFLQNSSFVGANFSSAKLRGSFLNNANLRNTNFRGA 210

Query: 191 VLTRSDLGGAIIEGADFSDAVID----LAQKQ 218
            L  + L GA +EGADF+DA+ D    L QKQ
Sbjct: 211 DLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242


>gi|334117106|ref|ZP_08491198.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333461926|gb|EGK90531.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 520

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 41/121 (33%), Positives = 61/121 (50%), Gaps = 2/121 (1%)

Query: 94  LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAY 151
           L KY A  R   GI  + A     +L  A     N   AN + A++ +++  G+K N A 
Sbjct: 7   LKKYAAGERNFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSQTNLIGAKLNIAR 66

Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
           L  A    A+ T ADL+   + R+ L +A L  A L+R  L R++L GA + GA+ S A 
Sbjct: 67  LSGAHLGGADLTDADLNVAYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANLSGAT 126

Query: 212 I 212
           +
Sbjct: 127 L 127



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 1/95 (1%)

Query: 117 DLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           DL+KA+ +     RA    A++  ++ SG+  +GA L +A    AN   A+L    +   
Sbjct: 91  DLKKAILIGAKLIRAELIRAELSGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGA 150

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            L EANL  A L    L+R+DL GA + G +   A
Sbjct: 151 CLTEANLEQANLQGADLSRADLSGADLRGTELRQA 185



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 53/108 (49%), Gaps = 6/108 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+L  A   +   R AN   A++R +  SG+    A LE+A    A+ + ADL
Sbjct: 113 SGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRADL 172

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG-----ADFSDA 210
           S   +    L +ANLT AVL    L+  +L  AI+ G     AD S+A
Sbjct: 173 SGADLRGTELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEA 220



 Score = 40.8 bits (94), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 54/123 (43%), Gaps = 4/123 (3%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A    ADLR      E  +AN T A +  +D SG     A L       A+ + A LS
Sbjct: 168 SRADLSGADLRGT----ELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEAKLS 223

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
              + R  L  ANL NA LV   LT + L  A   GAD + A +  A+  A+ +    T 
Sbjct: 224 GADLSRADLCHANLLNASLVHADLTNAYLIRADWIGADLTGATLTGAKLHAVSRLGIKTE 283

Query: 229 PIT 231
            +T
Sbjct: 284 GMT 286



 Score = 38.5 bits (88), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 11/108 (10%)

Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A  G ADL  A ++V     A     D++++   G+K   A L +A    AN +GA+L
Sbjct: 68  SGAHLGGADLTDADLNV-----AYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANL 122

Query: 168 SDTLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S   +    L      +ANL  A L    LT ++L  A ++GAD S A
Sbjct: 123 SGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRA 170



 Score = 37.4 bits (85), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 15/85 (17%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NF   ++ E++ SG   +GA L+ A    AN +GA+LS T +    LN A L+       
Sbjct: 16  NFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSQTNLIGAKLNIARLS------- 68

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
                   GA + GAD +DA +++A
Sbjct: 69  --------GAHLGGADLTDADLNVA 85


>gi|300866933|ref|ZP_07111605.1| exported hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300335037|emb|CBN56767.1| exported hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 253

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 53/118 (44%), Gaps = 9/118 (7%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
           N+ + E   E G  S      A+LR A         N   AD+R ++  G+   GA L  
Sbjct: 26  NRRDVEKLKETGQCSRCDLRDANLRNA---------NLQGADLRNANLRGANLRGAALRN 76

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A    A+  GADL D  + R  L  ANL++A L    L R+++ G   +G D   A +
Sbjct: 77  ADLSNADLRGADLRDADLSRSNLRNANLSDANLRNADLERAEVRGVNFQGTDLRGANV 134


>gi|226365701|ref|YP_002783484.1| hypothetical protein ROP_62920 [Rhodococcus opacus B4]
 gi|226244191|dbj|BAH54539.1| hypothetical protein [Rhodococcus opacus B4]
          Length = 201

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 61/131 (46%), Gaps = 16/131 (12%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFS-----GSKFNGAYL 152
           +E R E  I +   F  ADL ++ HV   FR+ +FT   +  S+F      GS+F+   L
Sbjct: 38  SELRTESVIFTECDFTGADLAESHHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97

Query: 153 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 202
              V  + +FT     GADL           EANL    L R VL  +DL     GGA +
Sbjct: 98  RPMVFDECDFTLVSLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKL 157

Query: 203 EGADFSDAVID 213
           +GAD   A +D
Sbjct: 158 DGADLRGAHVD 168


>gi|73621284|gb|AAZ78338.1| OxyO [Streptomyces rimosus]
          Length = 353

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 43/86 (50%), Gaps = 6/86 (6%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    ADLR+A   + N R      AN   AD+R +D  G    G  L  AV Y+A   G
Sbjct: 231 ADLREADLREATPARANLRDADLSDANVRKADLRFADLRGVDLWGTDLRGAVLYRAKLAG 290

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRT 190
            +LS+  +D   L  A+LT+A + R+
Sbjct: 291 LELSEAHLDGADLRGADLTDAAVARS 316



 Score = 41.6 bits (96), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 46/93 (49%), Gaps = 1/93 (1%)

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           A H +   R A     D+R++  SG+   GA L +A    A+   ADL +    R  L +
Sbjct: 191 ADHKRAQLRGAILRDCDLRDARLSGADLRGARLARADLADADLREADLREATPARANLRD 250

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A+L++A + +  L  +DL G  + G D   AV+
Sbjct: 251 ADLSDANVRKADLRFADLRGVDLWGTDLRGAVL 283


>gi|162453209|ref|YP_001615576.1| hypothetical protein sce4933 [Sorangium cellulosum So ce56]
 gi|161163791|emb|CAN95096.1| hypothetical protein sce4933 [Sorangium cellulosum So ce56]
          Length = 890

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 47/82 (57%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + T AD R  D  G +F  A+LE A    A+ +GA L   ++ +  L+ ANLT A L   
Sbjct: 575 DLTGADFRGVDLRGMRFARAFLEGADLRGADLSGAVLEGAVLAKADLSGANLTGARLRGA 634

Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
            L +++L GA+ + AD ++AV+
Sbjct: 635 NLGKANLEGAVFDDADLTEAVL 656



 Score = 43.9 bits (102), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 50/107 (46%), Gaps = 9/107 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           + A F   DLR            F  A +  +D  G+  +GA LE AV  KA+ +GA+L+
Sbjct: 577 TGADFRGVDLRGM---------RFARAFLEGADLRGADLSGAVLEGAVLAKADLSGANLT 627

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
              +    L +ANL  AV     LT + L GA + GA    A ++ A
Sbjct: 628 GARLRGANLGKANLEGAVFDDADLTEAVLMGARLAGASLKRAKLERA 674



 Score = 40.8 bits (94), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 38/84 (45%), Gaps = 5/84 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A    A + ++D SG+   GA L  A   KAN  GA   D  +   VL  A L  A L R
Sbjct: 609 AVLEGAVLAKADLSGANLTGARLRGANLGKANLEGAVFDDADLTEAVLMGARLAGASLKR 668

Query: 190 TVLTRSD-----LGGAIIEGADFS 208
             L R+D      GG  + GAD S
Sbjct: 669 AKLERADALQVSWGGVDLSGADLS 692



 Score = 40.4 bits (93), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 55/123 (44%), Gaps = 17/123 (13%)

Query: 108 GSAAQFGSADLRK--AVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           G+ A+F  A   +  AVH      A+F  A + ++ F  +   GA  ++A     + + A
Sbjct: 741 GAKARFAGARFSEGVAVHKSGLPEADFRDAVLDKTCFRTTDLRGARFDRAQMTMTDLSEA 800

Query: 166 DLSDTLMDRMVLNEA---------------NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           D +D   DR V+  A               NLT A+L ++ L  +D  GA +  ADFS A
Sbjct: 801 DATDATFDRAVMKNALLIRTNLDRASLRGCNLTEAILSKSRLAGADFTGAQLCRADFSRA 860

Query: 211 VID 213
             D
Sbjct: 861 RGD 863



 Score = 37.4 bits (85), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 8/96 (8%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTL-MDRMVLNEANLT 183
           AN   A + E    G+ F+GA L K         KA F GA  S+ + + +  L EA+  
Sbjct: 709 ANLERAMLLECSLDGTDFSGARLHKTSLMSCTGAKARFAGARFSEGVAVHKSGLPEADFR 768

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
           +AVL +T    +DL GA  + A  +  + DL++  A
Sbjct: 769 DAVLDKTCFRTTDLRGARFDRAQMT--MTDLSEADA 802


>gi|428311554|ref|YP_007122531.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428253166|gb|AFZ19125.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 411

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 70/137 (51%), Gaps = 18/137 (13%)

Query: 77  AAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSAD 136
           A  +VAS S+  + L D N ++A   G       A+ G A+LR A    +   AN + A+
Sbjct: 65  ADLIVASLSA--ADLRDANLHDANLIG-------AKLGVANLRDA----DLSGANLSGAE 111

Query: 137 MRESDFSGSKFNGAY-----LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           +  +D + S  NGAY     L KA   +AN  GA+LS T M    L+ ANL  A L    
Sbjct: 112 LSCTDLTCSNLNGAYISGANLIKAKLSRANLQGANLSVTNMIGADLSGANLQGANLGGAN 171

Query: 192 LTRSDLGGAIIEGADFS 208
           L  +DLGGA ++GA  S
Sbjct: 172 LIEADLGGANLQGAKLS 188



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 1/102 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           + A    A+L KA   + N + AN +  +M  +D SG+   GA L  A   +A+  GA+L
Sbjct: 123 NGAYISGANLIKAKLSRANLQGANLSVTNMIGADLSGANLQGANLGGANLIEADLGGANL 182

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
               + R  L   NL N+ L    L+ S+L G  +  AD  +
Sbjct: 183 QGAKLSRSNLAYVNLANSDLSNADLSDSNLAGTNLTNADLDN 224



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 46/154 (29%), Positives = 66/154 (42%), Gaps = 18/154 (11%)

Query: 94  LNKYEAETRGEFGIGSAAQFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNG 149
           L +Y A  R   G    A    ADLR    + + + +   AN +  D+ ++D   +   G
Sbjct: 7   LERYAAGERCLRG----ADLHGADLRGVDLRGIDLSD---ANLSDTDLSDADLRDADLIG 59

Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           A L  A    A+ + ADL D       L++ANL  A L    L  +DL GA + GA+ S 
Sbjct: 60  ANLRGADLIVASLSAADLRDA-----NLHDANLIGAKLGVANLRDADLSGANLSGAELS- 113

Query: 210 AVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 243
              DL        Y +G N I    +R +L   N
Sbjct: 114 -CTDLTCSNLNGAYISGANLIKAKLSRANLQGAN 146


>gi|428319993|ref|YP_007117875.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428243673|gb|AFZ09459.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 146

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 46/83 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+   AD+RE++  G+  + A LE A    AN TGA+L+   +    LN+A+L  A L  
Sbjct: 51  AHLIGADLREANLQGANLSSANLEGADLTGANLTGANLTQVFLTNASLNDADLDRANLTA 110

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
            ++  +D+ GA ++    +DA I
Sbjct: 111 AIINTADVSGASMQDMTITDAKI 133



 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 51/102 (50%), Gaps = 4/102 (3%)

Query: 114 GSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           G A      HVK+      ++    + D SG+   GA+L  A   +AN  GA+LS   ++
Sbjct: 19  GPARAENPAHVKQLL----STGQCFQCDLSGADLIGAHLIGADLREANLQGANLSSANLE 74

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
              L  ANLT A L +  LT + L  A ++ A+ + A+I+ A
Sbjct: 75  GADLTGANLTGANLTQVFLTNASLNDADLDRANLTAAIINTA 116



 Score = 40.4 bits (93), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 47/95 (49%)

Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           A +++ +   + F+ + + AD+  +   G+    A L+ A    AN  GADL+   +   
Sbjct: 27  AHVKQLLSTGQCFQCDLSGADLIGAHLIGADLREANLQGANLSSANLEGADLTGANLTGA 86

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            L +  LTNA L    L R++L  AII  AD S A
Sbjct: 87  NLTQVFLTNASLNDADLDRANLTAAIINTADVSGA 121


>gi|428211266|ref|YP_007084410.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|427999647|gb|AFY80490.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 279

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 68/129 (52%), Gaps = 14/129 (10%)

Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           A+LR+A  +     AN + A++ E+D S +  + A +E+A   +A    A  S+T +   
Sbjct: 70  ANLREANLIN----ANLSKANLSEADLSLANISRAIVERANLERAKLVQALASETRLGWA 125

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP------ 229
            L EA +  A L R  L+ +DL GA +EGA+ + A++     QA+ +  N TN       
Sbjct: 126 NLKEATMNQANLSRANLSEADLTGANLEGANLTIAIL----IQAIMEKVNLTNATLNGAN 181

Query: 230 ITGVSTRKS 238
           +TGV+ R S
Sbjct: 182 LTGVNLRDS 190



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 18/113 (15%)

Query: 107 IGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           + S  + G A+L++A   + N  RAN + AD+  ++  G+    A L +A+  K N T A
Sbjct: 116 LASETRLGWANLKEATMNQANLSRANLSEADLTGANLEGANLTIAILIQAIMEKVNLTNA 175

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
                      LN ANLT        L  SDL  A + G++ + A  DL + Q
Sbjct: 176 ----------TLNGANLTG-----VNLRDSDLSRANMSGSNLAGA--DLTKSQ 211



 Score = 38.5 bits (88), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 43/83 (51%), Gaps = 15/83 (18%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T + +R ++ S +   G  LE A  Y+AN   ++LS           ANLTNA+L+ 
Sbjct: 205 ADLTKSQLRGTNVSWTTMRGTNLEGASLYRANLGWSNLSG----------ANLTNAILMD 254

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
           T L R++L        DF+ A++
Sbjct: 255 TNLYRTNL-----RDVDFTGAIM 272


>gi|16329465|ref|NP_440193.1| hypothetical protein slr0967 [Synechocystis sp. PCC 6803]
 gi|383321206|ref|YP_005382059.1| hypothetical protein SYNGTI_0297 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383324376|ref|YP_005385229.1| hypothetical protein SYNPCCP_0297 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383490260|ref|YP_005407936.1| hypothetical protein SYNPCCN_0297 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384435526|ref|YP_005650250.1| hypothetical protein SYNGTS_0297 [Synechocystis sp. PCC 6803]
 gi|451813624|ref|YP_007450076.1| hypothetical protein MYO_13000 [Synechocystis sp. PCC 6803]
 gi|1651947|dbj|BAA16873.1| slr0967 [Synechocystis sp. PCC 6803]
 gi|339272558|dbj|BAK49045.1| hypothetical protein SYNGTS_0297 [Synechocystis sp. PCC 6803]
 gi|359270525|dbj|BAL28044.1| hypothetical protein SYNGTI_0297 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359273696|dbj|BAL31214.1| hypothetical protein SYNPCCN_0297 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359276866|dbj|BAL34383.1| hypothetical protein SYNPCCP_0297 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407957344|dbj|BAM50584.1| hypothetical protein BEST7613_1653 [Synechocystis sp. PCC 6803]
 gi|451779593|gb|AGF50562.1| hypothetical protein MYO_13000 [Synechocystis sp. PCC 6803]
          Length = 150

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 43/83 (51%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+   AD+R ++ SG+  N A LE A    AN   ADL   ++    LN ANLT+A    
Sbjct: 56  AHLIGADLRNANLSGTNLNEANLEGADLTGANLQNADLRGAMVTNATLNRANLTSANFAF 115

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L   D+ GA +EG +  +A I
Sbjct: 116 AKLYDVDVTGATVEGMNIQNAEI 138


>gi|428307821|ref|YP_007144646.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
           9333]
 gi|428249356|gb|AFZ15136.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
          Length = 263

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 57/107 (53%), Gaps = 23/107 (21%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKA----VHVKEN--FRANFTSADMRESD--------- 141
           YEAE  G       A F  ADL KA     H+ E   F AN + A+++++D         
Sbjct: 157 YEAELIG-------AYFYKADLFKANLSNAHLGEAYLFGANLSQAELKKADLRWTNLSKA 209

Query: 142 -FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
            F+G+   GA L  A   KANFTGA+L+D  +D + L++ANL  A++
Sbjct: 210 NFTGANLVGANLRGANLSKANFTGANLTDANLDTVNLHKANLEGAIM 256



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 56/111 (50%), Gaps = 11/111 (9%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFN----------GAYLEKAVAYK 159
           A    A+L+ A  ++ N R A+F +A++  ++ S S  N          GAY  KA  +K
Sbjct: 114 ANLMGANLKGADLIEANMRGADFINANLMSANLSNSFLNYAKFYEAELIGAYFYKADLFK 173

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           AN + A L +  +    L++A L  A L  T L++++  GA + GA+   A
Sbjct: 174 ANLSNAHLGEAYLFGANLSQAELKKADLRWTNLSKANFTGANLVGANLRGA 224


>gi|30696344|ref|NP_851183.1| thylakoid lumenal protein [Arabidopsis thaliana]
 gi|38503418|sp|P81760.2|TL17_ARATH RecName: Full=Thylakoid lumenal 17.4 kDa protein, chloroplastic;
           AltName: Full=P17.4; Flags: Precursor
 gi|13899115|gb|AAK48979.1|AF370552_1 thylakoid lumenal 17.4 kD protein, chloroplast precursor (P17.4)
           [Arabidopsis thaliana]
 gi|9759188|dbj|BAB09725.1| thylakoid lumenal 17.4 kD protein, chloroplast precursor (P17.4)
           [Arabidopsis thaliana]
 gi|28059599|gb|AAO30073.1| thylakoid lumenal 17.4 kD protein, chloroplast precursor (P17.4)
           [Arabidopsis thaliana]
 gi|332008985|gb|AED96368.1| thylakoid lumenal protein [Arabidopsis thaliana]
          Length = 236

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 52/110 (47%), Gaps = 5/110 (4%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            ++A M  + F G+      + KA A +A+F G + ++ ++DR+   ++NL  AV   TV
Sbjct: 131 LSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLKGAVFRNTV 190

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L+ S    A +E   F D +I     Q +C+     N       R  LGC
Sbjct: 191 LSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 235


>gi|86608719|ref|YP_477481.1| pentapeptide repeat-containing protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
 gi|86557261|gb|ABD02218.1| pentapeptide repeat family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
          Length = 207

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 66/132 (50%), Gaps = 15/132 (11%)

Query: 111 AQFGSADLRKAVHVKENFRANFTS------ADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    A+L++A+  + N +A   S      AD+R ++ SGS   GA+L +A   +AN   
Sbjct: 68  ADLSGANLKEAILRQANLQAADLSQAILNLADLRGANLSGSAQAGAFLWEADLAQANLQQ 127

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
            DL+        L  ANL+ A L R +LTR+DL GA +  AD   A  DL  + A  + A
Sbjct: 128 TDLTGA-----NLQVANLSGADLRRAILTRADLTGAKLHNADLRGA--DL--RGAFLEGA 178

Query: 225 NGTNPITGVSTR 236
           + T  +    TR
Sbjct: 179 DLTGALYNAQTR 190



 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 31/116 (26%), Positives = 55/116 (47%), Gaps = 8/116 (6%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYK 159
            GI +AA F   +L   +   +     F S D     +  ++  G+  +GA L++A+  +
Sbjct: 26  LGIPTAAAFAQLELDAQLGRSQIV---FPSKDCPACNLTGAELPGADLSGANLKEAILRQ 82

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           AN   ADLS  +++   L  ANL+ +      L  +DL  A ++  D + A + +A
Sbjct: 83  ANLQAADLSQAILNLADLRGANLSGSAQAGAFLWEADLAQANLQQTDLTGANLQVA 138


>gi|332704952|ref|ZP_08425038.1| hypothetical protein LYNGBM3L_00660 [Moorea producens 3L]
 gi|332356304|gb|EGJ35758.1| hypothetical protein LYNGBM3L_00660 [Moorea producens 3L]
          Length = 544

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 48/81 (59%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ + AD  +++ SG+  + A L +A   +AN +GA+LSD  +    L  ANL+NA    
Sbjct: 266 ADLSGADFNDANLSGADLSSANLIRANLIRANLSGANLSDVKVIGGNLGNANLSNANFSS 325

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L R++L GA + GAD S+A
Sbjct: 326 AKLIRANLSGADLSGADLSNA 346



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 39/119 (32%), Positives = 55/119 (46%), Gaps = 24/119 (20%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT----- 163
           S A F SA L          RAN + AD+  +D S + F+GA L  A    AN +     
Sbjct: 319 SNANFSSAKL---------IRANLSGADLSGADLSNANFSGASLYSANLSNANLSSANLR 369

Query: 164 ----------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                     GADL  T +    L+ ANL+NA L+ + L  ++L GA + GA+   A +
Sbjct: 370 GTELSGANLSGADLRGTKLSGANLSGANLSNAKLIDSNLRGTELSGANLSGANLRGASL 428



 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 61/145 (42%), Gaps = 22/145 (15%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    A+L  A  +  N R      AN + A++R +    +  +GA L  A  Y AN 
Sbjct: 389 SGANLSGANLSNAKLIDSNLRGTELSGANLSGANLRGASLYSANLSGANLRGASLYSANL 448

Query: 163 TGADLSD---------------TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           +GA+LS                T      L+ ANL  A L R  L  +DL  A + GAD 
Sbjct: 449 SGANLSGANLSLANLCPMRVSGTDFSAANLSGANLGGAYLYRADLKDTDLSSANLTGADL 508

Query: 208 SDAVIDLAQ-KQALCKYANGTNPIT 231
           S A ++ A  K A   Y  G +  T
Sbjct: 509 SSANLNGADVKNARFGYIVGIDEST 533



 Score = 41.2 bits (95), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 53/108 (49%), Gaps = 11/108 (10%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRE----------SDFSGSKFNGAYLEKAVAYK 159
           A    ADL  A  ++ N  RAN + A++ +          ++ S + F+ A L +A    
Sbjct: 276 ANLSGADLSSANLIRANLIRANLSGANLSDVKVIGGNLGNANLSNANFSSAKLIRANLSG 335

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           A+ +GADLS+       L  ANL+NA L    L  ++L GA + GAD 
Sbjct: 336 ADLSGADLSNANFSGASLYSANLSNANLSSANLRGTELSGANLSGADL 383



 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 52/111 (46%), Gaps = 11/111 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKAN- 161
           S A   SA+L  A     N R      AN + AD+R +  SG+  +GA L  A    +N 
Sbjct: 349 SGASLYSANLSNANLSSANLRGTELSGANLSGADLRGTKLSGANLSGANLSNAKLIDSNL 408

Query: 162 ----FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
                +GA+LS   +    L  ANL+ A L    L  ++L GA + GA+ S
Sbjct: 409 RGTELSGANLSGANLRGASLYSANLSGANLRGASLYSANLSGANLSGANLS 459


>gi|304414054|ref|ZP_07395422.1| pentapeptide repeat-containing protein [Candidatus Regiella
           insecticola LSR1]
 gi|304283268|gb|EFL91664.1| pentapeptide repeat-containing protein [Candidatus Regiella
           insecticola LSR1]
          Length = 283

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 42/128 (32%), Positives = 57/128 (44%), Gaps = 23/128 (17%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDF----------------------SGS 145
           S A   +ADLR A     N + A    ADMRE D                       SG+
Sbjct: 122 SNATLSNADLRGAYMSWANLQNATLNDADMREVDLVGADMREAKLIGKKTNLEGANLSGA 181

Query: 146 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
              GA L   +  KA  + ADLS   ++R+ L EANL +A+L  T L  + L  A +E  
Sbjct: 182 DLRGAELCHTILIKAALSWADLSYAKLERVNLREANLYHAILEETSLYLTKLENANLESV 241

Query: 206 DFSDAVID 213
           +  DAV++
Sbjct: 242 NLKDAVLE 249


>gi|30696347|ref|NP_200161.2| thylakoid lumenal protein [Arabidopsis thaliana]
 gi|332008984|gb|AED96367.1| thylakoid lumenal protein [Arabidopsis thaliana]
          Length = 235

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 52/110 (47%), Gaps = 5/110 (4%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            ++A M  + F G+      + KA A +A+F G + ++ ++DR+   ++NL  AV   TV
Sbjct: 130 LSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLKGAVFRNTV 189

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L+ S    A +E   F D +I     Q +C+     N       R  LGC
Sbjct: 190 LSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 234


>gi|282898711|ref|ZP_06306699.1| hglK (Pentapeptide repeat protein) [Cylindrospermopsis raciborskii
           CS-505]
 gi|281196579|gb|EFA71488.1| hglK (Pentapeptide repeat protein) [Cylindrospermopsis raciborskii
           CS-505]
          Length = 682

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 57/105 (54%), Gaps = 6/105 (5%)

Query: 109 SAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S AQ   ADL  A   + +   +  + A++ ++++ G+  + +YL  A    ANF+ A+L
Sbjct: 529 SGAQLQEADLYAAQLARVSAIGSQLSHANLTKTNWQGADLSESYLNHANLNSANFSAANL 588

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           S       +L  AN+TN  L    ++R+DL GA +EG DF  A++
Sbjct: 589 SGA-----ILRYANMTNTNLRSADISRADLRGANLEGTDFQGAIL 628



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 29/108 (26%)

Query: 132 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMDRM-- 175
           F SA++ +S F GS+F                A L KA   ++N + A+LS  LM R+  
Sbjct: 424 FKSANLNQSSFKGSRFRSVGEDGRWDTYDDIIADLSKAQLKRSNLSNANLSRVLMSRVDL 483

Query: 176 ---VLN----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
              VLN          +ANL++A LV + L ++ L  A++ GAD S A
Sbjct: 484 SRSVLNRANLASSKLIDANLSSAQLVGSDLQQATLQDAVLTGADISGA 531



 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 54/111 (48%), Gaps = 11/111 (9%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN  S+ + +++ S ++  G+ L++A    A  TGAD+S   +    L  A L     +
Sbjct: 490 RANLASSKLIDANLSSAQLVGSDLQQATLQDAVLTGADISGAQLQEADLYAAQLARVSAI 549

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQ-----------ALCKYANGTN 228
            + L+ ++L     +GAD S++ ++ A              A+ +YAN TN
Sbjct: 550 GSQLSHANLTKTNWQGADLSESYLNHANLNSANFSAANLSGAILRYANMTN 600


>gi|443475317|ref|ZP_21065270.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443019839|gb|ELS33873.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 377

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 11/111 (9%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT- 163
           A    A+L  A+ VK +       RAN T AD+RE+D SG++   A L KA   KAN + 
Sbjct: 140 ADLTQANLSAAILVKASLKQVILNRANLTEADLREADLSGAQLYLAVLSKANLAKANLSL 199

Query: 164 ----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
                A+L +  ++  +  EANL NA L ++ L  ++L  A +  A+ S A
Sbjct: 200 ANLDSANLLEAKLEGSLFCEANLENANLSQSFLMEANLTKANLRKANLSKA 250



 Score = 44.7 bits (104), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 51/102 (50%), Gaps = 15/102 (14%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM----------DRMVLN 178
           R+N   A++ E+D SG+      L  A+   AN +  DLS + +          DR  L 
Sbjct: 84  RSNLVRANLYEADLSGASLVNINLSNAICASANLSHVDLSQSNLSSTNLSLANLDRADLT 143

Query: 179 EANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDAVIDLA 215
           +ANL+ A+LV+      +L R++L  A +  AD S A + LA
Sbjct: 144 QANLSAAILVKASLKQVILNRANLTEADLREADLSGAQLYLA 185



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 51/154 (33%), Positives = 71/154 (46%), Gaps = 18/154 (11%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTS 134
           L+AA++   S     L   N  EA+ R E  + S AQ   A L KA   K N   AN  S
Sbjct: 147 LSAAILVKASLKQVILNRANLTEADLR-EADL-SGAQLYLAVLSKANLAKANLSLANLDS 204

Query: 135 ADMRESDFSGSKFNGAYLE---------------KAVAYKANFTGADLSDTLMDRMVLNE 179
           A++ E+   GS F  A LE               KA   KAN + A+L+  ++ +  L  
Sbjct: 205 ANLLEAKLEGSLFCEANLENANLSQSFLMEANLTKANLRKANLSKANLTSAILSQANLLG 264

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           ANL  A L +  L  SD  GA ++G + S A ++
Sbjct: 265 ANLAGASLAKANLAESDCFGANLQGTNLSQANVE 298



 Score = 41.6 bits (96), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 69/143 (48%), Gaps = 11/143 (7%)

Query: 73  STALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANF 132
           ST L  A + S   + S L   N YEA+  G       A   + +L  A+       AN 
Sbjct: 69  STDLVRANLRSARLDRSNLVRANLYEADLSG-------ASLVNINLSNAICAS----ANL 117

Query: 133 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 192
           +  D+ +S+ S +  + A L++A   +AN + A L    + +++LN ANLT A L    L
Sbjct: 118 SHVDLSQSNLSSTNLSLANLDRADLTQANLSAAILVKASLKQVILNRANLTEADLREADL 177

Query: 193 TRSDLGGAIIEGADFSDAVIDLA 215
           + + L  A++  A+ + A + LA
Sbjct: 178 SGAQLYLAVLSKANLAKANLSLA 200



 Score = 38.1 bits (87), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 31/129 (24%), Positives = 65/129 (50%), Gaps = 12/129 (9%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           AQ    +L++A   + N +      AN +S D+  ++   ++ + + L +A  Y+A+ +G
Sbjct: 40  AQLAGINLKQANLFRANLQNAVLAIANLSSTDLVRANLRSARLDRSNLVRANLYEADLSG 99

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA------VIDLAQKQ 218
           A L +  +   +   ANL++  L ++ L+ ++L  A ++ AD + A      ++  + KQ
Sbjct: 100 ASLVNINLSNAICASANLSHVDLSQSNLSSTNLSLANLDRADLTQANLSAAILVKASLKQ 159

Query: 219 ALCKYANGT 227
            +   AN T
Sbjct: 160 VILNRANLT 168



 Score = 37.0 bits (84), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 14/105 (13%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA-----YKANFTGA 165
           A    A+LRKA   K    AN TSA + +++  G+   GA L KA       + AN  G 
Sbjct: 235 ANLTKANLRKANLSK----ANLTSAILSQANLLGANLAGASLAKANLAESDCFGANLQGT 290

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +LS   ++ + L E++L  A LV      ++L GA + GA+  DA
Sbjct: 291 NLSQANVEAVDLRESDLAKANLV-----GANLAGANLFGAELLDA 330


>gi|421082377|ref|ZP_15543263.1| Pentapeptide repeat protein [Pectobacterium wasabiae CFBP 3304]
 gi|401702907|gb|EJS93144.1| Pentapeptide repeat protein [Pectobacterium wasabiae CFBP 3304]
          Length = 846

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 43/160 (26%), Positives = 74/160 (46%), Gaps = 12/160 (7%)

Query: 78  AAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADM 137
            A++ SCS  +   A+  ++   T     + S +    AD  +A   + N R     A +
Sbjct: 687 GALLDSCSW-VETQANEARFVGATWLTSAVASGSSMNGADFTQATLRQSNLR----QASL 741

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
             + F+ +K   + L +A   + NF  A+L+ +L  R    EAN T+A L+  +L +S L
Sbjct: 742 IGAVFARAKLENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLMGALLQKSQL 801

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
            GA   GA+   A  DL+Q      + + T  + G  T++
Sbjct: 802 SGANFRGANLFRA--DLSQ-----AFTSNTTQLDGAWTKR 834



 Score = 40.4 bits (93), Expect = 0.90,   Method: Composition-based stats.
 Identities = 27/84 (32%), Positives = 36/84 (42%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+F+  D+R +DFS +    A L       ANF  A LS T +    L   N  NA L  
Sbjct: 538 ADFSGMDLRGADFSKALLECADLSNCKLDGANFHSAMLSRTELHNTSLCGCNFENASLAL 597

Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
                SD  GA  +     +A+ D
Sbjct: 598 AQCCHSDFSGAHFKNTQLQEALFD 621


>gi|254299592|ref|ZP_04967041.1| pentapeptide repeat protein [Burkholderia pseudomallei 406e]
 gi|418542641|ref|ZP_13108060.1| type VI secretion system [Burkholderia pseudomallei 1258a]
 gi|418549165|ref|ZP_13114243.1| type VI secretion system [Burkholderia pseudomallei 1258b]
 gi|157809489|gb|EDO86659.1| pentapeptide repeat protein [Burkholderia pseudomallei 406e]
 gi|385355180|gb|EIF61399.1| type VI secretion system [Burkholderia pseudomallei 1258a]
 gi|385356028|gb|EIF62174.1| type VI secretion system [Burkholderia pseudomallei 1258b]
          Length = 825

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 566

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585


>gi|427738633|ref|YP_007058177.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427373674|gb|AFY57630.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 436

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 41/115 (35%), Positives = 60/115 (52%), Gaps = 7/115 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANFT+ ++  ++F+ +   GA LE A    AN T ADLS T + +  L  ANL N+ L  
Sbjct: 259 ANFTNVNLEGANFTNANLEGANLENAKLNNANLTNADLSYTNLRKADLRCANLINSDLSN 318

Query: 190 TVLTRSDLGGAIIEGA-----DFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
              +R++L  AI+ GA     +FSDA  +L     +  Y +G N I     R +L
Sbjct: 319 ADASRANLSDAIVNGANLIQSNFSDA--NLRGCNLIKTYLSGANLIRADLKRANL 371



 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 44/83 (53%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
             A+ +  D+ +++FS +   GA         ANFT A+L    ++   LN ANLTNA L
Sbjct: 237 INADLSGIDLCDANFSDANLEGANFTNVNLEGANFTNANLEGANLENAKLNNANLTNADL 296

Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
             T L ++DL  A +  +D S+A
Sbjct: 297 SYTNLRKADLRCANLINSDLSNA 319


>gi|381207604|ref|ZP_09914675.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 255

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 16/111 (14%)

Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADL +A   + N + A+ T  D+  ++  G+  +GA L  A    AN  GADL+D  +  
Sbjct: 96  ADLHEANAPEANLKNADLTEVDLLHANLGGTDLSGAKLSGAKLRGANLVGADLTDADLSE 155

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAI---------------IEGADFSDA 210
             L+EANL+ A L    L  +DLG A+               ++GAD +DA
Sbjct: 156 ANLSEANLSEADLSGADLREADLGKAVLSQAKLVGANLHRIRLQGADLTDA 206



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/123 (34%), Positives = 59/123 (47%), Gaps = 16/123 (13%)

Query: 102 RGE-FGIGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEK 154
           +GE FG+        ADL KAV    + R      AN   AD++E++  G+  +   L  
Sbjct: 20  KGELFGV----DLSEADLPKAVLYSSDLREAKLSKANLAKADLQEANLVGAGLHRVDLNG 75

Query: 155 AVAYKANFTGADLSDTLM---DRMVLN--EANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           A  ++AN   ADLS  L+   D    N  EANL NA L    L  ++LGG  + GA  S 
Sbjct: 76  ANLHQANLAQADLSGALLFFADLHEANAPEANLKNADLTEVDLLHANLGGTDLSGAKLSG 135

Query: 210 AVI 212
           A +
Sbjct: 136 AKL 138



 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 51/110 (46%), Gaps = 11/110 (10%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A+   A LR A  V  +        AN + A++ E+D SG+    A L KAV  +A  
Sbjct: 129 SGAKLSGAKLRGANLVGADLTDADLSEANLSEANLSEADLSGADLREADLGKAVLSQAKL 188

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            GA+L      R+ L  A+LT+A L    L   DL  AI E   F  A +
Sbjct: 189 VGANLH-----RIRLQGADLTDADLTDANLYGIDLREAITENTLFEKAKL 233



 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 57/119 (47%), Gaps = 13/119 (10%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A  G  DL  A       R AN   AD+ ++D S +  + A L      +A+ +GADL +
Sbjct: 121 ANLGGTDLSGAKLSGAKLRGANLVGADLTDADLSEANLSEANLS-----EADLSGADLRE 175

Query: 170 TLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADFSDAVID--LAQKQALC 221
             + + VL++A L  A L R       LT +DL  A + G D  +A+ +  L +K  LC
Sbjct: 176 ADLGKAVLSQAKLVGANLHRIRLQGADLTDADLTDANLYGIDLREAITENTLFEKAKLC 234



 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 54/108 (50%), Gaps = 11/108 (10%)

Query: 109 SAAQFGSADLRKA------VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    ADL++A      +H  +   AN   A++ ++D SG+    A L +A A +AN 
Sbjct: 49  SKANLAKADLQEANLVGAGLHRVDLNGANLHQANLAQADLSGALLFFADLHEANAPEANL 108

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             ADL++     + L  ANL    L    L+ + L GA + GAD +DA
Sbjct: 109 KNADLTE-----VDLLHANLGGTDLSGAKLSGAKLRGANLVGADLTDA 151


>gi|218440553|ref|YP_002378882.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218173281|gb|ACK72014.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 320

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 58/110 (52%), Gaps = 14/110 (12%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A   +A+L++A+    +F+    SA++  ++  G K NGA L +A   KAN +G DL+  
Sbjct: 100 ANLSNANLKQAILTNVDFK----SANLSGANLVGVKLNGANLSRADLSKANLSGIDLTGA 155

Query: 171 LMDRMVLNEANLT----------NAVLVRTVLTRSDLGGAIIEGADFSDA 210
            + R+ L+ ANL            A L R+ L   DL GAI++G++   A
Sbjct: 156 NLSRVDLSRANLNGADLSGANLYKADLSRSNLRNGDLQGAILQGSNLHKA 205


>gi|254182800|ref|ZP_04889393.1| pentapeptide repeat protein [Burkholderia pseudomallei 1655]
 gi|184213334|gb|EDU10377.1| pentapeptide repeat protein [Burkholderia pseudomallei 1655]
          Length = 825

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 566

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585


>gi|51246498|ref|YP_066382.1| hypothetical protein DP2646 [Desulfotalea psychrophila LSv54]
 gi|50877535|emb|CAG37375.1| hypothetical protein DP2646 [Desulfotalea psychrophila LSv54]
          Length = 446

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 47/96 (48%), Gaps = 5/96 (5%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           + K V   E ++      D+   DFSG    GA LE+A    AN   ADL+D  +    L
Sbjct: 337 IEKVVEAGECYQC-----DLAGLDFSGESLTGADLEQADLSGANLAEADLADANLRGANL 391

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
             ANLT A L R  L + DL GA + GA+  D  +D
Sbjct: 392 RGANLTGADLRRADLYKGDLRGADLTGANLEDTQMD 427



 Score = 37.7 bits (86), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 38/71 (53%), Gaps = 6/71 (8%)

Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTGADLSD 169
           ADL +A     N   A+   A++R ++  G+   GA L +A  YK     A+ TGA+L D
Sbjct: 364 ADLEQADLSGANLAEADLADANLRGANLRGANLTGADLRRADLYKGDLRGADLTGANLED 423

Query: 170 TLMDRMVLNEA 180
           T MD ++  +A
Sbjct: 424 TQMDGVLQTDA 434


>gi|334117107|ref|ZP_08491199.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333461927|gb|EGK90532.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 520

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 57/95 (60%), Gaps = 9/95 (9%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N ++A+MR++  + ++ +GA L     YKAN +GA L+   + R  L EA L  A ++R+
Sbjct: 51  NLSNANMRKAKLNVARLSGANL-----YKANLSGAILNVANLIRADLREAQLVEATMIRS 105

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
            L R++L  A + GA+ S+A  DL  ++A  + AN
Sbjct: 106 ELIRANLSSANLTGANLSEA--DL--REATLREAN 136



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 67/138 (48%), Gaps = 3/138 (2%)

Query: 74  TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN-FRANF 132
           T L+ A +     N++ L+  N Y+A   G   I + A    ADLR+A  V+    R+  
Sbjct: 50  TNLSNANMRKAKLNVARLSGANLYKANLSG--AILNVANLIRADLREAQLVEATMIRSEL 107

Query: 133 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 192
             A++  ++ +G+  + A L +A   +AN   ADLS   +    L  ANL  A L R  L
Sbjct: 108 IRANLSSANLTGANLSEADLREATLREANLEQADLSGAHLRGASLTAANLERANLHRADL 167

Query: 193 TRSDLGGAIIEGADFSDA 210
           +R+DL G  +  A+   A
Sbjct: 168 SRADLRGVNLCNAELRQA 185



 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 51/92 (55%), Gaps = 1/92 (1%)

Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           +A+LR+A   + N   A+   A++R +D SG+   GA L++A    AN  GA+LS+  + 
Sbjct: 179 NAELRQANLSQANLSGADLRGANLRWADLSGANLTGADLDEARLSGANLYGANLSNVNLL 238

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
              L  A+LT A L+      +DL GA + GA
Sbjct: 239 NATLVHADLTQANLIHADWVGADLTGAALTGA 270



 Score = 43.9 bits (102), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 41/134 (30%), Positives = 65/134 (48%), Gaps = 11/134 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           +AA    A+L +A   + + R      A    A++ +++ SG+   GA L  A    AN 
Sbjct: 153 TAANLERANLHRADLSRADLRGVNLCNAELRQANLSQANLSGADLRGANLRWADLSGANL 212

Query: 163 TGADLSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
           TGADL +  +    L  ANL+     NA LV   LT+++L  A   GAD + A +  A+ 
Sbjct: 213 TGADLDEARLSGANLYGANLSNVNLLNATLVHADLTQANLIHADWVGADLTGAALTGAKI 272

Query: 218 QALCKYANGTNPIT 231
            A+ ++    + IT
Sbjct: 273 YAVSRFDVKADDIT 286


>gi|209526319|ref|ZP_03274848.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|376001485|ref|ZP_09779353.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423062694|ref|ZP_17051484.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209493248|gb|EDZ93574.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|375330094|emb|CCE15106.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406715650|gb|EKD10803.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 390

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 64/115 (55%), Gaps = 11/115 (9%)

Query: 110 AAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVA-----YKANFT 163
           +A    ADL +A+ +K N  +A+ +SA++ +S+   + F  AYL KA       ++A+ +
Sbjct: 111 SAHLNWADLTEAIFIKTNLHKADLSSANLTKSNLQSANFVRAYLIKANLSEADLFQADLS 170

Query: 164 GADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
            A+L D  +    L+E     ANL  A L    LT+++LG A + GA+ +DA ++
Sbjct: 171 SANLKDVNLSAANLSECKMTRANLMGANLTEADLTKANLGRANLRGANLTDAYLN 225



 Score = 42.0 bits (97), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 70/146 (47%), Gaps = 9/146 (6%)

Query: 76  LAAAVVASCSSNISALADLNKYEAE-TRGEFGIGS--AAQFGSADLRKAVHVKEN-FRAN 131
           L+AA ++ C    + L   N  EA+ T+   G  +   A    A L  A  V+ + ++AN
Sbjct: 179 LSAANLSECKMTRANLMGANLTEADLTKANLGRANLRGANLTDAYLNSASLVEADLYQAN 238

Query: 132 FTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
            T A++  ++ S +       NGA+L K     A+  G DLS  L+  + L  A L+ A 
Sbjct: 239 LTRANLSRANLSKTYLRDICLNGAHLTKVNLSGADLGGVDLSQKLLTGINLAGAYLSEAT 298

Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVI 212
           LV  +L  ++L  A + GA+   A +
Sbjct: 299 LVGALLMEANLSAANLSGANLQSACL 324



 Score = 40.8 bits (94), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 49/108 (45%), Gaps = 11/108 (10%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A  G  DL + +    N        A    A + E++ S +  +GA L+ A    A+ 
Sbjct: 270 SGADLGGVDLSQKLLTGINLAGAYLSEATLVGALLMEANLSAANLSGANLQSACLIHADL 329

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            GA      +DR+ L +ANLT A L +  L  ++L  AI+ G +   A
Sbjct: 330 GGA-----YLDRVDLTDANLTGANLTKADLREANLRAAILAGVELKGA 372



 Score = 40.8 bits (94), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 6/102 (5%)

Query: 115 SADLRKAVHVKEN------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           SA+  +A  +K N      F+A+ +SA++++ + S +  +   + +A    AN T ADL+
Sbjct: 146 SANFVRAYLIKANLSEADLFQADLSSANLKDVNLSAANLSECKMTRANLMGANLTEADLT 205

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
              + R  L  ANLT+A L    L  +DL  A +  A+ S A
Sbjct: 206 KANLGRANLRGANLTDAYLNSASLVEADLYQANLTRANLSRA 247



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 67/160 (41%), Gaps = 26/160 (16%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLE---------------- 153
           A F  ADL  A     N  + N + A++  ++ SGS  NGA L+                
Sbjct: 57  ADFSEADLSGAHLSLANLSKVNLSGANLTGANLSGSSLNGANLQGATLSGVNLESAHLNW 116

Query: 154 ----KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
               +A+  K N   ADLS   + +  L  AN   A L++  L+ +DL  A +  A+  D
Sbjct: 117 ADLTEAIFIKTNLHKADLSSANLTKSNLQSANFVRAYLIKANLSEADLFQADLSSANLKD 176

Query: 210 AVIDLAQKQALCKY--AN--GTNPITGVSTRKSLGCGNSR 245
             +  A   + CK   AN  G N      T+ +LG  N R
Sbjct: 177 VNLS-AANLSECKMTRANLMGANLTEADLTKANLGRANLR 215


>gi|428216569|ref|YP_007101034.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427988351|gb|AFY68606.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 330

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 50/164 (30%), Positives = 76/164 (46%), Gaps = 15/164 (9%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR------ 129
           L  A +   S NI++L + N   A+ RG     S A    ADLR A   + N        
Sbjct: 132 LKGATLRRASKNITSLRNANLRRADLRG--ADLSEANLAGADLRGADLSEANLANTDLTG 189

Query: 130 ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN   A MR     E++ +G+    AY++     +AN + ADL  T +D  V++ ANL+ 
Sbjct: 190 ANLAEAIMRGTGLTEANLTGANLANAYMQNVRTERANLSEADLQGTNLDLAVMSMANLSK 249

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
           + L    L R++L G  +   + S A  +L + Q +  Y   TN
Sbjct: 250 SNLSEASLYRANLNGTDLSRTNLSGA--NLREAQLVESYMARTN 291



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 54/177 (30%), Positives = 81/177 (45%), Gaps = 19/177 (10%)

Query: 56  NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGS---AAQ 112
           NQ     AKL +  +    AL  A + +     + L D N   A+ R     G+    A 
Sbjct: 73  NQAHLSEAKLNDVDLH-GAALVGATLVNADLTFAVLIDANLMNADLRSANLSGANLAGAC 131

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGAD 166
              A LR+A     + R AN   AD+R     E++ +G+   GA L +A     + TGA+
Sbjct: 132 LKGATLRRASKNITSLRNANLRRADLRGADLSEANLAGADLRGADLSEANLANTDLTGAN 191

Query: 167 LSDTLMDRMVLNEANLTNAVL-------VRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           L++ +M    L EANLT A L       VRT   R++L  A ++G +   AV+ +A 
Sbjct: 192 LAEAIMRGTGLTEANLTGANLANAYMQNVRT--ERANLSEADLQGTNLDLAVMSMAN 246



 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 7/96 (7%)

Query: 95  NKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAY 151
           N  EA+ +G   +  + S A    ++L +A      +RAN    D+  ++ SG+    A 
Sbjct: 226 NLSEADLQGTNLDLAVMSMANLSKSNLSEASL----YRANLNGTDLSRTNLSGANLREAQ 281

Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           L ++   + N T ADL+D L+ R  L+ ANL NA L
Sbjct: 282 LVESYMARTNLTNADLADALLARAELSSANLLNANL 317



 Score = 45.1 bits (105), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 47/87 (54%), Gaps = 5/87 (5%)

Query: 129 RANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           RAN + AD++ ++      S +  + + L +A  Y+AN  G DLS T +    L EA L 
Sbjct: 224 RANLSEADLQGTNLDLAVMSMANLSKSNLSEASLYRANLNGTDLSRTNLSGANLREAQLV 283

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
            + + RT LT +DL  A++  A+ S A
Sbjct: 284 ESYMARTNLTNADLADALLARAELSSA 310


>gi|167913453|ref|ZP_02500544.1| pentapeptide repeat family protein [Burkholderia pseudomallei 112]
 gi|403521532|ref|YP_006657101.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
           BPC006]
 gi|403076599|gb|AFR18178.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
           BPC006]
          Length = 825

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 566

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585


>gi|443651776|ref|ZP_21130709.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|159027471|emb|CAO89436.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443334417|gb|ELS48929.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 931

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 36/117 (30%), Positives = 55/117 (47%), Gaps = 3/117 (2%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
            G A+L +A   + +   AN   A++  ++  G+   GA L  A   +AN  GA+L    
Sbjct: 789 LGGANLERANLAEADIGGANLEGANLEGANLKGANLEGANLAMAFLKRANLEGANLRGAN 848

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
           ++   L  ANL  A L R  L  ++L GA + GA+   A +D A  +    Y  G N
Sbjct: 849 LEEAYLEGANLAMAFLKRANLEGANLRGANLYGANLKGANLDWANLEG--AYLEGAN 903



 Score = 43.1 bits (100), Expect = 0.13,   Method: Composition-based stats.
 Identities = 38/123 (30%), Positives = 54/123 (43%), Gaps = 10/123 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           + A  G A+L  A     N +      AN   A ++ ++  G+   GA LE+A    AN 
Sbjct: 800 AEADIGGANLEGANLEGANLKGANLEGANLAMAFLKRANLEGANLRGANLEEAYLEGANL 859

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
             A L    ++   L  ANL  A L    L  ++L GA +EGA+     +D A      K
Sbjct: 860 AMAFLKRANLEGANLRGANLYGANLKGANLDWANLEGAYLEGANLRGVFLDGAN----FK 915

Query: 223 YAN 225
           YAN
Sbjct: 916 YAN 918


>gi|334188366|ref|NP_001190531.1| thylakoid lumenal protein [Arabidopsis thaliana]
 gi|332008986|gb|AED96369.1| thylakoid lumenal protein [Arabidopsis thaliana]
          Length = 250

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 52/110 (47%), Gaps = 5/110 (4%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            ++A M  + F G+      + KA A +A+F G + ++ ++DR+   ++NL  AV   TV
Sbjct: 145 LSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLKGAVFRNTV 204

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
           L+ S    A +E   F D +I     Q +C+     N       R  LGC
Sbjct: 205 LSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 249


>gi|226194659|ref|ZP_03790253.1| pentapeptide repeat protein [Burkholderia pseudomallei Pakistan 9]
 gi|386863935|ref|YP_006276883.1| type VI secretion system [Burkholderia pseudomallei 1026b]
 gi|418534996|ref|ZP_13100802.1| type VI secretion system [Burkholderia pseudomallei 1026a]
 gi|225933225|gb|EEH29218.1| pentapeptide repeat protein [Burkholderia pseudomallei Pakistan 9]
 gi|385357281|gb|EIF63347.1| type VI secretion system [Burkholderia pseudomallei 1026a]
 gi|385661063|gb|AFI68485.1| type VI secretion system [Burkholderia pseudomallei 1026b]
          Length = 825

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 566

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585


>gi|385871982|gb|AFI90502.1| Pentapeptide repeat protein [Pectobacterium sp. SCC3193]
          Length = 273

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 45/159 (28%), Positives = 76/159 (47%), Gaps = 12/159 (7%)

Query: 79  AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMR 138
           A++ SCS  +   A+  ++   T     + S +   SAD  +A   + N R     A + 
Sbjct: 115 ALLDSCSW-VETQANEARFTGATWLTSAVASGSSMNSADFTQATLRQSNLR----QASLI 169

Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
            + F+ +K   + L +A   + NF  A+L+ +L  R    EAN T+A L+  +L +S LG
Sbjct: 170 GAVFALAKLENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQLG 229

Query: 199 GAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
           GA   GA+   A  DL+Q      + + T  + G  T++
Sbjct: 230 GANFRGANLFRA--DLSQ-----AFTSNTTQLDGAWTKR 261


>gi|6226483|sp|Q52118.1|YMO3_ERWST RecName: Full=Uncharacterized protein in mobD 3'region
 gi|886362|gb|AAA69501.1| unknown [Plasmid pSW200]
          Length = 295

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 61/105 (58%), Gaps = 6/105 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANF 162
           S A   +ADL++A     N   A+ T+A++ ++D      +GA L  A   +AY  +A+ 
Sbjct: 170 SNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADL 229

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           + A+LS+  + R  L++ANL++A L    L R+DL  AI++GA+ 
Sbjct: 230 SNANLSNADLKRADLSDANLSDANLTNVDLKRADLSNAILKGANL 274



 Score = 44.7 bits (104), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 56/116 (48%), Gaps = 11/116 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYK--- 159
           S A    ADL  A     N        AN T A + E+D S +  +GA L  A   +   
Sbjct: 85  SDANLSDADLSDANLSDANLSGANLAHANLTMAYLSEADLSNANLSGADLTNANLNQTDL 144

Query: 160 --ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
              N +GA+L+   +    L+EA+L+NA L    L R+DL  A + GAD ++A ++
Sbjct: 145 PNVNLSGANLAHANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNANLN 200



 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 48/92 (52%), Gaps = 10/92 (10%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM----------VLN 178
            AN T A + E+D S +  + A L++A    AN +GADL++  +++            L 
Sbjct: 156 HANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLA 215

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            ANLT A L    L+ ++L  A ++ AD SDA
Sbjct: 216 HANLTMAYLSEADLSNANLSNADLKRADLSDA 247



 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 5/78 (6%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           +++  + S +   GAYL  A     N + ADLSD  +    L+ ANL +A L    L+ +
Sbjct: 68  NLKGVNLSDTDLKGAYLSDA-----NLSDADLSDANLSDANLSGANLAHANLTMAYLSEA 122

Query: 196 DLGGAIIEGADFSDAVID 213
           DL  A + GAD ++A ++
Sbjct: 123 DLSNANLSGADLTNANLN 140



 Score = 37.4 bits (85), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 51/91 (56%), Gaps = 5/91 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANFTGADLSDTLMDRMVLNEANLTN 184
           A+ T+A++ ++D      +GA L  A   +AY  +A+ + A+LS+  + R  L+ ANL+ 
Sbjct: 132 ADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADLSNANLSNADLKRADLSNANLSG 191

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           A L    L ++DL    + GA+ + A + +A
Sbjct: 192 ADLTNANLNQTDLPNVNLSGANLAHANLTMA 222


>gi|443319118|ref|ZP_21048355.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
 gi|442781316|gb|ELR91419.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
          Length = 331

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 41/130 (31%), Positives = 59/130 (45%), Gaps = 20/130 (15%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A     D+R  D SG   + A L  A     N TGA+LS   + R  L +ANLT  +L  
Sbjct: 191 AYLNGVDLRGMDLSGVNLSQARLNGAKLDLVNLTGANLSQATLRRASLQQANLTGTILTG 250

Query: 190 TV----------LTRSD-----LGGAIIE-----GADFSDAVIDLAQKQALCKYANGTNP 229
            V          LTR+D     L GA+++     GA+F+DA++    +  L   A G   
Sbjct: 251 AVLWHADMQGVNLTRADLSQANLAGALLQATSITGAEFTDAILPEESRNGLYALATGETL 310

Query: 230 ITGVSTRKSL 239
            +   TR++L
Sbjct: 311 WSHRLTRETL 320



 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 56/117 (47%), Gaps = 13/117 (11%)

Query: 94  LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLE 153
           L  YEA  R   GI       S +L + + V         + D+ E+   G+    A+L 
Sbjct: 28  LEMYEAGYRDFAGI----HLNSVNLSQRILV---------AVDLAEASLVGADLARAFLT 74

Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           KA  Y+AN   A+LS T +  + L +++L+ A L  T + ++ L GA + GA+   A
Sbjct: 75  KANLYRANLHRANLSFTKLSDVNLRQSDLSKADLRSTFMVKAHLEGANLSGANLGQA 131



 Score = 37.4 bits (85), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 48/109 (44%), Gaps = 11/109 (10%)

Query: 116 ADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           ADLR    VK +            +AN   A++  ++  G+   GA L  A   +AN + 
Sbjct: 106 ADLRSTFMVKAHLEGANLSGANLGQANLRGANLEGANLCGANLQGANLRGANLSQANLSW 165

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           A+LS + M  + L+   L +  L    L   DL G  + G + S A ++
Sbjct: 166 ANLSGSRMGGVALDRTQLADVTLEGAYLNGVDLRGMDLSGVNLSQARLN 214


>gi|443476809|ref|ZP_21066696.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443018179|gb|ELS32476.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 330

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 68/131 (51%), Gaps = 13/131 (9%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG 149
           L+ +N   A   G   IG+   F   +L +A   + N + AN   AD++ ++   +   G
Sbjct: 183 LSRVNLQGANLSGAIAIGTI--FTEVNLSQANLTEVNLKGANLMKADLKNANLRLANLFG 240

Query: 150 AYLEKA---VAYKAN-------FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 199
           A L KA   +A  +N        TG+DLS +L+DR  L++A+L +A LVR  L  +DL  
Sbjct: 241 ANLSKANLSMATLSNAGLIQAILTGSDLSRSLLDRANLSQASLVDAYLVRANLDGADLSN 300

Query: 200 AIIEGADFSDA 210
           AI+  A+ S A
Sbjct: 301 AILTRAELSGA 311



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 60/125 (48%), Gaps = 21/125 (16%)

Query: 131 NFTSADMRESDF---------------SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           N T AD+R ++F               SG+K  GA L +A+   AN T ADL   ++ R+
Sbjct: 36  NLTKADLRRTNFVFAYLNKVTFNHANLSGAKLGGATLNQAIMMSANLTEADLHGAMLQRV 95

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP--ITGV 233
            L  ANL+ A L+   L+ +DL    + GA+   A++      AL +   G  P  + G 
Sbjct: 96  NLFGANLSLANLMDANLSEADLRSVNLRGANLRCAIL----SAALMREERGYPPTNMVGA 151

Query: 234 STRKS 238
           + RK+
Sbjct: 152 NLRKA 156



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 46/94 (48%), Gaps = 6/94 (6%)

Query: 124 VKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           V  N R A+   A++  SD +G   +GA L +A   + N  GA+LS  +    +  E NL
Sbjct: 149 VGANLRKADLRGANLSGSDLTGVDLSGANLSEATLSRVNLQGANLSGAIAIGTIFTEVNL 208

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           + A      LT  +L GA +  AD  +A + LA 
Sbjct: 209 SQA-----NLTEVNLKGANLMKADLKNANLRLAN 237


>gi|428202965|ref|YP_007081554.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427980397|gb|AFY77997.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 179

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 35/95 (36%), Positives = 49/95 (51%), Gaps = 1/95 (1%)

Query: 117 DLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           DL+ A     N   AN  +AD+ E++  G+   GA L+ A   K N  GA+L    +   
Sbjct: 65  DLQNANLQGANLEGANLQNADLEEANLQGANLAGANLQGADLEKGNLAGANLQTANLINA 124

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            L EANL NA L    L R+DL  A + GA+ ++A
Sbjct: 125 DLEEANLQNANLQGASLQRADLEKANLTGANTNEA 159



 Score = 45.8 bits (107), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 40/142 (28%), Positives = 68/142 (47%), Gaps = 8/142 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A   S +LR+ +  KE    N +  D++ ++  G+   GA L+ A   +AN  GA+L+
Sbjct: 38  STAPEASTELRRLLDTKECAGCNLSGVDLQNANLQGANLEGANLQNADLEEANLQGANLA 97

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
              +    L + NL  A L    L  +DL  A ++ A+   A +    ++A  + AN   
Sbjct: 98  GANLQGADLEKGNLAGANLQTANLINADLEEANLQNANLQGASL----QRADLEKAN--- 150

Query: 229 PITGVSTRKSLGCGNSRRNAYG 250
            +TG +T ++   G +  NA G
Sbjct: 151 -LTGANTNEANLQGANLENAIG 171


>gi|424851694|ref|ZP_18276091.1| pentapeptide repeat-containing protein [Rhodococcus opacus PD630]
 gi|356666359|gb|EHI46430.1| pentapeptide repeat-containing protein [Rhodococcus opacus PD630]
          Length = 194

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 60/131 (45%), Gaps = 16/131 (12%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFS-----GSKFNGAYL 152
           +E R E  I +   F  ADL ++ HV   FR+ +FT   +  S+F      GS+F+   L
Sbjct: 31  SELRTESVIFTDCDFTGADLAESRHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 90

Query: 153 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 202
              V  + +FT     GADL           EANL    L R VL  +DL     GGA  
Sbjct: 91  RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 150

Query: 203 EGADFSDAVID 213
           +GAD   A ID
Sbjct: 151 DGADLRGARID 161


>gi|390441101|ref|ZP_10229280.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
 gi|389835591|emb|CCI33406.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
          Length = 436

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 71/149 (47%), Gaps = 9/149 (6%)

Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
           T+ EF    A     A+L KA+              +++ D SG+   GA L  A+   A
Sbjct: 203 TKAEFT-TDAKVIEKAELIKAIR-----EGTIDKTTLQQVDLSGAILRGAILIGAILRGA 256

Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQK 217
           N +GA+LSD ++   +L+ A L+ A L    L+ +DL GA + GA+ S+A +   DL++ 
Sbjct: 257 NLSGANLSDAILRGAILSRAFLSGAFLSEADLSGADLSGANLRGANLSEADLSEADLSEA 316

Query: 218 QALCKYANGTNPITGVSTRKSLGCGNSRR 246
                  +G N I     R +L   N RR
Sbjct: 317 DLSEADLSGANLIDANLRRANLIKANLRR 345



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 40/127 (31%), Positives = 63/127 (49%), Gaps = 10/127 (7%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+LR A   + +   A+ + AD+ E+D SG+    A L +A   KAN   A+L
Sbjct: 289 SGADLSGANLRGANLSEADLSEADLSEADLSEADLSGANLIDANLRRANLIKANLRRANL 348

Query: 168 SDTLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
            + ++    L+      ANL  A+L+  +L  +DL GA +  A+ S+A I+     A+  
Sbjct: 349 IEAILSEADLSGANLRRANLIKAILIEAILIEADLRGADLRWANLSEADIE----NAIFI 404

Query: 223 YANGTNP 229
            A G  P
Sbjct: 405 DATGITP 411



 Score = 44.7 bits (104), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 54/103 (52%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTS-ADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A LR A+  +      F S AD+  +D SG+   GA L +A   +A+ + ADL
Sbjct: 259 SGANLSDAILRGAILSRAFLSGAFLSEADLSGADLSGANLRGANLSEADLSEADLSEADL 318

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S+  +    L +ANL  A L++  L R++L  AI+  AD S A
Sbjct: 319 SEADLSGANLIDANLRRANLIKANLRRANLIEAILSEADLSGA 361



 Score = 44.3 bits (103), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 51/103 (49%), Gaps = 2/103 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+LR+A  +K N R AN   A + E+D SG+    A L KA+  +A    ADL
Sbjct: 324 SGANLIDANLRRANLIKANLRRANLIEAILSEADLSGANLRRANLIKAILIEAILIEADL 383

Query: 168 SDTLMDRMVLNEANLTNAVLVR-TVLTRSDLGGAIIEGADFSD 209
               +    L+EA++ NA+ +  T +T       I  GA F D
Sbjct: 384 RGADLRWANLSEADIENAIFIDATGITPEQKQDLIRRGAIFGD 426


>gi|189499620|ref|YP_001959090.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
           BS1]
 gi|189495061|gb|ACE03609.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
          Length = 300

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 41/121 (33%), Positives = 64/121 (52%), Gaps = 3/121 (2%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADL-RKAVHVKENFRANFTSADMRESDFSGSKFNG 149
           L+D N  EA+  G   +   A    A+L R  V   +   AN +     E+DF+ S+   
Sbjct: 93  LSDANLVEADLSGSMLV--EANLRGANLSRGKVRDVDLTSANLSDGFFIETDFTRSQMVR 150

Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           + +++A   +A  TG +LS + ++++ L+ A+L NAVLV   +T SDL  A   GAD  D
Sbjct: 151 SKMQRAFLGRATLTGTNLSWSNLEKVNLDNADLQNAVLVDVDITSSDLVAANFSGADLRD 210

Query: 210 A 210
           A
Sbjct: 211 A 211


>gi|254417634|ref|ZP_05031369.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196175575|gb|EDX70604.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 470

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 55/110 (50%), Gaps = 9/110 (8%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A   SADLR A        A    AD+R +D  G+K N A L+   A  AN +GA+LS  
Sbjct: 214 ANLVSADLRNANLTD----AQLEVADIRSADLRGAKLNNANLDTVNADSANLSGANLS-- 267

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
              +  +  A+   A+LVRT L  + L G+  + AD + A +  AQ + +
Sbjct: 268 ---QAYITNADFNGAILVRTTLREAVLNGSNFQIADLTQANLQGAQLKGI 314



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 51/158 (32%), Positives = 70/158 (44%), Gaps = 23/158 (14%)

Query: 67  NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA-VHVK 125
           N  + V T L  AV+   +  I+ L   N   A+ +G   IG    F  A+L KA +   
Sbjct: 277 NGAILVRTTLREAVLNGSNFQIADLTQANLQGAQLKG---IG----FNRANLTKANLEGA 329

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL----------SDTLMDRM 175
           +   A    AD+  +  +G+  + AYL  A    AN +G DL          S+  +   
Sbjct: 330 DLTNAKLAIADLTNAQLTGAILHSAYLHSATLANANLSGVDLQGAQLREANLSNVTLVGA 389

Query: 176 VLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFS 208
            L +ANL     T A L  T LTR DL GA + GAD S
Sbjct: 390 TLEDANLIRSTLTGANLTYTNLTRCDLRGANLTGADLS 427



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 51/110 (46%), Gaps = 6/110 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A   +AD   A+ V+   R      +NF  AD+ +++  G++  G    +A   KAN 
Sbjct: 267 SQAYITNADFNGAILVRTTLREAVLNGSNFQIADLTQANLQGAQLKGIGFNRANLTKANL 326

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            GADL++  +    L  A LT A+L    L  + L  A + G D   A +
Sbjct: 327 EGADLTNAKLAIADLTNAQLTGAILHSAYLHSATLANANLSGVDLQGAQL 376



 Score = 43.5 bits (101), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 6/144 (4%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
           F++   AA V     +N+   ADL  Y A   G   + S A    A L  A  V+   R 
Sbjct: 154 FIANWYAAVVTDLRDTNLQG-ADL--YRANLDG--ALLSRANLQDAQLDYANLVRTYLRE 208

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A  T+A++  +D   +    A LE A    A+  GA L++  +D +  + ANL+ A L +
Sbjct: 209 ATLTNANLVSADLRNANLTDAQLEVADIRSADLRGAKLNNANLDTVNADSANLSGANLSQ 268

Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
             +T +D  GAI+      +AV++
Sbjct: 269 AYITNADFNGAILVRTTLREAVLN 292


>gi|428215879|ref|YP_007089023.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428004260|gb|AFY85103.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 284

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 51/101 (50%), Gaps = 19/101 (18%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN +  D+ E+D SG               AN + ADLSDT +   +L  ANLT A+L 
Sbjct: 159 RANLSGLDLSETDLSG---------------ANLSYADLSDTQLTEAILYGANLTGAILT 203

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
              L  + + G++++GAD S A +    + A  K+ + TN 
Sbjct: 204 SAQLDGAKMNGSLVDGADLSQANL----QDAEVKWVDLTNA 240



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 54/105 (51%), Gaps = 1/105 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S +QF SA L+ A  V+ N  +    +AD+R +D S +   G+ L +A   + N TGA+L
Sbjct: 43  SHSQFCSAILQGATLVEANLEQTKLRAADLRRADLSHANLMGSDLSRADMIETNLTGANL 102

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
               +  ++ +E    +A L R  L   +L G  + GA+  +A I
Sbjct: 103 EQANLTEVIFSEVIFADANLSRANLQGLNLSGINLSGANLQEAHI 147



 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 39/128 (30%), Positives = 59/128 (46%), Gaps = 10/128 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANF 162
           S A    +DL +A  ++ N   AN   A++ E  FS   F  A L +A          N 
Sbjct: 78  SHANLMGSDLSRADMIETNLTGANLEQANLTEVIFSEVIFADANLSRANLQGLNLSGINL 137

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
           +GA+L +  +  +  + ANL+ A L    L+ +DL GA +  AD SD  +     +A+  
Sbjct: 138 SGANLQEAHIAEVSFHNANLSRANLSGLDLSETDLSGANLSYADLSDTQL----TEAILY 193

Query: 223 YANGTNPI 230
            AN T  I
Sbjct: 194 GANLTGAI 201


>gi|53715998|ref|YP_106439.1| pentapeptide repeat-containing protein [Burkholderia mallei ATCC
           23344]
 gi|121597894|ref|YP_990510.1| pentapeptide repeat-containing protein [Burkholderia mallei SAVP1]
 gi|124382797|ref|YP_001025000.1| pentapeptide repeat-containing protein [Burkholderia mallei NCTC
           10229]
 gi|126447556|ref|YP_001079344.1| pentapeptide repeat-containing protein [Burkholderia mallei NCTC
           10247]
 gi|166999172|ref|ZP_02265018.1| pentapeptide repeat family protein [Burkholderia mallei PRL-20]
 gi|238561876|ref|ZP_00441284.2| pentapeptide repeat family protein [Burkholderia mallei GB8 horse
           4]
 gi|254176522|ref|ZP_04883180.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 10399]
 gi|254203434|ref|ZP_04909795.1| pentapeptide repeat family protein [Burkholderia mallei FMH]
 gi|254205313|ref|ZP_04911666.1| pentapeptide repeat family protein [Burkholderia mallei JHU]
 gi|254356120|ref|ZP_04972397.1| pentapeptide repeat family protein [Burkholderia mallei 2002721280]
 gi|52421968|gb|AAU45538.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 23344]
 gi|121225692|gb|ABM49223.1| pentapeptide repeat family protein [Burkholderia mallei SAVP1]
 gi|126240410|gb|ABO03522.1| pentapeptide repeat family protein [Burkholderia mallei NCTC 10247]
 gi|147745673|gb|EDK52752.1| pentapeptide repeat family protein [Burkholderia mallei FMH]
 gi|147754899|gb|EDK61963.1| pentapeptide repeat family protein [Burkholderia mallei JHU]
 gi|148025103|gb|EDK83272.1| pentapeptide repeat family protein [Burkholderia mallei 2002721280]
 gi|160697564|gb|EDP87534.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 10399]
 gi|238523698|gb|EEP87135.1| pentapeptide repeat family protein [Burkholderia mallei GB8 horse
           4]
 gi|243064727|gb|EES46913.1| pentapeptide repeat family protein [Burkholderia mallei PRL-20]
 gi|261826983|gb|ABM99323.2| pentapeptide repeat family protein [Burkholderia mallei NCTC 10229]
          Length = 825

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 566

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585


>gi|374300595|ref|YP_005052234.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
 gi|332553531|gb|EGJ50575.1| Protein of unknown function DUF2169 [Desulfovibrio africanus str.
            Walvis Bay]
          Length = 1248

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 32/93 (34%), Positives = 49/93 (52%), Gaps = 10/93 (10%)

Query: 136  DMRESDFSGSKFN----------GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
            D+R  D SG++            GA L KA+  +A+F+GA LS   +   VL + +L  A
Sbjct: 949  DLRGIDLSGTQLGKTLMCGTNLAGANLSKAMGQEADFSGACLSGANLTGAVLQKTSLVEA 1008

Query: 186  VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
            +L    L ++ L G+ + GAD SDA +D+   Q
Sbjct: 1009 ILSGACLKQAVLNGSDLSGADLSDATLDMVVIQ 1041



 Score = 46.6 bits (109), Expect = 0.013,   Method: Composition-based stats.
 Identities = 38/135 (28%), Positives = 64/135 (47%), Gaps = 25/135 (18%)

Query: 110  AAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
                  A+L KA+  + +F       AN T A ++++    +  +GA L++AV   ++ +
Sbjct: 967  GTNLAGANLSKAMGQEADFSGACLSGANLTGAVLQKTSLVEAILSGACLKQAVLNGSDLS 1026

Query: 164  GADLSDTLMDRMVLNEANLTNAVLVRTVLTR---------SDLGGA----------IIEG 204
            GADLSD  +D +V+ +A L  A + R  L           +D  GA          +++G
Sbjct: 1027 GADLSDATLDMVVIQKAKLDGADVRRASLKMCVIEGPAAGADFRGARFTQCVLKRMLLDG 1086

Query: 205  ADFSDAVIDLAQKQA 219
            ADFS A ++    QA
Sbjct: 1087 ADFSGAALNSTVLQA 1101


>gi|291570913|dbj|BAI93185.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
          Length = 484

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 59/107 (55%), Gaps = 14/107 (13%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN---------- 178
           +ANFT A +  ++FSG+   G  L +A    +  +GA L    ++  VLN          
Sbjct: 34  QANFTEAVLSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLS 93

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
           +ANL +A L+R  L R++L  A++ GA+ ++A  DL  ++A  ++A+
Sbjct: 94  QANLVDASLIRAELMRAELSEAVVNGANLTEA--DL--REATLRHAD 136



 Score = 41.2 bits (95), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 64/129 (49%), Gaps = 13/129 (10%)

Query: 101 TRGEFGIG--SAAQFGSADLRKAV-HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA 157
           TR +  +   S A    A+L +AV +V    RA+ + A++ ++    ++   A L +AV 
Sbjct: 58  TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLVDASLIRAELMRAELSEAVV 117

Query: 158 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSDLGGAIIEGADF 207
             AN T ADL +  +    L +     ANL+ A L+     R+ LTR+DL  A + G + 
Sbjct: 118 NGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNL 177

Query: 208 SDAVIDLAQ 216
            +A +  A+
Sbjct: 178 RNAELRQAE 186



 Score = 40.4 bits (93), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
           DFS      A L +    +ANFT A LS T       + ANLT   L R  L  S L GA
Sbjct: 16  DFSAILLCEANLSRVNLSQANFTEAVLSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 70

Query: 201 IIEGADFSDAVIDLA 215
           I++GA+ ++AV+++A
Sbjct: 71  ILQGANLNEAVLNVA 85



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 42/83 (50%), Gaps = 5/83 (6%)

Query: 130 ANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN T AD+RE+     D   +  +GA L +A    +N   ++L+   + R  L   NL N
Sbjct: 120 ANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNLRN 179

Query: 185 AVLVRTVLTRSDLGGAIIEGADF 207
           A L +  L  +DL GA + GA+ 
Sbjct: 180 AELRQAELNGADLRGANLSGANL 202



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 47/93 (50%), Gaps = 1/93 (1%)

Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           A+L +A  +  N  R+N T AD+  +D  G     A L +A    A+  GA+LS   +  
Sbjct: 145 ANLSEACLILSNLERSNLTRADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRW 204

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
             L+ ANL+ A L  T L+ + L GA + GA  
Sbjct: 205 ANLSGANLSGANLEATQLSGASLRGANLSGASL 237



 Score = 37.4 bits (85), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 1/96 (1%)

Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADL +A     N R A    A++  +D  G+  +GA L  A    AN +GA+L  T +  
Sbjct: 165 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 224

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             L  ANL+ A L+      +DL  A +   D++DA
Sbjct: 225 ASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDA 260


>gi|307944130|ref|ZP_07659471.1| pentapeptide repeat protein [Roseibium sp. TrichSKD4]
 gi|307772476|gb|EFO31696.1| pentapeptide repeat protein [Roseibium sp. TrichSKD4]
          Length = 534

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 53/113 (46%), Gaps = 1/113 (0%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
             I   A+   ADLR A   + + R A    AD+R +    +K  GA L++A   +A+  
Sbjct: 63  LAILQEAKLQEADLRGAKLQQADLRGAKLQQADLRLAKLQQAKLWGADLQEADLQEADLR 122

Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           GADL    +    L  A L  A L    L  +DL GA + GAD   A ++ A+
Sbjct: 123 GADLRGAKLQEADLRGAKLQEADLRGAKLQEADLRGAKLRGADLRGAKLEWAK 175



 Score = 41.2 bits (95), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 48/190 (25%), Positives = 76/190 (40%), Gaps = 34/190 (17%)

Query: 67  NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
            W       L  A +      ++ L +    EA+ RG       A+   ADLR A   + 
Sbjct: 42  EWADLWGANLQQAKLQQADLRLAILQEAKLQEADLRG-------AKLQQADLRGAKLQQA 94

Query: 127 NFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           + R           A+   AD++E+D  G+   GA L+     +A+  GA L +  +   
Sbjct: 95  DLRLAKLQQAKLWGADLQEADLQEADLRGADLRGAKLQ-----EADLRGAKLQEADLRGA 149

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ------ALCKYANGTNP 229
            L EA+L  A      L  +DL GA +E A    A ++ A  +      A+  +A     
Sbjct: 150 KLQEADLRGA-----KLRGADLRGAKLEWAKLEWAKLEWADVRTVKSSLAVSGFARADFT 204

Query: 230 ITGVSTRKSL 239
            TG  T+K +
Sbjct: 205 HTGYLTQKQV 214


>gi|297796179|ref|XP_002865974.1| thylakoid lumenal 17.4 kDa protein, chloroplast [Arabidopsis lyrata
           subsp. lyrata]
 gi|297311809|gb|EFH42233.1| thylakoid lumenal 17.4 kDa protein, chloroplast [Arabidopsis lyrata
           subsp. lyrata]
          Length = 236

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 51/103 (49%), Gaps = 10/103 (9%)

Query: 144 GSKFNGA-----YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
           G+KF+GA      + KA A +A+F G + ++ ++DR+   ++NL  AV   TVL+ S   
Sbjct: 138 GAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLKGAVFRNTVLSGSTFE 197

Query: 199 GAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
            A +E   F D +I     Q +C+     N       R  LGC
Sbjct: 198 EANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 235


>gi|78187857|ref|YP_375900.1| pentapeptide repeat-containing protein [Chlorobium luteolum DSM
           273]
 gi|78167759|gb|ABB24857.1| pentapeptide repeat family protein [Chlorobium luteolum DSM 273]
          Length = 447

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 59/118 (50%), Gaps = 21/118 (17%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRE----------SDFSGSKFNGAYLEKA---- 155
           A+   ADLR+ V ++ +   AN   A++RE          +D  G+   GA+L KA    
Sbjct: 63  AELAGADLRRTVLIRADLSGANLNGANLREANLAMAFIRKADMKGADMTGAWLVKANLKS 122

Query: 156 -----VAYK-ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
                 +++ AN  GA+L  + + +  L  ANL+NAVL    L  +DL GA + GA F
Sbjct: 123 SFMNGASFRGANLLGANLRWSSLRKADLTGANLSNAVLFEANLAGADLSGANLSGATF 180



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 52/98 (53%), Gaps = 6/98 (6%)

Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADLR+A     +F  A   +ADMR     G+    AY++KA    A   GA L    +DR
Sbjct: 297 ADLRQADLGASSFNGATLDNADMR-----GANLRNAYMKKADLKSAKLGGACLEGANLDR 351

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             L +A+L+ A L  T+L  + L GA +EGAD + A +
Sbjct: 352 AFLKDADLSGANLRGTMLYGATLSGANLEGADLAGASL 389



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 2/106 (1%)

Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           A+ F  A L  A     N R A    AD++ +   G+   GA L++A    A+ +GA+L 
Sbjct: 306 ASSFNGATLDNADMRGANLRNAYMKKADLKSAKLGGACLEGANLDRAFLKDADLSGANLR 365

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VID 213
            T++    L+ ANL  A L    L  +DL GA ++GAD   A V+D
Sbjct: 366 GTMLYGATLSGANLEGADLAGASLFDADLRGANLDGADLEGANVMD 411



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 21/113 (18%)

Query: 111 AQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           A    A+LR A   K + +           AN   A ++++D SG+   G  L  A    
Sbjct: 317 ADMRGANLRNAYMKKADLKSAKLGGACLEGANLDRAFLKDADLSGANLRGTMLYGATLSG 376

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           AN  GADL+           A+L +A L    L  +DL GA +  ADF+DAV 
Sbjct: 377 ANLEGADLAG----------ASLFDADLRGANLDGADLEGANVMDADFTDAVF 419



 Score = 37.4 bits (85), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 36/81 (44%), Gaps = 20/81 (24%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
              AD+R++D   S FNGA L+ A                     +  ANL NA + +  
Sbjct: 294 LEGADLRQADLGASSFNGATLDNAD--------------------MRGANLRNAYMKKAD 333

Query: 192 LTRSDLGGAIIEGADFSDAVI 212
           L  + LGGA +EGA+   A +
Sbjct: 334 LKSAKLGGACLEGANLDRAFL 354


>gi|386828886|ref|ZP_10115993.1| putative low-complexity protein [Beggiatoa alba B18LD]
 gi|386429770|gb|EIJ43598.1| putative low-complexity protein [Beggiatoa alba B18LD]
          Length = 199

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 47/83 (56%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           FRAN    D+  ++ SG+  +GA L +A   KAN + ADLS+  +    L   NLT+A L
Sbjct: 48  FRANLNKVDLTNANLSGANLSGANLSEANLSKANLSKADLSEANLSESYLARTNLTDANL 107

Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
               LT++ L  + + GA+ S+A
Sbjct: 108 SEANLTKAYLIESYLSGANLSEA 130



 Score = 42.0 bits (97), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 54/115 (46%), Gaps = 11/115 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAY---- 158
           S A    A+L ++   + N        AN T A + ES  SG+  + A L +A  +    
Sbjct: 83  SKADLSEANLSESYLARTNLTDANLSEANLTKAYLIESYLSGANLSEANLFRANLFESDL 142

Query: 159 -KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            +AN TGA+L  T +    L EA LT A L +  +T +DL GA ++     +  I
Sbjct: 143 FRANLTGANLFKTNLTETNLIEAYLTGASLFKATMTEADLTGAKMDDTHLDENAI 197


>gi|134280632|ref|ZP_01767342.1| pentapeptide repeat protein [Burkholderia pseudomallei 305]
 gi|134247654|gb|EBA47738.1| pentapeptide repeat protein [Burkholderia pseudomallei 305]
          Length = 825

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 566

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585


>gi|189347104|ref|YP_001943633.1| pentapeptide repeat-containing protein [Chlorobium limicola DSM
           245]
 gi|189341251|gb|ACD90654.1| pentapeptide repeat protein [Chlorobium limicola DSM 245]
          Length = 408

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 58/110 (52%), Gaps = 6/110 (5%)

Query: 109 SAAQFGSAD---LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           SA+ F +AD   L+  V    ++RA       R +D SG++  G  L  A    A+ +GA
Sbjct: 21  SASAFNTADFNALKTGVKPWNSYRAGLGG---RVADLSGAQLKGMNLRGADLSYADLSGA 77

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           DL+ + + +  L+ A L +AVL   +L R+ L  A +  AD  DAV++ A
Sbjct: 78  DLASSDLSKARLDHARLDSAVLRSALLVRASLDKARLHNADLEDAVLEAA 127



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 59/108 (54%), Gaps = 6/108 (5%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTG 164
           A+  SA LR A+ V+ +  +A   +AD+ ++    + F GA+++ AV  KA+     F+G
Sbjct: 92  ARLDSAVLRSALLVRASLDKARLHNADLEDAVLEAASFKGAFMQTAVLKKADCTGADFSG 151

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           ADL +T      L  A LT A L  T L R+D+  +++ G+  S + +
Sbjct: 152 ADLRETNFREARLAGALLTGADLRATYLWRADMSRSVLSGSRVSPSTV 199



 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 56/120 (46%), Gaps = 2/120 (1%)

Query: 113 FGSADLRKAVHVKE-NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           F   D+RK     E N R A F   ++  +D + ++  GA   KA  + A+   ADLS  
Sbjct: 285 FAWNDMRKRNRAMEVNLRQAKFDQKNLSYADLAHARLQGASFRKADLFDADLRNADLSGC 344

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
            M    L +A+L  A L    L R++LG A + G   S + +    K+A  K+A   + +
Sbjct: 345 DMREANLEKADLGGADLSGVNLWRANLGRARLNGVKVSASTVLDTGKKADQKWAERHDAV 404



 Score = 38.1 bits (87), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 52/125 (41%), Gaps = 19/125 (15%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
           N Y A   G     S AQ    +LR A    +   A+ + AD+  SD S ++ + A L+ 
Sbjct: 41  NSYRAGLGGRVADLSGAQLKGMNLRGA----DLSYADLSGADLASSDLSKARLDHARLDS 96

Query: 155 AVAY----------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           AV            KA    ADL D      VL  A+   A +   VL ++D  GA   G
Sbjct: 97  AVLRSALLVRASLDKARLHNADLEDA-----VLEAASFKGAFMQTAVLKKADCTGADFSG 151

Query: 205 ADFSD 209
           AD  +
Sbjct: 152 ADLRE 156


>gi|428314781|ref|YP_007150965.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428256164|gb|AFZ22121.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 237

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 35/96 (36%), Positives = 53/96 (55%), Gaps = 6/96 (6%)

Query: 113 FGSADLRKAVHVKENF-RANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           F  ADL  A  +  N  +AN + A     ++R++D +G+K   A L  A    A+ TGA+
Sbjct: 130 FQGADLSNAQLLNTNLAKANLSMATLNRTELRDADLTGAKLESANLSNATLVGAHMTGAN 189

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
           L+    +  VL  A+LT AVL++T L  +DL  AI+
Sbjct: 190 LTGANFNNAVLRYADLTKAVLIKTNLKGADLSLAIM 225



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 57/109 (52%), Gaps = 9/109 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFT 163
           S      ADLR      +  RAN  +  ++++  SG++ N A L     + A   + +F 
Sbjct: 76  SGLDLSGADLRNT----DLSRANLKNTKLKDAKMSGARLNQANLTYADLDGADFQECDFQ 131

Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           GADLS+  +    L +ANL+ A L RT L  +DL GA +E A+ S+A +
Sbjct: 132 GADLSNAQLLNTNLAKANLSMATLNRTELRDADLTGAKLESANLSNATL 180



 Score = 45.1 bits (105), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 52/102 (50%), Gaps = 4/102 (3%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +AN T AD+  +DF    F GA L  A     N   A+LS   ++R  L +A+LT A L 
Sbjct: 112 QANLTYADLDGADFQECDFQGADLSNAQLLNTNLAKANLSMATLNRTELRDADLTGAKLE 171

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
              L+ + L GA + GA+ + A  +     A+ +YA+ T  +
Sbjct: 172 SANLSNATLVGAHMTGANLTGANFN----NAVLRYADLTKAV 209



 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 38/85 (44%), Gaps = 12/85 (14%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           D+RE + SG   +GA L      +AN     L D  M    LN+ANLT A          
Sbjct: 69  DLREINLSGLDLSGADLRNTDLSRANLKNTKLKDAKMSGARLNQANLTYA---------- 118

Query: 196 DLGGAIIEGADFSDAVIDLAQKQAL 220
           DL GA  +  DF  A  DL+  Q L
Sbjct: 119 DLDGADFQECDFQGA--DLSNAQLL 141


>gi|451980423|ref|ZP_21928815.1| conserved hypothetical protein, contains pentapeptide repeats
           [Nitrospina gracilis 3/211]
 gi|451762323|emb|CCQ90046.1| conserved hypothetical protein, contains pentapeptide repeats
           [Nitrospina gracilis 3/211]
          Length = 289

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 44/136 (32%), Positives = 64/136 (47%), Gaps = 31/136 (22%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFN-----GA------------ 150
           S A+F  A L++A     N  R+ F  A M E++ +G +FN     GA            
Sbjct: 100 SGAKFHQALLKRAQFEGANLVRSEFLEAQMNEANLAGVRFNKSDLRGAMMIGINLAGAQI 159

Query: 151 ---YLEKAVAYKANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRTV-----LTRSDL 197
              +L K    K + TG D     L+ + +   VL E N  NA+L RT      LT ++L
Sbjct: 160 PQSHLSKTNISKGDLTGTDVSGCNLTGSDLREAVLRETNFQNAILDRTFLKGADLTGANL 219

Query: 198 GGAIIEGADFSDAVID 213
            GA + GADF++ V+D
Sbjct: 220 TGARLRGADFAETVLD 235



 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 16/117 (13%)

Query: 109 SAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVA 157
           +  +F  +DLR A+ +  N            + N +  D+  +D SG    G+ L +AV 
Sbjct: 135 AGVRFNKSDLRGAMMIGINLAGAQIPQSHLSKTNISKGDLTGTDVSGCNLTGSDLREAVL 194

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
            + NF  A     ++DR  L  A+LT A L    L  +D    +++GA+FS A + L
Sbjct: 195 RETNFQNA-----ILDRTFLKGADLTGANLTGARLRGADFAETVLDGANFSGADLSL 246



 Score = 41.2 bits (95), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 45/84 (53%), Gaps = 10/84 (11%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R N +  + R +D SG+KF+ A L+     +A F GA+L  +      +NEANL     V
Sbjct: 86  RTNLSGVNFRNTDLSGAKFHQALLK-----RAQFEGANLVRSEFLEAQMNEANLAG---V 137

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
           R    +SDL GA++ G + + A I
Sbjct: 138 R--FNKSDLRGAMMIGINLAGAQI 159


>gi|428224453|ref|YP_007108550.1| heat shock protein DnaJ domain-containing protein [Geitlerinema sp.
           PCC 7407]
 gi|427984354|gb|AFY65498.1| heat shock protein DnaJ domain protein [Geitlerinema sp. PCC 7407]
          Length = 297

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 49/92 (53%), Gaps = 5/92 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           N + A++ E DFSG   + A L +A       +K N +GA+LS   + R  L +ANL NA
Sbjct: 183 NLSGANLAEKDFSGRNLSNADLSQADLSDTFLHKVNLSGANLSGAKLFRANLLQANLRNA 242

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
            L    L  +DL GA + GAD + A +  A +
Sbjct: 243 NLQNANLVGADLSGADLTGADLTGARVGTADR 274


>gi|440681919|ref|YP_007156714.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
 gi|428679038|gb|AFZ57804.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
          Length = 269

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 44/141 (31%), Positives = 69/141 (48%), Gaps = 18/141 (12%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFS-----GSKFNGAYLEKAVAYK 159
           A   +A+L +A  ++ N        ANF+ AD+ ++D S     G+  + A L  AV   
Sbjct: 69  ANLTNANLSQAKLIEANLSQANLSIANFSGADLTQADLSQVNLIGANLSDANLRNAVITD 128

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
           AN  G D S+      +LN+A+L  A L+R+ L+ ++L GA +  AD S+A  +L   + 
Sbjct: 129 ANLIGTDFSNA-----ILNDADLAAAKLIRSNLSFANLIGANLIAADLSEA--NLYDAEV 181

Query: 220 LCKYANGTNPITGVSTRKSLG 240
           +  Y    N      TR  LG
Sbjct: 182 MTAYLYKANLSKANLTRVHLG 202



 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 55/108 (50%), Gaps = 6/108 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    ADL  A  ++ N        AN  +AD+ E++   ++   AYL KA   KAN 
Sbjct: 137 SNAILNDADLAAAKLIRSNLSFANLIGANLIAADLSEANLYDAEVMTAYLYKANLSKANL 196

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           T   L  + + +  L+EANLTNA L  + L  ++L GA ++ A+   A
Sbjct: 197 TRVHLGSSYLFKANLSEANLTNADLSWSNLRYANLAGANLQRANLRGA 244



 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 62/118 (52%), Gaps = 12/118 (10%)

Query: 99  AETRGEFGIGSAAQ---FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKA 155
           A  +GE   G+  Q   F   DL  A+ V    R N   A++  ++ S +K   A L +A
Sbjct: 34  ANLQGENLRGANLQGVNFTKVDLSHALLV----RTNLMFANLTNANLSQAKLIEANLSQA 89

Query: 156 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
               ANF+GADL+   + ++ L  ANL++A L   V+T ++L      G DFS+A+++
Sbjct: 90  NLSIANFSGADLTQADLSQVNLIGANLSDANLRNAVITDANL-----IGTDFSNAILN 142



 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
           E D S +   G  L  A     NFT  DLS  L+ R  L  ANLTNA L +  L  ++L 
Sbjct: 28  EIDLSTANLQGENLRGANLQGVNFTKVDLSHALLVRTNLMFANLTNANLSQAKLIEANLS 87

Query: 199 GAIIEGADFSDAVIDLAQ 216
            A +  A+FS A  DL Q
Sbjct: 88  QANLSIANFSGA--DLTQ 103



 Score = 37.4 bits (85), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 53/103 (51%), Gaps = 6/103 (5%)

Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A   +ADL +A ++  E   A    A++ +++ +      +YL KA   +AN T ADLS 
Sbjct: 164 ANLIAADLSEANLYDAEVMTAYLYKANLSKANLTRVHLGSSYLFKANLSEANLTNADLSW 223

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           +      L  ANL  A L R  L  ++L GA ++GA+  D ++
Sbjct: 224 S-----NLRYANLAGANLQRANLRGANLQGANLKGANLQDTIM 261


>gi|75906828|ref|YP_321124.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75700553|gb|ABA20229.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 727

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 5/83 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A  + A++ ++D+  S  +GA LE+A     N + ADLS T M   +L  A L NA L  
Sbjct: 596 AQLSFANLTKTDWQSSDLSGADLERA-----NLSNADLSATRMTGAILRSAQLENANLRN 650

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L+  DL GA + GADF D ++
Sbjct: 651 ADLSLVDLRGANVAGADFKDTIL 673



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 29/108 (26%)

Query: 132 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLM----- 172
           F SA++ ++ F GS+F                A L +A   +ANFT A+LS  LM     
Sbjct: 469 FKSANLNQASFKGSRFRSVGDDGRLDTYDDAIADLSQAQMKQANFTDANLSRVLMTRSDL 528

Query: 173 DRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDA 210
            R  LN ANL+NA L+        L  +DL G ++E     GAD  DA
Sbjct: 529 SRATLNRANLSNARLIGANLSSAQLVGADLRGTVLENASLTGADLGDA 576



 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 65/137 (47%), Gaps = 25/137 (18%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFS 143
           A+ADL++ + +          A F  A+L + +  + +       RAN ++A +  ++ S
Sbjct: 499 AIADLSQAQMKQ---------ANFTDANLSRVLMTRSDLSRATLNRANLSNARLIGANLS 549

Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMD----------RMVLNEANLTNAVLVRTVLT 193
            ++  GA L   V   A+ TGADL D  +           R++   A L+ A L +T   
Sbjct: 550 SAQLVGADLRGTVLENASLTGADLGDAKLQEANLYGARLSRVIAIGAQLSFANLTKTDWQ 609

Query: 194 RSDLGGAIIEGADFSDA 210
            SDL GA +E A+ S+A
Sbjct: 610 SSDLSGADLERANLSNA 626


>gi|425454434|ref|ZP_18834174.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
           9807]
 gi|389804880|emb|CCI15729.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
           9807]
          Length = 962

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 1/103 (0%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A LR A+    N + AN   A+++E++   + F GA L +A   +AN  GA+L +
Sbjct: 798 ANLEGAILRGAILEGANLKEANLKEANLKEANLEEAFFEGAILAEANLERANLYGANLGE 857

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             ++   L  ANL  A L R  L  + L GA +E A+   A +
Sbjct: 858 ANLEEAFLAGANLEEAFLERANLKGAFLMGAFLERANLKGAFL 900



 Score = 47.4 bits (111), Expect = 0.006,   Method: Composition-based stats.
 Identities = 39/117 (33%), Positives = 58/117 (49%), Gaps = 9/117 (7%)

Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
           E  I   A    A+L++A ++KE   AN   A++ E+ F G+    A LE+A  Y AN  
Sbjct: 801 EGAILRGAILEGANLKEA-NLKE---ANLKEANLEEAFFEGAILAEANLERANLYGANLG 856

Query: 164 GADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
            A+L +       ++   L  ANL  A L+   L R++L GA + GA    A I+ A
Sbjct: 857 EANLEEAFLAGANLEEAFLERANLKGAFLMGAFLERANLKGAFLMGAFLQWADIERA 913



 Score = 46.6 bits (109), Expect = 0.013,   Method: Composition-based stats.
 Identities = 30/87 (34%), Positives = 45/87 (51%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN   A++ E++   +   GA LE+A   +AN  GA L    ++R  L  A L  A L 
Sbjct: 847 RANLYGANLGEANLEEAFLAGANLEEAFLERANLKGAFLMGAFLERANLKGAFLMGAFLQ 906

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
              + R++L GA +E A F  A ++ A
Sbjct: 907 WADIERANLDGANLETASFYGANLERA 933



 Score = 45.4 bits (106), Expect = 0.026,   Method: Composition-based stats.
 Identities = 28/100 (28%), Positives = 53/100 (53%), Gaps = 1/100 (1%)

Query: 117 DLRKAVHV-KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           DL+  + + ++ ++AN   A++  +   G+   GA L++A   +AN   A+L +   +  
Sbjct: 779 DLKNCLLICRDLYKANLERANLEGAILRGAILEGANLKEANLKEANLKEANLEEAFFEGA 838

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           +L EANL  A L    L  ++L  A + GA+  +A ++ A
Sbjct: 839 ILAEANLERANLYGANLGEANLEEAFLAGANLEEAFLERA 878



 Score = 43.9 bits (102), Expect = 0.070,   Method: Composition-based stats.
 Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 4/101 (3%)

Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
            A  G A+L +A        AN   A +  ++  G+   GA+LE+A    A   GA L  
Sbjct: 852 GANLGEANLEEAFLAG----ANLEEAFLERANLKGAFLMGAFLERANLKGAFLMGAFLQW 907

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             ++R  L+ ANL  A      L R++L  A + GA+F DA
Sbjct: 908 ADIERANLDGANLETASFYGANLERANLERANLVGANFKDA 948


>gi|220906448|ref|YP_002481759.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219863059|gb|ACL43398.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 309

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 56/115 (48%), Gaps = 10/115 (8%)

Query: 94  LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYL 152
           L +YEA  R   GI         +LR A   + N RA N   A + +++F G+   GA L
Sbjct: 134 LQRYEAGERNFQGI---------NLRGAQLNQLNLRAINLEQAQLEDANFQGTVLEGANL 184

Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
            +A   +AN  GA L  + +D   L  A+L  A L  T L R++L  A + G +F
Sbjct: 185 RQANLSRANLKGARLDGSSLDNANLTSADLEGASLQSTSLDRANLTAANLMGVNF 239



 Score = 37.7 bits (86), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 37/126 (29%), Positives = 53/126 (42%), Gaps = 16/126 (12%)

Query: 116 ADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A+LR+A   + N +           AN TSAD+  +    +  + A L  A     NF  
Sbjct: 182 ANLRQANLSRANLKGARLDGSSLDNANLTSADLEGASLQSTSLDRANLTAANLMGVNFWL 241

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
           ADL      +  L  ANL    + R     ++L G  + GAD  DA+ D    Q  C + 
Sbjct: 242 ADLQSVNFTQANLTGANLGGTDVSRANFKAANLTGVNLSGADRRDAIYD----QFTC-FP 296

Query: 225 NGTNPI 230
            G NP+
Sbjct: 297 EGFNPL 302


>gi|17228308|ref|NP_484856.1| heterocyst-specific glycolipids-directing protein [Nostoc sp. PCC
           7120]
 gi|535436|gb|AAB59979.1| HglK [Nostoc sp. PCC 7120]
 gi|17130158|dbj|BAB72770.1| heterocyst-specific glycolipids-directing protein [Nostoc sp. PCC
           7120]
 gi|1585247|prf||2124368C hglK gene
          Length = 727

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 5/83 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A  + A++ ++D+  S  +GA LE+A     N + ADLS T M   +L  A L NA L  
Sbjct: 596 AQLSFANLTKTDWQSSDLSGADLERA-----NLSNADLSATRMTGAILRSAQLENANLRN 650

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L+  DL GA + GADF D ++
Sbjct: 651 ADLSLVDLRGANVAGADFKDTIL 673



 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 29/108 (26%)

Query: 132 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLM----- 172
           F SA++ ++ F GS+F                A L +A   +ANFT A+LS  LM     
Sbjct: 469 FKSANLNQASFKGSRFRSVGDDGRWDTYDDAIADLSQAQMKQANFTDANLSRVLMTRSDL 528

Query: 173 DRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDA 210
            R  LN ANL+NA L+        L  +DL G ++E     GAD  DA
Sbjct: 529 SRATLNRANLSNARLIGANLSSAQLVGADLRGTVLENASLTGADLGDA 576



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 65/137 (47%), Gaps = 25/137 (18%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFS 143
           A+ADL++ + +          A F  A+L + +  + +       RAN ++A +  ++ S
Sbjct: 499 AIADLSQAQMKQ---------ANFTDANLSRVLMTRSDLSRATLNRANLSNARLIGANLS 549

Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMD----------RMVLNEANLTNAVLVRTVLT 193
            ++  GA L   V   A+ TGADL D  +           R++   A L+ A L +T   
Sbjct: 550 SAQLVGADLRGTVLENASLTGADLGDAKLQEANLYGARLSRVIAIGAQLSFANLTKTDWQ 609

Query: 194 RSDLGGAIIEGADFSDA 210
            SDL GA +E A+ S+A
Sbjct: 610 SSDLSGADLERANLSNA 626


>gi|428211194|ref|YP_007084338.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|427999575|gb|AFY80418.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 190

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + AD+  +D + +   GA L  A  +  +FTGA+L      R +L +A L +A LVR
Sbjct: 85  ANLSGADLSGADLTDADLGGADLSYATLHYTDFTGANLF-----RAMLVDAKLNHAKLVR 139

Query: 190 TVLTRSDLGGAIIEGADFS 208
             L  ++L GAI+EGA FS
Sbjct: 140 VRLRSANLNGAIVEGAIFS 158


>gi|381204220|ref|ZP_09911291.1| hypothetical protein SclubJA_01165 [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 155

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 45/74 (60%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           FR NF  +D+ +++ S +    + L++ +   AN  GADL+ T + + +L EANLT A++
Sbjct: 65  FRTNFYKSDLTDANLSETNLVRSNLKQTILQGANLQGADLTRTDLRKAILFEANLTGALI 124

Query: 188 VRTVLTRSDLGGAI 201
             T LT + L GAI
Sbjct: 125 KDTKLTGTVLKGAI 138



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 44/81 (54%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            T A + + +  G+   G+ L +   YK++ T A+LS+T + R  L +  L  A L    
Sbjct: 44  LTKAKLSKMELQGANLRGSNLFRTNFYKSDLTDANLSETNLVRSNLKQTILQGANLQGAD 103

Query: 192 LTRSDLGGAIIEGADFSDAVI 212
           LTR+DL  AI+  A+ + A+I
Sbjct: 104 LTRTDLRKAILFEANLTGALI 124


>gi|126655992|ref|ZP_01727376.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
 gi|126622272|gb|EAZ92978.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
          Length = 319

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 56/118 (47%), Gaps = 16/118 (13%)

Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           Q   ADLR       +FR  +F+ A++RE DF+G+    AYL +A     N TGA+L  T
Sbjct: 25  QLRRADLRGLNLSNTDFRGVDFSYANLREVDFTGADLRDAYLNEADLTGVNLTGANLEGT 84

Query: 171 LMDRMVLNEAN-----LTNAVLVRTVLTRSD----------LGGAIIEGADFSDAVID 213
            + ++ L +AN      + A L    LT+SD          L G  + GA   DA  D
Sbjct: 85  SLIKIYLIKANCYQTDFSGAYLTGAYLTKSDFKEAKFNGAYLNGTKLSGAKLGDAYYD 142


>gi|451338330|ref|ZP_21908865.1| hypothetical protein C791_5803 [Amycolatopsis azurea DSM 43854]
 gi|449419237|gb|EMD24783.1| hypothetical protein C791_5803 [Amycolatopsis azurea DSM 43854]
          Length = 424

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/138 (34%), Positives = 67/138 (48%), Gaps = 17/138 (12%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS-----DTLMDRMVLNEANLT 183
           R N   AD+  SD SG     A L +A    A+ +GADL+     D  +D +VL    LT
Sbjct: 269 RINLRGADLAGSDLSGINLTSAILNEANLVGADLSGADLTNADLADAKLDGIVLRRTTLT 328

Query: 184 NAVLVRTVLTRS-----DLGGAIIEGADFSDAVIDLA---QKQALCKYANGTNP-ITGVS 234
             VL RT L+       +L GA +EG + S A  DLA    + A+ + AN T   +TG  
Sbjct: 329 GVVLDRTDLSEQALPGLNLVGAHLEGTNLSRA--DLAGVILRDAVLRGANLTEADLTGAD 386

Query: 235 TRK-SLGCGNSRRNAYGS 251
            R  +L   ++ R  +GS
Sbjct: 387 LRNVTLRTVDTTRTIFGS 404


>gi|428308662|ref|YP_007119639.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428250274|gb|AFZ16233.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 360

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/125 (36%), Positives = 60/125 (48%), Gaps = 11/125 (8%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGA 150
           L  +N   A  +G + I   A  G ADLR A             AD+ E+D S +K N A
Sbjct: 165 LGRVNLSHANLKGAYLI--RAYLGGADLRCA---------EIDGADLTEADLSEAKLNCA 213

Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            L       AN + ADLSD  + R  L+ A+L  A L    L R++L GA + GAD S A
Sbjct: 214 KLRGTNLKAANLSLADLSDVNLIRANLSSADLMRANLRDADLIRTNLSGADLRGADLSLA 273

Query: 211 VIDLA 215
            + LA
Sbjct: 274 DLSLA 278



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 56/118 (47%), Gaps = 16/118 (13%)

Query: 111 AQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           A     DL +A  ++ +FR           A+   AD+R +D  G+  + A L  A    
Sbjct: 43  ADLSGTDLSEADLIEVDFRGCNLRGTHLKGAHLQGADLRGADLRGAHLDNANLRGANLRG 102

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADFSDAVI 212
           AN  GADL  T ++   L++ NL+  +L    L+R+DL GA I     +G   SDA +
Sbjct: 103 ANLRGADLQSTELNSANLSDTNLSETILCSANLSRADLRGADIRDSNLQGVSLSDAKL 160



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 65/144 (45%), Gaps = 28/144 (19%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEK----------AVAYK 159
           A    ADLR A     N R AN   A++R +D   ++ N A L            A   +
Sbjct: 78  ADLRGADLRGAHLDNANLRGANLRGANLRGADLQSTELNSANLSDTNLSETILCSANLSR 137

Query: 160 ANFTGADLSDTLMD---------------RMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           A+  GAD+ D+ +                R+ L+ ANL  A L+R  L  +DL  A I+G
Sbjct: 138 ADLRGADIRDSNLQGVSLSDAKLRGANLGRVNLSHANLKGAYLIRAYLGGADLRCAEIDG 197

Query: 205 ADFSDAVIDLAQKQALCKYANGTN 228
           AD ++A  DL++ +  C    GTN
Sbjct: 198 ADLTEA--DLSEAKLNCAKLRGTN 219



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 50/125 (40%), Gaps = 29/125 (23%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           G+  A  F  ADL            + + AD+ E DF G    G +L+ A    A+  GA
Sbjct: 33  GLNLAEDFAEADLSGT---------DLSEADLIEVDFRGCNLRGTHLKGAHLQGADLRGA 83

Query: 166 DLSDTLMDR--------------------MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
           DL    +D                       LN ANL++  L  T+L  ++L  A + GA
Sbjct: 84  DLRGAHLDNANLRGANLRGANLRGADLQSTELNSANLSDTNLSETILCSANLSRADLRGA 143

Query: 206 DFSDA 210
           D  D+
Sbjct: 144 DIRDS 148


>gi|407782050|ref|ZP_11129265.1| hypothetical protein P24_07514 [Oceanibaculum indicum P24]
 gi|407206523|gb|EKE76474.1| hypothetical protein P24_07514 [Oceanibaculum indicum P24]
          Length = 422

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 11/105 (10%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    ADL  A  V+ N        AN +  ++R +   G+  +GA L  A    AN TG
Sbjct: 136 ANMSGADLSNATMVEANLESALLCGANLSGVNLRGAQLEGADLSGANLTGANLADANLTG 195

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
            +L+  ++ R      N+  A + R +LT  DLGGA + GA+ ++
Sbjct: 196 VNLTGAVISR-----TNMARAEMNRAILTNVDLGGADLTGANMAE 235



 Score = 43.9 bits (102), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 56/110 (50%), Gaps = 8/110 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A    ADLR A H++    ANFT A+++E+D  G    GA +    A  A    ADLS
Sbjct: 73  SNAVLHRADLRGA-HLRN---ANFTGANLKEADLRG----GALISGNPANPATMLRADLS 124

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
              MD  +L  AN++ A L    +  ++L  A++ GA+ S   +  AQ +
Sbjct: 125 FAEMDAAMLQSANMSGADLSNATMVEANLESALLCGANLSGVNLRGAQLE 174



 Score = 42.4 bits (98), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 41/115 (35%), Positives = 53/115 (46%), Gaps = 17/115 (14%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKF-NGAYLEKAVAYKANFTGAD 166
           S A    ADL  AV    N   AN T+A +R +  +  +  N         + AN  GAD
Sbjct: 308 SEANLEGADLEGAVMDGVNLSNANMTAARLRGATLASVEIKNSDGKPTGRLWPANLAGAD 367

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDAVIDLAQ 216
           LS           A+LTNA+L    L ++DL GA +      GA+  DAVID AQ
Sbjct: 368 LS----------RADLTNAILSGANLAKTDLTGAKLHNTNLIGANLRDAVIDPAQ 412



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 67/148 (45%), Gaps = 6/148 (4%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
           N  EA+ RG   I       +  LR  +   E   A   SA+M  +D S +    A LE 
Sbjct: 96  NLKEADLRGGALISGNPANPATMLRADLSFAEMDAAMLQSANMSGADLSNATMVEANLES 155

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
           A+   AN +G +L    ++   L+ ANLT A L    LT  +L GA+I   + + A ++ 
Sbjct: 156 ALLCGANLSGVNLRGAQLEGADLSGANLTGANLADANLTGVNLTGAVISRTNMARAEMN- 214

Query: 215 AQKQALCKYANGTNPITGVS---TRKSL 239
             +  L     G   +TG +   TR++L
Sbjct: 215 --RAILTNVDLGGADLTGANMAETRRAL 240


>gi|409993775|ref|ZP_11276905.1| hypothetical protein APPUASWS_21733 [Arthrospira platensis str.
           Paraca]
 gi|291572160|dbj|BAI94432.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
 gi|409935380|gb|EKN76914.1| hypothetical protein APPUASWS_21733 [Arthrospira platensis str.
           Paraca]
          Length = 741

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 53/106 (50%), Gaps = 6/106 (5%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    A+LR       N R      A+   AD+R +D  G+   GA L +A  Y+AN T 
Sbjct: 576 ANLAHANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANLKGANLYRANFYQANITE 635

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            + +   + R+  N ++L +A L+R  L++S L  A + GA+ S +
Sbjct: 636 GNFNGAKLRRVNFNRSDLRDAELIRVDLSKSRLRSACLRGANLSQS 681



 Score = 44.7 bits (104), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 56/128 (43%), Gaps = 8/128 (6%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN------FRANFTSADMRESDFSG 144
           L  +N   A  RG  G    A    ADLR A     N      +RANF  A++ E +F+G
Sbjct: 583 LRGVNLRNANLRG--GNLEGAHLEGADLRGADLQGANLKGANLYRANFYQANITEGNFNG 640

Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           +K       ++    A     DLS + +    L  ANL+ + L    LTR+DL      G
Sbjct: 641 AKLRRVNFNRSDLRDAELIRVDLSKSRLRSACLRGANLSQSNLKGADLTRADLSNVKFTG 700

Query: 205 ADFSDAVI 212
           AD S  +I
Sbjct: 701 ADLSCTLI 708



 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 56/111 (50%), Gaps = 4/111 (3%)

Query: 95  NKYEAE-TRGEFGIGSAAQ--FGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA 150
           N Y+A  T G F      +  F  +DLR A  ++ +  ++   SA +R ++ S S   GA
Sbjct: 627 NFYQANITEGNFNGAKLRRVNFNRSDLRDAELIRVDLSKSRLRSACLRGANLSQSNLKGA 686

Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
            L +A      FTGADLS TL+    L+ A+L NA L +  L  S+  G I
Sbjct: 687 DLTRADLSNVKFTGADLSCTLIRHANLSGADLRNAKLEKANLFGSNTVGCI 737



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 53/115 (46%), Gaps = 7/115 (6%)

Query: 111 AQFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           +QF   DLR    K V++K   + + T ADMRE +  G       L      KAN + A 
Sbjct: 431 SQFQGLDLRQTNLKGVNLK---KMDLTGADMREKNLEGMSLIQLDLRLVNLAKANLSHAI 487

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
           L+ + +    L  ANL  A LV+  L R+DL    +  A  + A +  A  ++ C
Sbjct: 488 LNGSKLAVANLKGANLQEASLVKADLRRADLEEVNLSYASLTTAKLQRANLRSAC 542



 Score = 37.0 bits (84), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 53/113 (46%), Gaps = 11/113 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 163
           + A    A+L++A  VK + R A+    ++  +  + +K   A L  A   +AN      
Sbjct: 494 AVANLKGANLQEASLVKADLRRADLEEVNLSYASLTTAKLQRANLRSACLIEANLMAASL 553

Query: 164 ------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
                 GADLS+  ++   LN+ANL +A L    L  ++L G  +EGA    A
Sbjct: 554 EGCDLKGADLSNANLESAKLNQANLAHANLRGVNLRNANLRGGNLEGAHLEGA 606


>gi|428311553|ref|YP_007122530.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428253165|gb|AFZ19124.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 234

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 51/103 (49%), Gaps = 11/103 (10%)

Query: 109 SAAQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVA 157
           S A    ADLR+A     N             AN + AD+R+++  G+K + A L     
Sbjct: 33  SGANLSEADLREANLSGANLSGADLIGSSLTDANLSDADLRDANLIGAKLSVAILSNVNL 92

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
             AN +GA+LS   ++  +L  ANL  A+L+R  L  ++L GA
Sbjct: 93  VGANLSGAELSGANLNEAMLGAANLIGAILIRAKLHAANLNGA 135



 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 9/100 (9%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A  G+A+L  A+ +    RA   +A++  ++ S S   GA L  A    AN +GA+L + 
Sbjct: 110 AMLGAANLIGAILI----RAKLHAANLNGANLSISNLIGANLSGANLIGANLSGANLIEA 165

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
                 LN ANL  A L R  L  + L GA +  ADFSDA
Sbjct: 166 -----NLNGANLNGARLYRANLAHAKLNGANLSNADFSDA 200



 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 14/96 (14%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + AD+RE++ SG+  +GA L  +    AN + ADL D          ANL  A L  
Sbjct: 35  ANLSEADLREANLSGANLSGADLIGSSLTDANLSDADLRD----------ANLIGAKLSV 84

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
            +L+  +L GA + GA+ S A ++    +A+   AN
Sbjct: 85  AILSNVNLVGANLSGAELSGANLN----EAMLGAAN 116



 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 59/117 (50%), Gaps = 5/117 (4%)

Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
           E  +G+A   G+  +R  +H      AN + +++  ++ SG+   GA L  A   +AN  
Sbjct: 109 EAMLGAANLIGAILIRAKLHAANLNGANLSISNLIGANLSGANLIGANLSGANLIEANLN 168

Query: 164 GADLSDTLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           GA+L+   + R       LN ANL+NA      L ++DL  A +E A+    ++++A
Sbjct: 169 GANLNGARLYRANLAHAKLNGANLSNADFSDANLAKTDLTDANLENANLEGTILNVA 225


>gi|428314300|ref|YP_007125277.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428255912|gb|AFZ21871.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 355

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 54/112 (48%), Gaps = 8/112 (7%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A   +ADLR A   K         RA+ T A + E+D SG+  +GA L  A    A   G
Sbjct: 61  ANLSNADLRVANFTKAQLIETTLSRADLTQAILSEADLSGAILSGALLSGADLKGATLIG 120

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
             L   L+    L + NLT A L R +L ++DL  AI+  A   +A  DL++
Sbjct: 121 VSLIGALIKGAKLTKVNLTGATLSRAILVQADLKKAILNRAILGEA--DLSE 170



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 41/82 (50%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NF+ A +   D SGS  N   L  A    ANFT   L    +    L  AN T A L+ T
Sbjct: 22  NFSGAKLSGVDLSGSNLNRINLSSAHLNGANFTKTKLIRANLSNADLRVANFTKAQLIET 81

Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
            L+R+DL  AI+  AD S A++
Sbjct: 82  TLSRADLTQAILSEADLSGAIL 103



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 63/126 (50%), Gaps = 21/126 (16%)

Query: 109 SAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVA 157
           S A    ADLR+A     +            RAN T   +++++  G++ + A L KA  
Sbjct: 214 SGANLSGADLREANLSHADLSGADLQGANLTRANLTGVLLKKANLRGAELSKANLHKANL 273

Query: 158 YKANFTGADLSDTLMDRMVLNEAN----------LTNAVLVRTVLTRSDLGGAIIEGADF 207
            KAN +GA+L +  +    L++AN          LTNA L  T L  ++L GA +EGA+ 
Sbjct: 274 SKANLSGANLLEANLLDANLSQANLLRSGLLLTYLTNANLSSTNLNEANLIGANLEGANL 333

Query: 208 SDAVID 213
           S+A ++
Sbjct: 334 SEASLE 339



 Score = 43.9 bits (102), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 60/113 (53%), Gaps = 7/113 (6%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R N +SA +  ++F+ +K   A L  A    ANFT A L +T + R     A+LT A+L 
Sbjct: 40  RINLSSAHLNGANFTKTKLIRANLSNADLRVANFTKAQLIETTLSR-----ADLTQAILS 94

Query: 189 RTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANGTNP-ITGVSTRKSL 239
              L+ + L GA++ GAD   A +I ++   AL K A  T   +TG +  +++
Sbjct: 95  EADLSGAILSGALLSGADLKGATLIGVSLIGALIKGAKLTKVNLTGATLSRAI 147



 Score = 43.9 bits (102), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 50/103 (48%), Gaps = 6/103 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A L +A   + N R AN   AD+ E+D  G+  +GA L       AN +GADL
Sbjct: 169 SEANLSGASLVRAYLNRVNLRQANLEEADLSEADLKGANLSGANLS-----GANLSGADL 223

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            +  +    L+ A+L  A L R  LT   L  A + GA+ S A
Sbjct: 224 REANLSHADLSGADLQGANLTRANLTGVLLKKANLRGAELSKA 266


>gi|254425612|ref|ZP_05039329.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
 gi|196188035|gb|EDX83000.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
          Length = 215

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 54/107 (50%), Gaps = 1/107 (0%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A+L  A   K N   AN + AD+ ESD S +   GA L  A    A+ +GADL  
Sbjct: 15  ANLSEANLDGATLDKANLMGANLSEADLSESDLSSADLPGATLHNATLQNADLSGADLRS 74

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
             + R  L+EANL +A L    L  +DL GA + GA+   A + +A 
Sbjct: 75  ADLFRADLSEANLRSADLSSADLRGADLPGAKLIGANLIGANLSIAN 121



 Score = 42.0 bits (97), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 46/81 (56%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ + AD+R ++ S +  +GA L+KA    AN + ADLS++ +    L  A L NA L  
Sbjct: 5   ADLSGADLRGANLSEANLDGATLDKANLMGANLSEADLSESDLSSADLPGATLHNATLQN 64

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L+ +DL  A +  AD S+A
Sbjct: 65  ADLSGADLRSADLFRADLSEA 85



 Score = 40.8 bits (94), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 40/76 (52%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            ++AD+  +D  G+  + A L+ A   KAN  GA+LS+  +    L+ A+L  A L    
Sbjct: 2   LSNADLSGADLRGANLSEANLDGATLDKANLMGANLSEADLSESDLSSADLPGATLHNAT 61

Query: 192 LTRSDLGGAIIEGADF 207
           L  +DL GA +  AD 
Sbjct: 62  LQNADLSGADLRSADL 77


>gi|427707050|ref|YP_007049427.1| RDD domain-containing protein [Nostoc sp. PCC 7107]
 gi|427359555|gb|AFY42277.1| RDD domain containing protein [Nostoc sp. PCC 7107]
          Length = 711

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 46/78 (58%), Gaps = 5/78 (6%)

Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
           A++ ++D+ G+  +GAYL+ A     N + A+LS + M   VL  A L NA L    L+ 
Sbjct: 585 ANLTKTDWQGADLSGAYLDHA-----NLSNANLSTSRMTGAVLRSAQLENADLRNADLSF 639

Query: 195 SDLGGAIIEGADFSDAVI 212
           +DL GA + GADF D ++
Sbjct: 640 ADLRGANVAGADFKDTIL 657



 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 47/87 (54%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN  SA +  ++ S ++  GA L+ AV   A+ TGADL D  ++   L  A L   + +
Sbjct: 519 RANLESARLIGANLSSAQLVGADLQGAVLENASLTGADLGDAKLNEANLYAARLGRVIAI 578

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
            T L+ ++L     +GAD S A +D A
Sbjct: 579 GTQLSFANLTKTDWQGADLSGAYLDHA 605



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 51/113 (45%), Gaps = 29/113 (25%)

Query: 132 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLM----- 172
             SAD+ ++ F GS+F                A L +    +AN T A+LS  L+     
Sbjct: 453 LKSADLNQASFKGSRFRSVGEDGRWDTYDDAIADLTQVQMKQANLTDANLSRVLLTGSDL 512

Query: 173 DRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDAVIDLA 215
            R  LN ANL +A L+        L  +DL GA++E     GAD  DA ++ A
Sbjct: 513 SRASLNRANLESARLIGANLSSAQLVGADLQGAVLENASLTGADLGDAKLNEA 565


>gi|381206177|ref|ZP_09913248.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 210

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 71/145 (48%), Gaps = 7/145 (4%)

Query: 98  EAETRGEFGIGS---AAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLE 153
           EA+  G   +G+   +     A L++A     N   AN + A++ E++  G+   G  L 
Sbjct: 42  EADLGGSLLMGATLISTNLTGAKLQEANLTNANLSEANLSEANLSEANLFGANLTGTNLT 101

Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           +A   +A+ + ADLS+  +    L+EAN + A L RT L  ++L  A + GAD   A  D
Sbjct: 102 EANLSEADLSWADLSEANLSEANLSEANFSKANLSRTNLRETNLQKADLRGADLRSA--D 159

Query: 214 LAQKQALCKYANGTNPITGVSTRKS 238
           L +   +  Y N  N + G   RK+
Sbjct: 160 LREAVLVAAYLNEAN-LDGADMRKA 183



 Score = 41.2 bits (95), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 48/98 (48%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           +   DL K +   E  + + + AD+  S   G+      L  A   +AN T A+LS+  +
Sbjct: 21  YDRKDLDKLLSTSECVKCDLSEADLGGSLLMGATLISTNLTGAKLQEANLTNANLSEANL 80

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
               L+EANL  A L  T LT ++L  A +  AD S+A
Sbjct: 81  SEANLSEANLFGANLTGTNLTEANLSEADLSWADLSEA 118



 Score = 41.2 bits (95), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 48/95 (50%), Gaps = 1/95 (1%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           + A    ADL  A   + N   AN + A+  +++ S +      L+KA    A+   ADL
Sbjct: 101 TEANLSEADLSWADLSEANLSEANLSEANFSKANLSRTNLRETNLQKADLRGADLRSADL 160

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
            + ++    LNEANL  A + +  L R+ +GGAI+
Sbjct: 161 REAVLVAAYLNEANLDGADMRKANLYRASMGGAIL 195


>gi|300867251|ref|ZP_07111911.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
 gi|300334728|emb|CBN57077.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
          Length = 520

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 61/117 (52%), Gaps = 10/117 (8%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGA 150
           ADLN+  A+ RG           +A+LR+A   + N   A+   A++R +D +G+   GA
Sbjct: 165 ADLNR--ADLRG-------VNLSNAELRQANLSQANLSGADLRGANLRWADLNGADLTGA 215

Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
            L++A    AN  GA+LS   +   +L  A+LT A L+R     +DL GA + GA  
Sbjct: 216 DLDEARLSGANLYGANLSSANLLNAILVHADLTQANLIRADWVGADLTGAALTGAKL 272



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 47/80 (58%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N + A++ + +F  +K N A L  A   +AN +GA L+   + R  LN A+L+ A L+R 
Sbjct: 46  NMSGANLSDVNFRKAKLNVARLSGANLSRANLSGAILNVANLIRADLNSADLSEATLIRA 105

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            L R+D+  A + GA+ S+A
Sbjct: 106 ELIRADMSNASLSGANLSEA 125



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 55/110 (50%), Gaps = 4/110 (3%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A LR +  V  N  RAN   AD+  +D  G   + A L +A   +AN +GADL  
Sbjct: 140 ADLSGAHLRGSSLVSANLERANLHRADLNRADLRGVNLSNAELRQANLSQANLSGADLRG 199

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 216
             +    LN A+LT A L    L+ ++L GA +  A+  +A++   DL Q
Sbjct: 200 ANLRWADLNGADLTGADLDEARLSGANLYGANLSSANLLNAILVHADLTQ 249



 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 67/133 (50%), Gaps = 11/133 (8%)

Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFT 163
           +A    A+L +A   + + R  N ++A++R+++ S     G+   GA L  A    A+ T
Sbjct: 154 SANLERANLHRADLNRADLRGVNLSNAELRQANLSQANLSGADLRGANLRWADLNGADLT 213

Query: 164 GADLSDTLMD-----RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
           GADL +  +         L+ ANL NA+LV   LT+++L  A   GAD + A +  A+  
Sbjct: 214 GADLDEARLSGANLYGANLSSANLLNAILVHADLTQANLIRADWVGADLTGAALTGAKLY 273

Query: 219 ALCKYANGTNPIT 231
            + ++    + IT
Sbjct: 274 GVSRFGLKADDIT 286



 Score = 37.4 bits (85), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 52/118 (44%), Gaps = 16/118 (13%)

Query: 109 SAAQFGSADLRKAV-HVKENFRANFTSADMRE----------SDFSGSKFNGAYLEKA-- 155
           S A    A+L  A+ +V    RA+  SAD+ E          +D S +  +GA L +A  
Sbjct: 68  SGANLSRANLSGAILNVANLIRADLNSADLSEATLIRAELIRADMSNASLSGANLSEADL 127

Query: 156 ---VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
                 +AN   ADLS   +    L  ANL  A L R  L R+DL G  +  A+   A
Sbjct: 128 REGTLRQANLEQADLSGAHLRGSSLVSANLERANLHRADLNRADLRGVNLSNAELRQA 185


>gi|443475539|ref|ZP_21065485.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443019605|gb|ELS33670.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 222

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 64/118 (54%), Gaps = 16/118 (13%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKA----------VAYK 159
           A    A+L + + +  +F RA+ T A++ ++D   S  + A L KA          +A  
Sbjct: 88  ANLSRANLSEGILMGVDFSRADLTEANLSKADLYNSLLSSANLTKANLKSSTLDSSIATD 147

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNA-----VLVRTVLTRSDLGGAIIEGADFSDAVI 212
           ANF+ A +++T +  +VL+ ANL+NA      +  + LT SDL GA   GAD S++V+
Sbjct: 148 ANFSNAIVTETTLKSIVLSRANLSNADFSNSKMRNSRLTNSDLRGAKFGGADLSNSVM 205



 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 64/141 (45%), Gaps = 16/141 (11%)

Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A  F + D +K +        + + AD+   D  GS  NGA L  A     N +GA L+D
Sbjct: 28  AHAFVATDYQKLLITNACNNCDLSGADLSYKDLYGSALNGANLSGA-----NLSGALLND 82

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-----------KQ 218
           + +    L+ ANL+  +L+    +R+DL  A +  AD  ++++  A              
Sbjct: 83  SKLRGANLSRANLSEGILMGVDFSRADLTEANLSKADLYNSLLSSANLTKANLKSSTLDS 142

Query: 219 ALCKYANGTNPITGVSTRKSL 239
           ++   AN +N I   +T KS+
Sbjct: 143 SIATDANFSNAIVTETTLKSI 163


>gi|427720942|ref|YP_007068936.1| RDD domain-containing protein [Calothrix sp. PCC 7507]
 gi|427353378|gb|AFY36102.1| RDD domain containing protein [Calothrix sp. PCC 7507]
          Length = 716

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 47/81 (58%), Gaps = 5/81 (6%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            + A++ ++D+ G+  +G YL+ A     N + A+LS T +   V+  ANL NA L    
Sbjct: 587 LSYANLTKTDWQGADLSGVYLDHA-----NLSNANLSATRLTGAVMRSANLENANLQNAD 641

Query: 192 LTRSDLGGAIIEGADFSDAVI 212
           L+ +DL GA + GADF  A++
Sbjct: 642 LSHADLQGANLAGADFRGAIL 662



 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 54/113 (47%), Gaps = 29/113 (25%)

Query: 132 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMD---- 173
           F SA++ +  F GS+F                A L +    +AN T A+LS  +M+    
Sbjct: 458 FKSANLSQGSFKGSRFRSPGEDGRWDTYDDVIADLSQVEMKQANLTDANLSRVVMNRSDL 517

Query: 174 -RMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDAVIDLA 215
            R  LN ANL+N  L+      T L  +DL GA++E     GAD SDA ++ A
Sbjct: 518 SRATLNRANLSNTRLIAANLSSTQLVGADLTGAVLENASLTGADLSDAKLNEA 570



 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 62/132 (46%), Gaps = 15/132 (11%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFS 143
            +ADL++ E +          A    A+L + V  + +       RAN ++  +  ++ S
Sbjct: 488 VIADLSQVEMKQ---------ANLTDANLSRVVMNRSDLSRATLNRANLSNTRLIAANLS 538

Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
            ++  GA L  AV   A+ TGADLSD  ++   L  A+L     + T L+ ++L     +
Sbjct: 539 STQLVGADLTGAVLENASLTGADLSDAKLNEADLFAAHLGRVTAIGTQLSYANLTKTDWQ 598

Query: 204 GADFSDAVIDLA 215
           GAD S   +D A
Sbjct: 599 GADLSGVYLDHA 610


>gi|383312720|ref|YP_005365521.1| hypothetical protein MCE_05120 [Candidatus Rickettsia amblyommii
           str. GAT-30V]
 gi|378931380|gb|AFC69889.1| hypothetical protein MCE_05120 [Candidatus Rickettsia amblyommii
           str. GAT-30V]
          Length = 958

 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 41/121 (33%), Positives = 63/121 (52%), Gaps = 12/121 (9%)

Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           +  SADL KA   K N   A+ T+A +  +    +K + A LEKA A      G ++SD 
Sbjct: 555 KLKSADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVI-DLAQKQALCKYA 224
           +   +   EAN  NA++ R  LT++D   A++E AD      ++A+  ++  KQA  K A
Sbjct: 610 IAQNINAKEANFKNAIMQRADLTKADFTKAVLENADMQAVAAAEAIFKEVNLKQANLKAA 669

Query: 225 N 225
           N
Sbjct: 670 N 670


>gi|440233072|ref|YP_007346865.1| uncharacterized low-complexity protein [Serratia marcescens FGI94]
 gi|440054777|gb|AGB84680.1| uncharacterized low-complexity protein [Serratia marcescens FGI94]
          Length = 846

 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 44/154 (28%), Positives = 70/154 (45%), Gaps = 13/154 (8%)

Query: 71  FVSTALAAAVVASCS----SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
           F+ T L AA  +  S    S + + A+  +++  T       S +    AD   A   + 
Sbjct: 675 FMKTTLEAASFSGASLESCSWVESHAEQARFDGATLVTCAAASESVLNGADFSNATLKQC 734

Query: 127 NFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
           N R       +R + F+ +K   + L +A    A+FT A+L  +L  R    +AN ++A 
Sbjct: 735 NLR----QTPLRGARFTLAKLENSDLSEACCQGADFTRANLVGSLFVRSDFRQANFSDAN 790

Query: 187 LVRTVLTRSDLGGAIIEG-----ADFSDAVIDLA 215
           L+  +L +S LGGA   G     AD S A+ D A
Sbjct: 791 LMGAILQKSLLGGARFNGANLFRADLSQAITDDA 824



 Score = 41.2 bits (95), Expect = 0.45,   Method: Composition-based stats.
 Identities = 38/170 (22%), Positives = 73/170 (42%), Gaps = 8/170 (4%)

Query: 30  LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNIS 89
           L K  ++   + + + S      CS  +     A+     +    A + +V+     + +
Sbjct: 670 LHKTTFMKTTLEAASFSGASLESCSWVESHAEQARFDGATLVTCAAASESVLNGADFSNA 729

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RAN-----FTSADMRESDFS 143
            L   N  +   RG     + A+  ++DL +A     +F RAN     F  +D R+++FS
Sbjct: 730 TLKQCNLRQTPLRG--ARFTLAKLENSDLSEACCQGADFTRANLVGSLFVRSDFRQANFS 787

Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
            +   GA L+K++   A F GA+L    + + + ++A   N    + V T
Sbjct: 788 DANLMGAILQKSLLGGARFNGANLFRADLSQAITDDATSLNGAWTKRVKT 837


>gi|298249936|ref|ZP_06973740.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297547940|gb|EFH81807.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 170

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 51/89 (57%), Gaps = 5/89 (5%)

Query: 130 ANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN + AD+RES F      G+ F+ A L  A   KAN   A LSDT +   +L  A++++
Sbjct: 54  ANLSEADLRESLFIEADCGGANFHRARLNSANFQKANLRAAILSDTDLRNALLANADVSD 113

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           A L  T+L  ++L  AI  GA F DA+++
Sbjct: 114 ADLRGTILAGANLEQAIFCGAVFKDAILN 142



 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 44/97 (45%), Gaps = 15/97 (15%)

Query: 124 VKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM----------D 173
           +    R N   A++R    +    +G YL     ++AN + ADL ++L            
Sbjct: 23  LHHEIRPNLAGANLRGWSLAHINLSGVYL-----HEANLSEADLRESLFIEADCGGANFH 77

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           R  LN AN   A L   +L+ +DL  A++  AD SDA
Sbjct: 78  RARLNSANFQKANLRAAILSDTDLRNALLANADVSDA 114


>gi|427415392|ref|ZP_18905576.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
 gi|425756225|gb|EKU97081.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
          Length = 389

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 3/111 (2%)

Query: 117 DLRKAVHVKE---NFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           D++ A+ + E   ++  NF+S D+R  +FSG K  GA         A F   +L      
Sbjct: 189 DVQAALSIFERQLDYAPNFSSLDLRGLNFSGLKLEGAMFNHTRLNMAEFKKTNLKRASFQ 248

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
             +LN+A+  +AVL   +   + L GA++ GA  ++  +  AQ Q    Y+
Sbjct: 249 GAILNDAHFEDAVLTNALFMNAKLKGAVLNGAKLNEVWLTGAQLQGAHLYS 299



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 43/90 (47%), Gaps = 1/90 (1%)

Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           H + N  A F   +++ + F G+  N A+ E AV   A F  A L   +++   LNE  L
Sbjct: 229 HTRLNM-AEFKKTNLKRASFQGAILNDAHFEDAVLTNALFMNAKLKGAVLNGAKLNEVWL 287

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           T A L    L  ++L  A +  A+   AV+
Sbjct: 288 TGAQLQGAHLYSTNLHLAKLNSANLETAVL 317


>gi|428773363|ref|YP_007165151.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
           7202]
 gi|428687642|gb|AFZ47502.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
          Length = 319

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 46/83 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANFT AD+ E++ SG     A L +A    +N  G   ++    R  L EA+L N++L  
Sbjct: 135 ANFTRADLTEANLSGLNLMEADLTRANLSASNLQGCSFNEANFSRADLREADLKNSILEG 194

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L R++L  A + GA+FS AV+
Sbjct: 195 VFLHRANLSRANLRGANFSGAVL 217



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 75/169 (44%), Gaps = 35/169 (20%)

Query: 45  ESDGQFPDCSNNQCAGP---YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAET 101
            SD  + + S+   +G    +A L N R+ +S+ L  A++  C        DL       
Sbjct: 39  HSDLSWSNLSSTDLSGANFCHADLVNTRI-ISSRLIGALMQHC--------DL------- 82

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
              +G  S     S DL  A    +   AN ++  +  ++ +G+   GA L  A    AN
Sbjct: 83  --SYGDLSWTNLNSVDLSYA----DLSYANLSNTFLSNANLTGANLTGATLTGATLTGAN 136

Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           FT ADL+          EANL+   L+   LTR++L  + ++G  F++A
Sbjct: 137 FTRADLT----------EANLSGLNLMEADLTRANLSASNLQGCSFNEA 175


>gi|254413837|ref|ZP_05027606.1| protein kinase domain [Coleofasciculus chthonoplastes PCC 7420]
 gi|196179434|gb|EDX74429.1| protein kinase domain [Coleofasciculus chthonoplastes PCC 7420]
          Length = 546

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 44/87 (50%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R +F S D+   D   S  +G    ++   + N  GADLS     +  L  ANL +A L 
Sbjct: 418 RRDFASHDLSGLDLQKSDLSGGIFYQSKLTRINLQGADLSSADFGQASLTRANLRDANLG 477

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
           R  L+ SDL GA + GAD S A ++ A
Sbjct: 478 RAYLSNSDLEGADLRGADLSFAYLNHA 504


>gi|389694674|ref|ZP_10182768.1| putative low-complexity protein [Microvirga sp. WSM3557]
 gi|388588060|gb|EIM28353.1| putative low-complexity protein [Microvirga sp. WSM3557]
          Length = 251

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 63/227 (27%), Positives = 95/227 (41%), Gaps = 38/227 (16%)

Query: 33  PLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRV--------------FVSTALAA 78
           P W  CQ       DG  P    + C+     L N  +              F S+ +A 
Sbjct: 22  PAWAKCQ-------DGPGPGVDWSGCSKARLMLTNEDLTGTNFQRSLLTLSDFASSKMAG 74

Query: 79  AVVASCSSNISAL--ADLNKYEAET----RGEFGIG--SAAQFGSADLRKA--VHVKENF 128
           A ++    + +    ADL+K         R  FG    + A FGSAD+ ++    VK   
Sbjct: 75  ANLSETEVSRTRFEGADLSKANFTKALGWRANFGQANLTGADFGSADMNRSNFAQVKAA- 133

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKA----VAYK-ANFTGADLSDTLMDRMVLNEANLT 183
            ANF+ +++  SDFSG+  +GA + KA    V ++ A   G D S + + R  L+  NL 
Sbjct: 134 GANFSKSELNRSDFSGADLSGANISKAELARVLFQSAKIAGVDFSYSNLSRSRLDGLNLQ 193

Query: 184 NAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKYANGTNP 229
                 + L  + +GGA + GA   +   ID+A   A  K     NP
Sbjct: 194 GVNFTGSYLYLTQIGGADLSGATGLTQEQIDIACGSAQTKLPPSINP 240


>gi|119485597|ref|ZP_01619872.1| hypothetical protein L8106_24480 [Lyngbya sp. PCC 8106]
 gi|119456922|gb|EAW38049.1| hypothetical protein L8106_24480 [Lyngbya sp. PCC 8106]
          Length = 253

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 44/120 (36%), Positives = 59/120 (49%), Gaps = 16/120 (13%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGA 150
           LAD N YEA  R        A    ADLR+A    +  RA+ T AD+R++D   +     
Sbjct: 93  LADANLYEANLR-------YANLQGADLRQA----DLSRASLTRADLRKADLQDANLFKV 141

Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
              +A   +ANF  ADL      +  L +AN T+A L       SDL  A ++GADFS+A
Sbjct: 142 NFSEAYLSEANFENADLRQVTFFKANLADANFTDANLF-----GSDLRLANLKGADFSNA 196



 Score = 37.4 bits (85), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 40/80 (50%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           F A    A +R++D   +   GA L  A  ++AN   A+L +  +    L  A+L  A L
Sbjct: 59  FNAKLQGAILRDADLRSANLYGANLYVADLFRANLADANLYEANLRYANLQGADLRQADL 118

Query: 188 VRTVLTRSDLGGAIIEGADF 207
            R  LTR+DL  A ++ A+ 
Sbjct: 119 SRASLTRADLRKADLQDANL 138


>gi|242277903|ref|YP_002990032.1| pentapeptide repeat-containing protein [Desulfovibrio salexigens DSM
            2638]
 gi|242120797|gb|ACS78493.1| pentapeptide repeat protein [Desulfovibrio salexigens DSM 2638]
          Length = 1277

 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 38/155 (24%), Positives = 68/155 (43%), Gaps = 17/155 (10%)

Query: 70   VFVSTALAAAVVASCSSNISALADLNKYEAETRGE-------FGIGSAAQFGSADLRKAV 122
            +F       AV+   + +++ L   +  EAE +G         G    A F  ++++K++
Sbjct: 1045 IFKGAQFPKAVLRDTNFDMAILEKTDFSEAELKGARINMCMISGKADKADFSQSNIKKSI 1104

Query: 123  HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV-LNEAN 181
                     F ++ +  +DFS +  N +      A+K NFT A+L      R     +++
Sbjct: 1105 ---------FKASSLTGADFSEASVNESLFNDVDAHKVNFTDANLDKLRTGRNSNFKDSD 1155

Query: 182  LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
              +A L    L  SD  G+   GADF + +ID +Q
Sbjct: 1156 FRHATLHGAALRESDFTGSDFRGADFENGLIDNSQ 1190



 Score = 42.0 bits (97), Expect = 0.27,   Method: Composition-based stats.
 Identities = 36/141 (25%), Positives = 57/141 (40%), Gaps = 28/141 (19%)

Query: 70   VFVSTALAAAVVASCSSNISALADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVK 125
            +F +++L  A  +  S N S   D++ ++         +   G  + F  +D R      
Sbjct: 1104 IFKASSLTGADFSEASVNESLFNDVDAHKVNFTDANLDKLRTGRNSNFKDSDFR------ 1157

Query: 126  ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
                A    A +RESDF+GS               +F GAD  + L+D   L  ANL   
Sbjct: 1158 ---HATLHGAALRESDFTGS---------------DFRGADFENGLIDNSQLVRANLNGV 1199

Query: 186  VLVRTVLTRSDLGGAIIEGAD 206
                   T+S+L GA +  A+
Sbjct: 1200 SAKGARFTKSNLEGASMRAAN 1220



 Score = 40.0 bits (92), Expect = 1.2,   Method: Composition-based stats.
 Identities = 35/129 (27%), Positives = 52/129 (40%), Gaps = 25/129 (19%)

Query: 109  SAAQFGSADLRKAVHVKENFR----------------ANFTSADMRESDFSGSKFNGAYL 152
            S      ADL K    K NF+                A+F+ A +R +D S   FN A  
Sbjct: 972  SGLDLSGADLSKCQLQKTNFKGAILDNVKFVQAIGMSADFSKASLRRADLSRGLFNKALF 1031

Query: 153  EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII---------E 203
             ++   +AN   A        + VL + N   A+L +T  + ++L GA I         +
Sbjct: 1032 VESDLSEANGAQAIFKGAQFPKAVLRDTNFDMAILEKTDFSEAELKGARINMCMISGKAD 1091

Query: 204  GADFSDAVI 212
             ADFS + I
Sbjct: 1092 KADFSQSNI 1100



 Score = 38.1 bits (87), Expect = 3.8,   Method: Composition-based stats.
 Identities = 36/123 (29%), Positives = 54/123 (43%), Gaps = 6/123 (4%)

Query: 94   LNKYEAETRGEFGIGSAAQFG-SADLRKAVHVKENFR-----ANFTSADMRESDFSGSKF 147
            L K EA+   +      A+ G SAD  +A+  +E  R      +   A +   D SG   
Sbjct: 917  LKKLEAKELPDAAKAKLAEHGLSADSLRALTREEVQRYHEQGKSLVGAVLSGVDLSGLDL 976

Query: 148  NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
            +GA L K    K NF GA L +    + +   A+ + A L R  L+R     A+   +D 
Sbjct: 977  SGADLSKCQLQKTNFKGAILDNVKFVQAIGMSADFSKASLRRADLSRGLFNKALFVESDL 1036

Query: 208  SDA 210
            S+A
Sbjct: 1037 SEA 1039


>gi|428223745|ref|YP_007107842.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427983646|gb|AFY64790.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 183

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 11/114 (9%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMR----------ESDFSGSKFNGAYLEKAVAYKAN 161
           F   DLR+A     N  A +  ++D+R          +++  G+K  GA +  A  Y+AN
Sbjct: 20  FDEIDLREANLFNANLEAVSLQNSDLRSTYLPYTNLNKANLQGAKLQGAEMSDAQLYQAN 79

Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
             GADL  + + R  L  A+L  A L    L  +DL GA ++GA+  DA +  A
Sbjct: 80  LAGADLRGSNLSRATLRYASLQQANLQGANLQGADLYGANLQGANLQDADLQRA 133



 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 42/89 (47%), Gaps = 13/89 (14%)

Query: 128 FRANFTSADMRESDFS----------GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           ++AN   AD+R S+ S           +   GA L+ A  Y AN  GA+L D  + R  L
Sbjct: 76  YQANLAGADLRGSNLSRATLRYASLQQANLQGANLQGADLYGANLQGANLQDADLQRADL 135

Query: 178 NEANLTNAVLVRTVLTRS---DLGGAIIE 203
           ++A L   +L    L R+   D  GA ++
Sbjct: 136 DQATLKATILANANLFRAQNIDWTGAAVD 164


>gi|227496450|ref|ZP_03926734.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434]
 gi|226834032|gb|EEH66415.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434]
          Length = 222

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 53/104 (50%), Gaps = 1/104 (0%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADLR+++  +   R AN   +DMR +D  G+   G +L       A+  GADL D
Sbjct: 98  ADMAGADLRRSILPRAELRNANLVDSDMRGADLRGADLRGTWLPYTDMRGADLAGADLRD 157

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
             ++   L+ A+L ++ L    LT ++L  A + GAD   A ID
Sbjct: 158 ADLEGADLHGASLQSSDLRGADLTDAELTDADLRGADLRGADID 201



 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 34/70 (48%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N    D+ ++D  G+  +GA L  +    A+ T ADL    + R VL  A LT A L + 
Sbjct: 34  NLRELDLTDADLRGANLDGADLSWSTLSTADLTDADLRGATLRRTVLTRAVLTRAALTQV 93

Query: 191 VLTRSDLGGA 200
               +D+ GA
Sbjct: 94  YARDADMAGA 103



 Score = 37.4 bits (85), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 32/60 (53%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           D+R++D S        L  A    AN  GADLS + +    L +A+L  A L RTVLTR+
Sbjct: 24  DLRDTDLSNLNLRELDLTDADLRGANLDGADLSWSTLSTADLTDADLRGATLRRTVLTRA 83


>gi|427707611|ref|YP_007049988.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
 gi|427360116|gb|AFY42838.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
          Length = 521

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 53/101 (52%), Gaps = 1/101 (0%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A   +ADLR+A   K N R AN + A ++ S  +G+    A L  A  ++ + +GA+L D
Sbjct: 120 ANLSNADLREATLRKANLRRANLSEASLKGSSLAGTNLEMANLNAADLHRTDLSGANLRD 179

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             + +  L  ANL+ A L    L  +DL GA +  AD S A
Sbjct: 180 AELKQTNLTHANLSGADLSGANLRWADLSGANLSWADLSGA 220



 Score = 43.9 bits (102), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+L++      N   A+ + A++R +D SG+  + A L  A    AN  GA+L
Sbjct: 173 SGANLRDAELKQTNLTHANLSGADLSGANLRWADLSGANLSWADLSGAKLSGANLMGANL 232

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S+  +       ANLT A L++     +DL GA + GA    A
Sbjct: 233 SNANLTNTSFVHANLTEATLIKAEWIGADLTGATLTGAKLHSA 275



 Score = 40.4 bits (93), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 41/83 (49%), Gaps = 5/83 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A +  +  SG  F GA +  A    AN   ADLS     R  L  A+L  A L+R
Sbjct: 55  ANLSHAKLNVARLSGVNFVGAIMNYASLNVANLIRADLS-----RAQLRGASLVRAELIR 109

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L+R+DL  A +  AD  +A +
Sbjct: 110 AELSRADLFEANLSNADLREATL 132


>gi|428204342|ref|YP_007082931.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427981774|gb|AFY79374.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 203

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 52/97 (53%), Gaps = 6/97 (6%)

Query: 117 DLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           DLR+A    EN        AN   AD+R+++ S +   GA L  A   +AN  GA+L+  
Sbjct: 30  DLREANLAGENLSGASLPWANCIKADLRKTNLSQANLGGADLRWANLEEANLEGANLNRA 89

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
            + +  L+ ANLT   LV+  L +++L  A ++GAD 
Sbjct: 90  DLSQANLSRANLTQVKLVKADLRKTNLSEANLQGADL 126



 Score = 40.8 bits (94), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 49/108 (45%), Gaps = 6/108 (5%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A  G ADLR A   + N       RA+ + A++  ++ +  K   A L K    +AN 
Sbjct: 62  SQANLGGADLRWANLEEANLEGANLNRADLSQANLSRANLTQVKLVKADLRKTNLSEANL 121

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            GADL    +    L   NL+ A L      +++L  A +E AD + A
Sbjct: 122 QGADLRWANLGEANLERTNLSQANLQWVNFAKANLSEANLEDADLNQA 169



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 6/96 (6%)

Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADLRK      N  +AN   AD+R ++   +   GA L +A   +AN + A+L+   + +
Sbjct: 54  ADLRKT-----NLSQANLGGADLRWANLEEANLEGANLNRADLSQANLSRANLTQVKLVK 108

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             L + NL+ A L    L  ++LG A +E  + S A
Sbjct: 109 ADLRKTNLSEANLQGADLRWANLGEANLERTNLSQA 144



 Score = 37.7 bits (86), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 45/97 (46%), Gaps = 6/97 (6%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    A+L +   VK + R      AN   AD+R ++   +      L +A     NF
Sbjct: 92  SQANLSRANLTQVKLVKADLRKTNLSEANLQGADLRWANLGEANLERTNLSQANLQWVNF 151

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 199
             A+LS+  ++   LN+ANLT A L  T    ++L G
Sbjct: 152 AKANLSEANLEDADLNQANLTEAKLKGTNFEGANLQG 188


>gi|428770347|ref|YP_007162137.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
           10605]
 gi|428684626|gb|AFZ54093.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
          Length = 278

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 58/129 (44%), Gaps = 11/129 (8%)

Query: 112 QFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           Q    DLR A     N    + + AD+R++D SG+  +  YL +A    AN TGA+L+  
Sbjct: 25  QLRRIDLRNAQLKGVNLGGCDLSYADLRDADLSGADLSKCYLNEANLSGANLTGANLTGA 84

Query: 171 LM----------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
            +           + ++ EA  T + L R    ++DL GA + GA  +  +   A     
Sbjct: 85  YLIKAYLTKVNFQKAIVKEAYFTGSFLTRANFYKADLSGAFLNGAHLNGGIFKDASYDNT 144

Query: 221 CKYANGTNP 229
            ++  G NP
Sbjct: 145 TRFDKGFNP 153


>gi|428320418|ref|YP_007118300.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428244098|gb|AFZ09884.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 479

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 55/105 (52%), Gaps = 11/105 (10%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL ++     N  RA+ T A +RE++  G +F GA L++A   KAN  GA+L
Sbjct: 60  SGANLSGADLAESFLNLANLTRADLTGAVLREANLVGVEFTGANLKQASLIKANLVGANL 119

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                     +EANLT A L    L  S L GAI++ A +++  I
Sbjct: 120 ----------HEANLTRANLSGADLRGSQLSGAILDKAVYNNRTI 154



 Score = 44.3 bits (103), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 51/103 (49%), Gaps = 16/103 (15%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 162
           S A    ADLR+          AN + AD+R     ++D SG+K N A L KA   + N 
Sbjct: 352 SGANLRDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGAKLNEADLRKADLMRVNL 411

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
            GADL+          EA+L++A L R  L  ++L G  ++GA
Sbjct: 412 EGADLT----------EADLSDAHLFRVNLRGANLKGTNLKGA 444



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 38/97 (39%), Positives = 49/97 (50%), Gaps = 11/97 (11%)

Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           SADLR     + +   AN   AD+RE+DF+G+    A L  A     + T ADLS     
Sbjct: 338 SADLRGVDLTRADLSGANLRDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGA--- 394

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
              LNEA+L  A L+R      +L GA +  AD SDA
Sbjct: 395 --KLNEADLRKADLMRV-----NLEGADLTEADLSDA 424



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 45/84 (53%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + AD+ ES  + +    A L  AV  +AN  G + +   + +  L +ANL  A L  
Sbjct: 62  ANLSGADLAESFLNLANLTRADLTGAVLREANLVGVEFTGANLKQASLIKANLVGANLHE 121

Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
             LTR++L GA + G+  S A++D
Sbjct: 122 ANLTRANLSGADLRGSQLSGAILD 145



 Score = 40.8 bits (94), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 44/94 (46%), Gaps = 15/94 (15%)

Query: 131 NFTSADMRESDFSGSKFNG---------------AYLEKAVAYKANFTGADLSDTLMDRM 175
           N T AD+  SD SG+  +                A L+KA    AN  G DL    +   
Sbjct: 270 NLTGADLNGSDLSGANLSASNLTSVNLKNVDLSRASLKKAYLKGANLEGTDLRGADLSGA 329

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           +L++ NL++A L    LTR+DL GA +  AD  +
Sbjct: 330 ILHQVNLSSADLRGVDLTRADLSGANLRDADLRE 363



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 41/158 (25%), Positives = 66/158 (41%), Gaps = 37/158 (23%)

Query: 112 QFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAV--------- 156
           +F  A+L++A  +K N        AN T A++  +D  GS+ +GA L+KAV         
Sbjct: 98  EFTGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAVYNNRTIFPE 157

Query: 157 -----AYKA------------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 199
                A  A            N    DL++  +    L   NL  A+L    L R++L G
Sbjct: 158 DIDPGAMGAFLLAPNASLPGLNLAMVDLTEADLKGADLRRTNLYKAILFGAKLDRANLAG 217

Query: 200 AIIEGADFSDA-----VIDLAQKQALCKYANGTNPITG 232
           A +  AD  +A     +++ A       ++ G +P  G
Sbjct: 218 ANLSAADLREASLSGTILEKAVYSNKTLFSEGIDPALG 255



 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 48/95 (50%), Gaps = 14/95 (14%)

Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           ADL  A+      + N +SAD+R  D + +  +GA L  A   + +FTGA    TL+   
Sbjct: 324 ADLSGAIL----HQVNLSSADLRGVDLTRADLSGANLRDADLRETDFTGA----TLL--- 372

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
               ANL+ A L    LT++DL GA +  AD   A
Sbjct: 373 ---FANLSGADLRGVDLTKADLSGAKLNEADLRKA 404



 Score = 37.0 bits (84), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 5/87 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RA+   A ++ ++  G+   GA L  A+ ++ N + ADL    + R  L+ ANL +A L 
Sbjct: 303 RASLKKAYLKGANLEGTDLRGADLSGAILHQVNLSSADLRGVDLTRADLSGANLRDADLR 362

Query: 189 RT-----VLTRSDLGGAIIEGADFSDA 210
            T      L  ++L GA + G D + A
Sbjct: 363 ETDFTGATLLFANLSGADLRGVDLTKA 389


>gi|304393841|ref|ZP_07375766.1| pentapeptide repeat-containing protein [Ahrensia sp. R2A130]
 gi|303294040|gb|EFL88415.1| pentapeptide repeat-containing protein [Ahrensia sp. R2A130]
          Length = 247

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 58/114 (50%), Gaps = 4/114 (3%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           G+G +   GS   R  +   +    +FT A+M  SDFSGS      + K+   +ANFTGA
Sbjct: 109 GVGLSKVEGS---RTVLQNSDFTDTDFTKAEMFRSDFSGSILKNVNMNKSEFSRANFTGA 165

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
           DLS  ++    ++ ANL +A L  T  + S +  A + G D S A   L Q+Q 
Sbjct: 166 DLSGAMITFANISRANLADAKLDGTDFSSSWMYLAKVAGVDMS-ATKGLTQEQV 218



 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 51/118 (43%), Gaps = 21/118 (17%)

Query: 119 RKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLM 172
           R  +    NF  AN    D+  SD    KF+GA + K++  +AN +     G  LS    
Sbjct: 58  RNVILSGYNFSLANLNQTDLFGSDLRDVKFDGADMTKSILTRANLSNSSLKGVGLSKVEG 117

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIE---------------GADFSDAVIDLA 215
            R VL  ++ T+    +  + RSD  G+I++               GAD S A+I  A
Sbjct: 118 SRTVLQNSDFTDTDFTKAEMFRSDFSGSILKNVNMNKSEFSRANFTGADLSGAMITFA 175


>gi|219849225|ref|YP_002463658.1| pentapeptide repeat-containing protein [Chloroflexus aggregans DSM
           9485]
 gi|219543484|gb|ACL25222.1| pentapeptide repeat protein [Chloroflexus aggregans DSM 9485]
          Length = 311

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 56/114 (49%), Gaps = 14/114 (12%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADLRKA     N   A    A++R ++ S + F+GA L  A     N +GADL D
Sbjct: 89  ADLSDADLRKADLSWANLEFATLIGANLRGANLSAADFSGANLYGANLSLCNLSGADLRD 148

Query: 170 TLMDRMVLNE-------------ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           T+M    L+E             ANL+ A+L+R  L  ++L GA + GA+   A
Sbjct: 149 TVMIGANLSEAQLREAQLVNLSGANLSGAILLRVSLNGANLNGANLAGANLMHA 202



 Score = 42.0 bits (97), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 42/78 (53%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++RE+        GA L +    +A+   AD SD  +  + L+ A+L NA+  R
Sbjct: 197 ANLMHANLREATLDEVNCIGANLSETNLSEASLCNADFSDANLSGIYLSGAHLRNAIFTR 256

Query: 190 TVLTRSDLGGAIIEGADF 207
             L+R++L GA + GA+ 
Sbjct: 257 ANLSRANLSGANLRGANL 274



 Score = 38.1 bits (87), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 44/86 (51%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A++  ++ SG+  + A L +A    AN   ADLS   +    L+ ANL  A L  
Sbjct: 24  ANLSGANLSAANLSGANLSEAKLSRARLTDANLYRADLSICELGEANLSWANLREAKLNW 83

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
             L R+DL  A +  AD S A ++ A
Sbjct: 84  AQLVRADLSDADLRKADLSWANLEFA 109


>gi|428314592|ref|YP_007151039.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428256316|gb|AFZ22271.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 237

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 59/119 (49%), Gaps = 21/119 (17%)

Query: 113 FGSADLRKAVHVKENF----------------RANFTSADMRESDFSGSKFNGAYLEKAV 156
           F +A+LR AV V++N                   N +  D+  +D S +  NGA L +A 
Sbjct: 105 FANANLRCAVLVEQNLCQCNFSYVKLNFANLSGINLSGVDLTSADLSDACLNGANLSQAS 164

Query: 157 AYK-----ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            Y+     AN + A+L  T + +  LN+ANLT A L    L+ +DL GAI++ A  S A
Sbjct: 165 LYRTLLTRANLSQANLRGTNLFKASLNDANLTQADLTGANLSFADLRGAILDEATLSGA 223



 Score = 37.7 bits (86), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 34/61 (55%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN + A++R ++   +  N A L +A    AN + ADL   ++D   L+ ANLT A L 
Sbjct: 172 RANLSQANLRGTNLFKASLNDANLTQADLTGANLSFADLRGAILDEATLSGANLTGAKLT 231

Query: 189 R 189
           +
Sbjct: 232 Q 232


>gi|440681606|ref|YP_007156401.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
 gi|428678725|gb|AFZ57491.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
          Length = 943

 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 40/108 (37%), Positives = 56/108 (51%), Gaps = 14/108 (12%)

Query: 104 EFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           +FG    A    ADL +A     NF R N + A M  ++FS + FN A L +A   +AN 
Sbjct: 808 DFG---GANLSHADLSRANLNCANFSRTNCSGAYMISANFSEALFNHANLHEANFIRANL 864

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           TGADLS   ++   L+ A+L+ A          +L GA +E A+FS A
Sbjct: 865 TGADLSSADLNYADLSLADLSGA----------NLSGANLEDANFSGA 902



 Score = 44.7 bits (104), Expect = 0.040,   Method: Composition-based stats.
 Identities = 30/85 (35%), Positives = 40/85 (47%), Gaps = 5/85 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM-----DRMVLNEANLTNA 185
           N   AD+ E DF G+  + A L +A    ANF+  + S   M        + N ANL  A
Sbjct: 798 NLRGADLSEVDFGGANLSHADLSRANLNCANFSRTNCSGAYMISANFSEALFNHANLHEA 857

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDA 210
             +R  LT +DL  A +  AD S A
Sbjct: 858 NFIRANLTGADLSSADLNYADLSLA 882



 Score = 43.5 bits (101), Expect = 0.11,   Method: Composition-based stats.
 Identities = 26/72 (36%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A F  A+L +A  ++ N   A+ +SAD+  +D S +  +GA L  A    ANF+GA L
Sbjct: 845 SEALFNHANLHEANFIRANLTGADLSSADLNYADLSLADLSGANLSGANLEDANFSGAKL 904

Query: 168 SDTLMDRMVLNE 179
           S+ L+  +  +E
Sbjct: 905 SNGLLGDICWDE 916


>gi|427734924|ref|YP_007054468.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427369965|gb|AFY53921.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 213

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 60/112 (53%), Gaps = 17/112 (15%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN   A++  ++F+GSKF GA+LE      AN  GA+L +T +       ANL  A L+
Sbjct: 31  RANLAGANLVGTNFAGSKFEGAHLE-----GANLMGANLKETDL------RANLMGANLM 79

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT----GVSTR 236
           +  LT +D+ G+ + GA+   AVI  ++      + +GTN I     GV  R
Sbjct: 80  QADLTGADVRGSNLRGANLMGAVI--SEVSFAGAFLSGTNLINVDLQGVDLR 129



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 56/122 (45%), Gaps = 19/122 (15%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN----------E 179
           AN   AD+  +D  GS   GA L  AV  + +F GA LS T +  + L            
Sbjct: 76  ANLMQADLTGADVRGSNLRGANLMGAVISEVSFAGAFLSGTNLINVDLQGVDLRGADLRG 135

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--------DLAQKQALCKYANGTNPIT 231
           ANLT A L    L+R+DL GA++  A+  +A +        +LA    LC    G N + 
Sbjct: 136 ANLTGANLKGADLSRADLQGALLSEANLEEADLRKANLSGANLAGANLLCAELEGAN-VN 194

Query: 232 GV 233
           GV
Sbjct: 195 GV 196



 Score = 41.2 bits (95), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 43/85 (50%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
              +    D+R +D  G+   GA L+ A   +A+  GA LS+  ++   L +ANL+ A L
Sbjct: 119 INVDLQGVDLRGADLRGANLTGANLKGADLSRADLQGALLSEANLEEADLRKANLSGANL 178

Query: 188 VRTVLTRSDLGGAIIEGADFSDAVI 212
               L  ++L GA + G DF  A +
Sbjct: 179 AGANLLCAELEGANVNGVDFDRACL 203


>gi|428302093|ref|YP_007140399.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428238637|gb|AFZ04427.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 146

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/143 (34%), Positives = 71/143 (49%), Gaps = 18/143 (12%)

Query: 74  TALAAAVVASCSSNISALADLN---KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
           TAL  A   + S  ISA AD+    ++  ETR  +         + +LR A     N + 
Sbjct: 7   TALTIASTITLSLPISAQADMKSDVQHLLETRECY---------ACNLRGA-----NLKG 52

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+   AD+R ++  G+   GA LE A    A+   A+LS   ++   LN ANLTNA L  
Sbjct: 53  AHLIGADLRNANLKGANLAGANLEGADLTGADLEEANLSYAFVNSTSLNYANLTNANLSN 112

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L  ++L GA++ GAD + A I
Sbjct: 113 AHLYSAELDGAVMVGADLAGADI 135


>gi|428213326|ref|YP_007086470.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428001707|gb|AFY82550.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 340

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 8/106 (7%)

Query: 109 SAAQFGSADLRKAVHVKENFR--ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           S A   SA+L  A  V++ F   A    A +R ++ S S   GA L      +A+ +GAD
Sbjct: 192 SGAVLNSANLSGA-SVRQAFLQGAQMEGASLRNTNMSTSNLRGALL-----TQADLSGAD 245

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           L D  M  +VLNEA L N  L    L  + L G I+ GAD   A++
Sbjct: 246 LLDADMQGVVLNEAILINTQLRNVQLQGASLEGTILSGADLEGAIL 291



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 45/83 (54%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           F  N  +AD+ E++ SG+  +  Y+E+A    A   G++L+   +  + LN ANL+ AVL
Sbjct: 137 FLINLANADLTEANLSGTDLSRIYIEQANLNGAQLQGSNLTGAELFGVTLNNANLSGAVL 196

Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
               L+ + +  A ++GA    A
Sbjct: 197 NSANLSGASVRQAFLQGAQMEGA 219



 Score = 42.4 bits (98), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 43/149 (28%), Positives = 66/149 (44%), Gaps = 29/149 (19%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEK-----AVAY------ 158
           A F  +DLR A     N   AN    ++R ++ +G    GA L +     AV +      
Sbjct: 84  ASFRGSDLRGANLTGANLTGANLQGVNLRGANLTGVNLTGANLSRSQLVGAVLFLINLAN 143

Query: 159 ----KANFTGADLSDTLMDRMVLNEA-----NLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
               +AN +G DLS   +++  LN A     NLT A L    L  ++L GA++  A+ S 
Sbjct: 144 ADLTEANLSGTDLSRIYIEQANLNGAQLQGSNLTGAELFGVTLNNANLSGAVLNSANLSG 203

Query: 210 AVIDLAQKQALCKYANGTNPITGVSTRKS 238
           A +    +QA  + A     + G S R +
Sbjct: 204 ASV----RQAFLQGA----QMEGASLRNT 224



 Score = 40.0 bits (92), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 40/86 (46%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           +N T+A  R SD  G+   GA L  A     N  GA+L+   +    L+ + L  AVL  
Sbjct: 79  SNLTNASFRGSDLRGANLTGANLTGANLQGVNLRGANLTGVNLTGANLSRSQLVGAVLFL 138

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
             L  +DL  A + G D S   I+ A
Sbjct: 139 INLANADLTEANLSGTDLSRIYIEQA 164



 Score = 40.0 bits (92), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 43/86 (50%), Gaps = 5/86 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN ++ D+  +D SGS    A    +    AN TGA+L+   +  + L  ANLT   L  
Sbjct: 64  ANLSNTDLTGADLSGSNLTNASFRGSDLRGANLTGANLTGANLQGVNLRGANLTGVNLTG 123

Query: 190 TVLTRSDLGGAI-----IEGADFSDA 210
             L+RS L GA+     +  AD ++A
Sbjct: 124 ANLSRSQLVGAVLFLINLANADLTEA 149


>gi|429106957|ref|ZP_19168826.1| FIG01055523: hypothetical protein [Cronobacter malonaticus 681]
 gi|426293680|emb|CCJ94939.1| FIG01055523: hypothetical protein [Cronobacter malonaticus 681]
          Length = 846

 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 38/119 (31%), Positives = 57/119 (47%), Gaps = 17/119 (14%)

Query: 129 RANFTSADMRESD----------FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
           RA+FT A +R+S+          F  +K     L +A    ANF  A L  +L  R    
Sbjct: 723 RADFTHATLRQSNLRQTALCCARFELAKLENTDLSEANCRGANFQRASLVGSLFIRTDFR 782

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
           E + T+A L+  +L +S LGGA   GA+   A  DL+Q      + NG   ++G  T++
Sbjct: 783 EVDFTDANLMGALLQKSQLGGADFNGANLFRA--DLSQ-----TFTNGETRMSGAFTKR 834



 Score = 38.1 bits (87), Expect = 4.2,   Method: Composition-based stats.
 Identities = 31/105 (29%), Positives = 44/105 (41%), Gaps = 6/105 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 165
           S A    ADL        NFR A    A++  +      F GA L  A    ++F+GA  
Sbjct: 551 SKALLECADLSHCQLDGANFRGAMLARAELHHTSLRDCNFEGASLALAQCCHSDFSGARF 610

Query: 166 ---DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
               L +TL+D  V ++A L   +   T  TR     A ++G  F
Sbjct: 611 KDTQLQETLLDDCVFDDATLEGLLFRETWFTRCRFHRATLDGCVF 655



 Score = 37.0 bits (84), Expect = 8.8,   Method: Composition-based stats.
 Identities = 28/112 (25%), Positives = 41/112 (36%), Gaps = 9/112 (8%)

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
           R E  +     F   DL            +F+  D+R +DFS +    A L       AN
Sbjct: 519 RAERTLAQGGDFSGMDLTGV---------DFSGMDLRGADFSKALLECADLSHCQLDGAN 569

Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           F GA L+   +    L + N   A L       SD  GA  +     + ++D
Sbjct: 570 FRGAMLARAELHHTSLRDCNFEGASLALAQCCHSDFSGARFKDTQLQETLLD 621


>gi|193212588|ref|YP_001998541.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
           8327]
 gi|193086065|gb|ACF11341.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
          Length = 430

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 45/81 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ + +D+  SDFS +  +GA L++A    +   GADLS    +R    EA+ + A +  
Sbjct: 81  ADLSQSDLGGSDFSDADLHGAMLDEAYLGGSRMAGADLSGASFERASAAEADFSRAKMPS 140

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
           +VL RS+L GA   GAD   A
Sbjct: 141 SVLRRSELTGARFAGADLRGA 161


>gi|168705224|ref|ZP_02737501.1| pentapeptide repeat [Gemmata obscuriglobus UQM 2246]
          Length = 831

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 43/88 (48%), Gaps = 5/88 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A F  A + E+ FSGS+  GA      A KANF  A  +D +    +L  ANL  A  +R
Sbjct: 541 AKFDGAMLSEASFSGSQIQGASFADVPARKANFASARAADAVFRGAILANANLRAATFLR 600

Query: 190 TVLTRSDLGGA-----IIEGADFSDAVI 212
           T     DL GA      + GADF+ A +
Sbjct: 601 TNFQNVDLTGADFAFSDLRGADFTGATL 628



 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 47/97 (48%), Gaps = 5/97 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           + + + A++ +S F G  F GA L  A   K +FT A+L+        L   N TNA L 
Sbjct: 233 KTDLSGAELEQSHFGGCDFTGADLSHAKLQKTDFTAANLAGATCVDADLRGTNFTNADLR 292

Query: 189 R-----TVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
           +       L  +DL GA + GADF+ A +  A+   L
Sbjct: 293 KANFRGANLAGADLTGANVAGADFTGANLTGAKVDGL 329



 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 38/62 (61%), Gaps = 1/62 (1%)

Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +A+L  A  V  + R  NFT+AD+R+++F G+   GA L  A    A+FTGA+L+   
Sbjct: 266 FTAANLAGATCVDADLRGTNFTNADLRKANFRGANLAGADLTGANVAGADFTGANLTGAK 325

Query: 172 MD 173
           +D
Sbjct: 326 VD 327


>gi|254486622|ref|ZP_05099827.1| hypothetical protein RGAI101_1279 [Roseobacter sp. GAI101]
 gi|214043491|gb|EEB84129.1| hypothetical protein RGAI101_1279 [Roseobacter sp. GAI101]
          Length = 200

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/112 (36%), Positives = 53/112 (47%), Gaps = 22/112 (19%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A    ADL  AV          T A++  S+ SG+   GAYLE A    A  TGADL+  
Sbjct: 98  ADLSGADLTGAV---------LTQANLEMSNLSGATLTGAYLELANLAGARVTGADLT-- 146

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
                   +ANLT+A L   VL  + L GA++ GAD   A +   +   LCK
Sbjct: 147 --------KANLTSANLRGAVLLEAKLVGAVLLGADLDGASL---EGAILCK 187



 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 57/112 (50%), Gaps = 15/112 (13%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F  ADLR+ +  K   + + + +D++  D +G+   GA L  A  + A+ + ADLS  
Sbjct: 4   AAFDEADLRQLLDTKVCQKCDLSGSDLKGVDLAGANLAGANLSGAKLWAADLSKADLSGV 63

Query: 171 LMD----------RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            ++             L +ANL+ A      LT ++LGGA + GAD + AV+
Sbjct: 64  NLEAATLTAANLAGANLADANLSGA-----YLTTTNLGGADLSGADLTGAVL 110



 Score = 37.7 bits (86), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 46/101 (45%), Gaps = 20/101 (19%)

Query: 130 ANFTSADMRESDFSG--------------------SKFNGAYLEKAVAYKANFTGADLSD 169
           A   +AD+ ++D SG                    +  +GAYL       A+ +GADL+ 
Sbjct: 48  AKLWAADLSKADLSGVNLEAATLTAANLAGANLADANLSGAYLTTTNLGGADLSGADLTG 107

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            ++ +  L  +NL+ A L    L  ++L GA + GAD + A
Sbjct: 108 AVLTQANLEMSNLSGATLTGAYLELANLAGARVTGADLTKA 148


>gi|428301995|ref|YP_007140301.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428238539|gb|AFZ04329.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 342

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 47/92 (51%), Gaps = 10/92 (10%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVA-----YKANFT-----GADLSDTLMDRMVLN 178
            AN    D ++++ SGSKF  A LE A       + AN +     G +LSD  M  + LN
Sbjct: 130 HANLAGTDFQDANLSGSKFVSANLEYAALKNVYLWNANISDACLIGTNLSDAYMHSVKLN 189

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            ANLTNA+L R  L+   L    +  AD SDA
Sbjct: 190 GANLTNAILHRVKLSDGKLRDTNLINADLSDA 221


>gi|332705327|ref|ZP_08425405.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
 gi|332355687|gb|EGJ35149.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
          Length = 221

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 51/107 (47%), Gaps = 16/107 (14%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADLR  +    + R AN T AD+R +D  G+   GA L +A   +AN   ADLS 
Sbjct: 111 AILTRADLRLTILQDTDLRGANLTRADLRYADLRGANLTGACLHQADLTRANLCDADLS- 169

Query: 170 TLMDRMVLNEANLTNAV-----LVRTVLTRSDLGGAIIEGADFSDAV 211
                    +ANL+ A+     L R  L+  DLG A + GA   D +
Sbjct: 170 ---------QANLSGAILSQVDLRRVTLSNVDLGQAELSGATVPDQL 207



 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 52/100 (52%), Gaps = 4/100 (4%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F   DL++ +++ E    N T    R  + + +  + A L++    +AN TGA L  T +
Sbjct: 28  FRGVDLQQ-INLSE---VNLTGVIFRRVNLADANLSLAVLQEVNLNQANLTGAKLWRTNL 83

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            +  L EANL+ A ++R  LTR +L  AI+  AD    ++
Sbjct: 84  KKTSLVEANLSQAFMIRANLTRVNLRQAILTRADLRLTIL 123



 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 52/121 (42%), Gaps = 7/121 (5%)

Query: 94  LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAY 151
           L +Y A  R   G+          +L   +  + N   AN + A ++E + + +   GA 
Sbjct: 18  LERYSAGERDFRGVDLQQINLSEVNLTGVIFRRVNLADANLSLAVLQEVNLNQANLTGAK 77

Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGAD 206
           L +    K +   A+LS   M R  L   NL  A+L R     T+L  +DL GA +  AD
Sbjct: 78  LWRTNLKKTSLVEANLSQAFMIRANLTRVNLRQAILTRADLRLTILQDTDLRGANLTRAD 137

Query: 207 F 207
            
Sbjct: 138 L 138


>gi|318042736|ref|ZP_07974692.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0101]
          Length = 164

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+   AD+R S+  G+  +GA L  A+   +  + ADLSD     + L +ANL +AVL++
Sbjct: 69  ADLRGADLRGSNLEGADLSGADLRGAMLQDSWLSNADLSD-----VDLRQANLRDAVLIQ 123

Query: 190 TVLTRSDLGGAIIEGADFS 208
            +     L GA++ GADF+
Sbjct: 124 ALTPGLQLEGAVLIGADFT 142


>gi|300863681|ref|ZP_07108615.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300338313|emb|CBN53761.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 238

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/108 (38%), Positives = 53/108 (49%), Gaps = 7/108 (6%)

Query: 112 QFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           +F  ADLR++   K     NFT A  +E+D S S   G  L +A  Y+A    ADLS   
Sbjct: 36  EFDRADLRQSRLGK----TNFTQASFQETDLSESILWGTDLTEANLYRAVLREADLSGAK 91

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---SDAVIDLAQ 216
           +    L EANL  A L    L R+ L  AI+  AD    SD + DL Q
Sbjct: 92  LTDANLEEANLMKACLSGANLVRAKLLRAILFEADLRSTSDQITDLGQ 139



 Score = 41.6 bits (96), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 45/81 (55%), Gaps = 8/81 (9%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYK--------ANFTGADLSDTLMDRMVLNEAN 181
           +N + A + +++  G+K   A+L + +  +        A+  GADLS   +   +L +AN
Sbjct: 150 SNLSGALLYQANLDGAKLCRAHLNETIQQRFLATNLSEASLQGADLSYADLSGAILRKAN 209

Query: 182 LTNAVLVRTVLTRSDLGGAII 202
           L  A + RT+LT +DL GAI+
Sbjct: 210 LRGADMTRTILTNTDLEGAIM 230



 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 40/75 (53%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           D+  ++  G +F+ A L ++   K NFT A   +T +   +L   +LT A L R VL  +
Sbjct: 26  DLLNAELQGIEFDRADLRQSRLGKTNFTQASFQETDLSESILWGTDLTEANLYRAVLREA 85

Query: 196 DLGGAIIEGADFSDA 210
           DL GA +  A+  +A
Sbjct: 86  DLSGAKLTDANLEEA 100


>gi|428224583|ref|YP_007108680.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427984484|gb|AFY65628.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 156

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 70/138 (50%), Gaps = 9/138 (6%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F  A L +A   + N + AN +SAD+  +D S +  +GA L +A    A+ T ADL    
Sbjct: 17  FQQAALHQADLEEVNLQQANLSSADLSSADLSHANLSGANLSRANLSNADLTNADLRSAD 76

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV---IDLAQKQALCKYANGTN 228
           +  + L  ANL+ A L R  L ++DL  AI+  ADF+ A    +DL+         +GTN
Sbjct: 77  LSEVNLIGANLSGAKLGRANLFQADLRSAILTDADFTGANLEDVDLSGAD-----LSGTN 131

Query: 229 PITGVSTRKSLGCGNSRR 246
             T   ++ +   G SRR
Sbjct: 132 LRTAELSKAASSHGVSRR 149


>gi|427724799|ref|YP_007072076.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
 gi|427356519|gb|AFY39242.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
          Length = 276

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 64/138 (46%), Gaps = 17/138 (12%)

Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDR 174
           AV  K N   A   +A++R +D  G+   GAYL       AN     F+GA+L  + +  
Sbjct: 135 AVGPKANLSGAYLNTANLRGADLQGANLRGAYLSGTDFTGANLTGVAFSGANLKRSFLTG 194

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIE------GADFSDAV-IDLAQKQALC----KY 223
             L EA L N  L    L  +DL GA++E      GADFSD   +  +++  LC    K 
Sbjct: 195 ACLREARLINVELEMADLRGADLTGAMLEQIESLAGADFSDVRGLSDSERSYLCSRSPKE 254

Query: 224 ANGTNPITGVSTRKSLGC 241
               N  T  +TR SL C
Sbjct: 255 LGTWNSFTRKNTRASLNC 272



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 12/98 (12%)

Query: 123 HVKENF-RAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 180
            VKE   R N   +A++ + D +G   + A L+ A+    NFTGA L+   +    L  A
Sbjct: 17  EVKEILERGNSLENANLEDLDLAGYDLSDANLQGAILIGVNFTGATLAGAQLQNADLRRA 76

Query: 181 NLTN----------AVLVRTVLTRSDLGGAIIEGADFS 208
           NLTN          A L RT+L   DL GA+++GA+ +
Sbjct: 77  NLTNASLKGATLSEAYLQRTILNDCDLAGAVLDGANLT 114


>gi|111023196|ref|YP_706168.1| hypothetical protein RHA1_ro06233 [Rhodococcus jostii RHA1]
 gi|110822726|gb|ABG98010.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
          Length = 201

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 16/131 (12%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFS-----GSKFNGAYL 152
           +E R E  I +   F  ADL ++ HV   FR+ +FT   +  S+F      GS+F+   L
Sbjct: 38  SELRTESVIFTECDFTGADLAESHHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97

Query: 153 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 202
              V  + +FT     GADL           EANL    L R VL  +DL     GGA  
Sbjct: 98  RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 157

Query: 203 EGADFSDAVID 213
           +GAD   A +D
Sbjct: 158 DGADLRGAHVD 168


>gi|425469207|ref|ZP_18848164.1| Tetratricopeptide repeat protein [Microcystis aeruginosa PCC 9701]
 gi|389882794|emb|CCI36776.1| Tetratricopeptide repeat protein [Microcystis aeruginosa PCC 9701]
          Length = 262

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 55/101 (54%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L++ +  ++  + + + + + +S+  G+K NGA L  A   +AN +GADLS   +    L
Sbjct: 29  LQQLLSTRKCPQCDLSGSGLVQSNLVGAKLNGANLVGANLSQANLSGADLSGANLTGASL 88

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
             ANLT A L   +LT +DL GA +  A+  +  +D A  Q
Sbjct: 89  FGANLTGANLTGAILTGADLRGAYLNNANLDNTKLDTAYVQ 129



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 40/72 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++ +++ SG+  +GA L  A  + AN TGA+L+  ++    L  A L NA L  
Sbjct: 61  ANLVGANLSQANLSGADLSGANLTGASLFGANLTGANLTGAILTGADLRGAYLNNANLDN 120

Query: 190 TVLTRSDLGGAI 201
           T L  + + GA+
Sbjct: 121 TKLDTAYVQGAV 132


>gi|22297676|ref|NP_680923.1| hypothetical protein tlr0132 [Thermosynechococcus elongatus BP-1]
 gi|22293853|dbj|BAC07685.1| tlr0132 [Thermosynechococcus elongatus BP-1]
          Length = 274

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 10/77 (12%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N   A++ E DF G   +           AN + ADLSD  + R++L+ ANL +A L R 
Sbjct: 161 NLQGANLSEKDFEGHNLS----------HANLSHADLSDAFLHRVILHRANLRHANLFRA 210

Query: 191 VLTRSDLGGAIIEGADF 207
            L ++DL  A ++GA+ 
Sbjct: 211 NLLQADLSYADLQGANL 227


>gi|443321008|ref|ZP_21050077.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
 gi|442789287|gb|ELR98951.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
          Length = 333

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 47/86 (54%), Gaps = 5/86 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV--- 186
           A+  S+ M  S  S SK   A L  AV  KAN   ADLS   ++R +L EANL  A+   
Sbjct: 116 ASLISSSMIGSCLSKSKLKLANLTSAVLAKANLQYADLSFAGLNRAILTEANLRGAILKQ 175

Query: 187 --LVRTVLTRSDLGGAIIEGADFSDA 210
             L+R+ L R DL GA ++G + S A
Sbjct: 176 ATLIRSYLNRVDLSGANLQGCNLSLA 201



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 6/102 (5%)

Query: 107 IGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           I + A    A L++A  ++    R + + A+++  + S +   GA L  A    AN  GA
Sbjct: 162 ILTEANLRGAILKQATLIRSYLNRVDLSGANLQGCNLSLADLRGANLTGANLQGANLEGA 221

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           +LSD     + L+ ANLT A LV T L R++L GA +  A+ 
Sbjct: 222 NLSD-----VNLSGANLTKANLVGTQLVRANLTGAKLSYANL 258


>gi|386721242|ref|YP_006187567.1| hypothetical protein B2K_03510 [Paenibacillus mucilaginosus K02]
 gi|384088366|gb|AFH59802.1| hypothetical protein B2K_03510 [Paenibacillus mucilaginosus K02]
          Length = 219

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +  FT + +  SDFSG+   G+  + +   +ANF GA+L+D  +  + L  A+    +LV
Sbjct: 31  KGQFTGSALHGSDFSGADLTGSSFKSSDVREANFDGANLTDCSLSALDLANASFHKTILV 90

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDL 214
           RT  ++S L GA   G   +D  + +
Sbjct: 91  RTNFSKSGLDGAQFTGVRLTDVTLTM 116


>gi|158341491|ref|YP_001522656.1| peptidase C14, caspase catalytic subunit p20 [Acaryochloris marina
           MBIC11017]
 gi|158311732|gb|ABW33342.1| peptidase C14, caspase catalytic subunit p20 [Acaryochloris marina
           MBIC11017]
          Length = 1037

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 59/108 (54%), Gaps = 5/108 (4%)

Query: 115 SADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           SADLR A+ ++ N F  N ++ ++  +D S +  + A L  A   +AN +GADL +T + 
Sbjct: 884 SADLRNAILIRANLFSTNLSNVNLYSADLSSTDMSSANLSNADLIRANLSGADLHNTDLF 943

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
              L+ ANL+NA L   +L  S+L     E  + + A ++ A+   +C
Sbjct: 944 YANLSNANLSNANLSNAILLSSNLR----ETKNLTQAQLEGAEHPLIC 987



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 53/109 (48%), Gaps = 11/109 (10%)

Query: 111 AQFGSADLRKAVHVKENFRA---NFTS--------ADMRESDFSGSKFNGAYLEKAVAYK 159
           A+   ADLR A+ ++ N  A   NFT         AD+R +D + + FN A L       
Sbjct: 805 AKLRHADLRSAILIRANLFAADLNFTDFSDADLRYADLRRTDLNFTDFNHANLNFTKLGN 864

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           AN  G +LSD  +    L  A+L NA+L+R  L  ++L    +  AD S
Sbjct: 865 ANLNGTNLSDANLIGTNLYSADLRNAILIRANLFSTNLSNVNLYSADLS 913


>gi|428217414|ref|YP_007101879.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427989196|gb|AFY69451.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 225

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 49/89 (55%), Gaps = 5/89 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + ++A +  SD SG+  + A L  A+    N +GA+L D  +    L +ANLT A LV  
Sbjct: 108 DLSAATLNRSDLSGANLSEANLSDALMDSVNLSGANLDDANLSFAALTDANLTAASLV-- 165

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
               +DL GA ++GAD +DA  + A  +A
Sbjct: 166 ---EADLNGAFLKGADLTDANFEGANLEA 191



 Score = 37.4 bits (85), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 49/102 (48%), Gaps = 4/102 (3%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           SAA    +DL  A ++ E   AN + A M   + SG+  + A L  A    AN T A L 
Sbjct: 110 SAATLNRSDLSGA-NLSE---ANLSDALMDSVNLSGANLDDANLSFAALTDANLTAASLV 165

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +  ++   L  A+LT+A      L  ++L  A IEGA+   A
Sbjct: 166 EADLNGAFLKGADLTDANFEGANLEAANLSTATIEGANLEQA 207


>gi|300863629|ref|ZP_07108569.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
 gi|300338371|emb|CBN53713.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
          Length = 386

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 74/146 (50%), Gaps = 10/146 (6%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSA 135
           L  +VV S S +   L   N  E + R  + IG  A    ADL KA H+    RAN + A
Sbjct: 25  LVLSVVDSHSGDTPTLVLANINEQQNR-PYLIG--ANLSEADLSKA-HLS---RANLSKA 77

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           D+  ++  G+   GA L  A    AN TGA+L+   ++   L+ ANL+ A L  T ++ +
Sbjct: 78  DLSGANLCGANLVGASLSGANLTGANLTGANLTGAHLNWANLSTANLSKANLKGTDMSAA 137

Query: 196 DLGGAIIEGADFSDAVI---DLAQKQ 218
           +  GAI+  A+   A +   +L+Q Q
Sbjct: 138 NFSGAILNDANLGKAYLIKSNLSQAQ 163



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 47/82 (57%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +AN    DM  ++FSG+  N A L KA   K+N + A L+D  + +  L +A+LT+A L 
Sbjct: 126 KANLKGTDMSAANFSGAILNDANLGKAYLIKSNLSQAQLNDADLTQANLKDADLTDANLS 185

Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
              L R++L GA +  AD + A
Sbjct: 186 GAELARANLAGANLTRADLTKA 207



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 51/90 (56%), Gaps = 1/90 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S AQ   ADL +A     +   AN + A++  ++ +G+    A L KA   KAN   ADL
Sbjct: 160 SQAQLNDADLTQANLKDADLTDANLSGAELARANLAGANLTRADLTKANLLKANLRRADL 219

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           +++ ++   L EA+L+ A+L R  L+++DL
Sbjct: 220 TESYLNWASLGEADLSEAILTRANLSKADL 249



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 45/81 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN T A++  ++ +G+  N A L  A   KAN  G D+S       +LN+ANL  A L++
Sbjct: 97  ANLTGANLTGANLTGAHLNWANLSTANLSKANLKGTDMSAANFSGAILNDANLGKAYLIK 156

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
           + L+++ L  A +  A+  DA
Sbjct: 157 SNLSQAQLNDADLTQANLKDA 177



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 55/110 (50%), Gaps = 16/110 (14%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    A+L KA  +K N        A+ T A+++++D + +  +GA L +A    AN 
Sbjct: 140 SGAILNDANLGKAYLIKSNLSQAQLNDADLTQANLKDADLTDANLSGAELARANLAGANL 199

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           T ADL+          +ANL  A L R  LT S L  A +  AD S+A++
Sbjct: 200 TRADLT----------KANLLKANLRRADLTESYLNWASLGEADLSEAIL 239



 Score = 44.3 bits (103), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 50/108 (46%), Gaps = 11/108 (10%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A  G  DL K +    N        AN + A + E++ S +   GA L  A   KANF
Sbjct: 270 SGADLGGLDLSKKLLTGINLASAYLSEANLSGAYLIEANLSDANLCGADLSDACLMKANF 329

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            GA      M  + L+ ANLT A L +  L  ++L GAI+  AD   A
Sbjct: 330 IGAR-----MGNINLSNANLTGAKLCKADLMGANLRGAILTEADMRGA 372



 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 52/103 (50%), Gaps = 1/103 (0%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A+L +A   K N  +AN   AD+ ES  + +    A L +A+  +AN + ADLS 
Sbjct: 192 ANLAGANLTRADLTKANLLKANLRRADLTESYLNWASLGEADLSEAILTRANLSKADLSK 251

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           T + ++VL+  +L+   L    L   DL   ++ G + + A +
Sbjct: 252 TYLRKIVLHGCHLSGINLSGADLGGLDLSKKLLTGINLASAYL 294



 Score = 40.4 bits (93), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 4/102 (3%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A  G ADL +A+      RAN + AD+ ++       +G +L       A+  G DLS  
Sbjct: 227 ASLGEADLSEAILT----RANLSKADLSKTYLRKIVLHGCHLSGINLSGADLGGLDLSKK 282

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           L+  + L  A L+ A L    L  ++L  A + GAD SDA +
Sbjct: 283 LLTGINLASAYLSEANLSGAYLIEANLSDANLCGADLSDACL 324


>gi|254413321|ref|ZP_05027092.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196179941|gb|EDX74934.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 636

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 51/97 (52%), Gaps = 1/97 (1%)

Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           AA   +A+LR+ +  K N R A    A + E++   +    A L +A  Y+A  T ADLS
Sbjct: 210 AANLTTANLREVLLEKANLRDAILVGATLTEANLRQACLRRANLTQAELYRAILTDADLS 269

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
           +   DR+ L+ ANL  A L+R  L  ++L   +++  
Sbjct: 270 EVTGDRVNLSRANLMGAYLLRASLVNANLRRTVLQNV 306



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 46/95 (48%), Gaps = 10/95 (10%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR----------MVLN 178
            +N T A + ++    ++   A L +A    AN T A+L + L+++            L 
Sbjct: 180 HSNLTGATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAILVGATLT 239

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           EANL  A L R  LT+++L  AI+  AD S+   D
Sbjct: 240 EANLRQACLRRANLTQAELYRAILTDADLSEVTGD 274



 Score = 41.2 bits (95), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 49/175 (28%), Positives = 68/175 (38%), Gaps = 43/175 (24%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYE-----AETRGEFGIGSAAQFGSADLRKAVHVK 125
            +ST L AA +   S   + L   N  E     A  R    +G  A    A+LR+A   +
Sbjct: 193 LISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAILVG--ATLTEANLRQACLRR 250

Query: 126 EN------FRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKAN------------- 161
            N      +RA  T AD+ E      + S +   GAYL +A    AN             
Sbjct: 251 ANLTQAELYRAILTDADLSEVTGDRVNLSRANLMGAYLLRASLVNANLRRTVLQNVYCLQ 310

Query: 162 ------------FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
                          ADLS   ++  +L EANLT+A L+ + L R  L  A + G
Sbjct: 311 TNLTAANLQGADLRQADLSGAYLNETILTEANLTDAYLIGSYLIRPKLEQAQLTG 365



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 47/90 (52%), Gaps = 10/90 (11%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEK----------AVAYKANFTGADLSDTLMDRMVLNEA 180
           N + A+++ +  + S   GA L+K          A  Y+A+   A+L+   +  ++L +A
Sbjct: 167 NLSGANLQAAQLNHSNLTGATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKA 226

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           NL +A+LV   LT ++L  A +  A+ + A
Sbjct: 227 NLRDAILVGATLTEANLRQACLRRANLTQA 256



 Score = 38.1 bits (87), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 50/120 (41%), Gaps = 16/120 (13%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFN----------GAYL 152
           S A    ADL  A+    N       R N +  D + +    +  +          GA L
Sbjct: 114 SGACLHQADLHNAILKHSNLNQAILTRVNLSKVDGQSASLCQANLSWVEAPYCNLSGANL 173

Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           + A    +N TGA L  T +    L  ANL  A L+   LT ++L   ++E A+  DA++
Sbjct: 174 QAAQLNHSNLTGATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAIL 233


>gi|448449600|ref|ZP_21591825.1| pentapeptide repeat-containing protein [Halorubrum litoreum JCM
           13561]
 gi|445813229|gb|EMA63210.1| pentapeptide repeat-containing protein [Halorubrum litoreum JCM
           13561]
          Length = 822

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 1/98 (1%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADL  AV    +   A+     + E+D SG+   GA L      +A+ T ADLS+
Sbjct: 178 ASLLGADLPGAVLTDTDLSGADLIKTGLIEADLSGADLTGANLRHGRLKEADLTNADLSN 237

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
             + R+ L +A+L  AVL    +T +DL GA++  AD 
Sbjct: 238 ADLYRVDLTDADLEGAVLTDADITDADLEGAVLTDADL 275



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 47/105 (44%), Gaps = 6/105 (5%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A     DL   V    N R      ++ T A +R SD S +   GA+LE      A+   
Sbjct: 378 ADLTEVDLEGTVLTDANLRFSEFRGSDITDASLRGSDLSNTDLTGAHLEGIDLTDASLRE 437

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           ADL+D  ++ + L  ANL  A L    L  +DL  A +  AD +D
Sbjct: 438 ADLTDVNLEEIDLTNANLREADLTGAHLKGTDLTDASLREADLTD 482



 Score = 45.4 bits (106), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 52/114 (45%), Gaps = 19/114 (16%)

Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
           E  + + A    ADL  AV          T AD+  +D +G+    A L  A    A+ T
Sbjct: 251 EGAVLTDADITDADLEGAV---------LTDADLEGTDLTGANLKVADLTGANLKVADLT 301

Query: 164 GADLSDTLM-----DRMVLNEA-----NLTNAVLVRTVLTRSDLGGAIIEGADF 207
           GADL D ++     +R  L EA     +LT A L    LT  DLGGA++  AD 
Sbjct: 302 GADLEDAVLTDADLERTDLIEASLLSADLTGASLKEADLTEVDLGGAVLTDADL 355



 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 45/147 (30%), Positives = 69/147 (46%), Gaps = 11/147 (7%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNG 149
           L + N  EA+  G    G+      A LR+A     N    + T+A +RE+D +G+   G
Sbjct: 450 LTNANLREADLTGAHLKGT--DLTDASLREADLTDVNLEEIDLTNASLREADLTGAHLEG 507

Query: 150 -----AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
                A+LE      AN   ADL+   +++  L  ANLT+A L    L+ +DL    + G
Sbjct: 508 VDLTGAHLEGIDLTSANLNQADLTSANLNQADLRGANLTDASLREANLSGADLTDTELSG 567

Query: 205 ADFSDAVI---DLAQKQALCKYANGTN 228
           AD S   +   DL + ++L    +G N
Sbjct: 568 ADLSRTDLEKSDLHKSKSLPTNLSGAN 594



 Score = 44.7 bits (104), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 64/135 (47%), Gaps = 12/135 (8%)

Query: 85  SSNISALADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDF 142
           S +I   ADL+K +       G   + A  G A+L  A  V+ +   AN   AD+ ++D 
Sbjct: 16  SEDIEPSADLSKVDLSDADLSGADLTNAYLGGANLSNATLVEADLTGANLRDADLTDADL 75

Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDR-----MVLNEANLTNAVLVRTVLTRSDL 197
             +    AYLE      A    ADL+D  + R      +L EA+LT+A L RT     D 
Sbjct: 76  YRTDLTDAYLEGVNLSGATPVEADLTDASLKRANLSSTILMEADLTDADLYRT-----DF 130

Query: 198 GGAIIEGADFSDAVI 212
             A +EGA+ ++A +
Sbjct: 131 TDAYLEGANLTNAYL 145



 Score = 43.9 bits (102), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 51/96 (53%), Gaps = 1/96 (1%)

Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           A+L +AV    +   A+   A + ++D SG+      L +A    A+ TGA+L    +  
Sbjct: 168 AELPRAVLTDASLLGADLPGAVLTDTDLSGADLIKTGLIEADLSGADLTGANLRHGRLKE 227

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             L  A+L+NA L R  LT +DL GA++  AD +DA
Sbjct: 228 ADLTNADLSNADLYRVDLTDADLEGAVLTDADITDA 263



 Score = 43.5 bits (101), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 51/104 (49%), Gaps = 4/104 (3%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A    A+LR    +KE   A+ T+AD+  +D        A LE AV   A+ T ADL 
Sbjct: 211 SGADLTGANLRHG-RLKE---ADLTNADLSNADLYRVDLTDADLEGAVLTDADITDADLE 266

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             ++    L   +LT A L    LT ++L  A + GAD  DAV+
Sbjct: 267 GAVLTDADLEGTDLTGANLKVADLTGANLKVADLTGADLEDAVL 310



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 48/96 (50%), Gaps = 6/96 (6%)

Query: 116 ADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           A LR+A     N    + T+A++RE+D +G+   G  L  A   +A+ T  +L +  +  
Sbjct: 433 ASLREADLTDVNLEEIDLTNANLREADLTGAHLKGTDLTDASLREADLTDVNLEEIDLTN 492

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             L EA+LT A L        DL GA +EG D + A
Sbjct: 493 ASLREADLTGAHLEGV-----DLTGAHLEGIDLTSA 523



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 5/86 (5%)

Query: 129 RANFTS-----ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           RAN +S     AD+ ++D   + F  AYLE A    A  +G+DL++  ++   L +A+  
Sbjct: 107 RANLSSTILMEADLTDADLYRTDFTDAYLEGANLTNAYLSGSDLTNAYLEGANLTDASPI 166

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSD 209
            A L R VLT + L GA + GA  +D
Sbjct: 167 GAELPRAVLTDASLLGADLPGAVLTD 192



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 55/126 (43%), Gaps = 34/126 (26%)

Query: 128 FRANFTSADMRESDF---------------SGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
             A+ T AD+  +DF               SGS    AYLE A    A+  GA+L     
Sbjct: 116 MEADLTDADLYRTDFTDAYLEGANLTNAYLSGSDLTNAYLEGANLTDASPIGAELP---- 171

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGA------IIE----GADFSDAVIDLAQKQALCK 222
            R VL +A+L  A L   VLT +DL GA      +IE    GAD + A +    +    K
Sbjct: 172 -RAVLTDASLLGADLPGAVLTDTDLSGADLIKTGLIEADLSGADLTGANL----RHGRLK 226

Query: 223 YANGTN 228
            A+ TN
Sbjct: 227 EADLTN 232



 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 60/128 (46%), Gaps = 26/128 (20%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKA--------VAY--K 159
           A    ADL +   ++ +   A+ T A ++E+D +     GA L  A         AY   
Sbjct: 308 AVLTDADLERTDLIEASLLSADLTGASLKEADLTEVDLGGAVLTDADLEGTALTEAYLPS 367

Query: 160 ANFTG-----ADLSDTLMDRMVLNEANL----------TNAVLVRTVLTRSDLGGAIIEG 204
            + TG     ADL++  ++  VL +ANL          T+A L  + L+ +DL GA +EG
Sbjct: 368 PDLTGASLKEADLTEVDLEGTVLTDANLRFSEFRGSDITDASLRGSDLSNTDLTGAHLEG 427

Query: 205 ADFSDAVI 212
            D +DA +
Sbjct: 428 IDLTDASL 435


>gi|427710065|ref|YP_007052442.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
 gi|427362570|gb|AFY45292.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
          Length = 575

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 48/81 (59%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN ++AD+  ++      + A L +A A++A+   A+LSD  +    L+ A+L NA L R
Sbjct: 78  ANLSNADLSGANLRNINLSKAKLSRANAFRADLVSANLSDADLSSTNLSGADLRNANLTR 137

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             LT +DL GA + GA+ +DA
Sbjct: 138 ADLTNADLSGANLNGANLTDA 158



 Score = 45.4 bits (106), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 53/103 (51%), Gaps = 6/103 (5%)

Query: 109 SAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A   + +L KA   + N FRA+  SA++ ++D S +  +GA L  A   +A+ T ADL
Sbjct: 86  SGANLRNINLSKAKLSRANAFRADLVSANLSDADLSSTNLSGADLRNANLTRADLTNADL 145

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S        LN ANLT+A +        +L G  + G D S+A
Sbjct: 146 SGA-----NLNGANLTDANMRGVRFDNVNLQGVNLNGVDLSNA 183



 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           AN + A+++ +D S ++ N A L+ A  Y AN  GADL  +      LN ANL NA L
Sbjct: 218 ANLSYANLQNADLSNARLNNADLQNANLYNANLQGADLIGS-----KLNSANLDNADL 270



 Score = 37.0 bits (84), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 38/77 (49%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + ++AD+R  +F G   NG  L +      N  G +L +  +    L  A+L+NA L   
Sbjct: 179 DLSNADLRNFNFRGVSLNGVNLSRVNLNGYNLRGVELKNANLSYANLQNADLSNARLNNA 238

Query: 191 VLTRSDLGGAIIEGADF 207
            L  ++L  A ++GAD 
Sbjct: 239 DLQNANLYNANLQGADL 255


>gi|298246992|ref|ZP_06970797.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297549651|gb|EFH83517.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 381

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 57/121 (47%), Gaps = 13/121 (10%)

Query: 103 GEFGIGSAAQFGSA---DLRKAVHVKENFRANFTSADMRES-----DFSGSKFNGAYLEK 154
           G   +GS  + GSA   DL+   H+     A    A MR S     D S +   GA L K
Sbjct: 236 GHDALGSQGERGSARHPDLQ--AHLSH---AQLAGAKMRGSYLSGVDLSQANLRGADLSK 290

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
           A  Y AN  GADLS   +    L EAN+  A L    L+++ L GA +  AD S A + L
Sbjct: 291 AYFYGANLQGADLSGANLTETTLTEANIEGANLTEANLSKATLIGANLRQADLSGARLTL 350

Query: 215 A 215
           A
Sbjct: 351 A 351



 Score = 38.1 bits (87), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 48/103 (46%), Gaps = 20/103 (19%)

Query: 108 GSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           GS A  G ADL+K V  +     N    D+R  +F                +AN  GADL
Sbjct: 146 GSKALVG-ADLQKIVLPQ----INLAQMDLRRVNFR---------------EANLQGADL 185

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S   + R  L+ ANL++A L    L  +DL G  + GAD SD+
Sbjct: 186 SGVNLYRADLSGANLSHATLKGADLRGADLRGTDLTGADLSDS 228


>gi|254409513|ref|ZP_05023294.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183510|gb|EDX78493.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 209

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 60/118 (50%), Gaps = 16/118 (13%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFS----------GSKFNGAYLEKAVAYK 159
           A   +A+L +A  ++ N  RAN T A +RE+              +  +GA L +A+ + 
Sbjct: 80  ANLTAAELVRATLIECNLKRANLTEAHLREASLMFANLAQACLYQADLHGAMLHQAILHW 139

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRT-----VLTRSDLGGAIIEGADFSDAVI 212
           A+   ADL   ++    +  A+L+ A L+R      +L  +DL GAI+ GA+F  A++
Sbjct: 140 ASLKNADLIGAILQGADMRGADLSQACLIRADVSKAILMVADLRGAIVMGANFKAAIL 197



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 49/95 (51%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
            A    A++R SD SG+  +GA L+ +   +AN + A+LS   + +  LN+ANLT A LV
Sbjct: 29  EAILNGANLRRSDLSGANLSGASLKGSNLSEANLSQANLSVANLSKAELNDANLTAAELV 88

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
           R  L   +L  A +  A   +A +  A     C Y
Sbjct: 89  RATLIECNLKRANLTEAHLREASLMFANLAQACLY 123


>gi|443310759|ref|ZP_21040400.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
 gi|442779202|gb|ELR89454.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
          Length = 330

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 53/117 (45%), Gaps = 15/117 (12%)

Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           E    + +G    F  A+LR A      F A+  S  +  +D  G+    AYL +A  YK
Sbjct: 5   ELLERYAVGEI-DFSGANLRGA----NLFAADLISIILIHADLHGANLTFAYLNRAQLYK 59

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           AN  GA L            ANLT A L    L  +DL GAI++GAD   A + LA 
Sbjct: 60  ANLIGAKLC----------GANLTQADLRAAALHDADLHGAILQGADLRSADMSLAN 106



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 62/132 (46%), Gaps = 18/132 (13%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA 155
           ++A+ RG    G  A    ADLR A   + N R A+ + AD+  +D S +  N A L+ A
Sbjct: 154 FKADVRGANLAG--ANLSRADLRYANFNEVNLRGADLSCADLSNTDLSYALLNDANLDGA 211

Query: 156 VAYKANFTGA----------DLSDTLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGA 200
           +   AN + A          DL+D  +         LN ANLT A L +  L R DL  A
Sbjct: 212 ILTGANLSNARCERASMIDTDLTDVNLSGAAIPDGKLNRANLTGANLSKASLNRIDLSRA 271

Query: 201 IIEGADFSDAVI 212
            +  AD SDA +
Sbjct: 272 NLSYADLSDAYL 283


>gi|428215892|ref|YP_007089036.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428004273|gb|AFY85116.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 449

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 53/102 (51%), Gaps = 4/102 (3%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           + A+   ADLR A    E F A    A++ E+D   +KF+ A L KA     N +G++LS
Sbjct: 173 TGAKLEKADLRNA----ELFSAKLIEANLVEADLRNAKFSEANLSKAKLDGTNLSGSNLS 228

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            T +    L EANLT A L    L +++L G  +  A+ S A
Sbjct: 229 RTNLSEASLTEANLTEANLSEATLRKANLSGVKLCDANLSRA 270



 Score = 45.4 bits (106), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 54/108 (50%), Gaps = 6/108 (5%)

Query: 111 AQFGSADLRKAVHVKEN-FRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTG 164
           A+   ADL +A   + + FRA  T AD+  +     D SG+    A L +A   +AN + 
Sbjct: 80  AKLSYADLSRADLFRADLFRAELTDADLHRANLTRADLSGANLTRANLNEATLSQANLSD 139

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           ++LS   ++   LN A L  A L    L  SDL GA +E AD  +A +
Sbjct: 140 SNLSFASLNNTKLNGAKLNGANLSEARLFDSDLTGAKLEKADLRNAEL 187



 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 10/95 (10%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----------ADLSDTLMDRMVL 177
           + A+ + AD+ E+  SG K +GA LE A   +A+ +           ADLS   + R  L
Sbjct: 38  YDADLSCADLFEAKLSGIKLSGANLENAHLSRADLSNGKLFGAKLSYADLSRADLFRADL 97

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             A LT+A L R  LTR+DL GA +  A+ ++A +
Sbjct: 98  FRAELTDADLHRANLTRADLSGANLTRANLNEATL 132



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 46/90 (51%), Gaps = 5/90 (5%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE-----ANL 182
           F A  + AD+  +D   +    A L  A  ++AN T ADLS   + R  LNE     ANL
Sbjct: 78  FGAKLSYADLSRADLFRADLFRAELTDADLHRANLTRADLSGANLTRANLNEATLSQANL 137

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           +++ L    L  + L GA + GA+ S+A +
Sbjct: 138 SDSNLSFASLNNTKLNGAKLNGANLSEARL 167



 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 43/86 (50%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN  +A +  +D S  K  GA L  A   +A+   ADL    +    L+ ANLT A L  
Sbjct: 60  ANLENAHLSRADLSNGKLFGAKLSYADLSRADLFRADLFRAELTDADLHRANLTRADLSG 119

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
             LTR++L  A +  A+ SD+ +  A
Sbjct: 120 ANLTRANLNEATLSQANLSDSNLSFA 145



 Score = 37.0 bits (84), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 60/123 (48%), Gaps = 10/123 (8%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGA 150
           ADL++    TR +    S A    A+L +A   + N   +N + A +  +  +G+K NGA
Sbjct: 105 ADLHRANL-TRADL---SGANLTRANLNEATLSQANLSDSNLSFASLNNTKLNGAKLNGA 160

Query: 151 YLEKAVAYKANFTGA-----DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
            L +A  + ++ TGA     DL +  +    L EANL  A L     + ++L  A ++G 
Sbjct: 161 NLSEARLFDSDLTGAKLEKADLRNAELFSAKLIEANLVEADLRNAKFSEANLSKAKLDGT 220

Query: 206 DFS 208
           + S
Sbjct: 221 NLS 223


>gi|418939008|ref|ZP_13492446.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
 gi|375054283|gb|EHS50653.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
          Length = 229

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/116 (36%), Positives = 57/116 (49%), Gaps = 10/116 (8%)

Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADLR A +H     RAN T A +     SG+K + A L +A+A KAN  G DLS 
Sbjct: 120 ANLDRADLRDADLHGTILHRANLTGAIL-----SGAKLDKASLIQAIAQKANLQGVDLSG 174

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
             +  M L+  + T   L   + T ++L GAI  GA    A +     QA+ + AN
Sbjct: 175 ADLTDMNLSRVDFTAVNLKGAIFTGTNLTGAIFSGAKLDKASL----IQAIAQKAN 226



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 59/125 (47%), Gaps = 31/125 (24%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDF-----SGSKFNGAYLEKAVAYK 159
           A F  A+L+ A     N R      ANFT AD++ +D       G+ F GA LE AV   
Sbjct: 60  ANFTEANLKGA-----NLRGADCDGANFTRADLKSADLRWADCDGANFTGANLESAVLQH 114

Query: 160 -----ANFTGADLSDTLMDRMVLNEANLTNAV----------LVRTVLTRSDLGGAIIEG 204
                AN   ADL D  +   +L+ ANLT A+          L++ +  +++L G  + G
Sbjct: 115 TDLTNANLDRADLRDADLHGTILHRANLTGAILSGAKLDKASLIQAIAQKANLQGVDLSG 174

Query: 205 ADFSD 209
           AD +D
Sbjct: 175 ADLTD 179



 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 6/106 (5%)

Query: 113 FGSADLRK----AVHVKE-NF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           F  ADL +       +KE NF  AN   A++R +D  G+ F  A L+ A    A+  GA+
Sbjct: 42  FAGADLEQVRLAGASLKEANFTEANLKGANLRGADCDGANFTRADLKSADLRWADCDGAN 101

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            +   ++  VL   +LTNA L R  L  +DL G I+  A+ + A++
Sbjct: 102 FTGANLESAVLQHTDLTNANLDRADLRDADLHGTILHRANLTGAIL 147


>gi|189500184|ref|YP_001959654.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
           BS1]
 gi|189495625|gb|ACE04173.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
          Length = 412

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/110 (38%), Positives = 55/110 (50%), Gaps = 8/110 (7%)

Query: 118 LRKAVHVKENFRANFTSA--DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           +RK+V    + R N+  A  D+  +D  G    GA L  A    AN  GADLSDT +   
Sbjct: 33  IRKSVTSWNSMRENYPEAAIDLSGADLKGRNLKGADLHNANLQGANLHGADLSDTDLRGA 92

Query: 176 VLNEANLTNAVL----VRTVLTR-SDLGGAIIEGADFSDAVIDLA-QKQA 219
             + A+L  A+L    +R    R +DL  A  EGAD   AV+D A  KQA
Sbjct: 93  SFDHASLKGALLFDADLREATVREADLEDAAFEGADLRGAVLDGAVMKQA 142



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 49/103 (47%), Gaps = 20/103 (19%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKAN---------------FTGADLSDTLMDR 174
           AN   AD+ ++D  G+ F+ A L+ A+ + A+               F GADL   ++D 
Sbjct: 77  ANLHGADLSDTDLRGASFDHASLKGALLFDADLREATVREADLEDAAFEGADLRGAVLDG 136

Query: 175 MV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            V     L E+NL NA L  T L  ++L  A + G D S A +
Sbjct: 137 AVMKQADLGESNLRNASLRGTDLRAANLKMADLAGCDLSGAYL 179



 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 53/103 (51%), Gaps = 1/103 (0%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F  A L+ A+    + R A    AD+ ++ F G+   GA L+ AV  +A+   ++L +
Sbjct: 92  ASFDHASLKGALLFDADLREATVREADLEDAAFEGADLRGAVLDGAVMKQADLGESNLRN 151

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             +    L  ANL  A L    L+ + L  A+++GA+  ++V+
Sbjct: 152 ASLRGTDLRAANLKMADLAGCDLSGAYLWRAVLDGANLENSVV 194



 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 39/85 (45%), Gaps = 4/85 (4%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A F  ADLR AV       A    AD+ ES+   +   G  L  A    A+  G DLS  
Sbjct: 122 AAFEGADLRGAVLDG----AVMKQADLGESNLRNASLRGTDLRAANLKMADLAGCDLSGA 177

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRS 195
            + R VL+ ANL N+V+    +  +
Sbjct: 178 YLWRAVLDGANLENSVVTSVTIVET 202


>gi|443329141|ref|ZP_21057730.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442791290|gb|ELS00788.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 174

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 51/88 (57%), Gaps = 5/88 (5%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANL 182
            RAN + A +R S+ SG+ F  A L+KA     +    NF+GA+L +  + +  L+EA L
Sbjct: 38  IRANLSQAILRNSNLSGAFFVLADLQKADLSGAILIVVNFSGANLQEANLTQSKLSEAVL 97

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           T   L    LT ++L GAI+ GA+ S+A
Sbjct: 98  TGTQLQGANLTEANLQGAILAGANLSEA 125



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 54/113 (47%), Gaps = 11/113 (9%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    ADL  A+ +  NF       AN T + + E+  +G++  GA L +A    A   G
Sbjct: 60  ADLQKADLSGAILIVVNFSGANLQEANLTQSKLSEAVLTGTQLQGANLTEANLQGAILAG 119

Query: 165 ADLSDTLMDRMVLNEAN-----LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A+LS+  +    L  AN     L NA L    +T ++L GA +EGA   + +I
Sbjct: 120 ANLSEANLRGGDLRGANLYGVDLRNADLTDAKITHANLRGANLEGAIMPEQLI 172


>gi|254411218|ref|ZP_05024995.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196181719|gb|EDX76706.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 293

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 59/124 (47%), Gaps = 21/124 (16%)

Query: 110 AAQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAY 158
           +A    A+L  A+ ++ N +           ANFT AD+ E D S ++ NG  L +A+  
Sbjct: 163 SANLEKANLTNAILLETNLKQANLNKALLHGANFTQADLTEVDLSQARLNGVNLTRAILV 222

Query: 159 KA----------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            A             GA+LS   + R  L  +NLT A+L+ TVL  +++    + GA  +
Sbjct: 223 GAKLRGVSICWTTLRGANLSKANLYRAKLCWSNLTEAILLETVLLDANMDQVNLRGATLT 282

Query: 209 DAVI 212
            A++
Sbjct: 283 GAIL 286



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/127 (30%), Positives = 64/127 (50%), Gaps = 24/127 (18%)

Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA----------YLEKAVAYKANFTG 164
           ADL +A  +  N  +AN + A++  +    S  NGA           L +A+  +AN   
Sbjct: 84  ADLVEANLISSNLTQANLSEANLINASLRASTLNGANLSRANLSEAILSEAIMREANLNQ 143

Query: 165 ADLSDTLMDRMVLN----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA---V 211
           A L D  + R  L+          +ANLTNA+L+ T L +++L  A++ GA+F+ A    
Sbjct: 144 AKLIDASLSRTNLSYATLISANLEKANLTNAILLETNLKQANLNKALLHGANFTQADLTE 203

Query: 212 IDLAQKQ 218
           +DL+Q +
Sbjct: 204 VDLSQAR 210



 Score = 45.1 bits (105), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 59/118 (50%), Gaps = 13/118 (11%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADL 167
           F   +LR A  +     A+ T A++R SD S S   GA L+     +AN T     GADL
Sbjct: 31  FRRVNLRNASLIG----ADLTHANLRGSDLSQSNLTGASLKLVNFREANLTQITLRGADL 86

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
            +  +    L +ANL+ A L+   L  S L GA +  A+ S+A++     +A+ + AN
Sbjct: 87  VEANLISSNLTQANLSEANLINASLRASTLNGANLSRANLSEAIL----SEAIMREAN 140



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 6/101 (5%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A     +L  A  +  N  +AN T+A + E++   +  N     KA+ + ANFT ADL++
Sbjct: 149 ASLSRTNLSYATLISANLEKANLTNAILLETNLKQANLN-----KALLHGANFTQADLTE 203

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             + +  LN  NLT A+LV   L    +    + GA+ S A
Sbjct: 204 VDLSQARLNGVNLTRAILVGAKLRGVSICWTTLRGANLSKA 244


>gi|390438685|ref|ZP_10227130.1| Tetratricopeptide repeat protein [Microcystis sp. T1-4]
 gi|389837879|emb|CCI31254.1| Tetratricopeptide repeat protein [Microcystis sp. T1-4]
          Length = 262

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 55/101 (54%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L++ +  ++  + + + + + +S+  G+K NGA L  A   +AN +GADLS   +    L
Sbjct: 29  LQQLLSTRKCPQCDLSGSGLVQSNLVGAKLNGANLVGANLSQANLSGADLSGANLTGASL 88

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
             ANLT A L   +LT +DL GA +  A+  +  +D A  Q
Sbjct: 89  FGANLTGANLTGAILTGADLRGAYLNNANLDNTKLDTAYVQ 129



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 40/72 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++ +++ SG+  +GA L  A  + AN TGA+L+  ++    L  A L NA L  
Sbjct: 61  ANLVGANLSQANLSGADLSGANLTGASLFGANLTGANLTGAILTGADLRGAYLNNANLDN 120

Query: 190 TVLTRSDLGGAI 201
           T L  + + GA+
Sbjct: 121 TKLDTAYVQGAV 132


>gi|428201752|ref|YP_007080341.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427979184|gb|AFY76784.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 187

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 46/87 (52%)

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           E ++ +   A++ E+D SG+  NGAYL KA    AN   A L +  + +  L  ANL  A
Sbjct: 83  EMWKIDLGQANLEETDLSGANLNGAYLWKAKLCIANLERAYLKEVNLVQCDLWRANLRGA 142

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVI 212
            L+   LT + L GA +E A + +  I
Sbjct: 143 YLIGANLTGASLKGACLERAKYDEKTI 169



 Score = 41.2 bits (95), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 47/95 (49%), Gaps = 7/95 (7%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R N  +AD++E     +    A LE A  Y+AN  G+ L  T + R  L EANL+ A + 
Sbjct: 26  RINLHAADLKEVCLIDADLEEANLEGANLYRANLKGSCLYRTNLARSNLREANLSGAEMW 85

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA--QKQALC 221
           +      DLG A +E  D S A ++ A   K  LC
Sbjct: 86  KI-----DLGQANLEETDLSGANLNGAYLWKAKLC 115


>gi|76819210|ref|YP_336861.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
           1710b]
 gi|76583683|gb|ABA53157.1| pentapeptide repeat family protein [Burkholderia pseudomallei
           1710b]
          Length = 862

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 34/79 (43%), Positives = 42/79 (53%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T  D+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 549 ADLTGVDLSGMDLRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVD 603

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 604 ARLTAANLSLAHCERTDFS 622


>gi|56751008|ref|YP_171709.1| hypothetical protein syc0999_c [Synechococcus elongatus PCC 6301]
 gi|81299332|ref|YP_399540.1| hypothetical protein Synpcc7942_0521 [Synechococcus elongatus PCC
           7942]
 gi|56685967|dbj|BAD79189.1| hypothetical protein [Synechococcus elongatus PCC 6301]
 gi|81168213|gb|ABB56553.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 195

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 54/103 (52%), Gaps = 6/103 (5%)

Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADL  A+ V  + R A    A +RE+D SG+   GA L ++   +A   G++L   +++ 
Sbjct: 49  ADLTGAILVGADLRRAWLRGAILREADCSGANLLGADLLRSDLCRAQLVGSNLRRAMLND 108

Query: 175 MVLNEAN-----LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            +L EAN     L  A LVR +L R+D   A +  AD S A I
Sbjct: 109 SILAEANCRQACLQQADLVRAILYRTDFTAADLHEADLSHAFI 151



 Score = 37.4 bits (85), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 53/110 (48%), Gaps = 3/110 (2%)

Query: 118 LRKAVHVKENFRANFTSA--DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           LR+   V   +R+   +   D+R++D S        LE+A    A   GADL    +   
Sbjct: 10  LRRGTAVWSRWRSQNPTVIPDLRQADLSFVDLVNVDLERADLTGAILVGADLRRAWLRGA 69

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYA 224
           +L EA+ + A L+   L RSDL  A + G++   A++ D    +A C+ A
Sbjct: 70  ILREADCSGANLLGADLLRSDLCRAQLVGSNLRRAMLNDSILAEANCRQA 119


>gi|448661888|ref|ZP_21683780.1| hypothetical protein C435_21969 [Haloarcula californiae ATCC 33799]
 gi|445758247|gb|EMA09568.1| hypothetical protein C435_21969 [Haloarcula californiae ATCC 33799]
          Length = 480

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 17/128 (13%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A      LR+A     N + AN T A +R++D + +   GA L  A   +A+ T A L +
Sbjct: 168 ANLTDTSLRQADLTDANLKGANLTDASLRQADLTDANLKGADLPGASLLRADLTDAFLRE 227

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRS---------------DLGGAIIEGADFSDA-VID 213
             +    LN ANLT  +L +  LT +               DL GA + GADFS+A +I+
Sbjct: 228 VNLTDAALNRANLTGTILHKADLTDTDLQVADFTNADLRYADLTGATLPGADFSEANLIN 287

Query: 214 LAQKQALC 221
              ++ L 
Sbjct: 288 TTLREVLL 295



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 45/81 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANF  AD+ +++  GS F  A L +A    A    ADL+D  +    L +A+L  A L  
Sbjct: 113 ANFLRADLHDANLKGSDFTDAALRQADLTDATLRQADLTDADLWAAALPDADLKGANLTD 172

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
           T L ++DL  A ++GA+ +DA
Sbjct: 173 TSLRQADLTDANLKGANLTDA 193



 Score = 44.3 bits (103), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 55/119 (46%), Gaps = 5/119 (4%)

Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           AA    ADL+ A     + R     AD+ +++  G+    A L +A    AN  GADL  
Sbjct: 157 AAALPDADLKGANLTDTSLR----QADLTDANLKGANLTDASLRQADLTDANLKGADLPG 212

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGT 227
             + R  L +A L    L    L R++L G I+  AD +D  + +A    A  +YA+ T
Sbjct: 213 ASLLRADLTDAFLREVNLTDAALNRANLTGTILHKADLTDTDLQVADFTNADLRYADLT 271



 Score = 43.5 bits (101), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 56/108 (51%), Gaps = 25/108 (23%)

Query: 130 ANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDT------LMDRMV-- 176
           AN + A ++E+D +G     +   GA L+ AV    NF GADL +       L D ++  
Sbjct: 28  ANLSGAFLKEADLTGANLTRTDLTGANLKGAVLADVNFAGADLVNANIKEAELTDAILRQ 87

Query: 177 -------LNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 212
                  L +ANLT + L+RT L      R+DL  A ++G+DF+DA +
Sbjct: 88  ADLTDAALWDANLTGSNLLRTDLPGANFLRADLHDANLKGSDFTDAAL 135



 Score = 42.0 bits (97), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 34/112 (30%), Positives = 53/112 (47%), Gaps = 6/112 (5%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTG 164
           + F  A LR+A       R A+ T AD+      ++D  G+      L +A    AN  G
Sbjct: 128 SDFTDAALRQADLTDATLRQADLTDADLWAAALPDADLKGANLTDTSLRQADLTDANLKG 187

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           A+L+D  + +  L +ANL  A L    L R+DL  A +   + +DA ++ A 
Sbjct: 188 ANLTDASLRQADLTDANLKGADLPGASLLRADLTDAFLREVNLTDAALNRAN 239



 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 39/73 (53%)

Query: 140 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 199
           +D + +  +GA+L++A    AN T  DL+   +   VL + N   A LV   +  ++L  
Sbjct: 23  ADLTDANLSGAFLKEADLTGANLTRTDLTGANLKGAVLADVNFAGADLVNANIKEAELTD 82

Query: 200 AIIEGADFSDAVI 212
           AI+  AD +DA +
Sbjct: 83  AILRQADLTDAAL 95



 Score = 37.7 bits (86), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 47/82 (57%), Gaps = 6/82 (7%)

Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADF 207
           + +V+ K  + GADL+D  +    L EA+LT A L RT LT ++L GA++      GAD 
Sbjct: 11  DDSVSDKDIYPGADLTDANLSGAFLKEADLTGANLTRTDLTGANLKGAVLADVNFAGADL 70

Query: 208 SDAVIDLAQ-KQALCKYANGTN 228
            +A I  A+   A+ + A+ T+
Sbjct: 71  VNANIKEAELTDAILRQADLTD 92


>gi|392382587|ref|YP_005031784.1| conserved protein of unknown function; pentapeptide repeats
           [Azospirillum brasilense Sp245]
 gi|356877552|emb|CCC98392.1| conserved protein of unknown function; pentapeptide repeats
           [Azospirillum brasilense Sp245]
          Length = 493

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 61/124 (49%), Gaps = 27/124 (21%)

Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDF-SGSKFNG---------------AY 151
           +A+    ADLR A +H     RA  T A++R +DF +GS  NG               A 
Sbjct: 84  TASTLIGADLRGANLH-----RAILTDANLRGADFRAGSLMNGTDDKPRSDGVTRLTEAK 138

Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
           +E+++   ANFTG DLS        LN+A+LT A +   VL  +D  GA ++G  F    
Sbjct: 139 MERSILAGANFTGCDLSGA-----DLNDADLTGADMTAAVLVGADFWGATLDGVTFDGTT 193

Query: 212 IDLA 215
           ID A
Sbjct: 194 IDEA 197


>gi|332707026|ref|ZP_08427086.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332354291|gb|EGJ33771.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 239

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 51/105 (48%), Gaps = 1/105 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S   F  AD  +A     N   AN   A +  + F  +K   A L+ A     N  GADL
Sbjct: 78  SGVDFSRADFSQANLSDSNLENANLKDAKVIGARFENAKLTSADLDGADFKDTNLKGADL 137

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           SD  +  + L  A+L+ A+L RT L  +DL GA +E AD S A I
Sbjct: 138 SDANLLNIRLANADLSTAILNRTELREADLTGANMEHADLSHASI 182



 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 59/118 (50%), Gaps = 5/118 (4%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S +   +A+L+ A  +   F  A  TSAD+  +DF  +   GA L  A         ADL
Sbjct: 93  SDSNLENANLKDAKVIGARFENAKLTSADLDGADFKDTNLKGADLSDANLLNIRLANADL 152

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
           S  +++R  L EA+LT A +    L+ + + GAI+  A+ + A +     +A  +YAN
Sbjct: 153 STAILNRTELREADLTGANMEHADLSHASIYGAILREANLTGANL----YKANLRYAN 206



 Score = 37.4 bits (85), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 6/111 (5%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A+  SADL  A     N +      AN  +  +  +D S +  N   L +A    AN   
Sbjct: 115 AKLTSADLDGADFKDTNLKGADLSDANLLNIRLANADLSTAILNRTELREADLTGANMEH 174

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           ADLS   +   +L EANLT A L +  L  ++L  A+++G +   A +  A
Sbjct: 175 ADLSHASIYGAILREANLTGANLYKANLRYANLQDAVLKGTNLKGADLQFA 225


>gi|86607938|ref|YP_476700.1| pentapeptide repeat-containing protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
 gi|86556480|gb|ABD01437.1| pentapeptide repeat family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
          Length = 154

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 44/127 (34%), Positives = 63/127 (49%), Gaps = 15/127 (11%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG---- 164
           S AQ   A+L K + +++   A+ + AD+RE+D SG+  +GA L  A   + N  G    
Sbjct: 32  SGAQLSGANL-KGIILRD---ADLSGADLREADLSGADLSGADLRGAKLRRVNLIGAKLV 87

Query: 165 -ADLSDTLMDRMVLNEANLTNAVLVRTVL-TRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
            ADL    + R  L  A+L+ A L R  L   +DL GAII    F  A+ D        K
Sbjct: 88  KADLRGANLYRAKLLRADLSEAELNRADLRIGADLRGAIITNTHFRGALYD-----EYTK 142

Query: 223 YANGTNP 229
           + +G NP
Sbjct: 143 FPDGFNP 149


>gi|383482351|ref|YP_005391265.1| hypothetical protein MCI_01270 [Rickettsia montanensis str. OSU
           85-930]
 gi|378934705|gb|AFC73206.1| hypothetical protein MCI_01270 [Rickettsia montanensis str. OSU
           85-930]
          Length = 959

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 39/118 (33%), Positives = 62/118 (52%), Gaps = 11/118 (9%)

Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           +  +ADL KA   K N   A+ T+A +  +    +K + A LEKA A      G ++SD 
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ---KQALCKYAN 225
           +   +   EAN  NA++ R  LT++D   A++E AD     ++ A+   K+A+ K AN
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKADFTKALLENADMQ--AVEAAEAIFKEAILKQAN 665



 Score = 41.2 bits (95), Expect = 0.54,   Method: Composition-based stats.
 Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 4/107 (3%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A+  +A L KA    E    N + A  +  +   + F  A +++A   KA+FT A L + 
Sbjct: 589 AKLSNATLEKA----EAEGLNISDAIAKNINAKEANFKNAIMQRADLTKADFTKALLENA 644

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
            M  +   EA    A+L +  L  ++L G   EGADF  A I+ A K
Sbjct: 645 DMQAVEAAEAIFKEAILKQANLKAANLAGINKEGADFDKAKINDATK 691


>gi|427712429|ref|YP_007061053.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
 gi|427376558|gb|AFY60510.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
          Length = 316

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 44/88 (50%), Gaps = 5/88 (5%)

Query: 130 ANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN   AD+RE DF G     +   GA L  A  ++ +F  A+LS   + R  L  ANL+N
Sbjct: 202 ANLRGADLREKDFEGRNLSYADLTGADLSDAFLHRVSFYRANLSQATLFRANLLNANLSN 261

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A L    L  +D  GA + GAD   A +
Sbjct: 262 ANLRDANLIGADFSGADLRGADLRGAKV 289



 Score = 44.3 bits (103), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 43/84 (51%), Gaps = 12/84 (14%)

Query: 138 RESDFSGSKFNGAYLE------KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           R  D SG+   GA L       + ++Y A+ TGADLSD  + R+    ANL+ A L R  
Sbjct: 195 RGKDLSGANLRGADLREKDFEGRNLSY-ADLTGADLSDAFLHRVSFYRANLSQATLFRAN 253

Query: 192 LTRSDLGGAIIE-----GADFSDA 210
           L  ++L  A +      GADFS A
Sbjct: 254 LLNANLSNANLRDANLIGADFSGA 277


>gi|397736621|ref|ZP_10503302.1| pentapeptide repeats family protein [Rhodococcus sp. JVH1]
 gi|396927531|gb|EJI94759.1| pentapeptide repeats family protein [Rhodococcus sp. JVH1]
          Length = 201

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 16/131 (12%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFS-----GSKFNGAYL 152
           +E R E  I +   F  ADL ++ HV   FR+ +FT   +  S+F      GS+F+   L
Sbjct: 38  SELRTESVIFTECDFTGADLAESNHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97

Query: 153 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 202
              V  + +FT     GADL           EANL    L R VL  +DL     GGA  
Sbjct: 98  RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 157

Query: 203 EGADFSDAVID 213
           +GAD   A +D
Sbjct: 158 DGADLRGAHVD 168


>gi|390442549|ref|ZP_10230537.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
 gi|389834137|emb|CCI34663.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
          Length = 179

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/127 (33%), Positives = 56/127 (44%), Gaps = 21/127 (16%)

Query: 84  CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFS 143
           C  N  +L DLN   A   G       A    ADL          R N   A++R +D  
Sbjct: 61  CDFNGISLKDLNLSSANLEG-------ANLSQADLE---------RTNLQGANLRGTDLR 104

Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
           G+      L  A   KAN  GADL     ++  L  ANLTNA L +  L +++L  A ++
Sbjct: 105 GADLGKTLLAGADLSKANLLGADL-----EKANLQGANLTNANLQKADLEKANLTNARLD 159

Query: 204 GADFSDA 210
           GA+  DA
Sbjct: 160 GANLQDA 166


>gi|428220816|ref|YP_007104986.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427994156|gb|AFY72851.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 418

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 58/108 (53%), Gaps = 11/108 (10%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANF 162
           S A F  +DL  A+ ++ + R AN + A++ E     +D SG  F+G+ L +A   +ANF
Sbjct: 143 SMANFTGSDLSGAIMIRADLRRANISRANLNEADISRADLSGVDFSGSNLSQANFEEANF 202

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            G + S     R  L EAN +N       L+ SDL GA +  A+F++A
Sbjct: 203 LGTNFS-----RTNLIEANFSNTNFREVDLSGSDLIGADLSNANFAEA 245



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/139 (35%), Positives = 70/139 (50%), Gaps = 19/139 (13%)

Query: 95  NKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAY 151
           N  E +  G   IG   S A F  ADLR+A  V     ANF +A+++E+D SG+   GA 
Sbjct: 221 NFREVDLSGSDLIGADLSNANFAEADLRRANLVG----ANFNNANLKEADLSGAYLIGAT 276

Query: 152 LEKAVAYKANF----------TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
           L  A   +A+F          TGADL+   +    L+ ANL++  L    LT +DL  A 
Sbjct: 277 LVNANIVRADFRRANLIGADLTGADLTGADLVGANLSGANLSDCNLTSVSLTSADLSMAN 336

Query: 202 IEGADFSDAVIDLAQKQAL 220
               D ++A  +L++ QAL
Sbjct: 337 FANCDLTNA--NLSRVQAL 353



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 1/103 (0%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F  ADL +A   +  F   NF++ ++ E+D      +GA L  A    A+  GADL  
Sbjct: 55  ANFSGADLSRAKLRRATFGETNFSNTNLSEADLRRVNLSGADLRGANLSTADLIGADLRR 114

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             ++  +L EA+L+   LV T +T ++L  A   G+D S A++
Sbjct: 115 ATLEGAILAEADLSRTNLVGTNMTDANLSMANFTGSDLSGAIM 157



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 9/102 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A    ADLR         RA    A + E+D S +   G  +  A    ANFTG+DLS
Sbjct: 103 STADLIGADLR---------RATLEGAILAEADLSRTNLVGTNMTDANLSMANFTGSDLS 153

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             +M R  L  AN++ A L    ++R+DL G    G++ S A
Sbjct: 154 GAIMIRADLRRANISRANLNEADISRADLSGVDFSGSNLSQA 195



 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 41/80 (51%), Gaps = 10/80 (12%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +FT A++ E DF+G+             KANF+GADLS   + R    E N +N  L   
Sbjct: 36  DFTGANLSEVDFAGTDL----------QKANFSGADLSRAKLRRATFGETNFSNTNLSEA 85

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            L R +L GA + GA+ S A
Sbjct: 86  DLRRVNLSGADLRGANLSTA 105



 Score = 44.3 bits (103), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 54/118 (45%), Gaps = 16/118 (13%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYL---------- 152
           S +    A+  +A  +  NF       ANF++ + RE D SGS   GA L          
Sbjct: 188 SGSNLSQANFEEANFLGTNFSRTNLIEANFSNTNFREVDLSGSDLIGADLSNANFAEADL 247

Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            +A    ANF  A+L +  +    L  A L NA +VR    R++L GA + GAD + A
Sbjct: 248 RRANLVGANFNNANLKEADLSGAYLIGATLVNANIVRADFRRANLIGADLTGADLTGA 305



 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 48/83 (57%), Gaps = 5/83 (6%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN-----EANLTNA 185
           +F   D+++++FSG+  + A L +A   + NF+  +LS+  + R+ L+      ANL+ A
Sbjct: 46  DFAGTDLQKANFSGADLSRAKLRRATFGETNFSNTNLSEADLRRVNLSGADLRGANLSTA 105

Query: 186 VLVRTVLTRSDLGGAIIEGADFS 208
            L+   L R+ L GAI+  AD S
Sbjct: 106 DLIGADLRRATLEGAILAEADLS 128



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 16/120 (13%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA---------------YL 152
           S A    A LR+A   + NF   N + AD+R  + SG+   GA                L
Sbjct: 58  SGADLSRAKLRRATFGETNFSNTNLSEADLRRVNLSGADLRGANLSTADLIGADLRRATL 117

Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           E A+  +A+ +  +L  T M    L+ AN T + L   ++ R+DL  A I  A+ ++A I
Sbjct: 118 EGAILAEADLSRTNLVGTNMTDANLSMANFTGSDLSGAIMIRADLRRANISRANLNEADI 177


>gi|389842816|ref|YP_006344900.1| hypothetical protein ES15_3816 [Cronobacter sakazakii ES15]
 gi|387853292|gb|AFK01390.1| hypothetical protein ES15_3816 [Cronobacter sakazakii ES15]
          Length = 846

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 39/119 (32%), Positives = 59/119 (49%), Gaps = 17/119 (14%)

Query: 129 RANFTSADMRESDFS-----GSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDRMVLN 178
           RA+FT A +R+S+       G++F  A LE     +AN     F  A L  +L  R    
Sbjct: 723 RADFTHATLRQSNLRQTALCGARFELAKLENTDLSEANCRGASFQRASLVGSLFIRTDFR 782

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
           E + T+A L+  +L +S LGGA   GA+   A  DL+Q      + NG   ++G  T++
Sbjct: 783 EVDFTDANLMGALLQKSQLGGADFNGANLFRA--DLSQ-----SFTNGETRMSGAFTKR 834


>gi|359462953|ref|ZP_09251516.1| hypothetical protein ACCM5_29760 [Acaryochloris sp. CCMEE 5410]
          Length = 435

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 65/128 (50%), Gaps = 11/128 (8%)

Query: 99  AETRGEFGIGSA----AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
           A  RG + +GSA    A   SADL   V++ +   AN + A +  ++   +K  GA L  
Sbjct: 287 ANLRGAY-LGSANLLGANLNSADL-IGVYLSD---ANLSQAKLVGANLRTAKLIGAKLTD 341

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
               +ANFTGADLSD  ++     +ANL      RT    +DL GA + GA F +  +D 
Sbjct: 342 TDLSEANFTGADLSDANLEGADFTDANLREVSFQRTQFREADLSGADLRGAIFLE--VDQ 399

Query: 215 AQKQALCK 222
            ++  LC+
Sbjct: 400 LEECKLCR 407



 Score = 41.2 bits (95), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 44/143 (30%), Positives = 61/143 (42%), Gaps = 32/143 (22%)

Query: 99  AETRGEFGIGS---AAQFGSADLRKAVHVKENFRANFTSADMRE---------------- 139
           A  +G + IG+    A    A+LR A    +   AN + AD+ +                
Sbjct: 222 ANFQGTYLIGTNLREANLREANLRNA----DLLSANLSEADLTQANLSSANLLGTNLNSA 277

Query: 140 ----SDFSGSKFNGAYLEKAVAYKANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRT 190
               +D +G+   GAYL  A    AN   AD     LSD  + +  L  ANL  A L+  
Sbjct: 278 NFQNADLTGANLRGAYLGSANLLGANLNSADLIGVYLSDANLSQAKLVGANLRTAKLIGA 337

Query: 191 VLTRSDLGGAIIEGADFSDAVID 213
            LT +DL  A   GAD SDA ++
Sbjct: 338 KLTDTDLSEANFTGADLSDANLE 360


>gi|443665875|ref|ZP_21133688.1| tetratricopeptide repeat family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|159027171|emb|CAO86803.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443331319|gb|ELS45983.1| tetratricopeptide repeat family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 262

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 55/101 (54%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L++ +  ++  + + + + + +S+ +G+K NGA L  A   +AN +GADLS   +     
Sbjct: 29  LQQLLSTRKCPQCDLSGSGLVQSNLTGAKLNGANLVGANLSQANLSGADLSGANLTGASF 88

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
             ANLT A L   +LT +DL GA +  A+  +  +D A  Q
Sbjct: 89  FGANLTGANLTGAILTGADLRGAYLNNANLENTKLDTAYVQ 129



 Score = 40.4 bits (93), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 40/72 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++ +++ SG+  +GA L  A  + AN TGA+L+  ++    L  A L NA L  
Sbjct: 61  ANLVGANLSQANLSGADLSGANLTGASFFGANLTGANLTGAILTGADLRGAYLNNANLEN 120

Query: 190 TVLTRSDLGGAI 201
           T L  + + GA+
Sbjct: 121 TKLDTAYVQGAV 132


>gi|451981569|ref|ZP_21929921.1| hypothetical protein NITGR_590064 [Nitrospina gracilis 3/211]
 gi|451761242|emb|CCQ91185.1| hypothetical protein NITGR_590064 [Nitrospina gracilis 3/211]
          Length = 241

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 44/79 (55%), Gaps = 5/79 (6%)

Query: 135 ADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AD+R S+F+ + F+     GAYLE A    ANF  A+L    + + V   ANL  A L  
Sbjct: 91  ADLRHSNFTNANFSEANLTGAYLEGANLEGANFQRAELKAGALKQAVFRNANLFEADLRY 150

Query: 190 TVLTRSDLGGAIIEGADFS 208
           T +  +D  GA +EGADF+
Sbjct: 151 TRVDEADFTGANLEGADFT 169


>gi|427715910|ref|YP_007063904.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
 gi|427348346|gb|AFY31070.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
          Length = 1031

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN +  +   ++ SG+  +GA L  A   +AN  G  LS   ++R  L+ AN + A L 
Sbjct: 864 RANLSGTNFSRANLSGANLSGADLSTANLSRANLNGVYLSRANLNRANLSGANFSRADLS 923

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
           R  L+ +DL GA + GAD SDA ++ A
Sbjct: 924 RANLSGADLSGADLSGADLSDANLNRA 950



 Score = 41.2 bits (95), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 46/80 (57%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANF+ AD+  ++ SG+  +GA L  A    AN   A+LS   + R  L++ANL++A L  
Sbjct: 915 ANFSRADLSRANLSGADLSGADLSGADLSDANLNRANLSRANLKRANLSDANLSSANLSG 974

Query: 190 TVLTRSDLGGAIIEGADFSD 209
             L+R++L  A +  A+  D
Sbjct: 975 DNLSRANLSRANLSDANLGD 994



 Score = 40.4 bits (93), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 53/103 (51%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTS-ADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL  A   + N    + S A++  ++ SG+ F+ A L +A    A+ +GADL
Sbjct: 878 SGANLSGADLSTANLSRANLNGVYLSRANLNRANLSGANFSRADLSRANLSGADLSGADL 937

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S   +    LN ANL+ A L R  L+ ++L  A + G + S A
Sbjct: 938 SGADLSDANLNRANLSRANLKRANLSDANLSSANLSGDNLSRA 980



 Score = 40.4 bits (93), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 9/101 (8%)

Query: 109  SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
            S A F  ADL          RAN + AD+  +D SG+  + A L +A   +AN   A+LS
Sbjct: 913  SGANFSRADLS---------RANLSGADLSGADLSGADLSDANLNRANLSRANLKRANLS 963

Query: 169  DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
            D  +    L+  NL+ A L R  L+ ++LG    +   + +
Sbjct: 964  DANLSSANLSGDNLSRANLSRANLSDANLGDEFFKAIHWDE 1004


>gi|418939072|ref|ZP_13492497.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
 gi|375054219|gb|EHS50602.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
          Length = 202

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 11/102 (10%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           + A    ADLR A     NF  AN  SAD++ +D + +   GA L  A   +AN TGA  
Sbjct: 63  TGANLTGADLRWADCDGANFTGANLKSADLQHTDLTNANLTGANLTGANLTEANLTGA-- 120

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
                   +L EA L  A L++ +  +++L G  + GAD +D
Sbjct: 121 --------ILKEARLDKASLIQAIKQKANLQGVDLSGADLTD 154



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 50/106 (47%), Gaps = 9/106 (8%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           F  ADL       E  R     A +  +DF+G+   GA L  A    ANFTGA+L    +
Sbjct: 42  FAGADL-------EQVR--LAGASLEGADFTGANLTGADLRWADCDGANFTGANLKSADL 92

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
               L  ANLT A L    LT ++L GAI++ A    A +  A KQ
Sbjct: 93  QHTDLTNANLTGANLTGANLTEANLTGAILKEARLDKASLIQAIKQ 138



 Score = 45.1 bits (105), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 54/110 (49%), Gaps = 16/110 (14%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANF 162
           + A   SADL+       N   AN T A++ E++ +G+    A L+K     A+  KAN 
Sbjct: 83  TGANLKSADLQHTDLTNANLTGANLTGANLTEANLTGAILKEARLDKASLIQAIKQKANL 142

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            G DLS           A+LT+  L R   T  +L GAI++GA  + A++
Sbjct: 143 QGVDLSG----------ADLTDMNLSRVDFTGVNLKGAILKGAILTGAIL 182


>gi|254513085|ref|ZP_05125151.1| Pentapeptide repeat protein [Rhodobacteraceae bacterium KLH11]
 gi|221533084|gb|EEE36079.1| Pentapeptide repeat protein [Rhodobacteraceae bacterium KLH11]
          Length = 353

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 47/84 (55%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN  SA + +++F  S F+ A L  A+    +F+GA L+     R    +++ +NA L 
Sbjct: 74  RANLISATLSKANFKHSNFDSADLTCAICKDTDFSGASLTTVNAPRADFEKSDFSNAFLF 133

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
             +L RS+L GA   GA+ SDA +
Sbjct: 134 GALLQRSNLSGASFFGANLSDAYL 157



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 58/113 (51%), Gaps = 1/113 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S +    A+L  A   K NF+ +NF SAD+  +    + F+GA L    A +A+F  +D 
Sbjct: 68  SNSSLARANLISATLSKANFKHSNFDSADLTCAICKDTDFSGASLTTVNAPRADFEKSDF 127

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
           S+  +   +L  +NL+ A      L+ + L G+I++   F   +++  Q ++L
Sbjct: 128 SNAFLFGALLQRSNLSGASFFGANLSDAYLAGSIMKETIFERTIMNGIQAKSL 180


>gi|220906761|ref|YP_002482072.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219863372|gb|ACL43711.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 190

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/93 (40%), Positives = 48/93 (51%), Gaps = 6/93 (6%)

Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           R  D SG+   GA L +A    AN +GA+L D L+    L  ANLTNA L    + R+DL
Sbjct: 39  RGCDLSGADLRGAILTRADLRGANLSGANLQDALLLLTDLRGANLTNANLTAAYMNRTDL 98

Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYAN--GTN 228
             A + GA   DA +    +  L K AN  GTN
Sbjct: 99  REANLSGATLVDAGL----RNTLFKGANLQGTN 127



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 9/93 (9%)

Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           ADLR A+          T AD+R ++ SG+    A L       AN T A+L+   M+R 
Sbjct: 46  ADLRGAI---------LTRADLRGANLSGANLQDALLLLTDLRGANLTNANLTAAYMNRT 96

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            L EANL+ A LV   L  +   GA ++G +F+
Sbjct: 97  DLREANLSGATLVDAGLRNTLFKGANLQGTNFA 129



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 48/95 (50%), Gaps = 6/95 (6%)

Query: 117 DLRKAVHVKENFRANFTS-ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           DLR A     N  A + +  D+RE++ SG+    A L   +   AN  G + + + +   
Sbjct: 77  DLRGANLTNANLTAAYMNRTDLREANLSGATLVDAGLRNTLFKGANLQGTNFAGSDLSYA 136

Query: 176 VLNEANLTNA-----VLVRTVLTRSDLGGAIIEGA 205
            L + NLTNA      L+ T L +++L GA +EGA
Sbjct: 137 DLRDTNLTNANLTATNLLFTRLNKTNLQGANLEGA 171


>gi|218247298|ref|YP_002372669.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
 gi|218167776|gb|ACK66513.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
          Length = 371

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 44/136 (32%), Positives = 67/136 (49%), Gaps = 11/136 (8%)

Query: 80  VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMR 138
           + A+ + N++ L  L  +   T    G   AA+  + +L  A   + NFR AN T A++ 
Sbjct: 218 LYAANTHNLAELIKLAHFNPLTDLAGGNFLAAELSAVELSGANLTQTNFRGANLTDAELS 277

Query: 139 ES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
           E+      FSG+  +GAYL  A   KA+F  A L+   +    L EANL  A L+ T   
Sbjct: 278 EAILNYCKFSGADLSGAYLGNAQLVKADFHRASLAVANLIGANLTEANLREANLIDT--- 334

Query: 194 RSDLGGAIIEGADFSD 209
             +L GA ++ A F +
Sbjct: 335 --NLSGATVKNAKFGE 348


>gi|443317576|ref|ZP_21046968.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
 gi|442782825|gb|ELR92773.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
          Length = 303

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 55/103 (53%), Gaps = 11/103 (10%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A  G  DL +A+ V+ N  R++ +  ++ +++ + +   G  L +A   +ANFT A+L  
Sbjct: 99  ADLGETDLSQAILVEANLNRSDLSGVNLHQANLTKASLIGVELNRANLREANFTEANLRR 158

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             + R  L +ANL          TR++L  A +  ADFSDA++
Sbjct: 159 VELQRAQLGKANL----------TRANLADARMLHADFSDAIL 191



 Score = 43.9 bits (102), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 68/144 (47%), Gaps = 16/144 (11%)

Query: 74  TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFT 133
           T L+ A++   + N S L+ +N ++A       IG   +   A+LR+A         NFT
Sbjct: 104 TDLSQAILVEANLNRSDLSGVNLHQANLTKASLIG--VELNRANLREA---------NFT 152

Query: 134 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEANLTNAVLV 188
            A++R  +   ++   A L +A    A    AD SD ++         LN ANLT   L 
Sbjct: 153 EANLRRVELQRAQLGKANLTRANLADARMLHADFSDAILQETNLSGARLNRANLTRTDLT 212

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
              L  ++L GA +  A+F++A++
Sbjct: 213 AANLKETNLLGADLSYANFTEALL 236



 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 39/119 (32%), Positives = 59/119 (49%), Gaps = 2/119 (1%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A  G+ DL+ A     +  RAN     + E+D SG+      L  A   KAN +GA+L+ 
Sbjct: 34  ANLGNFDLKGANLSGADLTRANCIGVILSEADLSGATLVRTDLSGADINKANLSGANLTK 93

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANGT 227
             +    L E +L+ A+LV   L RSDL G  +  A+ + A +I +   +A  + AN T
Sbjct: 94  ANLLGADLGETDLSQAILVEANLNRSDLSGVNLHQANLTKASLIGVELNRANLREANFT 152



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 49/84 (58%), Gaps = 5/84 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R + T+A+++E++  G+  + A   +A+  +AN +GADLS   +  +     +LT   L 
Sbjct: 208 RTDLTAANLKETNLLGADLSYANFTEALLAEANLSGADLSYANLAGL-----DLTGLNLA 262

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
            T LT+++L GA +  A+  +AV+
Sbjct: 263 GTNLTQANLAGANLTEANLEEAVL 286



 Score = 37.0 bits (84), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 20/59 (33%), Positives = 32/59 (54%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
            AN + AD+  ++ +G    G  L      +AN  GA+L++  ++  VL EANLT A +
Sbjct: 238 EANLSGADLSYANLAGLDLTGLNLAGTNLTQANLAGANLTEANLEEAVLTEANLTQATM 296


>gi|298246994|ref|ZP_06970799.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297549653|gb|EFH83519.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 285

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 45/82 (54%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           + NF  + +R SDF+G+   G+    +   +ANF GA+L+D     + L +AN   ++LV
Sbjct: 100 KGNFKGSVLRGSDFTGADVTGSSFRGSDVREANFAGANLTDCDFSTLDLVDANFRESILV 159

Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
           RT  ++S L GA   G   +D 
Sbjct: 160 RTNFSKSGLVGAQFIGVTLTDV 181


>gi|291570908|dbj|BAI93180.1| TPR domain protein [Arthrospira platensis NIES-39]
          Length = 256

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/100 (39%), Positives = 51/100 (51%)

Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
           D+R+ +  KE    N T+A +  +D SG+   GA L  A   +AN TGA+L+   +    
Sbjct: 28  DIRQLLSTKECENCNLTNAGLVLADLSGANLTGANLTGANLSRANLTGANLTGANLTGAS 87

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           L  ANLT A L    L  SDL GA +  A   DA I  AQ
Sbjct: 88  LFGANLTGANLTGANLAGSDLRGAYLANAIAVDANITEAQ 127



 Score = 38.5 bits (88), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 38/72 (52%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN T A++  ++ +G+   GA L  A  + AN TGA+L+   +    L  A L NA+ V 
Sbjct: 61  ANLTGANLSRANLTGANLTGANLTGASLFGANLTGANLTGANLAGSDLRGAYLANAIAVD 120

Query: 190 TVLTRSDLGGAI 201
             +T + L G +
Sbjct: 121 ANITEAQLIGVV 132



 Score = 37.4 bits (85), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 24/40 (60%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           F AN T A++  ++ +GS   GAYL  A+A  AN T A L
Sbjct: 89  FGANLTGANLTGANLAGSDLRGAYLANAIAVDANITEAQL 128



 Score = 37.4 bits (85), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 38/72 (52%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN T A++  ++ +G+   GA L  A    AN  G+DL    +   +  +AN+T A L+
Sbjct: 70  RANLTGANLTGANLTGASLFGANLTGANLTGANLAGSDLRGAYLANAIAVDANITEAQLI 129

Query: 189 RTVLTRSDLGGA 200
             V   +++G A
Sbjct: 130 GVVGLPTNIGNA 141


>gi|428210339|ref|YP_007094692.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
           PCC 7203]
 gi|428012260|gb|AFY90823.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
          Length = 164

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/115 (37%), Positives = 57/115 (49%), Gaps = 12/115 (10%)

Query: 107 IGSAAQFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           I S A+   A+L     K V + E   AN T A +  +D S +    A L +A+  KAN 
Sbjct: 36  ILSKAELAGANLNGANLKGVKLSE---ANLTGATLWRTDLSNATLYKAILSRAILIKANL 92

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA---IIE--GADFSDAVI 212
           +  DL DTL++R  L   NLT A L    LT +DL  A   ++E  G D S A I
Sbjct: 93  SSVDLRDTLLNRADLRLTNLTGANLSGANLTGTDLRYAQLKLVELTGVDLSQACI 147



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 61/122 (50%), Gaps = 14/122 (11%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY 158
            +T  E        F   DLR+ V + E    + + A + +++ +G+  NGA L+     
Sbjct: 3   VDTLLELYTAGKRDFSCFDLRR-VDLSE---IDLSGAILSKAELAGANLNGANLKGVKLS 58

Query: 159 KANFTGA-----DLSD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           +AN TGA     DLS+      ++ R +L +ANL++  L  T+L R+DL    + GA+ S
Sbjct: 59  EANLTGATLWRTDLSNATLYKAILSRAILIKANLSSVDLRDTLLNRADLRLTNLTGANLS 118

Query: 209 DA 210
            A
Sbjct: 119 GA 120



 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 47/97 (48%), Gaps = 15/97 (15%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT----- 183
           + +F+  D+R  D S    +GA L KA    AN  GA+L       + L+EANLT     
Sbjct: 14  KRDFSCFDLRRVDLSEIDLSGAILSKAELAGANLNGANLKG-----VKLSEANLTGATLW 68

Query: 184 -----NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
                NA L + +L+R+ L  A +   D  D +++ A
Sbjct: 69  RTDLSNATLYKAILSRAILIKANLSSVDLRDTLLNRA 105


>gi|67459256|ref|YP_246880.1| hypothetical protein RF_0864 [Rickettsia felis URRWXCal2]
 gi|67004789|gb|AAY61715.1| Uncharacterized low-complexity protein [Rickettsia felis URRWXCal2]
          Length = 959

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 41/121 (33%), Positives = 62/121 (51%), Gaps = 12/121 (9%)

Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           +  +ADL KA   K N   A+ T+A +  +    +K + A LEKA A      G ++SD 
Sbjct: 555 KLKNADLTKAKLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 224
           +   +   EAN  NA++ R  LT++D   A++E AD      ++A+   A  KQA  K A
Sbjct: 610 IAKNINAQEANFKNAIMQRADLTKADFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669

Query: 225 N 225
           N
Sbjct: 670 N 670


>gi|218439263|ref|YP_002377592.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218171991|gb|ACK70724.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 294

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 76/157 (48%), Gaps = 18/157 (11%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLE 153
           NKY+A  R    +    +    DLR       NF+ A+ + A++RE + SG+    A+L 
Sbjct: 12  NKYDAGERDFCNL----ELRRIDLRGLNLSHANFKGADLSYANLREINLSGADLREAFLN 67

Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANL----------TNAVLVRTVLTRSDLGGAIIE 203
           +A    AN  GA+L  T + +  L + NL          T A L ++ LT+++L GA + 
Sbjct: 68  EADLTGANLQGANLEGTYLIKAYLMKTNLQEANLSKAYLTGAYLSKSNLTKANLSGAYLN 127

Query: 204 GADFSDA-VIDLAQKQALCKYANGTNPITGVSTRKSL 239
           GA  S A + D++  +    + +   P+  V T+K L
Sbjct: 128 GAKLSGADLTDISYDE--TTHFDVNFPLNKVETKKEL 162


>gi|443477350|ref|ZP_21067204.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443017546|gb|ELS31963.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 670

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 69/151 (45%), Gaps = 23/151 (15%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A   SA+L+ A  V  N R AN   A++ +S+   +  N A LE A    A+   A+L
Sbjct: 521 SEADLNSANLKGANLVLTNLRKANLVKANLSDSNLGAANLNDAILEGADLSAADLRSAEL 580

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--------------- 212
           + T +    L+ ANLT A LV       +  GA + GA+F +A++               
Sbjct: 581 NLTNLSNANLSSANLTAAKLVLI-----EFAGANLNGANFRNAIVENIGSIESADFTNAV 635

Query: 213 --DLAQKQALCKYANGTNPITGVSTRKSLGC 241
             D   ++  C  A+G    +G ST+ +L C
Sbjct: 636 NLDPIVRKYFCSLASGNVADSGNSTKSTLNC 666



 Score = 41.2 bits (95), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 44/86 (51%), Gaps = 5/86 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A++   +   +K   A L+KA   K N + ADL+   +    L   NL  A LV+
Sbjct: 488 ANLSQANLLRVNLFQAKLGSANLQKAELMKTNLSEADLNSANLKGANLVLTNLRKANLVK 547

Query: 190 TVLTRSDLGGA-----IIEGADFSDA 210
             L+ S+LG A     I+EGAD S A
Sbjct: 548 ANLSDSNLGAANLNDAILEGADLSAA 573


>gi|334121293|ref|ZP_08495365.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333455228|gb|EGK83883.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 299

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 61/120 (50%), Gaps = 10/120 (8%)

Query: 94  LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYL 152
           LN YE   R +F   + A    A+L  A+ +  N  RAN + A++  +  + +   GA L
Sbjct: 7   LNNYEKGHR-DF---TGADLSGANLSGAILIGVNLSRANLSGANLSRAFLTKATLQGAVL 62

Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           ++      N + A + +T +    L +ANL+ A LV+  L R+ L GA + GA+   AV+
Sbjct: 63  QRT-----NLSFAKMGETQLSGADLTKANLSGAFLVKAKLPRAKLSGATLTGANLRGAVL 117


>gi|427736744|ref|YP_007056288.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427371785|gb|AFY55741.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 443

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 59/124 (47%), Gaps = 16/124 (12%)

Query: 109 SAAQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVA 157
           ++ +F  ADLR+A  V  N              N + AD+  +D SG+  +GAY   A  
Sbjct: 319 TSTKFIGADLREANFVGANLDNVDFSNANLSGTNLSGADLSGADLSGAYLSGAYFYDADL 378

Query: 158 YKANFTGADLS-----DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             AN  GADLS     D  +    L  A+L+ A      L+ ++L GA + GAD +D  I
Sbjct: 379 SDANLQGADLSGAYFYDADLSGANLQGADLSGAYFYDADLSGANLQGANLNGADLTDTYI 438

Query: 213 DLAQ 216
           D A+
Sbjct: 439 DRAK 442



 Score = 44.7 bits (104), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 5/91 (5%)

Query: 130 ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN   AD+R     +SDF+ +   GA L  A    AN  GADLS+  +    LN   L  
Sbjct: 171 ANLARADLRGTKLNQSDFTNANLAGADLRDADLTNANLAGADLSNADLTNANLNSVQLVK 230

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           A L+   L  +DL  A + GA   DA I+ A
Sbjct: 231 AQLINARLVDTDLRKANLNGAYLIDANINRA 261



 Score = 41.2 bits (95), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 42/122 (34%), Positives = 56/122 (45%), Gaps = 13/122 (10%)

Query: 109 SAAQFGSADLRKAVHVKENF--RANFTSADMRESDFSGSKFNGAYLEKAVAYKAN----- 161
           S     +ADL  A  ++E F    NF  A++   DFSG   NG  L  A    AN     
Sbjct: 264 SGTNLSNADLTSA-KLRETFPSNTNFCGANLSGIDFSGFILNGINLRWAKLIGANLTSTK 322

Query: 162 FTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           F GADL +       +D +  + ANL+   L    L+ +DL GA + GA F DA +  A 
Sbjct: 323 FIGADLREANFVGANLDNVDFSNANLSGTNLSGADLSGADLSGAYLSGAYFYDADLSDAN 382

Query: 217 KQ 218
            Q
Sbjct: 383 LQ 384


>gi|424801888|ref|ZP_18227430.1| FIG01055523: hypothetical protein [Cronobacter sakazakii 696]
 gi|423237609|emb|CCK09300.1| FIG01055523: hypothetical protein [Cronobacter sakazakii 696]
          Length = 846

 Score = 49.7 bits (117), Expect = 0.002,   Method: Composition-based stats.
 Identities = 54/188 (28%), Positives = 85/188 (45%), Gaps = 32/188 (17%)

Query: 62  YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 121
           +A+L N   FV + L AA  +  + +  +  + N  EA            +F SA     
Sbjct: 667 HARL-NKTTFVKSTLEAADFSDATLDSCSFVETNADEA------------RFISATWITC 713

Query: 122 VHVKENF--RANFTSADMRESDFS-----GSKFNGAYLE-----KAVAYKANFTGADLSD 169
               E+   RA+FT A +R+S+       G++F  A LE     +A    A+F  A L  
Sbjct: 714 AAASESTLNRADFTHATLRQSNLRQTALCGARFELAKLENTDLSEADCRGASFQRASLVG 773

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
           +L  R    E + T+A L+  +L +S LGGA   GA+   A  DL+Q      + NG   
Sbjct: 774 SLFIRTDFREVDFTDANLMGALLQKSQLGGADFNGANLFRA--DLSQ-----SFTNGETR 826

Query: 230 ITGVSTRK 237
           ++G  T++
Sbjct: 827 MSGAFTKR 834



 Score = 37.7 bits (86), Expect = 5.5,   Method: Composition-based stats.
 Identities = 30/105 (28%), Positives = 44/105 (41%), Gaps = 6/105 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTS-ADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 165
           S A    ADL        NFR    + A++  +      F GA L  A    ++F+GA  
Sbjct: 551 SKALLECADLSHCQLDGANFRGTMLARAELHHTSLRDCNFEGASLSLAQCCHSDFSGARF 610

Query: 166 ---DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
               L +TL+D  V ++A L   +   T  TR     A ++G  F
Sbjct: 611 KDTQLQETLLDDCVFDDATLEGLLFRETWFTRCRFHRATLDGCVF 655


>gi|218442709|ref|YP_002381029.1| hypothetical protein PCC7424_5734 [Cyanothece sp. PCC 7424]
 gi|218175067|gb|ACK73799.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 266

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 48/143 (33%), Positives = 66/143 (46%), Gaps = 27/143 (18%)

Query: 121 AVHVKENFRANF-TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
           AV  K N    F  +A+++ +D  G+   GAYL  A     + TGA+L D  +    L  
Sbjct: 125 AVGPKANLNGAFLNTANLKNADLKGANLRGAYLSGA-----DLTGANLEDAALSGANLQG 179

Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID-----------LAQ------KQALCK 222
           A LT A L +  L  ++L GA +  AD +DA ++           LAQ      K  LC 
Sbjct: 180 ALLTGAYLRKARLIGAELQGADLRAADLTDANLEQLQNLAGADFTLAQGLTEDTKAMLCS 239

Query: 223 YAN---GT-NPITGVSTRKSLGC 241
                 GT NP T  +T +SLGC
Sbjct: 240 RPAQELGTWNPFTRSNTAQSLGC 262



 Score = 40.8 bits (94), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 55/113 (48%), Gaps = 1/113 (0%)

Query: 117 DLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           +L KA+   +N  +AN    ++ + D S +  + A L  A   + N  GA+L    +  +
Sbjct: 7   ELTKALSEGKNLAKANLQGINLAQMDLSNADLSAANLIGANLSETNLKGANLEGADLRGV 66

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
            L++ANL  A L  + L RS+L G  ++ A    A I LA+  +   +  G N
Sbjct: 67  NLSKANLEGANLQNSYLFRSNLEGCCLKEAQLQGAKIQLARYDSYTVWPEGYN 119


>gi|186683195|ref|YP_001866391.1| pentapeptide repeat-containing serine/threonine kinase [Nostoc
           punctiforme PCC 73102]
 gi|186465647|gb|ACC81448.1| serine/threonine protein kinase with pentapeptide repeats [Nostoc
           punctiforme PCC 73102]
          Length = 534

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 56/107 (52%), Gaps = 14/107 (13%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           G+ +A Q G  D   A+H       N +  +++ +D SG+ F+   L+K      N  GA
Sbjct: 398 GLLTAYQKGRRDF--ALH-------NLSLLNLQGADLSGTNFHSTQLQKT-----NLQGA 443

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           +L ++   R  L++ANL +A L +     +DL GA + GAD S+A +
Sbjct: 444 NLHNSDFGRASLSKANLKDANLTKAYFNHADLEGADLRGADLSNAYL 490



 Score = 37.0 bits (84), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 5/104 (4%)

Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
           H  +  + N   A++  SDF  +  + A L+ A   KA F  ADL         L  A+L
Sbjct: 431 HSTQLQKTNLQGANLHNSDFGRASLSKANLKDANLTKAYFNHADLEGA-----DLRGADL 485

Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
           +NA L    L  ++L GA +  A  SD  + LA+   +    NG
Sbjct: 486 SNAYLSNANLRGANLCGANLTSAKISDEQLALAKTNWMTIRPNG 529


>gi|189499236|ref|YP_001958706.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
           BS1]
 gi|189494677|gb|ACE03225.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
          Length = 442

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 55/109 (50%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RA+      R ++  G+ FN A+++KA    A+ TGA L +T +    L ++NL+   L 
Sbjct: 326 RASLVETVFRNANLQGADFNRAFMKKADLSGADLTGAQLRETRLQEADLKKSNLSKTNLY 385

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
            T LT +DL GA + GA+    ++D A   A     +G    TG +  K
Sbjct: 386 DTDLTCADLRGADLTGANLLYTILDNALISAETITPSGEKATTGWAVLK 434



 Score = 45.8 bits (107), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 52/102 (50%), Gaps = 4/102 (3%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A+   AD R A    + F A+    D++++D SG+   GA L+ + A +A F  ADL+ T
Sbjct: 83  AKLNGADFRNA----KLFSASLKRTDLKQTDLSGANLRGADLKNSYAKEAKFINADLTGT 138

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                 L  A+LT AVL   +   ++L  A + G + + A +
Sbjct: 139 DFRYANLEGADLTGAVLENALFFDANLSSADLRGVNLTGAKM 180



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 42/85 (49%), Gaps = 10/85 (11%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE-----ANLTN 184
           A    ADM++ D + S  NGA L+ A     +F+ +DLS T   R  L E     ANL  
Sbjct: 287 AGLKGADMKKLDMTSSTMNGAKLDHA-----DFSESDLSSTSWKRASLVETVFRNANLQG 341

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSD 209
           A   R  + ++DL GA + GA   +
Sbjct: 342 ADFNRAFMKKADLSGADLTGAQLRE 366


>gi|428313439|ref|YP_007124416.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428255051|gb|AFZ21010.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 167

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 51/93 (54%), Gaps = 10/93 (10%)

Query: 128 FRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKA-----NFTGADLSDTLMDRMVL 177
           ++AN + AD+R++ F+     G++  GA L +A   KA     N  GA L+ T +    L
Sbjct: 58  YQANLSKADLRQTIFNEAILHGAELTGANLHRASLIKADLCEANLKGASLTHTNLGAAKL 117

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           + ANL NA L    L ++DL  A +EGAD S A
Sbjct: 118 SGANLNNANLTWANLRKADLKNANLEGADLSGA 150



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 11/101 (10%)

Query: 119 RKAVHVKENF-RANFTSADMRE----------SDFSGSKFNGAYLEKAVAYKANFTGADL 167
           R+ +  + NF RAN   +D+R+          ++   S  +GA L +   Y+AN + ADL
Sbjct: 8   RRYLAGERNFHRANLNGSDLRKIPLMRADLLKANLHNSNLSGANLTRVNLYQANLSKADL 67

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
             T+ +  +L+ A LT A L R  L ++DL  A ++GA  +
Sbjct: 68  RQTIFNEAILHGAELTGANLHRASLIKADLCEANLKGASLT 108



 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 57/112 (50%), Gaps = 7/112 (6%)

Query: 116 ADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADL KA +H      AN T  ++ +++ S +        +A+ + A  TGA+L    + +
Sbjct: 35  ADLLKANLHNSNLSGANLTRVNLYQANLSKADLRQTIFNEAILHGAELTGANLHRASLIK 94

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
             L EANL  A      LT ++LG A + GA+ ++A +  A  ++A  K AN
Sbjct: 95  ADLCEANLKGA-----SLTHTNLGAAKLSGANLNNANLTWANLRKADLKNAN 141



 Score = 37.7 bits (86), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 5/82 (6%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NF  A++  SD        A L KA  + +N +GA+L+     R+ L +ANL+ A L +T
Sbjct: 16  NFHRANLNGSDLRKIPLMRADLLKANLHNSNLSGANLT-----RVNLYQANLSKADLRQT 70

Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
           +   + L GA + GA+   A +
Sbjct: 71  IFNEAILHGAELTGANLHRASL 92


>gi|428299412|ref|YP_007137718.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428235956|gb|AFZ01746.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 677

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 47/85 (55%), Gaps = 11/85 (12%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM-DRMVLNEANLTNAVLV 188
           ANFT A++ E+ FSG+K  GA          +F GA L  T M +   L+E+NL NA L 
Sbjct: 561 ANFTHANLTEAQFSGAKLVGA----------DFHGAILIATKMKNDTNLDESNLYNANLH 610

Query: 189 RTVLTRSDLGGAIIEGADFSDAVID 213
           R + T   + GA + GAD S A +D
Sbjct: 611 RAIFTNVTMRGADLFGADLSRATLD 635



 Score = 40.8 bits (94), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 46/96 (47%), Gaps = 15/96 (15%)

Query: 131 NFTSADMRESDFSGSKFNG-----AYLEKAVAYKA----------NFTGADLSDTLMDRM 175
           NF+  ++   DFSG+  NG     A L KA   KA          N  GA+LS   +   
Sbjct: 456 NFSGQNLIGQDFSGNNLNGRNFSNANLSKANLNKASLINADLSNANLEGANLSHADLSGA 515

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
            L+  NL  A+L+  +L R DL  A + GA+ + A+
Sbjct: 516 NLSNVNLVGAILIEAILNRVDLCNANLNGANLTLAL 551



 Score = 37.4 bits (85), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 54/117 (46%), Gaps = 22/117 (18%)

Query: 113 FGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F +A+L KA     N  +A+  +AD+  ++  G+  + A L  A     N  GA L + +
Sbjct: 477 FSNANLSKA-----NLNKASLINADLSNANLEGANLSHADLSGANLSNVNLVGAILIEAI 531

Query: 172 MDRM-----VLNEANLT-----------NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           ++R+      LN ANLT           NA      LT +   GA + GADF  A++
Sbjct: 532 LNRVDLCNANLNGANLTLALFRDEPDLCNANFTHANLTEAQFSGAKLVGADFHGAIL 588


>gi|298491495|ref|YP_003721672.1| serine/threonine protein kinase ['Nostoc azollae' 0708]
 gi|298233413|gb|ADI64549.1| serine/threonine protein kinase ['Nostoc azollae' 0708]
          Length = 533

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 5/85 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N +  ++  +D SG+ F+ A L K      N  GA+L +T   R  L + NL +A L + 
Sbjct: 413 NISLLNLEGADLSGTNFHSAQLRKT-----NLQGANLENTDFGRASLMQTNLRDANLTKA 467

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
            L+ +DL GA + GAD S A I  A
Sbjct: 468 YLSHADLEGADLRGADLSYAYISQA 492


>gi|220909908|ref|YP_002485219.1| serine/threonine protein kinase with pentapeptide repeats
           [Cyanothece sp. PCC 7425]
 gi|219866519|gb|ACL46858.1| serine/threonine protein kinase with pentapeptide repeats
           [Cyanothece sp. PCC 7425]
          Length = 526

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 60/116 (51%), Gaps = 13/116 (11%)

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKA- 160
           +G+ G+ S  +  +A L   +      R +FT+ D+R           AYL +AV ++A 
Sbjct: 383 KGQTGVASKTKLDAAKL---IEAYRKGRRDFTNQDLRSL-----VLRKAYLAEAVFHQAQ 434

Query: 161 ----NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
               +  GA+L +  + R  L +ANL +A L +  L+ +DL GA + GA+ SDA +
Sbjct: 435 LNNTDLQGANLFNANLGRASLTKANLRDANLQKAYLSYADLAGADLRGANLSDAYL 490


>gi|414077930|ref|YP_006997248.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
 gi|413971346|gb|AFW95435.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
          Length = 189

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 43/89 (48%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N    D   +D  G+    A L  A   KAN  GA L+   +  ++L  A+LT A L   
Sbjct: 31  NLGGVDFGRADLRGANLTAASLSGANLSKANLQGAILARAHLSEVILCGADLTQATLTTA 90

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
            L  SDL GA++ GA+  DA + +A   A
Sbjct: 91  HLNESDLSGALLSGANLCDANLHMASISA 119



 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 55/113 (48%), Gaps = 21/113 (18%)

Query: 109 SAAQFGSADLRKAV----HVKENF-------RANFTSADMRESDFSGSKFNGAYLEKAVA 157
           S A    A+L+ A+    H+ E         +A  T+A + ESD SG+  +GA L  A  
Sbjct: 53  SGANLSKANLQGAILARAHLSEVILCGADLTQATLTTAHLNESDLSGALLSGANLCDANL 112

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           + A+ + A+L            ANL+ A +    + ++DL GA + GAD S+A
Sbjct: 113 HMASISAANLQG----------ANLSGAKMGGVRMWKADLQGADLSGADLSEA 155



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 1/95 (1%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           + A    +DL  A+    N   AN   A +  ++  G+  +GA +     +KA+  GADL
Sbjct: 88  TTAHLNESDLSGALLSGANLCDANLHMASISAANLQGANLSGAKMGGVRMWKADLQGADL 147

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
           S   +    L E NLT A L  T ++ + L GAI+
Sbjct: 148 SGADLSEANLCEVNLTGANLDDTDMSETFLTGAIM 182



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 47/98 (47%), Gaps = 9/98 (9%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           FG ADLR A         N T+A +  ++ S +   GA L +A   +    GADL+   +
Sbjct: 37  FGRADLRGA---------NLTAASLSGANLSKANLQGAILARAHLSEVILCGADLTQATL 87

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
               LNE++L+ A+L    L  ++L  A I  A+   A
Sbjct: 88  TTAHLNESDLSGALLSGANLCDANLHMASISAANLQGA 125


>gi|419963472|ref|ZP_14479445.1| hypothetical protein WSS_A15164 [Rhodococcus opacus M213]
 gi|432333027|ref|ZP_19584842.1| hypothetical protein Rwratislav_00170 [Rhodococcus wratislaviensis
           IFP 2016]
 gi|414571123|gb|EKT81843.1| hypothetical protein WSS_A15164 [Rhodococcus opacus M213]
 gi|430780078|gb|ELB95186.1| hypothetical protein Rwratislav_00170 [Rhodococcus wratislaviensis
           IFP 2016]
          Length = 201

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 16/131 (12%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFS-----GSKFNGAYL 152
           +E R E  I +   F  ADL ++ HV   FR+ +FT   +  S+F      GS+F+   L
Sbjct: 38  SELRTESVIFTDCDFTGADLAESRHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97

Query: 153 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 202
              V  + +FT     GADL           EANL    L R VL  +DL     GGA  
Sbjct: 98  RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 157

Query: 203 EGADFSDAVID 213
           +GAD   A +D
Sbjct: 158 DGADLRGAHVD 168


>gi|300863652|ref|ZP_07108591.1| conserved exported hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300338360|emb|CBN53735.1| conserved exported hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 329

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 47/93 (50%), Gaps = 6/93 (6%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    ADLR A   K N        AN   A+++++D  G+   GAYL ++   +AN +G
Sbjct: 149 ANLQGADLRGANLYKTNLTTTNLTEANLLYANLQQADLRGTNLQGAYLVRSHLQRANLSG 208

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           ADLS   +    L EANLT A L+   L   DL
Sbjct: 209 ADLSGADLGGAYLTEANLTRANLIGAKLNLIDL 241



 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 54/115 (46%), Gaps = 12/115 (10%)

Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADLR   +      R++   A++  +D SG+   GAYL +A   +AN  GA L+ 
Sbjct: 179 ANLQQADLRGTNLQGAYLVRSHLQRANLSGADLSGADLGGAYLTEANLTRANLIGAKLNL 238

Query: 170 TLMDR-----------MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
             +D+             L  A L+ A L+   LT ++L GA + GAD   A +D
Sbjct: 239 IDLDKPSCINVCEVYPTQLQGAILSQASLIGADLTGANLSGADLRGADLRSANLD 293



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 46/96 (47%), Gaps = 6/96 (6%)

Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
           D RK  H       N  +AD+R ++  G+   GA L K      N T A+L    + +  
Sbjct: 132 DRRKPNHT------NLQNADLRYANLQGADLRGANLYKTNLTTTNLTEANLLYANLQQAD 185

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           L   NL  A LVR+ L R++L GA + GAD   A +
Sbjct: 186 LRGTNLQGAYLVRSHLQRANLSGADLSGADLGGAYL 221



 Score = 37.4 bits (85), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 36/122 (29%), Positives = 49/122 (40%), Gaps = 12/122 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSG-----------SKFNGAYLEKAV 156
           S A    ADL  A   + N  RAN   A +   D              ++  GA L +A 
Sbjct: 207 SGADLSGADLGGAYLTEANLTRANLIGAKLNLIDLDKPSCINVCEVYPTQLQGAILSQAS 266

Query: 157 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
              A+ TGA+LS   +    L  ANL  AVL    L+ + L G  + G D   A +    
Sbjct: 267 LIGADLTGANLSGADLRGADLRSANLDGAVLTNADLSFAALAGTSLSGTDLKGATLTNGM 326

Query: 217 KQ 218
           +Q
Sbjct: 327 RQ 328


>gi|434387412|ref|YP_007098023.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
 gi|428018402|gb|AFY94496.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
          Length = 263

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 57/120 (47%), Gaps = 11/120 (9%)

Query: 105 FGIGSAA-QFGSADLRKAVHVKENF----------RANFTSADMRESDFSGSKFNGAYLE 153
           FGI   A  +  ADL+K +   +            R    +AD++ +   G+  +GA L 
Sbjct: 31  FGIAPVALAYNPADLKKLIATNKCIGCDLSGADLSRQQLVNADLQAATLVGANLSGANLA 90

Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
            A    AN TGA+L+ T +   VL  A+L +     T LTR+DL  A +   +F  A+++
Sbjct: 91  SAKLGGANLTGANLTRTNLTGAVLQAASLIDVNFANTNLTRTDLSYANLVNTNFRSAILN 150


>gi|254264016|ref|ZP_04954881.1| pentapeptide repeat protein [Burkholderia pseudomallei 1710a]
 gi|254215018|gb|EET04403.1| pentapeptide repeat protein [Burkholderia pseudomallei 1710a]
          Length = 825

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 42/79 (53%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T  D+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 512 ADLTGVDLSGMDLRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVD 566

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585


>gi|158334009|ref|YP_001515181.1| hypothetical protein AM1_0823 [Acaryochloris marina MBIC11017]
 gi|158304250|gb|ABW25867.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 421

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 65/128 (50%), Gaps = 11/128 (8%)

Query: 99  AETRGEFGIGSA----AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
           A  RG + +GSA    A   SADL   V++ +   AN + A +  ++   +K  GA L  
Sbjct: 273 ANLRGAY-LGSANLLGANLNSADL-IGVYLSD---ANLSHAKLVGANLRTAKLIGAQLAD 327

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
               +ANFTGADLSD  ++     +ANL      RT    +DL GA + GA F +  +D 
Sbjct: 328 TDLSEANFTGADLSDANLEGADFTDANLREVSFQRTQFREADLSGADLRGAIFLE--VDQ 385

Query: 215 AQKQALCK 222
            ++  LC+
Sbjct: 386 LEECKLCR 393



 Score = 40.8 bits (94), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 60/141 (42%), Gaps = 28/141 (19%)

Query: 99  AETRGEFGIGSAAQFGSADLRKA-VHVKENFRANFTSADMRE------------------ 139
           A  +G + IG+      ADLR+A +   +   AN + AD+ +                  
Sbjct: 208 ANFQGTYLIGT--NLREADLREANLRNADLLSANLSEADLTQANLSSANLLGTNLNSANF 265

Query: 140 --SDFSGSKFNGAYLEKAVAYKANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRTVL 192
             +D +G+   GAYL  A    AN   AD     LSD  +    L  ANL  A L+   L
Sbjct: 266 QNADLTGANLRGAYLGSANLLGANLNSADLIGVYLSDANLSHAKLVGANLRTAKLIGAQL 325

Query: 193 TRSDLGGAIIEGADFSDAVID 213
             +DL  A   GAD SDA ++
Sbjct: 326 ADTDLSEANFTGADLSDANLE 346


>gi|434394477|ref|YP_007129424.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428266318|gb|AFZ32264.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 132

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 49/88 (55%), Gaps = 5/88 (5%)

Query: 115 SADLRKAVHVKE----NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           S++L++ ++ K+    N R AN  +A++ E++ SG+   GA L+ A   KAN  GA+L  
Sbjct: 40  SSELQRLLNTKQCPGCNLRGANLRNANLEEANLSGANLQGANLQNADLEKANLQGANLQQ 99

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDL 197
             +    L EANL NA L    L  +DL
Sbjct: 100 ANLSDADLQEANLQNANLQNANLRSADL 127


>gi|119509719|ref|ZP_01628864.1| hypothetical protein N9414_00180 [Nodularia spumigena CCY9414]
 gi|119465585|gb|EAW46477.1| hypothetical protein N9414_00180 [Nodularia spumigena CCY9414]
          Length = 212

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 49/100 (49%), Gaps = 15/100 (15%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           ++   T+ D+   DFSGS  +   L       A+ +GADLSDT M +++L  ANL+ A L
Sbjct: 86  YKPPKTNPDLSGKDFSGSNLSNKDLSGRNLSYADLSGADLSDTFMHKVILRGANLSEANL 145

Query: 188 VR---------------TVLTRSDLGGAIIEGADFSDAVI 212
            R               + L  +DL GA + GAD + A I
Sbjct: 146 FRANLLLADMREANLRSSYLIGADLSGADLRGADLTGARI 185


>gi|440756225|ref|ZP_20935426.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
 gi|440173447|gb|ELP52905.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
          Length = 433

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 59/111 (53%), Gaps = 12/111 (10%)

Query: 109 SAAQFGSADLRKAV---------HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           S A    ADL +A+         H+   + A+ + A++ E+D S +  + A L +A+   
Sbjct: 261 SEAILSEADLSEAILWTAKLSWAHL---WGADLSGANLSEADLSEADLSEADLSEAILRG 317

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           AN + ADLS   +    L +ANL  A+L   +L+ +DL GAI+ GAD S A
Sbjct: 318 ANLSEADLSWANLRGANLIQANLRGAILSWAILSGADLSGAILRGADLSGA 368



 Score = 42.4 bits (98), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 41/121 (33%), Positives = 59/121 (48%), Gaps = 12/121 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL +A+      R AN + AD+  ++  G+    A L  A+   A  +GADL
Sbjct: 301 SEADLSEADLSEAI-----LRGANLSEADLSWANLRGANLIQANLRGAILSWAILSGADL 355

Query: 168 SDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALC 221
           S  +     +    L+EA+L  A L   +L+ + L GA +E A F DA  I   QKQ L 
Sbjct: 356 SGAILRGADLSGADLSEADLRGAFLSEAILSGAILSGAKVENAIFIDATGITPEQKQDLI 415

Query: 222 K 222
           +
Sbjct: 416 R 416



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 45/81 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A + E+D S +    A L  A  + A+ +GA+LS+  +    L+EA+L+ A+L  
Sbjct: 258 ANLSEAILSEADLSEAILWTAKLSWAHLWGADLSGANLSEADLSEADLSEADLSEAILRG 317

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L+ +DL  A + GA+   A
Sbjct: 318 ANLSEADLSWANLRGANLIQA 338



 Score = 37.4 bits (85), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 54/110 (49%), Gaps = 11/110 (10%)

Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
           T+ EF    A     A+L KA+              +R  D SG+    A L  A   +A
Sbjct: 200 TKAEFTT-DAKVIEKAELIKAIR-----EGTIDETTLRFVDLSGAILIEADLSWANLSEA 253

Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           + +GA+LS+      +L+EA+L+ A+L    L+ + L GA + GA+ S+A
Sbjct: 254 DLSGANLSEA-----ILSEADLSEAILWTAKLSWAHLWGADLSGANLSEA 298



 Score = 37.0 bits (84), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 45/83 (54%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
             A+ + A++ E+D SG+  + A L +A   +A    A LS   +    L+ ANL+ A L
Sbjct: 241 IEADLSWANLSEADLSGANLSEAILSEADLSEAILWTAKLSWAHLWGADLSGANLSEADL 300

Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
               L+ +DL  AI+ GA+ S+A
Sbjct: 301 SEADLSEADLSEAILRGANLSEA 323


>gi|428311473|ref|YP_007122450.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428253085|gb|AFZ19044.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 580

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 45/79 (56%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN T A++R+ + +G+  +GA L       ANFTGA+L    +   +L+ AN T ++L R
Sbjct: 25  ANLTGANLRKINLTGANLSGANLSWCCFSHANFTGANLHQANLHSAILDNANFTQSILSR 84

Query: 190 TVLTRSDLGGAIIEGADFS 208
             L++ DL  A +  AD +
Sbjct: 85  AKLSKVDLRLANLREADLN 103



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 45/83 (54%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
             AN    ++  ++F+G+    A LE+A   +A   G +L++  ++ + L  ANL  A L
Sbjct: 143 MEANLCRTNLIATNFTGANLREANLEQANLQEATLVGVNLTEANLNNVYLRGANLRQADL 202

Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
            R +LT +D+  A  EGAD S A
Sbjct: 203 HRAILTGADMSEANCEGADLSRA 225



 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 53/95 (55%), Gaps = 1/95 (1%)

Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           A  F  A+LR+A   + N + A     ++ E++ +     GA L +A  ++A  TGAD+S
Sbjct: 154 ATNFTGANLREANLEQANLQEATLVGVNLTEANLNNVYLRGANLRQADLHRAILTGADMS 213

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
           +   +   L+ ANLT A L+R  L ++DL  A+++
Sbjct: 214 EANCEGADLSRANLTGAYLLRASLRKADLLRAVLQ 248



 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 16/101 (15%)

Query: 116 ADLRKA-VHVKENFRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A+LR+A +H     RA  T ADM E+     D S +   GAYL +A   KA+   A L +
Sbjct: 195 ANLRQADLH-----RAILTGADMSEANCEGADLSRANLTGAYLLRASLRKADLLRAVLQE 249

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             + R  L+EANL  A      L ++DL GA ++    S+A
Sbjct: 250 VYLLRTDLSEANLRGA-----DLRKADLSGAYLKDTLLSEA 285



 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 51/103 (49%), Gaps = 1/103 (0%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F  + L +A   K + R AN   AD+  +D S S  +GA L+     + N   A+L+ 
Sbjct: 75  ANFTQSILSRAKLSKVDLRLANLREADLNWADLSASNLSGADLQNTQLDQINLEHANLNH 134

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            L+    L EANL    L+ T  T ++L  A +E A+  +A +
Sbjct: 135 ALLMGAQLMEANLCRTNLIATNFTGANLREANLEQANLQEATL 177



 Score = 41.2 bits (95), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 54/104 (51%), Gaps = 7/104 (6%)

Query: 111 AQFGSADLRKAVHVKENF--RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           A    ADL +AV ++E +  R + + A++R +D   +  +GAYL+  +  +AN +GA L 
Sbjct: 235 ASLRKADLLRAV-LQEVYLLRTDLSEANLRGADLRKADLSGAYLKDTLLSEANLSGAYLL 293

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA----IIEGADFS 208
           ++ + R  L+ A LT   + +  L   DL       +  G D+S
Sbjct: 294 ESYLIRTKLDRAELTGCCIHQWHLEEVDLSYVECRYVFTGFDYS 337



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 44/90 (48%), Gaps = 10/90 (11%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR----------MVLNEA 180
           N   A++  +   G++   A L +      NFTGA+L +  +++          + L EA
Sbjct: 126 NLEHANLNHALLMGAQLMEANLCRTNLIATNFTGANLREANLEQANLQEATLVGVNLTEA 185

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           NL N  L    L ++DL  AI+ GAD S+A
Sbjct: 186 NLNNVYLRGANLRQADLHRAILTGADMSEA 215


>gi|298250682|ref|ZP_06974486.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297548686|gb|EFH82553.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 287

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 43/82 (52%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           + NF  + +R SDF+G+   G+    +   +A F GA+L+D     + L +AN   A+LV
Sbjct: 100 KGNFKGSALRGSDFTGADLTGSSFRGSDVREATFAGANLTDCDFSTLDLTDANFREAILV 159

Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
           RT   +S L GA   G   +D 
Sbjct: 160 RTNFNKSGLVGAKFIGVTLTDV 181


>gi|409994207|ref|ZP_11277325.1| hypothetical protein APPUASWS_23863 [Arthrospira platensis str.
           Paraca]
 gi|409934955|gb|EKN76501.1| hypothetical protein APPUASWS_23863 [Arthrospira platensis str.
           Paraca]
          Length = 519

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 50/92 (54%), Gaps = 10/92 (10%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN---------- 178
           +ANFT A +  ++FSG+   G  L +A    +  +GA L    ++  VLN          
Sbjct: 34  QANFTEAILSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLS 93

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +ANL +A L+R  L R++L  A++ GA+ ++A
Sbjct: 94  QANLIDASLIRAELMRAELSEAVVNGANLTEA 125



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 63/129 (48%), Gaps = 13/129 (10%)

Query: 101 TRGEFGIG--SAAQFGSADLRKAV-HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA 157
           TR +  +   S A    A+L +AV +V    RA+ + A++ ++    ++   A L +AV 
Sbjct: 58  TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLIDASLIRAELMRAELSEAVV 117

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNA----------VLVRTVLTRSDLGGAIIEGADF 207
             AN T ADL +  +    L +ANL+ A           L R+ LTRSDL  A + G + 
Sbjct: 118 NGANLTEADLREATLRHTELQQANLSGANLSEACLILSNLERSNLTRSDLTRADLRGVNL 177

Query: 208 SDAVIDLAQ 216
            +A +  A+
Sbjct: 178 RNAELRQAE 186



 Score = 40.4 bits (93), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
           DFS      A L +    +ANFT A LS T       + ANLT   L R  L  S L GA
Sbjct: 16  DFSAILLCEANLSRVNLSQANFTEAILSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 70

Query: 201 IIEGADFSDAVIDLA 215
           I++GA+ ++AV+++A
Sbjct: 71  ILQGANLNEAVLNVA 85



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 48/96 (50%), Gaps = 1/96 (1%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A+L +A  +  N  R+N T +D+  +D  G     A L +A    A+  GA+LS 
Sbjct: 140 ANLSGANLSEACLILSNLERSNLTRSDLTRADLRGVNLRNAELRQAELSGADLRGANLSG 199

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
             +    L+ ANL+ A L  T L+ + L GA + GA
Sbjct: 200 ANLRWANLSGANLSGANLEATQLSGASLRGANLSGA 235



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 5/83 (6%)

Query: 130 ANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           AN T AD+RE     ++   +  +GA L +A    +N   ++L+ + + R  L   NL N
Sbjct: 120 ANLTEADLREATLRHTELQQANLSGANLSEACLILSNLERSNLTRSDLTRADLRGVNLRN 179

Query: 185 AVLVRTVLTRSDLGGAIIEGADF 207
           A L +  L+ +DL GA + GA+ 
Sbjct: 180 AELRQAELSGADLRGANLSGANL 202



 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 49/91 (53%), Gaps = 1/91 (1%)

Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           +A+LR+A     + R AN + A++R ++ SG+  +GA LE      A+  GA+LS   + 
Sbjct: 179 NAELRQAELSGADLRGANLSGANLRWANLSGANLSGANLEATQLSGASLRGANLSGASLL 238

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
                 A+LT A L+    T +DL G+ + G
Sbjct: 239 NCTAIHADLTQANLIECDWTDADLRGSALTG 269


>gi|409912856|ref|YP_006891321.1| pentapeptide repeat-containing protein [Geobacter sulfurreducens
           KN400]
 gi|298506440|gb|ADI85163.1| pentapeptide repeat domain protein [Geobacter sulfurreducens KN400]
          Length = 259

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 52/97 (53%), Gaps = 4/97 (4%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A    AD+RK V+V+   + NF+ A++  ++FSG+K   A L  AV    NF+ ADLS T
Sbjct: 122 ANLSGADMRK-VNVE---KGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSAT 177

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
            +  + L  AN   A    T+L  + L GA   GAD 
Sbjct: 178 DLGSLDLEGANFRGATFNGTLLRDAKLKGADFTGADL 214



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 52/100 (52%), Gaps = 14/100 (14%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           + AQ   A L +A+         F +ADMR +  SG     AY+  A    AN +GAD+ 
Sbjct: 85  TGAQMDGASLDEAI---------FDTADMRSAHCSG-----AYIHHAKFVGANLSGADMR 130

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
              +++   ++ANLTNA      L  ++LGGA++ G +FS
Sbjct: 131 KVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFS 170



 Score = 37.0 bits (84), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 52/117 (44%), Gaps = 22/117 (18%)

Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           G   I +A  F    + +A  + E    +F SAD++  D  G K +          ++NF
Sbjct: 12  GLLSIATAHAFDPLVIERAKSLGECEHCDFVSADLKGVDLKGIKLD----------ESNF 61

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
           TGADLS   +D     E N T A          +L GA ++GA   +A+ D A  ++
Sbjct: 62  TGADLSAAAIDD--CGECNFTGA----------NLTGAQMDGASLDEAIFDTADMRS 106


>gi|119490886|ref|ZP_01623169.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
 gi|119453704|gb|EAW34863.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
          Length = 517

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 56/106 (52%), Gaps = 1/106 (0%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
            I +AA    ADLR+A   + + R AN  SA++R++    S   G  L  A   +A+  G
Sbjct: 115 AILTAANLSEADLREATLRQVDLRQANLKSANLRDAVLIASNLEGTNLHGADLTRADLRG 174

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           A+L +  + +  L++ANL+ A L    L  +DL GA + GA+   A
Sbjct: 175 ANLVNAELRQANLSQANLSGANLKGANLRWADLNGADLRGANLEQA 220



 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 10/91 (10%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN-------- 181
           ANF+ A +  ++ SG+  +G  L +A    A  +GA+LS   +   +LN AN        
Sbjct: 35  ANFSQAVLSITNLSGANLSGTNLSQAKLNVAKLSGANLSGANLTGAILNVANLIRADLSH 94

Query: 182 --LTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             L NA  +R+ L R+DL  AI+  A+ S+A
Sbjct: 95  ATLINASAIRSELIRADLSHAILTAANLSEA 125



 Score = 42.4 bits (98), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 60/121 (49%), Gaps = 21/121 (17%)

Query: 111 AQFGSADLRKAVHVKENFR----------------ANFTSADMRESDFSGSKFNGAYLEK 154
           A   SA+LR AV +  N                  AN  +A++R+++ S +  +GA L+ 
Sbjct: 140 ANLKSANLRDAVLIASNLEGTNLHGADLTRADLRGANLVNAELRQANLSQANLSGANLKG 199

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEA-----NLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           A    A+  GADL    +++  L+ A     +L++A L+ T L  +DL  A + GAD++ 
Sbjct: 200 ANLRWADLNGADLRGANLEQARLSGASLYGADLSHASLLYTHLIHADLTQANLTGADWTG 259

Query: 210 A 210
           A
Sbjct: 260 A 260



 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 42/120 (35%), Positives = 60/120 (50%), Gaps = 15/120 (12%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGA 150
           ADL +  A+ RG       A   +A+LR+A   + N   AN   A++R +D +G+   GA
Sbjct: 165 ADLTR--ADLRG-------ANLVNAELRQANLSQANLSGANLKGANLRWADLNGADLRGA 215

Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            LE+A    A+  GADLS   +    L  A+LT A L     T +D  GA + GA  + A
Sbjct: 216 NLEQARLSGASLYGADLSHASLLYTHLIHADLTQANL-----TGADWTGAELTGAALTGA 270



 Score = 40.8 bits (94), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 52/103 (50%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAV-HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+L  A+ +V    RA+ + A +  +    S+   A L  A+   AN + ADL
Sbjct: 68  SGANLSGANLTGAILNVANLIRADLSHATLINASAIRSELIRADLSHAILTAANLSEADL 127

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            +  + ++ L +ANL +A L   VL  S+L G  + GAD + A
Sbjct: 128 REATLRQVDLRQANLKSANLRDAVLIASNLEGTNLHGADLTRA 170


>gi|428227093|ref|YP_007111190.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427986994|gb|AFY68138.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 225

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 66/147 (44%), Gaps = 15/147 (10%)

Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A    ADLR A  +  NFR      AN   AD+R +D   ++  G  L  A+ +  N 
Sbjct: 70  SYANLKRADLRGATLLGANFRGVNLEQANLCGADLRGADLRCAQMQGVQLRGALMHGVNL 129

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--------DL 214
            GA+L+ + +    LN A    ++L R  L  + L  A + G + +DA +        DL
Sbjct: 130 VGANLAASELAGSNLNHARCMGSLLGRANLRGATLVKADLRGVELTDASLRSADLANADL 189

Query: 215 AQKQALCKYANGTNPITGVSTRKSLGC 241
            +   +    +  N +TG + R++  C
Sbjct: 190 ERANLIGADLDRAN-LTGTNLRRAFVC 215


>gi|428301952|ref|YP_007140258.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428238496|gb|AFZ04286.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 267

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 46/92 (50%), Gaps = 15/92 (16%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN   A++  +D SG+    A LEKA     N  GA LS   + +  L  ANL+NA L 
Sbjct: 78  RANLEGANLSNADLSGTFLGEANLEKA-----NLQGAKLSQAFLYKANLEGANLSNAYLS 132

Query: 189 RTVLTRSDLGGA----------IIEGADFSDA 210
            T LTR++L GA          I+  AD  DA
Sbjct: 133 GTALTRANLRGANLRKSVIFVSILSEADLQDA 164



 Score = 45.4 bits (106), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 56/105 (53%), Gaps = 9/105 (8%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A    A+LRK+V     F +  + AD+++++   +K   + LE+A   +AN T A L + 
Sbjct: 139 ANLRGANLRKSVI----FVSILSEADLQDANLMEAKLLSSNLERANLARANLTKAQLHNA 194

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
                +L +ANLT A LV+  L ++ L  A +  AD + A++  A
Sbjct: 195 -----ILQDANLTQAKLVKAELNQASLARANLLNADLTGAILQQA 234



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 46/83 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+   A + +++  G+   GA L  A+  +AN  GA+LS+  +    L EANL  A L  
Sbjct: 49  ADLYGAKLSKANLQGANLQGAILNYALLGRANLEGANLSNADLSGTFLGEANLEKANLQG 108

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L+++ L  A +EGA+ S+A +
Sbjct: 109 AKLSQAFLYKANLEGANLSNAYL 131



 Score = 41.6 bits (96), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 61/122 (50%), Gaps = 8/122 (6%)

Query: 95  NKYEAETRGEFGIGSA----AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG 149
           N   A+  G F +G A    A    A L +A   K N   AN ++A +  +  + +   G
Sbjct: 85  NLSNADLSGTF-LGEANLEKANLQGAKLSQAFLYKANLEGANLSNAYLSGTALTRANLRG 143

Query: 150 AYLEKAVAYKANFTGADLSD-TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           A L K+V + +  + ADL D  LM+  +L  +NL  A L R  LT++ L  AI++ A+ +
Sbjct: 144 ANLRKSVIFVSILSEADLQDANLMEAKLL-SSNLERANLARANLTKAQLHNAILQDANLT 202

Query: 209 DA 210
            A
Sbjct: 203 QA 204



 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 47/91 (51%), Gaps = 6/91 (6%)

Query: 123 HVKENFRANF-TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
           HVK+    N   S D+  +D  G+K + A L+ A     N  GA L+  L+ R  L  AN
Sbjct: 31  HVKQLLNTNSCPSCDLSNADLYGAKLSKANLQGA-----NLQGAILNYALLGRANLEGAN 85

Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           L+NA L  T L  ++L  A ++GA  S A +
Sbjct: 86  LSNADLSGTFLGEANLEKANLQGAKLSQAFL 116


>gi|359459044|ref|ZP_09247607.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 256

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 46/78 (58%)

Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
           AD+R ++ +G+    A L K    +AN +GA LS   +   V+ +A+L  A+L++T + +
Sbjct: 63  ADLRGTNLAGANLQAANLMKTDFCQANLSGAILSGASLQDAVMTQADLNGAILIKTSMIQ 122

Query: 195 SDLGGAIIEGADFSDAVI 212
           + L GAI+ GA+   A I
Sbjct: 123 TRLRGAILRGANLKQARI 140



 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 6/98 (6%)

Query: 116 ADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADLR       N +A N    D  +++ SG+  +GA L+ AV  +A+  GA L  T M +
Sbjct: 63  ADLRGTNLAGANLQAANLMKTDFCQANLSGAILSGASLQDAVMTQADLNGAILIKTSMIQ 122

Query: 175 M-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
                 +L  ANL  A ++ + L   +L   ++E AD 
Sbjct: 123 TRLRGAILRGANLKQARILGSFLEDVNLKKGVLEKADL 160


>gi|239816752|ref|YP_002945662.1| pentapeptide repeat-containing protein [Variovorax paradoxus S110]
 gi|239803329|gb|ACS20396.1| pentapeptide repeat protein [Variovorax paradoxus S110]
          Length = 866

 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 44/129 (34%), Positives = 59/129 (45%), Gaps = 19/129 (14%)

Query: 116 ADLRKAVHVKENFRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A L  A     +FRA      + E+     +FSG +  GA L       A+F+GA L D 
Sbjct: 517 AHLSDAAPPMPSFRAAKIRRRLAEAAPGARNFSGMRLVGADLSDMDLRGADFSGAALEDA 576

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRS----------DLGGAIIEGADFSDAVIDLAQ---- 216
            +D   L++AN   AVL R  L+R+          +LGGA  E ADFS A +  A     
Sbjct: 577 NLDNAQLSDANFNGAVLARARLSRTSLASATFRNANLGGAHCEFADFSGADLSSANCEKT 636

Query: 217 KQALCKYAN 225
           + A C  AN
Sbjct: 637 RFASCSMAN 645



 Score = 48.5 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 58/144 (40%), Gaps = 22/144 (15%)

Query: 111 AQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           A F  ADL  A   K  F           +  FT+++M   DF GS ++  +L K     
Sbjct: 621 ADFSGADLSSANCEKTRFASCSMANTVLDQTRFTASEMSHCDFRGSDWHQVFLTKLRMSG 680

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
             F GA     +     L +    NA LVR     SD   ++    DFSDA +D      
Sbjct: 681 MAFDGASFQQVVWLECTLADVRFANASLVRCSFVTSDCSRSV----DFSDARLD------ 730

Query: 220 LCKYANGTNPITGVSTRKSLG-CG 242
            C +A+G+     V  R +L  CG
Sbjct: 731 ACSFAHGSTLAGAVLRRAALKQCG 754



 Score = 44.7 bits (104), Expect = 0.044,   Method: Composition-based stats.
 Identities = 36/110 (32%), Positives = 52/110 (47%), Gaps = 11/110 (10%)

Query: 110 AAQFGSADLRKAVHVKENFRAN-FTSADMRES-----DFSGSKFNGAYLEKAVAYKANFT 163
            +    A LR+A   +   R      AD+RE+     DFS     GA LE+ VA ++ F 
Sbjct: 737 GSTLAGAVLRRAALKQCGLRTTPLQQADLREARLDNCDFSECALQGAKLERLVAGESLFV 796

Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
            ADL+        L  ANL +A   + V  ++DL GA +   D S ++ID
Sbjct: 797 RADLTGA-----SLRGANLIDANFSKAVFVQADLSGANLFRTDVSQSLID 841



 Score = 37.4 bits (85), Expect = 8.3,   Method: Composition-based stats.
 Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 1/100 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+L  A     NF  A    A +  +  + + F  A L  A    A+F+GADL
Sbjct: 569 SGAALEDANLDNAQLSDANFNGAVLARARLSRTSLASATFRNANLGGAHCEFADFSGADL 628

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           S    ++      ++ N VL +T  T S++      G+D+
Sbjct: 629 SSANCEKTRFASCSMANTVLDQTRFTASEMSHCDFRGSDW 668


>gi|359727541|ref|ZP_09266237.1| hypothetical protein Lwei2_11644 [Leptospira weilii str.
           2006001855]
          Length = 263

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 47/92 (51%), Gaps = 4/92 (4%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N +S  + +  F G  F+GA L  A    ++F GA+     +    LN A+L NA     
Sbjct: 151 NLSSIILEKLKFDGVNFSGANLGHAFLQNSSFVGANFEGAKLRGSFLNNADLRNANFRGA 210

Query: 191 VLTRSDLGGAIIEGADFSDAVID----LAQKQ 218
            L  + L GA +EGADF+DA+ D    L QKQ
Sbjct: 211 DLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242


>gi|378582929|ref|ZP_09831540.1| hypothetical protein CKS_5479 [Pantoea stewartii subsp. stewartii
           DC283]
 gi|377814439|gb|EHT97579.1| hypothetical protein CKS_5479 [Pantoea stewartii subsp. stewartii
           DC283]
          Length = 375

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 61/105 (58%), Gaps = 6/105 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANF 162
           S A   +ADL++A     N   A+ T+A++ ++D      +GA L  A   +AY  +A+ 
Sbjct: 250 SNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADL 309

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           + A+LS+  + R  L++ANL++A L    L R+DL  AI++GA+ 
Sbjct: 310 SNANLSNADLKRADLSDANLSDANLTNVDLKRADLSNAILKGANL 354



 Score = 43.5 bits (101), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 5/87 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTGADLSDTLMDRMVLNEANLT 183
            AN T A + E+D S +  +GA L  A   +      N +GA+L+   +    L+EA+L+
Sbjct: 191 HANLTMAYLSEADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADLS 250

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
           NA L    L R+DL  A + GAD ++A
Sbjct: 251 NANLSNADLKRADLSNANLSGADLTNA 277



 Score = 40.8 bits (94), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 48/92 (52%), Gaps = 10/92 (10%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM----------VLN 178
            AN T A + E+D S +  + A L++A    AN +GADL++  +++            L 
Sbjct: 236 HANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLA 295

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            ANLT A L    L+ ++L  A ++ AD SDA
Sbjct: 296 HANLTMAYLSEADLSNANLSNADLKRADLSDA 327



 Score = 37.7 bits (86), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 40/75 (53%), Gaps = 5/75 (6%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           +++  + S +   GAYL  A     N + ADLSD  +    L+ ANL +A L    L+ +
Sbjct: 148 NLKGVNLSDTDLKGAYLSDA-----NLSDADLSDANLSDANLSGANLAHANLTMAYLSEA 202

Query: 196 DLGGAIIEGADFSDA 210
           DL  A + GAD ++A
Sbjct: 203 DLSNANLSGADLTNA 217


>gi|300869593|ref|ZP_07114173.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
 gi|300332371|emb|CBN59373.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
          Length = 214

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 60/115 (52%), Gaps = 5/115 (4%)

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           E   AN   +D+  ++ S +K N A L KA     N +G DL    M   +L EANL  A
Sbjct: 31  ELIGANLCESDITGANLSKAKLNRANLSKANLSNTNLSGTDLGGADMTEAILTEANLCRA 90

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY-ANGTNPITGVSTRKSL 239
            L+ T L+++DL  A +  A+F  A I    +  LC+   +G N + GV+ R+++
Sbjct: 91  DLIGTNLSKADLSRAFLTQANFIGANI---SRAILCQTDLHGVN-LYGVNLRRAI 141



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 51/102 (50%), Gaps = 9/102 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S    G AD+ +A+          T A++  +D  G+  + A L +A   +ANF GA++S
Sbjct: 68  SGTDLGGADMTEAI---------LTEANLCRADLIGTNLSKADLSRAFLTQANFIGANIS 118

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             ++ +  L+  NL    L R +LT +DL GA +   D S A
Sbjct: 119 RAILCQTDLHGVNLYGVNLRRAILTEADLIGANLTKVDLSGA 160



 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 5/89 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEANLT 183
           RAN + A++  ++ SG+   GA + +A+  +AN       G +LS   + R  L +AN  
Sbjct: 54  RANLSKANLSNTNLSGTDLGGADMTEAILTEANLCRADLIGTNLSKADLSRAFLTQANFI 113

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            A + R +L ++DL G  + G +   A++
Sbjct: 114 GANISRAILCQTDLHGVNLYGVNLRRAIL 142



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 50/94 (53%), Gaps = 1/94 (1%)

Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
           ADL +A   + NF  AN + A + ++D  G    G  L +A+  +A+  GA+L+   +  
Sbjct: 100 ADLSRAFLTQANFIGANISRAILCQTDLHGVNLYGVNLRRAILTEADLIGANLTKVDLSG 159

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
             L  A+L  A L   +L+ +DL GA + GA+ +
Sbjct: 160 ADLMGASLIRADLTEAILSAADLTGANLLGANLT 193


>gi|172037842|ref|YP_001804343.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
           51142]
 gi|354556328|ref|ZP_08975624.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
 gi|171699296|gb|ACB52277.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
 gi|353551765|gb|EHC21165.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
          Length = 319

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 61/133 (45%), Gaps = 11/133 (8%)

Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           Q   ADLR       +FR  + + A++RE DF+G+    AYL +A     NFT A+L   
Sbjct: 25  QLRRADLRGLNLSHTDFRGVDLSYANLREVDFTGADLRDAYLNEADLTAVNFTDANLEGA 84

Query: 171 LMDRMVLNEAN----------LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
            + ++ L +AN          LT A L +T    +   GA + GA  S A ++ A     
Sbjct: 85  SLIKIYLIKANCYQTNFSGAYLTGAYLTKTNFKEAKFHGAYLNGAKLSGAKLEDAYYDHQ 144

Query: 221 CKYANGTNPITGV 233
            ++    +P T +
Sbjct: 145 TRFDTSFDPKTAL 157


>gi|94263119|ref|ZP_01286937.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
 gi|93456490|gb|EAT06604.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
          Length = 355

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 43/82 (52%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + T  D+R+ +  G+ F GA L K +   AN  G D S   +   +L EA+L+ A   + 
Sbjct: 70  DLTMVDLRQLELPGASFKGARLHKTLLGGANLAGCDFSQARIFWSLLQEADLSRASFRQA 129

Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
              RS L  A  E ADFS+AV+
Sbjct: 130 EFERSILQDANCEEADFSEAVL 151



 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 46/104 (44%), Gaps = 11/104 (10%)

Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFS----------GSKFNGAYLEKAVAYKANFTG 164
           ADL +A   +  F R+    A+  E+DFS           S+  G  L +A  +K   +G
Sbjct: 119 ADLSRASFRQAEFERSILQDANCEEADFSEAVLFKTILLNSRLKGINLRQAKMHKVLLSG 178

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            DL+      M   E N  NA L     +R+D+ G +  GAD S
Sbjct: 179 CDLAGQDFSDMRFREVNFANAKLGGADFSRADISGCVFTGADLS 222



 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 10/96 (10%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV----------LNEA 180
           +F+    RE +F+ +K  GA   +A      FTGADLS + +  ++          L  A
Sbjct: 185 DFSDMRFREVNFANAKLGGADFSRADISGCVFTGADLSASRLSGVIARQSMFAGTNLQGA 244

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           +L  A LV+  L  S+L GA + GA+   A ++ A+
Sbjct: 245 DLEGAGLVQAYLGESNLEGASLVGANLESASLEKAR 280


>gi|451979948|ref|ZP_21928350.1| hypothetical protein NITGR_130030 [Nitrospina gracilis 3/211]
 gi|451762820|emb|CCQ89564.1| hypothetical protein NITGR_130030 [Nitrospina gracilis 3/211]
          Length = 360

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 50/114 (43%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAV 156
           +E  T  E  +  A    S          + F+A F  A +  +DFSG     A   +A 
Sbjct: 86  FEGSTLKETNLSEALLHNSNFTNTKFQNTDLFQAQFHDAILTNADFSGETIPNALFFRAN 145

Query: 157 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
              +NFT + L D   D   L  A LTNA+L RT+   S  G A  E A+F ++
Sbjct: 146 LKHSNFTNSYLEDCQFDDADLTNAVLTNAILTRTIENLSSPGKAKFENANFKNS 199



 Score = 42.0 bits (97), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 65/143 (45%), Gaps = 18/143 (12%)

Query: 96  KYEAETRGEFG---------IGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSK 146
           +Y++ T+ EF          + ++ Q     L  A +V    + +F+  D+R+ D     
Sbjct: 19  EYKSITQEEFDRLYEKHHNWLEASKQIKDTQLESANNV---LKPDFSYHDLRDIDLKDKN 75

Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
                L+KA     NF G+ L +T +   +L+ +N TN     T L ++    AI+  AD
Sbjct: 76  -----LQKANLKNCNFEGSTLKETNLSEALLHNSNFTNTKFQNTDLFQAQFHDAILTNAD 130

Query: 207 FSDAVIDLAQ-KQALCKYANGTN 228
           FS   I  A   +A  K++N TN
Sbjct: 131 FSGETIPNALFFRANLKHSNFTN 153



 Score = 37.4 bits (85), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 32/155 (20%), Positives = 68/155 (43%), Gaps = 18/155 (11%)

Query: 74  TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANF 132
           T L+ A++ + +   +   + + ++A+      I + A F    +  A+  + N + +NF
Sbjct: 94  TNLSEALLHNSNFTNTKFQNTDLFQAQFHD--AILTNADFSGETIPNALFFRANLKHSNF 151

Query: 133 TSADMRESDFSGSKFNGAYLEKAVAYK---------------ANFTGADLSDTLMDRMVL 177
           T++ + +  F  +    A L  A+  +               ANF  ++L++  +    L
Sbjct: 152 TNSYLEDCQFDDADLTNAVLTNAILTRTIENLSSPGKAKFENANFKNSNLNNATLSSSDL 211

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
             AN  N+ +VR  L  ++  G    GAD ++A+ 
Sbjct: 212 TNANFQNSTMVRVKLENTNTAGTHFGGADITNALF 246


>gi|77404498|ref|YP_345074.1| hypothetical protein pREC1_0013 [Rhodococcus erythropolis PR4]
 gi|77019879|dbj|BAE46254.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
          Length = 589

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 9/102 (8%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A  G ADLR A         N   AD++ ++ SG+    A L+ A+  +A+ TGA+L+D 
Sbjct: 228 ASLGFADLRAA---------NLQGADLQTAELSGATLRLANLKGAILREADLTGANLTDA 278

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            +    L EA L  A+LV   L   DL    +E A+ S A +
Sbjct: 279 TLTEADLAEAKLQGAILVNVNLQNFDLSRLDLEKANLSGATL 320



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 66/141 (46%), Gaps = 10/141 (7%)

Query: 95  NKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGA 150
           N  EA   G +  G+A   A    A L KA   K     A   +AD++E+   G+    A
Sbjct: 349 NLAEANLTGAYMFGAALTEAVLTDATLTKAHLAKTTLAGALLINADLQEATLEGADLEDA 408

Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            LE A   KAN   A LS        L EA+LT AVL+   LT +   GA + GAD +DA
Sbjct: 409 DLESAKLSKANLRLAILSGA-----TLPEADLTGAVLIGANLTNTTFSGANLSGADLTDA 463

Query: 211 VIDLAQ-KQALCKYANGTNPI 230
            + +A  ++A    AN T  +
Sbjct: 464 DLSVADLEEADLTEANLTGAV 484



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 57/121 (47%), Gaps = 16/121 (13%)

Query: 109 SAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKA-- 155
           + A    A L+ A+ V  N            +AN + A + E+D   +   GA LE+A  
Sbjct: 281 TEADLAEAKLQGAILVNVNLQNFDLSRLDLEKANLSGATLFEADLRSATLTGANLERANL 340

Query: 156 ---VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                ++AN   A+L+   M    L EA LT+A L +  L ++ L GA++  AD  +A +
Sbjct: 341 AHAKLFEANLAEANLTGAYMFGAALTEAVLTDATLTKAHLAKTTLAGALLINADLQEATL 400

Query: 213 D 213
           +
Sbjct: 401 E 401



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 49/98 (50%), Gaps = 16/98 (16%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    A+L  AV +  N   AN T AD+ +++ S +            Y AN T A+LSD
Sbjct: 473 ADLTEANLTGAVLIGANLAHANLTDADLSKANLSDADL----------YSANLTDANLSD 522

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
                  L+ A LT A L+ T+LTR DL GA++ G D 
Sbjct: 523 A-----DLSGATLTRAGLMGTILTRVDLTGAVLTGLDL 555



 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 66/144 (45%), Gaps = 9/144 (6%)

Query: 70  VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGS---AAQFGSADLRKA-VHVK 125
            F    L+ A +     +++ L + +  EA   G   IG+    A    ADL KA +   
Sbjct: 449 TFSGANLSGADLTDADLSVADLEEADLTEANLTGAVLIGANLAHANLTDADLSKANLSDA 508

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           + + AN T A++ ++D SG+    A L   +  + + TGA L+      + L   NLT+ 
Sbjct: 509 DLYSANLTDANLSDADLSGATLTRAGLMGTILTRVDLTGAVLTG-----LDLVGVNLTDV 563

Query: 186 VLVRTVLTRSDLGGAIIEGADFSD 209
            L    +   DL GAI+ G D S+
Sbjct: 564 NLDNVNMDDVDLSGAILPGTDTSE 587



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 51/111 (45%), Gaps = 16/111 (14%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 164
           A    ADL  A   K N R A  + A + E+D +G+   GA L         F+G     
Sbjct: 403 ADLEDADLESAKLSKANLRLAILSGATLPEADLTGAVLIGANLTNTT-----FSGANLSG 457

Query: 165 -----ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
                ADLS   ++   L EANLT AVL+   L  ++L  A +  A+ SDA
Sbjct: 458 ADLTDADLSVADLEEADLTEANLTGAVLIGANLAHANLTDADLSKANLSDA 508


>gi|428211575|ref|YP_007084719.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|427999956|gb|AFY80799.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 514

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/120 (35%), Positives = 62/120 (51%), Gaps = 13/120 (10%)

Query: 109 SAAQFGSADLRKAVHVKEN-------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
           S A  GSA L +A H+++        FRAN   A++ +++  G+  N A LE A   +AN
Sbjct: 101 SNATLGSATLEQA-HLEKAIFNGATLFRANLHQANLEKAELLGANLNSANLELANLKEAN 159

Query: 162 FTGADLSD-TL----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
              ADL D TL    +++  L  ANL NA L    L R +L  A +E A+ S   ++ A+
Sbjct: 160 LENADLQDATLPLANLEKANLKNANLKNANLSGANLKRVNLENANLESANLSSTNLEEAK 219



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 54/184 (29%), Positives = 82/184 (44%), Gaps = 21/184 (11%)

Query: 68  WRVFVSTALAAAVVASCSSNISALADL------NKYEAETRGEFGIGSAAQFGSADLRKA 121
           W +F    L   V A+  S+++ L +       N  EA   G       AQ   AD+  +
Sbjct: 21  WSLFCLIFLPNPVFAARGSDVAKLEETGQCTRCNLQEANLMG-------AQLQGADMSDS 73

Query: 122 VHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
                N R      ANF+ + M ++D S +    A LE+A   KA F GA L    + + 
Sbjct: 74  NLRLANLRGAKLDGANFSRSRMFQADLSNATLGSATLEQAHLEKAIFNGATLFRANLHQA 133

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGTNP-ITGV 233
            L +A L  A L    L  ++L  A +E AD  DA + LA  ++A  K AN  N  ++G 
Sbjct: 134 NLEKAELLGANLNSANLELANLKEANLENADLQDATLPLANLEKANLKNANLKNANLSGA 193

Query: 234 STRK 237
           + ++
Sbjct: 194 NLKR 197



 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 45/85 (52%), Gaps = 7/85 (8%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            T A++ E++ SG    GA L       AN  GADLS   ++   L  ANL NA L    
Sbjct: 412 LTEANLVEANLSGINLKGARL-----ANANLQGADLSLANLETAHLFGANLQNANLSGAN 466

Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQ 216
           LT ++LGGA + GA+     ++L+Q
Sbjct: 467 LTGANLGGANLTGANLEG--VNLSQ 489



 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 15/94 (15%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTGADLSDTLMDRMVLNEANLTNA- 185
            + A++   + +G    GA LE    +     +AN TG +LS + +    L  ANL  A 
Sbjct: 261 LSGANLEAMNLTGINLEGANLEGTSLFMSNLERANLTGVNLSQSYLHYTDLTSANLVGAN 320

Query: 186 ---------VLVRTVLTRSDLGGAIIEGADFSDA 210
                    +L+ T LTR+DL  A  +GA+  D+
Sbjct: 321 LHRADLRHSILLGTDLTRADLSHANFKGANLQDS 354


>gi|167848210|ref|ZP_02473718.1| pentapeptide repeat protein [Burkholderia pseudomallei B7210]
          Length = 333

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 20  ADLTGADLSGMDLRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVD 74

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 75  ARLTAANLSLAHCERTDFS 93


>gi|434392917|ref|YP_007127864.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428264758|gb|AFZ30704.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 313

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 57/108 (52%), Gaps = 6/108 (5%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S      A+L KA+  + N        AN T AD+ +++ SG + + A L +AV   A+ 
Sbjct: 93  SGVNLWRANLNKAILCEANLSRANLDEANLTGADLSKANLSGIQLSKANLTEAVIVDAHL 152

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             A+L++T + R  L    L  A L+ + LT +DL  A +EGA+ S+A
Sbjct: 153 NRANLTETKLMRSHLCGTQLERAELIASDLTAADLSRANLEGANLSEA 200



 Score = 44.7 bits (104), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 52/100 (52%), Gaps = 9/100 (9%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A   + DL +++ V     A+     ++++  + ++FNG++L       AN TGADLS  
Sbjct: 40  AILEATDLSRSILVG----ADLNGVILKQATMTATRFNGSHLVGVDLTAANLTGADLSGV 95

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            + R     ANL  A+L    L+R++L  A + GAD S A
Sbjct: 96  NLWR-----ANLNKAILCEANLSRANLDEANLTGADLSKA 130



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 57/116 (49%), Gaps = 16/116 (13%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA--VAYKA---NFTG 164
           A+  ++DL  A   + N   AN + A++ +++ SG+   G  L +A  +A KA   N  G
Sbjct: 175 AELIASDLTAADLSRANLEGANLSEANLSQANLSGANLTGVNLHRANLIAAKAILANLRG 234

Query: 165 ADLSDTLMDRMVLNEA----------NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           A+L    +    L EA          NL+ A L R +LT  +L  AI+ GA+  DA
Sbjct: 235 ANLEQAELITTNLTEADLSWANLSKTNLSGADLHRAILTDVNLNSAILRGANLIDA 290



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 56/113 (49%), Gaps = 16/113 (14%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR--MV--------LN 178
           RA   ++D+  +D S +   GA L +A   +AN +GA+L+   + R  ++        L 
Sbjct: 174 RAELIASDLTAADLSRANLEGANLSEANLSQANLSGANLTGVNLHRANLIAAKAILANLR 233

Query: 179 EANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI-DLAQKQALCKYAN 225
            ANL  A L+ T LT +DL  A      + GAD   A++ D+    A+ + AN
Sbjct: 234 GANLEQAELITTNLTEADLSWANLSKTNLSGADLHRAILTDVNLNSAILRGAN 286


>gi|162456757|ref|YP_001619124.1| pentapeptide repeat-containing protein [Sorangium cellulosum So
           ce56]
 gi|161167339|emb|CAN98644.1| pentapeptide repeats hypothetical protein [Sorangium cellulosum So
           ce56]
          Length = 895

 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 37/111 (33%), Positives = 56/111 (50%), Gaps = 12/111 (10%)

Query: 108 GSAAQFGSADLRKAV--HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           G    F SA L++ V  H      A+F+ ADM  ++  G+   GA L++A    A+ +G 
Sbjct: 747 GERVSFRSACLQQGVVVHGSSFPEADFSDADMERANLRGTVLAGARLDRANLRGADLSGC 806

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           D S          EA+L  AVL   +L R+DL  A ++GA+  DA+   A+
Sbjct: 807 DAS----------EASLERAVLQGGLLIRTDLVNASLQGANLMDALASKAR 847



 Score = 48.1 bits (113), Expect = 0.004,   Method: Composition-based stats.
 Identities = 40/118 (33%), Positives = 60/118 (50%), Gaps = 12/118 (10%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGS-----KFNGAYLEKAVA-YKAN 161
           S  +F  ADL +A  V+     A+F+SA +R++ F         F  A L++ V  + ++
Sbjct: 708 SGVRFTGADLSEANLVESTLDGADFSSATLRKTTFVACHGERVSFRSACLQQGVVVHGSS 767

Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
           F  AD SD  M+R     ANL   VL    L R++L GA + G D S+A ++ A  Q 
Sbjct: 768 FPEADFSDADMER-----ANLRGTVLAGARLDRANLRGADLSGCDASEASLERAVLQG 820



 Score = 47.0 bits (110), Expect = 0.010,   Method: Composition-based stats.
 Identities = 33/91 (36%), Positives = 48/91 (52%), Gaps = 10/91 (10%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +FT A++     SG   +GA+LE A     + +G DLS T ++  VL  ANL  A L   
Sbjct: 581 DFTGANLAGMCLSGVDLSGAFLESA-----DLSGCDLSRTNLEGAVLARANLAGANLADA 635

Query: 191 VLTRSDLGGAIIEG-----ADFSDAVIDLAQ 216
            L  ++LGGA + G     AD  +AV+  A+
Sbjct: 636 RLRGANLGGAALRGASLDRADLKEAVLSRAE 666



 Score = 45.4 bits (106), Expect = 0.024,   Method: Composition-based stats.
 Identities = 55/173 (31%), Positives = 81/173 (46%), Gaps = 37/173 (21%)

Query: 74  TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGS----ADLRKAVHVKENF- 128
           T L  AV+A  +     LA  N  +A  RG   +G AA  G+    ADL++AV  +    
Sbjct: 615 TNLEGAVLARAN-----LAGANLADARLRGA-NLGGAALRGASLDRADLKEAVLSRAELE 668

Query: 129 RANFTSADMRESDF-----SGSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDRMVLN 178
           RA F+ AD+  +D+      G+ F GA L +    K +     FTGADLS+  +    L+
Sbjct: 669 RARFSGADLTGADWFETKPGGADFTGATLGQCNLLKVDLSGVRFTGADLSEANLVESTLD 728

Query: 179 EANLTNAVLVRTVLT---------RSDL--GGAIIEG-----ADFSDAVIDLA 215
            A+ ++A L +T            RS     G ++ G     ADFSDA ++ A
Sbjct: 729 GADFSSATLRKTTFVACHGERVSFRSACLQQGVVVHGSSFPEADFSDADMERA 781



 Score = 42.0 bits (97), Expect = 0.33,   Method: Composition-based stats.
 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 10/87 (11%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT----------LMDRMVLNEA 180
           + + A +  +D SG   +   LE AV  +AN  GA+L+D            +    L+ A
Sbjct: 596 DLSGAFLESADLSGCDLSRTNLEGAVLARANLAGANLADARLRGANLGGAALRGASLDRA 655

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADF 207
           +L  AVL R  L R+   GA + GAD+
Sbjct: 656 DLKEAVLSRAELERARFSGADLTGADW 682


>gi|427723149|ref|YP_007070426.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
 gi|427354869|gb|AFY37592.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
          Length = 508

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 52/106 (49%), Gaps = 6/106 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTS------ADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S A+     LR+A     N R    S      AD+ +++  G+   GAYL  A  Y AN 
Sbjct: 67  SGAKLSKVHLRQAYLYGTNLRRTHLSEAFLFKADLSKTNLYGAYLYGAYLYGANLYGANL 126

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           + ADLS+  +    L+EA+L+ A L    L+ +DL G  + G + S
Sbjct: 127 SKADLSEADLSEADLSEADLSEADLSGVSLSEADLSGVNLSGVNLS 172



 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 43/81 (53%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ + AD+  +D SG+K +  +L +A  Y  N     LS+  + +  L++ NL  A L  
Sbjct: 54  ADLSGADLSGADLSGAKLSKVHLRQAYLYGTNLRRTHLSEAFLFKADLSKTNLYGAYLYG 113

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L  ++L GA +  AD S+A
Sbjct: 114 AYLYGANLYGANLSKADLSEA 134



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 44/84 (52%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
            A+ + AD+ E+D S +  +G  L +A     N +G +LS   +  + L+  NL+ A L 
Sbjct: 133 EADLSEADLSEADLSEADLSGVSLSEADLSGVNLSGVNLSGVNLSGVNLSGVNLSGAKLC 192

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
            T+   S L GA ++ AD + A I
Sbjct: 193 HTLCKLSTLVGASLKSADLTGACI 216



 Score = 37.4 bits (85), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 37/73 (50%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
           D+R + FSG+  +G  L       A+ +GADLS   +    L++ +L  A L  T L R+
Sbjct: 30  DLRRAQFSGAHLSGVNLSGVNLSGADLSGADLSGADLSGAKLSKVHLRQAYLYGTNLRRT 89

Query: 196 DLGGAIIEGADFS 208
            L  A +  AD S
Sbjct: 90  HLSEAFLFKADLS 102


>gi|337746223|ref|YP_004640385.1| hypothetical protein KNP414_01954 [Paenibacillus mucilaginosus
           KNP414]
 gi|336297412|gb|AEI40515.1| Uncharacterized low-complexity protein-like protein [Paenibacillus
           mucilaginosus KNP414]
          Length = 289

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +  FT + +  SDFSG+   G+  + +   +ANF GA+L+D  +  + L  A+    +LV
Sbjct: 101 KGQFTGSALHGSDFSGADLTGSSFKSSDVREANFDGANLTDCSLSTLDLANASFHKTILV 160

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDL 214
           RT  ++S L GA   G   +D  + +
Sbjct: 161 RTNFSKSGLDGAQFTGVRLTDVTLTM 186


>gi|425454308|ref|ZP_18834054.1| Pentapeptide repeat protein [Microcystis aeruginosa PCC 9807]
 gi|389805079|emb|CCI15409.1| Pentapeptide repeat protein [Microcystis aeruginosa PCC 9807]
          Length = 222

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 16/102 (15%)

Query: 140 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT---------------N 184
           ++ SG+  +GA LE+A   KAN TGA+LS   +  + L+EA+LT                
Sbjct: 68  ANLSGANLSGALLEEAKLGKANLTGANLSKADLSAITLSEADLTEADLSEAVLSNALMDQ 127

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
           A+LV   L  +DL  AII  A+ S+AV + AQ K A+   +N
Sbjct: 128 AILVDATLIGADLESAIISKANLSNAVANKAQFKNAILSESN 169



 Score = 43.9 bits (102), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 54/103 (52%), Gaps = 6/103 (5%)

Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A+ G A+L  A   K +  A   + AD+ E+D S +  + A +++A+   A   GADL  
Sbjct: 83  AKLGKANLTGANLSKADLSAITLSEADLTEADLSEAVLSNALMDQAILVDATLIGADL-- 140

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
              +  ++++ANL+NAV  +     + L  + + G DFS A +
Sbjct: 141 ---ESAIISKANLSNAVANKAQFKNAILSESNLSGTDFSQATM 180


>gi|440754482|ref|ZP_20933684.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
 gi|440174688|gb|ELP54057.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
          Length = 469

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 54/101 (53%)

Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
           +G+   R  ++    +RAN   A+++ ++  G+  NGA L  A   +A   GA L+   +
Sbjct: 294 YGAYLYRANLYRANLYRANLKGANLKGANLKGANLNGANLILANLNRAYLNGAILNRANL 353

Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           +  +LN ANL  A L    L +++L GA + GAD + A ++
Sbjct: 354 NGAILNRANLNGAYLNGAYLIQANLNGADLNGADLNRANLN 394



 Score = 44.7 bits (104), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 10/95 (10%)

Query: 131 NFTSADMRESDFSGSKFNGA----------YLEKAVAYKANFTGADLSDTLMDRMVLNEA 180
           + +  ++RE++ +G+  NGA          YL +A  Y+AN   A+L    +    L  A
Sbjct: 267 DLSRTNLREANLNGANLNGAQLYRANLYGAYLYRANLYRANLYRANLKGANLKGANLKGA 326

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
           NL  A L+   L R+ L GAI+  A+ + A+++ A
Sbjct: 327 NLNGANLILANLNRAYLNGAILNRANLNGAILNRA 361



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 33/59 (55%), Gaps = 5/59 (8%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           RAN   A +  ++ +G+  NGAYL      +AN  GADL+   ++R  LN ANL  A L
Sbjct: 350 RANLNGAILNRANLNGAYLNGAYL-----IQANLNGADLNGADLNRANLNGANLNGANL 403


>gi|443328868|ref|ZP_21057461.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442791604|gb|ELS01098.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 266

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 54/109 (49%), Gaps = 4/109 (3%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A    ADLR+A    +  RAN +  D+  +D   +   G  L  A   KA+ + A+LS+ 
Sbjct: 153 ADLNDADLREA----QLIRANLSEVDLSGADLRAANLKGVNLRGADLNKADLSRANLSEA 208

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
            +    LNEANL+ A L    L  ++L    + GA F + ++D   K++
Sbjct: 209 YLYLANLNEANLSRADLSEANLHEANLSRVDLRGAIFCETIMDDGHKES 257



 Score = 40.4 bits (93), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 42/77 (54%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N +  D+  ++ SG+  +GA L +A   + N +  +L+   ++   LN+A+L  A L+R 
Sbjct: 109 NLSGVDLSGANLSGADLSGADLSEADLSRVNLSRVNLNGANLNDADLNDADLREAQLIRA 168

Query: 191 VLTRSDLGGAIIEGADF 207
            L+  DL GA +  A+ 
Sbjct: 169 NLSEVDLSGADLRAANL 185



 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 10/80 (12%)

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           ++E D S S  +G  L       AN +GADLS           A+L+ A L R  L+R +
Sbjct: 95  LKEFDLSQSNLSGVNLSGVDLSGANLSGADLSG----------ADLSEADLSRVNLSRVN 144

Query: 197 LGGAIIEGADFSDAVIDLAQ 216
           L GA +  AD +DA +  AQ
Sbjct: 145 LNGANLNDADLNDADLREAQ 164


>gi|158313419|ref|YP_001505927.1| pentapeptide repeat-containing protein [Frankia sp. EAN1pec]
 gi|158108824|gb|ABW11021.1| pentapeptide repeat protein [Frankia sp. EAN1pec]
          Length = 299

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 53/118 (44%), Gaps = 10/118 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A     DLR A             AD+R++D S +   GA L  A+   A  TGADL 
Sbjct: 106 SGADLRGTDLRDAC---------LRGADLRDADLSQAALGGADLAGALLAGAFLTGADLH 156

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 225
            T +    L+ A+L  A L R  L  +D  G I+ GAD   A   D   +QA  + A+
Sbjct: 157 GTDLHGAFLHNADLRKAFLARADLRGADADGIIMRGADLRAADATDAVLRQADLRAAD 214



 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 37/107 (34%), Positives = 50/107 (46%), Gaps = 9/107 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A  G ADL  A+       A  T AD+  +D  G+  + A L KA   +A+  GAD  
Sbjct: 131 SQAALGGADLAGALLAG----AFLTGADLHGTDLHGAFLHNADLRKAFLARADLRGADAD 186

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDA 210
             +M    L  A+ T+AVL +  L  +D     L GAI+ G D   A
Sbjct: 187 GIIMRGADLRAADATDAVLRQADLRAADLRGIRLAGAILRGVDLRGA 233



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 1/99 (1%)

Query: 108 GSAAQFGS-ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
           G+ A+  S  DL  A+  +         AD+  +D +G    G  L  A  + A  +GAD
Sbjct: 50  GAPARLSSLGDLLAALRGRPRTGGYAAGADLTGADLAGVCLTGRILRGAQLHGAYLSGAD 109

Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
           L  T +    L  A+L +A L +  L  +DL GA++ GA
Sbjct: 110 LRGTDLRDACLRGADLRDADLSQAALGGADLAGALLAGA 148



 Score = 37.7 bits (86), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 36/78 (46%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+     +G    GA L  A    A+  G DL D  +    L +A+L+ A L  
Sbjct: 78  ADLTGADLAGVCLTGRILRGAQLHGAYLSGADLRGTDLRDACLRGADLRDADLSQAALGG 137

Query: 190 TVLTRSDLGGAIIEGADF 207
             L  + L GA + GAD 
Sbjct: 138 ADLAGALLAGAFLTGADL 155


>gi|448242763|ref|YP_007406816.1| hypothetical protein SMWW4_v1c30030 [Serratia marcescens WW4]
 gi|445213127|gb|AGE18797.1| hypothetical protein SMWW4_v1c30030 [Serratia marcescens WW4]
          Length = 850

 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 39/130 (30%), Positives = 60/130 (46%), Gaps = 5/130 (3%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNG 149
           A +D  + E+E         +AQ  S  LR    + +  R    +A +R+ D SG    G
Sbjct: 482 AFSDKQRGESERALHQMYLMSAQAQSPALRLRGDLAQIIRQRVAAAMLRDKDLSGLDLTG 541

Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           A L      +AN  GA     L++   L +  L    L   +L R+DL GA+++ AD S 
Sbjct: 542 ADLSGMDLCQANLRGA-----LLENANLRQTQLVGCDLREAMLARADLSGAVLQQADLSH 596

Query: 210 AVIDLAQKQA 219
           A + LA+ +A
Sbjct: 597 ASLALAKCEA 606



 Score = 42.0 bits (97), Expect = 0.33,   Method: Composition-based stats.
 Identities = 29/90 (32%), Positives = 47/90 (52%), Gaps = 12/90 (13%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RANF+ A +   D S ++ N          +A+F  A+ S +L  R  L++ANL +A  +
Sbjct: 747 RANFSRARLDNCDLSEARLN----------EADFRQANGSGSLFIRCDLSKANLRDANFI 796

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
             +L +  L GA ++G +   A  DL+Q Q
Sbjct: 797 AAILQKCVLSGADLQGTNLFRA--DLSQSQ 824



 Score = 39.3 bits (90), Expect = 2.1,   Method: Composition-based stats.
 Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 5/80 (6%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + T AD+   D   +   GA LE A   +    G DL + ++ R     A+L+ AVL + 
Sbjct: 538 DLTGADLSGMDLCQANLRGALLENANLRQTQLVGCDLREAMLAR-----ADLSGAVLQQA 592

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            L+ + L  A  E  DF  A
Sbjct: 593 DLSHASLALAKCEATDFGGA 612


>gi|432333149|ref|ZP_19584958.1| hypothetical protein Rwratislav_00760 [Rhodococcus wratislaviensis
           IFP 2016]
 gi|430779982|gb|ELB95096.1| hypothetical protein Rwratislav_00760 [Rhodococcus wratislaviensis
           IFP 2016]
          Length = 220

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 51/108 (47%), Gaps = 6/108 (5%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A   +ADLR         R A+ TS +M E D SG+    A L  A    AN   ADL+D
Sbjct: 34  ANLRNADLRLGFLRDATLRNADLTSCNMYEVDLSGANLYLAQLSGAHMTGANLNNADLTD 93

Query: 170 TLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           T + +      +L E  L  A L R  L  +DL GA + G D SDA +
Sbjct: 94  TKLIKTQLSGAMLIEVELDGADLSRAFLQNADLTGAHLRGTDLSDATL 141



 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 73/162 (45%), Gaps = 19/162 (11%)

Query: 79  AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMR 138
           A + SC+     L+  N Y A+  G    G  A   +ADL     +K       + A + 
Sbjct: 54  ADLTSCNMYEVDLSGANLYLAQLSGAHMTG--ANLNNADLTDTKLIK----TQLSGAMLI 107

Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLTNAVLVRTVLT 193
           E +  G+  + A+L+ A    A+  G DLSD  +   + M  N  EA L +A L    LT
Sbjct: 108 EVELDGADLSRAFLQNADLTGAHLRGTDLSDATLVGAELMATNLAEAELVDADLTDADLT 167

Query: 194 RSDLGGAIIEGA-----DFSDAVI---DLAQKQALCKYANGT 227
            +DL GA + GA     DF+DA +   DL   Q   +Y + T
Sbjct: 168 FADLTGADLRGANLTRTDFTDADLTGADLGTTQDKARYDDTT 209


>gi|20090742|ref|NP_616817.1| hypothetical protein MA1892 [Methanosarcina acetivorans C2A]
 gi|19915798|gb|AAM05297.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans
           C2A]
          Length = 560

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/122 (36%), Positives = 62/122 (50%), Gaps = 12/122 (9%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTG 164
           A    A+LR     K N R  + + AD+RE+D SG   +GA L  A        +AN  G
Sbjct: 389 ANLSGANLRGTNLSKANLREVDLSGADLREADLSGVDLSGANLSGADLSGVDLSRANLNG 448

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKY 223
           ADL+   + R  LNEANL+     +T L  +DL  A + GA  S+A +  A+ K A  + 
Sbjct: 449 ADLNGIDLRRANLNEANLS-----KTNLNEADLSKAKLSGAYLSEAKLKGAKLKGAYMRK 503

Query: 224 AN 225
           AN
Sbjct: 504 AN 505



 Score = 44.7 bits (104), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 44/79 (55%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N   AD+ ESD   +  + A+L +A   KAN + A+LS+  +    +  ANL+ A L + 
Sbjct: 279 NLIGADLSESDLRDAFLHEAHLNEADLSKANLSKANLSEADLKGAYMRRANLSEANLSKA 338

Query: 191 VLTRSDLGGAIIEGADFSD 209
            L+  DL GA + GAD ++
Sbjct: 339 KLSGVDLSGANLSGADLNE 357



 Score = 41.2 bits (95), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 54/114 (47%), Gaps = 12/114 (10%)

Query: 109 SAAQFGSADLRKAVHVKENFR--ANFTSADMRESD----------FSGSKFNGAYLEKAV 156
           S A    ADL +    K  +   AN + AD+ E+D           SG+   G  L KA 
Sbjct: 346 SGANLSGADLNEFYLNKATYTRGANLSEADLSEADLSEANLKGANLSGANLRGTNLSKAN 405

Query: 157 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             + + +GADL +  +  + L+ ANL+ A L    L+R++L GA + G D   A
Sbjct: 406 LREVDLSGADLREADLSGVDLSGANLSGADLSGVDLSRANLNGADLNGIDLRRA 459



 Score = 41.2 bits (95), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 53/102 (51%), Gaps = 9/102 (8%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----A 165
           A     DLR+A ++ E   AN +  ++ E+D S +K +GAYL +A    A   G     A
Sbjct: 449 ADLNGIDLRRA-NLNE---ANLSKTNLNEADLSKAKLSGAYLSEAKLKGAKLKGAYMRKA 504

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
           +LS+  ++   L EANL+ A L    L+  DL GA + G + 
Sbjct: 505 NLSEADLNGADLREANLSEANLNGVDLSVIDLRGANLNGVNI 546



 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 42/81 (51%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + AD+   D S +  NGA L      +AN   A+LS T ++   L++A L+ A L  
Sbjct: 429 ANLSGADLSGVDLSRANLNGADLNGIDLRRANLNEANLSKTNLNEADLSKAKLSGAYLSE 488

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L  + L GA +  A+ S+A
Sbjct: 489 AKLKGAKLKGAYMRKANLSEA 509


>gi|428214178|ref|YP_007087322.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428002559|gb|AFY83402.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 346

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 74/151 (49%), Gaps = 3/151 (1%)

Query: 67  NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA-VHVK 125
           NW       L+ A +A+   + + L+  N   A+    + IG+     S DLR+A + + 
Sbjct: 95  NWADLSGANLSGANLANADVSGANLSGANLSGAKLNQTYLIGT--NLKSVDLREANLSLA 152

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
              +A+ T A++R++D +G+K   + L  A    AN TGA+L    + +  LN ANLT A
Sbjct: 153 SLNKADLTKANLRQADLTGAKLKQSNLNLADLTHANLTGANLKQANLSQAHLNWANLTKA 212

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
            L    L  ++L  A +   D ++  +  AQ
Sbjct: 213 DLREANLCGANLSKANLSQTDLTEVCLKDAQ 243



 Score = 44.7 bits (104), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 44/84 (52%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           R +   AD+ E++ SG    GA L KA    AN + A+LS   +    L  ANLT A L 
Sbjct: 31  RLSLAKADLSEANLSGVYLGGASLTKANLSGANLSRANLSGASLSGANLTGANLTGANLA 90

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
              L  +DL GA + GA+ ++A +
Sbjct: 91  GAHLNWADLSGANLSGANLANADV 114



 Score = 42.0 bits (97), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 55/118 (46%), Gaps = 16/118 (13%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRES----------DFSGSKFNGAYLEKAVAYK 159
           A    ADLR+A     N  +AN +  D+ E           +FSG+   G  L   +   
Sbjct: 207 ANLTKADLREANLCGANLSKANLSQTDLTEVCLKDAQLSGINFSGANLTGVDLSNKLLTG 266

Query: 160 ANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           AN +GA+     LS   + +  L EANL+ A L+ + L  +DL  A + GA+ S A +
Sbjct: 267 ANLSGAELSLANLSGAYLIQTNLREANLSEANLMGSHLMDADLTKANLSGANLSQANV 324



 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 21/121 (17%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA----VAYK------ 159
           A    A+L++A   + +   AN T AD+RE++  G+  + A L +     V  K      
Sbjct: 187 ANLTGANLKQANLSQAHLNWANLTKADLREANLCGANLSKANLSQTDLTEVCLKDAQLSG 246

Query: 160 -----ANFTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
                AN TG DLS+ L     +    L+ ANL+ A L++T L  ++L  A + G+   D
Sbjct: 247 INFSGANLTGVDLSNKLLTGANLSGAELSLANLSGAYLIQTNLREANLSEANLMGSHLMD 306

Query: 210 A 210
           A
Sbjct: 307 A 307



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 42/79 (53%), Gaps = 5/79 (6%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
           F    + ++D S +  +G YL  A   KAN +GA+LS     R  L+ A+L+ A L    
Sbjct: 29  FNRLSLAKADLSEANLSGVYLGGASLTKANLSGANLS-----RANLSGASLSGANLTGAN 83

Query: 192 LTRSDLGGAIIEGADFSDA 210
           LT ++L GA +  AD S A
Sbjct: 84  LTGANLAGAHLNWADLSGA 102


>gi|443314210|ref|ZP_21043788.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
 gi|442786182|gb|ELR95944.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
          Length = 516

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 56/124 (45%), Gaps = 2/124 (1%)

Query: 104 EFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           E  + S A    A+LR A  +  +   AN   A++R +D SG+    A L  A    A  
Sbjct: 174 EDTVLSGAVLQRAELRHATLMGADLSGANLRGANLRWADLSGANLQEADLTDAKLSGATL 233

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALC 221
            GADLS   +   +L   +L+   L R     SDL GA + GA  + AV  DL   +  C
Sbjct: 234 VGADLSGATLVNTILVHTDLSRTRLQRVYCVDSDLSGATLNGAFLAGAVCYDLVTAETTC 293

Query: 222 KYAN 225
            + +
Sbjct: 294 DWVD 297



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 50/92 (54%), Gaps = 10/92 (10%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL----------SDTLMDRMVLN 178
           +AN + AD+RE+    ++ +GA L ++   K+NF GA+L           DT++   VL 
Sbjct: 125 QANLSEADLREARLRWARLSGANLSQSDLRKSNFLGANLEGAQLYAAQMEDTVLSGAVLQ 184

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            A L +A L+   L+ ++L GA +  AD S A
Sbjct: 185 RAELRHATLMGADLSGANLRGANLRWADLSGA 216



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 50/101 (49%), Gaps = 1/101 (0%)

Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADLR+A +       AN + +D+R+S+F G+   GA L  A       +GA L  
Sbjct: 126 ANLSEADLREARLRWARLSGANLSQSDLRKSNFLGANLEGAQLYAAQMEDTVLSGAVLQR 185

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             +    L  A+L+ A L    L  +DL GA ++ AD +DA
Sbjct: 186 AELRHATLMGADLSGANLRGANLRWADLSGANLQEADLTDA 226


>gi|337745078|ref|YP_004639240.1| hypothetical protein KNP414_00780 [Paenibacillus mucilaginosus
           KNP414]
 gi|336296267|gb|AEI39370.1| Uncharacterized low-complexity protein-like protein [Paenibacillus
           mucilaginosus KNP414]
          Length = 289

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +  FT + +  SDFSG+   G+  + +   +ANF GA+L+D  +  + L  A+    +LV
Sbjct: 101 KGQFTGSALHGSDFSGADLTGSSFKSSDVREANFDGANLTDCSLSTLDLANASFHKTILV 160

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDL 214
           RT  ++S L GA   G   +D  + +
Sbjct: 161 RTNFSKSGLDGAQFTGVRLTDVTLTM 186


>gi|425452817|ref|ZP_18832632.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 7941]
 gi|425459927|ref|ZP_18839413.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9808]
 gi|440756386|ref|ZP_20935587.1| tetratricopeptide repeat family protein [Microcystis aeruginosa
           TAIHU98]
 gi|389765245|emb|CCI08832.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 7941]
 gi|389827515|emb|CCI21150.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9808]
 gi|440173608|gb|ELP53066.1| tetratricopeptide repeat family protein [Microcystis aeruginosa
           TAIHU98]
          Length = 262

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 55/101 (54%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L++ +  ++  + + + + + +S+ +G+K NGA L  A   +AN +GADLS   +     
Sbjct: 29  LQQLLSTRQCPQCDLSGSGLVQSNLTGAKLNGANLVGANLSQANLSGADLSGANLTGASF 88

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
             ANLT A L   +LT +DL GA +  A+  +  +D A  Q
Sbjct: 89  FGANLTGANLSGAILTGADLRGAYLNNANLDNTKLDTAYVQ 129



 Score = 40.4 bits (93), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 40/72 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++ +++ SG+  +GA L  A  + AN TGA+LS  ++    L  A L NA L  
Sbjct: 61  ANLVGANLSQANLSGADLSGANLTGASFFGANLTGANLSGAILTGADLRGAYLNNANLDN 120

Query: 190 TVLTRSDLGGAI 201
           T L  + + GA+
Sbjct: 121 TKLDTAYVQGAV 132


>gi|46205596|ref|ZP_00048308.2| COG1357: Uncharacterized low-complexity proteins [Magnetospirillum
           magnetotacticum MS-1]
          Length = 195

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 48/89 (53%), Gaps = 5/89 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTN 184
           A+F+ A MR +    +  +GA  E A  +  +FTGAD  D+L  R  L+EA     NLT 
Sbjct: 16  ADFSGATMRFARLDKALLDGARFEGADLWGTDFTGADADDSLFRRARLDEANLSDCNLTG 75

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           A      L ++ L GA + GA+F+ A +D
Sbjct: 76  ADFEGASLKKARLVGARLRGANFTGARLD 104



 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 42/85 (49%), Gaps = 5/85 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RA    A++ + + +G+ F GA L+KA    A   GA+ +   +D   L+EA+ +   LV
Sbjct: 60  RARLDEANLSDCNLTGADFEGASLKKARLVGARLRGANFTGARLDGADLSEADFSRTSLV 119

Query: 189 RTVLT-----RSDLGGAIIEGADFS 208
           R  LT      +   GA +EG   S
Sbjct: 120 RLDLTACKLRHARFAGAWLEGVRLS 144


>gi|453064141|gb|EMF05113.1| putative low-complexity protein [Serratia marcescens VGH107]
          Length = 850

 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 39/130 (30%), Positives = 60/130 (46%), Gaps = 5/130 (3%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNG 149
           A +D  + E+E         +AQ  S  LR    + +  R    +A +R+ D SG    G
Sbjct: 482 AFSDKQRGESERALHQMYLMSAQAQSPALRLRGDLAQIIRQRVAAAMLRDKDLSGLDLTG 541

Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
           A L      +AN  GA     L++   L +  L    L   +L R+DL GA+++ AD S 
Sbjct: 542 ADLSGMDLCQANLRGA-----LLESANLRQTQLVGCDLREAMLARADLSGAVLQQADLSH 596

Query: 210 AVIDLAQKQA 219
           A + LA+ +A
Sbjct: 597 ASLALAKCEA 606



 Score = 41.2 bits (95), Expect = 0.45,   Method: Composition-based stats.
 Identities = 30/108 (27%), Positives = 54/108 (50%), Gaps = 11/108 (10%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A FG A L +          N     ++ ++FS ++ +   L +A   +A+F  A+ + +
Sbjct: 728 ADFGDATLNQC---------NLRQMPLQRANFSRARLDNCDLSEARLNEADFRQANGNGS 778

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
           L  R  L++ANL +A  +  +L +  L GA ++G +   A  DL+Q Q
Sbjct: 779 LFIRCDLSQANLRDANFIAAILQKCVLSGADLQGTNLFRA--DLSQSQ 824



 Score = 39.3 bits (90), Expect = 2.1,   Method: Composition-based stats.
 Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 5/80 (6%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + T AD+   D   +   GA LE A   +    G DL + ++ R     A+L+ AVL + 
Sbjct: 538 DLTGADLSGMDLCQANLRGALLESANLRQTQLVGCDLREAMLAR-----ADLSGAVLQQA 592

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            L+ + L  A  E  DF  A
Sbjct: 593 DLSHASLALAKCEATDFGGA 612


>gi|428316016|ref|YP_007113898.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428239696|gb|AFZ05482.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 168

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/109 (35%), Positives = 56/109 (51%), Gaps = 9/109 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           ++A    A L  AV   EN + N   A +   +  G+   GA LEKA  + A+ T ADLS
Sbjct: 55  TSANLNGAKLEGAVL--ENVKLN--EALLDSVNLKGANLKGASLEKAGLFSADLTKADLS 110

Query: 169 DT-----LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           +       +    LN ANL+NA L  T L  +DL GA ++GA+   A++
Sbjct: 111 NANLKGAFLRGAKLNNANLSNADLSETDLNIADLTGANLKGANLKGAIM 159


>gi|427734496|ref|YP_007054040.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427369537|gb|AFY53493.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 116

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR-- 189
            + AD+ E D SG+K   A L  A  Y AN +GA LS   +    L+ ANL+ A L +  
Sbjct: 1   MSGADLHEKDLSGAKLYRANLSGAKLYGANLSGASLSGADLSGSSLSAANLSGAYLQKAN 60

Query: 190 ---TVLTRSDLGGAIIEGADFSDAVI 212
                L ++DL  A + GAD  +AV+
Sbjct: 61  LSGAYLQKADLSKATLYGADLQNAVL 86



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/90 (37%), Positives = 46/90 (51%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           + AN + A +  +D SGS  + A L  A   KAN +GA L    + +  L  A+L NAVL
Sbjct: 27  YGANLSGASLSGADLSGSSLSAANLSGAYLQKANLSGAYLQKADLSKATLYGADLQNAVL 86

Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
               L  + L GA +EGA    A I+ A K
Sbjct: 87  FGANLEGAKLKGANLEGAKLKGANIEEAIK 116


>gi|425437233|ref|ZP_18817656.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9432]
 gi|389677805|emb|CCH93269.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9432]
          Length = 262

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 55/101 (54%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L++ +  ++  + + + + + +S+ +G+K NGA L  A   +AN +GADLS   +     
Sbjct: 29  LQQLLSTRQCPQCDLSGSGLVQSNLTGAKLNGANLVGANLSQANLSGADLSGANLTGASF 88

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
             ANLT A L   +LT +DL GA +  A+  +  +D A  Q
Sbjct: 89  FGANLTGANLSGAILTGADLRGAYLNNANLDNTKLDTAYVQ 129



 Score = 40.8 bits (94), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 40/72 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++ +++ SG+  +GA L  A  + AN TGA+LS  ++    L  A L NA L  
Sbjct: 61  ANLVGANLSQANLSGADLSGANLTGASFFGANLTGANLSGAILTGADLRGAYLNNANLDN 120

Query: 190 TVLTRSDLGGAI 201
           T L  + + GA+
Sbjct: 121 TKLDTAYVQGAV 132


>gi|448677922|ref|ZP_21689112.1| pentapeptide repeat-containing protein [Haloarcula argentinensis
           DSM 12282]
 gi|445773597|gb|EMA24630.1| pentapeptide repeat-containing protein [Haloarcula argentinensis
           DSM 12282]
          Length = 428

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 5/86 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +AN +SAD+RE+D SG+    A L  A   KA+ +GADLS   +    L  A+L++A L 
Sbjct: 70  KANLSSADLREADLSGADLGSADLSGANLQKADLSGADLSYANLSGADLENADLSSADLR 129

Query: 189 RTVLT-----RSDLGGAIIEGADFSD 209
           RT L+      +DL  A +   DFSD
Sbjct: 130 RTNLSGVKFVETDLADADLRNIDFSD 155



 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 52/106 (49%), Gaps = 6/106 (5%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A   SADLR+       F   +   AD+R  DFS ++  G  L  A  +  + +GADL  
Sbjct: 121 ADLSSADLRRTNLSGVKFVETDLADADLRNIDFSDTELVGTDLSGADFFATDLSGADLRV 180

Query: 170 TLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDA 210
             M  + L EA+L+ A L  T L+      +DL GA + G D SDA
Sbjct: 181 ADMSNVNLREADLSGADLGGTDLSDANLREADLSGADLGGVDLSDA 226


>gi|427724651|ref|YP_007071928.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
 gi|427356371|gb|AFY39094.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
          Length = 281

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 66/116 (56%), Gaps = 10/116 (8%)

Query: 107 IGSAAQFGSADLRKA----VHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
           I + A    A+LR+A      ++ N  +AN  S+++ E++ + +K   + +  A   +A 
Sbjct: 46  IFTGATLDQANLREADLSYASLQGNLSQANLISSNLTEANLTAAKMAYSGMRAANLTRAK 105

Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVI 212
            T ADLS  +++  ++ EANL+ A LV     R  LT+++L GA ++GA+ + A++
Sbjct: 106 LTSADLSYCILNEAIMREANLSKATLVDAFIGRANLTQANLEGANLQGANLTSAIL 161



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 39/81 (48%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++  +   G+   GA L  A  +  N TG+   D  + +  LN ANLTN  L  
Sbjct: 149 ANLQGANLTSAILIGANLRGANLANATLHGINATGSTADDADLSKSKLNSANLTNVKLRG 208

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
           T L  + L    + GAD ++A
Sbjct: 209 TNLREAQLAWTTMRGADLTEA 229


>gi|386016243|ref|YP_005934529.1| hypothetical protein PAJ_1653 [Pantoea ananatis AJ13355]
 gi|327394311|dbj|BAK11733.1| hypothetical protein PAJ_1653 [Pantoea ananatis AJ13355]
          Length = 846

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 45/172 (26%), Positives = 75/172 (43%), Gaps = 17/172 (9%)

Query: 71  FVSTALAAAV-----VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
           F+ + L  AV     + SCS  +   AD   +         + S +    AD   A   +
Sbjct: 675 FIKSTLEQAVFNRAELESCSW-VETQADHATFSGSIWLTCAVASGSSLNDADFTHATLRQ 733

Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
            N R    +A +    F+ +K + + L +A    ANF  A+L+ +L  R    +A+ T+A
Sbjct: 734 SNLRQTPLNAAV----FTQAKLDNSDLSEASCKGANFQQANLAGSLFVRTDFRDADFTDA 789

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
            L+  +L +S LGGA   G     A  DL+Q      + + T  + G  T++
Sbjct: 790 NLMGAILQKSQLGGACFRGTTLFRA--DLSQ-----AFTSETTELDGAFTKR 834



 Score = 37.7 bits (86), Expect = 5.3,   Method: Composition-based stats.
 Identities = 34/107 (31%), Positives = 45/107 (42%), Gaps = 11/107 (10%)

Query: 110 AAQFGSADL-RKAVHVKE----NF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
            A F SA L R  +H       NF RA+F  A   +SDFSGS F    L++ +     F 
Sbjct: 567 GANFNSAMLARTELHHSSLRNCNFERASFALAQCCQSDFSGSYFKDTQLQETLFDNCTFN 626

Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            A  S+ L       +     A+L   V    DL      G DF++A
Sbjct: 627 EATFSELLFRETWFTQCRFQRAILQACVFMELDL-----PGLDFTEA 668


>gi|428219102|ref|YP_007103567.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427990884|gb|AFY71139.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 698

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 54/101 (53%), Gaps = 1/101 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           + A    A+L  A     NF +AN   A++R  + SG   +GA L  A    AN +GA+L
Sbjct: 67  TGANLTGANLTGANLTGANFSKANLRGANLRGVNLSGVNLSGANLSGANLSGANLSGANL 126

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
           S   + R+ L+ AN +NA L    L+  DL GA + GA+FS
Sbjct: 127 SGVNLSRVNLSGANFSNANLNNFDLSGFDLTGANLTGANFS 167



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 46/81 (56%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANF++A++   D SG   +G  L  A    AN +GA+LS+  +  + L + NL+ A L R
Sbjct: 214 ANFSNANLNNFDLSGFDLSGVNLSGANLSGANLSGANLSEANLSEVDLYQINLSGANLSR 273

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             LT ++L GA   GA+ S A
Sbjct: 274 IDLTGANLSGANFSGANLSGA 294



 Score = 41.6 bits (96), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 43/81 (53%), Gaps = 5/81 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           ANF++A++   D SG    GA L  A     NF+G +LS   + R  L+ AN +NA L  
Sbjct: 139 ANFSNANLNNFDLSGFDLTGANLTGA-----NFSGVNLSGVNLSRANLSGANFSNANLNN 193

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L+  DL G  + GA+ S A
Sbjct: 194 FDLSGFDLSGVNLSGANLSGA 214



 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 49/100 (49%), Gaps = 4/100 (4%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S      A+L +A +VK   RA    AD+R +D +G+   GA L  A    AN TGA+ S
Sbjct: 32  SYTNLNEANLSEA-YVK---RAYLRGADLRGADLTGANLTGANLTGANLTGANLTGANFS 87

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
              +    L   NL+   L    L+ ++L GA + GA+ S
Sbjct: 88  KANLRGANLRGVNLSGVNLSGANLSGANLSGANLSGANLS 127



 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 2/87 (2%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A++   D SG    G  L  A     N +GA+LS+  +  + L + NL+ A L R
Sbjct: 324 ANLSGANLNNFDLSGFDLRGINLSGADLGGTNLSGANLSEANLSEVDLYQINLSGANLSR 383

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQ 216
             LT ++L GA +  A+ ++  +DL Q
Sbjct: 384 IDLTGANLTGANLSEANLNE--VDLYQ 408



 Score = 37.4 bits (85), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 48/91 (52%), Gaps = 2/91 (2%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A++ E D      +GA L +     AN TGA+LS+  ++ + L + NL+ A L +
Sbjct: 359 ANLSEANLSEVDLYQINLSGANLSRIDLTGANLTGANLSEANLNEVDLYQINLSGANLSK 418

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
                 DLGG  ++  + + A  +L + +AL
Sbjct: 419 VNFQGFDLGGFDLKNVNLTGA--NLREVKAL 447


>gi|300866166|ref|ZP_07110885.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300335845|emb|CBN56045.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 351

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 42/81 (51%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++  ++ S +   GA L +     AN +GADLS   + +  L E NL  A L  
Sbjct: 31  ANLGEANLNRTNLSNANLRGANLTRTKLIGANLSGADLSGANLSKAKLIEINLGGASLTG 90

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
           T+L   DL GA + GA FS A
Sbjct: 91  TILLGVDLSGANLSGAIFSQA 111



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 62/135 (45%), Gaps = 16/135 (11%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSA 135
           + A++V +C  N S L D N   A         S A     DL +A+            A
Sbjct: 119 IGASLVGACLLNGSKLVDANLSGATL-------SRATANGVDLSRAI---------LNRA 162

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
            + E D SG+  +GA L +A A + N +GA+L  + +    L EANL  A L    L  +
Sbjct: 163 ILSEVDLSGANLSGATLIRAYANRGNLSGANLHSSNLSEASLREANLCVANLSGAELQGT 222

Query: 196 DLGGAIIEGADFSDA 210
           DL GA + GA+ S A
Sbjct: 223 DLSGANLNGANLSGA 237



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 53/97 (54%), Gaps = 1/97 (1%)

Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           AA  G A+L +      N R AN T   +  ++ SG+  +GA L KA   + N  GA L+
Sbjct: 30  AANLGEANLNRTNLSNANLRGANLTRTKLIGANLSGADLSGANLSKAKLIEINLGGASLT 89

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
            T++  + L+ ANL+ A+  +  L+++ L GA + GA
Sbjct: 90  GTILLGVDLSGANLSGAIFSQADLSKAVLIGASLVGA 126



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 53/114 (46%), Gaps = 1/114 (0%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+L  A     N R AN   A + ++D   ++ N A L  A    AN +GA L
Sbjct: 225 SGANLNGANLSGADLQGANLRGANLNGASLHKADLRTAELNKANLRGANLSGANLSGASL 284

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
            +  +    LN ANL+ A L+ T L  +DL G  +  A+   A +++A     C
Sbjct: 285 LEADLRGANLNGANLSGAGLLLTSLAGADLTGTNLSEANLIGATLNVANLNEAC 338



 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 41/93 (44%), Gaps = 11/93 (11%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    ADLR A   K N R      AN + A + E+D  G+  NGA L  A     +  G
Sbjct: 252 ASLHKADLRTAELNKANLRGANLSGANLSGASLLEADLRGANLNGANLSGAGLLLTSLAG 311

Query: 165 ADLSDTLMDRM-----VLNEANLTNAVLVRTVL 192
           ADL+ T +         LN ANL  A L   +L
Sbjct: 312 ADLTGTNLSEANLIGATLNVANLNEACLGGAIL 344



 Score = 40.8 bits (94), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 52/116 (44%), Gaps = 17/116 (14%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAY-----LEKAVAYKANF 162
           S A    A+L KA  ++ N   A+ T   +   D SG+  +GA      L KAV   A+ 
Sbjct: 64  SGADLSGANLSKAKLIEINLGGASLTGTILLGVDLSGANLSGAIFSQADLSKAVLIGASL 123

Query: 163 TG-----------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
            G           A+LS   + R   N  +L+ A+L R +L+  DL GA + GA  
Sbjct: 124 VGACLLNGSKLVDANLSGATLSRATANGVDLSRAILNRAILSEVDLSGANLSGATL 179



 Score = 40.4 bits (93), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 14/112 (12%)

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
           RG     S A   S++L +A   + N   AN + A+++ +D SG+  NGA          
Sbjct: 186 RGNL---SGANLHSSNLSEASLREANLCVANLSGAELQGTDLSGANLNGA---------- 232

Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           N +GADL    +    LN A+L  A L    L +++L GA + GA+ S A +
Sbjct: 233 NLSGADLQGANLRGANLNGASLHKADLRTAELNKANLRGANLSGANLSGASL 284


>gi|163798116|ref|ZP_02192053.1| hypothetical protein BAL199_09395 [alpha proteobacterium BAL199]
 gi|159176607|gb|EDP61184.1| hypothetical protein BAL199_09395 [alpha proteobacterium BAL199]
          Length = 1025

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 46/123 (37%), Positives = 59/123 (47%), Gaps = 21/123 (17%)

Query: 110 AAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
            A F +ADL  A     NFR      A FT A +  +DFS     GA L K  A  A FT
Sbjct: 733 GALFENADLTNA-----NFRGATLEDAVFTGAVLTGADFSDCAMRGANLSKVEAKGARFT 787

Query: 164 GADLSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGA-----IIEGADFSDAVID 213
            ++L+D  +    L EA+LT     NAV +   LTR+DL  A     I   A   +AV+D
Sbjct: 788 RSELTDAKLVAAKLVEADLTATTMENAVALNADLTRADLSKARFTKVIFMTATMDEAVLD 847

Query: 214 LAQ 216
            A+
Sbjct: 848 SAE 850



 Score = 42.0 bits (97), Expect = 0.29,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 39/80 (48%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           ++  A +  +DFSG    GA  E A    ANF GA L D +    VL  A+ ++  +   
Sbjct: 715 DWAGAVLANTDFSGRDLRGALFENADLTNANFRGATLEDAVFTGAVLTGADFSDCAMRGA 774

Query: 191 VLTRSDLGGAIIEGADFSDA 210
            L++ +  GA    ++ +DA
Sbjct: 775 NLSKVEAKGARFTRSELTDA 794



 Score = 38.9 bits (89), Expect = 2.2,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 35/83 (42%), Gaps = 5/83 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A +    +RE    G    G     AV    +F+G DL   L +   L  AN   A L  
Sbjct: 694 ARYLGQVVRECLAGGGDLTGRDWAGAVLANTDFSGRDLRGALFENADLTNANFRGATLED 753

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
            V T     GA++ GADFSD  +
Sbjct: 754 AVFT-----GAVLTGADFSDCAM 771


>gi|186680850|ref|YP_001864046.1| RDD domain-containing protein [Nostoc punctiforme PCC 73102]
 gi|186463302|gb|ACC79103.1| RDD domain containing protein [Nostoc punctiforme PCC 73102]
          Length = 717

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 46/137 (33%), Positives = 65/137 (47%), Gaps = 34/137 (24%)

Query: 132 FTSADMRESDFSGSKFNGA--------Y------LEKAVAYKANFTGADLSDTLMDRM-- 175
           F SA++ ++ F GS+F GA        Y      L +A   +AN T A+LS  LM+R+  
Sbjct: 459 FKSANLNQASFKGSRFRGAGEDGRWDTYDDVIADLSQAQLQQANLTDANLSRVLMNRIDL 518

Query: 176 ---VLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDAVIDLAQKQALCK 222
               LN ANL+NA L       T L  +DL  A++E     GAD  DA ++ A       
Sbjct: 519 SRATLNRANLSNARLYDAKLNSTQLVGADLRNAVLERASLTGADLGDAKLNEAN-----L 573

Query: 223 YANGTNPITGVSTRKSL 239
           YA     +T + T+ S 
Sbjct: 574 YAARLGRVTAIGTQLSF 590



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 45/78 (57%), Gaps = 5/78 (6%)

Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
           A++  +D+ G+  +GAYL+     +AN + A+LS T +   VL  A + N  L    L+ 
Sbjct: 591 ANLTNTDWQGADLSGAYLD-----RANLSNANLSATRLAGAVLRSAQMENVNLQNADLSL 645

Query: 195 SDLGGAIIEGADFSDAVI 212
           +DL GA + GADF  A++
Sbjct: 646 ADLRGANVAGADFKGAIL 663



 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 50/92 (54%), Gaps = 10/92 (10%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN ++A + ++  + ++  GA L  AV  +A+ TGADL D       LNEANL  A L 
Sbjct: 525 RANLSNARLYDAKLNSTQLVGADLRNAVLERASLTGADLGDA-----KLNEANLYAARLG 579

Query: 189 R-----TVLTRSDLGGAIIEGADFSDAVIDLA 215
           R     T L+ ++L     +GAD S A +D A
Sbjct: 580 RVTAIGTQLSFANLTNTDWQGADLSGAYLDRA 611



 Score = 37.0 bits (84), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 40/79 (50%), Gaps = 9/79 (11%)

Query: 141 DFSGSKFNGAYLEKAVAYKANFTGA------DLSDTL---MDRMVLNEANLTNAVLVRTV 191
           D SG KF  A L +A    + F GA      D  D +   + +  L +ANLT+A L R +
Sbjct: 453 DLSGVKFKSANLNQASFKGSRFRGAGEDGRWDTYDDVIADLSQAQLQQANLTDANLSRVL 512

Query: 192 LTRSDLGGAIIEGADFSDA 210
           + R DL  A +  A+ S+A
Sbjct: 513 MNRIDLSRATLNRANLSNA 531


>gi|434405486|ref|YP_007148371.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
 gi|428259741|gb|AFZ25691.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
          Length = 808

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 57/103 (55%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    ADL  A+    N  +AN + A++R ++  G+  +GAY   A    A+ +GA L
Sbjct: 103 SGANLSGADLSGAILFGANLSQANLSQANLRGANLRGADLSGAYPSGADLRGADLSGAYL 162

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S+  + +  L++ANL+ A L +  L+ + L GA + GAD S A
Sbjct: 163 SEAKLSQAKLSQANLSQANLSQADLSGAYLTGAYLSGADLSGA 205



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 50/81 (61%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A++ E+   G+K + A L +A    AN +GA+LS+ ++    L++ANL+ A L  
Sbjct: 45  ANLSQANLSEAILFGAKLSQANLSQANLSGANLSGANLSEAILFGAKLSQANLSQANLSG 104

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L+ +DL GAI+ GA+ S A
Sbjct: 105 ANLSGADLSGAILFGANLSQA 125



 Score = 43.5 bits (101), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 47/84 (55%), Gaps = 5/84 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +AN + A++ ++D SG+   GAYL       A+ +GADLS   + R  L+ A+L+ A L 
Sbjct: 174 QANLSQANLSQADLSGAYLTGAYLS-----GADLSGADLSGARLSRADLSRADLSAADLR 228

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
              L+ +DL  A + GA  S A +
Sbjct: 229 GAYLSAADLSAAYLSGAYLSAAYL 252



 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 45/81 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A++ E+   G+K + A L +A    AN +GADLS  ++    L++ANL+ A L  
Sbjct: 75  ANLSGANLSEAILFGAKLSQANLSQANLSGANLSGADLSGAILFGANLSQANLSQANLRG 134

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L  +DL GA   GAD   A
Sbjct: 135 ANLRGADLSGAYPSGADLRGA 155



 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 58/132 (43%), Gaps = 34/132 (25%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT----- 163
           S A    A+L +A+     F A  + A++ +++ SG+  +GA L  A+ + AN +     
Sbjct: 73  SGANLSGANLSEAIL----FGAKLSQANLSQANLSGANLSGADLSGAILFGANLSQANLS 128

Query: 164 -------------------------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
                                    GADLS   +    L++A L+ A L +  L+++DL 
Sbjct: 129 QANLRGANLRGADLSGAYPSGADLRGADLSGAYLSEAKLSQAKLSQANLSQANLSQADLS 188

Query: 199 GAIIEGADFSDA 210
           GA + GA  S A
Sbjct: 189 GAYLTGAYLSGA 200


>gi|416374431|ref|ZP_11683193.1| hypothetical protein CWATWH0003_0051 [Crocosphaera watsonii WH
           0003]
 gi|357266721|gb|EHJ15312.1| hypothetical protein CWATWH0003_0051 [Crocosphaera watsonii WH
           0003]
          Length = 279

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 60/111 (54%), Gaps = 11/111 (9%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A   SADLR A   + N   A+ TSA++  ++ +G+  NGA L +     AN +G DLS 
Sbjct: 54  ATLASADLRGANLKQVNLSYADLTSANLSGANLTGAILNGAKLNRVDLSYANLSGVDLSG 113

Query: 170 TLMDR-----MVLNEANLTNAVLVRTVLTRS-----DLGGAIIEGADFSDA 210
             + R     + L EA+LTNA L +  +++S     D   A ++GA+FS A
Sbjct: 114 ANLSRSDLSYVDLREADLTNANLYKADISQSKLHNTDFQEAFLQGANFSRA 164


>gi|443310610|ref|ZP_21040256.1| serine/threonine protein kinase [Synechocystis sp. PCC 7509]
 gi|442779315|gb|ELR89562.1| serine/threonine protein kinase [Synechocystis sp. PCC 7509]
          Length = 533

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 46/85 (54%), Gaps = 5/85 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N +  D+  +D +   F+G  L K   +KA    +DL      +  LN+A+L +A L R 
Sbjct: 413 NLSMLDLERADLTEVNFHGCNLHKTNLHKAILFNSDLG-----QASLNQASLKDANLSRA 467

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
            L+ +DL GA + GAD SDA ++ A
Sbjct: 468 YLSHADLEGADLRGADLSDAYLNHA 492


>gi|37522461|ref|NP_925838.1| hypothetical protein gll2892 [Gloeobacter violaceus PCC 7421]
 gi|35213462|dbj|BAC90833.1| gll2892 [Gloeobacter violaceus PCC 7421]
          Length = 457

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 1/101 (0%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADLR A     N   AN   AD+  +D +G+  N A+L  A   +AN  GA+L+ 
Sbjct: 79  ANLSEADLRGANLNWANLNWANLNWADLSGADLNGANLNWAHLNWADLREANLGGAELNR 138

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             +    L  ANL+   L R  ++ +DL GA + GA+ S+A
Sbjct: 139 ANLREANLGGANLSGVSLSRAFMSGADLRGADLGGANLSEA 179



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 50/100 (50%), Gaps = 6/100 (6%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A+   A+LR+A     N       RA  + AD+R +D  G+  + A L  A    AN  G
Sbjct: 134 AELNRANLREANLGGANLSGVSLSRAFMSGADLRGADLGGANLSEADLGGANLGGANLKG 193

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           ADL    ++R  L  A+L  A L RT LT   L GA++EG
Sbjct: 194 ADLGGANLERTSLRGADLRGADLRRTRLTGCSLEGAVLEG 233



 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 45/84 (53%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+   AD+RE++  G++ N A L +A    AN +G  LS   M    L  A+L  A L  
Sbjct: 119 AHLNWADLREANLGGAELNRANLREANLGGANLSGVSLSRAFMSGADLRGADLGGANLSE 178

Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
             L  ++LGGA ++GAD   A ++
Sbjct: 179 ADLGGANLGGANLKGADLGGANLE 202



 Score = 44.3 bits (103), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 37/107 (34%), Positives = 50/107 (46%), Gaps = 14/107 (13%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A  G ADL  A         N   A++ E+D  G+  N A L  A    A+ +GADL+  
Sbjct: 64  ADLGGADLEGA---------NLGGANLSEADLRGANLNWANLNWANLNWADLSGADLNGA 114

Query: 171 LMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            ++   LN     EANL  A L R  L  ++LGGA + G   S A +
Sbjct: 115 NLNWAHLNWADLREANLGGAELNRANLREANLGGANLSGVSLSRAFM 161



 Score = 40.4 bits (93), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 58/121 (47%), Gaps = 2/121 (1%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   AD+  +D  G+   GA LE A    AN + ADL    ++   LN ANL  A L  
Sbjct: 49  ANLGGADLDGADLGGADLGGADLEGANLGGANLSEADLRGANLNWANLNWANLNWADLSG 108

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN-GTNPITGVSTRKSLGCGNSRRN 247
             L  ++L  A +  AD  +A +  A+  +A  + AN G   ++GVS  ++   G   R 
Sbjct: 109 ADLNGANLNWAHLNWADLREANLGGAELNRANLREANLGGANLSGVSLSRAFMSGADLRG 168

Query: 248 A 248
           A
Sbjct: 169 A 169


>gi|425454784|ref|ZP_18834510.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9807]
 gi|389804455|emb|CCI16535.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9807]
          Length = 262

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 55/102 (53%)

Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
            L++ +  ++  + + + + + +S+ +G+K NGA L  A   +AN +GADLS   +    
Sbjct: 28  HLQQLLSTRKCPQCDLSGSGLVQSNLTGAKLNGANLVGANLSQANLSGADLSGANLTGAS 87

Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
              ANLT A L   +LT +DL GA +  A+  +  +D A  Q
Sbjct: 88  FFGANLTGANLSGAILTGADLRGAYLNNANLENTKLDTAYVQ 129



 Score = 41.2 bits (95), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 40/72 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++ +++ SG+  +GA L  A  + AN TGA+LS  ++    L  A L NA L  
Sbjct: 61  ANLVGANLSQANLSGADLSGANLTGASFFGANLTGANLSGAILTGADLRGAYLNNANLEN 120

Query: 190 TVLTRSDLGGAI 201
           T L  + + GA+
Sbjct: 121 TKLDTAYVQGAV 132


>gi|158338433|ref|YP_001519610.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158308674|gb|ABW30291.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 219

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 51/105 (48%), Gaps = 11/105 (10%)

Query: 111 AQFGSADLRKAVHVKENFRA-----------NFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           A F SAD RKA   + + RA           N   A++  ++ SG+  +GA L  A+ Y 
Sbjct: 71  ANFASADFRKAKLFRADLRATCLYRADLRGANLRGANLFGANLSGANLSGANLSNAMLYC 130

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           AN  GA+L  T++D   L  AN ++  L   +L  + L G   +G
Sbjct: 131 ANLGGANLRGTILDSANLMRANFSHGDLRNAILRNAKLQGTHFDG 175



 Score = 37.0 bits (84), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 22/84 (26%), Positives = 40/84 (47%), Gaps = 1/84 (1%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A  G A+LR  +    N  RANF+  D+R +    +K  G + +     + +    +LS 
Sbjct: 131 ANLGGANLRGTILDSANLMRANFSHGDLRNAILRNAKLQGTHFDGTRMLRTDLDEINLSK 190

Query: 170 TLMDRMVLNEANLTNAVLVRTVLT 193
           T +D + L + +L N+ +    +T
Sbjct: 191 TQIDGVHLMDIDLNNSAMENAAIT 214


>gi|149179551|ref|ZP_01858089.1| pentapeptide repeat domain protein [Planctomyces maris DSM 8797]
 gi|148841608|gb|EDL56033.1| pentapeptide repeat domain protein [Planctomyces maris DSM 8797]
          Length = 343

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 51/88 (57%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN   AD+   + +G+  N A LE A   ++NF+ ADL++T +    L EAN  NA L 
Sbjct: 83  RANLQKADLTGGNLTGAILNEANLEAAYLNQSNFSHADLNETKLAHTKLMEANFFNADLR 142

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           +  L+ +DL GA ++ ++ S A +  A+
Sbjct: 143 KADLSGADLRGANLKWSNLSGARLSAAE 170



 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 49/92 (53%), Gaps = 11/92 (11%)

Query: 117 DLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTGA 165
           DL KA   ++N        A+  +AD+R+++  GS  +GAYL +A        +AN   A
Sbjct: 30  DLFKADLRRDNLSDLDLSEADLRNADLRDANLEGSDLSGAYLGQARLCQTNLCRANLQKA 89

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
           DL+   +   +LNEANL  A L ++  + +DL
Sbjct: 90  DLTGGNLTGAILNEANLEAAYLNQSNFSHADL 121



 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 47/89 (52%), Gaps = 8/89 (8%)

Query: 111 AQFGSADLR--KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           + F  ADL   K  H K    ANF +AD+R++D SG+   GA L+      +N +GA LS
Sbjct: 114 SNFSHADLNETKLAHTKL-MEANFFNADLRKADLSGADLRGANLK-----WSNLSGARLS 167

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
              + +  L E +L++A L   + T + L
Sbjct: 168 AAELSKANLIETDLSDADLTEAIFTDAKL 196


>gi|39997499|ref|NP_953450.1| pentapeptide repeat-containing protein [Geobacter sulfurreducens
           PCA]
 gi|39984390|gb|AAR35777.1| pentapeptide repeat domain protein [Geobacter sulfurreducens PCA]
          Length = 254

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 52/100 (52%), Gaps = 14/100 (14%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           + AQ   A L +A+         F +ADMR +  SG     AY+  A    AN +GAD+ 
Sbjct: 85  TGAQMDGASLDEAI---------FDTADMRSAHCSG-----AYIHHAKFVGANLSGADMR 130

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
              +++   ++ANLTNA      L  ++LGGA++ G +FS
Sbjct: 131 KVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFS 170



 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 53/102 (51%), Gaps = 4/102 (3%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A    AD+RK V+V+   + NF+ A++  ++FSG+K   A L  AV    NF+ ADLS T
Sbjct: 122 ANLSGADMRK-VNVE---KGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSAT 177

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            +  + L  AN   A    T+L  + L GA +  + F    I
Sbjct: 178 DLGSLDLEGANFRGATFNGTLLRDAKLKGADLRQSRFHSVSI 219



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 50/108 (46%), Gaps = 11/108 (10%)

Query: 111 AQFGSADLRKA------VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A F +AD+R A      +H  +   AN + ADMR+ +     F+ A L  A     NF+G
Sbjct: 97  AIFDTADMRSAHCSGAYIHHAKFVGANLSGADMRKVNVEKGNFSQANLTNA-----NFSG 151

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A L    +   VL   N + A L  T L   DL GA   GA F+  ++
Sbjct: 152 AKLKYANLGGAVLRGTNFSFADLSATDLGSLDLEGANFRGATFNGTLL 199


>gi|288920260|ref|ZP_06414574.1| pentapeptide repeat protein [Frankia sp. EUN1f]
 gi|288348364|gb|EFC82627.1| pentapeptide repeat protein [Frankia sp. EUN1f]
          Length = 287

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 59/132 (44%), Gaps = 11/132 (8%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           G+G A     ADL        N R      A  + AD+R +D  G+   GA L  A    
Sbjct: 74  GVGRA----GADLAGRTFTGRNLRGADLRGAFLSGADLRGADLRGACLRGADLRDADLSS 129

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQ 218
           A  +GADL   L+    L+ A+L  A L R  L R+DL GA +  AD   A   ++  + 
Sbjct: 130 AALSGADLHGALLVGTYLSRADLRGADLGRVYLRRADLRGAFLGRADLRGADAAEIVLRG 189

Query: 219 ALCKYANGTNPI 230
           A+ + A  T  +
Sbjct: 190 AVLRGAEATGAV 201



 Score = 44.7 bits (104), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 57/123 (46%), Gaps = 16/123 (13%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
           N   A+ RG F   S A    ADLR A             AD+R++D S +  +GA L  
Sbjct: 91  NLRGADLRGAFL--SGADLRGADLRGAC---------LRGADLRDADLSSAALSGADLHG 139

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSD 209
           A+      + ADL    + R+ L  A+L  A L R  L  +D     L GA++ GA+ + 
Sbjct: 140 ALLVGTYLSRADLRGADLGRVYLRRADLRGAFLGRADLRGADAAEIVLRGAVLRGAEATG 199

Query: 210 AVI 212
           AV+
Sbjct: 200 AVL 202


>gi|307352983|ref|YP_003894034.1| pentapeptide repeat-containing protein [Methanoplanus petrolearius
           DSM 11571]
 gi|307156216|gb|ADN35596.1| pentapeptide repeat protein [Methanoplanus petrolearius DSM 11571]
          Length = 165

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 48/86 (55%), Gaps = 10/86 (11%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           ++A F   D+R +D  G  F+           A+FTGADL+D  +     + A+L+ AVL
Sbjct: 77  YKAVFQGTDLRNADLHGGIFS----------LADFTGADLTDADLVGAAFDYADLSGAVL 126

Query: 188 VRTVLTRSDLGGAIIEGADFSDAVID 213
           +   +  +DL GA + GAD +DA+I+
Sbjct: 127 IGADMRYADLRGADLSGADLTDALIE 152


>gi|119486130|ref|ZP_01620190.1| hypothetical protein L8106_17342 [Lyngbya sp. PCC 8106]
 gi|119456621|gb|EAW37750.1| hypothetical protein L8106_17342 [Lyngbya sp. PCC 8106]
          Length = 207

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 62/137 (45%), Gaps = 24/137 (17%)

Query: 96  KYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRA----------------NFTSAD 136
           K  A  RG    G+    A   +ADLR A+ +  + R                 + T  D
Sbjct: 62  KLRANLRGADLTGTNLIGADLRNADLRGAILLDADVREASFAGAFLTGASCGALDLTGVD 121

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           +R +D  G   + A L++A     N +GADLS     +  L EANL+ AVL  T L R++
Sbjct: 122 LRGADLRGVSLSQAILQQADLRNTNLSGADLS-----QADLEEANLSGAVLRGTNLERAN 176

Query: 197 LGGAIIEGADFSDAVID 213
           L  AI+E   +   ++D
Sbjct: 177 LLCAIVEQTQWFGTILD 193



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 60/131 (45%), Gaps = 20/131 (15%)

Query: 116 ADLRKAVHVKENFRA------NFTSADMRESDFSG----------SKFNGAYLEKAVAYK 159
           A+L++A  ++ N R       N   AD+R +D  G          + F GA+L  A    
Sbjct: 56  ANLQRA-KLRANLRGADLTGTNLIGADLRNADLRGAILLDADVREASFAGAFLTGASCGA 114

Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 216
            + TG DL    +  + L++A L  A L  T L+ +DL  A +E A+ S AV+   +L +
Sbjct: 115 LDLTGVDLRGADLRGVSLSQAILQQADLRNTNLSGADLSQADLEEANLSGAVLRGTNLER 174

Query: 217 KQALCKYANGT 227
              LC     T
Sbjct: 175 ANLLCAIVEQT 185


>gi|119493532|ref|ZP_01624198.1| hypothetical protein L8106_18192 [Lyngbya sp. PCC 8106]
 gi|119452649|gb|EAW33830.1| hypothetical protein L8106_18192 [Lyngbya sp. PCC 8106]
          Length = 192

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/95 (36%), Positives = 53/95 (55%), Gaps = 6/95 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN T AD+  +D SG+  +GA L  A+   AN + ADLS + + R  L  A LT+A L  
Sbjct: 80  ANLTRADLTGADLSGADLHGADLSGAILSGANLSYADLSKSTLFRAELLNATLTHANLKG 139

Query: 190 TVLTRSDLGGAIIEGADFSDA------VIDLAQKQ 218
             L +++L GA+++ A F  A      V+ L ++Q
Sbjct: 140 ANLKQTNLEGAVVQDAVFVKAMGLAFEVVSLLKQQ 174



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 49/92 (53%), Gaps = 10/92 (10%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL-SDTLMDRMV---------LNEA 180
           NF   D+  ++ S +  +GA L +A  ++AN   A+L   +L +  +         L+ A
Sbjct: 16  NFRDTDLFRAELSNANLSGANLFRANLFRANLFRANLLGVSLFNANLIGANLYCANLSGA 75

Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           NL+ A L R  LT +DL GA + GAD S A++
Sbjct: 76  NLSGANLTRADLTGADLSGADLHGADLSGAIL 107



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           F AN   A++  ++ SG+  +GA L +A    A+ +GADL    +   +L+ ANL+ A L
Sbjct: 58  FNANLIGANLYCANLSGANLSGANLTRADLTGADLSGADLHGADLSGAILSGANLSYADL 117

Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
            ++ L R++L  A +  A+   A
Sbjct: 118 SKSTLFRAELLNATLTHANLKGA 140



 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 43/83 (51%), Gaps = 5/83 (6%)

Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
           FRAN   A++  ++  G     A L  A  Y AN +GA+LS   + R     A+LT A L
Sbjct: 38  FRANLFRANLFRANLLGVSLFNANLIGANLYCANLSGANLSGANLTR-----ADLTGADL 92

Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
               L  +DL GAI+ GA+ S A
Sbjct: 93  SGADLHGADLSGAILSGANLSYA 115


>gi|418719603|ref|ZP_13278802.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii str.
           UI 09149]
 gi|418737331|ref|ZP_13293728.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii
           serovar Castellonis str. 200801910]
 gi|421093686|ref|ZP_15554410.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii str.
           200801926]
 gi|410363669|gb|EKP14698.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii str.
           200801926]
 gi|410743646|gb|EKQ92388.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii str.
           UI 09149]
 gi|410746525|gb|EKQ99431.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii
           serovar Castellonis str. 200801910]
 gi|456889646|gb|EMG00529.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii str.
           200701203]
          Length = 263

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 44/82 (53%), Gaps = 9/82 (10%)

Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
           DFSG+    A+L+ +    ANF GA L  +      LN A+L NA      L  + L GA
Sbjct: 166 DFSGANLGHAFLQNSSFVGANFEGAKLRGSF-----LNNADLRNANFRGADLRWAKLAGA 220

Query: 201 IIEGADFSDAVID----LAQKQ 218
            +EGADF+DA+ D    L QKQ
Sbjct: 221 NVEGADFTDAIYDIGTRLDQKQ 242


>gi|381395251|ref|ZP_09920956.1| hypothetical protein GPUN_1974 [Glaciecola punicea DSM 14233 = ACAM
           611]
 gi|379329152|dbj|GAB56089.1| hypothetical protein GPUN_1974 [Glaciecola punicea DSM 14233 = ACAM
           611]
          Length = 258

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 60/116 (51%), Gaps = 10/116 (8%)

Query: 107 IGSAAQFGSADLR----KAVHVKENF--RANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
           IGS   F  AD+R    K V  +     R+  T+ADMR  DF G  F+ A LE A    A
Sbjct: 139 IGST--FIDADMRDSSLKNVRARSAMFTRSVLTNADMRWGDFEGVDFSNANLEGADLTMA 196

Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
           N  GA+L+   +   +L   NL  A+L  T++  + + GA ++  DF+   +DL+Q
Sbjct: 197 NLRGANLTAANLKNAMLLYTNLEGAILNGTIMDGAQIVGANMKRVDFTK--VDLSQ 250



 Score = 40.4 bits (93), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 39/84 (46%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A    AD+ ES+   + FN A L+      A   G+   D  M    L      +A+  R
Sbjct: 106 AQLLGADLSESNLRNANFNKAVLQYTGFIDATLIGSTFIDADMRDSSLKNVRARSAMFTR 165

Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
           +VLT +D+     EG DFS+A ++
Sbjct: 166 SVLTNADMRWGDFEGVDFSNANLE 189


>gi|159045175|ref|YP_001533969.1| hypothetical protein Dshi_2635 [Dinoroseobacter shibae DFL 12]
 gi|157912935|gb|ABV94368.1| hypothetical protein Dshi_2635 [Dinoroseobacter shibae DFL 12]
          Length = 245

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 53/110 (48%), Gaps = 11/110 (10%)

Query: 111 AQFGSADLRKAVHVKENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           AQF  A +R  +  + N R   F  AD+R +        G  L +A   +A+  GADLS 
Sbjct: 122 AQFSGARMRGILFDRTNARDTVFAGADLRAA-----SMVGVALPRATLTEADLGGADLSG 176

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
                  L  AN  NA LV  VL  +DL GA + GAD S+A +  A  QA
Sbjct: 177 AF-----LEGANFGNARLVGAVLREADLTGARLTGADLSEADLTGAVTQA 221



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 44/105 (41%), Gaps = 30/105 (28%)

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---------------DRM----- 175
           D+  ++ +G+    AYL  AV   AN  GADL D  M               DR      
Sbjct: 83  DLAGAELAGADLRDAYLTYAVFDGANLEGADLRDAFMPFAQFSGARMRGILFDRTNARDT 142

Query: 176 -----VLNEANLTNAVLVRTVLTRSDLG-----GAIIEGADFSDA 210
                 L  A++    L R  LT +DLG     GA +EGA+F +A
Sbjct: 143 VFAGADLRAASMVGVALPRATLTEADLGGADLSGAFLEGANFGNA 187


>gi|315497235|ref|YP_004086039.1| pentapeptide repeat protein [Asticcacaulis excentricus CB 48]
 gi|315415247|gb|ADU11888.1| pentapeptide repeat protein [Asticcacaulis excentricus CB 48]
          Length = 224

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 48/95 (50%), Gaps = 10/95 (10%)

Query: 129 RANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLS-----DTLMDRMVLN 178
           +A+FTSAD+ E+     DF+ + F G+ L +    +   TGAD S     D   +   LN
Sbjct: 73  QADFTSADLTEAQFTACDFNNTPFKGSGLAQVRFLRCKLTGADFSHSRNMDVSFEDCRLN 132

Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
           +A L N   V+  L   DL  A ++G DF  AV +
Sbjct: 133 DARLKNFAFVKQTLKSLDLTNADLQGCDFRQAVFE 167


>gi|122920845|pdb|2J8K|A Chain A, Structure Of The Fusion Of Np275 And Np276, Pentapeptide
           Repeat Proteins From Nostoc Punctiforme
          Length = 201

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 50/147 (34%), Positives = 65/147 (44%), Gaps = 37/147 (25%)

Query: 113 FGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
           F   DLR AV    N             AN   A++  +D SG+  NGA L       AN
Sbjct: 37  FSIVDLRGAVLENINLSGAILHGAMLDEANLQQANLSRADLSGATLNGADL-----RGAN 91

Query: 162 FTGADLSD----------TLMDRMVLNEANLTNAVLVRTVLTR-----SDLGGAIIEGAD 206
            + ADLSD           ++D  VLN+ANL  A L + +L+      +DL  A +E AD
Sbjct: 92  LSKADLSDAILDNAILEGAILDEAVLNQANLKAANLEQAILSHANIREADLSEANLEAAD 151

Query: 207 FSD---AVIDLAQ---KQALCKYANGT 227
            S    A+ DL Q    QA  + AN T
Sbjct: 152 LSGADLAIADLHQANLHQAALERANLT 178



 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 57/106 (53%), Gaps = 16/106 (15%)

Query: 131 NFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           +F+  D+R +     + SG+  +GA L++A   +AN + ADLS        LN A+L  A
Sbjct: 36  DFSIVDLRGAVLENINLSGAILHGAMLDEANLQQANLSRADLSGA-----TLNGADLRGA 90

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ------KQALCKYAN 225
            L +  L+ + L  AI+EGA   +AV++ A       +QA+  +AN
Sbjct: 91  NLSKADLSDAILDNAILEGAILDEAVLNQANLKAANLEQAILSHAN 136



 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 6/77 (7%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A   +A+L +A+    N R      AN  +AD+  +D + +  + A L +A   +AN TG
Sbjct: 120 ANLKAANLEQAILSHANIREADLSEANLEAADLSGADLAIADLHQANLHQAALERANLTG 179

Query: 165 ADLSDTLMDRMVLNEAN 181
           A+L D  ++  +L   N
Sbjct: 180 ANLEDANLEGTILEGGN 196


>gi|86606920|ref|YP_475683.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86555462|gb|ABD00420.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
          Length = 154

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/127 (35%), Positives = 58/127 (45%), Gaps = 15/127 (11%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG---- 164
           S AQ   A+LR  V       A+ + AD+RE D SG+  +GA L  A   + N  G    
Sbjct: 32  SGAQLSGANLRGIVLRD----ADLSGADLREGDLSGADLSGADLRGAKLRRVNLIGAKLV 87

Query: 165 -ADLSDTLMDRMVLNEANLTNAVLVRTVL-TRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
            ADL    + R  L  A+L+ A L R  L   +DL GAII    F  A+ D        K
Sbjct: 88  KADLRGANLYRAKLLRADLSEADLSRADLRIGADLRGAIITNTRFRGALYD-----EYTK 142

Query: 223 YANGTNP 229
           +  G NP
Sbjct: 143 FPEGFNP 149


>gi|398337534|ref|ZP_10522239.1| hypothetical protein LkmesMB_19432 [Leptospira kmetyi serovar
           Malaysia str. Bejo-Iso9]
          Length = 263

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 49/92 (53%), Gaps = 4/92 (4%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + +S  + + +F G  F+GA L  A    ++F GA+ S   +    LN A+L N+     
Sbjct: 151 DLSSIILEKQNFDGVDFSGANLGHAFLQNSSFVGANFSGAKLRGSFLNNADLRNSNFRGA 210

Query: 191 VLTRSDLGGAIIEGADFSDAVID----LAQKQ 218
            L  + L GA +EGADF+DA+ D    L QKQ
Sbjct: 211 DLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242


>gi|162450992|ref|YP_001613359.1| hypothetical protein sce2720 [Sorangium cellulosum So ce56]
 gi|161161574|emb|CAN92879.1| hypothetical protein sce2720 [Sorangium cellulosum So ce56]
          Length = 579

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 51/102 (50%), Gaps = 6/102 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFRAN-FTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 162
           + A+   A+LR+A+      R      AD+      ++D  G+   GA LE+A+   AN 
Sbjct: 286 TGAELTGANLRRALLQGAILRGQRLAGADLEMTLLVDADLEGADLQGARLERAILDGANL 345

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
            GADL+  L+ + +L  A L   +L + +  R DL G  ++G
Sbjct: 346 RGADLTRALLLQTLLRGAALDGVILDKAIFDRVDLTGTDLQG 387



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 44/84 (52%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A ++ +   G +  GA LE  +   A+  GADL    ++R +L+ ANL  A L R
Sbjct: 293 ANLRRALLQGAILRGQRLAGADLEMTLLVDADLEGADLQGARLERAILDGANLRGADLTR 352

Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
            +L ++ L GA ++G     A+ D
Sbjct: 353 ALLLQTLLRGAALDGVILDKAIFD 376



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A FT +D+R +   G+  +GA L +A    A+  GADL+ TL+    L  A LT A L R
Sbjct: 79  ATFTGSDLRGARLRGANLSGAKLLRANLAGADLAGADLTATLLLGADLTGARLTGAKLDR 138

Query: 190 TVLT-----RSDLGGAIIEGADFSDA 210
             L       ++L GA+++GA  + A
Sbjct: 139 IRLDFAKLPGAELAGAVLQGASLNKA 164



 Score = 40.4 bits (93), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 54/110 (49%), Gaps = 17/110 (15%)

Query: 112 QFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           Q G A LR    K +H+ E   A+   +D++++ +      GA L++     A FTG+DL
Sbjct: 30  QLGGARLRGAKLKDIHLDE---ADLAGSDLQDTQWFRCPLRGASLDRCDLRGATFTGSDL 86

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI 212
                    L  ANL+ A L+R  L  +DL GA     ++ GAD + A +
Sbjct: 87  RGA-----RLRGANLSGAKLLRANLAGADLAGADLTATLLLGADLTGARL 131



 Score = 40.4 bits (93), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 45/97 (46%), Gaps = 1/97 (1%)

Query: 117 DLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           DL + +   E F     T  D+R     G++  GA L+     +A+  G+DL DT   R 
Sbjct: 5   DLARRLRAGEPFAGKTITRFDLRGKQLGGARLRGAKLKDIHLDEADLAGSDLQDTQWFRC 64

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            L  A+L    L     T SDL GA + GA+ S A +
Sbjct: 65  PLRGASLDRCDLRGATFTGSDLRGARLRGANLSGAKL 101



 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 51/112 (45%), Gaps = 16/112 (14%)

Query: 117 DLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           DLR A     + R      AN + A +  ++ +G+   GA L   +   A+ TGA L+  
Sbjct: 75  DLRGATFTGSDLRGARLRGANLSGAKLLRANLAGADLAGADLTATLLLGADLTGARLTGA 134

Query: 171 LMDRMVLNEANLTNAVLVRTV----------LTRSDLGGAIIEGADFSDAVI 212
            +DR+ L+ A L  A L   V          LTR+ L  A I G+ F DA +
Sbjct: 135 KLDRIRLDFAKLPGAELAGAVLQGASLNKADLTRALLRDARITGSTFYDARL 186



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 54/119 (45%), Gaps = 16/119 (13%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFS----------GSKFNGAYLEK 154
           A F  +DLR A     N       RAN   AD+  +D +          G++  GA L++
Sbjct: 79  ATFTGSDLRGARLRGANLSGAKLLRANLAGADLAGADLTATLLLGADLTGARLTGAKLDR 138

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
                A   GA+L+  ++    LN+A+LT A+L    +T S    A + GAD   A ++
Sbjct: 139 IRLDFAKLPGAELAGAVLQGASLNKADLTRALLRDARITGSTFYDARLGGADLGGATLE 197



 Score = 37.4 bits (85), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 45/89 (50%), Gaps = 14/89 (15%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL- 187
           +A+ T A +R++  +GS F          Y A   GADL    ++++VL  A+L  A+L 
Sbjct: 163 KADLTRALLRDARITGSTF----------YDARLGGADLGGATLEKVVLVRADLRGAILP 212

Query: 188 ---VRTVLTRSDLGGAIIEGADFSDAVID 213
               R+VL  + L    + GAD + + +D
Sbjct: 213 KSMTRSVLDEARLDRPDLSGADLAASELD 241



 Score = 37.0 bits (84), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 48/102 (47%), Gaps = 4/102 (3%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A+    DLR+A        +NFT AD+R +D   S    A L +A   +A+ +GA   + 
Sbjct: 403 AKLAGMDLREADFTG----SNFTRADLRGADLRSSVLTRATLMEADLARADLSGATAKEA 458

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
                 L  A   +A L R   TR+DL  A + GAD  D V+
Sbjct: 459 FFGDAALAGARARDARLRRATFTRADLDHADLSGADLGDVVM 500


>gi|448412419|ref|ZP_21576534.1| hypothetical protein C475_19468 [Halosimplex carlsbadense 2-9-1]
 gi|445668180|gb|ELZ20811.1| hypothetical protein C475_19468 [Halosimplex carlsbadense 2-9-1]
          Length = 561

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 11/108 (10%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           +A    S D   A     +FR A   +A++R++D  G+ F GA L  A    A+ TGA+ 
Sbjct: 251 TAGTLESVDFGGATLTDASFRRAGLQNAELRDADLVGADFQGADLRNASLTNADLTGANF 310

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
            D          A+LT+A L    L+ +DL  A + GAD  DA +  A
Sbjct: 311 RD----------ADLTDAHLRGADLSEADLKDATLCGADLKDATLTRA 348



 Score = 45.1 bits (105), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 75/161 (46%), Gaps = 20/161 (12%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADM-----RESDFSGSKFNGAYLEKAVAYK 159
           A   +A+LR A  V  +F+      A+ T+AD+     R++D + +   GA L +A    
Sbjct: 273 AGLQNAELRDADLVGADFQGADLRNASLTNADLTGANFRDADLTDAHLRGADLSEADLKD 332

Query: 160 ANFTGADLSDTLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
           A   GADL D  + R       L EA L NA L    L R DL  A +  AD +    DL
Sbjct: 333 ATLCGADLKDATLTRASLWNSDLTEAYLRNADLSDGYLRRVDLTDADLPAADLTG---DL 389

Query: 215 AQKQALCK-YANGTNPITGVSTRKSLGCGNSRRNAYGSPSS 254
             + +L + ++     I+  + R+SL C ++     G P++
Sbjct: 390 NARCSLGRTFSMPRCAISDHTGRRSLTCRSTSARPSGRPTT 430



 Score = 43.9 bits (102), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 56/111 (50%), Gaps = 4/111 (3%)

Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           A LR     K++       A +RE+D SG+   G+ L+ A+   A+    DL+   M   
Sbjct: 127 AQLRGVALPKQSL---LERAVLREADLSGANLAGSTLKGAILTDASLREVDLTGADMMGA 183

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYAN 225
           VL EA+LT+  L +    ++ + GAI++ A+   A + DL   +A+ K A 
Sbjct: 184 VLVEADLTSGTLAQLSGDKAVMRGAILKDANLERAHLWDLTAPEAVFKRAT 234



 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 40/94 (42%), Gaps = 15/94 (15%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RA    A MR++   G+ F    LE       +F GA L+D    R  L  A L +A LV
Sbjct: 232 RATLCEATMRDAVLPGASFTAGTLESV-----DFGGATLTDASFRRAGLQNAELRDADLV 286

Query: 189 ----------RTVLTRSDLGGAIIEGADFSDAVI 212
                        LT +DL GA    AD +DA +
Sbjct: 287 GADFQGADLRNASLTNADLTGANFRDADLTDAHL 320



 Score = 38.5 bits (88), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 36/81 (44%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+FT+  +   DF G+    A   +A    A    ADL         L  A+LTNA L  
Sbjct: 248 ASFTAGTLESVDFGGATLTDASFRRAGLQNAELRDADLVGADFQGADLRNASLTNADLTG 307

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
                +DL  A + GAD S+A
Sbjct: 308 ANFRDADLTDAHLRGADLSEA 328


>gi|410449702|ref|ZP_11303755.1| NifU-like N-terminal domain protein [Leptospira sp. Fiocruz LV3954]
 gi|421111700|ref|ZP_15572173.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
           JET]
 gi|410016459|gb|EKO78538.1| NifU-like N-terminal domain protein [Leptospira sp. Fiocruz LV3954]
 gi|410802896|gb|EKS09041.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
           JET]
 gi|456874476|gb|EMF89769.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
           ST188]
          Length = 263

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 4/92 (4%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + +S  + + +F G  F+GA L  A    ++F GA+ S   +    LN A+L N      
Sbjct: 151 DLSSIILEKQNFDGVDFSGANLGHAFLQNSSFVGANFSSAKLRGSFLNNADLRNTNFRGA 210

Query: 191 VLTRSDLGGAIIEGADFSDAVID----LAQKQ 218
            L  + L GA +EGADF+DA+ D    L QKQ
Sbjct: 211 DLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242


>gi|307152112|ref|YP_003887496.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306982340|gb|ADN14221.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 180

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 45/87 (51%), Gaps = 1/87 (1%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADL KA  V  N    N   AD+RE++ SG+    A L  A    AN TGA+L +
Sbjct: 60  ANLTDADLLKAHLVGANLVEINLIGADLREANLSGADLTKADLRCANLTGANLTGANLRE 119

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSD 196
             +D   L  ANLT+A ++ T L  +D
Sbjct: 120 VNLDGANLMGANLTDAQIINTDLNMAD 146



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 42/128 (32%), Positives = 66/128 (51%), Gaps = 7/128 (5%)

Query: 87  NISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGS 145
           NI  L  L +Y+A+ R +F     +    A+L  A   + N  RA+ + AD+ E+D SG+
Sbjct: 2   NIQEL--LKRYKAKER-DF---QGSNLHQANLEGANLQRINLTRADLSGADLSEADLSGA 55

Query: 146 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
               A L  A   KA+  GA+L +  +    L EANL+ A L +  L  ++L GA + GA
Sbjct: 56  CLMQANLTDADLLKAHLVGANLVEINLIGADLREANLSGADLTKADLRCANLTGANLTGA 115

Query: 206 DFSDAVID 213
           +  +  +D
Sbjct: 116 NLREVNLD 123



 Score = 40.8 bits (94), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 53/112 (47%), Gaps = 12/112 (10%)

Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
           TR +    S A    ADL  A  ++    AN T AD+ ++   G+      L  A   +A
Sbjct: 38  TRADL---SGADLSEADLSGACLMQ----ANLTDADLLKAHLVGANLVEINLIGADLREA 90

Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           N +GADL+     +  L  ANLT A L    L   +L GA + GA+ +DA I
Sbjct: 91  NLSGADLT-----KADLRCANLTGANLTGANLREVNLDGANLMGANLTDAQI 137


>gi|148972698|ref|ZP_01811409.1| hypothetical protein LVAL_00031 [Leptolyngbya valderiana BDU 20041]
 gi|148872721|gb|EDL71121.1| hypothetical protein LVAL_00031 [Leptolyngbya valderiana BDU 20041]
          Length = 170

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 55/98 (56%), Gaps = 7/98 (7%)

Query: 132 FTSADM-----RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
           FT AD+     R++D SG+   G  LE+A   KA  +GAD S  +++R +L EA+L +  
Sbjct: 5   FTDADLYGALLRDADLSGAHLVGVRLERANLIKAILSGADFSRAVLERALLIEADLRSTA 64

Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA--LCK 222
             RT L  ++L  A +  A  ++A+++ A  +   LC+
Sbjct: 65  DQRTTLREANLREADLSYAHLNEAILEGANLEGAKLCR 102



 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 54/125 (43%), Gaps = 29/125 (23%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRAN------FTSADMRESDFSGSKFNGAYLEKAVAY-- 158
           I S A F  A L +A+ ++ + R+          A++RE+D S +  N A LE A     
Sbjct: 39  ILSGADFSRAVLERALLIEADLRSTADQRTTLREANLREADLSYAHLNEAILEGANLEGA 98

Query: 159 ---------------------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
                                 AN  GADLS       +L  ANL +A L RTV  R+DL
Sbjct: 99  KLCRANLSSEAGTDALPTDLSNANLRGADLSYADFSGAILRNANLRDADLTRTVFDRTDL 158

Query: 198 GGAII 202
            GAI+
Sbjct: 159 TGAIL 163


>gi|114799805|ref|YP_760951.1| pentapeptide repeat-containing protein [Hyphomonas neptunium ATCC
           15444]
 gi|114739979|gb|ABI78104.1| pentapeptide repeat domain protein [Hyphomonas neptunium ATCC
           15444]
          Length = 245

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 54/101 (53%), Gaps = 11/101 (10%)

Query: 116 ADLRKAVHVKENF-RANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           ADLR A      F  A F +A M++     +DFS ++  GA LEKA     NF GA L  
Sbjct: 88  ADLRGADLTSARFADATFNNARMQDVLASGADFSRARLQGANLEKARLIGVNFEGASL-- 145

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            L  R  L  A+L+ A    T+L R++L G I +GA+ S+A
Sbjct: 146 -LFAR--LETADLSGANCTGTILDRANLRGTIFDGANLSEA 183


>gi|428225932|ref|YP_007110029.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427985833|gb|AFY66977.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 180

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 56/118 (47%), Gaps = 26/118 (22%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
           N Y A+ RG       +  G A+LR+A         N   AD+++S+  G+    A L  
Sbjct: 69  NLYSAKLRG-------SDLGLANLREA---------NLGDADLKQSNLRGADLRNANLLG 112

Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           A   +A+  GADL D          ANLTNA L    L +++L GA++ G  F  AV+
Sbjct: 113 ASLIEADLRGADLRD----------ANLTNANLDGADLRQTNLQGAVLTGVSFRGAVL 160



 Score = 41.2 bits (95), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 62/136 (45%), Gaps = 19/136 (13%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD----------LSDTLMDRMVLNEAN 181
              AD+R    +G K   A L K+  Y A   G+D          L D  + +  L  A+
Sbjct: 45  LRKADLRYFQLNGVKLLAANLSKSNLYSAKLRGSDLGLANLREANLGDADLKQSNLRGAD 104

Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGTNPITGVSTRKSLG 240
           L NA L+   L  +DL GA +  A+ ++A +D A  +Q   + A     +TGVS R ++ 
Sbjct: 105 LRNANLLGASLIEADLRGADLRDANLTNANLDGADLRQTNLQGA----VLTGVSFRGAVL 160

Query: 241 CGNSRRNA----YGSP 252
           CG +  N     YG P
Sbjct: 161 CGATMPNGLAARYGCP 176


>gi|15965782|ref|NP_386135.1| signal peptide protein [Sinorhizobium meliloti 1021]
 gi|334316724|ref|YP_004549343.1| pentapeptide repeat-containing protein [Sinorhizobium meliloti
           AK83]
 gi|384529911|ref|YP_005713999.1| pentapeptide repeat-containing protein [Sinorhizobium meliloti
           BL225C]
 gi|384535747|ref|YP_005719832.1| hypothetical protein SM11_chr1295 [Sinorhizobium meliloti SM11]
 gi|407720970|ref|YP_006840632.1| signal peptide protein [Sinorhizobium meliloti Rm41]
 gi|418401673|ref|ZP_12975198.1| pentapeptide repeat-containing protein [Sinorhizobium meliloti
           CCNWSX0020]
 gi|433613810|ref|YP_007190608.1| putative low-complexity protein [Sinorhizobium meliloti GR4]
 gi|15075051|emb|CAC46608.1| Hypothetical protein signal peptide [Sinorhizobium meliloti 1021]
 gi|333812087|gb|AEG04756.1| pentapeptide repeat protein [Sinorhizobium meliloti BL225C]
 gi|334095718|gb|AEG53729.1| pentapeptide repeat protein [Sinorhizobium meliloti AK83]
 gi|336032639|gb|AEH78571.1| hypothetical protein signal peptide [Sinorhizobium meliloti SM11]
 gi|359504345|gb|EHK76882.1| pentapeptide repeat-containing protein [Sinorhizobium meliloti
           CCNWSX0020]
 gi|407319202|emb|CCM67806.1| signal peptide protein [Sinorhizobium meliloti Rm41]
 gi|429552000|gb|AGA07009.1| putative low-complexity protein [Sinorhizobium meliloti GR4]
          Length = 241

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/90 (40%), Positives = 51/90 (56%), Gaps = 2/90 (2%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+F SA+++ +DF+G++  GA  EKA   +ANF  A L+ T      L+ A L+ AVL  
Sbjct: 124 ASFASAELQRTDFTGARLTGADFEKAELGRANFDKAVLTGTRFSMANLSRAKLSGAVLEG 183

Query: 190 TV-LTRSDLGGAIIEGADFSDAVIDLAQKQ 218
            + L R+ L    IEG D S A   L Q+Q
Sbjct: 184 PIDLDRAFLFLTRIEGVDLSSAS-GLTQEQ 212


>gi|443324425|ref|ZP_21053179.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442795970|gb|ELS05303.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 305

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/136 (33%), Positives = 71/136 (52%), Gaps = 10/136 (7%)

Query: 111 AQFGSADLRKAVHVKE-NFRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTG 164
           A   +A+L+ AV +      AN ++AD+ ++     D S +   GA L  A    ANF+ 
Sbjct: 71  ADLATANLQAAVLIGICLIEANLSNADLSDAYLMDGDLSNANLIGADLRDANCDHANFSN 130

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
           A+L  TLM ++ L  ANLT A L RT L+ ++L  A +  AD S+A  +L + + L  + 
Sbjct: 131 ANLIGTLMRKVRLRHANLTGAKLQRTNLSEAELIEAHLSEADLSNA--NLYEAELLNIFG 188

Query: 225 NGTN--PITGVSTRKS 238
             TN   +  ++T  S
Sbjct: 189 YKTNFCRVQAIATHMS 204



 Score = 40.4 bits (93), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 60/125 (48%), Gaps = 12/125 (9%)

Query: 91  LADLNKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKF 147
           L++ N YEAE    FG  +     Q  +  + +A      F+ANF+ A++ + D      
Sbjct: 173 LSNANLYEAELLNIFGYKTNFCRVQAIATHMSRAYL----FQANFSEAELIKIDLRW--- 225

Query: 148 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
             A  ++A    AN   ADL  T +++  L +ANLT A L    L  +DL GA +  A+ 
Sbjct: 226 --ANCDRANFRNANLQQADLRGTNLNQADLKQANLTRANLRGANLNHADLRGANLTDANI 283

Query: 208 SDAVI 212
            DA+ 
Sbjct: 284 QDAIF 288


>gi|428218533|ref|YP_007102998.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427990315|gb|AFY70570.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 348

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 57/116 (49%), Gaps = 16/116 (13%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           SA     A+L  A  ++ N   AN   A+  E++ S +  N AYL KA  + AN T A+L
Sbjct: 47  SAVNLRGANLSMANLIRANLSGANLIEANFDEANLSMAYLNCAYLNKAYLHGANLTWANL 106

Query: 168 SDTLMDRMVLNEANLTNAVLVRT---------------VLTRSDLGGAIIEGADFS 208
           S + +     +EANL+ AVL  T                L+ +DLGGA + GA+ S
Sbjct: 107 SQSCLIDTDASEANLSGAVLSGTDAYGSNFSGANLSEAYLSVADLGGANLHGANLS 162



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 56/114 (49%), Gaps = 21/114 (18%)

Query: 115 SADLRKAVHVKENF-RANFTS-----ADMRESDFSGSKFNGA---------------YLE 153
           +AD+R A  ++ +  RA+ T      AD+ ++   G++ +GA               +LE
Sbjct: 208 AADIRGASLIETDLSRADLTKVSLICADLSDAHLIGTELHGANLSQANLKHADLRLSHLE 267

Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
            A  Y A+   ADLS   ++   LNEA L  A+L  T L  +DL GA + GA+ 
Sbjct: 268 AANLYGASLYSADLSQANLNAAYLNEAFLFGAILKWTNLADADLSGAHLGGANL 321



 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 59/119 (49%), Gaps = 6/119 (5%)

Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTG 164
           A    A+L     +  NF RAN ++A+M +++ + SKF  A L++A       Y A+  G
Sbjct: 154 ANLHGANLSSVYAIATNFERANLSNANMSKANCAKSKFGSAILDRANLSMSYLYAADIRG 213

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
           A L +T + R  L + +L  A L    L  ++L GA +  A+   A + L+  +A   Y
Sbjct: 214 ASLIETDLSRADLTKVSLICADLSDAHLIGTELHGANLSQANLKHADLRLSHLEAANLY 272



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 56/109 (51%), Gaps = 9/109 (8%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
           G+ +  Q+ SA+ R  V +     A+ TS D+ ++D S     GA L  A   +AN +GA
Sbjct: 13  GVSTWNQWRSANSRIQVDLT---GADLTSVDLLDADLSAVNLRGANLSMANLIRANLSGA 69

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VID 213
           +L +   D     EANL+ A L    L ++ L GA +  A+ S + +ID
Sbjct: 70  NLIEANFD-----EANLSMAYLNCAYLNKAYLHGANLTWANLSQSCLID 113



 Score = 37.4 bits (85), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 38/77 (49%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
            +  D   S+FSG+  + AYL  A    AN  GA+LS           ANL+NA + +  
Sbjct: 126 LSGTDAYGSNFSGANLSEAYLSVADLGGANLHGANLSSVYAIATNFERANLSNANMSKAN 185

Query: 192 LTRSDLGGAIIEGADFS 208
             +S  G AI++ A+ S
Sbjct: 186 CAKSKFGSAILDRANLS 202


>gi|159030580|emb|CAO88243.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
          Length = 354

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/133 (32%), Positives = 66/133 (49%), Gaps = 1/133 (0%)

Query: 80  VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMR 138
           + A+ S +   LA L + +  T        AA+     L  A   + N R AN T AD+ 
Sbjct: 204 IYAAVSDDFLELAQLAELDPLTDFTGANLLAAELSGISLGMANLYQANLRGANLTDADLS 263

Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
           E + S + F GA L  A+   A+ + AD   + +    L  +NLT A LV   +T+++L 
Sbjct: 264 EINGSHASFKGADLSGALLANADLSYADFYRSSLALANLIGSNLTGANLVEVNITQANLS 323

Query: 199 GAIIEGADFSDAV 211
           GA ++GA F+D V
Sbjct: 324 GAKVQGAKFADNV 336


>gi|381204843|ref|ZP_09911914.1| hypothetical protein SclubJA_04390 [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 214

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 50/106 (47%), Gaps = 11/106 (10%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    A LRK    K + R      A+   AD+R  D SG+  + A L  A   KAN TG
Sbjct: 40  ANLSGATLRKVNLNKSSLRQATLKEASLVGADLRRVDLSGANLSNANLVGANLRKANLTG 99

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           ADLS        L+ ANLT AVL    LT ++L G  + GA    A
Sbjct: 100 ADLSGA-----KLSNANLTGAVLSSANLTGTNLLGVELIGAKLERA 140



 Score = 40.4 bits (93), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 55/125 (44%), Gaps = 15/125 (12%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
              A+L  A  V  N R AN T AD+  +  S +   GA L  A     N  G +L    
Sbjct: 77  LSGANLSNANLVGANLRKANLTGADLSGAKLSNANLTGAVLSSANLTGTNLLGVELIGAK 136

Query: 172 MDR-----MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--------DLAQKQ 218
           ++R      +L  ANL+   LV +  T +DL  A + GA   D  +        +L++KQ
Sbjct: 137 LERANARGAILKNANLSMTNLVLSNFTEADLSNANLSGAKLIDTDLTRATLRNANLSRKQ 196

Query: 219 ALCKY 223
            LC+ 
Sbjct: 197 -LCRV 200


>gi|282898833|ref|ZP_06306820.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
 gi|281196360|gb|EFA71270.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
          Length = 189

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 7/86 (8%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           +F  AD+  SD +G   +GA L      +AN  GA L +  +  ++L  A+LT A+L+  
Sbjct: 36  DFARADLSWSDLTGISLSGANLS-----QANLRGAKLENAHLSEVILCGADLTQAILINA 90

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQ 216
            L  SDL GA++  A+  DA  DL Q
Sbjct: 91  HLNESDLSGALLVDANLCDA--DLHQ 114



 Score = 40.8 bits (94), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 11/98 (11%)

Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    +DL  A+ V  N       +A+ T+A+++ +  +G+K  G  + KA    A+ TG
Sbjct: 90  AHLNESDLSGALLVDANLCDADLHQASITAANLQSAKLNGAKMGGVRMWKADLQGADLTG 149

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
           ADLS+  M  + L+ ANL+   +  T LT     GAI+
Sbjct: 150 ADLSEANMCGVNLSMANLSATDMSETFLT-----GAIM 182


>gi|218439290|ref|YP_002377619.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218172018|gb|ACK70751.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 231

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 60/107 (56%), Gaps = 6/107 (5%)

Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGA 165
           QF +   ++A  +K N   A+F+ AD R S     +F+ + F GA L +A+ +  +FTGA
Sbjct: 16  QFKTCKFQEAELIKVNLSGADFSKADFRSSRLGKTNFAYACFFGADLSEAILWGTDFTGA 75

Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
           +L   ++  + L+ A L+ A L    L ++ LGGA +  A+  +A++
Sbjct: 76  NLEKAILREVELSGAILSQANLTGVNLMKATLGGANLSLANLREAIL 122



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 39/109 (35%), Positives = 54/109 (49%), Gaps = 14/109 (12%)

Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
           A    A+LR+A+  + +FR       N   AD+ E+D S +K NG  L +A    A    
Sbjct: 110 ANLSLANLREAILYEADFRPTSEHITNLQQADLSEADLSYAKLNGVNLRQAKLMGAKLCR 169

Query: 165 ADLSDTLMDRMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           ADLS  +    +   L EANL NA      L+ +DL GAI+  AD + A
Sbjct: 170 ADLSKGIWQNSLPTDLCEANLRNA-----DLSYADLSGAILSYADLTGA 213


>gi|425455658|ref|ZP_18835373.1| Genome sequencing data, contig C328 [Microcystis aeruginosa PCC
           9807]
 gi|389803408|emb|CCI17656.1| Genome sequencing data, contig C328 [Microcystis aeruginosa PCC
           9807]
          Length = 354

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 55/103 (53%), Gaps = 1/103 (0%)

Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           AA+     L  A   + N R AN T AD+ E + S + F GA L  A+   A+ + AD  
Sbjct: 234 AAELSGISLGMANLYQANLRGANLTDADLSEINGSHASFKGADLSGALLANADLSYADFY 293

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
            + +    L  +NLT A LV   +T+++L GA ++GA F+D V
Sbjct: 294 RSSLALANLIGSNLTGANLVEVNITQANLSGAKVQGAKFADNV 336


>gi|383501588|ref|YP_005414947.1| hypothetical protein MC5_03910 [Rickettsia australis str. Cutlack]
 gi|378932599|gb|AFC71104.1| hypothetical protein MC5_03910 [Rickettsia australis str. Cutlack]
          Length = 960

 Score = 48.9 bits (115), Expect = 0.003,   Method: Composition-based stats.
 Identities = 38/116 (32%), Positives = 57/116 (49%), Gaps = 7/116 (6%)

Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           +  +ADL KA   K N   A+ T+A +  +     K + A LEKA A      G ++SD 
Sbjct: 555 KLKNADLTKAKLDKANLEYADLTNATLTNATAQFVKLSNATLEKAEA-----EGLNISDV 609

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 225
           +   +   EAN  N ++ R  LT++D   A++E AD      +D   K+A  K AN
Sbjct: 610 IAKNINAKEANFKNVIMQRADLTKADFTKAVLENADMQAVEALDAIFKEATLKQAN 665



 Score = 37.7 bits (86), Expect = 5.8,   Method: Composition-based stats.
 Identities = 43/169 (25%), Positives = 72/169 (42%), Gaps = 17/169 (10%)

Query: 51  PDCSNNQCAGPYA---KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGI 107
           PD S+   +G       LKN  +F S  L    +++C+ + +     N   A  +     
Sbjct: 342 PDLSDINLSGKTLTNLNLKN-TLFASANLENINISNCNLDFTNFEGANLQNAVFQDVTAR 400

Query: 108 GSAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK------- 159
            +   F  ADL+K+ +   +  RA     D+ E++ + SKFN   +  A A K       
Sbjct: 401 NTGFLF--ADLKKSKIENSDMSRAYMPKVDLSEAEVTNSKFNAVMMVNADAEKLIMQDSE 458

Query: 160 ---ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
              +N TG  L+   M R+ +    L NA+L +  +  +DL  A +  A
Sbjct: 459 WKNSNLTGISLAYADMQRVQMQGVVLNNALLDQANIISTDLENAFMNNA 507


>gi|379720162|ref|YP_005312293.1| hypothetical protein PM3016_2256 [Paenibacillus mucilaginosus 3016]
 gi|378568834|gb|AFC29144.1| hypothetical protein PM3016_2256 [Paenibacillus mucilaginosus 3016]
          Length = 288

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           +  FT + +  SDFSG+   G+  + +   +ANF GA+L+D  +  + L  A+    +LV
Sbjct: 100 KGRFTGSALHGSDFSGADLTGSSFKSSDVREANFDGANLTDCSLSTLDLANASFHKTILV 159

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDL 214
           RT  ++S L GA   G   +D  + +
Sbjct: 160 RTNFSKSGLDGAQFTGVRLTDVTLTM 185


>gi|218247899|ref|YP_002373270.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
 gi|218168377|gb|ACK67114.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
          Length = 222

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 44/74 (59%)

Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
           + +++ S +  +G  L  A   +AN TGADLS+ +M  + L+EANLT+A L    L  + 
Sbjct: 65  LLDANLSNANLSGTLLNDAKLTRANLTGADLSNAIMMGITLSEANLTDANLTHADLYNAL 124

Query: 197 LGGAIIEGADFSDA 210
           +  AI+ GA  +DA
Sbjct: 125 MSKAILSGATLTDA 138



 Score = 44.3 bits (103), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 46/83 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A  T A++  +D S +   G  L +A    AN T ADL + LM + +L+ A LT+A L  
Sbjct: 83  AKLTRANLTGADLSNAIMMGITLSEANLTDANLTHADLYNALMSKAILSGATLTDADLES 142

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
            V++ +DL  AI + A  + A++
Sbjct: 143 AVISDADLTHAIAQNAILNQAIL 165



 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 15/92 (16%)

Query: 129 RANFTSADMR----------ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
           RAN T AD+           E++ + +    A L  A+  KA  +GA L+D  ++  V++
Sbjct: 87  RANLTGADLSNAIMMGITLSEANLTDANLTHADLYNALMSKAILSGATLTDADLESAVIS 146

Query: 179 EANLT-----NAVLVRTVLTRSDLGGAIIEGA 205
           +A+LT     NA+L + +L+RS+L      GA
Sbjct: 147 DADLTHAIAQNAILNQAILSRSNLSDGDFSGA 178


>gi|359685228|ref|ZP_09255229.1| hypothetical protein Lsan2_11384 [Leptospira santarosai str.
           2000030832]
          Length = 263

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 4/92 (4%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           + +S  + + +F G  F+GA L  A    ++F GA+ S   +    LN A+L N      
Sbjct: 151 DLSSIILEKQNFDGVDFSGANLGHAFLQNSSFVGANFSSAKLRGSFLNNADLRNTNFRGA 210

Query: 191 VLTRSDLGGAIIEGADFSDAVID----LAQKQ 218
            L  + L GA +EGADF+DA+ D    L QKQ
Sbjct: 211 DLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242


>gi|194336315|ref|YP_002018109.1| pentapeptide repeat-containing protein [Pelodictyon
           phaeoclathratiforme BU-1]
 gi|194308792|gb|ACF43492.1| pentapeptide repeat protein [Pelodictyon phaeoclathratiforme BU-1]
          Length = 441

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/109 (34%), Positives = 56/109 (51%), Gaps = 1/109 (0%)

Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S AQ   ADL +A ++    + ANF  A++ +++ S +  +GA L  A    A+ +GA L
Sbjct: 59  SGAQLNMADLNRADLNGAHLYNANFGKANLIKTNLSKANLSGATLWDANLSGADLSGAQL 118

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
              ++    L  ANLT A L    LTR++L G     A FS A +D  Q
Sbjct: 119 ICAILTNATLTGANLTEACLNSADLTRANLIGGDFTRASFSGATLDEVQ 167



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 50/89 (56%), Gaps = 6/89 (6%)

Query: 115 SADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           SADL +A  +  +F RA+F+ A + E   +G+    A+L +A  Y+++ +GA+L    ++
Sbjct: 140 SADLTRANLIGGDFTRASFSGATLDEVQLAGADLTMAFLGQAKLYRSDLSGANLCGAKLN 199

Query: 174 RMVLNEANLTNA-----VLVRTVLTRSDL 197
           R  L EANL+ A     ++  T+    DL
Sbjct: 200 RATLIEANLSKADMHGVIIWHTIFVNVDL 228



 Score = 45.8 bits (107), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 68/132 (51%), Gaps = 20/132 (15%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNG 149
           +ADLN+  A+  G       A   +A+  KA  +K N  +AN + A + +++ SG+  +G
Sbjct: 65  MADLNR--ADLNG-------AHLYNANFGKANLIKTNLSKANLSGATLWDANLSGADLSG 115

Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----G 204
           A L  A+   A  TGA+L++       LN A+LT A L+    TR+   GA ++     G
Sbjct: 116 AQLICAILTNATLTGANLTEA-----CLNSADLTRANLIGGDFTRASFSGATLDEVQLAG 170

Query: 205 ADFSDAVIDLAQ 216
           AD + A +  A+
Sbjct: 171 ADLTMAFLGQAK 182



 Score = 43.9 bits (102), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 63/128 (49%), Gaps = 22/128 (17%)

Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
           E  +GS  ++ +A  RKA    +  R N   AD+     SG++ N A L +     AN  
Sbjct: 5   ELLLGSVTEWNAA--RKA---HQKGRPNLKGADL-----SGAQLNKADLSRTDLVGANLR 54

Query: 164 GADLSDTLMDRMVLNEANLTNAV----------LVRTVLTRSDLGGAIIEGADFSDAVID 213
           GADLS   ++   LN A+L  A           L++T L++++L GA +  A+ S A  D
Sbjct: 55  GADLSGAQLNMADLNRADLNGAHLYNANFGKANLIKTNLSKANLSGATLWDANLSGA--D 112

Query: 214 LAQKQALC 221
           L+  Q +C
Sbjct: 113 LSGAQLIC 120



 Score = 42.0 bits (97), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 11/114 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S AQ   ADL +   V  N R A+ + A +  +D + +  NGA+L     Y ANF  A+L
Sbjct: 34  SGAQLNKADLSRTDLVGANLRGADLSGAQLNMADLNRADLNGAHL-----YNANFGKANL 88

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
             T      L++ANL+ A L    L+ +DL GA +  A  ++A +  A     C
Sbjct: 89  IKT-----NLSKANLSGATLWDANLSGADLSGAQLICAILTNATLTGANLTEAC 137


>gi|239947676|ref|ZP_04699429.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes
           scapularis]
 gi|239921952|gb|EER21976.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes
           scapularis]
          Length = 953

 Score = 48.9 bits (115), Expect = 0.003,   Method: Composition-based stats.
 Identities = 48/178 (26%), Positives = 77/178 (43%), Gaps = 23/178 (12%)

Query: 65  LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
           L+N  +  + AL A     C  N+  +   N Y ++T  E    +      ADLR+A+  
Sbjct: 494 LENAFMNKTHALEAKFKEQC--NMQGITARNAYFSDTEFE----NILSLKEADLREAIMQ 547

Query: 125 KENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
           +   +           A    AD+  +  + +    A L  A   KA   G ++SD +  
Sbjct: 548 RVKLKNADLTKAKLDKAKLEYADLTNATLTNATAQFAKLSNATLEKAEAEGLNISDAIAK 607

Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYAN 225
            +   EAN  NA++ R  LT++D   A++E AD      ++A+   A  KQA  K AN
Sbjct: 608 NINAKEANFKNAIMQRADLTKADFTKAVLENADMQAMEAAEAIFKEANLKQANLKVAN 665



 Score = 42.4 bits (98), Expect = 0.21,   Method: Composition-based stats.
 Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 4/107 (3%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A+  +A L KA    E    N + A  +  +   + F  A +++A   KA+FT A L + 
Sbjct: 584 AKLSNATLEKA----EAEGLNISDAIAKNINAKEANFKNAIMQRADLTKADFTKAVLENA 639

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
            M  M   EA    A L +  L  ++L G   EGADF  A ID A K
Sbjct: 640 DMQAMEAAEAIFKEANLKQANLKVANLAGINKEGADFDKAKIDDATK 686


>gi|218441428|ref|YP_002379757.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218174156|gb|ACK72889.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 362

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 53/102 (51%), Gaps = 6/102 (5%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S  Q G A+L    H+  N R A  T AD+ E+D +  K +GA L  A    AN + +DL
Sbjct: 245 SGVQLGGANL---YHI--NLRGAVLTDADLGEADLNHGKLSGADLSGAYLGNANLSYSDL 299

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
               +    L  A+L  A L    L++++L GAI+EG  F+D
Sbjct: 300 HKASLALTNLIGADLRGANLTEVNLSQANLSGAIVEGTRFAD 341


>gi|119486749|ref|ZP_01620724.1| hypothetical protein L8106_10882 [Lyngbya sp. PCC 8106]
 gi|119456042|gb|EAW37175.1| hypothetical protein L8106_10882 [Lyngbya sp. PCC 8106]
          Length = 160

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 52/98 (53%)

Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
           AD+R+ +  K   + + + AD+ E++ +G+   GA L+K +   A   GADLS   +   
Sbjct: 26  ADMRRLLDTKRCQQCDLSEADLSEAELTGADLLGANLQKTILRGAKLKGADLSSANLIEA 85

Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
            L  A+L +A L  T L +++L  A +  AD   A ++
Sbjct: 86  DLTGADLRDAKLHSTTLRKANLSAANLTWADLYRAFLE 123



 Score = 41.6 bits (96), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 65/144 (45%), Gaps = 12/144 (8%)

Query: 70  VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 129
           + V T L+    A   +++  L D  + +     E  + S A+   ADL  A   K   R
Sbjct: 10  LLVLTGLSIPASAELQADMRRLLDTKRCQQCDLSEADL-SEAELTGADLLGANLQKTILR 68

Query: 130 ------ANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
                 A+ +SA++ E+D +G+     K +   L KA    AN T ADL    ++  + N
Sbjct: 69  GAKLKGADLSSANLIEADLTGADLRDAKLHSTTLRKANLSAANLTWADLYRAFLEEAIFN 128

Query: 179 EANLTNAVLVRTVLTRSDLGGAII 202
           +ANL NA L    L  ++  GA +
Sbjct: 129 DANLENANLNDAKLDGTNFCGATM 152


>gi|428299465|ref|YP_007137771.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428236009|gb|AFZ01799.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 731

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/85 (40%), Positives = 46/85 (54%), Gaps = 5/85 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           N TSA + +SDF+GS F+GA L      K     ANFTGAD+S  L    + + AN TNA
Sbjct: 253 NLTSAKLVDSDFTGSNFSGAKLINTDLSKTNLTNANFTGADMSGVLTTDAIASGANFTNA 312

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDA 210
            L    L++ +   A   GA+ + A
Sbjct: 313 NLSNANLSKGNFTDATFFGANLTGA 337


>gi|307155293|ref|YP_003890677.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306985521|gb|ADN17402.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 145

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 45/83 (54%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+   AD+R ++ SG+    A LE A    AN  GADL+  ++    LN +NL +     
Sbjct: 51  AHLIGADLRNANLSGANLVEANLEGADLTGANLQGADLTGAMVTNASLNNSNLKDVNFTN 110

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
            +L  +D+ GA++EG +  +A I
Sbjct: 111 AMLYDADVTGALMEGLNLKNAQI 133


>gi|193083812|gb|ACF09494.1| pentapeptide repeat protein [uncultured marine crenarchaeote
           SAT1000-23-F7]
          Length = 741

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 44/110 (40%), Positives = 61/110 (55%), Gaps = 14/110 (12%)

Query: 127 NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           NFR +NFTS ++  ++F+    +GA L      +   TGADL +       L+ A+L+N 
Sbjct: 495 NFRESNFTSTNIANANFTSVNLSGADLSMKDLTENILTGADLRNA-----NLSGADLSNN 549

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDA----VID---LAQKQALCKYANGTN 228
            LV T+LT +DL  AI+ GAD S A    +ID   + QK  L K AN TN
Sbjct: 550 QLVNTILTGADLTDAILSGADLSTANIFGIIDGINILQKTKL-KGANFTN 598



 Score = 42.0 bits (97), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 42/80 (52%), Gaps = 5/80 (6%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N    D+ E+   G+   G  LEKA    +N    DLS   + ++ L ++NL+     RT
Sbjct: 605 NLIGVDISETILKGADLTGVKLEKAKVNNSNLEDLDLSFKNLSKIRLVDSNLS-----RT 659

Query: 191 VLTRSDLGGAIIEGADFSDA 210
           +L+ +DL  A + GA+ SDA
Sbjct: 660 ILSGADLSNAELMGANLSDA 679



 Score = 40.8 bits (94), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 41/82 (50%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NF   ++  S+F  S F    +  A     N +GADLS   +   +L  A+L NA L   
Sbjct: 485 NFEHINLSYSNFRESNFTSTNIANANFTSVNLSGADLSMKDLTENILTGADLRNANLSGA 544

Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
            L+ + L   I+ GAD +DA++
Sbjct: 545 DLSNNQLVNTILTGADLTDAIL 566


>gi|434391142|ref|YP_007126089.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428262983|gb|AFZ28929.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 516

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 52/92 (56%), Gaps = 5/92 (5%)

Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV-----LNEANLTNAV 186
            T AD+RE++F  +   GA L +A   +ANF+ A+L+  +M +       L+EANL  A 
Sbjct: 325 LTKADLRETNFYTTNLTGANLSEANCDRANFSAANLNGAIMLQTSFRAANLSEANLKYAN 384

Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
           L+   LT ++L  A +EGA+ + A +  A  Q
Sbjct: 385 LIAANLTEANLSRASLEGANLTAANLSHANLQ 416



 Score = 37.0 bits (84), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 5/84 (5%)

Query: 129 RANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
           RANF++A+     M ++ F  +  + A L+ A    AN T A+LS   ++   L  ANL+
Sbjct: 352 RANFSAANLNGAIMLQTSFRAANLSEANLKYANLIAANLTEANLSRASLEGANLTAANLS 411

Query: 184 NAVLVRTVLTRSDLGGAIIEGADF 207
           +A L  T L + +L GA +  A+ 
Sbjct: 412 HANLQNTYLNKINLSGATLIQANL 435


>gi|17230748|ref|NP_487296.1| hypothetical protein all3256 [Nostoc sp. PCC 7120]
 gi|17132351|dbj|BAB74955.1| all3256 [Nostoc sp. PCC 7120]
          Length = 268

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 68/145 (46%), Gaps = 19/145 (13%)

Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S+A    ADL  A  ++ N        ANF + D  E++   ++  GAYL KA  YKAN 
Sbjct: 137 SSADLRDADLAGAKLIRSNLCFANLIAANFIAVDFSEANLYQAEVMGAYLYKANFYKANL 196

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
             A L    + R  L  A+L  A L    LT ++L GA + GA+   A +          
Sbjct: 197 HQAHLGGAYLFRANLTAADLRGADLAWANLTSANLAGANLSGANLRGANL---------- 246

Query: 223 YANGTNPITGVSTRKSLGCGNSRRN 247
             NG N + GV+ ++++   +SR +
Sbjct: 247 --NGAN-LNGVNLQETIMPDSSRHD 268



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 57/115 (49%), Gaps = 16/115 (13%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           SAA F  A+L ++V          T AD+  + F G+ F+GA L  A+  +AN  G D S
Sbjct: 87  SAANFSVANLSQSV---------LTHADLSHAHFIGADFSGANLRGAIVTEANLIGTDFS 137

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
                   L +A+L  A L+R+ L  ++L  A     DFS+A  +L Q + +  Y
Sbjct: 138 SA-----DLRDADLAGAKLIRSNLCFANLIAANFIAVDFSEA--NLYQAEVMGAY 185



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 5/85 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN-----A 185
           N    ++R ++ +G+  +   L +A+  +AN +GADLS   +    L+EANL+      A
Sbjct: 35  NLQENNLRGANLAGANLSRVDLSRALLIRANLSGADLSSANLHHAKLSEANLSAANFSVA 94

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDA 210
            L ++VLT +DL  A   GADFS A
Sbjct: 95  NLSQSVLTHADLSHAHFIGADFSGA 119


>gi|402773132|ref|YP_006592669.1| pentapeptide repeat protein [Methylocystis sp. SC2]
 gi|401775152|emb|CCJ08018.1| Pentapeptide repeat protein [Methylocystis sp. SC2]
          Length = 261

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 47/139 (33%), Positives = 61/139 (43%), Gaps = 36/139 (25%)

Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A F S  L  A   K +  A NFT AD++ +DFSG++ N A L  A+   A F  ADLS+
Sbjct: 115 ADFFSTKLAGAKLAKADLSATNFTRADLQNADFSGARMNAATLYAALLDGATFADADLSN 174

Query: 170 T---------------LMD---------------RMVLNEAN-----LTNAVLVRTVLTR 194
                           L+D               R  L +AN     LT A L   VLT 
Sbjct: 175 ARIIGGGKGVNFRNAKLIDADLGADPANQGMAPVRAELPDANFDGADLTRANLTHAVLTG 234

Query: 195 SDLGGAIIEGADFSDAVID 213
           ++   AI+ GA F  AV+D
Sbjct: 235 ANFTAAIVSGARFDYAVLD 253


>gi|425441123|ref|ZP_18821410.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9717]
 gi|389718260|emb|CCH97767.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9717]
          Length = 262

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 55/101 (54%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
           L++ +  ++  + + + + + +S+ +G+K NGA L  A   +AN +GADLS   +     
Sbjct: 29  LQQLLSTRKCPQCDLSGSGLVQSNLTGAKLNGANLVGANLSQANLSGADLSGANLTGASF 88

Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
             ANLT A L   +LT +DL GA +  A+  +  +D A  Q
Sbjct: 89  FGANLTGANLSGAILTGADLRGAYLNNANLENTKLDTAYVQ 129



 Score = 41.2 bits (95), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 40/72 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN   A++ +++ SG+  +GA L  A  + AN TGA+LS  ++    L  A L NA L  
Sbjct: 61  ANLVGANLSQANLSGADLSGANLTGASFFGANLTGANLSGAILTGADLRGAYLNNANLEN 120

Query: 190 TVLTRSDLGGAI 201
           T L  + + GA+
Sbjct: 121 TKLDTAYVQGAV 132


>gi|300869620|ref|ZP_07114200.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
 gi|300332398|emb|CBN59400.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
          Length = 580

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 68/144 (47%), Gaps = 21/144 (14%)

Query: 67  NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
           NW +F    L+ A +    S+      +N  +A+  G   +G  A+   A+L +A     
Sbjct: 103 NWAIFQEADLSGADLQRAKSD-----QINLEKAKLDGARLMG--AELMEANLNRASLAG- 154

Query: 127 NFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
              AN T A++RE+  + +    A L+ A   +A+  GA          +L  ANLT A 
Sbjct: 155 ---ANLTGANLREAHLAEANLREAILKGANLIEADLNGA----------ILRSANLTEAD 201

Query: 187 LVRTVLTRSDLGGAIIEGADFSDA 210
           + R VLT +DL  A++ GAD S A
Sbjct: 202 MHRVVLTGADLTEAVLNGADLSRA 225



 Score = 44.7 bits (104), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 64/137 (46%), Gaps = 6/137 (4%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSA 135
           L  A +A  +   + L   N  EA+  G   I  +A    AD+ + V       A+ T A
Sbjct: 162 LREAHLAEANLREAILKGANLIEADLNG--AILRSANLTEADMHRVVLTG----ADLTEA 215

Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
            +  +D S +   GAYL KA   KA+   ++L +  +    L EANL  A L +  L+ +
Sbjct: 216 VLNGADLSRANLTGAYLLKASFKKAHLLRSNLQEVYLLWADLTEANLRGADLRKADLSGA 275

Query: 196 DLGGAIIEGADFSDAVI 212
            L  AI+  AD  DA++
Sbjct: 276 YLSDAILSEADLRDALL 292



 Score = 40.4 bits (93), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 42/82 (51%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ + A++RE D +G+   GA L  A    A   GA L    +   +L  ANL  ++L  
Sbjct: 25  ASLSGANLREIDLTGANLTGANLSWAFLSHAKLVGACLRRADLRSAMLTSANLNQSILSG 84

Query: 190 TVLTRSDLGGAIIEGADFSDAV 211
             LT+ DL  A ++ AD + A+
Sbjct: 85  ANLTKVDLRLAYLQEADLNWAI 106



 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 53/100 (53%), Gaps = 6/100 (6%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
            + + A    A+L  A  +K +F+ A+   ++++E     +    A L  A   KA+ +G
Sbjct: 215 AVLNGADLSRANLTGAYLLKASFKKAHLLRSNLQEVYLLWADLTEANLRGADLRKADLSG 274

Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           A LSD      +L+EA+L +A+L+   L R++L GA + G
Sbjct: 275 AYLSDA-----ILSEADLRDALLIEAHLIRTNLEGAQLTG 309


>gi|427735760|ref|YP_007055304.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427370801|gb|AFY54757.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 263

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 42/84 (50%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RAN    D+   + S +K NGA L  A  +K    GADLSD  + R  L  A++  A L 
Sbjct: 153 RANLKGRDLSGRNLSYAKLNGANLSDAFMHKVVLRGADLSDANLFRANLLLADMKEANLQ 212

Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
              L  +DL GA + GAD   A I
Sbjct: 213 GADLIGADLSGADLRGADLRGARI 236



 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 46/83 (55%), Gaps = 7/83 (8%)

Query: 131 NFTSADMRESDFSGSKFNGAYLE------KAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
           N T    ++SD SG  F  A L+      + ++Y A   GA+LSD  M ++VL  A+L++
Sbjct: 135 NKTYQPPQQSDLSGQDFRRANLKGRDLSGRNLSY-AKLNGANLSDAFMHKVVLRGADLSD 193

Query: 185 AVLVRTVLTRSDLGGAIIEGADF 207
           A L R  L  +D+  A ++GAD 
Sbjct: 194 ANLFRANLLLADMKEANLQGADL 216


>gi|332708407|ref|ZP_08428384.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332352810|gb|EGJ32373.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 309

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 56/111 (50%), Gaps = 6/111 (5%)

Query: 109 SAAQFGSADLRKAV-----HVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
           S AQ   ADLR+A       +  N + AN  + +  E++FSG+  + A LE A      +
Sbjct: 115 SLAQLQKADLREATGKGITFINANLKMANLGAVNFPEANFSGASLDIASLEAANLMDTKW 174

Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
            GADL    + R  L  A+LT+A L+   L  +DL   I+ GA   ++ ++
Sbjct: 175 VGADLERANLSRASLVRADLTSANLIVANLRAADLTEVILRGAQLLESSLE 225



 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 53/102 (51%), Gaps = 15/102 (14%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEK----AVAY-KANFTGADL------SDTLMD-RMV- 176
           A    AD+RE+   G  F  A L+     AV + +ANF+GA L      +  LMD + V 
Sbjct: 117 AQLQKADLREATGKGITFINANLKMANLGAVNFPEANFSGASLDIASLEAANLMDTKWVG 176

Query: 177 --LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
             L  ANL+ A LVR  LT ++L  A +  AD ++ ++  AQ
Sbjct: 177 ADLERANLSRASLVRADLTSANLIVANLRAADLTEVILRGAQ 218


>gi|167826694|ref|ZP_02458165.1| pentapeptide repeat family protein [Burkholderia pseudomallei 9]
          Length = 326

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 13  ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 67

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 68  ARLTAANLSLAHCERTDFS 86


>gi|428317459|ref|YP_007115341.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428241139|gb|AFZ06925.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 197

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 68/136 (50%), Gaps = 19/136 (13%)

Query: 89  SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKF 147
           S LAD N  +A   G       A    A+L++AV ++ N R A+ + AD+R +DF  +  
Sbjct: 29  SDLADANLSQANLSG-------ANLVGANLQRAV-LRANLRGADLSGADLRGADFRNADL 80

Query: 148 NGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG---- 198
            GA    A+   A+F     TGA + +  +  + L  A+L  A L R +L  +DL     
Sbjct: 81  RGASFANALVRDASFGGAFLTGASIGNLDLSGVDLRGADLRGAALARAILHSADLSHANL 140

Query: 199 -GAIIEGADFSDAVID 213
            GA + GAD  +A+++
Sbjct: 141 SGADLSGADLEEAILN 156



 Score = 37.0 bits (84), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 53/114 (46%), Gaps = 21/114 (18%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSA----------DMRESDFSGSKFNGAYLEKAVAYK 159
           A F +ADLR A       R A+F  A          D+   D  G+   GA L +A+ + 
Sbjct: 73  ADFRNADLRGASFANALVRDASFGGAFLTGASIGNLDLSGVDLRGADLRGAALARAILHS 132

Query: 160 ANFT----------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
           A+ +          GADL + +++  VL  ANLT A L+   + ++   GA+++
Sbjct: 133 ADLSHANLSGADLSGADLEEAILNGAVLRGANLTGANLLCATIEQTLWDGALLD 186


>gi|443476936|ref|ZP_21066816.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443018029|gb|ELS32353.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 180

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 66/148 (44%), Gaps = 21/148 (14%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A    ADL  A  +     AN T A++ E++  G+   GA    A    AN T ++ S++
Sbjct: 35  ATLNKADLSSANLID----ANLTGANLIETNLRGAMLRGANFADADLSWANLTWSNSSNS 90

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-----------------ID 213
              R  L+ AN + A L+    T + L GA + G +   A+                 +D
Sbjct: 91  RFVRSNLSVANFSGANLIEADFTGAILKGANLRGTNLRGAMLKNLRTCADTDFTGVRNLD 150

Query: 214 LAQKQALCKYANGTNPITGVSTRKSLGC 241
              +  LC  A+GT+P T   +R++LGC
Sbjct: 151 ERMRLYLCTVASGTHPFTKNDSRQTLGC 178


>gi|167907368|ref|ZP_02494573.1| pentapeptide repeat protein [Burkholderia pseudomallei NCTC 13177]
          Length = 269

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/97 (40%), Positives = 53/97 (54%), Gaps = 6/97 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ + AD+R +D SG+   GA L  A    AN +GADLSD       L  A+L++A L  
Sbjct: 39  ADLSDADLRGADLSGADLCGANLSGADLCGANLSGADLSDA-----DLRGADLSDADLRG 93

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
             L+ ++L GA + GAD SDA +  A    A   YAN
Sbjct: 94  ADLSVANLSGANLSGADLSDADLSGANLSGAYLSYAN 130



 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 44/120 (36%), Positives = 61/120 (50%), Gaps = 26/120 (21%)

Query: 84  CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDF 142
           C +N+S  ADL+  +A+ RG       A    ADLR A     N   AN + AD+ ++D 
Sbjct: 67  CGANLSG-ADLS--DADLRG-------ADLSDADLRGADLSVANLSGANLSGADLSDADL 116

Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
           SG+  +GAYL       AN +GA+LSD          ANL+ A L    L+ +DL GA +
Sbjct: 117 SGANLSGAYLS-----YANLSGANLSD----------ANLSGANLRGADLSGADLSGAYL 161



 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 45/81 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ + AD+R +D S +   GA L  A    AN +GADLSD  +    L+ A L+ A L  
Sbjct: 74  ADLSDADLRGADLSDADLRGADLSVANLSGANLSGADLSDADLSGANLSGAYLSYANLSG 133

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L+ ++L GA + GAD S A
Sbjct: 134 ANLSDANLSGANLRGADLSGA 154



 Score = 38.5 bits (88), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 52/112 (46%), Gaps = 19/112 (16%)

Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
           S A    ADLR A         + + AD+  ++ SG+   GA L  A    A+  GADLS
Sbjct: 37  SGADLSDADLRGA---------DLSGADLCGANLSGADLCGANLSGADLSDADLRGADLS 87

Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA----------DFSDA 210
           D  +    L+ ANL+ A L    L+ +DL GA + GA          + SDA
Sbjct: 88  DADLRGADLSVANLSGANLSGADLSDADLSGANLSGAYLSYANLSGANLSDA 139


>gi|220910596|ref|YP_002485907.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219867207|gb|ACL47546.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 449

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 53/100 (53%), Gaps = 4/100 (4%)

Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           A    A+LR+     E F AN + AD+ E++ S ++  GA L ++   +AN + A LS  
Sbjct: 320 ANLSGANLREV----ELFEANLSRADLLEANLSRARLTGANLSRSTLSEANLSRATLSGA 375

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
            ++R  L+   L    L    L+R+DLG A + GA+ S A
Sbjct: 376 HLNRATLSGGTLYKVDLSGVNLSRADLGDANLSGANLSRA 415



 Score = 43.9 bits (102), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 55/107 (51%), Gaps = 8/107 (7%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A    A+L +A     N  R+  + A++  +  SG+  N A L     YK + +G +L
Sbjct: 338 SRADLLEANLSRARLTGANLSRSTLSEANLSRATLSGAHLNRATLSGGTLYKVDLSGVNL 397

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
           S     R  L +ANL+ A L R  L+R++L  A + GA+ S+  +DL
Sbjct: 398 S-----RADLGDANLSGANLSRADLSRANLTAADLSGANLSE--VDL 437



 Score = 42.4 bits (98), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 45/81 (55%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A++   D S  K + A L KA    A+   ADLS+  +  + L+ ANL  A L  
Sbjct: 190 ANLSGANLSRVDLSEVKLSQANLTKANLSGADLDKADLSNLELIEVDLSGANLAGANLSS 249

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
           T L+R+DL GA + GA+ + A
Sbjct: 250 TNLSRADLSGANLRGANLARA 270



 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 53/104 (50%), Gaps = 15/104 (14%)

Query: 119 RKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
           R A   ++  RA  T+A++  +D  G   + A LE A     N +GA L+D L++   L 
Sbjct: 9   RYAAGERDFHRAELTNAELITADLKGINLSRADLEWA-----NLSGAKLNDALLNGAELV 63

Query: 179 EANLTN-----AVLVRTVLTRSD-----LGGAIIEGADFSDAVI 212
            ANL N     A L+   L+RSD     LGGA +  AD S+A +
Sbjct: 64  NANLINVDLSGASLIGINLSRSDLSWANLGGANLSRADLSEATL 107



 Score = 41.6 bits (96), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 49/103 (47%), Gaps = 6/103 (5%)

Query: 112 QFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
            F  A+L  A  +  + +  N + AD+  ++ SG+K N A L  A    AN    DLS  
Sbjct: 16  DFHRAELTNAELITADLKGINLSRADLEWANLSGAKLNDALLNGAELVNANLINVDLSGA 75

Query: 171 LM-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
            +      R  L+ ANL  A L R  L+ + L GA + GAD S
Sbjct: 76  SLIGINLSRSDLSWANLGGANLSRADLSEATLRGADLRGADLS 118



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 42/128 (32%), Positives = 62/128 (48%), Gaps = 11/128 (8%)

Query: 92  ADLNK---YEAETRGEFGIGSAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKF 147
           ADL++    EA+ RG   I +      A+LR A +   E   AN    D+ E+D SG+  
Sbjct: 115 ADLSRVEMIEADLRGL--ILNGVNLRGANLRGANLSGTELTYANLGRVDLIEADLSGANL 172

Query: 148 NGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
           +GA L      +     AN +GA+LS   +  + L++ANLT A L    L ++DL    +
Sbjct: 173 SGATLCGANLSRVNLSNANLSGANLSRVDLSEVKLSQANLTKANLSGADLDKADLSNLEL 232

Query: 203 EGADFSDA 210
              D S A
Sbjct: 233 IEVDLSGA 240



 Score = 38.1 bits (87), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 41/81 (50%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN +S ++  +D SG+   GA L +A     N  GA+L+  ++    L   +L+ A L  
Sbjct: 245 ANLSSTNLSRADLSGANLRGANLARAKLIGTNLRGANLTGAILTGANLEGTDLSQADLRS 304

Query: 190 TVLTRSDLGGAIIEGADFSDA 210
             L+   L G I+ GA+ S A
Sbjct: 305 ANLSGLILNGTILRGANLSGA 325



 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 64/133 (48%), Gaps = 28/133 (21%)

Query: 98  EAETRGEFGIGSAAQFGSADLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAY 151
           EA  RG       A    ADL +   ++ + R       N   A++R ++ SG++   A 
Sbjct: 104 EATLRG-------ADLRGADLSRVEMIEADLRGLILNGVNLRGANLRGANLSGTELTYAN 156

Query: 152 LEKAVAYKANFTGADLSD-TL----MDRMVLNEANLTNAVLVRT----------VLTRSD 196
           L +    +A+ +GA+LS  TL    + R+ L+ ANL+ A L R            LT+++
Sbjct: 157 LGRVDLIEADLSGANLSGATLCGANLSRVNLSNANLSGANLSRVDLSEVKLSQANLTKAN 216

Query: 197 LGGAIIEGADFSD 209
           L GA ++ AD S+
Sbjct: 217 LSGADLDKADLSN 229



 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 54/125 (43%), Gaps = 21/125 (16%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFT---------------SADMRESDFSGSKFNGAYL 152
           S A    A+L +A  +  N R AN T                AD+R ++ SG   NG  L
Sbjct: 258 SGANLRGANLARAKLIGTNLRGANLTGAILTGANLEGTDLSQADLRSANLSGLILNGTIL 317

Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEAN-----LTNAVLVRTVLTRSDLGGAIIEGADF 207
             A    AN    +L +  + R  L EAN     LT A L R+ L+ ++L  A + GA  
Sbjct: 318 RGANLSGANLREVELFEANLSRADLLEANLSRARLTGANLSRSTLSEANLSRATLSGAHL 377

Query: 208 SDAVI 212
           + A +
Sbjct: 378 NRATL 382



 Score = 37.4 bits (85), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 9/112 (8%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A     DL +    + N  +AN + AD+ ++D S    N   +E  ++  AN  GA+L
Sbjct: 193 SGANLSRVDLSEVKLSQANLTKANLSGADLDKADLS----NLELIEVDLS-GANLAGANL 247

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 216
           S T + R  L+ ANL  A L R  L  ++L GA + GA  + A +   DL+Q
Sbjct: 248 SSTNLSRADLSGANLRGANLARAKLIGTNLRGANLTGAILTGANLEGTDLSQ 299


>gi|300867247|ref|ZP_07111907.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
 gi|300334724|emb|CBN57073.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
          Length = 520

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 48/83 (57%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           AN + A++  ++ +G+K N A L  A   +AN T ADL+   + R+ L  A L  A L+R
Sbjct: 45  ANLSGANLCGANLTGAKLNIARLSGAHLGEANLTDADLNVAYLVRVDLKGAILIRAKLIR 104

Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
             L R++L GA + GA+ S A +
Sbjct: 105 AELIRAELSGANLSGANLSGATL 127



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 6/102 (5%)

Query: 117 DLRKAVHVK------ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
           DL+ A+ ++      E  RA  + A++  ++ SG+    A L KA   +AN  GA LS  
Sbjct: 91  DLKGAILIRAKLIRAELIRAELSGANLSGANLSGATLTEATLRKADLTQANLRGAHLSGA 150

Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
            +   +L EAN   A L R  L+ +DL G+ +  A+ + A++
Sbjct: 151 SLTEALLVEANFQGADLSRADLSHADLRGSELRQANLTQAIL 192



 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 41/142 (28%), Positives = 67/142 (47%), Gaps = 21/142 (14%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 164
           A    A L +A+ V+ NF+ A+ + AD+  +D  GS+   A L +A+   A+ +G     
Sbjct: 145 AHLSGASLTEALLVEANFQGADLSRADLSHADLRGSELRQANLTQAILSGADLSGVNLRW 204

Query: 165 ----------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADFSD 209
                     ADLS+  +    L+ A+L NA L+ T L  +DL  A +      GAD + 
Sbjct: 205 AILSGCNLRWADLSEAKLSGADLSRADLCNANLLNTSLVHADLSNAYLIKADWVGADLTG 264

Query: 210 AVIDLAQKQALCKYANGTNPIT 231
           A +  A+  A+ +    T  +T
Sbjct: 265 ATLTGAKLHAVSRLGIKTEGMT 286



 Score = 44.7 bits (104), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 10/80 (12%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
            A    AD+ +++  G+  +GA L +A+  +ANF GADLS           A+L++A L 
Sbjct: 129 EATLRKADLTQANLRGAHLSGASLTEALLVEANFQGADLS----------RADLSHADLR 178

Query: 189 RTVLTRSDLGGAIIEGADFS 208
            + L +++L  AI+ GAD S
Sbjct: 179 GSELRQANLTQAILSGADLS 198



 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 38/74 (51%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
            AN +  ++   + SG+  + A L  A    AN TGA L+   +    L EANLT+A L 
Sbjct: 24  EANLSGVNLSGINLSGANLSVANLSGANLCGANLTGAKLNIARLSGAHLGEANLTDADLN 83

Query: 189 RTVLTRSDLGGAII 202
              L R DL GAI+
Sbjct: 84  VAYLVRVDLKGAIL 97


>gi|113475153|ref|YP_721214.1| periplasmic binding protein/LacI transcriptional regulator
           [Trichodesmium erythraeum IMS101]
 gi|110166201|gb|ABG50741.1| periplasmic binding protein/LacI transcriptional regulator
           [Trichodesmium erythraeum IMS101]
          Length = 525

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/101 (34%), Positives = 56/101 (55%), Gaps = 4/101 (3%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           +N T A++  ++ +G    G+ L+ A    AN  GA+L D  ++   L  ANL  A+L  
Sbjct: 31  SNLTGANLSGANLAGINLQGSNLQGANLVNANLEGANLKDVNLEGANLARANLKKAILQN 90

Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
           + L  S+L G+ ++ ADFS+A  +L   +AL  +AN  N I
Sbjct: 91  SNLDNSNLYGSDLQAADFSEA--NLVNMKAL--WANFHNAI 127


>gi|399073585|ref|ZP_10750574.1| putative low-complexity protein [Caulobacter sp. AP07]
 gi|398041367|gb|EJL34432.1| putative low-complexity protein [Caulobacter sp. AP07]
          Length = 313

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 6/97 (6%)

Query: 116 ADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           ADLR A     NF      RA+   A+MR ++F G+ F  A L    +  ANF GADL+ 
Sbjct: 69  ADLRGASFFGSNFTGADLSRADLRGAEMRGANFVGAIFTDAKLSGIESSGANFQGADLAR 128

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
             +    L+ AN   A L +  L+ S+L GA ++G +
Sbjct: 129 VDLSSSELHGANFIGANLEKANLSSSELVGANLQGVN 165



 Score = 43.9 bits (102), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 43/87 (49%), Gaps = 5/87 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTGADLSDTLMDRMVLNEANLT 183
           +AN   AD+R + F GS F GA L +     A    ANF GA  +D  +  +  + AN  
Sbjct: 63  KANLMGADLRGASFFGSNFTGADLSRADLRGAEMRGANFVGAIFTDAKLSGIESSGANFQ 122

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
            A L R  L+ S+L GA   GA+   A
Sbjct: 123 GADLARVDLSSSELHGANFIGANLEKA 149



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 50/103 (48%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADL-RKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A F  ADL R  +   E   ANF  A++ +++ S S+  GA L+   A  A+F  A+L
Sbjct: 117 SGANFQGADLARVDLSSSELHGANFIGANLEKANLSSSELVGANLQGVNARYASFQSAEL 176

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           + + M       AN  NA    + L  +  GGA   GAD S A
Sbjct: 177 NASNMIGGNFERANFRNAEFPGSTLRGAIFGGADFHGADLSGA 219


>gi|302522367|ref|ZP_07274709.1| OxyO [Streptomyces sp. SPB78]
 gi|302431262|gb|EFL03078.1| OxyO [Streptomyces sp. SPB78]
          Length = 233

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 58/124 (46%), Gaps = 16/124 (12%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLE---------------KAVAYKANFTGADLSDTLMDR 174
           AN T A+++ S  S +  N A+L                KA  ++A+ T AD+S   +  
Sbjct: 93  ANLTDANLKYSSLSSTHLNEAWLSHSVLSHASLSLADLSKANLHEADLTKADVSGANLSE 152

Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 234
             L  A +TNA   RT L+ ++L GA + GAD S  V +L QKQ      N T  +    
Sbjct: 153 ADLAGAKMTNANFFRTNLSGAELTGADLSGADLS-TVKNLTQKQVSSARTNRTTRLPSGL 211

Query: 235 TRKS 238
           TR S
Sbjct: 212 TRAS 215


>gi|307150734|ref|YP_003886118.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306980962|gb|ADN12843.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 231

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 52/105 (49%), Gaps = 16/105 (15%)

Query: 109 SAAQFGSADLRKAVHVKENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A F  +D R +   K NF A  F  AD  E+   G+ F+ A LEKA+  + + +GA  
Sbjct: 33  SRADFSYSDFRSSRLGKTNFSAACFLGADFSEAILWGTDFSKANLEKAILREVDLSGA-- 90

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIEGADF 207
                   +L EANLT   L++  L     + + L GAI+  ADF
Sbjct: 91  --------ILTEANLTQVNLIKATLGGANLSLAQLPGAIVYEADF 127



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 47/97 (48%), Gaps = 13/97 (13%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV---LNEANLTNA 185
           R N T A++  ++ S +K NGA L +A    A    ADLS  +    +   L+EANL NA
Sbjct: 134 RTNLTQANLSAANLSYAKLNGANLYQAQLMNAQLCRADLSKGIWQNCLPTDLSEANLQNA 193

Query: 186 ----------VLVRTVLTRSDLGGAIIEGADFSDAVI 212
                     +L    LT +DL G I+   D + A++
Sbjct: 194 DLSYADLSGAILCYADLTGADLTGTILTNVDLTGAIL 230



 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 57/117 (48%), Gaps = 3/117 (2%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           SAA F  AD  +A+    +F +AN   A +RE D SG+    A L +    KA   GA+L
Sbjct: 53  SAACFLGADFSEAILWGTDFSKANLEKAILREVDLSGAILTEANLTQVNLIKATLGGANL 112

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCK 222
           S   +   ++ EA+       RT LT+++L  A +  A  + A +  AQ     LC+
Sbjct: 113 SLAQLPGAIVYEADFRPTSEQRTNLTQANLSAANLSYAKLNGANLYQAQLMNAQLCR 169



 Score = 37.0 bits (84), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 19/117 (16%)

Query: 113 FGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF---- 162
           F  A+L KA+  + +        AN T  ++ ++   G+  + A L  A+ Y+A+F    
Sbjct: 72  FSKANLEKAILREVDLSGAILTEANLTQVNLIKATLGGANLSLAQLPGAIVYEADFRPTS 131

Query: 163 ------TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG---ADFSDA 210
                 T A+LS   +    LN ANL  A L+   L R+DL   I +     D S+A
Sbjct: 132 EQRTNLTQANLSAANLSYAKLNGANLYQAQLMNAQLCRADLSKGIWQNCLPTDLSEA 188


>gi|443327376|ref|ZP_21056002.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442792998|gb|ELS02459.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 187

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 55/102 (53%), Gaps = 1/102 (0%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F  A+L KAV  +      NF+ +D+ E+D   + F+ +   KA  +K+    A+L+  +
Sbjct: 47  FTGANLGKAVFYRTVVELGNFSQSDLGEADLREANFSQSLFYKASLFKSQLQKANLNQVI 106

Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
             R    +ANL +AVL    L +++L  A + GAD S+A ++
Sbjct: 107 AIRAFFRDANLNHAVLTSANLQQANLTNADLRGADLSNANLE 148


>gi|427416432|ref|ZP_18906615.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
 gi|425759145|gb|EKU99997.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
          Length = 237

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 54/106 (50%), Gaps = 16/106 (15%)

Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
           F + +L +++    + R AN   A +RESD S +    A LEKA   KA+  GA+LSD  
Sbjct: 57  FENCNLSESILWGSDLRNANLKQAQLRESDLSSALLTQANLEKANLIKASLCGANLSD-- 114

Query: 172 MDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 212
                   ANL NA L+   L      R+DLG + + GAD S A +
Sbjct: 115 --------ANLANACLLDADLRSNSDQRTDLGQSNLSGADLSYAFL 152



 Score = 42.4 bits (98), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 55/112 (49%), Gaps = 9/112 (8%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A F  +DLR++   + +F R NF + ++ ES   GS    A L++A         +DL
Sbjct: 33  SDANFSQSDLRQSRLGRTHFCRVNFENCNLSESILWGSDLRNANLKQA-----QLRESDL 87

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---SDAVIDLAQ 216
           S  L+ +  L +ANL  A L    L+ ++L  A +  AD    SD   DL Q
Sbjct: 88  SSALLTQANLEKANLIKASLCGANLSDANLANACLLDADLRSNSDQRTDLGQ 139


>gi|298489886|ref|YP_003720063.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
 gi|298231804|gb|ADI62940.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
          Length = 256

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 52/104 (50%), Gaps = 10/104 (9%)

Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL-SDTL----- 171
           +R+ +  K+       +A +  +D SG+   GA LE A   +AN TGADL S  L     
Sbjct: 29  IRQLLATKKCQNCQLINAGLALADLSGADLRGANLEGANLSRANLTGADLRSANLAGASL 88

Query: 172 ----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
               + R  LNEANLT A L  T L   +L  A + GA+F  AV
Sbjct: 89  FGVNLSRAKLNEANLTGADLRNTYLMNIELTNANLNGANFQGAV 132


>gi|440683010|ref|YP_007157805.1| serine/threonine protein kinase with pentapeptide repeats [Anabaena
           cylindrica PCC 7122]
 gi|428680129|gb|AFZ58895.1| serine/threonine protein kinase with pentapeptide repeats [Anabaena
           cylindrica PCC 7122]
          Length = 535

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 47/85 (55%), Gaps = 5/85 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           N +   ++ +D SG+ F+ A L++      N  GA+L +T   R  L +ANL +A L + 
Sbjct: 415 NISMLSLQGADLSGTNFHHAQLKQT-----NLQGANLQNTDFGRASLMQANLRDANLTKA 469

Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
            L+ +DL GA + GAD S A +  A
Sbjct: 470 YLSNADLEGADLRGADLSYAYMSQA 494



 Score = 37.7 bits (86), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 5/101 (4%)

Query: 131 NFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
           NF  A +++++  G     + F  A L +A    AN T A LS+  ++   L  A+L+ A
Sbjct: 430 NFHHAQLKQTNLQGANLQNTDFGRASLMQANLRDANLTKAYLSNADLEGADLRGADLSYA 489

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
            + +  L  ++L GA + GA  +D  I LA+   L    NG
Sbjct: 490 YMSQANLRGANLCGANLTGAKVTDEQIALAKTNWLTVRPNG 530


>gi|387129013|ref|YP_006291903.1| Pentapeptide repeat protein [Methylophaga sp. JAM7]
 gi|386270302|gb|AFJ01216.1| Pentapeptide repeat protein [Methylophaga sp. JAM7]
          Length = 153

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 43/79 (54%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ + + +++ +   S+  GA+   +  ++AN  GADL   L+D  +LN ANL N  L  
Sbjct: 51  ADLSESKLQKINLQNSQLQGAWFTHSKMHEANLEGADLQGALLDYTLLNHANLKNTNLDN 110

Query: 190 TVLTRSDLGGAIIEGADFS 208
             +  S+L GA + GA  +
Sbjct: 111 AQMIFSNLTGADLSGASMN 129


>gi|302039057|ref|YP_003799379.1| hypothetical protein NIDE3778 [Candidatus Nitrospira defluvii]
 gi|300607121|emb|CBK43454.1| conserved exported protein of unknown function [Candidatus
           Nitrospira defluvii]
          Length = 476

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 26/129 (20%)

Query: 111 AQFGSADLRKAVHVKENF--------------------------RANFTSADMRESDFSG 144
           A    ADLRKA+ VK +                           RA+F  AD++ +D S 
Sbjct: 133 ANLEGADLRKALLVKAHLNRIAADEAAFYGANLQGALFREALLERAHFEDADLQGADLSN 192

Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
           +     Y   A   K N T ADL+ T + R  L +ANL  A L   +L  ++L GA +  
Sbjct: 193 ATLLDGYFYGANLSKTNLTDADLAGTDLRRTNLRQANLRRANLQGALLDSANLDGASLIE 252

Query: 205 ADFSDAVID 213
           AD   A +D
Sbjct: 253 ADLESAYLD 261



 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 65/138 (47%), Gaps = 33/138 (23%)

Query: 89  SALADLNKYEAETRG-EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKF 147
           ++LA+ + +EA  RG +F        G A+L+         R N  +A+M  +    S+ 
Sbjct: 263 ASLANADLHEASLRGADFRF---THLGGANLQ---------RVNLENANMEGATLVKSRL 310

Query: 148 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG--------- 198
           + A L   V YKAN + A+          L+ ANL +AVL+ T L R+DL          
Sbjct: 311 DSATLTMTVLYKANLSAAN----------LHGANLHHAVLIGTQLARADLRKADLTEIYG 360

Query: 199 -GAIIEGADFSDAVIDLA 215
             A ++ A  S+A ++LA
Sbjct: 361 PNAHLQQARLSEANLELA 378



 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 46/103 (44%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           SAA    A+L  AV +     RA+   AD+ E     +    A L +A    AN   ADL
Sbjct: 326 SAANLHGANLHHAVLIGTQLARADLRKADLTEIYGPNAHLQQARLSEANLELANLVAADL 385

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S   +   V+ + NL    L    L+ SDL GA++  AD   A
Sbjct: 386 SQADISHAVVVQTNLQETNLRGANLSASDLTGALLNNADLGQA 428



 Score = 41.6 bits (96), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 1/98 (1%)

Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
           A    ADL  A  +   F  AN +  ++ ++D +G+      L +A   +AN  GA L  
Sbjct: 183 ADLQGADLSNATLLDGYFYGANLSKTNLTDADLAGTDLRRTNLRQANLRRANLQGALLDS 242

Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
             +D   L EA+L +A L    L  +DL  A + GADF
Sbjct: 243 ANLDGASLIEADLESAYLDDASLANADLHEASLRGADF 280



 Score = 41.6 bits (96), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 16/121 (13%)

Query: 108 GSAAQFGSADLRKAVHVKENF-RANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKAN 161
           G  A     DLR+   V  N  R N   A     ++R +    +   GA   +AV   AN
Sbjct: 75  GRRANLCRTDLRQLRLVGANLERINLEGAILKGSNLRTASLVQAHLKGADFSQAVLDDAN 134

Query: 162 FTGADLSDTLMDRMVLNE----------ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
             GADL   L+ +  LN           ANL  A+    +L R+    A ++GAD S+A 
Sbjct: 135 LEGADLRKALLVKAHLNRIAADEAAFYGANLQGALFREALLERAHFEDADLQGADLSNAT 194

Query: 212 I 212
           +
Sbjct: 195 L 195



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 129 RANFTSADMRE----------SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
           RAN    D+R+           +  G+   G+ L  A   +A+  GAD S  ++D   L 
Sbjct: 77  RANLCRTDLRQLRLVGANLERINLEGAILKGSNLRTASLVQAHLKGADFSQAVLDDANLE 136

Query: 179 EANLTNAVLVRTVLTR-----SDLGGAIIEGADFSDAVIDLAQ 216
            A+L  A+LV+  L R     +   GA ++GA F +A+++ A 
Sbjct: 137 GADLRKALLVKAHLNRIAADEAAFYGANLQGALFREALLERAH 179



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 54/128 (42%), Gaps = 26/128 (20%)

Query: 111 AQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
           A     DLR+    + N R           AN   A + E+D   +  + A L  A  ++
Sbjct: 213 ADLAGTDLRRTNLRQANLRRANLQGALLDSANLDGASLIEADLESAYLDDASLANADLHE 272

Query: 160 ANFTGADLSDTL-----MDRMVLNEANLTNAVLVR----------TVLTRSDLGGAIIEG 204
           A+  GAD   T      + R+ L  AN+  A LV+          TVL +++L  A + G
Sbjct: 273 ASLRGADFRFTHLGGANLQRVNLENANMEGATLVKSRLDSATLTMTVLYKANLSAANLHG 332

Query: 205 ADFSDAVI 212
           A+   AV+
Sbjct: 333 ANLHHAVL 340


>gi|167722130|ref|ZP_02405366.1| pentapeptide repeat family protein [Burkholderia pseudomallei DM98]
          Length = 323

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
           A+ T AD+   D  G++  GA LE A    A+ TGADLS     R VL  A+LT A LV 
Sbjct: 10  ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 64

Query: 190 TVLTRSDLGGAIIEGADFS 208
             LT ++L  A  E  DFS
Sbjct: 65  ARLTAANLSLAHCERTDFS 83


>gi|119490887|ref|ZP_01623170.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
 gi|119453705|gb|EAW34864.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
          Length = 517

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 57/116 (49%), Gaps = 11/116 (9%)

Query: 108 GSAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAV 156
           G++     ADLR+A  VK N            + N T AD+R+++ SG+    A L  A 
Sbjct: 157 GASTNLQRADLRRANLVKANLPKADFSHAEMRQTNLTYADLRQANLSGANLRWADLRGAN 216

Query: 157 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
              A+ +GA+LS   +    L+ A L  A LV   LT+++L  A   GAD S A +
Sbjct: 217 LLGADLSGANLSGANLSGANLSRATLAKASLVHVDLTQANLIKADWMGADISGATL 272



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 1/103 (0%)

Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           + A    ADLR+A   + NF +AN + A++R    + +    A L +A   KAN   AD 
Sbjct: 123 TKANLNGADLREARVGQANFSQANLSGANLRGVSGASTNLQRADLRRANLVKANLPKADF 182

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
           S   M +  L  A+L  A L    L  +DL GA + GAD S A
Sbjct: 183 SHAEMRQTNLTYADLRQANLSGANLRWADLRGANLLGADLSGA 225



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 49/87 (56%), Gaps = 5/87 (5%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANLT 183
           +AN + A +  ++ SG+  +G  L +A        +AN TGA+LS   ++   L  A+L+
Sbjct: 39  QANLSDASLCVTNLSGANLSGINLSRANLNVSRLSQANLTGANLSRATLNVANLVRADLS 98

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
           +A+LV T+  RS+L  A +  A+ + A
Sbjct: 99  DAILVETLAIRSELIRARLNNANLTKA 125



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 43/82 (52%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
           NFT  ++ E++ S    + A L  A     N +GA+LS   + R  LN + L+ A L   
Sbjct: 21  NFTGINLNEANLSRINLSQANLSDASLCVTNLSGANLSGINLSRANLNVSRLSQANLTGA 80

Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
            L+R+ L  A +  AD SDA++
Sbjct: 81  NLSRATLNVANLVRADLSDAIL 102



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 41/85 (48%), Gaps = 5/85 (5%)

Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-----TLMDRMVLNEANLTNA 185
           N + A++  S  S +   GA L +A    AN   ADLSD     TL  R  L  A L NA
Sbjct: 61  NLSRANLNVSRLSQANLTGANLSRATLNVANLVRADLSDAILVETLAIRSELIRARLNNA 120

Query: 186 VLVRTVLTRSDLGGAIIEGADFSDA 210
            L +  L  +DL  A +  A+FS A
Sbjct: 121 NLTKANLNGADLREARVGQANFSQA 145


>gi|158335891|ref|YP_001517065.1| hypothetical protein AM1_2749 [Acaryochloris marina MBIC11017]
 gi|158306132|gb|ABW27749.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 1055

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 58/110 (52%), Gaps = 17/110 (15%)

Query: 109  SAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVA 157
            S+A   SA+L +A  ++ N            RAN  SAD+R ++ S +  + A L +A  
Sbjct: 899  SSANLSSANLIRANLIRANLSSADLSSANLIRANLRSADLRSANLSSANLSSANLIRANL 958

Query: 158  YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS-DLGGAIIEGAD 206
             +AN + ADLS   + R     ANL+N  L+RTVL+ + +L    +EG D
Sbjct: 959  IRANLSSADLSSANLIR-----ANLSNTFLIRTVLSDAQNLTSDQLEGVD 1003



 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 53/103 (51%), Gaps = 3/103 (2%)

Query: 99  AETRGEFGIGSAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA 157
           A+T G + I   A   SADLR A +   +  RA+ +SAD+  ++ S +  + A L  A  
Sbjct: 826 AKTVGPYLI--RADLRSADLRSADLSSADLIRADLSSADLSSANLSSANLSSANLSSANL 883

Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
             AN   A+LS   +    L+ ANL  A L+R  L+ +DL  A
Sbjct: 884 SSANLIRANLSSADLSSANLSSANLIRANLIRANLSSADLSSA 926


>gi|411120639|ref|ZP_11393011.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410709308|gb|EKQ66823.1| putative low-complexity protein [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 181

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 129 RANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
            AN T+AD+ ++D SGS  +     GA L +A    AN  GADL    + R  L  ANL 
Sbjct: 59  HANLTNADLSQADLSGSNLSDVNLIGADLSQASLVGANLVGADLRSADLHRADLRGANLQ 118

Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSD 209
           +A L    L+++ L GA + G D +D
Sbjct: 119 DADLNGANLSQTALAGANLAGVDLTD 144



 Score = 37.0 bits (84), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 50/109 (45%), Gaps = 13/109 (11%)

Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
           S A   +ADL +A     N    N   AD+ ++   G+   GA L  A  ++A+  GA+L
Sbjct: 58  SHANLTNADLSQADLSGSNLSDVNLIGADLSQASLVGANLVGADLRSADLHRADLRGANL 117

Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
            D          A+L  A L +T L  ++L G  +   D  D  +DL++
Sbjct: 118 QD----------ADLNGANLSQTALAGANLAGVDLTDVDMQD--VDLSE 154


>gi|358461868|ref|ZP_09172018.1| pentapeptide repeat protein [Frankia sp. CN3]
 gi|357072553|gb|EHI82089.1| pentapeptide repeat protein [Frankia sp. CN3]
          Length = 376

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/86 (38%), Positives = 49/86 (56%), Gaps = 15/86 (17%)

Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN----- 184
           AN T+AD+ ++D S ++ +GA L  A   +A+ + A+L          NEANLTN     
Sbjct: 252 ANLTNADLYQADLSFARLHGANLTSARLERADLSTAEL----------NEANLTNGQLHE 301

Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
           AVL   VL  ++L GA + GA+ +DA
Sbjct: 302 AVLYSAVLHGANLTGARLHGANLTDA 327



 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 48/90 (53%), Gaps = 16/90 (17%)

Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
           RA+ ++A++ E++ +  + + A L  AV + AN TGA           L+ ANLT+A   
Sbjct: 281 RADLSTAELNEANLTNGQLHEAVLYSAVLHGANLTGAR----------LHGANLTDAQPY 330

Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
           R  LT     GA + G D S  V++L Q+Q
Sbjct: 331 RANLT-----GAQLHGVDLSR-VVNLTQEQ 354


>gi|354565480|ref|ZP_08984655.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353549439|gb|EHC18881.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 182

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 73/161 (45%), Gaps = 20/161 (12%)

Query: 94  LNKYEAETRGEFGIG-----------SAAQFGSADLRKAVHVKENFRA-NFTSADMRESD 141
           L++YE   R   G+            S A F  ADL  A   + N    NF+ A++ ++D
Sbjct: 7   LSRYETGERDFVGVNLHKVNLREVDLSGANFCGADLSGADLSQANLSGCNFSRANLTDAD 66

Query: 142 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
            + +  NGA L      + N  GADL +  ++   L+ A+L  A LVR  LT+++L  A 
Sbjct: 67  LTRADLNGANLS-----EINLIGADLINANLEGTNLSRADLRGANLVRANLTKANLSEAE 121

Query: 202 IEGADFSDAVI---DLAQKQALCKYANGTNPITGVSTRKSL 239
           + GAD S A +   +L +        NG N      T K +
Sbjct: 122 LSGADLSGANLNQANLIETNLNEAELNGVNITGATVTEKEM 162


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.128    0.372 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,277,805,160
Number of Sequences: 23463169
Number of extensions: 166556190
Number of successful extensions: 463396
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3648
Number of HSP's successfully gapped in prelim test: 727
Number of HSP's that attempted gapping in prelim test: 397437
Number of HSP's gapped (non-prelim): 36834
length of query: 281
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 140
effective length of database: 9,050,888,538
effective search space: 1267124395320
effective search space used: 1267124395320
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)