BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023545
(281 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255583634|ref|XP_002532572.1| conserved hypothetical protein [Ricinus communis]
gi|223527699|gb|EEF29806.1| conserved hypothetical protein [Ricinus communis]
Length = 280
Score = 433 bits (1114), Expect = e-119, Method: Compositional matrix adjust.
Identities = 223/282 (79%), Positives = 239/282 (84%), Gaps = 3/282 (1%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MA +SISPLSIKS+N SSS+ PY L + SKP + CQ++ TE + + DCS +
Sbjct: 1 MAFTSISPLSIKSVNISPSSSRSPYHLPSQSKPFHILCQLA--TEREDRILDCSTTRYKV 58
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
++K KNWR VSTALAAA + + A ADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 59 HHSKPKNWRTLVSTALAAAAAVNLGFGLPAAADLNKFEAELRGEFGIGSAAQFGSADLRK 118
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AVHV ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 119 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 178
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
ANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVIDL QKQALCKYANGTN ITGVSTRKSL
Sbjct: 179 ANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNSITGVSTRKSL 238
Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCD TGLCDAK
Sbjct: 239 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDEATGLCDAK 280
>gi|224071571|ref|XP_002303521.1| predicted protein [Populus trichocarpa]
gi|222840953|gb|EEE78500.1| predicted protein [Populus trichocarpa]
Length = 275
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 214/282 (75%), Positives = 236/282 (83%), Gaps = 8/282 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MA +SIS +SIKS N + P+++ +LSKP +A Q+ TE QF DCS N
Sbjct: 1 MAFTSISSMSIKSPNIST-----PHRILSLSKPFRIAYQL--DTERGNQFADCSKNGYEV 53
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
AK KNW VST L AA ++ S N+ A+ADLN++EAETRGEFGIGSAAQFGSADLRK
Sbjct: 54 ETAKAKNWARVVSTTLVAAAISFSSCNLPAVADLNRFEAETRGEFGIGSAAQFGSADLRK 113
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AVH+ ENFR ANFT+ADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 114 AVHLNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 173
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
+NLTNAVLVR+VLTRSDLGGA+I GADFSDAVIDL QKQALCKYA+GTNPITGVSTR SL
Sbjct: 174 SNLTNAVLVRSVLTRSDLGGALIAGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASL 233
Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
GCGNSRRNAYG+PSSPLLSAPPQKLLDRDGFCD GTGLCDAK
Sbjct: 234 GCGNSRRNAYGTPSSPLLSAPPQKLLDRDGFCDQGTGLCDAK 275
>gi|297741150|emb|CBI31881.3| unnamed protein product [Vitis vinifera]
Length = 261
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 217/282 (76%), Positives = 230/282 (81%), Gaps = 22/282 (7%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSS+SPL I SK P L + SKP V C+I + G + C N
Sbjct: 1 MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNY--CRAN---- 42
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
A+ K W+ VSTALAAAVV + S + A+ADLNKYEAETRGEFGIGSAAQFGSADLRK
Sbjct: 43 --AESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 99
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AVHV ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 100 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 159
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
ANLTNAVL RTVLTRSDLGGA+IEGADFSDAVIDL QKQALCKYA+GTNPITGVSTR SL
Sbjct: 160 ANLTNAVLARTVLTRSDLGGAVIEGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASL 219
Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
GCGNSRR+AYGSPSSPLLSAPP KLLDRDGFCD GTGLCDAK
Sbjct: 220 GCGNSRRSAYGSPSSPLLSAPPPKLLDRDGFCDEGTGLCDAK 261
>gi|359474379|ref|XP_002265958.2| PREDICTED: uncharacterized protein LOC100250522 isoform 2 [Vitis
vinifera]
Length = 596
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 217/282 (76%), Positives = 230/282 (81%), Gaps = 22/282 (7%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSS+SPL I SK P L + SKP V C+I + G + C N
Sbjct: 336 MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNY--CRAN---- 377
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
A+ K W+ VSTALAAAVV + S + A+ADLNKYEAETRGEFGIGSAAQFGSADLRK
Sbjct: 378 --AESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 434
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AVHV ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 435 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 494
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
ANLTNAVL RTVLTRSDLGGA+IEGADFSDAVIDL QKQALCKYA+GTNPITGVSTR SL
Sbjct: 495 ANLTNAVLARTVLTRSDLGGAVIEGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASL 554
Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
GCGNSRR+AYGSPSSPLLSAPP KLLDRDGFCD GTGLCDAK
Sbjct: 555 GCGNSRRSAYGSPSSPLLSAPPPKLLDRDGFCDEGTGLCDAK 596
Score = 186 bits (473), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 109/179 (60%), Positives = 121/179 (67%), Gaps = 20/179 (11%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSS+SPL I SK P L +LSKP V C+I + E NN
Sbjct: 1 MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRE---------NNWRGE 42
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
A+ K W+ VSTALAAAVV + S + A+ADLNKYE ETRGEFGIGSAAQFGSADLRK
Sbjct: 43 ANAESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRK 101
Query: 121 AVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
AVHV ENF RANFTSADMRESDFSGS FNG YLEKAVAYKA+ TG D +MVL+
Sbjct: 102 AVHVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLTGPDAPHARPYKMVLH 160
>gi|449459702|ref|XP_004147585.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Cucumis sativus]
gi|449520611|ref|XP_004167327.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Cucumis sativus]
Length = 279
Score = 399 bits (1025), Expect = e-109, Method: Compositional matrix adjust.
Identities = 205/281 (72%), Positives = 229/281 (81%), Gaps = 5/281 (1%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSSIS LS+K L SS S+ P L K + + QI+ + + Q DCS + G
Sbjct: 1 MALSSISSLSVKCLPLNSSKSRHPCSLQT-RKQISMVSQINPQKD---QTQDCSERKHIG 56
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ K W+ VSTALAAA V SS + ++A+LNKYEA+TRGEFGIGSAAQ+GSADLRK
Sbjct: 57 KITEPKRWQKLVSTALAAAAVIGFSSGMPSVAELNKYEADTRGEFGIGSAAQYGSADLRK 116
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AVH+ ENFR ANFTSADMRESDFSG FNGAYLEKAVAYK NF+GADLSDTLMDRMVLNE
Sbjct: 117 AVHINENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNE 176
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
AN TNAVLVR+VLTRSDLGGAII GADFSDAVIDL QKQALCKYA+GTNP+TGVSTR SL
Sbjct: 177 ANFTNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASL 236
Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDA 280
GCGNSRRNAYG+PSSPLLSAPPQ+LLDRDGFCD TGLC+A
Sbjct: 237 GCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQDTGLCEA 277
>gi|388505216|gb|AFK40674.1| unknown [Lotus japonicus]
Length = 273
Score = 386 bits (991), Expect = e-105, Method: Compositional matrix adjust.
Identities = 207/287 (72%), Positives = 230/287 (80%), Gaps = 25/287 (8%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQL------HALSKPLWVACQISSKTESDGQFPDCS 54
MAL+S+SPLSI ++N SS+ +L H S P+ V CQ++S + P S
Sbjct: 2 MALNSLSPLSI-NINSLHVSSRPTSELSNSLHFHPKSSPI-VLCQMNSNRD----HPQES 55
Query: 55 NNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFG 114
K W VS LAAAV+A SS++SALADLNK+EAE RGEFGIGSAAQFG
Sbjct: 56 -----------KKWGKLVSATLAAAVIA-FSSDMSALADLNKFEAEIRGEFGIGSAAQFG 103
Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
SADLRKAVHV ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMD
Sbjct: 104 SADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMD 163
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
RMVLNEANLTNA+LVRTVLTRSDLGG+IIEGADFSDAV+DL QK ALCKYA+GTNP+TGV
Sbjct: 164 RMVLNEANLTNAILVRTVLTRSDLGGSIIEGADFSDAVLDLTQKLALCKYASGTNPVTGV 223
Query: 234 STRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDA 280
STR SLGCGN RRNAYG+PSSPLLSAPPQKLL+RDGFCD TGLCD+
Sbjct: 224 STRVSLGCGNKRRNAYGTPSSPLLSAPPQKLLNRDGFCDEATGLCDS 270
>gi|356540500|ref|XP_003538726.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Glycine max]
Length = 260
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 206/282 (73%), Positives = 227/282 (80%), Gaps = 23/282 (8%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S+SPLSI SL+ SSS+ H+ S P+ V ++++
Sbjct: 1 MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPVVVKSVANAES---------------- 44
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
W VS LAAAV+A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 45 -----TKWGKVVSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 98
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AVHV ENFR ANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNE
Sbjct: 99 AVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNE 158
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
ANLTNA+L+RTVLTRSDLGGAIIEGADFSDAV+DL QKQALCKYA+GTNP+TGVSTR SL
Sbjct: 159 ANLTNAILLRTVLTRSDLGGAIIEGADFSDAVLDLPQKQALCKYASGTNPVTGVSTRVSL 218
Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
GCGN RRNAYGSPSSPLLSAPPQKLLDRDGFCD TGLCDAK
Sbjct: 219 GCGNKRRNAYGSPSSPLLSAPPQKLLDRDGFCDDATGLCDAK 260
>gi|357481963|ref|XP_003611267.1| Thylakoid lumenal protein [Medicago truncatula]
gi|355512602|gb|AES94225.1| Thylakoid lumenal protein [Medicago truncatula]
Length = 262
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 198/282 (70%), Positives = 220/282 (78%), Gaps = 21/282 (7%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S +PLSI S + + S + + Q+ K + P SN
Sbjct: 1 MALNSFTPLSINSHH---------VSCYPSSSKVSKSSQVICKMSLNNDHPQESN----- 46
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
KNW VS LAAAV+ SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 47 -----KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKK 100
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
VHV ENFR ANFTSADMRESDFSGS FNGAY+EKAVA+KANFTGADLSDTLMDRMVLNE
Sbjct: 101 TVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTGADLSDTLMDRMVLNE 160
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
ANLTNA+L RTVLTRSDLGGAIIEGADFSDAV+DL QK ALCKYA+GTNP+TGVSTR SL
Sbjct: 161 ANLTNAILSRTVLTRSDLGGAIIEGADFSDAVLDLPQKLALCKYASGTNPVTGVSTRVSL 220
Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
GCGN RRNAYG+PSSPLLSAPPQKLLDRDGFCD +GLCD+K
Sbjct: 221 GCGNKRRNAYGTPSSPLLSAPPQKLLDRDGFCDEASGLCDSK 262
>gi|357481965|ref|XP_003611268.1| Thylakoid lumenal protein [Medicago truncatula]
gi|355512603|gb|AES94226.1| Thylakoid lumenal protein [Medicago truncatula]
Length = 232
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 183/217 (84%), Positives = 197/217 (90%), Gaps = 2/217 (0%)
Query: 66 KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
KNW VS LAAAV+ SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K VHV
Sbjct: 17 KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKKTVHVN 75
Query: 126 ENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
ENFR ANFTSADMRESDFSGS FNGAY+EKAVA+KANFTGADLSDTLMDRMVLNEANLTN
Sbjct: 76 ENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTGADLSDTLMDRMVLNEANLTN 135
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 244
A+L RTVLTRSDLGGAIIEGADFSDAV+DL QK ALCKYA+GTNP+TGVSTR SLGCGN
Sbjct: 136 AILSRTVLTRSDLGGAIIEGADFSDAVLDLPQKLALCKYASGTNPVTGVSTRVSLGCGNK 195
Query: 245 RRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
RRNAYG+PSSPLLSAPPQKLLDRDGFCD +GLCD+K
Sbjct: 196 RRNAYGTPSSPLLSAPPQKLLDRDGFCDEASGLCDSK 232
>gi|116785652|gb|ABK23807.1| unknown [Picea sitchensis]
Length = 291
Score = 369 bits (947), Expect = e-100, Method: Compositional matrix adjust.
Identities = 186/242 (76%), Positives = 205/242 (84%), Gaps = 7/242 (2%)
Query: 40 ISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEA 99
I+ K +D D Q A + KNW+ ++ ALA V+ + ++A ADLNKYEA
Sbjct: 52 ITGKISTDQHKKDA---QPASATPESKNWQRCLAAALATIVIGT---GMNAEADLNKYEA 105
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAY 158
ETRGEFGIGSAAQFGSA+LRK VH ENFR ANFTSAD+RESDFSGS FNGAYLEKAVAY
Sbjct: 106 ETRGEFGIGSAAQFGSAELRKTVHANENFRRANFTSADIRESDFSGSTFNGAYLEKAVAY 165
Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
K NFTGADLSDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVID QKQ
Sbjct: 166 KTNFTGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDFTQKQ 225
Query: 219 ALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLC 278
ALCKYA+GTNPITG+STRKSLGCGNSRRNAYG+PS+PLLSAPP+KLLD+DGFCDS TGLC
Sbjct: 226 ALCKYASGTNPITGISTRKSLGCGNSRRNAYGTPSAPLLSAPPEKLLDKDGFCDSSTGLC 285
Query: 279 DA 280
DA
Sbjct: 286 DA 287
>gi|212721536|ref|NP_001132582.1| uncharacterized protein LOC100194053 [Zea mays]
gi|194694816|gb|ACF81492.1| unknown [Zea mays]
gi|195647732|gb|ACG43334.1| hypothetical protein [Zea mays]
gi|413937988|gb|AFW72539.1| hypothetical protein ZEAMMB73_749291 [Zea mays]
Length = 268
Score = 364 bits (934), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 177/194 (91%), Positives = 186/194 (95%), Gaps = 1/194 (0%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK 146
+ A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFR ANFTSADMRESDFSGS
Sbjct: 74 MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 133
Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGAIIEGAD
Sbjct: 134 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGAD 193
Query: 207 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 266
FSDAVIDL+QKQALCKYA+GTNP+TGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQK+LD
Sbjct: 194 FSDAVIDLSQKQALCKYASGTNPMTGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKILD 253
Query: 267 RDGFCDSGTGLCDA 280
RDGFCD TG+CDA
Sbjct: 254 RDGFCDPATGMCDA 267
>gi|14334898|gb|AAK59627.1| unknown protein [Arabidopsis thaliana]
Length = 280
Score = 361 bits (927), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 203/285 (71%), Positives = 230/285 (80%), Gaps = 9/285 (3%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--- 57
MA SS+SPL +KSL+ SSS + + L Q+SS+ S+ + D SN +
Sbjct: 1 MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTREGC 58
Query: 58 CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
C+ A+ W+ +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSAD
Sbjct: 59 CSS--AESNKWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSAD 115
Query: 118 LRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
L K VH ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMV
Sbjct: 116 LSKTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMV 175
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 236
LNEANLTNAVLVR+VLTRSDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TR
Sbjct: 176 LNEANLTNAVLVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTR 235
Query: 237 KSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
KSLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD TGLCD K
Sbjct: 236 KSLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDVK 280
>gi|18391370|ref|NP_563902.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
gi|75151954|sp|Q8H1Q1.1|TL225_ARATH RecName: Full=Thylakoid lumenal protein At1g12250, chloroplastic;
Flags: Precursor
gi|23297125|gb|AAN13098.1| unknown protein [Arabidopsis thaliana]
gi|332190736|gb|AEE28857.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
Length = 280
Score = 361 bits (927), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 203/285 (71%), Positives = 230/285 (80%), Gaps = 9/285 (3%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--- 57
MA SS+SPL +KSL+ SSS + + L Q+SS+ S+ + D SN +
Sbjct: 1 MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTREGC 58
Query: 58 CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
C+ A+ W+ +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSAD
Sbjct: 59 CSS--AESNTWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSAD 115
Query: 118 LRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
L K VH ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMV
Sbjct: 116 LSKTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMV 175
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 236
LNEANLTNAVLVR+VLTRSDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TR
Sbjct: 176 LNEANLTNAVLVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTR 235
Query: 237 KSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
KSLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD TGLCD K
Sbjct: 236 KSLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDVK 280
>gi|297844088|ref|XP_002889925.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
lyrata]
gi|297335767|gb|EFH66184.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
lyrata]
Length = 280
Score = 361 bits (927), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 207/288 (71%), Positives = 235/288 (81%), Gaps = 15/288 (5%)
Query: 1 MALSSISPLSIKSLNFCSSSSKG---PYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ 57
MA SS+SPL +KSL+ SSS PY H PL Q+SS++ S + D SN +
Sbjct: 1 MAFSSLSPLPMKSLDISRSSSSVSRSPY--HYQRYPLR-RLQLSSRSNS--EIKDSSNAR 55
Query: 58 ---CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFG 114
C+ ++ W+ +S A+AAAV+AS SS++ A+A+LN++EA+TRGEFGIGSAAQ+G
Sbjct: 56 EGCCS--RSESNTWKRILSAAMAAAVIAS-SSSVPAMAELNRFEADTRGEFGIGSAAQYG 112
Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
SADL K +H ENFR ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMD
Sbjct: 113 SADLSKTIHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMD 172
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
RMVLNEANLTNAVLVR+VLTRSDLGGA IEGADFSDAVIDL QKQALCKYANGTNP+TGV
Sbjct: 173 RMVLNEANLTNAVLVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYANGTNPLTGV 232
Query: 234 STRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
TRKSLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD TGLCDAK
Sbjct: 233 DTRKSLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDAK 280
>gi|125540470|gb|EAY86865.1| hypothetical protein OsI_08249 [Oryza sativa Indica Group]
Length = 276
Score = 360 bits (924), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 201/282 (71%), Positives = 223/282 (79%), Gaps = 7/282 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL + SPL+ + C+ + + L + V+CQ + DG S + A
Sbjct: 1 MALPTTSPLAAAAARPCAFPTPWRCRSPPLRRLPHVSCQANRGGSRDGN--SLSTSAAAA 58
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ WR VS ALAAA+V++ A ADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 59 AASPPPRWRAAVSAALAAAIVSA----APAYADLNKFEAEQRGEFGIGSAAQFGSADLKK 114
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AVHV ENFR ANFT+ADMRES+FSGS FNGAYLEKAVAY+ANFTGADLSDTLMDRMVLNE
Sbjct: 115 AVHVNENFRRANFTAADMRESNFSGSTFNGAYLEKAVAYRANFTGADLSDTLMDRMVLNE 174
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
ANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVIDL QKQALCKYANGTNP+TGVSTRKSL
Sbjct: 175 ANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNPLTGVSTRKSL 234
Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
GCGNSRRNAYGSPSSPLLSAPP KLLDRDGFCD TG+CDAK
Sbjct: 235 GCGNSRRNAYGSPSSPLLSAPPPKLLDRDGFCDEATGMCDAK 276
>gi|242066558|ref|XP_002454568.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
gi|241934399|gb|EES07544.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
Length = 270
Score = 360 bits (923), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 177/194 (91%), Positives = 184/194 (94%), Gaps = 1/194 (0%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK 146
+ A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFR ANFTSADMRESDFSGS
Sbjct: 76 MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 135
Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGAIIEGAD
Sbjct: 136 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGAD 195
Query: 207 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 266
FSDAVIDL QKQALCKYA+GTN ITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD
Sbjct: 196 FSDAVIDLPQKQALCKYASGTNSITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 255
Query: 267 RDGFCDSGTGLCDA 280
RDGFCD TG+C+A
Sbjct: 256 RDGFCDPATGMCEA 269
>gi|115447561|ref|NP_001047560.1| Os02g0643500 [Oryza sativa Japonica Group]
gi|49388647|dbj|BAD25782.1| thylakoid lumenal protein-like [Oryza sativa Japonica Group]
gi|113537091|dbj|BAF09474.1| Os02g0643500 [Oryza sativa Japonica Group]
gi|125583041|gb|EAZ23972.1| hypothetical protein OsJ_07699 [Oryza sativa Japonica Group]
gi|215687060|dbj|BAG90906.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 277
Score = 358 bits (918), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 201/282 (71%), Positives = 222/282 (78%), Gaps = 6/282 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL + SPL+ + C+ + + L + V+CQ + DG S A
Sbjct: 1 MALPTTSPLAAAAARPCAFPTPWRCRSPPLRRLPHVSCQANRGGSRDGNSLSTSAAAAAA 60
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ WR VS ALAAA+V++ A ADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 61 ASPPPR-WRAAVSAALAAAIVSA----APAYADLNKFEAEQRGEFGIGSAAQFGSADLKK 115
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AVHV ENFR ANFT+ADMRES+FSGS FNGAYLEKAVAY+ANFTGADLSDTLMDRMVLNE
Sbjct: 116 AVHVNENFRRANFTAADMRESNFSGSTFNGAYLEKAVAYRANFTGADLSDTLMDRMVLNE 175
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
ANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVIDL QKQALCKYANGTNP+TGVSTRKSL
Sbjct: 176 ANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNPLTGVSTRKSL 235
Query: 240 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
GCGNSRRNAYGSPSSPLLSAPP KLLDRDGFCD TG+CDAK
Sbjct: 236 GCGNSRRNAYGSPSSPLLSAPPPKLLDRDGFCDEATGMCDAK 277
>gi|145323868|ref|NP_001077523.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
gi|332190737|gb|AEE28858.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
Length = 206
Score = 357 bits (917), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 177/207 (85%), Positives = 190/207 (91%), Gaps = 2/207 (0%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTS 134
+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH ENFR ANFTS
Sbjct: 1 MAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTS 59
Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEANLTNAVLVR+VLTR
Sbjct: 60 ADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTR 119
Query: 195 SDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSS 254
SDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TRKSLGCGNSRRNAYGSPSS
Sbjct: 120 SDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTRKSLGCGNSRRNAYGSPSS 179
Query: 255 PLLSAPPQKLLDRDGFCDSGTGLCDAK 281
PLLSAPPQ+LL RDGFCD TGLCD K
Sbjct: 180 PLLSAPPQRLLGRDGFCDEKTGLCDVK 206
>gi|357136761|ref|XP_003569972.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Brachypodium distachyon]
Length = 268
Score = 353 bits (906), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 172/195 (88%), Positives = 181/195 (92%), Gaps = 1/195 (0%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK 146
+ A ADLNK+EAE RGEFGIGSAAQFG+ADL+K VHV ENFR ANFTSADMRESDFSGS
Sbjct: 74 MPAYADLNKFEAEQRGEFGIGSAAQFGNADLKKTVHVNENFRRANFTSADMRESDFSGST 133
Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
FNGAY+EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL RTVLTRSDLGGA IEGAD
Sbjct: 134 FNGAYMEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLARTVLTRSDLGGATIEGAD 193
Query: 207 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 266
FSDAV+DL QK ALCKYA+GTNP+TGVSTRKSLGCGNSRRNAYGSPSSPLLSAPP KLLD
Sbjct: 194 FSDAVLDLQQKLALCKYASGTNPVTGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPPKLLD 253
Query: 267 RDGFCDSGTGLCDAK 281
RDGFCD TG+CDAK
Sbjct: 254 RDGFCDEATGMCDAK 268
>gi|326490876|dbj|BAJ90105.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 267
Score = 348 bits (892), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 202/284 (71%), Positives = 221/284 (77%), Gaps = 20/284 (7%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQL-HALSKPLW-VACQISSKTESDGQFPDCSNNQC 58
MAL+S SPL+ + K P L S+ L ++CQ ++ G + SN
Sbjct: 1 MALASTSPLAA-----TVARPKAPASLTRCRSRRLQRISCQATTDRSGGG---NASNTSP 52
Query: 59 AGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADL 118
A P WRV VS ALAAAVV + + A ADLNKYEA+ RGEFGIGSAAQFG+ADL
Sbjct: 53 APP-----RWRVAVSAALAAAVVVA----MPAHADLNKYEADQRGEFGIGSAAQFGNADL 103
Query: 119 RKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
+ VHV ENFR ANFTSADMRESDFSGS FNGAY+EKAVA++ANFTGADLSDTLMDRMVL
Sbjct: 104 KNTVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFRANFTGADLSDTLMDRMVL 163
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
NEANLTNAVL RTVLTRSDLGGA IEGADFSDAVIDL QK ALCKYA+GTNPITGVSTRK
Sbjct: 164 NEANLTNAVLSRTVLTRSDLGGATIEGADFSDAVIDLPQKLALCKYASGTNPITGVSTRK 223
Query: 238 SLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 281
SLGCGNSRRNAYGSPSSPLLSAPP KLLDRDGFCD +GLCDAK
Sbjct: 224 SLGCGNSRRNAYGSPSSPLLSAPPPKLLDRDGFCDEASGLCDAK 267
>gi|10086510|gb|AAG12570.1|AC022522_3 Hypothetical protein [Arabidopsis thaliana]
Length = 293
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 192/301 (63%), Positives = 215/301 (71%), Gaps = 38/301 (12%)
Query: 11 IKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRV 70
+KSL+ SSS + + L Q+SS+ S+ + D SN A+ W+
Sbjct: 1 MKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTS-----AESNTWKR 53
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
+S A AA V + SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH ENFR
Sbjct: 54 ILSAA-MAAAVIASSSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRR 112
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEANLTNAVLVR
Sbjct: 113 ANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVR 172
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQ-----------------------------AL 220
+VLTRSDLGGA IEGADFSDAVIDL QKQ AL
Sbjct: 173 SVLTRSDLGGAKIEGADFSDAVIDLLQKQVTTTHHYIYPSFRSTIKKYFTNGFHNVLKAL 232
Query: 221 CKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDA 280
CKYA GTNP+TGV TRKSLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD TGLCD
Sbjct: 233 CKYATGTNPLTGVDTRKSLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDV 292
Query: 281 K 281
K
Sbjct: 293 K 293
>gi|302780733|ref|XP_002972141.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
gi|300160440|gb|EFJ27058.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
Length = 219
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 160/205 (78%), Positives = 183/205 (89%), Gaps = 5/205 (2%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF--RANFT 133
LAA V+A+ ++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+ H ENF RANFT
Sbjct: 14 LAATVLAT---GMNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFSRRANFT 70
Query: 134 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
SADMRE+DFSGS FNG YLEKAVAY+ NF+GADLSDTLMDRMVLNEA+LTNA+LVR VLT
Sbjct: 71 SADMREADFSGSTFNGGYLEKAVAYRTNFSGADLSDTLMDRMVLNEADLTNALLVRAVLT 130
Query: 194 RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPS 253
RSDLGGA IEGADFSDAV+DLAQKQALCKYANG NP+TG+ TRKSLGCGN+RRNAYG+PS
Sbjct: 131 RSDLGGAKIEGADFSDAVLDLAQKQALCKYANGVNPVTGMDTRKSLGCGNARRNAYGTPS 190
Query: 254 SPLLSAPPQKLLDRDGFCDSGTGLC 278
+P+LSAPP++LLD+DGFCD TG C
Sbjct: 191 APILSAPPERLLDKDGFCDDATGKC 215
>gi|302822738|ref|XP_002993025.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
gi|300139117|gb|EFJ05864.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
Length = 196
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 155/192 (80%), Positives = 176/192 (91%), Gaps = 1/192 (0%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK 146
++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+ H ENFR ANFTSADMRE+DFSGS
Sbjct: 1 MNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFRRANFTSADMREADFSGST 60
Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
FNG YLEKAVAY+ NF+GADLSDTLMDRMVLNEA+LTNA+LVR VLTRSDLGGA IEGAD
Sbjct: 61 FNGGYLEKAVAYRTNFSGADLSDTLMDRMVLNEADLTNALLVRAVLTRSDLGGAKIEGAD 120
Query: 207 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 266
FSDAV+DLAQKQALCKYANG NP+TG+ TRKSLGCGN+RRNAYG+PS+P+LSAPP++LLD
Sbjct: 121 FSDAVLDLAQKQALCKYANGVNPVTGMDTRKSLGCGNARRNAYGTPSAPILSAPPERLLD 180
Query: 267 RDGFCDSGTGLC 278
+DGFCD TG C
Sbjct: 181 KDGFCDDATGKC 192
>gi|168028137|ref|XP_001766585.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682230|gb|EDQ68650.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 225
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 153/193 (79%), Positives = 170/193 (88%), Gaps = 3/193 (1%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFN 148
+LADLN EA TRGEFGIGSA QFGSADL+K H ENFR NFTSADM+E++FS S FN
Sbjct: 28 SLADLNSLEANTRGEFGIGSAVQFGSADLKKTQHANENFRRGNFTSADMKEANFSNSTFN 87
Query: 149 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
GAYLEKAVAY+ NF+GADLSDTLMDRMVLNEANL+NA+LVR VLTRSDLG AIIEGADFS
Sbjct: 88 GAYLEKAVAYRTNFSGADLSDTLMDRMVLNEANLSNALLVRAVLTRSDLGSAIIEGADFS 147
Query: 209 DAVIDLAQKQ--ALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 266
DAV+DL QKQ ALCKYA+GTNP+TG+STRKSLGCGN+RRNAYGSPSSP LSAPP LLD
Sbjct: 148 DAVLDLTQKQAFALCKYASGTNPVTGMSTRKSLGCGNARRNAYGSPSSPELSAPPPILLD 207
Query: 267 RDGFCDSGTGLCD 279
++GFCD+ TG CD
Sbjct: 208 KNGFCDNSTGKCD 220
>gi|356495617|ref|XP_003516671.1| PREDICTED: LOW QUALITY PROTEIN: thylakoid lumenal protein
At1g12250, chloroplastic-like [Glycine max]
Length = 222
Score = 290 bits (741), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 166/251 (66%), Positives = 186/251 (74%), Gaps = 30/251 (11%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S SPLS+ SL+ S SS + + S P V CQ +S +
Sbjct: 1 MALNSFSPLSVNSLHVSSISSSKISRSLSKSFP--VVCQTNSNRDH-------------- 44
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ V VS LAAA++A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 45 -----RQGNV-VSATLAAAIIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 97
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AVHV ENFR +NFT+ADMRESDFSGS FNGAYLEKAVAYKANF G DLSDTL DRMVLNE
Sbjct: 98 AVHVNENFRXSNFTAADMRESDFSGSTFNGAYLEKAVAYKANFPGVDLSDTLTDRMVLNE 157
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
ANL+NA+L+RTVLTRSDLGGAIIEGADFSDAV+DL QK ALCKY +T VSTR SL
Sbjct: 158 ANLSNAILLRTVLTRSDLGGAIIEGADFSDAVLDLPQKHALCKY------VTRVSTRVSL 211
Query: 240 GCGNSRRNAYG 250
GCGN RRNAYG
Sbjct: 212 GCGNKRRNAYG 222
>gi|159478056|ref|XP_001697120.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
gi|158274594|gb|EDP00375.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
Length = 239
Score = 198 bits (503), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 100/167 (59%), Positives = 119/167 (71%), Gaps = 1/167 (0%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFN 148
ALADLN YEA T GEFGIGSA Q+G AD++ ++ R +NFTSAD R + F GS
Sbjct: 51 ALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCRNATFKGSNLQ 110
Query: 149 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
GAY KAV Y+ NF A+LSD LMDR + EANL NA+L RTV TRSDL A+IEGADF+
Sbjct: 111 GAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLKNAILQRTVFTRSDLKDAVIEGADFT 170
Query: 209 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 255
+A++D Q ALCKYA+GTNP+TG TRKSLGCG RR PS+P
Sbjct: 171 NALLDKTQVMALCKYASGTNPVTGADTRKSLGCGGKRRYQASYPSNP 217
>gi|302829835|ref|XP_002946484.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
nagariensis]
gi|300268230|gb|EFJ52411.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
nagariensis]
Length = 214
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 99/167 (59%), Positives = 120/167 (71%), Gaps = 1/167 (0%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFN 148
A ADLN YEAE GEFGIGSA Q+G AD++ ++ R +NFTSAD R ++F GS
Sbjct: 26 AFADLNVYEAEAGGEFGIGSAQQYGEADVQGRDFSGQDLRRSNFTSADCRNANFKGSNLQ 85
Query: 149 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
GAY KAV Y+ NF A+LSD LMDR + EANL NAVL R V TRSDL A++EGADF+
Sbjct: 86 GAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLRNAVLQRAVFTRSDLKDAVVEGADFT 145
Query: 209 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 255
+A++D Q ALCKYA+G NP+TGVSTRKSLGCG+ RR PS+P
Sbjct: 146 NALLDKTQVMALCKYADGVNPVTGVSTRKSLGCGSQRRYKASYPSNP 192
>gi|255638223|gb|ACU19425.1| unknown [Glycine max]
Length = 199
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 115/176 (65%), Positives = 133/176 (75%), Gaps = 18/176 (10%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S+SPLSI SL+ SSS+ H+ S P+ V CQI+S + + Q +
Sbjct: 2 MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPV-VVCQINSNRD---------HRQEST 51
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ K+ VS LAAAV+A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 52 KWGKV------VSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 104
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
AVHV ENFR ANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRM
Sbjct: 105 AVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRM 160
>gi|384248119|gb|EIE21604.1| thylakoid lumenal protein [Coccomyxa subellipsoidea C-169]
Length = 217
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 103/185 (55%), Positives = 126/185 (68%), Gaps = 3/185 (1%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLR-KAVHVKENFRANFTSADMRESDFSGSKFN 148
A+ADLNKYEA GEFG G+A Q+G ADL+ + H ++ R+NFT+AD R +F S
Sbjct: 29 AIADLNKYEAAAGGEFGNGTAQQYGEADLKGRDFHGEDLRRSNFTAADCRNCNFKDSNLQ 88
Query: 149 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
GAY K+V KANF A+LSD LMDR VLNEANL NA R VLTRSDLGGA I G DF+
Sbjct: 89 GAYFIKSVVPKANFENANLSDVLMDRAVLNEANLRNANFQRAVLTRSDLGGADINGTDFT 148
Query: 209 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRD 268
+A++D Q+ ALC+YA+GTN TGV TRKSLGCG+ RR SPS+P P +D+
Sbjct: 149 NALLDKTQQIALCRYADGTNTETGVETRKSLGCGSRRRFRESSPSNP--EGPQVADVDKK 206
Query: 269 GFCDS 273
F S
Sbjct: 207 AFVKS 211
>gi|297741151|emb|CBI31882.3| unnamed protein product [Vitis vinifera]
Length = 201
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 106/169 (62%), Positives = 116/169 (68%), Gaps = 20/169 (11%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSS+SPL I SK P L +LSKP V C+I + E NN
Sbjct: 1 MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRE---------NNWRGE 42
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
A+ K W+ VSTALAAAVV + S + A+ADLNKYE ETRGEFGIGSAAQFGSADLRK
Sbjct: 43 ANAESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRK 101
Query: 121 AVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
AVHV ENF RANFTSADMRESDFSGS FNG YLEKAVAYKA+ T A S
Sbjct: 102 AVHVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLTDAQSS 150
>gi|303288862|ref|XP_003063719.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454787|gb|EEH52092.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 277
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 104/198 (52%), Positives = 130/198 (65%), Gaps = 9/198 (4%)
Query: 86 SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE---NFR-ANFTSADMRESD 141
S+ +A A+LN EA GEF GSA QFG DLR V + + R +NFT A+MR +
Sbjct: 81 SSPAAHAELNAREANRGGEFNRGSAQQFGGYDLRNEDVVGKYGADLRLSNFTGAEMRGAK 140
Query: 142 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
G+ GAYL KAVA++A+F GA+LSD LMDR VLN AN +A+L R VLT SDLG A
Sbjct: 141 LRGANLTGAYLMKAVAFEADFEGANLSDALMDRAVLNSANFRDAILTRVVLTSSDLGDAK 200
Query: 202 IEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPL---LS 258
I+GADFSDA+ID +Q+Q LC+YA+GTN +TGVSTR+SL CG R + SPS + S
Sbjct: 201 IDGADFSDALIDKSQQQKLCQYASGTNSVTGVSTRRSLNCGGGVRTS--SPSRYMTDETS 258
Query: 259 APPQKLLDRDGFCDSGTG 276
A P+ D F GTG
Sbjct: 259 AKPEAAFDASRFSAYGTG 276
>gi|307105880|gb|EFN54127.1| hypothetical protein CHLNCDRAFT_31689 [Chlorella variabilis]
Length = 259
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 90/167 (53%), Positives = 118/167 (70%), Gaps = 1/167 (0%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFN 148
A A+LNKYE GEF +G+A Q+G AD++ ++ R+NFT+AD R+++F SK
Sbjct: 71 ASAELNKYEFGVTGEFNVGTARQYGEADVKGQDFSNQDLQRSNFTAADCRDANFQNSKLQ 130
Query: 149 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
AY K+V +AN ADLSD LMDR V+ +ANL AVL R +LTRSDL + I GADF+
Sbjct: 131 AAYFMKSVLARANLENADLSDALMDRAVIVDANLRGAVLQRAILTRSDLDRSDIYGADFT 190
Query: 209 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 255
+A++D Q+ ALCKYA+G NP+TGVSTRKSL CG+SRR SPS+P
Sbjct: 191 NALVDKTQQMALCKYADGVNPMTGVSTRKSLNCGSSRRFKASSPSNP 237
>gi|424513452|emb|CCO66074.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
Length = 231
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 105/197 (53%), Positives = 121/197 (61%), Gaps = 7/197 (3%)
Query: 72 VSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF--- 128
+S A A V S A+A+LN EA GEF GSA QFG DLR A +V E +
Sbjct: 21 LSVATAMIVSGIIPSPPFAVAELNSREANQGGEFNRGSAQQFGGYDLR-AENVSEKYGTD 79
Query: 129 --RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
+NFT A+MR+S G+K NGAYL KAVA A+FT ADLSD LMDR V AN TNA+
Sbjct: 80 LRLSNFTGAEMRDSKLVGAKLNGAYLMKAVAANADFTDADLSDALMDRGVFVNANFTNAI 139
Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRR 246
L R VLT SDL GA I ADFSDA++D + LCK A GTNP TGV+TRKSL C R
Sbjct: 140 LARVVLTSSDLNGANITNADFSDALLDNTMQMKLCKIATGTNPTTGVNTRKSLNCTGGRG 199
Query: 247 NAYGSPSSPLLSAPPQK 263
N GSPS + QK
Sbjct: 200 NV-GSPSRYMTEEDAQK 215
>gi|308811122|ref|XP_003082869.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
gi|116054747|emb|CAL56824.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
Length = 247
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 97/184 (52%), Positives = 116/184 (63%), Gaps = 7/184 (3%)
Query: 66 KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
K V S ALA A S + A A+LN+ EA GEF GSA QFG DL K K
Sbjct: 34 KKGHVITSIALATAFALSGAP---AHAELNRAEANRGGEFNRGSAKQFGGYDLVKVDIAK 90
Query: 126 E---NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
E + R +NFT ADMR + G+ GAY+ K VA + +FTGAD+SD LMDR VL AN
Sbjct: 91 EYGKDLRLSNFTGADMRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALMDRSVLVGAN 150
Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
T+AVL R VLT SD+ AIIE ADF+DA++D +QALCK A+G NP TGV+TR SLGC
Sbjct: 151 FTDAVLNRVVLTSSDMKDAIIENADFTDALLDPKTQQALCKTASGKNPETGVATRVSLGC 210
Query: 242 GNSR 245
R
Sbjct: 211 SGGR 214
>gi|357481967|ref|XP_003611269.1| Thylakoid lumenal protein [Medicago truncatula]
gi|355512604|gb|AES94227.1| Thylakoid lumenal protein [Medicago truncatula]
Length = 147
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 95/165 (57%), Positives = 110/165 (66%), Gaps = 21/165 (12%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S +PLSI S + + S + + Q+ K + P SN
Sbjct: 1 MALNSFTPLSINSHH---------VSCYPSSSKVSKSSQVICKMSLNNDHPQESN----- 46
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
KNW VS LAAAV+ SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 47 -----KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKK 100
Query: 121 AVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
VHV ENF RANFTSADMRESDFSGS FNGAY+EKAVA+KANFTG
Sbjct: 101 TVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTG 145
>gi|255087366|ref|XP_002505606.1| predicted protein [Micromonas sp. RCC299]
gi|226520876|gb|ACO66864.1| predicted protein [Micromonas sp. RCC299]
Length = 146
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 71/108 (65%), Positives = 85/108 (78%)
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
MR++ G+ GAYL KAVA+ A+F GA+LSD LMDR VLN AN +A++ R VLT SD
Sbjct: 1 MRKAKLRGANLTGAYLMKAVAFAADFEGANLSDALMDRAVLNNANFKDAIMTRVVLTSSD 60
Query: 197 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 244
LG A+IEGADFSDA+ID+ Q+QALCKYANG N +TGVSTRKSL CG S
Sbjct: 61 LGDAVIEGADFSDALIDVKQQQALCKYANGVNSVTGVSTRKSLNCGGS 108
>gi|145356542|ref|XP_001422487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582730|gb|ABP00804.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 114
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/111 (61%), Positives = 84/111 (75%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NFT AD+R + G+ GAY+ K VA + +FTGAD+SD LMDR VL +AN TNA+L R
Sbjct: 4 NFTGADLRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALMDRAVLVKANFTNAILNRV 63
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
VLT SDL GAI+E ADF+DA++D+ +QALCK A+G NP TGVSTR SLGC
Sbjct: 64 VLTSSDLEGAIVENADFTDALLDVKTQQALCKTASGKNPETGVSTRVSLGC 114
>gi|224125144|ref|XP_002329904.1| predicted protein [Populus trichocarpa]
gi|222871141|gb|EEF08272.1| predicted protein [Populus trichocarpa]
Length = 108
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/81 (77%), Positives = 68/81 (83%), Gaps = 4/81 (4%)
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 234
MV+NEANLTNAVLVR+ LTR DLGGA I GAD SD+VIDL QKQ YA+GTNP TGVS
Sbjct: 1 MVINEANLTNAVLVRSALTRCDLGGAQIAGADSSDSVIDLPQKQ----YASGTNPTTGVS 56
Query: 235 TRKSLGCGNSRRNAYGSPSSP 255
R SLGCGNSRRNAYG+PSSP
Sbjct: 57 NRASLGCGNSRRNAYGTPSSP 77
>gi|434390855|ref|YP_007125802.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428262696|gb|AFZ28642.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 176
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 59/110 (53%), Positives = 77/110 (70%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MRE++F G+ A L K V +AN GA+L+ L+DR+ L+EANL NA+L +
Sbjct: 66 FVAAEMREANFQGADLTNAILTKGVLLRANLEGANLTGALVDRVTLDEANLKNAILQEAI 125
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTRS L A I GADF+DA+ID Q LC A+G NP+TGVSTR+SLGC
Sbjct: 126 LTRSRLFDADITGADFTDALIDRYQVSLLCDRADGVNPVTGVSTRESLGC 175
>gi|416382245|ref|ZP_11684306.1| Pentapeptide repeat containing protein [Crocosphaera watsonii WH
0003]
gi|357265427|gb|EHJ14194.1| Pentapeptide repeat containing protein [Crocosphaera watsonii WH
0003]
Length = 171
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/129 (46%), Positives = 82/129 (63%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DL K V F +ADMRE++F GS + A +A KAN GA+L+ +L+
Sbjct: 51 FSHKDLEKGV---------FAAADMREANFEGSNLSYAIFTEATLLKANLKGANLTSSLL 101
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LT+A+L+ + TR+ A+I GADF+DAVID Q +C+ A G NP+TG
Sbjct: 102 DRVTLDFADLTDAILIDAIATRTRFYDAVITGADFTDAVIDRYQVSLMCERAEGVNPVTG 161
Query: 233 VSTRKSLGC 241
VSTR SLGC
Sbjct: 162 VSTRDSLGC 170
>gi|67921246|ref|ZP_00514765.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|67857363|gb|EAM52603.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
Length = 172
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/129 (46%), Positives = 82/129 (63%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DL K V F +ADMRE++F GS + A +A KAN GA+L+ +L+
Sbjct: 52 FSHKDLEKGV---------FAAADMREANFEGSNLSYAIFTEATLLKANLKGANLTSSLL 102
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LT+A+L+ + TR+ A+I GADF+DAVID Q +C+ A G NP+TG
Sbjct: 103 DRVTLDFADLTDAILIDAIATRTRFYDAVITGADFTDAVIDRYQVSLMCERAEGVNPVTG 162
Query: 233 VSTRKSLGC 241
VSTR SLGC
Sbjct: 163 VSTRDSLGC 171
>gi|218247318|ref|YP_002372689.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|218167796|gb|ACK66533.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
Length = 172
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/129 (47%), Positives = 83/129 (64%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DL KAV F +A+MRE++F GS + A L + V KAN A+L+ +L+
Sbjct: 52 FSHRDLEKAV---------FAAAEMRETNFEGSNLSYAILTEGVLLKANLKDANLTGSLL 102
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LTNA+LV + TR+ II GADF+DAVID Q +C+ A+G NP+TG
Sbjct: 103 DRVTLDFADLTNAILVDAIATRTRFYDTIITGADFTDAVIDRYQVALMCERADGVNPVTG 162
Query: 233 VSTRKSLGC 241
V+TR SLGC
Sbjct: 163 VATRDSLGC 171
>gi|254421873|ref|ZP_05035591.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
gi|196189362|gb|EDX84326.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
Length = 187
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 74/110 (67%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +AD R+++F G+ +G L KA + N GAD + T DR++ + A+LTNA+ V +
Sbjct: 76 FAAADARDANFEGADMSGTILTKATFLRTNLKGADFTKTFADRVLFDGADLTNAIFVEAI 135
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
T S G II GADFSDA+ID Q + +CK A+G NP+TG+STR+SLGC
Sbjct: 136 ATSSSFGDTIITGADFSDAIIDRFQVKKMCKRADGINPVTGISTRESLGC 185
>gi|257061347|ref|YP_003139235.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
gi|256591513|gb|ACV02400.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
Length = 172
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 60/129 (46%), Positives = 82/129 (63%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DL KAV F +A+MRE++F GS + A L + V KAN +L+ +L+
Sbjct: 52 FSHRDLEKAV---------FAAAEMRETNFEGSNLSYAILTEGVLLKANLKDVNLTGSLL 102
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LTNA+LV + TR+ II GADF+DAVID Q +C+ A+G NP+TG
Sbjct: 103 DRVTLDFADLTNAILVDAIATRTRFYDTIITGADFTDAVIDRYQVALMCERADGVNPVTG 162
Query: 233 VSTRKSLGC 241
V+TR SLGC
Sbjct: 163 VATRDSLGC 171
>gi|434384986|ref|YP_007095597.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428015976|gb|AFY92070.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 165
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 62/129 (48%), Positives = 78/129 (60%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
FG DL V F S+++R + SG+ A L AV K N +GA+L+ L
Sbjct: 45 FGGQDLTGGV---------FVSSELRGVNMSGANLTNAMLTMAVLLKTNLSGANLTGALA 95
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR +EA+LTNA+L LTRS GA I GADF+DA+ID AQ + LC A+G NP+TG
Sbjct: 96 DRATFDEADLTNAILTEATLTRSRFYGAKITGADFTDALIDRAQAKLLCDRADGINPVTG 155
Query: 233 VSTRKSLGC 241
VSTR SLGC
Sbjct: 156 VSTRDSLGC 164
>gi|428316344|ref|YP_007114226.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428240024|gb|AFZ05810.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 169
Score = 110 bits (275), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 55/110 (50%), Positives = 72/110 (65%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MR ++F G+ A L K V AN +GA+LS L DR+ + ANLTNA +
Sbjct: 59 FVAAEMRGTNFQGADLTNAILTKGVLLNANLSGANLSGALADRVTFDGANLTNANFTEAI 118
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+TR+ A I GADFSDA+ID Q LC+ A+G NP+TGVSTR+SLGC
Sbjct: 119 MTRTRFFDAAISGADFSDAIIDAYQVSILCEKADGVNPVTGVSTRESLGC 168
>gi|126658078|ref|ZP_01729230.1| hypothetical protein CY0110_05667 [Cyanothece sp. CCY0110]
gi|126620716|gb|EAZ91433.1| hypothetical protein CY0110_05667 [Cyanothece sp. CCY0110]
Length = 181
Score = 110 bits (275), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 74/110 (67%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +ADMRE++F GS + + + + AN G DLS +L+DR+ L+ A+LTNA+LV +
Sbjct: 71 FAAADMREANFEGSNLSYSIFTEGILLGANLKGVDLSSSLLDRVTLDFADLTNAILVDAI 130
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
TR+ A I GADF++AVID Q +C+ A G NP+TGVSTR SLGC
Sbjct: 131 ATRTRFYDATITGADFTNAVIDRYQVSLMCERAEGVNPVTGVSTRDSLGC 180
>gi|434405844|ref|YP_007148729.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428260099|gb|AFZ26049.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 168
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 55/110 (50%), Positives = 72/110 (65%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MR ++F G+ A L K V KAN GA+L+ L+DR+ L+ ANL NA+
Sbjct: 58 FVAAEMRGTNFQGANLTNAILTKGVLLKANLEGANLAGALVDRVTLDGANLKNAIFTEAT 117
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTRS A + GADF+DA+ID Q LCK A+G NP+TG+STR SLGC
Sbjct: 118 LTRSRFFDADVTGADFTDALIDRYQVALLCKSADGVNPVTGISTRDSLGC 167
>gi|75908890|ref|YP_323186.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75702615|gb|ABA22291.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 168
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 55/112 (49%), Positives = 73/112 (65%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
NF +A+MR ++F G+ A L K V KAN + A+L+ L+DR+ L+ ANL NA+
Sbjct: 56 VNFVAAEMRGTNFQGANLTNAILTKGVLLKANLSEANLTGALVDRVTLDNANLKNAIFTE 115
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTRS A I GADF+DA+ID Q LC+ A+G NP+TGV+TR SLGC
Sbjct: 116 ATLTRSRFYDADITGADFTDAIIDRYQVSLLCERADGVNPVTGVATRDSLGC 167
>gi|254412921|ref|ZP_05026693.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196180085|gb|EDX75077.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 180
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 73/110 (66%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F AD+R + F G+ G+ L KA ++A+ TGA+LS+TL DR+V + ANLTNA+ +
Sbjct: 70 FAGADLRGASFRGASLQGSILTKAAFFEADLTGANLSETLADRVVFDGANLTNAIFTNAI 129
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+RS I GADFS A++D Q +C+ A+G NP+TGVSTR SLGC
Sbjct: 130 ASRSRFFDTTITGADFSGAILDTYQISLMCQRADGVNPVTGVSTRDSLGC 179
>gi|354555882|ref|ZP_08975181.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|353552206|gb|EHC21603.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 182
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 75/110 (68%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +ADMRE++F GS + + + + AN GA+LS +L+DR+ L+ A+LTNA+LV +
Sbjct: 72 FAAADMREANFEGSNLSYSIFTEGILLGANLKGANLSSSLLDRVTLDFADLTNAILVDAI 131
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
TR+ A I GADF++AVID Q +C+ A G NP+TGVSTR SLGC
Sbjct: 132 ATRTRFYDATITGADFTNAVIDRYQVSLMCERAEGVNPVTGVSTRDSLGC 181
>gi|172037118|ref|YP_001803619.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|171698572|gb|ACB51553.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
Length = 184
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 75/110 (68%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +ADMRE++F GS + + + + AN GA+LS +L+DR+ L+ A+LTNA+LV +
Sbjct: 74 FAAADMREANFEGSNLSYSIFTEGILLGANLKGANLSSSLLDRVTLDFADLTNAILVDAI 133
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
TR+ A I GADF++AVID Q +C+ A G NP+TGVSTR SLGC
Sbjct: 134 ATRTRFYDATITGADFTNAVIDRYQVSLMCERAEGVNPVTGVSTRDSLGC 183
>gi|332712340|ref|ZP_08432267.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332348814|gb|EGJ28427.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 169
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/113 (46%), Positives = 75/113 (66%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R F A+MR ++F G+ +G+ K KAN GA+L+D+L DR++L++ANLTNA+L
Sbjct: 56 RGVFAGAEMRGTNFQGADLSGSIFTKGNLLKANLEGANLTDSLADRVILDQANLTNAILT 115
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ + A I GADF+DA+ID Q + +C A G NP+TG+STR SLGC
Sbjct: 116 DAIMNSTRFYDAEITGADFTDALIDRYQAKLMCGRATGVNPVTGISTRDSLGC 168
>gi|295293762|gb|ADF88289.1| pentapeptide repeat-containing protein [Aphanizomenon sp. 10E6]
Length = 168
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 71/110 (64%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MR ++F G+ A K V KAN A+L+ L+DR+ L+ ANL NA+ +
Sbjct: 58 FVAAEMRGTNFQGANLTNAIFTKGVLLKANLEAANLTGALVDRVTLDSANLRNAIFTKAT 117
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTRS A I GADF+DA+ID Q LC+ A+G NP+TGVSTR SLGC
Sbjct: 118 LTRSRFYDADITGADFTDALIDRYQVSLLCQRADGVNPVTGVSTRDSLGC 167
>gi|186685193|ref|YP_001868389.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186467645|gb|ACC83446.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 168
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 72/110 (65%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MR ++F G+ A L K V KAN GA+LS L+DR+ ++ ANL NA+
Sbjct: 58 FVAAEMRGTNFQGANLTNAILTKGVLLKANLEGANLSGALVDRVTMDGANLKNAIFTEAT 117
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTRS A I GADF+DA+ID Q +C+ A+G NP+TG+STR SLGC
Sbjct: 118 LTRSRFFDAEITGADFTDALIDRYQVSLMCERADGVNPVTGMSTRDSLGC 167
>gi|428224803|ref|YP_007108900.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984704|gb|AFY65848.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 176
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 71/110 (64%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F SA+MR ++F G+ + A L K V AN GA+L+ L DR+ +ANL NA+LV
Sbjct: 67 FVSAEMRNANFEGANLSNAILTKGVLLNANLEGANLTGALADRVFWLDANLRNAILVDVT 126
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
TR+ G + GADFSDA++D + + LCK A G NP+TGV+TR SLGC
Sbjct: 127 ATRTSFEGVDVTGADFSDAILDRYELKELCKRAEGVNPVTGVATRDSLGC 176
>gi|428778133|ref|YP_007169920.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428692412|gb|AFZ45706.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 174
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 62/137 (45%), Positives = 82/137 (59%), Gaps = 9/137 (6%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
+ I S F + DL AV F +A+MR+++FSGS A K A+ +
Sbjct: 46 YTIVSERDFSNKDLVGAV---------FAAAEMRKTNFSGSNLENAMFTKGTLINADLSN 96
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
+LS LMDR+ L+ A+L NAVL T LTRS L G IEGADF+DA+++ Q + LC+ A
Sbjct: 97 TNLSGALMDRVSLDGADLRNAVLQGTFLTRSTLEGTKIEGADFTDAILNRYQVKLLCERA 156
Query: 225 NGTNPITGVSTRKSLGC 241
G NP TGV+TR SLGC
Sbjct: 157 EGVNPKTGVATRDSLGC 173
>gi|427728139|ref|YP_007074376.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427364058|gb|AFY46779.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 168
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 71/110 (64%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MR ++F G+ A K V AN +GA+L+ L+DR L+ ANL NA+
Sbjct: 58 FVAAEMRGTNFQGANLTNAIFTKGVLLNANLSGANLTGALVDRATLDSANLKNAIFTEAT 117
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTRS A I GADF+DA+ID Q LC+ A+G NP+TGV+TR+SLGC
Sbjct: 118 LTRSRFYDADITGADFTDAIIDRYQVSLLCERADGINPVTGVATRESLGC 167
>gi|334119379|ref|ZP_08493465.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333458167|gb|EGK86786.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 169
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 71/110 (64%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MR ++F G+ A L K V AN +GA+LS L DR+ + ANLTNA +
Sbjct: 59 FVAAEMRGTNFQGADLTNAILTKGVLLNANLSGANLSGALADRVTFDGANLTNANFSEAI 118
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+TR+ A I GADF+DA+ID Q LC+ A+G NP TGVSTR+SLGC
Sbjct: 119 MTRTRFFDAAISGADFTDAIIDAYQVSILCEKADGVNPATGVSTRESLGC 168
>gi|440681954|ref|YP_007156749.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428679073|gb|AFZ57839.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 168
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 71/110 (64%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MR ++F G+ + A L K V KAN A+L+ L+DR+ L+ ANL NA+
Sbjct: 58 FVAAEMRGANFQGANLSNAILTKGVLLKANLEDANLTGALVDRVTLDSANLKNAIFTEAT 117
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTRS A I GADF+DA+ID Q LC+ ANG N +TG+STR SLGC
Sbjct: 118 LTRSRFYDADITGADFTDALIDRYQVSLLCERANGVNSVTGISTRDSLGC 167
>gi|17227682|ref|NP_484230.1| hypothetical protein all0186 [Nostoc sp. PCC 7120]
gi|17135164|dbj|BAB77710.1| all0186 [Nostoc sp. PCC 7120]
Length = 168
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 71/112 (63%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
NF +A+MR ++F G+ A L K V KAN + A+L+ L+DR L+ ANL NA+
Sbjct: 56 VNFVAAEMRGTNFQGANLTNAILTKGVLLKANLSEANLTGALVDRATLDNANLKNAIFTE 115
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTRS A I GADF+DA+ID Q LC+ ANG N +TG++TR SLGC
Sbjct: 116 ATLTRSRFYDADITGADFTDALIDRYQVSLLCERANGVNRVTGIATRDSLGC 167
>gi|428299988|ref|YP_007138294.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428236532|gb|AFZ02322.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 193
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/111 (51%), Positives = 71/111 (63%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NF +A+MR +F G+ A L K V KAN GA+L+ L+DR+ L+ ANL NA
Sbjct: 82 NFVAAEMRGINFEGANLTNAMLTKGVMLKANLEGANLTAALVDRVALDGANLKNANFTDA 141
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTRS L A I GADFS+A+ID Q + LC A+GTNP+TGV TR SL C
Sbjct: 142 TLTRSRLFDADITGADFSNALIDTYQMKLLCDRASGTNPVTGVDTRDSLEC 192
>gi|428779391|ref|YP_007171177.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
gi|428693670|gb|AFZ49820.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
Length = 171
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/137 (44%), Positives = 82/137 (59%), Gaps = 9/137 (6%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
+ + S F + DL AV F +A+MR ++FSGS A K A+ +
Sbjct: 43 YTVVSERDFSNKDLVGAV---------FAAAEMRRTNFSGSNLENAMFTKGTLINADLSN 93
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
+LS LMDR+ L+ A+L+NAVL T LTRS L G I GADF+DA+++ Q + LC+ A
Sbjct: 94 TNLSGALMDRVNLDGADLSNAVLNGTFLTRSTLEGTKITGADFTDAILNRYQVKLLCEKA 153
Query: 225 NGTNPITGVSTRKSLGC 241
G NP TGVSTR+SLGC
Sbjct: 154 EGVNPKTGVSTRESLGC 170
>gi|427720966|ref|YP_007068960.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427353402|gb|AFY36126.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 168
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 81/141 (57%), Gaps = 1/141 (0%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
R F + + + + +L E+ A F +A+MR ++F G+ A L K V KA
Sbjct: 27 RPAFALTNVINYNNINLENRDFAHEDLTGATFVAAEMRGANFQGANLTNAVLTKGVLLKA 86
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
+ + A+L+ L+DR+ L+ ANL NA+ LTRS A I GADF+DA+ID Q +
Sbjct: 87 DLSDANLTGALVDRVTLDGANLKNAIFTEATLTRSRFYDAEITGADFTDALIDRYQVSLM 146
Query: 221 CKYANGTNPITGVSTRKSLGC 241
C A G NP+TGVSTR SLGC
Sbjct: 147 CDRAAGINPVTGVSTRDSLGC 167
>gi|16331228|ref|NP_441956.1| hypothetical protein sll0301 [Synechocystis sp. PCC 6803]
gi|383322971|ref|YP_005383824.1| hypothetical protein SYNGTI_2062 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383326140|ref|YP_005386993.1| hypothetical protein SYNPCCP_2061 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383492024|ref|YP_005409700.1| hypothetical protein SYNPCCN_2061 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437292|ref|YP_005652016.1| hypothetical protein SYNGTS_2063 [Synechocystis sp. PCC 6803]
gi|451815384|ref|YP_007451836.1| hypothetical protein MYO_120830 [Synechocystis sp. PCC 6803]
gi|1001404|dbj|BAA10026.1| sll0301 [Synechocystis sp. PCC 6803]
gi|339274324|dbj|BAK50811.1| hypothetical protein SYNGTS_2063 [Synechocystis sp. PCC 6803]
gi|359272290|dbj|BAL29809.1| hypothetical protein SYNGTI_2062 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359275460|dbj|BAL32978.1| hypothetical protein SYNPCCN_2061 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278630|dbj|BAL36147.1| hypothetical protein SYNPCCP_2061 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451781353|gb|AGF52322.1| hypothetical protein MYO_120830 [Synechocystis sp. PCC 6803]
Length = 169
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 82/137 (59%), Gaps = 9/137 (6%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
+G + + F DL KAV F +AD+RES+F GS + + L AV A+ G
Sbjct: 41 YGDLARSDFSHQDLNKAV---------FAAADLRESNFEGSDLSFSILTDAVFLHASLRG 91
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
A+LS +L+DR+ L+ A+L + + + TR+ I GADFSDAVID Q + +C+ A
Sbjct: 92 ANLSGSLVDRVTLDFADLRDTIFTEAIATRTRFYDTDITGADFSDAVIDAYQVKLMCERA 151
Query: 225 NGTNPITGVSTRKSLGC 241
G NP+TGV+TR SLGC
Sbjct: 152 EGVNPVTGVATRDSLGC 168
>gi|359460928|ref|ZP_09249491.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 172
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/131 (44%), Positives = 73/131 (55%), Gaps = 20/131 (15%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA--------------------DLSDT 170
NFT AD+R DF F GA L A+ KAN T A DL++T
Sbjct: 41 NFTFADLRYEDFENKNFEGASLAGAILLKANLTNANLKGTILTMATFQRSNLTNADLTET 100
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
DR++ NEA+LTNA+ +LT S A I GADFS A +D Q +C+YA+G NP+
Sbjct: 101 FADRVLFNEADLTNAIFTDAMLTSSKFYDATITGADFSYAFLDRDQVTMMCEYADGVNPV 160
Query: 231 TGVSTRKSLGC 241
TGVSTR+SL C
Sbjct: 161 TGVSTRESLEC 171
>gi|158337601|ref|YP_001518776.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158307842|gb|ABW29459.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 172
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/131 (44%), Positives = 73/131 (55%), Gaps = 20/131 (15%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA--------------------DLSDT 170
NFT AD+R DF F GA L A+ KAN T A DL++T
Sbjct: 41 NFTFADLRYEDFENKNFEGASLAGAILLKANLTNANLKGTILTMATFQRSNLTNADLTET 100
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
DR++ NEA+LTNA+ +LT S A I GADFS A +D Q +C+YA+G NP+
Sbjct: 101 FADRVLFNEADLTNAIFTDAMLTSSKFYDATITGADFSYAFLDRDQVTMMCEYADGVNPV 160
Query: 231 TGVSTRKSLGC 241
TGVSTR+SL C
Sbjct: 161 TGVSTRESLEC 171
>gi|407961395|dbj|BAM54635.1| hypothetical protein BEST7613_5704 [Synechocystis sp. PCC 6803]
Length = 147
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 82/137 (59%), Gaps = 9/137 (6%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
+G + + F DL KAV F +AD+RES+F GS + + L AV A+ G
Sbjct: 19 YGDLARSDFSHQDLNKAV---------FAAADLRESNFEGSDLSFSILTDAVFLHASLRG 69
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
A+LS +L+DR+ L+ A+L + + + TR+ I GADFSDAVID Q + +C+ A
Sbjct: 70 ANLSGSLVDRVTLDFADLRDTIFTEAIATRTRFYDTDITGADFSDAVIDAYQVKLMCERA 129
Query: 225 NGTNPITGVSTRKSLGC 241
G NP+TGV+TR SLGC
Sbjct: 130 EGVNPVTGVATRDSLGC 146
>gi|87302980|ref|ZP_01085784.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
gi|87282476|gb|EAQ74435.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
Length = 203
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 51/111 (45%), Positives = 75/111 (67%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+F R++DFSG+ +G+ L +A +++F+GADLSD LMDR + +L+ A+L
Sbjct: 92 SFAGVMARDADFSGADLHGSILTQAAFLRSDFSGADLSDALMDRADFSGTDLSGALLRGV 151
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ S GA+I+ ADFSDA++D + ++ALC+ A GTNP TGVSTR SL C
Sbjct: 152 IAAGSSFSGAVIDDADFSDALLDRSDQRALCRRAQGTNPTTGVSTRLSLDC 202
>gi|255083653|ref|XP_002508401.1| predicted protein [Micromonas sp. RCC299]
gi|226523678|gb|ACO69659.1| predicted protein [Micromonas sp. RCC299]
Length = 187
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 57/133 (42%), Positives = 75/133 (56%), Gaps = 6/133 (4%)
Query: 120 KAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
KA H+ E+F A +T D+R SDFSGS A +AV N GAD+S++ +D
Sbjct: 30 KAEHINEDFSHEDLVGAIYTEGDLRGSDFSGSDLRAAIFSRAVMPGVNLEGADMSNSFLD 89
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
+VL +N+ + RSDLG + ADF++AVID Q LC A+GTNP TGV
Sbjct: 90 YVVLRGSNMRGVIAREANFVRSDLGDCDVTDADFTEAVIDRYQAIGLCDSASGTNPFTGV 149
Query: 234 STRKSLGCGNSRR 246
TR SLGC +R
Sbjct: 150 DTRDSLGCERLKR 162
>gi|427706655|ref|YP_007049032.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427359160|gb|AFY41882.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 168
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 70/110 (63%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MR ++F + A K V KAN GA+L+ L+DR+ L+ ANL NA
Sbjct: 58 FVAAEMRGTNFQAANLTNAIFTKGVLLKANLEGANLTGALVDRVTLDGANLKNANFTEAT 117
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTRS A I GADF+DA+ID Q LC+ A+G NP+TGV+TR+SLGC
Sbjct: 118 LTRSRFYDADITGADFTDALIDRYQISLLCERADGVNPVTGVATRESLGC 167
>gi|443313318|ref|ZP_21042930.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442776723|gb|ELR87004.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 182
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 70/110 (63%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MR ++F G+ A + K V AN GA+LS L+DR+ L+ ANL NA+
Sbjct: 72 FVAAEMRGANFQGADLTNAIMTKGVLLGANLEGANLSGALVDRVTLDNANLKNAIFTDAT 131
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTRS A I GADFS+A+ID Q LC A GTNP+TG++T +SLGC
Sbjct: 132 LTRSRFFDADITGADFSNALIDRYQINLLCDRATGTNPVTGITTTESLGC 181
>gi|428203864|ref|YP_007082453.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427981296|gb|AFY78896.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 170
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 72/110 (65%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +ADMR +F S + L + V AN GA+L+++LMDR+ L+ A+LTNA+ V +
Sbjct: 60 FAAADMRGINFEDSDLSNTILTEGVLLGANLKGANLTNSLMDRVTLDFADLTNAIFVDAI 119
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
TR+ I GADFS AV+D Q + LC A+G NP+TG+STR+SLGC
Sbjct: 120 ATRTRFYDTTITGADFSGAVLDRYQVKLLCDRADGVNPVTGISTRESLGC 169
>gi|425438309|ref|ZP_18818714.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9432]
gi|425452591|ref|ZP_18832408.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 7941]
gi|440756403|ref|ZP_20935604.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|443646807|ref|ZP_21129485.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|159025958|emb|CAO87888.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|389676535|emb|CCH94452.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9432]
gi|389765527|emb|CCI08587.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 7941]
gi|440173625|gb|ELP53083.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|443335636|gb|ELS50100.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 166
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/129 (44%), Positives = 78/129 (60%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR V F +A MR + GS + + L +AV KAN GADL+ +L+
Sbjct: 46 FSHQDLRGGV---------FAAAAMRGVNLEGSDLSYSILTEAVLLKANLKGADLTASLV 96
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LTN + + TRS II GADF++AVID Q + +C+ A+G NP+TG
Sbjct: 97 DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTG 156
Query: 233 VSTRKSLGC 241
V+TR SLGC
Sbjct: 157 VATRDSLGC 165
>gi|443314355|ref|ZP_21043921.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442786047|gb|ELR95821.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 173
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 95/182 (52%), Gaps = 14/182 (7%)
Query: 62 YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQ-FGSADLRK 120
+ + WR + L A+ I+A A IG Q F +DL +
Sbjct: 3 WQRSGEWRQILRGGLLFAIAIVLWGGIAARA------------IAIGEITQDFTYSDLNR 50
Query: 121 AVHVKENFRANFTSADM-RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
EN +A RE++FSG+ + L K YKA GA+L+ + DR++ +
Sbjct: 51 QDFAGENLAGASLAAADAREANFSGADLSQTILTKGNFYKAKLVGANLTQSFADRVIFDG 110
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
A+L+NA++V ++T + G A I+GADFS ++D Q +C+YA+G NP+TGV+TR SL
Sbjct: 111 ADLSNALVVDAIMTSTSFGEATIQGADFSGTILDRYQVAQMCEYADGVNPVTGVATRDSL 170
Query: 240 GC 241
GC
Sbjct: 171 GC 172
>gi|425469693|ref|ZP_18848608.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9701]
gi|389880432|emb|CCI38813.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9701]
Length = 166
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/129 (43%), Positives = 78/129 (60%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR V F +A MR + G+ + + L +AV KAN GADL+ +L+
Sbjct: 46 FSHQDLRGGV---------FAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLV 96
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LTN + + TRS II GADF++AVID Q + +C+ A+G NP+TG
Sbjct: 97 DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDAYQVKLMCERADGINPVTG 156
Query: 233 VSTRKSLGC 241
V+TR SLGC
Sbjct: 157 VATRDSLGC 165
>gi|443322626|ref|ZP_21051645.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
gi|442787675|gb|ELR97389.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
Length = 164
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/129 (44%), Positives = 73/129 (56%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DL AV F AD+R ++F + + L + V AN T A+L+D L
Sbjct: 43 FSGQDLEGAV---------FADADLRGANFQAANLANSILTQGVFLNANLTKANLTDALA 93
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR EANLT+A+LV + +RS AII GADFS A++D Q LC A GTNP+TG
Sbjct: 94 DRATFAEANLTDAILVNIIASRSSFVDAIITGADFSGAILDKYQVALLCDRAQGTNPVTG 153
Query: 233 VSTRKSLGC 241
VSTR SL C
Sbjct: 154 VSTRASLNC 162
>gi|422303610|ref|ZP_16390961.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9806]
gi|389791366|emb|CCI12792.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9806]
Length = 166
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/129 (43%), Positives = 78/129 (60%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR V F +A MR + G+ + + L +AV KAN GADL+ +L+
Sbjct: 46 FSHQDLRGGV---------FAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLV 96
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LTN + + TRS II GADF++AVID Q + +C+ A+G NP+TG
Sbjct: 97 DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDAYQVKLMCERADGINPVTG 156
Query: 233 VSTRKSLGC 241
V+TR SLGC
Sbjct: 157 VATRDSLGC 165
>gi|425465439|ref|ZP_18844748.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|389832325|emb|CCI24153.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
Length = 166
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/129 (43%), Positives = 79/129 (61%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR V F +A MR ++ G+ + + L +AV KAN GADL+ +L+
Sbjct: 46 FSHQDLRGGV---------FAAAAMRGANLEGADLSYSILTEAVLLKANLKGADLTASLV 96
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LTN + + TRS II GADF++AVID Q + +C+ A+G NP+TG
Sbjct: 97 DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTG 156
Query: 233 VSTRKSLGC 241
V+TR SLGC
Sbjct: 157 VATRDSLGC 165
>gi|300868096|ref|ZP_07112733.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
gi|300333934|emb|CBN57911.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
Length = 174
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 68/110 (61%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MR ++F G+ A L K V AN + A+LS L DR+ + ANLTNA +
Sbjct: 64 FVAAEMRNTNFEGADLTNAILTKGVLLNANLSNANLSGALADRVTFDGANLTNANFTEAI 123
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTR+ I GADF+DA+ID Q LC+ A G N +TGVSTR+SLGC
Sbjct: 124 LTRTRFYDTAISGADFTDAIIDSYQVNLLCEKAEGVNSVTGVSTRESLGC 173
>gi|443328655|ref|ZP_21057250.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791786|gb|ELS01278.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 222
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 72/110 (65%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +AD+R S+F GS + + L KA+ N +G DL+++ MDR+ L+ +NL+NA+L +
Sbjct: 112 FAAADVRGSNFEGSDLSNSILTKAIFTDTNLSGVDLTNSFMDRVDLSNSNLSNAILQDII 171
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
T ++ I GADFS A+ID Q LC+ A G NP+TGVSTR SLGC
Sbjct: 172 ATSTNFYNTDITGADFSGAIIDRYQTYVLCQRAAGVNPVTGVSTRYSLGC 221
>gi|414075538|ref|YP_006994856.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413968954|gb|AFW93043.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 168
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 68/110 (61%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MR ++F + A K V KAN A+L+ L+DR+ + ANL NA+
Sbjct: 58 FVAAEMRGTNFQDANLTNAIFTKGVLLKANLESANLTGALVDRVTFDSANLRNAIFAEAT 117
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTRS A I GADF+DA+ID Q LC+ A+G NP+TG+STR SLGC
Sbjct: 118 LTRSRFYDADITGADFTDALIDRYQVSLLCQRADGVNPVTGISTRDSLGC 167
>gi|425446471|ref|ZP_18826474.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9443]
gi|389733275|emb|CCI02926.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9443]
Length = 166
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/129 (43%), Positives = 78/129 (60%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR V F +A MR + G+ + + L +AV KAN GADL+ +L+
Sbjct: 46 FSHQDLRGGV---------FAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLV 96
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LTN + + TRS II GADF++AVID Q + +C+ A+G NP+TG
Sbjct: 97 DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTG 156
Query: 233 VSTRKSLGC 241
V+TR SLGC
Sbjct: 157 VATRDSLGC 165
>gi|166365075|ref|YP_001657348.1| hypothetical protein MAE_23340 [Microcystis aeruginosa NIES-843]
gi|166087448|dbj|BAG02156.1| hypothetical protein MAE_23340 [Microcystis aeruginosa NIES-843]
Length = 166
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/129 (43%), Positives = 78/129 (60%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR V F +A MR + G+ + + L +AV KAN GADL+ +L+
Sbjct: 46 FSHQDLRGGV---------FAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLV 96
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LTN + + TRS II GADF++AVID Q + +C+ A+G NP+TG
Sbjct: 97 DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTG 156
Query: 233 VSTRKSLGC 241
V+TR SLGC
Sbjct: 157 VATRDSLGC 165
>gi|411119939|ref|ZP_11392315.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410710095|gb|EKQ67606.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 169
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 72/110 (65%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F SA+MR ++FSG+ A K AN +GA+L L+DR L +A+L+NA+L+
Sbjct: 59 FVSAEMRGTNFSGAILTNAMFTKGNLLGANLSGANLEGALLDRTTLYKADLSNAILIDAT 118
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L+ S L A ++GADF++A++D LCK A GTNP TG+STR+SLGC
Sbjct: 119 LSNSILDEATVDGADFTNAIVDRYAVSQLCKRAQGTNPTTGISTRESLGC 168
>gi|425439807|ref|ZP_18820122.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9717]
gi|425456970|ref|ZP_18836676.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9807]
gi|389719892|emb|CCH96344.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9717]
gi|389801790|emb|CCI19079.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9807]
Length = 166
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/129 (43%), Positives = 78/129 (60%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR V F +A MR + G+ + + L +AV KAN GADL+ +L+
Sbjct: 46 FSHQDLRGGV---------FAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLV 96
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LTN + + TRS II GADF++AVID Q + +C+ A+G NP+TG
Sbjct: 97 DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTG 156
Query: 233 VSTRKSLGC 241
V+TR SLGC
Sbjct: 157 VATRDSLGC 165
>gi|116073351|ref|ZP_01470613.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
gi|116068656|gb|EAU74408.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
Length = 167
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 72/111 (64%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+F A R +DFSG+ +GA + +A+F+ ADLSD+LMDR + NLTNA+L
Sbjct: 57 SFAGAVGRGADFSGADLHGAIFTQGAFAEADFSDADLSDSLMDRADFSGTNLTNALLNGV 116
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + S GA IEGADFSDA++D LC+ A G NPITG++TR SLGC
Sbjct: 117 IASGSSFAGASIEGADFSDALLDRDDVVRLCRDAEGVNPITGMATRDSLGC 167
>gi|318040416|ref|ZP_07972372.1| hypothetical protein SCB01_01865 [Synechococcus sp. CB0101]
Length = 174
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 52/111 (46%), Positives = 72/111 (64%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+F A + ++F+G+ +GA + +A+F+GADLSD LMDR ++ NL NAVLV
Sbjct: 64 SFAGAVGKGANFAGANLHGAIFTQGAFPEADFSGADLSDVLMDRTDMSHTNLRNAVLVGV 123
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + GA + GADFSDA+ID A ++ LC A+GTNP TG TR SLGC
Sbjct: 124 IAAGASFSGADVTGADFSDALIDRADQRQLCAKASGTNPSTGADTRASLGC 174
>gi|428308896|ref|YP_007119873.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428250508|gb|AFZ16467.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 176
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 77/134 (57%), Gaps = 1/134 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S+ + S DL +N A F +A+MR ++F S A L K V AN A+L
Sbjct: 42 SSINYSSTDLTNRDFSHKNLVGAVFVAAEMRGTNFQESDLTNAILTKGVMLGANLQDANL 101
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 227
+ L+DR+ L+ ANL NA+ + RS A I GADF+DA+ID Q LC+ A+G
Sbjct: 102 TGALVDRVTLDNANLKNAIFQEATMIRSRFYDADITGADFTDAIIDRYQVSLLCEKASGV 161
Query: 228 NPITGVSTRKSLGC 241
NPITGV+TR SLGC
Sbjct: 162 NPITGVATRDSLGC 175
>gi|390440134|ref|ZP_10228485.1| Similar to Pentapeptide repeat [Microcystis sp. T1-4]
gi|389836418|emb|CCI32609.1| Similar to Pentapeptide repeat [Microcystis sp. T1-4]
Length = 166
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/129 (42%), Positives = 78/129 (60%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR V F +A MR + G+ + + L +AV KAN GADL+ +L+
Sbjct: 46 FSHQDLRGGV---------FAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLV 96
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LTN + + +RS II GADF++AVID Q + +C+ A+G NP+TG
Sbjct: 97 DRVTLDFADLTNTIFTDAIASRSRFYDTIITGADFTNAVIDAYQVKLMCERADGINPVTG 156
Query: 233 VSTRKSLGC 241
V+TR SLGC
Sbjct: 157 VATRDSLGC 165
>gi|434407744|ref|YP_007150629.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428261999|gb|AFZ27949.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 162
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 76/112 (67%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F++A++ ++F+G+ GA L +V KAN GADL++ ++D++ L A+L++AV +
Sbjct: 50 AEFSNANLELTNFTGADLRGAVLSASVMTKANLHGADLTNAMVDQVNLTRADLSDAVFIE 109
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L R+ IEGADF+DA++D AQ + LC+ A+G N TGV TR SLGC
Sbjct: 110 ALLLRAIFTDVNIEGADFTDAILDRAQVKELCEKASGVNSQTGVQTRDSLGC 161
>gi|170077406|ref|YP_001734044.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
gi|169885075|gb|ACA98788.1| Pentapeptide repeats protein [Synechococcus sp. PCC 7002]
Length = 169
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 57/120 (47%), Positives = 77/120 (64%), Gaps = 1/120 (0%)
Query: 125 KENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
EN +A +F AD+R SDF+GS + A L + +AN T A+LS+ MD++ + ANLT
Sbjct: 50 HENLQAASFARADVRGSDFTGSDLSRAILTEGKFMEANLTEANLSEAFMDQVNMEGANLT 109
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 243
NA+ V V ++ AII+GADFS A++D Q LCK A+GTN ITG+ TR SL C N
Sbjct: 110 NALFVDAVAPGTNFAEAIIDGADFSGALLDRYQLSELCKRASGTNTITGIDTRYSLNCKN 169
>gi|33862602|ref|NP_894162.1| hypothetical protein PMT0329 [Prochlorococcus marinus str. MIT
9313]
gi|33634518|emb|CAE20504.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 179
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 53/122 (43%), Positives = 77/122 (63%)
Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
K H ++ ++F A R +DFS S +GA L + ++NF+GADLSD LMDR+ +
Sbjct: 58 KDFHAQDLSNSSFAGAVARAADFSNSNLHGAILTQGTFTQSNFSGADLSDALMDRVDFVD 117
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
+L N VL + + S GA I+GADFSDA++DL ++ LC A+G N ITG++T +SL
Sbjct: 118 TDLRNCVLKGVIASGSSFAGAQIDGADFSDALLDLDDQRRLCLDADGINQITGIATFESL 177
Query: 240 GC 241
C
Sbjct: 178 NC 179
>gi|298489879|ref|YP_003720056.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298231797|gb|ADI62933.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 163
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 76/112 (67%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F++A++ ++F+G+ G +V KAN GA+L++ +++ + LN A+L++A+L+
Sbjct: 51 AEFSNANLEMANFAGADLRGTVFSASVMTKANLHGANLTNAMVNEVKLNGADLSDAILLE 110
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L RS IEGADFSDA++D +Q Q LCK A+G N TGV TR+SLGC
Sbjct: 111 ALLLRSIFTDVNIEGADFSDAILDRSQIQELCKKASGVNSQTGVETRESLGC 162
>gi|440684176|ref|YP_007158971.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428681295|gb|AFZ60061.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 162
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 77/112 (68%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F++A++ ++F+G+ GA L +V +AN GADL++ ++D++ LN A+L++A+L+
Sbjct: 51 AEFSNANLEMANFTGADLRGAVLSASVMTQANLHGADLTNAMIDQVKLNGADLSDAILLE 110
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L RS I GADF+DA++D AQ + LC+ A+G N TGV TR SLGC
Sbjct: 111 ALLLRSIFTDVNIAGADFTDAILDKAQIKELCQKASGVNSRTGVETRDSLGC 162
>gi|434400337|ref|YP_007134341.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428271434|gb|AFZ37375.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 169
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 51/104 (49%), Positives = 68/104 (65%)
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
R ++F GS + + L KAV AN +L+ +LMDR+ L+ +NLTNA++ V T ++
Sbjct: 65 RGANFEGSDLSNSILTKAVFSNANLAEINLTKSLMDRVALDNSNLTNAIIREAVATSTNF 124
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
GA I GADFSD+++D Q LCK A G NP TGVSTR SLGC
Sbjct: 125 DGATITGADFSDSILDRYQIYLLCKRAEGVNPTTGVSTRDSLGC 168
>gi|124023686|ref|YP_001017993.1| hypothetical protein P9303_19861 [Prochlorococcus marinus str. MIT
9303]
gi|123963972|gb|ABM78728.1| Uncharacterized low-complexity proteins [Prochlorococcus marinus
str. MIT 9303]
Length = 179
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 53/122 (43%), Positives = 76/122 (62%)
Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
K H ++ +F A R +DFS S GA L + ++NF+GADLSD LMDR+ +
Sbjct: 58 KDFHAQDLSNTSFAGAVARAADFSNSNLRGAILTQGTFTQSNFSGADLSDALMDRVDFVD 117
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
+L N+VL + + S GA I+GADFSDA++DL ++ LC A+G N ITG++T +SL
Sbjct: 118 TDLRNSVLKGVIASGSSFAGAQIDGADFSDALLDLDDQRRLCLDADGINQITGIATFESL 177
Query: 240 GC 241
C
Sbjct: 178 NC 179
>gi|425462969|ref|ZP_18842432.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9808]
gi|389823905|emb|CCI27601.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9808]
Length = 166
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/129 (42%), Positives = 78/129 (60%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR V F +A MR ++ + + + L +AV KAN GADL+ +L+
Sbjct: 46 FSHQDLRGGV---------FAAAAMRGANLEEADLSYSILTEAVLLKANLKGADLTASLV 96
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ L+ A+LTN + + TRS II GADF++AVID Q + +C+ A+G NP+TG
Sbjct: 97 DRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTG 156
Query: 233 VSTRKSLGC 241
V+TR SLGC
Sbjct: 157 VATRDSLGC 165
>gi|86609913|ref|YP_478675.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86558455|gb|ABD03412.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 173
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 83/130 (63%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +ADL+ +++R ++F SA+++ +D G+ GA KA AN +GADLS++L
Sbjct: 43 FNNADLQGQDLSGQDWRGSSFVSANLQGADLHGANLAGAAFTKANLAGANLSGADLSNSL 102
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+D L A+L A L + R+ GA I GADFS+A +D A K+ LC+ A G++PIT
Sbjct: 103 LDLANLAGADLRGAKLTGAIAARAVWQGAQIAGADFSEAYVDRAAKRQLCERAEGSHPIT 162
Query: 232 GVSTRKSLGC 241
GV+TR+SLGC
Sbjct: 163 GVTTRESLGC 172
>gi|17230233|ref|NP_486781.1| hypothetical protein alr2741 [Nostoc sp. PCC 7120]
gi|17131834|dbj|BAB74440.1| alr2741 [Nostoc sp. PCC 7120]
Length = 182
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 82/130 (63%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +A+L + E+ +A F++A++ ++F G+ GA L +V +AN GADL++ +
Sbjct: 52 FSNAELSRHNFAGESLQAAEFSNANLEMTNFVGADLRGAVLSASVMTQANLQGADLTNAM 111
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+D++ L ANL++ VL +L R+ IEGADF+DA++D AQ + LC A+G N T
Sbjct: 112 VDQVNLTGANLSDVVLKEALLLRAIFANVNIEGADFTDAILDKAQIKELCTKASGVNTKT 171
Query: 232 GVSTRKSLGC 241
GV TR SLGC
Sbjct: 172 GVETRDSLGC 181
>gi|428309499|ref|YP_007120476.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428251111|gb|AFZ17070.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 166
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 70/179 (39%), Positives = 92/179 (51%), Gaps = 28/179 (15%)
Query: 72 VSTALAAAVVASCSSNISALADLNKY--------EAETRGEFGIGSAAQFGSADLRKAVH 123
++T L A +V C + ALA KY AE +G+ F LR A
Sbjct: 6 LATFLLALIVWCCP--LPALAQATKYYPPPLSYSNAELKGK-------DFSGQTLRSAEF 56
Query: 124 VKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
N R NFT AD+R + FS S V AN GADLS+ ++D++ A+L
Sbjct: 57 SNANLERTNFTDADLRGTIFSAS----------VMTHANLHGADLSNAMIDQVSFTNADL 106
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++AVL +++ RS I GADFSDA++D AQ + LC A G N TGVSTR SLGC
Sbjct: 107 SDAVLTESIMLRSTFDNVDITGADFSDAILDGAQIKELCTKATGVNSQTGVSTRDSLGC 165
>gi|75910505|ref|YP_324801.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704230|gb|ABA23906.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 182
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 82/130 (63%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +A+L + E+ +A F++A++ ++F G+ GA L +V +AN GADL++ +
Sbjct: 52 FSNAELSRHNFAGESLQAAEFSNANLEMTNFVGADLRGAVLSASVMTQANLQGADLTNAM 111
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+D++ L ANL++ VL +L R+ IEGADF+DA++D AQ + LC A+G N T
Sbjct: 112 VDQVNLTGANLSDVVLKEALLLRAIFANVNIEGADFTDAILDKAQIKELCTKASGVNTKT 171
Query: 232 GVSTRKSLGC 241
GV TR SLGC
Sbjct: 172 GVKTRDSLGC 181
>gi|427722287|ref|YP_007069564.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427354007|gb|AFY36730.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 175
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 71/114 (62%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+F AD+R SDFSGS + A L + N +GADL++ MD++ L+ ANLTNA+
Sbjct: 62 ASFARADVRSSDFSGSDLSRAILSEGKFMDTNLSGADLTEAFMDQVNLSGANLTNAIFTD 121
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 243
V ++ A I GADFS A++D Q LCK A+GTN ITG+ TR SL C N
Sbjct: 122 AVAPGTNFTDANIAGADFSGALLDRYQLSQLCKRASGTNAITGIETRYSLNCEN 175
>gi|56750202|ref|YP_170903.1| hypothetical protein syc0193_c [Synechococcus elongatus PCC 6301]
gi|81300170|ref|YP_400378.1| hypothetical protein Synpcc7942_1361 [Synechococcus elongatus PCC
7942]
gi|56685161|dbj|BAD78383.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169051|gb|ABB57391.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 167
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 69/110 (62%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F S +MR+++ + A L V ANF GADLS L+DR+ L A+LT+A+LV
Sbjct: 57 FVSTEMRKANLEEANLRNAILTLGVFLDANFHGADLSGALLDRVFLVGADLTDALLVDVT 116
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
TR+ I GADF+DA+ID +++ LC A+G NP TGV+TR SLGC
Sbjct: 117 ATRTSFQDVKITGADFTDAIIDRYEQKQLCLRADGVNPKTGVATRDSLGC 166
>gi|428224653|ref|YP_007108750.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984554|gb|AFY65698.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 187
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 70/110 (63%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F++A++ ++F G+ G +V AN GA+L++ LMD+ L A+L A+L +
Sbjct: 77 FSNANLERANFEGADVRGGVFSASVLTDANLQGANLTNALMDQANLTRADLRGAILSEAI 136
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L S I GADFSDA++D AQ +ALC+ A G NP+TG+STR+SLGC
Sbjct: 137 LLGSTFAETAIAGADFSDAILDGAQIKALCQRAEGVNPVTGLSTRESLGC 186
>gi|427716094|ref|YP_007064088.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348530|gb|AFY31254.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 163
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 83/130 (63%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +A+L++ E + A F++A++ +++F+G+ GA L +V + N GADL+D L
Sbjct: 33 FSNAELKRHDFSGETLQGAEFSNANLEQANFAGADLRGAVLSASVMTQTNLHGADLTDAL 92
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+D++ L +A+L++AVL +L R+ I ADF+DAV+D AQ + LC A+G N T
Sbjct: 93 VDQVNLTKADLSDAVLKEALLLRAIFTDVNINSADFTDAVLDRAQIKELCGKASGVNSKT 152
Query: 232 GVSTRKSLGC 241
GV TR SLGC
Sbjct: 153 GVQTRDSLGC 162
>gi|428206519|ref|YP_007090872.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428008440|gb|AFY87003.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 192
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 76/112 (67%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F++A+M + +F+ + GA + +V +AN GADLS ++D++ + A+L++AVL
Sbjct: 80 AEFSNANMEQVNFTDADLRGAIMSASVMTQANLHGADLSIAMVDQVKMTGADLSDAVLQE 139
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L R+ G I GADFSDA++D AQ + LC+ A+G N TG++TR+SLGC
Sbjct: 140 ALLLRTIFTGVDITGADFSDAILDGAQVKELCQRASGINSKTGIATRESLGC 191
>gi|428209239|ref|YP_007093592.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428011160|gb|AFY89723.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 165
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 50/113 (44%), Positives = 71/113 (62%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RA F + + E++FS + GA AV KAN G D S + L+ A+L++A+L
Sbjct: 52 RAEFNNTKLAEANFSSADLRGAVFNSAVLRKANLHGVDFSYGIAYLSDLSAADLSDAILT 111
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ RS+ GA + GADFS+AV+D Q LC+YA+G NP+TGV TR+SLGC
Sbjct: 112 SAMMLRSNFKGAKVTGADFSEAVLDREQVVQLCEYASGVNPVTGVDTRESLGC 164
>gi|428316951|ref|YP_007114833.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428240631|gb|AFZ06417.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 165
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 82/130 (63%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +A+L + + RA F++A+M ++FS + GA + +V +AN GA+L++ +
Sbjct: 35 FSNAELTRRDFSGQMLRAAEFSNANMDLTNFSNADLRGAIMSASVMTQANLHGANLTNAM 94
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+D++ A+L++A+L T+L RS G I GADF+DA++D +Q + LC A G N T
Sbjct: 95 IDQVKFTNADLSDAILAETILLRSTFDGVDITGADFTDAIMDGSQVKELCTKATGINSQT 154
Query: 232 GVSTRKSLGC 241
G+STR SLGC
Sbjct: 155 GISTRDSLGC 164
>gi|428306100|ref|YP_007142925.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428247635|gb|AFZ13415.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 174
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 51/129 (39%), Positives = 79/129 (61%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F + DL AV F +A M+ ++F GS + A L + ANF A+L++ L+
Sbjct: 54 FSNTDLTGAV---------FAAAQMKGANFQGSNLSNAILSQGTLSNANFADANLTNALV 104
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
D++ L+ A+LTNA+ + + ++ + I GADF+DA+ID Q + LC+ A+G NP+T
Sbjct: 105 DQVTLDGADLTNAIFRQATMVGTNFNDSAIAGADFTDAIIDRYQLKQLCQRASGVNPVTA 164
Query: 233 VSTRKSLGC 241
VSTR+SLGC
Sbjct: 165 VSTRESLGC 173
>gi|33866170|ref|NP_897729.1| hypothetical protein SYNW1636 [Synechococcus sp. WH 8102]
gi|33639145|emb|CAE08151.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 171
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 52/119 (43%), Positives = 73/119 (61%)
Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
H + +F A R ++FSG+ +GA + +A+F+GADLSD LMDR NL
Sbjct: 53 HGQHLANTSFAGAVGRGANFSGADLHGAIFTQGAFAEADFSGADLSDALMDRADFAGTNL 112
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+AVL + + S A I GADFSDA++DL ++ LC+ A+G NP+TGV+T SLGC
Sbjct: 113 RDAVLTGIIASGSSFSDAQIAGADFSDALLDLDDQRRLCRDADGVNPVTGVATLDSLGC 171
>gi|434390929|ref|YP_007125876.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428262770|gb|AFZ28716.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 163
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 81/130 (62%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +A+L+ + RA+ F++A+M +++F+ + GA +V KAN GA+L++ +
Sbjct: 33 FSNAELKGRDFSGQMLRASEFSNANMEQTNFTDADLRGAIFSASVMTKANLHGANLTNAM 92
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
D++ A+L+ AVL T+L RS I ADFSDA++D Q + LC+ A+G NP T
Sbjct: 93 ADQVNFTNADLSAAVLAETILLRSVFDNTDITAADFSDAILDGVQIKELCQRASGVNPTT 152
Query: 232 GVSTRKSLGC 241
GV TR+SLGC
Sbjct: 153 GVDTRESLGC 162
>gi|317970566|ref|ZP_07971956.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
Length = 175
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 71/110 (64%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F A + ++FSG+ +GA L + ANF GADLSD L+DR ++ +L NAVLV +
Sbjct: 66 FAGAVGKAANFSGADLHGAILTQGAFPDANFNGADLSDVLLDRTDMSGTDLRNAVLVGVI 125
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ S GA +E ADF+DA++D A ++ C A+GTNP TG +TR SLGC
Sbjct: 126 ASGSTFTGAQVENADFTDALLDRADQRNFCISASGTNPTTGANTRASLGC 175
>gi|186684198|ref|YP_001867394.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186466650|gb|ACC82451.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 174
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 76/112 (67%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F++A+M ++FS + GA + +V KAN GADL++ ++D++ L +A+L++A+
Sbjct: 62 AEFSNANMELANFSNADLRGAVMSASVMTKANLHGADLTNAMVDQVNLTKADLSDAIFKE 121
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L R+ I+GADF+DA++D AQ + LC+ A+G N TGV TR+SLGC
Sbjct: 122 ALLLRAIFNDVNIDGADFTDAILDRAQIKELCRKASGVNSKTGVQTRESLGC 173
>gi|414079521|ref|YP_007000945.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413972800|gb|AFW96888.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 162
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 74/112 (66%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F++A++ ++F+G+ G +V KAN GADL++ +++ + L A+L+NAVL+
Sbjct: 50 AEFSNANLEMANFTGADLRGTVFSASVMTKANLHGADLTNAMVNEVKLAGADLSNAVLIE 109
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L R+ I GADF+DA++D AQ + LC+ A+G N TGV TR+SLGC
Sbjct: 110 ALLLRTVFTDVNITGADFTDAILDKAQIKELCQKASGVNSQTGVETRESLGC 161
>gi|307153777|ref|YP_003889161.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306984005|gb|ADN15886.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 173
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 75/118 (63%), Gaps = 1/118 (0%)
Query: 125 KENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
++N R A F +ADMR + F S + A L + + AN GA+L+ TL+DR+ L+ A+L
Sbjct: 54 EKNLRGAVFAAADMRGASFENSDLSYAILTEGILLNANLKGANLTGTLLDRVTLDFADLR 113
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+A+L + TR+ + I GADF+ AVID Q +C+ A+G N ITGVSTR SLGC
Sbjct: 114 DAILTDAIATRTRFYDSDITGADFTGAVIDTYQISLMCERADGVNSITGVSTRDSLGC 171
>gi|428770661|ref|YP_007162451.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428684940|gb|AFZ54407.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 165
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/112 (47%), Positives = 74/112 (66%), Gaps = 10/112 (8%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANF+++D+R G+ FN A LE+A NF GADL++ + LN A+LT+A+L
Sbjct: 63 ANFSNSDLR-----GAVFNAARLEEA-----NFHGADLTNGFIYVTSLNRADLTDAILRE 112
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ R+ L GA ++GADF+ AV+D Q LCK A G NP+TG STR+SLGC
Sbjct: 113 AIMKRTTLKGANVDGADFTFAVLDNEQVIELCKNAQGINPVTGASTRQSLGC 164
>gi|282900610|ref|ZP_06308552.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281194410|gb|EFA69365.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 167
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 78/142 (54%), Gaps = 11/142 (7%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
E E IG A F DLR + +FT A++R+SDFSGS G A
Sbjct: 36 EYNKEILIG--ADFSQRDLRDS---------SFTKANLRQSDFSGSNLTGVSFFAANLES 84
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
ANFTGADL++ +D ANLTNA+L + + GAII GADF+D ++ ++
Sbjct: 85 ANFTGADLTNATLDSARFIGANLTNAILEGSFAASAKFDGAIIAGADFTDVLLRRDEQNK 144
Query: 220 LCKYANGTNPITGVSTRKSLGC 241
LC+ ANG NP TG TR++L C
Sbjct: 145 LCQVANGINPTTGRHTRETLFC 166
>gi|428777417|ref|YP_007169204.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428691696|gb|AFZ44990.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 165
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 72/116 (62%), Gaps = 5/116 (4%)
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
E + N +AD +++ G+ FNGA L + AN+ G + S+ + +LTNA
Sbjct: 54 EFYDENLEAADFHDANLEGAVFNGATL-----HNANWRGVNFSNGIAYLTDFTGVDLTNA 108
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
VL ++ RS GAI+EGADF++AV+D Q + LC+ A+G NP TGVSTR+SLGC
Sbjct: 109 VLTEAMMLRSKFEGAIVEGADFTNAVVDRLQVKKLCERASGVNPTTGVSTRESLGC 164
>gi|148240085|ref|YP_001225472.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
gi|147848624|emb|CAK24175.1| Secreted pentapeptide repeats protein [Synechococcus sp. WH 7803]
Length = 174
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/111 (45%), Positives = 67/111 (60%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+F A + +DFSG+ GA + ANF GADLSD LMDR +L +AVL+
Sbjct: 64 SFAGAAGKGADFSGANLQGAIFTQGAFADANFHGADLSDALMDRADFTGTDLRDAVLIGV 123
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + S GA ++GADFSDA++D ++ LC+ A G NP TGV TR SL C
Sbjct: 124 IASGSSFAGAQVDGADFSDALLDRDDQRRLCQEAEGVNPTTGVLTRDSLSC 174
>gi|119509637|ref|ZP_01628783.1| hypothetical protein N9414_21581 [Nodularia spumigena CCY9414]
gi|119465656|gb|EAW46547.1| hypothetical protein N9414_21581 [Nodularia spumigena CCY9414]
Length = 221
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 66/110 (60%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+MR ++F G+ A L K V AN A+L L+DR+ ++ ANL NA+
Sbjct: 111 FVAAEMRGANFQGANLKNAILTKGVLLNANLENANLEGALVDRVTMDGANLKNAIFTEAT 170
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+TRS A I GADF+DA+ID Q +C A G N +TGV+TR SLGC
Sbjct: 171 MTRSRFFDADITGADFTDALIDRYQVALMCDRAAGINSVTGVATRDSLGC 220
>gi|334116781|ref|ZP_08490873.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333461601|gb|EGK90206.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 165
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 82/130 (63%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +A+L + + RA F++A+M ++FS + GA + +V +AN GA+L++ +
Sbjct: 35 FSNAELTRRDFSGQMLRAAEFSNANMDLTNFSNADLQGAIMSASVMTQANLHGANLTNAM 94
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+D++ A+L++A+L T+L RS G I GADF+DA++D +Q + LC A+G N T
Sbjct: 95 IDQVKFTNADLSDAILAETILLRSTFEGVDITGADFTDAIMDGSQIKELCTKASGINSQT 154
Query: 232 GVSTRKSLGC 241
G+ TR SLGC
Sbjct: 155 GIYTRDSLGC 164
>gi|260435516|ref|ZP_05789486.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
gi|260413390|gb|EEX06686.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
Length = 163
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/147 (42%), Positives = 88/147 (59%), Gaps = 18/147 (12%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
E RG+F A Q SAD+ + +KE F AD+RE + SG+ GA + +
Sbjct: 28 ELRGQF----AVQEISADMH-GLDLKEK---EFLKADLREVNLSGTDLRGAVINTSQLQG 79
Query: 160 ANFTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
A+ ADLSD + + L AN TNA+++++ T A I+GADF++AVIDL
Sbjct: 80 ADLRDADLSDVVGFASHFEGADLRGANFTNAMMMQSRFT-----DAQIDGADFTNAVIDL 134
Query: 215 AQKQALCKYANGTNPITGVSTRKSLGC 241
Q++ALC A+G+NPI+GVSTR+SLGC
Sbjct: 135 PQQRALCVRADGSNPISGVSTRESLGC 161
>gi|354568879|ref|ZP_08988040.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353539391|gb|EHC08878.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 172
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 71/110 (64%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F A+M+ ++F G+ +G L K +A+ + A+L++ DR++ N+ANLTNA+ +
Sbjct: 62 FAGAEMQGANFQGANLSGTILTKGSFLQADLSNANLAEAFADRVIFNKANLTNAIFRDAM 121
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L S A I GADFS A++D Q + +C A+G NP+TGVSTR+SLGC
Sbjct: 122 LASSRFFEAEITGADFSGAIVDPYQVKLMCDRADGINPVTGVSTRESLGC 171
>gi|86605651|ref|YP_474414.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86554193|gb|ABC99151.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 165
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 79/130 (60%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +ADL+ +++R ++F SA+++ +D G+ G KA AN GADLS++L
Sbjct: 35 FSNADLQGQDLSGQDWRGSSFVSANLQGADLQGANLAGVAFTKANLAGANLAGADLSNSL 94
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+D L A+L A L + R+ GA I GADFSDA +D A + LC+ A G++PIT
Sbjct: 95 LDLANLAGADLRGANLRGAIAARAVWDGAQIAGADFSDAYVDRAALRQLCQRAEGSHPIT 154
Query: 232 GVSTRKSLGC 241
GVSTR SLGC
Sbjct: 155 GVSTRASLGC 164
>gi|87303664|ref|ZP_01086439.1| hypothetical protein WH5701_12843 [Synechococcus sp. WH 5701]
gi|87281769|gb|EAQ73734.1| hypothetical protein WH5701_12843 [Synechococcus sp. WH 5701]
Length = 153
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/146 (41%), Positives = 80/146 (54%), Gaps = 9/146 (6%)
Query: 105 FGIGSAA---------QFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKA 155
G+GSAA Q DL+ +H ++ + F A M D SGS GA +
Sbjct: 7 MGVGSAAAITAPELRGQRALQDLQPDMHGRDLRQQEFLKASMGGFDLSGSDLRGAVFNSS 66
Query: 156 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
N + A+L D + + A+L+ AVL +L +S GA IEGADFSDAV+DL+
Sbjct: 67 DLTNTNLSAANLEDAVAFATRFDGADLSGAVLRNAMLMQSRFTGAQIEGADFSDAVLDLS 126
Query: 216 QKQALCKYANGTNPITGVSTRKSLGC 241
Q +ALC A+G NP TGVST +SLGC
Sbjct: 127 QVKALCSRADGVNPSTGVSTVESLGC 152
>gi|427735661|ref|YP_007055205.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427370702|gb|AFY54658.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 168
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 68/112 (60%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
NF SA+MR ++F G+ A K AN GA+ ++ L+D++ L+ ANL NA +
Sbjct: 56 VNFISAEMRGTNFQGADLTNAMFTKGNLLGANLEGANFTNALVDQVTLDNANLKNANFTQ 115
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++RS A I GADF+DA+ID Q + +C A+G NP TGV TR SLGC
Sbjct: 116 ATMSRSRFFDADITGADFTDAIIDRYQVKLMCDRASGVNPETGVETRYSLGC 167
>gi|282895655|ref|ZP_06303780.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
gi|281199349|gb|EFA74214.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
Length = 171
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 89/174 (51%), Gaps = 15/174 (8%)
Query: 68 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 127
+ V ++ +L + +C +++ A +Y E I A F DLR +
Sbjct: 12 FLVILNLSLLVIIPLTCLVGLTSTALALEYNKE------ILIGADFSQRDLRDS------ 59
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
+FT A++R+SDFSGS G A ANFTGADL++ +D ANLTNA+L
Sbjct: 60 ---SFTKANLRQSDFSGSNLTGVSFFAANLESANFTGADLTNATLDSARFIGANLTNAIL 116
Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ GAII GADF+D ++ ++ LC+ A G NP TG TRK+L C
Sbjct: 117 EGAFAASAKFDGAIITGADFTDVLLRRDEQNKLCQLAKGINPTTGRHTRKTLFC 170
>gi|220905675|ref|YP_002480986.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219862286|gb|ACL42625.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 162
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 68/110 (61%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A M E++F G+ A L KA +ANF GA+L+D L D + ++L+NA+L
Sbjct: 52 FAAAVMPEANFEGANLRNAILSKAELSQANFRGANLTDVLADGVSWANSDLSNAILAGAT 111
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L + G I GADFSDA+ID LC+ A G NP+TG++TR+SLGC
Sbjct: 112 LIGTTFTGVTITGADFSDALIDRYDVSLLCQRAEGINPVTGIATRESLGC 161
>gi|411116478|ref|ZP_11388965.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410712581|gb|EKQ70082.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 165
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 67/162 (41%), Positives = 89/162 (54%), Gaps = 21/162 (12%)
Query: 80 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRE 139
V A+ S+ I A D+ + G+ + S +FG +DL+ A NF AD+R
Sbjct: 24 VYAASSAAIRAYDDVEATTKDYSGQNLVRS--EFGDSDLQGA---------NFAGADLR- 71
Query: 140 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 199
G+ FNGA L A N G D SD + A+L++A+L +L +S G
Sbjct: 72 ----GAVFNGAKLTNA-----NLHGVDFSDGIAYITDFANADLSDAILNSAMLLKSSFKG 122
Query: 200 AIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
A I GADFSDA ID AQ ALC+ A+GTNP+TGV TR+SLGC
Sbjct: 123 ANITGADFSDAAIDRAQVLALCQTASGTNPVTGVDTRESLGC 164
>gi|427708609|ref|YP_007050986.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427361114|gb|AFY43836.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 189
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 74/112 (66%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F++A+M ++F+G+ GA L +V KAN ADL++ ++D++ L A+L++AV
Sbjct: 77 AEFSNANMEMANFTGADLRGAVLSASVMTKANLHQADLTNAMVDQVNLTGADLSDAVFKE 136
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L R+ I+GADF+DAV+D AQ + LC A+G N TGV TR+SLGC
Sbjct: 137 ALLLRALFTDVNIQGADFTDAVLDKAQIKELCSKASGVNSKTGVETRESLGC 188
>gi|116072323|ref|ZP_01469590.1| hypothetical protein BL107_11066 [Synechococcus sp. BL107]
gi|116064845|gb|EAU70604.1| hypothetical protein BL107_11066 [Synechococcus sp. BL107]
Length = 186
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 50/111 (45%), Positives = 71/111 (63%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+F A R +DF + +GA L + +A+F GADLSD LMDR ++L +AVL+
Sbjct: 76 SFAGATGRGADFRDAILHGAILTQGAFAEADFRGADLSDALMDRADFVASDLRDAVLIGV 135
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + S A+IEGADF+DA++D ++ LC+ A+G NP TGVST SLGC
Sbjct: 136 IASGSSFSKALIEGADFTDALLDRDDQRRLCRDADGINPTTGVSTFDSLGC 186
>gi|427729477|ref|YP_007075714.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427365396|gb|AFY48117.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 170
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 83/130 (63%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +A+L + ++ +A F++A++ +DF+G+ GA L +V +AN ADL++ +
Sbjct: 41 FSNAELARHDFAGDSLQAAEFSNANLEMTDFTGADLRGAVLSASVMTQANLHKADLTNAM 100
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+D++ L A+L++AV +L R+ IEGADF+DA++D AQ + LC A+G N T
Sbjct: 101 VDQVNLTGADLSDAVFKEALLLRAIFNDVNIEGADFTDALLDKAQIKELCTKASGVNSQT 160
Query: 232 GVSTRKSLGC 241
GV+TR SLGC
Sbjct: 161 GVATRDSLGC 170
>gi|428222027|ref|YP_007106197.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427995367|gb|AFY74062.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 161
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 71/110 (64%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+M ++F + GA ++ AN A+ + ++D++ A+LT+A+LV T+
Sbjct: 51 FANANMENANFERADLRGAVFSASILRNANLRAANFTTGMLDQIDFANADLTDAILVDTL 110
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L RS A I+GADF+DA++D AQ + LC A GTNP TGVSTR+SLGC
Sbjct: 111 LLRSTFDFAKIDGADFTDALLDGAQIKWLCSKAKGTNPFTGVSTRESLGC 160
>gi|218439896|ref|YP_002378225.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218172624|gb|ACK71357.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 170
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 74/117 (63%)
Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
K + A F +A+MR + S + + L +AV AN GA+L+ +L+DR+ L+ A+LTN
Sbjct: 52 KNLYGAVFAAANMRGASLENSDLSYSILTEAVLLNANLKGANLTGSLVDRVTLDFADLTN 111
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
A+ + +R+ I GADFS A++D Q +C+ A+G NP+TGVSTR+SLGC
Sbjct: 112 AIFTDAIASRTRFYDTTITGADFSGAILDQYQVYLMCERASGVNPVTGVSTRESLGC 168
>gi|427420479|ref|ZP_18910662.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425756356|gb|EKU97210.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 169
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 71/108 (65%)
Query: 134 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
+A++R ++F G+ + L KA + + TGA+LS+T DR+ ++LTNAV+ ++T
Sbjct: 60 AAEVRNANFRGADLSATILTKAKFIRTDLTGANLSETFADRVEFTGSDLTNAVVTDALMT 119
Query: 194 RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
S A I GADFS ++D Q + LC+ A+G NP+TGVSTR+SLGC
Sbjct: 120 SSTFADATITGADFSYTILDRFQVKYLCERADGMNPVTGVSTRESLGC 167
>gi|78212794|ref|YP_381573.1| hypothetical protein Syncc9605_1263 [Synechococcus sp. CC9605]
gi|78197253|gb|ABB35018.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 169
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/147 (41%), Positives = 88/147 (59%), Gaps = 18/147 (12%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
E RG+F A Q SAD+ + +KE F AD+RE + SG+ GA + +
Sbjct: 34 ELRGQF----AVQEISADM-HGLDLKEK---EFLKADLREVNLSGTDLRGAVINTSQLQG 85
Query: 160 ANFTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
A+ A+LSD + + L AN TNA+++++ T A I+GADF++AVIDL
Sbjct: 86 ADLRDANLSDVVGFASHFEGADLRGANFTNAMMMQSRFT-----DAQIDGADFTNAVIDL 140
Query: 215 AQKQALCKYANGTNPITGVSTRKSLGC 241
Q++ALC A+G+NPI+GVSTR+SLGC
Sbjct: 141 PQQRALCARADGSNPISGVSTRESLGC 167
>gi|78185103|ref|YP_377538.1| hypothetical protein Syncc9902_1536 [Synechococcus sp. CC9902]
gi|78169397|gb|ABB26494.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 182
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 70/111 (63%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+F A R +DF + +GA L + +A+F GADLSD LMDR +L +AVL+
Sbjct: 72 SFAGATGRGADFRDANLHGAILTQGAFAEADFRGADLSDALMDRADFVATDLRDAVLIGV 131
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + S A+IEGADF+DA++D ++ LC+ A+G NP TG+ST SLGC
Sbjct: 132 IASGSSFSKALIEGADFTDALLDRDDQRLLCRDADGINPTTGISTFDSLGC 182
>gi|428227020|ref|YP_007111117.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427986921|gb|AFY68065.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 166
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 50/114 (43%), Positives = 71/114 (62%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
+A F++A+++ +DFSG+ GA + AN G D SD + ++ANL++AVL
Sbjct: 52 LQAEFSNANLKNADFSGADLRGAVFNGSTLVHANLRGVDFSDGIAYISDFSDANLSDAVL 111
Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L +S GA + GADF+DAV+D AQ LCK A+G N ITG TR+SLGC
Sbjct: 112 SSAMLLKSRFTGADVTGADFTDAVLDRAQVLQLCKTASGVNSITGADTRESLGC 165
>gi|16332305|ref|NP_443033.1| hypothetical protein sll0577 [Synechocystis sp. PCC 6803]
gi|383324046|ref|YP_005384900.1| hypothetical protein SYNGTI_3138 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383327215|ref|YP_005388069.1| hypothetical protein SYNPCCP_3137 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383493099|ref|YP_005410776.1| hypothetical protein SYNPCCN_3137 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384438367|ref|YP_005653092.1| hypothetical protein SYNGTS_3139 [Synechocystis sp. PCC 6803]
gi|451816456|ref|YP_007452908.1| hypothetical protein MYO_131750 [Synechocystis sp. PCC 6803]
gi|1653935|dbj|BAA18845.1| sll0577 [Synechocystis sp. PCC 6803]
gi|339275400|dbj|BAK51887.1| hypothetical protein SYNGTS_3139 [Synechocystis sp. PCC 6803]
gi|359273366|dbj|BAL30885.1| hypothetical protein SYNGTI_3138 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359276536|dbj|BAL34054.1| hypothetical protein SYNPCCN_3137 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359279706|dbj|BAL37223.1| hypothetical protein SYNPCCP_3137 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407960039|dbj|BAM53279.1| hypothetical protein BEST7613_4348 [Synechocystis sp. PCC 6803]
gi|451782425|gb|AGF53394.1| hypothetical protein MYO_131750 [Synechocystis sp. PCC 6803]
Length = 169
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 76/139 (54%), Gaps = 16/139 (11%)
Query: 114 GSADLRKAVHVKENFR------ANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANF 162
G++ V + +FR A FT+ D+ S D GS FNGA L A N
Sbjct: 35 GASAFENMVLAETDFRDQDLLTAQFTNVDLTSSIFEAMDLRGSVFNGANLTDA-----NL 89
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
G DL++ L N ANL NA+L ++ R+ A I+GADFS AV+D Q ALCK
Sbjct: 90 KGVDLTNGLTYLTSFNGANLENAILAEAIMLRTSFKNAKIQGADFSLAVLDTEQIAALCK 149
Query: 223 YANGTNPITGVSTRKSLGC 241
A+G NP TG+STR+SLGC
Sbjct: 150 VADGVNPKTGISTRESLGC 168
>gi|282902031|ref|ZP_06309929.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281193118|gb|EFA68117.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 162
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 81/130 (62%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +A+L + +N +A F++A++ ++F+ + GA +V +AN GADL++ +
Sbjct: 32 FSNAELGRHNFSGQNLQAAEFSNANLEMANFANADLRGAVFSASVMTQANLHGADLTNAM 91
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+D++ L +A+L++A+ + +L RS+ I+GADFS A++D Q + LCK A G N T
Sbjct: 92 LDQVKLTDADLSDAIFIEAILLRSNFAKTNIDGADFSKAILDRGQIRDLCKSARGINSRT 151
Query: 232 GVSTRKSLGC 241
V TR SLGC
Sbjct: 152 HVQTRDSLGC 161
>gi|317969830|ref|ZP_07971220.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
Length = 178
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 54/135 (40%), Positives = 77/135 (57%), Gaps = 10/135 (7%)
Query: 112 QFGSADLRKAVH-----VKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
Q + DL+ +H +E +A+ D+ E+D G+ FN A L+ A N + AD
Sbjct: 48 QRSAQDLQPDMHGRNLQQQEFLKASMEGFDLSETDLRGAVFNTANLQNA-----NLSAAD 102
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
L D + + A+L+ AV +L S GA+IEG DF+DAV+DL Q++ALC A+G
Sbjct: 103 LEDAVAFATRFDNADLSGAVFRNAMLMNSKFTGAVIEGTDFTDAVLDLPQQKALCARASG 162
Query: 227 TNPITGVSTRKSLGC 241
NP TGV TR+SL C
Sbjct: 163 VNPRTGVDTRESLAC 177
>gi|87125517|ref|ZP_01081362.1| hypothetical protein RS9917_02051 [Synechococcus sp. RS9917]
gi|86166817|gb|EAQ68079.1| hypothetical protein RS9917_02051 [Synechococcus sp. RS9917]
Length = 180
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 71/119 (59%)
Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
H ++ +F A R +DFS + +GA + A+F GADLSD LMDR + +L
Sbjct: 62 HGQDLRNTSFAGAVGRGADFSDANLHGAIFTQGAFANADFHGADLSDALMDRADFSGTDL 121
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L + + S GA IEGADFSDA++D + LC+ A G++P TGVSTR+SLGC
Sbjct: 122 RGTLLSGVIASGSSFAGAQIEGADFSDALLDRDDVRRLCRDAEGSHPHTGVSTRESLGC 180
>gi|443312247|ref|ZP_21041866.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442777717|gb|ELR87991.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 162
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 80/130 (61%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +A+L+ + RA F++A+M ++FS + GA +V A+ GADLS+ +
Sbjct: 32 FSNAELKSRDFSGQTLRAAEFSNANMELANFSNADLRGAVFSASVMTGASLHGADLSNAM 91
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+D++ L +A+L++AVL +L R+ I ADF+DA++D AQ + LC A+G NP T
Sbjct: 92 VDQVNLTKADLSDAVLTEALLLRAIFDDVSIVNADFTDAILDRAQIKELCAKASGVNPKT 151
Query: 232 GVSTRKSLGC 241
GV TR SLGC
Sbjct: 152 GVETRYSLGC 161
>gi|428302010|ref|YP_007140316.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428238554|gb|AFZ04344.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 162
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 72/112 (64%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F++A+M +DF G+ GA + + KAN GA+L++ L+D++ L A+L++AVL
Sbjct: 50 AEFSNANMELADFRGADLRGAVMSASTMTKANLHGANLANALVDQVNLTGADLSDAVLQE 109
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L R+ I GADF+DA++D AQ + LC A+G N TGV TR SLGC
Sbjct: 110 ALLLRAIFTDVKINGADFTDAILDGAQIRELCNIASGVNSQTGVETRYSLGC 161
>gi|412992118|emb|CCO19831.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
Length = 293
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 80/143 (55%), Gaps = 9/143 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S F + DLR + V+ A++R ++FS S GA + +++ A+ G+D+S
Sbjct: 145 SNEDFSNLDLRGTIWVE---------AELRNTNFSKSDMRGAVMTRSIMPNADVHGSDVS 195
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
+ L D ++L AN +AV V RSD+G I+ ADF++AVID Q LC+ A G N
Sbjct: 196 NVLFDYVLLRGANFEDAVAVGANFIRSDMGEMKIKNADFTEAVIDRYQVLGLCETAEGVN 255
Query: 229 PITGVSTRKSLGCGNSRRNAYGS 251
P TGV TR SLGC + + GS
Sbjct: 256 PYTGVDTRMSLGCDSFVKKYEGS 278
>gi|254415547|ref|ZP_05029307.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196177728|gb|EDX72732.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 165
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 52/112 (46%), Positives = 68/112 (60%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+F AD+RES+FS ++ G A ANF GA+LS + +D+ LN ANL NAVL
Sbjct: 53 ASFNQADLRESNFSHAELQGVSFFGANLKLANFEGANLSYSTLDKARLNGANLKNAVLEG 112
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ GA IEGADF+DA +D ++ LC+ A GTNP TG TR +L C
Sbjct: 113 AYAFNAQFDGATIEGADFTDAFLDPKAEEKLCQMATGTNPTTGRQTRDTLFC 164
>gi|88808683|ref|ZP_01124193.1| hypothetical protein WH7805_03297 [Synechococcus sp. WH 7805]
gi|88787671|gb|EAR18828.1| hypothetical protein WH7805_03297 [Synechococcus sp. WH 7805]
Length = 176
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 92/179 (51%), Gaps = 8/179 (4%)
Query: 63 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
A L N R ++TAL AA+V L D EA T E A Q D+ +
Sbjct: 5 ALLCNLRRHLTTALLAALVVFTG----VLIDGPSVEAITAPELRGQRAVQ----DITSDM 56
Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
H ++ F AD+RE D + GA + + A+ GADL D + + A+L
Sbjct: 57 HGRDLKEKEFLKADLREVDLGEADLRGAVINTSQLQGADLRGADLEDVVAFSSRFDGADL 116
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
NA +L +S A IEG DF++AVIDL+Q +ALC A+G N ++GVST++SLGC
Sbjct: 117 RNANFTNAMLMQSRFNDAEIEGTDFTNAVIDLSQLKALCGRASGVNSLSGVSTKESLGC 175
>gi|318041364|ref|ZP_07973320.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0101]
Length = 170
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 73/122 (59%), Gaps = 5/122 (4%)
Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
+ + +E +AN D ESD G+ FN A L+ A N ADL D + +
Sbjct: 53 RNLQQQEFLKANLEGFDFSESDLRGAVFNTANLQGA-----NLHAADLEDAVAFASRFDN 107
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
A+L++AVL +L S G++I+GADF+DAV+DL Q++ALC+ A GTN TGV+TR SL
Sbjct: 108 ADLSDAVLRNAMLMNSKFAGSVIDGADFTDAVLDLPQQKALCERAGGTNARTGVNTRDSL 167
Query: 240 GC 241
C
Sbjct: 168 NC 169
>gi|443321745|ref|ZP_21050787.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
gi|442788515|gb|ELR98206.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
Length = 149
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 71/112 (63%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+F A +RES+FS + G A NF GA+L++ +D LN+ANL NA+L+
Sbjct: 37 ASFDLASLRESNFSHANLTGVRFFSANLESVNFEGANLTNATLDSARLNDANLKNAILIG 96
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ + + G IEGADF+DA+I +++ LCK A GTNP+TG TR++L C
Sbjct: 97 AFVSNAKVQGVNIEGADFTDALILPYEQKLLCKVAQGTNPVTGRDTRETLFC 148
>gi|88809155|ref|ZP_01124664.1| hypothetical protein WH7805_05666 [Synechococcus sp. WH 7805]
gi|88787097|gb|EAR18255.1| hypothetical protein WH7805_05666 [Synechococcus sp. WH 7805]
Length = 180
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 66/111 (59%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+F A + +DFSG+ GA + ANF GADLSD LMDR +L +AVL+
Sbjct: 70 SFAGAAAKGADFSGANLQGAIFTQGAFADANFRGADLSDALMDRADFTGTDLRDAVLIGV 129
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + S A ++GADFSDA++D ++ LC+ A G NP TGV TR SL C
Sbjct: 130 IASGSSFARAQVDGADFSDALLDRDDQRKLCQEAEGLNPTTGVLTRDSLSC 180
>gi|427702634|ref|YP_007045856.1| low-complexity protein [Cyanobium gracile PCC 6307]
gi|427345802|gb|AFY28515.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
Length = 182
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 72/112 (64%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
++F A R++ F + +GA L +A +A+F GADLSD LMD++ ++ +LT AVL
Sbjct: 70 SSFAGATGRQARFRDADLHGAILTQAAFPEADFHGADLSDALMDKVDMSGTDLTGAVLRG 129
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + S+ GA + ADF+DA++D ++ LC+ A GTNP+TG TR SL C
Sbjct: 130 AIASGSNFTGATVTDADFTDALLDRVDQRNLCREARGTNPVTGADTRLSLDC 181
>gi|298492040|ref|YP_003722217.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298233958|gb|ADI65094.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 167
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/131 (40%), Positives = 75/131 (57%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F DLR + +FT A++R+S+F+G+ G A AN GADL++
Sbjct: 45 ADFSRRDLRDS---------SFTKANLRQSNFTGANLRGVSFFAANLESANLEGADLTNA 95
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D L ANLTNAVL + GAI++GADF+DA++ +++ LC A GTNPI
Sbjct: 96 TLDSARLIRANLTNAVLEGAFAASAKFDGAIVDGADFTDALLRQDEQKKLCNLAKGTNPI 155
Query: 231 TGVSTRKSLGC 241
TG TR++L C
Sbjct: 156 TGRDTRETLFC 166
>gi|411119374|ref|ZP_11391754.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410711237|gb|EKQ68744.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 182
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 73/112 (65%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F++A++ +F+ + GA + + + GADL+ ++D++ + +L++A+L
Sbjct: 71 AEFSNANLNRVNFTNADLRGAVMSASTMVDTSLHGADLTQAMLDQVKMIRTDLSDAILAN 130
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
T+L R+ +EGADF+DA++D AQ +ALC++A+G N TGVSTR SLGC
Sbjct: 131 TILLRTTFENINLEGADFTDAILDGAQVKALCQFASGANSKTGVSTRDSLGC 182
>gi|443478408|ref|ZP_21068166.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443016315|gb|ELS31005.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 150
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 70/110 (63%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +A+M ++F + GA ++ KAN G D S L+D+ +A+L+NA+LV T+
Sbjct: 40 FANANMEGANFENADVRGAVFSASILRKANLKGTDFSGGLLDQADFAKADLSNALLVETI 99
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L RS I+GADF+DA++D AQ++ LC A GTN TG++TR+SL C
Sbjct: 100 LLRSTFDFVNIDGADFTDAIMDGAQRKWLCSKAKGTNAKTGINTRESLEC 149
>gi|260435480|ref|ZP_05789450.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
gi|260413354|gb|EEX06650.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
Length = 173
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 67/110 (60%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F A R ++F G+ +GA L + +A+F GADLSD LMDR +L NAVL +
Sbjct: 64 FAGAVGRGANFRGANLHGAILTQGAFAEADFQGADLSDALMDRADFVATDLRNAVLTGII 123
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ S A IEGADF+DA++D ++ LC A+G NP TGVST SLGC
Sbjct: 124 ASGSSFSNAQIEGADFTDALLDRDDQRRLCGEADGINPSTGVSTFDSLGC 173
>gi|428772631|ref|YP_007164419.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
7202]
gi|428686910|gb|AFZ46770.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
Length = 166
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 73/130 (56%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F S L+ +N + A FT ++ ++ F+ S GA A ANF+G D+SD L
Sbjct: 36 FESKSLKGEDFTNQNLQLAEFTKVNLEDAKFNDSDLRGAVFNGVNAEGANFSGVDMSDGL 95
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+ N +L+NA+ ++ R+ A +EGADF+ AV+D Q LCK A+G NP+T
Sbjct: 96 VYVTSFNNTDLSNAIFRDAIMLRTTFKNANVEGADFTFAVLDSEQVNQLCKNASGVNPVT 155
Query: 232 GVSTRKSLGC 241
STR+SLGC
Sbjct: 156 NASTRQSLGC 165
>gi|254430459|ref|ZP_05044162.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
gi|197624912|gb|EDY37471.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
Length = 180
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/111 (43%), Positives = 70/111 (63%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+F A R +DFSG+ +GA L +A +A+F GADLS LMD++ + A+ T A L
Sbjct: 70 SFAGAAGRHADFSGANLHGAILTQAAFPEASFAGADLSGVLMDKVDFSGADFTGADLSDV 129
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + S+ GA + ADF+ A+ID ++ LC+ A GT+P+TG TR SLGC
Sbjct: 130 IASGSNFSGATVTNADFTGALIDRVDQRLLCRDAEGTHPLTGADTRLSLGC 180
>gi|33865660|ref|NP_897219.1| hypothetical protein SYNW1126 [Synechococcus sp. WH 8102]
gi|33632830|emb|CAE07641.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 190
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/163 (36%), Positives = 89/163 (54%), Gaps = 4/163 (2%)
Query: 79 AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMR 138
++VA+ +S L N +A T E A Q +AD+ + +KE F AD+R
Sbjct: 30 SLVAAILVVVSTLLWTNSAQAITAPELRGQRAVQEITADM-HGLDLKEK---EFLKADLR 85
Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
E + S + GA + + A+ GADLS+ + + A+L A +L +S
Sbjct: 86 EVNLSDTDLRGAVINTSQLQGADLRGADLSNVVGFASRFDGADLRGATFTNAMLMQSRFA 145
Query: 199 GAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
A IEGADF+DAV+DL Q++ LC A G +P++GVSTR+SLGC
Sbjct: 146 DARIEGADFTDAVLDLPQQKLLCATAAGEHPVSGVSTRESLGC 188
>gi|308814214|ref|XP_003084412.1| COG1357: Uncharacterized low-complexity proteins (ISS)
[Ostreococcus tauri]
gi|116056297|emb|CAL56680.1| COG1357: Uncharacterized low-complexity proteins (ISS)
[Ostreococcus tauri]
Length = 186
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 69/118 (58%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
+ +D+R ++ S + GA +A+ D S+ + D VL A++ + V
Sbjct: 46 YAESDLRNANISNTDARGAVFSRAIMPGVKLNATDASNAMFDYAVLRGADMRDGVFANAN 105
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAY 249
R+D+G A+IEGADFS+AVID + LC+ A+GTNP TG+ TR +LGC +SR + Y
Sbjct: 106 FVRADMGEAMIEGADFSEAVIDRYEAIRLCERASGTNPWTGIETRATLGCDDSRVSKY 163
>gi|87124337|ref|ZP_01080186.1| hypothetical protein RS9917_12025 [Synechococcus sp. RS9917]
gi|86167909|gb|EAQ69167.1| hypothetical protein RS9917_12025 [Synechococcus sp. RS9917]
Length = 178
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/120 (44%), Positives = 68/120 (56%)
Query: 122 VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
+H + F AD++ D SGS GA + + A+ GADLSD + + A+
Sbjct: 58 MHGMDLKEKEFLKADLQGVDLSGSDLRGAVINTSSLQGADLQGADLSDVVAFASRFDGAD 117
Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L NAV +L +S G A I+GADF+DAVIDL Q +ALC A G N TGV TR SLGC
Sbjct: 118 LRNAVFTNAMLMQSRFGDAQIDGADFTDAVIDLPQLKALCARAAGENSRTGVLTRDSLGC 177
>gi|123968679|ref|YP_001009537.1| hypothetical protein A9601_11461 [Prochlorococcus marinus str.
AS9601]
gi|126696485|ref|YP_001091371.1| hypothetical protein P9301_11471 [Prochlorococcus marinus str. MIT
9301]
gi|123198789|gb|ABM70430.1| conserved hypothetical protein [Prochlorococcus marinus str.
AS9601]
gi|126543528|gb|ABO17770.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9301]
Length = 172
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 47/104 (45%), Positives = 66/104 (63%)
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
R++DFS +G L + +N G DL+DTL DR+ + +L NAVL+ + + S
Sbjct: 68 RDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLSDRVNFQKTDLRNAVLINMIASGSSF 127
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
GA IEGADFS A++D ++ LC+ A+G NP TGVSTR+SL C
Sbjct: 128 AGAQIEGADFSYAILDSEDQRNLCEIADGINPTTGVSTRESLEC 171
>gi|352096257|ref|ZP_08957137.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
gi|351676951|gb|EHA60102.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
Length = 177
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 49/119 (41%), Positives = 71/119 (59%)
Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
H + +F A R +DFS + G +A +ANF GA+LSD LMDR ++ +L
Sbjct: 59 HGQNLVNTSFAGATGRGADFSDANLQGTIFTQAEFPEANFHGANLSDALMDRADFSKTDL 118
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+A+L + S GA IEGADF+DA++D ++ LC+ A+G NP +GV+TR SL C
Sbjct: 119 RDALLQGVIAAGSSFAGADIEGADFTDALLDREDQRRLCQDADGVNPSSGVATRDSLDC 177
>gi|78779436|ref|YP_397548.1| hypothetical protein PMT9312_1053 [Prochlorococcus marinus str. MIT
9312]
gi|78712935|gb|ABB50112.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9312]
Length = 172
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 47/104 (45%), Positives = 66/104 (63%)
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
R++DFS +G L + +N G DL+DTL DR+ + +L NAVL+ + + S
Sbjct: 68 RDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLSDRVNFQKTDLRNAVLINMIASGSSF 127
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
GA IEGADFS A++D ++ LC+ A+G NP TGVSTR+SL C
Sbjct: 128 AGAKIEGADFSYAILDSEDQRNLCEIADGINPTTGVSTRESLEC 171
>gi|172036187|ref|YP_001802688.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354552985|ref|ZP_08972292.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171697641|gb|ACB50622.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353554815|gb|EHC24204.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 179
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 68/112 (60%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+AD+ +S+FS + GA + A+ GADL++ L A+LTNAVL
Sbjct: 67 AQFTNADLTDSNFSEADLRGAVFNGSALIGADLHGADLTNGLAYLTSFKGADLTNAVLTE 126
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ R+ A I GADFS AV+D+ + LC A+G NP TGVSTR+SLGC
Sbjct: 127 AIMMRTKFDDAKITGADFSLAVLDVYEVDKLCDRADGVNPKTGVSTRESLGC 178
>gi|303287274|ref|XP_003062926.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455562|gb|EEH52865.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 182
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 73/142 (51%), Gaps = 15/142 (10%)
Query: 120 KAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
KA HV E+F A +T D+R SDFSGS A +A+ N G+D+ + +D
Sbjct: 17 KAEHVNEDFSHSDLVGAIYTEGDLRGSDFSGSDLRAAIFSRAIMPGVNLEGSDMQNAFLD 76
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA---------LCKYA 224
+VL AN+ + RSDLG + ADF++AVID Q ++ LC A
Sbjct: 77 YVVLRGANMRGVIASGANFVRSDLGDVDVTNADFTEAVIDRYQARSISHWSPYDPLCDGA 136
Query: 225 NGTNPITGVSTRKSLGCGNSRR 246
+G N TGV TR SLGC +R
Sbjct: 137 SGVNEFTGVDTRDSLGCDRLKR 158
>gi|72382551|ref|YP_291906.1| hypothetical protein PMN2A_0712 [Prochlorococcus marinus str.
NATL2A]
gi|72002401|gb|AAZ58203.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL2A]
Length = 184
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 69/118 (58%), Gaps = 6/118 (5%)
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
R++DFS +G L + +N G DL+DTL DR+ + +L N++LV + + S
Sbjct: 71 RDADFSNVDLHGTTLTLSDLKGSNLNGVDLTDTLSDRVNFQKTDLRNSILVNMIASGSSF 130
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 255
GA IEGADF+ A++D ++ LCK A+G NP TGVSTR SL C + PS P
Sbjct: 131 AGAQIEGADFTFAILDSEDQRNLCKIADGVNPTTGVSTRASLECKGDK------PSMP 182
>gi|282897737|ref|ZP_06305736.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
gi|281197416|gb|EFA72313.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
Length = 162
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 80/130 (61%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +A+L + +N +A F++A++ ++F+ + GA +V +AN GADL++ +
Sbjct: 32 FSNAELGRHNFSGQNLQAAEFSNANLEMANFANADLRGAVFSASVMTQANLHGADLTNAM 91
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+D++ L A+L++A+ + +L RS A I+GADF++A++D Q LCK A G N T
Sbjct: 92 LDQVKLTGADLSDAIFLEAILLRSIFTEANIDGADFTEAILDRGQVGELCKSARGVNSQT 151
Query: 232 GVSTRKSLGC 241
V TR SLGC
Sbjct: 152 HVQTRDSLGC 161
>gi|113953693|ref|YP_729958.1| hypothetical protein sync_0742 [Synechococcus sp. CC9311]
gi|113881044|gb|ABI46002.1| conserved hypothetical protein [Synechococcus sp. CC9311]
Length = 190
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 71/119 (59%)
Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
H + +F A R +DFS + G +A +ANF GA+LSD LMDR ++ +L
Sbjct: 72 HGQNLVNTSFAGATGRGADFSDANLQGTIFTQAEFPEANFHGANLSDALMDRADFSKTDL 131
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+A+LV + S GA IEGADF+DA++D ++ LC+ A+G N +GVSTR SL C
Sbjct: 132 RDALLVGVIAAGSSFAGADIEGADFTDALLDREDQRRLCQDADGVNSSSGVSTRDSLDC 190
>gi|428211433|ref|YP_007084577.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|427999814|gb|AFY80657.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 166
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 75/112 (66%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F++AD++ ++FS + GA ++ +ANF GADL++++++ L A+ T+AVLV
Sbjct: 54 AEFSNADLQFTNFSNVQAEGAIFSLSMMKEANFHGADLTNSMLEWTNLTNADFTDAVLVE 113
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ +++ + GADF+DA++D AQ + LC+ A+G N TGV TR+SLGC
Sbjct: 114 ALFLGANVKKMKVTGADFTDAILDGAQVKQLCENASGVNSKTGVDTRESLGC 165
>gi|33240611|ref|NP_875553.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
gi|33238139|gb|AAQ00206.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
Length = 183
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 77/134 (57%), Gaps = 2/134 (1%)
Query: 122 VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
+H ++ +++ A R+S+ S +G + A +N G +L+DTL DR+ + +
Sbjct: 50 LHGQDLSKSSIAGATARDSNLSDVDLHGTVVTLADLKGSNLNGINLTDTLSDRVNFQKTD 109
Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L NAVLV + + S GA IEGADFS AV+D ++ LC+ A GTNP TG+STR+SL C
Sbjct: 110 LRNAVLVNMIASGSSFAGAQIEGADFSYAVLDSDDQRNLCEIAEGTNPQTGISTRESLEC 169
Query: 242 GNSRRNAYGSPSSP 255
S R P P
Sbjct: 170 --SERGVGYKPPMP 181
>gi|157413511|ref|YP_001484377.1| hypothetical protein P9215_11761 [Prochlorococcus marinus str. MIT
9215]
gi|254526043|ref|ZP_05138095.1| Pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
gi|157388086|gb|ABV50791.1| conserved hpothetical protein [Prochlorococcus marinus str. MIT
9215]
gi|221537467|gb|EEE39920.1| Pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
Length = 172
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 47/104 (45%), Positives = 65/104 (62%)
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
R++DFS +G L + +N G DL+DTL DR+ + +L NAVL+ + + S
Sbjct: 68 RDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLSDRVNFQKTDLRNAVLINMIASGSSF 127
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
GA IEGADFS A++D ++ LC+ A+G NP TGVSTR SL C
Sbjct: 128 AGAQIEGADFSYAILDSEDQRNLCEIADGINPTTGVSTRDSLEC 171
>gi|124026254|ref|YP_001015370.1| hypothetical protein NATL1_15481 [Prochlorococcus marinus str.
NATL1A]
gi|123961322|gb|ABM76105.1| Hypothetical protein NATL1_15481 [Prochlorococcus marinus str.
NATL1A]
Length = 184
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 69/118 (58%), Gaps = 6/118 (5%)
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
R++DFS +G L + +N G DL+DTL DR+ + +L N++LV + + S
Sbjct: 71 RDADFSNVDLHGTTLTLSDLKGSNLNGVDLTDTLSDRVNFQKTDLRNSILVNMIASGSSF 130
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 255
GA IEGADF+ A++D ++ LCK A+G NP TGVSTR SL C + PS P
Sbjct: 131 AGAQIEGADFTFAILDSEDQRNLCKIADGVNPTTGVSTRASLECKGDK------PSIP 182
>gi|126657693|ref|ZP_01728847.1| hypothetical protein CY0110_25878 [Cyanothece sp. CCY0110]
gi|126620910|gb|EAZ91625.1| hypothetical protein CY0110_25878 [Cyanothece sp. CCY0110]
Length = 167
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+AD+ +S+FS + GA + A+ GADL++ L A+LT+AVL
Sbjct: 55 AQFTNADLTDSNFSKADLRGAVFNGSALIGADLHGADLTNGLAYLTSFKGADLTDAVLTE 114
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ R+ A I GADFS AV+D+ + + LC A+G NP TG+STR+SLGC
Sbjct: 115 AIMMRTKFDDAKITGADFSLAVLDIYEVEKLCDRADGVNPKTGISTRESLGC 166
>gi|123966365|ref|YP_001011446.1| hypothetical protein P9515_11321 [Prochlorococcus marinus str. MIT
9515]
gi|123200731|gb|ABM72339.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 172
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 64/104 (61%)
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
R++DFS +G L + +N G DL+DTL DR+ + +L N++L+ + + S
Sbjct: 68 RDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLADRVNFQKTDLRNSILINMIASGSSF 127
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
GA IEGADFS A++D ++ LCK A G NP TGVSTR SL C
Sbjct: 128 AGAQIEGADFSYAILDSEDQRNLCKIAEGVNPTTGVSTRDSLEC 171
>gi|78212400|ref|YP_381179.1| hypothetical protein Syncc9605_0856 [Synechococcus sp. CC9605]
gi|78196859|gb|ABB34624.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 181
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 50/110 (45%), Positives = 67/110 (60%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F A R ++F G+ +GA L + +A+F GADLSD LMDR +L NAVL +
Sbjct: 72 FAGAVGRGANFRGANLHGAILTQGAFAEADFQGADLSDALMDRADFVGTDLRNAVLNGII 131
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ S A IEGADF+DA++D ++ LC A+G NP TGV+T SLGC
Sbjct: 132 ASGSSFSNAQIEGADFTDALLDRDDQRRLCGEADGINPSTGVATFDSLGC 181
>gi|116070665|ref|ZP_01467934.1| hypothetical protein BL107_13505 [Synechococcus sp. BL107]
gi|116066070|gb|EAU71827.1| hypothetical protein BL107_13505 [Synechococcus sp. BL107]
Length = 169
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 78/134 (58%), Gaps = 3/134 (2%)
Query: 108 GSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
GS++ G + + +KE F A++R+ + SG+ GA + A+ A+L
Sbjct: 37 GSSSYQGITEDMHGMDLKEK---EFLKANLRDVNLSGADLRGAVINTTQLQGADLRDANL 93
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 227
SD + + A+L AVL +L +S A IEGADF+DAVIDL Q++ALC A+G
Sbjct: 94 SDVVGFASRFDGADLRGAVLTNAMLMQSRFTDAQIEGADFTDAVIDLPQQRALCSSADGV 153
Query: 228 NPITGVSTRKSLGC 241
NP +GVSTR+SLGC
Sbjct: 154 NPQSGVSTRESLGC 167
>gi|352094392|ref|ZP_08955563.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
gi|351680732|gb|EHA63864.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
Length = 172
Score = 90.1 bits (222), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 76/135 (56%), Gaps = 10/135 (7%)
Query: 112 QFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
QF D+ +H ++ F AD+R D S + GA + + A+ GA+L D +
Sbjct: 42 QFAVQDISNDMHGRDLKEKEFLKADLRGVDLSDTDLRGAVINTSQLQGADLHGANLEDVV 101
Query: 172 -----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
D L++AN TNA+L+++ A IEG DF++AVIDL Q +ALC A+G
Sbjct: 102 AFSSRFDETDLSDANFTNAMLMQSRFV-----DARIEGTDFTNAVIDLTQMKALCGRASG 156
Query: 227 TNPITGVSTRKSLGC 241
N ++GVSTR+SLGC
Sbjct: 157 VNSVSGVSTRESLGC 171
>gi|78184792|ref|YP_377227.1| hypothetical protein Syncc9902_1219 [Synechococcus sp. CC9902]
gi|78169086|gb|ABB26183.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 169
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 78/134 (58%), Gaps = 3/134 (2%)
Query: 108 GSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
GS++ G + + +KE F A++R+ + SG+ GA + A+ A+L
Sbjct: 37 GSSSYQGITEDMHGMDLKEK---EFLKANLRDVNLSGADLRGAVINTTQLQGADLRDANL 93
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 227
SD + + A+L AVL +L +S A IEGADF+DAVIDL Q++ALC A+G
Sbjct: 94 SDVVGFASRFDGADLRGAVLTNAMLMQSRFTDAQIEGADFTDAVIDLPQQRALCSSADGV 153
Query: 228 NPITGVSTRKSLGC 241
NP +GVSTR+SLGC
Sbjct: 154 NPQSGVSTRESLGC 167
>gi|33861598|ref|NP_893159.1| hypothetical protein PMM1042 [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
gi|33634175|emb|CAE19501.1| conserved hpothetical protein [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
Length = 172
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 64/104 (61%)
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
R++DFS +G L + +N G DL+DTL DR+ + +L N++L+ + + S
Sbjct: 68 RDADFSEVDLHGTTLTLSDLKGSNLNGIDLTDTLADRVNFQKTDLRNSILINMIASGSSF 127
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
GA IEGADFS A++D ++ LCK A G NP TGVSTR SL C
Sbjct: 128 AGAQIEGADFSYAILDSEDQRNLCKIAEGVNPTTGVSTRDSLEC 171
>gi|113953830|ref|YP_730899.1| pentapeptide repeat-containing protein [Synechococcus sp. CC9311]
gi|113881181|gb|ABI46139.1| Secreted pentapeptide repeats protein [Synechococcus sp. CC9311]
Length = 172
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 77/135 (57%), Gaps = 10/135 (7%)
Query: 112 QFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
QF D+ + +H ++ F AD+R D S + GA + + A+ GA+L D +
Sbjct: 42 QFALQDISEDMHGRDLKEKEFLKADLRGIDLSDTDLRGAVINTSQLQGADLHGANLEDVV 101
Query: 172 -----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
D L++AN TNA+L+++ A IEG DF++AVIDL Q +ALC A+G
Sbjct: 102 AFSSRFDETDLSDANFTNAMLMQSRFV-----DARIEGTDFTNAVIDLTQLKALCGRASG 156
Query: 227 TNPITGVSTRKSLGC 241
N ++GVSTR+SLGC
Sbjct: 157 VNSVSGVSTRESLGC 171
>gi|254430802|ref|ZP_05044505.1| secreted pentapeptide repeats protein [Cyanobium sp. PCC 7001]
gi|197625255|gb|EDY37814.1| secreted pentapeptide repeats protein [Cyanobium sp. PCC 7001]
Length = 173
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 76/130 (58%), Gaps = 10/130 (7%)
Query: 117 DLRKAVH-----VKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
DL+ +H +E +A+ D E+D G+ FNG+ L +A + + A+L D +
Sbjct: 48 DLQPDMHGRNLRQQEFLKASLEGFDFSEADLRGAVFNGSSLREA-----DLSAANLEDVV 102
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+++NL A+L +L +S G+ I GADFSDAV+DL +++ALC A G NP T
Sbjct: 103 AYATRFDDSNLEGAILRNAMLMQSRFKGSSITGADFSDAVLDLPEQKALCARATGVNPST 162
Query: 232 GVSTRKSLGC 241
GVSTR+SL C
Sbjct: 163 GVSTRESLAC 172
>gi|427701840|ref|YP_007045062.1| low-complexity protein [Cyanobium gracile PCC 6307]
gi|427345008|gb|AFY27721.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
Length = 184
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/126 (39%), Positives = 76/126 (60%), Gaps = 6/126 (4%)
Query: 117 DLR-KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
DLR + + +E +A+ D+R++D G+ FN L +A + GADL D +
Sbjct: 63 DLRGRNLQQQEFLKASMEGFDLRDADLRGAVFNSTDLRQA-----DLRGADLEDVVAFAT 117
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 235
+ A+L A +L +S A I+GADFSDAV+DL +++ALC A+G++P+TGV T
Sbjct: 118 RFDGADLRGAQFRNAMLMQSRFRDARIDGADFSDAVLDLPEQKALCARASGSHPLTGVDT 177
Query: 236 RKSLGC 241
R+SLGC
Sbjct: 178 RESLGC 183
>gi|159903694|ref|YP_001551038.1| hypothetical protein P9211_11531 [Prochlorococcus marinus str. MIT
9211]
gi|159888870|gb|ABX09084.1| Hypothetical protein P9211_11531 [Prochlorococcus marinus str. MIT
9211]
Length = 183
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 72/120 (60%)
Query: 122 VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
+H ++ +++ A R+++ S +G + A +N G DL+DTL DR+ + +
Sbjct: 50 LHGQDLSKSSIAGATARDANLSDVDLHGTVVTLADLKGSNLNGIDLTDTLSDRVNFQKTD 109
Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L NAVLV + + S GA+I GADFSD+V+D ++ LC+ A G NP TG++TR SL C
Sbjct: 110 LRNAVLVNMIASGSSFAGALIAGADFSDSVLDRDDQRNLCEIAEGVNPKTGIATRDSLEC 169
>gi|220907989|ref|YP_002483300.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219864600|gb|ACL44939.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 171
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/113 (43%), Positives = 67/113 (59%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ F A + ++ SG+ GA AV AN G + SD + ++A+L NAVL
Sbjct: 58 QVEFGDARLSGANLSGANLRGAVFNAAVLTGANLQGVNFSDGIGYLCDFSDADLENAVLD 117
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L +S+ GA I GADFS A++D Q LC+YA+G NP TGVSTR+SLGC
Sbjct: 118 SAMLLKSEFKGAKINGADFSFALLDRPQVLQLCEYASGVNPTTGVSTRESLGC 170
>gi|414079727|ref|YP_007001151.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413973006|gb|AFW97094.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 167
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 90/178 (50%), Gaps = 15/178 (8%)
Query: 64 KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVH 123
K +NW +S L A + + ++ A +Y E I +A F DL +
Sbjct: 4 KHRNWISILSLLLWAIISTTALASFVPTAVALEYNKE------ILISADFSGRDLTDS-- 55
Query: 124 VKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
+FT A++R S+FS S G A AN GADL++T +D L +A+LT
Sbjct: 56 -------SFTKANLRYSNFSHSNLRGVSFFAANLESANLQGADLTNTTLDSARLIKADLT 108
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
NA+L + GAII+GADF+D ++ +++ LCK A GTNP+T TR +L C
Sbjct: 109 NAILEGAFAANARFDGAIIDGADFTDVLLRQDEQKKLCKLAKGTNPVTKRDTRDTLYC 166
>gi|17232102|ref|NP_488650.1| hypothetical protein alr4610 [Nostoc sp. PCC 7120]
gi|17133747|dbj|BAB76309.1| alr4610 [Nostoc sp. PCC 7120]
Length = 164
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 70/189 (37%), Positives = 101/189 (53%), Gaps = 38/189 (20%)
Query: 65 LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
+K+WRV VS LA + A+ SS+I+ A + G+ IGS +F + D
Sbjct: 1 MKDWRVVVSFVLAMVLFLFPGSAQAASSSSITRSAGDELKAKDFSGQSLIGS--EFTNVD 58
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLM 172
L EN ANF++AD+R F+G+ G L + +AY ANF ADLSD +
Sbjct: 59 L-------EN--ANFSNADLRGGVFNGTVLEGVNLHGVDFSEGIAYLANFKNADLSDAI- 108
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
LTNA+++R++ + + GADF++AV+D+ Q + LC ANG N TG
Sbjct: 109 ---------LTNAMMLRSIFDNVN-----VTGADFTNAVLDITQVKKLCLKANGVNSKTG 154
Query: 233 VSTRKSLGC 241
V TR+SLGC
Sbjct: 155 VDTRESLGC 163
>gi|254409676|ref|ZP_05023457.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183673|gb|EDX78656.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 163
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 71/112 (63%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F +A+++ S+F+ + GA +V AN GADLS ++D+ L A+L++ +LV
Sbjct: 51 AEFANANLQLSNFAYADLRGAIFSGSVMTHANLHGADLSYGMLDQADLTGADLSDVILVE 110
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
T+L S +I GADF+DA++D AQ + LC+ A+G N TGV+T SLGC
Sbjct: 111 TLLLGSVFDNTLITGADFTDALLDGAQLKHLCQQASGINSKTGVATSDSLGC 162
>gi|119389531|pdb|2G0Y|A Chain A, Crystal Structure Of A Lumenal Pentapeptide Repeat Protein
From Cyanothece Sp 51142 At 2.3 Angstrom Resolution.
Tetragonal Crystal Form
Length = 184
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 67/112 (59%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+AD+ +S+FS + GA + A+ GADL++ L A+LTNAVL
Sbjct: 72 AQFTNADLTDSNFSEADLRGAVFNGSALIGADLHGADLTNGLAYLTSFKGADLTNAVLTE 131
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ R+ A I GADFS AV+D+ + LC A+G NP TGVSTR+SL C
Sbjct: 132 AIMMRTKFDDAKITGADFSLAVLDVYEVDKLCDRADGVNPKTGVSTRESLRC 183
>gi|428203139|ref|YP_007081728.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427980571|gb|AFY78171.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 177
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 66/110 (60%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F A++R S+FS +K G A ANF GADL+ ++ L AN TNA+LV
Sbjct: 67 FDHANLRGSNFSNAKLQGVRFFAANLESANFEGADLTGADLESARLVRANFTNAILVGAF 126
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
T + GAII+GADF+D ++ ++ LC+ A GTNP+TG +TR +L C
Sbjct: 127 ATNTLFNGAIIDGADFTDVLLRPDTEKKLCEIARGTNPVTGRNTRDTLNC 176
>gi|67922694|ref|ZP_00516198.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|416392485|ref|ZP_11685875.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
gi|67855476|gb|EAM50731.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|357263639|gb|EHJ12621.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
Length = 170
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 66/112 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+AD+ +S+FS + GA + + ADL++ L A+LTNAVL
Sbjct: 58 AQFTNADLTDSNFSDADLRGAVFNGSALIGTDLHQADLTNGLAYLTSFEGADLTNAVLTE 117
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ R+ A I GADFS AV+DL Q LCK A+G N TG+STR+SLGC
Sbjct: 118 AIMMRTTFKNANITGADFSLAVLDLQQVAELCKRADGVNSKTGISTRESLGC 169
>gi|145356305|ref|XP_001422373.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582615|gb|ABP00690.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 123
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 68/118 (57%), Gaps = 1/118 (0%)
Query: 125 KENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
+E+ R A + AD+R SD S GA +AV + AD SD + D +L ++ T
Sbjct: 6 REDLRGAIYAEADLRRSDLRESDARGAVFSRAVMPGVDARDADFSDAMFDYALLRGSDFT 65
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
N+V V R+DLG + ADF++AVID Q +LC+ A+GTNP TG +TR SL C
Sbjct: 66 NSVFVGANFVRADLGEVVATNADFTEAVIDRYQTLSLCERASGTNPYTGANTRDSLLC 123
>gi|443314247|ref|ZP_21043822.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442786146|gb|ELR95911.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 166
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 69/136 (50%), Gaps = 1/136 (0%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
+ A F +LR ++ R N +T ADM E + +G+ G L KAN GA
Sbjct: 30 VAQAESFDRQNLRMRDFSGQDLRGNDYTRADMAEVNLTGANLQGVRLFDTNLTKANLEGA 89
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
DL +D ANLTNA+L + +D AII+GADF+D +D LC A
Sbjct: 90 DLRGATLDGARFLAANLTNAILAGSYAFNTDFRKAIIDGADFTDVFLDPKTNDLLCAVAQ 149
Query: 226 GTNPITGVSTRKSLGC 241
GTNP+TG TR +L C
Sbjct: 150 GTNPVTGRDTRDTLYC 165
>gi|440684721|ref|YP_007159516.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428681840|gb|AFZ60606.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 167
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 66/110 (60%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
FT A++R+S+FS + G A AN GADL++ +D L ANLTN +L
Sbjct: 57 FTKANLRQSNFSHANLRGVSFFAANLESANLEGADLTNATLDSARLIRANLTNTILEGAF 116
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ GAII+GADF+DA++ +++ LCK A G NP+TG TR++L C
Sbjct: 117 AASARFDGAIIDGADFTDALLRGDEQKKLCKVAKGNNPVTGRDTRETLFC 166
>gi|354567474|ref|ZP_08986643.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353542746|gb|EHC12207.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 164
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 93/183 (50%), Gaps = 26/183 (14%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
+K+WRVF LA V+ + L+ + +R Q +AD +
Sbjct: 1 MKSWRVFAVLILAMVVL------LFPLSAEAAKSSSSR----FAGYKQMSNADFSGQTLI 50
Query: 125 KENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
+E F +ANF++AD+R G+ FN AYLEKA N GAD ++ + +
Sbjct: 51 REEFTKVKLDKANFSNADLR-----GAVFNNAYLEKA-----NLHGADFTNGIAYLVDFR 100
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
+A+L++A+ T+L S I G DF++AV+D + + LC ANG N TGVSTR+S
Sbjct: 101 DADLSDAIFTDTMLLYSTFDNVEITGTDFTNAVLDGPELKKLCARANGVNSKTGVSTRES 160
Query: 239 LGC 241
L C
Sbjct: 161 LEC 163
>gi|186686067|ref|YP_001869263.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186468519|gb|ACC84320.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 191
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 67/112 (59%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
++FT A++R+S+FS + NG A AN G+DL + +D L ANLTNA+L
Sbjct: 79 SSFTKANLRQSNFSRANLNGVSFFAANLESANLEGSDLRNATLDSARLVRANLTNALLEG 138
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ GAII+GADF+D ++ +++ LCK A GTNP TG TR +L C
Sbjct: 139 AFAANARFDGAIIDGADFTDTLLRPDEQKKLCKLAKGTNPTTGRDTRDTLFC 190
>gi|427706684|ref|YP_007049061.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427359189|gb|AFY41911.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 169
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 65/110 (59%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
FT A++R+S+FS S G A AN G DL++ +D L +A+LTNAVL
Sbjct: 59 FTKANLRQSNFSNSNLRGVSFFAANLESANLQGTDLTNATLDSARLMKADLTNAVLEGAF 118
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ GAII+GADF+D ++ +++ LCK A GTNP TG TR +L C
Sbjct: 119 AANAKFDGAIIDGADFTDVLLRPDEQKKLCKVAKGTNPTTGRDTRDTLFC 168
>gi|302768839|ref|XP_002967839.1| hypothetical protein SELMODRAFT_408705 [Selaginella moellendorffii]
gi|300164577|gb|EFJ31186.1| hypothetical protein SELMODRAFT_408705 [Selaginella moellendorffii]
Length = 126
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 56/136 (41%), Positives = 70/136 (51%), Gaps = 19/136 (13%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A DLR AV F + D R+ + GS L+ + A F G DL DT
Sbjct: 4 ADLSGQDLRGAV---------FAACDCRKINLRGSN-----LDSSTDTFAGFEGGDLQDT 49
Query: 171 -----LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
L DR+V NL NA+ +LT S GA I GADF++A++D Q+ LCK A
Sbjct: 50 SWVQALADRVVFRMTNLQNAIFTNAILTGSQFDGADITGADFTEAILDNYQRLKLCKRAT 109
Query: 226 GTNPITGVSTRKSLGC 241
GTN ITGV TR+SL C
Sbjct: 110 GTNSITGVETRESLAC 125
>gi|428773304|ref|YP_007165092.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
7202]
gi|428687583|gb|AFZ47443.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
Length = 164
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 67/112 (59%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F ++ MR S A + ++V A+ G + S L+DR+ + ++L++A+L+
Sbjct: 51 AVFAASSMRRVSMRNSDLTNAMMTESVLLDADLHGVNFSGALIDRVTFDFSDLSDAILIG 110
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ TR+ I GADF+DAVID Q +C+ A+G NP+TGV+TR SLGC
Sbjct: 111 AIATRTRFYDTDITGADFTDAVIDRYQVSLMCERADGVNPVTGVATRDSLGC 162
>gi|119389418|pdb|2F3L|A Chain A, Crystal Structure Of A Lumenal Rfr-Domain Protein
(Contig83.1_1_243_746) From Cyanothece Sp. 51142 At 2.1
Angstrom Resolution
Length = 184
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 66/112 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+AD+ +S+FS + GA + A+ GADL++ L A+LTNAVL
Sbjct: 72 AQFTNADLTDSNFSEADLRGAVFNGSALIGADLHGADLTNGLAYLTSFKGADLTNAVLTE 131
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ R+ A I GADFS AV+D+ + LC A+G NP TGVSTR+SL C
Sbjct: 132 AIXXRTKFDDAKITGADFSLAVLDVYEVDKLCDRADGVNPKTGVSTRESLRC 183
>gi|75909862|ref|YP_324158.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75703587|gb|ABA23263.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 194
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 67/112 (59%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
++FT A++R+S+FS S G A AN G +L++ +D L +ANLTNAVL
Sbjct: 82 SSFTKANLRQSNFSKSNLTGVSFFAANLESANLEGTNLTNATLDSARLIKANLTNAVLEG 141
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ GAII+GADF+D ++ +++ LCK A GTNP TG TR +L C
Sbjct: 142 AFAASTKFDGAIIDGADFTDVLLRPDEQKKLCKVAKGTNPTTGRETRDTLFC 193
>gi|434400099|ref|YP_007134103.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428271196|gb|AFZ37137.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 167
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 51/129 (39%), Positives = 74/129 (57%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR A+ F A +R SDFS S +G L + + NFTGA+LS+ +
Sbjct: 47 FSHQDLRDAI---------FDHASLRGSDFSYSDLSGVRLFGSNLSRVNFTGANLSNADL 97
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
+ L AN TNA+L +T + L AIIEGADF++ ++ ++ LC+ A+GTNP TG
Sbjct: 98 ESCRLTRANFTNAILTGAFMTNTLLDEAIIEGADFTNVLLSPTTEKMLCENASGTNPTTG 157
Query: 233 VSTRKSLGC 241
+T+ +L C
Sbjct: 158 RNTKDTLFC 166
>gi|427731475|ref|YP_007077712.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427367394|gb|AFY50115.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 185
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 80/149 (53%), Gaps = 10/149 (6%)
Query: 103 GEFGIGSAAQFG----SADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYL 152
G GI + A F + D K + ++ +F ++FT A++R+S+FS S G
Sbjct: 36 GILGITTIAGFAPTALALDYNKEILIEADFSGRDLTDSSFTKANLRQSNFSNSNLQGVSF 95
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A AN G +LS+ +D L +A+LTNAVL + GAII+GADF+D ++
Sbjct: 96 FAANLESANLQGVNLSNATLDSARLIKADLTNAVLEGAFAANAKFDGAIIDGADFTDVLL 155
Query: 213 DLAQKQALCKYANGTNPITGVSTRKSLGC 241
+++ LCK A GTNP TG T +L C
Sbjct: 156 RPDEQKKLCKVAKGTNPTTGRDTHDTLYC 184
>gi|119490210|ref|ZP_01622723.1| hypothetical protein L8106_15969 [Lyngbya sp. PCC 8106]
gi|119454096|gb|EAW35249.1| hypothetical protein L8106_15969 [Lyngbya sp. PCC 8106]
Length = 177
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 64/112 (57%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
+ F A++R+S+FS S G L A + NF ADLS +D LN ANLTNA+L
Sbjct: 65 SEFDFANLRDSNFSHSNLRGVSLFGAKLQRTNFEAADLSYATLDTARLNRANLTNAILEG 124
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+D A+I GADF+D ++ ++ LC A GTNP+TG TR +L C
Sbjct: 125 AFAYNTDFSDAMIAGADFTDVLLRRDMQEKLCALAEGTNPVTGRDTRDTLYC 176
>gi|116074723|ref|ZP_01471984.1| hypothetical protein RS9916_29354 [Synechococcus sp. RS9916]
gi|116067945|gb|EAU73698.1| hypothetical protein RS9916_29354 [Synechococcus sp. RS9916]
Length = 173
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/189 (34%), Positives = 93/189 (49%), Gaps = 29/189 (15%)
Query: 64 KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLR-KAV 122
+L N R S LA V C AL + +A T E A Q SAD+ + +
Sbjct: 2 RLLNPRALCSGLLATLV---CCVISVALLPSSPAQAITAPELRGQKAVQDISADMHGRDL 58
Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLE----------KAVAYKANFTGADLSDTLM 172
KE +A+ D+ E+D G+ N + L+ VA+ + F GADL D
Sbjct: 59 KEKEFLKADLQGVDLSEADLRGAVINTSLLQGSDLRSADLGDVVAFASRFDGADLRD--- 115
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
A NA+L+++ T ++ IEGADF++AVIDL Q +A+C A G N TG
Sbjct: 116 -------ARFVNAMLMQSRFTEAN-----IEGADFTNAVIDLPQLKAMCARAEGVNSATG 163
Query: 233 VSTRKSLGC 241
+STR+SLGC
Sbjct: 164 ISTRESLGC 172
>gi|148239470|ref|YP_001224857.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
gi|147848009|emb|CAK23560.1| Secreted pentapeptide repeats protein [Synechococcus sp. WH 7803]
Length = 176
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 74/135 (54%), Gaps = 10/135 (7%)
Query: 112 QFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
Q D+ +H ++ F AD+RE D + GA + + A+ GADL D +
Sbjct: 46 QRAVQDISSNMHGRDLKEKEFLKADLREVDLGDADLRGAVINTSQLQGADLRGADLEDVV 105
Query: 172 -----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
D L +AN TNA+L++ S A IEG DF++AVIDL Q +ALC A+G
Sbjct: 106 AFSSRFDGADLRDANFTNAMLMQ-----SRFNDAQIEGTDFTNAVIDLPQLKALCGRASG 160
Query: 227 TNPITGVSTRKSLGC 241
N ++GVST++SLGC
Sbjct: 161 VNSLSGVSTKESLGC 175
>gi|170078800|ref|YP_001735438.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
gi|169886469|gb|ACB00183.1| secreted pentapeptide repeats protein [Synechococcus sp. PCC 7002]
Length = 165
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 65/113 (57%), Gaps = 5/113 (4%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ N T+AD+ SD G+ FN LE N GAD ++ + A+LT+A+ V
Sbjct: 58 QVNLTNADLSGSDLRGAVFNSTLLETT-----NLHGADFTNGIAYLSKFTGADLTDAIFV 112
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L RS A I+GADFS AV+D Q++ LC A G NP+TG++T SLGC
Sbjct: 113 EAILLRSTFENAKIDGADFSFAVLDGPQQKKLCAVATGVNPVTGIATADSLGC 165
>gi|116070732|ref|ZP_01468001.1| hypothetical protein BL107_13840 [Synechococcus sp. BL107]
gi|116066137|gb|EAU71894.1| hypothetical protein BL107_13840 [Synechococcus sp. BL107]
Length = 165
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 74/135 (54%), Gaps = 6/135 (4%)
Query: 113 FGSADLRKAVHV------KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
F + D+ K V + K+ A F +++RE+D SGS GA L A A+ + D
Sbjct: 30 FAAVDVAKQVLIGADYANKDLVGATFNLSNLREADLSGSDLRGASLYGAKLQDADLSDTD 89
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
L + +D V+ NL+NAV+ + +I GADF+D + Q ++LC A+G
Sbjct: 90 LREATLDSAVMTGTNLSNAVMEGAFAFNTRFKDVVITGADFTDVPMRPDQLKSLCSVADG 149
Query: 227 TNPITGVSTRKSLGC 241
TNP+TG STR+SLGC
Sbjct: 150 TNPVTGRSTRESLGC 164
>gi|220907029|ref|YP_002482340.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863640|gb|ACL43979.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 174
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/160 (38%), Positives = 82/160 (51%), Gaps = 15/160 (9%)
Query: 84 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFS 143
C N+ +L + A+ E +G F DLR + FT A++ SDFS
Sbjct: 26 CWFNLLSLPIAPGWAADYTKESLVG--VDFSGKDLRDS---------EFTQANLSRSDFS 74
Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
S G A AN +GADL T +D L ANLTNA+L + GA I
Sbjct: 75 QSDLRGVSFFAANLESANLSGADLRLTTLDNARLTHANLTNAILEGAFAFNARFQGATIT 134
Query: 204 GADFSDAVIDLAQ--KQALCKYANGTNPITGVSTRKSLGC 241
GADF+D +DL Q + LC+ A+GTNP+TG +TR++LGC
Sbjct: 135 GADFTD--VDLRQDAQTILCQGASGTNPVTGRNTRETLGC 172
>gi|428298761|ref|YP_007137067.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428235305|gb|AFZ01095.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 169
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 91/180 (50%), Gaps = 17/180 (9%)
Query: 64 KLKN--WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 121
KL N WR+ +S L + + ++ +A +Y E I + F DL +
Sbjct: 4 KLSNNFWRIVLSALLGTVIWMISTWGLTPIAFALEYNKE------ILIQSDFSGRDLSDS 57
Query: 122 VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
+FT A++++S+FS + G A + TGADLS++ +D L +AN
Sbjct: 58 ---------SFTKANLKQSNFSNTNLRGVSFFAANLESVDLTGADLSNSTLDSARLVKAN 108
Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LTNA+L + GAII+GADF+D ++ ++ LCK A GTNP T +TR +L C
Sbjct: 109 LTNAILEGAFAISAKFEGAIIDGADFTDILLRDDEQARLCKIATGTNPTTKRNTRDTLMC 168
>gi|17230824|ref|NP_487372.1| hypothetical protein all3332 [Nostoc sp. PCC 7120]
gi|17132427|dbj|BAB75031.1| all3332 [Nostoc sp. PCC 7120]
Length = 206
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 81/147 (55%), Gaps = 10/147 (6%)
Query: 105 FGIGSAAQFG----SADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEK 154
FG+ + A F + + K + V+ +F ++FT A++R+S+FS S G
Sbjct: 59 FGMITIANFTPPAFALEYNKEILVEADFSGRDLTDSSFTKANLRQSNFSKSNLTGVSFFA 118
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
A AN G++L++ +D L +ANL NAVL + GAII+GADF+D ++
Sbjct: 119 ANLESANLEGSNLTNATLDSARLIKANLKNAVLEGAFAASTKFDGAIIDGADFTDVLLRP 178
Query: 215 AQKQALCKYANGTNPITGVSTRKSLGC 241
+++ LCK A GTNP TG TR +L C
Sbjct: 179 DEQKKLCKVAKGTNPTTGRETRDTLFC 205
>gi|119511413|ref|ZP_01630525.1| hypothetical protein N9414_20009 [Nodularia spumigena CCY9414]
gi|119463958|gb|EAW44883.1| hypothetical protein N9414_20009 [Nodularia spumigena CCY9414]
Length = 126
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 70/112 (62%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F++A+M ++F+ + GA + +V +AN GADL++ ++D++ A+L++AV
Sbjct: 14 AEFSNANMELANFADADLRGAVMSASVMTQANLHGADLTNAMVDQVKFAGADLSDAVFKE 73
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L RS I+ ADF+DA++D Q + LC A+G N TGV TR SLGC
Sbjct: 74 ALLLRSTFTDVNIDSADFTDAILDGVQIKELCSKASGVNSKTGVETRYSLGC 125
>gi|119512324|ref|ZP_01631410.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
gi|119463037|gb|EAW43988.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
Length = 170
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 67/110 (60%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
FT A++R+SDF+ + G A AN ADLS +D L +ANLTNA+L
Sbjct: 60 FTKANLRQSDFNHANLRGVSFFAANLESANLESADLSFATLDSARLIKANLTNAILEGAF 119
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + GAII+GADF+D ++ +++ LC+ A GTNP+TG +TR +L C
Sbjct: 120 ASNARFDGAIIDGADFTDILLRQDEEKKLCQLAKGTNPVTGRNTRDTLFC 169
>gi|33240300|ref|NP_875242.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
gi|33237827|gb|AAP99894.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
Length = 170
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/134 (41%), Positives = 76/134 (56%), Gaps = 21/134 (15%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S +F DLR NFR +N T A S +G+ +GA L+ A+AY ++F ADL
Sbjct: 57 SGYEFVKFDLRGI-----NFRDSNLTGAVFNNSKLNGADLHGANLKDALAYASDFEDADL 111
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 227
+D+ NL+NA+L+ S AIIEGADF+DAV+ Q++ LC A+GT
Sbjct: 112 TDS----------NLSNALLME-----SSFNNAIIEGADFTDAVLSRIQQKQLCSIADGT 156
Query: 228 NPITGVSTRKSLGC 241
N TG+ST SLGC
Sbjct: 157 NSSTGISTSYSLGC 170
>gi|434386546|ref|YP_007097157.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428017536|gb|AFY93630.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 212
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 69/112 (61%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+A + +++F+G+ G + + + N GA+L+ L+D++ A+L++AV V
Sbjct: 100 AVFTTAKLDDTNFAGADLTGVVISSSTLNRTNLHGANLTQGLLDQVRFVGADLSDAVFVE 159
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ RS I GADF+DA++ Q++ LC+ A G N TGV+TR SLGC
Sbjct: 160 AMMLRSTFTDVNIAGADFTDAILGKLQQKELCQIATGVNSKTGVATRDSLGC 211
>gi|428300991|ref|YP_007139297.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428237535|gb|AFZ03325.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 166
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 92/188 (48%), Gaps = 35/188 (18%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
+K W+ V L + AS + Y A + G A D +
Sbjct: 1 MKFWQFLVGLVLTFVIFASSTP---------AYAASSSAVTGSIVAGSLKGKDFSGQSLI 51
Query: 125 KENF------RANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMD 173
E F +ANF++AD+R + F+GS + A L+ + +AY ++F GA+LSD
Sbjct: 52 AEEFTSVNLEKANFSAADLRGAVFNGSMLHDANLQGIDFSEGIAYLSDFKGANLSD---- 107
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
A TNA+++R+ + D + GADF++AV+D + Q LC A+G NP TGV
Sbjct: 108 ------AVFTNAMMLRSAFSDVD-----VTGADFTNAVLDRTEVQKLCVNASGVNPKTGV 156
Query: 234 STRKSLGC 241
TR+SLGC
Sbjct: 157 ETRQSLGC 164
>gi|123968372|ref|YP_001009230.1| hypothetical protein A9601_08391 [Prochlorococcus marinus str.
AS9601]
gi|123198482|gb|ABM70123.1| conserved hypothetical protein [Prochlorococcus marinus str.
AS9601]
Length = 170
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/113 (46%), Positives = 63/113 (55%), Gaps = 15/113 (13%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+N A S SKFNGA L A+AY +FT ADLSD N TNA+L+
Sbjct: 73 ESNLEGAVFNNSKLQNSKFNGANLRDALAYATDFTDADLSDV----------NFTNALLM 122
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
S+ GA I+GADF+DAV+ Q++ LC ANGTN TG ST SLGC
Sbjct: 123 -----ESNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170
>gi|124023397|ref|YP_001017704.1| hypothetical protein P9303_16951 [Prochlorococcus marinus str. MIT
9303]
gi|123963683|gb|ABM78439.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9303]
Length = 198
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 79/149 (53%), Gaps = 26/149 (17%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFN----------GAYL 152
EF G A + S D+ ++NF +A+ D+ E+D G+ FN GA L
Sbjct: 63 EFRGGQAIEEISKDMHGRDLKEQNFLKADLRGVDLSEADLRGAVFNSSQLQEADLQGADL 122
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
E VA+ + F GADL AN TNA+L++ S A+IEGADFS+AV+
Sbjct: 123 ENVVAFASRFDGADLRG----------ANFTNAMLMQ-----SQFKDALIEGADFSNAVL 167
Query: 213 DLAQKQALCKYANGTNPITGVSTRKSLGC 241
D Q+ LC ANGTN ++G +T SLGC
Sbjct: 168 DRRQQNELCSRANGTNAVSGSNTIDSLGC 196
>gi|428202122|ref|YP_007080711.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427979554|gb|AFY77154.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 168
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 74/135 (54%), Gaps = 1/135 (0%)
Query: 108 GSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
G+AA F +L +N + A FT+ D+ ++FS + GA + + N GAD
Sbjct: 33 GAAASFEDKNLSGQDFSGQNLQTAQFTNVDLTSANFSNTDLRGAVFNGSALKETNLHGAD 92
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
L++ L N A+L++AVL ++ R+ GA I GADF+ AV+D Q LC A+G
Sbjct: 93 LTNGLAYLSSFNGADLSDAVLTEAIMLRTTFDGANITGADFTLAVLDGDQVAKLCTIASG 152
Query: 227 TNPITGVSTRKSLGC 241
N TGV TR SLGC
Sbjct: 153 VNSKTGVETRASLGC 167
>gi|119486074|ref|ZP_01620136.1| hypothetical protein L8106_06120 [Lyngbya sp. PCC 8106]
gi|119456849|gb|EAW37977.1| hypothetical protein L8106_06120 [Lyngbya sp. PCC 8106]
Length = 161
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 68/112 (60%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F ++++ ++F S+ G+ KA+ AN GADL+ ++D++ + A+L+N++
Sbjct: 49 AEFANSNLESANFDHSQLVGSVFSKAMMKNANMRGADLTYAMLDQVDFSNADLSNSIFTE 108
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ S I GADF+DA++D Q + LC A+G NP TGVSTR SLGC
Sbjct: 109 VLFFGSTFKDTKITGADFTDALLDGEQLRQLCITASGVNPKTGVSTRYSLGC 160
>gi|443312459|ref|ZP_21042076.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442777437|gb|ELR87713.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 167
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 65/112 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+FT A++R S+ S S G A AN GA+L++ +D + + NLTNAVL
Sbjct: 55 ASFTKANLRNSNLSHSDLTGVSFFAANLESANLEGANLTNATLDAARIIKTNLTNAVLTG 114
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ GAII+GADF+D ++ ++ LCK A GTNP TG TR++L C
Sbjct: 115 AFAANAKFDGAIIDGADFTDVLLRQDEQDKLCKVAQGTNPTTGKQTRETLMC 166
>gi|428770110|ref|YP_007161900.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428684389|gb|AFZ53856.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 193
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 55/132 (41%), Positives = 71/132 (53%), Gaps = 20/132 (15%)
Query: 130 ANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFT----------GADLSDT---- 170
NFT A + DFS GS F + L A Y++N T GADL +T
Sbjct: 59 VNFTYAQLEGEDFSHRDLTGSVFAASNLRNASFYQSNLTNSVMTEGILFGADLRETNFTG 118
Query: 171 -LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
L+DR+ L+ A+L NA+ + TR+ IEGADF+ AVID Q +C A+G N
Sbjct: 119 SLIDRVTLDFADLRNAIFTDAIATRTRFYDTNIEGADFTGAVIDRYQVALMCDRASGVNS 178
Query: 230 ITGVSTRKSLGC 241
ITGV+TR SLGC
Sbjct: 179 ITGVATRDSLGC 190
>gi|427715923|ref|YP_007063917.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348359|gb|AFY31083.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 169
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 78/141 (55%), Gaps = 6/141 (4%)
Query: 107 IGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
+G A + + K + ++ +F ++FT A++R+S+FS + +G A A
Sbjct: 28 VGGATTALALEYNKEILIEADFSGRDLTDSSFTKANLRQSNFSNANLSGVSFFAANLESA 87
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
N GA+L++ +D + NLTNAVL + GAII+GADF+D ++ +++ L
Sbjct: 88 NLQGANLTNATLDSARFIKTNLTNAVLEGAFAANAKFDGAIIDGADFTDVLLRQDEQKKL 147
Query: 221 CKYANGTNPITGVSTRKSLGC 241
CK A GTNP TG TR +L C
Sbjct: 148 CKVAKGTNPTTGRDTRDTLFC 168
>gi|425455123|ref|ZP_18834848.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9807]
gi|389804043|emb|CCI17099.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9807]
Length = 161
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/121 (43%), Positives = 68/121 (56%), Gaps = 10/121 (8%)
Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
+F D+R+S F GS F+ A LE + AN GAD SD M + L A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 99
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPITG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIATGTNPITGRNTRDTLF 159
Query: 241 C 241
C
Sbjct: 160 C 160
>gi|425440692|ref|ZP_18820990.1| Pentapeptide repeat family protein (modular protein) [Microcystis
aeruginosa PCC 9717]
gi|389718807|emb|CCH97279.1| Pentapeptide repeat family protein (modular protein) [Microcystis
aeruginosa PCC 9717]
Length = 213
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/121 (43%), Positives = 68/121 (56%), Gaps = 10/121 (8%)
Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
+F D+R+S F GS F+ A LE + AN GAD SD M + L A
Sbjct: 92 DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 151
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPITG +TR +L
Sbjct: 152 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIATGTNPITGRNTRDTLF 211
Query: 241 C 241
C
Sbjct: 212 C 212
>gi|194476536|ref|YP_002048715.1| hypothetical protein PCC_0045 [Paulinella chromatophora]
gi|171191543|gb|ACB42505.1| hypothetical protein PCC_0045 [Paulinella chromatophora]
Length = 167
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 51/138 (36%), Positives = 74/138 (53%), Gaps = 26/138 (18%)
Query: 115 SADLR-KAVHVKENFRANFTSADMRESDFSGSKFN----------GAYLEKAVAYKANFT 163
+AD+ + + +E +A+ D ESD G+ FN A L+ VA+ + F
Sbjct: 45 NADMHGRKLQQQEFLKADLQKIDFSESDLRGTVFNNSDLRNANLNAADLQDVVAFASRFD 104
Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
GADL T NL N +L++ S +IEGADF+DA++DL Q++ LC +
Sbjct: 105 GADLRQT----------NLRNGMLIQ-----SKFKDTLIEGADFTDAILDLKQQKILCSF 149
Query: 224 ANGTNPITGVSTRKSLGC 241
ANGTN TGV T++SL C
Sbjct: 150 ANGTNLKTGVDTKESLRC 167
>gi|425470227|ref|ZP_18849097.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9701]
gi|389884202|emb|CCI35462.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9701]
Length = 161
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/121 (43%), Positives = 69/121 (57%), Gaps = 10/121 (8%)
Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
+F D+R+S F GS F+ A LE + AN GAD SD M + L +A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTKA 99
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPITG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPITGRNTRDTLF 159
Query: 241 C 241
C
Sbjct: 160 C 160
>gi|33865584|ref|NP_897143.1| hypothetical protein SYNW1050 [Synechococcus sp. WH 8102]
gi|33632753|emb|CAE07565.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 162
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 76/139 (54%), Gaps = 7/139 (5%)
Query: 111 AQFGSA-DLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
AQ +A D+ K V + ++ A F +++RE++ SGS GA L A A+ +
Sbjct: 24 AQVSAAMDVAKQVLIGSDYSGKDLRGATFNLSNLREANLSGSDLRGASLYGAKLQDADLS 83
Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
G DL + +D V+ NL+NAVL + I GADF+D + Q ++LC
Sbjct: 84 GTDLREATLDAAVMTGTNLSNAVLEGAFAFNTRFVDVTISGADFTDVPMRGDQLKSLCAV 143
Query: 224 ANGTNPITGVSTRKSLGCG 242
A+GTNP+TG STR SLGCG
Sbjct: 144 ADGTNPVTGRSTRDSLGCG 162
>gi|428225171|ref|YP_007109268.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427985072|gb|AFY66216.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 170
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 66/112 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
+NFT A+MR S+ S + G A AN GA L+ +D L +ANLTNA+L
Sbjct: 58 SNFTKANMRSSNLSRANLQGVSFFGANLESANLEGAQLNYATLDSARLVKANLTNAILEG 117
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
T + GA IEGADF+DA++ + + LC+ A+G NP TG +TR+SL C
Sbjct: 118 TYAFNAKFAGATIEGADFTDALLRDDEIEHLCEVASGVNPTTGRATRESLMC 169
>gi|218438105|ref|YP_002376434.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218170833|gb|ACK69566.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 168
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 65/112 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+ D+ E+DFS + GA + + GADL++ L A+L++A+L
Sbjct: 56 AQFTNVDLSEADFSNADLRGAVFNGSALIEGKLRGADLTNALGYLSSFERADLSDAILAE 115
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ R+ A + GADFS AV+D Q LC+ A+G N TGVSTR+SLGC
Sbjct: 116 VIMKRTSFKNADVTGADFSYAVLDGEQIANLCRTASGVNSKTGVSTRESLGC 167
>gi|78184858|ref|YP_377293.1| hypothetical protein Syncc9902_1285 [Synechococcus sp. CC9902]
gi|78169152|gb|ABB26249.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 162
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 67/117 (57%)
Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
K+ A F +++RE+D SGS GA L A A+ + DL + +D V+ NL+N
Sbjct: 45 KDLVGATFNLSNLREADLSGSDLRGASLYGAKLQDADLSDTDLREATLDSAVMTGTNLSN 104
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
AV+ + +I GADF+D + Q ++LC A+GTNP+TG STR+SLGC
Sbjct: 105 AVMEGAFAFNTRFKDVVITGADFTDVPMRPDQLKSLCSVADGTNPVTGRSTRESLGC 161
>gi|422302957|ref|ZP_16390315.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9806]
gi|389792132|emb|CCI12113.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9806]
Length = 161
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 69/121 (57%), Gaps = 10/121 (8%)
Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
+F D+R+S F GS F+ A LE + AN GAD SD M + L +A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTKA 99
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+TG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159
Query: 241 C 241
C
Sbjct: 160 C 160
>gi|425434011|ref|ZP_18814483.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9432]
gi|425451971|ref|ZP_18831790.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
7941]
gi|440753099|ref|ZP_20932302.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|389678210|emb|CCH92885.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9432]
gi|389766463|emb|CCI07918.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
7941]
gi|440177592|gb|ELP56865.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
Length = 161
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 53/121 (43%), Positives = 68/121 (56%), Gaps = 10/121 (8%)
Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
+F D+R+S F GS F+ A LE + AN GAD SD M + L A
Sbjct: 40 DFGGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 99
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPITG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIATGTNPITGRNTRDTLF 159
Query: 241 C 241
C
Sbjct: 160 C 160
>gi|126696874|ref|YP_001091760.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9301]
gi|126543917|gb|ABO18159.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9301]
Length = 186
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 71/124 (57%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L+ +H + + D+ D + GAY+ A ++F GA+++D +
Sbjct: 50 LKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMTDLIAYATRF 109
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
+ A+ T+A L L +S GAII+GADF+DA +DL+Q+++LC+ A+GTN TGV+T
Sbjct: 110 DNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLSQRKSLCERASGTNSQTGVNTID 169
Query: 238 SLGC 241
SL C
Sbjct: 170 SLEC 173
>gi|123969083|ref|YP_001009941.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. AS9601]
gi|123199193|gb|ABM70834.1| Pentapeptide repeats [Prochlorococcus marinus str. AS9601]
Length = 186
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 71/124 (57%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L+ +H + + D+ D + GAY+ A ++F GA+++D +
Sbjct: 50 LKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMTDLIAYATRF 109
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
+ A+ T+A L L +S GAII+GADF+DA +DL+Q+++LC+ A+GTN TGV+T
Sbjct: 110 DNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLSQRKSLCERASGTNTKTGVNTID 169
Query: 238 SLGC 241
SL C
Sbjct: 170 SLEC 173
>gi|416389980|ref|ZP_11685429.1| pentapeptide repeat protein [Crocosphaera watsonii WH 0003]
gi|357264135|gb|EHJ13061.1| pentapeptide repeat protein [Crocosphaera watsonii WH 0003]
Length = 164
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 70/131 (53%), Gaps = 8/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
F DLRK A F A++R+S+FS + G A ANF GADL
Sbjct: 41 VDFSGQDLRK--------EALFDHANLRDSNFSNANVQGVRFFSANLDSANFEGADLRYA 92
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
++ L + N TNA+L T + GAII+GADF+D ++D ++ LC A GTNPI
Sbjct: 93 DLEVARLTKVNFTNAILEGAFATNILVQGAIIDGADFTDVLLDPKTEKYLCTIATGTNPI 152
Query: 231 TGVSTRKSLGC 241
TG +T+ +L C
Sbjct: 153 TGRNTKDTLYC 163
>gi|67922307|ref|ZP_00515820.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|67855883|gb|EAM51129.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
Length = 164
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 70/131 (53%), Gaps = 8/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
F DLRK A F A++R+S+FS + G A ANF GADL
Sbjct: 41 VDFSGQDLRK--------EALFDHANLRDSNFSNANVQGVRFFSANLDSANFEGADLRYA 92
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
++ L + N TNA+L T + GAII+GADF+D ++D ++ LC A GTNPI
Sbjct: 93 DLEVARLTKVNFTNAILEGAFATNILVQGAIIDGADFTDVLLDPKTEKYLCTIATGTNPI 152
Query: 231 TGVSTRKSLGC 241
TG +T+ +L C
Sbjct: 153 TGRNTKDTLYC 163
>gi|427723591|ref|YP_007070868.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427355311|gb|AFY38034.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 165
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 67/128 (52%), Gaps = 1/128 (0%)
Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
S D +NF+ A FT + R +D S + GA + N GAD+S+ +
Sbjct: 38 SEDFANENFAGQNFQGAEFTQVNFRNADMSNTDLRGAVFNSSQLQNTNLHGADMSNGIAY 97
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
A+L+ A+ +L RS A I+GADFS AV+D +Q++ LC A G NP+TG+
Sbjct: 98 LSAFTGADLSGAIFEEAILLRSTFDDANIDGADFSFAVLDGSQQKKLCAAATGVNPVTGI 157
Query: 234 STRKSLGC 241
T SLGC
Sbjct: 158 ETADSLGC 165
>gi|390440388|ref|ZP_10228721.1| Pentapeptide repeat family protein [Microcystis sp. T1-4]
gi|389836192|emb|CCI32847.1| Pentapeptide repeat family protein [Microcystis sp. T1-4]
Length = 161
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 68/121 (56%), Gaps = 10/121 (8%)
Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
+F D+R+S F GS F+ A LE + AN GAD SD M + L A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 99
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+TG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159
Query: 241 C 241
C
Sbjct: 160 C 160
>gi|425463375|ref|ZP_18842714.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9809]
gi|389833543|emb|CCI21857.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9809]
Length = 161
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 69/121 (57%), Gaps = 10/121 (8%)
Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
+F D+R+S F GS F+ A LE + AN GAD SD M + L +A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSRANLEGVRFFSANLEGADFSDANMRNVDLESARLTKA 99
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+TG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159
Query: 241 C 241
C
Sbjct: 160 C 160
>gi|166362955|ref|YP_001655228.1| hypothetical protein MAE_02140 [Microcystis aeruginosa NIES-843]
gi|166085328|dbj|BAG00036.1| hypothetical protein MAE_02140 [Microcystis aeruginosa NIES-843]
Length = 186
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 66/112 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+ ++++S+FS + GA A + NF GADL++ L ++L++A+
Sbjct: 73 AQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSDAIFAE 132
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ R+ G I GADFS AV+D Q + LC+ A G N TG+ST +SLGC
Sbjct: 133 AIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEGVNSKTGISTPESLGC 184
>gi|425445790|ref|ZP_18825810.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
gi|389734131|emb|CCI02174.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
Length = 169
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 66/112 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+ ++++S+FS + GA A + NF GADL++ L ++L++A+
Sbjct: 56 AQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSDAIFAE 115
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ R+ G I GADFS AV+D Q + LC+ A G N TGVST +SLGC
Sbjct: 116 AIMLRTIFEGVNINGADFSFAVLDAEQIKNLCERAEGVNSKTGVSTPESLGC 167
>gi|425458741|ref|ZP_18838229.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9808]
gi|389824728|emb|CCI26060.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9808]
Length = 161
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 68/121 (56%), Gaps = 10/121 (8%)
Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
+F D+R+S F GS F+ A LE + AN GAD SD M + L A
Sbjct: 40 DFGGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 99
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+TG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159
Query: 241 C 241
C
Sbjct: 160 C 160
>gi|425447360|ref|ZP_18827349.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9443]
gi|389732098|emb|CCI03919.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9443]
Length = 161
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/121 (43%), Positives = 67/121 (55%), Gaps = 10/121 (8%)
Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
+F D+R+S F GS F+ A LE + AN GAD SD M + L A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 99
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
N TNAVL T + GAII+GADF+DA+I + LC+ A GTNPITG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEIYLCEIAKGTNPITGRNTRDTLF 159
Query: 241 C 241
C
Sbjct: 160 C 160
>gi|443326265|ref|ZP_21054925.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442794122|gb|ELS03549.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 172
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 65/112 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F ++ SDFS S G A +ANF A+L ++ L++ANLTNAVL
Sbjct: 60 ATFDHTNLIGSDFSDSNLFGVRFFAANLREANFANANLKFADLEAARLSDANLTNAVLAG 119
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
LT + L G IIEGADFS A++D ++ LC A GTNP TG +TR +L C
Sbjct: 120 AYLTNALLDGVIIEGADFSGALLDRNDEKMLCDIATGTNPTTGRNTRDTLFC 171
>gi|126696175|ref|YP_001091061.1| hypothetical protein P9301_08371 [Prochlorococcus marinus str. MIT
9301]
gi|91070292|gb|ABE11210.1| conserved hypothetical protein [uncultured Prochlorococcus marinus
clone HF10-88D1]
gi|126543218|gb|ABO17460.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9301]
Length = 170
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 62/113 (54%), Gaps = 15/113 (13%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+N A S SKF GA L A+AY +FT ADLSD N TNA+L+
Sbjct: 73 ESNLEGAVFNNSKLQNSKFTGANLRDALAYATDFTDADLSD----------VNFTNALLM 122
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
S+ GA I+GADF+DAV+ Q++ LC ANGTN TG ST SLGC
Sbjct: 123 E-----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170
>gi|390438199|ref|ZP_10226689.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
gi|425441109|ref|ZP_18821396.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9717]
gi|425454770|ref|ZP_18834496.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9807]
gi|425466166|ref|ZP_18845469.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|425468563|ref|ZP_18847571.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9701]
gi|389718271|emb|CCH97753.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9717]
gi|389804467|emb|CCI16499.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9807]
gi|389831470|emb|CCI25816.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|389838386|emb|CCI30813.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
gi|389884775|emb|CCI34954.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9701]
Length = 169
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 66/112 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+ ++++S+FS + GA A + NF GADL++ L ++L++A+
Sbjct: 56 AQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSDAIFAE 115
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ R+ G I GADFS AV+D Q + LC+ A G N TGVST +SLGC
Sbjct: 116 AIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEGVNSKTGVSTPESLGC 167
>gi|354567943|ref|ZP_08987110.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353541617|gb|EHC11084.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 169
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 9/132 (6%)
Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
AA F DL + +F A++R S+FS S G + +FTGADLS
Sbjct: 46 AADFSKQDLTDS---------SFDHANLRNSNFSNSNLRGVRFFSSNLASVDFTGADLSY 96
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
++ + +ANLTNA+L T + GAII+GADF+D I LC+ A GTNP
Sbjct: 97 ADLESARMTKANLTNAILEGAFTTGTMFDGAIIDGADFTDTYIREDTLNKLCQVAKGTNP 156
Query: 230 ITGVSTRKSLGC 241
+TG +TR +L C
Sbjct: 157 VTGRNTRDTLAC 168
>gi|443666115|ref|ZP_21133744.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|159030126|emb|CAO91018.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443331286|gb|ELS45952.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 169
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 66/112 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+ ++++S+FS + GA A + NF GADL++ L ++L++A+
Sbjct: 56 AQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSDAIFAE 115
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ R+ G I GADFS AV+D Q + LC+ A G N TGVST +SLGC
Sbjct: 116 AIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEGVNSKTGVSTPESLGC 167
>gi|157413206|ref|YP_001484072.1| hypothetical protein P9215_08711 [Prochlorococcus marinus str. MIT
9215]
gi|254525828|ref|ZP_05137880.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
MIT 9202]
gi|157387781|gb|ABV50486.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT
9215]
gi|221537252|gb|EEE39705.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
MIT 9202]
Length = 170
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 62/113 (54%), Gaps = 15/113 (13%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+N A S SKF GA L A+AY +FT ADLSD N TNA+L+
Sbjct: 73 ESNLEGAVFNNSKLQNSKFTGANLRDALAYATDFTDADLSD----------VNFTNALLM 122
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
S+ GA I+GADF+DAV+ Q++ LC ANGTN TG ST SLGC
Sbjct: 123 E-----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170
>gi|434404813|ref|YP_007147698.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428259068|gb|AFZ25018.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 172
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 64/110 (58%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F A++R+S+ S + NG A AN GADL ++ +D L ANLTNA+L
Sbjct: 62 FAKANLRQSNLSHTNLNGVSFFAANLESANLEGADLRNSTLDSARLVRANLTNALLEGAF 121
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ GAII+GADF+D ++ +++ LCK A GTNP+T TR +L C
Sbjct: 122 AANARFDGAIIDGADFTDMLLRQDEQKKLCKLAKGTNPVTLRDTRDTLFC 171
>gi|168067322|ref|XP_001785569.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662809|gb|EDQ49618.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 545
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 67/130 (51%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F ADLR +N R F + D R+ + GS +G+ A N +
Sbjct: 416 FDHADLRGRDMSNQNLRGVVFAACDCRKINLEGSTMDGSTDTFAGFEGGNLKNSSWIRAF 475
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
DR+V ANL NA VL+ S GA I GADF+DA++D Q+ +C+ A G NP T
Sbjct: 476 ADRVVFRGANLENANFTDAVLSGSQFDGADITGADFTDALVDNYQRLQMCRRAKGVNPTT 535
Query: 232 GVSTRKSLGC 241
GV+TR+SL C
Sbjct: 536 GVATRESLFC 545
>gi|422301609|ref|ZP_16388976.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9806]
gi|389789327|emb|CCI14609.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9806]
Length = 169
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 66/112 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+ ++++S+FS + GA A + NF GADL++ L ++L++A+
Sbjct: 56 AQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSDAIFAE 115
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ R+ G I GADFS AV+D Q + LC+ A G N TG+ST +SLGC
Sbjct: 116 AIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEGVNSKTGISTLESLGC 167
>gi|428317848|ref|YP_007115730.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428241528|gb|AFZ07314.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 171
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 79/145 (54%), Gaps = 7/145 (4%)
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAYLEKAV 156
G GI A F + D K + V +F +F A++R S+F+ + G A
Sbjct: 26 GAIGINPAPAF-ALDRDKEILVGADFTGKVLTDDSFNKANLRNSNFTNADLRGVSFFAAN 84
Query: 157 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
+ANF GA+L+ +D + +ANLTNA+L + L GA+I+GADF++ ++
Sbjct: 85 MEEANFEGANLTGATLDLARMMKANLTNAILEGAFAYNTRLEGAVIDGADFTETLLRDDM 144
Query: 217 KQALCKYANGTNPITGVSTRKSLGC 241
+ LCK A GTNP+TG TR++L C
Sbjct: 145 IEKLCKVAKGTNPVTGRDTRETLFC 169
>gi|428780675|ref|YP_007172461.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
gi|428694954|gb|AFZ51104.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
Length = 167
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 82/147 (55%), Gaps = 8/147 (5%)
Query: 98 EAETRGEFGIGS--AAQFGSADLR-KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
EA+T F S +A DL + + ++E AN T AD+ +D GS F + ++
Sbjct: 25 EAQTSTRFQRQSLISADLSEEDLSGETLQLREISDANLTGADLSNADLRGSIFTASVMKN 84
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
A + ANFT T+++ + A+L+ A+L +L+R+ L I GADF++AV+D
Sbjct: 85 ANLHGANFTF-----TVLNGVDFTNADLSQAILEDAILSRAILKDVDITGADFTNAVLDN 139
Query: 215 AQKQALCKYANGTNPITGVSTRKSLGC 241
Q LC+ A G N TGV+TR+SLGC
Sbjct: 140 QQYNQLCEMATGVNEETGVATRESLGC 166
>gi|148242344|ref|YP_001227501.1| pentapeptide repeat-containing protein [Synechococcus sp. RCC307]
gi|147850654|emb|CAK28148.1| Secreted pentapeptide repeat protein [Synechococcus sp. RCC307]
Length = 164
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/131 (37%), Positives = 70/131 (53%), Gaps = 10/131 (7%)
Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL----- 171
DL +H + + F D+ +DFS S G +NF+GADL D +
Sbjct: 39 DLSSDMHGRNLQQKEFLKMDLEGTDFSDSDLRGTVFNTTQLQDSNFSGADLRDVVAFSSR 98
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
DR L++A L N +L+++ T A I+GADF++AV+DL Q + LC A G N +
Sbjct: 99 FDRADLSQARLDNGMLLQSKFT-----DATIDGADFTNAVLDLPQIKQLCARATGVNERS 153
Query: 232 GVSTRKSLGCG 242
G+ST SLGCG
Sbjct: 154 GLSTADSLGCG 164
>gi|78779169|ref|YP_397281.1| hypothetical protein PMT9312_0785 [Prochlorococcus marinus str. MIT
9312]
gi|78712668|gb|ABB49845.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9312]
Length = 170
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/112 (45%), Positives = 62/112 (55%), Gaps = 15/112 (13%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
+N A S SKF GA L A+AY +FT ADLSD N TNA+L+
Sbjct: 74 SNLEGAVFNNSKLQNSKFTGANLRDALAYATDFTDADLSD----------VNFTNALLME 123
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
S+ GA I+GADF+DAV+ Q++ LC ANGTN TG ST SLGC
Sbjct: 124 -----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170
>gi|116074641|ref|ZP_01471902.1| hypothetical protein RS9916_28944 [Synechococcus sp. RS9916]
gi|116067863|gb|EAU73616.1| hypothetical protein RS9916_28944 [Synechococcus sp. RS9916]
Length = 158
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 59/110 (53%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F ++RE+DFSGS GA L A AN T +L D +D VL+ NLTNAVL
Sbjct: 49 FNLTNLREADFSGSDLQGASLYGAKLQDANLTDTNLRDATLDSAVLDGTNLTNAVLEDAF 108
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ II GADF++ + LC A GTNP+TG TR +LGC
Sbjct: 109 AFNTRFSNVIITGADFTNVPFRGDALKTLCAAAEGTNPVTGRDTRDTLGC 158
>gi|334118008|ref|ZP_08492098.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333459993|gb|EGK88603.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 171
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 72/118 (61%), Gaps = 11/118 (9%)
Query: 125 KENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
K N R +NFT+AD+R G F A +E+A ANFTGA L + RM+ +ANLT
Sbjct: 62 KANLRNSNFTNADLR-----GVSFFAANMEEANLEGANFTGATLD---LARMM--KANLT 111
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
NA+L + L GA+I+GADF+D ++ + LCK A GTNP+TG TR++L C
Sbjct: 112 NAILEGAFAYNTRLEGAVIDGADFTDTLLRDDMIEKLCKVAKGTNPVTGRDTRETLFC 169
>gi|443663881|ref|ZP_21133269.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|443331763|gb|ELS46407.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 150
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/135 (40%), Positives = 68/135 (50%), Gaps = 19/135 (14%)
Query: 112 QFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
FG DLR + N RA S F+ A LE + AN GAD SD
Sbjct: 29 DFGGQDLRDSTFDHSNLRA--------------SNFSHANLEGVRFFSANLEGADFSDAN 74
Query: 172 MDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
M + L AN TNAVL T + GAII+GADF+DA+I ++ LC+ A G
Sbjct: 75 MRNVDLESARLTRANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIAKG 134
Query: 227 TNPITGVSTRKSLGC 241
TNPITG +TR +L C
Sbjct: 135 TNPITGRNTRDTLFC 149
>gi|428305184|ref|YP_007142009.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428246719|gb|AFZ12499.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 169
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 63/112 (56%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT A++R +FS + G L A N GA+LS+ +D +ANLTNAVL
Sbjct: 57 ATFTKANLRNCNFSHADLRGVSLFGANLELVNLEGANLSNATLDTAKFTKANLTNAVLEG 116
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ GAII+GADF+D ++ ++ LCK A GTNP TG TR +L C
Sbjct: 117 AFAFNAKFDGAIIDGADFTDVLVRQDVQKQLCKIATGTNPTTGRETRDTLLC 168
>gi|428781463|ref|YP_007173249.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
gi|428695742|gb|AFZ51892.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
Length = 165
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 71/133 (53%), Gaps = 20/133 (15%)
Query: 114 GSADLRKAVHVKENFRANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTGADLS 168
G + + + +E RA+F +A++ + F+G+ + G +AY +FTG D
Sbjct: 47 GESLIEAEFYDEELERADFHNANLEAAVFNGANLTNANWQGVNFTNGIAYLTDFTGVDF- 105
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
TNA+L ++ RS A +EG DF++AV+D Q + LC+ A+G N
Sbjct: 106 --------------TNAILTEAMMLRSTFNDATVEGVDFTNAVVDRLQVKRLCERASGVN 151
Query: 229 PITGVSTRKSLGC 241
P TGVSTR+SLGC
Sbjct: 152 PTTGVSTRESLGC 164
>gi|291566844|dbj|BAI89116.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 174
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 64/112 (57%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
+ F A+++ S+FS + G L A N ADL +D L ANLTNA+L
Sbjct: 62 SEFDFANLQGSNFSHTDLRGVSLFGAKMQDVNLESADLRFATLDTARLVRANLTNALLEE 121
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+D GAII GADF+D ++ Q+Q LC+ A+GTNP+TG TR++L C
Sbjct: 122 AYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 173
>gi|425436672|ref|ZP_18817106.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9432]
gi|425449430|ref|ZP_18829270.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
7941]
gi|425458879|ref|ZP_18838365.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9808]
gi|440755734|ref|ZP_20934936.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|389678572|emb|CCH92580.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9432]
gi|389763888|emb|CCI09674.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
7941]
gi|389823689|emb|CCI27950.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9808]
gi|440175940|gb|ELP55309.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
Length = 169
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 66/112 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT+ ++++S+FS + GA A + NF GADL++ L ++L++A+
Sbjct: 56 AQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSDAIFSE 115
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ R+ G I GADFS AV+D Q + LC+ A G N TG+ST +SLGC
Sbjct: 116 AIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEGVNSKTGISTPESLGC 167
>gi|409992571|ref|ZP_11275753.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|409936565|gb|EKN78047.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 149
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 64/112 (57%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
+ F A+++ S+FS + G L A N ADL +D L ANLTNA+L
Sbjct: 37 SEFDFANLQGSNFSHTDLRGVSLFGAKMQDVNLESADLRLATLDTARLVRANLTNALLEE 96
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+D GAII GADF+D ++ Q+Q LC+ A+GTNP+TG TR++L C
Sbjct: 97 AYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 148
>gi|428220990|ref|YP_007105160.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994330|gb|AFY73025.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 165
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 62/110 (56%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F D+ ++F + G L A A+FTGADL + +D +N ANLTNAVL
Sbjct: 54 FNKTDLHNANFRNANLAGVSLFGANMTAADFTGADLRYSTLDTARMNGANLTNAVLEGAF 113
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + G +I+GADFSD + + LCK A GTNP+TG TR++L C
Sbjct: 114 VYGTSFVGTVIDGADFSDVDLRNTTRSLLCKVAKGTNPVTGRDTRETLEC 163
>gi|423066922|ref|ZP_17055712.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|406711687|gb|EKD06887.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 137
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 64/112 (57%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
+ F A+++ S+FS + G L A N ADL +D L ANLTNA+L
Sbjct: 25 SEFDFANLQGSNFSHTDLRGVSLFGAKMQDVNLESADLRLATLDTARLVRANLTNALLEE 84
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+D GAII GADF+D ++ Q+Q LC+ A+GTNP+TG TR++L C
Sbjct: 85 AYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 136
>gi|33862830|ref|NP_894390.1| hypothetical protein PMT0557 [Prochlorococcus marinus str. MIT
9313]
gi|33634746|emb|CAE20732.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 198
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 57/149 (38%), Positives = 77/149 (51%), Gaps = 26/149 (17%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFN----------GAYL 152
EF G A + S D+ ++NF +A+ D+ E+D G+ FN GA L
Sbjct: 63 EFRGGQAIEEISKDMHGRDLKEQNFLKADLRGVDLSEADLRGAVFNSSQLQEADLQGADL 122
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
E VA+ + F GADL AN TNA+L++ S A+IEGADFS+AV+
Sbjct: 123 ENVVAFASRFDGADLRG----------ANFTNAMLMQ-----SQFKDALIEGADFSNAVL 167
Query: 213 DLAQKQALCKYANGTNPITGVSTRKSLGC 241
D Q+ LC A+GTN +G T SLGC
Sbjct: 168 DRRQQNELCARADGTNAASGSQTLDSLGC 196
>gi|359460819|ref|ZP_09249382.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 164
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 73/132 (55%), Gaps = 10/132 (7%)
Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN- 178
+A+ ++ +F+ D+RE++FS ++ GA +A F G DL+ + + +
Sbjct: 32 RAIDDEDIVTQDFSGQDLREAEFSNNQLAGANFSEADLTAVVFNGVDLTGASLKNVDMTG 91
Query: 179 ---------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
EA+L+ A+L +L +S L A + ADFS AVID Q + LC+ A+G NP
Sbjct: 92 GMAYLSSFAEADLSGAILTEAMLLQSSLRNATVTDADFSFAVIDKDQVKILCETASGVNP 151
Query: 230 ITGVSTRKSLGC 241
+TGV TR SLGC
Sbjct: 152 VTGVDTRDSLGC 163
>gi|166364098|ref|YP_001656371.1| pentapeptide repeat-containing protein [Microcystis aeruginosa
NIES-843]
gi|166086471|dbj|BAG01179.1| pentapeptide repeat family protein [Microcystis aeruginosa
NIES-843]
Length = 161
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 51/121 (42%), Positives = 69/121 (57%), Gaps = 10/121 (8%)
Query: 131 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 180
+F D+R+S F GS F+ A LE + AN GA+ SD M + L +A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSRANLEGVRFFSANLEGANFSDANMRNVDLESARLTKA 99
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+TG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159
Query: 241 C 241
C
Sbjct: 160 C 160
>gi|113475775|ref|YP_721836.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110166823|gb|ABG51363.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 165
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 77/155 (49%), Gaps = 22/155 (14%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
I + A F ++ V+ N N+T D+ DFS +G A +KANF GA+
Sbjct: 12 ILTVAGFWVMNIYSVQAVENN--VNYTLTDLNNRDFSYKDLHGTSFAGATMWKANFQGAN 69
Query: 167 LSDTLM--------------------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
L +T++ DR+ ++++LTNA+ +L S A + G D
Sbjct: 70 LQNTILTKGDFLRANLTEADFTGTFADRVSFDKSDLTNAIFTDAMLMSSTFRDATVIGTD 129
Query: 207 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
FS A++D Q + +C+ A+G N TGV TR+SLGC
Sbjct: 130 FSGAMVDRYQIKLMCETASGKNKTTGVETRESLGC 164
>gi|158337467|ref|YP_001518642.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158307708|gb|ABW29325.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 164
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 73/132 (55%), Gaps = 10/132 (7%)
Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN- 178
+A+ ++ +F+ D+RE++FS ++ GA +A F G DL+ + + +
Sbjct: 32 RAIDDEDIVTQDFSGQDLREAEFSNNQLAGANFSEADLTAVVFNGVDLTGASLKNVDMTG 91
Query: 179 ---------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
EA+L+ A+L +L +S L A + ADFS AVID Q + LC+ A+G NP
Sbjct: 92 GMAYLSSFAEADLSGAILTEAMLLQSSLRDATVTDADFSFAVIDKDQVKILCETASGVNP 151
Query: 230 ITGVSTRKSLGC 241
+TGV TR SLGC
Sbjct: 152 VTGVDTRDSLGC 163
>gi|428313239|ref|YP_007124216.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428254851|gb|AFZ20810.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 169
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 63/112 (56%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
++FT A++R S+FS S G A ANF GA+L + +D L A+L NAVL
Sbjct: 57 SSFTKANLRSSNFSHSNLEGVSFFSANLESANFEGANLRNATLDTARLTRASLKNAVLEG 116
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ GA IEGADF++ + ++ LC A+GTNP TG STR +L C
Sbjct: 117 AFAFNTKFDGATIEGADFTEVLFRQDVQKQLCHVASGTNPTTGRSTRDTLFC 168
>gi|159903526|ref|YP_001550870.1| hypothetical protein P9211_09851 [Prochlorococcus marinus str. MIT
9211]
gi|159888702|gb|ABX08916.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9211]
Length = 169
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 78/152 (51%), Gaps = 21/152 (13%)
Query: 96 KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADM-----RESDFSGSKFNG 149
K E R + + + + DL VK + R NF +D+ S+ + ++FNG
Sbjct: 33 KRPPEIRNQDDLNISQDMHAQDLSGREFVKFDLRGINFKDSDLSGAVFNNSNLTNAQFNG 92
Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
A + ++AY NF DLSD ANLTNA+L+ + + I+GADF+D
Sbjct: 93 ADMHDSLAYATNFENTDLSD----------ANLTNALLMESTFVNTK-----IDGADFTD 137
Query: 210 AVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
AV+ Q++ LC A+GTN TG+ T SLGC
Sbjct: 138 AVLSRIQQKQLCSIASGTNSNTGIDTEYSLGC 169
>gi|209527449|ref|ZP_03275954.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|376003366|ref|ZP_09781178.1| pentapeptide repeat-containing protein [Arthrospira sp. PCC 8005]
gi|209492122|gb|EDZ92472.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|375328288|emb|CCE16931.1| pentapeptide repeat-containing protein [Arthrospira sp. PCC 8005]
Length = 137
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 64/112 (57%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
+ F A+++ S+FS + G L A N ADL +D L ANLTNA+L
Sbjct: 25 SEFDFANLQGSNFSHTDLRGVSLFGAKMQDINLESADLRLATLDTARLVRANLTNALLEE 84
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+D GAII GADF+D ++ Q+Q LC+ A+GTNP+TG TR++L C
Sbjct: 85 AYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 136
>gi|300868113|ref|ZP_07112748.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300333887|emb|CBN57928.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 169
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 64/112 (57%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT A++R S+FS + G A ANF GA+L +D + + NLTNA+L
Sbjct: 57 AQFTKANLRNSNFSNANLQGVSFFAANMEDANFEGANLRGATLDLARMIKVNLTNAILEG 116
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ AI++GADF+D +I + LCK A GTNP+TG +TR++L C
Sbjct: 117 AFAYNTKFERAIVDGADFTDILIRDDMVEKLCKVARGTNPVTGRNTRETLFC 168
>gi|113474577|ref|YP_720638.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110165625|gb|ABG50165.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 144
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 64/110 (58%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
FT + +R+S+FS + +G L A AN GA+LS + +D V N+ANLTNA+L
Sbjct: 34 FTKSILRKSNFSNANLSGVSLFGAHLEGANLEGANLSYSTLDDAVFNKANLTNAILEGAF 93
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ AII+GADF+DA + + LCK A G N ITG TR +L C
Sbjct: 94 AFHTQFRDAIIDGADFTDAFLRKDTTKDLCKIAQGKNSITGKETRDTLFC 143
>gi|434386960|ref|YP_007097571.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428017950|gb|AFY94044.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 168
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 62/110 (56%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
FT A +R F + G L A+ TGA+L++ L+D N TNA+LV
Sbjct: 58 FTQASVRNGKFINANLTGVSLIGGNFDSADMTGANLTNALLDTARFTRTNFTNAILVGAF 117
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ ++ GAII+GADF+D ++ ++ LCK A GTNP TG TR+SL C
Sbjct: 118 TSVTNFDGAIIDGADFTDVLLRKDIQKKLCKVAKGTNPTTGRDTRESLEC 167
>gi|427734374|ref|YP_007053918.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427369415|gb|AFY53371.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 167
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 71/131 (54%), Gaps = 6/131 (4%)
Query: 117 DLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
D K + ++ +F ++FT A++R+S+FS S G AN A+L
Sbjct: 36 DYNKEILIEADFSGQDLTDSSFTKANLRDSNFSNSNLQGVRFFATNLESANLRNANLRYA 95
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D L +A+LTNAVL + + GAII+GADF+D ++ ++ LCK A GTNP
Sbjct: 96 TLDSARLVKADLTNAVLEGAFASNARFDGAIIDGADFTDVLLRADEQDKLCKLAKGTNPT 155
Query: 231 TGVSTRKSLGC 241
TG TR +L C
Sbjct: 156 TGRDTRDTLFC 166
>gi|33861334|ref|NP_892895.1| hypothetical protein PMM0777 [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
gi|33633911|emb|CAE19236.1| conserved hypothetical protein [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
Length = 170
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 68/125 (54%)
Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
DL + +H ++ F ++ DFS S GA + A TGA+LSD L
Sbjct: 46 DLEEDMHGQDLSGNEFVKFNLNGFDFSQSNLEGAVFNNSKLQNATMTGANLSDALAYATD 105
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 236
+A+L++ +L S+ GA I+GADF++AV+ Q++ LC+ ANGTN TG ST
Sbjct: 106 FTDADLSDVNFTNALLMESNFEGAKIDGADFTNAVLSRIQQKELCEIANGTNSSTGESTE 165
Query: 237 KSLGC 241
SLGC
Sbjct: 166 YSLGC 170
>gi|428310976|ref|YP_007121953.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428252588|gb|AFZ18547.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 167
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 64/118 (54%), Gaps = 20/118 (16%)
Query: 129 RANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
+ NF +AD+R FS S +GA +AY ++FTGADLSD
Sbjct: 64 QTNFNNADLRNVVFSSSTLKQASLHGADFTSGIAYLSDFTGADLSD-------------- 109
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
AVL ++ RS A I GADF+DAV+D Q + LC A G N TG++TR+SLGC
Sbjct: 110 -AVLTEAIMLRSRFDEADITGADFTDAVLDGVQIKKLCARATGVNSKTGMATRESLGC 166
>gi|428215647|ref|YP_007088791.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004028|gb|AFY84871.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 183
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 65/113 (57%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+A+F A++R+S+ S + GA L A AN GA+LS+T +D NL NA+L
Sbjct: 70 QASFNHANLRKSNLSHANLQGASLFAAHLEDANLEGANLSNTTLDTARFIRTNLKNAILE 129
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + GA IEGADF+D + + LC+ A GTNP+TG +TR +L C
Sbjct: 130 GSFAFSAKFNGANIEGADFTDVFLRDDANEILCELATGTNPVTGRNTRDTLYC 182
>gi|318041291|ref|ZP_07973247.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0101]
Length = 161
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 49/131 (37%), Positives = 69/131 (52%), Gaps = 6/131 (4%)
Query: 117 DLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
D+ K V + +F A F ++RE+DF GS GA L A AN +G DL+D
Sbjct: 30 DVAKQVLIGHDFAGMDLRGATFNLTNLREADFHGSDLRGASLFGAKLQDANLSGTDLTDA 89
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D VL+ +L NAVL + +IEGADF++ + LC A+GTNP+
Sbjct: 90 TLDSAVLDGTDLRNAVLENAFAFNTRFNNVLIEGADFTNVPFRGDVLKTLCASASGTNPV 149
Query: 231 TGVSTRKSLGC 241
TG +TR +L C
Sbjct: 150 TGRNTRDTLEC 160
>gi|427420100|ref|ZP_18910283.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425762813|gb|EKV03666.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 165
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 66/133 (49%), Gaps = 1/133 (0%)
Query: 110 AAQFGSADLRKAVHVKENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
A + LR+ ++ R N +TS DM E+D S + G L KAN AD+S
Sbjct: 32 AKNYDRQSLRQQSFAGQDLRGNNYTSTDMAEADLSNTDLRGVRLFDTNLTKANLESADMS 91
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
+D ANL NA+ +D A IEGADF+D +D+ LC+ A G N
Sbjct: 92 GATLDGARFIRANLKNAIFEGAYAFSTDFRKANIEGADFTDVDLDVKTNDMLCEVATGVN 151
Query: 229 PITGVSTRKSLGC 241
P+TG +T+ +L C
Sbjct: 152 PVTGRATKDTLYC 164
>gi|78779832|ref|YP_397944.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9312]
gi|78713331|gb|ABB50508.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9312]
Length = 186
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 74/142 (52%), Gaps = 10/142 (7%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----M 172
L++ +H + F D+ D + GAY+ A ++F A++ D +
Sbjct: 50 LKEDLHGADLQNNEFVKYDLSNQDLGEANLQGAYMSVTTAANSSFKSANMKDLIAYAVRF 109
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
D L++ANLTN L+++V GA I+GADF+DA +DL Q+++LC+ A GTN TG
Sbjct: 110 DNADLSDANLTNGELMKSVF-----DGATIDGADFTDATLDLPQRKSLCERATGTNSKTG 164
Query: 233 VSTRKSLGCGNSRRNAYGSPSS 254
V T SL C R +P +
Sbjct: 165 VDTVDSLECSGLRGYIPATPEA 186
>gi|172037018|ref|YP_001803519.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354555787|ref|ZP_08975086.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171698472|gb|ACB51453.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353552111|gb|EHC21508.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 167
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 73/130 (56%), Gaps = 11/130 (8%)
Query: 123 HVKENF-RANFTSADMRESDFS-----GSKFNGAYLEKAVAYK-----ANFTGADLSDTL 171
+ K+N +F+S D+R+SDF G F+ A L+ + ANF GADL
Sbjct: 37 YAKQNLVERDFSSQDLRDSDFEHANLRGCNFSHANLQGVRFFASNLEGANFEGADLRYAD 96
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
++ L N TNA+L T + GA+I+GADF+D ++ L ++ LC+ A GTNPIT
Sbjct: 97 LESARLVRVNFTNAILEGAFATNTLFNGAVIDGADFTDVLLRLDTEKKLCEIAKGTNPIT 156
Query: 232 GVSTRKSLGC 241
G +T+ +L C
Sbjct: 157 GRNTKDTLFC 166
>gi|434395414|ref|YP_007130361.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428267255|gb|AFZ33201.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 168
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 63/110 (57%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F A++R S+FS + G L A AN GA+L++ +D L+ ANL +AVL
Sbjct: 58 FNHANLRNSNFSHANLEGVSLFAANLESANLEGANLTNATLDSARLSNANLKDAVLEGAF 117
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ AII+GADF+D ++ ++ LCK A GTNP TG TR++L C
Sbjct: 118 AANAKFDKAIIDGADFTDVLLRRDEQDKLCKVAKGTNPTTGRETRETLMC 167
>gi|254526458|ref|ZP_05138510.1| pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
gi|221537882|gb|EEE40335.1| pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
Length = 179
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 67/124 (54%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L+ +H + + D+ D + GAY+ A ++F GA++ D +
Sbjct: 43 LKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRF 102
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
+ A+ T+A L L +S GAII+GADF+DA +DL +++LC+ A GTN TGV+T
Sbjct: 103 DNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLKTRKSLCERATGTNSQTGVNTAD 162
Query: 238 SLGC 241
SL C
Sbjct: 163 SLEC 166
>gi|157413912|ref|YP_001484778.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9215]
gi|157388487|gb|ABV51192.1| Pentapeptide repeat-containing proteins [Prochlorococcus marinus
str. MIT 9215]
Length = 186
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 67/124 (54%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L+ +H + + D+ D + GAY+ A ++F GA++ D +
Sbjct: 50 LKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRF 109
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
+ A+ T+A L L +S GAII+GADF+DA +DL +++LC+ A GTN TGV+T
Sbjct: 110 DNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLKTRKSLCERATGTNSQTGVNTAD 169
Query: 238 SLGC 241
SL C
Sbjct: 170 SLEC 173
>gi|218245449|ref|YP_002370820.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|257058486|ref|YP_003136374.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
gi|218165927|gb|ACK64664.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
gi|256588652|gb|ACU99538.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
Length = 168
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 67/110 (60%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +AD+ E++FS S GA +ANF GA+L++ L +A+L++A+L +
Sbjct: 58 FANADLTEANFSDSDLRGAVFNGVELKQANFHGANLTNGLAYLSSFRDADLSDAILSEVI 117
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ R+ A I GADF+ AV+D + LC+ A+G N TG+STR+SLGC
Sbjct: 118 MLRTVFDNANITGADFTLAVLDGEEVAKLCQRADGVNSKTGMSTRESLGC 167
>gi|91070378|gb|ABE11292.1| pentapeptide repeats [uncultured Prochlorococcus marinus clone
HF10-88H9]
Length = 186
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 67/124 (54%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L+ +H + + D+ D + GAY+ A ++F GA++ D +
Sbjct: 50 LKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRF 109
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
+ A+ T+A L L +S GAII+GADF+DA +DL +++LC+ A GTN TGV+T
Sbjct: 110 DNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLKTRKSLCERATGTNSQTGVNTAD 169
Query: 238 SLGC 241
SL C
Sbjct: 170 SLEC 173
>gi|254414183|ref|ZP_05027950.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196178858|gb|EDX73855.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 178
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 63/118 (53%), Gaps = 20/118 (16%)
Query: 129 RANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
+ N ++ D+R +S + + GA ++AYK NF GADLSD
Sbjct: 75 QTNLSNTDLRSVVISDSTMTDANLQGADFSYSIAYKVNFKGADLSD-------------- 120
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
AVL +L S L I GADFS+AV+D Q Q+LC A+G N TGV TR+SLGC
Sbjct: 121 -AVLEEAILLGSRLDDVNITGADFSNAVLDRVQVQSLCTKASGVNSKTGVETRESLGC 177
>gi|428771687|ref|YP_007163477.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428685966|gb|AFZ55433.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 159
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/129 (37%), Positives = 69/129 (53%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F +L A K N R SA++ +SD G F GA ++ N GA+L+++++
Sbjct: 39 FSGQNLTDATFNKTNLR----SANLSQSDLQGVSFFGANMDSI-----NLEGANLTNSIL 89
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
D L ANL NAVL T + GA IEGADF+D ++ ++ LC+ A G NP TG
Sbjct: 90 DSARLTRANLRNAVLEGAFATNTKFEGANIEGADFTDVILRPDVEEMLCEKAKGVNPTTG 149
Query: 233 VSTRKSLGC 241
TR +L C
Sbjct: 150 RKTRDTLYC 158
>gi|434392213|ref|YP_007127160.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428264054|gb|AFZ30000.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 165
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 75/136 (55%), Gaps = 29/136 (21%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGA 165
A+F +ADL A NF++AD+R F+G+K +GA +AY +FTGA
Sbjct: 54 AEFANADLEAA---------NFSNADLRGVVFNGAKLIKANLHGADFTNGIAYIVDFTGA 104
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
+LSD +M+ A+++R++ D I GADF++AV+D + LC A+
Sbjct: 105 NLSDAVMEE----------AMMLRSIFNDVD-----ITGADFTNAVLDRTVVKKLCAQAS 149
Query: 226 GTNPITGVSTRKSLGC 241
G N TGV+TR SLGC
Sbjct: 150 GVNSKTGVATRDSLGC 165
>gi|428205702|ref|YP_007090055.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428007623|gb|AFY86186.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 169
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 62/110 (56%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F+ A++R S+FS S G L A ANF GA+L+ +D L ANL +A+L
Sbjct: 59 FSHANLRSSNFSHSNLEGVSLFAANLDSANFEGANLASATLDSARLTRANLKDAILEGAF 118
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ GA+I+GADF+D ++ + LC+ A G NP TG +TR +L C
Sbjct: 119 AANTKFDGAVIDGADFTDVLMRRDVQDKLCQVAKGVNPTTGRATRDTLFC 168
>gi|254431831|ref|ZP_05045534.1| pentapeptide repeat protein [Cyanobium sp. PCC 7001]
gi|197626284|gb|EDY38843.1| pentapeptide repeat protein [Cyanobium sp. PCC 7001]
Length = 174
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 69/135 (51%), Gaps = 6/135 (4%)
Query: 113 FGSADLRKAVHVKENFRAN------FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
+ D+ K V + ++ F ++R++D SGS GA L A A+ + +
Sbjct: 39 LAAVDVAKQVLIGADYHGQDLRGGTFNLTNLRDADLSGSDLQGASLFGAKLQDADLSNTN 98
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
L +T +D V N +LTNAVL + II+GADF++ + +ALC A G
Sbjct: 99 LRETTLDSAVFNGTDLTNAVLEDAFAFNTKFSDVIIDGADFTNVPLRGDALKALCAVARG 158
Query: 227 TNPITGVSTRKSLGC 241
TNP+TG TR +LGC
Sbjct: 159 TNPVTGRQTRDTLGC 173
>gi|113954335|ref|YP_730803.1| pentapeptide repeat-containing protein [Synechococcus sp. CC9311]
gi|113881686|gb|ABI46644.1| pentapeptide repeat protein [Synechococcus sp. CC9311]
Length = 157
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 67/135 (49%), Gaps = 6/135 (4%)
Query: 113 FGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
F + D K V + +F F ++RE+D SGS GA L A AN + ++
Sbjct: 22 FAAMDYAKQVLIGADFSNREMQGVTFNLTNLREADLSGSDLQGASLYGAKLQDANLSNSN 81
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
L D +D V + NLTNAVL + +EGADF++ + + LC A G
Sbjct: 82 LRDATLDSAVFDGTNLTNAVLEDAFAFNTRFINVTVEGADFTNVPLRTDALKVLCANAEG 141
Query: 227 TNPITGVSTRKSLGC 241
NP+TG TR++LGC
Sbjct: 142 VNPVTGRDTRETLGC 156
>gi|123966744|ref|YP_001011825.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9515]
gi|123201110|gb|ABM72718.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9515]
Length = 192
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 52/132 (39%), Positives = 70/132 (53%), Gaps = 21/132 (15%)
Query: 116 ADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
ADL+ +VK + AN A M + S F GA ++ +AY F AD SD
Sbjct: 63 ADLQNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFSD 122
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
ANLTN L+++V GAII+GADF+DA +DL +++LC+ A GTN
Sbjct: 123 ----------ANLTNGELMKSVF-----DGAIIDGADFTDANLDLKTRKSLCERATGTNS 167
Query: 230 ITGVSTRKSLGC 241
TGV T +SL C
Sbjct: 168 RTGVDTFESLEC 179
>gi|56752263|ref|YP_172964.1| hypothetical protein syc2254_d [Synechococcus elongatus PCC 6301]
gi|24251237|gb|AAN46157.1| unknown protein [Synechococcus elongatus PCC 7942]
gi|56687222|dbj|BAD80444.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 171
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/114 (40%), Positives = 62/114 (54%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
+A F S ++ F G+ GA +ANF AD +D + L N NA L
Sbjct: 56 IQAEFASVRLKGVSFRGADLRGAVFNGVDLREANFEDADFTDGIAYVSDLRNVNFRNANL 115
Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L +S+L G+ + GADFS AV+ Q ALC+ A+GTNP TG TR+SLGC
Sbjct: 116 TSAMLLQSELQGSDVTGADFSFAVLSKQQITALCETASGTNPKTGADTRESLGC 169
>gi|443475471|ref|ZP_21065420.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019714|gb|ELS33767.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 164
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 66/117 (56%)
Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
K RA FTS ++ ++F+ + GA + AN G+D S + +L++
Sbjct: 47 KNLIRAEFTSVTLKNANFTNADLRGAIFNGVLLDGANLHGSDFSSGIAYISRFKNVDLSD 106
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
AVL T + RS + GADF++A++D+ Q + LC A+GTN TGVSTR+SLGC
Sbjct: 107 AVLNDTNMLRSTFDNVEVTGADFTNALLDIQQLKKLCINASGTNSKTGVSTRESLGC 163
>gi|81300649|ref|YP_400857.1| hypothetical protein Synpcc7942_1840 [Synechococcus elongatus PCC
7942]
gi|81169530|gb|ABB57870.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 168
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/114 (40%), Positives = 62/114 (54%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
+A F S ++ F G+ GA +ANF AD +D + L N NA L
Sbjct: 53 IQAEFASVRLKGVSFRGADLRGAVFNGVDLREANFEDADFTDGIAYVSDLRNVNFRNANL 112
Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+L +S+L G+ + GADFS AV+ Q ALC+ A+GTNP TG TR+SLGC
Sbjct: 113 TSAMLLQSELQGSDVTGADFSFAVLSKQQITALCETASGTNPKTGADTRESLGC 166
>gi|434397761|ref|YP_007131765.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428268858|gb|AFZ34799.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 166
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 68/130 (52%), Gaps = 1/130 (0%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F D R +N ++ +F D+ ++FS + GA + AN G D S
Sbjct: 36 FSEVDFRSKDFSGKNLQSIDFAKVDLESANFSNADLRGAVFNASNLANANLQGVDFSYGF 95
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
+ A+LT+A+ T+L+ S GA I+ ADF+ AV++ Q + LC A+G NP T
Sbjct: 96 AYLTNFDGADLTDAIFQETILSFSTFEGAKIKNADFTFAVLEKWQVKQLCANASGVNPKT 155
Query: 232 GVSTRKSLGC 241
GV TR+SLGC
Sbjct: 156 GVDTRESLGC 165
>gi|78212716|ref|YP_381495.1| hypothetical protein Syncc9605_1185 [Synechococcus sp. CC9605]
gi|78197175|gb|ABB34940.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 165
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 64/112 (57%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F +++RE++ SGS GA L A A+ +G DL + +D V+ NL +AVL
Sbjct: 53 ATFNLSNLREANLSGSDLRGASLYGAKLQDADLSGTDLREATLDAAVMTGTNLEDAVLEG 112
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ +I GADF+D + Q ++LC A+GTN +TG STR+SLGC
Sbjct: 113 AFAFNTRFSDVLITGADFTDVPMRGDQLKSLCAVADGTNSVTGRSTRESLGC 164
>gi|443320013|ref|ZP_21049146.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
gi|442790267|gb|ELR99867.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
Length = 164
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 65/131 (49%), Gaps = 1/131 (0%)
Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
+F + DLR +N + FT ++ +F+ + G +AN G D S
Sbjct: 33 RFDNRDLRGESFANQNLQTVEFTKVKLQGVNFANADLIGVVFNSTALDQANLQGVDFSQG 92
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+ + +L +A+LV +L RS I GADFS AV+D Q LC YA+G N
Sbjct: 93 IAYLTSFDGVDLRDALLVEALLLRSTFKDTKISGADFSSAVLDQDQLDKLCSYADGVNSK 152
Query: 231 TGVSTRKSLGC 241
TGV TR+SLGC
Sbjct: 153 TGVKTRESLGC 163
>gi|56751209|ref|YP_171910.1| hypothetical protein syc1200_c [Synechococcus elongatus PCC 6301]
gi|81299124|ref|YP_399332.1| hypothetical protein Synpcc7942_0313 [Synechococcus elongatus PCC
7942]
gi|56686168|dbj|BAD79390.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81168005|gb|ABB56345.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 170
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 68/131 (51%), Gaps = 6/131 (4%)
Query: 117 DLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
D K + ++ NF ANFT A++R SDFS S G A + GADLS+T
Sbjct: 39 DFTKEILIESNFSNRDLSDANFTKANLRSSDFSNSVLVGVRFYGANLESVDLHGADLSNT 98
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
++D+ + +LT+A+L + GA I GADF+D ++ + LC A G N
Sbjct: 99 ILDQARMTNTDLTDAILEGAYAFNALFQGAKITGADFTDVLMRQDAQDLLCSVAEGVNSK 158
Query: 231 TGVSTRKSLGC 241
TG +TR +L C
Sbjct: 159 TGRATRDTLDC 169
>gi|123966041|ref|YP_001011122.1| hypothetical protein P9515_08061 [Prochlorococcus marinus str. MIT
9515]
gi|123200407|gb|ABM72015.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 170
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/140 (38%), Positives = 72/140 (51%), Gaps = 30/140 (21%)
Query: 117 DLRKAVHVK-----ENFRANFTSADMRESDFSGSKFN----------GAYLEKAVAYKAN 161
DL + +H + E + N D +S+ G+ FN GA L A+AY +
Sbjct: 46 DLEQDMHGQDLSGNEFVKFNLNGFDFSQSNLEGAVFNNSKLQNATLNGANLTDALAYATD 105
Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
FT ADLSD N TNA+L+ S+ GA I+GADF++AV+ Q++ LC
Sbjct: 106 FTDADLSD----------VNFTNALLME-----SNFEGAKIDGADFTNAVLSRIQQKELC 150
Query: 222 KYANGTNPITGVSTRKSLGC 241
ANGTN TG ST SLGC
Sbjct: 151 AIANGTNSSTGESTEYSLGC 170
>gi|124025420|ref|YP_001014536.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL1A]
gi|123960488|gb|ABM75271.1| Pentapeptide repeats [Prochlorococcus marinus str. NATL1A]
Length = 156
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 60/112 (53%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F +D+++SDFSGS GA A AN + ++ D MD +LN ANL+N+VL
Sbjct: 45 ATFYLSDLQDSDFSGSDLQGASFFDAKLENANLSNTNMRDVTMDAAILNGANLSNSVLEG 104
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ IIEGADF+D +I + LC ANG N +T T +L C
Sbjct: 105 AFAYNAKFENVIIEGADFTDVLIANDVRNKLCLIANGINSVTNKKTSDTLDC 156
>gi|307151213|ref|YP_003886597.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306981441|gb|ADN13322.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 174
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/136 (39%), Positives = 71/136 (52%), Gaps = 29/136 (21%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGA 165
AQF + DL +A NF+ AD+R + F+GS K +GA L A+AY ++F GA
Sbjct: 56 AQFTNVDLTQA---------NFSDADLRGAVFNGSALKEVKLHGADLTNALAYLSSFEGA 106
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
DLSD A+ +L R+ A + G DFS AV+D + LCK A+
Sbjct: 107 DLSD---------------AIFAEAILKRTSFKNADVTGTDFSFAVLDGEEIANLCKSAS 151
Query: 226 GTNPITGVSTRKSLGC 241
G N TGVSTR SL C
Sbjct: 152 GVNSKTGVSTRDSLRC 167
>gi|218438527|ref|YP_002376856.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218171255|gb|ACK69988.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 172
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 49/131 (37%), Positives = 66/131 (50%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
+ F DLR A F A++R S+FS G A ANF GA+L
Sbjct: 49 SDFSGQDLRDA---------KFDHANLRSSNFSNVNAEGVRFFAANLESANFEGANLRYA 99
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
++ L N TNAVL T + GAII+GADF+D ++ +Q LC A GTNP+
Sbjct: 100 DLESARLTRVNFTNAVLEGAFATNTLFKGAIIDGADFTDVLLRPDTEQYLCTIAKGTNPV 159
Query: 231 TGVSTRKSLGC 241
TG +T+ +L C
Sbjct: 160 TGRNTKDTLYC 170
>gi|449018152|dbj|BAM81554.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 321
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 68/113 (60%), Gaps = 1/113 (0%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT- 190
F + +R+ DFSGS A A ANF A+LS ++ L +A+L NA+L
Sbjct: 209 FQQSIVRDVDFSGSNLQDASFFDADCSGANFQNANLSRANLELANLRKADLRNAILTNAY 268
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 243
V+ ++ L G IEG+D++D ++ Q++ LCK A+G NP+T ++T+ SLGC +
Sbjct: 269 VVGQTKLEGIQIEGSDWTDVLLRPDQRRLLCKRASGENPVTHIATKDSLGCAD 321
>gi|22298403|ref|NP_681650.1| hypothetical protein tll0860 [Thermosynechococcus elongatus BP-1]
gi|22294582|dbj|BAC08412.1| tll0860 [Thermosynechococcus elongatus BP-1]
Length = 178
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 49/129 (37%), Positives = 68/129 (52%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR + K AN +++ ++ G F GA LE A N GADL +
Sbjct: 54 FSGRDLRGSEFTK----ANLFHSNLSHTNLQGVSFFGANLETA-----NLEGADLRYATL 104
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
D L +ANLTNA+L ++ AII GADF+D + ++ LCK A+GTNP+TG
Sbjct: 105 DTARLTKANLTNAILEGAFAFNTNFDDAIITGADFTDVELREDAQRKLCKVASGTNPVTG 164
Query: 233 VSTRKSLGC 241
T ++L C
Sbjct: 165 RKTWETLHC 173
>gi|427728200|ref|YP_007074437.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427364119|gb|AFY46840.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 164
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 92/184 (50%), Gaps = 28/184 (15%)
Query: 65 LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
++ WRV S LA ++ A+ SS+I+ A + G+ IG A+F +AD
Sbjct: 1 MRYWRVLASFVLAMILLLFPLSAEAASSSSITRSAGDEVARKDFSGQSLIG--AEFTNAD 58
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L EN ANF+ AD+R G FNG LE N G D S+ +
Sbjct: 59 L-------EN--ANFSDADLR-----GGVFNGTVLEGV-----NLHGVDFSNGIAYLAKF 99
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
ANL++AVL ++ RS I G DF++AV+D Q + LC A+G N TGV TR+
Sbjct: 100 KNANLSDAVLTDAMMLRSTFDNVDITGTDFTNAVLDGPQVKKLCTKASGVNSKTGVDTRE 159
Query: 238 SLGC 241
SLGC
Sbjct: 160 SLGC 163
>gi|428776639|ref|YP_007168426.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428690918|gb|AFZ44212.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 167
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 72/122 (59%), Gaps = 5/122 (4%)
Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
+ + ++E AN ++AD+ ++D GS F + ++ A + ANFT T+++ +
Sbjct: 50 ETLQLREISDANLSAADLSDTDMRGSIFTASVMKDANLHGANFTF-----TVLNGVDFTN 104
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
A+L+ +L +L+R+ I GADF++AV+D Q LC+ A+G N TG++TR SL
Sbjct: 105 ADLSQTILEDAILSRATFENTDITGADFTNAVLDSRQIDQLCETASGVNEETGMATRDSL 164
Query: 240 GC 241
GC
Sbjct: 165 GC 166
>gi|72381929|ref|YP_291284.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL2A]
gi|72001779|gb|AAZ57581.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
NATL2A]
Length = 156
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 60/112 (53%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F +D++ SDFSGS GA A AN + ++ D MD +LN ANL+N++L
Sbjct: 45 ATFYLSDLQNSDFSGSDLQGASFFDAKLENANLSNTNMRDVTMDAAILNGANLSNSILEG 104
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ IIEGADF+D +I + LC ANG N +T T ++L C
Sbjct: 105 AFAYNAKFENVIIEGADFTDVLIANDVRNKLCLIANGINSVTNKKTSETLDC 156
>gi|452821017|gb|EME28052.1| thylakoid lumenal protein [Galdieria sulphuraria]
Length = 217
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 64/114 (56%), Gaps = 1/114 (0%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F + +RE+DF G+K A A AN ADL++ ++ L A L NAVL R
Sbjct: 104 FQQSLLRETDFHGAKLVSASFFGAELSYANLEDADLTEANLELANLRSAKLKNAVLRRAY 163
Query: 192 LT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 244
+ + L I+GADFS+ ++ QK+ LC ANGTN TGV T+ SLGC +S
Sbjct: 164 FSGNTRLENVDIDGADFSEVILRKDQKKYLCNIANGTNSHTGVETKTSLGCNSS 217
>gi|422295781|gb|EKU23080.1| pentapeptide repeat protein [Nannochloropsis gaditana CCMP526]
Length = 217
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 69/117 (58%)
Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
K+ + +F+ A + ++F G+K GA K+ +A+FTGADL+ + + +A L +
Sbjct: 100 KDFSKKDFSGAFAQRANFKGAKLMGARFYKSALTEADFTGADLTSASFEGANMVDAILKD 159
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
A++ T + L IEGADFSD ++D ++ LC+ A GTNP T V TR+SL C
Sbjct: 160 AIVNNAYFTETVLKVGSIEGADFSDTLLDRFVQKKLCEKATGTNPKTKVDTRESLLC 216
>gi|218248608|ref|YP_002373979.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|218169086|gb|ACK67823.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
Length = 152
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 66/129 (51%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR A+ F A++R S+FS + G A ANF GADL +
Sbjct: 28 FSGQDLRDAL---------FDHANLRGSNFSHANLQGVRFFSANLEGANFEGADLRGADL 78
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
+ L N TNA+L T + G II+GADF+D ++ ++ LC A GTNP+TG
Sbjct: 79 ESARLTRVNFTNALLEGAFATNVLIKGVIIDGADFTDVLLRPDVEKQLCAIAQGTNPVTG 138
Query: 233 VSTRKSLGC 241
+T+ +L C
Sbjct: 139 RNTKDTLFC 147
>gi|317969761|ref|ZP_07971151.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
Length = 160
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 60/112 (53%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F ++RE+DF G+ GA L A AN GADLSD +D VL +L NAVL
Sbjct: 48 ATFNLTNLREADFHGADLRGASLYGAKLQDANLAGADLSDATLDSAVLEGTDLRNAVLEN 107
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ +I+GADF++ + LC A+GTNP+TG T+ +L C
Sbjct: 108 AFAFNTRFKDVLIDGADFTNVPFRGDVLKTLCASASGTNPVTGRVTKDTLEC 159
>gi|72382023|ref|YP_291378.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL2A]
gi|124025522|ref|YP_001014638.1| hypothetical protein NATL1_08151 [Prochlorococcus marinus str.
NATL1A]
gi|72001873|gb|AAZ57675.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
NATL2A]
gi|123960590|gb|ABM75373.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL1A]
Length = 170
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 82/165 (49%), Gaps = 22/165 (13%)
Query: 77 AAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSAD 136
A +V A + I L +LN + + + S F DL K ++ E +N T A
Sbjct: 28 AKSVFARTPAEIRNLEELNISQDMSSQDL---SGNDFVKLDL-KGINFSE---SNLTGAV 80
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
S +G+ +GA L A+AY ++F GADL D + +L E+N T+A
Sbjct: 81 FNNSKLNGADLHGAQLNDALAYASDFEGADLRDVDFNGALLMESNFTDA----------- 129
Query: 197 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+IEGADF+DAVI Q++ LC A+GTN T T SLGC
Sbjct: 130 ----LIEGADFTDAVISRIQQKELCNMASGTNSKTDEDTSYSLGC 170
>gi|443328810|ref|ZP_21057403.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791546|gb|ELS01040.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 170
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 62/113 (54%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R +F D+ E++FS S GA + AN GAD + A+L++A+
Sbjct: 56 RLDFAKVDLSEANFSNSDLRGAVFNASDLSNANLHGADFTYGFAYLTDFQGADLSDAIFR 115
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
T+L+ S A+I+GADF+ A+++ Q LC+ A G N TGV TR+SLGC
Sbjct: 116 ETILSFSSFEDAMIDGADFTLAILEKWQVNQLCENATGVNSQTGVDTRRSLGC 168
>gi|428774426|ref|YP_007166214.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
7202]
gi|428688705|gb|AFZ48565.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
Length = 158
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 72/131 (54%), Gaps = 20/131 (15%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTNA 185
++T A + ESDFSG +G+ K ++FT A+LS+ +D L ANLTNA
Sbjct: 27 DYTKAHLVESDFSGQDLSGSTFNKTNLRSSDFTNANLSNVSFFGANLDSANLEGANLTNA 86
Query: 186 VLVRTVLTRSDLGGAI---------------IEGADFSDAVIDLAQKQALCKYANGTNPI 230
VL +TR++L A+ IEGADF+D ++ ++ LC+ A+G NP+
Sbjct: 87 VLDSARVTRANLHNAVLEGAFATNTKFEKANIEGADFTDVLLRPDVEEMLCEVASGINPV 146
Query: 231 TGVSTRKSLGC 241
TG +TR +L C
Sbjct: 147 TGRNTRDTLYC 157
>gi|158337082|ref|YP_001518257.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158307323|gb|ABW28940.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 175
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 60/112 (53%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+FT AD+R SDFS S G A N GA+LS +D ANLTNA L
Sbjct: 63 ASFTKADLRGSDFSNSDLRGVSFFAANLEDVNLEGANLSVATLDSARFARANLTNANLEG 122
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ AII+GADF+D + + LC A GTNP+TG +TR +L C
Sbjct: 123 AFAFNTEFRRAIIDGADFTDVDLRDDTLEILCAAAQGTNPVTGRNTRDTLYC 174
>gi|257061674|ref|YP_003139562.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
gi|256591840|gb|ACV02727.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
Length = 167
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 66/129 (51%), Gaps = 9/129 (6%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DLR A+ F A++R S+FS + G A ANF GADL +
Sbjct: 47 FSGQDLRDAL---------FDHANLRGSNFSHANLQGVRFFSANLEGANFEGADLRGADL 97
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
+ L N TNA+L T + G II+GADF+D ++ ++ LC A GTNP+TG
Sbjct: 98 ESARLTRVNFTNALLEGAFATNVLIKGVIIDGADFTDVLLRPDVEKQLCAIAQGTNPVTG 157
Query: 233 VSTRKSLGC 241
+T+ +L C
Sbjct: 158 RNTKDTLFC 166
>gi|359460626|ref|ZP_09249189.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 175
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 60/112 (53%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+FT AD+R SDFS S G A N GA+LS +D ANLTNA L
Sbjct: 63 ASFTKADLRGSDFSNSDLRGVSFFAANLEDVNLEGANLSVATLDSARFARANLTNANLEG 122
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ AII+GADF+D + + LC A GTNP+TG +TR +L C
Sbjct: 123 AFAFNAEFRKAIIDGADFTDVDLRDDTLEILCAAAQGTNPVTGRNTRDTLYC 174
>gi|75908971|ref|YP_323267.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75702696|gb|ABA22372.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 164
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/189 (35%), Positives = 96/189 (50%), Gaps = 38/189 (20%)
Query: 65 LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
+K WRV S LA + A+ SS+I+ A + G+ IGS +F + D
Sbjct: 1 MKYWRVVASFVLAMVLFLFPGSAQAASSSSITRSAGDELKAKDFSGQSLIGS--EFTNVD 58
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLM 172
L EN ANF++AD+R F+G+ G L +AY A F ADLSD
Sbjct: 59 L-------EN--ANFSNADLRGGVFNGTVLEGVNLHGVDFSNGIAYLARFKNADLSD--- 106
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
A LT+A+++R+V D + GADF++AV+D + + LC A+G N TG
Sbjct: 107 -------AVLTDAMMLRSVFDNVD-----VSGADFTNAVLDGTEVKKLCVKASGVNSKTG 154
Query: 233 VSTRKSLGC 241
V TR+SLGC
Sbjct: 155 VDTRESLGC 163
>gi|352094203|ref|ZP_08955374.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
gi|351680543|gb|EHA63675.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
Length = 159
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 6/135 (4%)
Query: 113 FGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
F + D K V + +F F ++RE+D SGS GA L A AN + +
Sbjct: 24 FAAMDYAKQVLIGADFSNREMQGVTFNLTNLREADLSGSDLQGASLYGAKLQDANLSNTN 83
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
L D +D V + NLTNAVL + +EGADF++ + + LC A G
Sbjct: 84 LRDATLDSAVFDGTNLTNAVLEDAFAFNTRFINVTVEGADFTNVPLRADALKVLCANAEG 143
Query: 227 TNPITGVSTRKSLGC 241
NP+TG T ++LGC
Sbjct: 144 VNPVTGRDTSETLGC 158
>gi|434406341|ref|YP_007149226.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428260596|gb|AFZ26546.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 165
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 67/117 (57%), Gaps = 20/117 (17%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
ANF+ AD+R F+G+ G L + +AY NF GAD +D A T+
Sbjct: 63 ANFSDADLRGVVFNGTLLKGVNLHGVDFSQGIAYLVNFKGADFTD----------AVFTD 112
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
A+++R++ + + GADF++AV+D+ Q + LC A+G N TGV+TR+SLGC
Sbjct: 113 AMMLRSLFDDVN-----VTGADFTNAVLDMQQVKKLCLKASGVNSQTGVNTRESLGC 164
>gi|427736970|ref|YP_007056514.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427372011|gb|AFY55967.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 164
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 65/116 (56%), Gaps = 20/116 (17%)
Query: 131 NFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
NF++ADMR + F+GS +G +AY +NF +DLSD + TNA
Sbjct: 63 NFSNADMRGAVFNGSLLENSNLHGVDFTDGIAYLSNFKDSDLSDAI----------FTNA 112
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+++RT+ D + GADFS A++D + + LC+ A+G N TGVSTR SL C
Sbjct: 113 MMLRTIFRNVD-----VTGADFSGAILDRVEVKKLCETASGVNSKTGVSTRASLEC 163
>gi|443477206|ref|ZP_21067069.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443017715|gb|ELS32099.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 167
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 61/110 (55%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F +D+R + F + G A +AN TGA+LS + +D L++ANLTNAV+ +
Sbjct: 57 FNESDLRNASFVNADAQGVSFFAANMKEANLTGANLSYSTLDNARLDKANLTNAVIEGSF 116
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ II+GADF+D + +Q LCK A G NP TG TR +L C
Sbjct: 117 AYGTSFNNVIIDGADFTDVDLRTPIRQKLCKSAKGQNPTTGRLTRDTLEC 166
>gi|88770664|gb|ABD51935.1| chloroplast thylakoid 11 kDa protein [Guillardia theta]
Length = 242
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 59/107 (55%), Gaps = 2/107 (1%)
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
M++ DFS KF A + K A +A F GAD S+ +MDR +++ A+ VL+ S+
Sbjct: 134 MQKGDFSKVKFKDAVMSKVFADEATFDGADFSNAVMDRGTWRKSSFKGAIFANAVLSGSE 193
Query: 197 LGGAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGC 241
G+ + +DFSD + + +CK GTNP+TGV TR S C
Sbjct: 194 FEGSDLTDSDFSDTYMGDFDNKKICKNPTLQGTNPVTGVDTRASASC 240
>gi|33861906|ref|NP_893467.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. pastoris str. CCMP1986]
gi|33640274|emb|CAE19809.1| Pentapeptide repeats [Prochlorococcus marinus subsp. pastoris str.
CCMP1986]
Length = 192
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 71/136 (52%), Gaps = 21/136 (15%)
Query: 116 ADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
ADL+ +VK + AN A M + S F GA ++ +AY F AD SD
Sbjct: 63 ADLQNNEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFSD 122
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
ANLTN L+++V GA I+GADF++A +DL +++LC+ A+GTN
Sbjct: 123 ----------ANLTNGELMKSVFD-----GATIDGADFTNANLDLKTRKSLCERASGTNS 167
Query: 230 ITGVSTRKSLGCGNSR 245
TGV T +SL C R
Sbjct: 168 QTGVDTFESLECSGLR 183
>gi|428164857|gb|EKX33868.1| hypothetical protein GUITHDRAFT_155908 [Guillardia theta CCMP2712]
Length = 237
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 59/107 (55%), Gaps = 2/107 (1%)
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
M++ DFS KF A + K A +A F GAD S+ +MDR +++ A+ VL+ S+
Sbjct: 129 MQKGDFSKVKFKDAVMSKVFADEATFDGADFSNAVMDRGTWRKSSFKGAIFANAVLSGSE 188
Query: 197 LGGAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGC 241
G+ + +DFSD + + +CK GTNP+TGV TR S C
Sbjct: 189 FEGSDLTDSDFSDTYMGDFDNKKICKNPTLQGTNPVTGVDTRASASC 235
>gi|124023314|ref|YP_001017621.1| hypothetical protein P9303_16121 [Prochlorococcus marinus str. MIT
9303]
gi|123963600|gb|ABM78356.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9303]
Length = 158
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 66/131 (50%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F + DLR F A++RE++ SGS G+ L A + AN + +L D+
Sbjct: 36 ADFSNQDLRGDT---------FNLANLREANLSGSDLEGSTLFGAKLHDANLSNTNLRDS 86
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D + + +LTNAVL + I GADF++ + LC+ A GTNPI
Sbjct: 87 TLDSAIFDGTDLTNAVLEDAFAFNTRFKNVTITGADFTNVPLRGDALTTLCEVAEGTNPI 146
Query: 231 TGVSTRKSLGC 241
TG +T SLGC
Sbjct: 147 TGRNTADSLGC 157
>gi|186683889|ref|YP_001867085.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186466341|gb|ACC82142.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 165
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 63/112 (56%), Gaps = 10/112 (8%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANF++AD+R G FNG LE N G D S+ + +A+L++AVL
Sbjct: 63 ANFSNADLR-----GGVFNGTLLEGV-----NLHGVDFSEGIAYLTRFKDADLSDAVLTD 112
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ RS + GADF++A++D Q + LC A+G N TGV TR+SLGC
Sbjct: 113 AMMLRSTFDDVNVTGADFTNAILDGTQVKKLCVKASGVNSKTGVDTRQSLGC 164
>gi|224098455|ref|XP_002311180.1| predicted protein [Populus trichocarpa]
gi|222851000|gb|EEE88547.1| predicted protein [Populus trichocarpa]
Length = 218
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 104/214 (48%), Gaps = 28/214 (13%)
Query: 40 ISSKTESDGQFPDCSNNQCAGPYAKLKNWRV---FVSTALAAAVVASCSSNISALA--DL 94
I+ + S P S + C P A + N ++ F T A + S ALA
Sbjct: 20 ITKPSLSIPHLPSLSFSHCDKPQALIPNKQLVEDFAKTGFLAILSVSLFFTDPALAFKGG 79
Query: 95 NKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLE 153
Y +E TRG+ G D +K++F+ + +R+++F G+K GA
Sbjct: 80 GPYGSEVTRGQDLTGK-------DFSGRTLIKQDFKTSI----LRQANFKGAKLLGASF- 127
Query: 154 KAVAYKANFTGADLSDTLM---DRMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGADF 207
+ A+ TGADLSD + D + N +ANL+NA L + T + G+ I GADF
Sbjct: 128 ----FDADLTGADLSDADLRSADFSLTNVTKANLSNANLEGALATGNTSFRGSNITGADF 183
Query: 208 SDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+D + Q++ LCK+A+G NP TG +TR +L C
Sbjct: 184 TDVPLREDQREYLCKFADGVNPTTGNATRDTLLC 217
>gi|254423673|ref|ZP_05037391.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
gi|196191162|gb|EDX86126.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
Length = 190
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 67/133 (50%), Gaps = 1/133 (0%)
Query: 110 AAQFGSADLRKAVHVKENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
A F +LR+ ++ N +T AD+ E+D S + L +AN GA+L+
Sbjct: 57 ADNFDRMNLRQQDFSGQDLTDNDYTRADLTEADLSHTNLERVRLFTTRLNRANLEGANLT 116
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
+D L ANL +AVL D G IEGADF+D ++D LC+ A GTN
Sbjct: 117 GATLDGASLVGANLKDAVLEGAYAINIDFRGIDIEGADFTDVLLDPKDNDKLCEIATGTN 176
Query: 229 PITGVSTRKSLGC 241
P TG T+++L C
Sbjct: 177 PTTGRKTKETLYC 189
>gi|307154028|ref|YP_003889412.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306984256|gb|ADN16137.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 172
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 60/110 (54%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F A++R S+FS + G + ANF GA+L ++ L N TNAVL
Sbjct: 61 FDHANLRSSNFSNANLEGVRFFASNLESANFEGANLRYADLESARLIRVNFTNAVLEGAF 120
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
T + GAII+GADF+D ++ ++ LC A GTNP+TG T+ +L C
Sbjct: 121 ATNTLFKGAIIDGADFTDVLLRPDVEKYLCTIAKGTNPVTGRDTKDTLYC 170
>gi|298250074|ref|ZP_06973878.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297548078|gb|EFH81945.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 471
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/101 (44%), Positives = 60/101 (59%), Gaps = 14/101 (13%)
Query: 115 SADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSD 169
ADLRKA N + A M +D SG+ GA LE AVA+KANFTGA+LSD
Sbjct: 133 QADLRKA---------NLSMARMHHTDLSGANLTGAILEGIDLKDAVAHKANFTGANLSD 183
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L+D+ L+E++L+NA L ++L +DL AI+ G S A
Sbjct: 184 GLLDQANLSESDLSNANLHNSILDETDLSKAILRGTTLSKA 224
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 52/94 (55%), Gaps = 10/94 (10%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANL 182
F+A+ + A +RE++ +G+ +GA L KA + Y+A GA+L DT + L +A+L
Sbjct: 77 FKADLSEASIREANMTGANLSGATLHKADLQRVILYRATLAGANLFDTTLHEANLCQADL 136
Query: 183 TNAVLVRTVLTRSDLGGA-----IIEGADFSDAV 211
A L + +DL GA I+EG D DAV
Sbjct: 137 RKANLSMARMHHTDLSGANLTGAILEGIDLKDAV 170
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 49/100 (49%), Gaps = 8/100 (8%)
Query: 128 FRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
+ + AD+ +++FSG+ GA LE AV Y+ + ADLS+ + + ANL
Sbjct: 37 WEIDLMGADLSQTNFSGANLVRASLQGARLENAVLYRTSLFKADLSEASIREANMTGANL 96
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
+ A L + L R L A + GA+ D + A LC+
Sbjct: 97 SGATLHKADLQRVILYRATLAGANLFDTTLHEAN---LCQ 133
>gi|126659509|ref|ZP_01730642.1| hypothetical protein CY0110_07279 [Cyanothece sp. CCY0110]
gi|126619243|gb|EAZ89979.1| hypothetical protein CY0110_07279 [Cyanothece sp. CCY0110]
Length = 167
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 70/130 (53%), Gaps = 11/130 (8%)
Query: 123 HVKENF-RANFTSADMRESDFS-----GSKFNGAYLEKAVAYK-----ANFTGADLSDTL 171
+ K+N +F+ D+R+SDF G F+ A L+ + ANF GADL
Sbjct: 37 YAKQNLVERDFSGQDLRDSDFEHANLRGCNFSHANLQGVRFFASNLEGANFEGADLRYAD 96
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
++ L N TNA+L T + GA+I+GADF+D ++ L ++ LC A GTNP+T
Sbjct: 97 LESARLVRVNFTNAILEGAFATNTLFNGAVIDGADFTDVLLRLDTEKKLCDIAKGTNPVT 156
Query: 232 GVSTRKSLGC 241
+T+ +L C
Sbjct: 157 RRNTKDTLFC 166
>gi|427719897|ref|YP_007067891.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427352333|gb|AFY35057.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 165
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 61/114 (53%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
+A FTS +++ ++FS + G + N GAD S+ + +L++A+L
Sbjct: 51 IQAEFTSVNLKNTNFSNADLRGGVFNSTLLEGVNLHGADFSEGIAYLARFKNTDLSDAIL 110
Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ RS I GADF++AV+D Q + LC A+G N TG TR+SLGC
Sbjct: 111 TDAMMLRSTFDDVDITGADFTNAVLDGVQIKKLCVNASGVNSKTGTDTRESLGC 164
>gi|16331083|ref|NP_441811.1| hypothetical protein sll0274 [Synechocystis sp. PCC 6803]
gi|383322826|ref|YP_005383679.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325995|ref|YP_005386848.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491879|ref|YP_005409555.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437147|ref|YP_005651871.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
gi|451815240|ref|YP_007451692.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
gi|1653576|dbj|BAA18489.1| sll0274 [Synechocystis sp. PCC 6803]
gi|339274179|dbj|BAK50666.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
gi|359272145|dbj|BAL29664.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359275315|dbj|BAL32833.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278485|dbj|BAL36002.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451781209|gb|AGF52178.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
Length = 196
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 85/178 (47%), Gaps = 17/178 (9%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
L W+ V T + +VA+ + +LA + RG A F DLR ++
Sbjct: 34 LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 87
Query: 125 KENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
N R A+FT A+++ + F + +GA LE A A +F A L+ ANL
Sbjct: 88 HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDFESARLT----------HANLR 137
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
NA L + T + G IEGAD +D ++ + LC A GTNP+TG T+++L C
Sbjct: 138 NARLEGSFGTNTKFGEVDIEGADLTDIILRPDTEDYLCGLAKGTNPVTGRETKETLFC 195
>gi|407961546|dbj|BAM54786.1| hypothetical protein BEST7613_5855 [Synechocystis sp. PCC 6803]
Length = 194
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 85/178 (47%), Gaps = 17/178 (9%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
L W+ V T + +VA+ + +LA + RG A F DLR ++
Sbjct: 32 LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 85
Query: 125 KENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
N R A+FT A+++ + F + +GA LE A A +F A L+ ANL
Sbjct: 86 HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDFESARLT----------HANLR 135
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
NA L + T + G IEGAD +D ++ + LC A GTNP+TG T+++L C
Sbjct: 136 NARLEGSFGTNTKFGEVDIEGADLTDIILRPDTEDYLCGLAKGTNPVTGRETKETLFC 193
>gi|282897571|ref|ZP_06305571.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
gi|281197494|gb|EFA72390.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
Length = 164
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 57/184 (30%), Positives = 87/184 (47%), Gaps = 28/184 (15%)
Query: 65 LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
+K W++FV L A A+ SS+I+ A + G+ +G +F +
Sbjct: 1 MKYWQIFVGLVLTAVFFVSNLPAQAASSSSITRSAGSEIEIQDYSGKSLVGK--EFTNIK 58
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L A NF++AD+R G FNG L AN G + SD +
Sbjct: 59 LENA---------NFSNADLR-----GVVFNGTLL-----IDANLHGVNFSDGISYLSNF 99
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
+NL++A+ ++ RS + GADF++A++D + + LC A+G N TGV TRK
Sbjct: 100 KNSNLSDAIFTNAMMLRSTFNNVDVTGADFTNAILDGVEVKKLCANASGVNSQTGVDTRK 159
Query: 238 SLGC 241
SLGC
Sbjct: 160 SLGC 163
>gi|440680470|ref|YP_007155265.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428677589|gb|AFZ56355.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 168
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 64/121 (52%), Gaps = 30/121 (24%)
Query: 131 NFTSADMRESDFSGSKFNGAYLE----------KAVAYKANFTGADLSDTLMDRMVLNEA 180
NF++AD+R G FNGA LE + +AY A F D SD A
Sbjct: 67 NFSNADLR-----GGVFNGALLEGVNLHGVDFRQGIAYLARFKNTDFSD----------A 111
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 240
LT+A+++RT D + GADF++A++D+ Q + LC A G N TGV TR+SLG
Sbjct: 112 VLTDAMMLRTTFDDVD-----VTGADFTNAILDMTQVKKLCVNARGVNSQTGVDTRESLG 166
Query: 241 C 241
C
Sbjct: 167 C 167
>gi|86605126|ref|YP_473889.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86553668|gb|ABC98626.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 176
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 62/110 (56%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F A++R+SD S K GA L A KAN GADL +D L A+L A L ++
Sbjct: 66 FLKANLRQSDLSHVKAAGANLFGANLSKANLRGADLRGATLDMANLQGADLREAQLQDSM 125
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + + G I+GADF++A+I LC+ A G NP+TG +TR +L C
Sbjct: 126 MWLARVEGIQIDGADFTNALIRQDALSILCERATGVNPVTGRATRDTLEC 175
>gi|409993003|ref|ZP_11276163.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|409936150|gb|EKN77654.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 162
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 62/112 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F ++++ ++F ++ G+ +A+ ADL+ ++D++ ++A+L++++
Sbjct: 50 AEFANSNLEYANFDEAELRGSVFSRAIMLGVTMRKADLTYAMLDQVDFSQADLSDSIFTE 109
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ S I GADF+DA+ D Q + LC A G N TGV TR SLGC
Sbjct: 110 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSTTGVDTRYSLGC 161
>gi|72382760|ref|YP_292115.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL2A]
gi|72002610|gb|AAZ58412.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
NATL2A]
Length = 182
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 65/125 (52%), Gaps = 16/125 (12%)
Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
DL+ +VK D+ D G+ F GAY + ++ TGA++++ +
Sbjct: 54 DLQNTEYVKY---------DLSGKDLGGTNFTGAYFSVSTLKDSDLTGANMTNVIAYATR 104
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 236
+ ANLTN L L +S G I+GADF+DAV+D +Q++ LCK A G ST
Sbjct: 105 FDNANLTNVNLTGAELLKSVFDGVTIDGADFTDAVLDRSQQKNLCKVATG-------STA 157
Query: 237 KSLGC 241
+SLGC
Sbjct: 158 ESLGC 162
>gi|124026482|ref|YP_001015597.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL1A]
gi|123961550|gb|ABM76333.1| Pentapeptide repeats [Prochlorococcus marinus str. NATL1A]
Length = 182
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 65/125 (52%), Gaps = 16/125 (12%)
Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
DL+ +VK D+ D G+ F GAY + ++ TGA++++ +
Sbjct: 54 DLQNTEYVKY---------DLSGKDLGGTNFTGAYFSVSTLKDSDLTGANMTNVIAYATR 104
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 236
+ ANLTN L L +S G I+GADF+DAV+D +Q++ LCK A G ST
Sbjct: 105 FDNANLTNVNLTGAELLKSVFDGVTIDGADFTDAVLDRSQQKNLCKVATG-------STA 157
Query: 237 KSLGC 241
+SLGC
Sbjct: 158 ESLGC 162
>gi|33862899|ref|NP_894459.1| hypothetical protein PMT0626 [Prochlorococcus marinus str. MIT
9313]
gi|33634815|emb|CAE20801.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 158
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 66/131 (50%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F + DLR F A++RE++ SGS G+ L A + AN + +L D+
Sbjct: 36 ADFSNQDLRGDT---------FNLANLREANLSGSDLEGSTLFGAKLHDANLSNTNLRDS 86
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D + + +LTNAVL + I GADF++ + LC+ A GTNPI
Sbjct: 87 TLDSAIFDGTDLTNAVLEDAFAFNTRFKNVTITGADFTNVPLRGDALTTLCEVAEGTNPI 146
Query: 231 TGVSTRKSLGC 241
TG +T +LGC
Sbjct: 147 TGRNTADTLGC 157
>gi|148239424|ref|YP_001224811.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
gi|147847963|emb|CAK23514.1| Secreted pentapeptide repeat protein [Synechococcus sp. WH 7803]
Length = 158
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 69/143 (48%), Gaps = 6/143 (4%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAY 158
FG+ + + D K V + +F F ++RE+D SGS GA L A
Sbjct: 16 FGLLLPSAEAAMDYAKQVLIGADFSNRDMQGVTFNLTNLREADLSGSDLQGASLYGAKLQ 75
Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
AN + +L D +D VLN +LT+AVL + I GADF++ + +
Sbjct: 76 DANLSRTNLRDATLDSAVLNGTDLTDAVLEDAFAFNTRFIDVTISGADFTNVPLRGDVLK 135
Query: 219 ALCKYANGTNPITGVSTRKSLGC 241
LC A GTNP+TG TR +LGC
Sbjct: 136 TLCAAAEGTNPVTGRDTRDTLGC 158
>gi|88808450|ref|ZP_01123960.1| hypothetical protein WH7805_02132 [Synechococcus sp. WH 7805]
gi|88787438|gb|EAR18595.1| hypothetical protein WH7805_02132 [Synechococcus sp. WH 7805]
Length = 159
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 58/110 (52%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F ++RE+D SGS GA L A AN + +L D +D VLN +LT+AVL
Sbjct: 50 FNLTNLREADLSGSDLQGASLYGAKLQDANLSRTNLRDATLDSAVLNGTDLTDAVLEDAF 109
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ I GADF++ + + LC A GTNP+TG TR +LGC
Sbjct: 110 AFNTRFIDVTISGADFTNVPLRGDVLKTLCAAAEGTNPVTGRDTRDTLGC 159
>gi|87124267|ref|ZP_01080116.1| hypothetical protein RS9917_11675 [Synechococcus sp. RS9917]
gi|86167839|gb|EAQ69097.1| hypothetical protein RS9917_11675 [Synechococcus sp. RS9917]
Length = 183
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 61/117 (52%)
Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
+E F ++RE+D SGS GA L A A+ + +L D +D VL+ NL+N
Sbjct: 67 REMQGVTFNLTNLREADLSGSDLQGASLFGAKLQDADLSNTNLRDATLDSAVLDGTNLSN 126
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
AVL + I GADF++ + + LC A GTNP+TG +TR +LGC
Sbjct: 127 AVLEDAFAFNTRFINVTISGADFTNVPLRGDVLKTLCAVAEGTNPVTGRNTRDTLGC 183
>gi|86609869|ref|YP_478631.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86558411|gb|ABD03368.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 176
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 62/110 (56%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F A++R+SD S K GA L A KAN GADL +D L A+L A L ++
Sbjct: 66 FLKANLRQSDLSHVKAAGANLFGANLSKANLRGADLRGATLDMANLQGADLREAQLQDSM 125
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + + G I+GADF++A+I LC+ A G NP+TG +TR +L C
Sbjct: 126 MWLARVEGIQIDGADFTNALIRQDALSILCERATGVNPVTGRATRDTLEC 175
>gi|291569983|dbj|BAI92255.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 170
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 62/112 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F ++++ ++F ++ G+ +A+ ADL+ ++D++ ++A+L++++
Sbjct: 58 AEFANSNLEYANFDEAELRGSVFSRAIMLGVTMRKADLTYAMLDQVDFSQADLSDSIFTE 117
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ S I GADF+DA+ D Q + LC A G N TGV TR SLGC
Sbjct: 118 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSTTGVDTRYSLGC 169
>gi|148241708|ref|YP_001226865.1| pentapeptide repeat-containing protein [Synechococcus sp. RCC307]
gi|147850018|emb|CAK27512.1| Secreted pentapeptide repeats protein [Synechococcus sp. RCC307]
Length = 156
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 59/112 (52%), Gaps = 7/112 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
++F A +R +DFSG+K +GA + +NF GADLSD LMDR NL+ L
Sbjct: 51 SSFAGAVVRNADFSGAKLHGAIFTQGAFAGSNFAGADLSDVLMDRADFTGTNLSGTNLSG 110
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
V S A IEGADF+ A++D + LC+ A G TR SL C
Sbjct: 111 VVANGSSFAKAEIEGADFTGALLDRDDQITLCRKAKG-------ETRLSLDC 155
>gi|411119230|ref|ZP_11391610.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410711093|gb|EKQ68600.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 192
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 62/112 (55%), Gaps = 4/112 (3%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
FT A++RES+F G+ +G A AN GADL + +D L+ +NL NA L
Sbjct: 82 FTKANLRESNFRGADLHGVSFFGANLEGANLEGADLRNATLDTARLSRSNLKNANLEGAF 141
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYANGTNPITGVSTRKSLGC 241
+ GA I+GADF+ +D+ Q + ALC A GTNP T +TR +L C
Sbjct: 142 AFNAKFDGATIDGADFTG--VDMRQDVQHALCDRAAGTNPTTKRNTRDTLNC 191
>gi|209525582|ref|ZP_03274120.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423065234|ref|ZP_17054024.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493915|gb|EDZ94232.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406713366|gb|EKD08537.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 177
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 62/112 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F ++++ ++F S+ G+ +A+ ADL+ ++D++ ++A+L++++
Sbjct: 65 AEFANSNLEYANFDESELRGSVFSRAIMLGVTMRKADLTYAMVDQVDFSQADLSDSIFTE 124
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ S I GADF+DA+ D Q + LC A G N TGV TR SLGC
Sbjct: 125 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSRTGVDTRYSLGC 176
>gi|33863821|ref|NP_895381.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9313]
gi|33635404|emb|CAE21729.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9313]
Length = 209
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 67/122 (54%), Gaps = 6/122 (4%)
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
E + + D+ E+D GS F+ L+ A N G +L D L + A+L+ +
Sbjct: 87 EFVKYDLAGYDLSEADLRGSTFSVTSLKNA-----NLHGTNLEDVLAYATRFDNADLSES 141
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC-GNS 244
+L L +S+ GA+I+GADF++A++D +++ALC A G N TGV T SL C G S
Sbjct: 142 ILRNANLRKSEFAGALIDGADFTNALLDKQEQKALCARATGKNSKTGVDTYSSLDCSGIS 201
Query: 245 RR 246
R
Sbjct: 202 ER 203
>gi|119511352|ref|ZP_01630465.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
gi|119463974|gb|EAW44898.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
Length = 164
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 62/116 (53%), Gaps = 20/116 (17%)
Query: 131 NFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
NF++AD R F+GS+ G L +AY F GADL+D A TNA
Sbjct: 63 NFSNADFRGGVFNGSRLEGVNLHGVDFSDGIAYLTQFKGADLTD----------AVFTNA 112
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+++R+V D I GADF++A++D Q + LC A+G N TG TR+SL C
Sbjct: 113 MMLRSVFDDVD-----ITGADFTNAILDGTQIKKLCTQASGVNSQTGADTRESLEC 163
>gi|384252144|gb|EIE25621.1| hypothetical protein COCSUDRAFT_83628, partial [Coccomyxa
subellipsoidea C-169]
Length = 122
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 66/117 (56%), Gaps = 1/117 (0%)
Query: 126 ENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
++FR AD+R ++FS + GA L A A F GA L++ ++ + A+L+
Sbjct: 5 KDFRGQKLYKADLRGTNFSKANMEGASLFGAFCKDAKFVGAHLNNADLESVDFENADLSE 64
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
A+L +T + I G+D++D V+ +Q LCK A+GTNPITG TR++L C
Sbjct: 65 AILEGAQVTNAKFKNVNIAGSDWTDVVLRRDVQQQLCKIASGTNPITGQDTRETLIC 121
>gi|428180855|gb|EKX49721.1| hypothetical protein GUITHDRAFT_135885 [Guillardia theta CCMP2712]
Length = 244
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 105/235 (44%), Gaps = 35/235 (14%)
Query: 20 SSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAA 79
S KGP+ L + KP+ + + + E+D D + VS AL +A
Sbjct: 28 SLKGPHALSGM-KPVTRSHPAAVRMEADADAFDAK--------------KFAVSLALGSA 72
Query: 80 VVASCSSNISALADLNKYEAETRGEFGI--GSAAQFGSAD----LRKAVHVKENFRAN-- 131
++ S I A A + G F + G+A+ S R A+ NF
Sbjct: 73 LLFSSGMPIPAFA-------QQGGSFKVLKGAASTQDSGSRRTITRGALLEGSNFDGQNL 125
Query: 132 ----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
F + R+ F G+ GA AN GAD+S+ + + +ANL NA++
Sbjct: 126 PGISFQQSLCRDCSFVGTNLKGASFFDGDLTNANMEGADVSNVNFELTCMKDANLKNAIV 185
Query: 188 VRT-VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + + L G IEGADF+D + Q++ LCK A+GTNP TGV T+ SL C
Sbjct: 186 NNAYIQSTTKLDGINIEGADFTDTELRKDQQRYLCKRASGTNPKTGVDTKDSLRC 240
>gi|124022089|ref|YP_001016396.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9303]
gi|123962375|gb|ABM77131.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9303]
Length = 202
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 67/122 (54%), Gaps = 6/122 (4%)
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
E + + D+ E+D GS F+ L+ A N G +L D L + A+L+ +
Sbjct: 80 EFVKYDLAGYDLSEADLRGSTFSVTTLKNA-----NLHGTNLEDVLAYATRFDNADLSES 134
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC-GNS 244
+L L +S+ GA+I+GADF++A++D +++ALC A G N TGV T SL C G S
Sbjct: 135 ILRNANLRKSEFAGALIDGADFTNALLDRQEQKALCARATGKNSKTGVDTYTSLDCSGIS 194
Query: 245 RR 246
R
Sbjct: 195 ER 196
>gi|37523524|ref|NP_926901.1| hypothetical protein gll3955 [Gloeobacter violaceus PCC 7421]
gi|35214528|dbj|BAC91896.1| gll3955 [Gloeobacter violaceus PCC 7421]
Length = 159
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 78/175 (44%), Gaps = 18/175 (10%)
Query: 68 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 127
WR V LAA +V +SA AD+ + A L ++N
Sbjct: 2 WRSGVLAGLAAGLV--LPGLVSAQADIQN---------------NYNGAYLEGRSVAEQN 44
Query: 128 FR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
+ A F A++R DFS S GA L A ANF A L D + L A L AV
Sbjct: 45 LKQAQFYKANLRGVDFSSSDLRGASLFAASLRGANFNKARLDDAELSNADLQGAKLDQAV 104
Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L +T + L ++GADF+ +I+ QK C A GTN +T TR++LGC
Sbjct: 105 LAGAYMTAARLKDVSVDGADFTGTIINNQQKTYQCGRATGTNGLTKRQTRRTLGC 159
>gi|427710138|ref|YP_007052515.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427362643|gb|AFY45365.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 164
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 61/112 (54%), Gaps = 10/112 (8%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANF++AD+R G FNG LE N G D S+ + A+L++AVL
Sbjct: 62 ANFSNADLR-----GGVFNGIVLEGV-----NMHGVDFSNGIAYLARFKNADLSDAVLTD 111
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ RS I GADF++AV+D Q + LC A+G N T V TR+SLGC
Sbjct: 112 AMMLRSTFDNVEITGADFTNAVLDGTQVKKLCAKASGVNSKTSVDTRESLGC 163
>gi|427701765|ref|YP_007044987.1| low-complexity protein [Cyanobium gracile PCC 6307]
gi|427344933|gb|AFY27646.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
Length = 175
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 65/131 (49%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F ADLR F ++R++D SG+ GA L A A+ +G+DL D
Sbjct: 53 ADFHGADLRGVT---------FNLTNLRDADLSGADLRGASLFGAKLQDADLSGSDLRDA 103
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D V +L NA L + G +I+GADF++ + +LC A+GTNP+
Sbjct: 104 TLDSAVFEGTDLRNARLDDAFAFNTKFRGVLIDGADFTNVPLRGDALTSLCAAASGTNPV 163
Query: 231 TGVSTRKSLGC 241
TG TR +L C
Sbjct: 164 TGRLTRDTLNC 174
>gi|376005445|ref|ZP_09782948.1| conserved exported hypothetical protein [Arthrospira sp. PCC 8005]
gi|375326159|emb|CCE18701.1| conserved exported hypothetical protein [Arthrospira sp. PCC 8005]
Length = 177
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 62/112 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F ++++ ++F ++ G+ +A+ ADL+ ++D++ ++A+L++++
Sbjct: 65 AEFANSNLEYANFDEAELRGSVFSRAIMLGVTMRKADLTYAMVDQVDFSQADLSDSIFTE 124
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ S I GADF+DA+ D Q + LC A G N TGV TR SLGC
Sbjct: 125 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSRTGVDTRYSLGC 176
>gi|414077638|ref|YP_006996956.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413971054|gb|AFW95143.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 165
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 65/116 (56%), Gaps = 20/116 (17%)
Query: 131 NFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
NF +AD+R + F+G+ F+G + +AY + F +DLSD A T A
Sbjct: 64 NFNNADLRGAVFNGTLLDTVNFHGVDFSQGIAYLSRFKNSDLSD----------AVFTEA 113
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+++R+ + D + GADF++A++D+ Q + +C A+G N TGV TR SLGC
Sbjct: 114 MMLRSTFDQVD-----VTGADFTNAILDMIQIKKICINASGVNSKTGVDTRASLGC 164
>gi|148242416|ref|YP_001227573.1| pentapeptide repeat-containing protein [Synechococcus sp. RCC307]
gi|147850726|emb|CAK28220.1| Secreted pentapeptide repeat protein [Synechococcus sp. RCC307]
Length = 162
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 62/131 (47%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F S DL+ F ++RE+D SGS A L A AN +G+DL +
Sbjct: 40 ADFSSRDLKGVT---------FNLTNLREADLSGSDLRAASLFGAKLQDANLSGSDLREA 90
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D V N +L++A L + G I GADFSD + LC A GTN +
Sbjct: 91 TLDSAVFNGTDLSDARLEGAFAFNTRFSGVTITGADFSDVPLRGDALSTLCAVAEGTNSV 150
Query: 231 TGVSTRKSLGC 241
TG TR +LGC
Sbjct: 151 TGRDTRDTLGC 161
>gi|224112717|ref|XP_002316270.1| predicted protein [Populus trichocarpa]
gi|222865310|gb|EEF02441.1| predicted protein [Populus trichocarpa]
Length = 219
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 65/210 (30%), Positives = 101/210 (48%), Gaps = 38/210 (18%)
Query: 49 QFPDCSNNQCAGPYAKLKNWRV---FVSTALAAAVVASCSSNISALA--DLNKYEAE-TR 102
+F S+++C P A + N ++ F T L A + S ALA Y +E TR
Sbjct: 30 RFLSLSHSRCPNPQALILNKQLLEDFAKTGLLALLSVSLFFTDPALAFKGGGPYGSEVTR 89
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
G+ G D K++F+ + +R+++F G+K GA + A+
Sbjct: 90 GQDLTGK-------DFSGRTLTKQDFKTSI----LRQANFKGAKLLGASF-----FDADL 133
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI-----------IEGADFSDAV 211
TGADLSD L A+L+ A + + L+ ++L GA+ I GADF+D
Sbjct: 134 TGADLSDA-----DLRSADLSLANVAKVNLSNANLEGALATGNTSFRGSNITGADFTDVP 188
Query: 212 IDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ Q++ LCK A+G NP TG +TR +L C
Sbjct: 189 LREDQREYLCKVADGVNPTTGNATRDTLLC 218
>gi|449018747|dbj|BAM82149.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 269
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 64/115 (55%), Gaps = 2/115 (1%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ +F+ + R+++FSGS +GA KA +ANF A L +++ VL +N NAVL
Sbjct: 153 QKDFSGSTCRKTNFSGSDLSGARFFKADLTEANFENAQLIGASLEQTVLRGSNFQNAVLR 212
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 241
T T S L A IE D++DA+++ + LC A G N +T TR+SL C
Sbjct: 213 STYWTESVLTIANIENTDWTDALLEPTWQMKLCSRSDAKGMNTLTNTDTRESLMC 267
>gi|33240260|ref|NP_875202.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
gi|33237787|gb|AAP99854.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
Length = 158
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 68/139 (48%), Gaps = 6/139 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
+ + F S D K V E+F A F +++++D SGS GA L A +N
Sbjct: 19 TQSSFASIDYGKQTLVGEDFSKLDLKGATFYLTNLQDADLSGSDLEGASLFGAKLLNSNL 78
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
+ A+L + +D V NL NAVL + + I+G+DF++ ++ LC
Sbjct: 79 SNANLHNATLDSAVFEGTNLENAVLEDAFVFNARFSDVNIQGSDFTNVILRNQDLSYLCS 138
Query: 223 YANGTNPITGVSTRKSLGC 241
ANGTNP+T T+ +L C
Sbjct: 139 IANGTNPVTKRKTKDTLQC 157
>gi|282900932|ref|ZP_06308865.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281194023|gb|EFA68987.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 164
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 65/117 (55%), Gaps = 20/117 (17%)
Query: 130 ANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
ANF++AD+R F+G+ +G ++Y +NF ++LSD + TN
Sbjct: 62 ANFSNADLRGVVFNGTLLIDTNLHGVNFSDGISYLSNFKNSNLSDAI----------FTN 111
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
A+++R+ D I GADF++A++D + + LC A+G N TGV TR+SLGC
Sbjct: 112 AMMLRSTFNNVD-----ITGADFTNAILDGVEVKKLCADASGVNSQTGVDTRESLGC 163
>gi|427714384|ref|YP_007063008.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
gi|427378513|gb|AFY62465.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
Length = 177
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 65/131 (49%), Gaps = 11/131 (8%)
Query: 112 QFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
F DLR + K N F +N + D+R G F A LE A + TGADL
Sbjct: 54 DFSGKDLRDSEFTKANLFHSNLSHTDLR-----GVSFFAANLETA-----DLTGADLRVA 103
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D +ANLT+A L + GAII+GADF+D + ++ LC A G NP+
Sbjct: 104 TLDTARFTKANLTDANLEGAFAFNTIFDGAIIDGADFTDVDLRPDARKMLCSVAKGVNPV 163
Query: 231 TGVSTRKSLGC 241
TG +T +L C
Sbjct: 164 TGRATHDTLEC 174
>gi|332706397|ref|ZP_08426459.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332354834|gb|EGJ34312.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 126
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 63/114 (55%), Gaps = 1/114 (0%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ T ++ +++ + +K A ++ AN GADL+ + + N+A+LT+ +
Sbjct: 12 KVKITYCNLDQANLADAKLIQASIKHTTLNNANLHGADLTKSDTYNISFNDADLTDVIFT 71
Query: 189 RTVLTRSDLGGAIIEGADFSDAVID-LAQKQALCKYANGTNPITGVSTRKSLGC 241
+L R+ GA I GADF+ +I + ++ LC A+G NP TGV TR SLGC
Sbjct: 72 GALLQRASFDGADITGADFTSTLIQPVRERLKLCDVASGVNPTTGVVTRDSLGC 125
>gi|428306980|ref|YP_007143805.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428248515|gb|AFZ14295.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 160
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 66/139 (47%), Gaps = 21/139 (15%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANF 162
S F L + V+ N NF +AD+R F+GS G+ L A +AY A+F
Sbjct: 36 SGKDFSGQTLISSEFVEANLDNTNFNNADIRGVVFNGSTLKGSSLHSADFTNGLAYAADF 95
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
+ ADLSD AV ++L +S I G DFS V+D + LC
Sbjct: 96 SNADLSD---------------AVFSESILLKSRFDEVNINGTDFSGVVLDGTNVKKLCD 140
Query: 223 YANGTNPITGVSTRKSLGC 241
A+G N TGV+TR SLGC
Sbjct: 141 VADGVNSKTGVATRASLGC 159
>gi|224006618|ref|XP_002292269.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971911|gb|EED90244.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 255
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 66/114 (57%), Gaps = 4/114 (3%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F + +R+SDFS S GA A +NF AD++ ++ N ANL NA++
Sbjct: 140 FQQSIVRDSDFSNSNLYGASFFDATLDGSNFENADMTLCNVEMAQFNRANLKNAIVKDMY 199
Query: 192 LTRSDL--GGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 241
++ + L G IEG+D+S+ + Q++ LC + A GTNP+TGV+TR+SL C
Sbjct: 200 VSGATLFEGVKDIEGSDWSETQLRKDQQKYLCNHPTAKGTNPVTGVNTRESLMC 253
>gi|449016903|dbj|BAM80305.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 341
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 68/128 (53%), Gaps = 11/128 (8%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA---------- 180
+ +S D+ + +G+ +GA L A ++ +GA+L D +L+EA
Sbjct: 211 DLSSVDLSTAALAGADLHGAALSHANLFQVQLSGANLRGAKFDASILDEAALDGADLSGA 270
Query: 181 NLTNAVLVRTVLTRSDLGGAI-IEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
+L A++ RT+L + L I I+GADFS A+ID ++ LC+ A G N TGV+T SL
Sbjct: 271 DLRQALVRRTLLLGARLDANISIDGADFSGALIDRTNQRLLCELAQGVNSRTGVATATSL 330
Query: 240 GCGNSRRN 247
C + N
Sbjct: 331 ACPEPKTN 338
>gi|302756827|ref|XP_002961837.1| hypothetical protein SELMODRAFT_76876 [Selaginella moellendorffii]
gi|300170496|gb|EFJ37097.1| hypothetical protein SELMODRAFT_76876 [Selaginella moellendorffii]
Length = 180
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 69/119 (57%), Gaps = 11/119 (9%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN-----EANLT 183
+ +F ++ +R+++F G+K GA + AN TGAD SD + L+ +AN T
Sbjct: 66 KQDFKTSILRQANFKGAKLFGASF-----FDANLTGADFSDADLRGADLSLADATKANFT 120
Query: 184 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
NA L ++T + L GA I GADF+D + Q+ LC+ A+G NP+T STR++L C
Sbjct: 121 NANLEGALVTGNTSLKGANITGADFTDVLWREDQRSYLCRIADGINPVTSNSTRETLLC 179
>gi|225449424|ref|XP_002282933.1| PREDICTED: thylakoid lumenal 15 kDa protein 1, chloroplastic [Vitis
vinifera]
gi|296086195|emb|CBI31636.3| unnamed protein product [Vitis vinifera]
Length = 221
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 72/123 (58%), Gaps = 11/123 (8%)
Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 179
K + +F ++ +R+++F G+ GA + A+ TGADLSD + D + N +
Sbjct: 103 KSLIKQDFKTSILRQANFKGANLLGASF-----FDADLTGADLSDADLRGADFSLANVTK 157
Query: 180 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
ANL+NA L + T + G+II GADF+D + Q++ LCK A+G NP TG +TR++
Sbjct: 158 ANLSNANLEGALATGNTSFRGSIITGADFTDVPLREDQREYLCKVADGVNPTTGNATRET 217
Query: 239 LGC 241
L C
Sbjct: 218 LLC 220
>gi|302798106|ref|XP_002980813.1| hypothetical protein SELMODRAFT_178497 [Selaginella moellendorffii]
gi|300151352|gb|EFJ17998.1| hypothetical protein SELMODRAFT_178497 [Selaginella moellendorffii]
Length = 180
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 69/119 (57%), Gaps = 11/119 (9%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN-----EANLT 183
+ +F ++ +R+++F G+K GA + AN TGAD SD + L+ +AN T
Sbjct: 66 KQDFKTSILRQANFKGAKLFGASF-----FDANLTGADFSDADLRGADLSLADATKANFT 120
Query: 184 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
NA L ++T + L GA I GADF+D + Q+ LC+ A+G NP+T STR++L C
Sbjct: 121 NANLEGALVTGNTSLKGANITGADFTDVLWREDQRSYLCRIADGINPVTSNSTRETLLC 179
>gi|87302765|ref|ZP_01085576.1| hypothetical protein WH5701_13470 [Synechococcus sp. WH 5701]
gi|87282648|gb|EAQ74606.1| hypothetical protein WH5701_13470 [Synechococcus sp. WH 5701]
Length = 168
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 62/132 (46%), Gaps = 11/132 (8%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F ADLR N R AN + ADMR + G+K A+ G DL +
Sbjct: 46 ADFHDADLRGVTFNLTNLRDANLSGADMRNASLFGAKLQ----------DADMHGVDLRE 95
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
+D VL +L AVL + IEGADF++ + +LC A+GTNP
Sbjct: 96 ATLDSAVLEGTDLREAVLEDAFAFNTKFVDVAIEGADFTNVPLRGDVLTSLCAIASGTNP 155
Query: 230 ITGVSTRKSLGC 241
+TG TR +LGC
Sbjct: 156 VTGRVTRDTLGC 167
>gi|332705869|ref|ZP_08425945.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332355661|gb|EGJ35125.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 150
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 57/115 (49%), Gaps = 2/115 (1%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+A F + D+ D SG F+ A AN + + + ++ ANL A
Sbjct: 35 KATFANTDLSGQDLSGQDFHNAVFSSVNLQSANLSNVNFKGANITKVNFTNANLQGADFS 94
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI--TGVSTRKSLGC 241
+ + GA I GADF+ A++D Q + LCK A+ TNPI TGV TR SLGC
Sbjct: 95 YAFINVCNFKGANITGADFTFAILDSKQYRELCKNASATNPITDTGVDTRYSLGC 149
>gi|427723472|ref|YP_007070749.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427355192|gb|AFY37915.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 170
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 66/134 (49%), Gaps = 7/134 (5%)
Query: 115 SADLRKAVHVKENF-----RAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
+ D K ++E+F R N + + +R SDFS G A NF GAD+
Sbjct: 36 AVDYNKRTFIQEDFSHQDLRDNSYDLSSLRGSDFSYCDLRGVRFFSANLEFVNFEGADMR 95
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLG-GAIIEGADFSDAVIDLAQKQALCKYANGT 227
++D + AN TNA L L + +I+GADF+DA+I + LC A GT
Sbjct: 96 GAVLDSARIGHANFTNANLEGAYLASVKITPSTVIDGADFTDALILKNENDKLCDLATGT 155
Query: 228 NPITGVSTRKSLGC 241
NP TGV T +SL C
Sbjct: 156 NPDTGVDTAESLYC 169
>gi|255645177|gb|ACU23086.1| unknown [Glycine max]
Length = 222
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 72/123 (58%), Gaps = 11/123 (8%)
Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 179
K + +F ++ +R+++F G+K GA + A+ TGADLSD + D + N +
Sbjct: 104 KTLIKQDFKTSILRQANFKGAKLIGASF-----FDADLTGADLSDADLRNADFSLANVTK 158
Query: 180 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
ANL+NA L ++T + G+ + GADF+D + Q++ LCK A+G NP TG +TR +
Sbjct: 159 ANLSNANLEGALVTGNTSFRGSNVTGADFTDVPLREDQREYLCKVADGVNPTTGNATRDT 218
Query: 239 LGC 241
L C
Sbjct: 219 LFC 221
>gi|298492954|ref|YP_003723131.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298234872|gb|ADI66008.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 164
Score = 67.8 bits (164), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 63/116 (54%), Gaps = 20/116 (17%)
Query: 131 NFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
NF+++D+R F+G+ G L + +AY F AD SD + LT+A
Sbjct: 63 NFSNSDLRGGVFNGTLLEGVNLHGVDFSQGIAYLVKFNNADFSDAI----------LTDA 112
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+++R+V D + GADF++A++D + + LC A+G N T V TR+SLGC
Sbjct: 113 MMLRSVFDNVD-----VTGADFTNAILDGVEIKKLCLKASGVNSKTAVDTRESLGC 163
>gi|449441422|ref|XP_004138481.1| PREDICTED: thylakoid lumenal 15 kDa protein 1, chloroplastic-like
[Cucumis sativus]
Length = 214
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 75/142 (52%), Gaps = 15/142 (10%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
G+ D +K++F+ + +R+++F G+ GA + A+ TGA
Sbjct: 81 GVTRGQDLSGKDFSGKTLIKQDFKTSI----LRQANFKGANLLGASF-----FDADLTGA 131
Query: 166 DLSDTLM---DRMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQA 219
DLSD + D + N +ANL+NA L + T + G+ I GADF+D + Q++
Sbjct: 132 DLSDADLRGADFSLANVTKANLSNANLEGALATGNTSFRGSTINGADFTDVPLREDQREY 191
Query: 220 LCKYANGTNPITGVSTRKSLGC 241
LCK A+G NP TG +TR++L C
Sbjct: 192 LCKVADGVNPTTGNATRETLLC 213
>gi|170079322|ref|YP_001735960.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
gi|169886991|gb|ACB00705.1| Pentapeptide repeat containing protein [Synechococcus sp. PCC 7002]
Length = 166
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 65/134 (48%), Gaps = 7/134 (5%)
Query: 115 SADLRKAVHVKENF-----RAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
+ D K ++E+F R N + + +R DFS S G A NF GADL
Sbjct: 32 AVDYNKRTFIQEDFSHQDLRDNSYDLSSLRGCDFSYSDLRGVRFFSANLEFVNFEGADLR 91
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLG-GAIIEGADFSDAVIDLAQKQALCKYANGT 227
++D + AN NA L L + +IEGADF+DA+I + LC+ A+GT
Sbjct: 92 GAVLDSARIGHANFKNANLEGAFLASVKITPSTVIEGADFTDALILARENDKLCELASGT 151
Query: 228 NPITGVSTRKSLGC 241
NP TG T +L C
Sbjct: 152 NPTTGRDTAATLYC 165
>gi|351722845|ref|NP_001236746.1| uncharacterized protein LOC100500352 [Glycine max]
gi|255630103|gb|ACU15405.1| unknown [Glycine max]
Length = 224
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 71/123 (57%), Gaps = 11/123 (8%)
Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 179
K + +F ++ +R+++F G+K GA + A+ TGADLSD + D + N +
Sbjct: 106 KTLIKQDFKTSILRQANFKGAKLIGASF-----FDADLTGADLSDADLRNADFSLANVTK 160
Query: 180 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
ANL+NA L + T + G+ I GADF+D + Q++ LCK A+G NP TG +TR +
Sbjct: 161 ANLSNANLEGALATGNTSFKGSNITGADFTDVPLREDQREYLCKVADGVNPTTGNATRDA 220
Query: 239 LGC 241
L C
Sbjct: 221 LFC 223
>gi|298705858|emb|CBJ29003.1| thylakoid lumenal protein [Ectocarpus siliculosus]
Length = 199
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 59/105 (56%), Gaps = 2/105 (1%)
Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
E++FS F + KA A +N+ AD ++ ++DR+ + +++ A+ VLT +
Sbjct: 92 EANFSKGDFKEVVMSKAYARSSNWEEADFTNAVVDRVSFDGSSMKGAIFQNAVLTSTSFT 151
Query: 199 GAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGC 241
GA +E ADF++A + ++ LCK GTNP+T TR S GC
Sbjct: 152 GADVENADFTEAYMGDFDQKNLCKNPTLKGTNPVTNADTRASAGC 196
>gi|168022043|ref|XP_001763550.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685343|gb|EDQ71739.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 165
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 68/115 (59%), Gaps = 1/115 (0%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
+ +F ++ +R+++F G+K GA + A+ T ADL + L++ANLTNA L
Sbjct: 50 IKQDFKTSILRQANFKGAKLLGASFFDSDLTGADLTDADLRGADLSLARLSKANLTNANL 109
Query: 188 VRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+T + L G+II GADF++ Q++ LC A+G NP+TG +TR++L C
Sbjct: 110 EGASVTGNTYLKGSIITGADFTEVNWRDDQRKELCLIADGVNPVTGNATRETLLC 164
>gi|388521435|gb|AFK48779.1| unknown [Lotus japonicus]
Length = 225
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 71/119 (59%), Gaps = 11/119 (9%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 183
+ +F ++ +R+++F G+K GA + ++ TGADLSD + D + N +ANL+
Sbjct: 111 KQDFKTSILRQANFKGAKLLGASF-----FDSDLTGADLSDADLRSADFFLANVTKANLS 165
Query: 184 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
NA L + T + G+ I GADF+D + Q++ LCK A+G NP TG +TR++L C
Sbjct: 166 NANLEGALATGNTSFKGSNITGADFTDVPLRDDQREYLCKVADGVNPTTGNATRETLLC 224
>gi|397595313|gb|EJK56448.1| hypothetical protein THAOC_23663 [Thalassiosira oceanica]
Length = 238
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 2/111 (1%)
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
M ++D S + F A K +NF AD ++ ++DR ++L + VLT +
Sbjct: 128 MSKTDLSKANFREAQFSKGYLRDSNFEEADFTNAIVDRATFKGSSLKGTIFSNAVLTATS 187
Query: 197 LGGAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGCGNSR 245
GA +E ADF+DA I + LCK G NP+TG TR S CG R
Sbjct: 188 FEGADVENADFTDAYIGDFDIRNLCKNPTLKGENPLTGADTRLSANCGPGR 238
>gi|359806262|ref|NP_001240959.1| uncharacterized protein LOC100806792 [Glycine max]
gi|255626639|gb|ACU13664.1| unknown [Glycine max]
Length = 222
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 71/123 (57%), Gaps = 11/123 (8%)
Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 179
K + +F ++ +R+++F G+K GA + A+ TGADLSD + D + N +
Sbjct: 104 KTLIKQDFKTSILRQANFKGAKLIGASF-----FDADLTGADLSDADLRNADFSLANVTK 158
Query: 180 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
ANL+NA L + T + G+ + GADF+D + Q++ LCK A+G NP TG +TR +
Sbjct: 159 ANLSNANLEGALATGNTSFRGSNVTGADFTDVPLREDQREYLCKVADGVNPTTGNATRDT 218
Query: 239 LGC 241
L C
Sbjct: 219 LFC 221
>gi|302837694|ref|XP_002950406.1| hypothetical protein VOLCADRAFT_120854 [Volvox carteri f.
nagariensis]
gi|300264411|gb|EFJ48607.1| hypothetical protein VOLCADRAFT_120854 [Volvox carteri f.
nagariensis]
Length = 182
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 63/113 (55%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ T A++R+++F+ + G L +++ A F GA+L + ++ A+ TNAVL
Sbjct: 69 KLKLTKANLRQTNFTDANLEGVSLFGSLSESAIFRGANLRNADLESGNYEFADFTNAVLE 128
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + I G+D++D V+ ++ LC A+G NP TGVSTR+SL C
Sbjct: 129 GAFVNNAQFVKVTITGSDWTDVVLRKDVQKELCAIADGVNPTTGVSTRESLLC 181
>gi|255570589|ref|XP_002526251.1| Thylakoid lumenal 17.4 kDa protein, chloroplast precursor, putative
[Ricinus communis]
gi|223534416|gb|EEF36120.1| Thylakoid lumenal 17.4 kDa protein, chloroplast precursor, putative
[Ricinus communis]
Length = 228
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 109/245 (44%), Gaps = 24/245 (9%)
Query: 3 LSSIS-PLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGP 61
+++IS PLS++SL SS + + + L P+ + C S+ F + + C
Sbjct: 1 MATISFPLSVRSL----SSERSRFPVPQLHPPIKIICSGSADGSKSKPFKELQSVACG-- 54
Query: 62 YAKLKNWRVFVSTALAAAVVASCSSNISALA-DLNKYEAETRGE-FGIGSAAQFGSADLR 119
L W V +A+ V + S + L+ + N+ E G G + DLR
Sbjct: 55 --LLAAWAV-----TSASPVIAASQRLPPLSTEPNRCEKAFVGNTIGQANGVYDKPIDLR 107
Query: 120 KAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
+ E N + + +A M ++ F G+ + + KA A A+F G D S+ ++DR+
Sbjct: 108 FCDYTNEKSNLKGKSLAAALMSDAKFDGADMSEVVMSKAYAVGASFKGVDFSNAVLDRVN 167
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 236
+ANL AV TVL+ S A + A F D +I Q LCK N + R
Sbjct: 168 FGKANLQGAVFKNTVLSGSTFDEAQLADAVFEDTIIGYIDLQKLCK-----NTSINLEGR 222
Query: 237 KSLGC 241
+ LGC
Sbjct: 223 EILGC 227
>gi|147774410|emb|CAN74472.1| hypothetical protein VITISV_013914 [Vitis vinifera]
Length = 221
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 71/123 (57%), Gaps = 11/123 (8%)
Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 179
K + +F ++ +R+++F + GA + A+ TGADLSD + D + N +
Sbjct: 103 KSLIKQDFKTSILRQANFKXANLLGASF-----FDADLTGADLSDADLRGADFSLANVTK 157
Query: 180 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
ANL+NA L + T + G+II GADF+D + Q++ LCK A+G NP TG +TR++
Sbjct: 158 ANLSNANLEGALATGNTSFRGSIITGADFTDVPLREDQREYLCKVADGVNPTTGNATRET 217
Query: 239 LGC 241
L C
Sbjct: 218 LLC 220
>gi|388510406|gb|AFK43269.1| unknown [Lotus japonicus]
Length = 225
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 71/119 (59%), Gaps = 11/119 (9%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 183
+ +F ++ +R+++F G+K GA + ++ TGADLSD + D + N +ANL+
Sbjct: 111 KQDFKTSILRQANFKGAKLLGASF-----FDSDLTGADLSDADLRSADFSLANVTKANLS 165
Query: 184 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
NA L + T + G+ I GADF+D + Q++ LCK A+G NP TG +TR++L C
Sbjct: 166 NANLEGALATGNTSFKGSNITGADFTDVPLRDDQREYLCKVADGVNPTTGNATRETLLC 224
>gi|428166498|gb|EKX35473.1| hypothetical protein GUITHDRAFT_97823, partial [Guillardia theta
CCMP2712]
Length = 230
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 62/119 (52%), Gaps = 2/119 (1%)
Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
K+ + +F+ +E+ F G+K G KA A+FTGADLS ++ L+ L N
Sbjct: 112 KDFSKKDFSGCAAKEAKFVGTKLRGTRFFKADLTGADFTGADLSTASLEDAKLDGVVLKN 171
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 241
A+L + I GADF+DA++ LCK A GTNP+T TR+SLGC
Sbjct: 172 AILSNSYTNLGLDKVKDISGADFTDALVRPDILAKLCKRSDATGTNPVTKADTRESLGC 230
>gi|413968546|gb|AFW90610.1| chloroplast thylakoid lumenal 17.4 kDa protein [Solanum tuberosum]
Length = 228
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/231 (28%), Positives = 101/231 (43%), Gaps = 26/231 (11%)
Query: 3 LSSIS-PLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGP 61
++SIS PL+ KS + S P QLH+ P+ + C S DCSN++ +
Sbjct: 1 MASISIPLAYKSHSLRRSPIYRPSQLHS---PIQIKCSASK---------DCSNSEESS- 47
Query: 62 YAKLKNWRVFVSTALAAAVVASCSSNISA-------LADLNKYEAETRGE-FGIGSAAQF 113
+ K R LA ++S S I+A D N+ E G G +
Sbjct: 48 -TQFKQLRNVACGFLAVWALSSVSPVIAAGQRLPPLSTDPNRCERAFVGSTIGQANGVYD 106
Query: 114 GSADLRKAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
DLR + E N + + +A M ++ F G+ + KA A A+F D S+
Sbjct: 107 KPLDLRFCDYTNEKTNLKGKSLAAALMSDAKFDGADMTEVIMSKAYAVGASFKAMDFSNA 166
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
++DR+ +ANL A TVL+ S A ++G DF D +I Q +C
Sbjct: 167 VLDRVNFEKANLQGASFKNTVLSGSTFNDAQLDGVDFEDTIIGYIDLQKIC 217
>gi|159467845|ref|XP_001692102.1| hypothetical protein CHLREDRAFT_115715 [Chlamydomonas reinhardtii]
gi|158278829|gb|EDP04592.1| predicted protein [Chlamydomonas reinhardtii]
Length = 124
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 65/110 (59%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
T A++R+++ +G+ G L +++ A F GA+L + ++ +A+ ++A+L
Sbjct: 14 LTKANLRQTNLTGANLEGVSLFGSLSEGAVFKGANLRNADLESGNYEDADFSDAILEGAF 73
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + I+G+D++D V+ ++ALC A+G NP TGVSTR+SL C
Sbjct: 74 VNNAQFVRVNIKGSDWTDVVLRKDIQKALCAIADGVNPTTGVSTRESLMC 123
>gi|298715141|emb|CBJ27829.1| Thylakoid lumenal 15 kDa protein, chloroplast precursor (p15)
[Ectocarpus siliculosus]
Length = 245
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/146 (35%), Positives = 71/146 (48%), Gaps = 14/146 (9%)
Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
AA G RK V N A++ D+ F S G + A A F ADLS
Sbjct: 99 AASTGDKGARKTVTRGVNIENADYHDKDLSSVSFQQSLVRGTNFKNAKLVAAGFFDADLS 158
Query: 169 D-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGA------IIEGADFSDAVIDLAQK 217
+ M++ L ANL+ A + ++T + + GA IIEGADF+D + Q
Sbjct: 159 NCNFESANMNQANLELANLSGANMKNALVTEAYVSGATKMEPAIIEGADFTDTFLRKDQV 218
Query: 218 QALC--KYANGTNPITGVSTRKSLGC 241
+ LC + A GTNP++GV TR SLGC
Sbjct: 219 RYLCGLETAKGTNPVSGVDTRDSLGC 244
>gi|219116042|ref|XP_002178816.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409583|gb|EEC49514.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 109
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
+ ++F S G KA +A+F+GADL ++ ++EA L + V V + S +
Sbjct: 3 KSTNFGKSNLKGCRFYKAYLVRADFSGADLRGASLEDTSMDEALLKDTVAVGAYFSASIM 62
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 241
+E ADF+DA + LC+ A GTNP+TGV TR+SL C
Sbjct: 63 DTLTVENADFTDAQFPIKTLPLLCERSDATGTNPVTGVDTRESLMC 108
>gi|357133836|ref|XP_003568528.1| PREDICTED: thylakoid lumenal 15 kDa protein 1, chloroplastic-like
[Brachypodium distachyon]
Length = 200
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 70/119 (58%), Gaps = 11/119 (9%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 183
+ +F ++ +R+++F G+K GA + A+ TGADLSDT + D + N + NLT
Sbjct: 86 KQDFKTSILRQTNFKGAKLLGASF-----FDADLTGADLSDTDLRNADFSLANVTKVNLT 140
Query: 184 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
NA L ++T + G+ I GADF+D + Q+ LCK A+G N TG +T+++L C
Sbjct: 141 NANLEGALVTGNTSFKGSTIYGADFTDVPLRDDQRDYLCKIADGVNTTTGNATKETLFC 199
>gi|298711847|emb|CBJ32870.1| Pentapeptide repeat [Ectocarpus siliculosus]
Length = 238
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/132 (37%), Positives = 64/132 (48%), Gaps = 16/132 (12%)
Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
DL K + ++F + +E +FSGS G KA KA+FTGA+L
Sbjct: 113 DLSKGKYKSKDFSGSIA----KEVNFSGSDLRGVRFFKADLKKADFTGANLGTA-----S 163
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAI-----IEGADFSDAVIDLAQKQALCKY--ANGTNP 229
L EA+L ++ V T S G + I GADF+DA+I + LC A GTNP
Sbjct: 164 LEEADLEGTIMTNAVATGSYFGNNMNNVGDISGADFTDALIRKDVAKILCARPDAKGTNP 223
Query: 230 ITGVSTRKSLGC 241
TG TR SL C
Sbjct: 224 TTGTDTRDSLLC 235
>gi|219116308|ref|XP_002178949.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409716|gb|EEC49647.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 131
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 62/116 (53%), Gaps = 4/116 (3%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F + +R DFS S GA A +NF ++L + ++ L + NAV+
Sbjct: 16 FQQSIVRNCDFSNSDLRGASFFDATLTDSNFENSNLENVNLEMAQLTRVSFKNAVVTDAY 75
Query: 192 LTRSDLGGAI--IEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGCGN 243
++ + + + +EG+D+S+ + QK+ LC + A GTNP+TGV TR+SL C N
Sbjct: 76 VSGATIFDGVKDVEGSDWSETYLRADQKKLLCNHPTAKGTNPVTGVDTRESLMCPN 131
>gi|255073547|ref|XP_002500448.1| predicted protein [Micromonas sp. RCC299]
gi|226515711|gb|ACO61706.1| predicted protein [Micromonas sp. RCC299]
Length = 215
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 66/128 (51%), Gaps = 6/128 (4%)
Query: 117 DLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
DLR+ +V ++ + A M ++ F G+ + KA A A+FTGA+ ++ ++DR+
Sbjct: 93 DLRQCNYVDKDLSTKTLSGALMVDATFKGANMTEVVMSKAYAVNADFTGANFTNAVVDRV 152
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 235
+ ANL+NA V+T + G + GA F +A+I + LC+ NP T
Sbjct: 153 TFDGANLSNANFFNAVITGATFEGTNLAGAQFDEALIGKEDVKKLCE-----NPTLVEET 207
Query: 236 RKSLGCGN 243
R +GC N
Sbjct: 208 RFQVGCRN 215
>gi|217071608|gb|ACJ84164.1| unknown [Medicago truncatula]
Length = 240
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 69/242 (28%), Positives = 103/242 (42%), Gaps = 27/242 (11%)
Query: 13 SLNFCSSSSKGP-YQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVF 71
SL+ + S+K P + AL P + C + + E DG N K+K
Sbjct: 12 SLSIRNFSTKRPCFTTSAL--PFTITCSVVGEAELDGT----ENKPRLLSLNKIKGVACG 65
Query: 72 VSTALAAAVVASCSSNISALA--------DLNKYEAETRGE-FGIGSAAQFGSADLRKA- 121
+ LAA V S S ++A D N+ E G G + + DLRK
Sbjct: 66 I---LAAYAVTSASFPVTAATQRLPPLSTDPNRCERAFVGNTIGQANGVYDKALDLRKCD 122
Query: 122 -VHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
+ K N + ++A M ++ F G+ + KA A +F G D S+ ++DR+ +
Sbjct: 123 FTNEKSNLKGKTLSAALMSDAKFDGADMTEVVMSKAYAVGGSFKGVDFSNAVLDRVNFGK 182
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
A+L AV TVL+ S A +EGA F D +I Q +C+ N G R L
Sbjct: 183 ADLQGAVFRNTVLSGSTFDDAKLEGAVFEDTIIGYIDLQKICR-----NTTIGDEGRAEL 237
Query: 240 GC 241
GC
Sbjct: 238 GC 239
>gi|159474024|ref|XP_001695129.1| thylakoid lumenal 17.4 kDa protein [Chlamydomonas reinhardtii]
gi|158276063|gb|EDP01837.1| thylakoid lumenal 17.4 kDa protein [Chlamydomonas reinhardtii]
Length = 185
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 58/110 (52%), Gaps = 5/110 (4%)
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
+ ++D S + A L KA A KANF GAD+++ ++DR+ ANL + TV+T +
Sbjct: 81 LADADLSNTNLQEAVLTKAYAVKANFEGADMTNAVVDRVDFTNANLKRVKFINTVVTGAS 140
Query: 197 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRR 246
GA +EG+ + DA+I LC+ NP +R +GC R+
Sbjct: 141 FAGADLEGSVWEDALIGSQDVGKLCE-----NPTLTGESRAQVGCRAVRK 185
>gi|115434488|ref|NP_001042002.1| Os01g0144100 [Oryza sativa Japonica Group]
gi|13486898|dbj|BAB40127.1| unknown protein [Oryza sativa Japonica Group]
gi|113531533|dbj|BAF03916.1| Os01g0144100 [Oryza sativa Japonica Group]
gi|215678959|dbj|BAG96389.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765141|dbj|BAG86838.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 198
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 69/120 (57%), Gaps = 11/120 (9%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANL 182
R +F ++ +R+++F G+K GA + A+ TGADLSD + D + N + NL
Sbjct: 83 IRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSDADLRGADFSLANVSKVNL 137
Query: 183 TNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
TNA L + T + G+ I GADF+D + Q++ LCK A+G N TG +T+++L C
Sbjct: 138 TNANLEGALATGNTTFKGSNIYGADFTDVPLRDDQREYLCKIADGVNTTTGNATKETLFC 197
>gi|443326649|ref|ZP_21055296.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442793770|gb|ELS03210.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 920
Score = 63.5 bits (153), Expect = 9e-08, Method: Composition-based stats.
Identities = 39/104 (37%), Positives = 56/104 (53%), Gaps = 1/104 (0%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A+L A V+ N RAN A++ ++ +G+ GA LEKA+ ANF GA+L++
Sbjct: 801 ANLDGANLEGANLVRANLVRANLVRANLDGANLNGAILEGANLEKAILEGANFRGANLNE 860
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
+ L+EAN A R L R D A +GADF A++D
Sbjct: 861 ANLRGAHLSEANFQEADFDRADLQRVDFDRADFQGADFDRAIMD 904
Score = 43.9 bits (102), Expect = 0.069, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 38/66 (57%)
Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
A L +A Y+AN A+L ++ L ANL A LVR L ++L GAI+EGA+
Sbjct: 786 ANLYRANLYRANLVRANLDGANLEGANLVRANLVRANLVRANLDGANLNGAILEGANLEK 845
Query: 210 AVIDLA 215
A+++ A
Sbjct: 846 AILEGA 851
Score = 43.9 bits (102), Expect = 0.081, Method: Composition-based stats.
Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 5/96 (5%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
+RAN A++ ++ G+ GA L +A +AN A+L ++ +L ANL A+L
Sbjct: 789 YRANLYRANLVRANLDGANLEGANLVRANLVRANLVRANLDGANLNGAILEGANLEKAIL 848
Query: 188 ----VRTV-LTRSDLGGAIIEGADFSDAVIDLAQKQ 218
R L ++L GA + A+F +A D A Q
Sbjct: 849 EGANFRGANLNEANLRGAHLSEANFQEADFDRADLQ 884
>gi|116785879|gb|ABK23895.1| unknown [Picea sitchensis]
gi|116792150|gb|ABK26251.1| unknown [Picea sitchensis]
Length = 239
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 59/118 (50%), Gaps = 6/118 (5%)
Query: 125 KENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
K N R + +A M ++ F G+ + + KA A A+F G D S+ ++DR+ +AN+
Sbjct: 126 KTNLRGKSLAAALMSDAKFDGADMSEVIMSKAYAVGASFKGVDFSNAVIDRVNFGKANMQ 185
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+AV TVL+ S A +EGA F D +I Q LC TN R LGC
Sbjct: 186 DAVFRNTVLSGSTFVDANLEGAKFEDTIIGYIDLQKLC-----TNQTLSDEGRDILGC 238
>gi|307108672|gb|EFN56912.1| hypothetical protein CHLNCDRAFT_51710 [Chlorella variabilis]
Length = 155
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 57/110 (51%), Gaps = 5/110 (4%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
+ A M E+D SG+ L KA A AN GADL++ ++DR+ + +L A LV V
Sbjct: 49 LSGAYMNEADMSGANMREVVLTKAYAVGANLRGADLTNAVIDRVAFDGVDLEGAQLVNAV 108
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+T + GA ++ A+F DA+I + LC NP +R +GC
Sbjct: 109 ITGTTFTGANLKDANFEDALIGSEDAKRLC-----ANPTLVGESRDQVGC 153
>gi|425437827|ref|ZP_18818239.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9432]
gi|389677087|emb|CCH93934.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9432]
Length = 976
Score = 63.2 bits (152), Expect = 1e-07, Method: Composition-based stats.
Identities = 40/106 (37%), Positives = 55/106 (51%), Gaps = 6/106 (5%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A SA+L +A N RAN A++ E++ G+ GAYLE A +AN G
Sbjct: 857 ANLYSANLERANLYMANLERANLERANLKRANLYEANLYGAYLAGAYLEGANLERANLYG 916
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+L ++R L ANL A L L R++L GA + GA+F DA
Sbjct: 917 ANLEGANLERANLERANLKGANLEGANLERANLEGAFLRGANFKDA 962
Score = 43.9 bits (102), Expect = 0.081, Method: Composition-based stats.
Identities = 33/102 (32%), Positives = 52/102 (50%), Gaps = 1/102 (0%)
Query: 125 KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
++ +RAN A++ ++ G+ GA LE+A AN A+L + R L A L
Sbjct: 787 RDLYRANLERANLERANLYGAYLYGANLERANLKGANLYMANLERANLYRAYLYRAYLYR 846
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
A L R L R++L A +E A+ A ++ A ++A K AN
Sbjct: 847 AYLERAYLERANLYSANLERANLYMANLERANLERANLKRAN 888
Score = 43.9 bits (102), Expect = 0.083, Method: Composition-based stats.
Identities = 34/102 (33%), Positives = 46/102 (45%), Gaps = 1/102 (0%)
Query: 110 AAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
A A+L+ A N RAN A + + + AYLE+A Y AN A+L
Sbjct: 811 GANLERANLKGANLYMANLERANLYRAYLYRAYLYRAYLERAYLERANLYSANLERANLY 870
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
++R L ANL A L L + L GA +EGA+ A
Sbjct: 871 MANLERANLERANLKRANLYEANLYGAYLAGAYLEGANLERA 912
Score = 39.3 bits (90), Expect = 1.8, Method: Composition-based stats.
Identities = 30/93 (32%), Positives = 43/93 (46%), Gaps = 5/93 (5%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
+RA A + + + A LE+A Y AN A+L + R L EANL A L
Sbjct: 840 YRAYLYRAYLERAYLERANLYSANLERANLYMANLERANLERANLKRANLYEANLYGAYL 899
Query: 188 VRTV-----LTRSDLGGAIIEGADFSDAVIDLA 215
L R++L GA +EGA+ A ++ A
Sbjct: 900 AGAYLEGANLERANLYGANLEGANLERANLERA 932
Score = 38.1 bits (87), Expect = 4.5, Method: Composition-based stats.
Identities = 36/125 (28%), Positives = 55/125 (44%), Gaps = 5/125 (4%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGA 150
A+L + E +G A A+L +A N + AN A++ + + A
Sbjct: 792 ANLERANLERANLYG----AYLYGANLERANLKGANLYMANLERANLYRAYLYRAYLYRA 847
Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
YLE+A +AN A+L + L ANL A L R L ++L GA + GA A
Sbjct: 848 YLERAYLERANLYSANLERANLYMANLERANLERANLKRANLYEANLYGAYLAGAYLEGA 907
Query: 211 VIDLA 215
++ A
Sbjct: 908 NLERA 912
>gi|383763560|ref|YP_005442542.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381383828|dbj|BAM00645.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 189
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/123 (39%), Positives = 65/123 (52%), Gaps = 13/123 (10%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVA 157
A RG + + F A+L++A N RAN + AD+ +D SG+ GA L A
Sbjct: 30 AHLRGAHLVEADLSF--ANLQRANLAGANLERANLSGADLEGADLSGANLVGANLTGARL 87
Query: 158 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGG-----AIIEGADF 207
+AN TGA+L D L++R L E ANL NA V + L R+DLG A+ +GAD
Sbjct: 88 MRANLTGANLRDALVNRADLTEALLVDANLRNAHFVESTLVRADLGDANALKAVFKGADL 147
Query: 208 SDA 210
S A
Sbjct: 148 SGA 150
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ A + E+D S + A L A +AN +GADL + L ANLT A L+R
Sbjct: 30 AHLRGAHLVEADLSFANLQRANLAGANLERANLSGADLEGADLSGANLVGANLTGARLMR 89
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
LT ++L A++ AD ++A++
Sbjct: 90 ANLTGANLRDALVNRADLTEALL 112
>gi|159903302|ref|YP_001550646.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9211]
gi|159888478|gb|ABX08692.1| Pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9211]
Length = 158
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F DLR A F +++ ++ SGS GA L A K + + +L +
Sbjct: 36 ADFSDTDLRGAT---------FYLTNLQNANLSGSNLEGASLFGAKLLKTDLSNTNLKNA 86
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D +L+ A+LTNA L + I G+DF++ +I Q+ LC A+GTN +
Sbjct: 87 TLDSSILDGADLTNAYLEDAFAFNTQFKDVKISGSDFTNVLITNDQRNYLCSIASGTNSV 146
Query: 231 TGVSTRKSLGC 241
+ +TR SL C
Sbjct: 147 STRNTRDSLEC 157
>gi|428219116|ref|YP_007103581.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427990898|gb|AFY71153.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 179
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 65/131 (49%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F DLR A F A +R +F+ + +G L + AN +GA+L +
Sbjct: 57 ADFSGKDLRDA---------QFNKAVLRSVNFANANLSGVSLFGSDLTNANLSGANLRYS 107
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D + +L+NA+L + + I GADF+D + ++ LC+ A GTNP
Sbjct: 108 SLDTSRMVGTDLSNAILEGAFVYGAKFKNLKIAGADFTDVDLRETIREELCEVATGTNPT 167
Query: 231 TGVSTRKSLGC 241
TG TR++LGC
Sbjct: 168 TGRDTRETLGC 178
>gi|18406661|ref|NP_566030.1| thylakoid lumenal protein 1 [Arabidopsis thaliana]
gi|20141847|sp|O22160.2|TL15A_ARATH RecName: Full=Thylakoid lumenal 15 kDa protein 1, chloroplastic;
AltName: Full=p15; Flags: Precursor
gi|20196925|gb|AAM14836.1| pentapeptide repeat family protein [Arabidopsis thaliana]
gi|330255391|gb|AEC10485.1| thylakoid lumenal protein 1 [Arabidopsis thaliana]
Length = 224
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 67/120 (55%), Gaps = 11/120 (9%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANL 182
R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N + NL
Sbjct: 109 IRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTKVNL 163
Query: 183 TNAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
TNA L TV + G+ I GADF+D + Q+ LCK A+G N TG +TR +L C
Sbjct: 164 TNANLEGATVTGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDTLLC 223
>gi|222423354|dbj|BAH19651.1| AT2G44920 [Arabidopsis thaliana]
Length = 224
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 67/120 (55%), Gaps = 11/120 (9%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANL 182
R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N + NL
Sbjct: 109 IRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGGDFSLANVTKVNL 163
Query: 183 TNAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
TNA L TV + G+ I GADF+D + Q+ LCK A+G N TG +TR +L C
Sbjct: 164 TNANLEGATVTGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDTLLC 223
>gi|260434702|ref|ZP_05788672.1| secreted pentapeptide repeat protein [Synechococcus sp. WH 8109]
gi|260412576|gb|EEX05872.1| secreted pentapeptide repeat protein [Synechococcus sp. WH 8109]
Length = 160
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 1/112 (0%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F +++RE++ SGS GA L A A+ +G DL + +D V+ NL +AVL
Sbjct: 49 ATFNLSNLREANLSGSDLRGASLYGAKLQDADLSGTDLREATLDAAVMTGTNLEDAVLEG 108
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ +I GADF+D + L + TN +TG STR+SLGC
Sbjct: 109 AFAFNTRFRDVLITGADFTDVPCAGTNSKPL-RRCRRTNSVTGRSTRESLGC 159
>gi|449456995|ref|XP_004146234.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
[Cucumis sativus]
gi|449522387|ref|XP_004168208.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
[Cucumis sativus]
Length = 237
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 134 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
+A M ++ F G+ + + KA A A+F G D S+ ++DR+ +ANL A+ TVL+
Sbjct: 134 AALMSDAKFDGADLSEVVMSKAYAVGASFKGVDFSNAVLDRVNFGKANLQGALFKNTVLS 193
Query: 194 RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
S A +E A F D +I Q LC NP R LGC
Sbjct: 194 GSTFDDAQLEDAVFEDTIIGYIDLQKLC-----VNPTISPEGRAELGC 236
>gi|219130181|ref|XP_002185250.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403429|gb|EEC43382.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 235
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 52/107 (48%), Gaps = 2/107 (1%)
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
M +D S + F AY K + GAD ++ ++DR ++L A+ VLT +
Sbjct: 128 MTNTDASNANFAEAYFSKGYLRDSMLDGADFTNAIVDRATFKGSSLRGAIFANAVLTGTG 187
Query: 197 LGGAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGC 241
GA +E ADF+DA I + LCK G NP TG TR S C
Sbjct: 188 FEGADVENADFTDAYIGDFDIRLLCKNPTLKGENPKTGADTRMSANC 234
>gi|297824527|ref|XP_002880146.1| thylakoid lumenal 15 kDa protein, chloroplast [Arabidopsis lyrata
subsp. lyrata]
gi|297325985|gb|EFH56405.1| thylakoid lumenal 15 kDa protein, chloroplast [Arabidopsis lyrata
subsp. lyrata]
Length = 226
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 67/120 (55%), Gaps = 11/120 (9%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANL 182
R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N + NL
Sbjct: 111 IRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTKVNL 165
Query: 183 TNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
TNA L T + G+ I GADF+D + Q++ LCK A+G N TG +TR +L C
Sbjct: 166 TNANLEGATATGNTSFKGSNITGADFTDVPLRDDQREYLCKIADGVNATTGNATRDTLLC 225
>gi|126656956|ref|ZP_01728134.1| hypothetical protein CY0110_02219 [Cyanothece sp. CCY0110]
gi|126621794|gb|EAZ92503.1| hypothetical protein CY0110_02219 [Cyanothece sp. CCY0110]
Length = 1084
Score = 62.0 bits (149), Expect = 3e-07, Method: Composition-based stats.
Identities = 55/160 (34%), Positives = 75/160 (46%), Gaps = 20/160 (12%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY 158
A+ RG + G A G ADL A + A+ T AD+R +D +G+ GAYLE A
Sbjct: 931 ADLRGAYLEG--ADLGGADLTGA----DLEGADLTGADLRGADLTGAYLEGAYLEGADLT 984
Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLA 215
A+ TGA L ++ L A+LT A L L +DLGGA + GAD + A + DL
Sbjct: 985 GADLTGAYLEGAYLEGADLGGADLTGADLEGADLRGADLGGADLGGADLTGADLRGADLT 1044
Query: 216 Q-----------KQALCKYANGTNPITGVSTRKSLGCGNS 244
+ KQ NG + I K LG G++
Sbjct: 1045 KTDLNEARYLTVKQVQEAKNNGKDAIYDEEMEKKLGLGDN 1084
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 33/86 (38%), Positives = 44/86 (51%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ +D G+ GAYLE A A+ TGADL + L A+LT A L
Sbjct: 916 ADLTGADLTGADLEGADLRGAYLEGADLGGADLTGADLEGADLTGADLRGADLTGAYLEG 975
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
L +DL GA + GA A ++ A
Sbjct: 976 AYLEGADLTGADLTGAYLEGAYLEGA 1001
Score = 45.1 bits (105), Expect = 0.034, Method: Composition-based stats.
Identities = 29/75 (38%), Positives = 40/75 (53%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
++ E+ +G+ GAYLE A A+ TGADL+ ++ L A L A L LT +
Sbjct: 892 ELYEAKLTGADLTGAYLEGADLGGADLTGADLTGADLEGADLRGAYLEGADLGGADLTGA 951
Query: 196 DLGGAIIEGADFSDA 210
DL GA + GAD A
Sbjct: 952 DLEGADLTGADLRGA 966
Score = 42.7 bits (99), Expect = 0.18, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 43/90 (47%)
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
E + A T AD+ + G+ GA L A A+ GADL ++ L A+LT A
Sbjct: 892 ELYEAKLTGADLTGAYLEGADLGGADLTGADLTGADLEGADLRGAYLEGADLGGADLTGA 951
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
L LT +DL GA + GA A ++ A
Sbjct: 952 DLEGADLTGADLRGADLTGAYLEGAYLEGA 981
>gi|116792169|gb|ABK26257.1| unknown [Picea sitchensis]
Length = 237
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 63/117 (53%), Gaps = 6/117 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLTNA 185
N D + S +KF GA L A + A+ TGADLSD + D + N + NL+NA
Sbjct: 121 NLIQQDFKTSILRQAKFKGAKLIGASFFDADLTGADLSDADLRGADFSLANVTKVNLSNA 180
Query: 186 VLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L ++T + G+ I GADF+D + Q++ LC A+G N TG +TR +L C
Sbjct: 181 NLEGALVTGNTSFKGSNISGADFTDVPLRDDQRRYLCNIADGVNLTTGNATRDTLLC 237
>gi|302143933|emb|CBI23038.3| unnamed protein product [Vitis vinifera]
Length = 232
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 71/249 (28%), Positives = 109/249 (43%), Gaps = 26/249 (10%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MA SI PLS++ SS + + + L P ++C S D S++Q
Sbjct: 1 MATLSI-PLSLQH----SSPKRHRFSVPELHSPFRISCSASW----DSPELKASSSQ--- 48
Query: 61 PYAKLKNWR---VFVSTALAAAVVASCSSNISALA-DLNKYEAETRGE-FGIGSAAQFGS 115
+ +LKN + V AA+ V + S + L+ + N+ E G G +
Sbjct: 49 -FKELKNVAFGILAVCAVTAASPVIAASQRLPPLSTEPNRCERAFVGNTIGQANGVYDKP 107
Query: 116 ADLRKAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
DLR + E N + + +A M E+ F G+ + + KA A A+F G D ++ ++
Sbjct: 108 IDLRFCDYTNEKSNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGVDFTNAVL 167
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ +ANL AV TVL+ S A +E A F D +I Q +C TN
Sbjct: 168 DRVNFGKANLQGAVFKNTVLSGSTFDQAQLEDAVFEDTIIGYIDLQKIC-----TNTSIN 222
Query: 233 VSTRKSLGC 241
R LGC
Sbjct: 223 ADGRAELGC 231
>gi|359490718|ref|XP_002275994.2| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic [Vitis
vinifera]
Length = 244
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 71/249 (28%), Positives = 109/249 (43%), Gaps = 26/249 (10%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MA SI PLS++ SS + + + L P ++C S D S++Q
Sbjct: 13 MATLSI-PLSLQH----SSPKRHRFSVPELHSPFRISCSASW----DSPELKASSSQ--- 60
Query: 61 PYAKLKNWR---VFVSTALAAAVVASCSSNISALA-DLNKYEAETRGE-FGIGSAAQFGS 115
+ +LKN + V AA+ V + S + L+ + N+ E G G +
Sbjct: 61 -FKELKNVAFGILAVCAVTAASPVIAASQRLPPLSTEPNRCERAFVGNTIGQANGVYDKP 119
Query: 116 ADLRKAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
DLR + E N + + +A M E+ F G+ + + KA A A+F G D ++ ++
Sbjct: 120 IDLRFCDYTNEKSNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGVDFTNAVL 179
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
DR+ +ANL AV TVL+ S A +E A F D +I Q +C TN
Sbjct: 180 DRVNFGKANLQGAVFKNTVLSGSTFDQAQLEDAVFEDTIIGYIDLQKIC-----TNTSIN 234
Query: 233 VSTRKSLGC 241
R LGC
Sbjct: 235 ADGRAELGC 243
>gi|397570889|gb|EJK47511.1| hypothetical protein THAOC_33758, partial [Thalassiosira oceanica]
Length = 122
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 62/112 (55%), Gaps = 10/112 (8%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F + +R++DF G+ GA A ++F GAD++ ++ ++ E ++ A L V
Sbjct: 17 FQQSIVRDTDFRGTNLFGASFFDATLDGSDFEGADMTLCNVENAIVKEMYVSGATLFEGV 76
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 241
+ IE +D+SD + Q++ LC++ A GTNP+TGV TR+SL C
Sbjct: 77 KS--------IENSDWSDTQLRKDQQKYLCEHPTAKGTNPVTGVDTRESLMC 120
>gi|356509222|ref|XP_003523350.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
[Glycine max]
Length = 240
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 62/128 (48%), Gaps = 8/128 (6%)
Query: 117 DLRKAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
DLR+ E N + + ++A M ++ F G+ + KA A A+F G D S+ ++D
Sbjct: 117 DLRQCDFTDEKTNLKGKSLSAALMSDAKFDGADMTEVVMSKAYAVGASFKGVDFSNAVLD 176
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
R+ +A+L AV TVL+ S A ++ A F D +I Q LC TN G
Sbjct: 177 RVNFEKADLEGAVFKNTVLSGSTFDDAKLDNAVFEDTIIGYIDLQKLC-----TNKTIGD 231
Query: 234 STRKSLGC 241
R LGC
Sbjct: 232 EWRVELGC 239
>gi|340707640|pdb|3N90|A Chain A, The 1.7 Angstrom Resolution Crystal Structure Of
At2g44920, A Pentapeptide Repeat Protein From
Arabidopsis Thaliana Thylakoid Lumen
Length = 152
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 68/119 (57%), Gaps = 11/119 (9%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 183
R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N + NLT
Sbjct: 30 RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTKVNLT 84
Query: 184 NAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
NA L T++ + G+ I GADF+D + Q+ LCK A+G N TG +TR +L C
Sbjct: 85 NANLEGATMMGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDTLLC 143
>gi|428219581|ref|YP_007104046.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427991363|gb|AFY71618.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 508
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 74/140 (52%), Gaps = 9/140 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A F A+L A K + ANF+ AD+R ++ SG+ NGA L +A +AN
Sbjct: 172 SVASFNGANLTGASLAKLDLSGLDLSDANFSGADLRGANLSGANLNGADLSRANLSRANL 231
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALC 221
+ A+LS T R LNEANL+ A L + L+R+DL A + AD A + +++ A
Sbjct: 232 SRANLSRTNFVRTELNEANLSEASLSGSNLSRADLSRANLIKADLHGANLSMSKLAGAYL 291
Query: 222 KYAN--GTNPITGVSTRKSL 239
AN GTN I+ TR L
Sbjct: 292 VRANLLGTNLISADLTRAVL 311
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 54/99 (54%), Gaps = 1/99 (1%)
Query: 115 SADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
SADL +AV ++ + FRAN T A++ +D + + A +A AN G DL+ +
Sbjct: 303 SADLTRAVLIEADLFRANLTEANLSRADLNRANLTEASFIEANLISANLCGTDLTRANLT 362
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ +A + A+L++T L+ + L GA A+ S A++
Sbjct: 363 GVYAIDAEIVGAILIKTNLSEASLAGANFVRANLSRAIL 401
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 43/82 (52%), Gaps = 5/82 (6%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN AD+ ++ S SK GAYL +A N ADL+ R VL EA+L A L
Sbjct: 268 RANLIKADLHGANLSMSKLAGAYLVRANLLGTNLISADLT-----RAVLIEADLFRANLT 322
Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
L+R+DL A + A F +A
Sbjct: 323 EANLSRADLNRANLTEASFIEA 344
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%), Gaps = 6/101 (5%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A+ DL +A + N RAN T A + +D A L +A +AN GA+LS
Sbjct: 49 AELSRIDLSRADLSESNLKRANLTEAVLVGADLISINLGRATLTEANLNRANLIGANLSG 108
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+L EA+L L + LT++DL GA + GAD S A
Sbjct: 109 A-----ILVEADLARCDLRVSNLTKADLMGANLSGADLSVA 144
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 57/114 (50%), Gaps = 6/114 (5%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+L +A + NF R A++ E+ SGS + A L +A KA+ GA+L
Sbjct: 222 SRANLSRANLSRANLSRTNFVRTELNEANLSEASLSGSNLSRADLSRANLIKADLHGANL 281
Query: 168 SDTLMDRMVLNEANL--TNAV---LVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
S + + L ANL TN + L R VL +DL A + A+ S A ++ A
Sbjct: 282 SMSKLAGAYLVRANLLGTNLISADLTRAVLIEADLFRANLTEANLSRADLNRAN 335
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 77/184 (41%), Gaps = 35/184 (19%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK-------------- 154
S A DL KA V+ AN A + + F+G+ GA L K
Sbjct: 147 SGANLSQVDLSKATLVE----ANLKDAKLSVASFNGANLTGASLAKLDLSGLDLSDANFS 202
Query: 155 ------AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
A AN GADLS + R L+ ANL+ VRT L ++L A + G++ S
Sbjct: 203 GADLRGANLSGANLNGADLSRANLSRANLSRANLSRTNFVRTELNEANLSEASLSGSNLS 262
Query: 209 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQK--LLD 266
A DL++ + +G N +S K G R N G + L+SA + L++
Sbjct: 263 RA--DLSRANLIKADLHGAN----LSMSKLAGAYLVRANLLG---TNLISADLTRAVLIE 313
Query: 267 RDGF 270
D F
Sbjct: 314 ADLF 317
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 16/116 (13%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAY---------------LEK 154
A ADL +A + +F AN SA++ +D + + G Y L +
Sbjct: 324 ANLSRADLNRANLTEASFIEANLISANLCGTDLTRANLTGVYAIDAEIVGAILIKTNLSE 383
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A ANF A+LS ++ L+EANL A L ++ ++L GA +E AD S A
Sbjct: 384 ASLAGANFVRANLSRAILSGASLSEANLGRANLYGANMSEANLSGANLENADLSRA 439
>gi|298243143|ref|ZP_06966950.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297556197|gb|EFH90061.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 338
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 54/90 (60%), Gaps = 10/90 (11%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++RE+DFSG+ +G+ + +GADLS ++ R +L A+L+ A+L
Sbjct: 95 ANLVGANLREADFSGNDLSGS----------DLSGADLSRAILRRAILRRADLSEAILRD 144
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
VL R+DL A + GAD +DA + A++ A
Sbjct: 145 AVLRRADLTDADLRGADLTDADLTGAKRDA 174
>gi|172036979|ref|YP_001803480.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354554778|ref|ZP_08974082.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171698433|gb|ACB51414.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353553587|gb|EHC22979.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 325
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 105/224 (46%), Gaps = 40/224 (17%)
Query: 3 LSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPY 62
L+ IS +++K + P+QL L++ + E+D QF G
Sbjct: 95 LTQISGVTVKQFKLVKTH---PFQLEDLAEQI---------DENDPQFLLIERIMSQGG- 141
Query: 63 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
N + F L+ A++ C++N+ LADL EA G S A ADL A
Sbjct: 142 ----NDQDFREANLSGAIL--CNANL-ILADL--REANLMGTDL--SGANLMGADLSGAD 190
Query: 123 HVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVA---------------YKANFTGAD 166
+ N AN A++ E++ +G+ A L++A +AN GA
Sbjct: 191 LLGANLTGANLMGANLTEANLTGADLGDAILQEADLCWADLSEVNLIGADLSQANLKGAI 250
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L+D+L+ LNEANL+ A+L R++L++++L G+I+ D ++A
Sbjct: 251 LTDSLLSHTNLNEANLSEAILNRSILSKTNLSGSILSQTDLTNA 294
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 27/114 (23%)
Query: 108 GSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
G+ F A+L A+ AN AD+RE++ G+ +GA L A+ +GADL
Sbjct: 141 GNDQDFREANLSGAILC----NANLILADLREANLMGTDLSGANL-----MGADLSGADL 191
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
ANLT A L+ LT ++L GA D DA++ Q+ LC
Sbjct: 192 LG----------ANLTGANLMGANLTEANLTGA-----DLGDAIL---QEADLC 227
>gi|326523645|dbj|BAJ92993.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524189|dbj|BAJ97105.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 200
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 71/131 (54%), Gaps = 15/131 (11%)
Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---D 173
D +K++F+ + +R+++F G+ GA + A+ TGADLSD + D
Sbjct: 78 DFSGQTLIKQDFKTSI----LRQTNFKGANLLGASF-----FDADLTGADLSDADLRNAD 128
Query: 174 RMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+ N + NLTNA L ++T + G+ I GADF+D + Q+ LCK A+G N
Sbjct: 129 FSLANVTKVNLTNANLEGALVTGNTSFKGSNIYGADFTDVPLRDDQRDYLCKIADGVNTT 188
Query: 231 TGVSTRKSLGC 241
TG +T+++L C
Sbjct: 189 TGNATKETLFC 199
>gi|449016876|dbj|BAM80278.1| similar to thylakoid lumenal 17.4 kD protein, chloroplast precursor
[Cyanidioschyzon merolae strain 10D]
Length = 288
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 59/128 (46%), Gaps = 18/128 (14%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLS---------------DTLMDRMVLNEA 180
D+R DFSG +G LE A A +A F LS D ++DR+ A
Sbjct: 161 DLRGRDFSGYDLSGVLLEGATADEARFRSTQLSKAYAPGFKCRRCDFEDAVVDRVNFENA 220
Query: 181 NLTNAVLVRTVLTRSDLG-GAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRK 237
+L+ +V VL+ S G + DF+D I + LC+ +G NP+TG TR
Sbjct: 221 DLSGSVFRNAVLSDSMFSDGTNVRDVDFTDVYIGEYGLRRLCRNPTLDGENPLTGAPTRA 280
Query: 238 SLGCGNSR 245
SLGC R
Sbjct: 281 SLGCRAER 288
>gi|413947393|gb|AFW80042.1| putative homeobox DNA-binding domain superfamily protein [Zea mays]
Length = 202
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 66/126 (52%), Gaps = 5/126 (3%)
Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
D +K++F+ + +R+++F G+ GA A A+ + ADL +
Sbjct: 80 DFSGQTLIKQDFKTSI----LRQANFKGANLLGASFFDADLTSADLSDADLRGADLSLAN 135
Query: 177 LNEANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 235
L +ANL+NA L + T + GA I GADF+D + Q++ LCK A+G N TG T
Sbjct: 136 LTKANLSNANLEGALATGNTSFKGADITGADFTDVPLRDDQREYLCKIADGVNSTTGNPT 195
Query: 236 RKSLGC 241
+++L C
Sbjct: 196 KETLFC 201
>gi|384246084|gb|EIE19575.1| hypothetical protein COCSUDRAFT_31020 [Coccomyxa subellipsoidea
C-169]
Length = 203
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 60/126 (47%), Gaps = 25/126 (19%)
Query: 136 DMRESDFSGSKFNG--------------------AYLEKAVAYKANFTGADLSDTLMDRM 175
D+R DF+G +G L KA A ANF+GAD+++ ++DR+
Sbjct: 80 DLRMCDFTGKDLSGKTLSGALLKDAILPNSTMRETVLTKAYAVGANFSGADMTNAVIDRV 139
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 235
+ANL+N + V+T + GA ++GA F DA+I + LC NP +
Sbjct: 140 DFRKANLSNVKFINAVITGTAFDGANLDGAIFEDALIGNEDVKRLC-----LNPTLTGES 194
Query: 236 RKSLGC 241
R +GC
Sbjct: 195 RMGVGC 200
>gi|218187501|gb|EEC69928.1| hypothetical protein OsI_00358 [Oryza sativa Indica Group]
Length = 191
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 14/117 (11%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R +F ++ +R+++F G+K GA + A+ TGADLSD L A+ + A +
Sbjct: 84 RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSDA-----DLRGADFSLANVS 133
Query: 189 RTVLTRSDLGGAIIEG----ADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ LT ++L GA+ G DF+D + Q++ LCK A+G N TG +T+++L C
Sbjct: 134 KVNLTNANLEGALATGNTTFKDFTDVPLRDDQREYLCKIADGVNTTTGNATKETLFC 190
>gi|363807626|ref|NP_001241901.1| uncharacterized protein LOC100785667 [Glycine max]
gi|255647148|gb|ACU24042.1| unknown [Glycine max]
Length = 239
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 61/128 (47%), Gaps = 8/128 (6%)
Query: 117 DLRKA--VHVKENFRANFTSAD-MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
DLR+ + K N + SA M ++ F G+ + KA A A+F G D S+ ++D
Sbjct: 116 DLRQCDFTNEKTNLKGKSPSAALMSDAKFDGADMTEVVMSKAYAAGASFKGVDFSNAVLD 175
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
R+ +A+L A+ TVL+ S A ++ A F D +I Q LC TN G
Sbjct: 176 RVNFEKADLEGAIFKNTVLSGSPFDDAKLDNAVFEDTIIGYIDFQKLC-----TNKTIGD 230
Query: 234 STRKSLGC 241
R LGC
Sbjct: 231 EWRVELGC 238
>gi|302831317|ref|XP_002947224.1| hypothetical protein VOLCADRAFT_120426 [Volvox carteri f.
nagariensis]
gi|300267631|gb|EFJ51814.1| hypothetical protein VOLCADRAFT_120426 [Volvox carteri f.
nagariensis]
Length = 244
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 58/114 (50%), Gaps = 5/114 (4%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
A + ++D S + A L KA A KANF AD+++ ++DR+ + ANL TV
Sbjct: 117 LAGALLADADLSNTNLQEAVLTKAYAVKANFENADMTNAVVDRVDFSGANLRGVRFNNTV 176
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSR 245
+T + GA +EG+ + DA+I LC+ NP +R +GC SR
Sbjct: 177 VTGAQFAGADLEGSVWEDALIGSQDVGKLCE-----NPTLTGESRMQVGCRVSR 225
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 41/81 (50%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
D+R +SG +G L A+ A+ + +L + ++ + +AN NA + V+ R
Sbjct: 101 DLRLCSYSGKDLHGRVLAGALLADADLSNTNLQEAVLTKAYAVKANFENADMTNAVVDRV 160
Query: 196 DLGGAIIEGADFSDAVIDLAQ 216
D GA + G F++ V+ AQ
Sbjct: 161 DFSGANLRGVRFNNTVVTGAQ 181
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 30/59 (50%), Gaps = 1/59 (1%)
Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
A L KA VK NF A+ T+A + DFSG+ G V A F GADL ++ +
Sbjct: 135 AVLTKAYAVKANFENADMTNAVVDRVDFSGANLRGVRFNNTVVTGAQFAGADLEGSVWE 193
>gi|168060251|ref|XP_001782111.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666451|gb|EDQ53105.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 158
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 57/110 (51%), Gaps = 5/110 (4%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
++A M E+ F G+ + KA A A+F G+ ++ ++DR+ +++++ + TV
Sbjct: 52 LSAALMSEAKFDGADLTEVIMSKAYAVGASFKGSVFTNAVVDRVAFDKSDMQGVQFINTV 111
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L+ S GA +EGA F +A+I Q LCK NP +R L C
Sbjct: 112 LSGSTFEGANLEGASFENALIGYVDIQKLCK-----NPTLPEESRIDLAC 156
>gi|307109822|gb|EFN58059.1| hypothetical protein CHLNCDRAFT_57123 [Chlorella variabilis]
Length = 608
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 58/101 (57%), Gaps = 2/101 (1%)
Query: 126 ENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
++ R N +T AD+R ++ S + G L A+A ANF+GA+L + ++ + L A+L+N
Sbjct: 49 QDLRKNKYTKADLRGTNLSNANLEGVTLFGALATNANFSGANLRNADLELVELEGADLSN 108
Query: 185 AVLVRTVLTRSDLGGAI-IEGADFSDAVIDLAQKQALCKYA 224
AVL +LT + LG I GADF+D V LC+ A
Sbjct: 109 AVLEGAMLTNAQLGRVKSITGADFTDVVFRKDVMMGLCRIA 149
>gi|440804190|gb|ELR25067.1| pentapeptide repeatcontaining protein [Acanthamoeba castellanii
str. Neff]
Length = 293
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 57/108 (52%), Gaps = 6/108 (5%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
AQ ADLR+A +AN AD+RE++ SG+ A L A+ +A+ +GA L +
Sbjct: 162 AQLEDADLRQANLANAKMTKANLMHADLREANLSGAVMLRADLRSAILRRADLSGAALPN 221
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDAVI 212
+ R L ANLT A L LT +D L GA + GAD S++ +
Sbjct: 222 VELQRASLRRANLTGANLTWATLTDADCTQANLSGANLSGADLSNSTL 269
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 15/98 (15%)
Query: 130 ANFTSADMRE----------SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AN A+MRE ++ SG+ + A L KA +AN +GA+L + ++ L +
Sbjct: 112 ANLKGANMREVQLASTNLTRANLSGANLHLARLGKAQLRRANLSGANLEEAQLEDADLRQ 171
Query: 180 ANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDAVI 212
ANL NA + + L +D L GA++ AD A++
Sbjct: 172 ANLANAKMTKANLMHADLREANLSGAVMLRADLRSAIL 209
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 58/118 (49%), Gaps = 7/118 (5%)
Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
Q S +L +A N A A +R ++ SG+ A LE A +AN A ++
Sbjct: 123 QLASTNLTRANLSGANLHLARLGKAQLRRANLSGANLEEAQLEDADLRQANLANAKMTKA 182
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYANGT 227
+ L EANL+ AV++ R+DL AI+ AD S A + ++ ++A + AN T
Sbjct: 183 NLMHADLREANLSGAVML-----RADLRSAILRRADLSGAALPNVELQRASLRRANLT 235
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 44/96 (45%), Gaps = 12/96 (12%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD----------LSDTLMDRMVLNEA 180
+ T A + D G F A L +A N TGA+ L+ T + R L+ A
Sbjct: 78 DLTGARLFRCDLRGVDFQWANLTEATLTDCNLTGANLKGANMREVQLASTNLTRANLSGA 137
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
NL A L + L R++L GA +E A DA DL Q
Sbjct: 138 NLHLARLGKAQLRRANLSGANLEEAQLEDA--DLRQ 171
>gi|303279747|ref|XP_003059166.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459002|gb|EEH56298.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 213
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 62/126 (49%), Gaps = 6/126 (4%)
Query: 117 DLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
DLRK + ++ + A M ++ F G+ + KA A A+FTGA+ ++ ++DR+
Sbjct: 91 DLRKCEYDGKDLSTKTLSGALMVDASFKGTNLTEVVMSKAYALNADFTGANFTNAVVDRV 150
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 235
+ ANL NA V+T + G + GA F +A+I + LC NP T
Sbjct: 151 TFDGANLANADFHNAVITGTTYEGTDLTGATFEEALIGKEDVKRLCD-----NPTVKGPT 205
Query: 236 RKSLGC 241
R +GC
Sbjct: 206 RFEVGC 211
>gi|388504750|gb|AFK40441.1| unknown [Lotus japonicus]
Length = 239
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 56/118 (47%), Gaps = 6/118 (5%)
Query: 125 KENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
K N + ++A M ++ F G+ + KA A +F G D S+ ++DR+ +A+L
Sbjct: 126 KSNLKGKTLSAALMSDAKFDGADMTEVVMSKAYAVGGSFKGVDFSNAVLDRVNFEKADLQ 185
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
AV TVL+ S A +EGA F D +I Q LC+ N R LGC
Sbjct: 186 GAVFKNTVLSGSTFDDAKLEGAVFEDTIIGYIDLQKLCR-----NKTIADDWRVELGC 238
>gi|302819846|ref|XP_002991592.1| hypothetical protein SELMODRAFT_133757 [Selaginella moellendorffii]
gi|300140625|gb|EFJ07346.1| hypothetical protein SELMODRAFT_133757 [Selaginella moellendorffii]
Length = 157
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 57/110 (51%), Gaps = 5/110 (4%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
++A M ++ F G+ + KA A A+F G D ++ ++DR+V ++A++ AV TV
Sbjct: 51 LSAALMADAKFDGADMTEVVMSKAYAVGASFKGTDFTNAVLDRVVFDKADMKGAVFRNTV 110
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L+ S GA +E ADF +A+I + LC NP + L C
Sbjct: 111 LSGSTFQGANLENADFENALIGYNDARKLC-----LNPTLSEESTIELAC 155
>gi|376001358|ref|ZP_09779228.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375330187|emb|CCE14981.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 351
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/148 (33%), Positives = 75/148 (50%), Gaps = 8/148 (5%)
Query: 69 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 125
R F +L AA+ + N L+ N EA IG S +Q ADL AV +
Sbjct: 21 RNFSDISLVAAIFNEVTLNRINLSGANLSEALMVHTRLIGANLSRSQLSYADLSMAVLID 80
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-TLMDRMVLNEANLTN 184
AN T A M E+ + +GA L A+ + N TG +L+ +L+ +LN + LT+
Sbjct: 81 ----ANLTGATMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTCLLNGSQLTD 136
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+LV LTRS L GA + GA+ + +++
Sbjct: 137 AILVGATLTRSVLSGAHMTGANLNRSIL 164
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 1/100 (1%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL ++V NF AN T A++ ++ +G+ NGA L A AN TGA+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTGANLTGANL 249
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+ + L ANL+ A L LT ++L GA + AD
Sbjct: 250 NGLTLQSADLRLANLSKADLRGANLTGANLAGANLLEADL 289
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 40/81 (49%), Gaps = 15/81 (18%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
A + S SG+ GA L +++ + + +GA+L+ + R+ LN+ NL+
Sbjct: 139 LVGATLTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLS-------- 190
Query: 192 LTRSDLGGAIIEGADFSDAVI 212
GA + GAD S++VI
Sbjct: 191 -------GANLTGADLSESVI 204
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 51/111 (45%), Gaps = 11/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 159
A A+L A N AN T A++ ++ +G+ NG A L KA
Sbjct: 212 ANLTGANLTGANLTGANLNGANLTGANLTGANLTGANLNGLTLQSADLRLANLSKADLRG 271
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
AN TGA+L+ + L ANLT+A L L + L GA + GA+ + A
Sbjct: 272 ANLTGANLAGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQA 322
>gi|409990095|ref|ZP_11273525.1| pentapeptide repeat-containing protein, partial [Arthrospira
platensis str. Paraca]
gi|409939047|gb|EKN80281.1| pentapeptide repeat-containing protein, partial [Arthrospira
platensis str. Paraca]
Length = 220
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 83/167 (49%), Gaps = 13/167 (7%)
Query: 56 NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQ 112
N+ YA+ R F +L AA+ + N L+ N EA IG S +Q
Sbjct: 10 NKLLTRYAQ--GERNFSDISLVAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQ 67
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-TL 171
ADL AV + AN T A M E+ + +GA L A+ + N TG +L+ +L
Sbjct: 68 LSYADLSMAVLID----ANLTGASMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASL 123
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV---IDLA 215
+ +LN + LT+A+LV +TRS L GA + GA+ + ++ IDL+
Sbjct: 124 IGTCLLNGSQLTDAILVGATMTRSVLSGAHMTGANLNRSILSEIDLS 170
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 42/76 (55%), Gaps = 5/76 (6%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
A M S SG+ GA L +++ + + +GA+L+ + R+ LN+ NL+ A
Sbjct: 139 LVGATMTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGA-----N 193
Query: 192 LTRSDLGGAIIEGADF 207
LT +DL ++I+ ++F
Sbjct: 194 LTGADLSESVIQNSNF 209
>gi|224120874|ref|XP_002318440.1| predicted protein [Populus trichocarpa]
gi|222859113|gb|EEE96660.1| predicted protein [Populus trichocarpa]
Length = 240
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 57/118 (48%), Gaps = 6/118 (5%)
Query: 125 KENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
K N + + +A M ++ F G+ + KA A A+F G D S+ ++DR+ +A+L
Sbjct: 127 KSNLKGKSLAAALMSDAKFDGADMTEVVMSKAYAVGASFRGVDFSNAVLDRVNFGKADLK 186
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
AV TVL+ S A +E A F D +I Q +C+ N G R LGC
Sbjct: 187 GAVFKNTVLSGSTFDEAQLEDAIFEDTIIGYIDLQKICR-----NTSIGPDGRAELGC 239
>gi|33240880|ref|NP_875822.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
gi|33238409|gb|AAQ00475.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
Length = 184
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 65/117 (55%), Gaps = 12/117 (10%)
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
E + + + D+ ++D SGS F+ + L+KA + GA++ + + + A+L+NA
Sbjct: 59 EYVKYDLSGRDLGDADLSGSYFSVSNLQKA-----DLRGANMQNVIAYATRFDNADLSNA 113
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCG 242
L +S GA+I+G +F++AV+DL Q ++LC+ A G T +SL CG
Sbjct: 114 NFSGAELLKSRFDGAVIDGTNFTNAVLDLPQVKSLCERATG-------QTAESLECG 163
>gi|242052129|ref|XP_002455210.1| hypothetical protein SORBIDRAFT_03g006310 [Sorghum bicolor]
gi|241927185|gb|EES00330.1| hypothetical protein SORBIDRAFT_03g006310 [Sorghum bicolor]
Length = 200
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 62/114 (54%), Gaps = 1/114 (0%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ +F ++ +R+++F G+ GA A A+ + ADL L + NL+NA L
Sbjct: 86 KQDFKTSILRQANFKGANLLGASFFDADLTSADLSDADLRGADFSLANLTKTNLSNANLE 145
Query: 189 RTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++T + GA I GADF+D + Q++ LCK A+G N TG T+++L C
Sbjct: 146 GALVTGNTSFKGANITGADFTDVPLRDDQREYLCKIADGVNSTTGNPTKETLFC 199
>gi|159903945|ref|YP_001551289.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9211]
gi|159889121|gb|ABX09335.1| Pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9211]
Length = 184
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 32/126 (25%)
Query: 126 ENFRANFTSADMRESDFSGSKFN----------GAYLEKAVAYKANFTGADLSDTLMDRM 175
E + + + D+ +++ SGS F+ GA L+ +AY F ADLS
Sbjct: 59 EYVKYDLSGRDLGDANLSGSYFSVSSLKNADLRGANLQNVIAYATRFDNADLSG------ 112
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 235
ANL+ A L+++V GA+IEG DF++AV+DL Q ++LC+ A G T
Sbjct: 113 ----ANLSGAELLKSVFN-----GAVIEGTDFTNAVLDLPQVKSLCERATG-------KT 156
Query: 236 RKSLGC 241
+SL C
Sbjct: 157 AESLQC 162
>gi|428307622|ref|YP_007144447.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
gi|428249157|gb|AFZ14937.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
Length = 378
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 61/111 (54%), Gaps = 6/111 (5%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A L+ A ++ N A+ + AD+R +D SG+ A L KA +AN T DL
Sbjct: 43 SNADLSRASLKDAKLIRVNLSNADLSWADLRGADLSGANLENANLSKASLDQANLTNTDL 102
Query: 168 SDTLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
S ++R +L++ANL NA L T L +DLG A +E AD S+A +D
Sbjct: 103 SSANLNRASLDYALLSKANLINADLSGTNLVGADLGRANLENADLSNATLD 153
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 57/104 (54%), Gaps = 1/104 (0%)
Query: 111 AQFGSADL-RKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A +ADL R ++ + R N ++AD+ +D G+ +GA LE A KA+ A+L++
Sbjct: 40 ADLSNADLSRASLKDAKLIRVNLSNADLSWADLRGADLSGANLENANLSKASLDQANLTN 99
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
T + LN A+L A+L + L +DL G + GAD A ++
Sbjct: 100 TDLSSANLNRASLDYALLSKANLINADLSGTNLVGADLGRANLE 143
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 63/134 (47%), Gaps = 14/134 (10%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A ADLR A N +A+ A++ +D S + N A L+ A+ KAN
Sbjct: 63 SNADLSWADLRGADLSGANLENANLSKASLDQANLTNTDLSSANLNRASLDYALLSKANL 122
Query: 163 TGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
ADLS T + R L A+L+NA L ++L ++ G A ++ A +A I+ A
Sbjct: 123 INADLSGTNLVGADLGRANLENADLSNATLDNSILISANFGAANLKKASLCNANIERASL 182
Query: 218 QA---LCKYANGTN 228
+ + NGTN
Sbjct: 183 EGANLISANLNGTN 196
>gi|223995969|ref|XP_002287658.1| thylakoid lumenal 17.4 kDa protein, chloroplast precursor
[Thalassiosira pseudonana CCMP1335]
gi|220976774|gb|EED95101.1| thylakoid lumenal 17.4 kDa protein, chloroplast precursor
[Thalassiosira pseudonana CCMP1335]
Length = 245
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 5/110 (4%)
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
M ++D S +F A K +NF GAD ++ ++DR ++L AV VLT +
Sbjct: 128 MTKTDVSNGQFKEAQFSKGYLRDSNFDGADFTNAIVDRASFKGSSLKGAVFKNAVLTATS 187
Query: 197 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRR 246
GA +E ADF+DA I + LCK NP VS + N++R
Sbjct: 188 FEGADVENADFTDAYIGDFDIRTLCK-----NPTLKVSRFYRMTYRNAQR 232
>gi|254424332|ref|ZP_05038050.1| DnaJ domain protein [Synechococcus sp. PCC 7335]
gi|196191821|gb|EDX86785.1| DnaJ domain protein [Synechococcus sp. PCC 7335]
Length = 411
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 44/77 (57%), Gaps = 10/77 (12%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ + A+++E DFSG +GA N +GADLSDT M ++ LN ANL A L R
Sbjct: 298 DMSGANLKEKDFSGRNLSGA----------NLSGADLSDTFMHKVNLNRANLRKARLFRA 347
Query: 191 VLTRSDLGGAIIEGADF 207
L ++DL A + GAD
Sbjct: 348 NLLQADLSHADLSGADL 364
>gi|323452967|gb|EGB08840.1| hypothetical protein AURANDRAFT_25565 [Aureococcus anophagefferens]
Length = 176
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 70/139 (50%), Gaps = 11/139 (7%)
Query: 109 SAAQFGSADLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
SA G D +A ++F +F+ D +++F+ SK GA KA +A+F
Sbjct: 42 SAVSGGGKDYAEATIKGQDFSGKTFNNKDFSGCDAVDTNFAKSKLRGARFFKADLARADF 101
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
+GADLS ++ L LT A+ T +++ L + GADF+DAVI ++ LC
Sbjct: 102 SGADLSAASLEGANLEGTKLTGALAEGTAFSQTILDAGDLTGADFTDAVIQPYVQKGLC- 160
Query: 223 YANGTNPITGVSTRKSLGC 241
G +TG +TR SL C
Sbjct: 161 ---GRKDVTG-ATRDSLFC 175
>gi|427713339|ref|YP_007061963.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
gi|427377468|gb|AFY61420.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
Length = 327
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 46/121 (38%), Positives = 59/121 (48%), Gaps = 15/121 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A+ ADLR AV N AD+ +D G+ GA L K KAN TGADL+
Sbjct: 48 SGAKLQRADLRGAVLSA----INLNHADLIGADLRGAMLMGADLRKVNLRKANLTGADLT 103
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGT 227
ANLT A+L LT +D+ AI+ GAD + + LA+ +Q AN T
Sbjct: 104 ----------RANLTGAILSEANLTAADMSQAILRGADLTLTDLTLAELEQVNLSQANLT 153
Query: 228 N 228
N
Sbjct: 154 N 154
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 52/108 (48%), Gaps = 16/108 (14%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A+ A+LR+A + N R A AD+R + + + GA L +A+ AN G
Sbjct: 205 ARLEGANLREATLTEANLRYACLDEACLIGADLRGASLARAMLRGAQLNEAILTGANLMG 264
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+LS EA L A L+ +LT ++L G + G D S+ V+
Sbjct: 265 ANLS----------EAQLRGANLIEAILTGANLTGVDLTGVDLSETVM 302
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 36/147 (24%)
Query: 111 AQFGSADLRKAVHVKENF----------------RANFTSADM-----RESDFSGSKFNG 149
A ADLRK K N AN T+ADM R +D + +
Sbjct: 80 AMLMGADLRKVNLRKANLTGADLTRANLTGAILSEANLTAADMSQAILRGADLTLTDLTL 139
Query: 150 AYLEKAVAYKANFT-----GADLSDTLMDRMVLNEA----------NLTNAVLVRTVLTR 194
A LE+ +AN T GAD++D ++ L +A NL A L +T L
Sbjct: 140 AELEQVNLSQANLTNAYLRGADMADAILLEATLIQANLRGANLRNCNLQGANLQKTNLRG 199
Query: 195 SDLGGAIIEGADFSDAVIDLAQKQALC 221
++L A +EGA+ +A + A + C
Sbjct: 200 ANLRQARLEGANLREATLTEANLRYAC 226
>gi|427731151|ref|YP_007077388.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427367070|gb|AFY49791.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 572
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 54/104 (51%), Gaps = 6/104 (5%)
Query: 111 AQFGSADLRKAVHVKEN------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A ADL A+ N F N T A + +D S +K NGA L A A F G
Sbjct: 391 ADLSGADLSHAILNGTNLSDTILFSTNLTDASLMAADLSYAKLNGAKLIDAKLNGAMFLG 450
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
ADLS + R+VLN+A+L+ ++L L+ +DL AI+ G D S
Sbjct: 451 ADLSGVDLSRVVLNDADLSGSILSEADLSSADLSDAILLGTDLS 494
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 47/81 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + AD+ +D S + GA L A Y+ +F+ ADLS ++ + A+L+ A L
Sbjct: 296 ANLSGADLSSADLSSANLTGANLTGATLYRTDFSRADLSSCHLNDAEMGHADLSGANLRD 355
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
T L R++L AI+ GA+ SDA
Sbjct: 356 TQLCRTNLTNAILFGANLSDA 376
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 56/111 (50%), Gaps = 6/111 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL A N AN T A + +DFS + + +L A A+ +GA+L
Sbjct: 294 SGANLSGADLSSADLSSANLTGANLTGATLYRTDFSRADLSSCHLNDAEMGHADLSGANL 353
Query: 168 SDTLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
DT + R +L ANL++A L L+ +DL A + GAD S A+++
Sbjct: 354 RDTQLCRTNLTNAILFGANLSDANLKHINLSHADLCRADLSGADLSHAILN 404
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANF A + +++ +G+ F GA L A AN TGA+ D + L +ANL+ A L
Sbjct: 241 ANFRGAYLGDANLTGANFQGANLSGAYLGDANLTGANFQDANLAGANLGDANLSGANLSG 300
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L+ +DL A + GA+ + A +
Sbjct: 301 ADLSSADLSSANLTGANLTGATL 323
Score = 40.4 bits (93), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 35/69 (50%)
Query: 142 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
G+ F GAYL A ANF GA+LS + L AN +A L L ++L GA
Sbjct: 238 LKGANFRGAYLGDANLTGANFQGANLSGAYLGDANLTGANFQDANLAGANLGDANLSGAN 297
Query: 202 IEGADFSDA 210
+ GAD S A
Sbjct: 298 LSGADLSSA 306
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 45/81 (55%), Gaps = 5/81 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F AD+ D S N A L ++ +A+ + ADLSD ++ L+ ANL +A
Sbjct: 446 AMFLGADLSGVDLSRVVLNDADLSGSILSEADLSSADLSDAILLGTDLSFANLNSA---- 501
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L+ S+L GA++ GAD S+A
Sbjct: 502 -NLSGSNLSGAMLNGADLSEA 521
>gi|302779862|ref|XP_002971706.1| hypothetical protein SELMODRAFT_95422 [Selaginella moellendorffii]
gi|300160838|gb|EFJ27455.1| hypothetical protein SELMODRAFT_95422 [Selaginella moellendorffii]
Length = 157
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 5/110 (4%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
++A M ++ F G+ + KA A +F G D ++ ++DR+V ++A++ AV TV
Sbjct: 51 LSAALMADAKFDGADMTEVVMSKAYAVGGSFKGTDFTNAVLDRVVFDKADMKGAVFRNTV 110
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L+ S GA +E ADF +A+I + LC NP + L C
Sbjct: 111 LSGSTFQGANLENADFENALIGYNDARKLC-----LNPTLSEESTIELAC 155
>gi|428226754|ref|YP_007110851.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427986655|gb|AFY67799.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 330
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 50/148 (33%), Positives = 70/148 (47%), Gaps = 24/148 (16%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNG 149
L D N ++A+ G A ADLR A + + AN AD+R ++ SG+ G
Sbjct: 77 LVDANLHDADLHG-------ASLRGADLRGADLSLAVLLDANLMDADLRNANLSGADLTG 129
Query: 150 AYLEKA----------------VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
A L A + YKA+ G +LS + R+ L EANLT A L T L+
Sbjct: 130 ACLRGANLRQEMRSQHTNLRGSILYKADLRGVNLSGADLTRVDLREANLTEASLRETDLS 189
Query: 194 RSDLGGAIIEGADFSDAVIDLAQKQALC 221
+DL GA + GA SDA ++ A + C
Sbjct: 190 GADLSGANLTGALLSDACLEGAILEGAC 217
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 1/94 (1%)
Query: 118 LRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
LR A + N +AN A+++ + ++ GA L++ + +A T DLS +
Sbjct: 218 LRNAKLERANLSQANLFRANLQNALLPQARLTGAGLQQTIFAQAKLTDVDLSRADLFEAD 277
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L EANLT A L RT LTR++L A++ A+ S A
Sbjct: 278 LREANLTGAYLARTNLTRANLSDALLVRAELSSA 311
Score = 44.3 bits (103), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 56/109 (51%), Gaps = 21/109 (19%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAV 156
Y+A+ RG ADL + V ++E AN T A +RE+D SG+ +G
Sbjct: 154 YKADLRG-------VNLSGADLTR-VDLRE---ANLTEASLRETDLSGADLSG------- 195
Query: 157 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
AN TGA LSD ++ +L A L NA L R L++++L A ++ A
Sbjct: 196 ---ANLTGALLSDACLEGAILEGACLRNAKLERANLSQANLFRANLQNA 241
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 51/100 (51%), Gaps = 4/100 (4%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
ADLR + + F A A++R+++ G++ +GA L +A AN ADL +
Sbjct: 37 LSQADLRSS----DLFFAYLNRANLRQANLLGARLSGANLSQATLVDANLHDADLHGASL 92
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L A+L+ AVL+ L +DL A + GAD + A +
Sbjct: 93 RGADLRGADLSLAVLLDANLMDADLRNANLSGADLTGACL 132
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 42/85 (49%), Gaps = 15/85 (17%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N + AD+R SD F AYL +A +AN GA LS ANL+ A LV
Sbjct: 36 NLSQADLRSSDLF---F--AYLNRANLRQANLLGARLSG----------ANLSQATLVDA 80
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
L +DL GA + GAD A + LA
Sbjct: 81 NLHDADLHGASLRGADLRGADLSLA 105
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 47/84 (55%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+A T A ++++ F+ +K L +A ++A+ A+L+ + R L ANL++A+LV
Sbjct: 245 QARLTGAGLQQTIFAQAKLTDVDLSRADLFEADLREANLTGAYLARTNLTRANLSDALLV 304
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
R L+ ++L A ++ A D +
Sbjct: 305 RAELSSANLMDANLQRAVLPDGKV 328
>gi|158340319|ref|YP_001521675.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158310560|gb|ABW32174.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 284
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 73/141 (51%), Gaps = 16/141 (11%)
Query: 109 SAAQFGSADLRKAVHVK-ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S F ++ L++++ + + + A+F+ AD+R +DFS +K + A L++ +AN GADL
Sbjct: 68 SGVNFKASKLQRSLAIWVQAYWADFSDADLRHADFSCAKLSAAQLKRTDFSQANLMGADL 127
Query: 168 SDTLMDRMVL----------NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
SD++ ANLTNA L + + L GA + +D S + +
Sbjct: 128 SDSVAQDSCFKGANLWGVWAQRANLTNACLSHVDMATAKLTGAQLLDSDLSWSCL----S 183
Query: 218 QALCKYANGTNP-ITGVSTRK 237
QA+CK AN T+ + G RK
Sbjct: 184 QAVCKGANLTSACLEGSDLRK 204
>gi|85860772|ref|YP_462974.1| pentapeptide repeat-containing protein [Syntrophus aciditrophicus
SB]
gi|85723863|gb|ABC78806.1| pentapeptide repeat domain protein [Syntrophus aciditrophicus SB]
Length = 306
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 59/105 (56%), Gaps = 6/105 (5%)
Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A + DLR+A +H + AN T AD+++S+ S + N L +AN +GADL
Sbjct: 157 SEANLSNTDLREADLHGADLSDANLTGADLQKSNLSKANLNWTRL-----REANLSGADL 211
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
S+ + R L +ANL+ A LV L R++L G + GAD +A +
Sbjct: 212 SEAYLKRADLRKANLSRANLVDANLNRANLRGTDLRGADLGNANL 256
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 50/85 (58%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+A+ + A+++E+D SG+ + A L A N GADLS+ ++ L+EA+L A L
Sbjct: 53 KADLSEANLQETDLSGANLHKADLNGANLKGVNLVGADLSEACLNGADLSEADLGKADLR 112
Query: 189 RTVLTRSDLGGAIIEGADFSDAVID 213
RT L++ +L G + A+ S+ +D
Sbjct: 113 RTCLSKVNLRGTKLIEANLSNTDLD 137
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 15/128 (11%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A G ADLR+ K N R AN ++ D+ E + G L A +AN
Sbjct: 102 SEADLGKADLRRTCLSKVNLRGTKLIEANLSNTDLDEVELRGQNLRRTKLIGANLSEANL 161
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG-----GAIIEGADFSDAVIDLAQK 217
+ DL + + L++ANLT A L ++ L++++L A + GAD S+A + K
Sbjct: 162 SNTDLREADLHGADLSDANLTGADLQKSNLSKANLNWTRLREANLSGADLSEAYL----K 217
Query: 218 QALCKYAN 225
+A + AN
Sbjct: 218 RADLRKAN 225
Score = 44.3 bits (103), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 54/105 (51%), Gaps = 6/105 (5%)
Query: 112 QFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
+ +LR+ + N AN ++ D+RE+D G+ + A L A K+N + A+L+ T
Sbjct: 140 ELRGQNLRRTKLIGANLSEANLSNTDLREADLHGADLSDANLTGADLQKSNLSKANLNWT 199
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
L EANL+ A L L R+DL A + A+ DA ++ A
Sbjct: 200 -----RLREANLSGADLSEAYLKRADLRKANLSRANLVDANLNRA 239
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 50/110 (45%), Gaps = 26/110 (23%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A ADL+K+ K N AN + AD+ E AYL++A KAN
Sbjct: 177 SDANLTGADLQKSNLSKANLNWTRLREANLSGADLSE----------AYLKRADLRKANL 226
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ A+L D ANL A L T L +DLG A + GAD +A +
Sbjct: 227 SRANLVD----------ANLNRANLRGTDLRGADLGNANLAGADLREANL 266
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 45/100 (45%), Gaps = 11/100 (11%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADLRKA N RAN A++ ++ G+ GA L AN GADL
Sbjct: 212 SEAYLKRADLRKA-----NLSRANLVDANLNRANLRGTDLRGADL-----GNANLAGADL 261
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+ + + L A L A L T L+ +D G + AD
Sbjct: 262 REANLGKTCLRGARLQGAKLNETDLSDADFTGVDLSEADL 301
>gi|212721648|ref|NP_001132583.1| uncharacterized protein LOC100194054 [Zea mays]
gi|194694818|gb|ACF81493.1| unknown [Zea mays]
gi|413933909|gb|AFW68460.1| hypothetical protein ZEAMMB73_478838 [Zea mays]
Length = 225
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 61/128 (47%), Gaps = 8/128 (6%)
Query: 117 DLRKAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
DLR + E N + + +A M E+ F G+ + + KA A A+F G D ++ ++D
Sbjct: 102 DLRFCDYTNEKTNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGTDFTNAVID 161
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
R+ +A+LT A+ TVL+ S A ++ F D +I Q LC TN
Sbjct: 162 RVNFEKADLTGAIFKNTVLSGSTFDDAKMDDVVFEDTIIGYIDLQKLC-----TNTSISP 216
Query: 234 STRKSLGC 241
R LGC
Sbjct: 217 DARLELGC 224
>gi|323454309|gb|EGB10179.1| hypothetical protein AURANDRAFT_23610 [Aureococcus anophagefferens]
Length = 107
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 6/101 (5%)
Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII---- 202
FN A L A + A+ G D M ++ L A+L+NA L LT + + GA+I
Sbjct: 6 FNKAQLFSASFFDADLAGTTFVDADMKQVNLEMADLSNADLTNADLTEAYMAGAVIKDLK 65
Query: 203 --EGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ D++D + Q+ LC A GTNP TG+ TR +L C
Sbjct: 66 KIDNTDWTDVDMRKDQRTYLCSIAKGTNPKTGMDTRDTLMC 106
>gi|312195986|ref|YP_004016047.1| pentapeptide repeat-containing protein [Frankia sp. EuI1c]
gi|311227322|gb|ADP80177.1| pentapeptide repeat protein [Frankia sp. EuI1c]
Length = 377
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 56/104 (53%), Gaps = 9/104 (8%)
Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
AA+ ADL ++ +K A +D +G++ + A L+ A AN TGA L D
Sbjct: 237 AARLTGADLTGSILIKTKLTA---------TDLAGARLSQANLDGADLANANLTGARLDD 287
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
++ + L+E L +AVL R L R+DL GA + GAD + A +D
Sbjct: 288 AILTGVHLSEGRLVDAVLTRANLHRADLVGADLTGADLTGARLD 331
>gi|91070460|gb|ABE11370.1| pentapeptide repeats [uncultured Prochlorococcus marinus clone
HOT0M-10G7]
Length = 157
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 64/131 (48%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F +DL+ A F D+++++ SG + A L A N + ++L +
Sbjct: 33 ADFSGSDLKGAT---------FYLTDLQDANLSGCELQNATLYGAKLKDTNLSNSNLREV 83
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D VL+ +L+N L + + I+GADF++ + + C+ A+GTNPI
Sbjct: 84 TLDSAVLDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIVREFCEIASGTNPI 143
Query: 231 TGVSTRKSLGC 241
T TR++L C
Sbjct: 144 TNRDTRETLEC 154
>gi|358458677|ref|ZP_09168884.1| pentapeptide repeat protein [Frankia sp. CN3]
gi|357077988|gb|EHI87440.1| pentapeptide repeat protein [Frankia sp. CN3]
Length = 377
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 55/109 (50%), Gaps = 9/109 (8%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
F AA+ ADL AV K A +D +G++ + A L+ A AN TG
Sbjct: 232 FATFVAARLTGADLTGAVLAKTKLTA---------TDLAGTRLSRANLDGADLANANLTG 282
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
A L D ++ L+EA L A+L R L R+DL GA + GAD + A +D
Sbjct: 283 ARLDDAVLTGAHLSEARLVGAILTRADLHRADLVGADLTGADLTGARLD 331
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 38/82 (46%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
++T A + G++ G L A A TGADL+ ++ + L +L L R
Sbjct: 209 DWTIAHYPGAQLVGARLAGRDLTFATFVAARLTGADLTGAVLAKTKLTATDLAGTRLSRA 268
Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
L +DL A + GA DAV+
Sbjct: 269 NLDGADLANANLTGARLDDAVL 290
>gi|376002767|ref|ZP_09780589.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375328823|emb|CCE16342.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 517
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 85/180 (47%), Gaps = 29/180 (16%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F +A+LR+A N A+F+ A+MR D G+ +GA L +A AN +GA+LS
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---------SDAV----IDLA- 215
++ + L A+L+ A L+R + +DL GA + GA +D + +DL+
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKLYQVSRFNLKADEITCEWVDLSA 308
Query: 216 ----------QKQALCKYANGTNPITGVSTRKSL--GCGNSRRNAYGSPSS--PLLSAPP 261
+++L K+ N T PI + SL + N Y + P++ PP
Sbjct: 309 NGDHSQVYHFDRESLRKFFNQTRPIVEILVNSSLDQDANMALANIYHKIAQEFPVMERPP 368
Score = 44.3 bits (103), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 67/156 (42%), Gaps = 28/156 (17%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTS 134
L A++ + N++ LA ++ EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138
Query: 135 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 174
AD+RE+ + FNGA L A K N AD S+ + +
Sbjct: 139 ADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDLRGADLVKVNLPKADFSNAELRQ 198
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L ANL+NA + DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANMRWIDLQGADLSGANLTEA 234
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 67/156 (42%), Gaps = 40/156 (25%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSA---------------DMRESDFSGSKF 147
S A ++DLR+ + N AN T A D+ E+ S
Sbjct: 57 SVANLSASDLREVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEAQLINSLL 116
Query: 148 NGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDL 197
A L +A KANFT GADL +T + + N ANL+ A L T T++DL
Sbjct: 117 IRAELIRAKLTKANFTQANLNGADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDL 176
Query: 198 GGAI-----IEGADFSDAVIDLAQKQALCKYANGTN 228
GA + ADFS+A + +QA YAN +N
Sbjct: 177 RGADLVKVNLPKADFSNAEL----RQANLTYANLSN 208
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 49/102 (48%), Gaps = 3/102 (2%)
Query: 112 QFGSADLRKAVHVKENFR---ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
Q +D+ K + + +R +F ++ E + S GA L A AN + +DL
Sbjct: 8 QNSESDVLKVYEIVKKYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + R LN A L+NA L + +L ++ + A + D ++A
Sbjct: 68 EVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEA 109
>gi|209526071|ref|ZP_03274603.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423067542|ref|ZP_17056332.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493459|gb|EDZ93782.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406711116|gb|EKD06318.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 517
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 85/180 (47%), Gaps = 29/180 (16%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F +A+LR+A N A+F+ A+MR D G+ +GA L +A AN +GA+LS
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---------SDAV----IDLA- 215
++ + L A+L+ A L+R + +DL GA + GA +D + +DL+
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKLYQVSRFNLKADEITCEWVDLSA 308
Query: 216 ----------QKQALCKYANGTNPITGVSTRKSL--GCGNSRRNAYGSPSS--PLLSAPP 261
+++L K+ N T PI + SL + N Y + P++ PP
Sbjct: 309 NGDHSQVYHFDRESLRKFFNQTRPIVEILVNSSLDQDANMALANIYHKIAQEFPVMERPP 368
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 67/156 (42%), Gaps = 28/156 (17%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTS 134
L A++ + N++ LA ++ EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138
Query: 135 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 174
AD+RE+ + FNGA L A K N AD S+ + +
Sbjct: 139 ADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDLRGADLVKVNLPKADFSNAELRQ 198
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L ANL+NA + DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANMRWIDLQGADLSGANLTEA 234
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 67/156 (42%), Gaps = 40/156 (25%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSA---------------DMRESDFSGSKF 147
S A ++DLR+ + N AN T A D+ E+ S
Sbjct: 57 SVANLSASDLREVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEAQLINSLL 116
Query: 148 NGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDL 197
A L +A KANFT GADL +T + + N ANL+ A L T T++DL
Sbjct: 117 IRAELIRAKLTKANFTQANLNGADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDL 176
Query: 198 GGAI-----IEGADFSDAVIDLAQKQALCKYANGTN 228
GA + ADFS+A + +QA YAN +N
Sbjct: 177 RGADLVKVNLPKADFSNAEL----RQANLTYANLSN 208
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 49/102 (48%), Gaps = 3/102 (2%)
Query: 112 QFGSADLRKAVHVKENFR---ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
Q +D+ K + + +R +F ++ E + S GA L A AN + +DL
Sbjct: 8 QNSESDVLKVYEIVKKYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + R LN A L+NA L + +L ++ + A + D ++A
Sbjct: 68 EVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEA 109
>gi|427417538|ref|ZP_18907721.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425760251|gb|EKV01104.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 397
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 25/103 (24%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+F A+++E DFSG + K+N GADLSDT + ++ LN+ANL A L R
Sbjct: 283 ADFKGANLKEKDFSGRNLS----------KSNLEGADLSDTFLHKVNLNQANLHKAKLFR 332
Query: 190 TVLTR---------------SDLGGAIIEGADFSDAVIDLAQK 217
L + +DL GA + GAD S A+I K
Sbjct: 333 ANLLQANLSHANLREANLIGADLSGADLSGADLSGAIIGYGDK 375
>gi|186682860|ref|YP_001866056.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186465312|gb|ACC81113.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 589
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 54/106 (50%), Gaps = 6/106 (5%)
Query: 111 AQFGSADLRKAVHVKEN------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A ADL A+ N F N + A + +D S +K NGA L A A F G
Sbjct: 408 ADLSGADLSHAILNGTNLSDTILFSTNLSDAILMAADLSYAKLNGAKLNNARLNGAMFLG 467
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
ADLS + R+ LNEA+L+ +L L+ +DL AI+ G DFS A
Sbjct: 468 ADLSGVDLSRVSLNEADLSGVILSEADLSGADLTDAILFGTDFSYA 513
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 56/102 (54%), Gaps = 9/102 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A FG A+L A N + A++ +D S + +GA L +A +A+ ADLS
Sbjct: 301 SGANFGDANLSGA---------NLSGANLSGADLSSTNLSGANLSRANLSRADLNRADLS 351
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
T ++R L+ NL+ A L T +R+DL AI+ GA+ S+A
Sbjct: 352 STNLNRADLSNTNLSRADLSSTNFSRADLSNAILFGANLSEA 393
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 11/111 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANF 162
S A ADL N RAN + AD+ +D S + N A L +A NF
Sbjct: 316 SGANLSGADLSSTNLSGANLSRANLSRADLNRADLSSTNLNRADLSNTNLSRADLSSTNF 375
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
+ ADLS+ ++ L+EANL+N L L R+DL GAD S A+++
Sbjct: 376 SRADLSNAILFGANLSEANLSNVSLNHADLCRADL-----SGADLSHAILN 421
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 1/105 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A G A+L + N ANF A++ ++ SG+ +GA L AN + A+L
Sbjct: 281 SLAYLGDANLTGVNFIGANLSGANFGDANLSGANLSGANLSGADLSSTNLSGANLSRANL 340
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
S ++R L+ NL A L T L+R+DL AD S+A++
Sbjct: 341 SRADLNRADLSSTNLNRADLSNTNLSRADLSSTNFSRADLSNAIL 385
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 57/115 (49%), Gaps = 9/115 (7%)
Query: 63 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
AKL N R+ + L A + S +S LN EA+ G I S A ADL A+
Sbjct: 453 AKLNNARLNGAMFLGADLSGVDLSRVS----LN--EADLSGV--ILSEADLSGADLTDAI 504
Query: 123 HVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
+F AN SA++ S+ SG+ NGA L + A +GADLSD M++M
Sbjct: 505 LFGTDFSYANLNSANLSGSNLSGAILNGANLSHSNLSYAILSGADLSDANMEKMT 559
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 21/126 (16%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL +A N RA+ ++ ++ +D S + F+ A L A+ + AN + A+L
Sbjct: 336 SRANLSRADLNRADLSSTNLNRADLSNTNLSRADLSSTNFSRADLSNAILFGANLSEANL 395
Query: 168 SDTLMDRM---------------VLNEANLTNAVLVRT-----VLTRSDLGGAIIEGADF 207
S+ ++ +LN NL++ +L T +L +DL A + GA
Sbjct: 396 SNVSLNHADLCRADLSGADLSHAILNGTNLSDTILFSTNLSDAILMAADLSYAKLNGAKL 455
Query: 208 SDAVID 213
++A ++
Sbjct: 456 NNARLN 461
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 49/90 (54%), Gaps = 1/90 (1%)
Query: 127 NFRANFT-SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
NFR+ + A++ +DFSG+ + AYL A NF GA+LS L+ ANL+ A
Sbjct: 259 NFRSAYLGDANLTGADFSGADLSLAYLGDANLTGVNFIGANLSGANFGDANLSGANLSGA 318
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
L L+ ++L GA + A+ S A ++ A
Sbjct: 319 NLSGADLSSTNLSGANLSRANLSRADLNRA 348
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 46/101 (45%), Gaps = 1/101 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S+ F ADL A+ N AN ++ + +D + +GA L A+ N + L
Sbjct: 371 SSTNFSRADLSNAILFGANLSEANLSNVSLNHADLCRADLSGADLSHAILNGTNLSDTIL 430
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
T + +L A+L+ A L L + L GA+ GAD S
Sbjct: 431 FSTNLSDAILMAADLSYAKLNGAKLNNARLNGAMFLGADLS 471
>gi|428310629|ref|YP_007121606.1| serine/threonine protein kinase [Microcoleus sp. PCC 7113]
gi|428252241|gb|AFZ18200.1| serine/threonine protein kinase [Microcoleus sp. PCC 7113]
Length = 542
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 46/87 (52%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R +F S D+ D +G +A K NF GADLS+ R LN +NL +A L
Sbjct: 415 RRDFASQDLSGLDLHKVDLSGGIFHQAKLAKTNFQGADLSNADFGRASLNRSNLRDANLG 474
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
R L+ +DL GA + GAD S A ++ A
Sbjct: 475 RAYLSYADLEGADLRGADLSYAYLNHA 501
>gi|119486763|ref|ZP_01620738.1| hypothetical protein L8106_10952 [Lyngbya sp. PCC 8106]
gi|119456056|gb|EAW37189.1| hypothetical protein L8106_10952 [Lyngbya sp. PCC 8106]
Length = 331
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 61/128 (47%), Gaps = 20/128 (15%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR--------ANFTSADMRE 139
++ L D N +A+ RG F ADLR A N R N AD+R
Sbjct: 104 LAILLDANLIQADLRG-------VNFQGADLRGACLRGANLRYERRIYDGVNLRGADLRG 156
Query: 140 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 199
+D G GA L +A N GA+L++T++ +L +ANLT A L LT +DL G
Sbjct: 157 ADLQGVNLTGADLTRA-----NLRGANLAETVLRGAILKQANLTQANLQSAFLTEADLSG 211
Query: 200 AIIEGADF 207
A + GA+
Sbjct: 212 ARLIGANL 219
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 60/106 (56%), Gaps = 11/106 (10%)
Query: 118 LRKAVHVKENF-RANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSDTL 171
LR A+ + N +AN SA + E+D SG++ GA LE+A+ +A G +L D++
Sbjct: 184 LRGAILKQANLTQANLQSAFLTEADLSGARLIGANLRKVKLERAILIEAQLPGVELCDSI 243
Query: 172 MDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 212
+ + L+ ANL+ A L RT L TR+DL A + AD +DA +
Sbjct: 244 LPDVKLSSANLSGADLSRTNLVRADLTRTDLSNANLTQADLTDASV 289
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 59/132 (44%), Gaps = 25/132 (18%)
Query: 95 NKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRANFTSADMRESDF----------- 142
N+Y+A R I A SADL ANF AD++ S+F
Sbjct: 8 NRYQAGERDFRDIHLRNANLNSADL---------IDANFNHADLQGSEFVFAYLNSVNFV 58
Query: 143 ----SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
+K +GAYL KA AN + ADL ++ +ANL+ A+L+ L ++DL
Sbjct: 59 RANLGSAKLSGAYLNKANLSGANLSDADLHGAVLQGADFRKANLSLAILLDANLIQADLR 118
Query: 199 GAIIEGADFSDA 210
G +GAD A
Sbjct: 119 GVNFQGADLRGA 130
>gi|119488860|ref|ZP_01621822.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119455021|gb|EAW36163.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 1011
Score = 57.4 bits (137), Expect = 8e-06, Method: Composition-based stats.
Identities = 37/102 (36%), Positives = 54/102 (52%), Gaps = 14/102 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A +ADLR A + RAN + A++R ++ SG+ +G YL A +AN A+
Sbjct: 850 SGADLRTADLRSANLI----RANLSDANLRSANLSGANLSGVYLNSADLRRANLNDAN-- 903
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
LN+A+L+ A L L+ +DL GA + ADFS A
Sbjct: 904 --------LNDADLSGANLRSADLSGADLSGADLSVADFSSA 937
Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats.
Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 6/90 (6%)
Query: 127 NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD-----RMVLNEA 180
N R ++ + AD+R +D + A L A AN +GA+LS ++ R LN+A
Sbjct: 843 NLRTSDLSGADLRTADLRSANLIRANLSDANLRSANLSGANLSGVYLNSADLRRANLNDA 902
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
NL +A L L +DL GA + GAD S A
Sbjct: 903 NLNDADLSGANLRSADLSGADLSGADLSVA 932
>gi|119512769|ref|ZP_01631839.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
gi|119462587|gb|EAW43554.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
Length = 268
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 56/106 (52%), Gaps = 6/106 (5%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A DL A ++ N AN +AD+ E++ ++ NGAYL KA YKAN
Sbjct: 139 ANLRETDLSTAKLIRANLGFANLIEANLINADLSEANLYEAQLNGAYLYKANFYKANLHQ 198
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A LS + R +EANL+ A L + LT ++L GA ++GA+ A
Sbjct: 199 AHLSGAYLFRANFSEANLSCANLTWSNLTGANLAGANLQGANLRGA 244
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 52/101 (51%), Gaps = 1/101 (0%)
Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A +A+L+ A+ + N + A++RE+D S +K A L A +AN ADLS+
Sbjct: 114 ADLSTANLQGAIIAEANLIGTDLRDANLRETDLSTAKLIRANLGFANLIEANLINADLSE 173
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ LN A L A + L ++ L GA + A+FS+A
Sbjct: 174 ANLYEAQLNGAYLYKANFYKANLHQAHLSGAYLFRANFSEA 214
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD--RMV---LNEANLTN 184
AN ++R ++ G N L A+ +AN + ADLS + +++ L+EANL+
Sbjct: 34 ANLKGENLRGANLQGVNLNKVDLSHALLVRANLSNADLSGANLHQAKLIEANLSEANLSV 93
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
A L LT+++L A + GAD S A
Sbjct: 94 ANLSGATLTQANLSYAHLIGADLSTA 119
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 54/108 (50%), Gaps = 6/108 (5%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
SAA +LR A N + + + A + ++ S + +GA L +A +AN + A+L
Sbjct: 32 SAANLKGENLRGANLQGVNLNKVDLSHALLVRANLSNADLSGANLHQAKLIEANLSEANL 91
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDA 210
S + L +ANL+ A L+ L+ ++L GAII G D DA
Sbjct: 92 SVANLSGATLTQANLSYAHLIGADLSTANLQGAIIAEANLIGTDLRDA 139
>gi|359459933|ref|ZP_09248496.1| hypothetical protein ACCM5_14478 [Acaryochloris sp. CCMEE 5410]
Length = 315
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 53/92 (57%), Gaps = 5/92 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
++ AD++E DFSG + A L + A +K N GA+L++ + R L +ANLT A
Sbjct: 202 DWHGADLQERDFSGRNLSQANLANVNLKDAFMHKVNLAGANLTNANLTRANLLQANLTQA 261
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
L LT +DL GA + GADF+ A + + +K
Sbjct: 262 NLQGANLTAADLSGADLRGADFTGANMGIGKK 293
>gi|209526959|ref|ZP_03275476.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|376005813|ref|ZP_09783205.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423064919|ref|ZP_17053709.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209492561|gb|EDZ92899.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|375325803|emb|CCE18958.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406714162|gb|EKD09330.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 331
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 67/138 (48%), Gaps = 10/138 (7%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRA 130
F T L AA + + ++ L D N +A+ RG A ADLR A N R
Sbjct: 87 FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139
Query: 131 N---FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
+ S ++R +D G+ G L A +AN TGA+L++ ++ +LN+ NL+ L
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLTGANLTECVLRGAILNQTNLSETNL 199
Query: 188 VRTVLTRSDLGGAIIEGA 205
+LT +L GA + G+
Sbjct: 200 QGAILTEVNLSGANLIGS 217
Score = 43.5 bits (101), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 63/126 (50%), Gaps = 7/126 (5%)
Query: 94 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFR-ANFTSA-----DMRESDFSGSK 146
LN+Y + + G+ A+ +ADL A +F+ ANF A ++ ++ ++
Sbjct: 7 LNQYRSGEKLFRGVNLRNAELSNADLIGANLSGGDFQGANFVLAYLNGVNLTRANLEKAR 66
Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
GA L +A A T AD T++ L +ANLT A LV L ++DL GA ++GAD
Sbjct: 67 LGGANLSRANLSGAQLTDADFHGTILQAADLRKANLTLATLVDANLIQADLRGANLQGAD 126
Query: 207 FSDAVI 212
A +
Sbjct: 127 LRGACL 132
>gi|22299142|ref|NP_682389.1| hypothetical protein tlr1599 [Thermosynechococcus elongatus BP-1]
gi|22295324|dbj|BAC09151.1| tlr1599 [Thermosynechococcus elongatus BP-1]
Length = 309
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 71/143 (49%), Gaps = 9/143 (6%)
Query: 89 SALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGS 145
+AL N A+ RG G S A ADLR + V + R + S +R+++ +G+
Sbjct: 45 AALQSTNLQRADLRGAILTGANLSQADLRGADLRGVILVSADLR--WVS--LRKANLTGA 100
Query: 146 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
A L A +AN TGA LS+ ++ L +LT A L R LTR++L A + GA
Sbjct: 101 DLTRANLANADLSEANLTGAQLSEAIVRDANLTLTDLTLAELERANLTRANLTEAYLRGA 160
Query: 206 DFSDAVIDLAQKQALCKYANGTN 228
D +DAV L + Q L G N
Sbjct: 161 DLTDAV--LRESQLLQANLRGAN 181
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 56/105 (53%), Gaps = 6/105 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
SA A+L +A+ + N R A A++RE F + A L+KA N GADL
Sbjct: 183 SATNLQQANLERAILIGANLRRARLEEANLREVAFKEANLRHACLDKA-----NLVGADL 237
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ + +L ANL++A+L+ L ++L GA + GA+ +A++
Sbjct: 238 RGVSLAQALLRGANLSSAILIGANLMGANLSGADLRGANLIEAIL 282
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 49/108 (45%), Gaps = 16/108 (14%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A+ A+LR+ + N R AN AD+R + + GA L A+ AN G
Sbjct: 205 ARLEEANLREVAFKEANLRHACLDKANLVGADLRGVSLAQALLRGANLSSAILIGANLMG 264
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+LS A+L A L+ +LT + L G + D S+A++
Sbjct: 265 ANLSG----------ADLRGANLIEAILTGASLNGVDLSAVDMSEAIL 302
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 62/131 (47%), Gaps = 16/131 (12%)
Query: 91 LADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSK 146
L DL E E TR + A ADL AV ++E + A++R ++ S +
Sbjct: 134 LTDLTLAELERANLTRANL---TEAYLRGADLTDAV-LRE---SQLLQANLRGANLSATN 186
Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AI 201
A LE+A+ AN A L + + + EANL +A L + L +DL G A+
Sbjct: 187 LQQANLERAILIGANLRRARLEEANLREVAFKEANLRHACLDKANLVGADLRGVSLAQAL 246
Query: 202 IEGADFSDAVI 212
+ GA+ S A++
Sbjct: 247 LRGANLSSAIL 257
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 47/89 (52%), Gaps = 5/89 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD----RMV-LNEANLT 183
RA+ T A ++ ++ + GA L A +A+ GADL ++ R V L +ANLT
Sbjct: 39 RADLTDAALQSTNLQRADLRGAILTGANLSQADLRGADLRGVILVSADLRWVSLRKANLT 98
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A L R L +DL A + GA S+A++
Sbjct: 99 GADLTRANLANADLSEANLTGAQLSEAIV 127
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 52/108 (48%), Gaps = 11/108 (10%)
Query: 109 SAAQFGSADLRKA------VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
+ AQ A +R A + + E RAN T A++ E+ G+ A L ++ +AN
Sbjct: 118 TGAQLSEAIVRDANLTLTDLTLAELERANLTRANLTEAYLRGADLTDAVLRESQLLQANL 177
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
GA+LS T L +ANL A+L+ L R+ L A + F +A
Sbjct: 178 RGANLSAT-----NLQQANLERAILIGANLRRARLEEANLREVAFKEA 220
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 37/72 (51%), Gaps = 15/72 (20%)
Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
+ DF+G A+L + + TG DLS A+LT+A L T L R+DL
Sbjct: 14 DRDFAGIHLRRAHLSRCI-----LTGIDLS----------RADLTDAALQSTNLQRADLR 58
Query: 199 GAIIEGADFSDA 210
GAI+ GA+ S A
Sbjct: 59 GAILTGANLSQA 70
>gi|308813604|ref|XP_003084108.1| COG1357: Uncharacterized low-complexity proteins (ISS)
[Ostreococcus tauri]
gi|116055991|emb|CAL58524.1| COG1357: Uncharacterized low-complexity proteins (ISS)
[Ostreococcus tauri]
Length = 177
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 63/129 (48%), Gaps = 17/129 (13%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT ++ ++F G+ G A A F GA+L + + + L++A+LT+A+L
Sbjct: 48 AFFTKGSLKRANFDGANLEGITFFGADLTGATFRGANLQNANLGQANLSKADLTDAILSG 107
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQ-----------------ALCKYANGTNPITG 232
+++ + IEG+D+S+ ++ + + LCK A G NP+TG
Sbjct: 108 AIVSSAQFDDVKIEGSDWSEVIVRKREAKDDTTDDLFCVAYQDILTGLCKVAKGENPVTG 167
Query: 233 VSTRKSLGC 241
+ T +L C
Sbjct: 168 LPTELTLMC 176
>gi|300867252|ref|ZP_07111912.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300334729|emb|CBN57078.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 508
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 70/143 (48%), Gaps = 11/143 (7%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F DLR+A + N AN + A++R +D SG+ GA L +A AN GA+LS+
Sbjct: 181 ADFSGTDLRQANLCQVNLSGANLSGANLRWADLSGANLRGADLNEAKLSGANLYGANLSN 240
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
ANLTNA LV LT ++L GA GAD S + + A+ + ++
Sbjct: 241 ----------ANLTNASLVHADLTLANLNGADWVGADLSGSTLSGAKLYDVPRFGIKAEE 290
Query: 230 ITGVSTRKSLGCGNSRRNAYGSP 252
+T S NS+ +GSP
Sbjct: 291 VTCEWVDLSSNGDNSQVYRFGSP 313
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 76/152 (50%), Gaps = 14/152 (9%)
Query: 73 STALAAAVVASCSSNISAL--ADLNKYE----AETRGEFGIG-------SAAQFGSADLR 119
S+ L A++ + N++ L ADL++ + A RGE S A ADLR
Sbjct: 75 SSHLVRAILQGATLNVANLVRADLSEAQLMGAALIRGELIRAELSKANFSKANLTGADLR 134
Query: 120 KAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
+A + NF AN + A++R + + + F A L A KA+ GAD S T + + L
Sbjct: 135 EAKLTEVNFSEANLSGANLRGASGTAANFELANLHGADLSKADLNGADFSGTDLRQANLC 194
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ NL+ A L L +DL GA + GAD ++A
Sbjct: 195 QVNLSGANLSGANLRWADLSGANLRGADLNEA 226
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 45/80 (56%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NFT ++ E++ S + A L +A + N +GA+L++ + LN A L+++ LVR
Sbjct: 22 NFTGINLNEANLSRINLSQANLSEASLFVTNLSGANLNEVNLSNANLNVARLSSSHLVRA 81
Query: 191 VLTRSDLGGAIIEGADFSDA 210
+L + L A + AD S+A
Sbjct: 82 ILQGATLNVANLVRADLSEA 101
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 43/88 (48%), Gaps = 5/88 (5%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
F N + A++ E + S + N A L + +A GA L+ + R L+EA L A L
Sbjct: 49 FVTNLSGANLNEVNLSNANLNVARLSSSHLVRAILQGATLNVANLVRADLSEAQLMGAAL 108
Query: 188 VRTVLTRSDLGGAI-----IEGADFSDA 210
+R L R++L A + GAD +A
Sbjct: 109 IRGELIRAELSKANFSKANLTGADLREA 136
>gi|158336687|ref|YP_001517861.1| hypothetical protein AM1_3555 [Acaryochloris marina MBIC11017]
gi|158306928|gb|ABW28545.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 315
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 53/92 (57%), Gaps = 5/92 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
++ AD++E DFSG + A L + A +K N GA+L++ + R L +ANLT A
Sbjct: 202 DWHGADLQERDFSGRNLSQANLANVNLKDAFMHKVNLAGANLTNANLTRANLLQANLTQA 261
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
L LT +DL GA + GADF+ A + + +K
Sbjct: 262 NLQGANLTAADLSGADLRGADFTGANMGIGKK 293
>gi|78779034|ref|YP_397146.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9312]
gi|78712533|gb|ABB49710.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9312]
Length = 157
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 64/131 (48%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F +DL+ A F D+++++ SG + A L A N + ++L +
Sbjct: 33 ADFSGSDLKGAT---------FYLTDLQDANLSGCELQNATLYGAKLKDTNLSNSNLREV 83
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D VL+ +L+N L + + I+GADF++ + + C+ A+GTNPI
Sbjct: 84 TLDSAVLDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIVRKFCESASGTNPI 143
Query: 231 TGVSTRKSLGC 241
T TR++L C
Sbjct: 144 TNRDTRETLEC 154
>gi|94266259|ref|ZP_01289965.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
gi|93453141|gb|EAT03609.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
Length = 818
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 54/98 (55%), Gaps = 12/98 (12%)
Query: 127 NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMD------R 174
+FRA N + AD +DFS + F GA L AV + + TG A+L+D +D R
Sbjct: 372 DFRAANLSRADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSR 431
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L ANLTNA L LT +DL AI+ GAD +AV+
Sbjct: 432 ATLIRANLTNASLREADLTGADLSNAILTGADLREAVL 469
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N D+RE DF G++ +G ++A A+F+GADL + L A L A L R
Sbjct: 142 NLAGMDLREVDFRGARLHGVSFQEANLRGADFSGADLMHADLSEADLRGAKLVGANLSRV 201
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYA 224
L R+DLG A + AD + A + A+ +QA+ + A
Sbjct: 202 NLARADLGEADLSEADLTRANLGGARLRQAILRRA 236
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 49/138 (35%), Positives = 65/138 (47%), Gaps = 14/138 (10%)
Query: 109 SAAQFGSADLRKAVHVK------ENFRANFTSADMRESDFSG-SKFNGAYLEKAVAYKAN 161
S A F A+L AV + E AN T A + ++D S + A L A +A+
Sbjct: 389 SKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLTNASLREAD 448
Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
TGADLS+ +L A+L AVLVRT LT + L A + + SDA DL+
Sbjct: 449 LTGADLSNA-----ILTGADLREAVLVRTRLTHAHLNRADLAWSTLSDA--DLSNADLKE 501
Query: 222 KYANGTNPITGVSTRKSL 239
NG N G S +SL
Sbjct: 502 ASLNGVNLGAGASVLQSL 519
Score = 43.9 bits (102), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 6/99 (6%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
LR A+ ++ + F D+R +D G+ A L A A+ ADLS + R L
Sbjct: 519 LRSAI----SWSSRFVRYDLRNADLRGANLRDADLADADLSNADLANADLSRANLSRSDL 574
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
ANLT+A+L T+L+ + L A A+F++A DL Q
Sbjct: 575 RWANLTDAILQGTILSNASLNDANFNRANFAEA--DLTQ 611
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 111 AQFGSADLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A FG D RK + +FR NF+ AD+ + F+ + +GA L++ A G
Sbjct: 236 ALFGETDARKVDARQADFRGATFQRGNFSGADLSRARFADTDLSGAILQEVDLAGAELEG 295
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
+DLS + + L +ANL A L L +DL A + AD S A DLA
Sbjct: 296 SDLSRLALPGVRLVKANLGGANLYGADLRAADLTDASLVEADLSAA--DLA 344
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 8/123 (6%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG 149
LA ++ E + RG G F A+LR A +F A+ AD+ E+D G+K G
Sbjct: 143 LAGMDLREVDFRGARLHG--VSFQEANLRGA-----DFSGADLMHADLSEADLRGAKLVG 195
Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
A L + +A+ ADLS+ + R L A L A+L R + +D ADF
Sbjct: 196 ANLSRVNLARADLGEADLSEADLTRANLGGARLRQAILRRALFGETDARKVDARQADFRG 255
Query: 210 AVI 212
A
Sbjct: 256 ATF 258
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 53/112 (47%), Gaps = 24/112 (21%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS-- 168
A G A LR+A+ RA F D R+ D + F GA + + NF+GADLS
Sbjct: 221 ANLGGARLRQAILR----RALFGETDARKVDARQADFRGATFQ-----RGNFSGADLSRA 271
Query: 169 ---DTLMDRMVLNEANLTNAVLVRTVLTR----------SDLGGAIIEGADF 207
DT + +L E +L A L + L+R ++LGGA + GAD
Sbjct: 272 RFADTDLSGAILQEVDLAGAELEGSDLSRLALPGVRLVKANLGGANLYGADL 323
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 3/121 (2%)
Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A +ADL A V+ + A + A +++S +G+ +G+ L A A+F A+LS
Sbjct: 321 ADLRAADLTDASLVEADLSAADLAGAKLQKSIMAGATLHGSRLVSVTARNADFRAANLSR 380
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYANGT 227
++AN A L VL ++DL G + A+ +DA +D A +A AN T
Sbjct: 381 ADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLT 440
Query: 228 N 228
N
Sbjct: 441 N 441
>gi|94266194|ref|ZP_01289904.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
gi|93453242|gb|EAT03697.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
Length = 818
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 54/98 (55%), Gaps = 12/98 (12%)
Query: 127 NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMD------R 174
+FRA N + AD +DFS + F GA L AV + + TG A+L+D +D R
Sbjct: 372 DFRAANLSRADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSR 431
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L ANLTNA L LT +DL AI+ GAD +AV+
Sbjct: 432 ATLIRANLTNASLREADLTGADLSNAILTGADLREAVL 469
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N D+RE DF G++ +G ++A A+F+GADL + L A L A L R
Sbjct: 142 NLAGMDLREVDFRGARLHGVSFQEANLRGADFSGADLMHADLSEADLRGAKLVGANLSRV 201
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYA 224
L R+DLG A + AD + A + A+ +QA+ + A
Sbjct: 202 NLARADLGEADLSEADLTRANLGGARLRQAILRRA 236
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 49/138 (35%), Positives = 65/138 (47%), Gaps = 14/138 (10%)
Query: 109 SAAQFGSADLRKAVHVK------ENFRANFTSADMRESDFSG-SKFNGAYLEKAVAYKAN 161
S A F A+L AV + E AN T A + ++D S + A L A +A+
Sbjct: 389 SKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLTNASLREAD 448
Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
TGADLS+ +L A+L AVLVRT LT + L A + + SDA DL+
Sbjct: 449 LTGADLSNA-----ILTGADLREAVLVRTRLTHAHLNRADLAWSTLSDA--DLSNADLKE 501
Query: 222 KYANGTNPITGVSTRKSL 239
NG N G S +SL
Sbjct: 502 ASLNGVNLGAGASVLQSL 519
Score = 43.9 bits (102), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 6/99 (6%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
LR A+ ++ + F D+R +D G+ A L A A+ ADLS + R L
Sbjct: 519 LRSAI----SWSSRFVRYDLRNADLRGANLRDADLADADLSNADLANADLSRANLSRSDL 574
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
ANLT+A+L T+L+ + L A A+F++A DL Q
Sbjct: 575 RWANLTDAILQGTILSNASLNDANFNRANFAEA--DLTQ 611
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 111 AQFGSADLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A FG D RK + +FR NF+ AD+ + F+ + +GA L++ A G
Sbjct: 236 ALFGETDARKVDARQADFRGATFQRGNFSGADLSRARFADTDLSGAILQEVDLAGAELEG 295
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
+DLS + + L +ANL A L L +DL A + AD S A DLA
Sbjct: 296 SDLSRLALPGVRLVKANLGGANLYGADLRAADLTDASLVEADLSAA--DLA 344
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 8/123 (6%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG 149
LA ++ E + RG G F A+LR A +F A+ AD+ E+D G+K G
Sbjct: 143 LAGMDLREVDFRGARLHG--VSFQEANLRGA-----DFSGADLMHADLSEADLRGAKLVG 195
Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
A L + +A+ ADLS+ + R L A L A+L R + +D ADF
Sbjct: 196 ANLSRVNLARADLGEADLSEADLTRANLGGARLRQAILRRALFGETDARKVDARQADFRG 255
Query: 210 AVI 212
A
Sbjct: 256 ATF 258
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 53/112 (47%), Gaps = 24/112 (21%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS-- 168
A G A LR+A+ RA F D R+ D + F GA + + NF+GADLS
Sbjct: 221 ANLGGARLRQAILR----RALFGETDARKVDARQADFRGATFQ-----RGNFSGADLSRA 271
Query: 169 ---DTLMDRMVLNEANLTNAVLVRTVLTR----------SDLGGAIIEGADF 207
DT + +L E +L A L + L+R ++LGGA + GAD
Sbjct: 272 RFADTDLSGAILQEVDLAGAELEGSDLSRLALPGVRLVKANLGGANLYGADL 323
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 3/121 (2%)
Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A +ADL A V+ + A + A +++S +G+ +G+ L A A+F A+LS
Sbjct: 321 ADLRAADLTDASLVEADLSAADLAGAKLQKSIMAGATLHGSRLVSVTARNADFRAANLSR 380
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYANGT 227
++AN A L VL ++DL G + A+ +DA +D A +A AN T
Sbjct: 381 ADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLT 440
Query: 228 N 228
N
Sbjct: 441 N 441
>gi|428222198|ref|YP_007106368.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427995538|gb|AFY74233.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 225
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 58/105 (55%), Gaps = 9/105 (8%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A+ ADL A K A + A++ ++ SG+ + +L +AV AN ADL+
Sbjct: 26 AELNDADLSGANLSK----ARMSGAELNRANMSGANLHSTHLNRAVMKNANLENADLTGA 81
Query: 171 LMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAIIEGADFSDA 210
M + L+EANLTNA L V + LT ++L GAI+ ADFS++
Sbjct: 82 KMMEVNLSEANLTNANLSNVSGVESNLTMANLAGAILSSADFSNS 126
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 58/122 (47%), Gaps = 11/122 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 162
S A +A+L V+ N AN A + +DFS S + GA L+ A+ N
Sbjct: 89 SEANLTNANLSNVSGVESNLTMANLAGAILSSADFSNSNLSKVNLVGADLQGAIFSNTNL 148
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
TGADLS + + L+ ANL+ A L + LGGA I A+F+ + A + +
Sbjct: 149 TGADLSGINLKGVNLSGANLSMANLSGAI-----LGGANITKANFAQTDLSNADLRDVNI 203
Query: 223 YA 224
YA
Sbjct: 204 YA 205
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 53/119 (44%), Gaps = 11/119 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 163
S A S L +AV N A+ T A M E + S + A L ++N T
Sbjct: 54 SGANLHSTHLNRAVMKNANLENADLTGAKMMEVNLSEANLTNANLSNVSGVESNLTMANL 113
Query: 164 ------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
AD S++ + ++ L A+L A+ T LT +DL G ++G + S A + +A
Sbjct: 114 AGAILSSADFSNSNLSKVNLVGADLQGAIFSNTNLTGADLSGINLKGVNLSGANLSMAN 172
>gi|428308708|ref|YP_007119685.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428250320|gb|AFZ16279.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 294
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 44/79 (55%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NF A + + G+ GA L A + N GADLS ++R L +ANLT A+L RT
Sbjct: 186 NFRRAKLTAATLEGANLTGANLTDAQLNRVNLQGADLSGANLERACLEDANLTGAILRRT 245
Query: 191 VLTRSDLGGAIIEGADFSD 209
L+ +++ G + G DFSD
Sbjct: 246 QLSEANMSGTKLYGVDFSD 264
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 6/106 (5%)
Query: 109 SAAQFGSADLR--KAVHVK----ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S AQ A+L K VK E RAN + A M +S G+K +GA L A AN
Sbjct: 58 SGAQMNWANLSFVKMNEVKLIETELTRANLSGAFMVKSLLPGAKMSGADLMGANLRGANL 117
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
GA+L + ++R+ L +ANL L+ + L GA++ G+ +
Sbjct: 118 WGANLCGSQLERVNLRDANLMGVNFKWANLSEARLMGAMLYGSSLN 163
>gi|21674877|ref|NP_662942.1| pentapeptide repeat-containing protein [Chlorobium tepidum TLS]
gi|21648101|gb|AAM73284.1| pentapeptide repeat family protein [Chlorobium tepidum TLS]
Length = 439
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 58/118 (49%), Gaps = 16/118 (13%)
Query: 111 AQFGSADLRKAVHVKENFR----------------ANFTSADMRESDFSGSKFNGAYLEK 154
A+ G DLRKA K +F NF ADM+E++ G+ GA L++
Sbjct: 285 AELGGVDLRKASLSKSDFERANLDKANLAGANLAGVNFQRADMKEANLKGANLEGANLDR 344
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A A+ +GA+L ++ +L ANL A+L L ++L A ++GAD + A +
Sbjct: 345 AFLKGADLSGANLKGAILYGAMLYGANLDGAILTNVSLFDANLEKASLKGADLTGATL 402
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 51/105 (48%), Gaps = 1/105 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A L A K N +A+ + A + +++ G+ + YL+KA N A L
Sbjct: 56 SKANLEDAKLNGANLSKANLSKADLSGASLDKANLEGANLSMTYLKKANMKAVNAAHAWL 115
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+D ++ + +A+L A L R L + + GA +E A DAV+
Sbjct: 116 ADANLNGAFMKDASLKAANLARANLRWAKMSGADLEQASLKDAVL 160
>gi|427729960|ref|YP_007076197.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427365879|gb|AFY48600.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 937
Score = 56.6 bits (135), Expect = 1e-05, Method: Composition-based stats.
Identities = 33/104 (31%), Positives = 57/104 (54%), Gaps = 1/104 (0%)
Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
A A+L+ A + N + AN A++ ++ G+ GA L++A+ +A GA+L
Sbjct: 812 GANLYGANLQGANLQRANLQGANLQRANLYGANLEGANLYGANLQRAILQRAILEGANLQ 871
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++ R L ANL A+L R L ++L GA +EGA+ +A++
Sbjct: 872 RAILQRANLEGANLQRAILQRANLEGANLEGANLEGANLQEAIL 915
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 11/111 (9%)
Query: 111 AQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F A+L+ A N + AN A+++ ++ G+ A L A AN GA+L
Sbjct: 798 ANFQRANLQGANLQGANLYGANLQGANLQRANLQGANLQRANLYGANLEGANLYGANLQR 857
Query: 170 TLMDRMVLNEANLTNAV----------LVRTVLTRSDLGGAIIEGADFSDA 210
++ R +L ANL A+ L R +L R++L GA +EGA+ A
Sbjct: 858 AILQRAILEGANLQRAILQRANLEGANLQRAILQRANLEGANLEGANLEGA 908
Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats.
Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 6/110 (5%)
Query: 107 IGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
I S+ F A+ ++A N + AN A+++ ++ + GA L++A Y AN GA
Sbjct: 789 ILSSKDFYMANFQRANLQGANLQGANLYGANLQGANLQRANLQGANLQRANLYGANLEGA 848
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
+L + R +L A L A L R +L R++L EGA+ A++ A
Sbjct: 849 NLYGANLQRAILQRAILEGANLQRAILQRANL-----EGANLQRAILQRA 893
>gi|220907627|ref|YP_002482938.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219864238|gb|ACL44577.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 267
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 56/105 (53%), Gaps = 1/105 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A +A+ +KA + +N T AD+ ++D +G + A L +A + NFTG DL
Sbjct: 132 SQANMSAANFQKATLISAYLHNSNLTQADLSDADLTGINLSDANLSQATLIRTNFTGGDL 191
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
S ++ L E NLT L L+R++L G ++ GA+ + ++
Sbjct: 192 SRVMLVGANLAETNLTAVNLSDANLSRAELNGVVLAGANLNRVIL 236
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 57/104 (54%), Gaps = 6/104 (5%)
Query: 112 QFGSADLRKA--VHVK----ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
F A+L KA VH F A ++A++ +++ S + F A L A + +N T A
Sbjct: 100 NFSEANLIKANLVHAALYCANFFMAMMSAANLSQANMSAANFQKATLISAYLHNSNLTQA 159
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
DLSD + + L++ANL+ A L+RT T DL ++ GA+ ++
Sbjct: 160 DLSDADLTGINLSDANLSQATLIRTNFTGGDLSRVMLVGANLAE 203
>gi|242034055|ref|XP_002464422.1| hypothetical protein SORBIDRAFT_01g017890 [Sorghum bicolor]
gi|241918276|gb|EER91420.1| hypothetical protein SORBIDRAFT_01g017890 [Sorghum bicolor]
Length = 221
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 6/118 (5%)
Query: 125 KENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
K N + + +A M E+ F G+ + + KA A A+F G D ++ ++DR+ +A+LT
Sbjct: 108 KTNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGTDFTNAVIDRVNFEKADLT 167
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
A+ VL+ S A ++ F D +I Q LC TN +R LGC
Sbjct: 168 GAIFKNAVLSGSTFDDAKMDDVVFEDTIIGYIDLQKLC-----TNTSISPDSRLELGC 220
>gi|397645344|gb|EJK76787.1| hypothetical protein THAOC_01435 [Thalassiosira oceanica]
Length = 224
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 54/113 (47%), Gaps = 2/113 (1%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+FT + + FS S G KA A+F+GAD + ++ ANL N V +
Sbjct: 111 DFTQIIAKGTIFSKSNLQGCRFYKAYLVNADFSGADARGAAFEDTSMDGANLRNIVASGS 170
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN--GTNPITGVSTRKSLGC 241
+S L +EG DF+DA I + +C + GTNP TG TR SL C
Sbjct: 171 YFGQSLLDVESLEGGDFTDAQIPPKTLKLVCDREDVKGTNPTTGADTRDSLMC 223
>gi|422295276|gb|EKU22575.1| hypothetical protein NGA_0469800 [Nannochloropsis gaditana CCMP526]
Length = 90
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 47/83 (56%), Gaps = 2/83 (2%)
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
NF GAD S+ ++DR+ + +NL ++ VL+ + GA + +DF+D + + L
Sbjct: 7 NFEGADFSNAVVDRVSFDGSNLKGSIFSNAVLSGTSFVGADLTDSDFTDTYMGEFNLREL 66
Query: 221 CKYAN--GTNPITGVSTRKSLGC 241
CK GTNP+T T++S GC
Sbjct: 67 CKNPTLKGTNPVTQAPTKESAGC 89
>gi|83955651|ref|ZP_00964231.1| hypothetical protein NAS141_07590 [Sulfitobacter sp. NAS-14.1]
gi|83839945|gb|EAP79121.1| hypothetical protein NAS141_07590 [Sulfitobacter sp. NAS-14.1]
Length = 189
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 10/116 (8%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
N EA+ RG + A G ADLR A ++E A+ + A++ +D SG+K GA L +
Sbjct: 12 NLTEADLRGA-DLREADLSGRADLRGA-DLRE---ADLSGAELFYADLSGAKLIGAILSR 66
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+ AN +GADL R+ L+ A+L+ +L+ LT +DL GA + AD S A
Sbjct: 67 AILISANLSGADLR-----RVDLSGADLSGTILIGANLTGADLTGANLSSADLSGA 117
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 60/123 (48%), Gaps = 11/123 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG---- 164
S A+ A L +A+ + AN + AD+R D SG+ +G L A A+ TG
Sbjct: 55 SGAKLIGAILSRAILIS----ANLSGADLRRVDLSGADLSGTILIGANLTGADLTGANLS 110
Query: 165 -ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
ADLS + M+L ANL+ A L R L+ ++L GA + AD A +L + Y
Sbjct: 111 SADLSGANLSGMILRGANLSGANLSRADLSGANLSGASVTEADLGGA--NLTEANLTRTY 168
Query: 224 ANG 226
NG
Sbjct: 169 LNG 171
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 46/95 (48%), Gaps = 1/95 (1%)
Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADL + + N A+ T A++ +D SG+ +G L A AN + ADLS +
Sbjct: 87 ADLSGTILIGANLTGADLTGANLSSADLSGANLSGMILRGANLSGANLSRADLSGANLSG 146
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+ EA+L A L LTR+ L GA + SD
Sbjct: 147 ASVTEADLGGANLTEANLTRTYLNGATLCNTTMSD 181
>gi|150016367|ref|YP_001308621.1| pentapeptide repeat-containing protein [Clostridium beijerinckii
NCIMB 8052]
gi|149902832|gb|ABR33665.1| pentapeptide repeat protein [Clostridium beijerinckii NCIMB 8052]
Length = 1084
Score = 56.2 bits (134), Expect = 2e-05, Method: Composition-based stats.
Identities = 42/144 (29%), Positives = 71/144 (49%), Gaps = 11/144 (7%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA 150
ADL++ + G S F ADL A+ V+ +A+F+ A + E+ G+ FN +
Sbjct: 914 ADLSRASMDYTGL----SYCNFEKADLSYAILVESGVSKADFSEASLSEAHIEGTFFNKS 969
Query: 151 YLEKAV-----AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
EKA ++++F + + + V+ E+N NA + T L DL A + GA
Sbjct: 970 KFEKASLIMTQMWRSDFEDCNFNHANLSSAVMRESNFKNATFINTCLRNVDLEEADLTGA 1029
Query: 206 DFSDAVIDLAQ-KQALCKYANGTN 228
D S+A + A+ +A+ + N TN
Sbjct: 1030 DMSNANLSNAKINKAIFEGTNLTN 1053
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 41/127 (32%), Positives = 54/127 (42%), Gaps = 23/127 (18%)
Query: 109 SAAQFGSADLRKAVHVKENF-------RANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
S A F A L +A H++ F +A+ M SDF FN A L AV ++N
Sbjct: 947 SKADFSEASLSEA-HIEGTFFNKSKFEKASLIMTQMWRSDFEDCNFNHANLSSAVMRESN 1005
Query: 162 FTGADLSDTLMDRMVLNEANLT---------------NAVLVRTVLTRSDLGGAIIEGAD 206
F A +T + + L EA+LT A+ T LT DL IE D
Sbjct: 1006 FKNATFINTCLRNVDLEEADLTGADMSNANLSNAKINKAIFEGTNLTNVDLTNVDIENID 1065
Query: 207 FSDAVID 213
FS +ID
Sbjct: 1066 FSKTIID 1072
Score = 40.4 bits (93), Expect = 0.88, Method: Composition-based stats.
Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 10/112 (8%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRES--DFSG---SKFNGAYLEKAV-----AYKA 160
A FG A+L + + NF AD+ + D++G F A L A+ KA
Sbjct: 890 ANFGYANLNDSHISGTLYNCNFKEADLSRASMDYTGLSYCNFEKADLSYAILVESGVSKA 949
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+F+ A LS+ ++ N++ A L+ T + RSD A+ S AV+
Sbjct: 950 DFSEASLSEAHIEGTFFNKSKFEKASLIMTQMWRSDFEDCNFNHANLSSAVM 1001
>gi|145355959|ref|XP_001422212.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582452|gb|ABP00529.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 125
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 57/110 (51%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
FT ++ ++F+ + G L A A F A+LS+ + + L A+ TNA+L +
Sbjct: 16 FTKGSLKRANFNDANLTGITLFGADLSNATFVNANLSNANLGQANLTGADFTNAILSGAI 75
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
++ + L + +D+SD ++ LCK A+G NP+TG T SL C
Sbjct: 76 VSSAQLDEVKLTNSDWSDVIVRKDVLTGLCKVADGENPVTGNITALSLMC 125
>gi|167771967|ref|ZP_02444020.1| hypothetical protein ANACOL_03340 [Anaerotruncus colihominis DSM
17241]
gi|167665765|gb|EDS09895.1| pentapeptide repeat protein [Anaerotruncus colihominis DSM 17241]
Length = 314
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 67/129 (51%), Gaps = 10/129 (7%)
Query: 94 LNKYEAETRGEF-GIG---SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFS 143
L+K+ A RGE G+ + A ADL KA N +AN + A++ ++ S
Sbjct: 7 LDKHAAWLRGEPEGVKADLTGANLPGADLSKANLSGANLFGANLSKANLSGANLFGANLS 66
Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
G+ GA L KA AN +GADLS T + L++ANL+ A L L+R+ L GA +
Sbjct: 67 GANLFGANLSKANLSGANLSGADLSRTHLPGADLSKANLSGANLSGADLSRTHLPGADLS 126
Query: 204 GADFSDAVI 212
A+ S A +
Sbjct: 127 KANLSKANL 135
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 65/133 (48%), Gaps = 12/133 (9%)
Query: 92 ADLNKYEAETRGEFGIG------SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSG 144
ADL+K FG S A A+L A N +AN + A++ +D S
Sbjct: 33 ADLSKANLSGANLFGANLSKANLSGANLFGANLSGANLFGANLSKANLSGANLSGADLSR 92
Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGG 199
+ GA L KA AN +GADLS T + + L++ANL+ A L L++++L G
Sbjct: 93 THLPGADLSKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANLFGANLSKANLSG 152
Query: 200 AIIEGADFSDAVI 212
A + GA+ S A +
Sbjct: 153 ANLFGANLSGANL 165
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 6/106 (5%)
Query: 109 SAAQFGSADL-RKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL R + + +AN + A++ +D S + GA L KA KAN +GA+L
Sbjct: 81 SGANLSGADLSRTHLPGADLSKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANL 140
Query: 168 SDTLMDRMVLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFS 208
+ + L+ ANL + A L L++++L GA + GAD S
Sbjct: 141 FGANLSKANLSGANLFGANLSGANLFGANLSKANLSGANLSGADLS 186
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A ADL + H+ A+ + A++ +++ SG+ GA L KA AN GA+LS
Sbjct: 106 SGANLSGADLSR-THLPG---ADLSKANLSKANLSGANLFGANLSKANLSGANLFGANLS 161
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ L++ANL+ A L L+R+ L GA + A+ S A +
Sbjct: 162 GANLFGANLSKANLSGANLSGADLSRTHLPGADLSKANLSKANL 205
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 50/102 (49%), Gaps = 6/102 (5%)
Query: 109 SAAQFGSADLRKAVHVKEN------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S ADL KA K N F AN + A++ ++ G+ +GA L A KAN
Sbjct: 116 SRTHLPGADLSKANLSKANLSGANLFGANLSKANLSGANLFGANLSGANLFGANLSKANL 175
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
+GA+LS + R L A+L+ A L + L+ ++L G G
Sbjct: 176 SGANLSGADLSRTHLPGADLSKANLSKANLSGANLSGPTCPG 217
>gi|123965950|ref|YP_001011031.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9515]
gi|123200316|gb|ABM71924.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9515]
Length = 157
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 55/112 (49%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F D+++++ S A L A N + ++L + +D VL+ +LTN L
Sbjct: 43 ATFYLTDLQDANLSDCDLQNASLYGAKLKDTNLSNSNLREVTLDSAVLDGTDLTNTNLED 102
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + I+GADF++ + + CK A+GTNP T TR++L C
Sbjct: 103 SFAYSTQFENVKIQGADFTNVYLPKDIVREFCKEASGTNPFTNRETRETLEC 154
>gi|261821705|ref|YP_003259811.1| hypothetical protein Pecwa_2443 [Pectobacterium wasabiae WPP163]
gi|261605718|gb|ACX88204.1| Protein of unknown function DUF2169 [Pectobacterium wasabiae
WPP163]
Length = 846
Score = 55.8 bits (133), Expect = 2e-05, Method: Composition-based stats.
Identities = 45/160 (28%), Positives = 76/160 (47%), Gaps = 12/160 (7%)
Query: 78 AAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADM 137
A++ SCS + A+ ++ T + S + SAD +A + N R A +
Sbjct: 687 GALLDSCSW-VETQANEARFTGATWLTSAVASGSSMNSADFTQATLRQSNLR----QASL 741
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
+ F+ +K + L +A + NF A+L+ +L R EAN T+A L+ +L +S L
Sbjct: 742 IGAVFALAKLENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQL 801
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
GGA GA+ A DL+Q + + T + G T++
Sbjct: 802 GGANFRGANLFRA--DLSQ-----AFTSNTTQLDGAWTKR 834
>gi|58613539|gb|AAW79356.1| chloroplast thylakoid 11kDa protein [Heterocapsa triquetra]
Length = 91
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 14/103 (13%)
Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
++ +G+ F GA L +A A TGADL++ ++ +N T + V++
Sbjct: 1 DAGLAGADFTGAVLTQANLELAQLTGADLTNAIVTEAYING---TTKLEVKSA------- 50
Query: 199 GAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+GADF+D + Q+ LC A GTNP+T V TR+S+ C
Sbjct: 51 ----DGADFTDTPLRKDQQMYLCGIAKGTNPVTKVDTRESMAC 89
>gi|115482792|ref|NP_001064989.1| Os10g0502000 [Oryza sativa Japonica Group]
gi|22165076|gb|AAM93693.1| hypothetical protein [Oryza sativa Japonica Group]
gi|31432906|gb|AAP54482.1| Thylakoid lumenal 17.4 kDa protein, chloroplast precursor,
putative, expressed [Oryza sativa Japonica Group]
gi|113639598|dbj|BAF26903.1| Os10g0502000 [Oryza sativa Japonica Group]
gi|125532544|gb|EAY79109.1| hypothetical protein OsI_34214 [Oryza sativa Indica Group]
gi|125575308|gb|EAZ16592.1| hypothetical protein OsJ_32066 [Oryza sativa Japonica Group]
gi|215704684|dbj|BAG94312.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 236
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 6/118 (5%)
Query: 125 KENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
K N + + +A M +S F G+ + + KA A A+F G D ++ ++DR+ +A+L
Sbjct: 123 KTNLKGKSLAAALMSDSKFDGADMSEVVMSKAYAVGASFKGTDFTNAVIDRVNFEKADLQ 182
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
A+ TVL+ S A ++ F D +I Q LC TN +R LGC
Sbjct: 183 GAIFRNTVLSGSTFDDAKMQDVVFEDTIIGYIDLQKLC-----TNTSISADSRLELGC 235
>gi|383763954|ref|YP_005442936.1| hypothetical protein CLDAP_29990 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381384222|dbj|BAM01039.1| hypothetical protein CLDAP_29990 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 244
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 66/128 (51%), Gaps = 13/128 (10%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK--- 146
L + N YEA+ S A ADLR A + R A AD+R+++ +G+
Sbjct: 87 LREANLYEADL-------SNAVLDQADLRYATLERAVLRSATLRGADLRDANLAGADLRV 139
Query: 147 --FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
F+GA +E+A+ A+ A+L++ ++ R L ANL NAVL L +DL GA + G
Sbjct: 140 ADFSGAQMERAILTGASLVDANLANAVLRRADLRNANLRNAVLRYADLRGADLSGADLMG 199
Query: 205 ADFSDAVI 212
AD A +
Sbjct: 200 ADLMGARL 207
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 46/90 (51%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R NFT A + +++ S + A L +A +AN ADLS+ ++D+ L A L AVL
Sbjct: 59 RVNFTEASLNQANLSRATLLMAILSRAQLREANLYEADLSNAVLDQADLRYATLERAVLR 118
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
L +DL A + GAD A AQ +
Sbjct: 119 SATLRGADLRDANLAGADLRVADFSGAQME 148
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 41/71 (57%), Gaps = 9/71 (12%)
Query: 159 KANFTGADLSDTLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
+ N G DLS ++R+ LN+ANL+ A L+ +L+R+ L A + AD S+AV+D
Sbjct: 44 QVNLDGHDLSRADLNRVNFTEASLNQANLSRATLLMAILSRAQLREANLYEADLSNAVLD 103
Query: 214 LAQKQALCKYA 224
QA +YA
Sbjct: 104 ----QADLRYA 110
>gi|17230606|ref|NP_487154.1| hypothetical protein all3114 [Nostoc sp. PCC 7120]
gi|17132208|dbj|BAB74813.1| all3114 [Nostoc sp. PCC 7120]
Length = 576
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 46/81 (56%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
F N + A + +D S +K NGA L A A F GADLS + +VLN+A+L+ +L
Sbjct: 418 FSTNLSDAILEAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGIL 477
Query: 188 VRTVLTRSDLGGAIIEGADFS 208
LT +DL AI+ G DFS
Sbjct: 478 SEADLTGADLSDAILLGTDFS 498
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 67/132 (50%), Gaps = 23/132 (17%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A G A+L A NF+ AN T AD +++ S +GA L A AN TGA+L
Sbjct: 268 SGAYLGDANLTGA-----NFQDANLTGADFGDANLSSVNLSGANLSSADLSSANLTGANL 322
Query: 168 SDTLMDRMVLNEANLTNAVL--------------VRTV-LTRSDLGGAIIEGADFSDAVI 212
S + R L+ A+L++++L +R L R++L AI+ GA+ SDA +
Sbjct: 323 SGANLQRADLSRADLSSSILNDGEFSHANLSGVNLRDAELRRANLSNAILFGANLSDANL 382
Query: 213 DLAQ--KQALCK 222
+ A + LC+
Sbjct: 383 NHADLSRADLCR 394
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F AD+ D +G N A L + +A+ TGADLSD ++ + ANL +A
Sbjct: 450 AMFLGADLSGVDLTGVVLNDADLSGGILSEADLTGADLSDAILLGTDFSFANLNSA---- 505
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
L+ S+L GAI+ GAD S A + A
Sbjct: 506 -NLSGSNLSGAILNGADLSSANLSYA 530
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 51/90 (56%), Gaps = 6/90 (6%)
Query: 127 NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEA 180
NF+ A +A++ +FSG+ +GAYL A ANF TGAD D + + L+ A
Sbjct: 246 NFQGAYLGNANLTGVNFSGANLSGAYLGDANLTGANFQDANLTGADFGDANLSSVNLSGA 305
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
NL++A L LT ++L GA ++ AD S A
Sbjct: 306 NLSSADLSSANLTGANLSGANLQRADLSRA 335
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 53/115 (46%), Gaps = 11/115 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVA 157
S A SADL A N RA+ +S+ + + +FS + +G L A
Sbjct: 303 SGANLSSADLSSANLTGANLSGANLQRADLSRADLSSSILNDGEFSHANLSGVNLRDAEL 362
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+AN + A L + LN A+L+ A L R L+ +DL A + G + SD ++
Sbjct: 363 RRANLSNAILFGANLSDANLNHADLSRADLCRADLSGADLTHATLNGTNLSDTIL 417
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 58/114 (50%), Gaps = 17/114 (14%)
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
GEF S A +LR A E RAN ++A + ++ S + N A L +A +A+
Sbjct: 345 GEF---SHANLSGVNLRDA----ELRRANLSNAILFGANLSDANLNHADLSRADLCRADL 397
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
+GADL+ LN NL++ +L T +L AI+E AD S A ++ A+
Sbjct: 398 SGADLT-----HATLNGTNLSDTILFST-----NLSDAILEAADLSYAKLNGAK 441
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 1/88 (1%)
Query: 124 VKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
V E R NF A + ++ +G F+GA L A AN TGA+ D + +ANL
Sbjct: 238 VGEFLRGGNFQGAYLGNANLTGVNFSGANLSGAYLGDANLTGANFQDANLTGADFGDANL 297
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDA 210
++ L L+ +DL A + GA+ S A
Sbjct: 298 SSVNLSGANLSSADLSSANLTGANLSGA 325
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 5/89 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEANLT 183
RA+ AD+ +D + + NG L + + N + ADLS ++ LN A L
Sbjct: 389 RADLCRADLSGADLTHATLNGTNLSDTILFSTNLSDAILEAADLSYAKLNGAKLNYARLN 448
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+ + L+ DL G ++ AD S ++
Sbjct: 449 GAMFLGADLSGVDLTGVVLNDADLSGGIL 477
>gi|443476541|ref|ZP_21066442.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443018491|gb|ELS32731.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 400
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 89/188 (47%), Gaps = 30/188 (15%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFS-----G 144
L + N EA F I A A L +A V N AN TSA M +D S G
Sbjct: 61 LVEANLAEANLTSAFLI--RADLQRACLNQAYLVAANLNSANLTSASMVNADLSLATLTG 118
Query: 145 SKFNGAYLEKA-----VAYKANFTGADLSDT-----LMDRMVLNEANLTNAVLVRTVLTR 194
+ NGA L +A ++N GADLSD+ LM + L+ ANL+ A L+ LT
Sbjct: 119 ACLNGANLSRAKLNGTFFIESNLLGADLSDSDFTGALMIKANLSGANLSQACLMNVDLTE 178
Query: 195 SDLGGAIIEGADFSDAVIDLAQKQAL-CKYANGTNPITGVSTRKS-------LGCGNSRR 246
++L GA ++G D + A+++ A A+ YAN ++GVS ++ LG +
Sbjct: 179 ANLTGAELQGVDLAGAILNAANLNAVDLVYAN----LSGVSLSRANLSWANLLGTNLEKT 234
Query: 247 NAYGSPSS 254
N GS S
Sbjct: 235 NLVGSDLS 242
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 40/73 (54%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + AD+ D GS L A+ +AN TGA+L + +++ LN ANL A L R
Sbjct: 299 ANLSGADLSNVDLRGSYLIRTNLHNAILNEANLTGANLDEAVLNGASLNRANLNRASLTR 358
Query: 190 TVLTRSDLGGAII 202
LT ++L GA +
Sbjct: 359 ASLTGANLKGAFM 371
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
++N + A++ + S + +GA L A AN +GADLS+ + L NL NA+L
Sbjct: 267 MKSNLSGANLNGVNLSNANLSGANLSGANLMGANLSGADLSNVDLRGSYLIRTNLHNAIL 326
Query: 188 VRTVLTRSDLGGAIIEGADFSDAVID 213
LT ++L A++ GA + A ++
Sbjct: 327 NEANLTGANLDEAVLNGASLNRANLN 352
Score = 40.4 bits (93), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 44/83 (53%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N T A + +S+ SG+ NG L A AN +GA+L + L+ +L + L+RT
Sbjct: 260 NLTGAFLMKSNLSGANLNGVNLSNANLSGANLSGANLMGANLSGADLSNVDLRGSYLIRT 319
Query: 191 VLTRSDLGGAIIEGADFSDAVID 213
L + L A + GA+ +AV++
Sbjct: 320 NLHNAILNEANLTGANLDEAVLN 342
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 11/100 (11%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A + DLR + ++ N AN T A++ E+ +G+ N A L +A +A+
Sbjct: 302 SGADLSNVDLRGSYLIRTNLHNAILNEANLTGANLDEAVLNGASLNRANLNRASLTRASL 361
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
TGA+L M NL A ++ T L +++ GAI+
Sbjct: 362 TGANLKGAFMLW-----TNLRGAFMLWTNLDGANMTGAIL 396
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 16/104 (15%)
Query: 113 FGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
F A+L K++ N R N + A + ++ S + GA+L +A +AN A+
Sbjct: 6 FTKANLTKSILEGINLKGADLKRVNLSEAKLADAKLSKANLTGAFLHRADLNRANLVEAN 65
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L+ EANLT+A L+R L R+ L A + A+ + A
Sbjct: 66 LA----------EANLTSAFLIRADLQRACLNQAYLVAANLNSA 99
Score = 37.4 bits (85), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 44/85 (51%), Gaps = 17/85 (20%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN ++ E+D S + GA+L K+N +GA+L N NL+NA L
Sbjct: 244 ANLNETNLAEADLSWTNLTGAFL-----MKSNLSGANL----------NGVNLSNANLSG 288
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDL 214
L+ ++L GA + GAD S+ +DL
Sbjct: 289 ANLSGANLMGANLSGADLSN--VDL 311
>gi|307592031|ref|YP_003899622.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306985676|gb|ADN17556.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 161
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 56/108 (51%), Gaps = 5/108 (4%)
Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F +++ + K+ + AD+ E D +G K GA L KA Y AN +GA LS
Sbjct: 28 AYAFVQSNIDTLLSTKDCHNCDLVEADLHEKDLAGVKLYGADLSKAKLYGANLSGASLSG 87
Query: 170 TLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDAVI 212
+ L+ ANL+ + L + L +++L GA + GAD SDAV+
Sbjct: 88 ANLSGASLSGANLSGSYLQKANLKGAYLQKANLEGAALYGADLSDAVL 135
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 4/98 (4%)
Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
ADL KA + + AN + A + ++ SG+ +GA L + KAN GA L ++
Sbjct: 68 ADLSKA----KLYGANLSGASLSGANLSGASLSGANLSGSYLQKANLKGAYLQKANLEGA 123
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
L A+L++AVL L + L GA +EGA A+ D
Sbjct: 124 ALYGADLSDAVLYGANLKGAKLKGANLEGAKTKGAIFD 161
>gi|126696014|ref|YP_001090900.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9301]
gi|126543057|gb|ABO17299.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9301]
Length = 157
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 62/131 (47%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F +DL+ A F D+++++ SG + A L A N + ++L +
Sbjct: 33 ADFSGSDLKGAT---------FYLTDLQDANLSGCELQNATLYGAKLKDTNLSNSNLREV 83
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D VL+ +L+N L + + I+GADF++ + + C+ A GTNP
Sbjct: 84 TLDSAVLDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIIKKFCESATGTNPF 143
Query: 231 TGVSTRKSLGC 241
T TR++L C
Sbjct: 144 TNRETRETLEC 154
>gi|386828484|ref|ZP_10115591.1| putative low-complexity protein [Beggiatoa alba B18LD]
gi|386429368|gb|EIJ43196.1| putative low-complexity protein [Beggiatoa alba B18LD]
Length = 986
Score = 55.5 bits (132), Expect = 3e-05, Method: Composition-based stats.
Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 26/109 (23%)
Query: 126 ENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAY--------------------KANFTG 164
+N R +F+ D+R +DFSG+ A + A+ Y ANF+
Sbjct: 645 QNLRGQDFSGQDLRYADFSGADLTDALFKNAILYHVNFSNATLKNADFTKTDLSNANFSD 704
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
ADL+D L +L AN ++A L T++DL A+F+DA+ D
Sbjct: 705 ADLTDALFKNAILQHANFSDATLKNADFTKTDL-----SNANFTDAICD 748
Score = 55.1 bits (131), Expect = 3e-05, Method: Composition-based stats.
Identities = 41/148 (27%), Positives = 69/148 (46%), Gaps = 14/148 (9%)
Query: 78 AAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADM 137
+A+++ +++ K+ + + G+ S Q + + K +K N R S D
Sbjct: 588 SALMSQFFVDLAGREQATKWASRIIKQRGVASIIQNNADSILK--QLKNNQR---NSLDR 642
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
R + G F+G L A +F+GADL+D L +L N +NA L T++DL
Sbjct: 643 RGQNLRGQDFSGQDLRYA-----DFSGADLTDALFKNAILYHVNFSNATLKNADFTKTDL 697
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYAN 225
A AD +DA+ K A+ ++AN
Sbjct: 698 SNANFSDADLTDALF----KNAILQHAN 721
>gi|332712234|ref|ZP_08432162.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332349040|gb|EGJ28652.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 280
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 52/95 (54%), Gaps = 1/95 (1%)
Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADL A NF RA+ + A++ ++ +G+ F GA L A AN TGA+LS+T +
Sbjct: 171 ADLTNANLTGANFSRADLSQANLSNANLTGADFAGADLANADLSGANLTGANLSNTDLKG 230
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
L ANL L R L RSDL A+ GA+F +
Sbjct: 231 SNLTGANLNGTDLARADLERSDLRDAMTNGANFEN 265
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 2/98 (2%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ + AD+ ++ +G+ F+ A L +A AN TGAD + + L+ ANLT A L T
Sbjct: 167 DLSGADLTNANLTGANFSRADLSQANLSNANLTGADFAGADLANADLSGANLTGANLSNT 226
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
L S+L GA + G D + A DL + NG N
Sbjct: 227 DLKGSNLTGANLNGTDLARA--DLERSDLRDAMTNGAN 262
>gi|428296910|ref|YP_007135216.1| RDD domain-containing protein [Calothrix sp. PCC 6303]
gi|428233454|gb|AFY99243.1| RDD domain containing protein [Calothrix sp. PCC 6303]
Length = 718
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 61/112 (54%), Gaps = 14/112 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY---------- 158
S+AQ ADLR AV EN A+ T AD+ E+ + ++ GA L +A+A
Sbjct: 540 SSAQMVGADLRNAVL--EN--ASLTGADLGEAKLNEAELYGARLNRAIAIGAQLSYANLT 595
Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
K ++ ADLS + +DR+ L ANL+ A L +L ++L GA + AD + A
Sbjct: 596 KTDWQAADLSGSYLDRVNLTNANLSTARLTGAILRSANLEGANLRNADLTLA 647
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 50/98 (51%), Gaps = 14/98 (14%)
Query: 130 ANFTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMDRM 175
NF A++ ++ F S+F G A L +A +ANF+ A+LS L+++
Sbjct: 458 VNFKGANLDQASFKNSRFRGPGDDGLWDTFDDAIADLSQAQLKQANFSEANLSRVLLNKS 517
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
L+ + L A L + L ++L A + GAD +AV++
Sbjct: 518 DLSRSTLNKANLAGSRLIGANLSSAQMVGADLRNAVLE 555
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 33/130 (25%), Positives = 61/130 (46%), Gaps = 15/130 (11%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFS 143
A+ADL++ + + A F A+L + + K + +AN + + ++ S
Sbjct: 490 AIADLSQAQLK---------QANFSEANLSRVLLNKSDLSRSTLNKANLAGSRLIGANLS 540
Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
++ GA L AV A+ TGADL + ++ L A L A+ + L+ ++L +
Sbjct: 541 SAQMVGADLRNAVLENASLTGADLGEAKLNEAELYGARLNRAIAIGAQLSYANLTKTDWQ 600
Query: 204 GADFSDAVID 213
AD S + +D
Sbjct: 601 AADLSGSYLD 610
>gi|428216484|ref|YP_007100949.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988266|gb|AFY68521.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 673
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 55/91 (60%), Gaps = 7/91 (7%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDRMVLNEANLTNA 185
N +A + E+DFS ++ GA L A+A A+ F+GADL++ + +++E NLT A
Sbjct: 435 NLQNALLSETDFSDARLGGANLTGAIATGADLRGVDFSGADLTEANLTNAIMSEVNLTGA 494
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
L+R L ++DL A++ GA+ A DL+Q
Sbjct: 495 RLLRANLKQADLNFAVLRGAELMRA--DLSQ 523
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 45/89 (50%), Gaps = 9/89 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A+ G A+L A+ T AD+R DFSG+ A L A+ + N TGA L
Sbjct: 447 SDARLGGANLTGAIA---------TGADLRGVDFSGADLTEANLTNAIMSEVNLTGARLL 497
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
+ + LN A L A L+R L+++DL
Sbjct: 498 RANLKQADLNFAVLRGAELMRADLSQTDL 526
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 28/131 (21%)
Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
A+++ES+ S ++ A LE AV A+ A+L ++ L E +L++A L T+
Sbjct: 549 ANLQESNLSAAELENAQLEAAVLLLADLRSANLKLANLNYADLREVDLSSADL-----TQ 603
Query: 195 SDLGG----------------AIIEGADFSDAVIDLAQ--KQALCKYANGT----NPITG 232
++L G A I+GADF+D V++LA K CK A G +P
Sbjct: 604 ANLIGANLSGANLRGTDVNQLASIDGADFTD-VVNLADTSKTYFCKIAAGQTFAESPEQR 662
Query: 233 VSTRKSLGCGN 243
+TR +L C N
Sbjct: 663 RATRATLDCPN 673
>gi|75911046|ref|YP_325342.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704771|gb|ABA24447.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 576
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 46/81 (56%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
F N + A + +D S +K NGA L A A F GADLS + +VLN+A+L+ +L
Sbjct: 418 FSTNLSDAILEAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGIL 477
Query: 188 VRTVLTRSDLGGAIIEGADFS 208
LT +DL A++ G DFS
Sbjct: 478 SEADLTGADLSDAVLLGTDFS 498
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 67/117 (57%), Gaps = 13/117 (11%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
+ A FG A+L +V++ AN +SAD+ ++ +G+ +GA LE+A + + ADLS
Sbjct: 288 TGADFGDANL-SSVNLS---GANLSSADLSSANLTGANLSGANLERA-----DLSRADLS 338
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA---VIDLAQKQALCK 222
+++ L+ ANL+ L R++L AI+ GA+ SDA +DL++ LC+
Sbjct: 339 SCILNDGELSHANLSGVNFRDAELCRANLSNAILFGANLSDANLNHVDLSRAD-LCR 394
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 47/84 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F AD+ D +G N A L + +A+ TGADLSD ++ + ANL +A L
Sbjct: 450 AMFLGADLSGVDLTGVVLNDADLSGGILSEADLTGADLSDAVLLGTDFSFANLNSANLSG 509
Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
+ L+ + L GA + A+FS A++D
Sbjct: 510 SNLSGAILNGADLSSANFSYAILD 533
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 51/90 (56%), Gaps = 6/90 (6%)
Query: 127 NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEA 180
NF+ A +A++ +FSG+ +GAYL A ANF TGAD D + + L+ A
Sbjct: 246 NFQGAYLGNANLTGVNFSGANLSGAYLGDANLTGANFQGANLTGADFGDANLSSVNLSGA 305
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
NL++A L LT ++L GA +E AD S A
Sbjct: 306 NLSSADLSSANLTGANLSGANLERADLSRA 335
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 49/106 (46%), Gaps = 1/106 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A SADL A N AN AD+ +D S N L A NF A+L
Sbjct: 303 SGANLSSADLSSANLTGANLSGANLERADLSRADLSSCILNDGELSHANLSGVNFRDAEL 362
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
+ +L ANL++A L L+R+DL A + GAD + A ++
Sbjct: 363 CRANLSNAILFGANLSDANLNHVDLSRADLCRADLSGADLTHATLN 408
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 5/89 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEANLT 183
RA+ AD+ +D + + NG L + + N + ADLS ++ LN A L
Sbjct: 389 RADLCRADLSGADLTHATLNGTNLSDTILFSTNLSDAILEAADLSYAKLNGAKLNYARLN 448
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+ + L+ DL G ++ AD S ++
Sbjct: 449 GAMFLGADLSGVDLTGVVLNDADLSGGIL 477
>gi|428225059|ref|YP_007109156.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984960|gb|AFY66104.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 315
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 10/119 (8%)
Query: 131 NFTSADMRESDFSGSKFNGAYL----------EKAVAYKANFTGADLSDTLMDRMVLNEA 180
N D+R + SG+ +GA L AN +GA+LS + R LN A
Sbjct: 181 NLDGVDLRSTKLSGATLHGANLAATNFSDAKMHGGSFTGANLSGANLSRAFLKRANLNWA 240
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
NLT A L LT ++L GA IEGA+F+ + ++ L A G P + TR +L
Sbjct: 241 NLTRADLTDADLTEANLLGARIEGAEFTGVTLSDPTRRYLRLIATGVTPWSQQPTRSTL 299
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 23/125 (18%)
Query: 117 DLRKAVHVKENFRANFTSADMRESDFSG----------SKFNGAYLEKAVAYKANFTGA- 165
D+ + E NF D+R +D SG + GA L +A +AN +GA
Sbjct: 2 DVNYLLRAYEAGERNFAGVDLRGADLSGVTLIAVDLSDANLMGANLSRAFLTQANLSGAF 61
Query: 166 ---------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
LS+ + + L +ANL+ A +V++ R+ L GA + GA+ + + Q
Sbjct: 62 LNWADLRYVKLSEGCLTHVDLTKANLSGAFMVKSDFNRAKLSGANLNGANLRGSHL---Q 118
Query: 217 KQALC 221
LC
Sbjct: 119 HANLC 123
>gi|78189684|ref|YP_380022.1| pentapeptide repeat-containing protein [Chlorobium chlorochromatii
CaD3]
gi|78171883|gb|ABB28979.1| pentapeptide repeat family protein [Chlorobium chlorochromatii
CaD3]
Length = 389
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 49/88 (55%), Gaps = 5/88 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTN 184
ANF ADM+ + G+ GA+ ++A +AN GA+L+ L +D+ L ANLT
Sbjct: 270 ANFYKADMKGAQLQGANLQGAHCDRAFLLQANLQGANLTKALLFGATLDKADLRNANLTE 329
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A L +DL GAI+ A+ +DAV+
Sbjct: 330 ASLFGANCEGADLRGAILTRANVTDAVL 357
>gi|220910076|ref|YP_002485387.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219866687|gb|ACL47026.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 332
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 43/78 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN D+RE+D SG+ GA L ++AN GADLS++ + + L ANL A L
Sbjct: 181 ANLREVDLREADLSGANLRGALLTDVNLFQANLAGADLSNSNLKGVDLQRANLQQAKLTG 240
Query: 190 TVLTRSDLGGAIIEGADF 207
LT ++L G +++ A
Sbjct: 241 ATLTEANLAGVMMQRAQM 258
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 1/99 (1%)
Query: 111 AQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A+LR A+ N F+AN AD+ S+ G A L++A A T A+L+
Sbjct: 191 ADLSGANLRGALLTDVNLFQANLAGADLSNSNLKGVDLQRANLQQAKLTGATLTEANLAG 250
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
+M R + + L A L R L +DL GA + GA+ +
Sbjct: 251 VMMQRAQMFQVRLNRANLSRANLQGADLRGASLIGANLA 289
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 41/95 (43%), Gaps = 16/95 (16%)
Query: 131 NFTSADMRESDFSGSKFNGA----------------YLEKAVAYKANFTGADLSDTLMDR 174
N D+RE+D SG+ GA L A+ + GA+LS + R
Sbjct: 111 NLIETDLREADLSGANLTGACLRSANLRTERRGTPVNLRGAILAGVDLRGANLSGASLVR 170
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+ L ANL A L L +DL GA + GA +D
Sbjct: 171 VNLQGANLEEANLREVDLREADLSGANLRGALLTD 205
>gi|409994014|ref|ZP_11277136.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|291569676|dbj|BAI91948.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409935088|gb|EKN76630.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 331
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 66/138 (47%), Gaps = 10/138 (7%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRA 130
F T L AA + + ++ L D N +A+ RG A ADLR A N R
Sbjct: 87 FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139
Query: 131 N---FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
+ S ++R +D G+ G L A +AN GA+L++ ++ +LN+ NL+ L
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLMGANLTECVLRGAILNQTNLSETNL 199
Query: 188 VRTVLTRSDLGGAIIEGA 205
+LT +L GA + G+
Sbjct: 200 QGAILTEVNLSGANLIGS 217
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 63/126 (50%), Gaps = 7/126 (5%)
Query: 94 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFR-ANFTSA-----DMRESDFSGSK 146
LNKY + + G+ A+ +ADL A +F+ ANF A ++ ++ +K
Sbjct: 7 LNKYRSGEKLFRGVNLRNAELSNADLIGANLSGGDFQGANFVLAYLNGVNLTRANLEKAK 66
Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
GA L +A A T AD T++ L +ANLT A LV L ++DL GA ++GAD
Sbjct: 67 LGGANLSRANLSGAQLTDADFHGTILQAADLRKANLTLATLVDANLIQADLRGANLQGAD 126
Query: 207 FSDAVI 212
A +
Sbjct: 127 LRGACL 132
Score = 37.7 bits (86), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 5/85 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN ++A++ ++ S + A L + AN T ADL+D + R L ANL+ A L R
Sbjct: 247 ANLSNANLSHANLSRANLVRAELNRTNLSSANLTQADLTDASLGRTNLRNANLSYAYLTR 306
Query: 190 TVLTRS-----DLGGAIIEGADFSD 209
T + + +L GAI+ + D
Sbjct: 307 TEFSSANTIGVNLHGAIMPNGEIHD 331
>gi|428314577|ref|YP_007151024.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428256301|gb|AFZ22256.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 281
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 61/111 (54%), Gaps = 6/111 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A ADL +A + N AN + A + +++ S + + A+L +A AN
Sbjct: 123 SRANLSRADLSEANLSRANLSRADLSDANLSPASLSDANLSRANLSRAFLSRANLSDANL 182
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
+ A+LSD + R L+ ANL+ A L R L+ ++LGGA + GA+F ++ ID
Sbjct: 183 SRANLSDANLSRADLSRANLSRANLSRADLSGANLGGANLSGANFRNSEID 233
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 49/86 (56%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN ++ E++ S + + A L +A +AN + ADLSD + L++ANL+ A L R
Sbjct: 110 ANLREINLSEANLSRANLSRADLSEANLSRANLSRADLSDANLSPASLSDANLSRANLSR 169
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
L+R++L A + A+ SDA + A
Sbjct: 170 AFLSRANLSDANLSRANLSDANLSRA 195
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 5/86 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A +A++RE + S + + A L +A +AN + A+LS R L++ANL+ A L
Sbjct: 105 APLENANLREINLSEANLSRANLSRADLSEANLSRANLS-----RADLSDANLSPASLSD 159
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
L+R++L A + A+ SDA + A
Sbjct: 160 ANLSRANLSRAFLSRANLSDANLSRA 185
>gi|33861206|ref|NP_892767.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. pastoris str. CCMP1986]
gi|33639938|emb|CAE19108.1| Pentapeptide repeats [Prochlorococcus marinus subsp. pastoris str.
CCMP1986]
Length = 157
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 55/112 (49%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F D+++++ S A L A N + ++L + +D VL+ +LTN L
Sbjct: 43 ATFYLTDLQDANLSDCDLQNASLYGAKLKDTNLSNSNLREVTLDSAVLDGTDLTNTNLED 102
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
+ + I+GADF++ + + CK A+GTNP T TR++L C
Sbjct: 103 SFAYSTQFENVKIQGADFTNVYLPKDVLREFCKDASGTNPFTNRETRETLEC 154
>gi|218438018|ref|YP_002376347.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218170746|gb|ACK69479.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 333
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 69/135 (51%), Gaps = 6/135 (4%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRANF 132
LA A++ ++ L D N A+ RG G S A A++R+ K++F N
Sbjct: 92 LAGAILQETDLTLALLIDANLIGADLRGADLSGANLSGACLKGANMRQE---KKSFNTNL 148
Query: 133 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 192
A++ ++D SG+ G L KA AN T A+L D + ++ L ANLTN +L L
Sbjct: 149 QGANLFKADLSGANMKGVDLAKANLSGANLTEANLRDADLRKVDLTNANLTNTILSEANL 208
Query: 193 TRSDLGGAIIEGADF 207
+ ++L GA ++ A+
Sbjct: 209 SEANLTGATLKKANL 223
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 52/115 (45%), Gaps = 11/115 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYL-----EKAVA 157
S A A L+KA V+ NFT A M ++ + GA L A
Sbjct: 209 SEANLTGATLKKANLVRAKMMHTQLSEVNFTEAIMTHANLKAANLKGANLSLTRMNHADL 268
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+AN +GA L + + + ANLT A L T LTR+DL A + A+ + A++
Sbjct: 269 TRANLSGAILKEAELIEVFFARANLTGADLQGTNLTRADLMSANLSNANLTGAIM 323
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 45/101 (44%), Gaps = 1/101 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A DL KA N AN AD+R+ D + + L +A +AN TGA L
Sbjct: 159 SGANMKGVDLAKANLSGANLTEANLRDADLRKVDLTNANLTNTILSEANLSEANLTGATL 218
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
+ R + L+ ++T ++L A ++GA+ S
Sbjct: 219 KKANLVRAKMMHTQLSEVNFTEAIMTHANLKAANLKGANLS 259
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 46/90 (51%), Gaps = 15/90 (16%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN T A++R++D K + T A+L++T++ L+EANLT A L +
Sbjct: 176 ANLTEANLRDADL---------------RKVDLTNANLTNTILSEANLSEANLTGATLKK 220
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
L R+ + + +F++A++ A +A
Sbjct: 221 ANLVRAKMMHTQLSEVNFTEAIMTHANLKA 250
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 15/87 (17%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL-----NEANLT 183
RAN ++ +D SG+ + +A+ TGADL ++ ++L E +LT
Sbjct: 54 RANLAHTNLVTTDLSGANLS----------QADLTGADLRSAILHGIILAGAILQETDLT 103
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+L+ L +DL GA + GA+ S A
Sbjct: 104 LALLIDANLIGADLRGADLSGANLSGA 130
>gi|440793397|gb|ELR14582.1| K+ channel tetramerisation subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 381
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 56/108 (51%), Gaps = 17/108 (15%)
Query: 112 QFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
+F DLR A+H++ RANF D+ D +K NGA L + AN +GA
Sbjct: 229 KFNGCDLRGFDFHAMHLR---RANFHRCDLTGVDLRHAKLNGACLVECCLRDANLSGA-- 283
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
VL+ +LT+A R LT +DL GA++ GAD S+A +D A
Sbjct: 284 --------VLSGVDLTDADCRRADLTNADLRGAVLSGADLSEAKLDRA 323
>gi|434384824|ref|YP_007095435.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428015814|gb|AFY91908.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 377
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 61/115 (53%), Gaps = 6/115 (5%)
Query: 117 DLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
DL + ++ N RAN A++ +D G+ GA L+KA +AN GA+L ++ +
Sbjct: 200 DLAQTNLIRANLKRANLQGANLEGADLEGANLQGANLKKANLKRANLQGANLMIANLEGI 259
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEG-----ADFSDAVIDLAQKQALCKYAN 225
L ANL A+L+R L ++L GA +EG A+F A + A QA +AN
Sbjct: 260 NLVRANLEGAILIRANLEGANLEGANLEGAILLLANFKGAYLSKANLQACHGHAN 314
Score = 40.4 bits (93), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 40/82 (48%), Gaps = 1/82 (1%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN A + ++ G+ GA LE A+ ANF GA LS + + AN A L
Sbjct: 263 RANLEGAILIRANLEGANLEGANLEGAILLLANFKGAYLSKANL-QACHGHANFAGAYLS 321
Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
+ +DL GA +EGA+ A
Sbjct: 322 KANFEGADLEGANLEGANLQRA 343
>gi|254526129|ref|ZP_05138181.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9202]
gi|221537553|gb|EEE40006.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9202]
Length = 148
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 62/131 (47%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F +DL+ A F D+++++ S + A L A N + ++L +
Sbjct: 24 ADFSGSDLKGAT---------FYLTDLQDANLSDCELQNATLYGAKLKDTNLSNSNLREV 74
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D +L+ +L+N L + + I+GADF++ + + C+ A GTNPI
Sbjct: 75 TLDSAILDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVYLPKDIIREFCESATGTNPI 134
Query: 231 TGVSTRKSLGC 241
T TR++L C
Sbjct: 135 TNRDTRETLEC 145
>gi|291571459|dbj|BAI93731.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 351
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 80/184 (43%), Gaps = 40/184 (21%)
Query: 69 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 125
R F +L AA+ + N L+ N EA IG S +Q ADL AV +
Sbjct: 21 RNFSDISLVAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQLSYADLSMAVLID 80
Query: 126 ENFR-ANFTSADMRESDFSGSKFNGAYLE------------------------------- 153
N A+ T + ++D SG+ +GA L
Sbjct: 81 ANLTGASMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTCLLNGSQLTDAILV 140
Query: 154 -----KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
++V A+ TGA+L+ +++ + L+ ANLT A L+R L + +L GA + GAD S
Sbjct: 141 GATMTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGANLTGADLS 200
Query: 209 DAVI 212
++VI
Sbjct: 201 ESVI 204
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 1/100 (1%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL ++V NF AN T A++ ++ +G+ NGA L A AN TGA+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLAGANLAGANLNGANLTGANLTGANLTGANL 249
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+ + L ANL+ A L LT ++L GA + AD
Sbjct: 250 NGLTLQCADLRLANLSKADLRGANLTGANLAGANLLEADL 289
Score = 37.0 bits (84), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 45/91 (49%), Gaps = 10/91 (10%)
Query: 130 ANFTSADMRESDFSGSKFNG----------AYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AN T A++ ++ +G+ NG A L KA AN TGA+L+ + L
Sbjct: 232 ANLTGANLTGANLTGANLNGLTLQCADLRLANLSKADLRGANLTGANLAGANLLEADLRL 291
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
ANLT+A L L + L GA + GA+ + A
Sbjct: 292 ANLTDANLCGAGLLLTSLRGANLAGANLNQA 322
>gi|158340188|ref|YP_001521358.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158310429|gb|ABW32044.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 292
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 74/142 (52%), Gaps = 16/142 (11%)
Query: 109 SAAQFGSADLRKAVHVK-ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A F ++ L++++ + + + ++F+ AD+R +DFS +K + A L++ +AN GADL
Sbjct: 68 SGANFKASKLQRSLAIWVQAYWSDFSDADLRHADFSCAKLSAAQLKRTDFSQANLMGADL 127
Query: 168 SDTLMDRMVLNEA----------NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
SD+ A NLTN L + +T SDL A + +D S + +
Sbjct: 128 SDSEAQDACFKGANLWGVWAQRTNLTNVCLSQVDMTTSDLTEAQLSESDLSWSFL----S 183
Query: 218 QALCKYANGTNP-ITGVSTRKS 238
QA+C AN T+ + G +K+
Sbjct: 184 QAVCVGANLTSACLEGSDLKKT 205
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 10/92 (10%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA----------DLSDTLMDRMVLN 178
+ + T++D+ E+ S S + ++L +AV AN T A D D + R L+
Sbjct: 159 QVDMTTSDLTEAQLSESDLSWSFLSQAVCVGANLTSACLEGSDLKKTDFQDACLSRADLS 218
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+ NA L ++DL GA + GADF A
Sbjct: 219 AADCENACFFNANLYKADLRGAKLCGADFRGA 250
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 11/113 (9%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F A L A + +F +AN AD+ +S+ + F GA L A + N T LS
Sbjct: 100 ADFSCAKLSAAQLKRTDFSQANLMGADLSDSEAQDACFKGANLWGVWAQRTNLTNVCLSQ 159
Query: 170 TLMDRMVLNEAN----------LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
M L EA L+ AV V LT + L G+ ++ DF DA +
Sbjct: 160 VDMTTSDLTEAQLSESDLSWSFLSQAVCVGANLTSACLEGSDLKKTDFQDACL 212
>gi|448473532|ref|ZP_21601674.1| RDD domain-containing protein [Halorubrum aidingense JCM 13560]
gi|445819044|gb|EMA68893.1| RDD domain-containing protein [Halorubrum aidingense JCM 13560]
Length = 348
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 56/111 (50%), Gaps = 7/111 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ +D S + A L KA Y AN +GADL+ L+D+ L A+L
Sbjct: 63 ANLRGANITGADLSSANLTDALLTKANLYSANLSGADLTGALLDKANLRSADLRGVGFTE 122
Query: 190 TVLTRSDLGGAIIEGADFSD------AVIDLAQKQALCKYAN-GTNPITGV 233
LTR+DL A + GA+FSD AV D + A AN G +TGV
Sbjct: 123 AHLTRADLHSADLRGANFSDADLFGAAVTDADLRGADLTDANLGDTDLTGV 173
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 50/105 (47%), Gaps = 11/105 (10%)
Query: 109 SAAQFGSADLRKAVHVKEN-FRANFTSAD----------MRESDFSGSKFNGAYLEKAVA 157
+ A SA+L A+ K N + AN + AD +R +D G F A+L +A
Sbjct: 71 TGADLSSANLTDALLTKANLYSANLSGADLTGALLDKANLRSADLRGVGFTEAHLTRADL 130
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
+ A+ GA+ SD + + +A+L A L L +DL G I+
Sbjct: 131 HSADLRGANFSDADLFGAAVTDADLRGADLTDANLGDTDLTGVIL 175
>gi|398354158|ref|YP_006399622.1| hypothetical protein USDA257_c43260 [Sinorhizobium fredii USDA 257]
gi|390129484|gb|AFL52865.1| hypothetical protein USDA257_c43260 [Sinorhizobium fredii USDA 257]
Length = 249
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 62/124 (50%), Gaps = 12/124 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A+ +A+L KA V+ + +ANF+ + DFSG GA + +A+F
Sbjct: 85 SGAELTAANLEKATLVRASLAGAKADKANFSRVEAYRGDFSGISAEGALFVSSELQRADF 144
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGA-DFSDAVIDLAQ 216
TGA L+ ++ L AN AVL T L+R++L GA+ EG DF A + L +
Sbjct: 145 TGARLTGADFEKAELGRANFGKAVLTGTRFSVANLSRANLSGALFEGPLDFDRAFLFLTR 204
Query: 217 KQAL 220
+ L
Sbjct: 205 IEGL 208
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 40/88 (45%), Gaps = 5/88 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT-----N 184
++ D +D SG++ A LEKA +A+ GA R+ + +
Sbjct: 72 SHLVDTDFASTDLSGAELTAANLEKATLVRASLAGAKADKANFSRVEAYRGDFSGISAEG 131
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+ V + L R+D GA + GADF A +
Sbjct: 132 ALFVSSELQRADFTGARLTGADFEKAEL 159
>gi|118592119|ref|ZP_01549513.1| hypothetical protein SIAM614_25622 [Stappia aggregata IAM 12614]
gi|118435415|gb|EAV42062.1| hypothetical protein SIAM614_25622 [Labrenzia aggregata IAM 12614]
Length = 275
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 69/152 (45%), Gaps = 35/152 (23%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESD--------- 141
+D + EAE R +F S + F A++R K N +ANF AD+R+ D
Sbjct: 85 SDFRRTEAE-RADF---SGSDFSGANMRSVDLEKANLNKANFQDADLRDGDLNTVEANEA 140
Query: 142 -FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR------ 194
F G+ ++VA KA+F GA + D ++R+ LN AN +A + + L R
Sbjct: 141 IFDGADMRNVLFTRSVANKASFKGAKMDDANLERVDLNGANFQDARMRQAKLDRVKAQNA 200
Query: 195 --------------SDLGGAIIEGADFSDAVI 212
SDL GA + G DF A++
Sbjct: 201 NFSGADFSGVRLVSSDLTGANLTGVDFDGALL 232
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 62/124 (50%), Gaps = 9/124 (7%)
Query: 99 AETRG---EFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAY--- 151
AE RG E G + DL++A+ NF+ ++F + +DFSGS F+GA
Sbjct: 50 AELRGLVLENGDFAGTNLREVDLKEAMLPNANFKNSDFRRTEAERADFSGSDFSGANMRS 109
Query: 152 --LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
LEKA KANF ADL D ++ + NEA A + + TRS A +GA D
Sbjct: 110 VDLEKANLNKANFQDADLRDGDLNTVEANEAIFDGADMRNVLFTRSVANKASFKGAKMDD 169
Query: 210 AVID 213
A ++
Sbjct: 170 ANLE 173
Score = 43.9 bits (102), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 20/124 (16%)
Query: 93 DLNKYEAETRGEFGIGSAAQFGSADLR-----KAVHVKENFR------ANFTSADMRESD 141
DLN EA + A F AD+R ++V K +F+ AN D+ ++
Sbjct: 131 DLNTVEA---------NEAIFDGADMRNVLFTRSVANKASFKGAKMDDANLERVDLNGAN 181
Query: 142 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
F ++ A L++ A ANF+GAD S + L ANLT +L R+ L GA
Sbjct: 182 FQDARMRQAKLDRVKAQNANFSGADFSGVRLVSSDLTGANLTGVDFDGALLRRTRLAGAD 241
Query: 202 IEGA 205
+ GA
Sbjct: 242 LSGA 245
>gi|357014784|ref|ZP_09079783.1| hypothetical protein PelgB_35370 [Paenibacillus elgii B69]
Length = 843
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 62/119 (52%), Gaps = 14/119 (11%)
Query: 114 GSADLR--KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL---- 167
G AD++ KAV + A SAD++ F + + A L Y A FTG DL
Sbjct: 140 GLADIQATKAVVQTDLTWAYMASADLKSVSFEDADLSHADLSGCNLYGALFTGDDLKLSH 199
Query: 168 ----SDTL----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
S TL M+ +V++ A+ TNAV+ LT S+L G + GAD +DA+I+ AQ Q
Sbjct: 200 TVFASATLSYARMNEIVIDSADFTNAVMTNVYLTNSNLQGNSLTGADMTDALINGAQFQ 258
>gi|193213578|ref|YP_001999531.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
8327]
gi|193087055|gb|ACF12331.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
Length = 439
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 71/128 (55%), Gaps = 6/128 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S + G A+L + N + ++F SAD+ +++ +G+ G +A KAN GA+L
Sbjct: 279 SEEKLGDANLEEVDLSNANLKQSDFESADLDKANLAGANLAGGNFSRADMEKANLKGANL 338
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANG 226
++DR + +A+L+NA L L + L GA ++GAD ++A + D ++A K G
Sbjct: 339 EGAVLDRAFMKQADLSNANLRNANLFGAMLSGANLDGADLTNASLFDANLEKASLK---G 395
Query: 227 TNPITGVS 234
TN +TG +
Sbjct: 396 TN-LTGAN 402
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 70/152 (46%), Gaps = 14/152 (9%)
Query: 109 SAAQFGSADLRKA----VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
S A A+LRKA ++K RA+ AD+ E+ + A+L+ A +AN +G
Sbjct: 81 SGASLDQANLRKANLSMTYLK---RADLKKADLSEAWMVSANLRDAFLKDARLSRANLSG 137
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-SDAVIDLAQKQALCKY 223
+L + L +ANL +A L T R++L G + A F +AV++ A K
Sbjct: 138 TNLRWAKLWDADLGQANLKDANLFETSFERANLKGTLFTKARFLENAVMNDA------KV 191
Query: 224 ANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 255
+N T +G + ++ R PS+P
Sbjct: 192 SNNTVIPSGEPASRGWAMRHNSRFVQEEPSAP 223
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 61/128 (47%), Gaps = 16/128 (12%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-T 170
F SADL KA N NF+ ADM +++ G+ GA L++A +A+ + A+L +
Sbjct: 303 FESADLDKANLAGANLAGGNFSRADMEKANLKGANLEGAVLDRAFMKQADLSNANLRNAN 362
Query: 171 LMDRMV--------------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
L M+ L +ANL A L T LT ++L G + GA S + + +
Sbjct: 363 LFGAMLSGANLDGADLTNASLFDANLEKASLKGTNLTGANLIGINLTGAAISSSTLTPSG 422
Query: 217 KQALCKYA 224
K A +A
Sbjct: 423 KPATRSWA 430
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 45/90 (50%), Gaps = 6/90 (6%)
Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
D S + A L+ A +AN + ADLS +D+ L +ANL+ L R L ++DL A
Sbjct: 54 DLSKANLEDANLDGANLSEANLSKADLSGASLDQANLRKANLSMTYLKRADLKKADLSEA 113
Query: 201 IIEGADFSDAVIDLAQKQALCKYAN--GTN 228
+ A+ DA + K A AN GTN
Sbjct: 114 WMVSANLRDAFL----KDARLSRANLSGTN 139
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 53/106 (50%), Gaps = 13/106 (12%)
Query: 117 DLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANFTGADLSDT 170
DL KA N AN + A++ ++D SG+ + A L KA + Y +A+ ADLS+
Sbjct: 54 DLSKANLEDANLDGANLSEANLSKADLSGASLDQANLRKANLSMTYLKRADLKKADLSEA 113
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
M ANL +A L L+R++L G + A DA DL Q
Sbjct: 114 WMV-----SANLRDAFLKDARLSRANLSGTNLRWAKLWDA--DLGQ 152
>gi|224014282|ref|XP_002296804.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968659|gb|EED87005.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 2544
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 5/113 (4%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
++ D+ DFS + + G + NF GAD+ + ++ ANL + V V +
Sbjct: 2434 DYAGIDISGQDFSNASYKGKDFTQV---NTNFEGADVRGVSFEDTSMDNANLKDIVAVGS 2490
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN--GTNPITGVSTRKSLGC 241
+S + +E DF+DA I + +C + GTNP TG TR SL C
Sbjct: 2491 YFGQSLVDVKTLENGDFTDATIPPKTLKLVCDREDVKGTNPTTGADTRDSLMC 2543
>gi|298245086|ref|ZP_06968892.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297552567|gb|EFH86432.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 394
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 62/120 (51%), Gaps = 11/120 (9%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGA 150
L +N Y+++ R A DLR+A + RAN A++RE+ + A
Sbjct: 247 LYKINLYKSDLR-------EANLSKTDLREA----DISRANLYKANLRETFLLKANLYEA 295
Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L +A +AN + A+LS T + R L +ANL+ A L+ L+R DL GA + ADFS A
Sbjct: 296 DLHRANLSEANLSEANLSKTDLSRTNLTKANLSKADLISANLSRGDLSGADLSKADFSGA 355
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 57/117 (48%), Gaps = 18/117 (15%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLE 153
N Y+A+ RG AD KA N R AN A++RE+D S A+L
Sbjct: 206 NLYKADLRG------------ADFSKATLCGANLREANLCEANLREADIS-----RAFLY 248
Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
K YK++ A+LS T + ++ ANL A L T L +++L A + A+ S+A
Sbjct: 249 KINLYKSDLREANLSKTDLREADISRANLYKANLRETFLLKANLYEADLHRANLSEA 305
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 53/105 (50%), Gaps = 1/105 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A S DL+ +F AN AD+R +DFS + GA L +A +AN AD+
Sbjct: 183 SQADMKSMDLKGVKAHNIDFSGANLYKADLRGADFSKATLCGANLREANLCEANLREADI 242
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
S + ++ L +++L A L +T L +D+ A + A+ + +
Sbjct: 243 SRAFLYKINLYKSDLREANLSKTDLREADISRANLYKANLRETFL 287
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 49/94 (52%), Gaps = 5/94 (5%)
Query: 95 NKYEAET-RGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYL 152
N YEA+ R S A A+L K + N +AN + AD+ ++ S +GA L
Sbjct: 291 NLYEADLHRANL---SEANLSEANLSKTDLSRTNLTKANLSKADLISANLSRGDLSGADL 347
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
KA AN +GA+LS ++ +LN+AN+ A+
Sbjct: 348 SKADFSGANLSGANLSGATLNEAILNKANIQQAL 381
>gi|37520785|ref|NP_924162.1| hypothetical protein gll1216 [Gloeobacter violaceus PCC 7421]
gi|35211780|dbj|BAC89157.1| gll1216 [Gloeobacter violaceus PCC 7421]
Length = 287
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 63/130 (48%), Gaps = 9/130 (6%)
Query: 105 FGIGSAAQFGSADLRKAVHVKE-NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
F + A ADL ++V++K + R A AD+R + G+ +G+ LE A K
Sbjct: 137 FAVLPFADLSGADLSRSVNLKRADLRGARLVGADLRGAFLHGANLSGSRLEAADLMKVAL 196
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI-----IEGADFSDAVIDLAQK 217
GA+LS + R L A+L A L RT L +DL GA +EGAD A ++ A
Sbjct: 197 AGANLSGADLSRANLRAAHLEGADLRRTNLGEADLAGAFLRGARLEGADLRRARLEGADL 256
Query: 218 QALCKYANGT 227
+ C GT
Sbjct: 257 E--CAATEGT 264
>gi|418020640|ref|ZP_12659878.1| putative low-complexity protein [Candidatus Regiella insecticola
R5.15]
gi|347604005|gb|EGY28733.1| putative low-complexity protein [Candidatus Regiella insecticola
R5.15]
Length = 148
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 56/116 (48%), Gaps = 21/116 (18%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFN------------GAYLEKAVAY 158
A AD+R+ + ADMRE+ G K N GA L +
Sbjct: 9 ATLNDADMREV---------DLVGADMREAKLIGKKTNLEGANLSGADLQGAELYHTILI 59
Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
KA + ADLS+ ++R+ L EANL +A+L T L + L A +EG + DAV+++
Sbjct: 60 KAVLSWADLSNAKLERVNLREANLYHAILEETSLYITKLENANLEGVNLKDAVLEV 115
>gi|209528100|ref|ZP_03276576.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|209491459|gb|EDZ91838.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
Length = 351
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 84/197 (42%), Gaps = 42/197 (21%)
Query: 56 NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQ 112
N+ YA+ R F +L AA+ + N L+ N EA IG S +Q
Sbjct: 10 NKLLTRYAQ--GERNFSDISLMAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQ 67
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLE------------------ 153
ADL AV + N A T + ++D SG+ +GA L
Sbjct: 68 LSYADLSMAVLIDANLTGATMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTC 127
Query: 154 ------------------KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
++V A+ TGA+L+ +++ + L+ ANLT A L+R L +
Sbjct: 128 LLNGSQLTDAILVGATLTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQG 187
Query: 196 DLGGAIIEGADFSDAVI 212
+L GA + GAD S++VI
Sbjct: 188 NLSGANLTGADLSESVI 204
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 53/100 (53%), Gaps = 1/100 (1%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL ++V NF AN T A++ ++ +G+ NGA L +A +AN T A+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTRANLTRANLTRANL 249
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+ + L ANL+ A L LT ++L GA + AD
Sbjct: 250 NGLTLQSADLRLANLSKADLRGANLTGANLAGANLLEADL 289
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 49/101 (48%), Gaps = 13/101 (12%)
Query: 109 SAAQFGSADLRKAVHVKE-NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
S A A L + VH+ + N AN T AD+ ES S F A L A AN TGA+
Sbjct: 170 SGANLTGATLIR-VHLNQGNLSGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGAN 228
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
LN ANLT A L R LTR++L G ++ AD
Sbjct: 229 ----------LNGANLTRANLTRANLTRANLNGLTLQSADL 259
>gi|158316060|ref|YP_001508568.1| pentapeptide repeat-containing protein [Frankia sp. EAN1pec]
gi|158111465|gb|ABW13662.1| pentapeptide repeat protein [Frankia sp. EAN1pec]
Length = 411
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 53/95 (55%), Gaps = 6/95 (6%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN T A++ ++D +G++ A L A+ ++A TGA L + L A+LTNAVL
Sbjct: 287 RANLTDAELVDADLTGARLADATLAGALLFRATLTGAQLGRADLTGAQLGGADLTNAVLD 346
Query: 189 RTVLTRSDLGG-----AIIEGADFSDAVIDLAQKQ 218
+L + L G A ++GAD + A LAQKQ
Sbjct: 347 EAILADAVLSGANLTNARLDGADLT-AATGLAQKQ 380
Score = 44.3 bits (103), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 60/128 (46%), Gaps = 18/128 (14%)
Query: 114 GSADLRKAVHVKENFRANFTSADMRES--DFSGSKFNGAYLEKAVAYKANFT-------- 163
G ADL A + N T AD R + DF+G + L +A +AN T
Sbjct: 240 GHADLPGAPSLAHLTLTNATLADARLAGVDFTGGSLDDVDLARADLRRANLTDAELVDAD 299
Query: 164 --GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA-QKQAL 220
GA L+D + +L A LT A L R+DL GA + GAD ++AV+D A A+
Sbjct: 300 LTGARLADATLAGALLFRATLTGA-----QLGRADLTGAQLGGADLTNAVLDEAILADAV 354
Query: 221 CKYANGTN 228
AN TN
Sbjct: 355 LSGANLTN 362
>gi|411118568|ref|ZP_11390949.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410712292|gb|EKQ69798.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 321
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 56/105 (53%), Gaps = 6/105 (5%)
Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
AA A+L +A+ N AN T A++ E+ S ++ GA L++A KAN T ADLS
Sbjct: 194 AANLSGANLGRALLEGVNLIGANLTQANLIEARLSLAEMRGAKLDQAELTKANLTEADLS 253
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
L+ A L AV+V V L AI+ GADFSDA ID
Sbjct: 254 WASFRGTNLSAATLHKAVMVDVV-----LDAAILRGADFSDATID 293
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 41/80 (51%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NF + D+ + + GA L KAV ++A+ TGA+L D + + L NLT A L
Sbjct: 16 NFDTVDLSGVNLRQADLRGASLRKAVLFEADLTGANLVDVELHGVALRHTNLTAACLAGV 75
Query: 191 VLTRSDLGGAIIEGADFSDA 210
L +DL A + AD S A
Sbjct: 76 KLVGADLSAAQLVRADLSGA 95
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 61/126 (48%), Gaps = 21/126 (16%)
Query: 109 SAAQFGSADLRKA-----------VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA 157
SAAQ ADL A +H R N +A++ E+D + ++ + A L +A
Sbjct: 83 SAAQLVRADLSGANLWRSLLRNANLHAANLERTNLHAANLVEADLTTARLSHANLAEANL 142
Query: 158 YKANFTGADL----------SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
A+ TGA L S + + + L +A+L AVLV L+R++L A + GA+
Sbjct: 143 SDADLTGATLRWVNGVEAMFSRSRLRGVDLEQADLKKAVLVEVDLSRANLEAANLSGANL 202
Query: 208 SDAVID 213
A+++
Sbjct: 203 GRALLE 208
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 6/93 (6%)
Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTG 164
A+ A++R A + E +AN T AD+ + F G+ + A L KAV A G
Sbjct: 225 ARLSLAEMRGAKLDQAELTKANLTEADLSWASFRGTNLSAATLHKAVMVDVVLDAAILRG 284
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
AD SD +D LN+++LT +L VL S L
Sbjct: 285 ADFSDATIDPACLNQSSLTWVILPSGVLQISSL 317
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 51/103 (49%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A LR V+ F R+ D+ ++D + L +A AN +GA+L
Sbjct: 143 SDADLTGATLRWVNGVEAMFSRSRLRGVDLEQADLKKAVLVEVDLSRANLEAANLSGANL 202
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L++ + L ANLT A L+ L+ +++ GA ++ A+ + A
Sbjct: 203 GRALLEGVNLIGANLTQANLIEARLSLAEMRGAKLDQAELTKA 245
>gi|393766611|ref|ZP_10355166.1| pentapeptide repeat-containing protein [Methylobacterium sp. GXF4]
gi|392727929|gb|EIZ85239.1| pentapeptide repeat-containing protein [Methylobacterium sp. GXF4]
Length = 448
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 51/89 (57%), Gaps = 5/89 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTN 184
A F A MR +D SG+ +GA +A + A+F+GAD DT+ +D L +ANLT+
Sbjct: 133 ARFGQAAMRFADLSGALLDGASFAEADLWGADFSGADADDTVFRDARLDEAKLADANLTH 192
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVID 213
A LT++ L G+ + GA F+ A +D
Sbjct: 193 ADFEGASLTKASLAGSRLRGAKFTGAKLD 221
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 54/182 (29%), Positives = 75/182 (41%), Gaps = 16/182 (8%)
Query: 74 TALAAAVVASCSSNISAL----ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 129
TALAA A + L ADL++ E A A+LR+A R
Sbjct: 41 TALAAGGTAPADAESGGLPLAEADLSRARIEE---------ADLSGANLRRASLTGAVGR 91
Query: 130 AN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ F A + E+D S + +GA VA + F A L D + + A+L+ A+L
Sbjct: 92 STRFVGAILEETDLSEADMSGADFTGIVAGQVKFASAMLEDARFGQAAMRFADLSGALLD 151
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYANGTNP-ITGVSTRKSLGCGNSRR 246
+DL GA GAD D V D +A AN T+ G S K+ G+ R
Sbjct: 152 GASFAEADLWGADFSGADADDTVFRDARLDEAKLADANLTHADFEGASLTKASLAGSRLR 211
Query: 247 NA 248
A
Sbjct: 212 GA 213
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEANLTN 184
A+F A + ++ +GS+ GA A A+ +GADLSDT + R+ L A
Sbjct: 193 ADFEGASLTKASLAGSRLRGAKFTGAKLDGADLSGADLSDTDLVRLNLATCRLRHARFAG 252
Query: 185 AVLVRTVLTRSDLGGAIIE 203
A L T ++ LGGA+ E
Sbjct: 253 AWLNGTRMSVEQLGGAVGE 271
>gi|423066634|ref|ZP_17055424.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|406711942|gb|EKD07140.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 351
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 84/197 (42%), Gaps = 42/197 (21%)
Query: 56 NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQ 112
N+ YA+ R F +L AA+ + N L+ N EA IG S +Q
Sbjct: 10 NKLLTRYAQ--GERNFSDISLMAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQ 67
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLE------------------ 153
ADL AV + N A T + ++D SG+ +GA L
Sbjct: 68 LSYADLSMAVLIDANLTGATMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTC 127
Query: 154 ------------------KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
++V A+ TGA+L+ +++ + L+ ANLT A L+R L +
Sbjct: 128 LLNGSQLTDAILVGATLTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQG 187
Query: 196 DLGGAIIEGADFSDAVI 212
+L GA + GAD S++VI
Sbjct: 188 NLSGANLTGADLSESVI 204
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 1/100 (1%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL ++V NF AN T A++ ++ +G+ NGA L A +AN TGA+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTRANLTGANL 249
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+ + L ANL+ A L LT ++L GA + AD
Sbjct: 250 NGLTLQSADLRLANLSKADLRGANLTGANLAGANLLEADL 289
Score = 43.9 bits (102), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 48/101 (47%), Gaps = 13/101 (12%)
Query: 109 SAAQFGSADLRKAVHVKE-NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
S A A L + VH+ + N AN T AD+ ES S F A L A AN TGA+
Sbjct: 170 SGANLTGATLIR-VHLNQGNLSGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGAN 228
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
LN ANLT A L R LT ++L G ++ AD
Sbjct: 229 ----------LNGANLTGANLTRANLTGANLNGLTLQSADL 259
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN + + E D SG+ GA +L + AN TGADLS++++ ANLT
Sbjct: 157 ANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGANLTGADLSESVIQNSNFCIANLTG 216
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
A L LT ++L GA + GA+ + A
Sbjct: 217 ANLTGANLTGANLNGANLTGANLTRA 242
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 70/151 (46%), Gaps = 20/151 (13%)
Query: 78 AAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSA 135
A+++ +C N S L D A TR S A A+L +++ + + AN T A
Sbjct: 121 ASLIGTCLLNGSQLTDAILVGATLTRSVL---SGAHMTGANLNRSILSEIDLSGANLTGA 177
Query: 136 -----DMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEANLTNA 185
+ + + SG+ GA L ++V +NF TGA+L+ + LN ANLT A
Sbjct: 178 TLIRVHLNQGNLSGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGA 237
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
L TR++L GA + G A + LA
Sbjct: 238 NL-----TRANLTGANLNGLTLQSADLRLAN 263
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 51/111 (45%), Gaps = 11/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 159
A A+L A N AN T A++ ++ +G+ NG A L KA
Sbjct: 212 ANLTGANLTGANLTGANLNGANLTGANLTRANLTGANLNGLTLQSADLRLANLSKADLRG 271
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
AN TGA+L+ + L ANLT+A L L + L GA + GA+ + A
Sbjct: 272 ANLTGANLAGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQA 322
>gi|86605838|ref|YP_474601.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86554380|gb|ABC99338.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 158
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N AD+R +D S + GA L A ++AN GADLS + L+ A L A L R
Sbjct: 55 NLQEADLRGADLSSANLMGANLRGANLWEANLIGADLSFADLREANLHGAYLWEAKLTRA 114
Query: 191 VLTRSDL-----GGAIIEGADFSDAVI 212
L SDL GGA++ GAD S A++
Sbjct: 115 QLQGSDLSGAKIGGAVLTGADLSGAIL 141
Score = 43.9 bits (102), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 55/118 (46%), Gaps = 8/118 (6%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN-FR 129
V L A + + + L+ +N EA+ RG A SA+L A N +
Sbjct: 31 LVRATLQGANLRGANLSFGKLSGINLQEADLRG-------ADLSSANLMGANLRGANLWE 83
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
AN AD+ +D + +GAYL +A +A G+DLS + VL A+L+ A+L
Sbjct: 84 ANLIGADLSFADLREANLHGAYLWEAKLTRAQLQGSDLSGAKIGGAVLTGADLSGAIL 141
>gi|300023195|ref|YP_003755806.1| pentapeptide repeat protein [Hyphomicrobium denitrificans ATCC
51888]
gi|299525016|gb|ADJ23485.1| pentapeptide repeat protein [Hyphomicrobium denitrificans ATCC
51888]
Length = 282
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 60/112 (53%), Gaps = 2/112 (1%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
FG+ + + F ADL A+ N + F R +D SG+ +GA L +A + F+
Sbjct: 149 FGVFAGSNFAGADLTDAISAPLN-KTGFIEYIWR-TDLSGANLSGAQLTRANMTQTRFSF 206
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
A L D + +L EA+L+ AVL L+ +DL GA + GAD + A +D A+
Sbjct: 207 AVLRDASLHDTILREADLSGAVLTGADLSGADLTGADLSGADVTGANLDGAK 258
>gi|119487930|ref|ZP_01621427.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119455506|gb|EAW36644.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 276
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 56/103 (54%), Gaps = 6/103 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A SADL A + N R N T++ + E+ G+ AYL +A NFT ADL
Sbjct: 38 SGANLISADLSHANLCQTNLRGINLTNSTLSEARLRGADLCDAYLSEA-----NFTRADL 92
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S+ + L EANLT+A LV T L ++L A ++ A+ S+A
Sbjct: 93 SEAQLLNAYLKEANLTHAQLVNTNLNGANLSNAKLQNANLSNA 135
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 42/80 (52%), Gaps = 5/80 (6%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
ANFT AD+ E+ + A L A N GA+LS+ L ANL+NA L+
Sbjct: 84 EANFTRADLSEAQLLNAYLKEANLTHAQLVNTNLNGANLSNA-----KLQNANLSNANLL 138
Query: 189 RTVLTRSDLGGAIIEGADFS 208
TVLT +L GA + GA+ +
Sbjct: 139 NTVLTGVNLTGANLNGANLT 158
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 50/114 (43%), Gaps = 10/114 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A F ADL +A + A A++ + + NGA L A AN + A+L
Sbjct: 83 SEANFTRADLSEA----QLLNAYLKEANLTHAQLVNTNLNGANLSNAKLQNANLSNANLL 138
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
+T++ + L ANL A L L R +L G I D L+QK L +
Sbjct: 139 NTVLTGVNLTGANLNGANLTGVELCRVNLNGTQI------DENTQLSQKWLLVQ 186
>gi|297569025|ref|YP_003690369.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
gi|296924940|gb|ADH85750.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
Length = 830
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 55/104 (52%), Gaps = 16/104 (15%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTG 164
A G ADLR+A + NF +A AD+R ESDF + A +A +ANF+G
Sbjct: 227 ADLGGADLRRADLSRANFSQARLRQADLRQVLFSESDFRHADARRADFREATLRQANFSG 286
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
ADLS R + + +LT V +++L GA++EGAD S
Sbjct: 287 ADLS-----RAIFSGTDLTGGVF-----QQANLAGAVLEGADLS 320
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 2/111 (1%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
E R + +G Q + D + + K+ N D+R +F+ S+ +G L+ A
Sbjct: 134 EAREQIAMGQVQQALAGD--RNLQGKDLSTLNLAGLDLRGVNFADSRLHGVNLQGANLRG 191
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+F+ ADL + L EA L +A L R L +DLGGA + AD S A
Sbjct: 192 ADFSRADLMHADLSEADLREAKLVDANLARASLALADLGGADLRRADLSRA 242
Score = 43.9 bits (102), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 63/143 (44%), Gaps = 33/143 (23%)
Query: 90 ALADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVKENFR---------------- 129
ALADL + +R F S A+ ADLR+ + + +FR
Sbjct: 225 ALADLGGADLRRADLSRANF---SQARLRQADLRQVLFSESDFRHADARRADFREATLRQ 281
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANF+ AD+ + FSG+ G ++A A GADLS R+ L +V+
Sbjct: 282 ANFSGADLSRAIFSGTDLTGGVFQQANLAGAVLEGADLS-----RLA-----LAGVKMVK 331
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L S+L GA + G D +DA +
Sbjct: 332 ANLAGSNLYGADLRGVDLTDASL 354
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 53/114 (46%), Gaps = 11/114 (9%)
Query: 111 AQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
A ADLR+A V N A+ AD+ ++FS ++ A L + + +
Sbjct: 202 ADLSEADLREAKLVDANLARASLALADLGGADLRRADLSRANFSQARLRQADLRQVLFSE 261
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
++F AD L +AN + A L R + + +DL G + + A+ + AV++
Sbjct: 262 SDFRHADARRADFREATLRQANFSGADLSRAIFSGTDLTGGVFQQANLAGAVLE 315
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 53/109 (48%), Gaps = 9/109 (8%)
Query: 109 SAAQFGSADLRKAVHVK------ENFRANFTSADMRESDFSG--SKFNGAYLEKAVAYKA 160
S A F A+L AV + + AN T+A++ +D + S G L A KA
Sbjct: 410 SQADFTGANLTAAVFSEAIMAGAKLLEANLTNANLDGADLTSRVSMIRG-NLTNASLQKA 468
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+ GADLS+ ++ VL EANL L L R+DL A I AD S+
Sbjct: 469 DLHGADLSNAIVTGAVLREANLRRVRLSHASLNRADLSWATIVDADLSN 517
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 77/180 (42%), Gaps = 39/180 (21%)
Query: 70 VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 129
VF LA AV+ + ALA + +A G G A DL A ++ +
Sbjct: 303 VFQQANLAGAVLEGADLSRLALAGVKMVKANLAGSNLYG--ADLRGVDLTDASLLEADLS 360
Query: 130 A-NFTSADMRESDFSGSKFNGAYLEKAVAY--------------------KANFTGADL- 167
A + A + ++ F+G +GA L AVA +A+FTGA+L
Sbjct: 361 AADLAGARLDKAVFAGGTLHGARLLSAVARNADFRAANLTRVAAQQADFSQADFTGANLT 420
Query: 168 ----SDTLMDRMVLNEANLTNAVL-----------VRTVLTRSDLGGAIIEGADFSDAVI 212
S+ +M L EANLTNA L +R LT + L A + GAD S+A++
Sbjct: 421 AAVFSEAIMAGAKLLEANLTNANLDGADLTSRVSMIRGNLTNASLQKADLHGADLSNAIV 480
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 41/84 (48%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RA+ AD+ E+D +K A L +A A+ GADL + R ++A L A L
Sbjct: 196 RADLMHADLSEADLREAKLVDANLARASLALADLGGADLRRADLSRANFSQARLRQADLR 255
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
+ + + SD A ADF +A +
Sbjct: 256 QVLFSESDFRHADARRADFREATL 279
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 39/82 (47%), Gaps = 1/82 (1%)
Query: 117 DLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
DLR A V N R A+ AD+ +D + A L ++ N T ADLS ++
Sbjct: 554 DLRNANLVNANLRDADLADADLSNADLRQANLARANLSRSDLRWVNLTDADLSGAILSGA 613
Query: 176 VLNEANLTNAVLVRTVLTRSDL 197
LN+A+ AV LTR+ L
Sbjct: 614 SLNDADFNRAVFAEANLTRASL 635
>gi|376003692|ref|ZP_09781500.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375327990|emb|CCE17253.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 740
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 6/106 (5%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A A+LR N R A+ AD+R +D G+ F GA L +A Y+AN T
Sbjct: 575 ANLAHANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITE 634
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + + R+ N ++L +A L+R L++S L A ++GA+ S +
Sbjct: 635 GNFNGANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQS 680
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 56/128 (43%), Gaps = 8/128 (6%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSG 144
L +N A RG G A ADLR A NF RANF A++ E +F+G
Sbjct: 582 LRGVNLRNANLRG--GNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITEGNFNG 639
Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
+ ++ A DLS + + L ANL+ + L T TR+DL A G
Sbjct: 640 ANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGTDFTRADLSNAKFNG 699
Query: 205 ADFSDAVI 212
AD S +I
Sbjct: 700 ADLSFTLI 707
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 53/115 (46%), Gaps = 7/115 (6%)
Query: 111 AQFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
+QF DLR K V++K +F ADMRE + G L KAN + A
Sbjct: 430 SQFQGQDLRQKNLKGVNLKT---IDFKGADMREKNLKGMSLIKLDLRLVNLAKANLSHAI 486
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
L+ + + L AN+ A LV+T L R+DL + A + A + A ++ C
Sbjct: 487 LNGSKLAVANLKGANMQEASLVKTDLRRADLEDVNLSYASLTTAQLQRANLRSAC 541
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 56/111 (50%), Gaps = 4/111 (3%)
Query: 95 NKYEAE-TRGEFGIGSAAQ--FGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA 150
N Y+A T G F + + F +DLR A ++ + ++ SA ++ ++ S S G
Sbjct: 626 NFYQANITEGNFNGANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGT 685
Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
+A A F GADLS TL+ L+ A+LTNA L + L S+ G I
Sbjct: 686 DFTRADLSNAKFNGADLSFTLIRHANLSGADLTNAKLEKANLFGSNTVGCI 736
>gi|303289212|ref|XP_003063894.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454962|gb|EEH52267.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 124
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 58/122 (47%), Gaps = 22/122 (18%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
+T M+ ++FS S +G L ANFTGADLS+ AN+ L T+
Sbjct: 12 YTKGSMKRANFSNSNLSGVTLFGGDLSYANFTGADLSN----------ANIGQCNLTGTI 61
Query: 192 LTRSDLGGAIIEGA-----------DFSDAVIDLAQKQALC-KYANGTNPITGVSTRKSL 239
T ++L GAI+ GA D++D ++ +C K +G NP+TG T +L
Sbjct: 62 FTNANLSGAIVSGANMDELGDITGSDWTDVIVRKDVNDKICAKGVSGENPVTGNPTAMTL 121
Query: 240 GC 241
C
Sbjct: 122 FC 123
>gi|209526910|ref|ZP_03275429.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423063829|ref|ZP_17052619.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209492689|gb|EDZ93025.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406714678|gb|EKD09839.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 740
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 6/106 (5%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A A+LR N R A+ AD+R +D G+ F GA L +A Y+AN T
Sbjct: 575 ANLAHANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITE 634
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + + R+ N ++L +A L+R L++S L A ++GA+ S +
Sbjct: 635 GNFNGANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQS 680
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 56/128 (43%), Gaps = 8/128 (6%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSG 144
L +N A RG G A ADLR A NF RANF A++ E +F+G
Sbjct: 582 LRGVNLRNANLRG--GNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITEGNFNG 639
Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
+ ++ A DLS + + L ANL+ + L T TR+DL A G
Sbjct: 640 ANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGTDFTRADLSNAKFNG 699
Query: 205 ADFSDAVI 212
AD S +I
Sbjct: 700 ADLSFTLI 707
Score = 40.4 bits (93), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 53/115 (46%), Gaps = 7/115 (6%)
Query: 111 AQFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
+QF DLR K V++K +F ADMRE + G L KAN + A
Sbjct: 430 SQFQGQDLRQKNLKGVNLKT---IDFKGADMREKNLKGMSLIKLDLRLVNLAKANLSHAI 486
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
L+ + + L AN+ A LV+T L R+DL + A + A + A ++ C
Sbjct: 487 LNGSKLAVANLKGANMQEASLVKTDLRRADLEDVNLSYASLTTAQLQRANLRSAC 541
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 56/111 (50%), Gaps = 4/111 (3%)
Query: 95 NKYEAE-TRGEFGIGSAAQ--FGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA 150
N Y+A T G F + + F +DLR A ++ + ++ SA ++ ++ S S G
Sbjct: 626 NFYQANITEGNFNGANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGT 685
Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
+A A F GADLS TL+ L+ A+LTNA L + L S+ G I
Sbjct: 686 DFTRADLSNAKFNGADLSFTLIRHANLSGADLTNAKLEKANLFGSNTVGCI 736
>gi|334121546|ref|ZP_08495612.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333454932|gb|EGK83604.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 388
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 62/118 (52%), Gaps = 19/118 (16%)
Query: 111 AQFGSADLRKAVHVKENFR-ANF-----TSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A A+L KA +K N ANF + A ++E+D + ++ GA L KA AN T
Sbjct: 143 AVLTEANLSKAYLIKANLNGANFQDAYLSLASLKEADLTEAQLTGAELSKANLAGANLTR 202
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
A+LS +ANL A L RT LT++ L GA + G+D S+A +D A LCK
Sbjct: 203 ANLS----------KANLLKANLRRTNLTQAYLNGACLIGSDLSEACLDRAN---LCK 247
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A ADL A N A N A++ ++ +G+ N A L A+ AN
Sbjct: 71 SRANLSKADLSGANLTGANLMAASLSGANLIGANLTGANLAGAHLNWANLTGAILPNANL 130
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
GAD+S + + VL EANL+ A L++ A + GA+F DA + LA
Sbjct: 131 IGADMSAANLTKAVLTEANLSKAYLIK----------ANLNGANFQDAYLSLA 173
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 42/81 (51%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN ADM ++ + + A L KA KAN GA+ D + L EA+LT A L
Sbjct: 128 ANLIGADMSAANLTKAVLTEANLSKAYLIKANLNGANFQDAYLSLASLKEADLTEAQLTG 187
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L++++L GA + A+ S A
Sbjct: 188 AELSKANLAGANLTRANLSKA 208
Score = 44.3 bits (103), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 8/108 (7%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
AQ A+L KA N RAN + A++ +++ + AYL A G+DLS+
Sbjct: 183 AQLTGAELSKANLAGANLTRANLSKANLLKANLRRTNLTQAYLNGAC-----LIGSDLSE 237
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
+DR L +A+L+ L L S L G GAD S +DL++K
Sbjct: 238 ACLDRANLCKADLSKTYLRNITLNGSHLSGINFSGADLSG--VDLSRK 283
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 57/108 (52%), Gaps = 11/108 (10%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A DL + + N AN + A + E++ SG+ + A L ++AY N
Sbjct: 271 SGADLSGVDLSRKLLTGINMAEALLNEANLSGAYLMEANLSGANLSKANL--SLAYLIN- 327
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
ADLS++ + + L++ANL+ A L + LT ++L GAI+ AD + A
Sbjct: 328 --ADLSNSCLHEINLSKANLSKASLQKADLTGANLRGAILTEADLTGA 373
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 44/87 (50%), Gaps = 5/87 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN AD+ ++ NG++L NF+GADLS + R +L N+ A+L
Sbjct: 242 RANLCKADLSKTYLRNITLNGSHLS-----GINFSGADLSGVDLSRKLLTGINMAEALLN 296
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
L+ + L A + GA+ S A + LA
Sbjct: 297 EANLSGAYLMEANLSGANLSKANLSLA 323
>gi|347735787|ref|ZP_08868588.1| pentapeptide repeat family protein [Azospirillum amazonense Y2]
gi|346920906|gb|EGY01818.1| pentapeptide repeat family protein [Azospirillum amazonense Y2]
Length = 451
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 51/93 (54%), Gaps = 9/93 (9%)
Query: 117 DLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
DLR A+ VK + N AD+ E++ SG+K +GA L +A+ AN + A L
Sbjct: 178 DLRGAIFVKADLSGSDLTGCNLEGADLSEANLSGTKLDGAVLTRALLRSANLSKASLLGA 237
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
L+D + L+ ANLT A LVR + D+ G I E
Sbjct: 238 LLDDVDLSMANLTGADLVRRL---DDIEGTIGE 267
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 69/155 (44%), Gaps = 51/155 (32%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGA 150
L+D + EA+ R SA FG ADLR+A R N T AD+R G+ F GA
Sbjct: 69 LSDADLSEADLR------SACLFG-ADLRRATLE----RTNLTRADLR-----GAAFRGA 112
Query: 151 YLEKAVAYKANFT--------------------GADLSDTLMDRMVLNEANLTNAVLVRT 190
+ + V +A+ A+L++ M + L+ A L+NA +V+T
Sbjct: 113 SMRRVVMVEADLRDGHLMRSKNGELTPNVQGNPSAELAEASMTKADLSYAKLSNAFVVQT 172
Query: 191 VLTRSDLGGAI---------------IEGADFSDA 210
L +DL GAI +EGAD S+A
Sbjct: 173 DLRDTDLRGAIFVKADLSGSDLTGCNLEGADLSEA 207
Score = 40.4 bits (93), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 64/140 (45%), Gaps = 25/140 (17%)
Query: 82 ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSA----- 135
A+ S + A+ADL+ A+ RG A+ ADLR A + AN +A
Sbjct: 316 ANLSGTVLAMADLSM--ADLRG-------AELAGADLRGACLERATLNEANLANAVACPM 366
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV-----LVRT 190
D+R + F+ KA + N GADL+ ++ L++ NL NAV L
Sbjct: 367 DLRNGHEWPTNFS-----KARMVRVNLAGADLTMARLEDGDLSQGNLRNAVLAGACLTDA 421
Query: 191 VLTRSDLGGAIIEGADFSDA 210
LT +DL GA I ADF A
Sbjct: 422 TLTMADLRGADIRNADFRGA 441
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 44/103 (42%), Gaps = 25/103 (24%)
Query: 131 NFTSADMRESDFSGSKFNG-----AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
N ++A +R++ G+ +G A L A A GADL ++R LNEANL NA
Sbjct: 302 NLSAAILRQTTLKGANLSGTVLAMADLSMADLRGAELAGADLRGACLERATLNEANLANA 361
Query: 186 V--------------------LVRTVLTRSDLGGAIIEGADFS 208
V +VR L +DL A +E D S
Sbjct: 362 VACPMDLRNGHEWPTNFSKARMVRVNLAGADLTMARLEDGDLS 404
>gi|123968240|ref|YP_001009098.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. AS9601]
gi|123198350|gb|ABM69991.1| Pentapeptide repeat-containing protein [Prochlorococcus marinus
str. AS9601]
Length = 157
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 62/131 (47%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F +DL+ A F D+++++ S + A L A N + ++L +
Sbjct: 33 ADFSGSDLKGAT---------FYLTDLQDANLSDCELQNATLYGAKLKDTNLSNSNLREV 83
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D +L+ +L+N L + + I+GADF++ + + C+ A GTNPI
Sbjct: 84 TLDSAILDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIIRKFCESATGTNPI 143
Query: 231 TGVSTRKSLGC 241
T TR++L C
Sbjct: 144 TNRETRETLEC 154
>gi|334117749|ref|ZP_08491840.1| stress protein [Microcoleus vaginatus FGP-2]
gi|333460858|gb|EGK89466.1| stress protein [Microcoleus vaginatus FGP-2]
Length = 578
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 57/105 (54%), Gaps = 1/105 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S+A +A L + + N + AN S +++ +D + +GA L KA+ Y A A+L
Sbjct: 312 SSANLANAKLIQVNLIGSNLQGANLNSTNLQSADLIEANLSGANLTKAILYYARLIHANL 371
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
S + L++ANLT A L R LT++ LG A + GAD S + +
Sbjct: 372 SQANLSEAKLDKANLTTANLSRANLTQASLGSANLTGADLSQSKV 416
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 1/101 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A L A + N +AN + A + +++ + + + A L +A AN TGADL
Sbjct: 352 SGANLTKAILYYARLIHANLSQANLSEAKLDKANLTTANLSRANLTQASLGSANLTGADL 411
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
S + + ++ L+ ANL+ L LT +L G + G + S
Sbjct: 412 SQSKVTKVNLSGANLSGVNLTGVSLTGVNLQGVNLSGMNLS 452
>gi|434392029|ref|YP_007126976.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428263870|gb|AFZ29816.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 532
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 58/115 (50%), Gaps = 11/115 (9%)
Query: 110 AAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVAY 158
A Q +A+L + + NF A+ + AD+R++D SG+ GA L A
Sbjct: 310 ATQLNNANLSDSQLIGANFSNVVAEDIFLENADLSGADLRDADLSGANLKGANLSGANLT 369
Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
GADLS+ + +LN A L NA++ +T LT +D A + GAD +A+ D
Sbjct: 370 GVELDGADLSEANLAGAILNGAVLDNALVQKTDLTGADFTNATLTGADLKEAIGD 424
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 56/117 (47%), Gaps = 17/117 (14%)
Query: 111 AQFGSADLRKAV-HVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT----- 163
A ADL++A+ NF AN A + F GS F A L KANFT
Sbjct: 411 ATLTGADLKEAIGDSLTNFTGANLNGASLEVGSFIGSNFTDAALRDTNLIKANFTDALFI 470
Query: 164 --------GADL-SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
GADL S T +D + +N + NA+LV LT+++ GA + GA+ S A+
Sbjct: 471 DGSDANSVGADLTSSTFIDGIAIN-GDFRNALLVNANLTKANFTGANLAGANLSGAI 526
Score = 40.8 bits (94), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 56/117 (47%), Gaps = 6/117 (5%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
F +A++ +S G+ F+ E A+ +GADL D + L ANL+ A L
Sbjct: 309 FATQLNNANLSDSQLIGANFSNVVAEDIFLENADLSGADLRDADLSGANLKGANLSGANL 368
Query: 188 VRTVLTRSDLGGAIIEGADFSDAVID--LAQKQAL--CKYANGTNPITGVSTRKSLG 240
L +DL A + GA + AV+D L QK L + N T +TG ++++G
Sbjct: 369 TGVELDGADLSEANLAGAILNGAVLDNALVQKTDLTGADFTNAT--LTGADLKEAIG 423
>gi|163797791|ref|ZP_02191737.1| hypothetical protein BAL199_22152 [alpha proteobacterium BAL199]
gi|159176913|gb|EDP61479.1| hypothetical protein BAL199_22152 [alpha proteobacterium BAL199]
Length = 427
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 70/157 (44%), Gaps = 38/157 (24%)
Query: 92 ADLNK---YEAETRGEFGIGS---AAQFGSADLRKAVHVKENF-------RANFTSADMR 138
ADLN A+ RG F GS A ADLR + N R+N +DM
Sbjct: 78 ADLNHALLIRADLRGAFMRGSNLAGANLKEADLRGGALISGNLAAPATIIRSNIGQSDMD 137
Query: 139 ESDFSGSKFN----------GAYLEKAV----------AYKANFTGADLSDTLMD--RMV 176
E+D G+ + GA LEK + AN GADLS + R++
Sbjct: 138 EADMGGANLSGTDLSHSSMIGATLEKTLLCGANLSGVNLEGANLQGADLSGANLSSARII 197
Query: 177 ---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L+ ANL+ A++ RT +S+L GAI+E D S A
Sbjct: 198 GANLSGANLSGALIHRTQFQKSELHGAILENVDLSTA 234
Score = 44.3 bits (103), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 57/115 (49%), Gaps = 16/115 (13%)
Query: 116 ADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
ADLR+A+ V R +N + AD+R+S+ +G GA L A A+ T
Sbjct: 294 ADLREAILVSAVMRRTSLVMSDLSGSNLSGADLRDSELAGINLAGANLTNARIAGADLTS 353
Query: 165 ADLSD---TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
+L R+ + +NL+ AVLV LT + L GA ++GAD + A + AQ
Sbjct: 354 VELKGPDGQATGRLWV--SNLSGAVLVNADLTGARLTGANLKGADLTGAKLARAQ 406
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 49/94 (52%), Gaps = 3/94 (3%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + ++ ++ G+ +GA L A AN +GA+LS L+ R ++ L A+L
Sbjct: 169 ANLSGVNLEGANLQGADLSGANLSSARIIGANLSGANLSGALIHRTQFQKSELHGAILEN 228
Query: 190 TVLTRSDLGGAII---EGADFSDAVIDLAQKQAL 220
L+ +DL GA + +G S ++ D+ + A+
Sbjct: 229 VDLSTADLSGANLTSGDGRGLSRSLRDILHEHAV 262
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%)
Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+A K TG D+SD + L EA L +AV+ RT L SDL G+ + GAD D+
Sbjct: 273 RAQLAKTELTGIDVSDVNLSGADLREAILVSAVMRRTSLVMSDLSGSNLSGADLRDS 329
>gi|409994208|ref|ZP_11277326.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|409934956|gb|EKN76502.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 517
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 57/98 (58%), Gaps = 1/98 (1%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F +A+LR+A N A+F+ A++R +D G+ +GA L +A AN +GA+LS
Sbjct: 189 ADFTNAELRQANLTYANLSNADFSGANLRWTDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
++ + L A+L+ A L+R + +DL GA + GA
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKL 286
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 67/156 (42%), Gaps = 28/156 (17%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTS 134
L A++ + N++ L + EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138
Query: 135 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 174
AD+RES + FNGA L A KAN AD ++ + +
Sbjct: 139 ADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPKADFTNAELRQ 198
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L ANL+NA L +DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANLRWTDLQGADLSGANLTEA 234
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 23/144 (15%)
Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
TR + A+ +A+L KA+ + AN AD+ E+ + A L +A K
Sbjct: 72 TRANLNV---ARLSNANLTKAILNQATINVANLVRADLTEAQLINTLLIRAELVRAKLSK 128
Query: 160 ANFT-----GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAI-----IEG 204
ANFT GADL ++ + + N ANL+ A L T T++DL GA +
Sbjct: 129 ANFTQANLNGADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPK 188
Query: 205 ADFSDAVIDLAQKQALCKYANGTN 228
ADF++A + +QA YAN +N
Sbjct: 189 ADFTNAEL----RQANLTYANLSN 208
>gi|428218432|ref|YP_007102897.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427990214|gb|AFY70469.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 403
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 51/102 (50%), Gaps = 14/102 (13%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A ADL+ VK AN T A + ++D S GAYL A +AN GA
Sbjct: 14 ASLTRADLKGVDLVK----ANLTGASLSDADLSQVNLTGAYLNGADLNRANLAGA----- 64
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+L+EANL A L+R L R+ L AI+ GA+F +A +
Sbjct: 65 -----ILDEANLAAAFLIRANLQRASLNEAILAGANFHEASL 101
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 47/82 (57%), Gaps = 5/82 (6%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+AN + A++R +D +G+ + A + A ++ + TGA+L D+ LN A+L NA L
Sbjct: 158 KANLSGANLRSADLTGADLSHATMTGAELHQVDLTGANL-----DQTNLNAADLVNASLD 212
Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
L+R++LG A + G +A
Sbjct: 213 GAFLSRANLGWANLIGTTMKEA 234
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 11/108 (10%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVA-----YK 159
A A L +A+ NF AN SAD+ +D +G+ GA L A +
Sbjct: 79 ANLQRASLNEAILAGANFHEASLTGANLRSADLSLADLAGADLAGANLSDACMNSAFFIE 138
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
AN GADLS T + L +ANL+ A L LT +DL A + GA+
Sbjct: 139 ANLLGADLSLTSLRGASLAKANLSGANLRSADLTGADLSHATMTGAEL 186
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 1/95 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A S +++ A+ V+ N AN +A++ ++ + NGA L +A +AN +GA L
Sbjct: 302 SGADLSSTEMKGAILVRTNLNGANLANANLTGANLEQANLNGANLGEANLNRANLSGASL 361
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
+ + L ANL L+ L ++L GAI+
Sbjct: 362 TGANLKGAFLLWANLKGTFLLWANLDEANLTGAIL 396
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 46/89 (51%), Gaps = 5/89 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
AN + AD+ ++ G+ +GA L + A+ + N GA+L++ + L +ANL
Sbjct: 283 NANLSGADLSNTNLMGTSLSGADLSSTEMKGAILVRTNLNGANLANANLTGANLEQANLN 342
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A L L R++L GA + GA+ A +
Sbjct: 343 GANLGEANLNRANLSGASLTGANLKGAFL 371
Score = 40.4 bits (93), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 10/89 (11%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN----- 184
AN T A + ++ +G+ NGA L AN +GADLS+T + L+ A+L++
Sbjct: 259 ANLTGAFLMGANLNGANLNGANL-----TNANLSGADLSNTNLMGTSLSGADLSSTEMKG 313
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVID 213
A+LVRT L ++L A + GA+ A ++
Sbjct: 314 AILVRTNLNGANLANANLTGANLEQANLN 342
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 15/90 (16%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+AN T+A + +D G KAN TGA LSD L++ NLT A L
Sbjct: 8 KANLTNASLTRADLKGVDL----------VKANLTGASLSDA-----DLSQVNLTGAYLN 52
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
L R++L GAI++ A+ + A + A Q
Sbjct: 53 GADLNRANLAGAILDEANLAAAFLIRANLQ 82
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 41/83 (49%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
AN AD+ + G+ A L A A+ TGADLS M L++ +LT A L
Sbjct: 137 IEANLLGADLSLTSLRGASLAKANLSGANLRSADLTGADLSHATMTGAELHQVDLTGANL 196
Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
+T L +DL A ++GA S A
Sbjct: 197 DQTNLNAADLVNASLDGAFLSRA 219
Score = 37.7 bits (86), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 48/98 (48%), Gaps = 6/98 (6%)
Query: 118 LRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTGADLSDTL 171
LR A K N AN SAD+ +D S + GA L + A + N ADL +
Sbjct: 151 LRGASLAKANLSGANLRSADLTGADLSHATMTGAELHQVDLTGANLDQTNLNAADLVNAS 210
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+D L+ ANL A L+ T + ++L GA + A+ ++
Sbjct: 211 LDGAFLSRANLGWANLIGTTMKEANLVGADLSWANLNE 248
>gi|411117892|ref|ZP_11390273.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410711616|gb|EKQ69122.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 577
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 58/128 (45%), Gaps = 26/128 (20%)
Query: 111 AQFGSADLRKAVHVKENF-----------RANFTSADMRE---------------SDFSG 144
AQ A+LR+A V N +AN T AD+ +D S
Sbjct: 165 AQLDEANLREATLVGTNLNEASLIGAYLRQANLTEADLHRVVLSSADLSEAILANADLSR 224
Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
+ GAYL KA +KA+ ADL D + R L+EANL A L R L+ + L I+
Sbjct: 225 ANLAGAYLLKASFHKAHLLRADLQDVYLLRADLSEANLRGANLQRADLSGAYLNHTILSE 284
Query: 205 ADFSDAVI 212
AD S+A +
Sbjct: 285 ADLSEAYL 292
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 45/83 (54%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
D SD SG+ +G L A +AN T A+LS +++ +L ANL A L L+ +
Sbjct: 16 DFSHSDLSGANLSGFNLRGANFTEANLTEANLSWAFLNQAILTGANLRRADLRNASLSGA 75
Query: 196 DLGGAIIEGADFSDAVIDLAQKQ 218
DL AI+ GA+ S + LAQ Q
Sbjct: 76 DLNHAILHGANLSKIDLRLAQLQ 98
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 41/81 (50%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A ++ + + ++ GA L +A +AN GA+L + L+EANL A LV
Sbjct: 120 AKLDQVNLERAKLNSAQLKGAELMEANLRRANLAGANLDQANLREAQLDEANLREATLVG 179
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
T L + L GA + A+ ++A
Sbjct: 180 TNLNEASLIGAYLRQANLTEA 200
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 46/111 (41%), Gaps = 25/111 (22%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTGADLSDTLM------- 172
AN AD+R + SG+ N A L A K AN A L D M
Sbjct: 60 ANLRRADLRNASLSGADLNHAILHGANLSKIDLRLAQLQQANLNWATLQDADMGGANLAF 119
Query: 173 --------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
+R LN A L A L+ L R++L GA ++ A+ +A +D A
Sbjct: 120 AKLDQVNLERAKLNSAQLKGAELMEANLRRANLAGANLDQANLREAQLDEA 170
>gi|157413067|ref|YP_001483933.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9215]
gi|157387642|gb|ABV50347.1| Pentapeptide repeat-containing proteins [Prochlorococcus marinus
str. MIT 9215]
Length = 157
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 62/131 (47%), Gaps = 9/131 (6%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F +DL+ A F D+++++ S + A L A N + ++L +
Sbjct: 33 ADFSGSDLKGAT---------FYLTDLQDANLSDCELQNATLYGAKLKDTNLSNSNLREV 83
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+D +L+ +L+N L + + I+GADF++ + + C+ A GTNPI
Sbjct: 84 TLDSAILDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVYLPKDIIREFCESATGTNPI 143
Query: 231 TGVSTRKSLGC 241
T TR++L C
Sbjct: 144 TNRDTRETLEC 154
>gi|119491336|ref|ZP_01623390.1| hypothetical protein L8106_22104 [Lyngbya sp. PCC 8106]
gi|119453500|gb|EAW34662.1| hypothetical protein L8106_22104 [Lyngbya sp. PCC 8106]
Length = 122
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 45/75 (60%), Gaps = 6/75 (8%)
Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
H + NF AN T AD+R+SD S ++ GA LE AN TGA+LS T + + L +A+L
Sbjct: 47 HAQLNF-ANLTHADLRDSDLSHAQLIGATLE-----GANLTGANLSHTNLSQANLKQADL 100
Query: 183 TNAVLVRTVLTRSDL 197
T A L T+ + S L
Sbjct: 101 TEATLQDTIYSHSTL 115
>gi|428305945|ref|YP_007142770.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428247480|gb|AFZ13260.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 273
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 77/167 (46%), Gaps = 23/167 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A ADL + V N A SAD+ +D S + N AYL A AN
Sbjct: 123 SGASLLGADLSRINLVAANLSNAHLEGATMISADLSHADLSQTNINDAYLHLANLSNANL 182
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
TGA+LS + L+ A+L+NA L L ++L A + GAD S+AV +A +
Sbjct: 183 TGANLSGS-----ELHIADLSNANLSEAQLNSAELNNANLLGADLSNAVF----AEANLR 233
Query: 223 YANGT-NPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRD 268
N T N I+ + ++G G G+ +S +L P L DRD
Sbjct: 234 GTNLTSNQISSANLEGAIGLG------EGASASTVLD-QPTILEDRD 273
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 51/108 (47%), Gaps = 11/108 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S Q ADL V N + N A++R++D G A L +A A+ GADL
Sbjct: 78 SRVQLSGADL-----VDANLNSSNLIQANLRDTDMLGVDLREANLSEADLSGASLLGADL 132
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
S R+ L ANL+NA L + +DL A + + +DA + LA
Sbjct: 133 S-----RINLVAANLSNAHLEGATMISADLSHADLSQTNINDAYLHLA 175
>gi|428298482|ref|YP_007136788.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428235026|gb|AFZ00816.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 567
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 9/100 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A+ SADL RA+F A++R +DFSG+ N A A ANF+ ADL+
Sbjct: 83 SDAKLNSADLS---------RADFYQANLRNTDFSGANLNSANFRNADLRNANFSNADLA 133
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
+ + L N +NA + T L R +L G + GAD S
Sbjct: 134 NADFSGLDLYGVNFSNAKMRGTRLDRVNLSGVNLSGADLS 173
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 48/96 (50%), Gaps = 1/96 (1%)
Query: 116 ADL-RKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADL RK + + + AN +D+R +D S +K N A L +A Y+AN D S ++
Sbjct: 55 ADLSRKNLKRADLYNANLQRSDLRNTDLSDAKLNSADLSRADFYQANLRNTDFSGANLNS 114
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+L NA L +D G + G +FS+A
Sbjct: 115 ANFRNADLRNANFSNADLANADFSGLDLYGVNFSNA 150
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 63/140 (45%), Gaps = 30/140 (21%)
Query: 92 ADLNK---YEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSG- 144
ADL++ Y+A R G ++A F +ADLR A NF++AD+ +DFSG
Sbjct: 90 ADLSRADFYQANLRNTDFSGANLNSANFRNADLRNA---------NFSNADLANADFSGL 140
Query: 145 ---------SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
+K G L++ N +GADLS + L NL L R L+ +
Sbjct: 141 DLYGVNFSNAKMRGTRLDRVNLSGVNLSGADLSG-----IDLRNVNLRGINLTRINLSHA 195
Query: 196 DLGGAIIEGADFSDAVIDLA 215
+L G G D +A + A
Sbjct: 196 NLIGFDFRGTDLRNANLSYA 215
Score = 37.0 bits (84), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 37/70 (52%)
Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
D SG+ + L++A Y AN +DL +T + LN A+L+ A + L +D GA
Sbjct: 51 DLSGADLSRKNLKRADLYNANLQRSDLRNTDLSDAKLNSADLSRADFYQANLRNTDFSGA 110
Query: 201 IIEGADFSDA 210
+ A+F +A
Sbjct: 111 NLNSANFRNA 120
>gi|170751525|ref|YP_001757785.1| pentapeptide repeat-containing protein [Methylobacterium
radiotolerans JCM 2831]
gi|170658047|gb|ACB27102.1| pentapeptide repeat protein [Methylobacterium radiotolerans JCM
2831]
Length = 456
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTN 184
A F A MR +D SG+ +G A + A+F+GAD DT+ +D L +ANLT+
Sbjct: 141 ARFGEAAMRFADLSGALLDGTDFAGADLWGADFSGADADDTVFRGARLDEAKLADANLTH 200
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVID 213
A LT++ L G+ + GA F+ A +D
Sbjct: 201 ADFAEASLTKASLAGSRLRGAHFTGAKLD 229
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 5/86 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ + A M E+D SG+ GA L AV FTGA +++ L+EA+L+ A
Sbjct: 71 ADLSRARMEEADLSGANLRGASLTGAVGRSTRFTGA-----ILEAADLSEADLSGADFTG 125
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
V + GA++E A F +A + A
Sbjct: 126 IVAGQVKFAGAMLEDARFGEAAMRFA 151
Score = 43.9 bits (102), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 47/105 (44%), Gaps = 6/105 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A L AV F A +AD+ E+D SG+ F G VA + F GA L
Sbjct: 84 SGANLRGASLTGAVGRSTRFTGAILEAADLSEADLSGADFTG-----IVAGQVKFAGAML 138
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
D + A+L+ A+L T +DL GA GAD D V
Sbjct: 139 EDARFGEAAMRFADLSGALLDGTDFAGADLWGADFSGADADDTVF 183
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEANLTN 184
A+F A + ++ +GS+ GA+ A A+ +GADLSDT + R+ L A
Sbjct: 201 ADFAEASLTKASLAGSRLRGAHFTGAKLDGADLSGADLSDTDLVRLNLATCRLRHARFAG 260
Query: 185 AVLVRTVLTRSDLGGAIIE 203
A L T ++ LGGA+ E
Sbjct: 261 AWLNGTRMSVEQLGGAVGE 279
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 43/85 (50%), Gaps = 15/85 (17%)
Query: 131 NFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
+F AD+ +DFSG+ F GA L++A AN T AD + EA+LT A
Sbjct: 162 DFAGADLWGADFSGADADDTVFRGARLDEAKLADANLTHADFA----------EASLTKA 211
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDA 210
L + L + GA ++GAD S A
Sbjct: 212 SLAGSRLRGAHFTGAKLDGADLSGA 236
>gi|428304969|ref|YP_007141794.1| heat shock protein DnaJ domain-containing protein [Crinalium
epipsammum PCC 9333]
gi|428246504|gb|AFZ12284.1| heat shock protein DnaJ domain protein [Crinalium epipsammum PCC
9333]
Length = 242
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 48/92 (52%), Gaps = 5/92 (5%)
Query: 131 NFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
N + AD++E DFSG + + A L A +K N GA+L + R L +ANL+NA
Sbjct: 128 NMSGADLKEKDFSGRNLSDANLSHANLSDAFLHKVNLQGANLYKANLFRANLLQANLSNA 187
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
L L +DL GA + GAD + A I K
Sbjct: 188 CLREANLIGADLSGADLRGADLTGAKIGFNDK 219
>gi|428313200|ref|YP_007124177.1| pentapeptide repeat protein,protein kinase family protein
[Microcoleus sp. PCC 7113]
gi|428254812|gb|AFZ20771.1| pentapeptide repeat protein,protein kinase family protein
[Microcoleus sp. PCC 7113]
Length = 464
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 48/85 (56%), Gaps = 6/85 (7%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
FR N + +++E++ SG F A L K NF GADLSD + LN+ANL NA L
Sbjct: 326 FR-NISGLNLQEANLSGGLFYSAKLAKT-----NFQGADLSDAYFGQANLNQANLRNANL 379
Query: 188 VRTVLTRSDLGGAIIEGADFSDAVI 212
T + +DL GA ++GAD A +
Sbjct: 380 GGTSFSNADLSGADLQGADLRFAYL 404
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 51/109 (46%), Gaps = 26/109 (23%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A FG A+L +A N R AN +D SG+ GA L A KAN GA+L
Sbjct: 360 SDAYFGQANLNQA-----NLRNANLGGTSFSNADLSGADLQGADLRFAYLSKANLKGANL 414
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
EANL+NA ++ GA + GA+ S+A+I AQ
Sbjct: 415 C----------EANLSNA----------NIKGANLCGANLSNAIITEAQ 443
>gi|411117186|ref|ZP_11389673.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410713289|gb|EKQ70790.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 544
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 59/123 (47%), Gaps = 11/123 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A +LR+A + N R AN T A++R +D SG+ + A L A AN TG +L
Sbjct: 173 SGADLSYTELRQANLSRANLRGANLTGANLRWADLSGADLSWADLSGARLSGANLTGVNL 232
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 227
S ANL +LV LTR+ L GA G+D S A + A+ + ++ T
Sbjct: 233 S----------YANLLGTILVHADLTRASLIGADWAGSDLSGATLTGAKLHGVLRFGVKT 282
Query: 228 NPI 230
I
Sbjct: 283 EGI 285
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 4/135 (2%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A DL+ A + N AN + A ++ + F+ + + A L ++ +GADL
Sbjct: 118 SFANLSGVDLKDAKLRQANLSHANISRASLKWATFTSANLSQANLHGTDLSSSDLSGADL 177
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL-CKYAN- 225
S T + + L+ ANL A L L +DL GA + AD S A + A + YAN
Sbjct: 178 SYTELRQANLSRANLRGANLTGANLRWADLSGADLSWADLSGARLSGANLTGVNLSYANL 237
Query: 226 -GTNPITGVSTRKSL 239
GT + TR SL
Sbjct: 238 LGTILVHADLTRASL 252
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 51/96 (53%), Gaps = 4/96 (4%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
+N + A++ ++ + +K +GA L KA +AN A+L+ + L +A+L A + R
Sbjct: 50 SNLSEANLSKAKLNVAKLSGANLSKANLEEANLNVANLTLADLSHAELRQASLVRAEMAR 109
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
L+ ++L A + G D DA + +QA +AN
Sbjct: 110 AELSEANLSFANLSGVDLKDAKL----RQANLSHAN 141
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 6/126 (4%)
Query: 84 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA-VHVKENFRANFTSADMRESDF 142
C SN+S A+L+K + S A A+L +A ++V A+ + A++R++
Sbjct: 48 CGSNLSE-ANLSK----AKLNVAKLSGANLSKANLEEANLNVANLTLADLSHAELRQASL 102
Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
++ A L +A AN +G DL D + + L+ AN++ A L T ++L A +
Sbjct: 103 VRAEMARAELSEANLSFANLSGVDLKDAKLRQANLSHANISRASLKWATFTSANLSQANL 162
Query: 203 EGADFS 208
G D S
Sbjct: 163 HGTDLS 168
>gi|291570912|dbj|BAI93184.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 517
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 1/96 (1%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F +A+LR+A N A+F+ A++R +D G+ +GA L +A AN +GA+LS
Sbjct: 189 ADFTNAELRQANLTYANLSNADFSGANLRWTDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
++ + L A+L+ A L+R + +DL GA + GA
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGA 284
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 67/156 (42%), Gaps = 28/156 (17%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTS 134
L A++ + N++ L + EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138
Query: 135 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 174
AD+RES + FNGA L A KAN AD ++ + +
Sbjct: 139 ADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPKADFTNAELRQ 198
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L ANL+NA L +DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANLRWTDLQGADLSGANLTEA 234
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 23/144 (15%)
Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
TR + A+ +A+L KA+ + AN AD+ E+ + A L +A K
Sbjct: 72 TRANLNV---ARLSNANLTKAILNQATINVANLVRADLTEAQLINTLLIRAELVRAKLSK 128
Query: 160 ANFT-----GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAI-----IEG 204
ANFT GADL ++ + + N ANL+ A L T T++DL GA +
Sbjct: 129 ANFTQANLNGADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPK 188
Query: 205 ADFSDAVIDLAQKQALCKYANGTN 228
ADF++A + +QA YAN +N
Sbjct: 189 ADFTNAEL----RQANLTYANLSN 208
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 50/102 (49%), Gaps = 3/102 (2%)
Query: 112 QFGSADLRKAVHVKENFR---ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
Q +D+ K + + +R +F ++ E + S GA L A AN + +DL
Sbjct: 8 QNSESDVLKVYEIVKKYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + R LN A L+NA L + +L ++ + A + AD ++A
Sbjct: 68 EVNLTRANLNVARLSNANLTKAILNQATINVANLVRADLTEA 109
Score = 37.4 bits (85), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 55/128 (42%), Gaps = 22/128 (17%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A ++DLR+ N T A++ + S + A L +A AN ADL+
Sbjct: 57 SVANLSASDLREV---------NLTRANLNVARLSNANLTKAILNQATINVANLVRADLT 107
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
EA L N +L+R L R+ L A A+ + A DL + + NG N
Sbjct: 108 ----------EAQLINTLLIRAELVRAKLSKANFTQANLNGA--DLRESKLQQTNFNGAN 155
Query: 229 PITGVSTR 236
++G + R
Sbjct: 156 -LSGANLR 162
>gi|157803630|ref|YP_001492179.1| hypothetical protein A1E_02245 [Rickettsia canadensis str. McKiel]
gi|157784893|gb|ABV73394.1| Uncharacterized low-complexity protein [Rickettsia canadensis str.
McKiel]
Length = 956
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 40/113 (35%), Positives = 61/113 (53%), Gaps = 7/113 (6%)
Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
+A++ KA+ K N AN T A + ++ +K + A LEKA A G +++D +
Sbjct: 559 NANMNKALLDKANLEYANLTGAILTDASAQFAKLSNATLEKAEA-----EGLNIADAIAK 613
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 225
M EAN NA++ R LT+++L AI+E AD A +D K+A K AN
Sbjct: 614 NMNAKEANFKNAIMKRADLTKANLEKAILENADMQAAEALDAIFKEANLKQAN 666
Score = 38.9 bits (89), Expect = 2.4, Method: Composition-based stats.
Identities = 33/117 (28%), Positives = 50/117 (42%), Gaps = 7/117 (5%)
Query: 121 AVHVKENFRANFTSADMRESDFSGSKFNG------AYLEKAVAYKANFTGADLSDTLMDR 174
A+ K + N S R + FS ++F A L A+ + N A+++ L+D+
Sbjct: 510 ALEAKFKKQCNMKSITARNAYFSDAEFENILSLEEADLRNAIMERVNLVNANMNKALLDK 569
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANGTNPI 230
L ANLT A+L + L A +E A+ + D K K AN N I
Sbjct: 570 ANLEYANLTGAILTDASAQFAKLSNATLEKAEAEGLNIADAIAKNMNAKEANFKNAI 626
Score = 38.5 bits (88), Expect = 3.1, Method: Composition-based stats.
Identities = 43/147 (29%), Positives = 63/147 (42%), Gaps = 26/147 (17%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSA 135
L A++ S+ + L++ +AE G I A + K ++ KE ANF +A
Sbjct: 577 LTGAILTDASAQFAKLSNATLEKAEAEG-LNIADA-------IAKNMNAKE---ANFKNA 625
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
M+ +D + A LEKA+ A+ A+ D + EANL A L L R
Sbjct: 626 IMKRADLTK-----ANLEKAILENADMQAAEALDA-----IFKEANLKQANLKAANLARI 675
Query: 196 DLGGAIIEGADFSDAVIDLAQKQALCK 222
+ GADF A +D A K K
Sbjct: 676 NKA-----GADFDQAKVDDATKMHYTK 697
>gi|347755497|ref|YP_004863061.1| putative low-complexity protein [Candidatus Chloracidobacterium
thermophilum B]
gi|347588015|gb|AEP12545.1| putative low-complexity protein [Candidatus Chloracidobacterium
thermophilum B]
Length = 419
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 60/123 (48%), Gaps = 9/123 (7%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL-- 167
A SA LR A V+ N AN AD+ ++ G+ GA L +A AN GADL
Sbjct: 57 ANLASASLRDAFLVRANLEGANLRGADLESANLEGANLRGADLSRANLEGANLEGADLTG 116
Query: 168 ----SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCK 222
S L+D L A L NAV L + LGGA + DF +A+++ A ++AL
Sbjct: 117 ARLPSAQLID-AKLGVATLENAVFANADLRNAYLGGANLTAVDFQNAILEAANFEEALLT 175
Query: 223 YAN 225
AN
Sbjct: 176 GAN 178
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 1/99 (1%)
Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F +ADLR A N A +F +A + ++F + GA L AV +A GADLS
Sbjct: 137 AVFANADLRNAYLGGANLTAVDFQNAILEAANFEEALLTGANLRDAVLRRAVLPGADLSG 196
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
++R VL A+L+ L+ + GA ++GA FS
Sbjct: 197 AKLERAVLEGADLSQVSLLEADCRHATFQGARLKGAKFS 235
Score = 43.9 bits (102), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 45/92 (48%), Gaps = 14/92 (15%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF-----TGA 165
A+ G A L AV F +AD+R + G+ + A+ ANF TGA
Sbjct: 127 AKLGVATLENAV---------FANADLRNAYLGGANLTAVDFQNAILEAANFEEALLTGA 177
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
+L D ++ R VL A+L+ A L R VL +DL
Sbjct: 178 NLRDAVLRRAVLPGADLSGAKLERAVLEGADL 209
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 11/111 (9%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRE-----SDFSGSKFNGAYLEKAVAYK 159
A +A+LR+A N RAN SA +R+ ++ G+ GA LE A
Sbjct: 32 ANLDNANLRRADLEGANLEEASLRRANLASASLRDAFLVRANLEGANLRGADLESANLEG 91
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
AN GADLS ++ L A+LT A L L + LG A +E A F++A
Sbjct: 92 ANLRGADLSRANLEGANLEGADLTGARLPSAQLIDAKLGVATLENAVFANA 142
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 55/111 (49%), Gaps = 19/111 (17%)
Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
ADLR+ K AN +A++R +D G A LE+A +AN A L D + R
Sbjct: 22 ADLRELDLAK----ANLDNANLRRADLEG-----ANLEEASLRRANLASASLRDAFLVRA 72
Query: 176 VLNEANLTNAVL---------VRTV-LTRSDLGGAIIEGADFSDAVIDLAQ 216
L ANL A L +R L+R++L GA +EGAD + A + AQ
Sbjct: 73 NLEGANLRGADLESANLEGANLRGADLSRANLEGANLEGADLTGARLPSAQ 123
>gi|379022817|ref|YP_005299478.1| hypothetical protein RCA_02115 [Rickettsia canadensis str. CA410]
gi|376323755|gb|AFB20996.1| hypothetical protein RCA_02115 [Rickettsia canadensis str. CA410]
Length = 956
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 40/113 (35%), Positives = 61/113 (53%), Gaps = 7/113 (6%)
Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
+A++ KA+ K N AN T A + ++ +K + A LEKA A G +++D +
Sbjct: 559 NANMNKALLDKANLEYANLTGAILTDASAQFAKLSNATLEKAEA-----EGLNIADAIAK 613
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 225
M EAN NA++ R LT+++L AI+E AD A +D K+A K AN
Sbjct: 614 NMNAKEANFKNAIMKRADLTKANLEKAILENADMQAAEALDAIFKEANLKQAN 666
Score = 38.9 bits (89), Expect = 2.4, Method: Composition-based stats.
Identities = 33/117 (28%), Positives = 50/117 (42%), Gaps = 7/117 (5%)
Query: 121 AVHVKENFRANFTSADMRESDFSGSKFNG------AYLEKAVAYKANFTGADLSDTLMDR 174
A+ K + N S R + FS ++F A L A+ + N A+++ L+D+
Sbjct: 510 ALEAKFKKQCNMKSITARNAYFSDAEFENILSLEEADLRNAIMERVNLVNANMNKALLDK 569
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANGTNPI 230
L ANLT A+L + L A +E A+ + D K K AN N I
Sbjct: 570 ANLEYANLTGAILTDASAQFAKLSNATLEKAEAEGLNIADAIAKNMNAKEANFKNAI 626
Score = 38.5 bits (88), Expect = 3.2, Method: Composition-based stats.
Identities = 43/147 (29%), Positives = 63/147 (42%), Gaps = 26/147 (17%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSA 135
L A++ S+ + L++ +AE G I A + K ++ KE ANF +A
Sbjct: 577 LTGAILTDASAQFAKLSNATLEKAEAEG-LNIADA-------IAKNMNAKE---ANFKNA 625
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
M+ +D + A LEKA+ A+ A+ D + EANL A L L R
Sbjct: 626 IMKRADLTK-----ANLEKAILENADMQAAEALDA-----IFKEANLKQANLKAANLARI 675
Query: 196 DLGGAIIEGADFSDAVIDLAQKQALCK 222
+ GADF A +D A K K
Sbjct: 676 NKA-----GADFDQAKVDDATKMHYTK 697
>gi|254416875|ref|ZP_05030623.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196176239|gb|EDX71255.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 332
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 56/121 (46%), Gaps = 13/121 (10%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA 155
Y A+ RG I ADLR A +K N R AN ++RE+D G+ +GA L A
Sbjct: 144 YTAKLRG--AILQNVDLQGADLRGADLLKVNLRGANLRETNLREADLRGANLSGANLSSA 201
Query: 156 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
+ N GA+L + L N L R +L+ +DL G ++GA D + A
Sbjct: 202 FLTEVNLMGANLRGAI----------LKNVKLERAILSEADLTGVNLQGAVMPDVRLSKA 251
Query: 216 Q 216
Q
Sbjct: 252 Q 252
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 57/124 (45%), Gaps = 7/124 (5%)
Query: 94 LNKYEAETRGEFGIG-SAAQFGSADLRKAV------HVKENFRANFTSADMRESDFSGSK 146
L++YEA GI S ADL V H A + A+ R+++ G++
Sbjct: 7 LHRYEAGETKFTGISLSGVNLFGADLIGIVLNGADLHGATLIFAYLSRANFRKANLVGTR 66
Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
+GA L +A N + ADL + L ANLT A L+ L +DL GA + GAD
Sbjct: 67 LSGANLNQAWLSGVNLSNADLHGASLQSADLRSANLTLASLLDANLMDADLRGANLSGAD 126
Query: 207 FSDA 210
+ A
Sbjct: 127 LTGA 130
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 59/130 (45%), Gaps = 12/130 (9%)
Query: 91 LADLNKYEAETRG--------EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDF 142
L ++N A RG E I S A +L+ AV + A + +
Sbjct: 203 LTEVNLMGANLRGAILKNVKLERAILSEADLTGVNLQGAVMPD----VRLSKAQVSGGNL 258
Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
S ++ N A L + +AN + +DL + + R L ANL+NA L R L+ ++L GA +
Sbjct: 259 SFARLNRADLSRTNLREANLSDSDLIEAYLARTNLMGANLSNANLTRAELSTTNLMGANL 318
Query: 203 EGADFSDAVI 212
+GA D I
Sbjct: 319 QGATMPDGRI 328
>gi|436841883|ref|YP_007326261.1| Pentapeptide repeat protein [Desulfovibrio hydrothermalis AM13 = DSM
14728]
gi|432170789|emb|CCO24160.1| Pentapeptide repeat protein [Desulfovibrio hydrothermalis AM13 = DSM
14728]
Length = 1278
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 37/100 (37%), Positives = 50/100 (50%), Gaps = 7/100 (7%)
Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
AD R A K F+ + AD R++D + FNGA K V K NF GA+L R
Sbjct: 1094 ADFRNAFIKKSIFKGSTLDGADFRKADVHETLFNGA---KGV--KVNFAGANLDKLRTGR 1148
Query: 175 MV-LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
EA+ T A L + +DL GA+ GAD +A++D
Sbjct: 1149 NAEFPEADFTGATLRSSAFRETDLTGALFRGADLENALVD 1188
Score = 45.8 bits (107), Expect = 0.022, Method: Composition-based stats.
Identities = 45/135 (33%), Positives = 61/135 (45%), Gaps = 24/135 (17%)
Query: 86 SNISALADLNKYEAETRGEF---GIGSAAQFGSADLRKA-VH---------VKENFR-AN 131
S +S AD EA+ R F I + AD RKA VH VK NF AN
Sbjct: 1085 SMVSGKAD----EADFRNAFIKKSIFKGSTLDGADFRKADVHETLFNGAKGVKVNFAGAN 1140
Query: 132 FT------SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
+A+ E+DF+G+ + + A F GADL + L+D +L +ANL A
Sbjct: 1141 LDKLRTGRNAEFPEADFTGATLRSSAFRETDLTGALFRGADLENALVDNCMLVDANLNGA 1200
Query: 186 VLVRTVLTRSDLGGA 200
T+S+L GA
Sbjct: 1201 SAKGARFTKSNLEGA 1215
Score = 42.7 bits (99), Expect = 0.15, Method: Composition-based stats.
Identities = 24/86 (27%), Positives = 41/86 (47%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN T ++ +DF + + A L + + A+FT A L +R +L A + L
Sbjct: 980 ANLTGCQLKNTDFKETCLDNAKLIQTMGRSADFTKASLKGVNFERAMLGNAIFEESDLTG 1039
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
++ G+ +GA +DAV D+A
Sbjct: 1040 AQARQASFKGSSFKGATLADAVFDMA 1065
Score = 38.9 bits (89), Expect = 2.4, Method: Composition-based stats.
Identities = 32/99 (32%), Positives = 48/99 (48%), Gaps = 7/99 (7%)
Query: 115 SADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
SAD KA NF RA +A ESD +G++ A + + +F GA L+D + D
Sbjct: 1009 SADFTKASLKGVNFERAMLGNAIFEESDLTGAQARQASFKGS-----SFKGATLADAVFD 1063
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+L + + + A L + S + G E ADF +A I
Sbjct: 1064 MAILEKTDFSKANLSGARINMSMVSGKADE-ADFRNAFI 1101
>gi|163797895|ref|ZP_02191839.1| pentapeptide repeat family protein [alpha proteobacterium BAL199]
gi|159176857|gb|EDP61425.1| pentapeptide repeat family protein [alpha proteobacterium BAL199]
Length = 396
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 48/95 (50%), Gaps = 11/95 (11%)
Query: 114 GSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
G+AD + A N F +FT AD+RE DF+G+ GA A +A GADLS
Sbjct: 15 GAADGQPASFANANLFGFDFTGADLREVDFAGASLQGARFVGADLTRAVLVGADLSGVSF 74
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
VL EA+LT A LV GA+ EGAD
Sbjct: 75 RNAVLLEADLTGARLV----------GAVFEGADL 99
Score = 44.3 bits (103), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 45/82 (54%), Gaps = 5/82 (6%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F A M ++ +G+KF E V + + TGA+L + R ++ A L NA+L+
Sbjct: 127 FAGARMHRANLTGAKF-----ENVVLAQTDLTGANLERASLRRASMSGAVLRNAILIDAD 181
Query: 192 LTRSDLGGAIIEGADFSDAVID 213
L+ +DL +++ GAD S A +D
Sbjct: 182 LSHADLTDSLVTGADLSGAQLD 203
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 44/89 (49%), Gaps = 1/89 (1%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ T AD+R + SG+ GA L +AV A ADLS + L NL+ A L
Sbjct: 270 DLTDADLRSLNLSGADLRGAVLRRAVLTDALLVLADLSGADLTLASLARCNLSGANLAGA 329
Query: 191 VLTRSDLGGAIIEGAD-FSDAVIDLAQKQ 218
L+R+DL AI+ A S A D ++Q
Sbjct: 330 NLSRADLTDAILTAAPILSQAGADTGRRQ 358
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 55/108 (50%), Gaps = 1/108 (0%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN T A + + GA LE+A +A+ +GA L + ++ L+ A+LT++++
Sbjct: 134 RANLTGAKFENVVLAQTDLTGANLERASLRRASMSGAVLRNAILIDADLSHADLTDSLVT 193
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYANGTNPITGVST 235
L+ + L GA +E A+F A + D+ + A T P V+T
Sbjct: 194 GADLSGAQLDGATVERANFVGARLRDVDLSRVDTSKARLTPPTDSVTT 241
>gi|428222472|ref|YP_007106642.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427995812|gb|AFY74507.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 340
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 78/155 (50%), Gaps = 13/155 (8%)
Query: 66 KNWR--VFVSTA------LAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSA 116
NWR VF S L+AA ++S + +++ L +N A ++ S A G A
Sbjct: 18 NNWRSEVFRSKIDLSYADLSAATLSSINLSLANLRSINLSRANLSKANL---SGAILGKA 74
Query: 117 DLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
+L +A + N ANF AD+ + S S + A L AVA ANF A+LS T
Sbjct: 75 NLTEASLINANLSMANFIMADLSGAYLSESNLSRANLGNAVAIAANFIMANLSGTYFSES 134
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ ANL++A L +L +++L G+ + A+F+ A
Sbjct: 135 DFSRANLSSANLTEAILVKTNLTGSYLSKANFTSA 169
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 58/108 (53%), Gaps = 6/108 (5%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A SA+L +A+ VK N +ANFTSA++ +D S + + A + A AN
Sbjct: 137 SRANLSSANLTEAILVKTNLTGSYLSKANFTSANLSMTDLSEADLSSANMHLADLSMANL 196
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ A+L ++ + L +ANLT A L LT +DL + + GA+F A
Sbjct: 197 SSANLIGAILTDVDLRQANLTGAYLNTANLTGADLATSTLVGANFYQA 244
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 56/103 (54%), Gaps = 11/103 (10%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F A+LR+A + N + AN + A + ++ +G+ GA L A AN GA
Sbjct: 239 ANFYQANLREANLDRANAQNANLSEAYLSNANLTGTILEGANLSSAYISNANLVGA---- 294
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
VL A+LT A+L+ LT+++ GA ++GADF+ A++
Sbjct: 295 ------VLKGADLTGAILIGANLTKANFSGAKLDGADFTSAIM 331
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 58/120 (48%), Gaps = 21/120 (17%)
Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSA----------DMRESDFSGSKFN-----GAYL 152
S ADL A +H+ + AN +SA D+R+++ +G+ N GA L
Sbjct: 172 SMTDLSEADLSSANMHLADLSMANLSSANLIGAILTDVDLRQANLTGAYLNTANLTGADL 231
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ ANF A+L + +DR AN NA L L+ ++L G I+EGA+ S A I
Sbjct: 232 ATSTLVGANFYQANLREANLDR-----ANAQNANLSEAYLSNANLTGTILEGANLSSAYI 286
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 11/103 (10%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S + A+L AV + NF AN + ESDFS + + A L +A+ K N TG+ L
Sbjct: 102 SESNLSRANLGNAVAIAANFIMANLSGTYFSESDFSRANLSSANLTEAILVKTNLTGSYL 161
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S +AN T+A L T L+ +DL A + AD S A
Sbjct: 162 S----------KANFTSANLSMTDLSEADLSSANMHLADLSMA 194
>gi|427735932|ref|YP_007055476.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427370973|gb|AFY54929.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 713
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 70/152 (46%), Gaps = 31/152 (20%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY---------- 158
S+A ADLR AV EN A+ T AD+ E+ + + GA L + VA
Sbjct: 534 SSASLAKADLRNAVL--EN--ASLTGADLGEARLNDADLYGARLGRVVAIGTQLSNANLI 589
Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL---------------TRSDLGGAIIE 203
K + GADLS +DR L+ ANL+ A L +L + +DL GA +
Sbjct: 590 KTEWQGADLSSAYLDRANLSNANLSAARLTGAILRSTNLQNVNLRNADLSLADLRGANLA 649
Query: 204 GADFSDAVIDLAQKQALCKYANGTNPITGVST 235
GADF ++ Q+ K+ + P TG+ +
Sbjct: 650 GADFQGTILSARQQNPADKFVD--TPTTGIQS 679
Score = 43.9 bits (102), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 50/97 (51%), Gaps = 14/97 (14%)
Query: 131 NFTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMDRMV 176
+F A++ ++ F+GS+F G A L +A +AN +GA+LS LM R
Sbjct: 453 DFKYANLDKASFTGSRFRGPGKDGRWDTYDDWIANLSQAQLKQANLSGANLSRVLMVRTN 512
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
L+ +NL A L L ++L A + AD +AV++
Sbjct: 513 LSRSNLNKANLSAARLVGANLSSASLAKADLRNAVLE 549
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 54/111 (48%), Gaps = 6/111 (5%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A A+L + + V+ N +AN ++A + ++ S + A L AV A+ TG
Sbjct: 496 ANLSGANLSRVLMVRTNLSRSNLNKANLSAARLVGANLSSASLAKADLRNAVLENASLTG 555
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
ADL + ++ L A L V + T L+ ++L +GAD S A +D A
Sbjct: 556 ADLGEARLNDADLYGARLGRVVAIGTQLSNANLIKTEWQGADLSSAYLDRA 606
>gi|308205942|gb|ADO19342.1| pentapeptide repeat protein [Nostoc flagelliforme str. Sunitezuoqi]
Length = 146
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 52/106 (49%), Gaps = 10/106 (9%)
Query: 115 SADLRKAVHVKENFRANFTSA----------DMRESDFSGSKFNGAYLEKAVAYKANFTG 164
SA +R+ + +E F N T A D+R ++ G+ GA LE A AN
Sbjct: 28 SAPVRRLLETRECFGCNLTGANLKGAHLIGVDLRNANLKGANLEGANLEGADLTGANLKY 87
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+L+ + +LN ANLTN L + L SD+ GA++ D S A
Sbjct: 88 ANLTKAFVSDTILNNANLTNVNLSNSRLYNSDVDGAVMANIDLSGA 133
>gi|427715911|ref|YP_007063905.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348347|gb|AFY31071.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 589
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 57/111 (51%), Gaps = 6/111 (5%)
Query: 111 AQFGSADLRKAVHVKEN------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A AD+ KA+ N F N + A + +D S +K NGA L A A F G
Sbjct: 408 ADLSGADMSKAILNGTNLSDTILFSTNLSDAILIAADLSYAKLNGAKLNYARLNGAMFLG 467
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
ADLS + ++LN+A+L+ +L L+ +DL AI+ G D S A ++ A
Sbjct: 468 ADLSGVDLSGVILNDADLSGVLLSEADLSDADLSDAILFGTDLSYANLNRA 518
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 55/110 (50%), Gaps = 6/110 (5%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A SA+L A + R N +SAD+ +D S + N A L A AN + ADL
Sbjct: 321 SHADLSSANLSGANLTNTDLNRTNLSSADLSSADLSSTNLNSADLSSANLKDANLSSADL 380
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVI 212
S T + L++ANL+ L L R+DL G AI+ G + SD ++
Sbjct: 381 SHTHLFGANLSDANLSGVNLSHADLCRADLSGADMSKAILNGTNLSDTIL 430
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 57/110 (51%), Gaps = 9/110 (8%)
Query: 103 GEFGIGS---AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAY 158
GEF G A G A+L A NF AN + A + +++ +G F+GA L A
Sbjct: 252 GEFLRGGNFRGAYLGDANLTGA-----NFSGANLSGAYLGDANLTGVNFSGANLSGANLG 306
Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
AN +GA+LS+ + L+ ANL+ A L T L R++L A + AD S
Sbjct: 307 DANLSGANLSNANLSHADLSSANLSGANLTNTDLNRTNLSSADLSSADLS 356
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 58/103 (56%), Gaps = 6/103 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A G A+L NF AN + A++ +++ SG+ + A L A AN +GA+L
Sbjct: 281 SGAYLGDANLTGV-----NFSGANLSGANLGDANLSGANLSNANLSHADLSSANLSGANL 335
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
++T ++R L+ A+L++A L T L +DL A ++ A+ S A
Sbjct: 336 TNTDLNRTNLSSADLSSADLSSTNLNSADLSSANLKDANLSSA 378
Score = 43.5 bits (101), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 50/94 (53%), Gaps = 10/94 (10%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM----------DRMVLNE 179
A F AD+ D SG N A L + +A+ + ADLSD ++ +R L+
Sbjct: 463 AMFLGADLSGVDLSGVILNDADLSGVLLSEADLSDADLSDAILFGTDLSYANLNRANLSG 522
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
+NL+ A+L L+ ++L AI+ GAD SDA ++
Sbjct: 523 SNLSGALLNGADLSHTNLSCAILGGADVSDANLE 556
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 44/81 (54%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN ++A++ +D S + +GA L + N + ADLS + LN A+L++A L
Sbjct: 313 ANLSNANLSHADLSSANLSGANLTNTDLNRTNLSSADLSSADLSSTNLNSADLSSANLKD 372
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L+ +DL + GA+ SDA
Sbjct: 373 ANLSSADLSHTHLFGANLSDA 393
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S+ SADL A ++K+ AN +SAD+ + G+ + A L A+ ADLS
Sbjct: 356 SSTNLNSADLSSA-NLKD---ANLSSADLSHTHLFGANLSDANLSGVNLSHADLCRADLS 411
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
M + +LN NL++ +L T +L AI+ AD S A ++
Sbjct: 412 GADMSKAILNGTNLSDTILFST-----NLSDAILIAADLSYAKLN 451
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 41/89 (46%), Gaps = 5/89 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEANLT 183
A+ AD+ +D S + NG L + + N + ADLS ++ LN A L
Sbjct: 402 HADLCRADLSGADMSKAILNGTNLSDTILFSTNLSDAILIAADLSYAKLNGAKLNYARLN 461
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+ + L+ DL G I+ AD S ++
Sbjct: 462 GAMFLGADLSGVDLSGVILNDADLSGVLL 490
>gi|425447182|ref|ZP_18827173.1| Genome sequencing data, contig C314 (fragment) [Microcystis
aeruginosa PCC 9443]
gi|389732326|emb|CCI03724.1| Genome sequencing data, contig C314 (fragment) [Microcystis
aeruginosa PCC 9443]
Length = 285
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 46/78 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T A++ E+ +G+ NGA LE+A A+ GA+L + ++ L EANL A L+R
Sbjct: 137 ADLTEANLTEAKLNGADLNGANLEEAKLNGADLNGANLEEAKLNGAFLEEANLKRANLIR 196
Query: 190 TVLTRSDLGGAIIEGADF 207
L S L GA ++GA+
Sbjct: 197 ANLIGSGLWGANLKGANL 214
>gi|428312148|ref|YP_007123125.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428253760|gb|AFZ19719.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 223
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 49/83 (59%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N + D+R +DF G+ A L +A AN GA LS +++R VLN A L++A+L
Sbjct: 26 NLSDTDLRGADFRGADLFDANLARADLSDANLGGAILSRAVLNRAVLNRAVLSSALLSNA 85
Query: 191 VLTRSDLGGAIIEGADFSDAVID 213
L R+ L GA++ GA + A+++
Sbjct: 86 FLNRAVLCGAVLRGAILNGAILN 108
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 41/83 (49%), Gaps = 3/83 (3%)
Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
E F G L A+F GADL D + R L++ANL A+L R VL R+ L
Sbjct: 14 ERSFDFPNLEGINLSDTDLRGADFRGADLFDANLARADLSDANLGGAILSRAVLNRAVLN 73
Query: 199 GAIIEGADFSDAVIDLAQKQALC 221
A++ A S+A ++ + LC
Sbjct: 74 RAVLSSALLSNAFLN---RAVLC 93
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 5/87 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEANLT 183
RA A +R + +G+ NGA L A Y AN +G ADL ++ +L EA+L
Sbjct: 89 RAVLCGAVLRGAILNGAILNGANLSGADLYHANLSGALLGYADLYHAYLNSALLREADLY 148
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
+A L L ++L A + GAD + A
Sbjct: 149 HAYLREANLFGANLRSANLSGADLTGA 175
>gi|428216301|ref|YP_007100766.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988083|gb|AFY68338.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 188
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 56/127 (44%), Gaps = 12/127 (9%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN ++R + S FN A L A N TGA D MD L+ ANL +A L
Sbjct: 60 ANLADTNLRGASLKNSNFNRANLSWANMSWTNLTGASFMDARMDVTNLSSANLIDADLRG 119
Query: 190 TVLTRSDLGGAIIEG-----------ADFSDAV-IDLAQKQALCKYANGTNPITGVSTRK 237
L ++L G + G ADFS +D + LC A G +P T STR
Sbjct: 120 ANLQGANLRGTNLRGTQIEPLRSIDNADFSRVKNLDQRVRVYLCSIATGAHPFTKNSTRA 179
Query: 238 SLGCGNS 244
+L C NS
Sbjct: 180 TLECNNS 186
>gi|254409695|ref|ZP_05023476.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183692|gb|EDX78675.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 350
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 62/118 (52%), Gaps = 5/118 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A +ADL+ + RA + AD+R ++ G+ F AYL +A AN TGADLS
Sbjct: 144 SRANLKAADLQGVIL----NRAILSQADLRGANLRGACFIRAYLHRADLRDANLTGADLS 199
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA-LCKYAN 225
D + L+ ANL+ A L L+ ++L GA + GA +A + LA L K AN
Sbjct: 200 DADLKGADLSHANLSRANLSCANLSHANLTGANLTGAHLQNANLSLANLSGLLLKKAN 257
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 64/136 (47%), Gaps = 36/136 (26%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYL----------------- 152
A ADL+ A+ ++ +A+ T+A +RE+D SG+ GA L
Sbjct: 66 ADLSKADLKNALLIEATLSQADLTAAILREADLSGAILTGATLLDADLRHATLIGTSLID 125
Query: 153 ---EKAVAYKANFTG----------ADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTR 194
++A KAN TG ADL +++R +L++ ANL A +R L R
Sbjct: 126 AKMKRAKLAKANCTGASFSRANLKAADLQGVILNRAILSQADLRGANLRGACFIRAYLHR 185
Query: 195 SDLGGAIIEGADFSDA 210
+DL A + GAD SDA
Sbjct: 186 ADLRDANLTGADLSDA 201
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 47/82 (57%), Gaps = 5/82 (6%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N AD+ E++ S + A+L++A KA GA L D L++A+L NA+L+
Sbjct: 27 NLIRADLTEANLSRINLSAAHLQRANLAKAKLIGAQLKDA-----DLSKADLKNALLIEA 81
Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
L+++DL AI+ AD S A++
Sbjct: 82 TLSQADLTAAILREADLSGAIL 103
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 55/106 (51%), Gaps = 11/106 (10%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
+ A ADL+ A N RAN + A++ ++ +G+ GA+L+ A AN +G
Sbjct: 194 TGADLSDADLKGADLSHANLSRANLSCANLSHANLTGANLTGAHLQNANLSLANLSG--- 250
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
++L +ANL +A L + L R++L A + GA+ +A ++
Sbjct: 251 -------LLLKKANLQSAQLSKANLNRANLYKANLSGANLLEANLE 289
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 48/90 (53%), Gaps = 10/90 (11%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + +++++ ++ + A L +A YKAN +GA+L + ++ L E+NL A L+
Sbjct: 246 ANLSGLLLKKANLQSAQLSKANLNRANLYKANLSGANLLEANLEHANLAESNLQRAGLLL 305
Query: 190 TVLTRSDLG----------GAIIEGADFSD 209
LT ++L GA + GAD SD
Sbjct: 306 AYLTDANLSHANLNGANLIGANLMGADLSD 335
>gi|316934318|ref|YP_004109300.1| pentapeptide repeat-containing protein [Rhodopseudomonas palustris
DX-1]
gi|315602032|gb|ADU44567.1| pentapeptide repeat protein [Rhodopseudomonas palustris DX-1]
Length = 273
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 55/103 (53%), Gaps = 6/103 (5%)
Query: 109 SAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL A N +RA+ + A++ +D SG+ +GA L +A + AN +GADL
Sbjct: 57 SGANLSGADLSGANLSGANLYRADLSGANLSGADLSGANLSGANLYRAKLFSANLSGADL 116
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S L+ ANL A L L R+DL GA + GAD S A
Sbjct: 117 SGA-----NLSGANLYRADLSGANLYRADLSGANLSGADLSGA 154
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 46/80 (57%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N + AD+ ++ SG+ +GA L A Y+A GA+LS + L+ ANL+ A L R
Sbjct: 20 NLSGADLSGANLSGADLSGANLSGANLYRAKLFGANLSGANLSGADLSGANLSGANLYRA 79
Query: 191 VLTRSDLGGAIIEGADFSDA 210
L+ ++L GA + GA+ S A
Sbjct: 80 DLSGANLSGADLSGANLSGA 99
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 6/95 (6%)
Query: 109 SAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANF 162
S A ADL A N +RA SA++ +D SG+ +GA L +A Y+A+
Sbjct: 82 SGANLSGADLSGANLSGANLYRAKLFSANLSGADLSGANLSGANLYRADLSGANLYRADL 141
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
+GA+LS + L+ ANL+ A V L R+ +
Sbjct: 142 SGANLSGADLSGANLHRANLSGAKGVDLSLARTRI 176
>gi|374583660|ref|ZP_09656754.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
17734]
gi|374419742|gb|EHQ92177.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
17734]
Length = 367
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 58/103 (56%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL +A + + AN + A++ E+D S + +GA L +A AN +GA+L
Sbjct: 98 SGANLSEADLSRADLSEADLSGANLSGANLSEADLSRADLSGANLSEADLSGANLSGANL 157
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S+ + R L+ ANL A L L+ +DL GA + GA+ S+A
Sbjct: 158 SEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEA 200
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 50/86 (58%), Gaps = 5/86 (5%)
Query: 130 ANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN + AD+ +D SG+ +GA L +A AN +GA+LS+ + R L+ ANL+
Sbjct: 155 ANLSEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEADLSRADLSGANLSR 214
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
A L L+ +DL GA + GA+ S+A
Sbjct: 215 ADLSGANLSEADLSGANLSGANLSEA 240
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 5/86 (5%)
Query: 130 ANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN + AD+ +D SG+ + GA L +A AN +GA+LS+ + R L+ ANL
Sbjct: 195 ANLSEADLSRADLSGANLSRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANLRR 254
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
A L L R+DL GA + AD S+A
Sbjct: 255 ADLSGANLRRADLSGANLRRADLSEA 280
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 55/108 (50%), Gaps = 6/108 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A ADL +A N AN + A++ E+D S + +GA L +A AN
Sbjct: 123 SGANLSEADLSRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANLRRANLSGANL 182
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ ADLS + L+EA+L+ A L L+R+DL GA + AD S A
Sbjct: 183 SEADLSGANLSGANLSEADLSRADLSGANLSRADLSGANLSEADLSGA 230
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 58/118 (49%), Gaps = 16/118 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 165
S A ADL +A N R AN + A++ E+D SG+ +GA L +A +A+ +GA
Sbjct: 153 SGANLSEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEADLSRADLSGANL 212
Query: 166 -------------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
DLS + L+EA+L+ A L L R+DL GA + AD S A
Sbjct: 213 SRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANLRRADLSGANLRRADLSGA 270
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 1/100 (1%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL +A N RA+ + A++ E+D SG+ +GA L +A +A+ +GA+L
Sbjct: 193 SGANLSEADLSRADLSGANLSRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANL 252
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+ L A+L+ A L R L+ ++L A + GAD
Sbjct: 253 RRADLSGANLRRADLSGANLRRADLSEANLSEANLSGADL 292
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 17/128 (13%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN---- 184
RA+ + A++ E+D SG+ +GA L +A +A+ +GA+L R L+ ANL+
Sbjct: 134 RADLSGANLSEADLSGANLSGANLSEADLSRADLSGANLR-----RANLSGANLSEADLS 188
Query: 185 ------AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 238
A L L+R+DL GA + AD S A +L++ +G N +R
Sbjct: 189 GANLSGANLSEADLSRADLSGANLSRADLSGA--NLSEADLSGANLSGANLSEADLSRAD 246
Query: 239 LGCGNSRR 246
L N RR
Sbjct: 247 LSGANLRR 254
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 51/91 (56%), Gaps = 10/91 (10%)
Query: 130 ANFTSAD----------MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AN + A+ + E+D SG+ +GA L +A +A+ +GA+LS+ + L+
Sbjct: 95 ANLSGANLSEADLSRADLSEADLSGANLSGANLSEADLSRADLSGANLSEADLSGANLSG 154
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
ANL+ A L R L+ ++L A + GA+ S+A
Sbjct: 155 ANLSEADLSRADLSGANLRRANLSGANLSEA 185
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 42/70 (60%)
Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
+ SG+ +GA L +A +A+ + ADLS + L+EA+L+ A L L+ +DL GA
Sbjct: 91 NLSGANLSGANLSEADLSRADLSEADLSGANLSGANLSEADLSRADLSGANLSEADLSGA 150
Query: 201 IIEGADFSDA 210
+ GA+ S+A
Sbjct: 151 NLSGANLSEA 160
>gi|119487545|ref|ZP_01621155.1| hypothetical protein L8106_26852 [Lyngbya sp. PCC 8106]
gi|119455714|gb|EAW36850.1| hypothetical protein L8106_26852 [Lyngbya sp. PCC 8106]
Length = 277
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 64/116 (55%), Gaps = 5/116 (4%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A+ DL A ++ N AN T+A D +GS G+ + +AN T A+L++
Sbjct: 60 AKLMGVDLSDANLMEANLIGANLTNAKFDRCDLTGSNLRGSSSKLVSLTQANLTDANLTE 119
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
+ ANLTNA L+RT L +++L GA++EGA+ ++ ++ ++++ + AN
Sbjct: 120 ANLAEANFVGANLTNATLIRTNLMKANLTGAVLEGANLTNVIL----RESILEGAN 171
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 15/86 (17%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN T A++ E++F G+ A L + KAN TGA VL ANLTN +L
Sbjct: 115 ANLTEANLAEANFVGANLTNATLIRTNLMKANLTGA----------VLEGANLTNVILRE 164
Query: 190 TV-----LTRSDLGGAIIEGADFSDA 210
++ L + L GA++ A+F+DA
Sbjct: 165 SILEGANLIHATLSGALLISANFTDA 190
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 57/137 (41%), Gaps = 31/137 (22%)
Query: 111 AQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
A F A+L A ++ N AN T+ +RES G+ A L A+
Sbjct: 125 ANFVGANLTNATLIRTNLMKANLTGAVLEGANLTNVILRESILEGANLIHATLSGALLIS 184
Query: 160 ANFT----------GADLSDTLMDRM----------VLNEANLTNAVLVRTVLTRSDLGG 199
ANFT GADLSD + + L ANL+ A L RT L+ S+L G
Sbjct: 185 ANFTDADMSRVTMIGADLSDANLSGVNLRAANVSWTTLRGANLSRARLYRTKLSWSNLSG 244
Query: 200 AIIEGADFSDAVIDLAQ 216
A + A D +D A
Sbjct: 245 ANLIEAVLLDTRLDHAN 261
>gi|312194409|ref|YP_004014470.1| pentapeptide repeat-containing protein [Frankia sp. EuI1c]
gi|311225745|gb|ADP78600.1| pentapeptide repeat protein [Frankia sp. EuI1c]
Length = 2027
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 30/83 (36%), Positives = 45/83 (54%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T D+ ++D +G+ A L+ A AN TGA L+ R+ L ANLT+A L R
Sbjct: 1243 ADLTGLDLSDADLAGANLTDADLDDANLTGANLTGARLTGVRARRLRLTGANLTDADLRR 1302
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
LT DL G ++ G+ + A +
Sbjct: 1303 ARLTDPDLTGTVLTGSKWERAAL 1325
Score = 45.8 bits (107), Expect = 0.019, Method: Composition-based stats.
Identities = 25/71 (35%), Positives = 39/71 (54%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN T AD+ +++ +G+ GA L A + TGA+L+D + R L + +LT VL
Sbjct: 1258 ANLTDADLDDANLTGANLTGARLTGVRARRLRLTGANLTDADLRRARLTDPDLTGTVLTG 1317
Query: 190 TVLTRSDLGGA 200
+ R+ L GA
Sbjct: 1318 SKWERAALLGA 1328
>gi|392410087|ref|YP_006446694.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
gi|390623223|gb|AFM24430.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
Length = 490
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 53/95 (55%), Gaps = 10/95 (10%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANL 182
FRAN + A + ++F+G+ N A L KANFT ADLS+ + M R+ L+ L
Sbjct: 123 FRANLSKAIIDTANFTGANLNCANLAGNKLSKANFTKADLSEAVLTSSDMSRIQLSGNKL 182
Query: 183 TNA-----VLVRTVLTRSDLGGAIIEGADFSDAVI 212
T A VL + + R+DL GA +E AD SDA +
Sbjct: 183 TKADLSWGVLSKARIERADLTGANLERADLSDAKL 217
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 51/87 (58%), Gaps = 5/87 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
A+ + D+ E+DFSGS + + LE + K F+ +LS+T M R L++ NLT A L
Sbjct: 64 EADLSEIDLTEADFSGSNLSKSKLEGSCLKKGIFSRCNLSNTDMTRTTLSDCNLTEANLF 123
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
R L++ AII+ A+F+ A ++ A
Sbjct: 124 RANLSK-----AIIDTANFTGANLNCA 145
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 54/112 (48%), Gaps = 11/112 (9%)
Query: 110 AAQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAY 158
+A F +A ++ AV + N + A+F+ A + DFSG+ GA L++A +
Sbjct: 314 SANFSNAQMQGAVLTRTNLQEADFQKAAAQNADFSQASGEKVDFSGAVLQGANLQEANFF 373
Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
KA ADLS + + + +LT + + T+ +DL + AD S A
Sbjct: 374 KAKLERADLSSANVSKASFRDGDLTRVIALATIFVSADLQNTSFKDADVSAA 425
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 48/95 (50%), Gaps = 10/95 (10%)
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEA 180
E F + + A +R+SD +G+ F+ + L K++ ANF+ A + ++ R L EA
Sbjct: 276 EGFNCDLSGAAVRDSDLTGANFSSSQLVETDFSKSILVSANFSNAQMQGAVLTRTNLQEA 335
Query: 181 NL-----TNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ NA + + D GA+++GA+ +A
Sbjct: 336 DFQKAAAQNADFSQASGEKVDFSGAVLQGANLQEA 370
>gi|330509039|ref|YP_004385467.1| pentapeptide repeat-containing protein [Methanosaeta concilii GP6]
gi|328929847|gb|AEB69649.1| pentapeptide repeat protein [Methanosaeta concilii GP6]
Length = 386
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 57/110 (51%), Gaps = 6/110 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA-VAY----KANF 162
S + AD +A ++ N AN ADM +D + + GA L+ A + Y KANF
Sbjct: 204 SGSDLSDADFTRAYLMRSNLTGANIDWADMAYADLTEAVLTGASLKSAKMPYSDLTKANF 263
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
TGADLS+ +D +L A L NA L R L DL G + GA ++V+
Sbjct: 264 TGADLSEAYLDGAILAGATLRNAKLDRVNLREVDLRGLEMGGASLKNSVL 313
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEANLTN 184
A+ +D++ + +GS +GAYL A ++ G ADL+ ++ L A+LT
Sbjct: 51 AHLNQSDLQGCNLNGSNLDGAYLRSAWLMASHLNGSTLENADLTGAVLTEADLTGADLTG 110
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A L+R ++++ L GA I AD ++A I
Sbjct: 111 ANLIRVQMSKAKLNGARIVKADLTEADI 138
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 45/81 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A SA + S +GS A L AV +A+ TGADL+ + R+ +++A L A +V+
Sbjct: 71 AYLRSAWLMASHLNGSTLENADLTGAVLTEADLTGADLTGANLIRVQMSKAKLNGARIVK 130
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
LT +D+ + + AD +DA
Sbjct: 131 ADLTEADISDSDLSDADLTDA 151
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 49/104 (47%), Gaps = 6/104 (5%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A AD+ A + F RA S ++ SD S + F AYL ++N TGA++
Sbjct: 176 AHISWADMSVAYLSQGQFSRAELYSTNLSGSDLSDADFTRAYL-----MRSNLTGANIDW 230
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
M L EA LT A L + SDL A GAD S+A +D
Sbjct: 231 ADMAYADLTEAVLTGASLKSAKMPYSDLTKANFTGADLSEAYLD 274
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 1/103 (0%)
Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
+ADL AV + + A+ T A++ S +K NGA + KA +A+ + +DLSD +
Sbjct: 90 NADLTGAVLTEADLTGADLTGANLIRVQMSKAKLNGARIVKADLTEADISDSDLSDADLT 149
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
L +L+ A L LT +++ GA I AD S A + Q
Sbjct: 150 DARLFRTDLSGAKLKGIYLTSANMIGAHISWADMSVAYLSQGQ 192
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 51/98 (52%), Gaps = 15/98 (15%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKA-------------VAY--KANFTGADLSDTLMDR 174
A+ T A + +D SG+K G YL A VAY + F+ A+L T +
Sbjct: 146 ADLTDARLFRTDLSGAKLKGIYLTSANMIGAHISWADMSVAYLSQGQFSRAELYSTNLSG 205
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L++A+ T A L+R+ LT +++ A + AD ++AV+
Sbjct: 206 SDLSDADFTRAYLMRSNLTGANIDWADMAYADLTEAVL 243
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 19/112 (16%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A ADL +AV A+ SA M SD + + F GA L +A A GA L +
Sbjct: 231 ADMAYADLTEAVLTG----ASLKSAKMPYSDLTKANFTGADLSEAYLDGAILAGATLRNA 286
Query: 171 LMDRMVLNE----------ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+DR+ L E A+L N+VL + +DL GAD DA +
Sbjct: 287 KLDRVNLREVDLRGLEMGGASLKNSVLTGVFMAMTDLA-----GADLRDATL 333
>gi|110597243|ref|ZP_01385531.1| Pentapeptide repeat [Chlorobium ferrooxidans DSM 13031]
gi|110341079|gb|EAT59547.1| Pentapeptide repeat [Chlorobium ferrooxidans DSM 13031]
Length = 447
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 57/108 (52%), Gaps = 6/108 (5%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S + F SA L +A N + NF ADM+ + G+ GA L++A A+ + +L
Sbjct: 304 SGSSFKSASLDEANLAGANLSKVNFHKADMKGAHLQGANLQGANLDRAFLKDADLSNTNL 363
Query: 168 SD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S+ T++ L ANL NA L L ++LGGA ++GA+ +DA
Sbjct: 364 SNAVLFGTILTGANLQNANLENASLFEADLEEANLGGANLKGANITDA 411
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 46/83 (55%), Gaps = 5/83 (6%)
Query: 135 ADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ADM+ +D SG + G+++++A+ AN GA+L +++ + +ANL N VL
Sbjct: 103 ADMKGTDLSGACLIKANMKGSFMKEAIFRGANLQGANLRWVMLEEADMEDANLANTVLFE 162
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L ++L GA ++ A F D +
Sbjct: 163 ANLENANLKGANLKDAVFLDQAL 185
>gi|434395496|ref|YP_007130443.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428267337|gb|AFZ33283.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 249
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 57/104 (54%), Gaps = 9/104 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A A+L+ A ++ E AN + AD+ E+D SG+ +GA L + AN + A LS
Sbjct: 128 SGANLAQANLKGA-NLTE---ANLSKADLTEADLSGADLSGATLSGVILSDANLSDAILS 183
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++ VL ANL+ AVL LT +L EGA+ S+AV+
Sbjct: 184 RAILTLAVLQGANLSGAVLSGVNLTEVNL-----EGANLSNAVL 222
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 52/82 (63%), Gaps = 5/82 (6%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N + A++ +++ G+ A L KA +A+ +GADLS + ++L++ANL++A+L R
Sbjct: 126 NLSGANLAQANLKGANLTEANLSKADLTEADLSGADLSGATLSGVILSDANLSDAILSRA 185
Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
+LT A+++GA+ S AV+
Sbjct: 186 ILTL-----AVLQGANLSGAVL 202
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 44/78 (56%)
Query: 146 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
F+G L A +A ADLS+ ++ +L +A L+ A L RT+LT++DL A++ GA
Sbjct: 16 NFSGENLRSADLTRATLNAADLSEAILSEAILTQAELSEANLSRTILTKADLTEAVLAGA 75
Query: 206 DFSDAVIDLAQKQALCKY 223
+ A++ A+ + Y
Sbjct: 76 KLTGAILTEAELSRVNLY 93
>gi|284929723|ref|YP_003422245.1| hypothetical protein UCYN_11960 [cyanobacterium UCYN-A]
gi|284810167|gb|ADB95864.1| uncharacterized low-complexity protein [cyanobacterium UCYN-A]
Length = 243
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 62/125 (49%), Gaps = 15/125 (12%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYL 152
LNKY+ R F S LR+ + N + NF SAD+R+S S FNGA L
Sbjct: 7 LNKYDLGER---------NFQSICLREVDLTEVNLPKINFESADIRQSRLGKSNFNGAIL 57
Query: 153 EKA-----VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
++A + + N +LS ++ L+ A LTNA L L+++ L GA + A+
Sbjct: 58 KQADLSESIIWGTNLENTNLSKAILRDTDLSGAELTNADLTNAYLSKASLCGANLAKANL 117
Query: 208 SDAVI 212
S AV+
Sbjct: 118 SHAVL 122
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 51/105 (48%), Gaps = 1/105 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
+ A ADL +++ N N + A +R++D SG++ A L A KA+ GA+L
Sbjct: 53 NGAILKQADLSESIIWGTNLENTNLSKAILRDTDLSGAELTNADLTNAYLSKASLCGANL 112
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ + VL E +L RT L R++L + A S A++
Sbjct: 113 AKANLSHAVLYEVDLRPLSNRRTNLGRANLSSTDLSYAKLSSALL 157
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 42/91 (46%), Gaps = 6/91 (6%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTGAD 166
F SAD+R++ K NF A AD+ ES G+ L KA+ A T AD
Sbjct: 37 FESADIRQSRLGKSNFNGAILKQADLSESIIWGTNLENTNLSKAILRDTDLSGAELTNAD 96
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
L++ + + L ANL A L VL DL
Sbjct: 97 LTNAYLSKASLCGANLAKANLSHAVLYEVDL 127
>gi|434400818|ref|YP_007134822.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428271915|gb|AFZ37856.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 209
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 67/137 (48%), Gaps = 31/137 (22%)
Query: 127 NF-RANFTSADMRESDFSGS---------------KFNGAYLEKAVAYKANFTGADLSDT 170
NF +AN T AD RE D + + A LE+AV Y+A+ +LS +
Sbjct: 36 NFSQANLTGADFREIDLTQAILCEANLSQTILIEANLTKANLERAVLYRASLQLVNLSQS 95
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAI----------IEGADFSDAVIDLAQKQAL 220
++ L EANLT A+L +T L ++ L GA+ + GA+ S A++ QA
Sbjct: 96 ILTEADLREANLTEALLYKTSLGKAQLQGAVLNRAILQRTFLRGANLSQAIL----SQAN 151
Query: 221 CKYANGTNP-ITGVSTR 236
+ AN T+ +TG + R
Sbjct: 152 LQEANLTDADLTGANLR 168
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 64/131 (48%), Gaps = 1/131 (0%)
Query: 84 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDF 142
C +N+S + + E + A +L +++ + + R AN T A + ++
Sbjct: 58 CEANLSQTILIEANLTKANLERAVLYRASLQLVNLSQSILTEADLREANLTEALLYKTSL 117
Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
++ GA L +A+ + GA+LS ++ + L EANLT+A L L ++L GA +
Sbjct: 118 GKAQLQGAVLNRAILQRTFLRGANLSQAILSQANLQEANLTDADLTGANLRGANLQGAFL 177
Query: 203 EGADFSDAVID 213
A+ +A ++
Sbjct: 178 VEANLFEASLE 188
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 44/81 (54%), Gaps = 2/81 (2%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
+ R+ D S G L+ +AN TGAD + + + +L EANL+ +L+ LT++
Sbjct: 16 EFRQVDLSYRVLRGVDLQAINFSQANLTGADFREIDLTQAILCEANLSQTILIEANLTKA 75
Query: 196 DLGGAIIEGADFSDAVIDLAQ 216
+L A++ A +++L+Q
Sbjct: 76 NLERAVLYRASLQ--LVNLSQ 94
>gi|428223553|ref|YP_007107650.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427983454|gb|AFY64598.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 521
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 62/128 (48%), Gaps = 12/128 (9%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLE 153
E +G G FG DLR+A + N A+ + A++ ++ SG+ GA L
Sbjct: 5 ELLKRYGAGER-NFGGMDLREANLSRANLSHIDLSGADLSVANLSGANLSGADLRGARLN 63
Query: 154 KAVAYKANFTGADLSDTLMD-----RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
A AN +GA+LS +++ R L ANL A L+R L R+DL A ++ AD
Sbjct: 64 VAKLSGANLSGANLSSCILNVANLVRADLTGANLNQAALIRAELMRADLKQATLDSADLG 123
Query: 209 DAVIDLAQ 216
A + AQ
Sbjct: 124 GAQLQEAQ 131
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 11/124 (8%)
Query: 111 AQFGSADLRKAVHVKEN-FRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTG 164
A F A+LR A + N RANF +A++R ++ S G+ +GA L A A G
Sbjct: 155 AVFDQANLRGADLNRANATRANFRNAELRLANLSEILLIGADLHGANLRWANLTGARLRG 214
Query: 165 ADLSDTLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
ADL++ + L NLT+A L+ L+R++L G GA+ + A + A+
Sbjct: 215 ADLTEAKLSGAAIVGADLRNVNLTHASLIHADLSRANLIGTDWIGAELTGATLTGAKLHG 274
Query: 220 LCKY 223
+ +Y
Sbjct: 275 VSRY 278
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 5/90 (5%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR-----MVLNEANL 182
RA AD++++ + GA L++A ++ANF+ A+LS+ R V ++ANL
Sbjct: 103 IRAELMRADLKQATLDSADLGGAQLQEAQLHQANFSRANLSEVNFHRATLADAVFDQANL 162
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A L R TR++ A + A+ S+ ++
Sbjct: 163 RGADLNRANATRANFRNAELRLANLSEILL 192
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 76 LAAAVVASCSSNISAL-------ADLNKYEAETRGEF-------GIGSAAQFGSADLRKA 121
L+ A ++SC N++ L A+LN+ A R E +A G A L++A
Sbjct: 72 LSGANLSSCILNVANLVRADLTGANLNQ-AALIRAELMRADLKQATLDSADLGGAQLQEA 130
Query: 122 VHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
+ NF NF A + ++ F + GA L +A A +ANF A+L + +
Sbjct: 131 QLHQANFSRANLSEVNFHRATLADAVFDQANLRGADLNRANATRANFRNAELRLANLSEI 190
Query: 176 V----------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+ L ANLT A L LT + L GA I GAD +
Sbjct: 191 LLIGADLHGANLRWANLTGARLRGADLTEAKLSGAAIVGADLRN 234
>gi|220907082|ref|YP_002482393.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863693|gb|ACL44032.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 309
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 6/103 (5%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F A+L ++ N R A T AD+RE+ K N A L ++ +AN TGADL
Sbjct: 185 ADFQGANLSRSTLTGANLRGAYLTGADLREA-----KLNEANLRRSDLSQANLTGADLRG 239
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++R L ANL ++L+ L ++L A ++GA+ +AV+
Sbjct: 240 ANLNRATLRGANLRESILIGASLMGANLSQASLQGANLLEAVL 282
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 54/106 (50%), Gaps = 1/106 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A++ A+ + N A+ T A++ +++ G+ GAYL A TGA+L
Sbjct: 113 SEANLTGAEISAAILREANLTLADLTLAELSQTNLRGANLTGAYLRGAELLGTQLTGAEL 172
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
S L EA+ A L R+ LT ++L GA + GAD +A ++
Sbjct: 173 SQANFRGTNLTEADFQGANLSRSTLTGANLRGAYLTGADLREAKLN 218
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 16/103 (15%)
Query: 116 ADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
ADLR+A + N R AN T AD+R ++ + + GA L +++ A+ GA+LS
Sbjct: 210 ADLREAKLNEANLRRSDLSQANLTGADLRGANLNRATLRGANLRESILIGASLMGANLS- 268
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+A+L A L+ VLT ++L G + G D S V+
Sbjct: 269 ---------QASLQGANLLEAVLTGANLTGVDLTGVDLSATVM 302
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 52/108 (48%), Gaps = 14/108 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A ADLR A AN ++AD+R +D G A L K KAN TGADL+
Sbjct: 48 SGANLQGADLRGATLAA----ANLSNADLRGADLRGVLLMEADLRKVNLRKANLTGADLT 103
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
ANLT A L LT +++ AI+ A+ + A + LA+
Sbjct: 104 G----------ANLTGADLSEANLTGAEISAAILREANLTLADLTLAE 141
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 57/103 (55%), Gaps = 7/103 (6%)
Query: 109 SAAQFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
+AA +ADLR + V + E A+ ++R+++ +G+ GA L A +AN TG
Sbjct: 63 AAANLSNADLRGADLRGVLLME---ADLRKVNLRKANLTGADLTGANLTGADLSEANLTG 119
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
A++S ++ L A+LT A L +T L ++L GA + GA+
Sbjct: 120 AEISAAILREANLTLADLTLAELSQTNLRGANLTGAYLRGAEL 162
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 4/105 (3%)
Query: 122 VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
VH+ + + A++R + SG+ GA L A AN + ADL + ++L EA+
Sbjct: 30 VHLSQ---VDLQGANLRGAGLSGANLQGADLRGATLAAANLSNADLRGADLRGVLLMEAD 86
Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
L L + LT +DL GA + GAD S+A + A+ A+ + AN
Sbjct: 87 LRKVNLRKANLTGADLTGANLTGADLSEANLTGAEISAAILREAN 131
>gi|428220994|ref|YP_007105164.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994334|gb|AFY73029.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 283
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 55/108 (50%), Gaps = 6/108 (5%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTG 164
A +A++ A N R A A++R S +G+ F GA L +AV N T
Sbjct: 169 ANLDTANISDADLTNANLRWATLRDANLRGSILTGANGNLANFTGANLSQAVLRGINLTN 228
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
ADLS+ ++ L+ ANL A LV LT +DL GA I AD S AV+
Sbjct: 229 ADLSNAKLNAADLSNANLVGASLVGANLTSADLTGANITNADLSGAVM 276
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 45/77 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN T+A++R + S + A L +A +A+ + A ++ +D +++A+LTNA L
Sbjct: 129 ANLTAANLRSASLYKSNLSLAILTQATLAEADLSDASFTEANLDTANISDADLTNANLRW 188
Query: 190 TVLTRSDLGGAIIEGAD 206
L ++L G+I+ GA+
Sbjct: 189 ATLRDANLRGSILTGAN 205
>gi|378579963|ref|ZP_09828623.1| hypothetical protein CKS_2597 [Pantoea stewartii subsp. stewartii
DC283]
gi|377817422|gb|EHU00518.1| hypothetical protein CKS_2597 [Pantoea stewartii subsp. stewartii
DC283]
Length = 272
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 54/105 (51%), Gaps = 9/105 (8%)
Query: 108 GSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
GS A ADLR A A+ + AD+ +D SG+ GAYL A A+ +GADL
Sbjct: 24 GSRADLRGADLRGAYLRG----ADLSGADLSGADLSGADLRGAYLRDADLRGADLSGADL 79
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
SD + L +A+L A L+ +DL GA + GAD S A +
Sbjct: 80 SDADLRGAYLRDADLRGA-----DLSDADLSGAYLRGADLSGADL 119
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 51/110 (46%), Gaps = 6/110 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A ADLR A + R A+ + A +R +D SG+ GAYL A A+
Sbjct: 75 SGADLSDADLRGAYLRDADLRGADLSDADLSGAYLRGADLSGADLRGAYLRDADLRGADL 134
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ ADLS + L A+L A L L +DL GA + AD S A +
Sbjct: 135 SDADLSGAYLRDADLRGADLRGADLRGAYLRDADLRGADLSDADLSGAYL 184
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 55/115 (47%), Gaps = 3/115 (2%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVA 157
A+ RG + G A ADL A + R A AD+R +D SG+ + A L A
Sbjct: 32 ADLRGAYLRG--ADLSGADLSGADLSGADLRGAYLRDADLRGADLSGADLSDADLRGAYL 89
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+ GADLSD + L A+L+ A L L +DL GA + AD S A +
Sbjct: 90 RDADLRGADLSDADLSGAYLRGADLSGADLRGAYLRDADLRGADLSDADLSGAYL 144
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 46/91 (50%), Gaps = 10/91 (10%)
Query: 127 NFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
+ R N + AD+R G+ GAYL A A+ +GADLS + L +A+L A
Sbjct: 19 SLRQNGSRADLR-----GADLRGAYLRGADLSGADLSGADLSGADLRGAYLRDADLRGAD 73
Query: 187 LVRTVLTRSDLGGAIIE-----GADFSDAVI 212
L L+ +DL GA + GAD SDA +
Sbjct: 74 LSGADLSDADLRGAYLRDADLRGADLSDADL 104
>gi|86606854|ref|YP_475617.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86555396|gb|ABD00354.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 248
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 59/128 (46%), Gaps = 15/128 (11%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD---------------LSDTLMDR 174
+NFT+A + +S F G F+ + +A AN T + LS ++
Sbjct: 109 SNFTAAKLDKSSFQGGHFSHSIFREASLVAANLTEGNFFAADFRQANLFRCNLSQAILSS 168
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 234
L AN A+LV L + + GA GADF+DA + ++ L + A+GTN +T
Sbjct: 169 CQLQNANFDQALLVGANLQEAQIEGASFVGADFTDAKLSDEMRKFLLERASGTNELTQRD 228
Query: 235 TRKSLGCG 242
T +L G
Sbjct: 229 TLNTLLAG 236
>gi|325106774|ref|YP_004267842.1| pentapeptide repeat-containing protein [Planctomyces brasiliensis
DSM 5305]
gi|324967042|gb|ADY57820.1| pentapeptide repeat protein [Planctomyces brasiliensis DSM 5305]
Length = 194
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 68/148 (45%), Gaps = 16/148 (10%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN + AD+ E+D G+ +GA L +A +A+ GADLS + L+ ANL+ A L
Sbjct: 25 RANLSEADLSEADLRGADLSGANLSEADLSEADLRGADLSGANLSWANLSWANLSEADLS 84
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQ------KQALCKYANGTNPITGVSTRKSLGCG 242
L+ +DL A + GAD S A + A +A+ + G I S+GC
Sbjct: 85 GANLSEADLSEADLRGADLSGANLRGANLSGANLSEAVARLDFGAWSICVRKDVTSIGCR 144
Query: 243 NSRRNAYGSPSSPLLSAPPQKLLDRDGF 270
R + + L P D DGF
Sbjct: 145 TYRNDRW-------LEWTPD---DVDGF 162
>gi|428222289|ref|YP_007106459.1| serine/threonine protein kinase [Synechococcus sp. PCC 7502]
gi|427995629|gb|AFY74324.1| serine/threonine protein kinase [Synechococcus sp. PCC 7502]
Length = 563
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 54/97 (55%), Gaps = 5/97 (5%)
Query: 119 RKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-----TLMD 173
RK + + + NF + D+ ++ +G+ +G + ++ + +F +DL+ +M
Sbjct: 396 RKVIVEYGHGKRNFANLDLSKASLAGTNLSGIVMSRSKLVETDFCQSDLTHASFTGAIMT 455
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
++ LN ANL A + R +LT++DLGGA + AD +A
Sbjct: 456 QVKLNGANLAQAKMQRAILTKADLGGACLNQADLREA 492
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 59/123 (47%), Gaps = 17/123 (13%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGS-----KF 147
E+G G F + DL KA N +F +D+ + F+G+ K
Sbjct: 401 EYGHGKR-NFANLDLSKASLAGTNLSGIVMSRSKLVETDFCQSDLTHASFTGAIMTQVKL 459
Query: 148 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
NGA L +A +A T ADL +++ L EANL +A + + L+ +DL GA ++GA
Sbjct: 460 NGANLAQAKMQRAILTKADLGGACLNQADLREANLQSAYMSKADLSGADLTGANLKGAYL 519
Query: 208 SDA 210
S A
Sbjct: 520 SQA 522
>gi|378826441|ref|YP_005189173.1| BTB/POZ domain-containing protein KCTD9 [Sinorhizobium fredii
HH103]
gi|365179493|emb|CCE96348.1| BTB/POZ domain-containing protein KCTD9 [Sinorhizobium fredii
HH103]
Length = 250
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 60/122 (49%), Gaps = 12/122 (9%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A +A+L KA V+ + +ANF+ + DFSG GA + +A+FTG
Sbjct: 88 ADLTAANLEKATLVRASLAGAKADKANFSRVEGYRGDFSGISAEGALFVSSELQRADFTG 147
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGA-DFSDAVIDLAQKQ 218
A L+ ++ L AN AV+ T L+R+DL GA+ EG DF A + L + +
Sbjct: 148 ARLTGADFEKAELGRANFGKAVVTGTRFSVANLSRADLSGAVFEGPIDFDRAFLFLTRIE 207
Query: 219 AL 220
L
Sbjct: 208 GL 209
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 38/87 (43%), Gaps = 5/87 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN-----A 185
N D +D +G+ A LEKA +A+ GA R+ + + A
Sbjct: 74 NLVDTDFASTDLNGADLTAANLEKATLVRASLAGAKADKANFSRVEGYRGDFSGISAEGA 133
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ V + L R+D GA + GADF A +
Sbjct: 134 LFVSSELQRADFTGARLTGADFEKAEL 160
>gi|425458953|ref|ZP_18838439.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9808]
gi|389823440|emb|CCI28334.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9808]
Length = 425
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 60/120 (50%), Gaps = 5/120 (4%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A L A+ ++ N R A + AD+ E+D SG+ A L KA+ +A A LS+
Sbjct: 285 ANLIKAILSWAILIEANLRGAILSEADLSEADLSGANLRRANLIKAILRRAILIEAILSE 344
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
+ L ANL A+L+ +L +DL GA + A+ S+A I+ A+ A G P
Sbjct: 345 ADLSGANLRRANLIKAILIEAILIEADLRGADLRWANLSEADIE----NAIFIDATGITP 400
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 48/82 (58%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ + + AD+ E+D SG+ +GA L +A AN +GA+LS + L ANL A+L
Sbjct: 234 QVDLSGADLSEADLSGAILSGANLSEANLSGANLSGANLSWANLIDANLRRANLIKAILS 293
Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
+L ++L GAI+ AD S+A
Sbjct: 294 WAILIEANLRGAILSEADLSEA 315
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 51/104 (49%), Gaps = 9/104 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A ADL A+ + A++ E++ SG+ +GA L A AN A+L
Sbjct: 238 SGADLSEADLSGAI---------LSGANLSEANLSGANLSGANLSWANLIDANLRRANLI 288
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++ +L EANL A+L L+ +DL GA + A+ A++
Sbjct: 289 KAILSWAILIEANLRGAILSEADLSEADLSGANLRRANLIKAIL 332
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 49/103 (47%), Gaps = 2/103 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+LR+A +K R A A + E+D SG+ A L KA+ +A ADL
Sbjct: 313 SEADLSGANLRRANLIKAILRRAILIEAILSEADLSGANLRRANLIKAILIEAILIEADL 372
Query: 168 SDTLMDRMVLNEANLTNAVLVR-TVLTRSDLGGAIIEGADFSD 209
+ L+EA++ NA+ + T +T I GA F D
Sbjct: 373 RGADLRWANLSEADIENAIFIDATGITPEQKQDLIRRGAIFGD 415
>gi|116754331|ref|YP_843449.1| pentapeptide repeat-containing protein [Methanosaeta thermophila
PT]
gi|116665782|gb|ABK14809.1| pentapeptide repeat protein [Methanosaeta thermophila PT]
Length = 389
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 57/106 (53%), Gaps = 1/106 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A F A L A + FR + F+ A++ ++ +G+ +G+ ++ +A TGADL
Sbjct: 177 SHANFVGAHLSWADMSRSRFRESQFSRAELYGANLTGTDLSGSDFTRSYMMRARMTGADL 236
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
SD +D L EA L + L + +DL GA + GAD S+ V+D
Sbjct: 237 SDASLDYADLTEAELRDTDLSGCKMRYADLSGANLAGADISEVVLD 282
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 78/176 (44%), Gaps = 34/176 (19%)
Query: 46 SDGQFPDCSNNQCAGP---YAKLKNWRVFVSTALAAAV-VASCSSNISALADLNKYEAET 101
+D D S +G AKL+N R+ ++ + A + +A C+ + + D++ +AE
Sbjct: 99 ADLSMADLSGANLSGTDLSRAKLRNARLSGASLVNANLTMADCTEAL--MDDVSLEDAEM 156
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
G +F DL AV + ANF A + +D S S+F + +A Y A
Sbjct: 157 TG-------TRFFRTDLTGAVFSGASLSHANFVGAHLSWADMSRSRFRESQFSRAELYGA 209
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
N TG DLS + + T + ++R +T GAD SDA +D A
Sbjct: 210 NLTGTDLSGS----------DFTRSYMMRARMT----------GADLSDASLDYAD 245
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 49/111 (44%), Gaps = 16/111 (14%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 159
A A LR A V N A+ AD+ +D SG+ +G A L A
Sbjct: 74 ANLNGAYLRSAWLVNANLEGASLAGADLSMADLSGANLSGTDLSRAKLRNARLSGASLVN 133
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
AN T AD ++ LMD + L +A +T RT DL GA+ GA S A
Sbjct: 134 ANLTMADCTEALMDDVSLEDAEMTGTRFFRT-----DLTGAVFSGASLSHA 179
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 43/84 (51%), Gaps = 5/84 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T A++R++D SG K A L A N GAD+S+ ++D + NL+ A+L +
Sbjct: 244 ADLTEAELRDTDLSGCKMRYADLSGA-----NLAGADISEVVLDSVKTTGVNLSGAILYK 298
Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
T L DL + G A +D
Sbjct: 299 TSLFNLDLRDIDMHGVQIKKAKMD 322
>gi|113477234|ref|YP_723295.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110168282|gb|ABG52822.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 227
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 60/122 (49%), Gaps = 11/122 (9%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMR-----ESDFSGSKFNGAYLEKAVAYK 159
A+F ADL +A ++ + + N AD+ E D G+ G +A+ K
Sbjct: 90 AKFNKADLTRAKLIRADLSCADFSQVNMVDADLSRAILYEIDLHGANLYGVNFRRAILNK 149
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
A+ GA+L M + L EANLT A L +L+ +DL GA + GA+ SD + A QA
Sbjct: 150 ADLIGANLIRANMTGVDLIEANLTRANLTEAILSGADLNGASLLGANISDVNLVGAALQA 209
Query: 220 LC 221
+
Sbjct: 210 VI 211
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%), Gaps = 5/77 (6%)
Query: 139 ESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
E +FSG A+L E A ++AN TGA+LS + R+ L +ANLT A L+ T L+
Sbjct: 19 EKNFSGLYLQEAHLLKANLEGANFFEANLTGANLSQANLSRVNLAKANLTGANLIGTDLS 78
Query: 194 RSDLGGAIIEGADFSDA 210
++L ++ GA F+ A
Sbjct: 79 EANLSDTLLVGAKFNKA 95
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 49/95 (51%), Gaps = 10/95 (10%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL----------SDTLMDRMVL 177
+AN A+ E++ +G+ + A L + KAN TGA+L SDTL+
Sbjct: 33 LKANLEGANFFEANLTGANLSQANLSRVNLAKANLTGANLIGTDLSEANLSDTLLVGAKF 92
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
N+A+LT A L+R L+ +D + AD S A++
Sbjct: 93 NKADLTRAKLIRADLSCADFSQVNMVDADLSRAIL 127
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 66/154 (42%), Gaps = 26/154 (16%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFS-----GSKFN 148
N +EA G A A+L + K N AN D+ E++ S G+KFN
Sbjct: 41 NFFEANLTG-------ANLSQANLSRVNLAKANLTGANLIGTDLSEANLSDTLLVGAKFN 93
Query: 149 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
A L +A +A+ + AD S + N+ +A L R +L DL GA + G +F
Sbjct: 94 KADLTRAKLIRADLSCADFS----------QVNMVDADLSRAILYEIDLHGANLYGVNFR 143
Query: 209 DAVI---DLAQKQALCKYANGTNPITGVSTRKSL 239
A++ DL + G + I TR +L
Sbjct: 144 RAILNKADLIGANLIRANMTGVDLIEANLTRANL 177
>gi|309792396|ref|ZP_07686863.1| pentapeptide repeat-containing protein [Oscillochloris trichoides
DG-6]
gi|308225551|gb|EFO79312.1| pentapeptide repeat-containing protein [Oscillochloris trichoides
DG6]
Length = 314
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 60/125 (48%), Gaps = 10/125 (8%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADLRK N AN T A++R ++ S + F+GA L A N +G DL D
Sbjct: 89 ADLSDADLRKGDLAWANLEFANLTGANLRGANLSAADFSGANLYGANLSLCNLSGVDLRD 148
Query: 170 TLMDRMVLNEANLTNAVLVRTV--------LTRSDLGGAIIEGADFSDA-VIDLAQKQAL 220
T+M L EA L A LV L + LGGA ++G + S A ++ ++A
Sbjct: 149 TIMIGANLTEAQLREAQLVNLSGANLSGANLNKVSLGGASMQGVNLSGASLLSANLREAT 208
Query: 221 CKYAN 225
+ AN
Sbjct: 209 LREAN 213
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 68/146 (46%), Gaps = 13/146 (8%)
Query: 78 AAVVASCSSNISALADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKEN------FRA 130
A +V +N+S A+LNK G+ S A SA+LR+A + N + A
Sbjct: 164 AQLVNLSGANLSG-ANLNKVSLGGASMQGVNLSGASLLSANLREATLREANLIGANLYEA 222
Query: 131 NFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
N + AD+ +D S + +G YL E A+ AN + A+LS + LN NL A
Sbjct: 223 NLSEADLSAADLSMANLSGIYLSGANLEGAILTHANLSRANLSGCNLRGAQLNGCNLREA 282
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAV 211
L LT +DL GA + D S +
Sbjct: 283 SLADADLTGADLTGADLSECDLSGVI 308
>gi|359458687|ref|ZP_09247250.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 203
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 66/146 (45%), Gaps = 23/146 (15%)
Query: 111 AQFGSADLRKAVHVKENFRA-----------NFTSADMRESDFSGSKFNGAYLEKAVAYK 159
A F SADLRKA + + RA N A++ ++ SG+ +GA L A+ Y
Sbjct: 53 ANFASADLRKAKLFRADLRAACLYRADLRGANLKGANLFGANLSGANLSGANLSNAMLYC 112
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF--SDAV-IDLAQ 216
AN GA+L T++D L N ++ L +L + L G EG +D + I+L Q
Sbjct: 113 ANLGGANLRGTILDSANLMRVNFSHGDLRNAMLRNAKLQGTHFEGTRMLQTDLIEINLNQ 172
Query: 217 KQALCKY---------ANGTNPITGV 233
Q Y A G ITG+
Sbjct: 173 AQIDGVYLMDPDANNTAMGNTAITGI 198
>gi|428771470|ref|YP_007163260.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428685749|gb|AFZ55216.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 195
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 49/82 (59%), Gaps = 6/82 (7%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ +AD+R +D G GA L+KA AN +GADLS + L EANL+ A+L T
Sbjct: 113 DLCNADLRGADLRGVNLVGACLQKADLSNANLSGADLS-----QADLEEANLSGAILHGT 167
Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
LT+++L AI+EG F D VI
Sbjct: 168 NLTQANLLCAIVEGVSF-DYVI 188
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 21/98 (21%)
Query: 130 ANFTSADMRESDFSGSKFNG-----AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN + AD+ +S+F+GS G A LEKA+ + N GADL+ + L A+L
Sbjct: 28 ANLSGADLAQSNFTGSNLTGVNLTGANLEKAI-LRCNLRGADLTGASLQGADLRGADLRG 86
Query: 185 AVLVRT---------------VLTRSDLGGAIIEGADF 207
A+L+ + +LT DL A + GAD
Sbjct: 87 AILLSSQVENISLAGSFLAGAILTNLDLCNADLRGADL 124
>gi|434396750|ref|YP_007130754.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428267847|gb|AFZ33788.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 331
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 57/111 (51%), Gaps = 11/111 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 163
S A ++L KA ++ NF RAN T A + ++D SG A L A+ K N T
Sbjct: 65 SGADLSQSNLEKAQLIETNFSRANLTEASLIQADLSG-----AILSSAIGTKTNLTAAIL 119
Query: 164 -GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
G L T + + L EANLT A L +LT S+L AI+ A S+A ++
Sbjct: 120 IGCSLVGTQLLKSKLKEANLTGASLTGAILTGSNLTRAILTRAILSNANLE 170
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 43/89 (48%), Gaps = 6/89 (6%)
Query: 123 HVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
H + NF +ANF A + D + A +A AN +GADLS + +++
Sbjct: 19 HGQRNFQAIKLIKANFQRASLNNIDLKMAVLKKANFNQAQLINANLSGADLSQSNLEKAQ 78
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
L E N + A L L ++DL GAI+ A
Sbjct: 79 LIETNFSRANLTEASLIQADLSGAILSSA 107
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 6/101 (5%)
Query: 116 ADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
ADLR A N + N AD+ E++ S + GA L A + G +L+
Sbjct: 202 ADLRGANLEGANLQGANLEGVNLQDADLTEANLSAANLEGAVLSNANLQQVILKGTNLTG 261
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
T + L +ANL+ A L + L +DL GA + GAD + A
Sbjct: 262 TNLLNANLGQANLSQANLCQAGLLFTDLTGANLMGADLTSA 302
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 52/94 (55%), Gaps = 9/94 (9%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A + DL+ AV K ANF A + ++ SG+ + + LEKA + NF+ A+L++
Sbjct: 37 ASLNNIDLKMAVLKK----ANFNQAQLINANLSGADLSQSNLEKAQLIETNFSRANLTEA 92
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
L +A+L+ A+L + T+++L AI+ G
Sbjct: 93 -----SLIQADLSGAILSSAIGTKTNLTAAILIG 121
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 44/91 (48%), Gaps = 10/91 (10%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
+R +H +AN AD+R +D G+ GA L+ A N ADL+
Sbjct: 180 IRAYLHRVNLKKANLEKADLRFADLRGANLEGANLQGANLEGVNLQDADLT--------- 230
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
EANL+ A L VL+ ++L I++G + +
Sbjct: 231 -EANLSAANLEGAVLSNANLQQVILKGTNLT 260
>gi|298492301|ref|YP_003722478.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298234219|gb|ADI65355.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 264
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 55/106 (51%), Gaps = 6/106 (5%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTG 164
A ADL A ++ N AN A++ +D SG+ A YL +A YKAN T
Sbjct: 139 ANLKDADLAAAKLIRSNLSFANLVGANLITTDLSGANLYEAELMQTYLYQANLYKANLTN 198
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ L + + R L+EANLTNA L LT ++L GA + GA+ A
Sbjct: 199 SHLGSSYLFRANLSEANLTNADLTCANLTGANLRGANLRGANLRGA 244
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 66/139 (47%), Gaps = 24/139 (17%)
Query: 117 DLRKAVHVKENFR-ANFTSADMRESDFS----------GSKFNGAYLEKAVAYKANFTGA 165
DL A ENFR AN ++ + DFS G+ + A L +A +AN + A
Sbjct: 30 DLSTANLQGENFRGANLQGVNLTKVDFSHALLVRTNLSGANLSIANLHQAKLIEANLSEA 89
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA-----DFSDAVI---DLAQK 217
+LS + L +ANL+ L+ L+ ++L GA I GA DF +A + DLA
Sbjct: 90 NLSIANLRNATLTQANLSQVNLIGADLSEANLIGAAITGANLIGTDFRNANLKDADLAAA 149
Query: 218 QAL---CKYAN--GTNPIT 231
+ + +AN G N IT
Sbjct: 150 KLIRSNLSFANLVGANLIT 168
>gi|440681678|ref|YP_007156473.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428678797|gb|AFZ57563.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 402
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 53/103 (51%), Gaps = 4/103 (3%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADL KA K NF ANFT A + E+ G+ F AYL +A AN TG +L+
Sbjct: 281 AILAGADLTKA---KANFTGANFTGAILTEAILIGANFEKAYLIRADLTGANLTGTNLTR 337
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ L ANLT A L++ +L + L I+ GA A++
Sbjct: 338 ADLTEADLTGANLTRAYLIKAILEEAILEEVILRGAILRGAIL 380
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 71/149 (47%), Gaps = 18/149 (12%)
Query: 71 FVSTALAAAVV--ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 128
F L A++ A+ I A ADL K +A G A F A L +A+ +
Sbjct: 263 FTRAILTEAILIGANFEEAILAGADLTKAKANFTG-------ANFTGAILTEAILIG--- 312
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS-----DTLMDRMVLNEANLT 183
ANF A + +D +G+ G L +A +A+ TGA+L+ +++ +L E L
Sbjct: 313 -ANFEKAYLIRADLTGANLTGTNLTRADLTEADLTGANLTRAYLIKAILEEAILEEVILR 371
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+L +LTR+ L GA ++GA D I
Sbjct: 372 GAILRGAILTRAILRGANLKGATMPDGSI 400
Score = 43.9 bits (102), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 5/80 (6%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N + A++ E++F + A L++A+ ANF GA + R L EAN T A+L
Sbjct: 217 NISKANLTEANFKRAILAEANLKRAILIGANFEGA-----IFTRADLAEANFTRAILTEA 271
Query: 191 VLTRSDLGGAIIEGADFSDA 210
+L ++ AI+ GAD + A
Sbjct: 272 ILIGANFEEAILAGADLTKA 291
>gi|86605499|ref|YP_474262.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86554041|gb|ABC98999.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 330
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 58/118 (49%), Gaps = 4/118 (3%)
Query: 99 AETRGEFGIG---SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEK 154
A+ RG +G Q G A+L++A+ + N AN + AD+ +D S + A L +
Sbjct: 207 ADLRGASFLGGDLQGVQMGRANLKEAMLSQVNLAEANLSEADLAGADLSAACLRSAKLAR 266
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+AN GADL + L NL NA L +LTR+DL A + GA+ A +
Sbjct: 267 TDLSRANLAGADLRSASLVDAYLGRTNLENADLREAILTRADLSTANLAGANLRGATL 324
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 4/104 (3%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RA+ D+ ++D G AYL +A KAN GA+LS + + L+EA+L +A L
Sbjct: 31 RADLIGIDLSQADLHGINLIFAYLGRAKLQKANLVGANLSGANLSQADLSEADLRDAHLH 90
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 232
T L +DL GA + A DA + +A ++AN T+ G
Sbjct: 91 GTTLQGADLHGANLALALLIDANL----LEADLRWANLTSANLG 130
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 48/98 (48%), Gaps = 11/98 (11%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A G A L+KA V N AN + AD+ E+D + +G L+ A + AN A
Sbjct: 52 AYLGRAKLQKANLVGANLSGANLSQADLSEADLRDAHLHGTTLQGADLHGANLALA---- 107
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+L +ANL A L LT ++LGGA + GA+
Sbjct: 108 ------LLIDANLLEADLRWANLTSANLGGACLRGANL 139
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 64/135 (47%), Gaps = 8/135 (5%)
Query: 74 TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRAN-F 132
T L A + + ++ L D N EA+ R + ++A G A LR A E+ RA
Sbjct: 92 TTLQGADLHGANLALALLIDANLLEADLR--WANLTSANLGGACLRGANLRFESRRAAVL 149
Query: 133 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 192
SA++ +D SG+ GA L +A+ GA+L + + L ANL A L +L
Sbjct: 150 RSANLSRADLSGANLAGADL-----TRADLRGANLKEASLIGAHLQGANLQRACLRGALL 204
Query: 193 TRSDLGGAIIEGADF 207
+ +DL GA G D
Sbjct: 205 SNADLRGASFLGGDL 219
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 40/87 (45%), Gaps = 10/87 (11%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKA----------NFTGADLSDTLMDRMVLN 178
RA+ A+++E+ G+ GA L++A A +F G DL M R L
Sbjct: 171 RADLRGANLKEASLIGAHLQGANLQRACLRGALLSNADLRGASFLGGDLQGVQMGRANLK 230
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGA 205
EA L+ L L+ +DL GA + A
Sbjct: 231 EAMLSQVNLAEANLSEADLAGADLSAA 257
Score = 37.0 bits (84), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 132 FTSADMRESDFSGSKFNG-----AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
++AD+R + F G G A L++A+ + N A+LS+ + L+ A L +A
Sbjct: 204 LSNADLRGASFLGGDLQGVQMGRANLKEAMLSQVNLAEANLSEADLAGADLSAACLRSAK 263
Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVI 212
L RT L+R++L GA + A DA +
Sbjct: 264 LARTDLSRANLAGADLRSASLVDAYL 289
>gi|75910595|ref|YP_324891.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704320|gb|ABA23996.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 521
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 59/116 (50%), Gaps = 7/116 (6%)
Query: 115 SADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
SA+LR A + NFR A+ + A++R +D SG + A L A AN GADLS
Sbjct: 174 SANLRDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSNAKLSGANLVGADLS 233
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY 223
+ + L ANL A L+R +DL AI+ GA +S + L + +C++
Sbjct: 234 NANLTNASLVHANLIQAKLIRAEWVGADLTSAILTGAKLYSTSRFGLKTEGLICQW 289
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 9/101 (8%)
Query: 130 ANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN + +++ E+DFS +K N GA L A+ ++ A+L + + R L A+L
Sbjct: 45 ANLSGSNLSEADFSHAKLNVARLSGANLTNAIFNHSSLNVANLIRSDLSRAQLRGASLVR 104
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
A L+R L+R DL A + AD +A + + A ++AN
Sbjct: 105 AELIRAELSRVDLSEANLNSADLREATL----RHANLRHAN 141
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A SADLR+A A++R ++ +G+ GA L A AN G+DLS
Sbjct: 118 SEANLNSADLREAT---------LRHANLRHANLNGASLKGASLVGANLEMANLNGSDLS 168
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
R L ANL +A L + ++L GA + GA+
Sbjct: 169 -----RCDLTSANLRDAELKQVNFRHANLSGADLSGANL 202
Score = 40.4 bits (93), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 26/121 (21%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTNA 185
NF+ D+ E++ SG K G +A AN +G++LS+ LN A NLTNA
Sbjct: 16 NFSGVDLSEANLSGVKLCGVNFSQANLSIANLSGSNLSEADFSHAKLNVARLSGANLTNA 75
Query: 186 V----------LVRTVLTRSDLGGAIIEGA----------DFSDAVIDLAQ-KQALCKYA 224
+ L+R+ L+R+ L GA + A D S+A ++ A ++A ++A
Sbjct: 76 IFNHSSLNVANLIRSDLSRAQLRGASLVRAELIRAELSRVDLSEANLNSADLREATLRHA 135
Query: 225 N 225
N
Sbjct: 136 N 136
>gi|153873268|ref|ZP_02001907.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
gi|152070268|gb|EDN68095.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
Length = 159
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 49/83 (59%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
FRAN + D+ +D SG+ +GA L +A ANFT A+LS+ + +ANLT+A L
Sbjct: 47 FRANLSHVDLTNTDLSGANLSGANLNEANLTNANFTKANLSEANLCESYFAKANLTDANL 106
Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
LT++ L + + GA+ S+A
Sbjct: 107 SEANLTKAYLIESFLSGANLSEA 129
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 43/78 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANFT A++ E++ S F A L A +AN T A L ++ + L+EANL + L
Sbjct: 79 ANFTKANLSEANLCESYFAKANLTDANLSEANLTKAYLIESFLSGANLSEANLFRSNLFE 138
Query: 190 TVLTRSDLGGAIIEGADF 207
+ L R++L GA + A F
Sbjct: 139 SDLFRANLTGANLYKAKF 156
>gi|194337742|ref|YP_002019536.1| pentapeptide repeat-containing protein [Pelodictyon
phaeoclathratiforme BU-1]
gi|194310219|gb|ACF44919.1| pentapeptide repeat protein [Pelodictyon phaeoclathratiforme BU-1]
Length = 408
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 46/78 (58%), Gaps = 5/78 (6%)
Query: 135 ADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A++R+SDF+GS GA +++ AV +AN GA+L +++ LN ANLT A L
Sbjct: 111 ANLRKSDFTGSSLTGANLQGSFMKGAVLREANLEGANLRWAMLENGDLNRANLTGATLFE 170
Query: 190 TVLTRSDLGGAIIEGADF 207
L +DL GA ++ A F
Sbjct: 171 ANLAGADLKGANLKNAHF 188
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 46/84 (54%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+A+ A+M+ + G+ GA L++A A+ + ++LS+ L+ L ANL+ A L
Sbjct: 285 KADLHKAEMKSAKLQGADLQGANLDRAFLKGADLSNSNLSNALLYGAKLGNANLSGANLE 344
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
L +DL GA +EGA+ A I
Sbjct: 345 GASLFEADLEGANLEGANLKGANI 368
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 5/90 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLT 183
R + + ++ +G+ F+ A L KA A GADL +DR L A NL+
Sbjct: 265 RTRVEQSSFQNTNMAGADFHKADLHKAEMKSAKLQGADLQGANLDRAFLKGADLSNSNLS 324
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
NA+L L ++L GA +EGA +A ++
Sbjct: 325 NALLYGAKLGNANLSGANLEGASLFEADLE 354
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 5/88 (5%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEANL 182
R + + +D G+ A ++KA K++FTG A+L + M VL EANL
Sbjct: 84 IRVKLNGSKLDMADLKGANLTMALIKKANLRKSDFTGSSLTGANLQGSFMKGAVLREANL 143
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A L +L DL A + GA +A
Sbjct: 144 EGANLRWAMLENGDLNRANLTGATLFEA 171
>gi|428219623|ref|YP_007104088.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427991405|gb|AFY71660.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 172
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 54/101 (53%), Gaps = 6/101 (5%)
Query: 117 DLRKAVHVKENFRANF-TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
DLR A N R F +A +R SD +G+ A L A AN TGADL+ M+
Sbjct: 69 DLRGA-----NLRGAFLKNARLRGSDLTGADLRDATLTGAYFTGANLTGADLAGAEMEWA 123
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
L +ANL +A L L+RSDL GA ++GAD A + A+
Sbjct: 124 NLRDANLQDANLQDANLSRSDLDGANLDGADLRGANLSRAK 164
>gi|443324431|ref|ZP_21053184.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442795950|gb|ELS05284.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 239
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 60/105 (57%), Gaps = 11/105 (10%)
Query: 112 QFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGA 165
QF +L++A +K N + T+AD+R++ S F A L A + + +FT A
Sbjct: 16 QFSRINLQEAELIKVNLSNVDLTAADLRQARLGRSNFGHACLRSADLSESILWGTDFTQA 75
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
DLS + V+ EA+L+ A+L + L +++L +I+EGA+FS A
Sbjct: 76 DLS-----QAVMREADLSGAILTQANLEKANLIKSILEGANFSGA 115
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 60/112 (53%), Gaps = 4/112 (3%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
R FG A SADL +++ +F +A+ + A MRE+D SG+ A LEKA K+
Sbjct: 49 RSNFG---HACLRSADLSESILWGTDFTQADLSQAVMREADLSGAILTQANLEKANLIKS 105
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
GA+ S + ++ E +L A RT L+++DL A + A+ S A++
Sbjct: 106 ILEGANFSGAKLRHALMIEVDLRPASDYRTNLSQADLSYADLSYANLSMALL 157
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 56/123 (45%), Gaps = 19/123 (15%)
Query: 107 IGSAAQFGSADLRKAVHVK------ENFRANFTSADMRESDFS----------GSKFNGA 150
I A F A LR A+ ++ ++R N + AD+ +D S +K +GA
Sbjct: 106 ILEGANFSGAKLRHALMIEVDLRPASDYRTNLSQADLSYADLSYANLSMALLYQAKLDGA 165
Query: 151 YLEKA---VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
L +A N DL++ + L+ ANLT A+L R LT +DL G I+ D
Sbjct: 166 RLSRANLSAGRGENALATDLTEASLRDADLSYANLTGAILHRADLTGADLTGTILTNTDL 225
Query: 208 SDA 210
+A
Sbjct: 226 REA 228
>gi|359464087|ref|ZP_09252650.1| hypothetical protein ACCM5_35600 [Acaryochloris sp. CCMEE 5410]
Length = 237
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 60/123 (48%), Gaps = 21/123 (17%)
Query: 111 AQFGSADLRKAVHVKENF------RANF----------TSADMR-----ESDFSGSKFNG 149
A F ADLR++ + NF RAN TSADMR E+D SG+K
Sbjct: 35 ADFSDADLRQSRFGRTNFSYTCFRRANLSETIFWGADLTSADMRQANLREADLSGAKLIQ 94
Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
L +A KA GA+LS MD +L E +L RT L R++L GA + A+ S
Sbjct: 95 TQLTEANLLKACLCGANLSAVQMDGAILIEVDLRPTSDQRTDLGRANLAGADLSYANLSQ 154
Query: 210 AVI 212
A++
Sbjct: 155 ALL 157
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 56/102 (54%), Gaps = 1/102 (0%)
Query: 113 FGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +LR+A + A+F+ AD+R+S F + F+ +A + F GADL+
Sbjct: 17 FHRIELREAELINSELCGADFSDADLRQSRFGRTNFSYTCFRRANLSETIFWGADLTSAD 76
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
M + L EA+L+ A L++T LT ++L A + GA+ S +D
Sbjct: 77 MRQANLREADLSGAKLIQTQLTEANLLKACLCGANLSAVQMD 118
>gi|359151325|ref|ZP_09184042.1| pentapeptide repeat-containing protein [Streptomyces sp. S4]
Length = 240
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 65/142 (45%), Gaps = 2/142 (1%)
Query: 69 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 128
R + A A A+ S A+A+ + ++T + + AA + R +
Sbjct: 14 RSLLYLACPGAPPAAISDTARAIAE--RSGSQTSPTYAVAEAASLTAVPPRNSGRFHNLS 71
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN SAD+ + +G+ GA L + AN TGADL + L NLT A +
Sbjct: 72 RANLISADLARVNLTGANLTGADLARVNLTGANLTGADLIYANLAGADLTRVNLTRARMK 131
Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
T LT +DL GA + G D ++A
Sbjct: 132 LTNLTGADLTGADLAGGDLTNA 153
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 61/145 (42%), Gaps = 11/145 (7%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-----------ANFTSAD 136
++ A L G F S A SADL + N AN T AD
Sbjct: 50 VAEAASLTAVPPRNSGRFHNLSRANLISADLARVNLTGANLTGADLARVNLTGANLTGAD 109
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
+ ++ +G+ L +A N TGADL+ + L A+LTNA L LT D
Sbjct: 110 LIYANLAGADLTRVNLTRARMKLTNLTGADLTGADLAGGDLTNADLTNADLTGAHLTNVD 169
Query: 197 LGGAIIEGADFSDAVIDLAQKQALC 221
L GAI+ GA+ A + A++ L
Sbjct: 170 LTGAILTGANLGGANLAAARQLRLV 194
>gi|428314172|ref|YP_007125149.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428255784|gb|AFZ21743.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 276
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 60/114 (52%), Gaps = 11/114 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
SAA A LR+A N + AN + D++ +D G+ GA L++A N +GADL
Sbjct: 104 SAATLKGAKLREA-----NLQGANLRAVDLKNADLCGANLQGADLKRADLINTNLSGADL 158
Query: 168 S-----DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
S D + +++ L EANL A L L+ +DL GA + A+ + A + AQ
Sbjct: 159 SGANLTDVIFEKVNLREANLRGANLQGLDLSEADLTGADLSEANLNGARLQEAQ 212
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 58/119 (48%), Gaps = 16/119 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGS-----KFNGAYLEKAVA 157
S A A+L + K N R AN D+ E+D +G+ NGA L++A
Sbjct: 154 SGADLSGANLTDVIFEKVNLREANLRGANLQGLDLSEADLTGADLSEANLNGARLQEAQL 213
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
+AN +G D M + L+ ANL A L L+++ L G + GA+ +A++D A+
Sbjct: 214 SQANLSGLD-----MTHLNLSGANLRQANLSEAQLSQAQLYGTDLRGANLDEAILDQAK 267
>gi|254409899|ref|ZP_05023679.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196182935|gb|EDX77919.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 478
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 69/151 (45%), Gaps = 20/151 (13%)
Query: 95 NKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFR----------------ANFTSA 135
N EA RG F G+ A +ADL ++ NFR A+ + A
Sbjct: 141 NLSEANLRGAFVTGANLEGANLNAADLSRSDLSNSNFRHAEFKQANLSCANLAGADLSGA 200
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
++R +D SG+ + A L +A AN TGADL+ + L A+LT A L+ +
Sbjct: 201 NLRWTDLSGANLSWANLSEAKLSGANLTGADLTHANLLNTSLVHADLTQARLIHADWIGA 260
Query: 196 DLGGAIIEGADFSD-AVIDLAQKQALCKYAN 225
DL GA + GA + + L + +C++ +
Sbjct: 261 DLTGATLTGAKLHGVSRVGLKTQGIVCEWVD 291
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 64/129 (49%), Gaps = 4/129 (3%)
Query: 91 LADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSK 146
L++ N A G + IG S A+ A L A K N +AN A++ +D G++
Sbjct: 37 LSEANLSVANLSGAYLIGTNLSRARLNVARLSGANLTKANLTKANLNVANLIRADLGGAQ 96
Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
A + +A +A +GA L++ + L EA L +A L R L+ ++L GA + GA+
Sbjct: 97 LTQAAMIRAELIRAKLSGATLTEANLSGADLREAALRDAKLQRANLSEANLRGAFVTGAN 156
Query: 207 FSDAVIDLA 215
A ++ A
Sbjct: 157 LEGANLNAA 165
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 45/80 (56%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NF+ A++ E++ SG +GA L +A AN +GA L T + R LN A L+ A L +
Sbjct: 16 NFSGANLAEANLSGINLSGADLSEANLSVANLSGAYLIGTNLSRARLNVARLSGANLTKA 75
Query: 191 VLTRSDLGGAIIEGADFSDA 210
LT+++L A + AD A
Sbjct: 76 NLTKANLNVANLIRADLGGA 95
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 49/101 (48%), Gaps = 6/101 (5%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A+LR A N AN +AD+ SD S S F A ++A AN GADLS
Sbjct: 140 ANLSEANLRGAFVTGANLEGANLNAADLSRSDLSNSNFRHAEFKQANLSCANLAGADLSG 199
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ L+ ANL+ A L+ + L GA + GAD + A
Sbjct: 200 ANLRWTDLSGANLSWA-----NLSEAKLSGANLTGADLTHA 235
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 48/94 (51%), Gaps = 5/94 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A + ++ S ++ N A L A KAN T A+L+ + R L A LT A ++R
Sbjct: 45 ANLSGAYLIGTNLSRARLNVARLSGANLTKANLTKANLNVANLIRADLGGAQLTQAAMIR 104
Query: 190 TVLTRSDLGGAII-----EGADFSDAVIDLAQKQ 218
L R+ L GA + GAD +A + A+ Q
Sbjct: 105 AELIRAKLSGATLTEANLSGADLREAALRDAKLQ 138
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 47/101 (46%), Gaps = 6/101 (5%)
Query: 111 AQFGSADLRKAVHVK-ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A G A L +A ++ E RA + A + E++ SG+ A L A +AN + A+L
Sbjct: 90 ADLGGAQLTQAAMIRAELIRAKLSGATLTEANLSGADLREAALRDAKLQRANLSEANLRG 149
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ L ANL A L +RSDL + A+F A
Sbjct: 150 AFVTGANLEGANLNAADL-----SRSDLSNSNFRHAEFKQA 185
>gi|17228637|ref|NP_485185.1| hypothetical protein alr1142 [Nostoc sp. PCC 7120]
gi|17130488|dbj|BAB73099.1| alr1142 [Nostoc sp. PCC 7120]
Length = 521
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 59/116 (50%), Gaps = 7/116 (6%)
Query: 115 SADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
SA+LR A + NFR A+ + A++R +D SG + A L A AN GADLS
Sbjct: 174 SANLRDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSNAKLSGANLVGADLS 233
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY 223
+ + L ANL A L+R +DL AI+ GA +S + L + +C++
Sbjct: 234 NANLTNASLVHANLIQAKLIRAEWVGADLTSAILTGAKLYSTSRFGLKTEGLICQW 289
Score = 44.3 bits (103), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 52/101 (51%), Gaps = 9/101 (8%)
Query: 130 ANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN + +++ E+DFS +K N GA L A+ ++ A+L + R L A+L
Sbjct: 45 ANLSGSNLSEADFSHAKLNVARLSGANLTNAIFNHSSLNVANLIRADLSRAQLRGASLVR 104
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
A L+R L+R DL A + AD +A + + A ++AN
Sbjct: 105 AELIRAELSRVDLSEANLNSADLREATL----RHANLRHAN 141
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A SADLR+A A++R ++ +G+ GA L A AN G+DLS
Sbjct: 118 SEANLNSADLREAT---------LRHANLRHANLNGASLKGASLVGANLEMANLNGSDLS 168
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
R L ANL +A L + ++L GA + GA+
Sbjct: 169 -----RCDLTSANLRDAELKQVNFRHANLSGADLSGANL 202
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 26/121 (21%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTNA 185
NF+ D+ E++ SG K G +A AN +G++LS+ LN A NLTNA
Sbjct: 16 NFSGVDLSEANLSGVKLCGVNFSQANLSIANLSGSNLSEADFSHAKLNVARLSGANLTNA 75
Query: 186 V----------LVRTVLTRSDLGGAIIEGA----------DFSDAVIDLAQ-KQALCKYA 224
+ L+R L+R+ L GA + A D S+A ++ A ++A ++A
Sbjct: 76 IFNHSSLNVANLIRADLSRAQLRGASLVRAELIRAELSRVDLSEANLNSADLREATLRHA 135
Query: 225 N 225
N
Sbjct: 136 N 136
>gi|90419937|ref|ZP_01227846.1| conserved hypothetical protein with pentapeptide repeats
[Aurantimonas manganoxydans SI85-9A1]
gi|90335978|gb|EAS49726.1| conserved hypothetical protein with pentapeptide repeats
[Aurantimonas manganoxydans SI85-9A1]
Length = 292
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 57/108 (52%), Gaps = 6/108 (5%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAY-LEKAVAYKANFTGADLSD 169
A F ADL A + RA+F A+M+ +DFS N + L + V A+ TGADLS
Sbjct: 168 ATFDGADLSAARIAGDFSRASFVRANMKGADFSADMRNQSMGLMRGVLNSADLTGADLSG 227
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTR-----SDLGGAIIEGADFSDAVI 212
+ R A+ T+A L LTR ++ G ++EGADF+DA +
Sbjct: 228 ANLSRAAAEFADFTDADLSGADLTRFEASGANFNGTMVEGADFADAEL 275
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL-- 187
A+ TSA + +D S ++ GA L++A ANFTGADLS + + + +A A L
Sbjct: 118 ADLTSAYLNGTDLSNARLAGAKLDQAWGLGANFTGADLSGASLFQSQMQDATFDGADLSA 177
Query: 188 --VRTVLTRSDLGGAIIEGADFS 208
+ +R+ A ++GADFS
Sbjct: 178 ARIAGDFSRASFVRANMKGADFS 200
>gi|428770507|ref|YP_007162297.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428684786|gb|AFZ54253.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 355
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 59/114 (51%), Gaps = 10/114 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S+A F A+LR A + T D+ E++ +K NG L A AN T A+L+
Sbjct: 245 SSANFQDANLRGA---------DLTDVDLSEANLQNTKLNGVDLSGAYLEGANLTNANLT 295
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALC 221
+ + L ANLTNA L T L + LG I++GA F++ + ++ +KQ L
Sbjct: 296 NASLALSNLIGANLTNANLTNTNLQNTSLGQTIVKGAIFANNLGLNEEKKQELI 349
>gi|392410624|ref|YP_006447231.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
gi|390623760|gb|AFM24967.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
Length = 285
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 56/102 (54%), Gaps = 1/102 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A L+KA +F RA+ + AD+ +D SG+ +GA L A + + + DL
Sbjct: 161 SGADLFGAKLKKAALSAVDFSRADLSGADLSGADLSGAILSGARLNGANLSRVDLSFTDL 220
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
S + L+ ANLT A L + L+ +DL GA ++GAD +D
Sbjct: 221 SGAHLSGANLSAANLTGAYLPGSDLSGADLSGANLQGADITD 262
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 10/83 (12%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ + A++ ++D S + +GA L KA+ A+ +GADL A L A L
Sbjct: 128 ADLSKANLSQADLSRAILSGANLSKALLPFADLSGADLF----------GAKLKKAALSA 177
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
+R+DL GA + GAD S A++
Sbjct: 178 VDFSRADLSGADLSGADLSGAIL 200
>gi|163795566|ref|ZP_02189532.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
gi|159179165|gb|EDP63698.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
Length = 427
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 65/147 (44%), Gaps = 25/147 (17%)
Query: 94 LNKYEAETRGEF--GIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGA 150
LN Y R + G + AQ DLR+A+ +FR A F A++ E+ +GS+ A
Sbjct: 23 LNNYPGGQRADMRGGRHNGAQLNGVDLRRAMMSAADFRGAQFVGANLSEATLAGSQLRVA 82
Query: 151 YLEKAVAYKANFTGADL------SDTLMDR----------------MVLNEANLTNAVLV 188
L A K +F GADL S + D L+ A+L + V
Sbjct: 83 DLSGAKLVKTDFRGADLEQAKLTSSDITDADFRATTIGAPAGSDIATKLDGADLDHVKAV 142
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
RT LTR+ L GA GA F A +D A
Sbjct: 143 RTNLTRASLMGATARGAHFDGASLDRA 169
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 9/108 (8%)
Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A + ADL HVK R N T A + + G+ F+GA L++A AN A
Sbjct: 128 ATKLDGADLD---HVKA-VRTNLTRASLMGATARGAHFDGASLDRANFKGANLEHATFVS 183
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVI 212
+ + L E N +A L T LT +DL GA + GAD +D VI
Sbjct: 184 SSLRGANLQEVNFADATLSNTDLTGADLRSCHLDGADMSGADLTDCVI 231
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 50/98 (51%), Gaps = 6/98 (6%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTLM 172
+RK H N+ ADMR +G++ NG L +A+ A+ F GA+LS+ +
Sbjct: 16 IRKHGHFLNNYPGG-QRADMRGGRHNGAQLNGVDLRRAMMSAADFRGAQFVGANLSEATL 74
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L A+L+ A LV+T +DL A + +D +DA
Sbjct: 75 AGSQLRVADLSGAKLVKTDFRGADLEQAKLTSSDITDA 112
>gi|282896932|ref|ZP_06304938.1| hglK (Pentapeptide repeat protein) [Raphidiopsis brookii D9]
gi|281198341|gb|EFA73231.1| hglK (Pentapeptide repeat protein) [Raphidiopsis brookii D9]
Length = 689
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 58/105 (55%), Gaps = 6/105 (5%)
Query: 109 SAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S AQ ADL A + + + + +++ ++++ G+ + +YL A ANF+ A+L
Sbjct: 536 SGAQLQEADLYAAQLARVSAIGSQLSHSNLTKTNWQGADLSESYLNHANLNSANFSAANL 595
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
S +L AN+TNA L ++R+DL GA +EG DF A++
Sbjct: 596 SGA-----ILRSANMTNANLRNADISRADLRGANLEGTDFQGAIL 635
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 53/117 (45%), Gaps = 24/117 (20%)
Query: 132 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMDRM-- 175
SA++ ++ F S+F A L KA +N + A+LS LM R+
Sbjct: 431 LKSANLNQASFKSSRFRSVGEDGRWDTYDDIIADLSKAQLKGSNLSSANLSRVLMSRVDL 490
Query: 176 ---VLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
VLN ANL N+ L+ R L SDL AI++ A + A I AQ Q YA
Sbjct: 491 SFSVLNRANLANSKLIGANLSRAQLVGSDLQQAILQDAILTGADISGAQLQEADLYA 547
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 65/131 (49%), Gaps = 15/131 (11%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSG 144
+ADL+K A+ +G + SA+L + + + + RAN ++ + ++ S
Sbjct: 462 IADLSK--AQLKG-------SNLSSANLSRVLMSRVDLSFSVLNRANLANSKLIGANLSR 512
Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
++ G+ L++A+ A TGAD+S + L A L + + L+ S+L +G
Sbjct: 513 AQLVGSDLQQAILQDAILTGADISGAQLQEADLYAAQLARVSAIGSQLSHSNLTKTNWQG 572
Query: 205 ADFSDAVIDLA 215
AD S++ ++ A
Sbjct: 573 ADLSESYLNHA 583
>gi|220907270|ref|YP_002482581.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863881|gb|ACL44220.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 369
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 44/88 (50%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN D+ + S + + A L A +K NF GA+L + R L +ANLTNA L
Sbjct: 260 ANLAEKDLAGRNLSNANLSSANLSDAFLHKTNFHGANLFRANLFRANLLQANLTNANLRE 319
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQK 217
T L +DL GA + GAD A I K
Sbjct: 320 TNLIGADLSGADLRGADLRGAKIGFDNK 347
>gi|434393337|ref|YP_007128284.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428265178|gb|AFZ31124.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 213
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 51/102 (50%), Gaps = 6/102 (5%)
Query: 107 IGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
I + F DL +A K N R NFT A + ++D SGS + L +A A
Sbjct: 105 IATQVGFLETDLERANLKKVNLRDRDLSYTNFTKAKLEKADLSGSNLSHTNLSRAKLRNA 164
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
N +GA+LS+ + R L ANL A L L+R+ L GAI+
Sbjct: 165 NLSGANLSNADLSRADLRNANLIGANLDGANLSRAKLEGAIM 206
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 48/85 (56%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN ++R+ D S + F A LEKA +N + +LS + L+ ANL+NA L
Sbjct: 118 RANLKKVNLRDRDLSYTNFTKAKLEKADLSGSNLSHTNLSRAKLRNANLSGANLSNADLS 177
Query: 189 RTVLTRSDLGGAIIEGADFSDAVID 213
R L ++L GA ++GA+ S A ++
Sbjct: 178 RADLRNANLIGANLDGANLSRAKLE 202
>gi|75911045|ref|YP_325341.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704770|gb|ABA24446.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 973
Score = 52.0 bits (123), Expect = 3e-04, Method: Composition-based stats.
Identities = 36/96 (37%), Positives = 47/96 (48%), Gaps = 1/96 (1%)
Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
SADL A + R A AD+ +D SG+ NGAYL A A + ADLS +
Sbjct: 841 SADLSGAYLRGADLRDAYLNGADLSGADLSGAYLNGAYLNGAYLNGAYLSHADLSRADLR 900
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
L ANL +A L+ L +DL GA + A+ D
Sbjct: 901 SADLRSANLISADLISADLISADLNGADLSHANLGD 936
Score = 47.4 bits (111), Expect = 0.008, Method: Composition-based stats.
Identities = 32/86 (37%), Positives = 43/86 (50%), Gaps = 5/86 (5%)
Query: 130 ANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
A AD+R++ D SG+ +GAYL A A GA LS + R L A+L +
Sbjct: 847 AYLRGADLRDAYLNGADLSGADLSGAYLNGAYLNGAYLNGAYLSHADLSRADLRSADLRS 906
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
A L+ L +DL A + GAD S A
Sbjct: 907 ANLISADLISADLISADLNGADLSHA 932
Score = 46.2 bits (108), Expect = 0.014, Method: Composition-based stats.
Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 2/92 (2%)
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
+R +D SG+ GA L A A+ +GADLS ++ LN A L A L L+R+D
Sbjct: 839 LRSADLSGAYLRGADLRDAYLNGADLSGADLSGAYLNGAYLNGAYLNGAYLSHADLSRAD 898
Query: 197 LGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
L A + A+ A DL + NG +
Sbjct: 899 LRSADLRSANLISA--DLISADLISADLNGAD 928
Score = 43.1 bits (100), Expect = 0.15, Method: Composition-based stats.
Identities = 27/76 (35%), Positives = 38/76 (50%)
Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
AD+ +D SG+ G +L A A GADL D ++ L+ A+L+ A L L
Sbjct: 822 ADLSGADLSGAFLKGVFLRSADLSGAYLRGADLRDAYLNGADLSGADLSGAYLNGAYLNG 881
Query: 195 SDLGGAIIEGADFSDA 210
+ L GA + AD S A
Sbjct: 882 AYLNGAYLSHADLSRA 897
>gi|428310592|ref|YP_007121569.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428252204|gb|AFZ18163.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 522
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 3/134 (2%)
Query: 94 LNKYEAETRGEFGIGSA-AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAY 151
L KY A R G+ A +A+L A + N AN + A++ ++ S +K N A
Sbjct: 7 LKKYAAGDRDFSGLNLAEVNLSAANLSGANLSEVNLSVANLSGANLSGANLSRAKLNVAR 66
Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
L A KAN A L+ T + R L ANLT A L+R L R++L GA ++ A+ S A
Sbjct: 67 LSGANISKANLIQASLNVTNLIRADLRRANLTQAALIRAELIRAELSGATLKEANLSGAD 126
Query: 212 I-DLAQKQALCKYA 224
+ + A +QA+ A
Sbjct: 127 LREAALRQAILSRA 140
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 67/147 (45%), Gaps = 17/147 (11%)
Query: 98 EAETRGEF---GIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
EA RG F I ADL +A N R AD+R+++ S + +GA L +
Sbjct: 144 EANLRGAFLTASILEGTNLNKADLNRADLSDSNIR----EADLRQANLSFANLSGADLSR 199
Query: 155 AVAYKANFTGADLS-DTLMDRMV---------LNEANLTNAVLVRTVLTRSDLGGAIIEG 204
A A+ +GADL L D + L+ ANL NA LV LT++ L G
Sbjct: 200 ANLRWADLSGADLRWANLSDAKLSGANLMGADLSHANLHNASLVHADLTQASLIKVDWIG 259
Query: 205 ADFSDAVIDLAQKQALCKYANGTNPIT 231
AD S A + A+ A+ ++ T IT
Sbjct: 260 ADLSGATMTGAKLYAVSRFGLKTTGIT 286
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 60/116 (51%), Gaps = 20/116 (17%)
Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-----T 170
ADLR RAN T A + ++ ++ +GA L++A N +GADL +
Sbjct: 90 ADLR---------RANLTQAALIRAELIRAELSGATLKEA-----NLSGADLREAALRQA 135
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
++ R L+EANL A L ++L ++L A + AD SD+ I A +QA +AN
Sbjct: 136 ILSRATLSEANLRGAFLTASILEGTNLNKADLNRADLSDSNIREADLRQANLSFAN 191
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 1/101 (0%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADLR+A + RA + A++R + + S G L KA +A+ + +++ +
Sbjct: 120 ANLSGADLREAALRQAILSRATLSEANLRGAFLTASILEGTNLNKADLNRADLSDSNIRE 179
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + L+ ANL+ A L R L +DL GA + A+ SDA
Sbjct: 180 ADLRQANLSFANLSGADLSRANLRWADLSGADLRWANLSDA 220
>gi|428215909|ref|YP_007089053.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004290|gb|AFY85133.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 447
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 51/101 (50%), Gaps = 11/101 (10%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A +DLR A + + + N T AD+RE+D + + GA L A +A+ TGA
Sbjct: 330 ANMKGSDLRGADLIGASLNKVNLTQADLREADLTRADLRGANLRLADLREADLTGAS--- 386
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
LN+ NL A L LTR+DL GA + GAD +A
Sbjct: 387 -------LNQVNLAEADLRGVDLTRADLRGANLSGADLREA 420
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 50/95 (52%), Gaps = 9/95 (9%)
Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
ADLR A+ +SA++ ++D +G+ + A L KA AN G+DL +
Sbjct: 295 ADLRGAM---------LSSANLSQADMTGTDLSRANLRKAYLADANMKGSDLRGADLIGA 345
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
LN+ NLT A L LTR+DL GA + AD +A
Sbjct: 346 SLNKVNLTQADLREADLTRADLRGANLRLADLREA 380
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 51/102 (50%), Gaps = 25/102 (24%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKA-----------VAYK---------ANFTGADLSDT 170
NF + D+ D G+ G+YL +A + Y AN +GADLSD
Sbjct: 29 NFMTPDLSNKDLIGASLRGSYLREAKLSGANLSEAILCYADLIGADLKGANLSGADLSDA 88
Query: 171 LMDRMVLNEANLTNA-----VLVRTVLTRSDLGGAIIEGADF 207
++ L+E+NLT A +LV T L+ +DL GA ++GA+
Sbjct: 89 NLNLANLSESNLTGANFKGSLLVGTDLSEADLRGANLKGANL 130
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 58/122 (47%), Gaps = 12/122 (9%)
Query: 91 LADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKF 147
LAD N ++ RG IG++ ADLR+A + T AD+R ++ +
Sbjct: 327 LADANMKGSDLRGADLIGASLNKVNLTQADLREA---------DLTRADLRGANLRLADL 377
Query: 148 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
A L A + N ADL + R L ANL+ A L LT+++L A ++GA+
Sbjct: 378 READLTGASLNQVNLAEADLRGVDLTRADLRGANLSGADLREADLTKANLHWANLDGANL 437
Query: 208 SD 209
+D
Sbjct: 438 TD 439
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 51/102 (50%), Gaps = 10/102 (9%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ ES+ +G+ F G+ L +A+ GA+L + L EANL+ A L
Sbjct: 88 ANLNLANLSESNLTGANFKGSLLVGTDLSEADLRGANLKGANLIGAKLAEANLSGANLSG 147
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 231
T L+ +DL G I++ AV DL ++ G +P T
Sbjct: 148 TDLSEADLRGTILQ-----KAVYDLR-----TRFCEGLDPQT 179
>gi|300864770|ref|ZP_07109621.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
gi|300337239|emb|CBN54769.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
Length = 334
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 1/103 (0%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A DLR ++ N + N T AD+RE+D S + N A L+ A AN GA L
Sbjct: 230 ADLHDTDLRGGNLIQANLMKTNLTEADLREADLSHTNLNLANLKGADLSGANLQGAYLWA 289
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
T +D L A+L A L +++ +DL AI+ GA D I
Sbjct: 290 TNLDGACLKGADLRGASLRNAIISGADLRDAILTGATMPDGKI 332
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 50/110 (45%), Gaps = 6/110 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S AQ A+L V R AN AD+ ++D G A L K +A+
Sbjct: 198 SGAQLSGANLSGTVLSGARMRFTKLEQANLKQADLHDTDLRGGNLIQANLMKTNLTEADL 257
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
ADLS T ++ L A+L+ A L L ++L GA ++GAD A +
Sbjct: 258 READLSHTNLNLANLKGADLSGANLQGAYLWATNLDGACLKGADLRGASL 307
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 52/114 (45%), Gaps = 11/114 (9%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMR-----ESDFSGSKFNGAY 151
EA G F G+ F K H+ A+ T AD+R + D +G++ +GA
Sbjct: 58 LEANLNGAFLYGANLSFAKL---KGSHL---LGADLTKADLRGAQLAKVDLTGAQLSGAI 111
Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
L ++AN G +L + + L ANL A L LT + L GA ++GA
Sbjct: 112 LSWVSLFQANLPGVNLCGANLSGINLRSANLAGANLNWANLTGARLSGANLKGA 165
>gi|220909896|ref|YP_002485207.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219866507|gb|ACL46846.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 184
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 47/80 (58%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+F+ ++ + + +K GA L A A+ G DL+ +++ LN+ANL A +++
Sbjct: 16 DFSHVNLVQVCLTNAKLVGARLNGAELVGADLQGVDLTAAHLNQARLNQANLAGAEMIQA 75
Query: 191 VLTRSDLGGAIIEGADFSDA 210
LTR+DL GA + GAD +DA
Sbjct: 76 CLTRADLSGAYLAGADLTDA 95
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 43/79 (54%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ +D SG+ GA L KA KA+ +GADL + L E +L++A L
Sbjct: 90 ADLTDADLSGADLSGANLGGADLRKADLSKADLSGADLRGADLSGANLRETDLSDADLDG 149
Query: 190 TVLTRSDLGGAIIEGADFS 208
L +DL GA +E F+
Sbjct: 150 AYLGHADLTGADVERTRFN 168
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 36/68 (52%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN AD+R++D S + +GA L A AN DLSD +D L A+LT A + R
Sbjct: 105 ANLGGADLRKADLSKADLSGADLRGADLSGANLRETDLSDADLDGAYLGHADLTGADVER 164
Query: 190 TVLTRSDL 197
T +S L
Sbjct: 165 TRFNQSQL 172
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 10/82 (12%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+AN A+M ++ + + +GAYL A A+ +GADLS ANL A L
Sbjct: 64 QANLAGAEMIQACLTRADLSGAYLAGADLTDADLSGADLS----------GANLGGADLR 113
Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
+ L+++DL GA + GAD S A
Sbjct: 114 KADLSKADLSGADLRGADLSGA 135
Score = 37.4 bits (85), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 39/81 (48%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A A++ +D G A+L +A +AN GA++ + R L+ A L A L
Sbjct: 35 ARLNGAELVGADLQGVDLTAAHLNQARLNQANLAGAEMIQACLTRADLSGAYLAGADLTD 94
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L+ +DL GA + GAD A
Sbjct: 95 ADLSGADLSGANLGGADLRKA 115
>gi|239909009|ref|YP_002955751.1| hypothetical protein DMR_43740 [Desulfovibrio magneticus RS-1]
gi|239798876|dbj|BAH77865.1| hypothetical protein [Desulfovibrio magneticus RS-1]
Length = 972
Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats.
Identities = 34/90 (37%), Positives = 50/90 (55%), Gaps = 10/90 (11%)
Query: 129 RANFTSADMRESDFSGS-----KFNGAYLEKAVAYKA-----NFTGADLSDTLMDRMVLN 178
+ NF SA +RES+F+ + F A +EK+ +KA NF ADL++T L
Sbjct: 828 KTNFESASLRESNFTNAICNNANFKKARMEKSNLHKATLINTNFEKADLTNTNFSEASLE 887
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
ANL+N+ L LTR++L A + GA+ S
Sbjct: 888 GANLSNSKLKEANLTRANLCDANLVGANLS 917
Score = 44.7 bits (104), Expect = 0.045, Method: Composition-based stats.
Identities = 23/82 (28%), Positives = 41/82 (50%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
F ANF +++ E +F+G+K + A+ K NF A L ++ + N AN A +
Sbjct: 797 FNANFMFSNLSEVNFNGAKLDDVEFANAILNKTNFESASLRESNFTNAICNNANFKKARM 856
Query: 188 VRTVLTRSDLGGAIIEGADFSD 209
++ L ++ L E AD ++
Sbjct: 857 EKSNLHKATLINTNFEKADLTN 878
Score = 42.4 bits (98), Expect = 0.24, Method: Composition-based stats.
Identities = 40/119 (33%), Positives = 58/119 (48%), Gaps = 7/119 (5%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A+ ++L KA + NF NF+ A + ++ S SK A L +A AN G
Sbjct: 854 ARMEKSNLHKATLINTNFEKADLTNTNFSEASLEGANLSNSKLKEANLTRANLCDANLVG 913
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALCK 222
A+LS + + + N+ANL NA L+ S GA ++ A F D V IDL Q C+
Sbjct: 914 ANLSGSDLSKANFNKANLANANLLNCKFNFSKFLGANLDNAKFDDDVDIDLLTNQKRCQ 972
>gi|158337660|ref|YP_001518836.1| pentapeptide repeat-containing serine/threonine kinase
[Acaryochloris marina MBIC11017]
gi|158307901|gb|ABW29518.1| serine/threonine kinase with pentapeptide repeats [Acaryochloris
marina MBIC11017]
Length = 532
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 57/116 (49%), Gaps = 21/116 (18%)
Query: 112 QFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
+F + DLR A+ + NF RANFT A++R ++ AY+ A A+ GA+LSD
Sbjct: 429 KFQNTDLRDAILINANFGRANFTGANLRNANLMQ-----AYMSHADLANADLRGANLSDA 483
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
L+ ANL A +L GA + GA S++ + AQ L Y NG
Sbjct: 484 -----YLSHANLRGA----------NLCGADLSGAKLSESQLSFAQTNWLTVYPNG 524
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 44/104 (42%), Gaps = 21/104 (20%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F DLR N R SA+ E F + A L A +ANFTGA
Sbjct: 405 FSGQDLRNL-----NLRKFQLPSANFHEGKFQNTDLRDAILINANFGRANFTGA------ 453
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
NL NA L++ ++ +DL A + GA+ SDA + A
Sbjct: 454 ---------NLRNANLMQAYMSHADLANADLRGANLSDAYLSHA 488
>gi|254489813|ref|ZP_05103008.1| Pentapeptide repeat protein [Methylophaga thiooxidans DMS010]
gi|224464898|gb|EEF81152.1| Pentapeptide repeat protein [Methylophaga thiooxydans DMS010]
Length = 154
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/137 (33%), Positives = 63/137 (45%), Gaps = 21/137 (15%)
Query: 108 GSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSG-----SKFNGAYLEKAV------ 156
GSAA F + + + ++ N + AD+ DFSG S NG L +A
Sbjct: 15 GSAAAFEQIYVDRLLETRQCHHCNLSEADLSGKDFSGADMSESILNGINLSQATLVGVWF 74
Query: 157 ----AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
AN GAD S++LMD +LN ANL A L + L +DL AD + A +
Sbjct: 75 THSKMQGANLEGADASNSLMDYALLNGANLKGANLNGSQLIFADL-----TDADLTGASV 129
Query: 213 DLAQKQALCKYANGTNP 229
D AQ + + Y N T P
Sbjct: 130 DNAQMRGVL-YCNTTMP 145
>gi|452964739|gb|EME69773.1| serine/threonine protein kinase [Magnetospirillum sp. SO-1]
Length = 137
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 46/89 (51%), Gaps = 15/89 (16%)
Query: 140 SDFSGSKFNGAYLEKAVAYKANFTGA----------DLSDTLMDRMVLNEAN-----LTN 184
SDFSGS N A L +AV ANF GA DL++ R VLN AN L
Sbjct: 8 SDFSGSVLNAADLRQAVLIGANFEGAVLNHARLTDADLTEARFLRSVLNNANMHGACLKG 67
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVID 213
A+L V+ +DL A +EGAD A+I+
Sbjct: 68 AILAGAVMNNADLSCATLEGADLRGAIIN 96
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 59/111 (53%), Gaps = 2/111 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S + +ADLR+AV + NF A A + ++D + ++F + L A + A GA L
Sbjct: 11 SGSVLNAADLRQAVLIGANFEGAVLNHARLTDADLTEARFLRSVLNNANMHGACLKGAIL 70
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
+ +M+ L+ A L A L ++ +DL GA + GAD + A ++L + Q
Sbjct: 71 AGAVMNNADLSCATLEGADLRGAIINNADLSGADLRGADLTGA-LNLTRDQ 120
>gi|16331795|ref|NP_442523.1| hypothetical protein slr0516 [Synechocystis sp. PCC 6803]
gi|383323538|ref|YP_005384392.1| hypothetical protein SYNGTI_2630 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383326707|ref|YP_005387561.1| hypothetical protein SYNPCCP_2629 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383492591|ref|YP_005410268.1| hypothetical protein SYNPCCN_2629 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437859|ref|YP_005652584.1| hypothetical protein SYNGTS_2631 [Synechocystis sp. PCC 6803]
gi|451815947|ref|YP_007452399.1| hypothetical protein MYO_126560 [Synechocystis sp. PCC 6803]
gi|6226382|sp|Q55837.1|Y516_SYNY3 RecName: Full=Uncharacterized protein slr0516
gi|1001755|dbj|BAA10593.1| slr0516 [Synechocystis sp. PCC 6803]
gi|339274892|dbj|BAK51379.1| hypothetical protein SYNGTS_2631 [Synechocystis sp. PCC 6803]
gi|359272858|dbj|BAL30377.1| hypothetical protein SYNGTI_2630 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359276028|dbj|BAL33546.1| hypothetical protein SYNPCCN_2629 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359279198|dbj|BAL36715.1| hypothetical protein SYNPCCP_2629 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407960570|dbj|BAM53810.1| hypothetical protein BEST7613_4879 [Bacillus subtilis BEST7613]
gi|451781916|gb|AGF52885.1| hypothetical protein MYO_126560 [Synechocystis sp. PCC 6803]
Length = 166
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 5/83 (6%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN-----LTNA 185
N +A + SD SG+ +G L +A+ +AN TGA+LS+T + L EAN L+ A
Sbjct: 54 NLENARLNRSDLSGANLSGVNLRRALLDRANLTGANLSETDLTEAALTEANLAGADLSGA 113
Query: 186 VLVRTVLTRSDLGGAIIEGADFS 208
L R+ L DL GA ++GA+ +
Sbjct: 114 NLERSFLRDVDLTGANLKGANLA 136
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 45/83 (54%), Gaps = 10/83 (12%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N AD+RE FN LE A +++ +GA+LS + R +L+ ANLT A L T
Sbjct: 44 NLAGADLRE-------FN---LENARLNRSDLSGANLSGVNLRRALLDRANLTGANLSET 93
Query: 191 VLTRSDLGGAIIEGADFSDAVID 213
LT + L A + GAD S A ++
Sbjct: 94 DLTEAALTEANLAGADLSGANLE 116
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 15/120 (12%)
Query: 92 ADLNKYEAET-RGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGA 150
ADL ++ E R S A +LR+A+ RAN T A++ E+D +
Sbjct: 48 ADLREFNLENARLNRSDLSGANLSGVNLRRALL----DRANLTGANLSETDLT------- 96
Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+A +AN GADLS ++R L + +LT A L L ++L A + D +A
Sbjct: 97 ---EAALTEANLAGADLSGANLERSFLRDVDLTGANLKGANLAWANLTAANLTDVDLEEA 153
>gi|381205231|ref|ZP_09912302.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 236
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 58/107 (54%), Gaps = 6/107 (5%)
Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
AQ ADL A + + F AN A+++ ++ +G+ A L A YKAN GADL
Sbjct: 99 AQLVGADLEGADLDRADLFEANLEIANLQWANLAGASLENANLGLANLYKANLQGADLRG 158
Query: 170 TLMDRMVLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDAV 211
+ +L EANL+N A L+ L+R++L GA ++GA +A+
Sbjct: 159 ANLTGAMLGEANLSNANLEGARLMVVNLSRANLKGANLKGAKIHEAI 205
>gi|332710048|ref|ZP_08430003.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332351191|gb|EGJ30776.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 739
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 60/111 (54%), Gaps = 4/111 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S+AQ +AD R+A+ EN A+ T A++ E+ FS S +GA L K A +++F+ ADLS
Sbjct: 561 SSAQLINADFRRAI--LEN--ASLTGANLGEAKFSLSSLHGARLGKVSAVRSDFSSADLS 616
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
+ L+ ANL+NA L + L GA + A +A + A A
Sbjct: 617 QSSWQGANLSRANLSNANLKNVDFNSTQLVGANLRNAKLYNAKLRYANLSA 667
Score = 37.0 bits (84), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 51/111 (45%), Gaps = 14/111 (12%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
+G G FG+ D ++ ++F+ AD+R + +G A L+ + + N G
Sbjct: 497 YGPGEDQHFGTFD---------DWVSDFSGADLRAVNLTG-----AILDNVLMNRTNLIG 542
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
A L+ L ANL++A L+ R+ L A + GA+ +A L+
Sbjct: 543 ATLNRARFYNSSLIGANLSSAQLINADFRRAILENASLTGANLGEAKFSLS 593
>gi|158341150|ref|YP_001522487.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158311391|gb|ABW33002.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 150
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 48/82 (58%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+AN T+A + + F G+ F A L+ A AN +GA+L + + +L ANLT A L
Sbjct: 22 KANLTNAILHGATFIGTSFQQANLQAAGLISANLSGANLKEANLTNALLTTANLTGADLR 81
Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
++L R+ L AI++GA+ DA
Sbjct: 82 SSILCRAVLTDAILQGANLRDA 103
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 6/110 (5%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A F A+L A+ F +AN +A + ++ SG+ A L A+ AN TG
Sbjct: 18 ASFAKANLTNAILHGATFIGTSFQQANLQAAGLISANLSGANLKEANLTNALLTTANLTG 77
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
ADL +++ R VL +A L A L L +D A + GAD S A ++L
Sbjct: 78 ADLRSSILCRAVLTDAILQGANLRDADLRETDFKNADLTGADLSGAKVNL 127
>gi|154251684|ref|YP_001412508.1| pentapeptide repeat-containing protein [Parvibaculum
lavamentivorans DS-1]
gi|154155634|gb|ABS62851.1| pentapeptide repeat protein [Parvibaculum lavamentivorans DS-1]
Length = 363
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 48/92 (52%), Gaps = 10/92 (10%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RA+FT D+ DFS + GA+ +A+ ANF ++ +L A+ +NA+L
Sbjct: 273 RADFTRMDLSRKDFSRAVLAGAHFREAILADANF----------EKAILAAADFSNAILF 322
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
R L +DL GA + GAD +A D +K L
Sbjct: 323 RANLAGADLRGADLRGADLKNARQDDTKKGEL 354
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 37/77 (48%)
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
M+E D SG F +FTG DL D L AN +A L RT +R+D
Sbjct: 62 MKECDLSGLDFRNLNFSHGHFIGCDFTGCDLEDAHFSGANLFSANFDHANLTRTNFSRAD 121
Query: 197 LGGAIIEGADFSDAVID 213
L GA E A+ +DA +D
Sbjct: 122 LRGANFEDAEMADAQLD 138
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 46/104 (44%), Gaps = 17/104 (16%)
Query: 111 AQFGSADLRKAVHVK---------EN--FRA------NFTSADMRESDFSGSKFNGAYLE 153
AQ ADLR+ ++ EN FR N + ++DF G+ +GA L+
Sbjct: 135 AQLDGADLRRGAVIRRGASAPVGRENSSFRGARMYGTNMAECKLLDADFEGASISGASLQ 194
Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
A ANF GA+L + L +A+ AV+ + R D+
Sbjct: 195 GADLRGANFAGAELKGVELSGANLADADFRRAVMDEATIARGDM 238
>gi|434407898|ref|YP_007150783.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428262153|gb|AFZ28103.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 182
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 5/84 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+F A + +D SG+K GA E + +AN GADLS+ L + LT A LVR
Sbjct: 90 ADFRGAQLNHADLSGAKLCGANFEGCLMVRANLAGADLSNA-----SLAGSALTGANLVR 144
Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
+++DL A++ GA+ DAV D
Sbjct: 145 ANFSQADLTNAVLFGAETEDAVFD 168
>gi|434398906|ref|YP_007132910.1| heat shock protein DnaJ domain protein [Stanieria cyanosphaera PCC
7437]
gi|428270003|gb|AFZ35944.1| heat shock protein DnaJ domain protein [Stanieria cyanosphaera PCC
7437]
Length = 272
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 46/85 (54%), Gaps = 15/85 (17%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ + A+++E DFSG +G AN GADLSD+ + ++ L EANL A L R
Sbjct: 159 DLSRANLKEKDFSGRNLSG----------ANLQGADLSDSFLHKVNLEEANLQEANLFRA 208
Query: 191 VLTRSDLGGAIIE-----GADFSDA 210
L +++L A + GADFS A
Sbjct: 209 NLLKANLRKANLRDTNLIGADFSGA 233
>gi|428200510|ref|YP_007079099.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427977942|gb|AFY75542.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 174
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 53/98 (54%), Gaps = 4/98 (4%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A ADLR+A + AN + AD++E++ SG+ + A L AV KAN +GA L
Sbjct: 60 ASLDRADLREACLIV----ANLSGADLKEANLSGANLSEAVLTGAVLQKANLSGAKLRGA 115
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
++ + L E+NL A L L +DL GA + AD S
Sbjct: 116 ILAGVNLAESNLRGANLQGANLYGADLRGADLRNADLS 153
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 128 FRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
RAN + A +R ++ SG+ + A L +A AN +GADL + + L+EA L
Sbjct: 38 IRANLSGALLRGANLSGAFLVVASLDRADLREACLIVANLSGADLKEANLSGANLSEAVL 97
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
T AVL + L+ + L GAI+ G + +++ + A Q Y
Sbjct: 98 TGAVLQKANLSGAKLRGAILAGVNLAESNLRGANLQGANLY 138
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 45/82 (54%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+F+ D+ D + +K +GA L +A A GA+LS + L+ A+L A L+
Sbjct: 16 DFSRIDLHGVDLAQAKLSGANLIRANLSGALLRGANLSGAFLVVASLDRADLREACLIVA 75
Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
L+ +DL A + GA+ S+AV+
Sbjct: 76 NLSGADLKEANLSGANLSEAVL 97
>gi|119485665|ref|ZP_01619940.1| hypothetical protein L8106_24820 [Lyngbya sp. PCC 8106]
gi|119456990|gb|EAW38117.1| hypothetical protein L8106_24820 [Lyngbya sp. PCC 8106]
Length = 433
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 60/111 (54%), Gaps = 8/111 (7%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKA------VAYKAN 161
S A F A+LR+A K N A+ + A + ++D G K GA L A + Y AN
Sbjct: 116 SGANFRDANLREAYLWKANLSNADLSDAYLEKADLRGVKLEGADLGYAMLKGANLGY-AN 174
Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
F A L++T + L +ANL A LV L ++DL GA +EGA+ S+A +
Sbjct: 175 FVRARLANTDLSNANLWQANLREAHLVDANLQQADLRGAKLEGANLSNAKL 225
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 55/103 (53%), Gaps = 1/103 (0%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A+ + DL A + N R A+ A+++++D G+K GA L A +AN A
Sbjct: 178 ARLANTDLSNANLWQANLREAHLVDANLQQADLRGAKLEGANLSNAKLVQANLESAIFVG 237
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++ L++A+L A L +T +TR+DLG A ++ A DA +
Sbjct: 238 ANLENANLHQASLKGANLAKTQMTRADLGFANLQKASLGDAQL 280
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 62/137 (45%), Gaps = 14/137 (10%)
Query: 91 LADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFR-----------ANFTSAD 136
L D N +A+ RG E S A+ A+L A+ V N AN
Sbjct: 200 LVDANLQQADLRGAKLEGANLSNAKLVQANLESAIFVGANLENANLHQASLKGANLAKTQ 259
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
M +D + A L A +AN ADL++ + L +ANL NA+L + L +
Sbjct: 260 MTRADLGFANLQKASLGDAQLSQANLESADLTEAKLWVAKLEDANLNNAILEKAKLGFAQ 319
Query: 197 LGGAIIEGADFSDAVID 213
L GA +E A+ +DA+++
Sbjct: 320 LKGANLEDANLTDAILE 336
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 58/111 (52%), Gaps = 11/111 (9%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYK---------- 159
A G A+L+KA +AN SAD+ E+ +K A L A+ K
Sbjct: 263 ADLGFANLQKASLGDAQLSQANLESADLTEAKLWVAKLEDANLNNAILEKAKLGFAQLKG 322
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
AN A+L+D +++ ++L +ANL +A L L +++L GA ++ A+ ++A
Sbjct: 323 ANLEDANLTDAILEGVILEDANLEDANLEGAKLEQANLIGAYLKDANLTEA 373
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 7/85 (8%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ + + GAYL+ A +AN GADL L +ANL NA L
Sbjct: 343 ANLEDANLEGAKLEQANLIGAYLKDANLTEANLQGADLRGA-----NLTKANLRNAYLQG 397
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDL 214
L ++L GA ++GA+ D +DL
Sbjct: 398 ANLRGANLKGASLKGANLRD--VDL 420
>gi|319791261|ref|YP_004152901.1| hypothetical protein Varpa_0569 [Variovorax paradoxus EPS]
gi|315593724|gb|ADU34790.1| Protein of unknown function DUF2169 [Variovorax paradoxus EPS]
Length = 865
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 33/80 (41%), Positives = 41/80 (51%), Gaps = 5/80 (6%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ T AD D G F GA+LE A AN +GA+LS VL ANL A+ V T
Sbjct: 550 DLTGADFSGLDLRGVNFTGAWLESANFENANLSGANLS-----HAVLAHANLRGAIAVET 604
Query: 191 VLTRSDLGGAIIEGADFSDA 210
L ++LGGA + A DA
Sbjct: 605 SLVGANLGGARLASAVLEDA 624
Score = 39.3 bits (90), Expect = 2.0, Method: Composition-based stats.
Identities = 29/82 (35%), Positives = 40/82 (48%), Gaps = 10/82 (12%)
Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT----------VLT 193
G GA L A ANF G DLS + +L+ ANL L R+ +L
Sbjct: 734 GCSLVGADLGHAAMGSANFGGMDLSQVSLVGSMLDGANLIGTRLARSDWRLASAKGVLLC 793
Query: 194 RSDLGGAIIEGADFSDAVIDLA 215
++DL A + GA+FS+AV+ A
Sbjct: 794 KADLAHARMAGANFSNAVLQHA 815
>gi|163797086|ref|ZP_02191041.1| pentapeptide repeat protein [alpha proteobacterium BAL199]
gi|159177602|gb|EDP62155.1| pentapeptide repeat protein [alpha proteobacterium BAL199]
Length = 421
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 60/116 (51%), Gaps = 14/116 (12%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F ADLR +V + +A F++A + + DF+G+K GA L A A ADL+D
Sbjct: 51 ALFAGADLRGSVFAGGHLEQAQFSTARLEQVDFAGAKLMGANLRGANLKGAKLMAADLTD 110
Query: 170 --------TLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
M+R + L++A+L+NA VRT L+ +++ I G F AV+
Sbjct: 111 ADLRPAKIVDMNRTIEQSANLHKADLSNAQFVRTNLSGANMSAIIAVGTAFQSAVL 166
>gi|86610069|ref|YP_478831.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86558611|gb|ABD03568.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 160
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 45/87 (51%), Gaps = 5/87 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N AD+R +D S + GA L A ++AN GADLS + L+ A L A L R
Sbjct: 67 NLQEADLRGADLSSANLMGANLRGANLWEANLIGADLSFADLREANLHGAYLWEAKLTRA 126
Query: 191 VLTRSDL-----GGAIIEGADFSDAVI 212
L SDL GGA++ GAD A++
Sbjct: 127 QLQGSDLSGAKIGGAVLTGADLRGAIL 153
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 48/98 (48%), Gaps = 8/98 (8%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNG 149
L+ +N EA+ RG A SA+L A N + AN AD+ +D + +G
Sbjct: 63 LSGINLQEADLRG-------ADLSSANLMGANLRGANLWEANLIGADLSFADLREANLHG 115
Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
AYL +A +A G+DLS + VL A+L A+L
Sbjct: 116 AYLWEAKLTRAQLQGSDLSGAKIGGAVLTGADLRGAIL 153
>gi|434399306|ref|YP_007133310.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428270403|gb|AFZ36344.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 298
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 68/143 (47%), Gaps = 18/143 (12%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA----VHVKEN--FR 129
L+ A + + N + L + N Y+AE G F F A+L K VH + F
Sbjct: 151 LSEANLVEANLNQAELINANLYDAELIGAF-------FYQANLTKVNAIKVHASKTYCFA 203
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A++++SDF S A L A AN GA+LS + L ANL A
Sbjct: 204 ANLSEANLKKSDFRWSNLTYANLRDANLIGANLRGANLS-----QADLKGANLEGANFKG 258
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
LT++DL GA +GA+ DA+
Sbjct: 259 ANLTKADLRGANFKGANLQDAIF 281
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 1/101 (0%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A +ADLR+A + + N +AD RE++ + A L + K N A+L+
Sbjct: 84 ANLSNADLRQAYLIDADLTEINAIAADFREANCRCANLKEANLIGTLMRKVNLQQANLTA 143
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ R L+EANL A L + L ++L A + GA F A
Sbjct: 144 VKLHRSNLSEANLVEANLNQAELINANLYDAELIGAFFYQA 184
>gi|434407711|ref|YP_007150596.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428261966|gb|AFZ27916.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 268
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 53/101 (52%), Gaps = 1/101 (0%)
Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A G++ L + N A N +AD+ E+ ++ GAYL K YKAN T A LS
Sbjct: 144 ADLGTSKLHRTNLCFANLIAVNLIAADLSEATLHEAEVMGAYLYKTDLYKANLTEAHLSG 203
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ R L EA+L+NA L T L ++L GA + GA+ A
Sbjct: 204 AYLLRANLTEADLSNADLSWTNLRGANLTGANLRGANLRGA 244
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 43/97 (44%), Gaps = 14/97 (14%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A G ADL A + A A++ ++ S + GA L +A AN G+DLS
Sbjct: 64 ANLGGADLTGA----NLYNAKLIEANLSAANLSAANLRGATLTQADMNCANLIGSDLS-- 117
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
EANL AV+ L +DL GA + AD
Sbjct: 118 --------EANLKGAVITDANLIGADLRGANLRDADL 146
Score = 37.0 bits (84), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 6/104 (5%)
Query: 110 AAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
A +ADL +A +H E A D+ +++ + + +GAYL +AN T ADLS
Sbjct: 163 AVNLIAADLSEATLHEAEVMGAYLYKTDLYKANLTEAHLSGAYL-----LRANLTEADLS 217
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ + L ANLT A L L ++L GA + + + ++
Sbjct: 218 NADLSWTNLRGANLTGANLRGANLRGANLTGANLSSVNLHETIM 261
>gi|443310213|ref|ZP_21039874.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442779757|gb|ELR89989.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 253
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 54/101 (53%), Gaps = 1/101 (0%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A SA+L +A ++ N AN T A + +D S + A L A+ YKA A+L+D
Sbjct: 139 ANLKSANLSEAKLIRANLNEANLTEAHLNYADLSHANLGSASLVGAILYKAELRQANLND 198
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + L +ANL+ A L+ L ++L GA + GA+ + A
Sbjct: 199 AYLHKAYLFDANLSQARLINADLRWANLRGANLRGANLTGA 239
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 40/76 (52%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN AD++ ++ S + L A AN A+LS+ + R LNEANLT A L
Sbjct: 109 ANLIGADLQGANLSNADLENVNLIGANLQNANLKSANLSEAKLIRANLNEANLTEAHLNY 168
Query: 190 TVLTRSDLGGAIIEGA 205
L+ ++LG A + GA
Sbjct: 169 ADLSHANLGSASLVGA 184
>gi|436670209|ref|YP_007317948.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428262481|gb|AFZ28430.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 309
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 57/120 (47%), Gaps = 8/120 (6%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENF--RANFTSADMRESDFSGSKFNGAYLEKAVA 157
+T E I S A DL K+ + E RA+ T AD+ E+D + A L +
Sbjct: 162 QTNWEGAILSQASLQRVDLEKS-QLNETILRRADLTEADLVEADLRYADLTEAILCRVAL 220
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI 212
AN GADLS + R L A+L AVL T L +DL A + G+DFSD+ +
Sbjct: 221 ELANLVGADLSRATLKRASLFRADLEGAVLQDTNLVETDLRYANFKDTQLMGSDFSDSRV 280
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 46/84 (54%), Gaps = 5/84 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN ++E+D +G+ F+ A L + N+ GA LS + R+ L ++ L +L
Sbjct: 137 RANLFKVSLKEADCTGANFDEANLR-----QTNWEGAILSQASLQRVDLEKSQLNETILR 191
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
R LT +DL A + AD ++A++
Sbjct: 192 RADLTEADLVEADLRYADLTEAIL 215
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 49/85 (57%), Gaps = 5/85 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R + +++ SDFS + N A L +Y AN +G L T ++R L +ANLT A L+
Sbjct: 27 RVDLKGTNLKSSDFSHANLNSADL----SY-ANLSGTSLIWTDLNRANLRQANLTQACLL 81
Query: 189 RTVLTRSDLGGAIIEGADFSDAVID 213
R+ L +DL A + A+ S+A+++
Sbjct: 82 RSSLFWADLQEATLVNANLSNALLN 106
Score = 40.4 bits (93), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 44/85 (51%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
AN SAD+ ++ SG+ L +A +AN T A L + + L EA L NA L
Sbjct: 42 HANLNSADLSYANLSGTSLIWTDLNRANLRQANLTQACLLRSSLFWADLQEATLVNANLS 101
Query: 189 RTVLTRSDLGGAIIEGADFSDAVID 213
+L +L A ++GAD S+A ++
Sbjct: 102 NALLNHVNLTSACLKGADLSEASLE 126
>gi|158338487|ref|YP_001519664.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158308728|gb|ABW30345.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 464
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 66/129 (51%), Gaps = 21/129 (16%)
Query: 111 AQFGSADLR----KAVHVKEN------------FRANFTSADMRESDFSGSKFNGAYLEK 154
A+ G ADLR K ++KE RA+ AD+RE++ S ++ + LEK
Sbjct: 36 AKLGGADLRNANLKGANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLTLSNLEK 95
Query: 155 -----AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
A+ ++AN + A L+ + ++ L +ANL+ A L L R++LG A + A+ +
Sbjct: 96 SQLGAAILFRANLSQAQLTLSNLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTT 155
Query: 210 AVIDLAQKQ 218
A + A+ Q
Sbjct: 156 ANLSQARLQ 164
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT---- 183
FRAN + A + S+ ++ A L +A +AN A+L +++ L ANL+
Sbjct: 104 FRANLSQAQLTLSNLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTTANLSQARL 163
Query: 184 -NAVLVRTVLTRSDLGGAIIEGADF 207
NA LV T L ++L GA ++GA+
Sbjct: 164 QNASLVGTQLINANLEGASLKGANL 188
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+A AD+R ++ GA L++A A GADL + + L EANL++A L
Sbjct: 35 KAKLGGADLRNANLK-----GANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLT 89
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
+ L +S LG AI+ A+ S A + L+
Sbjct: 90 LSNLEKSQLGAAILFRANLSQAQLTLS 116
Score = 37.0 bits (84), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 41/74 (55%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ +F A + +++ + S +GA L +A ++A+ TGA L + L EANL NA +
Sbjct: 382 QVDFFRAQLPQANLAQSILDGANLTEANLFRADLTGASLKAATLKNANLAEANLENANIE 441
Query: 189 RTVLTRSDLGGAII 202
T L + L GAI+
Sbjct: 442 GTNLDDAYLCGAIM 455
>gi|390438023|ref|ZP_10226524.1| Pentapeptide repeat protein [Microcystis sp. T1-4]
gi|389838556|emb|CCI30648.1| Pentapeptide repeat protein [Microcystis sp. T1-4]
Length = 275
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 69/139 (49%), Gaps = 19/139 (13%)
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
A+ K N A +A++R +D SG+ GAYL A AN A LS + R L
Sbjct: 135 AIGPKANLTGAYLNNANLRFADLSGANLRGAYLSGADLTGANLAAAALSGANLQRASLTG 194
Query: 180 ANLTNAVLVRTVLTRSDLGGAI-----------IEGADFS--DAVIDLAQKQALCKYAN- 225
A L +A LV L +DL GA +EGADFS + + DL ++ LC ++
Sbjct: 195 AFLRDARLVGVELQFADLRGADLTGAILEQIQNLEGADFSQVEGLSDL-ERSYLCGRSSR 253
Query: 226 --GT-NPITGVSTRKSLGC 241
GT NP T +T +SLGC
Sbjct: 254 ELGTWNPYTRSNTGQSLGC 272
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 54/111 (48%), Gaps = 1/111 (0%)
Query: 117 DLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
D+RKA ++ A N D+ + D + F GA L A AN TGA+L + R
Sbjct: 17 DVRKARDKGQSLSAANLEGIDLSQMDLKNADFTGAILLGADLAGANLTGANLEAADLRRA 76
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
L ++L A L T+L R+ L GA ++GAD + A I L+ + G
Sbjct: 77 NLRGSDLRGANLRDTLLYRAILCGANLQGADLTGAKISLSVYDGTTSWPEG 127
>gi|167921391|ref|ZP_02508482.1| pentapeptide repeat protein [Burkholderia pseudomallei BCC215]
Length = 825
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 62/126 (49%), Gaps = 16/126 (12%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDF 142
C+ + A A L+ A R E + SAA G +++ V A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----QSLQV-----ADLTGADLSGMDL 524
Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 525 RGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAHC 579
Query: 203 EGADFS 208
E DFS
Sbjct: 580 ERTDFS 585
>gi|86608820|ref|YP_477582.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86557362|gb|ABD02319.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 328
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 59/119 (49%), Gaps = 4/119 (3%)
Query: 98 EAETRGEFGIGS---AAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLE 153
E + RG +G+ AQ A+L++A+ + N AN + AD+ +D S S A L
Sbjct: 204 ETDLRGVSFLGADLQGAQMARANLKEAILRQVNLTEANLSEADLAGADLSASSLCSAKLA 263
Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ +AN GADL + L NL NA L +LTR+DL A + GA+ A +
Sbjct: 264 RTDLSRANLAGADLRCANLVDAYLGRTNLENADLGEAILTRADLSTANLSGANLRGATL 322
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 51/96 (53%), Gaps = 11/96 (11%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
G A L+KA V N AN + AD+ E+D ++ +GA L+ A + AN T A
Sbjct: 52 LGRAKLQKANLVGANLGGANLSQADLSEADLRDAQLHGATLQGADLHGANLTLA------ 105
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+L +ANL +A L LT ++LGGA + GA+
Sbjct: 106 ----LLIDANLLDADLRWANLTSANLGGACLRGANL 137
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 61/118 (51%), Gaps = 16/118 (13%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMR-----ESDFSGSKFNGAYLE----------K 154
A A+L++A +K N + AN A ++ E+D G F GA L+ +
Sbjct: 170 ADLSGANLKEASLIKANLQGANLQQARLQGAILSETDLRGVSFLGADLQGAQMARANLKE 229
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+ + N T A+LS+ + L+ ++L +A L RT L+R++L GA + A+ DA +
Sbjct: 230 AILRQVNLTEANLSEADLAGADLSASSLCSAKLARTDLSRANLAGADLRCANLVDAYL 287
Score = 40.4 bits (93), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 84/167 (50%), Gaps = 10/167 (5%)
Query: 63 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
A L++ ++ +T L A + + ++ L D N +A+ R + ++A G A LR A
Sbjct: 80 ADLRDAQLHGAT-LQGADLHGANLTLALLIDANLLDADLR--WANLTSANLGGACLRGAN 136
Query: 123 HVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
++ R A +A++ +D SG+ +GA L +A+ +GA+L + + + L AN
Sbjct: 137 LRFDSRRGAVLRNANLSRADLSGANLSGADL-----TRADLSGANLKEASLIKANLQGAN 191
Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGT 227
L A L +L+ +DL G GAD A + A K+A+ + N T
Sbjct: 192 LQQARLQGAILSETDLRGVSFLGADLQGAQMARANLKEAILRQVNLT 238
Score = 40.4 bits (93), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 6/101 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL +A N + A+ A+++ ++ ++ GA L + +F GADL
Sbjct: 158 SGANLSGADLTRADLSGANLKEASLIKANLQGANLQQARLQGAILSETDLRGVSFLGADL 217
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
M R ANL A+L + LT ++L A + GAD S
Sbjct: 218 QGAQMAR-----ANLKEAILRQVNLTEANLSEADLAGADLS 253
>gi|119488469|ref|ZP_01621642.1| hypothetical protein L8106_23865 [Lyngbya sp. PCC 8106]
gi|119455280|gb|EAW36420.1| hypothetical protein L8106_23865 [Lyngbya sp. PCC 8106]
Length = 463
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 64/123 (52%), Gaps = 15/123 (12%)
Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY--------- 158
+ A A+L++A H + N SAD+R +D S + ++ +K VA
Sbjct: 107 TGASLNHANLKQANFHNADLDAVNLISADLRGADLSSASL--SWYDKVVANLSRADLTEA 164
Query: 159 ---KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
+AN GA+L +T + R LN+ANL +A L+RT+L SDL A + A DA ++ A
Sbjct: 165 NLSEANLCGANLLETNLTRANLNKANLQDANLIRTILLESDLSLAELSNARLQDANLEGA 224
Query: 216 QKQ 218
+ Q
Sbjct: 225 KLQ 227
Score = 43.9 bits (102), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 15/98 (15%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
R +D+ ++ S ++ A LE A +AN TG +LS + R+ LN ANL NA L
Sbjct: 197 IRTILLESDLSLAELSNARLQDANLEGAKLQQANLTGINLSRLNLARVNLNRANLKNANL 256
Query: 188 VRTV---------------LTRSDLGGAIIEGADFSDA 210
+ T L R++L A + GAD +DA
Sbjct: 257 LETSFEGANLRIVNLNQANLIRANLSRASLIGADLTDA 294
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 46/84 (54%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+AN T ++ + + N A L+ A + +F GA+L +++ L ANL+ A L+
Sbjct: 228 QANLTGINLSRLNLARVNLNRANLKNANLLETSFEGANLRIVNLNQANLIRANLSRASLI 287
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
LT ++L GA +E A+F AV+
Sbjct: 288 GADLTDANLYGANLENAEFLGAVM 311
>gi|17227929|ref|NP_484477.1| hypothetical protein alr0433 [Nostoc sp. PCC 7120]
gi|17129778|dbj|BAB72391.1| alr0433 [Nostoc sp. PCC 7120]
Length = 143
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 44/84 (52%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+A+ AD+R ++ +G+ A LE A AN GA+LS L+ NLTN L+
Sbjct: 50 QAHLIGADLRNANLAGANLKLANLEGADLTGANLKGANLSQVFASDASLSATNLTNVKLI 109
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
L +DL GA++ AD A++
Sbjct: 110 NAELYNADLEGAVLANADLRGAIL 133
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 51/99 (51%), Gaps = 11/99 (11%)
Query: 116 ADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A+L++A + + R AN A++ +D +G+ GA L + A A+ + +L++
Sbjct: 46 ANLQQAHLIGADLRNANLAGANLKLANLEGADLTGANLKGANLSQVFASDASLSATNLTN 105
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
+ L A L NA L VL +DL GAI+ GA +S
Sbjct: 106 -----VKLINAELYNADLEGAVLANADLRGAILFGALYS 139
>gi|434394300|ref|YP_007129247.1| heat shock protein DnaJ domain protein [Gloeocapsa sp. PCC 7428]
gi|428266141|gb|AFZ32087.1| heat shock protein DnaJ domain protein [Gloeocapsa sp. PCC 7428]
Length = 213
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 46/89 (51%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN D+ DF + F GA L A +K N +GA+L + R L +ANL++A L
Sbjct: 103 RANLKEKDLSGRDFRNANFTGANLSDAFMHKVNLSGANLFQANLFRANLLQANLSHANLR 162
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
L +DL G+ + GAD A I + +
Sbjct: 163 EANLVGADLSGSDLSGADLRGARIGVGDR 191
>gi|126455703|ref|YP_001074295.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
1106a]
gi|167896768|ref|ZP_02484170.1| pentapeptide repeat protein [Burkholderia pseudomallei 7894]
gi|242312992|ref|ZP_04812009.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106b]
gi|254195379|ref|ZP_04901807.1| pentapeptide repeat protein [Burkholderia pseudomallei S13]
gi|126229471|gb|ABN92884.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106a]
gi|169652126|gb|EDS84819.1| pentapeptide repeat protein [Burkholderia pseudomallei S13]
gi|242136231|gb|EES22634.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106b]
Length = 825
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 62/126 (49%), Gaps = 16/126 (12%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDF 142
C+ + A A L+ A R E + SAA G +++ V A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----QSLQV-----ADLTGADLSGMDL 524
Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 525 RGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAHC 579
Query: 203 EGADFS 208
E DFS
Sbjct: 580 ERTDFS 585
>gi|332706458|ref|ZP_08426519.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332354342|gb|EGJ33821.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 345
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 46/88 (52%), Gaps = 5/88 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANLTN 184
A+F AD++E DFS A L +A ++ N GA+L + R L +ANL+N
Sbjct: 231 ADFRGADLKERDFSNRNLQSANLSQANLKDAFLHRVNLAGANLEGANLFRANLFQANLSN 290
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A L L +D+ GA + GAD S A +
Sbjct: 291 ANLREANLIGADMSGADLSGADLSGAKV 318
>gi|53721218|ref|YP_110203.1| hypothetical protein BPSS0182 [Burkholderia pseudomallei K96243]
gi|167818308|ref|ZP_02449988.1| hypothetical protein Bpse9_24431 [Burkholderia pseudomallei 91]
gi|418395056|ref|ZP_12969100.1| type VI secretion system [Burkholderia pseudomallei 354a]
gi|418554994|ref|ZP_13119746.1| type VI secretion system [Burkholderia pseudomallei 354e]
gi|52211632|emb|CAH37627.1| conserved hypothetical protein [Burkholderia pseudomallei K96243]
gi|385369399|gb|EIF74730.1| type VI secretion system [Burkholderia pseudomallei 354e]
gi|385374364|gb|EIF79254.1| type VI secretion system [Burkholderia pseudomallei 354a]
Length = 825
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 62/126 (49%), Gaps = 16/126 (12%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDF 142
C+ + A A L+ A R E + SAA G +++ V A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----QSLQV-----ADLTGADLSGMDL 524
Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 525 RGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAHC 579
Query: 203 EGADFS 208
E DFS
Sbjct: 580 ERTDFS 585
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Query: 115 SADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L+D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILID 802
>gi|344171276|emb|CCA83758.1| hypothetical protein, Pentapeptide repeat domains [blood disease
bacterium R229]
Length = 325
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 47/83 (56%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ + AD+ + SG+ +GAYL A A+ +GADLS + L+ ANL+ A L
Sbjct: 54 ADLSGADLSGAYLSGAYLSGAYLSDADLSGADLSGADLSGAYLSGAYLSGANLSGADLSG 113
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L+ +DL GA + GAD S A +
Sbjct: 114 ANLSGADLSGADLSGADLSGAYL 136
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 51/103 (49%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTS-ADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL A + S AD+ +D SG+ +GAYL A AN +GADL
Sbjct: 52 SGADLSGADLSGAYLSGAYLSGAYLSDADLSGADLSGADLSGAYLSGAYLSGANLSGADL 111
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S + L+ A+L+ A L L+ + L GA + GAD S A
Sbjct: 112 SGANLSGADLSGADLSGADLSGAYLSGAYLSGAYLSGADLSGA 154
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 46/83 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + AD+ +D SG+ +GAYL A A +GADLS + L+ A L+ A L
Sbjct: 114 ANLSGADLSGADLSGADLSGAYLSGAYLSGAYLSGADLSGADLSGADLSGAYLSGAYLSS 173
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L+ +DL GA + GA+ S A +
Sbjct: 174 ANLSGADLSGANLSGANLSGAYL 196
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 50/105 (47%), Gaps = 11/105 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTS-ADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL A + S AD+ +D SG+ +GAYL A AN +GADL
Sbjct: 122 SGADLSGADLSGAYLSGAYLSGAYLSGADLSGADLSGADLSGAYLSGAYLSSANLSGADL 181
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
S ANL+ A L L+ +DL GA + GA+ S A +
Sbjct: 182 SG----------ANLSGANLSGAYLSSADLSGANLSGANLSGAYL 216
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL A + A+ + AD+ + SG+ +GAYL A A+ +GADL
Sbjct: 102 SGANLSGADLSGANLSGADLSGADLSGADLSGAYLSGAYLSGAYLSGADLSGADLSGADL 161
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S + L+ ANL+ A L L+ ++L GA + AD S A
Sbjct: 162 SGAYLSGAYLSSANLSGADLSGANLSGANLSGAYLSSADLSGA 204
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 46/93 (49%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L KAV AN + A + ++D S + +GA L A A +GA LS + L
Sbjct: 22 LMKAVEQAVKGSANLSGAYLSDADLSDADLSGADLSGADLSGAYLSGAYLSGAYLSDADL 81
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ A+L+ A L L+ + L GA + GAD S A
Sbjct: 82 SGADLSGADLSGAYLSGAYLSGANLSGADLSGA 114
Score = 37.0 bits (84), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 42/81 (51%), Gaps = 5/81 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A + A++ +D SG+ +GA L A+ +GADLS + L+ A L+ A L
Sbjct: 99 AYLSGANLSGADLSGANLSGADLS-----GADLSGADLSGAYLSGAYLSGAYLSGADLSG 153
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L+ +DL GA + GA S A
Sbjct: 154 ADLSGADLSGAYLSGAYLSSA 174
>gi|412993172|emb|CCO16705.1| predicted protein [Bathycoccus prasinos]
Length = 163
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 65/140 (46%), Gaps = 6/140 (4%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
G +A + DLRK + + + + A M +S F S F+ + K A KA+F
Sbjct: 29 IGQANAVSDKTLDLRKCQYDNVSVKGITLSGALMVDSVFDNSDFSETVMSKVYATKASFK 88
Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
+ ++ ++DR + +++T A VLT GA + GA+F +A+I + LC
Sbjct: 89 NVNFTNAVIDRATFDGSDMTGANFQNAVLTGVSYEGANLTGANFEEALIGDQDVKLLC-- 146
Query: 224 ANGTNPITGVSTRKSLGCGN 243
NP +R +GC N
Sbjct: 147 ---LNPTVVDESRMQIGCKN 163
>gi|428214427|ref|YP_007087571.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428002808|gb|AFY83651.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 155
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ S D+ ++D SG + A L A +AN TGA+LS + L EANLT+A L T
Sbjct: 61 DLQSVDLEKADLSGVDLSNANLTNADLEEANLTGANLSTADLTNADLEEANLTDANLQNT 120
Query: 191 VLTRSDLGGAI-----IEGADFSDAVID 213
T +DL AI + GADF+ A +D
Sbjct: 121 NFTSADLEDAILTNANVTGADFTGADLD 148
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 55/109 (50%), Gaps = 12/109 (11%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S S DL KA + AN T+AD+ E++ +G+ + A L A +AN T A+L
Sbjct: 58 SGCDLQSVDLEKADLSGVDLSNANLTNADLEEANLTGANLSTADLTNADLEEANLTDANL 117
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
+T N T+A L +LT +++ GA GAD D+VI L +
Sbjct: 118 QNT----------NFTSADLEDAILTNANVTGADFTGADL-DSVIGLTR 155
>gi|304404631|ref|ZP_07386292.1| pentapeptide repeat protein [Paenibacillus curdlanolyticus YK9]
gi|304346438|gb|EFM12271.1| pentapeptide repeat protein [Paenibacillus curdlanolyticus YK9]
Length = 288
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 48/86 (55%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ F + + SDFSG+ G+ + + +ANF GA+L+D + L +A+ ++LV
Sbjct: 100 KGQFKGSALHGSDFSGADLTGSSFKGSDVREANFDGANLTDCSFTALDLTKASFNKSILV 159
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDL 214
RT ++S L GA +G +D V+ L
Sbjct: 160 RTNFSKSGLDGAAFKGVKLTDVVLTL 185
>gi|452966664|gb|EME71673.1| putative low-complexity protein [Magnetospirillum sp. SO-1]
Length = 241
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 49/101 (48%), Gaps = 1/101 (0%)
Query: 111 AQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A L+ AV + F A F ADM +D S + GA L A F GA L D
Sbjct: 70 ANLSGASLKGAVFAGADLFHAIFDEADMTGADLSDTYLFGANLIATRLVGAEFKGAFLKD 129
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
LM+R L++A + ++R V + L GA + GAD + A
Sbjct: 130 VLMERADLSQAKMAGVYMLRGVFEEAKLAGADLSGADMTGA 170
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 49/103 (47%), Gaps = 6/103 (5%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A + DLR A F +AN A++ + G+ F GA L A+ +A+ TGADL
Sbjct: 43 SGAMLENVDLRGARLDGARFAKANLKWANLSGASLKGAVFAGADLFHAIFDEADMTGADL 102
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
SDT L ANL LV + L ++E AD S A
Sbjct: 103 SDT-----YLFGANLIATRLVGAEFKGAFLKDVLMERADLSQA 140
>gi|381205548|ref|ZP_09912619.1| hypothetical protein SclubJA_07991 [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 253
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 2/89 (2%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
F A+ ++ ++ SG GA L + AN +G+DLS+ L +A+L+NA+L
Sbjct: 88 FSASMEGCNLENANLSGVDLQGADLSHSYLPGANLSGSDLSNANFSGATLRDADLSNAIL 147
Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
T+L +DL GA + GA+ +DA DLA+
Sbjct: 148 KGTLLKEADLSGANLSGANLTDA--DLAK 174
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 47/81 (58%), Gaps = 5/81 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANF+ A +R++D S + G L++A AN +GA+L+D L +ANL+ A L+
Sbjct: 130 ANFSGATLRDADLSNAILKGTLLKEADLSGANLSGANLTDA-----DLAKANLSPATLLG 184
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
LTR++L + A+F +A
Sbjct: 185 ATLTRTNLSDTNLVKANFEEA 205
>gi|113477518|ref|YP_723579.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110168566|gb|ABG53106.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 710
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 60/127 (47%), Gaps = 15/127 (11%)
Query: 91 LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFS 143
L + N ++A T F + A GSADL KA + N F +D+RES++
Sbjct: 534 LIETNLHQANLTEATF---TGADLGSADLSKANLYRANLSKVKAEGTTFQLSDLRESNWQ 590
Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
G+ +GA +AN ADLS L+ A L NA L T ++ +DL GA +
Sbjct: 591 GANLSGANFS-----RANLKKADLSLALLTNANFRNAQLQNANLRNTDISLADLRGANLS 645
Query: 204 GADFSDA 210
G DF A
Sbjct: 646 GTDFKGA 652
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 47/107 (43%), Gaps = 14/107 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
+ A F SA L N A++ E+ F+G+ A L KA Y+AN +
Sbjct: 525 TQANFSSAKL---------IETNLHQANLTEATFTGADLGSADLSKANLYRANLSKVKAE 575
Query: 169 DTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
T L E ANL+ A R L ++DL A++ A+F +A
Sbjct: 576 GTTFQLSDLRESNWQGANLSGANFSRANLKKADLSLALLTNANFRNA 622
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 44/93 (47%), Gaps = 10/93 (10%)
Query: 128 FRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
FRA + A M +++FS +K L +A +A FTGADL + + L ANL
Sbjct: 510 FRATLSKAIMPGSTITQANFSSAKLIETNLHQANLTEATFTGADLGSADLSKANLYRANL 569
Query: 183 TNAVLVRTVLTRSDL-----GGAIIEGADFSDA 210
+ T SDL GA + GA+FS A
Sbjct: 570 SKVKAEGTTFQLSDLRESNWQGANLSGANFSRA 602
>gi|86606624|ref|YP_475387.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86555166|gb|ABD00124.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 371
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 65/152 (42%), Gaps = 15/152 (9%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGI----GSAAQFGSADLRKAVHVKENFR-- 129
L A V+ S S L++ + E R + + G F DL KA R
Sbjct: 204 LRGAKVSGTSLRGSRLSEETRLEERLRHIWQLQNWGGQGQDFSGQDLSKADLRGLGLRQI 263
Query: 130 ----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEA 180
AN D+R S+ G+ GA L++A AN GADL + + L A
Sbjct: 264 RLRGANLKRVDLRGSNLEGADLRGANLQRADLRGANLQNADLEGADLGGAELRQAQLQGA 323
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
NL A L R LT+++L GA IEG S + I
Sbjct: 324 NLRRADLSRANLTQANLEGAQIEGLKHSGSQI 355
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 50/100 (50%), Gaps = 5/100 (5%)
Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
A F A+LRKA NF A+ AD+R+++ G+K +GA L+ A A+ GA +S
Sbjct: 151 GANFYEANLRKANLGLCNFNGAHLHQADLRQANLQGAKLSGAVLQGADLRGADLRGAKVS 210
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
T + L+E L R + + GG +G DFS
Sbjct: 211 GTSLRGSRLSEETRLEERL-RHIWQLQNWGG---QGQDFS 246
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 37/79 (46%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ D++E+ G+ F A L KA NF GA L L +ANL A L
Sbjct: 137 ADLEGVDLQEARLGGANFYEANLRKANLGLCNFNGAHLHQA-----DLRQANLQGAKLSG 191
Query: 190 TVLTRSDLGGAIIEGADFS 208
VL +DL GA + GA S
Sbjct: 192 AVLQGADLRGADLRGAKVS 210
>gi|428319029|ref|YP_007116911.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428242709|gb|AFZ08495.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 520
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 62/121 (51%), Gaps = 2/121 (1%)
Query: 94 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAY 151
L KY A R GI + A +L A N AN + A++ +++ +G+K N A
Sbjct: 7 LKKYAAGERNFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSKTNLTGAKLNIAR 66
Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
L A A+ T ADL+ + R+ L +A L A L+R L R++L GA + GA+ S A
Sbjct: 67 LSGAHLGGADLTDADLNVAYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANLSGAT 126
Query: 212 I 212
+
Sbjct: 127 L 127
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 1/95 (1%)
Query: 117 DLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
DL+KA+ + RA A++ ++ SG+ +GA L +A AN A+L +
Sbjct: 91 DLKKAILIGAKLIRAELIRAELSGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGA 150
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L EANL A L L+R+DL GA + G + A
Sbjct: 151 CLTEANLEQANLQGADLSRADLSGADLRGTELRQA 185
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 53/108 (49%), Gaps = 6/108 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+L A + R AN A++R + SG+ A LE+A A+ + ADL
Sbjct: 113 SGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRADL 172
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG-----ADFSDA 210
S + L +ANLT AVL L+ +L AI+ G AD S+A
Sbjct: 173 SGADLRGTELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEA 220
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 10/94 (10%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV----------LN 178
+AN AD+ +D SG+ G L +A +A +GADLS + + L+
Sbjct: 159 QANLQGADLSRADLSGADLRGTELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLS 218
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
EA L+ A L R L ++L A + AD S+A +
Sbjct: 219 EAKLSGADLSRADLCHANLLNASLVHADLSNAYL 252
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 54/123 (43%), Gaps = 4/123 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A ADLR E +AN T A + +D SG A L A+ + A LS
Sbjct: 168 SRADLSGADLRGT----ELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEAKLS 223
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
+ R L ANL NA LV L+ + L A GAD + A + A+ A+ + T
Sbjct: 224 GADLSRADLCHANLLNASLVHADLSNAYLIRADWIGADLTGATLTGAKLHAVSRLGIKTE 283
Query: 229 PIT 231
+T
Sbjct: 284 GMT 286
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 15/85 (17%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NF ++ E++ SG +GA L+ A AN +GA+LS T NLT A L
Sbjct: 16 NFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSKT----------NLTGAKL--- 62
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
+ L GA + GAD +DA +++A
Sbjct: 63 --NIARLSGAHLGGADLTDADLNVA 85
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 11/108 (10%)
Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A G ADL A ++V A D++++ G+K A L +A AN +GA+L
Sbjct: 68 SGAHLGGADLTDADLNV-----AYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANL 122
Query: 168 SDTLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S + L +ANL A L LT ++L A ++GAD S A
Sbjct: 123 SGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRA 170
>gi|359457996|ref|ZP_09246559.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 464
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 66/129 (51%), Gaps = 21/129 (16%)
Query: 111 AQFGSADLR----KAVHVKEN------------FRANFTSADMRESDFSGSKFNGAYLEK 154
A+ G ADLR K ++KE RA+ AD+RE++ S ++ + LEK
Sbjct: 36 AKLGGADLRNANLKGANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLTLSNLEK 95
Query: 155 -----AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
A+ ++AN + A L+ + ++ L +ANL+ A L L R++LG A + A+ +
Sbjct: 96 SQLGAAILFRANLSQAQLTLSDLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTT 155
Query: 210 AVIDLAQKQ 218
A + A+ Q
Sbjct: 156 ANLSQARLQ 164
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT---- 183
FRAN + A + SD ++ A L +A +AN A+L +++ L ANL+
Sbjct: 104 FRANLSQAQLTLSDLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTTANLSQARL 163
Query: 184 -NAVLVRTVLTRSDLGGAIIEGADF 207
NA LV T L ++L GA ++GA+
Sbjct: 164 QNASLVGTQLINANLEGASLKGANL 188
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+A AD+R ++ GA L++A A GADL + + L EANL++A L
Sbjct: 35 KAKLGGADLRNANLK-----GANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLT 89
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
+ L +S LG AI+ A+ S A + L+
Sbjct: 90 LSNLEKSQLGAAILFRANLSQAQLTLS 116
>gi|119489371|ref|ZP_01622151.1| hypothetical protein L8106_02407 [Lyngbya sp. PCC 8106]
gi|119454644|gb|EAW35790.1| hypothetical protein L8106_02407 [Lyngbya sp. PCC 8106]
Length = 166
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 59/109 (54%), Gaps = 9/109 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A+ +DL V++K AN + A + +FS + +GA L A +ANFT A+LS
Sbjct: 55 SGAKLNGSDLS-GVNLK---GANLSGALLDNVNFSQADLSGANLSSAALTQANFTEANLS 110
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLG-----GAIIEGADFSDAVI 212
+ + L A LTNA L L ++DL GA I+GADF +A++
Sbjct: 111 EANLTGAFLRSAILTNAKLTNASLNKADLNTAKLEGAEIKGADFKEAIM 159
>gi|359459150|ref|ZP_09247713.1| pentapeptide repeat-containing serine/threonine kinase
[Acaryochloris sp. CCMEE 5410]
Length = 514
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 57/116 (49%), Gaps = 21/116 (18%)
Query: 112 QFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
+F + DLR A+ + NF RANFT A++R ++ AY+ A A+ GA+LSD
Sbjct: 411 KFQNTDLRDAILINANFGRANFTGANLRNANLMQ-----AYMSHADLANADLRGANLSDA 465
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
L+ ANL A +L GA + GA S++ + AQ L Y NG
Sbjct: 466 -----YLSHANLRGA----------NLCGADLSGAKLSESQLSFAQTNWLTVYPNG 506
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 44/104 (42%), Gaps = 21/104 (20%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F DLR N R SA+ E F + A L A +ANFTGA
Sbjct: 387 FSGQDLRNL-----NLRKFQLPSANFHEGKFQNTDLRDAILINANFGRANFTGA------ 435
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
NL NA L++ ++ +DL A + GA+ SDA + A
Sbjct: 436 ---------NLRNANLMQAYMSHADLANADLRGANLSDAYLSHA 470
>gi|428203771|ref|YP_007082360.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427981203|gb|AFY78803.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 180
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 52/97 (53%), Gaps = 1/97 (1%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADL KA V N N AD+ ++ SG+ GA L A + AN + A+L +
Sbjct: 60 ANLTDADLIKANLVGANLIEINLIGADLTSANLSGADLTGADLRCANLHNANLSQANLRE 119
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
+D L+ ANL+ A+LV T L+ +D GA ++G D
Sbjct: 120 VHLDGADLSGANLSGAILVNTDLSVADTVGAKLDGID 156
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 5/80 (6%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NF ++ ++ G KF G L +A+ +GADLS+T + L +ANLT+A L++
Sbjct: 16 NFEEVNLHIANLQGLKFQGINL-----TRADLSGADLSETDLSGACLKQANLTDADLIKA 70
Query: 191 VLTRSDLGGAIIEGADFSDA 210
L ++L + GAD + A
Sbjct: 71 NLVGANLIEINLIGADLTSA 90
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 46/99 (46%), Gaps = 15/99 (15%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT----- 183
RA+ + AD+ E+D SG+ A L A KAN GA+L + + L ANL+
Sbjct: 39 RADLSGADLSETDLSGACLKQANLTDADLIKANLVGANLIEINLIGADLTSANLSGADLT 98
Query: 184 ----------NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
NA L + L L GA + GA+ S A++
Sbjct: 99 GADLRCANLHNANLSQANLREVHLDGADLSGANLSGAIL 137
>gi|254189534|ref|ZP_04896044.1| pentapeptide repeat protein [Burkholderia pseudomallei Pasteur
52237]
gi|157937212|gb|EDO92882.1| pentapeptide repeat protein [Burkholderia pseudomallei Pasteur
52237]
Length = 825
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 62/126 (49%), Gaps = 16/126 (12%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDF 142
C+ + A A L+ A R E + SAA G +++ V A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----QSLQV-----ADLTGADLSGMDL 524
Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 525 RGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAHC 579
Query: 203 EGADFS 208
E DFS
Sbjct: 580 ERTDFS 585
>gi|119487879|ref|ZP_01621376.1| hypothetical protein L8106_28486 [Lyngbya sp. PCC 8106]
gi|119455455|gb|EAW36593.1| hypothetical protein L8106_28486 [Lyngbya sp. PCC 8106]
Length = 514
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 72/146 (49%), Gaps = 16/146 (10%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQ---FGSADLRKAVHVKE--NFR- 129
L A+++ + + S LAD N +A+ G G+ + A L + H++E N R
Sbjct: 265 LKQAILSEVNLSESNLADANLEQADLMGAELRGATLKGTNLSQAYLVRTNHLREVKNLRE 324
Query: 130 -----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN T A++RE + G+ A L++A+ GA+L D + R L EA L +
Sbjct: 325 ANLKGANLTRANLREVNLQGANLQQANLQQAI-----LQGANLKDANLIRANLREAKLQD 379
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
A L R L R++L A + A+ S+A
Sbjct: 380 AKLQRVNLERANLQAANLTDANLSNA 405
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 61/126 (48%), Gaps = 22/126 (17%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYK--- 159
S A ADL+ A + NF+ AN A+++ +DF G+ ++L++A + +
Sbjct: 185 SGANLQGADLQGANLHETNFQGANLAGANLGGANLKCTDFQGTNLQESHLKQAYSVRKAK 244
Query: 160 -------------ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
N GA+L ++ + L+E+NL +A L + L ++L GA ++G +
Sbjct: 245 FAQANLSGVDFQGVNLRGANLKQAILSEVNLSESNLADANLEQADLMGAELRGATLKGTN 304
Query: 207 FSDAVI 212
S A +
Sbjct: 305 LSQAYL 310
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 11/98 (11%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A+L++A+ N + AN A++RE+ +K LE+A AN T A+LS+
Sbjct: 345 ANLQQANLQQAILQGANLKDANLIRANLREAKLQDAKLQRVNLERANLQAANLTDANLSN 404
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
ANLT+A L T L ++ A++ DF
Sbjct: 405 ----------ANLTDASLCDTCLNQTQFYQAVLIRVDF 432
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 58/113 (51%), Gaps = 16/113 (14%)
Query: 113 FGSADLRKAVHVKENF---RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
F +L+++ H+K+ + +A F A++ DF G GA L++A+ + N + ++L+D
Sbjct: 224 FQGTNLQES-HLKQAYSVRKAKFAQANLSGVDFQGVNLRGANLKQAILSEVNLSESNLAD 282
Query: 170 TLMDR----------MVLNEANLTNAVLVRTVLTRS--DLGGAIIEGADFSDA 210
+++ L NL+ A LVRT R +L A ++GA+ + A
Sbjct: 283 ANLEQADLMGAELRGATLKGTNLSQAYLVRTNHLREVKNLREANLKGANLTRA 335
>gi|428216913|ref|YP_007101378.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988695|gb|AFY68950.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 227
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 57/98 (58%), Gaps = 4/98 (4%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ ++ ++ +D S S + A L + ANF+ A LS+ + + LN+ANL++A+L
Sbjct: 43 ADLSAGNLNHADLSNSDLSRANLYRCSLKHANFSAAKLSNANLKDVQLNDANLSDAILSC 102
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 227
L +DL GAI+ GAD S A DL + LC +AN T
Sbjct: 103 ANLAEADLSGAILVGADLSGA--DLTNAE-LC-HANLT 136
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 79/178 (44%), Gaps = 17/178 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
SA ADL + + N ANF++A + ++ + N A L A+ AN
Sbjct: 46 SAGNLNHADLSNSDLSRANLYRCSLKHANFSAAKLSNANLKDVQLNDANLSDAILSCANL 105
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDAVIDLAQK 217
ADLS ++ L+ A+LTNA L LT ++L G ++ GA+F++A ++ AQ
Sbjct: 106 AEADLSGAILVGADLSGADLTNAELCHANLTGANLEGVLLHNANLTGANFTNANMENAQL 165
Query: 218 QALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKL--LDRDGFCDS 273
A+ TN +T ++ NS A ++ L Q L+ CD+
Sbjct: 166 DG----ADLTNANLSGTTLHNVNLANSNLQAVNLTNADLRGVNLQHTHNLETANLCDA 219
>gi|158341584|ref|YP_001522748.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158311825|gb|ABW33434.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 521
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 52/99 (52%), Gaps = 1/99 (1%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A G ADL A N RANF A ++E+D + + +GA+L A AN +GA LS
Sbjct: 88 AYLGGADLYSANLRGANLIRANFNDAHLKEADLTNANLSGAHLRGANLLNANLSGALLSR 147
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
++ L+ ANL NA L L +DL A ++ AD S
Sbjct: 148 ANLENADLSYANLENADLSYANLENADLSHANLKNADLS 186
Score = 40.4 bits (93), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 48/84 (57%), Gaps = 5/84 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ SA++R ++ + FN A+L++A AN +GA L +LN ANL+ A+L R
Sbjct: 93 ADLYSANLRGANLIRANFNDAHLKEADLTNANLSGAHLRGA----NLLN-ANLSGALLSR 147
Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
L +DL A +E AD S A ++
Sbjct: 148 ANLENADLSYANLENADLSYANLE 171
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 54/109 (49%), Gaps = 7/109 (6%)
Query: 127 NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
NF A +SA + ++ G N AYL A Y AN GA+L + L EA+LTNA
Sbjct: 64 NFEIAYLSSAKLSCANLEGINLNRAYLGGADLYSANLRGANLIRANFNDAHLKEADLTNA 123
Query: 186 VL----VRTV-LTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGTN 228
L +R L ++L GA++ A+ +A + A + A YAN N
Sbjct: 124 NLSGAHLRGANLLNANLSGALLSRANLENADLSYANLENADLSYANLEN 172
>gi|119356056|ref|YP_910700.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
DSM 266]
gi|119353405|gb|ABL64276.1| pentapeptide repeat protein [Chlorobium phaeobacteroides DSM 266]
Length = 446
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 55/99 (55%), Gaps = 3/99 (3%)
Query: 109 SAAQFGSADLRKAVHVKENF--RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
S A +ADLR + ++++ F +A+ AD+RE+ A++EK++ KAN A+
Sbjct: 82 SGANLNNADLRGS-NLQQAFIKKADLKGADLREAYLVKVNLKEAFMEKSMLQKANLQSAN 140
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
L T R L +NL +AVL T +DL GA ++GA
Sbjct: 141 LRWTRFHRADLAGSNLQDAVLFETSFVDADLRGANLKGA 179
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 6/103 (5%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTG 164
A F A+L +A+ + +A+F ADM++ G+ +GA ++E A AN +G
Sbjct: 307 ADFEDANLDEAMMEGADLSKADFQKADMKKVKLQGANLSGANLDRSFMEGADLRNANLSG 366
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
A+L ++ L+ ANL+ A L T L ++L GA ++GA+
Sbjct: 367 ANLFGAMLKDANLSGANLSGASLFETDLEGANLSGANLKGANL 409
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A+ +L+KA +F AN A M +D S + F A ++K AN +GA+L
Sbjct: 292 ARLKGVNLQKASMPGADFEDANLDEAMMEGADLSKADFQKADMKKVKLQGANLSGANLDR 351
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
+ M+ L ANL+ A L +L ++L GA + GA
Sbjct: 352 SFMEGADLRNANLSGANLFGAMLKDANLSGANLSGA 387
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 48/91 (52%), Gaps = 15/91 (16%)
Query: 130 ANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGAD----------LSDTLMDR 174
AN +++ + ++ SG+ N G+ L++A KA+ GAD L + M++
Sbjct: 69 ANLSNSSLVRAELSGANLNNADLRGSNLQQAFIKKADLKGADLREAYLVKVNLKEAFMEK 128
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
+L +ANL +A L T R+DL G+ ++ A
Sbjct: 129 SMLQKANLQSANLRWTRFHRADLAGSNLQDA 159
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 8/90 (8%)
Query: 126 ENFR---ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
EN R N A M +DF + + A +E A KA+F AD M ++ L ANL
Sbjct: 290 ENARLKGVNLQKASMPGADFEDANLDEAMMEGADLSKADFQKAD-----MKKVKLQGANL 344
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ A L R+ + +DL A + GA+ A++
Sbjct: 345 SGANLDRSFMEGADLRNANLSGANLFGAML 374
>gi|428313290|ref|YP_007124267.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428254902|gb|AFZ20861.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 283
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN ++R + G+ A L+ + AN TGA+LS + LNEANL A L+
Sbjct: 34 ANLIGVNLRGAHLQGTNLRKALLDHTLLIAANLTGANLSQANLSHASLNEANLVEACLID 93
Query: 190 TVLTRSDLGGAIIEGADF 207
T L +DL A + GA+
Sbjct: 94 TTLISADLSHAELTGANL 111
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 58/113 (51%), Gaps = 11/113 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S AQ +L +A V+ N N ++A++ E++ G+ YL KA KAN + A L
Sbjct: 147 SGAQLLRTNLSEAKLVQANLSHTNLSNANLHEAELIGT-----YLYKAELQKANLSEAHL 201
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIEGADFSDAVIDLA 215
S + R L EA+L A L L+RS DL GA + GA+ S A ++ A
Sbjct: 202 SGAYLSRANLREADLERADLRWANLSRSNLCEADLKGANLRGANLSKANLERA 254
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 10/90 (11%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-----TLMDRMVLNEANLTNAV 186
SAD+ ++ +G+ GA L Y AN G DLSD T + R+ L A+L+ A
Sbjct: 96 LISADLSHAELTGANLIGADL-----YGANLKGVDLSDANLIGTNLRRVNLQGADLSGAQ 150
Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
L+RT L+ + L A + + S+A + A+
Sbjct: 151 LLRTNLSEAKLVQANLSHTNLSNANLHEAE 180
>gi|354569053|ref|ZP_08988212.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353539057|gb|EHC08553.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 519
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 63/117 (53%), Gaps = 2/117 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A +A++R+A NF AN + A++R +D +G+ + A L +A AN GADL
Sbjct: 173 SGANCRNAEMRQANLSHSNFSGANLSGANLRWADLNGANLSWADLSEAKLSGANLIGADL 232
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY 223
S+ + L A+LT A L++ +DL GA + GA +S + L + +C++
Sbjct: 233 SNANLTNASLVHADLTQAKLIKAEWVGADLSGATLTGAKLYSTSRFGLKTEGMICEW 289
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 71/147 (48%), Gaps = 27/147 (18%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKA--VHVKENFRANFTSADMR----------ESD 141
L KYEA R F S DL +A VK N ANF+ A++ +D
Sbjct: 7 LAKYEAGER---------DFRSVDLSEANLSGVKLN-EANFSHANLSIVNLSGSHLCGTD 56
Query: 142 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
FS ++ N A L A ++AN A L+ + R L+ A L +A L+R L R+DL A
Sbjct: 57 FSHAQINVARLSGAYLHQANLNHASLNVANLIRADLSRAQLQSASLIRAELIRADLSRAD 116
Query: 202 IEGADFSDAVIDLAQ---KQALCKYAN 225
+ A+ + A DL + + A+ +YAN
Sbjct: 117 LFAANLNCA--DLREASLRHAILRYAN 141
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 60/114 (52%), Gaps = 4/114 (3%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A+LR ++ + N AN + D+ +D SG+ A + +A +NF+GA+LS
Sbjct: 140 ANLNEANLRDSLLTEANLEGANLNNTDLSRTDCSGANCRNAEMRQANLSHSNFSGANLSG 199
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQKQAL 220
+ LN ANL+ A L L+ ++L GA + A+ ++A + DL Q + +
Sbjct: 200 ANLRWADLNGANLSWADLSEAKLSGANLIGADLSNANLTNASLVHADLTQAKLI 253
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 55/110 (50%), Gaps = 11/110 (10%)
Query: 109 SAAQFGSADLRKAVHVKEN------FRANFTSADMRESDFSG-----SKFNGAYLEKAVA 157
S AQ SA L +A ++ + F AN AD+RE+ + N A L ++
Sbjct: 93 SRAQLQSASLIRAELIRADLSRADLFAANLNCADLREASLRHAILRYANLNEANLRDSLL 152
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+AN GA+L++T + R + AN NA + + L+ S+ GA + GA+
Sbjct: 153 TEANLEGANLNNTDLSRTDCSGANCRNAEMRQANLSHSNFSGANLSGANL 202
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 50/101 (49%), Gaps = 1/101 (0%)
Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+L A ++V RA+ + A ++ + ++ A L +A + AN ADL
Sbjct: 68 SGAYLHQANLNHASLNVANLIRADLSRAQLQSASLIRAELIRADLSRADLFAANLNCADL 127
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
+ + +L ANL A L ++LT ++L GA + D S
Sbjct: 128 REASLRHAILRYANLNEANLRDSLLTEANLEGANLNNTDLS 168
>gi|119488080|ref|ZP_01621524.1| hypothetical protein L8106_11802 [Lyngbya sp. PCC 8106]
gi|119455369|gb|EAW36508.1| hypothetical protein L8106_11802 [Lyngbya sp. PCC 8106]
Length = 351
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 53/99 (53%), Gaps = 6/99 (6%)
Query: 118 LRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
LR+ + NF A +A++ + S + GA L K AN +GADLS+
Sbjct: 13 LRRYAKGERNFSEINLMAAQLNAANLNRVNLSYANLTGANLSKTRLICANLSGADLSNAN 72
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + +L EA L A L +T+L +++L GA++ G+ S+A
Sbjct: 73 LSQAILIEATLNGASLTQTLLVQANLSGALLSGSILSEA 111
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 48/87 (55%), Gaps = 6/87 (6%)
Query: 130 ANFTSADM-RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE-----ANLT 183
AN T A + S +GSK A L A +A + DLS + R +L+E ANL+
Sbjct: 116 ANLTGASLIGTSLLNGSKLIEATLIGATLSRATLSAIDLSGVNLTRAILSESELGGANLS 175
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
+A L+R L RS+L GA + GAD S+A
Sbjct: 176 SACLIRAYLNRSNLSGANLMGADLSEA 202
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 58/103 (56%), Gaps = 14/103 (13%)
Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTG 164
AAQ +A+L + V++ AN T A++ ++ + SG+ + A L +A+ +A G
Sbjct: 30 AAQLNAANLNR-VNLS---YANLTGANLSKTRLICANLSGADLSNANLSQAILIEATLNG 85
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
A L+ TL+ +ANL+ A+L ++L+ +DL GA + GA
Sbjct: 86 ASLTQTLLV-----QANLSGALLSGSILSEADLSGANLTGASL 123
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 46/80 (57%), Gaps = 5/80 (6%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N T A + ES+ G+ + A L +A ++N +GA+L L+EA+L NA L
Sbjct: 158 NLTRAILSESELGGANLSSACLIRAYLNRSNLSGANLMGA-----DLSEASLCNANLCVA 212
Query: 191 VLTRSDLGGAIIEGADFSDA 210
LTR++L GA +EGA+ + A
Sbjct: 213 NLTRANLQGADLEGANLNGA 232
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 49/103 (47%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL +A N AN T A+++ +D G+ NGA L A N A+L
Sbjct: 190 SGANLMGADLSEASLCNANLCVANLTRANLQGADLEGANLNGAQLSGANLKSTNLKNANL 249
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ ++ L A+L+ A L LT ++L GA + AD A
Sbjct: 250 NGLILHEADLRLADLSQANLRGANLTGANLAGASLLEADLRGA 292
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 56/106 (52%), Gaps = 2/106 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+L K + N A+ ++A++ ++ + NGA L + + +AN +GA L
Sbjct: 44 SYANLTGANLSKTRLICANLSGADLSNANLSQAILIEATLNGASLTQTLLVQANLSGALL 103
Query: 168 SDTLMDRMVLNEANLTNAVLVRT-VLTRSDLGGAIIEGADFSDAVI 212
S +++ L+ ANLT A L+ T +L S L A + GA S A +
Sbjct: 104 SGSILSEADLSGANLTGASLIGTSLLNGSKLIEATLIGATLSRATL 149
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 8/111 (7%)
Query: 107 IGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
I S ++ G A+L A ++ R+N + A++ +D S + A L A +AN GA
Sbjct: 163 ILSESELGGANLSSACLIRAYLNRSNLSGANLMGADLSEASLCNANLCVANLTRANLQGA 222
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
DL + LN A L+ A L T L ++L G I+ AD A DL+Q
Sbjct: 223 DL-----EGANLNGAQLSGANLKSTNLKNANLNGLILHEADLRLA--DLSQ 266
>gi|86608529|ref|YP_477291.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86557071|gb|ABD02028.1| pentapeptide repeat protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 248
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 15/128 (11%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDRMVLNEA---- 180
+NFT+A + +S F G +F+ + +A AN F AD + R L++A
Sbjct: 109 SNFTAAKLDKSSFQGGRFSHSIFREASLVAANLAEGNFFAADFRQANLSRCNLSQAALVS 168
Query: 181 ------NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 234
NL A+LV L + + + GADF+DA + ++ L + A+GTN +T
Sbjct: 169 CQLQFANLEQAILVGANLRDAQIEDTLFSGADFTDAKLSDETRKLLIERASGTNELTQRD 228
Query: 235 TRKSLGCG 242
T +L G
Sbjct: 229 TLNTLLAG 236
>gi|357146891|ref|XP_003574148.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
[Brachypodium distachyon]
Length = 227
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 58/128 (45%), Gaps = 8/128 (6%)
Query: 117 DLRKAVHVKE--NFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
DLR + E N + ++A M ++ F G+ + KA A A+F G D ++ ++D
Sbjct: 104 DLRFCDYTNEKNNLKGKTLSAALMSDAKFDGADLTEVVMSKAYAVGASFKGTDFTNAVID 163
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 233
R +A+L A+ TVL+ S A ++ F D +I Q LC+ N
Sbjct: 164 RANFGKADLEGAIFKNTVLSGSTFDDANMKDVVFEDTIIGYIDLQKLCR-----NMSINE 218
Query: 234 STRKSLGC 241
R LGC
Sbjct: 219 DARLDLGC 226
>gi|313681545|ref|YP_004059283.1| pentapeptide repeat-containing protein [Sulfuricurvum kujiense DSM
16994]
gi|313154405|gb|ADR33083.1| pentapeptide repeat protein [Sulfuricurvum kujiense DSM 16994]
Length = 198
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 57/104 (54%), Gaps = 6/104 (5%)
Query: 105 FGIG-SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
F IG S + + +A L+KA+ KE + AD+ ++DFSG F+G+ L +A +++ F
Sbjct: 8 FWIGVSLSAYDAAHLKKALEDKECIGCDLRGADLSQNDFSGGDFHGSDLSEADLHESIFE 67
Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
DLSD L+ AN NA+ + + R+DL GA+F
Sbjct: 68 MGDLSDC-----NLSGANAENALFWKGTMERADLTRIHARGANF 106
>gi|158340059|ref|YP_001521229.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158310300|gb|ABW31915.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 483
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 54/100 (54%), Gaps = 1/100 (1%)
Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A+ RKA + + + +A+ A + ++D SG+ F+GAYL KA A GADLS
Sbjct: 317 AHLSGANFRKANLSLADISKAHLGHAHLNDADLSGAYFSGAYLYKANLSSAFLIGADLSR 376
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+ ++L ANL +A L L+ +DL AI+ D +
Sbjct: 377 ANLSDVILRGANLLSANLSDASLSSADLNNAILLNTDLRE 416
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 5/83 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
+N A++R + SG+ F A L KA A+ ADLS L +ANL++
Sbjct: 307 SNLRKANLRHAHLSGANFRKANLSLADISKAHLGHAHLNDADLSGAYFSGAYLYKANLSS 366
Query: 185 AVLVRTVLTRSDLGGAIIEGADF 207
A L+ L+R++L I+ GA+
Sbjct: 367 AFLIGADLSRANLSDVILRGANL 389
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 41/81 (50%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ S+ + A+L A KAN + AD+S + LN+A+L+ A
Sbjct: 297 ANLGGANLSYSNLRKANLRHAHLSGANFRKANLSLADISKAHLGHAHLNDADLSGAYFSG 356
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L +++L A + GAD S A
Sbjct: 357 AYLYKANLSSAFLIGADLSRA 377
>gi|428221053|ref|YP_007105223.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994393|gb|AFY73088.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 270
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 55/98 (56%), Gaps = 9/98 (9%)
Query: 115 SADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
+ADLR+A N T+AD+ ++ + +GA L A AN + A+L D L+ +
Sbjct: 137 NADLRQA---------NLTNADLIYANLKNANLSGANLSGANLSGANLSDANLEDALLHK 187
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L+ ANL +A T+L R++L GA + GA F +A++
Sbjct: 188 AKLSNANLKSANFSGTILVRANLIGADLTGAIFKEAIL 225
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 55/117 (47%), Gaps = 11/117 (9%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY 158
A G + IG A DL + V N R S ++ ++D G+ L A
Sbjct: 63 ANLMGAYLIG--ANLSHVDLSGSNLVGANLR----SINLNDTDLKGADLRETILRNARMA 116
Query: 159 KANFTGADLSD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ N TG++LS+ ++ L +ANLTNA L+ L ++L GA + GA+ S A
Sbjct: 117 RVNLTGSNLSNADLVYVNLENADLRQANLTNADLIYANLKNANLSGANLSGANLSGA 173
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 11/103 (10%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A+L A + N + AN + A++ ++ SG+ + A LE A+ +KA + A+L
Sbjct: 138 ADLRQANLTNADLIYANLKNANLSGANLSGANLSGANLSDANLEDALLHKAKLSNANLK- 196
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
AN + +LVR L +DL GAI + A A +
Sbjct: 197 ---------SANFSGTILVRANLIGADLTGAIFKEAILVHATM 230
>gi|424513094|emb|CCO66678.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
Length = 140
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 63/130 (48%), Gaps = 21/130 (16%)
Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
H ++ + FT ++ ++F G+ +G L A +A+FTGA+L + ANL
Sbjct: 20 HDQDLTQTYFTKGSLKRANFRGANLSGISLFGANLEEADFTGANLEN----------ANL 69
Query: 183 TNAVLVRTVLTRSDLGGAIIEGA-----------DFSDAVIDLAQKQALCKYANGTNPIT 231
L++T T ++L AI+ GA D+S +I +C A+G +P++
Sbjct: 70 GQCNLLKTNFTGANLTNAIVSGASNLETVKANDSDWSQVIIRKDVLMGICANADGVSPVS 129
Query: 232 GVSTRKSLGC 241
G T+ +L C
Sbjct: 130 GDPTKMTLEC 139
>gi|186685487|ref|YP_001868683.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186467939|gb|ACC83740.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 146
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 53/106 (50%), Gaps = 10/106 (9%)
Query: 115 SADLRKAVHVKE---------NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
SA +R+ + +E N + A+ D+R ++ G+ GA LE A AN
Sbjct: 28 SAPVRRLLETRECLGCNLAGANLKGAHLIGVDLRNANLKGANLEGANLEGADLTGANLKS 87
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+L++ + +LN ANLTN L + L +D+ GA++ D S A
Sbjct: 88 ANLTEAFVSDTILNNANLTNVNLSNSRLYNTDVDGAVLANIDLSGA 133
>gi|334118424|ref|ZP_08492513.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333459431|gb|EGK88044.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 479
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 56/105 (53%), Gaps = 11/105 (10%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL ++ N RA+ T A +RE++ G++F GA L++A KAN GA+L
Sbjct: 60 SGANLSGADLAESFLNLANLTRADLTGAVLREANLVGAEFTGANLKQASLIKANLVGANL 119
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+EANLT A L L S L GAI++ A +++ I
Sbjct: 120 ----------HEANLTRANLSGADLRGSQLSGAILDKAVYNNRTI 154
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 51/97 (52%), Gaps = 11/97 (11%)
Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
SADLR + + AN + AD+RE+DF+G A AN +GADL +
Sbjct: 338 SADLRGVDLTRADLSGANLSDADLRETDFTG----------ATLLFANLSGADLRGVDLT 387
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ L+ ANLT A L + L R +L GA + AD SDA
Sbjct: 388 KADLSGANLTEADLRKADLMRVNLEGADLTEADLSDA 424
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 46/84 (54%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + AD+ ES + + A L AV +AN GA+ + + + L +ANL A L
Sbjct: 62 ANLSGADLAESFLNLANLTRADLTGAVLREANLVGAEFTGANLKQASLIKANLVGANLHE 121
Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
LTR++L GA + G+ S A++D
Sbjct: 122 ANLTRANLSGADLRGSQLSGAILD 145
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 5/80 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
+N +S +++ DFS + AYL+ A + + GADLS +L++ NL++A L
Sbjct: 289 SNLSSVNLKNVDFSRASLKKAYLKGANLEQTDLRGADLSGA-----ILHQVNLSSADLRG 343
Query: 190 TVLTRSDLGGAIIEGADFSD 209
LTR+DL GA + AD +
Sbjct: 344 VDLTRADLSGANLSDADLRE 363
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 68/159 (42%), Gaps = 37/159 (23%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAV-------- 156
A+F A+L++A +K N AN T A++ +D GS+ +GA L+KAV
Sbjct: 97 AEFTGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAVYNNRTIFP 156
Query: 157 ------AYKA------------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
A A N DL++ + L NL A+L L R++L
Sbjct: 157 EDIDPGAMGAFLLAPNASLPGLNLAMVDLTEADLKGADLRRTNLYKAILFGAKLDRANLA 216
Query: 199 GAIIEGADFSDAVID--LAQKQALCK---YANGTNPITG 232
GA + AD +A + + +K K ++ G +P G
Sbjct: 217 GANLSAADLREASLSGTILEKAVYSKKTLFSEGIDPALG 255
Score = 40.4 bits (93), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 49/103 (47%), Gaps = 16/103 (15%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 162
S A ADLR+ AN + AD+R ++D SG+ A L KA + N
Sbjct: 352 SGANLSDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGANLTEADLRKADLMRVNL 411
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
GADL+ EA+L++A L R L ++L G ++GA
Sbjct: 412 EGADLT----------EADLSDAHLFRVNLRGANLKGTNLKGA 444
>gi|428307284|ref|YP_007144109.1| serine/threonine protein kinase with pentapeptide repeats
[Crinalium epipsammum PCC 9333]
gi|428248819|gb|AFZ14599.1| serine/threonine protein kinase with pentapeptide repeats
[Crinalium epipsammum PCC 9333]
Length = 564
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 51/85 (60%), Gaps = 5/85 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N ++ ++++++ SG F+ A L + NF GA+LS+T M + L+ A L +A LVR
Sbjct: 444 NLSNLNLQKANLSGGNFHQANLTQT-----NFQGANLSNTDMGQTSLSGAMLRDANLVRA 498
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
L+ +DL GA + GAD S A + A
Sbjct: 499 YLSYADLEGADLRGADLSFAYFNYA 523
>gi|428215789|ref|YP_007088933.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004170|gb|AFY85013.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 222
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)
Query: 140 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT-----NAVLVRTVLTR 194
++ S + +GA L+ A AN +GA+LS+ M + L+EANLT NA L ++++
Sbjct: 68 ANLSNANLSGALLKDAKLQTANLSGANLSNAEMSGITLSEANLTGANLSNAELENALMSK 127
Query: 195 SDLGGAIIEGADFSDAVI 212
DL GA + GAD DA+I
Sbjct: 128 VDLTGADLTGADLIDAII 145
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 51/84 (60%), Gaps = 5/84 (5%)
Query: 130 ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN ++A+M E++ +G+ + A LE A+ K + TGADL+ + ++++ANL+N
Sbjct: 93 ANLSNAEMSGITLSEANLTGANLSNAELENALMSKVDLTGADLTGADLIDAIISDANLSN 152
Query: 185 AVLVRTVLTRSDLGGAIIEGADFS 208
A + + L ++ L + + GADFS
Sbjct: 153 ASVTQAQLKKAILSRSNLSGADFS 176
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 53/92 (57%), Gaps = 6/92 (6%)
Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
A ++ ++ SG+ + A + +AN TGA+LS+ ++ ++++ +LT A LT
Sbjct: 83 AKLQTANLSGANLSNAEMSGITLSEANLTGANLSNAELENALMSKVDLTGA-----DLTG 137
Query: 195 SDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
+DL AII A+ S+A + AQ K+A+ +N
Sbjct: 138 ADLIDAIISDANLSNASVTQAQLKKAILSRSN 169
Score = 37.7 bits (86), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 41/81 (50%), Gaps = 5/81 (6%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
+ D+ D GS NGA L A N +GA L D + L+ ANL+NA +
Sbjct: 50 LSGVDLSGKDLYGSALNGANLSNA-----NLSGALLKDAKLQTANLSGANLSNAEMSGIT 104
Query: 192 LTRSDLGGAIIEGADFSDAVI 212
L+ ++L GA + A+ +A++
Sbjct: 105 LSEANLTGANLSNAELENALM 125
>gi|376002766|ref|ZP_09780588.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
gi|375328822|emb|CCE16341.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
Length = 529
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 59/107 (55%), Gaps = 14/107 (13%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN---------- 178
+ANFT A + ++FSG+ G L +A + +GA L ++ VLN
Sbjct: 44 QANFTEAVLSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLS 103
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
+ANL +A L+R L R++L AI+ GA+ ++A DL ++A ++A+
Sbjct: 104 QANLVDASLIRAELMRAELSEAIVNGANLTEA--DL--REATLRHAD 146
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
DFS A L + +ANFT A LS T + ANLT L R L S L GA
Sbjct: 26 DFSAILLCEANLSRVNLSQANFTEAVLSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 80
Query: 201 IIEGADFSDAVIDLA 215
I++GA+ ++AV+++A
Sbjct: 81 ILQGANLNEAVLNVA 95
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 64/129 (49%), Gaps = 13/129 (10%)
Query: 101 TRGEFGIG--SAAQFGSADLRKAV-HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA 157
TR + + S A A+L +AV +V RA+ + A++ ++ ++ A L +A+
Sbjct: 68 TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLVDASLIRAELMRAELSEAIV 127
Query: 158 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSDLGGAIIEGADF 207
AN T ADL + + L + ANL+ A L+ R+ LTR+DL A + G +
Sbjct: 128 NGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNL 187
Query: 208 SDAVIDLAQ 216
+A + A+
Sbjct: 188 RNAELRQAE 196
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 42/83 (50%), Gaps = 5/83 (6%)
Query: 130 ANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN T AD+RE+ D + +GA L +A +N ++L+ + R L NL N
Sbjct: 130 ANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNLRN 189
Query: 185 AVLVRTVLTRSDLGGAIIEGADF 207
A L + L +DL GA + GA+
Sbjct: 190 AELRQAELNGADLRGANLSGANL 212
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 1/91 (1%)
Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
A+L +A + N R+N T AD+ +D G A L +A A+ GA+LS +
Sbjct: 155 ANLSEACLILSNLERSNLTRADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRW 214
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
L+ ANL+ A L T L+ + L GA + GA
Sbjct: 215 ANLSGANLSGANLEATQLSGASLRGANLSGA 245
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 1/96 (1%)
Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADL +A N R A A++ +D G+ +GA L A AN +GA+L T +
Sbjct: 175 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 234
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L ANL+ A L+ +DL A + D++DA
Sbjct: 235 ASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDA 270
>gi|354567192|ref|ZP_08986362.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353543493|gb|EHC12951.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 206
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 47/88 (53%), Gaps = 5/88 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R NF AD+R ++ SG+ GA L +A + NF ADLS + L +ANL A L
Sbjct: 42 RINFKGADLRSANLSGAILTGANLREANLQQVNFCDADLS-----QADLTQANLCGACLW 96
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
R L+ S L GA + AD +A + AQ
Sbjct: 97 RVQLSDSQLWGASLCNADLREADLSAAQ 124
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 47/81 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ +AD+RE+D S ++ A L +A +AN T A L +++ N+ANLTNA L
Sbjct: 108 ASLCNADLREADLSAAQLIEASLVEANLVRANLTKAKLCGSVLIEANFNQANLTNADLKW 167
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
T L ++ A +E A+F +A
Sbjct: 168 TNLMAANFSEANLENANFKNA 188
>gi|428309179|ref|YP_007120156.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428250791|gb|AFZ16750.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 303
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 52/106 (49%), Gaps = 6/106 (5%)
Query: 113 FGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
F + L +AV + +F +F AD+RE+DF+ F+ A L +A AN A
Sbjct: 136 FWRSHLMRAVLRRVDFHEAILQETSFRQADLREADFTRVYFSEASLSEANLRGANLDQAL 195
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ T R L +A+L A L R V ++DL GA +GA AV
Sbjct: 196 VKRTSFWRTNLQQASLKGAYLKRIVFNQTDLSGASFQGAQLQGAVF 241
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 51/100 (51%), Gaps = 15/100 (15%)
Query: 129 RANFTSADMRESDFSGSKF----------NGAYLEKAVAYK-----ANFTGADLSDTLMD 173
RAN + A++ ++ SG++ N A LE A+ ++ AN GA L +T +
Sbjct: 58 RANLSRANLSHANLSGARLECVSLSRANLNQADLEGAILFQSNLSQANLIGASLPETDLQ 117
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
L +ANLT A L T+ RS L A++ DF +A++
Sbjct: 118 VATLFQANLTGACLRGTIFWRSHLMRAVLRRVDFHEAILQ 157
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 55/116 (47%), Gaps = 21/116 (18%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMR----------ESDFSGSKFNGAYLEKAVA 157
S A A+L +A+ + +F R N A ++ ++D SG+ F GA L+ AV
Sbjct: 182 SEANLRGANLDQALVKRTSFWRTNLQQASLKGAYLKRIVFNQTDLSGASFQGAQLQGAVF 241
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
AN TGA+ ++R V ANLT ++L GA ++ A F + I+
Sbjct: 242 RGANLTGANFEGANLERAVFRGANLTG----------TNLKGASLQWAVFKEVNIE 287
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 45/103 (43%), Gaps = 21/103 (20%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL A+ + N +AN A + E+D L+ A ++AN TGA L
Sbjct: 82 SRANLNQADLEGAILFQSNLSQANLIGASLPETD----------LQVATLFQANLTGACL 131
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
T+ R + L+R VL R D AI++ F A
Sbjct: 132 RGTIFWR----------SHLMRAVLRRVDFHEAILQETSFRQA 164
>gi|209526072|ref|ZP_03274604.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423067543|ref|ZP_17056333.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493460|gb|EDZ93783.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406711117|gb|EKD06319.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 519
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 59/107 (55%), Gaps = 14/107 (13%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN---------- 178
+ANFT A + ++FSG+ G L +A + +GA L ++ VLN
Sbjct: 34 QANFTEAVLSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLS 93
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
+ANL +A L+R L R++L AI+ GA+ ++A DL ++A ++A+
Sbjct: 94 QANLVDASLIRAELMRAELSEAIVNGANLTEA--DL--REATLRHAD 136
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
DFS A L + +ANFT A LS T + ANLT L R L S L GA
Sbjct: 16 DFSAILLCEANLSRVNLSQANFTEAVLSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 70
Query: 201 IIEGADFSDAVIDLA 215
I++GA+ ++AV+++A
Sbjct: 71 ILQGANLNEAVLNVA 85
Score = 40.4 bits (93), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 64/129 (49%), Gaps = 13/129 (10%)
Query: 101 TRGEFGIG--SAAQFGSADLRKAV-HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA 157
TR + + S A A+L +AV +V RA+ + A++ ++ ++ A L +A+
Sbjct: 58 TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLVDASLIRAELMRAELSEAIV 117
Query: 158 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSDLGGAIIEGADF 207
AN T ADL + + L + ANL+ A L+ R+ LTR+DL A + G +
Sbjct: 118 NGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNL 177
Query: 208 SDAVIDLAQ 216
+A + A+
Sbjct: 178 RNAELRQAE 186
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 42/83 (50%), Gaps = 5/83 (6%)
Query: 130 ANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN T AD+RE+ D + +GA L +A +N ++L+ + R L NL N
Sbjct: 120 ANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNLRN 179
Query: 185 AVLVRTVLTRSDLGGAIIEGADF 207
A L + L +DL GA + GA+
Sbjct: 180 AELRQAELNGADLRGANLSGANL 202
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 1/91 (1%)
Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
A+L +A + N R+N T AD+ +D G A L +A A+ GA+LS +
Sbjct: 145 ANLSEACLILSNLERSNLTRADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRW 204
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
L+ ANL+ A L T L+ + L GA + GA
Sbjct: 205 ANLSGANLSGANLEATQLSGASLRGANLSGA 235
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 1/96 (1%)
Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADL +A N R A A++ +D G+ +GA L A AN +GA+L T +
Sbjct: 165 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 224
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L ANL+ A L+ +DL A + D++DA
Sbjct: 225 ASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDA 260
>gi|307152584|ref|YP_003887968.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306982812|gb|ADN14693.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 333
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 85/171 (49%), Gaps = 13/171 (7%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRANF 132
L+ A++ ++ L D N A+ RG G + A A++R+ K+NF N
Sbjct: 92 LSGAILQETDLTLAMLLDANLIGADLRGSDLSGANLTGACLRGANMRQE---KKNFNTNL 148
Query: 133 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 192
+A++ ++D G+ G L +A N +GA+L + + L +A+L+ A L T+L
Sbjct: 149 QAANLFKADLQGANMKGVDLARA-----NLSGANLKEANLRDADLRKADLSKANLTGTIL 203
Query: 193 TRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 243
+ ++L GA + GAD ++A +L + + + A G N + T +L N
Sbjct: 204 SEANLVGANLTGADLNNA--NLVRAKMMQAEAGGANFKGAIMTHINLNATN 252
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 1/87 (1%)
Query: 127 NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
NF+ A T ++ ++ SG+ + L A +AN +GA L + + + +ANLT A
Sbjct: 237 NFKGAIMTHINLNATNLSGANLSFTRLNHADLTRANLSGAYLKEAELIEVFFAKANLTGA 296
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVI 212
L T LTRSDL A + + S+A++
Sbjct: 297 DLSNTNLTRSDLMSANLSRVNLSEAIM 323
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 42/82 (51%), Gaps = 10/82 (12%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN A++ SD SG+ + +A+ TGADL ++ ++L+ A L L
Sbjct: 54 RANLAQANLVASDLSGANLS----------QADLTGADLRSAMLHGIILSGAILQETDLT 103
Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
+L ++L GA + G+D S A
Sbjct: 104 LAMLLDANLIGADLRGSDLSGA 125
>gi|307152500|ref|YP_003887884.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306982728|gb|ADN14609.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 305
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 65/137 (47%), Gaps = 26/137 (18%)
Query: 90 ALADL-NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK- 146
A+ DL NKY+A R S + DLR + NF+ A+F+ A++RE DFSG+
Sbjct: 6 AVIDLKNKYDAGERN----FSKIELRRVDLRGFNLSQANFKGADFSYANLREVDFSGADL 61
Query: 147 ----FN---------------GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
FN G+YL KA K N A+LS + L+++NLTNA L
Sbjct: 62 SEAFFNEADLTGANLQEANLQGSYLMKAYLMKTNLQSANLSKAYLTGAYLSKSNLTNANL 121
Query: 188 VRTVLTRSDLGGAIIEG 204
L S L GA + G
Sbjct: 122 TGAYLNGSKLNGADLTG 138
>gi|409991580|ref|ZP_11274829.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|291567915|dbj|BAI90187.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409937560|gb|EKN78975.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 390
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 64/122 (52%), Gaps = 21/122 (17%)
Query: 110 AAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL- 167
+A ADL +A+ +K NF +A+ +SA++ +S+ + F AYL KAN + ADL
Sbjct: 111 SAHLNWADLTEAIFIKTNFHKADLSSANLTKSNLQSANFVRAYL-----IKANLSEADLF 165
Query: 168 -----SDTLMDRMV---------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
S L D + + ANL A L LT+++LG A + GA+ +DA ++
Sbjct: 166 QADLSSANLKDVNLSAANLTECKMTRANLMGANLTEADLTKANLGRANLRGANLTDAYLN 225
Query: 214 LA 215
LA
Sbjct: 226 LA 227
Score = 44.3 bits (103), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 26/160 (16%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLE---------------- 153
A F ADL A N + NF+ A++ ++ SGS NGA L+
Sbjct: 57 ADFSEADLSGAHLSLANLSKVNFSGANLTGANLSGSSLNGANLQGATLSAVNLESAHLNW 116
Query: 154 ----KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+A+ K NF ADLS + + L AN A L++ L+ +DL A + A+ D
Sbjct: 117 ADLTEAIFIKTNFHKADLSSANLTKSNLQSANFVRAYLIKANLSEADLFQADLSSANLKD 176
Query: 210 AVIDLAQKQALCKY--AN--GTNPITGVSTRKSLGCGNSR 245
+ A CK AN G N T+ +LG N R
Sbjct: 177 VNLSAANLTE-CKMTRANLMGANLTEADLTKANLGRANLR 215
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 66/153 (43%), Gaps = 23/153 (15%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA-------------- 121
L+AA + C + L N EA+ + A G A+LR A
Sbjct: 179 LSAANLTECKMTRANLMGANLTEADL-------TKANLGRANLRGANLTDAYLNLASLVE 231
Query: 122 --VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
+H RAN + A++ ++ NGA+L K A+ G DLS L+ + L
Sbjct: 232 ADLHQANLTRANLSRANLSKTYLRDICLNGAHLTKVNLSGADLGGVDLSHKLLTGINLAG 291
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A L+ A LV +L ++L A + GA+ +A +
Sbjct: 292 AYLSEATLVGALLMEANLSAANLSGANLQNACL 324
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 52/102 (50%), Gaps = 6/102 (5%)
Query: 115 SADLRKAVHVKEN------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
SA+ +A +K N F+A+ +SA++++ + S + + +A AN T ADL+
Sbjct: 146 SANFVRAYLIKANLSEADLFQADLSSANLKDVNLSAANLTECKMTRANLMGANLTEADLT 205
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ R L ANLT+A L L +DL A + A+ S A
Sbjct: 206 KANLGRANLRGANLTDAYLNLASLVEADLHQANLTRANLSRA 247
>gi|441147419|ref|ZP_20964505.1| OxyO [Streptomyces rimosus subsp. rimosus ATCC 10970]
gi|440620240|gb|ELQ83273.1| OxyO [Streptomyces rimosus subsp. rimosus ATCC 10970]
Length = 345
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 43/86 (50%), Gaps = 6/86 (6%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A ADLR+A + N R AN AD+R +D G G L AV Y+A G
Sbjct: 223 ADLREADLREATPARANLRDADLSDANVRKADLRFADLRGVDLWGTDLRGAVLYRAKLAG 282
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRT 190
+LS+ +D L A+LT+A + R+
Sbjct: 283 LELSEAHLDGADLRGADLTDAAVARS 308
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 46/93 (49%), Gaps = 1/93 (1%)
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
A H + R A D+R++ SG+ GA L +A A+ ADL + R L +
Sbjct: 183 ADHKRAQLRGAILRDCDLRDARLSGADLRGARLARADLADADLREADLREATPARANLRD 242
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+L++A + + L +DL G + G D AV+
Sbjct: 243 ADLSDANVRKADLRFADLRGVDLWGTDLRGAVL 275
>gi|217423045|ref|ZP_03454547.1| pentapeptide repeat protein [Burkholderia pseudomallei 576]
gi|217393953|gb|EEC33973.1| pentapeptide repeat protein [Burkholderia pseudomallei 576]
Length = 825
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVD 566
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585
>gi|428181173|gb|EKX50038.1| hypothetical protein GUITHDRAFT_135709 [Guillardia theta CCMP2712]
Length = 1263
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 56/105 (53%)
Query: 114 GSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
GS + +H + + + ++ ++ +D S S A L +++ Y+AN + A+L + M+
Sbjct: 487 GSKLEKSNLHKSKLSKVDLSNCNLTLTDMSSSDLQKADLSRSLFYRANLSSANLKSSNMN 546
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
L+ NL++A L R L S L GA +EGADFS + A Q
Sbjct: 547 GADLSHCNLSSACLERASLYGSKLEGANLEGADFSHCDLSFAMLQ 591
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 9/99 (9%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
+F + F DLR F ++D R DFSGSK +G L KA +N +
Sbjct: 1020 KFAGATGLNFKDVDLRSC---------KFANSDFRGQDFSGSKLSGVQLSKANLTGSNLS 1070
Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
DL+ + M + L ANL AVL + L+++ L GA++
Sbjct: 1071 SCDLTGSDMSKCHLERANLLGAVLKGSDLSQARLKGAVL 1109
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 55/115 (47%), Gaps = 22/115 (19%)
Query: 120 KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
+++ K+ + + AD+ DF+G+ F+G+ L +A ++ G DLS
Sbjct: 926 RSLKGKDLRNSKLSEADLSHQDFAGADFSGSKLSRANLRQSKLDGCDLS----------- 974
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL-------CKYANGT 227
N L R++L + L GA+I G DFS+A ++ A A CK+A T
Sbjct: 975 ----NCDLSRSILEGASLQGAVIRGTDFSNAKLEGAALPAWVEVDFECCKFAGAT 1025
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 50/107 (46%), Gaps = 6/107 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANF 162
S++ ADL +++ + N AN S++M +D S + A LE+A Y AN
Sbjct: 516 SSSDLQKADLSRSLFYRANLSSANLKSSNMNGADLSHCNLSSACLERASLYGSKLEGANL 575
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
GAD S + +L NL A LT +D G+ +EGA D
Sbjct: 576 EGADFSHCDLSFAMLQNCNLRGANFTGAKLTGTDFSGSDLEGAIMPD 622
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 5/88 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD-----LSDTLMDRMVLNEANLTNA 185
N + D+ +++ G+K GA L + + N + A L ++M R LN+ + +A
Sbjct: 274 NLSYNDLSDANLEGAKLEGADLSYSNLSQCNLSQASCSRIMLQFSVMTRARLNDGDFGSA 333
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVID 213
L LT S L + EGADF D+V+D
Sbjct: 334 NLSECDLTHSQLSSSCFEGADFRDSVLD 361
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 51/123 (41%), Gaps = 21/123 (17%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAY----------- 158
A F DL A+ N R ANFT A + +DFSGS GA + Y
Sbjct: 578 ADFSHCDLSFAMLQNCNLRGANFTGAKLTGTDFSGSDLEGAIMPDMEGYDLQGVCLSGTS 637
Query: 159 ---------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+AN ADL + + L +A+L+ A L L +DL G + G + S
Sbjct: 638 GFFKDKSARRANLCDADLRGQELSGVNLQQADLSFADLTGANLQGADLTGTKLNGTNLSQ 697
Query: 210 AVI 212
+ +
Sbjct: 698 SRL 700
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 47/103 (45%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A L +A N +N +S D+ ++ SG+ GA L A + + + L
Sbjct: 191 SRADLSEAKLCRADLTHANLTESNLSSCDLSDTILSGANLGGADLSGAKLFNCDLSRTSL 250
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
D + + +L +A L A L L+ +DL A +EGA A
Sbjct: 251 MDVNLSKAMLQQARLQGAQLQGCNLSYNDLSDANLEGAKLEGA 293
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 39/82 (47%), Gaps = 15/82 (18%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NF AD+R++DFS + G L A +AN AD + E NLT L
Sbjct: 826 NFVGADLRKADFSQAVLKGHDLSAADLSQANLRNADFT----------ECNLTGCNL--- 872
Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
T+S+L G +GA S A+I
Sbjct: 873 --TQSNLSGCNFDGAILSGAII 892
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 43/90 (47%), Gaps = 1/90 (1%)
Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S AQF +A+L ++ R+NF+ +++ DFSG+ N + E+A KA F G ++
Sbjct: 386 SDAQFVNANLSNVKLNAARVLRSNFSESNLTACDFSGAVMNDSNFERANLTKARFVGCEM 445
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
+ A ++ + LT DL
Sbjct: 446 RNASFQHATFASATFSDVKMEGVDLTGCDL 475
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 46/90 (51%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RA+ T A++ ES+ S + L A A+ +GA L + + R L + NL+ A+L
Sbjct: 202 RADLTHANLTESNLSSCDLSDTILSGANLGGADLSGAKLFNCDLSRTSLMDVNLSKAMLQ 261
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
+ L + L G + D SDA ++ A+ +
Sbjct: 262 QARLQGAQLQGCNLSYNDLSDANLEGAKLE 291
Score = 37.0 bits (84), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 16/101 (15%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A+ A LR A +F + NF+ D+ +++ S S + A L +A +A+ T
Sbjct: 148 ARLDRATLRMATLRGSSFVSSSCAQTNFSRCDLSDANLSMSTLSRADLSEAKLCRADLTH 207
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
A+L+ E+NL++ L T+L+ ++LGGA + GA
Sbjct: 208 ANLT----------ESNLSSCDLSDTILSGANLGGADLSGA 238
>gi|167905147|ref|ZP_02492352.1| pentapeptide repeat protein [Burkholderia pseudomallei NCTC 13177]
gi|237508538|ref|ZP_04521253.1| pentapeptide repeat family protein [Burkholderia pseudomallei
MSHR346]
gi|235000743|gb|EEP50167.1| pentapeptide repeat family protein [Burkholderia pseudomallei
MSHR346]
Length = 825
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVD 566
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585
>gi|113476913|ref|YP_722974.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
gi|110167961|gb|ABG52501.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
Length = 567
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 52/106 (49%), Gaps = 6/106 (5%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 164
A A+L KAV V N R N + A++ ++ + F+GAYL +A +AN G
Sbjct: 418 ASLEGANLTKAVLVSANLRRVNLSGANLNSTNLRAANFSGAYLREAKLSRANLEGANLKK 477
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+LS M L A+L A L L R DL GA + G F DA
Sbjct: 478 ANLSGANMSHASLRGADLRRATLKDANLKRVDLVGANLAGVTFLDA 523
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 43/77 (55%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
+ R ++F+ K AYL A ++AN G +L + L +A L ++L++ L ++
Sbjct: 354 NFRRANFAALKLEDAYLRNADLFQANLRGVELRGARLQNANLKKAQLQGSILIKAKLQKA 413
Query: 196 DLGGAIIEGADFSDAVI 212
+L A +EGA+ + AV+
Sbjct: 414 NLYRASLEGANLTKAVL 430
Score = 40.4 bits (93), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 6/111 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
+A + A LR A + N R A +A+++++ GS A L+KA Y+A+
Sbjct: 361 AALKLEDAYLRNADLFQANLRGVELRGARLQNANLKKAQLQGSILIKAKLQKANLYRASL 420
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
GA+L+ ++ L NL+ A L T L ++ GA + A S A ++
Sbjct: 421 EGANLTKAVLVSANLRRVNLSGANLNSTNLRAANFSGAYLREAKLSRANLE 471
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 53/111 (47%), Gaps = 11/111 (9%)
Query: 110 AAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
AA F A LR+A + N +AN + A+M + G+ A L+ A + +
Sbjct: 452 AANFSGAYLREAKLSRANLEGANLKKANLSGANMSHASLRGADLRRATLKDANLKRVDLV 511
Query: 164 GADLSD-TLMDRMV----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
GA+L+ T +D + L ANL NA L+ L +L GA ++GA D
Sbjct: 512 GANLAGVTFLDADLQGANLKGANLKNANLLGANLENVNLQGANLQGAIMPD 562
>gi|113477694|ref|YP_723755.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110168742|gb|ABG53282.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 204
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 61/131 (46%), Gaps = 14/131 (10%)
Query: 96 KYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFR-----------ANFTSADMRESD 141
K A RG G+ A F +ADLR A+ + R A F + D+ D
Sbjct: 63 KLRANLRGADFTGADLRGADFRNADLRGAILIDAQLREASFAGAFLNGAIFNNLDLSGID 122
Query: 142 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
F G+ G L KA ++A + A+LS + L EANL+ AVL T L S+L A
Sbjct: 123 FRGADLRGVNLSKANLFRAELSNANLSGADLSSADLEEANLSGAVLRGTNLQSSNLLCAS 182
Query: 202 IEGADFSDAVI 212
+E AD + ++
Sbjct: 183 VEQADLTGTLL 193
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 58/109 (53%), Gaps = 12/109 (11%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F A+L+KA ++ N R A+FT AD+R +DF + GA L A +A+F GA L+ +
Sbjct: 54 FAGANLQKA-KLRANLRGADFTGADLRGADFRNADLRGAILIDAQLREASFAGAFLNGAI 112
Query: 172 MDRMVLN----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + L+ NL+ A L R L+ ++L GA + AD +A
Sbjct: 113 FNNLDLSGIDFRGADLRGVNLSKANLFRAELSNANLSGADLSSADLEEA 161
>gi|354564725|ref|ZP_08983901.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353549851|gb|EHC19290.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 564
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 11/106 (10%)
Query: 109 SAAQFGSADLR-----KAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S ADLR + N R A +AD+ + +G+K NGA L A+ A+
Sbjct: 386 SGTNLNHADLRGSNLSDTILFSTNLRNAILIAADLSYAKLNGAKLNGANLRSAILLGADL 445
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
G DL+D ++LNEA+L+ VL L+ +D+ AI+ G D S
Sbjct: 446 GGVDLTD-----VILNEADLSGVVLNEADLSGADISDAILFGTDLS 486
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 57/114 (50%), Gaps = 8/114 (7%)
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
GEF GS F A L A NF A N TSA + +++ +G F+ A L A AN
Sbjct: 227 GEFLQGS--NFSGAYLGDANLTGVNFSAANLTSAYLGDANLTGVNFSAANLNAANLGDAN 284
Query: 162 FTGADLSD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+GA+LS T + L+ ANL A L R L+ +DL A + GAD S A
Sbjct: 285 LSGANLSGANLRCTDLSSANLSGANLAGADLYRADLSHADLSSANLSGADLSHA 338
Score = 43.5 bits (101), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 47/81 (58%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A++R +D S + +GA L A Y+A+ + ADLS + L+ ANL++A L
Sbjct: 288 ANLSGANLRCTDLSSANLSGANLAGADLYRADLSHADLSSANLSGADLSHANLSSANLRD 347
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L+ S L AI+ A+ SDA
Sbjct: 348 AELSSSYLSHAILFAANLSDA 368
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 25/108 (23%)
Query: 130 ANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF-----TGADLSDTLMDRMVLNE 179
AN + AD+ +D SG+ N G+ L + + N ADLS ++ LN
Sbjct: 373 ANLSYADLCRADLSGTNLNHADLRGSNLSDTILFSTNLRNAILIAADLSYAKLNGAKLNG 432
Query: 180 ANLTNAVLV----------RTVLTRSDLGGAIIE-----GADFSDAVI 212
ANL +A+L+ +L +DL G ++ GAD SDA++
Sbjct: 433 ANLRSAILLGADLGGVDLTDVILNEADLSGVVLNEADLSGADISDAIL 480
Score = 37.0 bits (84), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 53/109 (48%), Gaps = 14/109 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A ADL +RA+ + AD+ ++ SG+ + A L A A + + LS
Sbjct: 306 SGANLAGADL---------YRADLSHADLSSANLSGADLSHANLSSANLRDAELSSSYLS 356
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVI 212
++ L++ANL +A L L R+DL G A + G++ SD ++
Sbjct: 357 HAILFAANLSDANLNSANLSYADLCRADLSGTNLNHADLRGSNLSDTIL 405
>gi|443668754|ref|ZP_21134246.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|443330716|gb|ELS45411.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 403
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 60/117 (51%), Gaps = 11/117 (9%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY 158
A+ RG F S A ADLR+A + AN + AD+ E++ SG+ GA L A+ +
Sbjct: 243 ADLRGAFL--SEANLKGADLRRAFLSE----ANLSGADLSEANLSGADLRGAILSGAILW 296
Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDA 210
AN GA LS + +L+ ANL A L L+ ++L GAI+ AD +A
Sbjct: 297 GANLKGAGLSLAFLRGAILSGANLGQADLWEANLSGANLSEANLSGAILWEADLIEA 353
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 10/103 (9%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
++KA +K +++ D SG+ GA L A +AN GADL R L
Sbjct: 211 IKKAELIKAIREGTIDKTTLQQVDLSGAILRGADLRGAFLSEANLKGADLR-----RAFL 265
Query: 178 NEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVIDLA 215
+EANL+ A L L+ +DL GAI+ GA+ A + LA
Sbjct: 266 SEANLSGADLSEANLSGADLRGAILSGAILWGANLKGAGLSLA 308
Score = 37.0 bits (84), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 10/78 (12%)
Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
AD+R + S + GA L +A +AN +GADLS EANL+ A L +L+
Sbjct: 243 ADLRGAFLSEANLKGADLRRAFLSEANLSGADLS----------EANLSGADLRGAILSG 292
Query: 195 SDLGGAIIEGADFSDAVI 212
+ L GA ++GA S A +
Sbjct: 293 AILWGANLKGAGLSLAFL 310
>gi|159029340|emb|CAO90206.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length = 405
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 60/117 (51%), Gaps = 11/117 (9%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY 158
A+ RG F S A ADLR+A + AN + AD+ E++ SG+ GA L A+ +
Sbjct: 245 ADLRGAFL--SEANLKGADLRRAFLSE----ANLSGADLSEANLSGADLRGAILSGAILW 298
Query: 159 KANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDA 210
AN GA LS + +L+ ANL A L L+ ++L GAI+ AD +A
Sbjct: 299 GANLKGAGLSLAFLRGAILSGANLGQADLWEANLSGANLSEANLSGAILWEADLIEA 355
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 10/103 (9%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
++KA +K +++ D SG+ GA L A +AN GADL R L
Sbjct: 213 IKKAELIKAIREGTIDKTTLQQVDLSGAILRGADLRGAFLSEANLKGADLR-----RAFL 267
Query: 178 NEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVIDLA 215
+EANL+ A L L+ +DL GAI+ GA+ A + LA
Sbjct: 268 SEANLSGADLSEANLSGADLRGAILSGAILWGANLKGAGLSLA 310
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 10/78 (12%)
Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
AD+R + S + GA L +A +AN +GADLS EANL+ A L +L+
Sbjct: 245 ADLRGAFLSEANLKGADLRRAFLSEANLSGADLS----------EANLSGADLRGAILSG 294
Query: 195 SDLGGAIIEGADFSDAVI 212
+ L GA ++GA S A +
Sbjct: 295 AILWGANLKGAGLSLAFL 312
>gi|152980852|ref|YP_001353914.1| pentapeptide repeat-containing protein [Janthinobacterium sp.
Marseille]
gi|151280929|gb|ABR89339.1| Uncharacterized conserved protein, pentapeptide repeat family
[Janthinobacterium sp. Marseille]
Length = 243
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 58/116 (50%), Gaps = 6/116 (5%)
Query: 96 KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEK 154
+++ E AA A+LR A N R AN AD+R++D SG+ A L
Sbjct: 16 EHDIEDNTMLATVKAALAAGANLRDADLSGANLRGANLRDADLRDADLSGANLRDADLSG 75
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A A+ +GA+LSD L+ ANL+ A L L ++LGGA + GAD S A
Sbjct: 76 ANLRDADLSGANLSDA-----DLSGANLSGADLSGANLGGANLGGANLSGADLSGA 126
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 6/105 (5%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTG 164
A A+LR A + R A+ + A++R++D SG+ +GA L A AN +G
Sbjct: 41 ADLSGANLRGANLRDADLRDADLSGANLRDADLSGANLRDADLSGANLSDADLSGANLSG 100
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
ADLS + L ANL+ A L L+ ++L GA + GA+ D
Sbjct: 101 ADLSGANLGGANLGGANLSGADLSGANLSGANLRGANLSGANLRD 145
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 47/101 (46%), Gaps = 6/101 (5%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A A+LR A N R AN + AD+ ++ SG+ +GA L A AN +G
Sbjct: 61 ADLSGANLRDADLSGANLRDADLSGANLSDADLSGANLSGADLSGANLGGANLGGANLSG 120
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
ADLS + L ANL+ A L + D+ A+ E A
Sbjct: 121 ADLSGANLSGANLRGANLSGANLRDYPVKIKDIHKAVYEAA 161
>gi|126442493|ref|YP_001061349.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
668]
gi|126221984|gb|ABN85489.1| pentapeptide repeat protein [Burkholderia pseudomallei 668]
Length = 825
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVD 566
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585
>gi|113474166|ref|YP_720227.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110165214|gb|ABG49754.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 1033
Score = 50.8 bits (120), Expect = 6e-04, Method: Composition-based stats.
Identities = 54/184 (29%), Positives = 84/184 (45%), Gaps = 18/184 (9%)
Query: 56 NQCAGPYAKLKNWRVFVSTA------LAAAVVASCSSNISALADLNKYEAETRGEFGIGS 109
+QC G A + F+S A L+ A + + + L+ A+ G + IG+
Sbjct: 816 SQCLGVGAFWETVGQFLSGADLRYADLSGAYLIVANLRYADLSGAYLISADLSGAYLIGA 875
Query: 110 ---AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
A ADLR A + AN + A + ++ S +K +GA L A A+ +GAD
Sbjct: 876 NLIGADLSRADLRYA----DLSGANLSDAKLSGANLSDAKLSGAGLSGADLRYADLSGAD 931
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDAVIDLAQKQALC 221
LS + L+ ANL+ A L L +DL GA + GAD SDA + + +
Sbjct: 932 LSRAKLSDAGLSGANLSVAGLSGADLRYADLSGADLRYADLSGADLSDANLSNVRWNSQT 991
Query: 222 KYAN 225
K++N
Sbjct: 992 KWSN 995
>gi|326506328|dbj|BAJ86482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 181
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 61/116 (52%), Gaps = 15/116 (12%)
Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---D 173
D +K++F+ + +R+++F G+ GA + A+ TGADLSD + D
Sbjct: 75 DFSGQTLIKQDFKTSI----LRQTNFKGANLLGASF-----FDADLTGADLSDADLRNAD 125
Query: 174 RMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
+ N + NLTNA L ++T + G+ I GADF+D + Q+ LCK A+G
Sbjct: 126 FSLANVTKVNLTNANLEGALVTGNTSFKGSNIYGADFTDVPLRDDQRDYLCKIADG 181
>gi|163760882|ref|ZP_02167961.1| hypothetical protein HPDFL43_07047 [Hoeflea phototrophica DFL-43]
gi|162281926|gb|EDQ32218.1| hypothetical protein HPDFL43_07047 [Hoeflea phototrophica DFL-43]
Length = 239
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 53/101 (52%), Gaps = 4/101 (3%)
Query: 115 SADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
S DLR++ ++ AN A + + +GSK GA ++ AY+A+F+ D +
Sbjct: 71 STDLRESNLIE----ANLEKATLFRASLAGSKATGARFDRIEAYRADFSNLDATGASFGS 126
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
+ A L N++L T T++DLG A +GAD S + LA
Sbjct: 127 AEMQRAKLNNSMLANTDFTKADLGRAQFDGADISGSRFSLA 167
>gi|334120837|ref|ZP_08494914.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333455836|gb|EGK84476.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 197
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 59/111 (53%), Gaps = 7/111 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF----- 162
S A A+L++AV ++ N R A+ + AD+R +DF + GA A+ A+F
Sbjct: 42 SGANLAGANLQRAV-LRANLRGADLSGADLRGADFRNADLRGASFANALVRDASFGGAFL 100
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
TGA + + + + L A+L A L R +L +DL A + GAD S A ++
Sbjct: 101 TGASIGNLDLSGVDLRGADLRGAALARAILHSADLSNANLSGADLSGADLE 151
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 54/114 (47%), Gaps = 21/114 (18%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSA----------DMRESDFSGSKFNGAYLEKAVAYK 159
A F +ADLR A R A+F A D+ D G+ GA L +A+ +
Sbjct: 73 ADFRNADLRGASFANALVRDASFGGAFLTGASIGNLDLSGVDLRGADLRGAALARAILHS 132
Query: 160 ANFT----------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
A+ + GADL + +++ VL ANLT A L+ ++ ++ GA+++
Sbjct: 133 ADLSNANLSGADLSGADLEEAILNGAVLRGANLTGANLLCAMIEQTLWDGALLD 186
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 45/84 (53%), Gaps = 9/84 (10%)
Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTGADLSDT 170
ADLR A RA SAD+ ++ SG+ +GA LE+ AV AN TGA+L
Sbjct: 118 ADLRGAALA----RAILHSADLSNANLSGADLSGADLEEAILNGAVLRGANLTGANLLCA 173
Query: 171 LMDRMVLNEANLTNAVLVRTVLTR 194
++++ + + A L A L T L+R
Sbjct: 174 MIEQTLWDGALLDRACLQGTPLSR 197
>gi|443475216|ref|ZP_21065173.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443020003|gb|ELS34017.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 352
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 47/87 (54%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A A + ++DF+G NGA + A + GADL+D + R L+ ANL A++VR
Sbjct: 243 AKLERAILIDADFNGVTLNGAIMADIKASRVQMQGADLTDAKLSRADLSRANLKGAIMVR 302
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQ 216
L + L + AD +DA+++ A+
Sbjct: 303 ANLIEAYLARTNLADADLTDAILNRAE 329
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 60/120 (50%), Gaps = 7/120 (5%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKF-----NGAYLE 153
A R E +G + + RK + AN + AD+R ++ SG+ GA L+
Sbjct: 144 ANLRQERAVGDRDEIDVS--RKKRSIASLIGANLSGADLRGANLSGADLYKADLRGANLQ 201
Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
+A AN + A L++ + + L EANL++A LV VL + L AI+ ADF+ ++
Sbjct: 202 EATLSGANLSEAKLNNAYLQGVFLTEANLSSASLVGAVLNNAKLERAILIDADFNGVTLN 261
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 69/137 (50%), Gaps = 10/137 (7%)
Query: 88 ISALADLNKYEAETR----GEFGIGSAAQFGSADLR--KAVHVKENF---RANFTSADMR 138
++ L D N +A+ R G +G A G A+LR +AV ++ R + A +
Sbjct: 113 LANLMDANLIDADMRTINLGGANLGGACMRG-ANLRQERAVGDRDEIDVSRKKRSIASLI 171
Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
++ SG+ GA L A YKA+ GA+L + + L+EA L NA L LT ++L
Sbjct: 172 GANLSGADLRGANLSGADLYKADLRGANLQEATLSGANLSEAKLNNAYLQGVFLTEANLS 231
Query: 199 GAIIEGADFSDAVIDLA 215
A + GA ++A ++ A
Sbjct: 232 SASLVGAVLNNAKLERA 248
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 46/84 (54%), Gaps = 5/84 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R AD+ ++ S + + A L+ A+ +AN A L+ R L +A+LT+A+L
Sbjct: 272 RVQMQGADLTDAKLSRADLSRANLKGAIMVRANLIEAYLA-----RTNLADADLTDAILN 326
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
R L+ ++L GAI++GA D +
Sbjct: 327 RAELSSANLVGAILKGATLPDGKV 350
>gi|418744036|ref|ZP_13300395.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
CBC379]
gi|418751631|ref|ZP_13307915.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
MOR084]
gi|409968104|gb|EKO35917.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
MOR084]
gi|410795431|gb|EKR93328.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
CBC379]
Length = 263
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 48/92 (52%), Gaps = 4/92 (4%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ +S + + +F G F+GA L A ++F GA+ S + LN ANL N
Sbjct: 151 DLSSIILEKQNFDGVDFSGANLGHAFLQNSSFVGANFSSAKLRGSFLNNANLRNTNFRGA 210
Query: 191 VLTRSDLGGAIIEGADFSDAVID----LAQKQ 218
L + L GA +EGADF+DA+ D L QKQ
Sbjct: 211 DLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242
>gi|334117106|ref|ZP_08491198.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333461926|gb|EGK90531.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 520
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 61/121 (50%), Gaps = 2/121 (1%)
Query: 94 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAY 151
L KY A R GI + A +L A N AN + A++ +++ G+K N A
Sbjct: 7 LKKYAAGERNFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSQTNLIGAKLNIAR 66
Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
L A A+ T ADL+ + R+ L +A L A L+R L R++L GA + GA+ S A
Sbjct: 67 LSGAHLGGADLTDADLNVAYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANLSGAT 126
Query: 212 I 212
+
Sbjct: 127 L 127
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 1/95 (1%)
Query: 117 DLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
DL+KA+ + RA A++ ++ SG+ +GA L +A AN A+L +
Sbjct: 91 DLKKAILIGAKLIRAELIRAELSGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGA 150
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L EANL A L L+R+DL GA + G + A
Sbjct: 151 CLTEANLEQANLQGADLSRADLSGADLRGTELRQA 185
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 53/108 (49%), Gaps = 6/108 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+L A + R AN A++R + SG+ A LE+A A+ + ADL
Sbjct: 113 SGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRADL 172
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG-----ADFSDA 210
S + L +ANLT AVL L+ +L AI+ G AD S+A
Sbjct: 173 SGADLRGTELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEA 220
Score = 40.8 bits (94), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 54/123 (43%), Gaps = 4/123 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A ADLR E +AN T A + +D SG A L A+ + A LS
Sbjct: 168 SRADLSGADLRGT----ELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEAKLS 223
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
+ R L ANL NA LV LT + L A GAD + A + A+ A+ + T
Sbjct: 224 GADLSRADLCHANLLNASLVHADLTNAYLIRADWIGADLTGATLTGAKLHAVSRLGIKTE 283
Query: 229 PIT 231
+T
Sbjct: 284 GMT 286
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 11/108 (10%)
Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A G ADL A ++V A D++++ G+K A L +A AN +GA+L
Sbjct: 68 SGAHLGGADLTDADLNV-----AYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANL 122
Query: 168 SDTLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S + L +ANL A L LT ++L A ++GAD S A
Sbjct: 123 SGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRA 170
Score = 37.4 bits (85), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 15/85 (17%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NF ++ E++ SG +GA L+ A AN +GA+LS T + LN A L+
Sbjct: 16 NFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSQTNLIGAKLNIARLS------- 68
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
GA + GAD +DA +++A
Sbjct: 69 --------GAHLGGADLTDADLNVA 85
>gi|300866933|ref|ZP_07111605.1| exported hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300335037|emb|CBN56767.1| exported hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 253
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 53/118 (44%), Gaps = 9/118 (7%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
N+ + E E G S A+LR A N AD+R ++ G+ GA L
Sbjct: 26 NRRDVEKLKETGQCSRCDLRDANLRNA---------NLQGADLRNANLRGANLRGAALRN 76
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A A+ GADL D + R L ANL++A L L R+++ G +G D A +
Sbjct: 77 ADLSNADLRGADLRDADLSRSNLRNANLSDANLRNADLERAEVRGVNFQGTDLRGANV 134
>gi|226365701|ref|YP_002783484.1| hypothetical protein ROP_62920 [Rhodococcus opacus B4]
gi|226244191|dbj|BAH54539.1| hypothetical protein [Rhodococcus opacus B4]
Length = 201
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 61/131 (46%), Gaps = 16/131 (12%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFS-----GSKFNGAYL 152
+E R E I + F ADL ++ HV FR+ +FT + S+F GS+F+ L
Sbjct: 38 SELRTESVIFTECDFTGADLAESHHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97
Query: 153 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 202
V + +FT GADL EANL L R VL +DL GGA +
Sbjct: 98 RPMVFDECDFTLVSLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKL 157
Query: 203 EGADFSDAVID 213
+GAD A +D
Sbjct: 158 DGADLRGAHVD 168
>gi|73621284|gb|AAZ78338.1| OxyO [Streptomyces rimosus]
Length = 353
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 43/86 (50%), Gaps = 6/86 (6%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A ADLR+A + N R AN AD+R +D G G L AV Y+A G
Sbjct: 231 ADLREADLREATPARANLRDADLSDANVRKADLRFADLRGVDLWGTDLRGAVLYRAKLAG 290
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRT 190
+LS+ +D L A+LT+A + R+
Sbjct: 291 LELSEAHLDGADLRGADLTDAAVARS 316
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 46/93 (49%), Gaps = 1/93 (1%)
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
A H + R A D+R++ SG+ GA L +A A+ ADL + R L +
Sbjct: 191 ADHKRAQLRGAILRDCDLRDARLSGADLRGARLARADLADADLREADLREATPARANLRD 250
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+L++A + + L +DL G + G D AV+
Sbjct: 251 ADLSDANVRKADLRFADLRGVDLWGTDLRGAVL 283
>gi|162453209|ref|YP_001615576.1| hypothetical protein sce4933 [Sorangium cellulosum So ce56]
gi|161163791|emb|CAN95096.1| hypothetical protein sce4933 [Sorangium cellulosum So ce56]
Length = 890
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ T AD R D G +F A+LE A A+ +GA L ++ + L+ ANLT A L
Sbjct: 575 DLTGADFRGVDLRGMRFARAFLEGADLRGADLSGAVLEGAVLAKADLSGANLTGARLRGA 634
Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
L +++L GA+ + AD ++AV+
Sbjct: 635 NLGKANLEGAVFDDADLTEAVL 656
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 50/107 (46%), Gaps = 9/107 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
+ A F DLR F A + +D G+ +GA LE AV KA+ +GA+L+
Sbjct: 577 TGADFRGVDLRGM---------RFARAFLEGADLRGADLSGAVLEGAVLAKADLSGANLT 627
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
+ L +ANL AV LT + L GA + GA A ++ A
Sbjct: 628 GARLRGANLGKANLEGAVFDDADLTEAVLMGARLAGASLKRAKLERA 674
Score = 40.8 bits (94), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 38/84 (45%), Gaps = 5/84 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A A + ++D SG+ GA L A KAN GA D + VL A L A L R
Sbjct: 609 AVLEGAVLAKADLSGANLTGARLRGANLGKANLEGAVFDDADLTEAVLMGARLAGASLKR 668
Query: 190 TVLTRSD-----LGGAIIEGADFS 208
L R+D GG + GAD S
Sbjct: 669 AKLERADALQVSWGGVDLSGADLS 692
Score = 40.4 bits (93), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 55/123 (44%), Gaps = 17/123 (13%)
Query: 108 GSAAQFGSADLRK--AVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
G+ A+F A + AVH A+F A + ++ F + GA ++A + + A
Sbjct: 741 GAKARFAGARFSEGVAVHKSGLPEADFRDAVLDKTCFRTTDLRGARFDRAQMTMTDLSEA 800
Query: 166 DLSDTLMDRMVLNEA---------------NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
D +D DR V+ A NLT A+L ++ L +D GA + ADFS A
Sbjct: 801 DATDATFDRAVMKNALLIRTNLDRASLRGCNLTEAILSKSRLAGADFTGAQLCRADFSRA 860
Query: 211 VID 213
D
Sbjct: 861 RGD 863
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 8/96 (8%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTL-MDRMVLNEANLT 183
AN A + E G+ F+GA L K KA F GA S+ + + + L EA+
Sbjct: 709 ANLERAMLLECSLDGTDFSGARLHKTSLMSCTGAKARFAGARFSEGVAVHKSGLPEADFR 768
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
+AVL +T +DL GA + A + + DL++ A
Sbjct: 769 DAVLDKTCFRTTDLRGARFDRAQMT--MTDLSEADA 802
>gi|428311554|ref|YP_007122531.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428253166|gb|AFZ19125.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 411
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 70/137 (51%), Gaps = 18/137 (13%)
Query: 77 AAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSAD 136
A +VAS S+ + L D N ++A G A+ G A+LR A + AN + A+
Sbjct: 65 ADLIVASLSA--ADLRDANLHDANLIG-------AKLGVANLRDA----DLSGANLSGAE 111
Query: 137 MRESDFSGSKFNGAY-----LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
+ +D + S NGAY L KA +AN GA+LS T M L+ ANL A L
Sbjct: 112 LSCTDLTCSNLNGAYISGANLIKAKLSRANLQGANLSVTNMIGADLSGANLQGANLGGAN 171
Query: 192 LTRSDLGGAIIEGADFS 208
L +DLGGA ++GA S
Sbjct: 172 LIEADLGGANLQGAKLS 188
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 1/102 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
+ A A+L KA + N + AN + +M +D SG+ GA L A +A+ GA+L
Sbjct: 123 NGAYISGANLIKAKLSRANLQGANLSVTNMIGADLSGANLQGANLGGANLIEADLGGANL 182
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+ R L NL N+ L L+ S+L G + AD +
Sbjct: 183 QGAKLSRSNLAYVNLANSDLSNADLSDSNLAGTNLTNADLDN 224
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 66/154 (42%), Gaps = 18/154 (11%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNG 149
L +Y A R G A ADLR + + + + AN + D+ ++D + G
Sbjct: 7 LERYAAGERCLRG----ADLHGADLRGVDLRGIDLSD---ANLSDTDLSDADLRDADLIG 59
Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
A L A A+ + ADL D L++ANL A L L +DL GA + GA+ S
Sbjct: 60 ANLRGADLIVASLSAADLRDA-----NLHDANLIGAKLGVANLRDADLSGANLSGAELS- 113
Query: 210 AVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 243
DL Y +G N I +R +L N
Sbjct: 114 -CTDLTCSNLNGAYISGANLIKAKLSRANLQGAN 146
>gi|428319993|ref|YP_007117875.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428243673|gb|AFZ09459.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 146
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 46/83 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ AD+RE++ G+ + A LE A AN TGA+L+ + LN+A+L A L
Sbjct: 51 AHLIGADLREANLQGANLSSANLEGADLTGANLTGANLTQVFLTNASLNDADLDRANLTA 110
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
++ +D+ GA ++ +DA I
Sbjct: 111 AIINTADVSGASMQDMTITDAKI 133
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 51/102 (50%), Gaps = 4/102 (3%)
Query: 114 GSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
G A HVK+ ++ + D SG+ GA+L A +AN GA+LS ++
Sbjct: 19 GPARAENPAHVKQLL----STGQCFQCDLSGADLIGAHLIGADLREANLQGANLSSANLE 74
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
L ANLT A L + LT + L A ++ A+ + A+I+ A
Sbjct: 75 GADLTGANLTGANLTQVFLTNASLNDADLDRANLTAAIINTA 116
Score = 40.4 bits (93), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 47/95 (49%)
Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
A +++ + + F+ + + AD+ + G+ A L+ A AN GADL+ +
Sbjct: 27 AHVKQLLSTGQCFQCDLSGADLIGAHLIGADLREANLQGANLSSANLEGADLTGANLTGA 86
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L + LTNA L L R++L AII AD S A
Sbjct: 87 NLTQVFLTNASLNDADLDRANLTAAIINTADVSGA 121
>gi|428211266|ref|YP_007084410.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|427999647|gb|AFY80490.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 279
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 68/129 (52%), Gaps = 14/129 (10%)
Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
A+LR+A + AN + A++ E+D S + + A +E+A +A A S+T +
Sbjct: 70 ANLREANLIN----ANLSKANLSEADLSLANISRAIVERANLERAKLVQALASETRLGWA 125
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP------ 229
L EA + A L R L+ +DL GA +EGA+ + A++ QA+ + N TN
Sbjct: 126 NLKEATMNQANLSRANLSEADLTGANLEGANLTIAIL----IQAIMEKVNLTNATLNGAN 181
Query: 230 ITGVSTRKS 238
+TGV+ R S
Sbjct: 182 LTGVNLRDS 190
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 18/113 (15%)
Query: 107 IGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
+ S + G A+L++A + N RAN + AD+ ++ G+ A L +A+ K N T A
Sbjct: 116 LASETRLGWANLKEATMNQANLSRANLSEADLTGANLEGANLTIAILIQAIMEKVNLTNA 175
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
LN ANLT L SDL A + G++ + A DL + Q
Sbjct: 176 ----------TLNGANLTG-----VNLRDSDLSRANMSGSNLAGA--DLTKSQ 211
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 43/83 (51%), Gaps = 15/83 (18%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T + +R ++ S + G LE A Y+AN ++LS ANLTNA+L+
Sbjct: 205 ADLTKSQLRGTNVSWTTMRGTNLEGASLYRANLGWSNLSG----------ANLTNAILMD 254
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
T L R++L DF+ A++
Sbjct: 255 TNLYRTNL-----RDVDFTGAIM 272
>gi|16329465|ref|NP_440193.1| hypothetical protein slr0967 [Synechocystis sp. PCC 6803]
gi|383321206|ref|YP_005382059.1| hypothetical protein SYNGTI_0297 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383324376|ref|YP_005385229.1| hypothetical protein SYNPCCP_0297 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383490260|ref|YP_005407936.1| hypothetical protein SYNPCCN_0297 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384435526|ref|YP_005650250.1| hypothetical protein SYNGTS_0297 [Synechocystis sp. PCC 6803]
gi|451813624|ref|YP_007450076.1| hypothetical protein MYO_13000 [Synechocystis sp. PCC 6803]
gi|1651947|dbj|BAA16873.1| slr0967 [Synechocystis sp. PCC 6803]
gi|339272558|dbj|BAK49045.1| hypothetical protein SYNGTS_0297 [Synechocystis sp. PCC 6803]
gi|359270525|dbj|BAL28044.1| hypothetical protein SYNGTI_0297 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359273696|dbj|BAL31214.1| hypothetical protein SYNPCCN_0297 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359276866|dbj|BAL34383.1| hypothetical protein SYNPCCP_0297 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407957344|dbj|BAM50584.1| hypothetical protein BEST7613_1653 [Synechocystis sp. PCC 6803]
gi|451779593|gb|AGF50562.1| hypothetical protein MYO_13000 [Synechocystis sp. PCC 6803]
Length = 150
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 43/83 (51%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ AD+R ++ SG+ N A LE A AN ADL ++ LN ANLT+A
Sbjct: 56 AHLIGADLRNANLSGTNLNEANLEGADLTGANLQNADLRGAMVTNATLNRANLTSANFAF 115
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L D+ GA +EG + +A I
Sbjct: 116 AKLYDVDVTGATVEGMNIQNAEI 138
>gi|428307821|ref|YP_007144646.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428249356|gb|AFZ15136.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 263
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 57/107 (53%), Gaps = 23/107 (21%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKA----VHVKEN--FRANFTSADMRESD--------- 141
YEAE G A F ADL KA H+ E F AN + A+++++D
Sbjct: 157 YEAELIG-------AYFYKADLFKANLSNAHLGEAYLFGANLSQAELKKADLRWTNLSKA 209
Query: 142 -FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
F+G+ GA L A KANFTGA+L+D +D + L++ANL A++
Sbjct: 210 NFTGANLVGANLRGANLSKANFTGANLTDANLDTVNLHKANLEGAIM 256
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 56/111 (50%), Gaps = 11/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFN----------GAYLEKAVAYK 159
A A+L+ A ++ N R A+F +A++ ++ S S N GAY KA +K
Sbjct: 114 ANLMGANLKGADLIEANMRGADFINANLMSANLSNSFLNYAKFYEAELIGAYFYKADLFK 173
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
AN + A L + + L++A L A L T L++++ GA + GA+ A
Sbjct: 174 ANLSNAHLGEAYLFGANLSQAELKKADLRWTNLSKANFTGANLVGANLRGA 224
>gi|30696344|ref|NP_851183.1| thylakoid lumenal protein [Arabidopsis thaliana]
gi|38503418|sp|P81760.2|TL17_ARATH RecName: Full=Thylakoid lumenal 17.4 kDa protein, chloroplastic;
AltName: Full=P17.4; Flags: Precursor
gi|13899115|gb|AAK48979.1|AF370552_1 thylakoid lumenal 17.4 kD protein, chloroplast precursor (P17.4)
[Arabidopsis thaliana]
gi|9759188|dbj|BAB09725.1| thylakoid lumenal 17.4 kD protein, chloroplast precursor (P17.4)
[Arabidopsis thaliana]
gi|28059599|gb|AAO30073.1| thylakoid lumenal 17.4 kD protein, chloroplast precursor (P17.4)
[Arabidopsis thaliana]
gi|332008985|gb|AED96368.1| thylakoid lumenal protein [Arabidopsis thaliana]
Length = 236
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 52/110 (47%), Gaps = 5/110 (4%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
++A M + F G+ + KA A +A+F G + ++ ++DR+ ++NL AV TV
Sbjct: 131 LSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLKGAVFRNTV 190
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L+ S A +E F D +I Q +C+ N R LGC
Sbjct: 191 LSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 235
>gi|86608719|ref|YP_477481.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86557261|gb|ABD02218.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 207
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 66/132 (50%), Gaps = 15/132 (11%)
Query: 111 AQFGSADLRKAVHVKENFRANFTS------ADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A A+L++A+ + N +A S AD+R ++ SGS GA+L +A +AN
Sbjct: 68 ADLSGANLKEAILRQANLQAADLSQAILNLADLRGANLSGSAQAGAFLWEADLAQANLQQ 127
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
DL+ L ANL+ A L R +LTR+DL GA + AD A DL + A + A
Sbjct: 128 TDLTGA-----NLQVANLSGADLRRAILTRADLTGAKLHNADLRGA--DL--RGAFLEGA 178
Query: 225 NGTNPITGVSTR 236
+ T + TR
Sbjct: 179 DLTGALYNAQTR 190
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 55/116 (47%), Gaps = 8/116 (6%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYK 159
GI +AA F +L + + F S D + ++ G+ +GA L++A+ +
Sbjct: 26 LGIPTAAAFAQLELDAQLGRSQIV---FPSKDCPACNLTGAELPGADLSGANLKEAILRQ 82
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
AN ADLS +++ L ANL+ + L +DL A ++ D + A + +A
Sbjct: 83 ANLQAADLSQAILNLADLRGANLSGSAQAGAFLWEADLAQANLQQTDLTGANLQVA 138
>gi|332704952|ref|ZP_08425038.1| hypothetical protein LYNGBM3L_00660 [Moorea producens 3L]
gi|332356304|gb|EGJ35758.1| hypothetical protein LYNGBM3L_00660 [Moorea producens 3L]
Length = 544
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 48/81 (59%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ + AD +++ SG+ + A L +A +AN +GA+LSD + L ANL+NA
Sbjct: 266 ADLSGADFNDANLSGADLSSANLIRANLIRANLSGANLSDVKVIGGNLGNANLSNANFSS 325
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L R++L GA + GAD S+A
Sbjct: 326 AKLIRANLSGADLSGADLSNA 346
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 55/119 (46%), Gaps = 24/119 (20%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT----- 163
S A F SA L RAN + AD+ +D S + F+GA L A AN +
Sbjct: 319 SNANFSSAKL---------IRANLSGADLSGADLSNANFSGASLYSANLSNANLSSANLR 369
Query: 164 ----------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
GADL T + L+ ANL+NA L+ + L ++L GA + GA+ A +
Sbjct: 370 GTELSGANLSGADLRGTKLSGANLSGANLSNAKLIDSNLRGTELSGANLSGANLRGASL 428
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 61/145 (42%), Gaps = 22/145 (15%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A A+L A + N R AN + A++R + + +GA L A Y AN
Sbjct: 389 SGANLSGANLSNAKLIDSNLRGTELSGANLSGANLRGASLYSANLSGANLRGASLYSANL 448
Query: 163 TGADLSD---------------TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+GA+LS T L+ ANL A L R L +DL A + GAD
Sbjct: 449 SGANLSGANLSLANLCPMRVSGTDFSAANLSGANLGGAYLYRADLKDTDLSSANLTGADL 508
Query: 208 SDAVIDLAQ-KQALCKYANGTNPIT 231
S A ++ A K A Y G + T
Sbjct: 509 SSANLNGADVKNARFGYIVGIDEST 533
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 53/108 (49%), Gaps = 11/108 (10%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRE----------SDFSGSKFNGAYLEKAVAYK 159
A ADL A ++ N RAN + A++ + ++ S + F+ A L +A
Sbjct: 276 ANLSGADLSSANLIRANLIRANLSGANLSDVKVIGGNLGNANLSNANFSSAKLIRANLSG 335
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
A+ +GADLS+ L ANL+NA L L ++L GA + GAD
Sbjct: 336 ADLSGADLSNANFSGASLYSANLSNANLSSANLRGTELSGANLSGADL 383
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 52/111 (46%), Gaps = 11/111 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKAN- 161
S A SA+L A N R AN + AD+R + SG+ +GA L A +N
Sbjct: 349 SGASLYSANLSNANLSSANLRGTELSGANLSGADLRGTKLSGANLSGANLSNAKLIDSNL 408
Query: 162 ----FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
+GA+LS + L ANL+ A L L ++L GA + GA+ S
Sbjct: 409 RGTELSGANLSGANLRGASLYSANLSGANLRGASLYSANLSGANLSGANLS 459
>gi|304414054|ref|ZP_07395422.1| pentapeptide repeat-containing protein [Candidatus Regiella
insecticola LSR1]
gi|304283268|gb|EFL91664.1| pentapeptide repeat-containing protein [Candidatus Regiella
insecticola LSR1]
Length = 283
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 57/128 (44%), Gaps = 23/128 (17%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDF----------------------SGS 145
S A +ADLR A N + A ADMRE D SG+
Sbjct: 122 SNATLSNADLRGAYMSWANLQNATLNDADMREVDLVGADMREAKLIGKKTNLEGANLSGA 181
Query: 146 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
GA L + KA + ADLS ++R+ L EANL +A+L T L + L A +E
Sbjct: 182 DLRGAELCHTILIKAALSWADLSYAKLERVNLREANLYHAILEETSLYLTKLENANLESV 241
Query: 206 DFSDAVID 213
+ DAV++
Sbjct: 242 NLKDAVLE 249
>gi|30696347|ref|NP_200161.2| thylakoid lumenal protein [Arabidopsis thaliana]
gi|332008984|gb|AED96367.1| thylakoid lumenal protein [Arabidopsis thaliana]
Length = 235
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 52/110 (47%), Gaps = 5/110 (4%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
++A M + F G+ + KA A +A+F G + ++ ++DR+ ++NL AV TV
Sbjct: 130 LSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLKGAVFRNTV 189
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L+ S A +E F D +I Q +C+ N R LGC
Sbjct: 190 LSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 234
>gi|282898711|ref|ZP_06306699.1| hglK (Pentapeptide repeat protein) [Cylindrospermopsis raciborskii
CS-505]
gi|281196579|gb|EFA71488.1| hglK (Pentapeptide repeat protein) [Cylindrospermopsis raciborskii
CS-505]
Length = 682
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 57/105 (54%), Gaps = 6/105 (5%)
Query: 109 SAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S AQ ADL A + + + + A++ ++++ G+ + +YL A ANF+ A+L
Sbjct: 529 SGAQLQEADLYAAQLARVSAIGSQLSHANLTKTNWQGADLSESYLNHANLNSANFSAANL 588
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
S +L AN+TN L ++R+DL GA +EG DF A++
Sbjct: 589 SGA-----ILRYANMTNTNLRSADISRADLRGANLEGTDFQGAIL 628
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 29/108 (26%)
Query: 132 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMDRM-- 175
F SA++ +S F GS+F A L KA ++N + A+LS LM R+
Sbjct: 424 FKSANLNQSSFKGSRFRSVGEDGRWDTYDDIIADLSKAQLKRSNLSNANLSRVLMSRVDL 483
Query: 176 ---VLN----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
VLN +ANL++A LV + L ++ L A++ GAD S A
Sbjct: 484 SRSVLNRANLASSKLIDANLSSAQLVGSDLQQATLQDAVLTGADISGA 531
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 54/111 (48%), Gaps = 11/111 (9%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN S+ + +++ S ++ G+ L++A A TGAD+S + L A L +
Sbjct: 490 RANLASSKLIDANLSSAQLVGSDLQQATLQDAVLTGADISGAQLQEADLYAAQLARVSAI 549
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQ-----------ALCKYANGTN 228
+ L+ ++L +GAD S++ ++ A A+ +YAN TN
Sbjct: 550 GSQLSHANLTKTNWQGADLSESYLNHANLNSANFSAANLSGAILRYANMTN 600
>gi|443475317|ref|ZP_21065270.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019839|gb|ELS33873.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 377
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 11/111 (9%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT- 163
A A+L A+ VK + RAN T AD+RE+D SG++ A L KA KAN +
Sbjct: 140 ADLTQANLSAAILVKASLKQVILNRANLTEADLREADLSGAQLYLAVLSKANLAKANLSL 199
Query: 164 ----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+L + ++ + EANL NA L ++ L ++L A + A+ S A
Sbjct: 200 ANLDSANLLEAKLEGSLFCEANLENANLSQSFLMEANLTKANLRKANLSKA 250
Score = 44.7 bits (104), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 51/102 (50%), Gaps = 15/102 (14%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM----------DRMVLN 178
R+N A++ E+D SG+ L A+ AN + DLS + + DR L
Sbjct: 84 RSNLVRANLYEADLSGASLVNINLSNAICASANLSHVDLSQSNLSSTNLSLANLDRADLT 143
Query: 179 EANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDAVIDLA 215
+ANL+ A+LV+ +L R++L A + AD S A + LA
Sbjct: 144 QANLSAAILVKASLKQVILNRANLTEADLREADLSGAQLYLA 185
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 71/154 (46%), Gaps = 18/154 (11%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTS 134
L+AA++ S L N EA+ R E + S AQ A L KA K N AN S
Sbjct: 147 LSAAILVKASLKQVILNRANLTEADLR-EADL-SGAQLYLAVLSKANLAKANLSLANLDS 204
Query: 135 ADMRESDFSGSKFNGAYLE---------------KAVAYKANFTGADLSDTLMDRMVLNE 179
A++ E+ GS F A LE KA KAN + A+L+ ++ + L
Sbjct: 205 ANLLEAKLEGSLFCEANLENANLSQSFLMEANLTKANLRKANLSKANLTSAILSQANLLG 264
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
ANL A L + L SD GA ++G + S A ++
Sbjct: 265 ANLAGASLAKANLAESDCFGANLQGTNLSQANVE 298
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 69/143 (48%), Gaps = 11/143 (7%)
Query: 73 STALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANF 132
ST L A + S + S L N YEA+ G A + +L A+ AN
Sbjct: 69 STDLVRANLRSARLDRSNLVRANLYEADLSG-------ASLVNINLSNAICAS----ANL 117
Query: 133 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 192
+ D+ +S+ S + + A L++A +AN + A L + +++LN ANLT A L L
Sbjct: 118 SHVDLSQSNLSSTNLSLANLDRADLTQANLSAAILVKASLKQVILNRANLTEADLREADL 177
Query: 193 TRSDLGGAIIEGADFSDAVIDLA 215
+ + L A++ A+ + A + LA
Sbjct: 178 SGAQLYLAVLSKANLAKANLSLA 200
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 31/129 (24%), Positives = 65/129 (50%), Gaps = 12/129 (9%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
AQ +L++A + N + AN +S D+ ++ ++ + + L +A Y+A+ +G
Sbjct: 40 AQLAGINLKQANLFRANLQNAVLAIANLSSTDLVRANLRSARLDRSNLVRANLYEADLSG 99
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA------VIDLAQKQ 218
A L + + + ANL++ L ++ L+ ++L A ++ AD + A ++ + KQ
Sbjct: 100 ASLVNINLSNAICASANLSHVDLSQSNLSSTNLSLANLDRADLTQANLSAAILVKASLKQ 159
Query: 219 ALCKYANGT 227
+ AN T
Sbjct: 160 VILNRANLT 168
Score = 37.0 bits (84), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 14/105 (13%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA-----YKANFTGA 165
A A+LRKA K AN TSA + +++ G+ GA L KA + AN G
Sbjct: 235 ANLTKANLRKANLSK----ANLTSAILSQANLLGANLAGASLAKANLAESDCFGANLQGT 290
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+LS ++ + L E++L A LV ++L GA + GA+ DA
Sbjct: 291 NLSQANVEAVDLRESDLAKANLV-----GANLAGANLFGAELLDA 330
>gi|421082377|ref|ZP_15543263.1| Pentapeptide repeat protein [Pectobacterium wasabiae CFBP 3304]
gi|401702907|gb|EJS93144.1| Pentapeptide repeat protein [Pectobacterium wasabiae CFBP 3304]
Length = 846
Score = 50.4 bits (119), Expect = 7e-04, Method: Composition-based stats.
Identities = 43/160 (26%), Positives = 74/160 (46%), Gaps = 12/160 (7%)
Query: 78 AAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADM 137
A++ SCS + A+ ++ T + S + AD +A + N R A +
Sbjct: 687 GALLDSCSW-VETQANEARFVGATWLTSAVASGSSMNGADFTQATLRQSNLR----QASL 741
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
+ F+ +K + L +A + NF A+L+ +L R EAN T+A L+ +L +S L
Sbjct: 742 IGAVFARAKLENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLMGALLQKSQL 801
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
GA GA+ A DL+Q + + T + G T++
Sbjct: 802 SGANFRGANLFRA--DLSQ-----AFTSNTTQLDGAWTKR 834
Score = 40.4 bits (93), Expect = 0.90, Method: Composition-based stats.
Identities = 27/84 (32%), Positives = 36/84 (42%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+F+ D+R +DFS + A L ANF A LS T + L N NA L
Sbjct: 538 ADFSGMDLRGADFSKALLECADLSNCKLDGANFHSAMLSRTELHNTSLCGCNFENASLAL 597
Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
SD GA + +A+ D
Sbjct: 598 AQCCHSDFSGAHFKNTQLQEALFD 621
>gi|254299592|ref|ZP_04967041.1| pentapeptide repeat protein [Burkholderia pseudomallei 406e]
gi|418542641|ref|ZP_13108060.1| type VI secretion system [Burkholderia pseudomallei 1258a]
gi|418549165|ref|ZP_13114243.1| type VI secretion system [Burkholderia pseudomallei 1258b]
gi|157809489|gb|EDO86659.1| pentapeptide repeat protein [Burkholderia pseudomallei 406e]
gi|385355180|gb|EIF61399.1| type VI secretion system [Burkholderia pseudomallei 1258a]
gi|385356028|gb|EIF62174.1| type VI secretion system [Burkholderia pseudomallei 1258b]
Length = 825
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 566
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585
>gi|427738633|ref|YP_007058177.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427373674|gb|AFY57630.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 436
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 60/115 (52%), Gaps = 7/115 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANFT+ ++ ++F+ + GA LE A AN T ADLS T + + L ANL N+ L
Sbjct: 259 ANFTNVNLEGANFTNANLEGANLENAKLNNANLTNADLSYTNLRKADLRCANLINSDLSN 318
Query: 190 TVLTRSDLGGAIIEGA-----DFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 239
+R++L AI+ GA +FSDA +L + Y +G N I R +L
Sbjct: 319 ADASRANLSDAIVNGANLIQSNFSDA--NLRGCNLIKTYLSGANLIRADLKRANL 371
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 44/83 (53%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
A+ + D+ +++FS + GA ANFT A+L ++ LN ANLTNA L
Sbjct: 237 INADLSGIDLCDANFSDANLEGANFTNVNLEGANFTNANLEGANLENAKLNNANLTNADL 296
Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
T L ++DL A + +D S+A
Sbjct: 297 SYTNLRKADLRCANLINSDLSNA 319
>gi|381207604|ref|ZP_09914675.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 255
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 16/111 (14%)
Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADL +A + N + A+ T D+ ++ G+ +GA L A AN GADL+D +
Sbjct: 96 ADLHEANAPEANLKNADLTEVDLLHANLGGTDLSGAKLSGAKLRGANLVGADLTDADLSE 155
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAI---------------IEGADFSDA 210
L+EANL+ A L L +DLG A+ ++GAD +DA
Sbjct: 156 ANLSEANLSEADLSGADLREADLGKAVLSQAKLVGANLHRIRLQGADLTDA 206
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 59/123 (47%), Gaps = 16/123 (13%)
Query: 102 RGE-FGIGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEK 154
+GE FG+ ADL KAV + R AN AD++E++ G+ + L
Sbjct: 20 KGELFGV----DLSEADLPKAVLYSSDLREAKLSKANLAKADLQEANLVGAGLHRVDLNG 75
Query: 155 AVAYKANFTGADLSDTLM---DRMVLN--EANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
A ++AN ADLS L+ D N EANL NA L L ++LGG + GA S
Sbjct: 76 ANLHQANLAQADLSGALLFFADLHEANAPEANLKNADLTEVDLLHANLGGTDLSGAKLSG 135
Query: 210 AVI 212
A +
Sbjct: 136 AKL 138
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 51/110 (46%), Gaps = 11/110 (10%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A+ A LR A V + AN + A++ E+D SG+ A L KAV +A
Sbjct: 129 SGAKLSGAKLRGANLVGADLTDADLSEANLSEANLSEADLSGADLREADLGKAVLSQAKL 188
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
GA+L R+ L A+LT+A L L DL AI E F A +
Sbjct: 189 VGANLH-----RIRLQGADLTDADLTDANLYGIDLREAITENTLFEKAKL 233
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 57/119 (47%), Gaps = 13/119 (10%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A G DL A R AN AD+ ++D S + + A L +A+ +GADL +
Sbjct: 121 ANLGGTDLSGAKLSGAKLRGANLVGADLTDADLSEANLSEANLS-----EADLSGADLRE 175
Query: 170 TLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADFSDAVID--LAQKQALC 221
+ + VL++A L A L R LT +DL A + G D +A+ + L +K LC
Sbjct: 176 ADLGKAVLSQAKLVGANLHRIRLQGADLTDADLTDANLYGIDLREAITENTLFEKAKLC 234
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 54/108 (50%), Gaps = 11/108 (10%)
Query: 109 SAAQFGSADLRKA------VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A ADL++A +H + AN A++ ++D SG+ A L +A A +AN
Sbjct: 49 SKANLAKADLQEANLVGAGLHRVDLNGANLHQANLAQADLSGALLFFADLHEANAPEANL 108
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
ADL++ + L ANL L L+ + L GA + GAD +DA
Sbjct: 109 KNADLTE-----VDLLHANLGGTDLSGAKLSGAKLRGANLVGADLTDA 151
>gi|218440553|ref|YP_002378882.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218173281|gb|ACK72014.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 320
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 58/110 (52%), Gaps = 14/110 (12%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A +A+L++A+ +F+ SA++ ++ G K NGA L +A KAN +G DL+
Sbjct: 100 ANLSNANLKQAILTNVDFK----SANLSGANLVGVKLNGANLSRADLSKANLSGIDLTGA 155
Query: 171 LMDRMVLNEANLT----------NAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ R+ L+ ANL A L R+ L DL GAI++G++ A
Sbjct: 156 NLSRVDLSRANLNGADLSGANLYKADLSRSNLRNGDLQGAILQGSNLHKA 205
>gi|254182800|ref|ZP_04889393.1| pentapeptide repeat protein [Burkholderia pseudomallei 1655]
gi|184213334|gb|EDU10377.1| pentapeptide repeat protein [Burkholderia pseudomallei 1655]
Length = 825
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 566
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585
>gi|51246498|ref|YP_066382.1| hypothetical protein DP2646 [Desulfotalea psychrophila LSv54]
gi|50877535|emb|CAG37375.1| hypothetical protein DP2646 [Desulfotalea psychrophila LSv54]
Length = 446
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 47/96 (48%), Gaps = 5/96 (5%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
+ K V E ++ D+ DFSG GA LE+A AN ADL+D + L
Sbjct: 337 IEKVVEAGECYQC-----DLAGLDFSGESLTGADLEQADLSGANLAEADLADANLRGANL 391
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
ANLT A L R L + DL GA + GA+ D +D
Sbjct: 392 RGANLTGADLRRADLYKGDLRGADLTGANLEDTQMD 427
Score = 37.7 bits (86), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 38/71 (53%), Gaps = 6/71 (8%)
Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTGADLSD 169
ADL +A N A+ A++R ++ G+ GA L +A YK A+ TGA+L D
Sbjct: 364 ADLEQADLSGANLAEADLADANLRGANLRGANLTGADLRRADLYKGDLRGADLTGANLED 423
Query: 170 TLMDRMVLNEA 180
T MD ++ +A
Sbjct: 424 TQMDGVLQTDA 434
>gi|334117107|ref|ZP_08491199.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333461927|gb|EGK90532.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 520
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 57/95 (60%), Gaps = 9/95 (9%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N ++A+MR++ + ++ +GA L YKAN +GA L+ + R L EA L A ++R+
Sbjct: 51 NLSNANMRKAKLNVARLSGANL-----YKANLSGAILNVANLIRADLREAQLVEATMIRS 105
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
L R++L A + GA+ S+A DL ++A + AN
Sbjct: 106 ELIRANLSSANLTGANLSEA--DL--REATLREAN 136
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 67/138 (48%), Gaps = 3/138 (2%)
Query: 74 TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN-FRANF 132
T L+ A + N++ L+ N Y+A G I + A ADLR+A V+ R+
Sbjct: 50 TNLSNANMRKAKLNVARLSGANLYKANLSG--AILNVANLIRADLREAQLVEATMIRSEL 107
Query: 133 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 192
A++ ++ +G+ + A L +A +AN ADLS + L ANL A L R L
Sbjct: 108 IRANLSSANLTGANLSEADLREATLREANLEQADLSGAHLRGASLTAANLERANLHRADL 167
Query: 193 TRSDLGGAIIEGADFSDA 210
+R+DL G + A+ A
Sbjct: 168 SRADLRGVNLCNAELRQA 185
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 51/92 (55%), Gaps = 1/92 (1%)
Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
+A+LR+A + N A+ A++R +D SG+ GA L++A AN GA+LS+ +
Sbjct: 179 NAELRQANLSQANLSGADLRGANLRWADLSGANLTGADLDEARLSGANLYGANLSNVNLL 238
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
L A+LT A L+ +DL GA + GA
Sbjct: 239 NATLVHADLTQANLIHADWVGADLTGAALTGA 270
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 65/134 (48%), Gaps = 11/134 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
+AA A+L +A + + R A A++ +++ SG+ GA L A AN
Sbjct: 153 TAANLERANLHRADLSRADLRGVNLCNAELRQANLSQANLSGADLRGANLRWADLSGANL 212
Query: 163 TGADLSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
TGADL + + L ANL+ NA LV LT+++L A GAD + A + A+
Sbjct: 213 TGADLDEARLSGANLYGANLSNVNLLNATLVHADLTQANLIHADWVGADLTGAALTGAKI 272
Query: 218 QALCKYANGTNPIT 231
A+ ++ + IT
Sbjct: 273 YAVSRFDVKADDIT 286
>gi|209526319|ref|ZP_03274848.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|376001485|ref|ZP_09779353.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423062694|ref|ZP_17051484.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493248|gb|EDZ93574.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|375330094|emb|CCE15106.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406715650|gb|EKD10803.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 390
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 64/115 (55%), Gaps = 11/115 (9%)
Query: 110 AAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVA-----YKANFT 163
+A ADL +A+ +K N +A+ +SA++ +S+ + F AYL KA ++A+ +
Sbjct: 111 SAHLNWADLTEAIFIKTNLHKADLSSANLTKSNLQSANFVRAYLIKANLSEADLFQADLS 170
Query: 164 GADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
A+L D + L+E ANL A L LT+++LG A + GA+ +DA ++
Sbjct: 171 SANLKDVNLSAANLSECKMTRANLMGANLTEADLTKANLGRANLRGANLTDAYLN 225
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 70/146 (47%), Gaps = 9/146 (6%)
Query: 76 LAAAVVASCSSNISALADLNKYEAE-TRGEFGIGS--AAQFGSADLRKAVHVKEN-FRAN 131
L+AA ++ C + L N EA+ T+ G + A A L A V+ + ++AN
Sbjct: 179 LSAANLSECKMTRANLMGANLTEADLTKANLGRANLRGANLTDAYLNSASLVEADLYQAN 238
Query: 132 FTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
T A++ ++ S + NGA+L K A+ G DLS L+ + L A L+ A
Sbjct: 239 LTRANLSRANLSKTYLRDICLNGAHLTKVNLSGADLGGVDLSQKLLTGINLAGAYLSEAT 298
Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVI 212
LV +L ++L A + GA+ A +
Sbjct: 299 LVGALLMEANLSAANLSGANLQSACL 324
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 49/108 (45%), Gaps = 11/108 (10%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A G DL + + N A A + E++ S + +GA L+ A A+
Sbjct: 270 SGADLGGVDLSQKLLTGINLAGAYLSEATLVGALLMEANLSAANLSGANLQSACLIHADL 329
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
GA +DR+ L +ANLT A L + L ++L AI+ G + A
Sbjct: 330 GGA-----YLDRVDLTDANLTGANLTKADLREANLRAAILAGVELKGA 372
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 6/102 (5%)
Query: 115 SADLRKAVHVKEN------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
SA+ +A +K N F+A+ +SA++++ + S + + + +A AN T ADL+
Sbjct: 146 SANFVRAYLIKANLSEADLFQADLSSANLKDVNLSAANLSECKMTRANLMGANLTEADLT 205
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ R L ANLT+A L L +DL A + A+ S A
Sbjct: 206 KANLGRANLRGANLTDAYLNSASLVEADLYQANLTRANLSRA 247
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 67/160 (41%), Gaps = 26/160 (16%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLE---------------- 153
A F ADL A N + N + A++ ++ SGS NGA L+
Sbjct: 57 ADFSEADLSGAHLSLANLSKVNLSGANLTGANLSGSSLNGANLQGATLSGVNLESAHLNW 116
Query: 154 ----KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+A+ K N ADLS + + L AN A L++ L+ +DL A + A+ D
Sbjct: 117 ADLTEAIFIKTNLHKADLSSANLTKSNLQSANFVRAYLIKANLSEADLFQADLSSANLKD 176
Query: 210 AVIDLAQKQALCKY--AN--GTNPITGVSTRKSLGCGNSR 245
+ A + CK AN G N T+ +LG N R
Sbjct: 177 VNLS-AANLSECKMTRANLMGANLTEADLTKANLGRANLR 215
>gi|428216569|ref|YP_007101034.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988351|gb|AFY68606.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 330
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 50/164 (30%), Positives = 76/164 (46%), Gaps = 15/164 (9%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR------ 129
L A + S NI++L + N A+ RG S A ADLR A + N
Sbjct: 132 LKGATLRRASKNITSLRNANLRRADLRG--ADLSEANLAGADLRGADLSEANLANTDLTG 189
Query: 130 ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN A MR E++ +G+ AY++ +AN + ADL T +D V++ ANL+
Sbjct: 190 ANLAEAIMRGTGLTEANLTGANLANAYMQNVRTERANLSEADLQGTNLDLAVMSMANLSK 249
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
+ L L R++L G + + S A +L + Q + Y TN
Sbjct: 250 SNLSEASLYRANLNGTDLSRTNLSGA--NLREAQLVESYMARTN 291
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 54/177 (30%), Positives = 81/177 (45%), Gaps = 19/177 (10%)
Query: 56 NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGS---AAQ 112
NQ AKL + + AL A + + + L D N A+ R G+ A
Sbjct: 73 NQAHLSEAKLNDVDLH-GAALVGATLVNADLTFAVLIDANLMNADLRSANLSGANLAGAC 131
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGAD 166
A LR+A + R AN AD+R E++ +G+ GA L +A + TGA+
Sbjct: 132 LKGATLRRASKNITSLRNANLRRADLRGADLSEANLAGADLRGADLSEANLANTDLTGAN 191
Query: 167 LSDTLMDRMVLNEANLTNAVL-------VRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
L++ +M L EANLT A L VRT R++L A ++G + AV+ +A
Sbjct: 192 LAEAIMRGTGLTEANLTGANLANAYMQNVRT--ERANLSEADLQGTNLDLAVMSMAN 246
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 7/96 (7%)
Query: 95 NKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAY 151
N EA+ +G + + S A ++L +A +RAN D+ ++ SG+ A
Sbjct: 226 NLSEADLQGTNLDLAVMSMANLSKSNLSEASL----YRANLNGTDLSRTNLSGANLREAQ 281
Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
L ++ + N T ADL+D L+ R L+ ANL NA L
Sbjct: 282 LVESYMARTNLTNADLADALLARAELSSANLLNANL 317
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 47/87 (54%), Gaps = 5/87 (5%)
Query: 129 RANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
RAN + AD++ ++ S + + + L +A Y+AN G DLS T + L EA L
Sbjct: 224 RANLSEADLQGTNLDLAVMSMANLSKSNLSEASLYRANLNGTDLSRTNLSGANLREAQLV 283
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + RT LT +DL A++ A+ S A
Sbjct: 284 ESYMARTNLTNADLADALLARAELSSA 310
>gi|167913453|ref|ZP_02500544.1| pentapeptide repeat family protein [Burkholderia pseudomallei 112]
gi|403521532|ref|YP_006657101.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
BPC006]
gi|403076599|gb|AFR18178.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
BPC006]
Length = 825
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 566
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585
>gi|443651776|ref|ZP_21130709.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|159027471|emb|CAO89436.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443334417|gb|ELS48929.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 931
Score = 50.4 bits (119), Expect = 8e-04, Method: Composition-based stats.
Identities = 36/117 (30%), Positives = 55/117 (47%), Gaps = 3/117 (2%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
G A+L +A + + AN A++ ++ G+ GA L A +AN GA+L
Sbjct: 789 LGGANLERANLAEADIGGANLEGANLEGANLKGANLEGANLAMAFLKRANLEGANLRGAN 848
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
++ L ANL A L R L ++L GA + GA+ A +D A + Y G N
Sbjct: 849 LEEAYLEGANLAMAFLKRANLEGANLRGANLYGANLKGANLDWANLEG--AYLEGAN 903
Score = 43.1 bits (100), Expect = 0.13, Method: Composition-based stats.
Identities = 38/123 (30%), Positives = 54/123 (43%), Gaps = 10/123 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
+ A G A+L A N + AN A ++ ++ G+ GA LE+A AN
Sbjct: 800 AEADIGGANLEGANLEGANLKGANLEGANLAMAFLKRANLEGANLRGANLEEAYLEGANL 859
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
A L ++ L ANL A L L ++L GA +EGA+ +D A K
Sbjct: 860 AMAFLKRANLEGANLRGANLYGANLKGANLDWANLEGAYLEGANLRGVFLDGAN----FK 915
Query: 223 YAN 225
YAN
Sbjct: 916 YAN 918
>gi|334188366|ref|NP_001190531.1| thylakoid lumenal protein [Arabidopsis thaliana]
gi|332008986|gb|AED96369.1| thylakoid lumenal protein [Arabidopsis thaliana]
Length = 250
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 52/110 (47%), Gaps = 5/110 (4%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
++A M + F G+ + KA A +A+F G + ++ ++DR+ ++NL AV TV
Sbjct: 145 LSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLKGAVFRNTV 204
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
L+ S A +E F D +I Q +C+ N R LGC
Sbjct: 205 LSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 249
>gi|226194659|ref|ZP_03790253.1| pentapeptide repeat protein [Burkholderia pseudomallei Pakistan 9]
gi|386863935|ref|YP_006276883.1| type VI secretion system [Burkholderia pseudomallei 1026b]
gi|418534996|ref|ZP_13100802.1| type VI secretion system [Burkholderia pseudomallei 1026a]
gi|225933225|gb|EEH29218.1| pentapeptide repeat protein [Burkholderia pseudomallei Pakistan 9]
gi|385357281|gb|EIF63347.1| type VI secretion system [Burkholderia pseudomallei 1026a]
gi|385661063|gb|AFI68485.1| type VI secretion system [Burkholderia pseudomallei 1026b]
Length = 825
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 566
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585
>gi|385871982|gb|AFI90502.1| Pentapeptide repeat protein [Pectobacterium sp. SCC3193]
Length = 273
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 76/159 (47%), Gaps = 12/159 (7%)
Query: 79 AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMR 138
A++ SCS + A+ ++ T + S + SAD +A + N R A +
Sbjct: 115 ALLDSCSW-VETQANEARFTGATWLTSAVASGSSMNSADFTQATLRQSNLR----QASLI 169
Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
+ F+ +K + L +A + NF A+L+ +L R EAN T+A L+ +L +S LG
Sbjct: 170 GAVFALAKLENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQLG 229
Query: 199 GAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
GA GA+ A DL+Q + + T + G T++
Sbjct: 230 GANFRGANLFRA--DLSQ-----AFTSNTTQLDGAWTKR 261
>gi|6226483|sp|Q52118.1|YMO3_ERWST RecName: Full=Uncharacterized protein in mobD 3'region
gi|886362|gb|AAA69501.1| unknown [Plasmid pSW200]
Length = 295
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 61/105 (58%), Gaps = 6/105 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANF 162
S A +ADL++A N A+ T+A++ ++D +GA L A +AY +A+
Sbjct: 170 SNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADL 229
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+ A+LS+ + R L++ANL++A L L R+DL AI++GA+
Sbjct: 230 SNANLSNADLKRADLSDANLSDANLTNVDLKRADLSNAILKGANL 274
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 56/116 (48%), Gaps = 11/116 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYK--- 159
S A ADL A N AN T A + E+D S + +GA L A +
Sbjct: 85 SDANLSDADLSDANLSDANLSGANLAHANLTMAYLSEADLSNANLSGADLTNANLNQTDL 144
Query: 160 --ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
N +GA+L+ + L+EA+L+NA L L R+DL A + GAD ++A ++
Sbjct: 145 PNVNLSGANLAHANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNANLN 200
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 48/92 (52%), Gaps = 10/92 (10%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM----------VLN 178
AN T A + E+D S + + A L++A AN +GADL++ +++ L
Sbjct: 156 HANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLA 215
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
ANLT A L L+ ++L A ++ AD SDA
Sbjct: 216 HANLTMAYLSEADLSNANLSNADLKRADLSDA 247
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
+++ + S + GAYL A N + ADLSD + L+ ANL +A L L+ +
Sbjct: 68 NLKGVNLSDTDLKGAYLSDA-----NLSDADLSDANLSDANLSGANLAHANLTMAYLSEA 122
Query: 196 DLGGAIIEGADFSDAVID 213
DL A + GAD ++A ++
Sbjct: 123 DLSNANLSGADLTNANLN 140
Score = 37.4 bits (85), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 51/91 (56%), Gaps = 5/91 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANFTGADLSDTLMDRMVLNEANLTN 184
A+ T+A++ ++D +GA L A +AY +A+ + A+LS+ + R L+ ANL+
Sbjct: 132 ADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADLSNANLSNADLKRADLSNANLSG 191
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
A L L ++DL + GA+ + A + +A
Sbjct: 192 ADLTNANLNQTDLPNVNLSGANLAHANLTMA 222
>gi|443319118|ref|ZP_21048355.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442781316|gb|ELR91419.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 331
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 59/130 (45%), Gaps = 20/130 (15%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A D+R D SG + A L A N TGA+LS + R L +ANLT +L
Sbjct: 191 AYLNGVDLRGMDLSGVNLSQARLNGAKLDLVNLTGANLSQATLRRASLQQANLTGTILTG 250
Query: 190 TV----------LTRSD-----LGGAIIE-----GADFSDAVIDLAQKQALCKYANGTNP 229
V LTR+D L GA+++ GA+F+DA++ + L A G
Sbjct: 251 AVLWHADMQGVNLTRADLSQANLAGALLQATSITGAEFTDAILPEESRNGLYALATGETL 310
Query: 230 ITGVSTRKSL 239
+ TR++L
Sbjct: 311 WSHRLTRETL 320
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 56/117 (47%), Gaps = 13/117 (11%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLE 153
L YEA R GI S +L + + V + D+ E+ G+ A+L
Sbjct: 28 LEMYEAGYRDFAGI----HLNSVNLSQRILV---------AVDLAEASLVGADLARAFLT 74
Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
KA Y+AN A+LS T + + L +++L+ A L T + ++ L GA + GA+ A
Sbjct: 75 KANLYRANLHRANLSFTKLSDVNLRQSDLSKADLRSTFMVKAHLEGANLSGANLGQA 131
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 48/109 (44%), Gaps = 11/109 (10%)
Query: 116 ADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
ADLR VK + +AN A++ ++ G+ GA L A +AN +
Sbjct: 106 ADLRSTFMVKAHLEGANLSGANLGQANLRGANLEGANLCGANLQGANLRGANLSQANLSW 165
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
A+LS + M + L+ L + L L DL G + G + S A ++
Sbjct: 166 ANLSGSRMGGVALDRTQLADVTLEGAYLNGVDLRGMDLSGVNLSQARLN 214
>gi|443476809|ref|ZP_21066696.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443018179|gb|ELS32476.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 330
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 68/131 (51%), Gaps = 13/131 (9%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG 149
L+ +N A G IG+ F +L +A + N + AN AD++ ++ + G
Sbjct: 183 LSRVNLQGANLSGAIAIGTI--FTEVNLSQANLTEVNLKGANLMKADLKNANLRLANLFG 240
Query: 150 AYLEKA---VAYKAN-------FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 199
A L KA +A +N TG+DLS +L+DR L++A+L +A LVR L +DL
Sbjct: 241 ANLSKANLSMATLSNAGLIQAILTGSDLSRSLLDRANLSQASLVDAYLVRANLDGADLSN 300
Query: 200 AIIEGADFSDA 210
AI+ A+ S A
Sbjct: 301 AILTRAELSGA 311
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 60/125 (48%), Gaps = 21/125 (16%)
Query: 131 NFTSADMRESDF---------------SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
N T AD+R ++F SG+K GA L +A+ AN T ADL ++ R+
Sbjct: 36 NLTKADLRRTNFVFAYLNKVTFNHANLSGAKLGGATLNQAIMMSANLTEADLHGAMLQRV 95
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP--ITGV 233
L ANL+ A L+ L+ +DL + GA+ A++ AL + G P + G
Sbjct: 96 NLFGANLSLANLMDANLSEADLRSVNLRGANLRCAIL----SAALMREERGYPPTNMVGA 151
Query: 234 STRKS 238
+ RK+
Sbjct: 152 NLRKA 156
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 46/94 (48%), Gaps = 6/94 (6%)
Query: 124 VKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
V N R A+ A++ SD +G +GA L +A + N GA+LS + + E NL
Sbjct: 149 VGANLRKADLRGANLSGSDLTGVDLSGANLSEATLSRVNLQGANLSGAIAIGTIFTEVNL 208
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
+ A LT +L GA + AD +A + LA
Sbjct: 209 SQA-----NLTEVNLKGANLMKADLKNANLRLAN 237
>gi|428202965|ref|YP_007081554.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427980397|gb|AFY77997.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 179
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 49/95 (51%), Gaps = 1/95 (1%)
Query: 117 DLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
DL+ A N AN +AD+ E++ G+ GA L+ A K N GA+L +
Sbjct: 65 DLQNANLQGANLEGANLQNADLEEANLQGANLAGANLQGADLEKGNLAGANLQTANLINA 124
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L EANL NA L L R+DL A + GA+ ++A
Sbjct: 125 DLEEANLQNANLQGASLQRADLEKANLTGANTNEA 159
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 68/142 (47%), Gaps = 8/142 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A S +LR+ + KE N + D++ ++ G+ GA L+ A +AN GA+L+
Sbjct: 38 STAPEASTELRRLLDTKECAGCNLSGVDLQNANLQGANLEGANLQNADLEEANLQGANLA 97
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
+ L + NL A L L +DL A ++ A+ A + ++A + AN
Sbjct: 98 GANLQGADLEKGNLAGANLQTANLINADLEEANLQNANLQGASL----QRADLEKAN--- 150
Query: 229 PITGVSTRKSLGCGNSRRNAYG 250
+TG +T ++ G + NA G
Sbjct: 151 -LTGANTNEANLQGANLENAIG 171
>gi|424851694|ref|ZP_18276091.1| pentapeptide repeat-containing protein [Rhodococcus opacus PD630]
gi|356666359|gb|EHI46430.1| pentapeptide repeat-containing protein [Rhodococcus opacus PD630]
Length = 194
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 60/131 (45%), Gaps = 16/131 (12%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFS-----GSKFNGAYL 152
+E R E I + F ADL ++ HV FR+ +FT + S+F GS+F+ L
Sbjct: 31 SELRTESVIFTDCDFTGADLAESRHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 90
Query: 153 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 202
V + +FT GADL EANL L R VL +DL GGA
Sbjct: 91 RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 150
Query: 203 EGADFSDAVID 213
+GAD A ID
Sbjct: 151 DGADLRGARID 161
>gi|390441101|ref|ZP_10229280.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
gi|389835591|emb|CCI33406.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
Length = 436
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 71/149 (47%), Gaps = 9/149 (6%)
Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
T+ EF A A+L KA+ +++ D SG+ GA L A+ A
Sbjct: 203 TKAEFT-TDAKVIEKAELIKAIR-----EGTIDKTTLQQVDLSGAILRGAILIGAILRGA 256
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQK 217
N +GA+LSD ++ +L+ A L+ A L L+ +DL GA + GA+ S+A + DL++
Sbjct: 257 NLSGANLSDAILRGAILSRAFLSGAFLSEADLSGADLSGANLRGANLSEADLSEADLSEA 316
Query: 218 QALCKYANGTNPITGVSTRKSLGCGNSRR 246
+G N I R +L N RR
Sbjct: 317 DLSEADLSGANLIDANLRRANLIKANLRR 345
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 63/127 (49%), Gaps = 10/127 (7%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+LR A + + A+ + AD+ E+D SG+ A L +A KAN A+L
Sbjct: 289 SGADLSGANLRGANLSEADLSEADLSEADLSEADLSGANLIDANLRRANLIKANLRRANL 348
Query: 168 SDTLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
+ ++ L+ ANL A+L+ +L +DL GA + A+ S+A I+ A+
Sbjct: 349 IEAILSEADLSGANLRRANLIKAILIEAILIEADLRGADLRWANLSEADIE----NAIFI 404
Query: 223 YANGTNP 229
A G P
Sbjct: 405 DATGITP 411
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 54/103 (52%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTS-ADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A LR A+ + F S AD+ +D SG+ GA L +A +A+ + ADL
Sbjct: 259 SGANLSDAILRGAILSRAFLSGAFLSEADLSGADLSGANLRGANLSEADLSEADLSEADL 318
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S+ + L +ANL A L++ L R++L AI+ AD S A
Sbjct: 319 SEADLSGANLIDANLRRANLIKANLRRANLIEAILSEADLSGA 361
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 51/103 (49%), Gaps = 2/103 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+LR+A +K N R AN A + E+D SG+ A L KA+ +A ADL
Sbjct: 324 SGANLIDANLRRANLIKANLRRANLIEAILSEADLSGANLRRANLIKAILIEAILIEADL 383
Query: 168 SDTLMDRMVLNEANLTNAVLVR-TVLTRSDLGGAIIEGADFSD 209
+ L+EA++ NA+ + T +T I GA F D
Sbjct: 384 RGADLRWANLSEADIENAIFIDATGITPEQKQDLIRRGAIFGD 426
>gi|189499620|ref|YP_001959090.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
BS1]
gi|189495061|gb|ACE03609.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
Length = 300
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 64/121 (52%), Gaps = 3/121 (2%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADL-RKAVHVKENFRANFTSADMRESDFSGSKFNG 149
L+D N EA+ G + A A+L R V + AN + E+DF+ S+
Sbjct: 93 LSDANLVEADLSGSMLV--EANLRGANLSRGKVRDVDLTSANLSDGFFIETDFTRSQMVR 150
Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+ +++A +A TG +LS + ++++ L+ A+L NAVLV +T SDL A GAD D
Sbjct: 151 SKMQRAFLGRATLTGTNLSWSNLEKVNLDNADLQNAVLVDVDITSSDLVAANFSGADLRD 210
Query: 210 A 210
A
Sbjct: 211 A 211
>gi|254417634|ref|ZP_05031369.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196175575|gb|EDX70604.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 470
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 55/110 (50%), Gaps = 9/110 (8%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A SADLR A A AD+R +D G+K N A L+ A AN +GA+LS
Sbjct: 214 ANLVSADLRNANLTD----AQLEVADIRSADLRGAKLNNANLDTVNADSANLSGANLS-- 267
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
+ + A+ A+LVRT L + L G+ + AD + A + AQ + +
Sbjct: 268 ---QAYITNADFNGAILVRTTLREAVLNGSNFQIADLTQANLQGAQLKGI 314
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 70/158 (44%), Gaps = 23/158 (14%)
Query: 67 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA-VHVK 125
N + V T L AV+ + I+ L N A+ +G IG F A+L KA +
Sbjct: 277 NGAILVRTTLREAVLNGSNFQIADLTQANLQGAQLKG---IG----FNRANLTKANLEGA 329
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL----------SDTLMDRM 175
+ A AD+ + +G+ + AYL A AN +G DL S+ +
Sbjct: 330 DLTNAKLAIADLTNAQLTGAILHSAYLHSATLANANLSGVDLQGAQLREANLSNVTLVGA 389
Query: 176 VLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFS 208
L +ANL T A L T LTR DL GA + GAD S
Sbjct: 390 TLEDANLIRSTLTGANLTYTNLTRCDLRGANLTGADLS 427
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 51/110 (46%), Gaps = 6/110 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A +AD A+ V+ R +NF AD+ +++ G++ G +A KAN
Sbjct: 267 SQAYITNADFNGAILVRTTLREAVLNGSNFQIADLTQANLQGAQLKGIGFNRANLTKANL 326
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
GADL++ + L A LT A+L L + L A + G D A +
Sbjct: 327 EGADLTNAKLAIADLTNAQLTGAILHSAYLHSATLANANLSGVDLQGAQL 376
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 6/144 (4%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
F++ AA V +N+ ADL Y A G + S A A L A V+ R
Sbjct: 154 FIANWYAAVVTDLRDTNLQG-ADL--YRANLDG--ALLSRANLQDAQLDYANLVRTYLRE 208
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A T+A++ +D + A LE A A+ GA L++ +D + + ANL+ A L +
Sbjct: 209 ATLTNANLVSADLRNANLTDAQLEVADIRSADLRGAKLNNANLDTVNADSANLSGANLSQ 268
Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
+T +D GAI+ +AV++
Sbjct: 269 AYITNADFNGAILVRTTLREAVLN 292
>gi|428215879|ref|YP_007089023.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004260|gb|AFY85103.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 284
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 51/101 (50%), Gaps = 19/101 (18%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN + D+ E+D SG AN + ADLSDT + +L ANLT A+L
Sbjct: 159 RANLSGLDLSETDLSG---------------ANLSYADLSDTQLTEAILYGANLTGAILT 203
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
L + + G++++GAD S A + + A K+ + TN
Sbjct: 204 SAQLDGAKMNGSLVDGADLSQANL----QDAEVKWVDLTNA 240
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 54/105 (51%), Gaps = 1/105 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S +QF SA L+ A V+ N + +AD+R +D S + G+ L +A + N TGA+L
Sbjct: 43 SHSQFCSAILQGATLVEANLEQTKLRAADLRRADLSHANLMGSDLSRADMIETNLTGANL 102
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ ++ +E +A L R L +L G + GA+ +A I
Sbjct: 103 EQANLTEVIFSEVIFADANLSRANLQGLNLSGINLSGANLQEAHI 147
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 59/128 (46%), Gaps = 10/128 (7%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANF 162
S A +DL +A ++ N AN A++ E FS F A L +A N
Sbjct: 78 SHANLMGSDLSRADMIETNLTGANLEQANLTEVIFSEVIFADANLSRANLQGLNLSGINL 137
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
+GA+L + + + + ANL+ A L L+ +DL GA + AD SD + +A+
Sbjct: 138 SGANLQEAHIAEVSFHNANLSRANLSGLDLSETDLSGANLSYADLSDTQL----TEAILY 193
Query: 223 YANGTNPI 230
AN T I
Sbjct: 194 GANLTGAI 201
>gi|53715998|ref|YP_106439.1| pentapeptide repeat-containing protein [Burkholderia mallei ATCC
23344]
gi|121597894|ref|YP_990510.1| pentapeptide repeat-containing protein [Burkholderia mallei SAVP1]
gi|124382797|ref|YP_001025000.1| pentapeptide repeat-containing protein [Burkholderia mallei NCTC
10229]
gi|126447556|ref|YP_001079344.1| pentapeptide repeat-containing protein [Burkholderia mallei NCTC
10247]
gi|166999172|ref|ZP_02265018.1| pentapeptide repeat family protein [Burkholderia mallei PRL-20]
gi|238561876|ref|ZP_00441284.2| pentapeptide repeat family protein [Burkholderia mallei GB8 horse
4]
gi|254176522|ref|ZP_04883180.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 10399]
gi|254203434|ref|ZP_04909795.1| pentapeptide repeat family protein [Burkholderia mallei FMH]
gi|254205313|ref|ZP_04911666.1| pentapeptide repeat family protein [Burkholderia mallei JHU]
gi|254356120|ref|ZP_04972397.1| pentapeptide repeat family protein [Burkholderia mallei 2002721280]
gi|52421968|gb|AAU45538.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 23344]
gi|121225692|gb|ABM49223.1| pentapeptide repeat family protein [Burkholderia mallei SAVP1]
gi|126240410|gb|ABO03522.1| pentapeptide repeat family protein [Burkholderia mallei NCTC 10247]
gi|147745673|gb|EDK52752.1| pentapeptide repeat family protein [Burkholderia mallei FMH]
gi|147754899|gb|EDK61963.1| pentapeptide repeat family protein [Burkholderia mallei JHU]
gi|148025103|gb|EDK83272.1| pentapeptide repeat family protein [Burkholderia mallei 2002721280]
gi|160697564|gb|EDP87534.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 10399]
gi|238523698|gb|EEP87135.1| pentapeptide repeat family protein [Burkholderia mallei GB8 horse
4]
gi|243064727|gb|EES46913.1| pentapeptide repeat family protein [Burkholderia mallei PRL-20]
gi|261826983|gb|ABM99323.2| pentapeptide repeat family protein [Burkholderia mallei NCTC 10229]
Length = 825
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 566
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585
>gi|374300595|ref|YP_005052234.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
gi|332553531|gb|EGJ50575.1| Protein of unknown function DUF2169 [Desulfovibrio africanus str.
Walvis Bay]
Length = 1248
Score = 50.4 bits (119), Expect = 8e-04, Method: Composition-based stats.
Identities = 32/93 (34%), Positives = 49/93 (52%), Gaps = 10/93 (10%)
Query: 136 DMRESDFSGSKFN----------GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
D+R D SG++ GA L KA+ +A+F+GA LS + VL + +L A
Sbjct: 949 DLRGIDLSGTQLGKTLMCGTNLAGANLSKAMGQEADFSGACLSGANLTGAVLQKTSLVEA 1008
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
+L L ++ L G+ + GAD SDA +D+ Q
Sbjct: 1009 ILSGACLKQAVLNGSDLSGADLSDATLDMVVIQ 1041
Score = 46.6 bits (109), Expect = 0.013, Method: Composition-based stats.
Identities = 38/135 (28%), Positives = 64/135 (47%), Gaps = 25/135 (18%)
Query: 110 AAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
A+L KA+ + +F AN T A ++++ + +GA L++AV ++ +
Sbjct: 967 GTNLAGANLSKAMGQEADFSGACLSGANLTGAVLQKTSLVEAILSGACLKQAVLNGSDLS 1026
Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTR---------SDLGGA----------IIEG 204
GADLSD +D +V+ +A L A + R L +D GA +++G
Sbjct: 1027 GADLSDATLDMVVIQKAKLDGADVRRASLKMCVIEGPAAGADFRGARFTQCVLKRMLLDG 1086
Query: 205 ADFSDAVIDLAQKQA 219
ADFS A ++ QA
Sbjct: 1087 ADFSGAALNSTVLQA 1101
>gi|291570913|dbj|BAI93185.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 484
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 59/107 (55%), Gaps = 14/107 (13%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN---------- 178
+ANFT A + ++FSG+ G L +A + +GA L ++ VLN
Sbjct: 34 QANFTEAVLSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLS 93
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
+ANL +A L+R L R++L A++ GA+ ++A DL ++A ++A+
Sbjct: 94 QANLVDASLIRAELMRAELSEAVVNGANLTEA--DL--REATLRHAD 136
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 64/129 (49%), Gaps = 13/129 (10%)
Query: 101 TRGEFGIG--SAAQFGSADLRKAV-HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA 157
TR + + S A A+L +AV +V RA+ + A++ ++ ++ A L +AV
Sbjct: 58 TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLVDASLIRAELMRAELSEAVV 117
Query: 158 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSDLGGAIIEGADF 207
AN T ADL + + L + ANL+ A L+ R+ LTR+DL A + G +
Sbjct: 118 NGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNL 177
Query: 208 SDAVIDLAQ 216
+A + A+
Sbjct: 178 RNAELRQAE 186
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
DFS A L + +ANFT A LS T + ANLT L R L S L GA
Sbjct: 16 DFSAILLCEANLSRVNLSQANFTEAVLSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 70
Query: 201 IIEGADFSDAVIDLA 215
I++GA+ ++AV+++A
Sbjct: 71 ILQGANLNEAVLNVA 85
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 42/83 (50%), Gaps = 5/83 (6%)
Query: 130 ANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN T AD+RE+ D + +GA L +A +N ++L+ + R L NL N
Sbjct: 120 ANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNLRN 179
Query: 185 AVLVRTVLTRSDLGGAIIEGADF 207
A L + L +DL GA + GA+
Sbjct: 180 AELRQAELNGADLRGANLSGANL 202
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 47/93 (50%), Gaps = 1/93 (1%)
Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
A+L +A + N R+N T AD+ +D G A L +A A+ GA+LS +
Sbjct: 145 ANLSEACLILSNLERSNLTRADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRW 204
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
L+ ANL+ A L T L+ + L GA + GA
Sbjct: 205 ANLSGANLSGANLEATQLSGASLRGANLSGASL 237
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 1/96 (1%)
Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADL +A N R A A++ +D G+ +GA L A AN +GA+L T +
Sbjct: 165 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 224
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L ANL+ A L+ +DL A + D++DA
Sbjct: 225 ASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDA 260
>gi|307944130|ref|ZP_07659471.1| pentapeptide repeat protein [Roseibium sp. TrichSKD4]
gi|307772476|gb|EFO31696.1| pentapeptide repeat protein [Roseibium sp. TrichSKD4]
Length = 534
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 53/113 (46%), Gaps = 1/113 (0%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
I A+ ADLR A + + R A AD+R + +K GA L++A +A+
Sbjct: 63 LAILQEAKLQEADLRGAKLQQADLRGAKLQQADLRLAKLQQAKLWGADLQEADLQEADLR 122
Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
GADL + L A L A L L +DL GA + GAD A ++ A+
Sbjct: 123 GADLRGAKLQEADLRGAKLQEADLRGAKLQEADLRGAKLRGADLRGAKLEWAK 175
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 76/190 (40%), Gaps = 34/190 (17%)
Query: 67 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
W L A + ++ L + EA+ RG A+ ADLR A +
Sbjct: 42 EWADLWGANLQQAKLQQADLRLAILQEAKLQEADLRG-------AKLQQADLRGAKLQQA 94
Query: 127 NFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
+ R A+ AD++E+D G+ GA L+ +A+ GA L + +
Sbjct: 95 DLRLAKLQQAKLWGADLQEADLQEADLRGADLRGAKLQ-----EADLRGAKLQEADLRGA 149
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ------ALCKYANGTNP 229
L EA+L A L +DL GA +E A A ++ A + A+ +A
Sbjct: 150 KLQEADLRGA-----KLRGADLRGAKLEWAKLEWAKLEWADVRTVKSSLAVSGFARADFT 204
Query: 230 ITGVSTRKSL 239
TG T+K +
Sbjct: 205 HTGYLTQKQV 214
>gi|297796179|ref|XP_002865974.1| thylakoid lumenal 17.4 kDa protein, chloroplast [Arabidopsis lyrata
subsp. lyrata]
gi|297311809|gb|EFH42233.1| thylakoid lumenal 17.4 kDa protein, chloroplast [Arabidopsis lyrata
subsp. lyrata]
Length = 236
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 51/103 (49%), Gaps = 10/103 (9%)
Query: 144 GSKFNGA-----YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
G+KF+GA + KA A +A+F G + ++ ++DR+ ++NL AV TVL+ S
Sbjct: 138 GAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLKGAVFRNTVLSGSTFE 197
Query: 199 GAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
A +E F D +I Q +C+ N R LGC
Sbjct: 198 EANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 235
>gi|78187857|ref|YP_375900.1| pentapeptide repeat-containing protein [Chlorobium luteolum DSM
273]
gi|78167759|gb|ABB24857.1| pentapeptide repeat family protein [Chlorobium luteolum DSM 273]
Length = 447
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 59/118 (50%), Gaps = 21/118 (17%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRE----------SDFSGSKFNGAYLEKA---- 155
A+ ADLR+ V ++ + AN A++RE +D G+ GA+L KA
Sbjct: 63 AELAGADLRRTVLIRADLSGANLNGANLREANLAMAFIRKADMKGADMTGAWLVKANLKS 122
Query: 156 -----VAYK-ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+++ AN GA+L + + + L ANL+NAVL L +DL GA + GA F
Sbjct: 123 SFMNGASFRGANLLGANLRWSSLRKADLTGANLSNAVLFEANLAGADLSGANLSGATF 180
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 52/98 (53%), Gaps = 6/98 (6%)
Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADLR+A +F A +ADMR G+ AY++KA A GA L +DR
Sbjct: 297 ADLRQADLGASSFNGATLDNADMR-----GANLRNAYMKKADLKSAKLGGACLEGANLDR 351
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L +A+L+ A L T+L + L GA +EGAD + A +
Sbjct: 352 AFLKDADLSGANLRGTMLYGATLSGANLEGADLAGASL 389
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 2/106 (1%)
Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
A+ F A L A N R A AD++ + G+ GA L++A A+ +GA+L
Sbjct: 306 ASSFNGATLDNADMRGANLRNAYMKKADLKSAKLGGACLEGANLDRAFLKDADLSGANLR 365
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VID 213
T++ L+ ANL A L L +DL GA ++GAD A V+D
Sbjct: 366 GTMLYGATLSGANLEGADLAGASLFDADLRGANLDGADLEGANVMD 411
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 21/113 (18%)
Query: 111 AQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
A A+LR A K + + AN A ++++D SG+ G L A
Sbjct: 317 ADMRGANLRNAYMKKADLKSAKLGGACLEGANLDRAFLKDADLSGANLRGTMLYGATLSG 376
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
AN GADL+ A+L +A L L +DL GA + ADF+DAV
Sbjct: 377 ANLEGADLAG----------ASLFDADLRGANLDGADLEGANVMDADFTDAVF 419
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 36/81 (44%), Gaps = 20/81 (24%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
AD+R++D S FNGA L+ A + ANL NA + +
Sbjct: 294 LEGADLRQADLGASSFNGATLDNAD--------------------MRGANLRNAYMKKAD 333
Query: 192 LTRSDLGGAIIEGADFSDAVI 212
L + LGGA +EGA+ A +
Sbjct: 334 LKSAKLGGACLEGANLDRAFL 354
>gi|386828886|ref|ZP_10115993.1| putative low-complexity protein [Beggiatoa alba B18LD]
gi|386429770|gb|EIJ43598.1| putative low-complexity protein [Beggiatoa alba B18LD]
Length = 199
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 47/83 (56%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
FRAN D+ ++ SG+ +GA L +A KAN + ADLS+ + L NLT+A L
Sbjct: 48 FRANLNKVDLTNANLSGANLSGANLSEANLSKANLSKADLSEANLSESYLARTNLTDANL 107
Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
LT++ L + + GA+ S+A
Sbjct: 108 SEANLTKAYLIESYLSGANLSEA 130
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 54/115 (46%), Gaps = 11/115 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAY---- 158
S A A+L ++ + N AN T A + ES SG+ + A L +A +
Sbjct: 83 SKADLSEANLSESYLARTNLTDANLSEANLTKAYLIESYLSGANLSEANLFRANLFESDL 142
Query: 159 -KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+AN TGA+L T + L EA LT A L + +T +DL GA ++ + I
Sbjct: 143 FRANLTGANLFKTNLTETNLIEAYLTGASLFKATMTEADLTGAKMDDTHLDENAI 197
>gi|134280632|ref|ZP_01767342.1| pentapeptide repeat protein [Burkholderia pseudomallei 305]
gi|134247654|gb|EBA47738.1| pentapeptide repeat protein [Burkholderia pseudomallei 305]
Length = 825
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 512 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 566
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585
>gi|189347104|ref|YP_001943633.1| pentapeptide repeat-containing protein [Chlorobium limicola DSM
245]
gi|189341251|gb|ACD90654.1| pentapeptide repeat protein [Chlorobium limicola DSM 245]
Length = 408
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 58/110 (52%), Gaps = 6/110 (5%)
Query: 109 SAAQFGSAD---LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
SA+ F +AD L+ V ++RA R +D SG++ G L A A+ +GA
Sbjct: 21 SASAFNTADFNALKTGVKPWNSYRAGLGG---RVADLSGAQLKGMNLRGADLSYADLSGA 77
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
DL+ + + + L+ A L +AVL +L R+ L A + AD DAV++ A
Sbjct: 78 DLASSDLSKARLDHARLDSAVLRSALLVRASLDKARLHNADLEDAVLEAA 127
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 59/108 (54%), Gaps = 6/108 (5%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTG 164
A+ SA LR A+ V+ + +A +AD+ ++ + F GA+++ AV KA+ F+G
Sbjct: 92 ARLDSAVLRSALLVRASLDKARLHNADLEDAVLEAASFKGAFMQTAVLKKADCTGADFSG 151
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
ADL +T L A LT A L T L R+D+ +++ G+ S + +
Sbjct: 152 ADLRETNFREARLAGALLTGADLRATYLWRADMSRSVLSGSRVSPSTV 199
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 56/120 (46%), Gaps = 2/120 (1%)
Query: 113 FGSADLRKAVHVKE-NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
F D+RK E N R A F ++ +D + ++ GA KA + A+ ADLS
Sbjct: 285 FAWNDMRKRNRAMEVNLRQAKFDQKNLSYADLAHARLQGASFRKADLFDADLRNADLSGC 344
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
M L +A+L A L L R++LG A + G S + + K+A K+A + +
Sbjct: 345 DMREANLEKADLGGADLSGVNLWRANLGRARLNGVKVSASTVLDTGKKADQKWAERHDAV 404
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 52/125 (41%), Gaps = 19/125 (15%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
N Y A G S AQ +LR A + A+ + AD+ SD S ++ + A L+
Sbjct: 41 NSYRAGLGGRVADLSGAQLKGMNLRGA----DLSYADLSGADLASSDLSKARLDHARLDS 96
Query: 155 AVAY----------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
AV KA ADL D VL A+ A + VL ++D GA G
Sbjct: 97 AVLRSALLVRASLDKARLHNADLEDA-----VLEAASFKGAFMQTAVLKKADCTGADFSG 151
Query: 205 ADFSD 209
AD +
Sbjct: 152 ADLRE 156
>gi|428314781|ref|YP_007150965.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428256164|gb|AFZ22121.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 237
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 53/96 (55%), Gaps = 6/96 (6%)
Query: 113 FGSADLRKAVHVKENF-RANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
F ADL A + N +AN + A ++R++D +G+K A L A A+ TGA+
Sbjct: 130 FQGADLSNAQLLNTNLAKANLSMATLNRTELRDADLTGAKLESANLSNATLVGAHMTGAN 189
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
L+ + VL A+LT AVL++T L +DL AI+
Sbjct: 190 LTGANFNNAVLRYADLTKAVLIKTNLKGADLSLAIM 225
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 57/109 (52%), Gaps = 9/109 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFT 163
S ADLR + RAN + ++++ SG++ N A L + A + +F
Sbjct: 76 SGLDLSGADLRNT----DLSRANLKNTKLKDAKMSGARLNQANLTYADLDGADFQECDFQ 131
Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
GADLS+ + L +ANL+ A L RT L +DL GA +E A+ S+A +
Sbjct: 132 GADLSNAQLLNTNLAKANLSMATLNRTELRDADLTGAKLESANLSNATL 180
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 52/102 (50%), Gaps = 4/102 (3%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+AN T AD+ +DF F GA L A N A+LS ++R L +A+LT A L
Sbjct: 112 QANLTYADLDGADFQECDFQGADLSNAQLLNTNLAKANLSMATLNRTELRDADLTGAKLE 171
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
L+ + L GA + GA+ + A + A+ +YA+ T +
Sbjct: 172 SANLSNATLVGAHMTGANLTGANFN----NAVLRYADLTKAV 209
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 38/85 (44%), Gaps = 12/85 (14%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
D+RE + SG +GA L +AN L D M LN+ANLT A
Sbjct: 69 DLREINLSGLDLSGADLRNTDLSRANLKNTKLKDAKMSGARLNQANLTYA---------- 118
Query: 196 DLGGAIIEGADFSDAVIDLAQKQAL 220
DL GA + DF A DL+ Q L
Sbjct: 119 DLDGADFQECDFQGA--DLSNAQLL 141
>gi|451980423|ref|ZP_21928815.1| conserved hypothetical protein, contains pentapeptide repeats
[Nitrospina gracilis 3/211]
gi|451762323|emb|CCQ90046.1| conserved hypothetical protein, contains pentapeptide repeats
[Nitrospina gracilis 3/211]
Length = 289
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 64/136 (47%), Gaps = 31/136 (22%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFN-----GA------------ 150
S A+F A L++A N R+ F A M E++ +G +FN GA
Sbjct: 100 SGAKFHQALLKRAQFEGANLVRSEFLEAQMNEANLAGVRFNKSDLRGAMMIGINLAGAQI 159
Query: 151 ---YLEKAVAYKANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRTV-----LTRSDL 197
+L K K + TG D L+ + + VL E N NA+L RT LT ++L
Sbjct: 160 PQSHLSKTNISKGDLTGTDVSGCNLTGSDLREAVLRETNFQNAILDRTFLKGADLTGANL 219
Query: 198 GGAIIEGADFSDAVID 213
GA + GADF++ V+D
Sbjct: 220 TGARLRGADFAETVLD 235
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 16/117 (13%)
Query: 109 SAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVA 157
+ +F +DLR A+ + N + N + D+ +D SG G+ L +AV
Sbjct: 135 AGVRFNKSDLRGAMMIGINLAGAQIPQSHLSKTNISKGDLTGTDVSGCNLTGSDLREAVL 194
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
+ NF A ++DR L A+LT A L L +D +++GA+FS A + L
Sbjct: 195 RETNFQNA-----ILDRTFLKGADLTGANLTGARLRGADFAETVLDGANFSGADLSL 246
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 45/84 (53%), Gaps = 10/84 (11%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R N + + R +D SG+KF+ A L+ +A F GA+L + +NEANL V
Sbjct: 86 RTNLSGVNFRNTDLSGAKFHQALLK-----RAQFEGANLVRSEFLEAQMNEANLAG---V 137
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
R +SDL GA++ G + + A I
Sbjct: 138 R--FNKSDLRGAMMIGINLAGAQI 159
>gi|428224453|ref|YP_007108550.1| heat shock protein DnaJ domain-containing protein [Geitlerinema sp.
PCC 7407]
gi|427984354|gb|AFY65498.1| heat shock protein DnaJ domain protein [Geitlerinema sp. PCC 7407]
Length = 297
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 49/92 (53%), Gaps = 5/92 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANLTNA 185
N + A++ E DFSG + A L +A +K N +GA+LS + R L +ANL NA
Sbjct: 183 NLSGANLAEKDFSGRNLSNADLSQADLSDTFLHKVNLSGANLSGAKLFRANLLQANLRNA 242
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
L L +DL GA + GAD + A + A +
Sbjct: 243 NLQNANLVGADLSGADLTGADLTGARVGTADR 274
>gi|440681919|ref|YP_007156714.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428679038|gb|AFZ57804.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 269
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 69/141 (48%), Gaps = 18/141 (12%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFS-----GSKFNGAYLEKAVAYK 159
A +A+L +A ++ N ANF+ AD+ ++D S G+ + A L AV
Sbjct: 69 ANLTNANLSQAKLIEANLSQANLSIANFSGADLTQADLSQVNLIGANLSDANLRNAVITD 128
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
AN G D S+ +LN+A+L A L+R+ L+ ++L GA + AD S+A +L +
Sbjct: 129 ANLIGTDFSNA-----ILNDADLAAAKLIRSNLSFANLIGANLIAADLSEA--NLYDAEV 181
Query: 220 LCKYANGTNPITGVSTRKSLG 240
+ Y N TR LG
Sbjct: 182 MTAYLYKANLSKANLTRVHLG 202
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 55/108 (50%), Gaps = 6/108 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A ADL A ++ N AN +AD+ E++ ++ AYL KA KAN
Sbjct: 137 SNAILNDADLAAAKLIRSNLSFANLIGANLIAADLSEANLYDAEVMTAYLYKANLSKANL 196
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
T L + + + L+EANLTNA L + L ++L GA ++ A+ A
Sbjct: 197 TRVHLGSSYLFKANLSEANLTNADLSWSNLRYANLAGANLQRANLRGA 244
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 62/118 (52%), Gaps = 12/118 (10%)
Query: 99 AETRGEFGIGSAAQ---FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKA 155
A +GE G+ Q F DL A+ V R N A++ ++ S +K A L +A
Sbjct: 34 ANLQGENLRGANLQGVNFTKVDLSHALLV----RTNLMFANLTNANLSQAKLIEANLSQA 89
Query: 156 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
ANF+GADL+ + ++ L ANL++A L V+T ++L G DFS+A+++
Sbjct: 90 NLSIANFSGADLTQADLSQVNLIGANLSDANLRNAVITDANL-----IGTDFSNAILN 142
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
E D S + G L A NFT DLS L+ R L ANLTNA L + L ++L
Sbjct: 28 EIDLSTANLQGENLRGANLQGVNFTKVDLSHALLVRTNLMFANLTNANLSQAKLIEANLS 87
Query: 199 GAIIEGADFSDAVIDLAQ 216
A + A+FS A DL Q
Sbjct: 88 QANLSIANFSGA--DLTQ 103
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 53/103 (51%), Gaps = 6/103 (5%)
Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A +ADL +A ++ E A A++ +++ + +YL KA +AN T ADLS
Sbjct: 164 ANLIAADLSEANLYDAEVMTAYLYKANLSKANLTRVHLGSSYLFKANLSEANLTNADLSW 223
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ L ANL A L R L ++L GA ++GA+ D ++
Sbjct: 224 S-----NLRYANLAGANLQRANLRGANLQGANLKGANLQDTIM 261
>gi|75906828|ref|YP_321124.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75700553|gb|ABA20229.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 727
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 5/83 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A + A++ ++D+ S +GA LE+A N + ADLS T M +L A L NA L
Sbjct: 596 AQLSFANLTKTDWQSSDLSGADLERA-----NLSNADLSATRMTGAILRSAQLENANLRN 650
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L+ DL GA + GADF D ++
Sbjct: 651 ADLSLVDLRGANVAGADFKDTIL 673
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 29/108 (26%)
Query: 132 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLM----- 172
F SA++ ++ F GS+F A L +A +ANFT A+LS LM
Sbjct: 469 FKSANLNQASFKGSRFRSVGDDGRLDTYDDAIADLSQAQMKQANFTDANLSRVLMTRSDL 528
Query: 173 DRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDA 210
R LN ANL+NA L+ L +DL G ++E GAD DA
Sbjct: 529 SRATLNRANLSNARLIGANLSSAQLVGADLRGTVLENASLTGADLGDA 576
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 65/137 (47%), Gaps = 25/137 (18%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFS 143
A+ADL++ + + A F A+L + + + + RAN ++A + ++ S
Sbjct: 499 AIADLSQAQMKQ---------ANFTDANLSRVLMTRSDLSRATLNRANLSNARLIGANLS 549
Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMD----------RMVLNEANLTNAVLVRTVLT 193
++ GA L V A+ TGADL D + R++ A L+ A L +T
Sbjct: 550 SAQLVGADLRGTVLENASLTGADLGDAKLQEANLYGARLSRVIAIGAQLSFANLTKTDWQ 609
Query: 194 RSDLGGAIIEGADFSDA 210
SDL GA +E A+ S+A
Sbjct: 610 SSDLSGADLERANLSNA 626
>gi|425454434|ref|ZP_18834174.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9807]
gi|389804880|emb|CCI15729.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9807]
Length = 962
Score = 50.4 bits (119), Expect = 9e-04, Method: Composition-based stats.
Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 1/103 (0%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A LR A+ N + AN A+++E++ + F GA L +A +AN GA+L +
Sbjct: 798 ANLEGAILRGAILEGANLKEANLKEANLKEANLEEAFFEGAILAEANLERANLYGANLGE 857
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++ L ANL A L R L + L GA +E A+ A +
Sbjct: 858 ANLEEAFLAGANLEEAFLERANLKGAFLMGAFLERANLKGAFL 900
Score = 47.4 bits (111), Expect = 0.006, Method: Composition-based stats.
Identities = 39/117 (33%), Positives = 58/117 (49%), Gaps = 9/117 (7%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
E I A A+L++A ++KE AN A++ E+ F G+ A LE+A Y AN
Sbjct: 801 EGAILRGAILEGANLKEA-NLKE---ANLKEANLEEAFFEGAILAEANLERANLYGANLG 856
Query: 164 GADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
A+L + ++ L ANL A L+ L R++L GA + GA A I+ A
Sbjct: 857 EANLEEAFLAGANLEEAFLERANLKGAFLMGAFLERANLKGAFLMGAFLQWADIERA 913
Score = 46.6 bits (109), Expect = 0.013, Method: Composition-based stats.
Identities = 30/87 (34%), Positives = 45/87 (51%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN A++ E++ + GA LE+A +AN GA L ++R L A L A L
Sbjct: 847 RANLYGANLGEANLEEAFLAGANLEEAFLERANLKGAFLMGAFLERANLKGAFLMGAFLQ 906
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
+ R++L GA +E A F A ++ A
Sbjct: 907 WADIERANLDGANLETASFYGANLERA 933
Score = 45.4 bits (106), Expect = 0.026, Method: Composition-based stats.
Identities = 28/100 (28%), Positives = 53/100 (53%), Gaps = 1/100 (1%)
Query: 117 DLRKAVHV-KENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
DL+ + + ++ ++AN A++ + G+ GA L++A +AN A+L + +
Sbjct: 779 DLKNCLLICRDLYKANLERANLEGAILRGAILEGANLKEANLKEANLKEANLEEAFFEGA 838
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
+L EANL A L L ++L A + GA+ +A ++ A
Sbjct: 839 ILAEANLERANLYGANLGEANLEEAFLAGANLEEAFLERA 878
Score = 43.9 bits (102), Expect = 0.070, Method: Composition-based stats.
Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 4/101 (3%)
Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A G A+L +A AN A + ++ G+ GA+LE+A A GA L
Sbjct: 852 GANLGEANLEEAFLAG----ANLEEAFLERANLKGAFLMGAFLERANLKGAFLMGAFLQW 907
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
++R L+ ANL A L R++L A + GA+F DA
Sbjct: 908 ADIERANLDGANLETASFYGANLERANLERANLVGANFKDA 948
>gi|220906448|ref|YP_002481759.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863059|gb|ACL43398.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 309
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 56/115 (48%), Gaps = 10/115 (8%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYL 152
L +YEA R GI +LR A + N RA N A + +++F G+ GA L
Sbjct: 134 LQRYEAGERNFQGI---------NLRGAQLNQLNLRAINLEQAQLEDANFQGTVLEGANL 184
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+A +AN GA L + +D L A+L A L T L R++L A + G +F
Sbjct: 185 RQANLSRANLKGARLDGSSLDNANLTSADLEGASLQSTSLDRANLTAANLMGVNF 239
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 53/126 (42%), Gaps = 16/126 (12%)
Query: 116 ADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A+LR+A + N + AN TSAD+ + + + A L A NF
Sbjct: 182 ANLRQANLSRANLKGARLDGSSLDNANLTSADLEGASLQSTSLDRANLTAANLMGVNFWL 241
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
ADL + L ANL + R ++L G + GAD DA+ D Q C +
Sbjct: 242 ADLQSVNFTQANLTGANLGGTDVSRANFKAANLTGVNLSGADRRDAIYD----QFTC-FP 296
Query: 225 NGTNPI 230
G NP+
Sbjct: 297 EGFNPL 302
>gi|17228308|ref|NP_484856.1| heterocyst-specific glycolipids-directing protein [Nostoc sp. PCC
7120]
gi|535436|gb|AAB59979.1| HglK [Nostoc sp. PCC 7120]
gi|17130158|dbj|BAB72770.1| heterocyst-specific glycolipids-directing protein [Nostoc sp. PCC
7120]
gi|1585247|prf||2124368C hglK gene
Length = 727
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 5/83 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A + A++ ++D+ S +GA LE+A N + ADLS T M +L A L NA L
Sbjct: 596 AQLSFANLTKTDWQSSDLSGADLERA-----NLSNADLSATRMTGAILRSAQLENANLRN 650
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L+ DL GA + GADF D ++
Sbjct: 651 ADLSLVDLRGANVAGADFKDTIL 673
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 29/108 (26%)
Query: 132 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLM----- 172
F SA++ ++ F GS+F A L +A +ANFT A+LS LM
Sbjct: 469 FKSANLNQASFKGSRFRSVGDDGRWDTYDDAIADLSQAQMKQANFTDANLSRVLMTRSDL 528
Query: 173 DRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDA 210
R LN ANL+NA L+ L +DL G ++E GAD DA
Sbjct: 529 SRATLNRANLSNARLIGANLSSAQLVGADLRGTVLENASLTGADLGDA 576
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 65/137 (47%), Gaps = 25/137 (18%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFS 143
A+ADL++ + + A F A+L + + + + RAN ++A + ++ S
Sbjct: 499 AIADLSQAQMKQ---------ANFTDANLSRVLMTRSDLSRATLNRANLSNARLIGANLS 549
Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMD----------RMVLNEANLTNAVLVRTVLT 193
++ GA L V A+ TGADL D + R++ A L+ A L +T
Sbjct: 550 SAQLVGADLRGTVLENASLTGADLGDAKLQEANLYGARLSRVIAIGAQLSFANLTKTDWQ 609
Query: 194 RSDLGGAIIEGADFSDA 210
SDL GA +E A+ S+A
Sbjct: 610 SSDLSGADLERANLSNA 626
>gi|428211194|ref|YP_007084338.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|427999575|gb|AFY80418.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 190
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + AD+ +D + + GA L A + +FTGA+L R +L +A L +A LVR
Sbjct: 85 ANLSGADLSGADLTDADLGGADLSYATLHYTDFTGANLF-----RAMLVDAKLNHAKLVR 139
Query: 190 TVLTRSDLGGAIIEGADFS 208
L ++L GAI+EGA FS
Sbjct: 140 VRLRSANLNGAIVEGAIFS 158
>gi|381204220|ref|ZP_09911291.1| hypothetical protein SclubJA_01165 [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 155
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 45/74 (60%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
FR NF +D+ +++ S + + L++ + AN GADL+ T + + +L EANLT A++
Sbjct: 65 FRTNFYKSDLTDANLSETNLVRSNLKQTILQGANLQGADLTRTDLRKAILFEANLTGALI 124
Query: 188 VRTVLTRSDLGGAI 201
T LT + L GAI
Sbjct: 125 KDTKLTGTVLKGAI 138
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 44/81 (54%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
T A + + + G+ G+ L + YK++ T A+LS+T + R L + L A L
Sbjct: 44 LTKAKLSKMELQGANLRGSNLFRTNFYKSDLTDANLSETNLVRSNLKQTILQGANLQGAD 103
Query: 192 LTRSDLGGAIIEGADFSDAVI 212
LTR+DL AI+ A+ + A+I
Sbjct: 104 LTRTDLRKAILFEANLTGALI 124
>gi|126655992|ref|ZP_01727376.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
gi|126622272|gb|EAZ92978.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
Length = 319
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 56/118 (47%), Gaps = 16/118 (13%)
Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
Q ADLR +FR +F+ A++RE DF+G+ AYL +A N TGA+L T
Sbjct: 25 QLRRADLRGLNLSNTDFRGVDFSYANLREVDFTGADLRDAYLNEADLTGVNLTGANLEGT 84
Query: 171 LMDRMVLNEAN-----LTNAVLVRTVLTRSD----------LGGAIIEGADFSDAVID 213
+ ++ L +AN + A L LT+SD L G + GA DA D
Sbjct: 85 SLIKIYLIKANCYQTDFSGAYLTGAYLTKSDFKEAKFNGAYLNGTKLSGAKLGDAYYD 142
>gi|451338330|ref|ZP_21908865.1| hypothetical protein C791_5803 [Amycolatopsis azurea DSM 43854]
gi|449419237|gb|EMD24783.1| hypothetical protein C791_5803 [Amycolatopsis azurea DSM 43854]
Length = 424
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 67/138 (48%), Gaps = 17/138 (12%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS-----DTLMDRMVLNEANLT 183
R N AD+ SD SG A L +A A+ +GADL+ D +D +VL LT
Sbjct: 269 RINLRGADLAGSDLSGINLTSAILNEANLVGADLSGADLTNADLADAKLDGIVLRRTTLT 328
Query: 184 NAVLVRTVLTRS-----DLGGAIIEGADFSDAVIDLA---QKQALCKYANGTNP-ITGVS 234
VL RT L+ +L GA +EG + S A DLA + A+ + AN T +TG
Sbjct: 329 GVVLDRTDLSEQALPGLNLVGAHLEGTNLSRA--DLAGVILRDAVLRGANLTEADLTGAD 386
Query: 235 TRK-SLGCGNSRRNAYGS 251
R +L ++ R +GS
Sbjct: 387 LRNVTLRTVDTTRTIFGS 404
>gi|428308662|ref|YP_007119639.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428250274|gb|AFZ16233.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 360
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 60/125 (48%), Gaps = 11/125 (8%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGA 150
L +N A +G + I A G ADLR A AD+ E+D S +K N A
Sbjct: 165 LGRVNLSHANLKGAYLI--RAYLGGADLRCA---------EIDGADLTEADLSEAKLNCA 213
Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L AN + ADLSD + R L+ A+L A L L R++L GA + GAD S A
Sbjct: 214 KLRGTNLKAANLSLADLSDVNLIRANLSSADLMRANLRDADLIRTNLSGADLRGADLSLA 273
Query: 211 VIDLA 215
+ LA
Sbjct: 274 DLSLA 278
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 56/118 (47%), Gaps = 16/118 (13%)
Query: 111 AQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
A DL +A ++ +FR A+ AD+R +D G+ + A L A
Sbjct: 43 ADLSGTDLSEADLIEVDFRGCNLRGTHLKGAHLQGADLRGADLRGAHLDNANLRGANLRG 102
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADFSDAVI 212
AN GADL T ++ L++ NL+ +L L+R+DL GA I +G SDA +
Sbjct: 103 ANLRGADLQSTELNSANLSDTNLSETILCSANLSRADLRGADIRDSNLQGVSLSDAKL 160
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 65/144 (45%), Gaps = 28/144 (19%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEK----------AVAYK 159
A ADLR A N R AN A++R +D ++ N A L A +
Sbjct: 78 ADLRGADLRGAHLDNANLRGANLRGANLRGADLQSTELNSANLSDTNLSETILCSANLSR 137
Query: 160 ANFTGADLSDTLMD---------------RMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
A+ GAD+ D+ + R+ L+ ANL A L+R L +DL A I+G
Sbjct: 138 ADLRGADIRDSNLQGVSLSDAKLRGANLGRVNLSHANLKGAYLIRAYLGGADLRCAEIDG 197
Query: 205 ADFSDAVIDLAQKQALCKYANGTN 228
AD ++A DL++ + C GTN
Sbjct: 198 ADLTEA--DLSEAKLNCAKLRGTN 219
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 50/125 (40%), Gaps = 29/125 (23%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
G+ A F ADL + + AD+ E DF G G +L+ A A+ GA
Sbjct: 33 GLNLAEDFAEADLSGT---------DLSEADLIEVDFRGCNLRGTHLKGAHLQGADLRGA 83
Query: 166 DLSDTLMDR--------------------MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
DL +D LN ANL++ L T+L ++L A + GA
Sbjct: 84 DLRGAHLDNANLRGANLRGANLRGADLQSTELNSANLSDTNLSETILCSANLSRADLRGA 143
Query: 206 DFSDA 210
D D+
Sbjct: 144 DIRDS 148
>gi|407782050|ref|ZP_11129265.1| hypothetical protein P24_07514 [Oceanibaculum indicum P24]
gi|407206523|gb|EKE76474.1| hypothetical protein P24_07514 [Oceanibaculum indicum P24]
Length = 422
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 11/105 (10%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A ADL A V+ N AN + ++R + G+ +GA L A AN TG
Sbjct: 136 ANMSGADLSNATMVEANLESALLCGANLSGVNLRGAQLEGADLSGANLTGANLADANLTG 195
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+L+ ++ R N+ A + R +LT DLGGA + GA+ ++
Sbjct: 196 VNLTGAVISR-----TNMARAEMNRAILTNVDLGGADLTGANMAE 235
Score = 43.9 bits (102), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 56/110 (50%), Gaps = 8/110 (7%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A ADLR A H++ ANFT A+++E+D G GA + A A ADLS
Sbjct: 73 SNAVLHRADLRGA-HLRN---ANFTGANLKEADLRG----GALISGNPANPATMLRADLS 124
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
MD +L AN++ A L + ++L A++ GA+ S + AQ +
Sbjct: 125 FAEMDAAMLQSANMSGADLSNATMVEANLESALLCGANLSGVNLRGAQLE 174
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 53/115 (46%), Gaps = 17/115 (14%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKF-NGAYLEKAVAYKANFTGAD 166
S A ADL AV N AN T+A +R + + + N + AN GAD
Sbjct: 308 SEANLEGADLEGAVMDGVNLSNANMTAARLRGATLASVEIKNSDGKPTGRLWPANLAGAD 367
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDAVIDLAQ 216
LS A+LTNA+L L ++DL GA + GA+ DAVID AQ
Sbjct: 368 LS----------RADLTNAILSGANLAKTDLTGAKLHNTNLIGANLRDAVIDPAQ 412
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 67/148 (45%), Gaps = 6/148 (4%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
N EA+ RG I + LR + E A SA+M +D S + A LE
Sbjct: 96 NLKEADLRGGALISGNPANPATMLRADLSFAEMDAAMLQSANMSGADLSNATMVEANLES 155
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
A+ AN +G +L ++ L+ ANLT A L LT +L GA+I + + A ++
Sbjct: 156 ALLCGANLSGVNLRGAQLEGADLSGANLTGANLADANLTGVNLTGAVISRTNMARAEMN- 214
Query: 215 AQKQALCKYANGTNPITGVS---TRKSL 239
+ L G +TG + TR++L
Sbjct: 215 --RAILTNVDLGGADLTGANMAETRRAL 240
>gi|409993775|ref|ZP_11276905.1| hypothetical protein APPUASWS_21733 [Arthrospira platensis str.
Paraca]
gi|291572160|dbj|BAI94432.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409935380|gb|EKN76914.1| hypothetical protein APPUASWS_21733 [Arthrospira platensis str.
Paraca]
Length = 741
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 53/106 (50%), Gaps = 6/106 (5%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A A+LR N R A+ AD+R +D G+ GA L +A Y+AN T
Sbjct: 576 ANLAHANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANLKGANLYRANFYQANITE 635
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + + R+ N ++L +A L+R L++S L A + GA+ S +
Sbjct: 636 GNFNGAKLRRVNFNRSDLRDAELIRVDLSKSRLRSACLRGANLSQS 681
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 56/128 (43%), Gaps = 8/128 (6%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN------FRANFTSADMRESDFSG 144
L +N A RG G A ADLR A N +RANF A++ E +F+G
Sbjct: 583 LRGVNLRNANLRG--GNLEGAHLEGADLRGADLQGANLKGANLYRANFYQANITEGNFNG 640
Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
+K ++ A DLS + + L ANL+ + L LTR+DL G
Sbjct: 641 AKLRRVNFNRSDLRDAELIRVDLSKSRLRSACLRGANLSQSNLKGADLTRADLSNVKFTG 700
Query: 205 ADFSDAVI 212
AD S +I
Sbjct: 701 ADLSCTLI 708
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 56/111 (50%), Gaps = 4/111 (3%)
Query: 95 NKYEAE-TRGEFGIGSAAQ--FGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA 150
N Y+A T G F + F +DLR A ++ + ++ SA +R ++ S S GA
Sbjct: 627 NFYQANITEGNFNGAKLRRVNFNRSDLRDAELIRVDLSKSRLRSACLRGANLSQSNLKGA 686
Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
L +A FTGADLS TL+ L+ A+L NA L + L S+ G I
Sbjct: 687 DLTRADLSNVKFTGADLSCTLIRHANLSGADLRNAKLEKANLFGSNTVGCI 737
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 53/115 (46%), Gaps = 7/115 (6%)
Query: 111 AQFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
+QF DLR K V++K + + T ADMRE + G L KAN + A
Sbjct: 431 SQFQGLDLRQTNLKGVNLK---KMDLTGADMREKNLEGMSLIQLDLRLVNLAKANLSHAI 487
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
L+ + + L ANL A LV+ L R+DL + A + A + A ++ C
Sbjct: 488 LNGSKLAVANLKGANLQEASLVKADLRRADLEEVNLSYASLTTAKLQRANLRSAC 542
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 53/113 (46%), Gaps = 11/113 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 163
+ A A+L++A VK + R A+ ++ + + +K A L A +AN
Sbjct: 494 AVANLKGANLQEASLVKADLRRADLEEVNLSYASLTTAKLQRANLRSACLIEANLMAASL 553
Query: 164 ------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
GADLS+ ++ LN+ANL +A L L ++L G +EGA A
Sbjct: 554 EGCDLKGADLSNANLESAKLNQANLAHANLRGVNLRNANLRGGNLEGAHLEGA 606
>gi|428311553|ref|YP_007122530.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428253165|gb|AFZ19124.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 234
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 51/103 (49%), Gaps = 11/103 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVA 157
S A ADLR+A N AN + AD+R+++ G+K + A L
Sbjct: 33 SGANLSEADLREANLSGANLSGADLIGSSLTDANLSDADLRDANLIGAKLSVAILSNVNL 92
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
AN +GA+LS ++ +L ANL A+L+R L ++L GA
Sbjct: 93 VGANLSGAELSGANLNEAMLGAANLIGAILIRAKLHAANLNGA 135
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 9/100 (9%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A G+A+L A+ + RA +A++ ++ S S GA L A AN +GA+L +
Sbjct: 110 AMLGAANLIGAILI----RAKLHAANLNGANLSISNLIGANLSGANLIGANLSGANLIEA 165
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
LN ANL A L R L + L GA + ADFSDA
Sbjct: 166 -----NLNGANLNGARLYRANLAHAKLNGANLSNADFSDA 200
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 14/96 (14%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + AD+RE++ SG+ +GA L + AN + ADL D ANL A L
Sbjct: 35 ANLSEADLREANLSGANLSGADLIGSSLTDANLSDADLRD----------ANLIGAKLSV 84
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
+L+ +L GA + GA+ S A ++ +A+ AN
Sbjct: 85 AILSNVNLVGANLSGAELSGANLN----EAMLGAAN 116
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 59/117 (50%), Gaps = 5/117 (4%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
E +G+A G+ +R +H AN + +++ ++ SG+ GA L A +AN
Sbjct: 109 EAMLGAANLIGAILIRAKLHAANLNGANLSISNLIGANLSGANLIGANLSGANLIEANLN 168
Query: 164 GADLSDTLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
GA+L+ + R LN ANL+NA L ++DL A +E A+ ++++A
Sbjct: 169 GANLNGARLYRANLAHAKLNGANLSNADFSDANLAKTDLTDANLENANLEGTILNVA 225
>gi|428314300|ref|YP_007125277.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428255912|gb|AFZ21871.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 355
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 54/112 (48%), Gaps = 8/112 (7%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A +ADLR A K RA+ T A + E+D SG+ +GA L A A G
Sbjct: 61 ANLSNADLRVANFTKAQLIETTLSRADLTQAILSEADLSGAILSGALLSGADLKGATLIG 120
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
L L+ L + NLT A L R +L ++DL AI+ A +A DL++
Sbjct: 121 VSLIGALIKGAKLTKVNLTGATLSRAILVQADLKKAILNRAILGEA--DLSE 170
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 41/82 (50%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NF+ A + D SGS N L A ANFT L + L AN T A L+ T
Sbjct: 22 NFSGAKLSGVDLSGSNLNRINLSSAHLNGANFTKTKLIRANLSNADLRVANFTKAQLIET 81
Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
L+R+DL AI+ AD S A++
Sbjct: 82 TLSRADLTQAILSEADLSGAIL 103
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 63/126 (50%), Gaps = 21/126 (16%)
Query: 109 SAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVA 157
S A ADLR+A + RAN T +++++ G++ + A L KA
Sbjct: 214 SGANLSGADLREANLSHADLSGADLQGANLTRANLTGVLLKKANLRGAELSKANLHKANL 273
Query: 158 YKANFTGADLSDTLMDRMVLNEAN----------LTNAVLVRTVLTRSDLGGAIIEGADF 207
KAN +GA+L + + L++AN LTNA L T L ++L GA +EGA+
Sbjct: 274 SKANLSGANLLEANLLDANLSQANLLRSGLLLTYLTNANLSSTNLNEANLIGANLEGANL 333
Query: 208 SDAVID 213
S+A ++
Sbjct: 334 SEASLE 339
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 60/113 (53%), Gaps = 7/113 (6%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R N +SA + ++F+ +K A L A ANFT A L +T + R A+LT A+L
Sbjct: 40 RINLSSAHLNGANFTKTKLIRANLSNADLRVANFTKAQLIETTLSR-----ADLTQAILS 94
Query: 189 RTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANGTNP-ITGVSTRKSL 239
L+ + L GA++ GAD A +I ++ AL K A T +TG + +++
Sbjct: 95 EADLSGAILSGALLSGADLKGATLIGVSLIGALIKGAKLTKVNLTGATLSRAI 147
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 50/103 (48%), Gaps = 6/103 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A L +A + N R AN AD+ E+D G+ +GA L AN +GADL
Sbjct: 169 SEANLSGASLVRAYLNRVNLRQANLEEADLSEADLKGANLSGANLS-----GANLSGADL 223
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + L+ A+L A L R LT L A + GA+ S A
Sbjct: 224 REANLSHADLSGADLQGANLTRANLTGVLLKKANLRGAELSKA 266
>gi|254425612|ref|ZP_05039329.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
gi|196188035|gb|EDX83000.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
Length = 215
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 54/107 (50%), Gaps = 1/107 (0%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A+L A K N AN + AD+ ESD S + GA L A A+ +GADL
Sbjct: 15 ANLSEANLDGATLDKANLMGANLSEADLSESDLSSADLPGATLHNATLQNADLSGADLRS 74
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
+ R L+EANL +A L L +DL GA + GA+ A + +A
Sbjct: 75 ADLFRADLSEANLRSADLSSADLRGADLPGAKLIGANLIGANLSIAN 121
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 46/81 (56%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ + AD+R ++ S + +GA L+KA AN + ADLS++ + L A L NA L
Sbjct: 5 ADLSGADLRGANLSEANLDGATLDKANLMGANLSEADLSESDLSSADLPGATLHNATLQN 64
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L+ +DL A + AD S+A
Sbjct: 65 ADLSGADLRSADLFRADLSEA 85
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 40/76 (52%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
++AD+ +D G+ + A L+ A KAN GA+LS+ + L+ A+L A L
Sbjct: 2 LSNADLSGADLRGANLSEANLDGATLDKANLMGANLSEADLSESDLSSADLPGATLHNAT 61
Query: 192 LTRSDLGGAIIEGADF 207
L +DL GA + AD
Sbjct: 62 LQNADLSGADLRSADL 77
>gi|427707050|ref|YP_007049427.1| RDD domain-containing protein [Nostoc sp. PCC 7107]
gi|427359555|gb|AFY42277.1| RDD domain containing protein [Nostoc sp. PCC 7107]
Length = 711
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 46/78 (58%), Gaps = 5/78 (6%)
Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
A++ ++D+ G+ +GAYL+ A N + A+LS + M VL A L NA L L+
Sbjct: 585 ANLTKTDWQGADLSGAYLDHA-----NLSNANLSTSRMTGAVLRSAQLENADLRNADLSF 639
Query: 195 SDLGGAIIEGADFSDAVI 212
+DL GA + GADF D ++
Sbjct: 640 ADLRGANVAGADFKDTIL 657
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 47/87 (54%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN SA + ++ S ++ GA L+ AV A+ TGADL D ++ L A L + +
Sbjct: 519 RANLESARLIGANLSSAQLVGADLQGAVLENASLTGADLGDAKLNEANLYAARLGRVIAI 578
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
T L+ ++L +GAD S A +D A
Sbjct: 579 GTQLSFANLTKTDWQGADLSGAYLDHA 605
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 51/113 (45%), Gaps = 29/113 (25%)
Query: 132 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLM----- 172
SAD+ ++ F GS+F A L + +AN T A+LS L+
Sbjct: 453 LKSADLNQASFKGSRFRSVGEDGRWDTYDDAIADLTQVQMKQANLTDANLSRVLLTGSDL 512
Query: 173 DRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDAVIDLA 215
R LN ANL +A L+ L +DL GA++E GAD DA ++ A
Sbjct: 513 SRASLNRANLESARLIGANLSSAQLVGADLQGAVLENASLTGADLGDAKLNEA 565
>gi|381206177|ref|ZP_09913248.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 210
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 71/145 (48%), Gaps = 7/145 (4%)
Query: 98 EAETRGEFGIGS---AAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLE 153
EA+ G +G+ + A L++A N AN + A++ E++ G+ G L
Sbjct: 42 EADLGGSLLMGATLISTNLTGAKLQEANLTNANLSEANLSEANLSEANLFGANLTGTNLT 101
Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
+A +A+ + ADLS+ + L+EAN + A L RT L ++L A + GAD A D
Sbjct: 102 EANLSEADLSWADLSEANLSEANLSEANFSKANLSRTNLRETNLQKADLRGADLRSA--D 159
Query: 214 LAQKQALCKYANGTNPITGVSTRKS 238
L + + Y N N + G RK+
Sbjct: 160 LREAVLVAAYLNEAN-LDGADMRKA 183
Score = 41.2 bits (95), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 48/98 (48%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
+ DL K + E + + + AD+ S G+ L A +AN T A+LS+ +
Sbjct: 21 YDRKDLDKLLSTSECVKCDLSEADLGGSLLMGATLISTNLTGAKLQEANLTNANLSEANL 80
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L+EANL A L T LT ++L A + AD S+A
Sbjct: 81 SEANLSEANLFGANLTGTNLTEANLSEADLSWADLSEA 118
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 48/95 (50%), Gaps = 1/95 (1%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
+ A ADL A + N AN + A+ +++ S + L+KA A+ ADL
Sbjct: 101 TEANLSEADLSWADLSEANLSEANLSEANFSKANLSRTNLRETNLQKADLRGADLRSADL 160
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
+ ++ LNEANL A + + L R+ +GGAI+
Sbjct: 161 REAVLVAAYLNEANLDGADMRKANLYRASMGGAIL 195
>gi|300867251|ref|ZP_07111911.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300334728|emb|CBN57077.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 520
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 61/117 (52%), Gaps = 10/117 (8%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGA 150
ADLN+ A+ RG +A+LR+A + N A+ A++R +D +G+ GA
Sbjct: 165 ADLNR--ADLRG-------VNLSNAELRQANLSQANLSGADLRGANLRWADLNGADLTGA 215
Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
L++A AN GA+LS + +L A+LT A L+R +DL GA + GA
Sbjct: 216 DLDEARLSGANLYGANLSSANLLNAILVHADLTQANLIRADWVGADLTGAALTGAKL 272
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 47/80 (58%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N + A++ + +F +K N A L A +AN +GA L+ + R LN A+L+ A L+R
Sbjct: 46 NMSGANLSDVNFRKAKLNVARLSGANLSRANLSGAILNVANLIRADLNSADLSEATLIRA 105
Query: 191 VLTRSDLGGAIIEGADFSDA 210
L R+D+ A + GA+ S+A
Sbjct: 106 ELIRADMSNASLSGANLSEA 125
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 55/110 (50%), Gaps = 4/110 (3%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A LR + V N RAN AD+ +D G + A L +A +AN +GADL
Sbjct: 140 ADLSGAHLRGSSLVSANLERANLHRADLNRADLRGVNLSNAELRQANLSQANLSGADLRG 199
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 216
+ LN A+LT A L L+ ++L GA + A+ +A++ DL Q
Sbjct: 200 ANLRWADLNGADLTGADLDEARLSGANLYGANLSSANLLNAILVHADLTQ 249
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 67/133 (50%), Gaps = 11/133 (8%)
Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFT 163
+A A+L +A + + R N ++A++R+++ S G+ GA L A A+ T
Sbjct: 154 SANLERANLHRADLNRADLRGVNLSNAELRQANLSQANLSGADLRGANLRWADLNGADLT 213
Query: 164 GADLSDTLMD-----RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
GADL + + L+ ANL NA+LV LT+++L A GAD + A + A+
Sbjct: 214 GADLDEARLSGANLYGANLSSANLLNAILVHADLTQANLIRADWVGADLTGAALTGAKLY 273
Query: 219 ALCKYANGTNPIT 231
+ ++ + IT
Sbjct: 274 GVSRFGLKADDIT 286
Score = 37.4 bits (85), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 52/118 (44%), Gaps = 16/118 (13%)
Query: 109 SAAQFGSADLRKAV-HVKENFRANFTSADMRE----------SDFSGSKFNGAYLEKA-- 155
S A A+L A+ +V RA+ SAD+ E +D S + +GA L +A
Sbjct: 68 SGANLSRANLSGAILNVANLIRADLNSADLSEATLIRAELIRADMSNASLSGANLSEADL 127
Query: 156 ---VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+AN ADLS + L ANL A L R L R+DL G + A+ A
Sbjct: 128 REGTLRQANLEQADLSGAHLRGSSLVSANLERANLHRADLNRADLRGVNLSNAELRQA 185
>gi|443475539|ref|ZP_21065485.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019605|gb|ELS33670.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 222
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 64/118 (54%), Gaps = 16/118 (13%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKA----------VAYK 159
A A+L + + + +F RA+ T A++ ++D S + A L KA +A
Sbjct: 88 ANLSRANLSEGILMGVDFSRADLTEANLSKADLYNSLLSSANLTKANLKSSTLDSSIATD 147
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNA-----VLVRTVLTRSDLGGAIIEGADFSDAVI 212
ANF+ A +++T + +VL+ ANL+NA + + LT SDL GA GAD S++V+
Sbjct: 148 ANFSNAIVTETTLKSIVLSRANLSNADFSNSKMRNSRLTNSDLRGAKFGGADLSNSVM 205
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 64/141 (45%), Gaps = 16/141 (11%)
Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F + D +K + + + AD+ D GS NGA L A N +GA L+D
Sbjct: 28 AHAFVATDYQKLLITNACNNCDLSGADLSYKDLYGSALNGANLSGA-----NLSGALLND 82
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-----------KQ 218
+ + L+ ANL+ +L+ +R+DL A + AD ++++ A
Sbjct: 83 SKLRGANLSRANLSEGILMGVDFSRADLTEANLSKADLYNSLLSSANLTKANLKSSTLDS 142
Query: 219 ALCKYANGTNPITGVSTRKSL 239
++ AN +N I +T KS+
Sbjct: 143 SIATDANFSNAIVTETTLKSI 163
>gi|427720942|ref|YP_007068936.1| RDD domain-containing protein [Calothrix sp. PCC 7507]
gi|427353378|gb|AFY36102.1| RDD domain containing protein [Calothrix sp. PCC 7507]
Length = 716
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 47/81 (58%), Gaps = 5/81 (6%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
+ A++ ++D+ G+ +G YL+ A N + A+LS T + V+ ANL NA L
Sbjct: 587 LSYANLTKTDWQGADLSGVYLDHA-----NLSNANLSATRLTGAVMRSANLENANLQNAD 641
Query: 192 LTRSDLGGAIIEGADFSDAVI 212
L+ +DL GA + GADF A++
Sbjct: 642 LSHADLQGANLAGADFRGAIL 662
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 54/113 (47%), Gaps = 29/113 (25%)
Query: 132 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMD---- 173
F SA++ + F GS+F A L + +AN T A+LS +M+
Sbjct: 458 FKSANLSQGSFKGSRFRSPGEDGRWDTYDDVIADLSQVEMKQANLTDANLSRVVMNRSDL 517
Query: 174 -RMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDAVIDLA 215
R LN ANL+N L+ T L +DL GA++E GAD SDA ++ A
Sbjct: 518 SRATLNRANLSNTRLIAANLSSTQLVGADLTGAVLENASLTGADLSDAKLNEA 570
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 62/132 (46%), Gaps = 15/132 (11%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF------RANFTSADMRESDFS 143
+ADL++ E + A A+L + V + + RAN ++ + ++ S
Sbjct: 488 VIADLSQVEMKQ---------ANLTDANLSRVVMNRSDLSRATLNRANLSNTRLIAANLS 538
Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
++ GA L AV A+ TGADLSD ++ L A+L + T L+ ++L +
Sbjct: 539 STQLVGADLTGAVLENASLTGADLSDAKLNEADLFAAHLGRVTAIGTQLSYANLTKTDWQ 598
Query: 204 GADFSDAVIDLA 215
GAD S +D A
Sbjct: 599 GADLSGVYLDHA 610
>gi|383312720|ref|YP_005365521.1| hypothetical protein MCE_05120 [Candidatus Rickettsia amblyommii
str. GAT-30V]
gi|378931380|gb|AFC69889.1| hypothetical protein MCE_05120 [Candidatus Rickettsia amblyommii
str. GAT-30V]
Length = 958
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 41/121 (33%), Positives = 63/121 (52%), Gaps = 12/121 (9%)
Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
+ SADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKSADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVI-DLAQKQALCKYA 224
+ + EAN NA++ R LT++D A++E AD ++A+ ++ KQA K A
Sbjct: 610 IAQNINAKEANFKNAIMQRADLTKADFTKAVLENADMQAVAAAEAIFKEVNLKQANLKAA 669
Query: 225 N 225
N
Sbjct: 670 N 670
>gi|440233072|ref|YP_007346865.1| uncharacterized low-complexity protein [Serratia marcescens FGI94]
gi|440054777|gb|AGB84680.1| uncharacterized low-complexity protein [Serratia marcescens FGI94]
Length = 846
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 44/154 (28%), Positives = 70/154 (45%), Gaps = 13/154 (8%)
Query: 71 FVSTALAAAVVASCS----SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
F+ T L AA + S S + + A+ +++ T S + AD A +
Sbjct: 675 FMKTTLEAASFSGASLESCSWVESHAEQARFDGATLVTCAAASESVLNGADFSNATLKQC 734
Query: 127 NFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
N R +R + F+ +K + L +A A+FT A+L +L R +AN ++A
Sbjct: 735 NLR----QTPLRGARFTLAKLENSDLSEACCQGADFTRANLVGSLFVRSDFRQANFSDAN 790
Query: 187 LVRTVLTRSDLGGAIIEG-----ADFSDAVIDLA 215
L+ +L +S LGGA G AD S A+ D A
Sbjct: 791 LMGAILQKSLLGGARFNGANLFRADLSQAITDDA 824
Score = 41.2 bits (95), Expect = 0.45, Method: Composition-based stats.
Identities = 38/170 (22%), Positives = 73/170 (42%), Gaps = 8/170 (4%)
Query: 30 LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNIS 89
L K ++ + + + S CS + A+ + A + +V+ + +
Sbjct: 670 LHKTTFMKTTLEAASFSGASLESCSWVESHAEQARFDGATLVTCAAASESVLNGADFSNA 729
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RAN-----FTSADMRESDFS 143
L N + RG + A+ ++DL +A +F RAN F +D R+++FS
Sbjct: 730 TLKQCNLRQTPLRG--ARFTLAKLENSDLSEACCQGADFTRANLVGSLFVRSDFRQANFS 787
Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
+ GA L+K++ A F GA+L + + + ++A N + V T
Sbjct: 788 DANLMGAILQKSLLGGARFNGANLFRADLSQAITDDATSLNGAWTKRVKT 837
>gi|298249936|ref|ZP_06973740.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297547940|gb|EFH81807.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 170
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 51/89 (57%), Gaps = 5/89 (5%)
Query: 130 ANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN + AD+RES F G+ F+ A L A KAN A LSDT + +L A++++
Sbjct: 54 ANLSEADLRESLFIEADCGGANFHRARLNSANFQKANLRAAILSDTDLRNALLANADVSD 113
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVID 213
A L T+L ++L AI GA F DA+++
Sbjct: 114 ADLRGTILAGANLEQAIFCGAVFKDAILN 142
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 44/97 (45%), Gaps = 15/97 (15%)
Query: 124 VKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM----------D 173
+ R N A++R + +G YL ++AN + ADL ++L
Sbjct: 23 LHHEIRPNLAGANLRGWSLAHINLSGVYL-----HEANLSEADLRESLFIEADCGGANFH 77
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
R LN AN A L +L+ +DL A++ AD SDA
Sbjct: 78 RARLNSANFQKANLRAAILSDTDLRNALLANADVSDA 114
>gi|427415392|ref|ZP_18905576.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425756225|gb|EKU97081.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 389
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 3/111 (2%)
Query: 117 DLRKAVHVKE---NFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
D++ A+ + E ++ NF+S D+R +FSG K GA A F +L
Sbjct: 189 DVQAALSIFERQLDYAPNFSSLDLRGLNFSGLKLEGAMFNHTRLNMAEFKKTNLKRASFQ 248
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
+LN+A+ +AVL + + L GA++ GA ++ + AQ Q Y+
Sbjct: 249 GAILNDAHFEDAVLTNALFMNAKLKGAVLNGAKLNEVWLTGAQLQGAHLYS 299
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 43/90 (47%), Gaps = 1/90 (1%)
Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
H + N A F +++ + F G+ N A+ E AV A F A L +++ LNE L
Sbjct: 229 HTRLNM-AEFKKTNLKRASFQGAILNDAHFEDAVLTNALFMNAKLKGAVLNGAKLNEVWL 287
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
T A L L ++L A + A+ AV+
Sbjct: 288 TGAQLQGAHLYSTNLHLAKLNSANLETAVL 317
>gi|428773363|ref|YP_007165151.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
7202]
gi|428687642|gb|AFZ47502.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
Length = 319
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 46/83 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANFT AD+ E++ SG A L +A +N G ++ R L EA+L N++L
Sbjct: 135 ANFTRADLTEANLSGLNLMEADLTRANLSASNLQGCSFNEANFSRADLREADLKNSILEG 194
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L R++L A + GA+FS AV+
Sbjct: 195 VFLHRANLSRANLRGANFSGAVL 217
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 75/169 (44%), Gaps = 35/169 (20%)
Query: 45 ESDGQFPDCSNNQCAGP---YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAET 101
SD + + S+ +G +A L N R+ +S+ L A++ C DL
Sbjct: 39 HSDLSWSNLSSTDLSGANFCHADLVNTRI-ISSRLIGALMQHC--------DL------- 82
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
+G S S DL A + AN ++ + ++ +G+ GA L A AN
Sbjct: 83 --SYGDLSWTNLNSVDLSYA----DLSYANLSNTFLSNANLTGANLTGATLTGATLTGAN 136
Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
FT ADL+ EANL+ L+ LTR++L + ++G F++A
Sbjct: 137 FTRADLT----------EANLSGLNLMEADLTRANLSASNLQGCSFNEA 175
>gi|254413837|ref|ZP_05027606.1| protein kinase domain [Coleofasciculus chthonoplastes PCC 7420]
gi|196179434|gb|EDX74429.1| protein kinase domain [Coleofasciculus chthonoplastes PCC 7420]
Length = 546
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 44/87 (50%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R +F S D+ D S +G ++ + N GADLS + L ANL +A L
Sbjct: 418 RRDFASHDLSGLDLQKSDLSGGIFYQSKLTRINLQGADLSSADFGQASLTRANLRDANLG 477
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
R L+ SDL GA + GAD S A ++ A
Sbjct: 478 RAYLSNSDLEGADLRGADLSFAYLNHA 504
>gi|389694674|ref|ZP_10182768.1| putative low-complexity protein [Microvirga sp. WSM3557]
gi|388588060|gb|EIM28353.1| putative low-complexity protein [Microvirga sp. WSM3557]
Length = 251
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 63/227 (27%), Positives = 95/227 (41%), Gaps = 38/227 (16%)
Query: 33 PLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRV--------------FVSTALAA 78
P W CQ DG P + C+ L N + F S+ +A
Sbjct: 22 PAWAKCQ-------DGPGPGVDWSGCSKARLMLTNEDLTGTNFQRSLLTLSDFASSKMAG 74
Query: 79 AVVASCSSNISAL--ADLNKYEAET----RGEFGIG--SAAQFGSADLRKA--VHVKENF 128
A ++ + + ADL+K R FG + A FGSAD+ ++ VK
Sbjct: 75 ANLSETEVSRTRFEGADLSKANFTKALGWRANFGQANLTGADFGSADMNRSNFAQVKAA- 133
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKA----VAYK-ANFTGADLSDTLMDRMVLNEANLT 183
ANF+ +++ SDFSG+ +GA + KA V ++ A G D S + + R L+ NL
Sbjct: 134 GANFSKSELNRSDFSGADLSGANISKAELARVLFQSAKIAGVDFSYSNLSRSRLDGLNLQ 193
Query: 184 NAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKYANGTNP 229
+ L + +GGA + GA + ID+A A K NP
Sbjct: 194 GVNFTGSYLYLTQIGGADLSGATGLTQEQIDIACGSAQTKLPPSINP 240
>gi|119485597|ref|ZP_01619872.1| hypothetical protein L8106_24480 [Lyngbya sp. PCC 8106]
gi|119456922|gb|EAW38049.1| hypothetical protein L8106_24480 [Lyngbya sp. PCC 8106]
Length = 253
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 59/120 (49%), Gaps = 16/120 (13%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGA 150
LAD N YEA R A ADLR+A + RA+ T AD+R++D +
Sbjct: 93 LADANLYEANLR-------YANLQGADLRQA----DLSRASLTRADLRKADLQDANLFKV 141
Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+A +ANF ADL + L +AN T+A L SDL A ++GADFS+A
Sbjct: 142 NFSEAYLSEANFENADLRQVTFFKANLADANFTDANLF-----GSDLRLANLKGADFSNA 196
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 40/80 (50%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
F A A +R++D + GA L A ++AN A+L + + L A+L A L
Sbjct: 59 FNAKLQGAILRDADLRSANLYGANLYVADLFRANLADANLYEANLRYANLQGADLRQADL 118
Query: 188 VRTVLTRSDLGGAIIEGADF 207
R LTR+DL A ++ A+
Sbjct: 119 SRASLTRADLRKADLQDANL 138
>gi|242277903|ref|YP_002990032.1| pentapeptide repeat-containing protein [Desulfovibrio salexigens DSM
2638]
gi|242120797|gb|ACS78493.1| pentapeptide repeat protein [Desulfovibrio salexigens DSM 2638]
Length = 1277
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 38/155 (24%), Positives = 68/155 (43%), Gaps = 17/155 (10%)
Query: 70 VFVSTALAAAVVASCSSNISALADLNKYEAETRGE-------FGIGSAAQFGSADLRKAV 122
+F AV+ + +++ L + EAE +G G A F ++++K++
Sbjct: 1045 IFKGAQFPKAVLRDTNFDMAILEKTDFSEAELKGARINMCMISGKADKADFSQSNIKKSI 1104
Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV-LNEAN 181
F ++ + +DFS + N + A+K NFT A+L R +++
Sbjct: 1105 ---------FKASSLTGADFSEASVNESLFNDVDAHKVNFTDANLDKLRTGRNSNFKDSD 1155
Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
+A L L SD G+ GADF + +ID +Q
Sbjct: 1156 FRHATLHGAALRESDFTGSDFRGADFENGLIDNSQ 1190
Score = 42.0 bits (97), Expect = 0.27, Method: Composition-based stats.
Identities = 36/141 (25%), Positives = 57/141 (40%), Gaps = 28/141 (19%)
Query: 70 VFVSTALAAAVVASCSSNISALADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVK 125
+F +++L A + S N S D++ ++ + G + F +D R
Sbjct: 1104 IFKASSLTGADFSEASVNESLFNDVDAHKVNFTDANLDKLRTGRNSNFKDSDFR------ 1157
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
A A +RESDF+GS +F GAD + L+D L ANL
Sbjct: 1158 ---HATLHGAALRESDFTGS---------------DFRGADFENGLIDNSQLVRANLNGV 1199
Query: 186 VLVRTVLTRSDLGGAIIEGAD 206
T+S+L GA + A+
Sbjct: 1200 SAKGARFTKSNLEGASMRAAN 1220
Score = 40.0 bits (92), Expect = 1.2, Method: Composition-based stats.
Identities = 35/129 (27%), Positives = 52/129 (40%), Gaps = 25/129 (19%)
Query: 109 SAAQFGSADLRKAVHVKENFR----------------ANFTSADMRESDFSGSKFNGAYL 152
S ADL K K NF+ A+F+ A +R +D S FN A
Sbjct: 972 SGLDLSGADLSKCQLQKTNFKGAILDNVKFVQAIGMSADFSKASLRRADLSRGLFNKALF 1031
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII---------E 203
++ +AN A + VL + N A+L +T + ++L GA I +
Sbjct: 1032 VESDLSEANGAQAIFKGAQFPKAVLRDTNFDMAILEKTDFSEAELKGARINMCMISGKAD 1091
Query: 204 GADFSDAVI 212
ADFS + I
Sbjct: 1092 KADFSQSNI 1100
Score = 38.1 bits (87), Expect = 3.8, Method: Composition-based stats.
Identities = 36/123 (29%), Positives = 54/123 (43%), Gaps = 6/123 (4%)
Query: 94 LNKYEAETRGEFGIGSAAQFG-SADLRKAVHVKENFR-----ANFTSADMRESDFSGSKF 147
L K EA+ + A+ G SAD +A+ +E R + A + D SG
Sbjct: 917 LKKLEAKELPDAAKAKLAEHGLSADSLRALTREEVQRYHEQGKSLVGAVLSGVDLSGLDL 976
Query: 148 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+GA L K K NF GA L + + + A+ + A L R L+R A+ +D
Sbjct: 977 SGADLSKCQLQKTNFKGAILDNVKFVQAIGMSADFSKASLRRADLSRGLFNKALFVESDL 1036
Query: 208 SDA 210
S+A
Sbjct: 1037 SEA 1039
>gi|428223745|ref|YP_007107842.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427983646|gb|AFY64790.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 183
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 11/114 (9%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMR----------ESDFSGSKFNGAYLEKAVAYKAN 161
F DLR+A N A + ++D+R +++ G+K GA + A Y+AN
Sbjct: 20 FDEIDLREANLFNANLEAVSLQNSDLRSTYLPYTNLNKANLQGAKLQGAEMSDAQLYQAN 79
Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
GADL + + R L A+L A L L +DL GA ++GA+ DA + A
Sbjct: 80 LAGADLRGSNLSRATLRYASLQQANLQGANLQGADLYGANLQGANLQDADLQRA 133
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 42/89 (47%), Gaps = 13/89 (14%)
Query: 128 FRANFTSADMRESDFS----------GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
++AN AD+R S+ S + GA L+ A Y AN GA+L D + R L
Sbjct: 76 YQANLAGADLRGSNLSRATLRYASLQQANLQGANLQGADLYGANLQGANLQDADLQRADL 135
Query: 178 NEANLTNAVLVRTVLTRS---DLGGAIIE 203
++A L +L L R+ D GA ++
Sbjct: 136 DQATLKATILANANLFRAQNIDWTGAAVD 164
>gi|227496450|ref|ZP_03926734.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434]
gi|226834032|gb|EEH66415.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434]
Length = 222
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 53/104 (50%), Gaps = 1/104 (0%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADLR+++ + R AN +DMR +D G+ G +L A+ GADL D
Sbjct: 98 ADMAGADLRRSILPRAELRNANLVDSDMRGADLRGADLRGTWLPYTDMRGADLAGADLRD 157
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
++ L+ A+L ++ L LT ++L A + GAD A ID
Sbjct: 158 ADLEGADLHGASLQSSDLRGADLTDAELTDADLRGADLRGADID 201
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 34/70 (48%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N D+ ++D G+ +GA L + A+ T ADL + R VL A LT A L +
Sbjct: 34 NLRELDLTDADLRGANLDGADLSWSTLSTADLTDADLRGATLRRTVLTRAVLTRAALTQV 93
Query: 191 VLTRSDLGGA 200
+D+ GA
Sbjct: 94 YARDADMAGA 103
Score = 37.4 bits (85), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 32/60 (53%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
D+R++D S L A AN GADLS + + L +A+L A L RTVLTR+
Sbjct: 24 DLRDTDLSNLNLRELDLTDADLRGANLDGADLSWSTLSTADLTDADLRGATLRRTVLTRA 83
>gi|427707611|ref|YP_007049988.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427360116|gb|AFY42838.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 521
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 53/101 (52%), Gaps = 1/101 (0%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A +ADLR+A K N R AN + A ++ S +G+ A L A ++ + +GA+L D
Sbjct: 120 ANLSNADLREATLRKANLRRANLSEASLKGSSLAGTNLEMANLNAADLHRTDLSGANLRD 179
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + L ANL+ A L L +DL GA + AD S A
Sbjct: 180 AELKQTNLTHANLSGADLSGANLRWADLSGANLSWADLSGA 220
Score = 43.9 bits (102), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+L++ N A+ + A++R +D SG+ + A L A AN GA+L
Sbjct: 173 SGANLRDAELKQTNLTHANLSGADLSGANLRWADLSGANLSWADLSGAKLSGANLMGANL 232
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S+ + ANLT A L++ +DL GA + GA A
Sbjct: 233 SNANLTNTSFVHANLTEATLIKAEWIGADLTGATLTGAKLHSA 275
Score = 40.4 bits (93), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 41/83 (49%), Gaps = 5/83 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A + + SG F GA + A AN ADLS R L A+L A L+R
Sbjct: 55 ANLSHAKLNVARLSGVNFVGAIMNYASLNVANLIRADLS-----RAQLRGASLVRAELIR 109
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L+R+DL A + AD +A +
Sbjct: 110 AELSRADLFEANLSNADLREATL 132
>gi|428204342|ref|YP_007082931.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427981774|gb|AFY79374.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 203
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 52/97 (53%), Gaps = 6/97 (6%)
Query: 117 DLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
DLR+A EN AN AD+R+++ S + GA L A +AN GA+L+
Sbjct: 30 DLREANLAGENLSGASLPWANCIKADLRKTNLSQANLGGADLRWANLEEANLEGANLNRA 89
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+ + L+ ANLT LV+ L +++L A ++GAD
Sbjct: 90 DLSQANLSRANLTQVKLVKADLRKTNLSEANLQGADL 126
Score = 40.8 bits (94), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 49/108 (45%), Gaps = 6/108 (5%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A G ADLR A + N RA+ + A++ ++ + K A L K +AN
Sbjct: 62 SQANLGGADLRWANLEEANLEGANLNRADLSQANLSRANLTQVKLVKADLRKTNLSEANL 121
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
GADL + L NL+ A L +++L A +E AD + A
Sbjct: 122 QGADLRWANLGEANLERTNLSQANLQWVNFAKANLSEANLEDADLNQA 169
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 6/96 (6%)
Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADLRK N +AN AD+R ++ + GA L +A +AN + A+L+ + +
Sbjct: 54 ADLRKT-----NLSQANLGGADLRWANLEEANLEGANLNRADLSQANLSRANLTQVKLVK 108
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L + NL+ A L L ++LG A +E + S A
Sbjct: 109 ADLRKTNLSEANLQGADLRWANLGEANLERTNLSQA 144
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 45/97 (46%), Gaps = 6/97 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A A+L + VK + R AN AD+R ++ + L +A NF
Sbjct: 92 SQANLSRANLTQVKLVKADLRKTNLSEANLQGADLRWANLGEANLERTNLSQANLQWVNF 151
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 199
A+LS+ ++ LN+ANLT A L T ++L G
Sbjct: 152 AKANLSEANLEDADLNQANLTEAKLKGTNFEGANLQG 188
>gi|428770347|ref|YP_007162137.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428684626|gb|AFZ54093.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 278
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 58/129 (44%), Gaps = 11/129 (8%)
Query: 112 QFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
Q DLR A N + + AD+R++D SG+ + YL +A AN TGA+L+
Sbjct: 25 QLRRIDLRNAQLKGVNLGGCDLSYADLRDADLSGADLSKCYLNEANLSGANLTGANLTGA 84
Query: 171 LM----------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
+ + ++ EA T + L R ++DL GA + GA + + A
Sbjct: 85 YLIKAYLTKVNFQKAIVKEAYFTGSFLTRANFYKADLSGAFLNGAHLNGGIFKDASYDNT 144
Query: 221 CKYANGTNP 229
++ G NP
Sbjct: 145 TRFDKGFNP 153
>gi|428320418|ref|YP_007118300.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428244098|gb|AFZ09884.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 479
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 55/105 (52%), Gaps = 11/105 (10%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL ++ N RA+ T A +RE++ G +F GA L++A KAN GA+L
Sbjct: 60 SGANLSGADLAESFLNLANLTRADLTGAVLREANLVGVEFTGANLKQASLIKANLVGANL 119
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+EANLT A L L S L GAI++ A +++ I
Sbjct: 120 ----------HEANLTRANLSGADLRGSQLSGAILDKAVYNNRTI 154
Score = 44.3 bits (103), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 51/103 (49%), Gaps = 16/103 (15%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 162
S A ADLR+ AN + AD+R ++D SG+K N A L KA + N
Sbjct: 352 SGANLRDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGAKLNEADLRKADLMRVNL 411
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
GADL+ EA+L++A L R L ++L G ++GA
Sbjct: 412 EGADLT----------EADLSDAHLFRVNLRGANLKGTNLKGA 444
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 49/97 (50%), Gaps = 11/97 (11%)
Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
SADLR + + AN AD+RE+DF+G+ A L A + T ADLS
Sbjct: 338 SADLRGVDLTRADLSGANLRDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGA--- 394
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
LNEA+L A L+R +L GA + AD SDA
Sbjct: 395 --KLNEADLRKADLMRV-----NLEGADLTEADLSDA 424
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 45/84 (53%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + AD+ ES + + A L AV +AN G + + + + L +ANL A L
Sbjct: 62 ANLSGADLAESFLNLANLTRADLTGAVLREANLVGVEFTGANLKQASLIKANLVGANLHE 121
Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
LTR++L GA + G+ S A++D
Sbjct: 122 ANLTRANLSGADLRGSQLSGAILD 145
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 44/94 (46%), Gaps = 15/94 (15%)
Query: 131 NFTSADMRESDFSGSKFNG---------------AYLEKAVAYKANFTGADLSDTLMDRM 175
N T AD+ SD SG+ + A L+KA AN G DL +
Sbjct: 270 NLTGADLNGSDLSGANLSASNLTSVNLKNVDLSRASLKKAYLKGANLEGTDLRGADLSGA 329
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+L++ NL++A L LTR+DL GA + AD +
Sbjct: 330 ILHQVNLSSADLRGVDLTRADLSGANLRDADLRE 363
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 66/158 (41%), Gaps = 37/158 (23%)
Query: 112 QFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAV--------- 156
+F A+L++A +K N AN T A++ +D GS+ +GA L+KAV
Sbjct: 98 EFTGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAVYNNRTIFPE 157
Query: 157 -----AYKA------------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 199
A A N DL++ + L NL A+L L R++L G
Sbjct: 158 DIDPGAMGAFLLAPNASLPGLNLAMVDLTEADLKGADLRRTNLYKAILFGAKLDRANLAG 217
Query: 200 AIIEGADFSDA-----VIDLAQKQALCKYANGTNPITG 232
A + AD +A +++ A ++ G +P G
Sbjct: 218 ANLSAADLREASLSGTILEKAVYSNKTLFSEGIDPALG 255
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 48/95 (50%), Gaps = 14/95 (14%)
Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
ADL A+ + N +SAD+R D + + +GA L A + +FTGA TL+
Sbjct: 324 ADLSGAIL----HQVNLSSADLRGVDLTRADLSGANLRDADLRETDFTGA----TLL--- 372
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
ANL+ A L LT++DL GA + AD A
Sbjct: 373 ---FANLSGADLRGVDLTKADLSGAKLNEADLRKA 404
Score = 37.0 bits (84), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 5/87 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RA+ A ++ ++ G+ GA L A+ ++ N + ADL + R L+ ANL +A L
Sbjct: 303 RASLKKAYLKGANLEGTDLRGADLSGAILHQVNLSSADLRGVDLTRADLSGANLRDADLR 362
Query: 189 RT-----VLTRSDLGGAIIEGADFSDA 210
T L ++L GA + G D + A
Sbjct: 363 ETDFTGATLLFANLSGADLRGVDLTKA 389
>gi|304393841|ref|ZP_07375766.1| pentapeptide repeat-containing protein [Ahrensia sp. R2A130]
gi|303294040|gb|EFL88415.1| pentapeptide repeat-containing protein [Ahrensia sp. R2A130]
Length = 247
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 58/114 (50%), Gaps = 4/114 (3%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
G+G + GS R + + +FT A+M SDFSGS + K+ +ANFTGA
Sbjct: 109 GVGLSKVEGS---RTVLQNSDFTDTDFTKAEMFRSDFSGSILKNVNMNKSEFSRANFTGA 165
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
DLS ++ ++ ANL +A L T + S + A + G D S A L Q+Q
Sbjct: 166 DLSGAMITFANISRANLADAKLDGTDFSSSWMYLAKVAGVDMS-ATKGLTQEQV 218
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 51/118 (43%), Gaps = 21/118 (17%)
Query: 119 RKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLM 172
R + NF AN D+ SD KF+GA + K++ +AN + G LS
Sbjct: 58 RNVILSGYNFSLANLNQTDLFGSDLRDVKFDGADMTKSILTRANLSNSSLKGVGLSKVEG 117
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIE---------------GADFSDAVIDLA 215
R VL ++ T+ + + RSD G+I++ GAD S A+I A
Sbjct: 118 SRTVLQNSDFTDTDFTKAEMFRSDFSGSILKNVNMNKSEFSRANFTGADLSGAMITFA 175
>gi|219849225|ref|YP_002463658.1| pentapeptide repeat-containing protein [Chloroflexus aggregans DSM
9485]
gi|219543484|gb|ACL25222.1| pentapeptide repeat protein [Chloroflexus aggregans DSM 9485]
Length = 311
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 56/114 (49%), Gaps = 14/114 (12%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADLRKA N A A++R ++ S + F+GA L A N +GADL D
Sbjct: 89 ADLSDADLRKADLSWANLEFATLIGANLRGANLSAADFSGANLYGANLSLCNLSGADLRD 148
Query: 170 TLMDRMVLNE-------------ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
T+M L+E ANL+ A+L+R L ++L GA + GA+ A
Sbjct: 149 TVMIGANLSEAQLREAQLVNLSGANLSGAILLRVSLNGANLNGANLAGANLMHA 202
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 42/78 (53%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++RE+ GA L + +A+ AD SD + + L+ A+L NA+ R
Sbjct: 197 ANLMHANLREATLDEVNCIGANLSETNLSEASLCNADFSDANLSGIYLSGAHLRNAIFTR 256
Query: 190 TVLTRSDLGGAIIEGADF 207
L+R++L GA + GA+
Sbjct: 257 ANLSRANLSGANLRGANL 274
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 44/86 (51%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A++ ++ SG+ + A L +A AN ADLS + L+ ANL A L
Sbjct: 24 ANLSGANLSAANLSGANLSEAKLSRARLTDANLYRADLSICELGEANLSWANLREAKLNW 83
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
L R+DL A + AD S A ++ A
Sbjct: 84 AQLVRADLSDADLRKADLSWANLEFA 109
>gi|428314592|ref|YP_007151039.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428256316|gb|AFZ22271.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 237
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 59/119 (49%), Gaps = 21/119 (17%)
Query: 113 FGSADLRKAVHVKENF----------------RANFTSADMRESDFSGSKFNGAYLEKAV 156
F +A+LR AV V++N N + D+ +D S + NGA L +A
Sbjct: 105 FANANLRCAVLVEQNLCQCNFSYVKLNFANLSGINLSGVDLTSADLSDACLNGANLSQAS 164
Query: 157 AYK-----ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
Y+ AN + A+L T + + LN+ANLT A L L+ +DL GAI++ A S A
Sbjct: 165 LYRTLLTRANLSQANLRGTNLFKASLNDANLTQADLTGANLSFADLRGAILDEATLSGA 223
Score = 37.7 bits (86), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 34/61 (55%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN + A++R ++ + N A L +A AN + ADL ++D L+ ANLT A L
Sbjct: 172 RANLSQANLRGTNLFKASLNDANLTQADLTGANLSFADLRGAILDEATLSGANLTGAKLT 231
Query: 189 R 189
+
Sbjct: 232 Q 232
>gi|440681606|ref|YP_007156401.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428678725|gb|AFZ57491.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 943
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 40/108 (37%), Positives = 56/108 (51%), Gaps = 14/108 (12%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
+FG A ADL +A NF R N + A M ++FS + FN A L +A +AN
Sbjct: 808 DFG---GANLSHADLSRANLNCANFSRTNCSGAYMISANFSEALFNHANLHEANFIRANL 864
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
TGADLS ++ L+ A+L+ A +L GA +E A+FS A
Sbjct: 865 TGADLSSADLNYADLSLADLSGA----------NLSGANLEDANFSGA 902
Score = 44.7 bits (104), Expect = 0.040, Method: Composition-based stats.
Identities = 30/85 (35%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM-----DRMVLNEANLTNA 185
N AD+ E DF G+ + A L +A ANF+ + S M + N ANL A
Sbjct: 798 NLRGADLSEVDFGGANLSHADLSRANLNCANFSRTNCSGAYMISANFSEALFNHANLHEA 857
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDA 210
+R LT +DL A + AD S A
Sbjct: 858 NFIRANLTGADLSSADLNYADLSLA 882
Score = 43.5 bits (101), Expect = 0.11, Method: Composition-based stats.
Identities = 26/72 (36%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A F A+L +A ++ N A+ +SAD+ +D S + +GA L A ANF+GA L
Sbjct: 845 SEALFNHANLHEANFIRANLTGADLSSADLNYADLSLADLSGANLSGANLEDANFSGAKL 904
Query: 168 SDTLMDRMVLNE 179
S+ L+ + +E
Sbjct: 905 SNGLLGDICWDE 916
>gi|427734924|ref|YP_007054468.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427369965|gb|AFY53921.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 213
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 60/112 (53%), Gaps = 17/112 (15%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN A++ ++F+GSKF GA+LE AN GA+L +T + ANL A L+
Sbjct: 31 RANLAGANLVGTNFAGSKFEGAHLE-----GANLMGANLKETDL------RANLMGANLM 79
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT----GVSTR 236
+ LT +D+ G+ + GA+ AVI ++ + +GTN I GV R
Sbjct: 80 QADLTGADVRGSNLRGANLMGAVI--SEVSFAGAFLSGTNLINVDLQGVDLR 129
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 56/122 (45%), Gaps = 19/122 (15%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN----------E 179
AN AD+ +D GS GA L AV + +F GA LS T + + L
Sbjct: 76 ANLMQADLTGADVRGSNLRGANLMGAVISEVSFAGAFLSGTNLINVDLQGVDLRGADLRG 135
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--------DLAQKQALCKYANGTNPIT 231
ANLT A L L+R+DL GA++ A+ +A + +LA LC G N +
Sbjct: 136 ANLTGANLKGADLSRADLQGALLSEANLEEADLRKANLSGANLAGANLLCAELEGAN-VN 194
Query: 232 GV 233
GV
Sbjct: 195 GV 196
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 43/85 (50%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
+ D+R +D G+ GA L+ A +A+ GA LS+ ++ L +ANL+ A L
Sbjct: 119 INVDLQGVDLRGADLRGANLTGANLKGADLSRADLQGALLSEANLEEADLRKANLSGANL 178
Query: 188 VRTVLTRSDLGGAIIEGADFSDAVI 212
L ++L GA + G DF A +
Sbjct: 179 AGANLLCAELEGANVNGVDFDRACL 203
>gi|428302093|ref|YP_007140399.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428238637|gb|AFZ04427.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 146
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 71/143 (49%), Gaps = 18/143 (12%)
Query: 74 TALAAAVVASCSSNISALADLN---KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
TAL A + S ISA AD+ ++ ETR + + +LR A N +
Sbjct: 7 TALTIASTITLSLPISAQADMKSDVQHLLETRECY---------ACNLRGA-----NLKG 52
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ AD+R ++ G+ GA LE A A+ A+LS ++ LN ANLTNA L
Sbjct: 53 AHLIGADLRNANLKGANLAGANLEGADLTGADLEEANLSYAFVNSTSLNYANLTNANLSN 112
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L ++L GA++ GAD + A I
Sbjct: 113 AHLYSAELDGAVMVGADLAGADI 135
>gi|428213326|ref|YP_007086470.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428001707|gb|AFY82550.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 340
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 8/106 (7%)
Query: 109 SAAQFGSADLRKAVHVKENFR--ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
S A SA+L A V++ F A A +R ++ S S GA L +A+ +GAD
Sbjct: 192 SGAVLNSANLSGA-SVRQAFLQGAQMEGASLRNTNMSTSNLRGALL-----TQADLSGAD 245
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L D M +VLNEA L N L L + L G I+ GAD A++
Sbjct: 246 LLDADMQGVVLNEAILINTQLRNVQLQGASLEGTILSGADLEGAIL 291
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 45/83 (54%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
F N +AD+ E++ SG+ + Y+E+A A G++L+ + + LN ANL+ AVL
Sbjct: 137 FLINLANADLTEANLSGTDLSRIYIEQANLNGAQLQGSNLTGAELFGVTLNNANLSGAVL 196
Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
L+ + + A ++GA A
Sbjct: 197 NSANLSGASVRQAFLQGAQMEGA 219
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 66/149 (44%), Gaps = 29/149 (19%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEK-----AVAY------ 158
A F +DLR A N AN ++R ++ +G GA L + AV +
Sbjct: 84 ASFRGSDLRGANLTGANLTGANLQGVNLRGANLTGVNLTGANLSRSQLVGAVLFLINLAN 143
Query: 159 ----KANFTGADLSDTLMDRMVLNEA-----NLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+AN +G DLS +++ LN A NLT A L L ++L GA++ A+ S
Sbjct: 144 ADLTEANLSGTDLSRIYIEQANLNGAQLQGSNLTGAELFGVTLNNANLSGAVLNSANLSG 203
Query: 210 AVIDLAQKQALCKYANGTNPITGVSTRKS 238
A + +QA + A + G S R +
Sbjct: 204 ASV----RQAFLQGA----QMEGASLRNT 224
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 40/86 (46%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
+N T+A R SD G+ GA L A N GA+L+ + L+ + L AVL
Sbjct: 79 SNLTNASFRGSDLRGANLTGANLTGANLQGVNLRGANLTGVNLTGANLSRSQLVGAVLFL 138
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
L +DL A + G D S I+ A
Sbjct: 139 INLANADLTEANLSGTDLSRIYIEQA 164
Score = 40.0 bits (92), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 43/86 (50%), Gaps = 5/86 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN ++ D+ +D SGS A + AN TGA+L+ + + L ANLT L
Sbjct: 64 ANLSNTDLTGADLSGSNLTNASFRGSDLRGANLTGANLTGANLQGVNLRGANLTGVNLTG 123
Query: 190 TVLTRSDLGGAI-----IEGADFSDA 210
L+RS L GA+ + AD ++A
Sbjct: 124 ANLSRSQLVGAVLFLINLANADLTEA 149
>gi|429106957|ref|ZP_19168826.1| FIG01055523: hypothetical protein [Cronobacter malonaticus 681]
gi|426293680|emb|CCJ94939.1| FIG01055523: hypothetical protein [Cronobacter malonaticus 681]
Length = 846
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 38/119 (31%), Positives = 57/119 (47%), Gaps = 17/119 (14%)
Query: 129 RANFTSADMRESD----------FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
RA+FT A +R+S+ F +K L +A ANF A L +L R
Sbjct: 723 RADFTHATLRQSNLRQTALCCARFELAKLENTDLSEANCRGANFQRASLVGSLFIRTDFR 782
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
E + T+A L+ +L +S LGGA GA+ A DL+Q + NG ++G T++
Sbjct: 783 EVDFTDANLMGALLQKSQLGGADFNGANLFRA--DLSQ-----TFTNGETRMSGAFTKR 834
Score = 38.1 bits (87), Expect = 4.2, Method: Composition-based stats.
Identities = 31/105 (29%), Positives = 44/105 (41%), Gaps = 6/105 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 165
S A ADL NFR A A++ + F GA L A ++F+GA
Sbjct: 551 SKALLECADLSHCQLDGANFRGAMLARAELHHTSLRDCNFEGASLALAQCCHSDFSGARF 610
Query: 166 ---DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
L +TL+D V ++A L + T TR A ++G F
Sbjct: 611 KDTQLQETLLDDCVFDDATLEGLLFRETWFTRCRFHRATLDGCVF 655
Score = 37.0 bits (84), Expect = 8.8, Method: Composition-based stats.
Identities = 28/112 (25%), Positives = 41/112 (36%), Gaps = 9/112 (8%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
R E + F DL +F+ D+R +DFS + A L AN
Sbjct: 519 RAERTLAQGGDFSGMDLTGV---------DFSGMDLRGADFSKALLECADLSHCQLDGAN 569
Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
F GA L+ + L + N A L SD GA + + ++D
Sbjct: 570 FRGAMLARAELHHTSLRDCNFEGASLALAQCCHSDFSGARFKDTQLQETLLD 621
>gi|193212588|ref|YP_001998541.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
8327]
gi|193086065|gb|ACF11341.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
Length = 430
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 45/81 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ + +D+ SDFS + +GA L++A + GADLS +R EA+ + A +
Sbjct: 81 ADLSQSDLGGSDFSDADLHGAMLDEAYLGGSRMAGADLSGASFERASAAEADFSRAKMPS 140
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
+VL RS+L GA GAD A
Sbjct: 141 SVLRRSELTGARFAGADLRGA 161
>gi|168705224|ref|ZP_02737501.1| pentapeptide repeat [Gemmata obscuriglobus UQM 2246]
Length = 831
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 43/88 (48%), Gaps = 5/88 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F A + E+ FSGS+ GA A KANF A +D + +L ANL A +R
Sbjct: 541 AKFDGAMLSEASFSGSQIQGASFADVPARKANFASARAADAVFRGAILANANLRAATFLR 600
Query: 190 TVLTRSDLGGA-----IIEGADFSDAVI 212
T DL GA + GADF+ A +
Sbjct: 601 TNFQNVDLTGADFAFSDLRGADFTGATL 628
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 47/97 (48%), Gaps = 5/97 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ + + A++ +S F G F GA L A K +FT A+L+ L N TNA L
Sbjct: 233 KTDLSGAELEQSHFGGCDFTGADLSHAKLQKTDFTAANLAGATCVDADLRGTNFTNADLR 292
Query: 189 R-----TVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
+ L +DL GA + GADF+ A + A+ L
Sbjct: 293 KANFRGANLAGADLTGANVAGADFTGANLTGAKVDGL 329
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 38/62 (61%), Gaps = 1/62 (1%)
Query: 113 FGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +A+L A V + R NFT+AD+R+++F G+ GA L A A+FTGA+L+
Sbjct: 266 FTAANLAGATCVDADLRGTNFTNADLRKANFRGANLAGADLTGANVAGADFTGANLTGAK 325
Query: 172 MD 173
+D
Sbjct: 326 VD 327
>gi|254486622|ref|ZP_05099827.1| hypothetical protein RGAI101_1279 [Roseobacter sp. GAI101]
gi|214043491|gb|EEB84129.1| hypothetical protein RGAI101_1279 [Roseobacter sp. GAI101]
Length = 200
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 53/112 (47%), Gaps = 22/112 (19%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A ADL AV T A++ S+ SG+ GAYLE A A TGADL+
Sbjct: 98 ADLSGADLTGAV---------LTQANLEMSNLSGATLTGAYLELANLAGARVTGADLT-- 146
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
+ANLT+A L VL + L GA++ GAD A + + LCK
Sbjct: 147 --------KANLTSANLRGAVLLEAKLVGAVLLGADLDGASL---EGAILCK 187
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 57/112 (50%), Gaps = 15/112 (13%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F ADLR+ + K + + + +D++ D +G+ GA L A + A+ + ADLS
Sbjct: 4 AAFDEADLRQLLDTKVCQKCDLSGSDLKGVDLAGANLAGANLSGAKLWAADLSKADLSGV 63
Query: 171 LMD----------RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++ L +ANL+ A LT ++LGGA + GAD + AV+
Sbjct: 64 NLEAATLTAANLAGANLADANLSGA-----YLTTTNLGGADLSGADLTGAVL 110
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 46/101 (45%), Gaps = 20/101 (19%)
Query: 130 ANFTSADMRESDFSG--------------------SKFNGAYLEKAVAYKANFTGADLSD 169
A +AD+ ++D SG + +GAYL A+ +GADL+
Sbjct: 48 AKLWAADLSKADLSGVNLEAATLTAANLAGANLADANLSGAYLTTTNLGGADLSGADLTG 107
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
++ + L +NL+ A L L ++L GA + GAD + A
Sbjct: 108 AVLTQANLEMSNLSGATLTGAYLELANLAGARVTGADLTKA 148
>gi|428301995|ref|YP_007140301.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428238539|gb|AFZ04329.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 342
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 47/92 (51%), Gaps = 10/92 (10%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVA-----YKANFT-----GADLSDTLMDRMVLN 178
AN D ++++ SGSKF A LE A + AN + G +LSD M + LN
Sbjct: 130 HANLAGTDFQDANLSGSKFVSANLEYAALKNVYLWNANISDACLIGTNLSDAYMHSVKLN 189
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
ANLTNA+L R L+ L + AD SDA
Sbjct: 190 GANLTNAILHRVKLSDGKLRDTNLINADLSDA 221
>gi|332705327|ref|ZP_08425405.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
gi|332355687|gb|EGJ35149.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
Length = 221
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 51/107 (47%), Gaps = 16/107 (14%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADLR + + R AN T AD+R +D G+ GA L +A +AN ADLS
Sbjct: 111 AILTRADLRLTILQDTDLRGANLTRADLRYADLRGANLTGACLHQADLTRANLCDADLS- 169
Query: 170 TLMDRMVLNEANLTNAV-----LVRTVLTRSDLGGAIIEGADFSDAV 211
+ANL+ A+ L R L+ DLG A + GA D +
Sbjct: 170 ---------QANLSGAILSQVDLRRVTLSNVDLGQAELSGATVPDQL 207
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 52/100 (52%), Gaps = 4/100 (4%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F DL++ +++ E N T R + + + + A L++ +AN TGA L T +
Sbjct: 28 FRGVDLQQ-INLSE---VNLTGVIFRRVNLADANLSLAVLQEVNLNQANLTGAKLWRTNL 83
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ L EANL+ A ++R LTR +L AI+ AD ++
Sbjct: 84 KKTSLVEANLSQAFMIRANLTRVNLRQAILTRADLRLTIL 123
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 52/121 (42%), Gaps = 7/121 (5%)
Query: 94 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAY 151
L +Y A R G+ +L + + N AN + A ++E + + + GA
Sbjct: 18 LERYSAGERDFRGVDLQQINLSEVNLTGVIFRRVNLADANLSLAVLQEVNLNQANLTGAK 77
Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGAD 206
L + K + A+LS M R L NL A+L R T+L +DL GA + AD
Sbjct: 78 LWRTNLKKTSLVEANLSQAFMIRANLTRVNLRQAILTRADLRLTILQDTDLRGANLTRAD 137
Query: 207 F 207
Sbjct: 138 L 138
>gi|318042736|ref|ZP_07974692.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0101]
Length = 164
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 46/79 (58%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ AD+R S+ G+ +GA L A+ + + ADLSD + L +ANL +AVL++
Sbjct: 69 ADLRGADLRGSNLEGADLSGADLRGAMLQDSWLSNADLSD-----VDLRQANLRDAVLIQ 123
Query: 190 TVLTRSDLGGAIIEGADFS 208
+ L GA++ GADF+
Sbjct: 124 ALTPGLQLEGAVLIGADFT 142
>gi|300863681|ref|ZP_07108615.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300338313|emb|CBN53761.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 238
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 53/108 (49%), Gaps = 7/108 (6%)
Query: 112 QFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
+F ADLR++ K NFT A +E+D S S G L +A Y+A ADLS
Sbjct: 36 EFDRADLRQSRLGK----TNFTQASFQETDLSESILWGTDLTEANLYRAVLREADLSGAK 91
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---SDAVIDLAQ 216
+ L EANL A L L R+ L AI+ AD SD + DL Q
Sbjct: 92 LTDANLEEANLMKACLSGANLVRAKLLRAILFEADLRSTSDQITDLGQ 139
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 45/81 (55%), Gaps = 8/81 (9%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYK--------ANFTGADLSDTLMDRMVLNEAN 181
+N + A + +++ G+K A+L + + + A+ GADLS + +L +AN
Sbjct: 150 SNLSGALLYQANLDGAKLCRAHLNETIQQRFLATNLSEASLQGADLSYADLSGAILRKAN 209
Query: 182 LTNAVLVRTVLTRSDLGGAII 202
L A + RT+LT +DL GAI+
Sbjct: 210 LRGADMTRTILTNTDLEGAIM 230
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 40/75 (53%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
D+ ++ G +F+ A L ++ K NFT A +T + +L +LT A L R VL +
Sbjct: 26 DLLNAELQGIEFDRADLRQSRLGKTNFTQASFQETDLSESILWGTDLTEANLYRAVLREA 85
Query: 196 DLGGAIIEGADFSDA 210
DL GA + A+ +A
Sbjct: 86 DLSGAKLTDANLEEA 100
>gi|428224583|ref|YP_007108680.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984484|gb|AFY65628.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 156
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 70/138 (50%), Gaps = 9/138 (6%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F A L +A + N + AN +SAD+ +D S + +GA L +A A+ T ADL
Sbjct: 17 FQQAALHQADLEEVNLQQANLSSADLSSADLSHANLSGANLSRANLSNADLTNADLRSAD 76
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV---IDLAQKQALCKYANGTN 228
+ + L ANL+ A L R L ++DL AI+ ADF+ A +DL+ +GTN
Sbjct: 77 LSEVNLIGANLSGAKLGRANLFQADLRSAILTDADFTGANLEDVDLSGAD-----LSGTN 131
Query: 229 PITGVSTRKSLGCGNSRR 246
T ++ + G SRR
Sbjct: 132 LRTAELSKAASSHGVSRR 149
>gi|427724799|ref|YP_007072076.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427356519|gb|AFY39242.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 276
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 64/138 (46%), Gaps = 17/138 (12%)
Query: 121 AVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDR 174
AV K N A +A++R +D G+ GAYL AN F+GA+L + +
Sbjct: 135 AVGPKANLSGAYLNTANLRGADLQGANLRGAYLSGTDFTGANLTGVAFSGANLKRSFLTG 194
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIE------GADFSDAV-IDLAQKQALC----KY 223
L EA L N L L +DL GA++E GADFSD + +++ LC K
Sbjct: 195 ACLREARLINVELEMADLRGADLTGAMLEQIESLAGADFSDVRGLSDSERSYLCSRSPKE 254
Query: 224 ANGTNPITGVSTRKSLGC 241
N T +TR SL C
Sbjct: 255 LGTWNSFTRKNTRASLNC 272
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 12/98 (12%)
Query: 123 HVKENF-RAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 180
VKE R N +A++ + D +G + A L+ A+ NFTGA L+ + L A
Sbjct: 17 EVKEILERGNSLENANLEDLDLAGYDLSDANLQGAILIGVNFTGATLAGAQLQNADLRRA 76
Query: 181 NLTN----------AVLVRTVLTRSDLGGAIIEGADFS 208
NLTN A L RT+L DL GA+++GA+ +
Sbjct: 77 NLTNASLKGATLSEAYLQRTILNDCDLAGAVLDGANLT 114
>gi|111023196|ref|YP_706168.1| hypothetical protein RHA1_ro06233 [Rhodococcus jostii RHA1]
gi|110822726|gb|ABG98010.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length = 201
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 16/131 (12%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFS-----GSKFNGAYL 152
+E R E I + F ADL ++ HV FR+ +FT + S+F GS+F+ L
Sbjct: 38 SELRTESVIFTECDFTGADLAESHHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97
Query: 153 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 202
V + +FT GADL EANL L R VL +DL GGA
Sbjct: 98 RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 157
Query: 203 EGADFSDAVID 213
+GAD A +D
Sbjct: 158 DGADLRGAHVD 168
>gi|425469207|ref|ZP_18848164.1| Tetratricopeptide repeat protein [Microcystis aeruginosa PCC 9701]
gi|389882794|emb|CCI36776.1| Tetratricopeptide repeat protein [Microcystis aeruginosa PCC 9701]
Length = 262
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 55/101 (54%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L++ + ++ + + + + + +S+ G+K NGA L A +AN +GADLS + L
Sbjct: 29 LQQLLSTRKCPQCDLSGSGLVQSNLVGAKLNGANLVGANLSQANLSGADLSGANLTGASL 88
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
ANLT A L +LT +DL GA + A+ + +D A Q
Sbjct: 89 FGANLTGANLTGAILTGADLRGAYLNNANLDNTKLDTAYVQ 129
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 40/72 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ +++ SG+ +GA L A + AN TGA+L+ ++ L A L NA L
Sbjct: 61 ANLVGANLSQANLSGADLSGANLTGASLFGANLTGANLTGAILTGADLRGAYLNNANLDN 120
Query: 190 TVLTRSDLGGAI 201
T L + + GA+
Sbjct: 121 TKLDTAYVQGAV 132
>gi|22297676|ref|NP_680923.1| hypothetical protein tlr0132 [Thermosynechococcus elongatus BP-1]
gi|22293853|dbj|BAC07685.1| tlr0132 [Thermosynechococcus elongatus BP-1]
Length = 274
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 10/77 (12%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N A++ E DF G + AN + ADLSD + R++L+ ANL +A L R
Sbjct: 161 NLQGANLSEKDFEGHNLS----------HANLSHADLSDAFLHRVILHRANLRHANLFRA 210
Query: 191 VLTRSDLGGAIIEGADF 207
L ++DL A ++GA+
Sbjct: 211 NLLQADLSYADLQGANL 227
>gi|443321008|ref|ZP_21050077.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
gi|442789287|gb|ELR98951.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
Length = 333
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV--- 186
A+ S+ M S S SK A L AV KAN ADLS ++R +L EANL A+
Sbjct: 116 ASLISSSMIGSCLSKSKLKLANLTSAVLAKANLQYADLSFAGLNRAILTEANLRGAILKQ 175
Query: 187 --LVRTVLTRSDLGGAIIEGADFSDA 210
L+R+ L R DL GA ++G + S A
Sbjct: 176 ATLIRSYLNRVDLSGANLQGCNLSLA 201
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 6/102 (5%)
Query: 107 IGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
I + A A L++A ++ R + + A+++ + S + GA L A AN GA
Sbjct: 162 ILTEANLRGAILKQATLIRSYLNRVDLSGANLQGCNLSLADLRGANLTGANLQGANLEGA 221
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+LSD + L+ ANLT A LV T L R++L GA + A+
Sbjct: 222 NLSD-----VNLSGANLTKANLVGTQLVRANLTGAKLSYANL 258
>gi|386721242|ref|YP_006187567.1| hypothetical protein B2K_03510 [Paenibacillus mucilaginosus K02]
gi|384088366|gb|AFH59802.1| hypothetical protein B2K_03510 [Paenibacillus mucilaginosus K02]
Length = 219
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ FT + + SDFSG+ G+ + + +ANF GA+L+D + + L A+ +LV
Sbjct: 31 KGQFTGSALHGSDFSGADLTGSSFKSSDVREANFDGANLTDCSLSALDLANASFHKTILV 90
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDL 214
RT ++S L GA G +D + +
Sbjct: 91 RTNFSKSGLDGAQFTGVRLTDVTLTM 116
>gi|158341491|ref|YP_001522656.1| peptidase C14, caspase catalytic subunit p20 [Acaryochloris marina
MBIC11017]
gi|158311732|gb|ABW33342.1| peptidase C14, caspase catalytic subunit p20 [Acaryochloris marina
MBIC11017]
Length = 1037
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 59/108 (54%), Gaps = 5/108 (4%)
Query: 115 SADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
SADLR A+ ++ N F N ++ ++ +D S + + A L A +AN +GADL +T +
Sbjct: 884 SADLRNAILIRANLFSTNLSNVNLYSADLSSTDMSSANLSNADLIRANLSGADLHNTDLF 943
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
L+ ANL+NA L +L S+L E + + A ++ A+ +C
Sbjct: 944 YANLSNANLSNANLSNAILLSSNLR----ETKNLTQAQLEGAEHPLIC 987
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 53/109 (48%), Gaps = 11/109 (10%)
Query: 111 AQFGSADLRKAVHVKENFRA---NFTS--------ADMRESDFSGSKFNGAYLEKAVAYK 159
A+ ADLR A+ ++ N A NFT AD+R +D + + FN A L
Sbjct: 805 AKLRHADLRSAILIRANLFAADLNFTDFSDADLRYADLRRTDLNFTDFNHANLNFTKLGN 864
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
AN G +LSD + L A+L NA+L+R L ++L + AD S
Sbjct: 865 ANLNGTNLSDANLIGTNLYSADLRNAILIRANLFSTNLSNVNLYSADLS 913
>gi|428217414|ref|YP_007101879.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427989196|gb|AFY69451.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 225
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ ++A + SD SG+ + A L A+ N +GA+L D + L +ANLT A LV
Sbjct: 108 DLSAATLNRSDLSGANLSEANLSDALMDSVNLSGANLDDANLSFAALTDANLTAASLV-- 165
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
+DL GA ++GAD +DA + A +A
Sbjct: 166 ---EADLNGAFLKGADLTDANFEGANLEA 191
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 49/102 (48%), Gaps = 4/102 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
SAA +DL A ++ E AN + A M + SG+ + A L A AN T A L
Sbjct: 110 SAATLNRSDLSGA-NLSE---ANLSDALMDSVNLSGANLDDANLSFAALTDANLTAASLV 165
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ ++ L A+LT+A L ++L A IEGA+ A
Sbjct: 166 EADLNGAFLKGADLTDANFEGANLEAANLSTATIEGANLEQA 207
>gi|300863629|ref|ZP_07108569.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
gi|300338371|emb|CBN53713.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
Length = 386
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 74/146 (50%), Gaps = 10/146 (6%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSA 135
L +VV S S + L N E + R + IG A ADL KA H+ RAN + A
Sbjct: 25 LVLSVVDSHSGDTPTLVLANINEQQNR-PYLIG--ANLSEADLSKA-HLS---RANLSKA 77
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
D+ ++ G+ GA L A AN TGA+L+ ++ L+ ANL+ A L T ++ +
Sbjct: 78 DLSGANLCGANLVGASLSGANLTGANLTGANLTGAHLNWANLSTANLSKANLKGTDMSAA 137
Query: 196 DLGGAIIEGADFSDAVI---DLAQKQ 218
+ GAI+ A+ A + +L+Q Q
Sbjct: 138 NFSGAILNDANLGKAYLIKSNLSQAQ 163
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 47/82 (57%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+AN DM ++FSG+ N A L KA K+N + A L+D + + L +A+LT+A L
Sbjct: 126 KANLKGTDMSAANFSGAILNDANLGKAYLIKSNLSQAQLNDADLTQANLKDADLTDANLS 185
Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
L R++L GA + AD + A
Sbjct: 186 GAELARANLAGANLTRADLTKA 207
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 51/90 (56%), Gaps = 1/90 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S AQ ADL +A + AN + A++ ++ +G+ A L KA KAN ADL
Sbjct: 160 SQAQLNDADLTQANLKDADLTDANLSGAELARANLAGANLTRADLTKANLLKANLRRADL 219
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
+++ ++ L EA+L+ A+L R L+++DL
Sbjct: 220 TESYLNWASLGEADLSEAILTRANLSKADL 249
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 45/81 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN T A++ ++ +G+ N A L A KAN G D+S +LN+ANL A L++
Sbjct: 97 ANLTGANLTGANLTGAHLNWANLSTANLSKANLKGTDMSAANFSGAILNDANLGKAYLIK 156
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
+ L+++ L A + A+ DA
Sbjct: 157 SNLSQAQLNDADLTQANLKDA 177
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 55/110 (50%), Gaps = 16/110 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A A+L KA +K N A+ T A+++++D + + +GA L +A AN
Sbjct: 140 SGAILNDANLGKAYLIKSNLSQAQLNDADLTQANLKDADLTDANLSGAELARANLAGANL 199
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
T ADL+ +ANL A L R LT S L A + AD S+A++
Sbjct: 200 TRADLT----------KANLLKANLRRADLTESYLNWASLGEADLSEAIL 239
Score = 44.3 bits (103), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 50/108 (46%), Gaps = 11/108 (10%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A G DL K + N AN + A + E++ S + GA L A KANF
Sbjct: 270 SGADLGGLDLSKKLLTGINLASAYLSEANLSGAYLIEANLSDANLCGADLSDACLMKANF 329
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
GA M + L+ ANLT A L + L ++L GAI+ AD A
Sbjct: 330 IGAR-----MGNINLSNANLTGAKLCKADLMGANLRGAILTEADMRGA 372
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 52/103 (50%), Gaps = 1/103 (0%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A+L +A K N +AN AD+ ES + + A L +A+ +AN + ADLS
Sbjct: 192 ANLAGANLTRADLTKANLLKANLRRADLTESYLNWASLGEADLSEAILTRANLSKADLSK 251
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
T + ++VL+ +L+ L L DL ++ G + + A +
Sbjct: 252 TYLRKIVLHGCHLSGINLSGADLGGLDLSKKLLTGINLASAYL 294
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 4/102 (3%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A G ADL +A+ RAN + AD+ ++ +G +L A+ G DLS
Sbjct: 227 ASLGEADLSEAILT----RANLSKADLSKTYLRKIVLHGCHLSGINLSGADLGGLDLSKK 282
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L+ + L A L+ A L L ++L A + GAD SDA +
Sbjct: 283 LLTGINLASAYLSEANLSGAYLIEANLSDANLCGADLSDACL 324
>gi|254413321|ref|ZP_05027092.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196179941|gb|EDX74934.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 636
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 51/97 (52%), Gaps = 1/97 (1%)
Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
AA +A+LR+ + K N R A A + E++ + A L +A Y+A T ADLS
Sbjct: 210 AANLTTANLREVLLEKANLRDAILVGATLTEANLRQACLRRANLTQAELYRAILTDADLS 269
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
+ DR+ L+ ANL A L+R L ++L +++
Sbjct: 270 EVTGDRVNLSRANLMGAYLLRASLVNANLRRTVLQNV 306
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 46/95 (48%), Gaps = 10/95 (10%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR----------MVLN 178
+N T A + ++ ++ A L +A AN T A+L + L+++ L
Sbjct: 180 HSNLTGATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAILVGATLT 239
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
EANL A L R LT+++L AI+ AD S+ D
Sbjct: 240 EANLRQACLRRANLTQAELYRAILTDADLSEVTGD 274
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 68/175 (38%), Gaps = 43/175 (24%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYE-----AETRGEFGIGSAAQFGSADLRKAVHVK 125
+ST L AA + S + L N E A R +G A A+LR+A +
Sbjct: 193 LISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAILVG--ATLTEANLRQACLRR 250
Query: 126 EN------FRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKAN------------- 161
N +RA T AD+ E + S + GAYL +A AN
Sbjct: 251 ANLTQAELYRAILTDADLSEVTGDRVNLSRANLMGAYLLRASLVNANLRRTVLQNVYCLQ 310
Query: 162 ------------FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
ADLS ++ +L EANLT+A L+ + L R L A + G
Sbjct: 311 TNLTAANLQGADLRQADLSGAYLNETILTEANLTDAYLIGSYLIRPKLEQAQLTG 365
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 47/90 (52%), Gaps = 10/90 (11%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEK----------AVAYKANFTGADLSDTLMDRMVLNEA 180
N + A+++ + + S GA L+K A Y+A+ A+L+ + ++L +A
Sbjct: 167 NLSGANLQAAQLNHSNLTGATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKA 226
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
NL +A+LV LT ++L A + A+ + A
Sbjct: 227 NLRDAILVGATLTEANLRQACLRRANLTQA 256
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 50/120 (41%), Gaps = 16/120 (13%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFN----------GAYL 152
S A ADL A+ N R N + D + + + + GA L
Sbjct: 114 SGACLHQADLHNAILKHSNLNQAILTRVNLSKVDGQSASLCQANLSWVEAPYCNLSGANL 173
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ A +N TGA L T + L ANL A L+ LT ++L ++E A+ DA++
Sbjct: 174 QAAQLNHSNLTGATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAIL 233
>gi|448449600|ref|ZP_21591825.1| pentapeptide repeat-containing protein [Halorubrum litoreum JCM
13561]
gi|445813229|gb|EMA63210.1| pentapeptide repeat-containing protein [Halorubrum litoreum JCM
13561]
Length = 822
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 1/98 (1%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADL AV + A+ + E+D SG+ GA L +A+ T ADLS+
Sbjct: 178 ASLLGADLPGAVLTDTDLSGADLIKTGLIEADLSGADLTGANLRHGRLKEADLTNADLSN 237
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+ R+ L +A+L AVL +T +DL GA++ AD
Sbjct: 238 ADLYRVDLTDADLEGAVLTDADITDADLEGAVLTDADL 275
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 47/105 (44%), Gaps = 6/105 (5%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A DL V N R ++ T A +R SD S + GA+LE A+
Sbjct: 378 ADLTEVDLEGTVLTDANLRFSEFRGSDITDASLRGSDLSNTDLTGAHLEGIDLTDASLRE 437
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
ADL+D ++ + L ANL A L L +DL A + AD +D
Sbjct: 438 ADLTDVNLEEIDLTNANLREADLTGAHLKGTDLTDASLREADLTD 482
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 52/114 (45%), Gaps = 19/114 (16%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
E + + A ADL AV T AD+ +D +G+ A L A A+ T
Sbjct: 251 EGAVLTDADITDADLEGAV---------LTDADLEGTDLTGANLKVADLTGANLKVADLT 301
Query: 164 GADLSDTLM-----DRMVLNEA-----NLTNAVLVRTVLTRSDLGGAIIEGADF 207
GADL D ++ +R L EA +LT A L LT DLGGA++ AD
Sbjct: 302 GADLEDAVLTDADLERTDLIEASLLSADLTGASLKEADLTEVDLGGAVLTDADL 355
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 69/147 (46%), Gaps = 11/147 (7%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNG 149
L + N EA+ G G+ A LR+A N + T+A +RE+D +G+ G
Sbjct: 450 LTNANLREADLTGAHLKGT--DLTDASLREADLTDVNLEEIDLTNASLREADLTGAHLEG 507
Query: 150 -----AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
A+LE AN ADL+ +++ L ANLT+A L L+ +DL + G
Sbjct: 508 VDLTGAHLEGIDLTSANLNQADLTSANLNQADLRGANLTDASLREANLSGADLTDTELSG 567
Query: 205 ADFSDAVI---DLAQKQALCKYANGTN 228
AD S + DL + ++L +G N
Sbjct: 568 ADLSRTDLEKSDLHKSKSLPTNLSGAN 594
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 64/135 (47%), Gaps = 12/135 (8%)
Query: 85 SSNISALADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDF 142
S +I ADL+K + G + A G A+L A V+ + AN AD+ ++D
Sbjct: 16 SEDIEPSADLSKVDLSDADLSGADLTNAYLGGANLSNATLVEADLTGANLRDADLTDADL 75
Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDR-----MVLNEANLTNAVLVRTVLTRSDL 197
+ AYLE A ADL+D + R +L EA+LT+A L RT D
Sbjct: 76 YRTDLTDAYLEGVNLSGATPVEADLTDASLKRANLSSTILMEADLTDADLYRT-----DF 130
Query: 198 GGAIIEGADFSDAVI 212
A +EGA+ ++A +
Sbjct: 131 TDAYLEGANLTNAYL 145
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 51/96 (53%), Gaps = 1/96 (1%)
Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
A+L +AV + A+ A + ++D SG+ L +A A+ TGA+L +
Sbjct: 168 AELPRAVLTDASLLGADLPGAVLTDTDLSGADLIKTGLIEADLSGADLTGANLRHGRLKE 227
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L A+L+NA L R LT +DL GA++ AD +DA
Sbjct: 228 ADLTNADLSNADLYRVDLTDADLEGAVLTDADITDA 263
Score = 43.5 bits (101), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 51/104 (49%), Gaps = 4/104 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A A+LR +KE A+ T+AD+ +D A LE AV A+ T ADL
Sbjct: 211 SGADLTGANLRHG-RLKE---ADLTNADLSNADLYRVDLTDADLEGAVLTDADITDADLE 266
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++ L +LT A L LT ++L A + GAD DAV+
Sbjct: 267 GAVLTDADLEGTDLTGANLKVADLTGANLKVADLTGADLEDAVL 310
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 48/96 (50%), Gaps = 6/96 (6%)
Query: 116 ADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
A LR+A N + T+A++RE+D +G+ G L A +A+ T +L + +
Sbjct: 433 ASLREADLTDVNLEEIDLTNANLREADLTGAHLKGTDLTDASLREADLTDVNLEEIDLTN 492
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L EA+LT A L DL GA +EG D + A
Sbjct: 493 ASLREADLTGAHLEGV-----DLTGAHLEGIDLTSA 523
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 129 RANFTS-----ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
RAN +S AD+ ++D + F AYLE A A +G+DL++ ++ L +A+
Sbjct: 107 RANLSSTILMEADLTDADLYRTDFTDAYLEGANLTNAYLSGSDLTNAYLEGANLTDASPI 166
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSD 209
A L R VLT + L GA + GA +D
Sbjct: 167 GAELPRAVLTDASLLGADLPGAVLTD 192
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 55/126 (43%), Gaps = 34/126 (26%)
Query: 128 FRANFTSADMRESDF---------------SGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
A+ T AD+ +DF SGS AYLE A A+ GA+L
Sbjct: 116 MEADLTDADLYRTDFTDAYLEGANLTNAYLSGSDLTNAYLEGANLTDASPIGAELP---- 171
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGA------IIE----GADFSDAVIDLAQKQALCK 222
R VL +A+L A L VLT +DL GA +IE GAD + A + + K
Sbjct: 172 -RAVLTDASLLGADLPGAVLTDTDLSGADLIKTGLIEADLSGADLTGANL----RHGRLK 226
Query: 223 YANGTN 228
A+ TN
Sbjct: 227 EADLTN 232
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 60/128 (46%), Gaps = 26/128 (20%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKA--------VAY--K 159
A ADL + ++ + A+ T A ++E+D + GA L A AY
Sbjct: 308 AVLTDADLERTDLIEASLLSADLTGASLKEADLTEVDLGGAVLTDADLEGTALTEAYLPS 367
Query: 160 ANFTG-----ADLSDTLMDRMVLNEANL----------TNAVLVRTVLTRSDLGGAIIEG 204
+ TG ADL++ ++ VL +ANL T+A L + L+ +DL GA +EG
Sbjct: 368 PDLTGASLKEADLTEVDLEGTVLTDANLRFSEFRGSDITDASLRGSDLSNTDLTGAHLEG 427
Query: 205 ADFSDAVI 212
D +DA +
Sbjct: 428 IDLTDASL 435
>gi|427710065|ref|YP_007052442.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427362570|gb|AFY45292.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 575
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 48/81 (59%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN ++AD+ ++ + A L +A A++A+ A+LSD + L+ A+L NA L R
Sbjct: 78 ANLSNADLSGANLRNINLSKAKLSRANAFRADLVSANLSDADLSSTNLSGADLRNANLTR 137
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
LT +DL GA + GA+ +DA
Sbjct: 138 ADLTNADLSGANLNGANLTDA 158
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 53/103 (51%), Gaps = 6/103 (5%)
Query: 109 SAAQFGSADLRKAVHVKEN-FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A + +L KA + N FRA+ SA++ ++D S + +GA L A +A+ T ADL
Sbjct: 86 SGANLRNINLSKAKLSRANAFRADLVSANLSDADLSSTNLSGADLRNANLTRADLTNADL 145
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S LN ANLT+A + +L G + G D S+A
Sbjct: 146 SGA-----NLNGANLTDANMRGVRFDNVNLQGVNLNGVDLSNA 183
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 5/58 (8%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
AN + A+++ +D S ++ N A L+ A Y AN GADL + LN ANL NA L
Sbjct: 218 ANLSYANLQNADLSNARLNNADLQNANLYNANLQGADLIGS-----KLNSANLDNADL 270
Score = 37.0 bits (84), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 38/77 (49%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ ++AD+R +F G NG L + N G +L + + L A+L+NA L
Sbjct: 179 DLSNADLRNFNFRGVSLNGVNLSRVNLNGYNLRGVELKNANLSYANLQNADLSNARLNNA 238
Query: 191 VLTRSDLGGAIIEGADF 207
L ++L A ++GAD
Sbjct: 239 DLQNANLYNANLQGADL 255
>gi|298246992|ref|ZP_06970797.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297549651|gb|EFH83517.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 381
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 57/121 (47%), Gaps = 13/121 (10%)
Query: 103 GEFGIGSAAQFGSA---DLRKAVHVKENFRANFTSADMRES-----DFSGSKFNGAYLEK 154
G +GS + GSA DL+ H+ A A MR S D S + GA L K
Sbjct: 236 GHDALGSQGERGSARHPDLQ--AHLSH---AQLAGAKMRGSYLSGVDLSQANLRGADLSK 290
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
A Y AN GADLS + L EAN+ A L L+++ L GA + AD S A + L
Sbjct: 291 AYFYGANLQGADLSGANLTETTLTEANIEGANLTEANLSKATLIGANLRQADLSGARLTL 350
Query: 215 A 215
A
Sbjct: 351 A 351
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 48/103 (46%), Gaps = 20/103 (19%)
Query: 108 GSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
GS A G ADL+K V + N D+R +F +AN GADL
Sbjct: 146 GSKALVG-ADLQKIVLPQ----INLAQMDLRRVNFR---------------EANLQGADL 185
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S + R L+ ANL++A L L +DL G + GAD SD+
Sbjct: 186 SGVNLYRADLSGANLSHATLKGADLRGADLRGTDLTGADLSDS 228
>gi|254409513|ref|ZP_05023294.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183510|gb|EDX78493.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 209
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 60/118 (50%), Gaps = 16/118 (13%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFS----------GSKFNGAYLEKAVAYK 159
A +A+L +A ++ N RAN T A +RE+ + +GA L +A+ +
Sbjct: 80 ANLTAAELVRATLIECNLKRANLTEAHLREASLMFANLAQACLYQADLHGAMLHQAILHW 139
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRT-----VLTRSDLGGAIIEGADFSDAVI 212
A+ ADL ++ + A+L+ A L+R +L +DL GAI+ GA+F A++
Sbjct: 140 ASLKNADLIGAILQGADMRGADLSQACLIRADVSKAILMVADLRGAIVMGANFKAAIL 197
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 49/95 (51%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
A A++R SD SG+ +GA L+ + +AN + A+LS + + LN+ANLT A LV
Sbjct: 29 EAILNGANLRRSDLSGANLSGASLKGSNLSEANLSQANLSVANLSKAELNDANLTAAELV 88
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
R L +L A + A +A + A C Y
Sbjct: 89 RATLIECNLKRANLTEAHLREASLMFANLAQACLY 123
>gi|443310759|ref|ZP_21040400.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442779202|gb|ELR89454.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 330
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 53/117 (45%), Gaps = 15/117 (12%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
E + +G F A+LR A F A+ S + +D G+ AYL +A YK
Sbjct: 5 ELLERYAVGEI-DFSGANLRGA----NLFAADLISIILIHADLHGANLTFAYLNRAQLYK 59
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
AN GA L ANLT A L L +DL GAI++GAD A + LA
Sbjct: 60 ANLIGAKLC----------GANLTQADLRAAALHDADLHGAILQGADLRSADMSLAN 106
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 62/132 (46%), Gaps = 18/132 (13%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA 155
++A+ RG G A ADLR A + N R A+ + AD+ +D S + N A L+ A
Sbjct: 154 FKADVRGANLAG--ANLSRADLRYANFNEVNLRGADLSCADLSNTDLSYALLNDANLDGA 211
Query: 156 VAYKANFTGA----------DLSDTLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGA 200
+ AN + A DL+D + LN ANLT A L + L R DL A
Sbjct: 212 ILTGANLSNARCERASMIDTDLTDVNLSGAAIPDGKLNRANLTGANLSKASLNRIDLSRA 271
Query: 201 IIEGADFSDAVI 212
+ AD SDA +
Sbjct: 272 NLSYADLSDAYL 283
>gi|428215892|ref|YP_007089036.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004273|gb|AFY85116.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 449
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 53/102 (51%), Gaps = 4/102 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
+ A+ ADLR A E F A A++ E+D +KF+ A L KA N +G++LS
Sbjct: 173 TGAKLEKADLRNA----ELFSAKLIEANLVEADLRNAKFSEANLSKAKLDGTNLSGSNLS 228
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
T + L EANLT A L L +++L G + A+ S A
Sbjct: 229 RTNLSEASLTEANLTEANLSEATLRKANLSGVKLCDANLSRA 270
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 54/108 (50%), Gaps = 6/108 (5%)
Query: 111 AQFGSADLRKAVHVKEN-FRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTG 164
A+ ADL +A + + FRA T AD+ + D SG+ A L +A +AN +
Sbjct: 80 AKLSYADLSRADLFRADLFRAELTDADLHRANLTRADLSGANLTRANLNEATLSQANLSD 139
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++LS ++ LN A L A L L SDL GA +E AD +A +
Sbjct: 140 SNLSFASLNNTKLNGAKLNGANLSEARLFDSDLTGAKLEKADLRNAEL 187
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 10/95 (10%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----------ADLSDTLMDRMVL 177
+ A+ + AD+ E+ SG K +GA LE A +A+ + ADLS + R L
Sbjct: 38 YDADLSCADLFEAKLSGIKLSGANLENAHLSRADLSNGKLFGAKLSYADLSRADLFRADL 97
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A LT+A L R LTR+DL GA + A+ ++A +
Sbjct: 98 FRAELTDADLHRANLTRADLSGANLTRANLNEATL 132
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 46/90 (51%), Gaps = 5/90 (5%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE-----ANL 182
F A + AD+ +D + A L A ++AN T ADLS + R LNE ANL
Sbjct: 78 FGAKLSYADLSRADLFRADLFRAELTDADLHRANLTRADLSGANLTRANLNEATLSQANL 137
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+++ L L + L GA + GA+ S+A +
Sbjct: 138 SDSNLSFASLNNTKLNGAKLNGANLSEARL 167
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 43/86 (50%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN +A + +D S K GA L A +A+ ADL + L+ ANLT A L
Sbjct: 60 ANLENAHLSRADLSNGKLFGAKLSYADLSRADLFRADLFRAELTDADLHRANLTRADLSG 119
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLA 215
LTR++L A + A+ SD+ + A
Sbjct: 120 ANLTRANLNEATLSQANLSDSNLSFA 145
Score = 37.0 bits (84), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 60/123 (48%), Gaps = 10/123 (8%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGA 150
ADL++ TR + S A A+L +A + N +N + A + + +G+K NGA
Sbjct: 105 ADLHRANL-TRADL---SGANLTRANLNEATLSQANLSDSNLSFASLNNTKLNGAKLNGA 160
Query: 151 YLEKAVAYKANFTGA-----DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
L +A + ++ TGA DL + + L EANL A L + ++L A ++G
Sbjct: 161 NLSEARLFDSDLTGAKLEKADLRNAELFSAKLIEANLVEADLRNAKFSEANLSKAKLDGT 220
Query: 206 DFS 208
+ S
Sbjct: 221 NLS 223
>gi|418939008|ref|ZP_13492446.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
gi|375054283|gb|EHS50653.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
Length = 229
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 57/116 (49%), Gaps = 10/116 (8%)
Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADLR A +H RAN T A + SG+K + A L +A+A KAN G DLS
Sbjct: 120 ANLDRADLRDADLHGTILHRANLTGAIL-----SGAKLDKASLIQAIAQKANLQGVDLSG 174
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
+ M L+ + T L + T ++L GAI GA A + QA+ + AN
Sbjct: 175 ADLTDMNLSRVDFTAVNLKGAIFTGTNLTGAIFSGAKLDKASL----IQAIAQKAN 226
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 59/125 (47%), Gaps = 31/125 (24%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDF-----SGSKFNGAYLEKAVAYK 159
A F A+L+ A N R ANFT AD++ +D G+ F GA LE AV
Sbjct: 60 ANFTEANLKGA-----NLRGADCDGANFTRADLKSADLRWADCDGANFTGANLESAVLQH 114
Query: 160 -----ANFTGADLSDTLMDRMVLNEANLTNAV----------LVRTVLTRSDLGGAIIEG 204
AN ADL D + +L+ ANLT A+ L++ + +++L G + G
Sbjct: 115 TDLTNANLDRADLRDADLHGTILHRANLTGAILSGAKLDKASLIQAIAQKANLQGVDLSG 174
Query: 205 ADFSD 209
AD +D
Sbjct: 175 ADLTD 179
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 6/106 (5%)
Query: 113 FGSADLRK----AVHVKE-NF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
F ADL + +KE NF AN A++R +D G+ F A L+ A A+ GA+
Sbjct: 42 FAGADLEQVRLAGASLKEANFTEANLKGANLRGADCDGANFTRADLKSADLRWADCDGAN 101
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ ++ VL +LTNA L R L +DL G I+ A+ + A++
Sbjct: 102 FTGANLESAVLQHTDLTNANLDRADLRDADLHGTILHRANLTGAIL 147
>gi|189500184|ref|YP_001959654.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
BS1]
gi|189495625|gb|ACE04173.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
Length = 412
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 55/110 (50%), Gaps = 8/110 (7%)
Query: 118 LRKAVHVKENFRANFTSA--DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
+RK+V + R N+ A D+ +D G GA L A AN GADLSDT +
Sbjct: 33 IRKSVTSWNSMRENYPEAAIDLSGADLKGRNLKGADLHNANLQGANLHGADLSDTDLRGA 92
Query: 176 VLNEANLTNAVL----VRTVLTR-SDLGGAIIEGADFSDAVIDLA-QKQA 219
+ A+L A+L +R R +DL A EGAD AV+D A KQA
Sbjct: 93 SFDHASLKGALLFDADLREATVREADLEDAAFEGADLRGAVLDGAVMKQA 142
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 49/103 (47%), Gaps = 20/103 (19%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKAN---------------FTGADLSDTLMDR 174
AN AD+ ++D G+ F+ A L+ A+ + A+ F GADL ++D
Sbjct: 77 ANLHGADLSDTDLRGASFDHASLKGALLFDADLREATVREADLEDAAFEGADLRGAVLDG 136
Query: 175 MV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
V L E+NL NA L T L ++L A + G D S A +
Sbjct: 137 AVMKQADLGESNLRNASLRGTDLRAANLKMADLAGCDLSGAYL 179
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 53/103 (51%), Gaps = 1/103 (0%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F A L+ A+ + R A AD+ ++ F G+ GA L+ AV +A+ ++L +
Sbjct: 92 ASFDHASLKGALLFDADLREATVREADLEDAAFEGADLRGAVLDGAVMKQADLGESNLRN 151
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ L ANL A L L+ + L A+++GA+ ++V+
Sbjct: 152 ASLRGTDLRAANLKMADLAGCDLSGAYLWRAVLDGANLENSVV 194
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 39/85 (45%), Gaps = 4/85 (4%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A F ADLR AV A AD+ ES+ + G L A A+ G DLS
Sbjct: 122 AAFEGADLRGAVLDG----AVMKQADLGESNLRNASLRGTDLRAANLKMADLAGCDLSGA 177
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRS 195
+ R VL+ ANL N+V+ + +
Sbjct: 178 YLWRAVLDGANLENSVVTSVTIVET 202
>gi|443329141|ref|ZP_21057730.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791290|gb|ELS00788.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 174
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 51/88 (57%), Gaps = 5/88 (5%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANL 182
RAN + A +R S+ SG+ F A L+KA + NF+GA+L + + + L+EA L
Sbjct: 38 IRANLSQAILRNSNLSGAFFVLADLQKADLSGAILIVVNFSGANLQEANLTQSKLSEAVL 97
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDA 210
T L LT ++L GAI+ GA+ S+A
Sbjct: 98 TGTQLQGANLTEANLQGAILAGANLSEA 125
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 54/113 (47%), Gaps = 11/113 (9%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A ADL A+ + NF AN T + + E+ +G++ GA L +A A G
Sbjct: 60 ADLQKADLSGAILIVVNFSGANLQEANLTQSKLSEAVLTGTQLQGANLTEANLQGAILAG 119
Query: 165 ADLSDTLMDRMVLNEAN-----LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+LS+ + L AN L NA L +T ++L GA +EGA + +I
Sbjct: 120 ANLSEANLRGGDLRGANLYGVDLRNADLTDAKITHANLRGANLEGAIMPEQLI 172
>gi|254411218|ref|ZP_05024995.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196181719|gb|EDX76706.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 293
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 59/124 (47%), Gaps = 21/124 (16%)
Query: 110 AAQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAY 158
+A A+L A+ ++ N + ANFT AD+ E D S ++ NG L +A+
Sbjct: 163 SANLEKANLTNAILLETNLKQANLNKALLHGANFTQADLTEVDLSQARLNGVNLTRAILV 222
Query: 159 KA----------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
A GA+LS + R L +NLT A+L+ TVL +++ + GA +
Sbjct: 223 GAKLRGVSICWTTLRGANLSKANLYRAKLCWSNLTEAILLETVLLDANMDQVNLRGATLT 282
Query: 209 DAVI 212
A++
Sbjct: 283 GAIL 286
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 64/127 (50%), Gaps = 24/127 (18%)
Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA----------YLEKAVAYKANFTG 164
ADL +A + N +AN + A++ + S NGA L +A+ +AN
Sbjct: 84 ADLVEANLISSNLTQANLSEANLINASLRASTLNGANLSRANLSEAILSEAIMREANLNQ 143
Query: 165 ADLSDTLMDRMVLN----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA---V 211
A L D + R L+ +ANLTNA+L+ T L +++L A++ GA+F+ A
Sbjct: 144 AKLIDASLSRTNLSYATLISANLEKANLTNAILLETNLKQANLNKALLHGANFTQADLTE 203
Query: 212 IDLAQKQ 218
+DL+Q +
Sbjct: 204 VDLSQAR 210
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 59/118 (50%), Gaps = 13/118 (11%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADL 167
F +LR A + A+ T A++R SD S S GA L+ +AN T GADL
Sbjct: 31 FRRVNLRNASLIG----ADLTHANLRGSDLSQSNLTGASLKLVNFREANLTQITLRGADL 86
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
+ + L +ANL+ A L+ L S L GA + A+ S+A++ +A+ + AN
Sbjct: 87 VEANLISSNLTQANLSEANLINASLRASTLNGANLSRANLSEAIL----SEAIMREAN 140
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 6/101 (5%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A +L A + N +AN T+A + E++ + N KA+ + ANFT ADL++
Sbjct: 149 ASLSRTNLSYATLISANLEKANLTNAILLETNLKQANLN-----KALLHGANFTQADLTE 203
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + LN NLT A+LV L + + GA+ S A
Sbjct: 204 VDLSQARLNGVNLTRAILVGAKLRGVSICWTTLRGANLSKA 244
>gi|390438685|ref|ZP_10227130.1| Tetratricopeptide repeat protein [Microcystis sp. T1-4]
gi|389837879|emb|CCI31254.1| Tetratricopeptide repeat protein [Microcystis sp. T1-4]
Length = 262
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 55/101 (54%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L++ + ++ + + + + + +S+ G+K NGA L A +AN +GADLS + L
Sbjct: 29 LQQLLSTRKCPQCDLSGSGLVQSNLVGAKLNGANLVGANLSQANLSGADLSGANLTGASL 88
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
ANLT A L +LT +DL GA + A+ + +D A Q
Sbjct: 89 FGANLTGANLTGAILTGADLRGAYLNNANLDNTKLDTAYVQ 129
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 40/72 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ +++ SG+ +GA L A + AN TGA+L+ ++ L A L NA L
Sbjct: 61 ANLVGANLSQANLSGADLSGANLTGASLFGANLTGANLTGAILTGADLRGAYLNNANLDN 120
Query: 190 TVLTRSDLGGAI 201
T L + + GA+
Sbjct: 121 TKLDTAYVQGAV 132
>gi|428201752|ref|YP_007080341.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427979184|gb|AFY76784.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 187
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 46/87 (52%)
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
E ++ + A++ E+D SG+ NGAYL KA AN A L + + + L ANL A
Sbjct: 83 EMWKIDLGQANLEETDLSGANLNGAYLWKAKLCIANLERAYLKEVNLVQCDLWRANLRGA 142
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVI 212
L+ LT + L GA +E A + + I
Sbjct: 143 YLIGANLTGASLKGACLERAKYDEKTI 169
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 47/95 (49%), Gaps = 7/95 (7%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R N +AD++E + A LE A Y+AN G+ L T + R L EANL+ A +
Sbjct: 26 RINLHAADLKEVCLIDADLEEANLEGANLYRANLKGSCLYRTNLARSNLREANLSGAEMW 85
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA--QKQALC 221
+ DLG A +E D S A ++ A K LC
Sbjct: 86 KI-----DLGQANLEETDLSGANLNGAYLWKAKLC 115
>gi|76819210|ref|YP_336861.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
1710b]
gi|76583683|gb|ABA53157.1| pentapeptide repeat family protein [Burkholderia pseudomallei
1710b]
Length = 862
Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats.
Identities = 34/79 (43%), Positives = 42/79 (53%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T D+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 549 ADLTGVDLSGMDLRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVD 603
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 604 ARLTAANLSLAHCERTDFS 622
>gi|56751008|ref|YP_171709.1| hypothetical protein syc0999_c [Synechococcus elongatus PCC 6301]
gi|81299332|ref|YP_399540.1| hypothetical protein Synpcc7942_0521 [Synechococcus elongatus PCC
7942]
gi|56685967|dbj|BAD79189.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81168213|gb|ABB56553.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 195
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 54/103 (52%), Gaps = 6/103 (5%)
Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADL A+ V + R A A +RE+D SG+ GA L ++ +A G++L +++
Sbjct: 49 ADLTGAILVGADLRRAWLRGAILREADCSGANLLGADLLRSDLCRAQLVGSNLRRAMLND 108
Query: 175 MVLNEAN-----LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+L EAN L A LVR +L R+D A + AD S A I
Sbjct: 109 SILAEANCRQACLQQADLVRAILYRTDFTAADLHEADLSHAFI 151
Score = 37.4 bits (85), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 53/110 (48%), Gaps = 3/110 (2%)
Query: 118 LRKAVHVKENFRANFTSA--DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
LR+ V +R+ + D+R++D S LE+A A GADL +
Sbjct: 10 LRRGTAVWSRWRSQNPTVIPDLRQADLSFVDLVNVDLERADLTGAILVGADLRRAWLRGA 69
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYA 224
+L EA+ + A L+ L RSDL A + G++ A++ D +A C+ A
Sbjct: 70 ILREADCSGANLLGADLLRSDLCRAQLVGSNLRRAMLNDSILAEANCRQA 119
>gi|448661888|ref|ZP_21683780.1| hypothetical protein C435_21969 [Haloarcula californiae ATCC 33799]
gi|445758247|gb|EMA09568.1| hypothetical protein C435_21969 [Haloarcula californiae ATCC 33799]
Length = 480
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 17/128 (13%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A LR+A N + AN T A +R++D + + GA L A +A+ T A L +
Sbjct: 168 ANLTDTSLRQADLTDANLKGANLTDASLRQADLTDANLKGADLPGASLLRADLTDAFLRE 227
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRS---------------DLGGAIIEGADFSDA-VID 213
+ LN ANLT +L + LT + DL GA + GADFS+A +I+
Sbjct: 228 VNLTDAALNRANLTGTILHKADLTDTDLQVADFTNADLRYADLTGATLPGADFSEANLIN 287
Query: 214 LAQKQALC 221
++ L
Sbjct: 288 TTLREVLL 295
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 45/81 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANF AD+ +++ GS F A L +A A ADL+D + L +A+L A L
Sbjct: 113 ANFLRADLHDANLKGSDFTDAALRQADLTDATLRQADLTDADLWAAALPDADLKGANLTD 172
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
T L ++DL A ++GA+ +DA
Sbjct: 173 TSLRQADLTDANLKGANLTDA 193
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 55/119 (46%), Gaps = 5/119 (4%)
Query: 110 AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
AA ADL+ A + R AD+ +++ G+ A L +A AN GADL
Sbjct: 157 AAALPDADLKGANLTDTSLR----QADLTDANLKGANLTDASLRQADLTDANLKGADLPG 212
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGT 227
+ R L +A L L L R++L G I+ AD +D + +A A +YA+ T
Sbjct: 213 ASLLRADLTDAFLREVNLTDAALNRANLTGTILHKADLTDTDLQVADFTNADLRYADLT 271
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 56/108 (51%), Gaps = 25/108 (23%)
Query: 130 ANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDT------LMDRMV-- 176
AN + A ++E+D +G + GA L+ AV NF GADL + L D ++
Sbjct: 28 ANLSGAFLKEADLTGANLTRTDLTGANLKGAVLADVNFAGADLVNANIKEAELTDAILRQ 87
Query: 177 -------LNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 212
L +ANLT + L+RT L R+DL A ++G+DF+DA +
Sbjct: 88 ADLTDAALWDANLTGSNLLRTDLPGANFLRADLHDANLKGSDFTDAAL 135
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 53/112 (47%), Gaps = 6/112 (5%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTG 164
+ F A LR+A R A+ T AD+ ++D G+ L +A AN G
Sbjct: 128 SDFTDAALRQADLTDATLRQADLTDADLWAAALPDADLKGANLTDTSLRQADLTDANLKG 187
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
A+L+D + + L +ANL A L L R+DL A + + +DA ++ A
Sbjct: 188 ANLTDASLRQADLTDANLKGADLPGASLLRADLTDAFLREVNLTDAALNRAN 239
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 39/73 (53%)
Query: 140 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 199
+D + + +GA+L++A AN T DL+ + VL + N A LV + ++L
Sbjct: 23 ADLTDANLSGAFLKEADLTGANLTRTDLTGANLKGAVLADVNFAGADLVNANIKEAELTD 82
Query: 200 AIIEGADFSDAVI 212
AI+ AD +DA +
Sbjct: 83 AILRQADLTDAAL 95
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%), Gaps = 6/82 (7%)
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADF 207
+ +V+ K + GADL+D + L EA+LT A L RT LT ++L GA++ GAD
Sbjct: 11 DDSVSDKDIYPGADLTDANLSGAFLKEADLTGANLTRTDLTGANLKGAVLADVNFAGADL 70
Query: 208 SDAVIDLAQ-KQALCKYANGTN 228
+A I A+ A+ + A+ T+
Sbjct: 71 VNANIKEAELTDAILRQADLTD 92
>gi|392382587|ref|YP_005031784.1| conserved protein of unknown function; pentapeptide repeats
[Azospirillum brasilense Sp245]
gi|356877552|emb|CCC98392.1| conserved protein of unknown function; pentapeptide repeats
[Azospirillum brasilense Sp245]
Length = 493
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 61/124 (49%), Gaps = 27/124 (21%)
Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDF-SGSKFNG---------------AY 151
+A+ ADLR A +H RA T A++R +DF +GS NG A
Sbjct: 84 TASTLIGADLRGANLH-----RAILTDANLRGADFRAGSLMNGTDDKPRSDGVTRLTEAK 138
Query: 152 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
+E+++ ANFTG DLS LN+A+LT A + VL +D GA ++G F
Sbjct: 139 MERSILAGANFTGCDLSGA-----DLNDADLTGADMTAAVLVGADFWGATLDGVTFDGTT 193
Query: 212 IDLA 215
ID A
Sbjct: 194 IDEA 197
>gi|332707026|ref|ZP_08427086.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332354291|gb|EGJ33771.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 239
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 51/105 (48%), Gaps = 1/105 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S F AD +A N AN A + + F +K A L+ A N GADL
Sbjct: 78 SGVDFSRADFSQANLSDSNLENANLKDAKVIGARFENAKLTSADLDGADFKDTNLKGADL 137
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
SD + + L A+L+ A+L RT L +DL GA +E AD S A I
Sbjct: 138 SDANLLNIRLANADLSTAILNRTELREADLTGANMEHADLSHASI 182
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 59/118 (50%), Gaps = 5/118 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S + +A+L+ A + F A TSAD+ +DF + GA L A ADL
Sbjct: 93 SDSNLENANLKDAKVIGARFENAKLTSADLDGADFKDTNLKGADLSDANLLNIRLANADL 152
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 225
S +++R L EA+LT A + L+ + + GAI+ A+ + A + +A +YAN
Sbjct: 153 STAILNRTELREADLTGANMEHADLSHASIYGAILREANLTGANL----YKANLRYAN 206
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 6/111 (5%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A+ SADL A N + AN + + +D S + N L +A AN
Sbjct: 115 AKLTSADLDGADFKDTNLKGADLSDANLLNIRLANADLSTAILNRTELREADLTGANMEH 174
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
ADLS + +L EANLT A L + L ++L A+++G + A + A
Sbjct: 175 ADLSHASIYGAILREANLTGANLYKANLRYANLQDAVLKGTNLKGADLQFA 225
>gi|86607938|ref|YP_476700.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86556480|gb|ABD01437.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 154
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 63/127 (49%), Gaps = 15/127 (11%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG---- 164
S AQ A+L K + +++ A+ + AD+RE+D SG+ +GA L A + N G
Sbjct: 32 SGAQLSGANL-KGIILRD---ADLSGADLREADLSGADLSGADLRGAKLRRVNLIGAKLV 87
Query: 165 -ADLSDTLMDRMVLNEANLTNAVLVRTVL-TRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
ADL + R L A+L+ A L R L +DL GAII F A+ D K
Sbjct: 88 KADLRGANLYRAKLLRADLSEAELNRADLRIGADLRGAIITNTHFRGALYD-----EYTK 142
Query: 223 YANGTNP 229
+ +G NP
Sbjct: 143 FPDGFNP 149
>gi|383482351|ref|YP_005391265.1| hypothetical protein MCI_01270 [Rickettsia montanensis str. OSU
85-930]
gi|378934705|gb|AFC73206.1| hypothetical protein MCI_01270 [Rickettsia montanensis str. OSU
85-930]
Length = 959
Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats.
Identities = 39/118 (33%), Positives = 62/118 (52%), Gaps = 11/118 (9%)
Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ---KQALCKYAN 225
+ + EAN NA++ R LT++D A++E AD ++ A+ K+A+ K AN
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKADFTKALLENADMQ--AVEAAEAIFKEAILKQAN 665
Score = 41.2 bits (95), Expect = 0.54, Method: Composition-based stats.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 4/107 (3%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A+ +A L KA E N + A + + + F A +++A KA+FT A L +
Sbjct: 589 AKLSNATLEKA----EAEGLNISDAIAKNINAKEANFKNAIMQRADLTKADFTKALLENA 644
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
M + EA A+L + L ++L G EGADF A I+ A K
Sbjct: 645 DMQAVEAAEAIFKEAILKQANLKAANLAGINKEGADFDKAKINDATK 691
>gi|427712429|ref|YP_007061053.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
gi|427376558|gb|AFY60510.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
Length = 316
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 44/88 (50%), Gaps = 5/88 (5%)
Query: 130 ANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN AD+RE DF G + GA L A ++ +F A+LS + R L ANL+N
Sbjct: 202 ANLRGADLREKDFEGRNLSYADLTGADLSDAFLHRVSFYRANLSQATLFRANLLNANLSN 261
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A L L +D GA + GAD A +
Sbjct: 262 ANLRDANLIGADFSGADLRGADLRGAKV 289
Score = 44.3 bits (103), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 43/84 (51%), Gaps = 12/84 (14%)
Query: 138 RESDFSGSKFNGAYLE------KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
R D SG+ GA L + ++Y A+ TGADLSD + R+ ANL+ A L R
Sbjct: 195 RGKDLSGANLRGADLREKDFEGRNLSY-ADLTGADLSDAFLHRVSFYRANLSQATLFRAN 253
Query: 192 LTRSDLGGAIIE-----GADFSDA 210
L ++L A + GADFS A
Sbjct: 254 LLNANLSNANLRDANLIGADFSGA 277
>gi|397736621|ref|ZP_10503302.1| pentapeptide repeats family protein [Rhodococcus sp. JVH1]
gi|396927531|gb|EJI94759.1| pentapeptide repeats family protein [Rhodococcus sp. JVH1]
Length = 201
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 16/131 (12%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFS-----GSKFNGAYL 152
+E R E I + F ADL ++ HV FR+ +FT + S+F GS+F+ L
Sbjct: 38 SELRTESVIFTECDFTGADLAESNHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97
Query: 153 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 202
V + +FT GADL EANL L R VL +DL GGA
Sbjct: 98 RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 157
Query: 203 EGADFSDAVID 213
+GAD A +D
Sbjct: 158 DGADLRGAHVD 168
>gi|390442549|ref|ZP_10230537.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
gi|389834137|emb|CCI34663.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
Length = 179
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 56/127 (44%), Gaps = 21/127 (16%)
Query: 84 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFS 143
C N +L DLN A G A ADL R N A++R +D
Sbjct: 61 CDFNGISLKDLNLSSANLEG-------ANLSQADLE---------RTNLQGANLRGTDLR 104
Query: 144 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
G+ L A KAN GADL ++ L ANLTNA L + L +++L A ++
Sbjct: 105 GADLGKTLLAGADLSKANLLGADL-----EKANLQGANLTNANLQKADLEKANLTNARLD 159
Query: 204 GADFSDA 210
GA+ DA
Sbjct: 160 GANLQDA 166
>gi|428220816|ref|YP_007104986.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994156|gb|AFY72851.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 418
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 58/108 (53%), Gaps = 11/108 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANF 162
S A F +DL A+ ++ + R AN + A++ E +D SG F+G+ L +A +ANF
Sbjct: 143 SMANFTGSDLSGAIMIRADLRRANISRANLNEADISRADLSGVDFSGSNLSQANFEEANF 202
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
G + S R L EAN +N L+ SDL GA + A+F++A
Sbjct: 203 LGTNFS-----RTNLIEANFSNTNFREVDLSGSDLIGADLSNANFAEA 245
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/139 (35%), Positives = 70/139 (50%), Gaps = 19/139 (13%)
Query: 95 NKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAY 151
N E + G IG S A F ADLR+A V ANF +A+++E+D SG+ GA
Sbjct: 221 NFREVDLSGSDLIGADLSNANFAEADLRRANLVG----ANFNNANLKEADLSGAYLIGAT 276
Query: 152 LEKAVAYKANF----------TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
L A +A+F TGADL+ + L+ ANL++ L LT +DL A
Sbjct: 277 LVNANIVRADFRRANLIGADLTGADLTGADLVGANLSGANLSDCNLTSVSLTSADLSMAN 336
Query: 202 IEGADFSDAVIDLAQKQAL 220
D ++A +L++ QAL
Sbjct: 337 FANCDLTNA--NLSRVQAL 353
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 1/103 (0%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F ADL +A + F NF++ ++ E+D +GA L A A+ GADL
Sbjct: 55 ANFSGADLSRAKLRRATFGETNFSNTNLSEADLRRVNLSGADLRGANLSTADLIGADLRR 114
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++ +L EA+L+ LV T +T ++L A G+D S A++
Sbjct: 115 ATLEGAILAEADLSRTNLVGTNMTDANLSMANFTGSDLSGAIM 157
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 9/102 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A ADLR RA A + E+D S + G + A ANFTG+DLS
Sbjct: 103 STADLIGADLR---------RATLEGAILAEADLSRTNLVGTNMTDANLSMANFTGSDLS 153
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+M R L AN++ A L ++R+DL G G++ S A
Sbjct: 154 GAIMIRADLRRANISRANLNEADISRADLSGVDFSGSNLSQA 195
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 41/80 (51%), Gaps = 10/80 (12%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+FT A++ E DF+G+ KANF+GADLS + R E N +N L
Sbjct: 36 DFTGANLSEVDFAGTDL----------QKANFSGADLSRAKLRRATFGETNFSNTNLSEA 85
Query: 191 VLTRSDLGGAIIEGADFSDA 210
L R +L GA + GA+ S A
Sbjct: 86 DLRRVNLSGADLRGANLSTA 105
Score = 44.3 bits (103), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 54/118 (45%), Gaps = 16/118 (13%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYL---------- 152
S + A+ +A + NF ANF++ + RE D SGS GA L
Sbjct: 188 SGSNLSQANFEEANFLGTNFSRTNLIEANFSNTNFREVDLSGSDLIGADLSNANFAEADL 247
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+A ANF A+L + + L A L NA +VR R++L GA + GAD + A
Sbjct: 248 RRANLVGANFNNANLKEADLSGAYLIGATLVNANIVRADFRRANLIGADLTGADLTGA 305
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 48/83 (57%), Gaps = 5/83 (6%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN-----EANLTNA 185
+F D+++++FSG+ + A L +A + NF+ +LS+ + R+ L+ ANL+ A
Sbjct: 46 DFAGTDLQKANFSGADLSRAKLRRATFGETNFSNTNLSEADLRRVNLSGADLRGANLSTA 105
Query: 186 VLVRTVLTRSDLGGAIIEGADFS 208
L+ L R+ L GAI+ AD S
Sbjct: 106 DLIGADLRRATLEGAILAEADLS 128
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 16/120 (13%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGA---------------YL 152
S A A LR+A + NF N + AD+R + SG+ GA L
Sbjct: 58 SGADLSRAKLRRATFGETNFSNTNLSEADLRRVNLSGADLRGANLSTADLIGADLRRATL 117
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
E A+ +A+ + +L T M L+ AN T + L ++ R+DL A I A+ ++A I
Sbjct: 118 EGAILAEADLSRTNLVGTNMTDANLSMANFTGSDLSGAIMIRADLRRANISRANLNEADI 177
>gi|389842816|ref|YP_006344900.1| hypothetical protein ES15_3816 [Cronobacter sakazakii ES15]
gi|387853292|gb|AFK01390.1| hypothetical protein ES15_3816 [Cronobacter sakazakii ES15]
Length = 846
Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats.
Identities = 39/119 (32%), Positives = 59/119 (49%), Gaps = 17/119 (14%)
Query: 129 RANFTSADMRESDFS-----GSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDRMVLN 178
RA+FT A +R+S+ G++F A LE +AN F A L +L R
Sbjct: 723 RADFTHATLRQSNLRQTALCGARFELAKLENTDLSEANCRGASFQRASLVGSLFIRTDFR 782
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
E + T+A L+ +L +S LGGA GA+ A DL+Q + NG ++G T++
Sbjct: 783 EVDFTDANLMGALLQKSQLGGADFNGANLFRA--DLSQ-----SFTNGETRMSGAFTKR 834
>gi|359462953|ref|ZP_09251516.1| hypothetical protein ACCM5_29760 [Acaryochloris sp. CCMEE 5410]
Length = 435
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 65/128 (50%), Gaps = 11/128 (8%)
Query: 99 AETRGEFGIGSA----AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
A RG + +GSA A SADL V++ + AN + A + ++ +K GA L
Sbjct: 287 ANLRGAY-LGSANLLGANLNSADL-IGVYLSD---ANLSQAKLVGANLRTAKLIGAKLTD 341
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
+ANFTGADLSD ++ +ANL RT +DL GA + GA F + +D
Sbjct: 342 TDLSEANFTGADLSDANLEGADFTDANLREVSFQRTQFREADLSGADLRGAIFLE--VDQ 399
Query: 215 AQKQALCK 222
++ LC+
Sbjct: 400 LEECKLCR 407
Score = 41.2 bits (95), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 61/143 (42%), Gaps = 32/143 (22%)
Query: 99 AETRGEFGIGS---AAQFGSADLRKAVHVKENFRANFTSADMRE---------------- 139
A +G + IG+ A A+LR A + AN + AD+ +
Sbjct: 222 ANFQGTYLIGTNLREANLREANLRNA----DLLSANLSEADLTQANLSSANLLGTNLNSA 277
Query: 140 ----SDFSGSKFNGAYLEKAVAYKANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRT 190
+D +G+ GAYL A AN AD LSD + + L ANL A L+
Sbjct: 278 NFQNADLTGANLRGAYLGSANLLGANLNSADLIGVYLSDANLSQAKLVGANLRTAKLIGA 337
Query: 191 VLTRSDLGGAIIEGADFSDAVID 213
LT +DL A GAD SDA ++
Sbjct: 338 KLTDTDLSEANFTGADLSDANLE 360
>gi|443665875|ref|ZP_21133688.1| tetratricopeptide repeat family protein [Microcystis aeruginosa
DIANCHI905]
gi|159027171|emb|CAO86803.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443331319|gb|ELS45983.1| tetratricopeptide repeat family protein [Microcystis aeruginosa
DIANCHI905]
Length = 262
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 55/101 (54%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L++ + ++ + + + + + +S+ +G+K NGA L A +AN +GADLS +
Sbjct: 29 LQQLLSTRKCPQCDLSGSGLVQSNLTGAKLNGANLVGANLSQANLSGADLSGANLTGASF 88
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
ANLT A L +LT +DL GA + A+ + +D A Q
Sbjct: 89 FGANLTGANLTGAILTGADLRGAYLNNANLENTKLDTAYVQ 129
Score = 40.4 bits (93), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 40/72 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ +++ SG+ +GA L A + AN TGA+L+ ++ L A L NA L
Sbjct: 61 ANLVGANLSQANLSGADLSGANLTGASFFGANLTGANLTGAILTGADLRGAYLNNANLEN 120
Query: 190 TVLTRSDLGGAI 201
T L + + GA+
Sbjct: 121 TKLDTAYVQGAV 132
>gi|451981569|ref|ZP_21929921.1| hypothetical protein NITGR_590064 [Nitrospina gracilis 3/211]
gi|451761242|emb|CCQ91185.1| hypothetical protein NITGR_590064 [Nitrospina gracilis 3/211]
Length = 241
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 44/79 (55%), Gaps = 5/79 (6%)
Query: 135 ADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AD+R S+F+ + F+ GAYLE A ANF A+L + + V ANL A L
Sbjct: 91 ADLRHSNFTNANFSEANLTGAYLEGANLEGANFQRAELKAGALKQAVFRNANLFEADLRY 150
Query: 190 TVLTRSDLGGAIIEGADFS 208
T + +D GA +EGADF+
Sbjct: 151 TRVDEADFTGANLEGADFT 169
>gi|427715910|ref|YP_007063904.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348346|gb|AFY31070.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 1031
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN + + ++ SG+ +GA L A +AN G LS ++R L+ AN + A L
Sbjct: 864 RANLSGTNFSRANLSGANLSGADLSTANLSRANLNGVYLSRANLNRANLSGANFSRADLS 923
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLA 215
R L+ +DL GA + GAD SDA ++ A
Sbjct: 924 RANLSGADLSGADLSGADLSDANLNRA 950
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 46/80 (57%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANF+ AD+ ++ SG+ +GA L A AN A+LS + R L++ANL++A L
Sbjct: 915 ANFSRADLSRANLSGADLSGADLSGADLSDANLNRANLSRANLKRANLSDANLSSANLSG 974
Query: 190 TVLTRSDLGGAIIEGADFSD 209
L+R++L A + A+ D
Sbjct: 975 DNLSRANLSRANLSDANLGD 994
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 53/103 (51%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTS-ADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL A + N + S A++ ++ SG+ F+ A L +A A+ +GADL
Sbjct: 878 SGANLSGADLSTANLSRANLNGVYLSRANLNRANLSGANFSRADLSRANLSGADLSGADL 937
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S + LN ANL+ A L R L+ ++L A + G + S A
Sbjct: 938 SGADLSDANLNRANLSRANLKRANLSDANLSSANLSGDNLSRA 980
Score = 40.4 bits (93), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 9/101 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A F ADL RAN + AD+ +D SG+ + A L +A +AN A+LS
Sbjct: 913 SGANFSRADLS---------RANLSGADLSGADLSGADLSDANLNRANLSRANLKRANLS 963
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
D + L+ NL+ A L R L+ ++LG + + +
Sbjct: 964 DANLSSANLSGDNLSRANLSRANLSDANLGDEFFKAIHWDE 1004
>gi|418939072|ref|ZP_13492497.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
gi|375054219|gb|EHS50602.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
Length = 202
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 11/102 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
+ A ADLR A NF AN SAD++ +D + + GA L A +AN TGA
Sbjct: 63 TGANLTGADLRWADCDGANFTGANLKSADLQHTDLTNANLTGANLTGANLTEANLTGA-- 120
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+L EA L A L++ + +++L G + GAD +D
Sbjct: 121 --------ILKEARLDKASLIQAIKQKANLQGVDLSGADLTD 154
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 50/106 (47%), Gaps = 9/106 (8%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
F ADL E R A + +DF+G+ GA L A ANFTGA+L +
Sbjct: 42 FAGADL-------EQVR--LAGASLEGADFTGANLTGADLRWADCDGANFTGANLKSADL 92
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
L ANLT A L LT ++L GAI++ A A + A KQ
Sbjct: 93 QHTDLTNANLTGANLTGANLTEANLTGAILKEARLDKASLIQAIKQ 138
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 54/110 (49%), Gaps = 16/110 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANF 162
+ A SADL+ N AN T A++ E++ +G+ A L+K A+ KAN
Sbjct: 83 TGANLKSADLQHTDLTNANLTGANLTGANLTEANLTGAILKEARLDKASLIQAIKQKANL 142
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
G DLS A+LT+ L R T +L GAI++GA + A++
Sbjct: 143 QGVDLSG----------ADLTDMNLSRVDFTGVNLKGAILKGAILTGAIL 182
>gi|254513085|ref|ZP_05125151.1| Pentapeptide repeat protein [Rhodobacteraceae bacterium KLH11]
gi|221533084|gb|EEE36079.1| Pentapeptide repeat protein [Rhodobacteraceae bacterium KLH11]
Length = 353
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 47/84 (55%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN SA + +++F S F+ A L A+ +F+GA L+ R +++ +NA L
Sbjct: 74 RANLISATLSKANFKHSNFDSADLTCAICKDTDFSGASLTTVNAPRADFEKSDFSNAFLF 133
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
+L RS+L GA GA+ SDA +
Sbjct: 134 GALLQRSNLSGASFFGANLSDAYL 157
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 58/113 (51%), Gaps = 1/113 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S + A+L A K NF+ +NF SAD+ + + F+GA L A +A+F +D
Sbjct: 68 SNSSLARANLISATLSKANFKHSNFDSADLTCAICKDTDFSGASLTTVNAPRADFEKSDF 127
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
S+ + +L +NL+ A L+ + L G+I++ F +++ Q ++L
Sbjct: 128 SNAFLFGALLQRSNLSGASFFGANLSDAYLAGSIMKETIFERTIMNGIQAKSL 180
>gi|220906761|ref|YP_002482072.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863372|gb|ACL43711.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 190
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/93 (40%), Positives = 48/93 (51%), Gaps = 6/93 (6%)
Query: 138 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
R D SG+ GA L +A AN +GA+L D L+ L ANLTNA L + R+DL
Sbjct: 39 RGCDLSGADLRGAILTRADLRGANLSGANLQDALLLLTDLRGANLTNANLTAAYMNRTDL 98
Query: 198 GGAIIEGADFSDAVIDLAQKQALCKYAN--GTN 228
A + GA DA + + L K AN GTN
Sbjct: 99 REANLSGATLVDAGL----RNTLFKGANLQGTN 127
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 9/93 (9%)
Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
ADLR A+ T AD+R ++ SG+ A L AN T A+L+ M+R
Sbjct: 46 ADLRGAI---------LTRADLRGANLSGANLQDALLLLTDLRGANLTNANLTAAYMNRT 96
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
L EANL+ A LV L + GA ++G +F+
Sbjct: 97 DLREANLSGATLVDAGLRNTLFKGANLQGTNFA 129
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 48/95 (50%), Gaps = 6/95 (6%)
Query: 117 DLRKAVHVKENFRANFTS-ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
DLR A N A + + D+RE++ SG+ A L + AN G + + + +
Sbjct: 77 DLRGANLTNANLTAAYMNRTDLREANLSGATLVDAGLRNTLFKGANLQGTNFAGSDLSYA 136
Query: 176 VLNEANLTNA-----VLVRTVLTRSDLGGAIIEGA 205
L + NLTNA L+ T L +++L GA +EGA
Sbjct: 137 DLRDTNLTNANLTATNLLFTRLNKTNLQGANLEGA 171
>gi|218247298|ref|YP_002372669.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|218167776|gb|ACK66513.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
Length = 371
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 67/136 (49%), Gaps = 11/136 (8%)
Query: 80 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMR 138
+ A+ + N++ L L + T G AA+ + +L A + NFR AN T A++
Sbjct: 218 LYAANTHNLAELIKLAHFNPLTDLAGGNFLAAELSAVELSGANLTQTNFRGANLTDAELS 277
Query: 139 ES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 193
E+ FSG+ +GAYL A KA+F A L+ + L EANL A L+ T
Sbjct: 278 EAILNYCKFSGADLSGAYLGNAQLVKADFHRASLAVANLIGANLTEANLREANLIDT--- 334
Query: 194 RSDLGGAIIEGADFSD 209
+L GA ++ A F +
Sbjct: 335 --NLSGATVKNAKFGE 348
>gi|443317576|ref|ZP_21046968.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442782825|gb|ELR92773.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 303
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 55/103 (53%), Gaps = 11/103 (10%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A G DL +A+ V+ N R++ + ++ +++ + + G L +A +ANFT A+L
Sbjct: 99 ADLGETDLSQAILVEANLNRSDLSGVNLHQANLTKASLIGVELNRANLREANFTEANLRR 158
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ R L +ANL TR++L A + ADFSDA++
Sbjct: 159 VELQRAQLGKANL----------TRANLADARMLHADFSDAIL 191
Score = 43.9 bits (102), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 68/144 (47%), Gaps = 16/144 (11%)
Query: 74 TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFT 133
T L+ A++ + N S L+ +N ++A IG + A+LR+A NFT
Sbjct: 104 TDLSQAILVEANLNRSDLSGVNLHQANLTKASLIG--VELNRANLREA---------NFT 152
Query: 134 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEANLTNAVLV 188
A++R + ++ A L +A A AD SD ++ LN ANLT L
Sbjct: 153 EANLRRVELQRAQLGKANLTRANLADARMLHADFSDAILQETNLSGARLNRANLTRTDLT 212
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
L ++L GA + A+F++A++
Sbjct: 213 AANLKETNLLGADLSYANFTEALL 236
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 59/119 (49%), Gaps = 2/119 (1%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A G+ DL+ A + RAN + E+D SG+ L A KAN +GA+L+
Sbjct: 34 ANLGNFDLKGANLSGADLTRANCIGVILSEADLSGATLVRTDLSGADINKANLSGANLTK 93
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANGT 227
+ L E +L+ A+LV L RSDL G + A+ + A +I + +A + AN T
Sbjct: 94 ANLLGADLGETDLSQAILVEANLNRSDLSGVNLHQANLTKASLIGVELNRANLREANFT 152
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 49/84 (58%), Gaps = 5/84 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R + T+A+++E++ G+ + A +A+ +AN +GADLS + + +LT L
Sbjct: 208 RTDLTAANLKETNLLGADLSYANFTEALLAEANLSGADLSYANLAGL-----DLTGLNLA 262
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
T LT+++L GA + A+ +AV+
Sbjct: 263 GTNLTQANLAGANLTEANLEEAVL 286
Score = 37.0 bits (84), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 32/59 (54%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
AN + AD+ ++ +G G L +AN GA+L++ ++ VL EANLT A +
Sbjct: 238 EANLSGADLSYANLAGLDLTGLNLAGTNLTQANLAGANLTEANLEEAVLTEANLTQATM 296
>gi|298246994|ref|ZP_06970799.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297549653|gb|EFH83519.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 285
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 45/82 (54%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ NF + +R SDF+G+ G+ + +ANF GA+L+D + L +AN ++LV
Sbjct: 100 KGNFKGSVLRGSDFTGADVTGSSFRGSDVREANFAGANLTDCDFSTLDLVDANFRESILV 159
Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
RT ++S L GA G +D
Sbjct: 160 RTNFSKSGLVGAQFIGVTLTDV 181
>gi|291570908|dbj|BAI93180.1| TPR domain protein [Arthrospira platensis NIES-39]
Length = 256
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 51/100 (51%)
Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
D+R+ + KE N T+A + +D SG+ GA L A +AN TGA+L+ +
Sbjct: 28 DIRQLLSTKECENCNLTNAGLVLADLSGANLTGANLTGANLSRANLTGANLTGANLTGAS 87
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
L ANLT A L L SDL GA + A DA I AQ
Sbjct: 88 LFGANLTGANLTGANLAGSDLRGAYLANAIAVDANITEAQ 127
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 38/72 (52%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN T A++ ++ +G+ GA L A + AN TGA+L+ + L A L NA+ V
Sbjct: 61 ANLTGANLSRANLTGANLTGANLTGASLFGANLTGANLTGANLAGSDLRGAYLANAIAVD 120
Query: 190 TVLTRSDLGGAI 201
+T + L G +
Sbjct: 121 ANITEAQLIGVV 132
Score = 37.4 bits (85), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 24/40 (60%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
F AN T A++ ++ +GS GAYL A+A AN T A L
Sbjct: 89 FGANLTGANLTGANLAGSDLRGAYLANAIAVDANITEAQL 128
Score = 37.4 bits (85), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 38/72 (52%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN T A++ ++ +G+ GA L A AN G+DL + + +AN+T A L+
Sbjct: 70 RANLTGANLTGANLTGASLFGANLTGANLTGANLAGSDLRGAYLANAIAVDANITEAQLI 129
Query: 189 RTVLTRSDLGGA 200
V +++G A
Sbjct: 130 GVVGLPTNIGNA 141
>gi|428210339|ref|YP_007094692.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428012260|gb|AFY90823.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 164
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 57/115 (49%), Gaps = 12/115 (10%)
Query: 107 IGSAAQFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
I S A+ A+L K V + E AN T A + +D S + A L +A+ KAN
Sbjct: 36 ILSKAELAGANLNGANLKGVKLSE---ANLTGATLWRTDLSNATLYKAILSRAILIKANL 92
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA---IIE--GADFSDAVI 212
+ DL DTL++R L NLT A L LT +DL A ++E G D S A I
Sbjct: 93 SSVDLRDTLLNRADLRLTNLTGANLSGANLTGTDLRYAQLKLVELTGVDLSQACI 147
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 61/122 (50%), Gaps = 14/122 (11%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAY 158
+T E F DLR+ V + E + + A + +++ +G+ NGA L+
Sbjct: 3 VDTLLELYTAGKRDFSCFDLRR-VDLSE---IDLSGAILSKAELAGANLNGANLKGVKLS 58
Query: 159 KANFTGA-----DLSD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
+AN TGA DLS+ ++ R +L +ANL++ L T+L R+DL + GA+ S
Sbjct: 59 EANLTGATLWRTDLSNATLYKAILSRAILIKANLSSVDLRDTLLNRADLRLTNLTGANLS 118
Query: 209 DA 210
A
Sbjct: 119 GA 120
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 47/97 (48%), Gaps = 15/97 (15%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT----- 183
+ +F+ D+R D S +GA L KA AN GA+L + L+EANLT
Sbjct: 14 KRDFSCFDLRRVDLSEIDLSGAILSKAELAGANLNGANLKG-----VKLSEANLTGATLW 68
Query: 184 -----NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
NA L + +L+R+ L A + D D +++ A
Sbjct: 69 RTDLSNATLYKAILSRAILIKANLSSVDLRDTLLNRA 105
>gi|67459256|ref|YP_246880.1| hypothetical protein RF_0864 [Rickettsia felis URRWXCal2]
gi|67004789|gb|AAY61715.1| Uncharacterized low-complexity protein [Rickettsia felis URRWXCal2]
Length = 959
Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats.
Identities = 41/121 (33%), Positives = 62/121 (51%), Gaps = 12/121 (9%)
Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAKLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 224
+ + EAN NA++ R LT++D A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAQEANFKNAIMQRADLTKADFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 225 N 225
N
Sbjct: 670 N 670
>gi|218439263|ref|YP_002377592.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218171991|gb|ACK70724.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 294
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 76/157 (48%), Gaps = 18/157 (11%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLE 153
NKY+A R + + DLR NF+ A+ + A++RE + SG+ A+L
Sbjct: 12 NKYDAGERDFCNL----ELRRIDLRGLNLSHANFKGADLSYANLREINLSGADLREAFLN 67
Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANL----------TNAVLVRTVLTRSDLGGAIIE 203
+A AN GA+L T + + L + NL T A L ++ LT+++L GA +
Sbjct: 68 EADLTGANLQGANLEGTYLIKAYLMKTNLQEANLSKAYLTGAYLSKSNLTKANLSGAYLN 127
Query: 204 GADFSDA-VIDLAQKQALCKYANGTNPITGVSTRKSL 239
GA S A + D++ + + + P+ V T+K L
Sbjct: 128 GAKLSGADLTDISYDE--TTHFDVNFPLNKVETKKEL 162
>gi|443477350|ref|ZP_21067204.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443017546|gb|ELS31963.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 670
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 69/151 (45%), Gaps = 23/151 (15%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A SA+L+ A V N R AN A++ +S+ + N A LE A A+ A+L
Sbjct: 521 SEADLNSANLKGANLVLTNLRKANLVKANLSDSNLGAANLNDAILEGADLSAADLRSAEL 580
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--------------- 212
+ T + L+ ANLT A LV + GA + GA+F +A++
Sbjct: 581 NLTNLSNANLSSANLTAAKLVLI-----EFAGANLNGANFRNAIVENIGSIESADFTNAV 635
Query: 213 --DLAQKQALCKYANGTNPITGVSTRKSLGC 241
D ++ C A+G +G ST+ +L C
Sbjct: 636 NLDPIVRKYFCSLASGNVADSGNSTKSTLNC 666
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 44/86 (51%), Gaps = 5/86 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A++ + +K A L+KA K N + ADL+ + L NL A LV+
Sbjct: 488 ANLSQANLLRVNLFQAKLGSANLQKAELMKTNLSEADLNSANLKGANLVLTNLRKANLVK 547
Query: 190 TVLTRSDLGGA-----IIEGADFSDA 210
L+ S+LG A I+EGAD S A
Sbjct: 548 ANLSDSNLGAANLNDAILEGADLSAA 573
>gi|334121293|ref|ZP_08495365.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333455228|gb|EGK83883.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 299
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 61/120 (50%), Gaps = 10/120 (8%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYL 152
LN YE R +F + A A+L A+ + N RAN + A++ + + + GA L
Sbjct: 7 LNNYEKGHR-DF---TGADLSGANLSGAILIGVNLSRANLSGANLSRAFLTKATLQGAVL 62
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++ N + A + +T + L +ANL+ A LV+ L R+ L GA + GA+ AV+
Sbjct: 63 QRT-----NLSFAKMGETQLSGADLTKANLSGAFLVKAKLPRAKLSGATLTGANLRGAVL 117
>gi|427736744|ref|YP_007056288.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427371785|gb|AFY55741.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 443
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 59/124 (47%), Gaps = 16/124 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVA 157
++ +F ADLR+A V N N + AD+ +D SG+ +GAY A
Sbjct: 319 TSTKFIGADLREANFVGANLDNVDFSNANLSGTNLSGADLSGADLSGAYLSGAYFYDADL 378
Query: 158 YKANFTGADLS-----DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
AN GADLS D + L A+L+ A L+ ++L GA + GAD +D I
Sbjct: 379 SDANLQGADLSGAYFYDADLSGANLQGADLSGAYFYDADLSGANLQGANLNGADLTDTYI 438
Query: 213 DLAQ 216
D A+
Sbjct: 439 DRAK 442
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 5/91 (5%)
Query: 130 ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN AD+R +SDF+ + GA L A AN GADLS+ + LN L
Sbjct: 171 ANLARADLRGTKLNQSDFTNANLAGADLRDADLTNANLAGADLSNADLTNANLNSVQLVK 230
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
A L+ L +DL A + GA DA I+ A
Sbjct: 231 AQLINARLVDTDLRKANLNGAYLIDANINRA 261
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 56/122 (45%), Gaps = 13/122 (10%)
Query: 109 SAAQFGSADLRKAVHVKENF--RANFTSADMRESDFSGSKFNGAYLEKAVAYKAN----- 161
S +ADL A ++E F NF A++ DFSG NG L A AN
Sbjct: 264 SGTNLSNADLTSA-KLRETFPSNTNFCGANLSGIDFSGFILNGINLRWAKLIGANLTSTK 322
Query: 162 FTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
F GADL + +D + + ANL+ L L+ +DL GA + GA F DA + A
Sbjct: 323 FIGADLREANFVGANLDNVDFSNANLSGTNLSGADLSGADLSGAYLSGAYFYDADLSDAN 382
Query: 217 KQ 218
Q
Sbjct: 383 LQ 384
>gi|424801888|ref|ZP_18227430.1| FIG01055523: hypothetical protein [Cronobacter sakazakii 696]
gi|423237609|emb|CCK09300.1| FIG01055523: hypothetical protein [Cronobacter sakazakii 696]
Length = 846
Score = 49.7 bits (117), Expect = 0.002, Method: Composition-based stats.
Identities = 54/188 (28%), Positives = 85/188 (45%), Gaps = 32/188 (17%)
Query: 62 YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 121
+A+L N FV + L AA + + + + + N EA +F SA
Sbjct: 667 HARL-NKTTFVKSTLEAADFSDATLDSCSFVETNADEA------------RFISATWITC 713
Query: 122 VHVKENF--RANFTSADMRESDFS-----GSKFNGAYLE-----KAVAYKANFTGADLSD 169
E+ RA+FT A +R+S+ G++F A LE +A A+F A L
Sbjct: 714 AAASESTLNRADFTHATLRQSNLRQTALCGARFELAKLENTDLSEADCRGASFQRASLVG 773
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 229
+L R E + T+A L+ +L +S LGGA GA+ A DL+Q + NG
Sbjct: 774 SLFIRTDFREVDFTDANLMGALLQKSQLGGADFNGANLFRA--DLSQ-----SFTNGETR 826
Query: 230 ITGVSTRK 237
++G T++
Sbjct: 827 MSGAFTKR 834
Score = 37.7 bits (86), Expect = 5.5, Method: Composition-based stats.
Identities = 30/105 (28%), Positives = 44/105 (41%), Gaps = 6/105 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTS-ADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 165
S A ADL NFR + A++ + F GA L A ++F+GA
Sbjct: 551 SKALLECADLSHCQLDGANFRGTMLARAELHHTSLRDCNFEGASLSLAQCCHSDFSGARF 610
Query: 166 ---DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
L +TL+D V ++A L + T TR A ++G F
Sbjct: 611 KDTQLQETLLDDCVFDDATLEGLLFRETWFTRCRFHRATLDGCVF 655
>gi|218442709|ref|YP_002381029.1| hypothetical protein PCC7424_5734 [Cyanothece sp. PCC 7424]
gi|218175067|gb|ACK73799.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 266
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 66/143 (46%), Gaps = 27/143 (18%)
Query: 121 AVHVKENFRANF-TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 179
AV K N F +A+++ +D G+ GAYL A + TGA+L D + L
Sbjct: 125 AVGPKANLNGAFLNTANLKNADLKGANLRGAYLSGA-----DLTGANLEDAALSGANLQG 179
Query: 180 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID-----------LAQ------KQALCK 222
A LT A L + L ++L GA + AD +DA ++ LAQ K LC
Sbjct: 180 ALLTGAYLRKARLIGAELQGADLRAADLTDANLEQLQNLAGADFTLAQGLTEDTKAMLCS 239
Query: 223 YAN---GT-NPITGVSTRKSLGC 241
GT NP T +T +SLGC
Sbjct: 240 RPAQELGTWNPFTRSNTAQSLGC 262
Score = 40.8 bits (94), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 55/113 (48%), Gaps = 1/113 (0%)
Query: 117 DLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
+L KA+ +N +AN ++ + D S + + A L A + N GA+L + +
Sbjct: 7 ELTKALSEGKNLAKANLQGINLAQMDLSNADLSAANLIGANLSETNLKGANLEGADLRGV 66
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 228
L++ANL A L + L RS+L G ++ A A I LA+ + + G N
Sbjct: 67 NLSKANLEGANLQNSYLFRSNLEGCCLKEAQLQGAKIQLARYDSYTVWPEGYN 119
>gi|186683195|ref|YP_001866391.1| pentapeptide repeat-containing serine/threonine kinase [Nostoc
punctiforme PCC 73102]
gi|186465647|gb|ACC81448.1| serine/threonine protein kinase with pentapeptide repeats [Nostoc
punctiforme PCC 73102]
Length = 534
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 56/107 (52%), Gaps = 14/107 (13%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
G+ +A Q G D A+H N + +++ +D SG+ F+ L+K N GA
Sbjct: 398 GLLTAYQKGRRDF--ALH-------NLSLLNLQGADLSGTNFHSTQLQKT-----NLQGA 443
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+L ++ R L++ANL +A L + +DL GA + GAD S+A +
Sbjct: 444 NLHNSDFGRASLSKANLKDANLTKAYFNHADLEGADLRGADLSNAYL 490
Score = 37.0 bits (84), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 5/104 (4%)
Query: 123 HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 182
H + + N A++ SDF + + A L+ A KA F ADL L A+L
Sbjct: 431 HSTQLQKTNLQGANLHNSDFGRASLSKANLKDANLTKAYFNHADLEGA-----DLRGADL 485
Query: 183 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
+NA L L ++L GA + A SD + LA+ + NG
Sbjct: 486 SNAYLSNANLRGANLCGANLTSAKISDEQLALAKTNWMTIRPNG 529
>gi|189499236|ref|YP_001958706.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
BS1]
gi|189494677|gb|ACE03225.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
Length = 442
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 55/109 (50%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RA+ R ++ G+ FN A+++KA A+ TGA L +T + L ++NL+ L
Sbjct: 326 RASLVETVFRNANLQGADFNRAFMKKADLSGADLTGAQLRETRLQEADLKKSNLSKTNLY 385
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
T LT +DL GA + GA+ ++D A A +G TG + K
Sbjct: 386 DTDLTCADLRGADLTGANLLYTILDNALISAETITPSGEKATTGWAVLK 434
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 52/102 (50%), Gaps = 4/102 (3%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A+ AD R A + F A+ D++++D SG+ GA L+ + A +A F ADL+ T
Sbjct: 83 AKLNGADFRNA----KLFSASLKRTDLKQTDLSGANLRGADLKNSYAKEAKFINADLTGT 138
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L A+LT AVL + ++L A + G + + A +
Sbjct: 139 DFRYANLEGADLTGAVLENALFFDANLSSADLRGVNLTGAKM 180
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 42/85 (49%), Gaps = 10/85 (11%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE-----ANLTN 184
A ADM++ D + S NGA L+ A +F+ +DLS T R L E ANL
Sbjct: 287 AGLKGADMKKLDMTSSTMNGAKLDHA-----DFSESDLSSTSWKRASLVETVFRNANLQG 341
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSD 209
A R + ++DL GA + GA +
Sbjct: 342 ADFNRAFMKKADLSGADLTGAQLRE 366
>gi|428313439|ref|YP_007124416.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428255051|gb|AFZ21010.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 167
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 51/93 (54%), Gaps = 10/93 (10%)
Query: 128 FRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKA-----NFTGADLSDTLMDRMVL 177
++AN + AD+R++ F+ G++ GA L +A KA N GA L+ T + L
Sbjct: 58 YQANLSKADLRQTIFNEAILHGAELTGANLHRASLIKADLCEANLKGASLTHTNLGAAKL 117
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ ANL NA L L ++DL A +EGAD S A
Sbjct: 118 SGANLNNANLTWANLRKADLKNANLEGADLSGA 150
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 11/101 (10%)
Query: 119 RKAVHVKENF-RANFTSADMRE----------SDFSGSKFNGAYLEKAVAYKANFTGADL 167
R+ + + NF RAN +D+R+ ++ S +GA L + Y+AN + ADL
Sbjct: 8 RRYLAGERNFHRANLNGSDLRKIPLMRADLLKANLHNSNLSGANLTRVNLYQANLSKADL 67
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
T+ + +L+ A LT A L R L ++DL A ++GA +
Sbjct: 68 RQTIFNEAILHGAELTGANLHRASLIKADLCEANLKGASLT 108
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 57/112 (50%), Gaps = 7/112 (6%)
Query: 116 ADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADL KA +H AN T ++ +++ S + +A+ + A TGA+L + +
Sbjct: 35 ADLLKANLHNSNLSGANLTRVNLYQANLSKADLRQTIFNEAILHGAELTGANLHRASLIK 94
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
L EANL A LT ++LG A + GA+ ++A + A ++A K AN
Sbjct: 95 ADLCEANLKGA-----SLTHTNLGAAKLSGANLNNANLTWANLRKADLKNAN 141
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 5/82 (6%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NF A++ SD A L KA + +N +GA+L+ R+ L +ANL+ A L +T
Sbjct: 16 NFHRANLNGSDLRKIPLMRADLLKANLHNSNLSGANLT-----RVNLYQANLSKADLRQT 70
Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
+ + L GA + GA+ A +
Sbjct: 71 IFNEAILHGAELTGANLHRASL 92
>gi|428299412|ref|YP_007137718.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428235956|gb|AFZ01746.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 677
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 47/85 (55%), Gaps = 11/85 (12%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM-DRMVLNEANLTNAVLV 188
ANFT A++ E+ FSG+K GA +F GA L T M + L+E+NL NA L
Sbjct: 561 ANFTHANLTEAQFSGAKLVGA----------DFHGAILIATKMKNDTNLDESNLYNANLH 610
Query: 189 RTVLTRSDLGGAIIEGADFSDAVID 213
R + T + GA + GAD S A +D
Sbjct: 611 RAIFTNVTMRGADLFGADLSRATLD 635
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 46/96 (47%), Gaps = 15/96 (15%)
Query: 131 NFTSADMRESDFSGSKFNG-----AYLEKAVAYKA----------NFTGADLSDTLMDRM 175
NF+ ++ DFSG+ NG A L KA KA N GA+LS +
Sbjct: 456 NFSGQNLIGQDFSGNNLNGRNFSNANLSKANLNKASLINADLSNANLEGANLSHADLSGA 515
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
L+ NL A+L+ +L R DL A + GA+ + A+
Sbjct: 516 NLSNVNLVGAILIEAILNRVDLCNANLNGANLTLAL 551
Score = 37.4 bits (85), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 54/117 (46%), Gaps = 22/117 (18%)
Query: 113 FGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F +A+L KA N +A+ +AD+ ++ G+ + A L A N GA L + +
Sbjct: 477 FSNANLSKA-----NLNKASLINADLSNANLEGANLSHADLSGANLSNVNLVGAILIEAI 531
Query: 172 MDRM-----VLNEANLT-----------NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++R+ LN ANLT NA LT + GA + GADF A++
Sbjct: 532 LNRVDLCNANLNGANLTLALFRDEPDLCNANFTHANLTEAQFSGAKLVGADFHGAIL 588
>gi|298491495|ref|YP_003721672.1| serine/threonine protein kinase ['Nostoc azollae' 0708]
gi|298233413|gb|ADI64549.1| serine/threonine protein kinase ['Nostoc azollae' 0708]
Length = 533
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N + ++ +D SG+ F+ A L K N GA+L +T R L + NL +A L +
Sbjct: 413 NISLLNLEGADLSGTNFHSAQLRKT-----NLQGANLENTDFGRASLMQTNLRDANLTKA 467
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
L+ +DL GA + GAD S A I A
Sbjct: 468 YLSHADLEGADLRGADLSYAYISQA 492
>gi|220909908|ref|YP_002485219.1| serine/threonine protein kinase with pentapeptide repeats
[Cyanothece sp. PCC 7425]
gi|219866519|gb|ACL46858.1| serine/threonine protein kinase with pentapeptide repeats
[Cyanothece sp. PCC 7425]
Length = 526
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 60/116 (51%), Gaps = 13/116 (11%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKA- 160
+G+ G+ S + +A L + R +FT+ D+R AYL +AV ++A
Sbjct: 383 KGQTGVASKTKLDAAKL---IEAYRKGRRDFTNQDLRSL-----VLRKAYLAEAVFHQAQ 434
Query: 161 ----NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ GA+L + + R L +ANL +A L + L+ +DL GA + GA+ SDA +
Sbjct: 435 LNNTDLQGANLFNANLGRASLTKANLRDANLQKAYLSYADLAGADLRGANLSDAYL 490
>gi|414077930|ref|YP_006997248.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413971346|gb|AFW95435.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 189
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 43/89 (48%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N D +D G+ A L A KAN GA L+ + ++L A+LT A L
Sbjct: 31 NLGGVDFGRADLRGANLTAASLSGANLSKANLQGAILARAHLSEVILCGADLTQATLTTA 90
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
L SDL GA++ GA+ DA + +A A
Sbjct: 91 HLNESDLSGALLSGANLCDANLHMASISA 119
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 55/113 (48%), Gaps = 21/113 (18%)
Query: 109 SAAQFGSADLRKAV----HVKENF-------RANFTSADMRESDFSGSKFNGAYLEKAVA 157
S A A+L+ A+ H+ E +A T+A + ESD SG+ +GA L A
Sbjct: 53 SGANLSKANLQGAILARAHLSEVILCGADLTQATLTTAHLNESDLSGALLSGANLCDANL 112
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ A+ + A+L ANL+ A + + ++DL GA + GAD S+A
Sbjct: 113 HMASISAANLQG----------ANLSGAKMGGVRMWKADLQGADLSGADLSEA 155
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 1/95 (1%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
+ A +DL A+ N AN A + ++ G+ +GA + +KA+ GADL
Sbjct: 88 TTAHLNESDLSGALLSGANLCDANLHMASISAANLQGANLSGAKMGGVRMWKADLQGADL 147
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
S + L E NLT A L T ++ + L GAI+
Sbjct: 148 SGADLSEANLCEVNLTGANLDDTDMSETFLTGAIM 182
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 47/98 (47%), Gaps = 9/98 (9%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
FG ADLR A N T+A + ++ S + GA L +A + GADL+ +
Sbjct: 37 FGRADLRGA---------NLTAASLSGANLSKANLQGAILARAHLSEVILCGADLTQATL 87
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
LNE++L+ A+L L ++L A I A+ A
Sbjct: 88 TTAHLNESDLSGALLSGANLCDANLHMASISAANLQGA 125
>gi|419963472|ref|ZP_14479445.1| hypothetical protein WSS_A15164 [Rhodococcus opacus M213]
gi|432333027|ref|ZP_19584842.1| hypothetical protein Rwratislav_00170 [Rhodococcus wratislaviensis
IFP 2016]
gi|414571123|gb|EKT81843.1| hypothetical protein WSS_A15164 [Rhodococcus opacus M213]
gi|430780078|gb|ELB95186.1| hypothetical protein Rwratislav_00170 [Rhodococcus wratislaviensis
IFP 2016]
Length = 201
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 16/131 (12%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRA-NFTSADMRESDFS-----GSKFNGAYL 152
+E R E I + F ADL ++ HV FR+ +FT + S+F GS+F+ L
Sbjct: 38 SELRTESVIFTDCDFTGADLAESRHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97
Query: 153 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 202
V + +FT GADL EANL L R VL +DL GGA
Sbjct: 98 RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 157
Query: 203 EGADFSDAVID 213
+GAD A +D
Sbjct: 158 DGADLRGAHVD 168
>gi|300863652|ref|ZP_07108591.1| conserved exported hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300338360|emb|CBN53735.1| conserved exported hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 329
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 47/93 (50%), Gaps = 6/93 (6%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A ADLR A K N AN A+++++D G+ GAYL ++ +AN +G
Sbjct: 149 ANLQGADLRGANLYKTNLTTTNLTEANLLYANLQQADLRGTNLQGAYLVRSHLQRANLSG 208
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
ADLS + L EANLT A L+ L DL
Sbjct: 209 ADLSGADLGGAYLTEANLTRANLIGAKLNLIDL 241
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 54/115 (46%), Gaps = 12/115 (10%)
Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADLR + R++ A++ +D SG+ GAYL +A +AN GA L+
Sbjct: 179 ANLQQADLRGTNLQGAYLVRSHLQRANLSGADLSGADLGGAYLTEANLTRANLIGAKLNL 238
Query: 170 TLMDR-----------MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
+D+ L A L+ A L+ LT ++L GA + GAD A +D
Sbjct: 239 IDLDKPSCINVCEVYPTQLQGAILSQASLIGADLTGANLSGADLRGADLRSANLD 293
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 46/96 (47%), Gaps = 6/96 (6%)
Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
D RK H N +AD+R ++ G+ GA L K N T A+L + +
Sbjct: 132 DRRKPNHT------NLQNADLRYANLQGADLRGANLYKTNLTTTNLTEANLLYANLQQAD 185
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L NL A LVR+ L R++L GA + GAD A +
Sbjct: 186 LRGTNLQGAYLVRSHLQRANLSGADLSGADLGGAYL 221
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 49/122 (40%), Gaps = 12/122 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSG-----------SKFNGAYLEKAV 156
S A ADL A + N RAN A + D ++ GA L +A
Sbjct: 207 SGADLSGADLGGAYLTEANLTRANLIGAKLNLIDLDKPSCINVCEVYPTQLQGAILSQAS 266
Query: 157 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
A+ TGA+LS + L ANL AVL L+ + L G + G D A +
Sbjct: 267 LIGADLTGANLSGADLRGADLRSANLDGAVLTNADLSFAALAGTSLSGTDLKGATLTNGM 326
Query: 217 KQ 218
+Q
Sbjct: 327 RQ 328
>gi|434387412|ref|YP_007098023.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428018402|gb|AFY94496.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 263
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 57/120 (47%), Gaps = 11/120 (9%)
Query: 105 FGIGSAA-QFGSADLRKAVHVKENF----------RANFTSADMRESDFSGSKFNGAYLE 153
FGI A + ADL+K + + R +AD++ + G+ +GA L
Sbjct: 31 FGIAPVALAYNPADLKKLIATNKCIGCDLSGADLSRQQLVNADLQAATLVGANLSGANLA 90
Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
A AN TGA+L+ T + VL A+L + T LTR+DL A + +F A+++
Sbjct: 91 SAKLGGANLTGANLTRTNLTGAVLQAASLIDVNFANTNLTRTDLSYANLVNTNFRSAILN 150
>gi|254264016|ref|ZP_04954881.1| pentapeptide repeat protein [Burkholderia pseudomallei 1710a]
gi|254215018|gb|EET04403.1| pentapeptide repeat protein [Burkholderia pseudomallei 1710a]
Length = 825
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 42/79 (53%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T D+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 512 ADLTGVDLSGMDLRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVD 566
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 567 ARLTAANLSLAHCERTDFS 585
>gi|158334009|ref|YP_001515181.1| hypothetical protein AM1_0823 [Acaryochloris marina MBIC11017]
gi|158304250|gb|ABW25867.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 421
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 65/128 (50%), Gaps = 11/128 (8%)
Query: 99 AETRGEFGIGSA----AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
A RG + +GSA A SADL V++ + AN + A + ++ +K GA L
Sbjct: 273 ANLRGAY-LGSANLLGANLNSADL-IGVYLSD---ANLSHAKLVGANLRTAKLIGAQLAD 327
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
+ANFTGADLSD ++ +ANL RT +DL GA + GA F + +D
Sbjct: 328 TDLSEANFTGADLSDANLEGADFTDANLREVSFQRTQFREADLSGADLRGAIFLE--VDQ 385
Query: 215 AQKQALCK 222
++ LC+
Sbjct: 386 LEECKLCR 393
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 60/141 (42%), Gaps = 28/141 (19%)
Query: 99 AETRGEFGIGSAAQFGSADLRKA-VHVKENFRANFTSADMRE------------------ 139
A +G + IG+ ADLR+A + + AN + AD+ +
Sbjct: 208 ANFQGTYLIGT--NLREADLREANLRNADLLSANLSEADLTQANLSSANLLGTNLNSANF 265
Query: 140 --SDFSGSKFNGAYLEKAVAYKANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRTVL 192
+D +G+ GAYL A AN AD LSD + L ANL A L+ L
Sbjct: 266 QNADLTGANLRGAYLGSANLLGANLNSADLIGVYLSDANLSHAKLVGANLRTAKLIGAQL 325
Query: 193 TRSDLGGAIIEGADFSDAVID 213
+DL A GAD SDA ++
Sbjct: 326 ADTDLSEANFTGADLSDANLE 346
>gi|434394477|ref|YP_007129424.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428266318|gb|AFZ32264.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 132
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 49/88 (55%), Gaps = 5/88 (5%)
Query: 115 SADLRKAVHVKE----NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
S++L++ ++ K+ N R AN +A++ E++ SG+ GA L+ A KAN GA+L
Sbjct: 40 SSELQRLLNTKQCPGCNLRGANLRNANLEEANLSGANLQGANLQNADLEKANLQGANLQQ 99
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDL 197
+ L EANL NA L L +DL
Sbjct: 100 ANLSDADLQEANLQNANLQNANLRSADL 127
>gi|119509719|ref|ZP_01628864.1| hypothetical protein N9414_00180 [Nodularia spumigena CCY9414]
gi|119465585|gb|EAW46477.1| hypothetical protein N9414_00180 [Nodularia spumigena CCY9414]
Length = 212
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 49/100 (49%), Gaps = 15/100 (15%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
++ T+ D+ DFSGS + L A+ +GADLSDT M +++L ANL+ A L
Sbjct: 86 YKPPKTNPDLSGKDFSGSNLSNKDLSGRNLSYADLSGADLSDTFMHKVILRGANLSEANL 145
Query: 188 VR---------------TVLTRSDLGGAIIEGADFSDAVI 212
R + L +DL GA + GAD + A I
Sbjct: 146 FRANLLLADMREANLRSSYLIGADLSGADLRGADLTGARI 185
>gi|440756225|ref|ZP_20935426.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|440173447|gb|ELP52905.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
Length = 433
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 59/111 (53%), Gaps = 12/111 (10%)
Query: 109 SAAQFGSADLRKAV---------HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
S A ADL +A+ H+ + A+ + A++ E+D S + + A L +A+
Sbjct: 261 SEAILSEADLSEAILWTAKLSWAHL---WGADLSGANLSEADLSEADLSEADLSEAILRG 317
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
AN + ADLS + L +ANL A+L +L+ +DL GAI+ GAD S A
Sbjct: 318 ANLSEADLSWANLRGANLIQANLRGAILSWAILSGADLSGAILRGADLSGA 368
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 59/121 (48%), Gaps = 12/121 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL +A+ R AN + AD+ ++ G+ A L A+ A +GADL
Sbjct: 301 SEADLSEADLSEAI-----LRGANLSEADLSWANLRGANLIQANLRGAILSWAILSGADL 355
Query: 168 SDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALC 221
S + + L+EA+L A L +L+ + L GA +E A F DA I QKQ L
Sbjct: 356 SGAILRGADLSGADLSEADLRGAFLSEAILSGAILSGAKVENAIFIDATGITPEQKQDLI 415
Query: 222 K 222
+
Sbjct: 416 R 416
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 45/81 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A + E+D S + A L A + A+ +GA+LS+ + L+EA+L+ A+L
Sbjct: 258 ANLSEAILSEADLSEAILWTAKLSWAHLWGADLSGANLSEADLSEADLSEADLSEAILRG 317
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L+ +DL A + GA+ A
Sbjct: 318 ANLSEADLSWANLRGANLIQA 338
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 54/110 (49%), Gaps = 11/110 (10%)
Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
T+ EF A A+L KA+ +R D SG+ A L A +A
Sbjct: 200 TKAEFTT-DAKVIEKAELIKAIR-----EGTIDETTLRFVDLSGAILIEADLSWANLSEA 253
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ +GA+LS+ +L+EA+L+ A+L L+ + L GA + GA+ S+A
Sbjct: 254 DLSGANLSEA-----ILSEADLSEAILWTAKLSWAHLWGADLSGANLSEA 298
Score = 37.0 bits (84), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 45/83 (54%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
A+ + A++ E+D SG+ + A L +A +A A LS + L+ ANL+ A L
Sbjct: 241 IEADLSWANLSEADLSGANLSEAILSEADLSEAILWTAKLSWAHLWGADLSGANLSEADL 300
Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
L+ +DL AI+ GA+ S+A
Sbjct: 301 SEADLSEADLSEAILRGANLSEA 323
>gi|428311473|ref|YP_007122450.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428253085|gb|AFZ19044.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 580
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 45/79 (56%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN T A++R+ + +G+ +GA L ANFTGA+L + +L+ AN T ++L R
Sbjct: 25 ANLTGANLRKINLTGANLSGANLSWCCFSHANFTGANLHQANLHSAILDNANFTQSILSR 84
Query: 190 TVLTRSDLGGAIIEGADFS 208
L++ DL A + AD +
Sbjct: 85 AKLSKVDLRLANLREADLN 103
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 45/83 (54%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
AN ++ ++F+G+ A LE+A +A G +L++ ++ + L ANL A L
Sbjct: 143 MEANLCRTNLIATNFTGANLREANLEQANLQEATLVGVNLTEANLNNVYLRGANLRQADL 202
Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
R +LT +D+ A EGAD S A
Sbjct: 203 HRAILTGADMSEANCEGADLSRA 225
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 53/95 (55%), Gaps = 1/95 (1%)
Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
A F A+LR+A + N + A ++ E++ + GA L +A ++A TGAD+S
Sbjct: 154 ATNFTGANLREANLEQANLQEATLVGVNLTEANLNNVYLRGANLRQADLHRAILTGADMS 213
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
+ + L+ ANLT A L+R L ++DL A+++
Sbjct: 214 EANCEGADLSRANLTGAYLLRASLRKADLLRAVLQ 248
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 16/101 (15%)
Query: 116 ADLRKA-VHVKENFRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A+LR+A +H RA T ADM E+ D S + GAYL +A KA+ A L +
Sbjct: 195 ANLRQADLH-----RAILTGADMSEANCEGADLSRANLTGAYLLRASLRKADLLRAVLQE 249
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ R L+EANL A L ++DL GA ++ S+A
Sbjct: 250 VYLLRTDLSEANLRGA-----DLRKADLSGAYLKDTLLSEA 285
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 51/103 (49%), Gaps = 1/103 (0%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F + L +A K + R AN AD+ +D S S +GA L+ + N A+L+
Sbjct: 75 ANFTQSILSRAKLSKVDLRLANLREADLNWADLSASNLSGADLQNTQLDQINLEHANLNH 134
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L+ L EANL L+ T T ++L A +E A+ +A +
Sbjct: 135 ALLMGAQLMEANLCRTNLIATNFTGANLREANLEQANLQEATL 177
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 54/104 (51%), Gaps = 7/104 (6%)
Query: 111 AQFGSADLRKAVHVKENF--RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
A ADL +AV ++E + R + + A++R +D + +GAYL+ + +AN +GA L
Sbjct: 235 ASLRKADLLRAV-LQEVYLLRTDLSEANLRGADLRKADLSGAYLKDTLLSEANLSGAYLL 293
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA----IIEGADFS 208
++ + R L+ A LT + + L DL + G D+S
Sbjct: 294 ESYLIRTKLDRAELTGCCIHQWHLEEVDLSYVECRYVFTGFDYS 337
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 44/90 (48%), Gaps = 10/90 (11%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR----------MVLNEA 180
N A++ + G++ A L + NFTGA+L + +++ + L EA
Sbjct: 126 NLEHANLNHALLMGAQLMEANLCRTNLIATNFTGANLREANLEQANLQEATLVGVNLTEA 185
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
NL N L L ++DL AI+ GAD S+A
Sbjct: 186 NLNNVYLRGANLRQADLHRAILTGADMSEA 215
>gi|298250682|ref|ZP_06974486.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297548686|gb|EFH82553.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 287
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 43/82 (52%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ NF + +R SDF+G+ G+ + +A F GA+L+D + L +AN A+LV
Sbjct: 100 KGNFKGSALRGSDFTGADLTGSSFRGSDVREATFAGANLTDCDFSTLDLTDANFREAILV 159
Query: 189 RTVLTRSDLGGAIIEGADFSDA 210
RT +S L GA G +D
Sbjct: 160 RTNFNKSGLVGAKFIGVTLTDV 181
>gi|409994207|ref|ZP_11277325.1| hypothetical protein APPUASWS_23863 [Arthrospira platensis str.
Paraca]
gi|409934955|gb|EKN76501.1| hypothetical protein APPUASWS_23863 [Arthrospira platensis str.
Paraca]
Length = 519
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 50/92 (54%), Gaps = 10/92 (10%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN---------- 178
+ANFT A + ++FSG+ G L +A + +GA L ++ VLN
Sbjct: 34 QANFTEAILSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLS 93
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ANL +A L+R L R++L A++ GA+ ++A
Sbjct: 94 QANLIDASLIRAELMRAELSEAVVNGANLTEA 125
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 63/129 (48%), Gaps = 13/129 (10%)
Query: 101 TRGEFGIG--SAAQFGSADLRKAV-HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA 157
TR + + S A A+L +AV +V RA+ + A++ ++ ++ A L +AV
Sbjct: 58 TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLIDASLIRAELMRAELSEAVV 117
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNA----------VLVRTVLTRSDLGGAIIEGADF 207
AN T ADL + + L +ANL+ A L R+ LTRSDL A + G +
Sbjct: 118 NGANLTEADLREATLRHTELQQANLSGANLSEACLILSNLERSNLTRSDLTRADLRGVNL 177
Query: 208 SDAVIDLAQ 216
+A + A+
Sbjct: 178 RNAELRQAE 186
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
DFS A L + +ANFT A LS T + ANLT L R L S L GA
Sbjct: 16 DFSAILLCEANLSRVNLSQANFTEAILSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 70
Query: 201 IIEGADFSDAVIDLA 215
I++GA+ ++AV+++A
Sbjct: 71 ILQGANLNEAVLNVA 85
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 48/96 (50%), Gaps = 1/96 (1%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A+L +A + N R+N T +D+ +D G A L +A A+ GA+LS
Sbjct: 140 ANLSGANLSEACLILSNLERSNLTRSDLTRADLRGVNLRNAELRQAELSGADLRGANLSG 199
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
+ L+ ANL+ A L T L+ + L GA + GA
Sbjct: 200 ANLRWANLSGANLSGANLEATQLSGASLRGANLSGA 235
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 5/83 (6%)
Query: 130 ANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
AN T AD+RE ++ + +GA L +A +N ++L+ + + R L NL N
Sbjct: 120 ANLTEADLREATLRHTELQQANLSGANLSEACLILSNLERSNLTRSDLTRADLRGVNLRN 179
Query: 185 AVLVRTVLTRSDLGGAIIEGADF 207
A L + L+ +DL GA + GA+
Sbjct: 180 AELRQAELSGADLRGANLSGANL 202
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 49/91 (53%), Gaps = 1/91 (1%)
Query: 115 SADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
+A+LR+A + R AN + A++R ++ SG+ +GA LE A+ GA+LS +
Sbjct: 179 NAELRQAELSGADLRGANLSGANLRWANLSGANLSGANLEATQLSGASLRGANLSGASLL 238
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
A+LT A L+ T +DL G+ + G
Sbjct: 239 NCTAIHADLTQANLIECDWTDADLRGSALTG 269
>gi|409912856|ref|YP_006891321.1| pentapeptide repeat-containing protein [Geobacter sulfurreducens
KN400]
gi|298506440|gb|ADI85163.1| pentapeptide repeat domain protein [Geobacter sulfurreducens KN400]
Length = 259
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 52/97 (53%), Gaps = 4/97 (4%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A AD+RK V+V+ + NF+ A++ ++FSG+K A L AV NF+ ADLS T
Sbjct: 122 ANLSGADMRK-VNVE---KGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSAT 177
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+ + L AN A T+L + L GA GAD
Sbjct: 178 DLGSLDLEGANFRGATFNGTLLRDAKLKGADFTGADL 214
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 52/100 (52%), Gaps = 14/100 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
+ AQ A L +A+ F +ADMR + SG AY+ A AN +GAD+
Sbjct: 85 TGAQMDGASLDEAI---------FDTADMRSAHCSG-----AYIHHAKFVGANLSGADMR 130
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
+++ ++ANLTNA L ++LGGA++ G +FS
Sbjct: 131 KVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFS 170
Score = 37.0 bits (84), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 52/117 (44%), Gaps = 22/117 (18%)
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
G I +A F + +A + E +F SAD++ D G K + ++NF
Sbjct: 12 GLLSIATAHAFDPLVIERAKSLGECEHCDFVSADLKGVDLKGIKLD----------ESNF 61
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
TGADLS +D E N T A +L GA ++GA +A+ D A ++
Sbjct: 62 TGADLSAAAIDD--CGECNFTGA----------NLTGAQMDGASLDEAIFDTADMRS 106
>gi|119490886|ref|ZP_01623169.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119453704|gb|EAW34863.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 517
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 56/106 (52%), Gaps = 1/106 (0%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
I +AA ADLR+A + + R AN SA++R++ S G L A +A+ G
Sbjct: 115 AILTAANLSEADLREATLRQVDLRQANLKSANLRDAVLIASNLEGTNLHGADLTRADLRG 174
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+L + + + L++ANL+ A L L +DL GA + GA+ A
Sbjct: 175 ANLVNAELRQANLSQANLSGANLKGANLRWADLNGADLRGANLEQA 220
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 10/91 (10%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN-------- 181
ANF+ A + ++ SG+ +G L +A A +GA+LS + +LN AN
Sbjct: 35 ANFSQAVLSITNLSGANLSGTNLSQAKLNVAKLSGANLSGANLTGAILNVANLIRADLSH 94
Query: 182 --LTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L NA +R+ L R+DL AI+ A+ S+A
Sbjct: 95 ATLINASAIRSELIRADLSHAILTAANLSEA 125
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 60/121 (49%), Gaps = 21/121 (17%)
Query: 111 AQFGSADLRKAVHVKENFR----------------ANFTSADMRESDFSGSKFNGAYLEK 154
A SA+LR AV + N AN +A++R+++ S + +GA L+
Sbjct: 140 ANLKSANLRDAVLIASNLEGTNLHGADLTRADLRGANLVNAELRQANLSQANLSGANLKG 199
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEA-----NLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
A A+ GADL +++ L+ A +L++A L+ T L +DL A + GAD++
Sbjct: 200 ANLRWADLNGADLRGANLEQARLSGASLYGADLSHASLLYTHLIHADLTQANLTGADWTG 259
Query: 210 A 210
A
Sbjct: 260 A 260
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 60/120 (50%), Gaps = 15/120 (12%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGA 150
ADL + A+ RG A +A+LR+A + N AN A++R +D +G+ GA
Sbjct: 165 ADLTR--ADLRG-------ANLVNAELRQANLSQANLSGANLKGANLRWADLNGADLRGA 215
Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
LE+A A+ GADLS + L A+LT A L T +D GA + GA + A
Sbjct: 216 NLEQARLSGASLYGADLSHASLLYTHLIHADLTQANL-----TGADWTGAELTGAALTGA 270
Score = 40.8 bits (94), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 52/103 (50%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAV-HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+L A+ +V RA+ + A + + S+ A L A+ AN + ADL
Sbjct: 68 SGANLSGANLTGAILNVANLIRADLSHATLINASAIRSELIRADLSHAILTAANLSEADL 127
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + ++ L +ANL +A L VL S+L G + GAD + A
Sbjct: 128 REATLRQVDLRQANLKSANLRDAVLIASNLEGTNLHGADLTRA 170
>gi|428227093|ref|YP_007111190.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427986994|gb|AFY68138.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 225
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 66/147 (44%), Gaps = 15/147 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A ADLR A + NFR AN AD+R +D ++ G L A+ + N
Sbjct: 70 SYANLKRADLRGATLLGANFRGVNLEQANLCGADLRGADLRCAQMQGVQLRGALMHGVNL 129
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--------DL 214
GA+L+ + + LN A ++L R L + L A + G + +DA + DL
Sbjct: 130 VGANLAASELAGSNLNHARCMGSLLGRANLRGATLVKADLRGVELTDASLRSADLANADL 189
Query: 215 AQKQALCKYANGTNPITGVSTRKSLGC 241
+ + + N +TG + R++ C
Sbjct: 190 ERANLIGADLDRAN-LTGTNLRRAFVC 215
>gi|428301952|ref|YP_007140258.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428238496|gb|AFZ04286.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 267
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 46/92 (50%), Gaps = 15/92 (16%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN A++ +D SG+ A LEKA N GA LS + + L ANL+NA L
Sbjct: 78 RANLEGANLSNADLSGTFLGEANLEKA-----NLQGAKLSQAFLYKANLEGANLSNAYLS 132
Query: 189 RTVLTRSDLGGA----------IIEGADFSDA 210
T LTR++L GA I+ AD DA
Sbjct: 133 GTALTRANLRGANLRKSVIFVSILSEADLQDA 164
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 56/105 (53%), Gaps = 9/105 (8%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A A+LRK+V F + + AD+++++ +K + LE+A +AN T A L +
Sbjct: 139 ANLRGANLRKSVI----FVSILSEADLQDANLMEAKLLSSNLERANLARANLTKAQLHNA 194
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
+L +ANLT A LV+ L ++ L A + AD + A++ A
Sbjct: 195 -----ILQDANLTQAKLVKAELNQASLARANLLNADLTGAILQQA 234
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 46/83 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ A + +++ G+ GA L A+ +AN GA+LS+ + L EANL A L
Sbjct: 49 ADLYGAKLSKANLQGANLQGAILNYALLGRANLEGANLSNADLSGTFLGEANLEKANLQG 108
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L+++ L A +EGA+ S+A +
Sbjct: 109 AKLSQAFLYKANLEGANLSNAYL 131
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 61/122 (50%), Gaps = 8/122 (6%)
Query: 95 NKYEAETRGEFGIGSA----AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNG 149
N A+ G F +G A A A L +A K N AN ++A + + + + G
Sbjct: 85 NLSNADLSGTF-LGEANLEKANLQGAKLSQAFLYKANLEGANLSNAYLSGTALTRANLRG 143
Query: 150 AYLEKAVAYKANFTGADLSD-TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
A L K+V + + + ADL D LM+ +L +NL A L R LT++ L AI++ A+ +
Sbjct: 144 ANLRKSVIFVSILSEADLQDANLMEAKLL-SSNLERANLARANLTKAQLHNAILQDANLT 202
Query: 209 DA 210
A
Sbjct: 203 QA 204
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 47/91 (51%), Gaps = 6/91 (6%)
Query: 123 HVKENFRANF-TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 181
HVK+ N S D+ +D G+K + A L+ A N GA L+ L+ R L AN
Sbjct: 31 HVKQLLNTNSCPSCDLSNADLYGAKLSKANLQGA-----NLQGAILNYALLGRANLEGAN 85
Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L+NA L T L ++L A ++GA S A +
Sbjct: 86 LSNADLSGTFLGEANLEKANLQGAKLSQAFL 116
>gi|359459044|ref|ZP_09247607.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 256
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 46/78 (58%)
Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
AD+R ++ +G+ A L K +AN +GA LS + V+ +A+L A+L++T + +
Sbjct: 63 ADLRGTNLAGANLQAANLMKTDFCQANLSGAILSGASLQDAVMTQADLNGAILIKTSMIQ 122
Query: 195 SDLGGAIIEGADFSDAVI 212
+ L GAI+ GA+ A I
Sbjct: 123 TRLRGAILRGANLKQARI 140
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 6/98 (6%)
Query: 116 ADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADLR N +A N D +++ SG+ +GA L+ AV +A+ GA L T M +
Sbjct: 63 ADLRGTNLAGANLQAANLMKTDFCQANLSGAILSGASLQDAVMTQADLNGAILIKTSMIQ 122
Query: 175 M-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+L ANL A ++ + L +L ++E AD
Sbjct: 123 TRLRGAILRGANLKQARILGSFLEDVNLKKGVLEKADL 160
>gi|239816752|ref|YP_002945662.1| pentapeptide repeat-containing protein [Variovorax paradoxus S110]
gi|239803329|gb|ACS20396.1| pentapeptide repeat protein [Variovorax paradoxus S110]
Length = 866
Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats.
Identities = 44/129 (34%), Positives = 59/129 (45%), Gaps = 19/129 (14%)
Query: 116 ADLRKAVHVKENFRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A L A +FRA + E+ +FSG + GA L A+F+GA L D
Sbjct: 517 AHLSDAAPPMPSFRAAKIRRRLAEAAPGARNFSGMRLVGADLSDMDLRGADFSGAALEDA 576
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRS----------DLGGAIIEGADFSDAVIDLAQ---- 216
+D L++AN AVL R L+R+ +LGGA E ADFS A + A
Sbjct: 577 NLDNAQLSDANFNGAVLARARLSRTSLASATFRNANLGGAHCEFADFSGADLSSANCEKT 636
Query: 217 KQALCKYAN 225
+ A C AN
Sbjct: 637 RFASCSMAN 645
Score = 48.5 bits (114), Expect = 0.003, Method: Composition-based stats.
Identities = 42/144 (29%), Positives = 58/144 (40%), Gaps = 22/144 (15%)
Query: 111 AQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
A F ADL A K F + FT+++M DF GS ++ +L K
Sbjct: 621 ADFSGADLSSANCEKTRFASCSMANTVLDQTRFTASEMSHCDFRGSDWHQVFLTKLRMSG 680
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
F GA + L + NA LVR SD ++ DFSDA +D
Sbjct: 681 MAFDGASFQQVVWLECTLADVRFANASLVRCSFVTSDCSRSV----DFSDARLD------ 730
Query: 220 LCKYANGTNPITGVSTRKSLG-CG 242
C +A+G+ V R +L CG
Sbjct: 731 ACSFAHGSTLAGAVLRRAALKQCG 754
Score = 44.7 bits (104), Expect = 0.044, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 52/110 (47%), Gaps = 11/110 (10%)
Query: 110 AAQFGSADLRKAVHVKENFRAN-FTSADMRES-----DFSGSKFNGAYLEKAVAYKANFT 163
+ A LR+A + R AD+RE+ DFS GA LE+ VA ++ F
Sbjct: 737 GSTLAGAVLRRAALKQCGLRTTPLQQADLREARLDNCDFSECALQGAKLERLVAGESLFV 796
Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
ADL+ L ANL +A + V ++DL GA + D S ++ID
Sbjct: 797 RADLTGA-----SLRGANLIDANFSKAVFVQADLSGANLFRTDVSQSLID 841
Score = 37.4 bits (85), Expect = 8.3, Method: Composition-based stats.
Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 1/100 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+L A NF A A + + + + F A L A A+F+GADL
Sbjct: 569 SGAALEDANLDNAQLSDANFNGAVLARARLSRTSLASATFRNANLGGAHCEFADFSGADL 628
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
S ++ ++ N VL +T T S++ G+D+
Sbjct: 629 SSANCEKTRFASCSMANTVLDQTRFTASEMSHCDFRGSDW 668
>gi|359727541|ref|ZP_09266237.1| hypothetical protein Lwei2_11644 [Leptospira weilii str.
2006001855]
Length = 263
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 47/92 (51%), Gaps = 4/92 (4%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N +S + + F G F+GA L A ++F GA+ + LN A+L NA
Sbjct: 151 NLSSIILEKLKFDGVNFSGANLGHAFLQNSSFVGANFEGAKLRGSFLNNADLRNANFRGA 210
Query: 191 VLTRSDLGGAIIEGADFSDAVID----LAQKQ 218
L + L GA +EGADF+DA+ D L QKQ
Sbjct: 211 DLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242
>gi|378582929|ref|ZP_09831540.1| hypothetical protein CKS_5479 [Pantoea stewartii subsp. stewartii
DC283]
gi|377814439|gb|EHT97579.1| hypothetical protein CKS_5479 [Pantoea stewartii subsp. stewartii
DC283]
Length = 375
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 61/105 (58%), Gaps = 6/105 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANF 162
S A +ADL++A N A+ T+A++ ++D +GA L A +AY +A+
Sbjct: 250 SNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADL 309
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+ A+LS+ + R L++ANL++A L L R+DL AI++GA+
Sbjct: 310 SNANLSNADLKRADLSDANLSDANLTNVDLKRADLSNAILKGANL 354
Score = 43.5 bits (101), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 5/87 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTGADLSDTLMDRMVLNEANLT 183
AN T A + E+D S + +GA L A + N +GA+L+ + L+EA+L+
Sbjct: 191 HANLTMAYLSEADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADLS 250
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
NA L L R+DL A + GAD ++A
Sbjct: 251 NANLSNADLKRADLSNANLSGADLTNA 277
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 48/92 (52%), Gaps = 10/92 (10%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM----------VLN 178
AN T A + E+D S + + A L++A AN +GADL++ +++ L
Sbjct: 236 HANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLA 295
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
ANLT A L L+ ++L A ++ AD SDA
Sbjct: 296 HANLTMAYLSEADLSNANLSNADLKRADLSDA 327
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 40/75 (53%), Gaps = 5/75 (6%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
+++ + S + GAYL A N + ADLSD + L+ ANL +A L L+ +
Sbjct: 148 NLKGVNLSDTDLKGAYLSDA-----NLSDADLSDANLSDANLSGANLAHANLTMAYLSEA 202
Query: 196 DLGGAIIEGADFSDA 210
DL A + GAD ++A
Sbjct: 203 DLSNANLSGADLTNA 217
>gi|300869593|ref|ZP_07114173.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300332371|emb|CBN59373.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 214
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 60/115 (52%), Gaps = 5/115 (4%)
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
E AN +D+ ++ S +K N A L KA N +G DL M +L EANL A
Sbjct: 31 ELIGANLCESDITGANLSKAKLNRANLSKANLSNTNLSGTDLGGADMTEAILTEANLCRA 90
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY-ANGTNPITGVSTRKSL 239
L+ T L+++DL A + A+F A I + LC+ +G N + GV+ R+++
Sbjct: 91 DLIGTNLSKADLSRAFLTQANFIGANI---SRAILCQTDLHGVN-LYGVNLRRAI 141
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 51/102 (50%), Gaps = 9/102 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S G AD+ +A+ T A++ +D G+ + A L +A +ANF GA++S
Sbjct: 68 SGTDLGGADMTEAI---------LTEANLCRADLIGTNLSKADLSRAFLTQANFIGANIS 118
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
++ + L+ NL L R +LT +DL GA + D S A
Sbjct: 119 RAILCQTDLHGVNLYGVNLRRAILTEADLIGANLTKVDLSGA 160
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 5/89 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEANLT 183
RAN + A++ ++ SG+ GA + +A+ +AN G +LS + R L +AN
Sbjct: 54 RANLSKANLSNTNLSGTDLGGADMTEAILTEANLCRADLIGTNLSKADLSRAFLTQANFI 113
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A + R +L ++DL G + G + A++
Sbjct: 114 GANISRAILCQTDLHGVNLYGVNLRRAIL 142
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 50/94 (53%), Gaps = 1/94 (1%)
Query: 116 ADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 174
ADL +A + NF AN + A + ++D G G L +A+ +A+ GA+L+ +
Sbjct: 100 ADLSRAFLTQANFIGANISRAILCQTDLHGVNLYGVNLRRAILTEADLIGANLTKVDLSG 159
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
L A+L A L +L+ +DL GA + GA+ +
Sbjct: 160 ADLMGASLIRADLTEAILSAADLTGANLLGANLT 193
>gi|172037842|ref|YP_001804343.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354556328|ref|ZP_08975624.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171699296|gb|ACB52277.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353551765|gb|EHC21165.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 319
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 61/133 (45%), Gaps = 11/133 (8%)
Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
Q ADLR +FR + + A++RE DF+G+ AYL +A NFT A+L
Sbjct: 25 QLRRADLRGLNLSHTDFRGVDLSYANLREVDFTGADLRDAYLNEADLTAVNFTDANLEGA 84
Query: 171 LMDRMVLNEAN----------LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
+ ++ L +AN LT A L +T + GA + GA S A ++ A
Sbjct: 85 SLIKIYLIKANCYQTNFSGAYLTGAYLTKTNFKEAKFHGAYLNGAKLSGAKLEDAYYDHQ 144
Query: 221 CKYANGTNPITGV 233
++ +P T +
Sbjct: 145 TRFDTSFDPKTAL 157
>gi|94263119|ref|ZP_01286937.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
gi|93456490|gb|EAT06604.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
Length = 355
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 43/82 (52%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ T D+R+ + G+ F GA L K + AN G D S + +L EA+L+ A +
Sbjct: 70 DLTMVDLRQLELPGASFKGARLHKTLLGGANLAGCDFSQARIFWSLLQEADLSRASFRQA 129
Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
RS L A E ADFS+AV+
Sbjct: 130 EFERSILQDANCEEADFSEAVL 151
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 46/104 (44%), Gaps = 11/104 (10%)
Query: 116 ADLRKAVHVKENF-RANFTSADMRESDFS----------GSKFNGAYLEKAVAYKANFTG 164
ADL +A + F R+ A+ E+DFS S+ G L +A +K +G
Sbjct: 119 ADLSRASFRQAEFERSILQDANCEEADFSEAVLFKTILLNSRLKGINLRQAKMHKVLLSG 178
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
DL+ M E N NA L +R+D+ G + GAD S
Sbjct: 179 CDLAGQDFSDMRFREVNFANAKLGGADFSRADISGCVFTGADLS 222
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 10/96 (10%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV----------LNEA 180
+F+ RE +F+ +K GA +A FTGADLS + + ++ L A
Sbjct: 185 DFSDMRFREVNFANAKLGGADFSRADISGCVFTGADLSASRLSGVIARQSMFAGTNLQGA 244
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
+L A LV+ L S+L GA + GA+ A ++ A+
Sbjct: 245 DLEGAGLVQAYLGESNLEGASLVGANLESASLEKAR 280
>gi|451979948|ref|ZP_21928350.1| hypothetical protein NITGR_130030 [Nitrospina gracilis 3/211]
gi|451762820|emb|CCQ89564.1| hypothetical protein NITGR_130030 [Nitrospina gracilis 3/211]
Length = 360
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 50/114 (43%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAV 156
+E T E + A S + F+A F A + +DFSG A +A
Sbjct: 86 FEGSTLKETNLSEALLHNSNFTNTKFQNTDLFQAQFHDAILTNADFSGETIPNALFFRAN 145
Query: 157 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+NFT + L D D L A LTNA+L RT+ S G A E A+F ++
Sbjct: 146 LKHSNFTNSYLEDCQFDDADLTNAVLTNAILTRTIENLSSPGKAKFENANFKNS 199
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 65/143 (45%), Gaps = 18/143 (12%)
Query: 96 KYEAETRGEFG---------IGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSK 146
+Y++ T+ EF + ++ Q L A +V + +F+ D+R+ D
Sbjct: 19 EYKSITQEEFDRLYEKHHNWLEASKQIKDTQLESANNV---LKPDFSYHDLRDIDLKDKN 75
Query: 147 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
L+KA NF G+ L +T + +L+ +N TN T L ++ AI+ AD
Sbjct: 76 -----LQKANLKNCNFEGSTLKETNLSEALLHNSNFTNTKFQNTDLFQAQFHDAILTNAD 130
Query: 207 FSDAVIDLAQ-KQALCKYANGTN 228
FS I A +A K++N TN
Sbjct: 131 FSGETIPNALFFRANLKHSNFTN 153
Score = 37.4 bits (85), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 32/155 (20%), Positives = 68/155 (43%), Gaps = 18/155 (11%)
Query: 74 TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANF 132
T L+ A++ + + + + + ++A+ I + A F + A+ + N + +NF
Sbjct: 94 TNLSEALLHNSNFTNTKFQNTDLFQAQFHD--AILTNADFSGETIPNALFFRANLKHSNF 151
Query: 133 TSADMRESDFSGSKFNGAYLEKAVAYK---------------ANFTGADLSDTLMDRMVL 177
T++ + + F + A L A+ + ANF ++L++ + L
Sbjct: 152 TNSYLEDCQFDDADLTNAVLTNAILTRTIENLSSPGKAKFENANFKNSNLNNATLSSSDL 211
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
AN N+ +VR L ++ G GAD ++A+
Sbjct: 212 TNANFQNSTMVRVKLENTNTAGTHFGGADITNALF 246
>gi|77404498|ref|YP_345074.1| hypothetical protein pREC1_0013 [Rhodococcus erythropolis PR4]
gi|77019879|dbj|BAE46254.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length = 589
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 9/102 (8%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A G ADLR A N AD++ ++ SG+ A L+ A+ +A+ TGA+L+D
Sbjct: 228 ASLGFADLRAA---------NLQGADLQTAELSGATLRLANLKGAILREADLTGANLTDA 278
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ L EA L A+LV L DL +E A+ S A +
Sbjct: 279 TLTEADLAEAKLQGAILVNVNLQNFDLSRLDLEKANLSGATL 320
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 66/141 (46%), Gaps = 10/141 (7%)
Query: 95 NKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGA 150
N EA G + G+A A A L KA K A +AD++E+ G+ A
Sbjct: 349 NLAEANLTGAYMFGAALTEAVLTDATLTKAHLAKTTLAGALLINADLQEATLEGADLEDA 408
Query: 151 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
LE A KAN A LS L EA+LT AVL+ LT + GA + GAD +DA
Sbjct: 409 DLESAKLSKANLRLAILSGA-----TLPEADLTGAVLIGANLTNTTFSGANLSGADLTDA 463
Query: 211 VIDLAQ-KQALCKYANGTNPI 230
+ +A ++A AN T +
Sbjct: 464 DLSVADLEEADLTEANLTGAV 484
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 57/121 (47%), Gaps = 16/121 (13%)
Query: 109 SAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKA-- 155
+ A A L+ A+ V N +AN + A + E+D + GA LE+A
Sbjct: 281 TEADLAEAKLQGAILVNVNLQNFDLSRLDLEKANLSGATLFEADLRSATLTGANLERANL 340
Query: 156 ---VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++AN A+L+ M L EA LT+A L + L ++ L GA++ AD +A +
Sbjct: 341 AHAKLFEANLAEANLTGAYMFGAALTEAVLTDATLTKAHLAKTTLAGALLINADLQEATL 400
Query: 213 D 213
+
Sbjct: 401 E 401
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 49/98 (50%), Gaps = 16/98 (16%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A A+L AV + N AN T AD+ +++ S + Y AN T A+LSD
Sbjct: 473 ADLTEANLTGAVLIGANLAHANLTDADLSKANLSDADL----------YSANLTDANLSD 522
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
L+ A LT A L+ T+LTR DL GA++ G D
Sbjct: 523 A-----DLSGATLTRAGLMGTILTRVDLTGAVLTGLDL 555
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 66/144 (45%), Gaps = 9/144 (6%)
Query: 70 VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGS---AAQFGSADLRKA-VHVK 125
F L+ A + +++ L + + EA G IG+ A ADL KA +
Sbjct: 449 TFSGANLSGADLTDADLSVADLEEADLTEANLTGAVLIGANLAHANLTDADLSKANLSDA 508
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
+ + AN T A++ ++D SG+ A L + + + TGA L+ + L NLT+
Sbjct: 509 DLYSANLTDANLSDADLSGATLTRAGLMGTILTRVDLTGAVLTG-----LDLVGVNLTDV 563
Query: 186 VLVRTVLTRSDLGGAIIEGADFSD 209
L + DL GAI+ G D S+
Sbjct: 564 NLDNVNMDDVDLSGAILPGTDTSE 587
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 51/111 (45%), Gaps = 16/111 (14%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 164
A ADL A K N R A + A + E+D +G+ GA L F+G
Sbjct: 403 ADLEDADLESAKLSKANLRLAILSGATLPEADLTGAVLIGANLTNTT-----FSGANLSG 457
Query: 165 -----ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
ADLS ++ L EANLT AVL+ L ++L A + A+ SDA
Sbjct: 458 ADLTDADLSVADLEEADLTEANLTGAVLIGANLAHANLTDADLSKANLSDA 508
>gi|428211575|ref|YP_007084719.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|427999956|gb|AFY80799.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 514
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 62/120 (51%), Gaps = 13/120 (10%)
Query: 109 SAAQFGSADLRKAVHVKEN-------FRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
S A GSA L +A H+++ FRAN A++ +++ G+ N A LE A +AN
Sbjct: 101 SNATLGSATLEQA-HLEKAIFNGATLFRANLHQANLEKAELLGANLNSANLELANLKEAN 159
Query: 162 FTGADLSD-TL----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
ADL D TL +++ L ANL NA L L R +L A +E A+ S ++ A+
Sbjct: 160 LENADLQDATLPLANLEKANLKNANLKNANLSGANLKRVNLENANLESANLSSTNLEEAK 219
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 82/184 (44%), Gaps = 21/184 (11%)
Query: 68 WRVFVSTALAAAVVASCSSNISALADL------NKYEAETRGEFGIGSAAQFGSADLRKA 121
W +F L V A+ S+++ L + N EA G AQ AD+ +
Sbjct: 21 WSLFCLIFLPNPVFAARGSDVAKLEETGQCTRCNLQEANLMG-------AQLQGADMSDS 73
Query: 122 VHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
N R ANF+ + M ++D S + A LE+A KA F GA L + +
Sbjct: 74 NLRLANLRGAKLDGANFSRSRMFQADLSNATLGSATLEQAHLEKAIFNGATLFRANLHQA 133
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGTNP-ITGV 233
L +A L A L L ++L A +E AD DA + LA ++A K AN N ++G
Sbjct: 134 NLEKAELLGANLNSANLELANLKEANLENADLQDATLPLANLEKANLKNANLKNANLSGA 193
Query: 234 STRK 237
+ ++
Sbjct: 194 NLKR 197
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 45/85 (52%), Gaps = 7/85 (8%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
T A++ E++ SG GA L AN GADLS ++ L ANL NA L
Sbjct: 412 LTEANLVEANLSGINLKGARL-----ANANLQGADLSLANLETAHLFGANLQNANLSGAN 466
Query: 192 LTRSDLGGAIIEGADFSDAVIDLAQ 216
LT ++LGGA + GA+ ++L+Q
Sbjct: 467 LTGANLGGANLTGANLEG--VNLSQ 489
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 15/94 (15%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTGADLSDTLMDRMVLNEANLTNA- 185
+ A++ + +G GA LE + +AN TG +LS + + L ANL A
Sbjct: 261 LSGANLEAMNLTGINLEGANLEGTSLFMSNLERANLTGVNLSQSYLHYTDLTSANLVGAN 320
Query: 186 ---------VLVRTVLTRSDLGGAIIEGADFSDA 210
+L+ T LTR+DL A +GA+ D+
Sbjct: 321 LHRADLRHSILLGTDLTRADLSHANFKGANLQDS 354
>gi|167848210|ref|ZP_02473718.1| pentapeptide repeat protein [Burkholderia pseudomallei B7210]
Length = 333
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 20 ADLTGADLSGMDLRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVD 74
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 75 ARLTAANLSLAHCERTDFS 93
>gi|434392917|ref|YP_007127864.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428264758|gb|AFZ30704.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 313
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 57/108 (52%), Gaps = 6/108 (5%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A+L KA+ + N AN T AD+ +++ SG + + A L +AV A+
Sbjct: 93 SGVNLWRANLNKAILCEANLSRANLDEANLTGADLSKANLSGIQLSKANLTEAVIVDAHL 152
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+L++T + R L L A L+ + LT +DL A +EGA+ S+A
Sbjct: 153 NRANLTETKLMRSHLCGTQLERAELIASDLTAADLSRANLEGANLSEA 200
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 52/100 (52%), Gaps = 9/100 (9%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A + DL +++ V A+ ++++ + ++FNG++L AN TGADLS
Sbjct: 40 AILEATDLSRSILVG----ADLNGVILKQATMTATRFNGSHLVGVDLTAANLTGADLSGV 95
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ R ANL A+L L+R++L A + GAD S A
Sbjct: 96 NLWR-----ANLNKAILCEANLSRANLDEANLTGADLSKA 130
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 57/116 (49%), Gaps = 16/116 (13%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA--VAYKA---NFTG 164
A+ ++DL A + N AN + A++ +++ SG+ G L +A +A KA N G
Sbjct: 175 AELIASDLTAADLSRANLEGANLSEANLSQANLSGANLTGVNLHRANLIAAKAILANLRG 234
Query: 165 ADLSDTLMDRMVLNEA----------NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A+L + L EA NL+ A L R +LT +L AI+ GA+ DA
Sbjct: 235 ANLEQAELITTNLTEADLSWANLSKTNLSGADLHRAILTDVNLNSAILRGANLIDA 290
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 56/113 (49%), Gaps = 16/113 (14%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR--MV--------LN 178
RA ++D+ +D S + GA L +A +AN +GA+L+ + R ++ L
Sbjct: 174 RAELIASDLTAADLSRANLEGANLSEANLSQANLSGANLTGVNLHRANLIAAKAILANLR 233
Query: 179 EANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI-DLAQKQALCKYAN 225
ANL A L+ T LT +DL A + GAD A++ D+ A+ + AN
Sbjct: 234 GANLEQAELITTNLTEADLSWANLSKTNLSGADLHRAILTDVNLNSAILRGAN 286
>gi|162456757|ref|YP_001619124.1| pentapeptide repeat-containing protein [Sorangium cellulosum So
ce56]
gi|161167339|emb|CAN98644.1| pentapeptide repeats hypothetical protein [Sorangium cellulosum So
ce56]
Length = 895
Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats.
Identities = 37/111 (33%), Positives = 56/111 (50%), Gaps = 12/111 (10%)
Query: 108 GSAAQFGSADLRKAV--HVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
G F SA L++ V H A+F+ ADM ++ G+ GA L++A A+ +G
Sbjct: 747 GERVSFRSACLQQGVVVHGSSFPEADFSDADMERANLRGTVLAGARLDRANLRGADLSGC 806
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
D S EA+L AVL +L R+DL A ++GA+ DA+ A+
Sbjct: 807 DAS----------EASLERAVLQGGLLIRTDLVNASLQGANLMDALASKAR 847
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 40/118 (33%), Positives = 60/118 (50%), Gaps = 12/118 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGS-----KFNGAYLEKAVA-YKAN 161
S +F ADL +A V+ A+F+SA +R++ F F A L++ V + ++
Sbjct: 708 SGVRFTGADLSEANLVESTLDGADFSSATLRKTTFVACHGERVSFRSACLQQGVVVHGSS 767
Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
F AD SD M+R ANL VL L R++L GA + G D S+A ++ A Q
Sbjct: 768 FPEADFSDADMER-----ANLRGTVLAGARLDRANLRGADLSGCDASEASLERAVLQG 820
Score = 47.0 bits (110), Expect = 0.010, Method: Composition-based stats.
Identities = 33/91 (36%), Positives = 48/91 (52%), Gaps = 10/91 (10%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+FT A++ SG +GA+LE A + +G DLS T ++ VL ANL A L
Sbjct: 581 DFTGANLAGMCLSGVDLSGAFLESA-----DLSGCDLSRTNLEGAVLARANLAGANLADA 635
Query: 191 VLTRSDLGGAIIEG-----ADFSDAVIDLAQ 216
L ++LGGA + G AD +AV+ A+
Sbjct: 636 RLRGANLGGAALRGASLDRADLKEAVLSRAE 666
Score = 45.4 bits (106), Expect = 0.024, Method: Composition-based stats.
Identities = 55/173 (31%), Positives = 81/173 (46%), Gaps = 37/173 (21%)
Query: 74 TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGS----ADLRKAVHVKENF- 128
T L AV+A + LA N +A RG +G AA G+ ADL++AV +
Sbjct: 615 TNLEGAVLARAN-----LAGANLADARLRGA-NLGGAALRGASLDRADLKEAVLSRAELE 668
Query: 129 RANFTSADMRESDF-----SGSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDRMVLN 178
RA F+ AD+ +D+ G+ F GA L + K + FTGADLS+ + L+
Sbjct: 669 RARFSGADLTGADWFETKPGGADFTGATLGQCNLLKVDLSGVRFTGADLSEANLVESTLD 728
Query: 179 EANLTNAVLVRTVLT---------RSDL--GGAIIEG-----ADFSDAVIDLA 215
A+ ++A L +T RS G ++ G ADFSDA ++ A
Sbjct: 729 GADFSSATLRKTTFVACHGERVSFRSACLQQGVVVHGSSFPEADFSDADMERA 781
Score = 42.0 bits (97), Expect = 0.33, Method: Composition-based stats.
Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 10/87 (11%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT----------LMDRMVLNEA 180
+ + A + +D SG + LE AV +AN GA+L+D + L+ A
Sbjct: 596 DLSGAFLESADLSGCDLSRTNLEGAVLARANLAGANLADARLRGANLGGAALRGASLDRA 655
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADF 207
+L AVL R L R+ GA + GAD+
Sbjct: 656 DLKEAVLSRAELERARFSGADLTGADW 682
>gi|427723149|ref|YP_007070426.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427354869|gb|AFY37592.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 508
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 52/106 (49%), Gaps = 6/106 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTS------ADMRESDFSGSKFNGAYLEKAVAYKANF 162
S A+ LR+A N R S AD+ +++ G+ GAYL A Y AN
Sbjct: 67 SGAKLSKVHLRQAYLYGTNLRRTHLSEAFLFKADLSKTNLYGAYLYGAYLYGANLYGANL 126
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
+ ADLS+ + L+EA+L+ A L L+ +DL G + G + S
Sbjct: 127 SKADLSEADLSEADLSEADLSEADLSGVSLSEADLSGVNLSGVNLS 172
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 43/81 (53%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ + AD+ +D SG+K + +L +A Y N LS+ + + L++ NL A L
Sbjct: 54 ADLSGADLSGADLSGAKLSKVHLRQAYLYGTNLRRTHLSEAFLFKADLSKTNLYGAYLYG 113
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L ++L GA + AD S+A
Sbjct: 114 AYLYGANLYGANLSKADLSEA 134
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 44/84 (52%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
A+ + AD+ E+D S + +G L +A N +G +LS + + L+ NL+ A L
Sbjct: 133 EADLSEADLSEADLSEADLSGVSLSEADLSGVNLSGVNLSGVNLSGVNLSGVNLSGAKLC 192
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
T+ S L GA ++ AD + A I
Sbjct: 193 HTLCKLSTLVGASLKSADLTGACI 216
Score = 37.4 bits (85), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 37/73 (50%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
D+R + FSG+ +G L A+ +GADLS + L++ +L A L T L R+
Sbjct: 30 DLRRAQFSGAHLSGVNLSGVNLSGADLSGADLSGADLSGAKLSKVHLRQAYLYGTNLRRT 89
Query: 196 DLGGAIIEGADFS 208
L A + AD S
Sbjct: 90 HLSEAFLFKADLS 102
>gi|337746223|ref|YP_004640385.1| hypothetical protein KNP414_01954 [Paenibacillus mucilaginosus
KNP414]
gi|336297412|gb|AEI40515.1| Uncharacterized low-complexity protein-like protein [Paenibacillus
mucilaginosus KNP414]
Length = 289
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ FT + + SDFSG+ G+ + + +ANF GA+L+D + + L A+ +LV
Sbjct: 101 KGQFTGSALHGSDFSGADLTGSSFKSSDVREANFDGANLTDCSLSTLDLANASFHKTILV 160
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDL 214
RT ++S L GA G +D + +
Sbjct: 161 RTNFSKSGLDGAQFTGVRLTDVTLTM 186
>gi|425454308|ref|ZP_18834054.1| Pentapeptide repeat protein [Microcystis aeruginosa PCC 9807]
gi|389805079|emb|CCI15409.1| Pentapeptide repeat protein [Microcystis aeruginosa PCC 9807]
Length = 222
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 16/102 (15%)
Query: 140 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT---------------N 184
++ SG+ +GA LE+A KAN TGA+LS + + L+EA+LT
Sbjct: 68 ANLSGANLSGALLEEAKLGKANLTGANLSKADLSAITLSEADLTEADLSEAVLSNALMDQ 127
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
A+LV L +DL AII A+ S+AV + AQ K A+ +N
Sbjct: 128 AILVDATLIGADLESAIISKANLSNAVANKAQFKNAILSESN 169
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 54/103 (52%), Gaps = 6/103 (5%)
Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A+ G A+L A K + A + AD+ E+D S + + A +++A+ A GADL
Sbjct: 83 AKLGKANLTGANLSKADLSAITLSEADLTEADLSEAVLSNALMDQAILVDATLIGADL-- 140
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ ++++ANL+NAV + + L + + G DFS A +
Sbjct: 141 ---ESAIISKANLSNAVANKAQFKNAILSESNLSGTDFSQATM 180
>gi|440754482|ref|ZP_20933684.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|440174688|gb|ELP54057.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
Length = 469
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 54/101 (53%)
Query: 113 FGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 172
+G+ R ++ +RAN A+++ ++ G+ NGA L A +A GA L+ +
Sbjct: 294 YGAYLYRANLYRANLYRANLKGANLKGANLKGANLNGANLILANLNRAYLNGAILNRANL 353
Query: 173 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
+ +LN ANL A L L +++L GA + GAD + A ++
Sbjct: 354 NGAILNRANLNGAYLNGAYLIQANLNGADLNGADLNRANLN 394
Score = 44.7 bits (104), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 10/95 (10%)
Query: 131 NFTSADMRESDFSGSKFNGA----------YLEKAVAYKANFTGADLSDTLMDRMVLNEA 180
+ + ++RE++ +G+ NGA YL +A Y+AN A+L + L A
Sbjct: 267 DLSRTNLREANLNGANLNGAQLYRANLYGAYLYRANLYRANLYRANLKGANLKGANLKGA 326
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
NL A L+ L R+ L GAI+ A+ + A+++ A
Sbjct: 327 NLNGANLILANLNRAYLNGAILNRANLNGAILNRA 361
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 33/59 (55%), Gaps = 5/59 (8%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
RAN A + ++ +G+ NGAYL +AN GADL+ ++R LN ANL A L
Sbjct: 350 RANLNGAILNRANLNGAYLNGAYL-----IQANLNGADLNGADLNRANLNGANLNGANL 403
>gi|443328868|ref|ZP_21057461.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791604|gb|ELS01098.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 266
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 54/109 (49%), Gaps = 4/109 (3%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A ADLR+A + RAN + D+ +D + G L A KA+ + A+LS+
Sbjct: 153 ADLNDADLREA----QLIRANLSEVDLSGADLRAANLKGVNLRGADLNKADLSRANLSEA 208
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
+ LNEANL+ A L L ++L + GA F + ++D K++
Sbjct: 209 YLYLANLNEANLSRADLSEANLHEANLSRVDLRGAIFCETIMDDGHKES 257
Score = 40.4 bits (93), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 42/77 (54%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N + D+ ++ SG+ +GA L +A + N + +L+ ++ LN+A+L A L+R
Sbjct: 109 NLSGVDLSGANLSGADLSGADLSEADLSRVNLSRVNLNGANLNDADLNDADLREAQLIRA 168
Query: 191 VLTRSDLGGAIIEGADF 207
L+ DL GA + A+
Sbjct: 169 NLSEVDLSGADLRAANL 185
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 10/80 (12%)
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
++E D S S +G L AN +GADLS A+L+ A L R L+R +
Sbjct: 95 LKEFDLSQSNLSGVNLSGVDLSGANLSGADLSG----------ADLSEADLSRVNLSRVN 144
Query: 197 LGGAIIEGADFSDAVIDLAQ 216
L GA + AD +DA + AQ
Sbjct: 145 LNGANLNDADLNDADLREAQ 164
>gi|158313419|ref|YP_001505927.1| pentapeptide repeat-containing protein [Frankia sp. EAN1pec]
gi|158108824|gb|ABW11021.1| pentapeptide repeat protein [Frankia sp. EAN1pec]
Length = 299
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 53/118 (44%), Gaps = 10/118 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A DLR A AD+R++D S + GA L A+ A TGADL
Sbjct: 106 SGADLRGTDLRDAC---------LRGADLRDADLSQAALGGADLAGALLAGAFLTGADLH 156
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 225
T + L+ A+L A L R L +D G I+ GAD A D +QA + A+
Sbjct: 157 GTDLHGAFLHNADLRKAFLARADLRGADADGIIMRGADLRAADATDAVLRQADLRAAD 214
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 50/107 (46%), Gaps = 9/107 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A G ADL A+ A T AD+ +D G+ + A L KA +A+ GAD
Sbjct: 131 SQAALGGADLAGALLAG----AFLTGADLHGTDLHGAFLHNADLRKAFLARADLRGADAD 186
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDA 210
+M L A+ T+AVL + L +D L GAI+ G D A
Sbjct: 187 GIIMRGADLRAADATDAVLRQADLRAADLRGIRLAGAILRGVDLRGA 233
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 1/99 (1%)
Query: 108 GSAAQFGS-ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 166
G+ A+ S DL A+ + AD+ +D +G G L A + A +GAD
Sbjct: 50 GAPARLSSLGDLLAALRGRPRTGGYAAGADLTGADLAGVCLTGRILRGAQLHGAYLSGAD 109
Query: 167 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
L T + L A+L +A L + L +DL GA++ GA
Sbjct: 110 LRGTDLRDACLRGADLRDADLSQAALGGADLAGALLAGA 148
Score = 37.7 bits (86), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 36/78 (46%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ +G GA L A A+ G DL D + L +A+L+ A L
Sbjct: 78 ADLTGADLAGVCLTGRILRGAQLHGAYLSGADLRGTDLRDACLRGADLRDADLSQAALGG 137
Query: 190 TVLTRSDLGGAIIEGADF 207
L + L GA + GAD
Sbjct: 138 ADLAGALLAGAFLTGADL 155
>gi|448242763|ref|YP_007406816.1| hypothetical protein SMWW4_v1c30030 [Serratia marcescens WW4]
gi|445213127|gb|AGE18797.1| hypothetical protein SMWW4_v1c30030 [Serratia marcescens WW4]
Length = 850
Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats.
Identities = 39/130 (30%), Positives = 60/130 (46%), Gaps = 5/130 (3%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNG 149
A +D + E+E +AQ S LR + + R +A +R+ D SG G
Sbjct: 482 AFSDKQRGESERALHQMYLMSAQAQSPALRLRGDLAQIIRQRVAAAMLRDKDLSGLDLTG 541
Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
A L +AN GA L++ L + L L +L R+DL GA+++ AD S
Sbjct: 542 ADLSGMDLCQANLRGA-----LLENANLRQTQLVGCDLREAMLARADLSGAVLQQADLSH 596
Query: 210 AVIDLAQKQA 219
A + LA+ +A
Sbjct: 597 ASLALAKCEA 606
Score = 42.0 bits (97), Expect = 0.33, Method: Composition-based stats.
Identities = 29/90 (32%), Positives = 47/90 (52%), Gaps = 12/90 (13%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RANF+ A + D S ++ N +A+F A+ S +L R L++ANL +A +
Sbjct: 747 RANFSRARLDNCDLSEARLN----------EADFRQANGSGSLFIRCDLSKANLRDANFI 796
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
+L + L GA ++G + A DL+Q Q
Sbjct: 797 AAILQKCVLSGADLQGTNLFRA--DLSQSQ 824
Score = 39.3 bits (90), Expect = 2.1, Method: Composition-based stats.
Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 5/80 (6%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ T AD+ D + GA LE A + G DL + ++ R A+L+ AVL +
Sbjct: 538 DLTGADLSGMDLCQANLRGALLENANLRQTQLVGCDLREAMLAR-----ADLSGAVLQQA 592
Query: 191 VLTRSDLGGAIIEGADFSDA 210
L+ + L A E DF A
Sbjct: 593 DLSHASLALAKCEATDFGGA 612
>gi|432333149|ref|ZP_19584958.1| hypothetical protein Rwratislav_00760 [Rhodococcus wratislaviensis
IFP 2016]
gi|430779982|gb|ELB95096.1| hypothetical protein Rwratislav_00760 [Rhodococcus wratislaviensis
IFP 2016]
Length = 220
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 51/108 (47%), Gaps = 6/108 (5%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A +ADLR R A+ TS +M E D SG+ A L A AN ADL+D
Sbjct: 34 ANLRNADLRLGFLRDATLRNADLTSCNMYEVDLSGANLYLAQLSGAHMTGANLNNADLTD 93
Query: 170 TLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
T + + +L E L A L R L +DL GA + G D SDA +
Sbjct: 94 TKLIKTQLSGAMLIEVELDGADLSRAFLQNADLTGAHLRGTDLSDATL 141
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 73/162 (45%), Gaps = 19/162 (11%)
Query: 79 AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMR 138
A + SC+ L+ N Y A+ G G A +ADL +K + A +
Sbjct: 54 ADLTSCNMYEVDLSGANLYLAQLSGAHMTG--ANLNNADLTDTKLIK----TQLSGAMLI 107
Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLTNAVLVRTVLT 193
E + G+ + A+L+ A A+ G DLSD + + M N EA L +A L LT
Sbjct: 108 EVELDGADLSRAFLQNADLTGAHLRGTDLSDATLVGAELMATNLAEAELVDADLTDADLT 167
Query: 194 RSDLGGAIIEGA-----DFSDAVI---DLAQKQALCKYANGT 227
+DL GA + GA DF+DA + DL Q +Y + T
Sbjct: 168 FADLTGADLRGANLTRTDFTDADLTGADLGTTQDKARYDDTT 209
>gi|20090742|ref|NP_616817.1| hypothetical protein MA1892 [Methanosarcina acetivorans C2A]
gi|19915798|gb|AAM05297.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans
C2A]
Length = 560
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 62/122 (50%), Gaps = 12/122 (9%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTG 164
A A+LR K N R + + AD+RE+D SG +GA L A +AN G
Sbjct: 389 ANLSGANLRGTNLSKANLREVDLSGADLREADLSGVDLSGANLSGADLSGVDLSRANLNG 448
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKY 223
ADL+ + R LNEANL+ +T L +DL A + GA S+A + A+ K A +
Sbjct: 449 ADLNGIDLRRANLNEANLS-----KTNLNEADLSKAKLSGAYLSEAKLKGAKLKGAYMRK 503
Query: 224 AN 225
AN
Sbjct: 504 AN 505
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 44/79 (55%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N AD+ ESD + + A+L +A KAN + A+LS+ + + ANL+ A L +
Sbjct: 279 NLIGADLSESDLRDAFLHEAHLNEADLSKANLSKANLSEADLKGAYMRRANLSEANLSKA 338
Query: 191 VLTRSDLGGAIIEGADFSD 209
L+ DL GA + GAD ++
Sbjct: 339 KLSGVDLSGANLSGADLNE 357
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 54/114 (47%), Gaps = 12/114 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFR--ANFTSADMRESD----------FSGSKFNGAYLEKAV 156
S A ADL + K + AN + AD+ E+D SG+ G L KA
Sbjct: 346 SGANLSGADLNEFYLNKATYTRGANLSEADLSEADLSEANLKGANLSGANLRGTNLSKAN 405
Query: 157 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + +GADL + + + L+ ANL+ A L L+R++L GA + G D A
Sbjct: 406 LREVDLSGADLREADLSGVDLSGANLSGADLSGVDLSRANLNGADLNGIDLRRA 459
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 53/102 (51%), Gaps = 9/102 (8%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----A 165
A DLR+A ++ E AN + ++ E+D S +K +GAYL +A A G A
Sbjct: 449 ADLNGIDLRRA-NLNE---ANLSKTNLNEADLSKAKLSGAYLSEAKLKGAKLKGAYMRKA 504
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+LS+ ++ L EANL+ A L L+ DL GA + G +
Sbjct: 505 NLSEADLNGADLREANLSEANLNGVDLSVIDLRGANLNGVNI 546
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 42/81 (51%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + AD+ D S + NGA L +AN A+LS T ++ L++A L+ A L
Sbjct: 429 ANLSGADLSGVDLSRANLNGADLNGIDLRRANLNEANLSKTNLNEADLSKAKLSGAYLSE 488
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L + L GA + A+ S+A
Sbjct: 489 AKLKGAKLKGAYMRKANLSEA 509
>gi|428214178|ref|YP_007087322.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428002559|gb|AFY83402.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 346
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 74/151 (49%), Gaps = 3/151 (1%)
Query: 67 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA-VHVK 125
NW L+ A +A+ + + L+ N A+ + IG+ S DLR+A + +
Sbjct: 95 NWADLSGANLSGANLANADVSGANLSGANLSGAKLNQTYLIGT--NLKSVDLREANLSLA 152
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
+A+ T A++R++D +G+K + L A AN TGA+L + + LN ANLT A
Sbjct: 153 SLNKADLTKANLRQADLTGAKLKQSNLNLADLTHANLTGANLKQANLSQAHLNWANLTKA 212
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
L L ++L A + D ++ + AQ
Sbjct: 213 DLREANLCGANLSKANLSQTDLTEVCLKDAQ 243
Score = 44.7 bits (104), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 44/84 (52%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
R + AD+ E++ SG GA L KA AN + A+LS + L ANLT A L
Sbjct: 31 RLSLAKADLSEANLSGVYLGGASLTKANLSGANLSRANLSGASLSGANLTGANLTGANLA 90
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
L +DL GA + GA+ ++A +
Sbjct: 91 GAHLNWADLSGANLSGANLANADV 114
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 55/118 (46%), Gaps = 16/118 (13%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRES----------DFSGSKFNGAYLEKAVAYK 159
A ADLR+A N +AN + D+ E +FSG+ G L +
Sbjct: 207 ANLTKADLREANLCGANLSKANLSQTDLTEVCLKDAQLSGINFSGANLTGVDLSNKLLTG 266
Query: 160 ANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
AN +GA+ LS + + L EANL+ A L+ + L +DL A + GA+ S A +
Sbjct: 267 ANLSGAELSLANLSGAYLIQTNLREANLSEANLMGSHLMDADLTKANLSGANLSQANV 324
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 21/121 (17%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKA----VAYK------ 159
A A+L++A + + AN T AD+RE++ G+ + A L + V K
Sbjct: 187 ANLTGANLKQANLSQAHLNWANLTKADLREANLCGANLSKANLSQTDLTEVCLKDAQLSG 246
Query: 160 -----ANFTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
AN TG DLS+ L + L+ ANL+ A L++T L ++L A + G+ D
Sbjct: 247 INFSGANLTGVDLSNKLLTGANLSGAELSLANLSGAYLIQTNLREANLSEANLMGSHLMD 306
Query: 210 A 210
A
Sbjct: 307 A 307
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 42/79 (53%), Gaps = 5/79 (6%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
F + ++D S + +G YL A KAN +GA+LS R L+ A+L+ A L
Sbjct: 29 FNRLSLAKADLSEANLSGVYLGGASLTKANLSGANLS-----RANLSGASLSGANLTGAN 83
Query: 192 LTRSDLGGAIIEGADFSDA 210
LT ++L GA + AD S A
Sbjct: 84 LTGANLAGAHLNWADLSGA 102
>gi|443314210|ref|ZP_21043788.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442786182|gb|ELR95944.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 516
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 56/124 (45%), Gaps = 2/124 (1%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
E + S A A+LR A + + AN A++R +D SG+ A L A A
Sbjct: 174 EDTVLSGAVLQRAELRHATLMGADLSGANLRGANLRWADLSGANLQEADLTDAKLSGATL 233
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALC 221
GADLS + +L +L+ L R SDL GA + GA + AV DL + C
Sbjct: 234 VGADLSGATLVNTILVHTDLSRTRLQRVYCVDSDLSGATLNGAFLAGAVCYDLVTAETTC 293
Query: 222 KYAN 225
+ +
Sbjct: 294 DWVD 297
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 50/92 (54%), Gaps = 10/92 (10%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL----------SDTLMDRMVLN 178
+AN + AD+RE+ ++ +GA L ++ K+NF GA+L DT++ VL
Sbjct: 125 QANLSEADLREARLRWARLSGANLSQSDLRKSNFLGANLEGAQLYAAQMEDTVLSGAVLQ 184
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A L +A L+ L+ ++L GA + AD S A
Sbjct: 185 RAELRHATLMGADLSGANLRGANLRWADLSGA 216
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 50/101 (49%), Gaps = 1/101 (0%)
Query: 111 AQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADLR+A + AN + +D+R+S+F G+ GA L A +GA L
Sbjct: 126 ANLSEADLREARLRWARLSGANLSQSDLRKSNFLGANLEGAQLYAAQMEDTVLSGAVLQR 185
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ L A+L+ A L L +DL GA ++ AD +DA
Sbjct: 186 AELRHATLMGADLSGANLRGANLRWADLSGANLQEADLTDA 226
>gi|337745078|ref|YP_004639240.1| hypothetical protein KNP414_00780 [Paenibacillus mucilaginosus
KNP414]
gi|336296267|gb|AEI39370.1| Uncharacterized low-complexity protein-like protein [Paenibacillus
mucilaginosus KNP414]
Length = 289
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ FT + + SDFSG+ G+ + + +ANF GA+L+D + + L A+ +LV
Sbjct: 101 KGQFTGSALHGSDFSGADLTGSSFKSSDVREANFDGANLTDCSLSTLDLANASFHKTILV 160
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDL 214
RT ++S L GA G +D + +
Sbjct: 161 RTNFSKSGLDGAQFTGVRLTDVTLTM 186
>gi|425452817|ref|ZP_18832632.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 7941]
gi|425459927|ref|ZP_18839413.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9808]
gi|440756386|ref|ZP_20935587.1| tetratricopeptide repeat family protein [Microcystis aeruginosa
TAIHU98]
gi|389765245|emb|CCI08832.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 7941]
gi|389827515|emb|CCI21150.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9808]
gi|440173608|gb|ELP53066.1| tetratricopeptide repeat family protein [Microcystis aeruginosa
TAIHU98]
Length = 262
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 55/101 (54%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L++ + ++ + + + + + +S+ +G+K NGA L A +AN +GADLS +
Sbjct: 29 LQQLLSTRQCPQCDLSGSGLVQSNLTGAKLNGANLVGANLSQANLSGADLSGANLTGASF 88
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
ANLT A L +LT +DL GA + A+ + +D A Q
Sbjct: 89 FGANLTGANLSGAILTGADLRGAYLNNANLDNTKLDTAYVQ 129
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 40/72 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ +++ SG+ +GA L A + AN TGA+LS ++ L A L NA L
Sbjct: 61 ANLVGANLSQANLSGADLSGANLTGASFFGANLTGANLSGAILTGADLRGAYLNNANLDN 120
Query: 190 TVLTRSDLGGAI 201
T L + + GA+
Sbjct: 121 TKLDTAYVQGAV 132
>gi|46205596|ref|ZP_00048308.2| COG1357: Uncharacterized low-complexity proteins [Magnetospirillum
magnetotacticum MS-1]
Length = 195
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 48/89 (53%), Gaps = 5/89 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTN 184
A+F+ A MR + + +GA E A + +FTGAD D+L R L+EA NLT
Sbjct: 16 ADFSGATMRFARLDKALLDGARFEGADLWGTDFTGADADDSLFRRARLDEANLSDCNLTG 75
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDAVID 213
A L ++ L GA + GA+F+ A +D
Sbjct: 76 ADFEGASLKKARLVGARLRGANFTGARLD 104
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 42/85 (49%), Gaps = 5/85 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RA A++ + + +G+ F GA L+KA A GA+ + +D L+EA+ + LV
Sbjct: 60 RARLDEANLSDCNLTGADFEGASLKKARLVGARLRGANFTGARLDGADLSEADFSRTSLV 119
Query: 189 RTVLT-----RSDLGGAIIEGADFS 208
R LT + GA +EG S
Sbjct: 120 RLDLTACKLRHARFAGAWLEGVRLS 144
>gi|453064141|gb|EMF05113.1| putative low-complexity protein [Serratia marcescens VGH107]
Length = 850
Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats.
Identities = 39/130 (30%), Positives = 60/130 (46%), Gaps = 5/130 (3%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNG 149
A +D + E+E +AQ S LR + + R +A +R+ D SG G
Sbjct: 482 AFSDKQRGESERALHQMYLMSAQAQSPALRLRGDLAQIIRQRVAAAMLRDKDLSGLDLTG 541
Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
A L +AN GA L++ L + L L +L R+DL GA+++ AD S
Sbjct: 542 ADLSGMDLCQANLRGA-----LLESANLRQTQLVGCDLREAMLARADLSGAVLQQADLSH 596
Query: 210 AVIDLAQKQA 219
A + LA+ +A
Sbjct: 597 ASLALAKCEA 606
Score = 41.2 bits (95), Expect = 0.45, Method: Composition-based stats.
Identities = 30/108 (27%), Positives = 54/108 (50%), Gaps = 11/108 (10%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A FG A L + N ++ ++FS ++ + L +A +A+F A+ + +
Sbjct: 728 ADFGDATLNQC---------NLRQMPLQRANFSRARLDNCDLSEARLNEADFRQANGNGS 778
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
L R L++ANL +A + +L + L GA ++G + A DL+Q Q
Sbjct: 779 LFIRCDLSQANLRDANFIAAILQKCVLSGADLQGTNLFRA--DLSQSQ 824
Score = 39.3 bits (90), Expect = 2.1, Method: Composition-based stats.
Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 5/80 (6%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ T AD+ D + GA LE A + G DL + ++ R A+L+ AVL +
Sbjct: 538 DLTGADLSGMDLCQANLRGALLESANLRQTQLVGCDLREAMLAR-----ADLSGAVLQQA 592
Query: 191 VLTRSDLGGAIIEGADFSDA 210
L+ + L A E DF A
Sbjct: 593 DLSHASLALAKCEATDFGGA 612
>gi|428316016|ref|YP_007113898.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428239696|gb|AFZ05482.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 168
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 56/109 (51%), Gaps = 9/109 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
++A A L AV EN + N A + + G+ GA LEKA + A+ T ADLS
Sbjct: 55 TSANLNGAKLEGAVL--ENVKLN--EALLDSVNLKGANLKGASLEKAGLFSADLTKADLS 110
Query: 169 DT-----LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ + LN ANL+NA L T L +DL GA ++GA+ A++
Sbjct: 111 NANLKGAFLRGAKLNNANLSNADLSETDLNIADLTGANLKGANLKGAIM 159
>gi|427734496|ref|YP_007054040.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427369537|gb|AFY53493.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 116
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR-- 189
+ AD+ E D SG+K A L A Y AN +GA LS + L+ ANL+ A L +
Sbjct: 1 MSGADLHEKDLSGAKLYRANLSGAKLYGANLSGASLSGADLSGSSLSAANLSGAYLQKAN 60
Query: 190 ---TVLTRSDLGGAIIEGADFSDAVI 212
L ++DL A + GAD +AV+
Sbjct: 61 LSGAYLQKADLSKATLYGADLQNAVL 86
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 46/90 (51%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
+ AN + A + +D SGS + A L A KAN +GA L + + L A+L NAVL
Sbjct: 27 YGANLSGASLSGADLSGSSLSAANLSGAYLQKANLSGAYLQKADLSKATLYGADLQNAVL 86
Query: 188 VRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
L + L GA +EGA A I+ A K
Sbjct: 87 FGANLEGAKLKGANLEGAKLKGANIEEAIK 116
>gi|425437233|ref|ZP_18817656.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9432]
gi|389677805|emb|CCH93269.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9432]
Length = 262
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 55/101 (54%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L++ + ++ + + + + + +S+ +G+K NGA L A +AN +GADLS +
Sbjct: 29 LQQLLSTRQCPQCDLSGSGLVQSNLTGAKLNGANLVGANLSQANLSGADLSGANLTGASF 88
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
ANLT A L +LT +DL GA + A+ + +D A Q
Sbjct: 89 FGANLTGANLSGAILTGADLRGAYLNNANLDNTKLDTAYVQ 129
Score = 40.8 bits (94), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 40/72 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ +++ SG+ +GA L A + AN TGA+LS ++ L A L NA L
Sbjct: 61 ANLVGANLSQANLSGADLSGANLTGASFFGANLTGANLSGAILTGADLRGAYLNNANLDN 120
Query: 190 TVLTRSDLGGAI 201
T L + + GA+
Sbjct: 121 TKLDTAYVQGAV 132
>gi|448677922|ref|ZP_21689112.1| pentapeptide repeat-containing protein [Haloarcula argentinensis
DSM 12282]
gi|445773597|gb|EMA24630.1| pentapeptide repeat-containing protein [Haloarcula argentinensis
DSM 12282]
Length = 428
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 5/86 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+AN +SAD+RE+D SG+ A L A KA+ +GADLS + L A+L++A L
Sbjct: 70 KANLSSADLREADLSGADLGSADLSGANLQKADLSGADLSYANLSGADLENADLSSADLR 129
Query: 189 RTVLT-----RSDLGGAIIEGADFSD 209
RT L+ +DL A + DFSD
Sbjct: 130 RTNLSGVKFVETDLADADLRNIDFSD 155
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 52/106 (49%), Gaps = 6/106 (5%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A SADLR+ F + AD+R DFS ++ G L A + + +GADL
Sbjct: 121 ADLSSADLRRTNLSGVKFVETDLADADLRNIDFSDTELVGTDLSGADFFATDLSGADLRV 180
Query: 170 TLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDA 210
M + L EA+L+ A L T L+ +DL GA + G D SDA
Sbjct: 181 ADMSNVNLREADLSGADLGGTDLSDANLREADLSGADLGGVDLSDA 226
>gi|427724651|ref|YP_007071928.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427356371|gb|AFY39094.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 281
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 66/116 (56%), Gaps = 10/116 (8%)
Query: 107 IGSAAQFGSADLRKA----VHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
I + A A+LR+A ++ N +AN S+++ E++ + +K + + A +A
Sbjct: 46 IFTGATLDQANLREADLSYASLQGNLSQANLISSNLTEANLTAAKMAYSGMRAANLTRAK 105
Query: 162 FTGADLSDTLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVI 212
T ADLS +++ ++ EANL+ A LV R LT+++L GA ++GA+ + A++
Sbjct: 106 LTSADLSYCILNEAIMREANLSKATLVDAFIGRANLTQANLEGANLQGANLTSAIL 161
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 39/81 (48%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ + G+ GA L A + N TG+ D + + LN ANLTN L
Sbjct: 149 ANLQGANLTSAILIGANLRGANLANATLHGINATGSTADDADLSKSKLNSANLTNVKLRG 208
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
T L + L + GAD ++A
Sbjct: 209 TNLREAQLAWTTMRGADLTEA 229
>gi|386016243|ref|YP_005934529.1| hypothetical protein PAJ_1653 [Pantoea ananatis AJ13355]
gi|327394311|dbj|BAK11733.1| hypothetical protein PAJ_1653 [Pantoea ananatis AJ13355]
Length = 846
Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats.
Identities = 45/172 (26%), Positives = 75/172 (43%), Gaps = 17/172 (9%)
Query: 71 FVSTALAAAV-----VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
F+ + L AV + SCS + AD + + S + AD A +
Sbjct: 675 FIKSTLEQAVFNRAELESCSW-VETQADHATFSGSIWLTCAVASGSSLNDADFTHATLRQ 733
Query: 126 ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
N R +A + F+ +K + + L +A ANF A+L+ +L R +A+ T+A
Sbjct: 734 SNLRQTPLNAAV----FTQAKLDNSDLSEASCKGANFQQANLAGSLFVRTDFRDADFTDA 789
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 237
L+ +L +S LGGA G A DL+Q + + T + G T++
Sbjct: 790 NLMGAILQKSQLGGACFRGTTLFRA--DLSQ-----AFTSETTELDGAFTKR 834
Score = 37.7 bits (86), Expect = 5.3, Method: Composition-based stats.
Identities = 34/107 (31%), Positives = 45/107 (42%), Gaps = 11/107 (10%)
Query: 110 AAQFGSADL-RKAVHVKE----NF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
A F SA L R +H NF RA+F A +SDFSGS F L++ + F
Sbjct: 567 GANFNSAMLARTELHHSSLRNCNFERASFALAQCCQSDFSGSYFKDTQLQETLFDNCTFN 626
Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
A S+ L + A+L V DL G DF++A
Sbjct: 627 EATFSELLFRETWFTQCRFQRAILQACVFMELDL-----PGLDFTEA 668
>gi|428219102|ref|YP_007103567.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427990884|gb|AFY71139.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 698
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 54/101 (53%), Gaps = 1/101 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
+ A A+L A NF +AN A++R + SG +GA L A AN +GA+L
Sbjct: 67 TGANLTGANLTGANLTGANFSKANLRGANLRGVNLSGVNLSGANLSGANLSGANLSGANL 126
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
S + R+ L+ AN +NA L L+ DL GA + GA+FS
Sbjct: 127 SGVNLSRVNLSGANFSNANLNNFDLSGFDLTGANLTGANFS 167
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 46/81 (56%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANF++A++ D SG +G L A AN +GA+LS+ + + L + NL+ A L R
Sbjct: 214 ANFSNANLNNFDLSGFDLSGVNLSGANLSGANLSGANLSEANLSEVDLYQINLSGANLSR 273
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
LT ++L GA GA+ S A
Sbjct: 274 IDLTGANLSGANFSGANLSGA 294
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 43/81 (53%), Gaps = 5/81 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
ANF++A++ D SG GA L A NF+G +LS + R L+ AN +NA L
Sbjct: 139 ANFSNANLNNFDLSGFDLTGANLTGA-----NFSGVNLSGVNLSRANLSGANFSNANLNN 193
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L+ DL G + GA+ S A
Sbjct: 194 FDLSGFDLSGVNLSGANLSGA 214
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 49/100 (49%), Gaps = 4/100 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A+L +A +VK RA AD+R +D +G+ GA L A AN TGA+ S
Sbjct: 32 SYTNLNEANLSEA-YVK---RAYLRGADLRGADLTGANLTGANLTGANLTGANLTGANFS 87
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
+ L NL+ L L+ ++L GA + GA+ S
Sbjct: 88 KANLRGANLRGVNLSGVNLSGANLSGANLSGANLSGANLS 127
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A++ D SG G L A N +GA+LS+ + + L + NL+ A L R
Sbjct: 324 ANLSGANLNNFDLSGFDLRGINLSGADLGGTNLSGANLSEANLSEVDLYQINLSGANLSR 383
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQ 216
LT ++L GA + A+ ++ +DL Q
Sbjct: 384 IDLTGANLTGANLSEANLNE--VDLYQ 408
Score = 37.4 bits (85), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 48/91 (52%), Gaps = 2/91 (2%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A++ E D +GA L + AN TGA+LS+ ++ + L + NL+ A L +
Sbjct: 359 ANLSEANLSEVDLYQINLSGANLSRIDLTGANLTGANLSEANLNEVDLYQINLSGANLSK 418
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 220
DLGG ++ + + A +L + +AL
Sbjct: 419 VNFQGFDLGGFDLKNVNLTGA--NLREVKAL 447
>gi|300866166|ref|ZP_07110885.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300335845|emb|CBN56045.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 351
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 42/81 (51%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ ++ S + GA L + AN +GADLS + + L E NL A L
Sbjct: 31 ANLGEANLNRTNLSNANLRGANLTRTKLIGANLSGADLSGANLSKAKLIEINLGGASLTG 90
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
T+L DL GA + GA FS A
Sbjct: 91 TILLGVDLSGANLSGAIFSQA 111
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 62/135 (45%), Gaps = 16/135 (11%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSA 135
+ A++V +C N S L D N A S A DL +A+ A
Sbjct: 119 IGASLVGACLLNGSKLVDANLSGATL-------SRATANGVDLSRAI---------LNRA 162
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
+ E D SG+ +GA L +A A + N +GA+L + + L EANL A L L +
Sbjct: 163 ILSEVDLSGANLSGATLIRAYANRGNLSGANLHSSNLSEASLREANLCVANLSGAELQGT 222
Query: 196 DLGGAIIEGADFSDA 210
DL GA + GA+ S A
Sbjct: 223 DLSGANLNGANLSGA 237
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 53/97 (54%), Gaps = 1/97 (1%)
Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
AA G A+L + N R AN T + ++ SG+ +GA L KA + N GA L+
Sbjct: 30 AANLGEANLNRTNLSNANLRGANLTRTKLIGANLSGADLSGANLSKAKLIEINLGGASLT 89
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
T++ + L+ ANL+ A+ + L+++ L GA + GA
Sbjct: 90 GTILLGVDLSGANLSGAIFSQADLSKAVLIGASLVGA 126
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 53/114 (46%), Gaps = 1/114 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+L A N R AN A + ++D ++ N A L A AN +GA L
Sbjct: 225 SGANLNGANLSGADLQGANLRGANLNGASLHKADLRTAELNKANLRGANLSGANLSGASL 284
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
+ + LN ANL+ A L+ T L +DL G + A+ A +++A C
Sbjct: 285 LEADLRGANLNGANLSGAGLLLTSLAGADLTGTNLSEANLIGATLNVANLNEAC 338
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 41/93 (44%), Gaps = 11/93 (11%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A ADLR A K N R AN + A + E+D G+ NGA L A + G
Sbjct: 252 ASLHKADLRTAELNKANLRGANLSGANLSGASLLEADLRGANLNGANLSGAGLLLTSLAG 311
Query: 165 ADLSDTLMDRM-----VLNEANLTNAVLVRTVL 192
ADL+ T + LN ANL A L +L
Sbjct: 312 ADLTGTNLSEANLIGATLNVANLNEACLGGAIL 344
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 52/116 (44%), Gaps = 17/116 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAY-----LEKAVAYKANF 162
S A A+L KA ++ N A+ T + D SG+ +GA L KAV A+
Sbjct: 64 SGADLSGANLSKAKLIEINLGGASLTGTILLGVDLSGANLSGAIFSQADLSKAVLIGASL 123
Query: 163 TG-----------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
G A+LS + R N +L+ A+L R +L+ DL GA + GA
Sbjct: 124 VGACLLNGSKLVDANLSGATLSRATANGVDLSRAILNRAILSEVDLSGANLSGATL 179
Score = 40.4 bits (93), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 14/112 (12%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
RG S A S++L +A + N AN + A+++ +D SG+ NGA
Sbjct: 186 RGNL---SGANLHSSNLSEASLREANLCVANLSGAELQGTDLSGANLNGA---------- 232
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
N +GADL + LN A+L A L L +++L GA + GA+ S A +
Sbjct: 233 NLSGADLQGANLRGANLNGASLHKADLRTAELNKANLRGANLSGANLSGASL 284
>gi|163798116|ref|ZP_02192053.1| hypothetical protein BAL199_09395 [alpha proteobacterium BAL199]
gi|159176607|gb|EDP61184.1| hypothetical protein BAL199_09395 [alpha proteobacterium BAL199]
Length = 1025
Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats.
Identities = 46/123 (37%), Positives = 59/123 (47%), Gaps = 21/123 (17%)
Query: 110 AAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
A F +ADL A NFR A FT A + +DFS GA L K A A FT
Sbjct: 733 GALFENADLTNA-----NFRGATLEDAVFTGAVLTGADFSDCAMRGANLSKVEAKGARFT 787
Query: 164 GADLSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGA-----IIEGADFSDAVID 213
++L+D + L EA+LT NAV + LTR+DL A I A +AV+D
Sbjct: 788 RSELTDAKLVAAKLVEADLTATTMENAVALNADLTRADLSKARFTKVIFMTATMDEAVLD 847
Query: 214 LAQ 216
A+
Sbjct: 848 SAE 850
Score = 42.0 bits (97), Expect = 0.29, Method: Composition-based stats.
Identities = 24/80 (30%), Positives = 39/80 (48%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
++ A + +DFSG GA E A ANF GA L D + VL A+ ++ +
Sbjct: 715 DWAGAVLANTDFSGRDLRGALFENADLTNANFRGATLEDAVFTGAVLTGADFSDCAMRGA 774
Query: 191 VLTRSDLGGAIIEGADFSDA 210
L++ + GA ++ +DA
Sbjct: 775 NLSKVEAKGARFTRSELTDA 794
Score = 38.9 bits (89), Expect = 2.2, Method: Composition-based stats.
Identities = 27/83 (32%), Positives = 35/83 (42%), Gaps = 5/83 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A + +RE G G AV +F+G DL L + L AN A L
Sbjct: 694 ARYLGQVVRECLAGGGDLTGRDWAGAVLANTDFSGRDLRGALFENADLTNANFRGATLED 753
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
V T GA++ GADFSD +
Sbjct: 754 AVFT-----GAVLTGADFSDCAM 771
>gi|186680850|ref|YP_001864046.1| RDD domain-containing protein [Nostoc punctiforme PCC 73102]
gi|186463302|gb|ACC79103.1| RDD domain containing protein [Nostoc punctiforme PCC 73102]
Length = 717
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/137 (33%), Positives = 65/137 (47%), Gaps = 34/137 (24%)
Query: 132 FTSADMRESDFSGSKFNGA--------Y------LEKAVAYKANFTGADLSDTLMDRM-- 175
F SA++ ++ F GS+F GA Y L +A +AN T A+LS LM+R+
Sbjct: 459 FKSANLNQASFKGSRFRGAGEDGRWDTYDDVIADLSQAQLQQANLTDANLSRVLMNRIDL 518
Query: 176 ---VLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDAVIDLAQKQALCK 222
LN ANL+NA L T L +DL A++E GAD DA ++ A
Sbjct: 519 SRATLNRANLSNARLYDAKLNSTQLVGADLRNAVLERASLTGADLGDAKLNEAN-----L 573
Query: 223 YANGTNPITGVSTRKSL 239
YA +T + T+ S
Sbjct: 574 YAARLGRVTAIGTQLSF 590
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 45/78 (57%), Gaps = 5/78 (6%)
Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
A++ +D+ G+ +GAYL+ +AN + A+LS T + VL A + N L L+
Sbjct: 591 ANLTNTDWQGADLSGAYLD-----RANLSNANLSATRLAGAVLRSAQMENVNLQNADLSL 645
Query: 195 SDLGGAIIEGADFSDAVI 212
+DL GA + GADF A++
Sbjct: 646 ADLRGANVAGADFKGAIL 663
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 50/92 (54%), Gaps = 10/92 (10%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN ++A + ++ + ++ GA L AV +A+ TGADL D LNEANL A L
Sbjct: 525 RANLSNARLYDAKLNSTQLVGADLRNAVLERASLTGADLGDA-----KLNEANLYAARLG 579
Query: 189 R-----TVLTRSDLGGAIIEGADFSDAVIDLA 215
R T L+ ++L +GAD S A +D A
Sbjct: 580 RVTAIGTQLSFANLTNTDWQGADLSGAYLDRA 611
Score = 37.0 bits (84), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 40/79 (50%), Gaps = 9/79 (11%)
Query: 141 DFSGSKFNGAYLEKAVAYKANFTGA------DLSDTL---MDRMVLNEANLTNAVLVRTV 191
D SG KF A L +A + F GA D D + + + L +ANLT+A L R +
Sbjct: 453 DLSGVKFKSANLNQASFKGSRFRGAGEDGRWDTYDDVIADLSQAQLQQANLTDANLSRVL 512
Query: 192 LTRSDLGGAIIEGADFSDA 210
+ R DL A + A+ S+A
Sbjct: 513 MNRIDLSRATLNRANLSNA 531
>gi|434405486|ref|YP_007148371.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428259741|gb|AFZ25691.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 808
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 57/103 (55%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A ADL A+ N +AN + A++R ++ G+ +GAY A A+ +GA L
Sbjct: 103 SGANLSGADLSGAILFGANLSQANLSQANLRGANLRGADLSGAYPSGADLRGADLSGAYL 162
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S+ + + L++ANL+ A L + L+ + L GA + GAD S A
Sbjct: 163 SEAKLSQAKLSQANLSQANLSQADLSGAYLTGAYLSGADLSGA 205
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 50/81 (61%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A++ E+ G+K + A L +A AN +GA+LS+ ++ L++ANL+ A L
Sbjct: 45 ANLSQANLSEAILFGAKLSQANLSQANLSGANLSGANLSEAILFGAKLSQANLSQANLSG 104
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L+ +DL GAI+ GA+ S A
Sbjct: 105 ANLSGADLSGAILFGANLSQA 125
Score = 43.5 bits (101), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 47/84 (55%), Gaps = 5/84 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+AN + A++ ++D SG+ GAYL A+ +GADLS + R L+ A+L+ A L
Sbjct: 174 QANLSQANLSQADLSGAYLTGAYLS-----GADLSGADLSGARLSRADLSRADLSAADLR 228
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
L+ +DL A + GA S A +
Sbjct: 229 GAYLSAADLSAAYLSGAYLSAAYL 252
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 45/81 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A++ E+ G+K + A L +A AN +GADLS ++ L++ANL+ A L
Sbjct: 75 ANLSGANLSEAILFGAKLSQANLSQANLSGANLSGADLSGAILFGANLSQANLSQANLRG 134
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L +DL GA GAD A
Sbjct: 135 ANLRGADLSGAYPSGADLRGA 155
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 58/132 (43%), Gaps = 34/132 (25%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT----- 163
S A A+L +A+ F A + A++ +++ SG+ +GA L A+ + AN +
Sbjct: 73 SGANLSGANLSEAIL----FGAKLSQANLSQANLSGANLSGADLSGAILFGANLSQANLS 128
Query: 164 -------------------------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
GADLS + L++A L+ A L + L+++DL
Sbjct: 129 QANLRGANLRGADLSGAYPSGADLRGADLSGAYLSEAKLSQAKLSQANLSQANLSQADLS 188
Query: 199 GAIIEGADFSDA 210
GA + GA S A
Sbjct: 189 GAYLTGAYLSGA 200
>gi|416374431|ref|ZP_11683193.1| hypothetical protein CWATWH0003_0051 [Crocosphaera watsonii WH
0003]
gi|357266721|gb|EHJ15312.1| hypothetical protein CWATWH0003_0051 [Crocosphaera watsonii WH
0003]
Length = 279
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 60/111 (54%), Gaps = 11/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A SADLR A + N A+ TSA++ ++ +G+ NGA L + AN +G DLS
Sbjct: 54 ATLASADLRGANLKQVNLSYADLTSANLSGANLTGAILNGAKLNRVDLSYANLSGVDLSG 113
Query: 170 TLMDR-----MVLNEANLTNAVLVRTVLTRS-----DLGGAIIEGADFSDA 210
+ R + L EA+LTNA L + +++S D A ++GA+FS A
Sbjct: 114 ANLSRSDLSYVDLREADLTNANLYKADISQSKLHNTDFQEAFLQGANFSRA 164
>gi|443310610|ref|ZP_21040256.1| serine/threonine protein kinase [Synechocystis sp. PCC 7509]
gi|442779315|gb|ELR89562.1| serine/threonine protein kinase [Synechocystis sp. PCC 7509]
Length = 533
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 46/85 (54%), Gaps = 5/85 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N + D+ +D + F+G L K +KA +DL + LN+A+L +A L R
Sbjct: 413 NLSMLDLERADLTEVNFHGCNLHKTNLHKAILFNSDLG-----QASLNQASLKDANLSRA 467
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
L+ +DL GA + GAD SDA ++ A
Sbjct: 468 YLSHADLEGADLRGADLSDAYLNHA 492
>gi|37522461|ref|NP_925838.1| hypothetical protein gll2892 [Gloeobacter violaceus PCC 7421]
gi|35213462|dbj|BAC90833.1| gll2892 [Gloeobacter violaceus PCC 7421]
Length = 457
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 1/101 (0%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADLR A N AN AD+ +D +G+ N A+L A +AN GA+L+
Sbjct: 79 ANLSEADLRGANLNWANLNWANLNWADLSGADLNGANLNWAHLNWADLREANLGGAELNR 138
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ L ANL+ L R ++ +DL GA + GA+ S+A
Sbjct: 139 ANLREANLGGANLSGVSLSRAFMSGADLRGADLGGANLSEA 179
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 50/100 (50%), Gaps = 6/100 (6%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A+ A+LR+A N RA + AD+R +D G+ + A L A AN G
Sbjct: 134 AELNRANLREANLGGANLSGVSLSRAFMSGADLRGADLGGANLSEADLGGANLGGANLKG 193
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
ADL ++R L A+L A L RT LT L GA++EG
Sbjct: 194 ADLGGANLERTSLRGADLRGADLRRTRLTGCSLEGAVLEG 233
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 45/84 (53%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ AD+RE++ G++ N A L +A AN +G LS M L A+L A L
Sbjct: 119 AHLNWADLREANLGGAELNRANLREANLGGANLSGVSLSRAFMSGADLRGADLGGANLSE 178
Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
L ++LGGA ++GAD A ++
Sbjct: 179 ADLGGANLGGANLKGADLGGANLE 202
Score = 44.3 bits (103), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 50/107 (46%), Gaps = 14/107 (13%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A G ADL A N A++ E+D G+ N A L A A+ +GADL+
Sbjct: 64 ADLGGADLEGA---------NLGGANLSEADLRGANLNWANLNWANLNWADLSGADLNGA 114
Query: 171 LMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
++ LN EANL A L R L ++LGGA + G S A +
Sbjct: 115 NLNWAHLNWADLREANLGGAELNRANLREANLGGANLSGVSLSRAFM 161
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 58/121 (47%), Gaps = 2/121 (1%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN AD+ +D G+ GA LE A AN + ADL ++ LN ANL A L
Sbjct: 49 ANLGGADLDGADLGGADLGGADLEGANLGGANLSEADLRGANLNWANLNWANLNWADLSG 108
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN-GTNPITGVSTRKSLGCGNSRRN 247
L ++L A + AD +A + A+ +A + AN G ++GVS ++ G R
Sbjct: 109 ADLNGANLNWAHLNWADLREANLGGAELNRANLREANLGGANLSGVSLSRAFMSGADLRG 168
Query: 248 A 248
A
Sbjct: 169 A 169
>gi|425454784|ref|ZP_18834510.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9807]
gi|389804455|emb|CCI16535.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9807]
Length = 262
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 55/102 (53%)
Query: 117 DLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 176
L++ + ++ + + + + + +S+ +G+K NGA L A +AN +GADLS +
Sbjct: 28 HLQQLLSTRKCPQCDLSGSGLVQSNLTGAKLNGANLVGANLSQANLSGADLSGANLTGAS 87
Query: 177 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
ANLT A L +LT +DL GA + A+ + +D A Q
Sbjct: 88 FFGANLTGANLSGAILTGADLRGAYLNNANLENTKLDTAYVQ 129
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 40/72 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ +++ SG+ +GA L A + AN TGA+LS ++ L A L NA L
Sbjct: 61 ANLVGANLSQANLSGADLSGANLTGASFFGANLTGANLSGAILTGADLRGAYLNNANLEN 120
Query: 190 TVLTRSDLGGAI 201
T L + + GA+
Sbjct: 121 TKLDTAYVQGAV 132
>gi|158338433|ref|YP_001519610.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158308674|gb|ABW30291.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 219
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 51/105 (48%), Gaps = 11/105 (10%)
Query: 111 AQFGSADLRKAVHVKENFRA-----------NFTSADMRESDFSGSKFNGAYLEKAVAYK 159
A F SAD RKA + + RA N A++ ++ SG+ +GA L A+ Y
Sbjct: 71 ANFASADFRKAKLFRADLRATCLYRADLRGANLRGANLFGANLSGANLSGANLSNAMLYC 130
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
AN GA+L T++D L AN ++ L +L + L G +G
Sbjct: 131 ANLGGANLRGTILDSANLMRANFSHGDLRNAILRNAKLQGTHFDG 175
Score = 37.0 bits (84), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 22/84 (26%), Positives = 40/84 (47%), Gaps = 1/84 (1%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A G A+LR + N RANF+ D+R + +K G + + + + +LS
Sbjct: 131 ANLGGANLRGTILDSANLMRANFSHGDLRNAILRNAKLQGTHFDGTRMLRTDLDEINLSK 190
Query: 170 TLMDRMVLNEANLTNAVLVRTVLT 193
T +D + L + +L N+ + +T
Sbjct: 191 TQIDGVHLMDIDLNNSAMENAAIT 214
>gi|149179551|ref|ZP_01858089.1| pentapeptide repeat domain protein [Planctomyces maris DSM 8797]
gi|148841608|gb|EDL56033.1| pentapeptide repeat domain protein [Planctomyces maris DSM 8797]
Length = 343
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 51/88 (57%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN AD+ + +G+ N A LE A ++NF+ ADL++T + L EAN NA L
Sbjct: 83 RANLQKADLTGGNLTGAILNEANLEAAYLNQSNFSHADLNETKLAHTKLMEANFFNADLR 142
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
+ L+ +DL GA ++ ++ S A + A+
Sbjct: 143 KADLSGADLRGANLKWSNLSGARLSAAE 170
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 49/92 (53%), Gaps = 11/92 (11%)
Query: 117 DLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTGA 165
DL KA ++N A+ +AD+R+++ GS +GAYL +A +AN A
Sbjct: 30 DLFKADLRRDNLSDLDLSEADLRNADLRDANLEGSDLSGAYLGQARLCQTNLCRANLQKA 89
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
DL+ + +LNEANL A L ++ + +DL
Sbjct: 90 DLTGGNLTGAILNEANLEAAYLNQSNFSHADL 121
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 47/89 (52%), Gaps = 8/89 (8%)
Query: 111 AQFGSADLR--KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
+ F ADL K H K ANF +AD+R++D SG+ GA L+ +N +GA LS
Sbjct: 114 SNFSHADLNETKLAHTKL-MEANFFNADLRKADLSGADLRGANLK-----WSNLSGARLS 167
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
+ + L E +L++A L + T + L
Sbjct: 168 AAELSKANLIETDLSDADLTEAIFTDAKL 196
>gi|39997499|ref|NP_953450.1| pentapeptide repeat-containing protein [Geobacter sulfurreducens
PCA]
gi|39984390|gb|AAR35777.1| pentapeptide repeat domain protein [Geobacter sulfurreducens PCA]
Length = 254
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 52/100 (52%), Gaps = 14/100 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
+ AQ A L +A+ F +ADMR + SG AY+ A AN +GAD+
Sbjct: 85 TGAQMDGASLDEAI---------FDTADMRSAHCSG-----AYIHHAKFVGANLSGADMR 130
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
+++ ++ANLTNA L ++LGGA++ G +FS
Sbjct: 131 KVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFS 170
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 53/102 (51%), Gaps = 4/102 (3%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A AD+RK V+V+ + NF+ A++ ++FSG+K A L AV NF+ ADLS T
Sbjct: 122 ANLSGADMRK-VNVE---KGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSAT 177
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ + L AN A T+L + L GA + + F I
Sbjct: 178 DLGSLDLEGANFRGATFNGTLLRDAKLKGADLRQSRFHSVSI 219
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 50/108 (46%), Gaps = 11/108 (10%)
Query: 111 AQFGSADLRKA------VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A F +AD+R A +H + AN + ADMR+ + F+ A L A NF+G
Sbjct: 97 AIFDTADMRSAHCSGAYIHHAKFVGANLSGADMRKVNVEKGNFSQANLTNA-----NFSG 151
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A L + VL N + A L T L DL GA GA F+ ++
Sbjct: 152 AKLKYANLGGAVLRGTNFSFADLSATDLGSLDLEGANFRGATFNGTLL 199
>gi|288920260|ref|ZP_06414574.1| pentapeptide repeat protein [Frankia sp. EUN1f]
gi|288348364|gb|EFC82627.1| pentapeptide repeat protein [Frankia sp. EUN1f]
Length = 287
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 59/132 (44%), Gaps = 11/132 (8%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
G+G A ADL N R A + AD+R +D G+ GA L A
Sbjct: 74 GVGRA----GADLAGRTFTGRNLRGADLRGAFLSGADLRGADLRGACLRGADLRDADLSS 129
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQ 218
A +GADL L+ L+ A+L A L R L R+DL GA + AD A ++ +
Sbjct: 130 AALSGADLHGALLVGTYLSRADLRGADLGRVYLRRADLRGAFLGRADLRGADAAEIVLRG 189
Query: 219 ALCKYANGTNPI 230
A+ + A T +
Sbjct: 190 AVLRGAEATGAV 201
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 57/123 (46%), Gaps = 16/123 (13%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
N A+ RG F S A ADLR A AD+R++D S + +GA L
Sbjct: 91 NLRGADLRGAFL--SGADLRGADLRGAC---------LRGADLRDADLSSAALSGADLHG 139
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSD 209
A+ + ADL + R+ L A+L A L R L +D L GA++ GA+ +
Sbjct: 140 ALLVGTYLSRADLRGADLGRVYLRRADLRGAFLGRADLRGADAAEIVLRGAVLRGAEATG 199
Query: 210 AVI 212
AV+
Sbjct: 200 AVL 202
>gi|307352983|ref|YP_003894034.1| pentapeptide repeat-containing protein [Methanoplanus petrolearius
DSM 11571]
gi|307156216|gb|ADN35596.1| pentapeptide repeat protein [Methanoplanus petrolearius DSM 11571]
Length = 165
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 48/86 (55%), Gaps = 10/86 (11%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
++A F D+R +D G F+ A+FTGADL+D + + A+L+ AVL
Sbjct: 77 YKAVFQGTDLRNADLHGGIFS----------LADFTGADLTDADLVGAAFDYADLSGAVL 126
Query: 188 VRTVLTRSDLGGAIIEGADFSDAVID 213
+ + +DL GA + GAD +DA+I+
Sbjct: 127 IGADMRYADLRGADLSGADLTDALIE 152
>gi|119486130|ref|ZP_01620190.1| hypothetical protein L8106_17342 [Lyngbya sp. PCC 8106]
gi|119456621|gb|EAW37750.1| hypothetical protein L8106_17342 [Lyngbya sp. PCC 8106]
Length = 207
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 62/137 (45%), Gaps = 24/137 (17%)
Query: 96 KYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRA----------------NFTSAD 136
K A RG G+ A +ADLR A+ + + R + T D
Sbjct: 62 KLRANLRGADLTGTNLIGADLRNADLRGAILLDADVREASFAGAFLTGASCGALDLTGVD 121
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
+R +D G + A L++A N +GADLS + L EANL+ AVL T L R++
Sbjct: 122 LRGADLRGVSLSQAILQQADLRNTNLSGADLS-----QADLEEANLSGAVLRGTNLERAN 176
Query: 197 LGGAIIEGADFSDAVID 213
L AI+E + ++D
Sbjct: 177 LLCAIVEQTQWFGTILD 193
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 60/131 (45%), Gaps = 20/131 (15%)
Query: 116 ADLRKAVHVKENFRA------NFTSADMRESDFSG----------SKFNGAYLEKAVAYK 159
A+L++A ++ N R N AD+R +D G + F GA+L A
Sbjct: 56 ANLQRA-KLRANLRGADLTGTNLIGADLRNADLRGAILLDADVREASFAGAFLTGASCGA 114
Query: 160 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 216
+ TG DL + + L++A L A L T L+ +DL A +E A+ S AV+ +L +
Sbjct: 115 LDLTGVDLRGADLRGVSLSQAILQQADLRNTNLSGADLSQADLEEANLSGAVLRGTNLER 174
Query: 217 KQALCKYANGT 227
LC T
Sbjct: 175 ANLLCAIVEQT 185
>gi|119493532|ref|ZP_01624198.1| hypothetical protein L8106_18192 [Lyngbya sp. PCC 8106]
gi|119452649|gb|EAW33830.1| hypothetical protein L8106_18192 [Lyngbya sp. PCC 8106]
Length = 192
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 53/95 (55%), Gaps = 6/95 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN T AD+ +D SG+ +GA L A+ AN + ADLS + + R L A LT+A L
Sbjct: 80 ANLTRADLTGADLSGADLHGADLSGAILSGANLSYADLSKSTLFRAELLNATLTHANLKG 139
Query: 190 TVLTRSDLGGAIIEGADFSDA------VIDLAQKQ 218
L +++L GA+++ A F A V+ L ++Q
Sbjct: 140 ANLKQTNLEGAVVQDAVFVKAMGLAFEVVSLLKQQ 174
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 49/92 (53%), Gaps = 10/92 (10%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL-SDTLMDRMV---------LNEA 180
NF D+ ++ S + +GA L +A ++AN A+L +L + + L+ A
Sbjct: 16 NFRDTDLFRAELSNANLSGANLFRANLFRANLFRANLLGVSLFNANLIGANLYCANLSGA 75
Query: 181 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
NL+ A L R LT +DL GA + GAD S A++
Sbjct: 76 NLSGANLTRADLTGADLSGADLHGADLSGAIL 107
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
F AN A++ ++ SG+ +GA L +A A+ +GADL + +L+ ANL+ A L
Sbjct: 58 FNANLIGANLYCANLSGANLSGANLTRADLTGADLSGADLHGADLSGAILSGANLSYADL 117
Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
++ L R++L A + A+ A
Sbjct: 118 SKSTLFRAELLNATLTHANLKGA 140
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 43/83 (51%), Gaps = 5/83 (6%)
Query: 128 FRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 187
FRAN A++ ++ G A L A Y AN +GA+LS + R A+LT A L
Sbjct: 38 FRANLFRANLFRANLLGVSLFNANLIGANLYCANLSGANLSGANLTR-----ADLTGADL 92
Query: 188 VRTVLTRSDLGGAIIEGADFSDA 210
L +DL GAI+ GA+ S A
Sbjct: 93 SGADLHGADLSGAILSGANLSYA 115
>gi|418719603|ref|ZP_13278802.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii str.
UI 09149]
gi|418737331|ref|ZP_13293728.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii
serovar Castellonis str. 200801910]
gi|421093686|ref|ZP_15554410.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii str.
200801926]
gi|410363669|gb|EKP14698.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii str.
200801926]
gi|410743646|gb|EKQ92388.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii str.
UI 09149]
gi|410746525|gb|EKQ99431.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii
serovar Castellonis str. 200801910]
gi|456889646|gb|EMG00529.1| NifU-like N-terminal domain protein [Leptospira borgpetersenii str.
200701203]
Length = 263
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 44/82 (53%), Gaps = 9/82 (10%)
Query: 141 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
DFSG+ A+L+ + ANF GA L + LN A+L NA L + L GA
Sbjct: 166 DFSGANLGHAFLQNSSFVGANFEGAKLRGSF-----LNNADLRNANFRGADLRWAKLAGA 220
Query: 201 IIEGADFSDAVID----LAQKQ 218
+EGADF+DA+ D L QKQ
Sbjct: 221 NVEGADFTDAIYDIGTRLDQKQ 242
>gi|381395251|ref|ZP_09920956.1| hypothetical protein GPUN_1974 [Glaciecola punicea DSM 14233 = ACAM
611]
gi|379329152|dbj|GAB56089.1| hypothetical protein GPUN_1974 [Glaciecola punicea DSM 14233 = ACAM
611]
Length = 258
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 60/116 (51%), Gaps = 10/116 (8%)
Query: 107 IGSAAQFGSADLR----KAVHVKENF--RANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
IGS F AD+R K V + R+ T+ADMR DF G F+ A LE A A
Sbjct: 139 IGST--FIDADMRDSSLKNVRARSAMFTRSVLTNADMRWGDFEGVDFSNANLEGADLTMA 196
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
N GA+L+ + +L NL A+L T++ + + GA ++ DF+ +DL+Q
Sbjct: 197 NLRGANLTAANLKNAMLLYTNLEGAILNGTIMDGAQIVGANMKRVDFTK--VDLSQ 250
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 39/84 (46%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A AD+ ES+ + FN A L+ A G+ D M L +A+ R
Sbjct: 106 AQLLGADLSESNLRNANFNKAVLQYTGFIDATLIGSTFIDADMRDSSLKNVRARSAMFTR 165
Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
+VLT +D+ EG DFS+A ++
Sbjct: 166 SVLTNADMRWGDFEGVDFSNANLE 189
>gi|159045175|ref|YP_001533969.1| hypothetical protein Dshi_2635 [Dinoroseobacter shibae DFL 12]
gi|157912935|gb|ABV94368.1| hypothetical protein Dshi_2635 [Dinoroseobacter shibae DFL 12]
Length = 245
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 53/110 (48%), Gaps = 11/110 (10%)
Query: 111 AQFGSADLRKAVHVKENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
AQF A +R + + N R F AD+R + G L +A +A+ GADLS
Sbjct: 122 AQFSGARMRGILFDRTNARDTVFAGADLRAA-----SMVGVALPRATLTEADLGGADLSG 176
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 219
L AN NA LV VL +DL GA + GAD S+A + A QA
Sbjct: 177 AF-----LEGANFGNARLVGAVLREADLTGARLTGADLSEADLTGAVTQA 221
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 44/105 (41%), Gaps = 30/105 (28%)
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---------------DRM----- 175
D+ ++ +G+ AYL AV AN GADL D M DR
Sbjct: 83 DLAGAELAGADLRDAYLTYAVFDGANLEGADLRDAFMPFAQFSGARMRGILFDRTNARDT 142
Query: 176 -----VLNEANLTNAVLVRTVLTRSDLG-----GAIIEGADFSDA 210
L A++ L R LT +DLG GA +EGA+F +A
Sbjct: 143 VFAGADLRAASMVGVALPRATLTEADLGGADLSGAFLEGANFGNA 187
>gi|315497235|ref|YP_004086039.1| pentapeptide repeat protein [Asticcacaulis excentricus CB 48]
gi|315415247|gb|ADU11888.1| pentapeptide repeat protein [Asticcacaulis excentricus CB 48]
Length = 224
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 48/95 (50%), Gaps = 10/95 (10%)
Query: 129 RANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLS-----DTLMDRMVLN 178
+A+FTSAD+ E+ DF+ + F G+ L + + TGAD S D + LN
Sbjct: 73 QADFTSADLTEAQFTACDFNNTPFKGSGLAQVRFLRCKLTGADFSHSRNMDVSFEDCRLN 132
Query: 179 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
+A L N V+ L DL A ++G DF AV +
Sbjct: 133 DARLKNFAFVKQTLKSLDLTNADLQGCDFRQAVFE 167
>gi|122920845|pdb|2J8K|A Chain A, Structure Of The Fusion Of Np275 And Np276, Pentapeptide
Repeat Proteins From Nostoc Punctiforme
Length = 201
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 65/147 (44%), Gaps = 37/147 (25%)
Query: 113 FGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 161
F DLR AV N AN A++ +D SG+ NGA L AN
Sbjct: 37 FSIVDLRGAVLENINLSGAILHGAMLDEANLQQANLSRADLSGATLNGADL-----RGAN 91
Query: 162 FTGADLSD----------TLMDRMVLNEANLTNAVLVRTVLTR-----SDLGGAIIEGAD 206
+ ADLSD ++D VLN+ANL A L + +L+ +DL A +E AD
Sbjct: 92 LSKADLSDAILDNAILEGAILDEAVLNQANLKAANLEQAILSHANIREADLSEANLEAAD 151
Query: 207 FSD---AVIDLAQ---KQALCKYANGT 227
S A+ DL Q QA + AN T
Sbjct: 152 LSGADLAIADLHQANLHQAALERANLT 178
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 57/106 (53%), Gaps = 16/106 (15%)
Query: 131 NFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
+F+ D+R + + SG+ +GA L++A +AN + ADLS LN A+L A
Sbjct: 36 DFSIVDLRGAVLENINLSGAILHGAMLDEANLQQANLSRADLSGA-----TLNGADLRGA 90
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ------KQALCKYAN 225
L + L+ + L AI+EGA +AV++ A +QA+ +AN
Sbjct: 91 NLSKADLSDAILDNAILEGAILDEAVLNQANLKAANLEQAILSHAN 136
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 6/77 (7%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A +A+L +A+ N R AN +AD+ +D + + + A L +A +AN TG
Sbjct: 120 ANLKAANLEQAILSHANIREADLSEANLEAADLSGADLAIADLHQANLHQAALERANLTG 179
Query: 165 ADLSDTLMDRMVLNEAN 181
A+L D ++ +L N
Sbjct: 180 ANLEDANLEGTILEGGN 196
>gi|86606920|ref|YP_475683.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86555462|gb|ABD00420.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 154
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 58/127 (45%), Gaps = 15/127 (11%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG---- 164
S AQ A+LR V A+ + AD+RE D SG+ +GA L A + N G
Sbjct: 32 SGAQLSGANLRGIVLRD----ADLSGADLREGDLSGADLSGADLRGAKLRRVNLIGAKLV 87
Query: 165 -ADLSDTLMDRMVLNEANLTNAVLVRTVL-TRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
ADL + R L A+L+ A L R L +DL GAII F A+ D K
Sbjct: 88 KADLRGANLYRAKLLRADLSEADLSRADLRIGADLRGAIITNTRFRGALYD-----EYTK 142
Query: 223 YANGTNP 229
+ G NP
Sbjct: 143 FPEGFNP 149
>gi|398337534|ref|ZP_10522239.1| hypothetical protein LkmesMB_19432 [Leptospira kmetyi serovar
Malaysia str. Bejo-Iso9]
Length = 263
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 49/92 (53%), Gaps = 4/92 (4%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ +S + + +F G F+GA L A ++F GA+ S + LN A+L N+
Sbjct: 151 DLSSIILEKQNFDGVDFSGANLGHAFLQNSSFVGANFSGAKLRGSFLNNADLRNSNFRGA 210
Query: 191 VLTRSDLGGAIIEGADFSDAVID----LAQKQ 218
L + L GA +EGADF+DA+ D L QKQ
Sbjct: 211 DLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242
>gi|162450992|ref|YP_001613359.1| hypothetical protein sce2720 [Sorangium cellulosum So ce56]
gi|161161574|emb|CAN92879.1| hypothetical protein sce2720 [Sorangium cellulosum So ce56]
Length = 579
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 51/102 (50%), Gaps = 6/102 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRAN-FTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 162
+ A+ A+LR+A+ R AD+ ++D G+ GA LE+A+ AN
Sbjct: 286 TGAELTGANLRRALLQGAILRGQRLAGADLEMTLLVDADLEGADLQGARLERAILDGANL 345
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
GADL+ L+ + +L A L +L + + R DL G ++G
Sbjct: 346 RGADLTRALLLQTLLRGAALDGVILDKAIFDRVDLTGTDLQG 387
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 44/84 (52%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A ++ + G + GA LE + A+ GADL ++R +L+ ANL A L R
Sbjct: 293 ANLRRALLQGAILRGQRLAGADLEMTLLVDADLEGADLQGARLERAILDGANLRGADLTR 352
Query: 190 TVLTRSDLGGAIIEGADFSDAVID 213
+L ++ L GA ++G A+ D
Sbjct: 353 ALLLQTLLRGAALDGVILDKAIFD 376
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A FT +D+R + G+ +GA L +A A+ GADL+ TL+ L A LT A L R
Sbjct: 79 ATFTGSDLRGARLRGANLSGAKLLRANLAGADLAGADLTATLLLGADLTGARLTGAKLDR 138
Query: 190 TVLT-----RSDLGGAIIEGADFSDA 210
L ++L GA+++GA + A
Sbjct: 139 IRLDFAKLPGAELAGAVLQGASLNKA 164
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 54/110 (49%), Gaps = 17/110 (15%)
Query: 112 QFGSADLR----KAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
Q G A LR K +H+ E A+ +D++++ + GA L++ A FTG+DL
Sbjct: 30 QLGGARLRGAKLKDIHLDE---ADLAGSDLQDTQWFRCPLRGASLDRCDLRGATFTGSDL 86
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI 212
L ANL+ A L+R L +DL GA ++ GAD + A +
Sbjct: 87 RGA-----RLRGANLSGAKLLRANLAGADLAGADLTATLLLGADLTGARL 131
Score = 40.4 bits (93), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 45/97 (46%), Gaps = 1/97 (1%)
Query: 117 DLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
DL + + E F T D+R G++ GA L+ +A+ G+DL DT R
Sbjct: 5 DLARRLRAGEPFAGKTITRFDLRGKQLGGARLRGAKLKDIHLDEADLAGSDLQDTQWFRC 64
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L A+L L T SDL GA + GA+ S A +
Sbjct: 65 PLRGASLDRCDLRGATFTGSDLRGARLRGANLSGAKL 101
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 51/112 (45%), Gaps = 16/112 (14%)
Query: 117 DLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
DLR A + R AN + A + ++ +G+ GA L + A+ TGA L+
Sbjct: 75 DLRGATFTGSDLRGARLRGANLSGAKLLRANLAGADLAGADLTATLLLGADLTGARLTGA 134
Query: 171 LMDRMVLNEANLTNAVLVRTV----------LTRSDLGGAIIEGADFSDAVI 212
+DR+ L+ A L A L V LTR+ L A I G+ F DA +
Sbjct: 135 KLDRIRLDFAKLPGAELAGAVLQGASLNKADLTRALLRDARITGSTFYDARL 186
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 54/119 (45%), Gaps = 16/119 (13%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFS----------GSKFNGAYLEK 154
A F +DLR A N RAN AD+ +D + G++ GA L++
Sbjct: 79 ATFTGSDLRGARLRGANLSGAKLLRANLAGADLAGADLTATLLLGADLTGARLTGAKLDR 138
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
A GA+L+ ++ LN+A+LT A+L +T S A + GAD A ++
Sbjct: 139 IRLDFAKLPGAELAGAVLQGASLNKADLTRALLRDARITGSTFYDARLGGADLGGATLE 197
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 45/89 (50%), Gaps = 14/89 (15%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL- 187
+A+ T A +R++ +GS F Y A GADL ++++VL A+L A+L
Sbjct: 163 KADLTRALLRDARITGSTF----------YDARLGGADLGGATLEKVVLVRADLRGAILP 212
Query: 188 ---VRTVLTRSDLGGAIIEGADFSDAVID 213
R+VL + L + GAD + + +D
Sbjct: 213 KSMTRSVLDEARLDRPDLSGADLAASELD 241
Score = 37.0 bits (84), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 48/102 (47%), Gaps = 4/102 (3%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A+ DLR+A +NFT AD+R +D S A L +A +A+ +GA +
Sbjct: 403 AKLAGMDLREADFTG----SNFTRADLRGADLRSSVLTRATLMEADLARADLSGATAKEA 458
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
L A +A L R TR+DL A + GAD D V+
Sbjct: 459 FFGDAALAGARARDARLRRATFTRADLDHADLSGADLGDVVM 500
>gi|448412419|ref|ZP_21576534.1| hypothetical protein C475_19468 [Halosimplex carlsbadense 2-9-1]
gi|445668180|gb|ELZ20811.1| hypothetical protein C475_19468 [Halosimplex carlsbadense 2-9-1]
Length = 561
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 11/108 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
+A S D A +FR A +A++R++D G+ F GA L A A+ TGA+
Sbjct: 251 TAGTLESVDFGGATLTDASFRRAGLQNAELRDADLVGADFQGADLRNASLTNADLTGANF 310
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 215
D A+LT+A L L+ +DL A + GAD DA + A
Sbjct: 311 RD----------ADLTDAHLRGADLSEADLKDATLCGADLKDATLTRA 348
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 75/161 (46%), Gaps = 20/161 (12%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADM-----RESDFSGSKFNGAYLEKAVAYK 159
A +A+LR A V +F+ A+ T+AD+ R++D + + GA L +A
Sbjct: 273 AGLQNAELRDADLVGADFQGADLRNASLTNADLTGANFRDADLTDAHLRGADLSEADLKD 332
Query: 160 ANFTGADLSDTLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
A GADL D + R L EA L NA L L R DL A + AD + DL
Sbjct: 333 ATLCGADLKDATLTRASLWNSDLTEAYLRNADLSDGYLRRVDLTDADLPAADLTG---DL 389
Query: 215 AQKQALCK-YANGTNPITGVSTRKSLGCGNSRRNAYGSPSS 254
+ +L + ++ I+ + R+SL C ++ G P++
Sbjct: 390 NARCSLGRTFSMPRCAISDHTGRRSLTCRSTSARPSGRPTT 430
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 56/111 (50%), Gaps = 4/111 (3%)
Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
A LR K++ A +RE+D SG+ G+ L+ A+ A+ DL+ M
Sbjct: 127 AQLRGVALPKQSL---LERAVLREADLSGANLAGSTLKGAILTDASLREVDLTGADMMGA 183
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYAN 225
VL EA+LT+ L + ++ + GAI++ A+ A + DL +A+ K A
Sbjct: 184 VLVEADLTSGTLAQLSGDKAVMRGAILKDANLERAHLWDLTAPEAVFKRAT 234
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 40/94 (42%), Gaps = 15/94 (15%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RA A MR++ G+ F LE +F GA L+D R L A L +A LV
Sbjct: 232 RATLCEATMRDAVLPGASFTAGTLESV-----DFGGATLTDASFRRAGLQNAELRDADLV 286
Query: 189 ----------RTVLTRSDLGGAIIEGADFSDAVI 212
LT +DL GA AD +DA +
Sbjct: 287 GADFQGADLRNASLTNADLTGANFRDADLTDAHL 320
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 36/81 (44%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+FT+ + DF G+ A +A A ADL L A+LTNA L
Sbjct: 248 ASFTAGTLESVDFGGATLTDASFRRAGLQNAELRDADLVGADFQGADLRNASLTNADLTG 307
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
+DL A + GAD S+A
Sbjct: 308 ANFRDADLTDAHLRGADLSEA 328
>gi|410449702|ref|ZP_11303755.1| NifU-like N-terminal domain protein [Leptospira sp. Fiocruz LV3954]
gi|421111700|ref|ZP_15572173.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
JET]
gi|410016459|gb|EKO78538.1| NifU-like N-terminal domain protein [Leptospira sp. Fiocruz LV3954]
gi|410802896|gb|EKS09041.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
JET]
gi|456874476|gb|EMF89769.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
ST188]
Length = 263
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 4/92 (4%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ +S + + +F G F+GA L A ++F GA+ S + LN A+L N
Sbjct: 151 DLSSIILEKQNFDGVDFSGANLGHAFLQNSSFVGANFSSAKLRGSFLNNADLRNTNFRGA 210
Query: 191 VLTRSDLGGAIIEGADFSDAVID----LAQKQ 218
L + L GA +EGADF+DA+ D L QKQ
Sbjct: 211 DLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242
>gi|307152112|ref|YP_003887496.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306982340|gb|ADN14221.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 180
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 45/87 (51%), Gaps = 1/87 (1%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADL KA V N N AD+RE++ SG+ A L A AN TGA+L +
Sbjct: 60 ANLTDADLLKAHLVGANLVEINLIGADLREANLSGADLTKADLRCANLTGANLTGANLRE 119
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSD 196
+D L ANLT+A ++ T L +D
Sbjct: 120 VNLDGANLMGANLTDAQIINTDLNMAD 146
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 66/128 (51%), Gaps = 7/128 (5%)
Query: 87 NISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGS 145
NI L L +Y+A+ R +F + A+L A + N RA+ + AD+ E+D SG+
Sbjct: 2 NIQEL--LKRYKAKER-DF---QGSNLHQANLEGANLQRINLTRADLSGADLSEADLSGA 55
Query: 146 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
A L A KA+ GA+L + + L EANL+ A L + L ++L GA + GA
Sbjct: 56 CLMQANLTDADLLKAHLVGANLVEINLIGADLREANLSGADLTKADLRCANLTGANLTGA 115
Query: 206 DFSDAVID 213
+ + +D
Sbjct: 116 NLREVNLD 123
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 53/112 (47%), Gaps = 12/112 (10%)
Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 160
TR + S A ADL A ++ AN T AD+ ++ G+ L A +A
Sbjct: 38 TRADL---SGADLSEADLSGACLMQ----ANLTDADLLKAHLVGANLVEINLIGADLREA 90
Query: 161 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
N +GADL+ + L ANLT A L L +L GA + GA+ +DA I
Sbjct: 91 NLSGADLT-----KADLRCANLTGANLTGANLREVNLDGANLMGANLTDAQI 137
>gi|148972698|ref|ZP_01811409.1| hypothetical protein LVAL_00031 [Leptolyngbya valderiana BDU 20041]
gi|148872721|gb|EDL71121.1| hypothetical protein LVAL_00031 [Leptolyngbya valderiana BDU 20041]
Length = 170
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 55/98 (56%), Gaps = 7/98 (7%)
Query: 132 FTSADM-----RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
FT AD+ R++D SG+ G LE+A KA +GAD S +++R +L EA+L +
Sbjct: 5 FTDADLYGALLRDADLSGAHLVGVRLERANLIKAILSGADFSRAVLERALLIEADLRSTA 64
Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA--LCK 222
RT L ++L A + A ++A+++ A + LC+
Sbjct: 65 DQRTTLREANLREADLSYAHLNEAILEGANLEGAKLCR 102
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 54/125 (43%), Gaps = 29/125 (23%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRAN------FTSADMRESDFSGSKFNGAYLEKAVAY-- 158
I S A F A L +A+ ++ + R+ A++RE+D S + N A LE A
Sbjct: 39 ILSGADFSRAVLERALLIEADLRSTADQRTTLREANLREADLSYAHLNEAILEGANLEGA 98
Query: 159 ---------------------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 197
AN GADLS +L ANL +A L RTV R+DL
Sbjct: 99 KLCRANLSSEAGTDALPTDLSNANLRGADLSYADFSGAILRNANLRDADLTRTVFDRTDL 158
Query: 198 GGAII 202
GAI+
Sbjct: 159 TGAIL 163
>gi|114799805|ref|YP_760951.1| pentapeptide repeat-containing protein [Hyphomonas neptunium ATCC
15444]
gi|114739979|gb|ABI78104.1| pentapeptide repeat domain protein [Hyphomonas neptunium ATCC
15444]
Length = 245
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 54/101 (53%), Gaps = 11/101 (10%)
Query: 116 ADLRKAVHVKENF-RANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
ADLR A F A F +A M++ +DFS ++ GA LEKA NF GA L
Sbjct: 88 ADLRGADLTSARFADATFNNARMQDVLASGADFSRARLQGANLEKARLIGVNFEGASL-- 145
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
L R L A+L+ A T+L R++L G I +GA+ S+A
Sbjct: 146 -LFAR--LETADLSGANCTGTILDRANLRGTIFDGANLSEA 183
>gi|428225932|ref|YP_007110029.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427985833|gb|AFY66977.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 180
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 56/118 (47%), Gaps = 26/118 (22%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEK 154
N Y A+ RG + G A+LR+A N AD+++S+ G+ A L
Sbjct: 69 NLYSAKLRG-------SDLGLANLREA---------NLGDADLKQSNLRGADLRNANLLG 112
Query: 155 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A +A+ GADL D ANLTNA L L +++L GA++ G F AV+
Sbjct: 113 ASLIEADLRGADLRD----------ANLTNANLDGADLRQTNLQGAVLTGVSFRGAVL 160
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 62/136 (45%), Gaps = 19/136 (13%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD----------LSDTLMDRMVLNEAN 181
AD+R +G K A L K+ Y A G+D L D + + L A+
Sbjct: 45 LRKADLRYFQLNGVKLLAANLSKSNLYSAKLRGSDLGLANLREANLGDADLKQSNLRGAD 104
Query: 182 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGTNPITGVSTRKSLG 240
L NA L+ L +DL GA + A+ ++A +D A +Q + A +TGVS R ++
Sbjct: 105 LRNANLLGASLIEADLRGADLRDANLTNANLDGADLRQTNLQGA----VLTGVSFRGAVL 160
Query: 241 CGNSRRNA----YGSP 252
CG + N YG P
Sbjct: 161 CGATMPNGLAARYGCP 176
>gi|15965782|ref|NP_386135.1| signal peptide protein [Sinorhizobium meliloti 1021]
gi|334316724|ref|YP_004549343.1| pentapeptide repeat-containing protein [Sinorhizobium meliloti
AK83]
gi|384529911|ref|YP_005713999.1| pentapeptide repeat-containing protein [Sinorhizobium meliloti
BL225C]
gi|384535747|ref|YP_005719832.1| hypothetical protein SM11_chr1295 [Sinorhizobium meliloti SM11]
gi|407720970|ref|YP_006840632.1| signal peptide protein [Sinorhizobium meliloti Rm41]
gi|418401673|ref|ZP_12975198.1| pentapeptide repeat-containing protein [Sinorhizobium meliloti
CCNWSX0020]
gi|433613810|ref|YP_007190608.1| putative low-complexity protein [Sinorhizobium meliloti GR4]
gi|15075051|emb|CAC46608.1| Hypothetical protein signal peptide [Sinorhizobium meliloti 1021]
gi|333812087|gb|AEG04756.1| pentapeptide repeat protein [Sinorhizobium meliloti BL225C]
gi|334095718|gb|AEG53729.1| pentapeptide repeat protein [Sinorhizobium meliloti AK83]
gi|336032639|gb|AEH78571.1| hypothetical protein signal peptide [Sinorhizobium meliloti SM11]
gi|359504345|gb|EHK76882.1| pentapeptide repeat-containing protein [Sinorhizobium meliloti
CCNWSX0020]
gi|407319202|emb|CCM67806.1| signal peptide protein [Sinorhizobium meliloti Rm41]
gi|429552000|gb|AGA07009.1| putative low-complexity protein [Sinorhizobium meliloti GR4]
Length = 241
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 51/90 (56%), Gaps = 2/90 (2%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+F SA+++ +DF+G++ GA EKA +ANF A L+ T L+ A L+ AVL
Sbjct: 124 ASFASAELQRTDFTGARLTGADFEKAELGRANFDKAVLTGTRFSMANLSRAKLSGAVLEG 183
Query: 190 TV-LTRSDLGGAIIEGADFSDAVIDLAQKQ 218
+ L R+ L IEG D S A L Q+Q
Sbjct: 184 PIDLDRAFLFLTRIEGVDLSSAS-GLTQEQ 212
>gi|443324425|ref|ZP_21053179.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442795970|gb|ELS05303.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 305
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/136 (33%), Positives = 71/136 (52%), Gaps = 10/136 (7%)
Query: 111 AQFGSADLRKAVHVKE-NFRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTG 164
A +A+L+ AV + AN ++AD+ ++ D S + GA L A ANF+
Sbjct: 71 ADLATANLQAAVLIGICLIEANLSNADLSDAYLMDGDLSNANLIGADLRDANCDHANFSN 130
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 224
A+L TLM ++ L ANLT A L RT L+ ++L A + AD S+A +L + + L +
Sbjct: 131 ANLIGTLMRKVRLRHANLTGAKLQRTNLSEAELIEAHLSEADLSNA--NLYEAELLNIFG 188
Query: 225 NGTN--PITGVSTRKS 238
TN + ++T S
Sbjct: 189 YKTNFCRVQAIATHMS 204
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 60/125 (48%), Gaps = 12/125 (9%)
Query: 91 LADLNKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKF 147
L++ N YEAE FG + Q + + +A F+ANF+ A++ + D
Sbjct: 173 LSNANLYEAELLNIFGYKTNFCRVQAIATHMSRAYL----FQANFSEAELIKIDLRW--- 225
Query: 148 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
A ++A AN ADL T +++ L +ANLT A L L +DL GA + A+
Sbjct: 226 --ANCDRANFRNANLQQADLRGTNLNQADLKQANLTRANLRGANLNHADLRGANLTDANI 283
Query: 208 SDAVI 212
DA+
Sbjct: 284 QDAIF 288
>gi|428218533|ref|YP_007102998.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427990315|gb|AFY70570.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 348
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 57/116 (49%), Gaps = 16/116 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
SA A+L A ++ N AN A+ E++ S + N AYL KA + AN T A+L
Sbjct: 47 SAVNLRGANLSMANLIRANLSGANLIEANFDEANLSMAYLNCAYLNKAYLHGANLTWANL 106
Query: 168 SDTLMDRMVLNEANLTNAVLVRT---------------VLTRSDLGGAIIEGADFS 208
S + + +EANL+ AVL T L+ +DLGGA + GA+ S
Sbjct: 107 SQSCLIDTDASEANLSGAVLSGTDAYGSNFSGANLSEAYLSVADLGGANLHGANLS 162
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 56/114 (49%), Gaps = 21/114 (18%)
Query: 115 SADLRKAVHVKENF-RANFTS-----ADMRESDFSGSKFNGA---------------YLE 153
+AD+R A ++ + RA+ T AD+ ++ G++ +GA +LE
Sbjct: 208 AADIRGASLIETDLSRADLTKVSLICADLSDAHLIGTELHGANLSQANLKHADLRLSHLE 267
Query: 154 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
A Y A+ ADLS ++ LNEA L A+L T L +DL GA + GA+
Sbjct: 268 AANLYGASLYSADLSQANLNAAYLNEAFLFGAILKWTNLADADLSGAHLGGANL 321
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 59/119 (49%), Gaps = 6/119 (5%)
Query: 111 AQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTG 164
A A+L + NF RAN ++A+M +++ + SKF A L++A Y A+ G
Sbjct: 154 ANLHGANLSSVYAIATNFERANLSNANMSKANCAKSKFGSAILDRANLSMSYLYAADIRG 213
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
A L +T + R L + +L A L L ++L GA + A+ A + L+ +A Y
Sbjct: 214 ASLIETDLSRADLTKVSLICADLSDAHLIGTELHGANLSQANLKHADLRLSHLEAANLY 272
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 56/109 (51%), Gaps = 9/109 (8%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 165
G+ + Q+ SA+ R V + A+ TS D+ ++D S GA L A +AN +GA
Sbjct: 13 GVSTWNQWRSANSRIQVDLT---GADLTSVDLLDADLSAVNLRGANLSMANLIRANLSGA 69
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VID 213
+L + D EANL+ A L L ++ L GA + A+ S + +ID
Sbjct: 70 NLIEANFD-----EANLSMAYLNCAYLNKAYLHGANLTWANLSQSCLID 113
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 38/77 (49%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 191
+ D S+FSG+ + AYL A AN GA+LS ANL+NA + +
Sbjct: 126 LSGTDAYGSNFSGANLSEAYLSVADLGGANLHGANLSSVYAIATNFERANLSNANMSKAN 185
Query: 192 LTRSDLGGAIIEGADFS 208
+S G AI++ A+ S
Sbjct: 186 CAKSKFGSAILDRANLS 202
>gi|159030580|emb|CAO88243.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length = 354
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 66/133 (49%), Gaps = 1/133 (0%)
Query: 80 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMR 138
+ A+ S + LA L + + T AA+ L A + N R AN T AD+
Sbjct: 204 IYAAVSDDFLELAQLAELDPLTDFTGANLLAAELSGISLGMANLYQANLRGANLTDADLS 263
Query: 139 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 198
E + S + F GA L A+ A+ + AD + + L +NLT A LV +T+++L
Sbjct: 264 EINGSHASFKGADLSGALLANADLSYADFYRSSLALANLIGSNLTGANLVEVNITQANLS 323
Query: 199 GAIIEGADFSDAV 211
GA ++GA F+D V
Sbjct: 324 GAKVQGAKFADNV 336
>gi|381204843|ref|ZP_09911914.1| hypothetical protein SclubJA_04390 [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 214
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 50/106 (47%), Gaps = 11/106 (10%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A A LRK K + R A+ AD+R D SG+ + A L A KAN TG
Sbjct: 40 ANLSGATLRKVNLNKSSLRQATLKEASLVGADLRRVDLSGANLSNANLVGANLRKANLTG 99
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
ADLS L+ ANLT AVL LT ++L G + GA A
Sbjct: 100 ADLSGA-----KLSNANLTGAVLSSANLTGTNLLGVELIGAKLERA 140
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 55/125 (44%), Gaps = 15/125 (12%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
A+L A V N R AN T AD+ + S + GA L A N G +L
Sbjct: 77 LSGANLSNANLVGANLRKANLTGADLSGAKLSNANLTGAVLSSANLTGTNLLGVELIGAK 136
Query: 172 MDR-----MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--------DLAQKQ 218
++R +L ANL+ LV + T +DL A + GA D + +L++KQ
Sbjct: 137 LERANARGAILKNANLSMTNLVLSNFTEADLSNANLSGAKLIDTDLTRATLRNANLSRKQ 196
Query: 219 ALCKY 223
LC+
Sbjct: 197 -LCRV 200
>gi|282898833|ref|ZP_06306820.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281196360|gb|EFA71270.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 189
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 7/86 (8%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+F AD+ SD +G +GA L +AN GA L + + ++L A+LT A+L+
Sbjct: 36 DFARADLSWSDLTGISLSGANLS-----QANLRGAKLENAHLSEVILCGADLTQAILINA 90
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLAQ 216
L SDL GA++ A+ DA DL Q
Sbjct: 91 HLNESDLSGALLVDANLCDA--DLHQ 114
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 11/98 (11%)
Query: 111 AQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A +DL A+ V N +A+ T+A+++ + +G+K G + KA A+ TG
Sbjct: 90 AHLNESDLSGALLVDANLCDADLHQASITAANLQSAKLNGAKMGGVRMWKADLQGADLTG 149
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
ADLS+ M + L+ ANL+ + T LT GAI+
Sbjct: 150 ADLSEANMCGVNLSMANLSATDMSETFLT-----GAIM 182
>gi|218439290|ref|YP_002377619.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218172018|gb|ACK70751.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 231
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 60/107 (56%), Gaps = 6/107 (5%)
Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGA 165
QF + ++A +K N A+F+ AD R S +F+ + F GA L +A+ + +FTGA
Sbjct: 16 QFKTCKFQEAELIKVNLSGADFSKADFRSSRLGKTNFAYACFFGADLSEAILWGTDFTGA 75
Query: 166 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+L ++ + L+ A L+ A L L ++ LGGA + A+ +A++
Sbjct: 76 NLEKAILREVELSGAILSQANLTGVNLMKATLGGANLSLANLREAIL 122
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 54/109 (49%), Gaps = 14/109 (12%)
Query: 111 AQFGSADLRKAVHVKENFR------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
A A+LR+A+ + +FR N AD+ E+D S +K NG L +A A
Sbjct: 110 ANLSLANLREAILYEADFRPTSEHITNLQQADLSEADLSYAKLNGVNLRQAKLMGAKLCR 169
Query: 165 ADLSDTLMDRMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
ADLS + + L EANL NA L+ +DL GAI+ AD + A
Sbjct: 170 ADLSKGIWQNSLPTDLCEANLRNA-----DLSYADLSGAILSYADLTGA 213
>gi|425455658|ref|ZP_18835373.1| Genome sequencing data, contig C328 [Microcystis aeruginosa PCC
9807]
gi|389803408|emb|CCI17656.1| Genome sequencing data, contig C328 [Microcystis aeruginosa PCC
9807]
Length = 354
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 55/103 (53%), Gaps = 1/103 (0%)
Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
AA+ L A + N R AN T AD+ E + S + F GA L A+ A+ + AD
Sbjct: 234 AAELSGISLGMANLYQANLRGANLTDADLSEINGSHASFKGADLSGALLANADLSYADFY 293
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
+ + L +NLT A LV +T+++L GA ++GA F+D V
Sbjct: 294 RSSLALANLIGSNLTGANLVEVNITQANLSGAKVQGAKFADNV 336
>gi|383501588|ref|YP_005414947.1| hypothetical protein MC5_03910 [Rickettsia australis str. Cutlack]
gi|378932599|gb|AFC71104.1| hypothetical protein MC5_03910 [Rickettsia australis str. Cutlack]
Length = 960
Score = 48.9 bits (115), Expect = 0.003, Method: Composition-based stats.
Identities = 38/116 (32%), Positives = 57/116 (49%), Gaps = 7/116 (6%)
Query: 112 QFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
+ +ADL KA K N A+ T+A + + K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAKLDKANLEYADLTNATLTNATAQFVKLSNATLEKAEA-----EGLNISDV 609
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 225
+ + EAN N ++ R LT++D A++E AD +D K+A K AN
Sbjct: 610 IAKNINAKEANFKNVIMQRADLTKADFTKAVLENADMQAVEALDAIFKEATLKQAN 665
Score = 37.7 bits (86), Expect = 5.8, Method: Composition-based stats.
Identities = 43/169 (25%), Positives = 72/169 (42%), Gaps = 17/169 (10%)
Query: 51 PDCSNNQCAGPYA---KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGI 107
PD S+ +G LKN +F S L +++C+ + + N A +
Sbjct: 342 PDLSDINLSGKTLTNLNLKN-TLFASANLENINISNCNLDFTNFEGANLQNAVFQDVTAR 400
Query: 108 GSAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYK------- 159
+ F ADL+K+ + + RA D+ E++ + SKFN + A A K
Sbjct: 401 NTGFLF--ADLKKSKIENSDMSRAYMPKVDLSEAEVTNSKFNAVMMVNADAEKLIMQDSE 458
Query: 160 ---ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 205
+N TG L+ M R+ + L NA+L + + +DL A + A
Sbjct: 459 WKNSNLTGISLAYADMQRVQMQGVVLNNALLDQANIISTDLENAFMNNA 507
>gi|379720162|ref|YP_005312293.1| hypothetical protein PM3016_2256 [Paenibacillus mucilaginosus 3016]
gi|378568834|gb|AFC29144.1| hypothetical protein PM3016_2256 [Paenibacillus mucilaginosus 3016]
Length = 288
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
+ FT + + SDFSG+ G+ + + +ANF GA+L+D + + L A+ +LV
Sbjct: 100 KGRFTGSALHGSDFSGADLTGSSFKSSDVREANFDGANLTDCSLSTLDLANASFHKTILV 159
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDL 214
RT ++S L GA G +D + +
Sbjct: 160 RTNFSKSGLDGAQFTGVRLTDVTLTM 185
>gi|218247899|ref|YP_002373270.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|218168377|gb|ACK67114.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
Length = 222
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 44/74 (59%)
Query: 137 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 196
+ +++ S + +G L A +AN TGADLS+ +M + L+EANLT+A L L +
Sbjct: 65 LLDANLSNANLSGTLLNDAKLTRANLTGADLSNAIMMGITLSEANLTDANLTHADLYNAL 124
Query: 197 LGGAIIEGADFSDA 210
+ AI+ GA +DA
Sbjct: 125 MSKAILSGATLTDA 138
Score = 44.3 bits (103), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 46/83 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A T A++ +D S + G L +A AN T ADL + LM + +L+ A LT+A L
Sbjct: 83 AKLTRANLTGADLSNAIMMGITLSEANLTDANLTHADLYNALMSKAILSGATLTDADLES 142
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
V++ +DL AI + A + A++
Sbjct: 143 AVISDADLTHAIAQNAILNQAIL 165
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 15/92 (16%)
Query: 129 RANFTSADMR----------ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
RAN T AD+ E++ + + A L A+ KA +GA L+D ++ V++
Sbjct: 87 RANLTGADLSNAIMMGITLSEANLTDANLTHADLYNALMSKAILSGATLTDADLESAVIS 146
Query: 179 EANLT-----NAVLVRTVLTRSDLGGAIIEGA 205
+A+LT NA+L + +L+RS+L GA
Sbjct: 147 DADLTHAIAQNAILNQAILSRSNLSDGDFSGA 178
>gi|359685228|ref|ZP_09255229.1| hypothetical protein Lsan2_11384 [Leptospira santarosai str.
2000030832]
Length = 263
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 4/92 (4%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
+ +S + + +F G F+GA L A ++F GA+ S + LN A+L N
Sbjct: 151 DLSSIILEKQNFDGVDFSGANLGHAFLQNSSFVGANFSSAKLRGSFLNNADLRNTNFRGA 210
Query: 191 VLTRSDLGGAIIEGADFSDAVID----LAQKQ 218
L + L GA +EGADF+DA+ D L QKQ
Sbjct: 211 DLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242
>gi|194336315|ref|YP_002018109.1| pentapeptide repeat-containing protein [Pelodictyon
phaeoclathratiforme BU-1]
gi|194308792|gb|ACF43492.1| pentapeptide repeat protein [Pelodictyon phaeoclathratiforme BU-1]
Length = 441
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 56/109 (51%), Gaps = 1/109 (0%)
Query: 109 SAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S AQ ADL +A ++ + ANF A++ +++ S + +GA L A A+ +GA L
Sbjct: 59 SGAQLNMADLNRADLNGAHLYNANFGKANLIKTNLSKANLSGATLWDANLSGADLSGAQL 118
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
++ L ANLT A L LTR++L G A FS A +D Q
Sbjct: 119 ICAILTNATLTGANLTEACLNSADLTRANLIGGDFTRASFSGATLDEVQ 167
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 50/89 (56%), Gaps = 6/89 (6%)
Query: 115 SADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
SADL +A + +F RA+F+ A + E +G+ A+L +A Y+++ +GA+L ++
Sbjct: 140 SADLTRANLIGGDFTRASFSGATLDEVQLAGADLTMAFLGQAKLYRSDLSGANLCGAKLN 199
Query: 174 RMVLNEANLTNA-----VLVRTVLTRSDL 197
R L EANL+ A ++ T+ DL
Sbjct: 200 RATLIEANLSKADMHGVIIWHTIFVNVDL 228
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 68/132 (51%), Gaps = 20/132 (15%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNG 149
+ADLN+ A+ G A +A+ KA +K N +AN + A + +++ SG+ +G
Sbjct: 65 MADLNR--ADLNG-------AHLYNANFGKANLIKTNLSKANLSGATLWDANLSGADLSG 115
Query: 150 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----G 204
A L A+ A TGA+L++ LN A+LT A L+ TR+ GA ++ G
Sbjct: 116 AQLICAILTNATLTGANLTEA-----CLNSADLTRANLIGGDFTRASFSGATLDEVQLAG 170
Query: 205 ADFSDAVIDLAQ 216
AD + A + A+
Sbjct: 171 ADLTMAFLGQAK 182
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 63/128 (49%), Gaps = 22/128 (17%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 163
E +GS ++ +A RKA + R N AD+ SG++ N A L + AN
Sbjct: 5 ELLLGSVTEWNAA--RKA---HQKGRPNLKGADL-----SGAQLNKADLSRTDLVGANLR 54
Query: 164 GADLSDTLMDRMVLNEANLTNAV----------LVRTVLTRSDLGGAIIEGADFSDAVID 213
GADLS ++ LN A+L A L++T L++++L GA + A+ S A D
Sbjct: 55 GADLSGAQLNMADLNRADLNGAHLYNANFGKANLIKTNLSKANLSGATLWDANLSGA--D 112
Query: 214 LAQKQALC 221
L+ Q +C
Sbjct: 113 LSGAQLIC 120
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 11/114 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S AQ ADL + V N R A+ + A + +D + + NGA+L Y ANF A+L
Sbjct: 34 SGAQLNKADLSRTDLVGANLRGADLSGAQLNMADLNRADLNGAHL-----YNANFGKANL 88
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 221
T L++ANL+ A L L+ +DL GA + A ++A + A C
Sbjct: 89 IKT-----NLSKANLSGATLWDANLSGADLSGAQLICAILTNATLTGANLTEAC 137
>gi|239947676|ref|ZP_04699429.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes
scapularis]
gi|239921952|gb|EER21976.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes
scapularis]
Length = 953
Score = 48.9 bits (115), Expect = 0.003, Method: Composition-based stats.
Identities = 48/178 (26%), Positives = 77/178 (43%), Gaps = 23/178 (12%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
L+N + + AL A C N+ + N Y ++T E + ADLR+A+
Sbjct: 494 LENAFMNKTHALEAKFKEQC--NMQGITARNAYFSDTEFE----NILSLKEADLREAIMQ 547
Query: 125 KENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 173
+ + A AD+ + + + A L A KA G ++SD +
Sbjct: 548 RVKLKNADLTKAKLDKAKLEYADLTNATLTNATAQFAKLSNATLEKAEAEGLNISDAIAK 607
Query: 174 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYAN 225
+ EAN NA++ R LT++D A++E AD ++A+ A KQA K AN
Sbjct: 608 NINAKEANFKNAIMQRADLTKADFTKAVLENADMQAMEAAEAIFKEANLKQANLKVAN 665
Score = 42.4 bits (98), Expect = 0.21, Method: Composition-based stats.
Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 4/107 (3%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A+ +A L KA E N + A + + + F A +++A KA+FT A L +
Sbjct: 584 AKLSNATLEKA----EAEGLNISDAIAKNINAKEANFKNAIMQRADLTKADFTKAVLENA 639
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 217
M M EA A L + L ++L G EGADF A ID A K
Sbjct: 640 DMQAMEAAEAIFKEANLKQANLKVANLAGINKEGADFDKAKIDDATK 686
>gi|218441428|ref|YP_002379757.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218174156|gb|ACK72889.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 362
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 53/102 (51%), Gaps = 6/102 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S Q G A+L H+ N R A T AD+ E+D + K +GA L A AN + +DL
Sbjct: 245 SGVQLGGANL---YHI--NLRGAVLTDADLGEADLNHGKLSGADLSGAYLGNANLSYSDL 299
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 209
+ L A+L A L L++++L GAI+EG F+D
Sbjct: 300 HKASLALTNLIGADLRGANLTEVNLSQANLSGAIVEGTRFAD 341
>gi|119486749|ref|ZP_01620724.1| hypothetical protein L8106_10882 [Lyngbya sp. PCC 8106]
gi|119456042|gb|EAW37175.1| hypothetical protein L8106_10882 [Lyngbya sp. PCC 8106]
Length = 160
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 52/98 (53%)
Query: 116 ADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 175
AD+R+ + K + + + AD+ E++ +G+ GA L+K + A GADLS +
Sbjct: 26 ADMRRLLDTKRCQQCDLSEADLSEAELTGADLLGANLQKTILRGAKLKGADLSSANLIEA 85
Query: 176 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
L A+L +A L T L +++L A + AD A ++
Sbjct: 86 DLTGADLRDAKLHSTTLRKANLSAANLTWADLYRAFLE 123
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 65/144 (45%), Gaps = 12/144 (8%)
Query: 70 VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 129
+ V T L+ A +++ L D + + E + S A+ ADL A K R
Sbjct: 10 LLVLTGLSIPASAELQADMRRLLDTKRCQQCDLSEADL-SEAELTGADLLGANLQKTILR 68
Query: 130 ------ANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
A+ +SA++ E+D +G+ K + L KA AN T ADL ++ + N
Sbjct: 69 GAKLKGADLSSANLIEADLTGADLRDAKLHSTTLRKANLSAANLTWADLYRAFLEEAIFN 128
Query: 179 EANLTNAVLVRTVLTRSDLGGAII 202
+ANL NA L L ++ GA +
Sbjct: 129 DANLENANLNDAKLDGTNFCGATM 152
>gi|428299465|ref|YP_007137771.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428236009|gb|AFZ01799.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 731
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 46/85 (54%), Gaps = 5/85 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
N TSA + +SDF+GS F+GA L K ANFTGAD+S L + + AN TNA
Sbjct: 253 NLTSAKLVDSDFTGSNFSGAKLINTDLSKTNLTNANFTGADMSGVLTTDAIASGANFTNA 312
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDA 210
L L++ + A GA+ + A
Sbjct: 313 NLSNANLSKGNFTDATFFGANLTGA 337
>gi|307155293|ref|YP_003890677.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306985521|gb|ADN17402.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 145
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 45/83 (54%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ AD+R ++ SG+ A LE A AN GADL+ ++ LN +NL +
Sbjct: 51 AHLIGADLRNANLSGANLVEANLEGADLTGANLQGADLTGAMVTNASLNNSNLKDVNFTN 110
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
+L +D+ GA++EG + +A I
Sbjct: 111 AMLYDADVTGALMEGLNLKNAQI 133
>gi|193083812|gb|ACF09494.1| pentapeptide repeat protein [uncultured marine crenarchaeote
SAT1000-23-F7]
Length = 741
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 61/110 (55%), Gaps = 14/110 (12%)
Query: 127 NFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
NFR +NFTS ++ ++F+ +GA L + TGADL + L+ A+L+N
Sbjct: 495 NFRESNFTSTNIANANFTSVNLSGADLSMKDLTENILTGADLRNA-----NLSGADLSNN 549
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDA----VID---LAQKQALCKYANGTN 228
LV T+LT +DL AI+ GAD S A +ID + QK L K AN TN
Sbjct: 550 QLVNTILTGADLTDAILSGADLSTANIFGIIDGINILQKTKL-KGANFTN 598
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 42/80 (52%), Gaps = 5/80 (6%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N D+ E+ G+ G LEKA +N DLS + ++ L ++NL+ RT
Sbjct: 605 NLIGVDISETILKGADLTGVKLEKAKVNNSNLEDLDLSFKNLSKIRLVDSNLS-----RT 659
Query: 191 VLTRSDLGGAIIEGADFSDA 210
+L+ +DL A + GA+ SDA
Sbjct: 660 ILSGADLSNAELMGANLSDA 679
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 41/82 (50%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NF ++ S+F S F + A N +GADLS + +L A+L NA L
Sbjct: 485 NFEHINLSYSNFRESNFTSTNIANANFTSVNLSGADLSMKDLTENILTGADLRNANLSGA 544
Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
L+ + L I+ GAD +DA++
Sbjct: 545 DLSNNQLVNTILTGADLTDAIL 566
>gi|434391142|ref|YP_007126089.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428262983|gb|AFZ28929.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 516
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 52/92 (56%), Gaps = 5/92 (5%)
Query: 132 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV-----LNEANLTNAV 186
T AD+RE++F + GA L +A +ANF+ A+L+ +M + L+EANL A
Sbjct: 325 LTKADLRETNFYTTNLTGANLSEANCDRANFSAANLNGAIMLQTSFRAANLSEANLKYAN 384
Query: 187 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
L+ LT ++L A +EGA+ + A + A Q
Sbjct: 385 LIAANLTEANLSRASLEGANLTAANLSHANLQ 416
Score = 37.0 bits (84), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 5/84 (5%)
Query: 129 RANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
RANF++A+ M ++ F + + A L+ A AN T A+LS ++ L ANL+
Sbjct: 352 RANFSAANLNGAIMLQTSFRAANLSEANLKYANLIAANLTEANLSRASLEGANLTAANLS 411
Query: 184 NAVLVRTVLTRSDLGGAIIEGADF 207
+A L T L + +L GA + A+
Sbjct: 412 HANLQNTYLNKINLSGATLIQANL 435
>gi|17230748|ref|NP_487296.1| hypothetical protein all3256 [Nostoc sp. PCC 7120]
gi|17132351|dbj|BAB74955.1| all3256 [Nostoc sp. PCC 7120]
Length = 268
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 68/145 (46%), Gaps = 19/145 (13%)
Query: 109 SAAQFGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S+A ADL A ++ N ANF + D E++ ++ GAYL KA YKAN
Sbjct: 137 SSADLRDADLAGAKLIRSNLCFANLIAANFIAVDFSEANLYQAEVMGAYLYKANFYKANL 196
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 222
A L + R L A+L A L LT ++L GA + GA+ A +
Sbjct: 197 HQAHLGGAYLFRANLTAADLRGADLAWANLTSANLAGANLSGANLRGANL---------- 246
Query: 223 YANGTNPITGVSTRKSLGCGNSRRN 247
NG N + GV+ ++++ +SR +
Sbjct: 247 --NGAN-LNGVNLQETIMPDSSRHD 268
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 57/115 (49%), Gaps = 16/115 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
SAA F A+L ++V T AD+ + F G+ F+GA L A+ +AN G D S
Sbjct: 87 SAANFSVANLSQSV---------LTHADLSHAHFIGADFSGANLRGAIVTEANLIGTDFS 137
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 223
L +A+L A L+R+ L ++L A DFS+A +L Q + + Y
Sbjct: 138 SA-----DLRDADLAGAKLIRSNLCFANLIAANFIAVDFSEA--NLYQAEVMGAY 185
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 5/85 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN-----A 185
N ++R ++ +G+ + L +A+ +AN +GADLS + L+EANL+ A
Sbjct: 35 NLQENNLRGANLAGANLSRVDLSRALLIRANLSGADLSSANLHHAKLSEANLSAANFSVA 94
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDA 210
L ++VLT +DL A GADFS A
Sbjct: 95 NLSQSVLTHADLSHAHFIGADFSGA 119
>gi|402773132|ref|YP_006592669.1| pentapeptide repeat protein [Methylocystis sp. SC2]
gi|401775152|emb|CCJ08018.1| Pentapeptide repeat protein [Methylocystis sp. SC2]
Length = 261
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 61/139 (43%), Gaps = 36/139 (25%)
Query: 111 AQFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A F S L A K + A NFT AD++ +DFSG++ N A L A+ A F ADLS+
Sbjct: 115 ADFFSTKLAGAKLAKADLSATNFTRADLQNADFSGARMNAATLYAALLDGATFADADLSN 174
Query: 170 T---------------LMD---------------RMVLNEAN-----LTNAVLVRTVLTR 194
L+D R L +AN LT A L VLT
Sbjct: 175 ARIIGGGKGVNFRNAKLIDADLGADPANQGMAPVRAELPDANFDGADLTRANLTHAVLTG 234
Query: 195 SDLGGAIIEGADFSDAVID 213
++ AI+ GA F AV+D
Sbjct: 235 ANFTAAIVSGARFDYAVLD 253
>gi|425441123|ref|ZP_18821410.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9717]
gi|389718260|emb|CCH97767.1| Similar to tr|Q55773|Q55773 [Microcystis aeruginosa PCC 9717]
Length = 262
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 55/101 (54%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 177
L++ + ++ + + + + + +S+ +G+K NGA L A +AN +GADLS +
Sbjct: 29 LQQLLSTRKCPQCDLSGSGLVQSNLTGAKLNGANLVGANLSQANLSGADLSGANLTGASF 88
Query: 178 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
ANLT A L +LT +DL GA + A+ + +D A Q
Sbjct: 89 FGANLTGANLSGAILTGADLRGAYLNNANLENTKLDTAYVQ 129
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 40/72 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN A++ +++ SG+ +GA L A + AN TGA+LS ++ L A L NA L
Sbjct: 61 ANLVGANLSQANLSGADLSGANLTGASFFGANLTGANLSGAILTGADLRGAYLNNANLEN 120
Query: 190 TVLTRSDLGGAI 201
T L + + GA+
Sbjct: 121 TKLDTAYVQGAV 132
>gi|300869620|ref|ZP_07114200.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
gi|300332398|emb|CBN59400.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
Length = 580
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 68/144 (47%), Gaps = 21/144 (14%)
Query: 67 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
NW +F L+ A + S+ +N +A+ G +G A+ A+L +A
Sbjct: 103 NWAIFQEADLSGADLQRAKSD-----QINLEKAKLDGARLMG--AELMEANLNRASLAG- 154
Query: 127 NFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 186
AN T A++RE+ + + A L+ A +A+ GA +L ANLT A
Sbjct: 155 ---ANLTGANLREAHLAEANLREAILKGANLIEADLNGA----------ILRSANLTEAD 201
Query: 187 LVRTVLTRSDLGGAIIEGADFSDA 210
+ R VLT +DL A++ GAD S A
Sbjct: 202 MHRVVLTGADLTEAVLNGADLSRA 225
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 64/137 (46%), Gaps = 6/137 (4%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRANFTSA 135
L A +A + + L N EA+ G I +A AD+ + V A+ T A
Sbjct: 162 LREAHLAEANLREAILKGANLIEADLNG--AILRSANLTEADMHRVVLTG----ADLTEA 215
Query: 136 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 195
+ +D S + GAYL KA KA+ ++L + + L EANL A L + L+ +
Sbjct: 216 VLNGADLSRANLTGAYLLKASFKKAHLLRSNLQEVYLLWADLTEANLRGADLRKADLSGA 275
Query: 196 DLGGAIIEGADFSDAVI 212
L AI+ AD DA++
Sbjct: 276 YLSDAILSEADLRDALL 292
Score = 40.4 bits (93), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 42/82 (51%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ + A++RE D +G+ GA L A A GA L + +L ANL ++L
Sbjct: 25 ASLSGANLREIDLTGANLTGANLSWAFLSHAKLVGACLRRADLRSAMLTSANLNQSILSG 84
Query: 190 TVLTRSDLGGAIIEGADFSDAV 211
LT+ DL A ++ AD + A+
Sbjct: 85 ANLTKVDLRLAYLQEADLNWAI 106
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 53/100 (53%), Gaps = 6/100 (6%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 164
+ + A A+L A +K +F+ A+ ++++E + A L A KA+ +G
Sbjct: 215 AVLNGADLSRANLTGAYLLKASFKKAHLLRSNLQEVYLLWADLTEANLRGADLRKADLSG 274
Query: 165 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
A LSD +L+EA+L +A+L+ L R++L GA + G
Sbjct: 275 AYLSDA-----ILSEADLRDALLIEAHLIRTNLEGAQLTG 309
>gi|427735760|ref|YP_007055304.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427370801|gb|AFY54757.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 263
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 42/84 (50%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RAN D+ + S +K NGA L A +K GADLSD + R L A++ A L
Sbjct: 153 RANLKGRDLSGRNLSYAKLNGANLSDAFMHKVVLRGADLSDANLFRANLLLADMKEANLQ 212
Query: 189 RTVLTRSDLGGAIIEGADFSDAVI 212
L +DL GA + GAD A I
Sbjct: 213 GADLIGADLSGADLRGADLRGARI 236
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 46/83 (55%), Gaps = 7/83 (8%)
Query: 131 NFTSADMRESDFSGSKFNGAYLE------KAVAYKANFTGADLSDTLMDRMVLNEANLTN 184
N T ++SD SG F A L+ + ++Y A GA+LSD M ++VL A+L++
Sbjct: 135 NKTYQPPQQSDLSGQDFRRANLKGRDLSGRNLSY-AKLNGANLSDAFMHKVVLRGADLSD 193
Query: 185 AVLVRTVLTRSDLGGAIIEGADF 207
A L R L +D+ A ++GAD
Sbjct: 194 ANLFRANLLLADMKEANLQGADL 216
>gi|332708407|ref|ZP_08428384.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332352810|gb|EGJ32373.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 309
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 56/111 (50%), Gaps = 6/111 (5%)
Query: 109 SAAQFGSADLRKAV-----HVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 162
S AQ ADLR+A + N + AN + + E++FSG+ + A LE A +
Sbjct: 115 SLAQLQKADLREATGKGITFINANLKMANLGAVNFPEANFSGASLDIASLEAANLMDTKW 174
Query: 163 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
GADL + R L A+LT+A L+ L +DL I+ GA ++ ++
Sbjct: 175 VGADLERANLSRASLVRADLTSANLIVANLRAADLTEVILRGAQLLESSLE 225
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 53/102 (51%), Gaps = 15/102 (14%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEK----AVAY-KANFTGADL------SDTLMD-RMV- 176
A AD+RE+ G F A L+ AV + +ANF+GA L + LMD + V
Sbjct: 117 AQLQKADLREATGKGITFINANLKMANLGAVNFPEANFSGASLDIASLEAANLMDTKWVG 176
Query: 177 --LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
L ANL+ A LVR LT ++L A + AD ++ ++ AQ
Sbjct: 177 ADLERANLSRASLVRADLTSANLIVANLRAADLTEVILRGAQ 218
>gi|167826694|ref|ZP_02458165.1| pentapeptide repeat family protein [Burkholderia pseudomallei 9]
Length = 326
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 13 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 67
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 68 ARLTAANLSLAHCERTDFS 86
>gi|428317459|ref|YP_007115341.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428241139|gb|AFZ06925.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 197
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 68/136 (50%), Gaps = 19/136 (13%)
Query: 89 SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKF 147
S LAD N +A G A A+L++AV ++ N R A+ + AD+R +DF +
Sbjct: 29 SDLADANLSQANLSG-------ANLVGANLQRAV-LRANLRGADLSGADLRGADFRNADL 80
Query: 148 NGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG---- 198
GA A+ A+F TGA + + + + L A+L A L R +L +DL
Sbjct: 81 RGASFANALVRDASFGGAFLTGASIGNLDLSGVDLRGADLRGAALARAILHSADLSHANL 140
Query: 199 -GAIIEGADFSDAVID 213
GA + GAD +A+++
Sbjct: 141 SGADLSGADLEEAILN 156
Score = 37.0 bits (84), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 53/114 (46%), Gaps = 21/114 (18%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSA----------DMRESDFSGSKFNGAYLEKAVAYK 159
A F +ADLR A R A+F A D+ D G+ GA L +A+ +
Sbjct: 73 ADFRNADLRGASFANALVRDASFGGAFLTGASIGNLDLSGVDLRGADLRGAALARAILHS 132
Query: 160 ANFT----------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 203
A+ + GADL + +++ VL ANLT A L+ + ++ GA+++
Sbjct: 133 ADLSHANLSGADLSGADLEEAILNGAVLRGANLTGANLLCATIEQTLWDGALLD 186
>gi|443476936|ref|ZP_21066816.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443018029|gb|ELS32353.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 180
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 66/148 (44%), Gaps = 21/148 (14%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A ADL A + AN T A++ E++ G+ GA A AN T ++ S++
Sbjct: 35 ATLNKADLSSANLID----ANLTGANLIETNLRGAMLRGANFADADLSWANLTWSNSSNS 90
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-----------------ID 213
R L+ AN + A L+ T + L GA + G + A+ +D
Sbjct: 91 RFVRSNLSVANFSGANLIEADFTGAILKGANLRGTNLRGAMLKNLRTCADTDFTGVRNLD 150
Query: 214 LAQKQALCKYANGTNPITGVSTRKSLGC 241
+ LC A+GT+P T +R++LGC
Sbjct: 151 ERMRLYLCTVASGTHPFTKNDSRQTLGC 178
>gi|167907368|ref|ZP_02494573.1| pentapeptide repeat protein [Burkholderia pseudomallei NCTC 13177]
Length = 269
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/97 (40%), Positives = 53/97 (54%), Gaps = 6/97 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ + AD+R +D SG+ GA L A AN +GADLSD L A+L++A L
Sbjct: 39 ADLSDADLRGADLSGADLCGANLSGADLCGANLSGADLSDA-----DLRGADLSDADLRG 93
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 225
L+ ++L GA + GAD SDA + A A YAN
Sbjct: 94 ADLSVANLSGANLSGADLSDADLSGANLSGAYLSYAN 130
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 61/120 (50%), Gaps = 26/120 (21%)
Query: 84 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTSADMRESDF 142
C +N+S ADL+ +A+ RG A ADLR A N AN + AD+ ++D
Sbjct: 67 CGANLSG-ADLS--DADLRG-------ADLSDADLRGADLSVANLSGANLSGADLSDADL 116
Query: 143 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
SG+ +GAYL AN +GA+LSD ANL+ A L L+ +DL GA +
Sbjct: 117 SGANLSGAYLS-----YANLSGANLSD----------ANLSGANLRGADLSGADLSGAYL 161
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 45/81 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ + AD+R +D S + GA L A AN +GADLSD + L+ A L+ A L
Sbjct: 74 ADLSDADLRGADLSDADLRGADLSVANLSGANLSGADLSDADLSGANLSGAYLSYANLSG 133
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L+ ++L GA + GAD S A
Sbjct: 134 ANLSDANLSGANLRGADLSGA 154
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 52/112 (46%), Gaps = 19/112 (16%)
Query: 109 SAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 168
S A ADLR A + + AD+ ++ SG+ GA L A A+ GADLS
Sbjct: 37 SGADLSDADLRGA---------DLSGADLCGANLSGADLCGANLSGADLSDADLRGADLS 87
Query: 169 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA----------DFSDA 210
D + L+ ANL+ A L L+ +DL GA + GA + SDA
Sbjct: 88 DADLRGADLSVANLSGANLSGADLSDADLSGANLSGAYLSYANLSGANLSDA 139
>gi|220910596|ref|YP_002485907.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219867207|gb|ACL47546.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 449
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 53/100 (53%), Gaps = 4/100 (4%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A A+LR+ E F AN + AD+ E++ S ++ GA L ++ +AN + A LS
Sbjct: 320 ANLSGANLREV----ELFEANLSRADLLEANLSRARLTGANLSRSTLSEANLSRATLSGA 375
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
++R L+ L L L+R+DLG A + GA+ S A
Sbjct: 376 HLNRATLSGGTLYKVDLSGVNLSRADLGDANLSGANLSRA 415
Score = 43.9 bits (102), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 55/107 (51%), Gaps = 8/107 (7%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A A+L +A N R+ + A++ + SG+ N A L YK + +G +L
Sbjct: 338 SRADLLEANLSRARLTGANLSRSTLSEANLSRATLSGAHLNRATLSGGTLYKVDLSGVNL 397
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 214
S R L +ANL+ A L R L+R++L A + GA+ S+ +DL
Sbjct: 398 S-----RADLGDANLSGANLSRADLSRANLTAADLSGANLSE--VDL 437
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 45/81 (55%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A++ D S K + A L KA A+ ADLS+ + + L+ ANL A L
Sbjct: 190 ANLSGANLSRVDLSEVKLSQANLTKANLSGADLDKADLSNLELIEVDLSGANLAGANLSS 249
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
T L+R+DL GA + GA+ + A
Sbjct: 250 TNLSRADLSGANLRGANLARA 270
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 53/104 (50%), Gaps = 15/104 (14%)
Query: 119 RKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
R A ++ RA T+A++ +D G + A LE A N +GA L+D L++ L
Sbjct: 9 RYAAGERDFHRAELTNAELITADLKGINLSRADLEWA-----NLSGAKLNDALLNGAELV 63
Query: 179 EANLTN-----AVLVRTVLTRSD-----LGGAIIEGADFSDAVI 212
ANL N A L+ L+RSD LGGA + AD S+A +
Sbjct: 64 NANLINVDLSGASLIGINLSRSDLSWANLGGANLSRADLSEATL 107
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 49/103 (47%), Gaps = 6/103 (5%)
Query: 112 QFGSADLRKAVHVKENFRA-NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
F A+L A + + + N + AD+ ++ SG+K N A L A AN DLS
Sbjct: 16 DFHRAELTNAELITADLKGINLSRADLEWANLSGAKLNDALLNGAELVNANLINVDLSGA 75
Query: 171 LM-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 208
+ R L+ ANL A L R L+ + L GA + GAD S
Sbjct: 76 SLIGINLSRSDLSWANLGGANLSRADLSEATLRGADLRGADLS 118
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 62/128 (48%), Gaps = 11/128 (8%)
Query: 92 ADLNK---YEAETRGEFGIGSAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKF 147
ADL++ EA+ RG I + A+LR A + E AN D+ E+D SG+
Sbjct: 115 ADLSRVEMIEADLRGL--ILNGVNLRGANLRGANLSGTELTYANLGRVDLIEADLSGANL 172
Query: 148 NGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 202
+GA L + AN +GA+LS + + L++ANLT A L L ++DL +
Sbjct: 173 SGATLCGANLSRVNLSNANLSGANLSRVDLSEVKLSQANLTKANLSGADLDKADLSNLEL 232
Query: 203 EGADFSDA 210
D S A
Sbjct: 233 IEVDLSGA 240
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 41/81 (50%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN +S ++ +D SG+ GA L +A N GA+L+ ++ L +L+ A L
Sbjct: 245 ANLSSTNLSRADLSGANLRGANLARAKLIGTNLRGANLTGAILTGANLEGTDLSQADLRS 304
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L+ L G I+ GA+ S A
Sbjct: 305 ANLSGLILNGTILRGANLSGA 325
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 64/133 (48%), Gaps = 28/133 (21%)
Query: 98 EAETRGEFGIGSAAQFGSADLRKAVHVKENFRA------NFTSADMRESDFSGSKFNGAY 151
EA RG A ADL + ++ + R N A++R ++ SG++ A
Sbjct: 104 EATLRG-------ADLRGADLSRVEMIEADLRGLILNGVNLRGANLRGANLSGTELTYAN 156
Query: 152 LEKAVAYKANFTGADLSD-TL----MDRMVLNEANLTNAVLVRT----------VLTRSD 196
L + +A+ +GA+LS TL + R+ L+ ANL+ A L R LT+++
Sbjct: 157 LGRVDLIEADLSGANLSGATLCGANLSRVNLSNANLSGANLSRVDLSEVKLSQANLTKAN 216
Query: 197 LGGAIIEGADFSD 209
L GA ++ AD S+
Sbjct: 217 LSGADLDKADLSN 229
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 54/125 (43%), Gaps = 21/125 (16%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFT---------------SADMRESDFSGSKFNGAYL 152
S A A+L +A + N R AN T AD+R ++ SG NG L
Sbjct: 258 SGANLRGANLARAKLIGTNLRGANLTGAILTGANLEGTDLSQADLRSANLSGLILNGTIL 317
Query: 153 EKAVAYKANFTGADLSDTLMDRMVLNEAN-----LTNAVLVRTVLTRSDLGGAIIEGADF 207
A AN +L + + R L EAN LT A L R+ L+ ++L A + GA
Sbjct: 318 RGANLSGANLREVELFEANLSRADLLEANLSRARLTGANLSRSTLSEANLSRATLSGAHL 377
Query: 208 SDAVI 212
+ A +
Sbjct: 378 NRATL 382
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 9/112 (8%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A DL + + N +AN + AD+ ++D S N +E ++ AN GA+L
Sbjct: 193 SGANLSRVDLSEVKLSQANLTKANLSGADLDKADLS----NLELIEVDLS-GANLAGANL 247
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 216
S T + R L+ ANL A L R L ++L GA + GA + A + DL+Q
Sbjct: 248 SSTNLSRADLSGANLRGANLARAKLIGTNLRGANLTGAILTGANLEGTDLSQ 299
>gi|300867247|ref|ZP_07111907.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300334724|emb|CBN57073.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 520
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 48/83 (57%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
AN + A++ ++ +G+K N A L A +AN T ADL+ + R+ L A L A L+R
Sbjct: 45 ANLSGANLCGANLTGAKLNIARLSGAHLGEANLTDADLNVAYLVRVDLKGAILIRAKLIR 104
Query: 190 TVLTRSDLGGAIIEGADFSDAVI 212
L R++L GA + GA+ S A +
Sbjct: 105 AELIRAELSGANLSGANLSGATL 127
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 6/102 (5%)
Query: 117 DLRKAVHVK------ENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
DL+ A+ ++ E RA + A++ ++ SG+ A L KA +AN GA LS
Sbjct: 91 DLKGAILIRAKLIRAELIRAELSGANLSGANLSGATLTEATLRKADLTQANLRGAHLSGA 150
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
+ +L EAN A L R L+ +DL G+ + A+ + A++
Sbjct: 151 SLTEALLVEANFQGADLSRADLSHADLRGSELRQANLTQAIL 192
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 67/142 (47%), Gaps = 21/142 (14%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 164
A A L +A+ V+ NF+ A+ + AD+ +D GS+ A L +A+ A+ +G
Sbjct: 145 AHLSGASLTEALLVEANFQGADLSRADLSHADLRGSELRQANLTQAILSGADLSGVNLRW 204
Query: 165 ----------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADFSD 209
ADLS+ + L+ A+L NA L+ T L +DL A + GAD +
Sbjct: 205 AILSGCNLRWADLSEAKLSGADLSRADLCNANLLNTSLVHADLSNAYLIKADWVGADLTG 264
Query: 210 AVIDLAQKQALCKYANGTNPIT 231
A + A+ A+ + T +T
Sbjct: 265 ATLTGAKLHAVSRLGIKTEGMT 286
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 10/80 (12%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
A AD+ +++ G+ +GA L +A+ +ANF GADLS A+L++A L
Sbjct: 129 EATLRKADLTQANLRGAHLSGASLTEALLVEANFQGADLS----------RADLSHADLR 178
Query: 189 RTVLTRSDLGGAIIEGADFS 208
+ L +++L AI+ GAD S
Sbjct: 179 GSELRQANLTQAILSGADLS 198
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 38/74 (51%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
AN + ++ + SG+ + A L A AN TGA L+ + L EANLT+A L
Sbjct: 24 EANLSGVNLSGINLSGANLSVANLSGANLCGANLTGAKLNIARLSGAHLGEANLTDADLN 83
Query: 189 RTVLTRSDLGGAII 202
L R DL GAI+
Sbjct: 84 VAYLVRVDLKGAIL 97
>gi|113475153|ref|YP_721214.1| periplasmic binding protein/LacI transcriptional regulator
[Trichodesmium erythraeum IMS101]
gi|110166201|gb|ABG50741.1| periplasmic binding protein/LacI transcriptional regulator
[Trichodesmium erythraeum IMS101]
Length = 525
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 56/101 (55%), Gaps = 4/101 (3%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
+N T A++ ++ +G G+ L+ A AN GA+L D ++ L ANL A+L
Sbjct: 31 SNLTGANLSGANLAGINLQGSNLQGANLVNANLEGANLKDVNLEGANLARANLKKAILQN 90
Query: 190 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 230
+ L S+L G+ ++ ADFS+A +L +AL +AN N I
Sbjct: 91 SNLDNSNLYGSDLQAADFSEA--NLVNMKAL--WANFHNAI 127
>gi|399073585|ref|ZP_10750574.1| putative low-complexity protein [Caulobacter sp. AP07]
gi|398041367|gb|EJL34432.1| putative low-complexity protein [Caulobacter sp. AP07]
Length = 313
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 6/97 (6%)
Query: 116 ADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
ADLR A NF RA+ A+MR ++F G+ F A L + ANF GADL+
Sbjct: 69 ADLRGASFFGSNFTGADLSRADLRGAEMRGANFVGAIFTDAKLSGIESSGANFQGADLAR 128
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 206
+ L+ AN A L + L+ S+L GA ++G +
Sbjct: 129 VDLSSSELHGANFIGANLEKANLSSSELVGANLQGVN 165
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 43/87 (49%), Gaps = 5/87 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTGADLSDTLMDRMVLNEANLT 183
+AN AD+R + F GS F GA L + A ANF GA +D + + + AN
Sbjct: 63 KANLMGADLRGASFFGSNFTGADLSRADLRGAEMRGANFVGAIFTDAKLSGIESSGANFQ 122
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
A L R L+ S+L GA GA+ A
Sbjct: 123 GADLARVDLSSSELHGANFIGANLEKA 149
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 50/103 (48%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADL-RKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A F ADL R + E ANF A++ +++ S S+ GA L+ A A+F A+L
Sbjct: 117 SGANFQGADLARVDLSSSELHGANFIGANLEKANLSSSELVGANLQGVNARYASFQSAEL 176
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
+ + M AN NA + L + GGA GAD S A
Sbjct: 177 NASNMIGGNFERANFRNAEFPGSTLRGAIFGGADFHGADLSGA 219
>gi|302522367|ref|ZP_07274709.1| OxyO [Streptomyces sp. SPB78]
gi|302431262|gb|EFL03078.1| OxyO [Streptomyces sp. SPB78]
Length = 233
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 58/124 (46%), Gaps = 16/124 (12%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLE---------------KAVAYKANFTGADLSDTLMDR 174
AN T A+++ S S + N A+L KA ++A+ T AD+S +
Sbjct: 93 ANLTDANLKYSSLSSTHLNEAWLSHSVLSHASLSLADLSKANLHEADLTKADVSGANLSE 152
Query: 175 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 234
L A +TNA RT L+ ++L GA + GAD S V +L QKQ N T +
Sbjct: 153 ADLAGAKMTNANFFRTNLSGAELTGADLSGADLS-TVKNLTQKQVSSARTNRTTRLPSGL 211
Query: 235 TRKS 238
TR S
Sbjct: 212 TRAS 215
>gi|307150734|ref|YP_003886118.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306980962|gb|ADN12843.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 231
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 52/105 (49%), Gaps = 16/105 (15%)
Query: 109 SAAQFGSADLRKAVHVKENFRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A F +D R + K NF A F AD E+ G+ F+ A LEKA+ + + +GA
Sbjct: 33 SRADFSYSDFRSSRLGKTNFSAACFLGADFSEAILWGTDFSKANLEKAILREVDLSGA-- 90
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIEGADF 207
+L EANLT L++ L + + L GAI+ ADF
Sbjct: 91 --------ILTEANLTQVNLIKATLGGANLSLAQLPGAIVYEADF 127
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 47/97 (48%), Gaps = 13/97 (13%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV---LNEANLTNA 185
R N T A++ ++ S +K NGA L +A A ADLS + + L+EANL NA
Sbjct: 134 RTNLTQANLSAANLSYAKLNGANLYQAQLMNAQLCRADLSKGIWQNCLPTDLSEANLQNA 193
Query: 186 ----------VLVRTVLTRSDLGGAIIEGADFSDAVI 212
+L LT +DL G I+ D + A++
Sbjct: 194 DLSYADLSGAILCYADLTGADLTGTILTNVDLTGAIL 230
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 57/117 (48%), Gaps = 3/117 (2%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
SAA F AD +A+ +F +AN A +RE D SG+ A L + KA GA+L
Sbjct: 53 SAACFLGADFSEAILWGTDFSKANLEKAILREVDLSGAILTEANLTQVNLIKATLGGANL 112
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCK 222
S + ++ EA+ RT LT+++L A + A + A + AQ LC+
Sbjct: 113 SLAQLPGAIVYEADFRPTSEQRTNLTQANLSAANLSYAKLNGANLYQAQLMNAQLCR 169
Score = 37.0 bits (84), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 19/117 (16%)
Query: 113 FGSADLRKAVHVKENF------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF---- 162
F A+L KA+ + + AN T ++ ++ G+ + A L A+ Y+A+F
Sbjct: 72 FSKANLEKAILREVDLSGAILTEANLTQVNLIKATLGGANLSLAQLPGAIVYEADFRPTS 131
Query: 163 ------TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG---ADFSDA 210
T A+LS + LN ANL A L+ L R+DL I + D S+A
Sbjct: 132 EQRTNLTQANLSAANLSYAKLNGANLYQAQLMNAQLCRADLSKGIWQNCLPTDLSEA 188
>gi|443327376|ref|ZP_21056002.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442792998|gb|ELS02459.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 187
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 55/102 (53%), Gaps = 1/102 (0%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F A+L KAV + NF+ +D+ E+D + F+ + KA +K+ A+L+ +
Sbjct: 47 FTGANLGKAVFYRTVVELGNFSQSDLGEADLREANFSQSLFYKASLFKSQLQKANLNQVI 106
Query: 172 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 213
R +ANL +AVL L +++L A + GAD S+A ++
Sbjct: 107 AIRAFFRDANLNHAVLTSANLQQANLTNADLRGADLSNANLE 148
>gi|427416432|ref|ZP_18906615.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425759145|gb|EKU99997.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 237
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 54/106 (50%), Gaps = 16/106 (15%)
Query: 113 FGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 171
F + +L +++ + R AN A +RESD S + A LEKA KA+ GA+LSD
Sbjct: 57 FENCNLSESILWGSDLRNANLKQAQLRESDLSSALLTQANLEKANLIKASLCGANLSD-- 114
Query: 172 MDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 212
ANL NA L+ L R+DLG + + GAD S A +
Sbjct: 115 --------ANLANACLLDADLRSNSDQRTDLGQSNLSGADLSYAFL 152
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 55/112 (49%), Gaps = 9/112 (8%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A F +DLR++ + +F R NF + ++ ES GS A L++A +DL
Sbjct: 33 SDANFSQSDLRQSRLGRTHFCRVNFENCNLSESILWGSDLRNANLKQA-----QLRESDL 87
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---SDAVIDLAQ 216
S L+ + L +ANL A L L+ ++L A + AD SD DL Q
Sbjct: 88 SSALLTQANLEKANLIKASLCGANLSDANLANACLLDADLRSNSDQRTDLGQ 139
>gi|298489886|ref|YP_003720063.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298231804|gb|ADI62940.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 256
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 52/104 (50%), Gaps = 10/104 (9%)
Query: 118 LRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL-SDTL----- 171
+R+ + K+ +A + +D SG+ GA LE A +AN TGADL S L
Sbjct: 29 IRQLLATKKCQNCQLINAGLALADLSGADLRGANLEGANLSRANLTGADLRSANLAGASL 88
Query: 172 ----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
+ R LNEANLT A L T L +L A + GA+F AV
Sbjct: 89 FGVNLSRAKLNEANLTGADLRNTYLMNIELTNANLNGANFQGAV 132
>gi|440683010|ref|YP_007157805.1| serine/threonine protein kinase with pentapeptide repeats [Anabaena
cylindrica PCC 7122]
gi|428680129|gb|AFZ58895.1| serine/threonine protein kinase with pentapeptide repeats [Anabaena
cylindrica PCC 7122]
Length = 535
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 47/85 (55%), Gaps = 5/85 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
N + ++ +D SG+ F+ A L++ N GA+L +T R L +ANL +A L +
Sbjct: 415 NISMLSLQGADLSGTNFHHAQLKQT-----NLQGANLQNTDFGRASLMQANLRDANLTKA 469
Query: 191 VLTRSDLGGAIIEGADFSDAVIDLA 215
L+ +DL GA + GAD S A + A
Sbjct: 470 YLSNADLEGADLRGADLSYAYMSQA 494
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 131 NFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 185
NF A +++++ G + F A L +A AN T A LS+ ++ L A+L+ A
Sbjct: 430 NFHHAQLKQTNLQGANLQNTDFGRASLMQANLRDANLTKAYLSNADLEGADLRGADLSYA 489
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 226
+ + L ++L GA + GA +D I LA+ L NG
Sbjct: 490 YMSQANLRGANLCGANLTGAKVTDEQIALAKTNWLTVRPNG 530
>gi|387129013|ref|YP_006291903.1| Pentapeptide repeat protein [Methylophaga sp. JAM7]
gi|386270302|gb|AFJ01216.1| Pentapeptide repeat protein [Methylophaga sp. JAM7]
Length = 153
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 43/79 (54%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ + + +++ + S+ GA+ + ++AN GADL L+D +LN ANL N L
Sbjct: 51 ADLSESKLQKINLQNSQLQGAWFTHSKMHEANLEGADLQGALLDYTLLNHANLKNTNLDN 110
Query: 190 TVLTRSDLGGAIIEGADFS 208
+ S+L GA + GA +
Sbjct: 111 AQMIFSNLTGADLSGASMN 129
>gi|302039057|ref|YP_003799379.1| hypothetical protein NIDE3778 [Candidatus Nitrospira defluvii]
gi|300607121|emb|CBK43454.1| conserved exported protein of unknown function [Candidatus
Nitrospira defluvii]
Length = 476
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 26/129 (20%)
Query: 111 AQFGSADLRKAVHVKENF--------------------------RANFTSADMRESDFSG 144
A ADLRKA+ VK + RA+F AD++ +D S
Sbjct: 133 ANLEGADLRKALLVKAHLNRIAADEAAFYGANLQGALFREALLERAHFEDADLQGADLSN 192
Query: 145 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 204
+ Y A K N T ADL+ T + R L +ANL A L +L ++L GA +
Sbjct: 193 ATLLDGYFYGANLSKTNLTDADLAGTDLRRTNLRQANLRRANLQGALLDSANLDGASLIE 252
Query: 205 ADFSDAVID 213
AD A +D
Sbjct: 253 ADLESAYLD 261
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 65/138 (47%), Gaps = 33/138 (23%)
Query: 89 SALADLNKYEAETRG-EFGIGSAAQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKF 147
++LA+ + +EA RG +F G A+L+ R N +A+M + S+
Sbjct: 263 ASLANADLHEASLRGADFRF---THLGGANLQ---------RVNLENANMEGATLVKSRL 310
Query: 148 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG--------- 198
+ A L V YKAN + A+ L+ ANL +AVL+ T L R+DL
Sbjct: 311 DSATLTMTVLYKANLSAAN----------LHGANLHHAVLIGTQLARADLRKADLTEIYG 360
Query: 199 -GAIIEGADFSDAVIDLA 215
A ++ A S+A ++LA
Sbjct: 361 PNAHLQQARLSEANLELA 378
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 46/103 (44%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
SAA A+L AV + RA+ AD+ E + A L +A AN ADL
Sbjct: 326 SAANLHGANLHHAVLIGTQLARADLRKADLTEIYGPNAHLQQARLSEANLELANLVAADL 385
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S + V+ + NL L L+ SDL GA++ AD A
Sbjct: 386 SQADISHAVVVQTNLQETNLRGANLSASDLTGALLNNADLGQA 428
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 1/98 (1%)
Query: 111 AQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 169
A ADL A + F AN + ++ ++D +G+ L +A +AN GA L
Sbjct: 183 ADLQGADLSNATLLDGYFYGANLSKTNLTDADLAGTDLRRTNLRQANLRRANLQGALLDS 242
Query: 170 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 207
+D L EA+L +A L L +DL A + GADF
Sbjct: 243 ANLDGASLIEADLESAYLDDASLANADLHEASLRGADF 280
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 16/121 (13%)
Query: 108 GSAAQFGSADLRKAVHVKENF-RANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKAN 161
G A DLR+ V N R N A ++R + + GA +AV AN
Sbjct: 75 GRRANLCRTDLRQLRLVGANLERINLEGAILKGSNLRTASLVQAHLKGADFSQAVLDDAN 134
Query: 162 FTGADLSDTLMDRMVLNE----------ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 211
GADL L+ + LN ANL A+ +L R+ A ++GAD S+A
Sbjct: 135 LEGADLRKALLVKAHLNRIAADEAAFYGANLQGALFREALLERAHFEDADLQGADLSNAT 194
Query: 212 I 212
+
Sbjct: 195 L 195
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 129 RANFTSADMRE----------SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 178
RAN D+R+ + G+ G+ L A +A+ GAD S ++D L
Sbjct: 77 RANLCRTDLRQLRLVGANLERINLEGAILKGSNLRTASLVQAHLKGADFSQAVLDDANLE 136
Query: 179 EANLTNAVLVRTVLTR-----SDLGGAIIEGADFSDAVIDLAQ 216
A+L A+LV+ L R + GA ++GA F +A+++ A
Sbjct: 137 GADLRKALLVKAHLNRIAADEAAFYGANLQGALFREALLERAH 179
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 54/128 (42%), Gaps = 26/128 (20%)
Query: 111 AQFGSADLRKAVHVKENFR-----------ANFTSADMRESDFSGSKFNGAYLEKAVAYK 159
A DLR+ + N R AN A + E+D + + A L A ++
Sbjct: 213 ADLAGTDLRRTNLRQANLRRANLQGALLDSANLDGASLIEADLESAYLDDASLANADLHE 272
Query: 160 ANFTGADLSDTL-----MDRMVLNEANLTNAVLVR----------TVLTRSDLGGAIIEG 204
A+ GAD T + R+ L AN+ A LV+ TVL +++L A + G
Sbjct: 273 ASLRGADFRFTHLGGANLQRVNLENANMEGATLVKSRLDSATLTMTVLYKANLSAANLHG 332
Query: 205 ADFSDAVI 212
A+ AV+
Sbjct: 333 ANLHHAVL 340
>gi|167722130|ref|ZP_02405366.1| pentapeptide repeat family protein [Burkholderia pseudomallei DM98]
Length = 323
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A+ T AD+ D G++ GA LE A A+ TGADLS R VL A+LT A LV
Sbjct: 10 ADLTGADLSGMDLRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVD 64
Query: 190 TVLTRSDLGGAIIEGADFS 208
LT ++L A E DFS
Sbjct: 65 ARLTAANLSLAHCERTDFS 83
>gi|119490887|ref|ZP_01623170.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119453705|gb|EAW34864.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 517
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 57/116 (49%), Gaps = 11/116 (9%)
Query: 108 GSAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAV 156
G++ ADLR+A VK N + N T AD+R+++ SG+ A L A
Sbjct: 157 GASTNLQRADLRRANLVKANLPKADFSHAEMRQTNLTYADLRQANLSGANLRWADLRGAN 216
Query: 157 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 212
A+ +GA+LS + L+ A L A LV LT+++L A GAD S A +
Sbjct: 217 LLGADLSGANLSGANLSGANLSRATLAKASLVHVDLTQANLIKADWMGADISGATL 272
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENF-RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
+ A ADLR+A + NF +AN + A++R + + A L +A KAN AD
Sbjct: 123 TKANLNGADLREARVGQANFSQANLSGANLRGVSGASTNLQRADLRRANLVKANLPKADF 182
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
S M + L A+L A L L +DL GA + GAD S A
Sbjct: 183 SHAEMRQTNLTYADLRQANLSGANLRWADLRGANLLGADLSGA 225
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 49/87 (56%), Gaps = 5/87 (5%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANLT 183
+AN + A + ++ SG+ +G L +A +AN TGA+LS ++ L A+L+
Sbjct: 39 QANLSDASLCVTNLSGANLSGINLSRANLNVSRLSQANLTGANLSRATLNVANLVRADLS 98
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSDA 210
+A+LV T+ RS+L A + A+ + A
Sbjct: 99 DAILVETLAIRSELIRARLNNANLTKA 125
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 43/82 (52%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 190
NFT ++ E++ S + A L A N +GA+LS + R LN + L+ A L
Sbjct: 21 NFTGINLNEANLSRINLSQANLSDASLCVTNLSGANLSGINLSRANLNVSRLSQANLTGA 80
Query: 191 VLTRSDLGGAIIEGADFSDAVI 212
L+R+ L A + AD SDA++
Sbjct: 81 NLSRATLNVANLVRADLSDAIL 102
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 41/85 (48%), Gaps = 5/85 (5%)
Query: 131 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-----TLMDRMVLNEANLTNA 185
N + A++ S S + GA L +A AN ADLSD TL R L A L NA
Sbjct: 61 NLSRANLNVSRLSQANLTGANLSRATLNVANLVRADLSDAILVETLAIRSELIRARLNNA 120
Query: 186 VLVRTVLTRSDLGGAIIEGADFSDA 210
L + L +DL A + A+FS A
Sbjct: 121 NLTKANLNGADLREARVGQANFSQA 145
>gi|158335891|ref|YP_001517065.1| hypothetical protein AM1_2749 [Acaryochloris marina MBIC11017]
gi|158306132|gb|ABW27749.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 1055
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 58/110 (52%), Gaps = 17/110 (15%)
Query: 109 SAAQFGSADLRKAVHVKENF-----------RANFTSADMRESDFSGSKFNGAYLEKAVA 157
S+A SA+L +A ++ N RAN SAD+R ++ S + + A L +A
Sbjct: 899 SSANLSSANLIRANLIRANLSSADLSSANLIRANLRSADLRSANLSSANLSSANLIRANL 958
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS-DLGGAIIEGAD 206
+AN + ADLS + R ANL+N L+RTVL+ + +L +EG D
Sbjct: 959 IRANLSSADLSSANLIR-----ANLSNTFLIRTVLSDAQNLTSDQLEGVD 1003
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 53/103 (51%), Gaps = 3/103 (2%)
Query: 99 AETRGEFGIGSAAQFGSADLRKA-VHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVA 157
A+T G + I A SADLR A + + RA+ +SAD+ ++ S + + A L A
Sbjct: 826 AKTVGPYLI--RADLRSADLRSADLSSADLIRADLSSADLSSANLSSANLSSANLSSANL 883
Query: 158 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 200
AN A+LS + L+ ANL A L+R L+ +DL A
Sbjct: 884 SSANLIRANLSSADLSSANLSSANLIRANLIRANLSSADLSSA 926
>gi|411120639|ref|ZP_11393011.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410709308|gb|EKQ66823.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 181
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 129 RANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 183
AN T+AD+ ++D SGS + GA L +A AN GADL + R L ANL
Sbjct: 59 HANLTNADLSQADLSGSNLSDVNLIGADLSQASLVGANLVGADLRSADLHRADLRGANLQ 118
Query: 184 NAVLVRTVLTRSDLGGAIIEGADFSD 209
+A L L+++ L GA + G D +D
Sbjct: 119 DADLNGANLSQTALAGANLAGVDLTD 144
Score = 37.0 bits (84), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 50/109 (45%), Gaps = 13/109 (11%)
Query: 109 SAAQFGSADLRKAVHVKENFR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 167
S A +ADL +A N N AD+ ++ G+ GA L A ++A+ GA+L
Sbjct: 58 SHANLTNADLSQADLSGSNLSDVNLIGADLSQASLVGANLVGADLRSADLHRADLRGANL 117
Query: 168 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 216
D A+L A L +T L ++L G + D D +DL++
Sbjct: 118 QD----------ADLNGANLSQTALAGANLAGVDLTDVDMQD--VDLSE 154
>gi|358461868|ref|ZP_09172018.1| pentapeptide repeat protein [Frankia sp. CN3]
gi|357072553|gb|EHI82089.1| pentapeptide repeat protein [Frankia sp. CN3]
Length = 376
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 49/86 (56%), Gaps = 15/86 (17%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN----- 184
AN T+AD+ ++D S ++ +GA L A +A+ + A+L NEANLTN
Sbjct: 252 ANLTNADLYQADLSFARLHGANLTSARLERADLSTAEL----------NEANLTNGQLHE 301
Query: 185 AVLVRTVLTRSDLGGAIIEGADFSDA 210
AVL VL ++L GA + GA+ +DA
Sbjct: 302 AVLYSAVLHGANLTGARLHGANLTDA 327
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 48/90 (53%), Gaps = 16/90 (17%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 188
RA+ ++A++ E++ + + + A L AV + AN TGA L+ ANLT+A
Sbjct: 281 RADLSTAELNEANLTNGQLHEAVLYSAVLHGANLTGAR----------LHGANLTDAQPY 330
Query: 189 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 218
R LT GA + G D S V++L Q+Q
Sbjct: 331 RANLT-----GAQLHGVDLSR-VVNLTQEQ 354
>gi|354565480|ref|ZP_08984655.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353549439|gb|EHC18881.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 182
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 73/161 (45%), Gaps = 20/161 (12%)
Query: 94 LNKYEAETRGEFGIG-----------SAAQFGSADLRKAVHVKENFRA-NFTSADMRESD 141
L++YE R G+ S A F ADL A + N NF+ A++ ++D
Sbjct: 7 LSRYETGERDFVGVNLHKVNLREVDLSGANFCGADLSGADLSQANLSGCNFSRANLTDAD 66
Query: 142 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 201
+ + NGA L + N GADL + ++ L+ A+L A LVR LT+++L A
Sbjct: 67 LTRADLNGANLS-----EINLIGADLINANLEGTNLSRADLRGANLVRANLTKANLSEAE 121
Query: 202 IEGADFSDAVI---DLAQKQALCKYANGTNPITGVSTRKSL 239
+ GAD S A + +L + NG N T K +
Sbjct: 122 LSGADLSGANLNQANLIETNLNEAELNGVNITGATVTEKEM 162
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.128 0.372
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,277,805,160
Number of Sequences: 23463169
Number of extensions: 166556190
Number of successful extensions: 463396
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3648
Number of HSP's successfully gapped in prelim test: 727
Number of HSP's that attempted gapping in prelim test: 397437
Number of HSP's gapped (non-prelim): 36834
length of query: 281
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 140
effective length of database: 9,050,888,538
effective search space: 1267124395320
effective search space used: 1267124395320
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)