BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023440
(282 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255583634|ref|XP_002532572.1| conserved hypothetical protein [Ricinus communis]
gi|223527699|gb|EEF29806.1| conserved hypothetical protein [Ricinus communis]
Length = 280
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 224/282 (79%), Positives = 240/282 (85%), Gaps = 2/282 (0%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MA +SISPLSIKS+N SSS+ PY L + SKP + CQ++ TE + + DCS +
Sbjct: 1 MAFTSISPLSIKSVNISPSSSRSPYHLPSQSKPFHILCQLA--TEREDRILDCSTTRYKV 58
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
++K KNWR VSTALAAA + + A ADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 59 HHSKPKNWRTLVSTALAAAAAVNLGFGLPAAADLNKFEAELRGEFGIGSAAQFGSADLRK 118
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
AVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 119 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 178
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
ANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVIDL QKQALCKYANGTN ITGVSTRKSL
Sbjct: 179 ANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNSITGVSTRKSL 238
Query: 241 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCD TGLCDAK
Sbjct: 239 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDEATGLCDAK 280
>gi|224071571|ref|XP_002303521.1| predicted protein [Populus trichocarpa]
gi|222840953|gb|EEE78500.1| predicted protein [Populus trichocarpa]
Length = 275
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 215/282 (76%), Positives = 237/282 (84%), Gaps = 7/282 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MA +SIS +SIKS N + P+++ +LSKP +A Q+ TE QF DCS N
Sbjct: 1 MAFTSISSMSIKSPNIST-----PHRILSLSKPFRIAYQL--DTERGNQFADCSKNGYEV 53
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
AK KNW VST L AA ++ S N+ A+ADLN++EAETRGEFGIGSAAQFGSADLRK
Sbjct: 54 ETAKAKNWARVVSTTLVAAAISFSSCNLPAVADLNRFEAETRGEFGIGSAAQFGSADLRK 113
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
AVH+ ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 114 AVHLNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 173
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
+NLTNAVLVR+VLTRSDLGGA+I GADFSDAVIDL QKQALCKYA+GTNPITGVSTR SL
Sbjct: 174 SNLTNAVLVRSVLTRSDLGGALIAGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASL 233
Query: 241 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
GCGNSRRNAYG+PSSPLLSAPPQKLLDRDGFCD GTGLCDAK
Sbjct: 234 GCGNSRRNAYGTPSSPLLSAPPQKLLDRDGFCDQGTGLCDAK 275
>gi|297741150|emb|CBI31881.3| unnamed protein product [Vitis vinifera]
Length = 261
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 218/282 (77%), Positives = 231/282 (81%), Gaps = 21/282 (7%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSS+SPL I SK P L + SKP V C+I + G + C N
Sbjct: 1 MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNY--CRAN---- 42
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
A+ K W+ VSTALAAAVV + S + A+ADLNKYEAETRGEFGIGSAAQFGSADLRK
Sbjct: 43 --AESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 99
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
AVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 100 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 159
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
ANLTNAVL RTVLTRSDLGGA+IEGADFSDAVIDL QKQALCKYA+GTNPITGVSTR SL
Sbjct: 160 ANLTNAVLARTVLTRSDLGGAVIEGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASL 219
Query: 241 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
GCGNSRR+AYGSPSSPLLSAPP KLLDRDGFCD GTGLCDAK
Sbjct: 220 GCGNSRRSAYGSPSSPLLSAPPPKLLDRDGFCDEGTGLCDAK 261
>gi|359474379|ref|XP_002265958.2| PREDICTED: uncharacterized protein LOC100250522 isoform 2 [Vitis
vinifera]
Length = 596
Score = 412 bits (1059), Expect = e-113, Method: Compositional matrix adjust.
Identities = 218/282 (77%), Positives = 231/282 (81%), Gaps = 21/282 (7%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSS+SPL I SK P L + SKP V C+I + G + C N
Sbjct: 336 MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNY--CRAN---- 377
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
A+ K W+ VSTALAAAVV + S + A+ADLNKYEAETRGEFGIGSAAQFGSADLRK
Sbjct: 378 --AESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 434
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
AVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 435 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 494
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
ANLTNAVL RTVLTRSDLGGA+IEGADFSDAVIDL QKQALCKYA+GTNPITGVSTR SL
Sbjct: 495 ANLTNAVLARTVLTRSDLGGAVIEGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASL 554
Query: 241 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
GCGNSRR+AYGSPSSPLLSAPP KLLDRDGFCD GTGLCDAK
Sbjct: 555 GCGNSRRSAYGSPSSPLLSAPPPKLLDRDGFCDEGTGLCDAK 596
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 110/179 (61%), Positives = 122/179 (68%), Gaps = 19/179 (10%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSS+SPL I SK P L +LSKP V C+I + E NN
Sbjct: 1 MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRE---------NNWRGE 42
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
A+ K W+ VSTALAAAVV + S + A+ADLNKYE ETRGEFGIGSAAQFGSADLRK
Sbjct: 43 ANAESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRK 101
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 179
AVHV ENFRRANFTSADMRESDFSGS FNG YLEKAVAYKA+ TG D +MVL+
Sbjct: 102 AVHVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLTGPDAPHARPYKMVLH 160
>gi|449459702|ref|XP_004147585.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Cucumis sativus]
gi|449520611|ref|XP_004167327.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Cucumis sativus]
Length = 279
Score = 405 bits (1042), Expect = e-111, Method: Compositional matrix adjust.
Identities = 206/281 (73%), Positives = 230/281 (81%), Gaps = 4/281 (1%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSSIS LS+K L SS S+ P L K + + QI+ + + Q DCS + G
Sbjct: 1 MALSSISSLSVKCLPLNSSKSRHPCSLQT-RKQISMVSQINPQKD---QTQDCSERKHIG 56
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ K W+ VSTALAAA V SS + ++A+LNKYEA+TRGEFGIGSAAQ+GSADLRK
Sbjct: 57 KITEPKRWQKLVSTALAAAAVIGFSSGMPSVAELNKYEADTRGEFGIGSAAQYGSADLRK 116
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
AVH+ ENFRRANFTSADMRESDFSG FNGAYLEKAVAYK NF+GADLSDTLMDRMVLNE
Sbjct: 117 AVHINENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNE 176
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
AN TNAVLVR+VLTRSDLGGAII GADFSDAVIDL QKQALCKYA+GTNP+TGVSTR SL
Sbjct: 177 ANFTNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASL 236
Query: 241 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDA 281
GCGNSRRNAYG+PSSPLLSAPPQ+LLDRDGFCD TGLC+A
Sbjct: 237 GCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQDTGLCEA 277
>gi|388505216|gb|AFK40674.1| unknown [Lotus japonicus]
Length = 273
Score = 393 bits (1010), Expect = e-107, Method: Compositional matrix adjust.
Identities = 208/287 (72%), Positives = 231/287 (80%), Gaps = 24/287 (8%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQL------HALSKPLWVACQISSKTESDGQFPDCS 54
MAL+S+SPLSI ++N SS+ +L H S P+ V CQ++S + P S
Sbjct: 2 MALNSLSPLSI-NINSLHVSSRPTSELSNSLHFHPKSSPI-VLCQMNSNRD----HPQES 55
Query: 55 NNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFG 114
K W VS LAAAV+A SS++SALADLNK+EAE RGEFGIGSAAQFG
Sbjct: 56 -----------KKWGKLVSATLAAAVIA-FSSDMSALADLNKFEAEIRGEFGIGSAAQFG 103
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
SADLRKAVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMD
Sbjct: 104 SADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMD 163
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
RMVLNEANLTNA+LVRTVLTRSDLGG+IIEGADFSDAV+DL QK ALCKYA+GTNP+TGV
Sbjct: 164 RMVLNEANLTNAILVRTVLTRSDLGGSIIEGADFSDAVLDLTQKLALCKYASGTNPVTGV 223
Query: 235 STRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDA 281
STR SLGCGN RRNAYG+PSSPLLSAPPQKLL+RDGFCD TGLCD+
Sbjct: 224 STRVSLGCGNKRRNAYGTPSSPLLSAPPQKLLNRDGFCDEATGLCDS 270
>gi|356540500|ref|XP_003538726.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Glycine max]
Length = 260
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 207/282 (73%), Positives = 228/282 (80%), Gaps = 22/282 (7%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S+SPLSI SL+ SSS+ H+ S P+ V ++++
Sbjct: 1 MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPVVVKSVANAES---------------- 44
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
W VS LAAAV+A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 45 -----TKWGKVVSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 98
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
AVHV ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNE
Sbjct: 99 AVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNE 158
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
ANLTNA+L+RTVLTRSDLGGAIIEGADFSDAV+DL QKQALCKYA+GTNP+TGVSTR SL
Sbjct: 159 ANLTNAILLRTVLTRSDLGGAIIEGADFSDAVLDLPQKQALCKYASGTNPVTGVSTRVSL 218
Query: 241 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
GCGN RRNAYGSPSSPLLSAPPQKLLDRDGFCD TGLCDAK
Sbjct: 219 GCGNKRRNAYGSPSSPLLSAPPQKLLDRDGFCDDATGLCDAK 260
>gi|357481963|ref|XP_003611267.1| Thylakoid lumenal protein [Medicago truncatula]
gi|355512602|gb|AES94225.1| Thylakoid lumenal protein [Medicago truncatula]
Length = 262
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 199/282 (70%), Positives = 221/282 (78%), Gaps = 20/282 (7%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S +PLSI S + + S + + Q+ K + P SN
Sbjct: 1 MALNSFTPLSINSHH---------VSCYPSSSKVSKSSQVICKMSLNNDHPQESN----- 46
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
KNW VS LAAAV+ SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 47 -----KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKK 100
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
VHV ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFTGADLSDTLMDRMVLNE
Sbjct: 101 TVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTGADLSDTLMDRMVLNE 160
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
ANLTNA+L RTVLTRSDLGGAIIEGADFSDAV+DL QK ALCKYA+GTNP+TGVSTR SL
Sbjct: 161 ANLTNAILSRTVLTRSDLGGAIIEGADFSDAVLDLPQKLALCKYASGTNPVTGVSTRVSL 220
Query: 241 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
GCGN RRNAYG+PSSPLLSAPPQKLLDRDGFCD +GLCD+K
Sbjct: 221 GCGNKRRNAYGTPSSPLLSAPPQKLLDRDGFCDEASGLCDSK 262
>gi|357481965|ref|XP_003611268.1| Thylakoid lumenal protein [Medicago truncatula]
gi|355512603|gb|AES94226.1| Thylakoid lumenal protein [Medicago truncatula]
Length = 232
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 184/217 (84%), Positives = 198/217 (91%), Gaps = 1/217 (0%)
Query: 66 KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
KNW VS LAAAV+ SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K VHV
Sbjct: 17 KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKKTVHVN 75
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFTGADLSDTLMDRMVLNEANLTN
Sbjct: 76 ENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTGADLSDTLMDRMVLNEANLTN 135
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 245
A+L RTVLTRSDLGGAIIEGADFSDAV+DL QK ALCKYA+GTNP+TGVSTR SLGCGN
Sbjct: 136 AILSRTVLTRSDLGGAIIEGADFSDAVLDLPQKLALCKYASGTNPVTGVSTRVSLGCGNK 195
Query: 246 RRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
RRNAYG+PSSPLLSAPPQKLLDRDGFCD +GLCD+K
Sbjct: 196 RRNAYGTPSSPLLSAPPQKLLDRDGFCDEASGLCDSK 232
>gi|116785652|gb|ABK23807.1| unknown [Picea sitchensis]
Length = 291
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 187/242 (77%), Positives = 206/242 (85%), Gaps = 6/242 (2%)
Query: 40 ISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEA 99
I+ K +D D Q A + KNW+ ++ ALA V+ + ++A ADLNKYEA
Sbjct: 52 ITGKISTDQHKKDA---QPASATPESKNWQRCLAAALATIVIGT---GMNAEADLNKYEA 105
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
ETRGEFGIGSAAQFGSA+LRK VH ENFRRANFTSAD+RESDFSGS FNGAYLEKAVAY
Sbjct: 106 ETRGEFGIGSAAQFGSAELRKTVHANENFRRANFTSADIRESDFSGSTFNGAYLEKAVAY 165
Query: 160 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
K NFTGADLSDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVID QKQ
Sbjct: 166 KTNFTGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDFTQKQ 225
Query: 220 ALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLC 279
ALCKYA+GTNPITG+STRKSLGCGNSRRNAYG+PS+PLLSAPP+KLLD+DGFCDS TGLC
Sbjct: 226 ALCKYASGTNPITGISTRKSLGCGNSRRNAYGTPSAPLLSAPPEKLLDKDGFCDSSTGLC 285
Query: 280 DA 281
DA
Sbjct: 286 DA 287
>gi|212721536|ref|NP_001132582.1| uncharacterized protein LOC100194053 [Zea mays]
gi|194694816|gb|ACF81492.1| unknown [Zea mays]
gi|195647732|gb|ACG43334.1| hypothetical protein [Zea mays]
gi|413937988|gb|AFW72539.1| hypothetical protein ZEAMMB73_749291 [Zea mays]
Length = 268
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 178/194 (91%), Positives = 187/194 (96%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
+ A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFTSADMRESDFSGS
Sbjct: 74 MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 133
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGAIIEGAD
Sbjct: 134 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGAD 193
Query: 208 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 267
FSDAVIDL+QKQALCKYA+GTNP+TGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQK+LD
Sbjct: 194 FSDAVIDLSQKQALCKYASGTNPMTGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKILD 253
Query: 268 RDGFCDSGTGLCDA 281
RDGFCD TG+CDA
Sbjct: 254 RDGFCDPATGMCDA 267
>gi|18391370|ref|NP_563902.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
gi|75151954|sp|Q8H1Q1.1|TL225_ARATH RecName: Full=Thylakoid lumenal protein At1g12250, chloroplastic;
Flags: Precursor
gi|23297125|gb|AAN13098.1| unknown protein [Arabidopsis thaliana]
gi|332190736|gb|AEE28857.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
Length = 280
Score = 368 bits (945), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 204/285 (71%), Positives = 231/285 (81%), Gaps = 8/285 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--- 57
MA SS+SPL +KSL+ SSS + + L Q+SS+ S+ + D SN +
Sbjct: 1 MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTREGC 58
Query: 58 CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
C+ A+ W+ +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSAD
Sbjct: 59 CSS--AESNTWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSAD 115
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
L K VH ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMV
Sbjct: 116 LSKTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMV 175
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 237
LNEANLTNAVLVR+VLTRSDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TR
Sbjct: 176 LNEANLTNAVLVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTR 235
Query: 238 KSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
KSLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD TGLCD K
Sbjct: 236 KSLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDVK 280
>gi|14334898|gb|AAK59627.1| unknown protein [Arabidopsis thaliana]
Length = 280
Score = 368 bits (945), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 204/285 (71%), Positives = 231/285 (81%), Gaps = 8/285 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--- 57
MA SS+SPL +KSL+ SSS + + L Q+SS+ S+ + D SN +
Sbjct: 1 MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTREGC 58
Query: 58 CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
C+ A+ W+ +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSAD
Sbjct: 59 CSS--AESNKWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSAD 115
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
L K VH ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMV
Sbjct: 116 LSKTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMV 175
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 237
LNEANLTNAVLVR+VLTRSDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TR
Sbjct: 176 LNEANLTNAVLVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTR 235
Query: 238 KSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
KSLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD TGLCD K
Sbjct: 236 KSLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDVK 280
>gi|297844088|ref|XP_002889925.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
lyrata]
gi|297335767|gb|EFH66184.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
lyrata]
Length = 280
Score = 368 bits (944), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 208/288 (72%), Positives = 236/288 (81%), Gaps = 14/288 (4%)
Query: 1 MALSSISPLSIKSLNFCSSSSKG---PYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ 57
MA SS+SPL +KSL+ SSS PY H PL Q+SS++ S + D SN +
Sbjct: 1 MAFSSLSPLPMKSLDISRSSSSVSRSPY--HYQRYPLR-RLQLSSRSNS--EIKDSSNAR 55
Query: 58 ---CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFG 114
C+ ++ W+ +S A+AAAV+AS SS++ A+A+LN++EA+TRGEFGIGSAAQ+G
Sbjct: 56 EGCCS--RSESNTWKRILSAAMAAAVIAS-SSSVPAMAELNRFEADTRGEFGIGSAAQYG 112
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
SADL K +H ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMD
Sbjct: 113 SADLSKTIHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMD 172
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
RMVLNEANLTNAVLVR+VLTRSDLGGA IEGADFSDAVIDL QKQALCKYANGTNP+TGV
Sbjct: 173 RMVLNEANLTNAVLVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYANGTNPLTGV 232
Query: 235 STRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
TRKSLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD TGLCDAK
Sbjct: 233 DTRKSLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDAK 280
>gi|125540470|gb|EAY86865.1| hypothetical protein OsI_08249 [Oryza sativa Indica Group]
Length = 276
Score = 366 bits (940), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 202/282 (71%), Positives = 224/282 (79%), Gaps = 6/282 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL + SPL+ + C+ + + L + V+CQ + DG S + A
Sbjct: 1 MALPTTSPLAAAAARPCAFPTPWRCRSPPLRRLPHVSCQANRGGSRDGN--SLSTSAAAA 58
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ WR VS ALAAA+V++ A ADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 59 AASPPPRWRAAVSAALAAAIVSA----APAYADLNKFEAEQRGEFGIGSAAQFGSADLKK 114
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
AVHV ENFRRANFT+ADMRES+FSGS FNGAYLEKAVAY+ANFTGADLSDTLMDRMVLNE
Sbjct: 115 AVHVNENFRRANFTAADMRESNFSGSTFNGAYLEKAVAYRANFTGADLSDTLMDRMVLNE 174
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
ANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVIDL QKQALCKYANGTNP+TGVSTRKSL
Sbjct: 175 ANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNPLTGVSTRKSL 234
Query: 241 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
GCGNSRRNAYGSPSSPLLSAPP KLLDRDGFCD TG+CDAK
Sbjct: 235 GCGNSRRNAYGSPSSPLLSAPPPKLLDRDGFCDEATGMCDAK 276
>gi|242066558|ref|XP_002454568.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
gi|241934399|gb|EES07544.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
Length = 270
Score = 366 bits (940), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 178/194 (91%), Positives = 185/194 (95%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
+ A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFTSADMRESDFSGS
Sbjct: 76 MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 135
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGAIIEGAD
Sbjct: 136 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGAD 195
Query: 208 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 267
FSDAVIDL QKQALCKYA+GTN ITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD
Sbjct: 196 FSDAVIDLPQKQALCKYASGTNSITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 255
Query: 268 RDGFCDSGTGLCDA 281
RDGFCD TG+C+A
Sbjct: 256 RDGFCDPATGMCEA 269
>gi|145323868|ref|NP_001077523.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
gi|332190737|gb|AEE28858.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
Length = 206
Score = 364 bits (935), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 178/207 (85%), Positives = 191/207 (92%), Gaps = 1/207 (0%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH ENFRRANFTS
Sbjct: 1 MAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTS 59
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 195
ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEANLTNAVLVR+VLTR
Sbjct: 60 ADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTR 119
Query: 196 SDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSS 255
SDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TRKSLGCGNSRRNAYGSPSS
Sbjct: 120 SDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTRKSLGCGNSRRNAYGSPSS 179
Query: 256 PLLSAPPQKLLDRDGFCDSGTGLCDAK 282
PLLSAPPQ+LL RDGFCD TGLCD K
Sbjct: 180 PLLSAPPQRLLGRDGFCDEKTGLCDVK 206
>gi|115447561|ref|NP_001047560.1| Os02g0643500 [Oryza sativa Japonica Group]
gi|49388647|dbj|BAD25782.1| thylakoid lumenal protein-like [Oryza sativa Japonica Group]
gi|113537091|dbj|BAF09474.1| Os02g0643500 [Oryza sativa Japonica Group]
gi|125583041|gb|EAZ23972.1| hypothetical protein OsJ_07699 [Oryza sativa Japonica Group]
gi|215687060|dbj|BAG90906.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 277
Score = 364 bits (934), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 202/282 (71%), Positives = 223/282 (79%), Gaps = 5/282 (1%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL + SPL+ + C+ + + L + V+CQ + DG S A
Sbjct: 1 MALPTTSPLAAAAARPCAFPTPWRCRSPPLRRLPHVSCQANRGGSRDGNSLSTSAAAAAA 60
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ WR VS ALAAA+V++ A ADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 61 ASPPPR-WRAAVSAALAAAIVSA----APAYADLNKFEAEQRGEFGIGSAAQFGSADLKK 115
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
AVHV ENFRRANFT+ADMRES+FSGS FNGAYLEKAVAY+ANFTGADLSDTLMDRMVLNE
Sbjct: 116 AVHVNENFRRANFTAADMRESNFSGSTFNGAYLEKAVAYRANFTGADLSDTLMDRMVLNE 175
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
ANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVIDL QKQALCKYANGTNP+TGVSTRKSL
Sbjct: 176 ANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNPLTGVSTRKSL 235
Query: 241 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
GCGNSRRNAYGSPSSPLLSAPP KLLDRDGFCD TG+CDAK
Sbjct: 236 GCGNSRRNAYGSPSSPLLSAPPPKLLDRDGFCDEATGMCDAK 277
>gi|357136761|ref|XP_003569972.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Brachypodium distachyon]
Length = 268
Score = 360 bits (924), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 173/195 (88%), Positives = 182/195 (93%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
+ A ADLNK+EAE RGEFGIGSAAQFG+ADL+K VHV ENFRRANFTSADMRESDFSGS
Sbjct: 74 MPAYADLNKFEAEQRGEFGIGSAAQFGNADLKKTVHVNENFRRANFTSADMRESDFSGST 133
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
FNGAY+EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL RTVLTRSDLGGA IEGAD
Sbjct: 134 FNGAYMEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLARTVLTRSDLGGATIEGAD 193
Query: 208 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 267
FSDAV+DL QK ALCKYA+GTNP+TGVSTRKSLGCGNSRRNAYGSPSSPLLSAPP KLLD
Sbjct: 194 FSDAVLDLQQKLALCKYASGTNPVTGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPPKLLD 253
Query: 268 RDGFCDSGTGLCDAK 282
RDGFCD TG+CDAK
Sbjct: 254 RDGFCDEATGMCDAK 268
>gi|326490876|dbj|BAJ90105.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 267
Score = 354 bits (909), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 203/284 (71%), Positives = 222/284 (78%), Gaps = 19/284 (6%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQL-HALSKPLW-VACQISSKTESDGQFPDCSNNQC 58
MAL+S SPL+ + K P L S+ L ++CQ ++ G + SN
Sbjct: 1 MALASTSPLAA-----TVARPKAPASLTRCRSRRLQRISCQATTDRSGGG---NASNTSP 52
Query: 59 AGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADL 118
A P WRV VS ALAAAVV + + A ADLNKYEA+ RGEFGIGSAAQFG+ADL
Sbjct: 53 APP-----RWRVAVSAALAAAVVVA----MPAHADLNKYEADQRGEFGIGSAAQFGNADL 103
Query: 119 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 178
+ VHV ENFRRANFTSADMRESDFSGS FNGAY+EKAVA++ANFTGADLSDTLMDRMVL
Sbjct: 104 KNTVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFRANFTGADLSDTLMDRMVL 163
Query: 179 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 238
NEANLTNAVL RTVLTRSDLGGA IEGADFSDAVIDL QK ALCKYA+GTNPITGVSTRK
Sbjct: 164 NEANLTNAVLSRTVLTRSDLGGATIEGADFSDAVIDLPQKLALCKYASGTNPITGVSTRK 223
Query: 239 SLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282
SLGCGNSRRNAYGSPSSPLLSAPP KLLDRDGFCD +GLCDAK
Sbjct: 224 SLGCGNSRRNAYGSPSSPLLSAPPPKLLDRDGFCDEASGLCDAK 267
>gi|10086510|gb|AAG12570.1|AC022522_3 Hypothetical protein [Arabidopsis thaliana]
Length = 293
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 193/301 (64%), Positives = 216/301 (71%), Gaps = 37/301 (12%)
Query: 11 IKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRV 70
+KSL+ SSS + + L Q+SS+ S+ + D SN A+ W+
Sbjct: 1 MKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTS-----AESNTWKR 53
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
+S A AA V + SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH ENFRR
Sbjct: 54 ILSAA-MAAAVIASSSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRR 112
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEANLTNAVLVR
Sbjct: 113 ANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVR 172
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQ-----------------------------AL 221
+VLTRSDLGGA IEGADFSDAVIDL QKQ AL
Sbjct: 173 SVLTRSDLGGAKIEGADFSDAVIDLLQKQVTTTHHYIYPSFRSTIKKYFTNGFHNVLKAL 232
Query: 222 CKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDA 281
CKYA GTNP+TGV TRKSLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD TGLCD
Sbjct: 233 CKYATGTNPLTGVDTRKSLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDV 292
Query: 282 K 282
K
Sbjct: 293 K 293
>gi|302822738|ref|XP_002993025.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
gi|300139117|gb|EFJ05864.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
Length = 196
Score = 335 bits (859), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 156/192 (81%), Positives = 177/192 (92%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+ H ENFRRANFTSADMRE+DFSGS
Sbjct: 1 MNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFRRANFTSADMREADFSGST 60
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
FNG YLEKAVAY+ NF+GADLSDTLMDRMVLNEA+LTNA+LVR VLTRSDLGGA IEGAD
Sbjct: 61 FNGGYLEKAVAYRTNFSGADLSDTLMDRMVLNEADLTNALLVRAVLTRSDLGGAKIEGAD 120
Query: 208 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 267
FSDAV+DLAQKQALCKYANG NP+TG+ TRKSLGCGN+RRNAYG+PS+P+LSAPP++LLD
Sbjct: 121 FSDAVLDLAQKQALCKYANGVNPVTGMDTRKSLGCGNARRNAYGTPSAPILSAPPERLLD 180
Query: 268 RDGFCDSGTGLC 279
+DGFCD TG C
Sbjct: 181 KDGFCDDATGKC 192
>gi|302780733|ref|XP_002972141.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
gi|300160440|gb|EFJ27058.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
Length = 219
Score = 334 bits (856), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 161/205 (78%), Positives = 184/205 (89%), Gaps = 4/205 (1%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RRANFT 134
LAA V+A+ ++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+ H ENF RRANFT
Sbjct: 14 LAATVLAT---GMNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFSRRANFT 70
Query: 135 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 194
SADMRE+DFSGS FNG YLEKAVAY+ NF+GADLSDTLMDRMVLNEA+LTNA+LVR VLT
Sbjct: 71 SADMREADFSGSTFNGGYLEKAVAYRTNFSGADLSDTLMDRMVLNEADLTNALLVRAVLT 130
Query: 195 RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPS 254
RSDLGGA IEGADFSDAV+DLAQKQALCKYANG NP+TG+ TRKSLGCGN+RRNAYG+PS
Sbjct: 131 RSDLGGAKIEGADFSDAVLDLAQKQALCKYANGVNPVTGMDTRKSLGCGNARRNAYGTPS 190
Query: 255 SPLLSAPPQKLLDRDGFCDSGTGLC 279
+P+LSAPP++LLD+DGFCD TG C
Sbjct: 191 APILSAPPERLLDKDGFCDDATGKC 215
>gi|168028137|ref|XP_001766585.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682230|gb|EDQ68650.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 225
Score = 319 bits (817), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 154/193 (79%), Positives = 171/193 (88%), Gaps = 2/193 (1%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
+LADLN EA TRGEFGIGSA QFGSADL+K H ENFRR NFTSADM+E++FS S FN
Sbjct: 28 SLADLNSLEANTRGEFGIGSAVQFGSADLKKTQHANENFRRGNFTSADMKEANFSNSTFN 87
Query: 150 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
GAYLEKAVAY+ NF+GADLSDTLMDRMVLNEANL+NA+LVR VLTRSDLG AIIEGADFS
Sbjct: 88 GAYLEKAVAYRTNFSGADLSDTLMDRMVLNEANLSNALLVRAVLTRSDLGSAIIEGADFS 147
Query: 210 DAVIDLAQKQ--ALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 267
DAV+DL QKQ ALCKYA+GTNP+TG+STRKSLGCGN+RRNAYGSPSSP LSAPP LLD
Sbjct: 148 DAVLDLTQKQAFALCKYASGTNPVTGMSTRKSLGCGNARRNAYGSPSSPELSAPPPILLD 207
Query: 268 RDGFCDSGTGLCD 280
++GFCD+ TG CD
Sbjct: 208 KNGFCDNSTGKCD 220
>gi|356495617|ref|XP_003516671.1| PREDICTED: LOW QUALITY PROTEIN: thylakoid lumenal protein
At1g12250, chloroplastic-like [Glycine max]
Length = 222
Score = 293 bits (751), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 166/251 (66%), Positives = 186/251 (74%), Gaps = 29/251 (11%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S SPLS+ SL+ S SS + + S P V CQ +S +
Sbjct: 1 MALNSFSPLSVNSLHVSSISSSKISRSLSKSFP--VVCQTNSNRDH-------------- 44
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ V VS LAAA++A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 45 -----RQGNV-VSATLAAAIIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 97
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
AVHV ENFR +NFT+ADMRESDFSGS FNGAYLEKAVAYKANF G DLSDTL DRMVLNE
Sbjct: 98 AVHVNENFRXSNFTAADMRESDFSGSTFNGAYLEKAVAYKANFPGVDLSDTLTDRMVLNE 157
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
ANL+NA+L+RTVLTRSDLGGAIIEGADFSDAV+DL QK ALCKY +T VSTR SL
Sbjct: 158 ANLSNAILLRTVLTRSDLGGAIIEGADFSDAVLDLPQKHALCKY------VTRVSTRVSL 211
Query: 241 GCGNSRRNAYG 251
GCGN RRNAYG
Sbjct: 212 GCGNKRRNAYG 222
>gi|159478056|ref|XP_001697120.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
gi|158274594|gb|EDP00375.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
Length = 239
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 101/167 (60%), Positives = 120/167 (71%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
ALADLN YEA T GEFGIGSA Q+G AD++ ++ RR+NFTSAD R + F GS
Sbjct: 51 ALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCRNATFKGSNLQ 110
Query: 150 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
GAY KAV Y+ NF A+LSD LMDR + EANL NA+L RTV TRSDL A+IEGADF+
Sbjct: 111 GAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLKNAILQRTVFTRSDLKDAVIEGADFT 170
Query: 210 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 256
+A++D Q ALCKYA+GTNP+TG TRKSLGCG RR PS+P
Sbjct: 171 NALLDKTQVMALCKYASGTNPVTGADTRKSLGCGGKRRYQASYPSNP 217
>gi|302829835|ref|XP_002946484.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
nagariensis]
gi|300268230|gb|EFJ52411.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
nagariensis]
Length = 214
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 100/167 (59%), Positives = 121/167 (72%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
A ADLN YEAE GEFGIGSA Q+G AD++ ++ RR+NFTSAD R ++F GS
Sbjct: 26 AFADLNVYEAEAGGEFGIGSAQQYGEADVQGRDFSGQDLRRSNFTSADCRNANFKGSNLQ 85
Query: 150 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
GAY KAV Y+ NF A+LSD LMDR + EANL NAVL R V TRSDL A++EGADF+
Sbjct: 86 GAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLRNAVLQRAVFTRSDLKDAVVEGADFT 145
Query: 210 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 256
+A++D Q ALCKYA+G NP+TGVSTRKSLGCG+ RR PS+P
Sbjct: 146 NALLDKTQVMALCKYADGVNPVTGVSTRKSLGCGSQRRYKASYPSNP 192
>gi|255638223|gb|ACU19425.1| unknown [Glycine max]
Length = 199
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 116/176 (65%), Positives = 134/176 (76%), Gaps = 17/176 (9%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S+SPLSI SL+ SSS+ H+ S P+ V CQI+S + + Q +
Sbjct: 2 MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPV-VVCQINSNRD---------HRQEST 51
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ K+ VS LAAAV+A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 52 KWGKV------VSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 104
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
AVHV ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRM
Sbjct: 105 AVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRM 160
>gi|384248119|gb|EIE21604.1| thylakoid lumenal protein [Coccomyxa subellipsoidea C-169]
Length = 217
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 104/185 (56%), Positives = 125/185 (67%), Gaps = 2/185 (1%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
A+ADLNKYEA GEFG G+A Q+G ADL+ E+ RR+NFT+AD R +F S
Sbjct: 29 AIADLNKYEAAAGGEFGNGTAQQYGEADLKGRDFHGEDLRRSNFTAADCRNCNFKDSNLQ 88
Query: 150 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
GAY K+V KANF A+LSD LMDR VLNEANL NA R VLTRSDLGGA I G DF+
Sbjct: 89 GAYFIKSVVPKANFENANLSDVLMDRAVLNEANLRNANFQRAVLTRSDLGGADINGTDFT 148
Query: 210 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRD 269
+A++D Q+ ALC+YA+GTN TGV TRKSLGCG+ RR SPS+P P +D+
Sbjct: 149 NALLDKTQQIALCRYADGTNTETGVETRKSLGCGSRRRFRESSPSNP--EGPQVADVDKK 206
Query: 270 GFCDS 274
F S
Sbjct: 207 AFVKS 211
>gi|297741151|emb|CBI31882.3| unnamed protein product [Vitis vinifera]
Length = 201
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 107/169 (63%), Positives = 117/169 (69%), Gaps = 19/169 (11%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSS+SPL I SK P L +LSKP V C+I + E NN
Sbjct: 1 MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRE---------NNWRGE 42
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
A+ K W+ VSTALAAAVV + S + A+ADLNKYE ETRGEFGIGSAAQFGSADLRK
Sbjct: 43 ANAESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRK 101
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AVHV ENFRRANFTSADMRESDFSGS FNG YLEKAVAYKA+ T A S
Sbjct: 102 AVHVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLTDAQSS 150
>gi|307105880|gb|EFN54127.1| hypothetical protein CHLNCDRAFT_31689 [Chlorella variabilis]
Length = 259
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 90/167 (53%), Positives = 119/167 (71%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
A A+LNKYE GEF +G+A Q+G AD++ ++ +R+NFT+AD R+++F SK
Sbjct: 71 ASAELNKYEFGVTGEFNVGTARQYGEADVKGQDFSNQDLQRSNFTAADCRDANFQNSKLQ 130
Query: 150 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
AY K+V +AN ADLSD LMDR V+ +ANL AVL R +LTRSDL + I GADF+
Sbjct: 131 AAYFMKSVLARANLENADLSDALMDRAVIVDANLRGAVLQRAILTRSDLDRSDIYGADFT 190
Query: 210 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 256
+A++D Q+ ALCKYA+G NP+TGVSTRKSL CG+SRR SPS+P
Sbjct: 191 NALVDKTQQMALCKYADGVNPMTGVSTRKSLNCGSSRRFKASSPSNP 237
>gi|303288862|ref|XP_003063719.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454787|gb|EEH52092.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 277
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 104/198 (52%), Positives = 130/198 (65%), Gaps = 8/198 (4%)
Query: 86 SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE---NFRRANFTSADMRESD 142
S+ +A A+LN EA GEF GSA QFG DLR V + + R +NFT A+MR +
Sbjct: 81 SSPAAHAELNAREANRGGEFNRGSAQQFGGYDLRNEDVVGKYGADLRLSNFTGAEMRGAK 140
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G+ GAYL KAVA++A+F GA+LSD LMDR VLN AN +A+L R VLT SDLG A
Sbjct: 141 LRGANLTGAYLMKAVAFEADFEGANLSDALMDRAVLNSANFRDAILTRVVLTSSDLGDAK 200
Query: 203 IEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPL---LS 259
I+GADFSDA+ID +Q+Q LC+YA+GTN +TGVSTR+SL CG R + SPS + S
Sbjct: 201 IDGADFSDALIDKSQQQKLCQYASGTNSVTGVSTRRSLNCGGGVRTS--SPSRYMTDETS 258
Query: 260 APPQKLLDRDGFCDSGTG 277
A P+ D F GTG
Sbjct: 259 AKPEAAFDASRFSAYGTG 276
>gi|424513452|emb|CCO66074.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
Length = 231
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 106/197 (53%), Positives = 122/197 (61%), Gaps = 6/197 (3%)
Query: 72 VSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF--- 128
+S A A V S A+A+LN EA GEF GSA QFG DLR A +V E +
Sbjct: 21 LSVATAMIVSGIIPSPPFAVAELNSREANQGGEFNRGSAQQFGGYDLR-AENVSEKYGTD 79
Query: 129 -RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
R +NFT A+MR+S G+K NGAYL KAVA A+FT ADLSD LMDR V AN TNA+
Sbjct: 80 LRLSNFTGAEMRDSKLVGAKLNGAYLMKAVAANADFTDADLSDALMDRGVFVNANFTNAI 139
Query: 188 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRR 247
L R VLT SDL GA I ADFSDA++D + LCK A GTNP TGV+TRKSL C R
Sbjct: 140 LARVVLTSSDLNGANITNADFSDALLDNTMQMKLCKIATGTNPTTGVNTRKSLNCTGGRG 199
Query: 248 NAYGSPSSPLLSAPPQK 264
N GSPS + QK
Sbjct: 200 NV-GSPSRYMTEEDAQK 215
>gi|357481967|ref|XP_003611269.1| Thylakoid lumenal protein [Medicago truncatula]
gi|355512604|gb|AES94227.1| Thylakoid lumenal protein [Medicago truncatula]
Length = 147
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 96/165 (58%), Positives = 111/165 (67%), Gaps = 20/165 (12%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S +PLSI S + + S + + Q+ K + P SN
Sbjct: 1 MALNSFTPLSINSHH---------VSCYPSSSKVSKSSQVICKMSLNNDHPQESN----- 46
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
KNW VS LAAAV+ SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 47 -----KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKK 100
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
VHV ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFTG
Sbjct: 101 TVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTG 145
>gi|308811122|ref|XP_003082869.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
gi|116054747|emb|CAL56824.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
Length = 247
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 97/184 (52%), Positives = 116/184 (63%), Gaps = 6/184 (3%)
Query: 66 KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
K V S ALA A S + A A+LN+ EA GEF GSA QFG DL K K
Sbjct: 34 KKGHVITSIALATAFALSGAP---AHAELNRAEANRGGEFNRGSAKQFGGYDLVKVDIAK 90
Query: 126 E---NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 182
E + R +NFT ADMR + G+ GAY+ K VA + +FTGAD+SD LMDR VL AN
Sbjct: 91 EYGKDLRLSNFTGADMRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALMDRSVLVGAN 150
Query: 183 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
T+AVL R VLT SD+ AIIE ADF+DA++D +QALCK A+G NP TGV+TR SLGC
Sbjct: 151 FTDAVLNRVVLTSSDMKDAIIENADFTDALLDPKTQQALCKTASGKNPETGVATRVSLGC 210
Query: 243 GNSR 246
R
Sbjct: 211 SGGR 214
>gi|255087366|ref|XP_002505606.1| predicted protein [Micromonas sp. RCC299]
gi|226520876|gb|ACO66864.1| predicted protein [Micromonas sp. RCC299]
Length = 146
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 71/108 (65%), Positives = 85/108 (78%)
Query: 138 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 197
MR++ G+ GAYL KAVA+ A+F GA+LSD LMDR VLN AN +A++ R VLT SD
Sbjct: 1 MRKAKLRGANLTGAYLMKAVAFAADFEGANLSDALMDRAVLNNANFKDAIMTRVVLTSSD 60
Query: 198 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 245
LG A+IEGADFSDA+ID+ Q+QALCKYANG N +TGVSTRKSL CG S
Sbjct: 61 LGDAVIEGADFSDALIDVKQQQALCKYANGVNSVTGVSTRKSLNCGGS 108
>gi|145356542|ref|XP_001422487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582730|gb|ABP00804.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 114
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/111 (61%), Positives = 84/111 (75%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
NFT AD+R + G+ GAY+ K VA + +FTGAD+SD LMDR VL +AN TNA+L R
Sbjct: 4 NFTGADLRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALMDRAVLVKANFTNAILNRV 63
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
VLT SDL GAI+E ADF+DA++D+ +QALCK A+G NP TGVSTR SLGC
Sbjct: 64 VLTSSDLEGAIVENADFTDALLDVKTQQALCKTASGKNPETGVSTRVSLGC 114
>gi|224125144|ref|XP_002329904.1| predicted protein [Populus trichocarpa]
gi|222871141|gb|EEF08272.1| predicted protein [Populus trichocarpa]
Length = 108
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/81 (77%), Positives = 68/81 (83%), Gaps = 4/81 (4%)
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 235
MV+NEANLTNAVLVR+ LTR DLGGA I GAD SD+VIDL QKQ YA+GTNP TGVS
Sbjct: 1 MVINEANLTNAVLVRSALTRCDLGGAQIAGADSSDSVIDLPQKQ----YASGTNPTTGVS 56
Query: 236 TRKSLGCGNSRRNAYGSPSSP 256
R SLGCGNSRRNAYG+PSSP
Sbjct: 57 NRASLGCGNSRRNAYGTPSSP 77
>gi|434390855|ref|YP_007125802.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428262696|gb|AFZ28642.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 176
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 59/110 (53%), Positives = 77/110 (70%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +A+MRE++F G+ A L K V +AN GA+L+ L+DR+ L+EANL NA+L +
Sbjct: 66 FVAAEMREANFQGADLTNAILTKGVLLRANLEGANLTGALVDRVTLDEANLKNAILQEAI 125
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
LTRS L A I GADF+DA+ID Q LC A+G NP+TGVSTR+SLGC
Sbjct: 126 LTRSRLFDADITGADFTDALIDRYQVSLLCDRADGVNPVTGVSTRESLGC 175
>gi|254421873|ref|ZP_05035591.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
gi|196189362|gb|EDX84326.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
Length = 187
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 77/117 (65%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N +F +AD R+++F G+ +G L KA + N GAD + T DR++ + A+LTN
Sbjct: 69 KNLSGTSFAAADARDANFEGADMSGTILTKATFLRTNLKGADFTKTFADRVLFDGADLTN 128
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+ V + T S G II GADFSDA+ID Q + +CK A+G NP+TG+STR+SLGC
Sbjct: 129 AIFVEAIATSSSFGDTIITGADFSDAIIDRFQVKKMCKRADGINPVTGISTRESLGC 185
>gi|67921246|ref|ZP_00514765.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|67857363|gb|EAM52603.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
Length = 172
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/130 (46%), Positives = 82/130 (63%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F DL K V F +ADMRE++F GS + A +A KAN GA+L+ +L
Sbjct: 52 FSHKDLEKGV----------FAAADMREANFEGSNLSYAIFTEATLLKANLKGANLTSSL 101
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+DR+ L+ A+LT+A+L+ + TR+ A+I GADF+DAVID Q +C+ A G NP+T
Sbjct: 102 LDRVTLDFADLTDAILIDAIATRTRFYDAVITGADFTDAVIDRYQVSLMCERAEGVNPVT 161
Query: 233 GVSTRKSLGC 242
GVSTR SLGC
Sbjct: 162 GVSTRDSLGC 171
>gi|416382245|ref|ZP_11684306.1| Pentapeptide repeat containing protein [Crocosphaera watsonii WH
0003]
gi|357265427|gb|EHJ14194.1| Pentapeptide repeat containing protein [Crocosphaera watsonii WH
0003]
Length = 171
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/130 (46%), Positives = 82/130 (63%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F DL K V F +ADMRE++F GS + A +A KAN GA+L+ +L
Sbjct: 51 FSHKDLEKGV----------FAAADMREANFEGSNLSYAIFTEATLLKANLKGANLTSSL 100
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+DR+ L+ A+LT+A+L+ + TR+ A+I GADF+DAVID Q +C+ A G NP+T
Sbjct: 101 LDRVTLDFADLTDAILIDAIATRTRFYDAVITGADFTDAVIDRYQVSLMCERAEGVNPVT 160
Query: 233 GVSTRKSLGC 242
GVSTR SLGC
Sbjct: 161 GVSTRDSLGC 170
>gi|218247318|ref|YP_002372689.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|218167796|gb|ACK66533.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
Length = 172
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 83/130 (63%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F DL KAV F +A+MRE++F GS + A L + V KAN A+L+ +L
Sbjct: 52 FSHRDLEKAV----------FAAAEMRETNFEGSNLSYAILTEGVLLKANLKDANLTGSL 101
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+DR+ L+ A+LTNA+LV + TR+ II GADF+DAVID Q +C+ A+G NP+T
Sbjct: 102 LDRVTLDFADLTNAILVDAIATRTRFYDTIITGADFTDAVIDRYQVALMCERADGVNPVT 161
Query: 233 GVSTRKSLGC 242
GV+TR SLGC
Sbjct: 162 GVATRDSLGC 171
>gi|434384986|ref|YP_007095597.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428015976|gb|AFY92070.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 165
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/144 (45%), Positives = 83/144 (57%), Gaps = 15/144 (10%)
Query: 114 GSADLRKAVHVKENFRRAN---------------FTSADMRESDFSGSKFNGAYLEKAVA 158
G D+ AV + NF R N F S+++R + SG+ A L AV
Sbjct: 21 GINDVTLAVSSQTNFSRINLTDRDFGGQDLTGGVFVSSELRGVNMSGANLTNAMLTMAVL 80
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
K N +GA+L+ L DR +EA+LTNA+L LTRS GA I GADF+DA+ID AQ
Sbjct: 81 LKTNLSGANLTGALADRATFDEADLTNAILTEATLTRSRFYGAKITGADFTDALIDRAQA 140
Query: 219 QALCKYANGTNPITGVSTRKSLGC 242
+ LC A+G NP+TGVSTR SLGC
Sbjct: 141 KLLCDRADGINPVTGVSTRDSLGC 164
>gi|257061347|ref|YP_003139235.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
gi|256591513|gb|ACV02400.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
Length = 172
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 60/130 (46%), Positives = 82/130 (63%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F DL KAV F +A+MRE++F GS + A L + V KAN +L+ +L
Sbjct: 52 FSHRDLEKAV----------FAAAEMRETNFEGSNLSYAILTEGVLLKANLKDVNLTGSL 101
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+DR+ L+ A+LTNA+LV + TR+ II GADF+DAVID Q +C+ A+G NP+T
Sbjct: 102 LDRVTLDFADLTNAILVDAIATRTRFYDTIITGADFTDAVIDRYQVALMCERADGVNPVT 161
Query: 233 GVSTRKSLGC 242
GV+TR SLGC
Sbjct: 162 GVATRDSLGC 171
>gi|354555882|ref|ZP_08975181.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|353552206|gb|EHC21603.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 182
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/134 (41%), Positives = 84/134 (62%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ + +L++ +N + F +ADMRE++F GS + + + + AN GA+L
Sbjct: 48 NTVNYTYGELQQQDFSHKNLEKGVFAAADMREANFEGSNLSYSIFTEGILLGANLKGANL 107
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
S +L+DR+ L+ A+LTNA+LV + TR+ A I GADF++AVID Q +C+ A G
Sbjct: 108 SSSLLDRVTLDFADLTNAILVDAIATRTRFYDATITGADFTNAVIDRYQVSLMCERAEGV 167
Query: 229 NPITGVSTRKSLGC 242
NP+TGVSTR SLGC
Sbjct: 168 NPVTGVSTRDSLGC 181
>gi|172037118|ref|YP_001803619.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|171698572|gb|ACB51553.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
Length = 184
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/134 (41%), Positives = 84/134 (62%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ + +L++ +N + F +ADMRE++F GS + + + + AN GA+L
Sbjct: 50 NTVNYTYGELQQQDFSHKNLEKGVFAAADMREANFEGSNLSYSIFTEGILLGANLKGANL 109
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
S +L+DR+ L+ A+LTNA+LV + TR+ A I GADF++AVID Q +C+ A G
Sbjct: 110 SSSLLDRVTLDFADLTNAILVDAIATRTRFYDATITGADFTNAVIDRYQVSLMCERAEGV 169
Query: 229 NPITGVSTRKSLGC 242
NP+TGVSTR SLGC
Sbjct: 170 NPVTGVSTRDSLGC 183
>gi|254412921|ref|ZP_05026693.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196180085|gb|EDX75077.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 180
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 76/117 (64%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N +F AD+R + F G+ G+ L KA ++A+ TGA+LS+TL DR+V + ANLTN
Sbjct: 63 QNLEGTSFAGADLRGASFRGASLQGSILTKAAFFEADLTGANLSETLADRVVFDGANLTN 122
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+ + +RS I GADFS A++D Q +C+ A+G NP+TGVSTR SLGC
Sbjct: 123 AIFTNAIASRSRFFDTTITGADFSGAILDTYQISLMCQRADGVNPVTGVSTRDSLGC 179
>gi|126658078|ref|ZP_01729230.1| hypothetical protein CY0110_05667 [Cyanothece sp. CCY0110]
gi|126620716|gb|EAZ91433.1| hypothetical protein CY0110_05667 [Cyanothece sp. CCY0110]
Length = 181
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/134 (41%), Positives = 83/134 (61%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ + +L++ +N + F +ADMRE++F GS + + + + AN G DL
Sbjct: 47 NTVNYTYGELQQEDFSHKNLQGGVFAAADMREANFEGSNLSYSIFTEGILLGANLKGVDL 106
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
S +L+DR+ L+ A+LTNA+LV + TR+ A I GADF++AVID Q +C+ A G
Sbjct: 107 SSSLLDRVTLDFADLTNAILVDAIATRTRFYDATITGADFTNAVIDRYQVSLMCERAEGV 166
Query: 229 NPITGVSTRKSLGC 242
NP+TGVSTR SLGC
Sbjct: 167 NPVTGVSTRDSLGC 180
>gi|75908890|ref|YP_323186.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75702615|gb|ABA22291.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 168
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 59/143 (41%), Positives = 83/143 (58%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
+T F + + +A+L + NF +A+MR ++F G+ A L K V
Sbjct: 25 DTHPAFAQINTINYNNANLENRDFANADLVGVNFVAAEMRGTNFQGANLTNAILTKGVLL 84
Query: 160 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
KAN + A+L+ L+DR+ L+ ANL NA+ LTRS A I GADF+DA+ID Q
Sbjct: 85 KANLSEANLTGALVDRVTLDNANLKNAIFTEATLTRSRFYDADITGADFTDAIIDRYQVS 144
Query: 220 ALCKYANGTNPITGVSTRKSLGC 242
LC+ A+G NP+TGV+TR SLGC
Sbjct: 145 LLCERADGVNPVTGVATRDSLGC 167
>gi|428316344|ref|YP_007114226.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428240024|gb|AFZ05810.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 169
Score = 110 bits (275), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 55/110 (50%), Positives = 72/110 (65%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +A+MR ++F G+ A L K V AN +GA+LS L DR+ + ANLTNA +
Sbjct: 59 FVAAEMRGTNFQGADLTNAILTKGVLLNANLSGANLSGALADRVTFDGANLTNANFTEAI 118
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+TR+ A I GADFSDA+ID Q LC+ A+G NP+TGVSTR+SLGC
Sbjct: 119 MTRTRFFDAAISGADFSDAIIDAYQVSILCEKADGVNPVTGVSTRESLGC 168
>gi|332712340|ref|ZP_08432267.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332348814|gb|EGJ28427.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 169
Score = 110 bits (275), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 76/116 (65%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N R F A+MR ++F G+ +G+ K KAN GA+L+D+L DR++L++ANLTNA
Sbjct: 53 NLVRGVFAGAEMRGTNFQGADLSGSIFTKGNLLKANLEGANLTDSLADRVILDQANLTNA 112
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+L ++ + A I GADF+DA+ID Q + +C A G NP+TG+STR SLGC
Sbjct: 113 ILTDAIMNSTRFYDAEITGADFTDALIDRYQAKLMCGRATGVNPVTGISTRDSLGC 168
>gi|434405844|ref|YP_007148729.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428260099|gb|AFZ26049.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 168
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 55/110 (50%), Positives = 72/110 (65%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +A+MR ++F G+ A L K V KAN GA+L+ L+DR+ L+ ANL NA+
Sbjct: 58 FVAAEMRGTNFQGANLTNAILTKGVLLKANLEGANLAGALVDRVTLDGANLKNAIFTEAT 117
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
LTRS A + GADF+DA+ID Q LCK A+G NP+TG+STR SLGC
Sbjct: 118 LTRSRFFDADVTGADFTDALIDRYQVALLCKSADGVNPVTGISTRDSLGC 167
>gi|427720966|ref|YP_007068960.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427353402|gb|AFY36126.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 168
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 81/141 (57%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
R F + + + + +L E+ A F +A+MR ++F G+ A L K V KA
Sbjct: 27 RPAFALTNVINYNNINLENRDFAHEDLTGATFVAAEMRGANFQGANLTNAVLTKGVLLKA 86
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 221
+ + A+L+ L+DR+ L+ ANL NA+ LTRS A I GADF+DA+ID Q +
Sbjct: 87 DLSDANLTGALVDRVTLDGANLKNAIFTEATLTRSRFYDAEITGADFTDALIDRYQVSLM 146
Query: 222 CKYANGTNPITGVSTRKSLGC 242
C A G NP+TGVSTR SLGC
Sbjct: 147 CDRAAGINPVTGVSTRDSLGC 167
>gi|295293762|gb|ADF88289.1| pentapeptide repeat-containing protein [Aphanizomenon sp. 10E6]
Length = 168
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 71/110 (64%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +A+MR ++F G+ A K V KAN A+L+ L+DR+ L+ ANL NA+ +
Sbjct: 58 FVAAEMRGTNFQGANLTNAIFTKGVLLKANLEAANLTGALVDRVTLDSANLRNAIFTKAT 117
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
LTRS A I GADF+DA+ID Q LC+ A+G NP+TGVSTR SLGC
Sbjct: 118 LTRSRFYDADITGADFTDALIDRYQVSLLCQRADGVNPVTGVSTRDSLGC 167
>gi|186685193|ref|YP_001868389.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186467645|gb|ACC83446.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 168
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 72/110 (65%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +A+MR ++F G+ A L K V KAN GA+LS L+DR+ ++ ANL NA+
Sbjct: 58 FVAAEMRGTNFQGANLTNAILTKGVLLKANLEGANLSGALVDRVTMDGANLKNAIFTEAT 117
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
LTRS A I GADF+DA+ID Q +C+ A+G NP+TG+STR SLGC
Sbjct: 118 LTRSRFFDAEITGADFTDALIDRYQVSLMCERADGVNPVTGMSTRDSLGC 167
>gi|428224803|ref|YP_007108900.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984704|gb|AFY65848.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 176
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 71/110 (64%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F SA+MR ++F G+ + A L K V AN GA+L+ L DR+ +ANL NA+LV
Sbjct: 67 FVSAEMRNANFEGANLSNAILTKGVLLNANLEGANLTGALADRVFWLDANLRNAILVDVT 126
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
TR+ G + GADFSDA++D + + LCK A G NP+TGV+TR SLGC
Sbjct: 127 ATRTSFEGVDVTGADFSDAILDRYELKELCKRAEGVNPVTGVATRDSLGC 176
>gi|158337601|ref|YP_001518776.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158307842|gb|ABW29459.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 172
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 56/133 (42%), Positives = 79/133 (59%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A F ADLR +NF A+ A + +++ + + G L A ++N T ADL+
Sbjct: 39 AQNFTFADLRYEDFENKNFEGASLAGAILLKANLTNANLKGTILTMATFQRSNLTNADLT 98
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
+T DR++ NEA+LTNA+ +LT S A I GADFS A +D Q +C+YA+G N
Sbjct: 99 ETFADRVLFNEADLTNAIFTDAMLTSSKFYDATITGADFSYAFLDRDQVTMMCEYADGVN 158
Query: 230 PITGVSTRKSLGC 242
P+TGVSTR+SL C
Sbjct: 159 PVTGVSTRESLEC 171
>gi|359460928|ref|ZP_09249491.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 172
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 94/180 (52%), Gaps = 13/180 (7%)
Query: 64 KLKNWRVFVSTALAAA-VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
++K W +S A +V SC + ++ + A F ADLR
Sbjct: 4 RIKPWLRTISVVFAVVWLVGSC------------FVLNSQPTWADDGAQNFTFADLRYED 51
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 182
+NF A+ A + +++ + + G L A ++N T ADL++T DR++ NEA+
Sbjct: 52 FENKNFEGASLAGAILLKANLTNANLKGTILTMATFQRSNLTNADLTETFADRVLFNEAD 111
Query: 183 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
LTNA+ +LT S A I GADFS A +D Q +C+YA+G NP+TGVSTR+SL C
Sbjct: 112 LTNAIFTDAMLTSSKFYDATITGADFSYAFLDRDQVTMMCEYADGVNPVTGVSTRESLEC 171
>gi|427728139|ref|YP_007074376.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427364058|gb|AFY46779.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 168
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 71/110 (64%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +A+MR ++F G+ A K V AN +GA+L+ L+DR L+ ANL NA+
Sbjct: 58 FVAAEMRGTNFQGANLTNAIFTKGVLLNANLSGANLTGALVDRATLDSANLKNAIFTEAT 117
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
LTRS A I GADF+DA+ID Q LC+ A+G NP+TGV+TR+SLGC
Sbjct: 118 LTRSRFYDADITGADFTDAIIDRYQVSLLCERADGINPVTGVATRESLGC 167
>gi|443314355|ref|ZP_21043921.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442786047|gb|ELR95821.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 173
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 98/182 (53%), Gaps = 13/182 (7%)
Query: 62 YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQ-FGSADLRK 120
+ + WR + L A+ I+A A IG Q F +DL +
Sbjct: 3 WQRSGEWRQILRGGLLFAIAIVLWGGIAARA------------IAIGEITQDFTYSDLNR 50
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
EN A+ +AD RE++FSG+ + L K YKA GA+L+ + DR++ +
Sbjct: 51 QDFAGENLAGASLAAADAREANFSGADLSQTILTKGNFYKAKLVGANLTQSFADRVIFDG 110
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
A+L+NA++V ++T + G A I+GADFS ++D Q +C+YA+G NP+TGV+TR SL
Sbjct: 111 ADLSNALVVDAIMTSTSFGEATIQGADFSGTILDRYQVAQMCEYADGVNPVTGVATRDSL 170
Query: 241 GC 242
GC
Sbjct: 171 GC 172
>gi|428778133|ref|YP_007169920.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428692412|gb|AFZ45706.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 174
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 62/138 (44%), Positives = 82/138 (59%), Gaps = 10/138 (7%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
+ I S F + DL AV F +A+MR+++FSGS A K A+ +
Sbjct: 46 YTIVSERDFSNKDLVGAV----------FAAAEMRKTNFSGSNLENAMFTKGTLINADLS 95
Query: 165 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 224
+LS LMDR+ L+ A+L NAVL T LTRS L G IEGADF+DA+++ Q + LC+
Sbjct: 96 NTNLSGALMDRVSLDGADLRNAVLQGTFLTRSTLEGTKIEGADFTDAILNRYQVKLLCER 155
Query: 225 ANGTNPITGVSTRKSLGC 242
A G NP TGV+TR SLGC
Sbjct: 156 AEGVNPKTGVATRDSLGC 173
>gi|334119379|ref|ZP_08493465.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333458167|gb|EGK86786.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 169
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 71/110 (64%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +A+MR ++F G+ A L K V AN +GA+LS L DR+ + ANLTNA +
Sbjct: 59 FVAAEMRGTNFQGADLTNAILTKGVLLNANLSGANLSGALADRVTFDGANLTNANFSEAI 118
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+TR+ A I GADF+DA+ID Q LC+ A+G NP TGVSTR+SLGC
Sbjct: 119 MTRTRFFDAAISGADFTDAIIDAYQVSILCEKADGVNPATGVSTRESLGC 168
>gi|16331228|ref|NP_441956.1| hypothetical protein sll0301 [Synechocystis sp. PCC 6803]
gi|383322971|ref|YP_005383824.1| hypothetical protein SYNGTI_2062 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383326140|ref|YP_005386993.1| hypothetical protein SYNPCCP_2061 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383492024|ref|YP_005409700.1| hypothetical protein SYNPCCN_2061 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437292|ref|YP_005652016.1| hypothetical protein SYNGTS_2063 [Synechocystis sp. PCC 6803]
gi|451815384|ref|YP_007451836.1| hypothetical protein MYO_120830 [Synechocystis sp. PCC 6803]
gi|1001404|dbj|BAA10026.1| sll0301 [Synechocystis sp. PCC 6803]
gi|339274324|dbj|BAK50811.1| hypothetical protein SYNGTS_2063 [Synechocystis sp. PCC 6803]
gi|359272290|dbj|BAL29809.1| hypothetical protein SYNGTI_2062 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359275460|dbj|BAL32978.1| hypothetical protein SYNPCCN_2061 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278630|dbj|BAL36147.1| hypothetical protein SYNPCCP_2061 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451781353|gb|AGF52322.1| hypothetical protein MYO_120830 [Synechocystis sp. PCC 6803]
Length = 169
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/126 (42%), Positives = 80/126 (63%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL ++ ++ +A F +AD+RES+F GS + + L AV A+ GA+LS +L+DR+
Sbjct: 43 DLARSDFSHQDLNKAVFAAADLRESNFEGSDLSFSILTDAVFLHASLRGANLSGSLVDRV 102
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
L+ A+L + + + TR+ I GADFSDAVID Q + +C+ A G NP+TGV+T
Sbjct: 103 TLDFADLRDTIFTEAIATRTRFYDTDITGADFSDAVIDAYQVKLMCERAEGVNPVTGVAT 162
Query: 237 RKSLGC 242
R SLGC
Sbjct: 163 RDSLGC 168
>gi|17227682|ref|NP_484230.1| hypothetical protein all0186 [Nostoc sp. PCC 7120]
gi|17135164|dbj|BAB77710.1| all0186 [Nostoc sp. PCC 7120]
Length = 168
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 71/112 (63%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
NF +A+MR ++F G+ A L K V KAN + A+L+ L+DR L+ ANL NA+
Sbjct: 56 VNFVAAEMRGTNFQGANLTNAILTKGVLLKANLSEANLTGALVDRATLDNANLKNAIFTE 115
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
LTRS A I GADF+DA+ID Q LC+ ANG N +TG++TR SLGC
Sbjct: 116 ATLTRSRFYDADITGADFTDALIDRYQVSLLCERANGVNRVTGIATRDSLGC 167
>gi|428299988|ref|YP_007138294.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428236532|gb|AFZ02322.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 193
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/138 (44%), Positives = 80/138 (57%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
G +A + +A+L + NF +A+MR +F G+ A L K V KAN
Sbjct: 55 IGQLNAMNYNNANLENRDFSHADLVGINFVAAEMRGINFEGANLTNAMLTKGVMLKANLE 114
Query: 165 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 224
GA+L+ L+DR+ L+ ANL NA LTRS L A I GADFS+A+ID Q + LC
Sbjct: 115 GANLTAALVDRVALDGANLKNANFTDATLTRSRLFDADITGADFSNALIDTYQMKLLCDR 174
Query: 225 ANGTNPITGVSTRKSLGC 242
A+GTNP+TGV TR SL C
Sbjct: 175 ASGTNPVTGVDTRDSLEC 192
>gi|440681954|ref|YP_007156749.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428679073|gb|AFZ57839.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 168
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 71/110 (64%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +A+MR ++F G+ + A L K V KAN A+L+ L+DR+ L+ ANL NA+
Sbjct: 58 FVAAEMRGANFQGANLSNAILTKGVLLKANLEDANLTGALVDRVTLDSANLKNAIFTEAT 117
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
LTRS A I GADF+DA+ID Q LC+ ANG N +TG+STR SLGC
Sbjct: 118 LTRSRFYDADITGADFTDALIDRYQVSLLCERANGVNSVTGISTRDSLGC 167
>gi|407961395|dbj|BAM54635.1| hypothetical protein BEST7613_5704 [Synechocystis sp. PCC 6803]
Length = 147
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/126 (42%), Positives = 80/126 (63%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL ++ ++ +A F +AD+RES+F GS + + L AV A+ GA+LS +L+DR+
Sbjct: 21 DLARSDFSHQDLNKAVFAAADLRESNFEGSDLSFSILTDAVFLHASLRGANLSGSLVDRV 80
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
L+ A+L + + + TR+ I GADFSDAVID Q + +C+ A G NP+TGV+T
Sbjct: 81 TLDFADLRDTIFTEAIATRTRFYDTDITGADFSDAVIDAYQVKLMCERAEGVNPVTGVAT 140
Query: 237 RKSLGC 242
R SLGC
Sbjct: 141 RDSLGC 146
>gi|428779391|ref|YP_007171177.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
gi|428693670|gb|AFZ49820.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
Length = 171
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/138 (44%), Positives = 82/138 (59%), Gaps = 10/138 (7%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
+ + S F + DL AV F +A+MR ++FSGS A K A+ +
Sbjct: 43 YTVVSERDFSNKDLVGAV----------FAAAEMRRTNFSGSNLENAMFTKGTLINADLS 92
Query: 165 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 224
+LS LMDR+ L+ A+L+NAVL T LTRS L G I GADF+DA+++ Q + LC+
Sbjct: 93 NTNLSGALMDRVNLDGADLSNAVLNGTFLTRSTLEGTKITGADFTDAILNRYQVKLLCEK 152
Query: 225 ANGTNPITGVSTRKSLGC 242
A G NP TGVSTR+SLGC
Sbjct: 153 AEGVNPKTGVSTRESLGC 170
>gi|255083653|ref|XP_002508401.1| predicted protein [Micromonas sp. RCC299]
gi|226523678|gb|ACO69659.1| predicted protein [Micromonas sp. RCC299]
Length = 187
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 56/133 (42%), Positives = 75/133 (56%), Gaps = 5/133 (3%)
Query: 120 KAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
KA H+ E+F + +T D+R SDFSGS A +AV N GAD+S++ +D
Sbjct: 30 KAEHINEDFSHEDLVGAIYTEGDLRGSDFSGSDLRAAIFSRAVMPGVNLEGADMSNSFLD 89
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
+VL +N+ + RSDLG + ADF++AVID Q LC A+GTNP TGV
Sbjct: 90 YVVLRGSNMRGVIAREANFVRSDLGDCDVTDADFTEAVIDRYQAIGLCDSASGTNPFTGV 149
Query: 235 STRKSLGCGNSRR 247
TR SLGC +R
Sbjct: 150 DTRDSLGCERLKR 162
>gi|170077406|ref|YP_001734044.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
gi|169885075|gb|ACA98788.1| Pentapeptide repeats protein [Synechococcus sp. PCC 7002]
Length = 169
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/120 (47%), Positives = 77/120 (64%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
EN + A+F AD+R SDF+GS + A L + +AN T A+LS+ MD++ + ANLT
Sbjct: 50 HENLQAASFARADVRGSDFTGSDLSRAILTEGKFMEANLTEANLSEAFMDQVNMEGANLT 109
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 244
NA+ V V ++ AII+GADFS A++D Q LCK A+GTN ITG+ TR SL C N
Sbjct: 110 NALFVDAVAPGTNFAEAIIDGADFSGALLDRYQLSELCKRASGTNTITGIDTRYSLNCKN 169
>gi|428308896|ref|YP_007119873.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428250508|gb|AFZ16467.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 176
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 77/134 (57%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S+ + S DL +N A F +A+MR ++F S A L K V AN A+L
Sbjct: 42 SSINYSSTDLTNRDFSHKNLVGAVFVAAEMRGTNFQESDLTNAILTKGVMLGANLQDANL 101
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+ L+DR+ L+ ANL NA+ + RS A I GADF+DA+ID Q LC+ A+G
Sbjct: 102 TGALVDRVTLDNANLKNAIFQEATMIRSRFYDADITGADFTDAIIDRYQVSLLCEKASGV 161
Query: 229 NPITGVSTRKSLGC 242
NPITGV+TR SLGC
Sbjct: 162 NPITGVATRDSLGC 175
>gi|87302980|ref|ZP_01085784.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
gi|87282476|gb|EAQ74435.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
Length = 203
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 51/111 (45%), Positives = 75/111 (67%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+F R++DFSG+ +G+ L +A +++F+GADLSD LMDR + +L+ A+L
Sbjct: 92 SFAGVMARDADFSGADLHGSILTQAAFLRSDFSGADLSDALMDRADFSGTDLSGALLRGV 151
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ S GA+I+ ADFSDA++D + ++ALC+ A GTNP TGVSTR SL C
Sbjct: 152 IAAGSSFSGAVIDDADFSDALLDRSDQRALCRRAQGTNPTTGVSTRLSLDC 202
>gi|411119939|ref|ZP_11392315.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410710095|gb|EKQ67606.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 169
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 55/135 (40%), Positives = 81/135 (60%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G +G+A+L ++ F SA+MR ++FSG+ A K AN +GA+
Sbjct: 34 GKFLNYGNANLTNQDFSNQDLGGGVFVSAEMRGTNFSGAILTNAMFTKGNLLGANLSGAN 93
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L L+DR L +A+L+NA+L+ L+ S L A ++GADF++A++D LCK A G
Sbjct: 94 LEGALLDRTTLYKADLSNAILIDATLSNSILDEATVDGADFTNAIVDRYAVSQLCKRAQG 153
Query: 228 TNPITGVSTRKSLGC 242
TNP TG+STR+SLGC
Sbjct: 154 TNPTTGISTRESLGC 168
>gi|434407744|ref|YP_007150629.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428261999|gb|AFZ27949.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 162
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 85/130 (65%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L + ++ + A F++A++ ++F+G+ GA L +V KAN GADL++ +
Sbjct: 32 FSNAELGRQDFSGQSLQAAEFSNANLELTNFTGADLRGAVLSASVMTKANLHGADLTNAM 91
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ L A+L++AV + +L R+ IEGADF+DA++D AQ + LC+ A+G N T
Sbjct: 92 VDQVNLTRADLSDAVFIEALLLRAIFTDVNIEGADFTDAILDRAQVKELCEKASGVNSQT 151
Query: 233 GVSTRKSLGC 242
GV TR SLGC
Sbjct: 152 GVQTRDSLGC 161
>gi|428309499|ref|YP_007120476.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428251111|gb|AFZ17070.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 166
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 70/179 (39%), Positives = 92/179 (51%), Gaps = 27/179 (15%)
Query: 72 VSTALAAAVVASCSSNISALADLNKY--------EAETRGEFGIGSAAQFGSADLRKAVH 123
++T L A +V C + ALA KY AE +G+ F LR A
Sbjct: 6 LATFLLALIVWCCP--LPALAQATKYYPPPLSYSNAELKGK-------DFSGQTLRSAEF 56
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
N R NFT AD+R + FS S V AN GADLS+ ++D++ A+L
Sbjct: 57 SNANLERTNFTDADLRGTIFSAS----------VMTHANLHGADLSNAMIDQVSFTNADL 106
Query: 184 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
++AVL +++ RS I GADFSDA++D AQ + LC A G N TGVSTR SLGC
Sbjct: 107 SDAVLTESIMLRSTFDNVDITGADFSDAILDGAQIKELCTKATGVNSQTGVSTRDSLGC 165
>gi|443313318|ref|ZP_21042930.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442776723|gb|ELR87004.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 182
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 56/131 (42%), Positives = 77/131 (58%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +A+L+ N F +A+MR ++F G+ A + K V AN GA+LS
Sbjct: 51 NYNNANLQNRDFSHTNLIGGVFVAAEMRGANFQGADLTNAIMTKGVLLGANLEGANLSGA 110
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
L+DR+ L+ ANL NA+ LTRS A I GADFS+A+ID Q LC A GTNP+
Sbjct: 111 LVDRVTLDNANLKNAIFTDATLTRSRFFDADITGADFSNALIDRYQINLLCDRATGTNPV 170
Query: 232 TGVSTRKSLGC 242
TG++T +SLGC
Sbjct: 171 TGITTTESLGC 181
>gi|443328655|ref|ZP_21057250.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791786|gb|ELS01278.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 222
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 81/134 (60%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
++ + ++LR +F F +AD+R S+F GS + + L KA+ N +G DL
Sbjct: 88 NSVNYTYSELRNEDLSHRDFSGGVFAAADVRGSNFEGSDLSNSILTKAIFTDTNLSGVDL 147
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+++ MDR+ L+ +NL+NA+L + T ++ I GADFS A+ID Q LC+ A G
Sbjct: 148 TNSFMDRVDLSNSNLSNAILQDIIATSTNFYNTDITGADFSGAIIDRYQTYVLCQRAAGV 207
Query: 229 NPITGVSTRKSLGC 242
NP+TGVSTR SLGC
Sbjct: 208 NPVTGVSTRYSLGC 221
>gi|427706655|ref|YP_007049032.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427359160|gb|AFY41882.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 168
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 57/141 (40%), Positives = 79/141 (56%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
R F + + +A+L + F +A+MR ++F + A K V KA
Sbjct: 27 RPAFAQINTINYSNANLENRDFANADLAGVTFVAAEMRGTNFQAANLTNAIFTKGVLLKA 86
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 221
N GA+L+ L+DR+ L+ ANL NA LTRS A I GADF+DA+ID Q L
Sbjct: 87 NLEGANLTGALVDRVTLDGANLKNANFTEATLTRSRFYDADITGADFTDALIDRYQISLL 146
Query: 222 CKYANGTNPITGVSTRKSLGC 242
C+ A+G NP+TGV+TR+SLGC
Sbjct: 147 CERADGVNPVTGVATRESLGC 167
>gi|440684176|ref|YP_007158971.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428681295|gb|AFZ60061.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 162
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 86/130 (66%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L + ++ + A F++A++ ++F+G+ GA L +V +AN GADL++ +
Sbjct: 33 FSNAELGRQDFSGQSLQAAEFSNANLEMANFTGADLRGAVLSASVMTQANLHGADLTNAM 92
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ LN A+L++A+L+ +L RS I GADF+DA++D AQ + LC+ A+G N T
Sbjct: 93 IDQVKLNGADLSDAILLEALLLRSIFTDVNIAGADFTDAILDKAQIKELCQKASGVNSRT 152
Query: 233 GVSTRKSLGC 242
GV TR SLGC
Sbjct: 153 GVETRDSLGC 162
>gi|443322626|ref|ZP_21051645.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
gi|442787675|gb|ELR97389.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
Length = 164
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 76/130 (58%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
+ S +L+ ++ A F AD+R ++F + + L + V AN T A+L+D L
Sbjct: 33 YTSTELQNRDFSGQDLEGAVFADADLRGANFQAANLANSILTQGVFLNANLTKANLTDAL 92
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
DR EANLT+A+LV + +RS AII GADFS A++D Q LC A GTNP+T
Sbjct: 93 ADRATFAEANLTDAILVNIIASRSSFVDAIITGADFSGAILDKYQVALLCDRAQGTNPVT 152
Query: 233 GVSTRKSLGC 242
GVSTR SL C
Sbjct: 153 GVSTRASLNC 162
>gi|298489879|ref|YP_003720056.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298231797|gb|ADI62933.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 163
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 85/130 (65%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L + ++ + A F++A++ ++F+G+ G +V KAN GA+L++ +
Sbjct: 33 FSNAELGRQDFSGQSLQAAEFSNANLEMANFAGADLRGTVFSASVMTKANLHGANLTNAM 92
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
++ + LN A+L++A+L+ +L RS IEGADFSDA++D +Q Q LCK A+G N T
Sbjct: 93 VNEVKLNGADLSDAILLEALLLRSIFTDVNIEGADFSDAILDRSQIQELCKKASGVNSQT 152
Query: 233 GVSTRKSLGC 242
GV TR+SLGC
Sbjct: 153 GVETRESLGC 162
>gi|425438309|ref|ZP_18818714.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9432]
gi|425452591|ref|ZP_18832408.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 7941]
gi|440756403|ref|ZP_20935604.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|443646807|ref|ZP_21129485.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|159025958|emb|CAO87888.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|389676535|emb|CCH94452.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9432]
gi|389765527|emb|CCI08587.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 7941]
gi|440173625|gb|ELP53083.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|443335636|gb|ELS50100.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 166
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 76/118 (64%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
++ R F +A MR + GS + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGSDLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|425465439|ref|ZP_18844748.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|389832325|emb|CCI24153.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
Length = 166
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 77/118 (65%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
++ R F +A MR ++ G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGANLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|428203864|ref|YP_007082453.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427981296|gb|AFY78896.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 170
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 72/110 (65%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +ADMR +F S + L + V AN GA+L+++LMDR+ L+ A+LTNA+ V +
Sbjct: 60 FAAADMRGINFEDSDLSNTILTEGVLLGANLKGANLTNSLMDRVTLDFADLTNAIFVDAI 119
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
TR+ I GADFS AV+D Q + LC A+G NP+TG+STR+SLGC
Sbjct: 120 ATRTRFYDTTITGADFSGAVLDRYQVKLLCDRADGVNPVTGISTRESLGC 169
>gi|425469693|ref|ZP_18848608.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9701]
gi|389880432|emb|CCI38813.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9701]
Length = 166
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 76/118 (64%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
++ R F +A MR + G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDAYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|17230233|ref|NP_486781.1| hypothetical protein alr2741 [Nostoc sp. PCC 7120]
gi|17131834|dbj|BAB74440.1| alr2741 [Nostoc sp. PCC 7120]
Length = 182
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 82/130 (63%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L + E+ + A F++A++ ++F G+ GA L +V +AN GADL++ +
Sbjct: 52 FSNAELSRHNFAGESLQAAEFSNANLEMTNFVGADLRGAVLSASVMTQANLQGADLTNAM 111
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ L ANL++ VL +L R+ IEGADF+DA++D AQ + LC A+G N T
Sbjct: 112 VDQVNLTGANLSDVVLKEALLLRAIFANVNIEGADFTDAILDKAQIKELCTKASGVNTKT 171
Query: 233 GVSTRKSLGC 242
GV TR SLGC
Sbjct: 172 GVETRDSLGC 181
>gi|422303610|ref|ZP_16390961.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9806]
gi|389791366|emb|CCI12792.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9806]
Length = 166
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 76/118 (64%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
++ R F +A MR + G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDAYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|75910505|ref|YP_324801.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704230|gb|ABA23906.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 182
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 82/130 (63%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L + E+ + A F++A++ ++F G+ GA L +V +AN GADL++ +
Sbjct: 52 FSNAELSRHNFAGESLQAAEFSNANLEMTNFVGADLRGAVLSASVMTQANLQGADLTNAM 111
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ L ANL++ VL +L R+ IEGADF+DA++D AQ + LC A+G N T
Sbjct: 112 VDQVNLTGANLSDVVLKEALLLRAIFANVNIEGADFTDAILDKAQIKELCTKASGVNTKT 171
Query: 233 GVSTRKSLGC 242
GV TR SLGC
Sbjct: 172 GVKTRDSLGC 181
>gi|86609913|ref|YP_478675.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86558455|gb|ABD03412.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 173
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 83/130 (63%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +ADL+ +++R ++F SA+++ +D G+ GA KA AN +GADLS++L
Sbjct: 43 FNNADLQGQDLSGQDWRGSSFVSANLQGADLHGANLAGAAFTKANLAGANLSGADLSNSL 102
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D L A+L A L + R+ GA I GADFS+A +D A K+ LC+ A G++PIT
Sbjct: 103 LDLANLAGADLRGAKLTGAIAARAVWQGAQIAGADFSEAYVDRAAKRQLCERAEGSHPIT 162
Query: 233 GVSTRKSLGC 242
GV+TR+SLGC
Sbjct: 163 GVTTRESLGC 172
>gi|166365075|ref|YP_001657348.1| hypothetical protein MAE_23340 [Microcystis aeruginosa NIES-843]
gi|166087448|dbj|BAG02156.1| hypothetical protein MAE_23340 [Microcystis aeruginosa NIES-843]
Length = 166
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 76/118 (64%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
++ R F +A MR + G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|414075538|ref|YP_006994856.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413968954|gb|AFW93043.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 168
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/131 (41%), Positives = 74/131 (56%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +A+L N F +A+MR ++F + A K V KAN A+L+
Sbjct: 37 NYNNANLENRDFSHTNLVGGTFVAAEMRGTNFQDANLTNAIFTKGVLLKANLESANLTGA 96
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
L+DR+ + ANL NA+ LTRS A I GADF+DA+ID Q LC+ A+G NP+
Sbjct: 97 LVDRVTFDSANLRNAIFAEATLTRSRFYDADITGADFTDALIDRYQVSLLCQRADGVNPV 156
Query: 232 TGVSTRKSLGC 242
TG+STR SLGC
Sbjct: 157 TGISTRDSLGC 167
>gi|300868096|ref|ZP_07112733.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
gi|300333934|emb|CBN57911.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
Length = 174
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 68/110 (61%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +A+MR ++F G+ A L K V AN + A+LS L DR+ + ANLTNA +
Sbjct: 64 FVAAEMRNTNFEGADLTNAILTKGVLLNANLSNANLSGALADRVTFDGANLTNANFTEAI 123
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
LTR+ I GADF+DA+ID Q LC+ A G N +TGVSTR+SLGC
Sbjct: 124 LTRTRFYDTAISGADFTDAIIDSYQVNLLCEKAEGVNSVTGVSTRESLGC 173
>gi|425446471|ref|ZP_18826474.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9443]
gi|389733275|emb|CCI02926.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9443]
Length = 166
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 76/118 (64%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
++ R F +A MR + G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|425439807|ref|ZP_18820122.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9717]
gi|425456970|ref|ZP_18836676.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9807]
gi|389719892|emb|CCH96344.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9717]
gi|389801790|emb|CCI19079.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9807]
Length = 166
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 76/118 (64%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
++ R F +A MR + G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|390440134|ref|ZP_10228485.1| Similar to Pentapeptide repeat [Microcystis sp. T1-4]
gi|389836418|emb|CCI32609.1| Similar to Pentapeptide repeat [Microcystis sp. T1-4]
Length = 166
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 76/118 (64%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
++ R F +A MR + G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
N + + +RS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIASRSRFYDTIITGADFTNAVIDAYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|427716094|ref|YP_007064088.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348530|gb|AFY31254.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 163
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 83/130 (63%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L++ E + A F++A++ +++F+G+ GA L +V + N GADL+D L
Sbjct: 33 FSNAELKRHDFSGETLQGAEFSNANLEQANFAGADLRGAVLSASVMTQTNLHGADLTDAL 92
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ L +A+L++AVL +L R+ I ADF+DAV+D AQ + LC A+G N T
Sbjct: 93 VDQVNLTKADLSDAVLKEALLLRAIFTDVNINSADFTDAVLDRAQIKELCGKASGVNSKT 152
Query: 233 GVSTRKSLGC 242
GV TR SLGC
Sbjct: 153 GVQTRDSLGC 162
>gi|116073351|ref|ZP_01470613.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
gi|116068656|gb|EAU74408.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
Length = 167
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 72/111 (64%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+F A R +DFSG+ +GA + +A+F+ ADLSD+LMDR + NLTNA+L
Sbjct: 57 SFAGAVGRGADFSGADLHGAIFTQGAFAEADFSDADLSDSLMDRADFSGTNLTNALLNGV 116
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ + S GA IEGADFSDA++D LC+ A G NPITG++TR SLGC
Sbjct: 117 IASGSSFAGASIEGADFSDALLDRDDVVRLCRDAEGVNPITGMATRDSLGC 167
>gi|428316951|ref|YP_007114833.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428240631|gb|AFZ06417.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 165
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 82/130 (63%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L + + R A F++A+M ++FS + GA + +V +AN GA+L++ +
Sbjct: 35 FSNAELTRRDFSGQMLRAAEFSNANMDLTNFSNADLRGAIMSASVMTQANLHGANLTNAM 94
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ A+L++A+L T+L RS G I GADF+DA++D +Q + LC A G N T
Sbjct: 95 IDQVKFTNADLSDAILAETILLRSTFDGVDITGADFTDAIMDGSQVKELCTKATGINSQT 154
Query: 233 GVSTRKSLGC 242
G+STR SLGC
Sbjct: 155 GISTRDSLGC 164
>gi|318040416|ref|ZP_07972372.1| hypothetical protein SCB01_01865 [Synechococcus sp. CB0101]
Length = 174
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 73/117 (62%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+ +F A + ++F+G+ +GA + +A+F+GADLSD LMDR ++ NL N
Sbjct: 58 QQLVNTSFAGAVGKGANFAGANLHGAIFTQGAFPEADFSGADLSDVLMDRTDMSHTNLRN 117
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVLV + + GA + GADFSDA+ID A ++ LC A+GTNP TG TR SLGC
Sbjct: 118 AVLVGVIAAGASFSGADVTGADFSDALIDRADQRQLCAKASGTNPSTGADTRASLGC 174
>gi|428206519|ref|YP_007090872.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428008440|gb|AFY87003.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 192
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 84/130 (64%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
+ +A+L ++ R A F++A+M + +F+ + GA + +V +AN GADLS +
Sbjct: 62 YSNAELTGKDFSRQILRAAEFSNANMEQVNFTDADLRGAIMSASVMTQANLHGADLSIAM 121
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ + A+L++AVL +L R+ G I GADFSDA++D AQ + LC+ A+G N T
Sbjct: 122 VDQVKMTGADLSDAVLQEALLLRTIFTGVDITGADFSDAILDGAQVKELCQRASGINSKT 181
Query: 233 GVSTRKSLGC 242
G++TR+SLGC
Sbjct: 182 GIATRESLGC 191
>gi|434400337|ref|YP_007134341.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428271434|gb|AFZ37375.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 169
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 82/134 (61%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SA + +++R ++ A F +AD R ++F GS + + L KAV AN +L
Sbjct: 35 SAVNYTYSEIRDQDFSHKDLAGAVFAAADARGANFEGSDLSNSILTKAVFSNANLAEINL 94
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+ +LMDR+ L+ +NLTNA++ V T ++ GA I GADFSD+++D Q LCK A G
Sbjct: 95 TKSLMDRVALDNSNLTNAIIREAVATSTNFDGATITGADFSDSILDRYQIYLLCKRAEGV 154
Query: 229 NPITGVSTRKSLGC 242
NP TGVSTR SLGC
Sbjct: 155 NPTTGVSTRDSLGC 168
>gi|425462969|ref|ZP_18842432.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9808]
gi|389823905|emb|CCI27601.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9808]
Length = 166
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 57/130 (43%), Positives = 75/130 (57%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F DLR V R AN AD+ S L +AV KAN GADL+ +L
Sbjct: 46 FSHQDLRGGVFAAAAMRGANLEEADLSYS----------ILTEAVLLKANLKGADLTASL 95
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+DR+ L+ A+LTN + + TRS II GADF++AVID Q + +C+ A+G NP+T
Sbjct: 96 VDRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVT 155
Query: 233 GVSTRKSLGC 242
GV+TR SLGC
Sbjct: 156 GVATRDSLGC 165
>gi|434390929|ref|YP_007125876.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428262770|gb|AFZ28716.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 163
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 80/130 (61%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L+ + R + F++A+M +++F+ + GA +V KAN GA+L++ +
Sbjct: 33 FSNAELKGRDFSGQMLRASEFSNANMEQTNFTDADLRGAIFSASVMTKANLHGANLTNAM 92
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
D++ A+L+ AVL T+L RS I ADFSDA++D Q + LC+ A+G NP T
Sbjct: 93 ADQVNFTNADLSAAVLAETILLRSVFDNTDITAADFSDAILDGVQIKELCQRASGVNPTT 152
Query: 233 GVSTRKSLGC 242
GV TR+SLGC
Sbjct: 153 GVDTRESLGC 162
>gi|428209239|ref|YP_007093592.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428011160|gb|AFY89723.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 165
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 73/117 (62%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N RA F + + E++FS + GA AV KAN G D S + L+ A+L++
Sbjct: 48 QNLVRAEFNNTKLAEANFSSADLRGAVFNSAVLRKANLHGVDFSYGIAYLSDLSAADLSD 107
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+L ++ RS+ GA + GADFS+AV+D Q LC+YA+G NP+TGV TR+SLGC
Sbjct: 108 AILTSAMMLRSNFKGAKVTGADFSEAVLDREQVVQLCEYASGVNPVTGVDTRESLGC 164
>gi|307153777|ref|YP_003889161.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306984005|gb|ADN15886.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 173
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 75/118 (63%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
++N R A F +ADMR + F S + A L + + AN GA+L+ TL+DR+ L+ A+L
Sbjct: 54 EKNLRGAVFAAADMRGASFENSDLSYAILTEGILLNANLKGANLTGTLLDRVTLDFADLR 113
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+A+L + TR+ + I GADF+ AVID Q +C+ A+G N ITGVSTR SLGC
Sbjct: 114 DAILTDAIATRTRFYDSDITGADFTGAVIDTYQISLMCERADGVNSITGVSTRDSLGC 171
>gi|334116781|ref|ZP_08490873.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333461601|gb|EGK90206.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 165
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 82/130 (63%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L + + R A F++A+M ++FS + GA + +V +AN GA+L++ +
Sbjct: 35 FSNAELTRRDFSGQMLRAAEFSNANMDLTNFSNADLQGAIMSASVMTQANLHGANLTNAM 94
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ A+L++A+L T+L RS G I GADF+DA++D +Q + LC A+G N T
Sbjct: 95 IDQVKFTNADLSDAILAETILLRSTFEGVDITGADFTDAIMDGSQIKELCTKASGINSQT 154
Query: 233 GVSTRKSLGC 242
G+ TR SLGC
Sbjct: 155 GIYTRDSLGC 164
>gi|186684198|ref|YP_001867394.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186466650|gb|ACC82451.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 174
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 85/130 (65%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L + ++ + A F++A+M ++FS + GA + +V KAN GADL++ +
Sbjct: 44 FSNAELSRRDFSGDSLQAAEFSNANMELANFSNADLRGAVMSASVMTKANLHGADLTNAM 103
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ L +A+L++A+ +L R+ I+GADF+DA++D AQ + LC+ A+G N T
Sbjct: 104 VDQVNLTKADLSDAIFKEALLLRAIFNDVNIDGADFTDAILDRAQIKELCRKASGVNSKT 163
Query: 233 GVSTRKSLGC 242
GV TR+SLGC
Sbjct: 164 GVQTRESLGC 173
>gi|86605651|ref|YP_474414.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86554193|gb|ABC99151.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 165
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 79/130 (60%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +ADL+ +++R ++F SA+++ +D G+ G KA AN GADLS++L
Sbjct: 35 FSNADLQGQDLSGQDWRGSSFVSANLQGADLQGANLAGVAFTKANLAGANLAGADLSNSL 94
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D L A+L A L + R+ GA I GADFSDA +D A + LC+ A G++PIT
Sbjct: 95 LDLANLAGADLRGANLRGAIAARAVWDGAQIAGADFSDAYVDRAALRQLCQRAEGSHPIT 154
Query: 233 GVSTRKSLGC 242
GVSTR SLGC
Sbjct: 155 GVSTRASLGC 164
>gi|354568879|ref|ZP_08988040.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353539391|gb|EHC08878.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 172
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 78/127 (61%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
+DLR ++ +F A+M+ ++F G+ +G L K +A+ + A+L++ DR
Sbjct: 45 SDLRYRDFSHQDLHGTSFAGAEMQGANFQGANLSGTILTKGSFLQADLSNANLAEAFADR 104
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 235
++ N+ANLTNA+ +L S A I GADFS A++D Q + +C A+G NP+TGVS
Sbjct: 105 VIFNKANLTNAIFRDAMLASSRFFEAEITGADFSGAIVDPYQVKLMCDRADGINPVTGVS 164
Query: 236 TRKSLGC 242
TR+SLGC
Sbjct: 165 TRESLGC 171
>gi|414079521|ref|YP_007000945.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413972800|gb|AFW96888.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 162
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 83/130 (63%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
+ +A+L + ++ + A F++A++ ++F+G+ G +V KAN GADL++ +
Sbjct: 32 YSNAELSRQDFSGQSLQAAEFSNANLEMANFTGADLRGTVFSASVMTKANLHGADLTNAM 91
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
++ + L A+L+NAVL+ +L R+ I GADF+DA++D AQ + LC+ A+G N T
Sbjct: 92 VNEVKLAGADLSNAVLIEALLLRTVFTDVNITGADFTDAILDKAQIKELCQKASGVNSQT 151
Query: 233 GVSTRKSLGC 242
GV TR+SLGC
Sbjct: 152 GVETRESLGC 161
>gi|427722287|ref|YP_007069564.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427354007|gb|AFY36730.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 175
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 73/120 (60%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ + A+F AD+R SDFSGS + A L + N +GADL++ MD++ L+ ANLT
Sbjct: 56 HQQLQAASFARADVRSSDFSGSDLSRAILSEGKFMDTNLSGADLTEAFMDQVNLSGANLT 115
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 244
NA+ V ++ A I GADFS A++D Q LCK A+GTN ITG+ TR SL C N
Sbjct: 116 NAIFTDAVAPGTNFTDANIAGADFSGALLDRYQLSQLCKRASGTNAITGIETRYSLNCEN 175
>gi|427729477|ref|YP_007075714.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427365396|gb|AFY48117.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 170
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 83/130 (63%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L + ++ + A F++A++ +DF+G+ GA L +V +AN ADL++ +
Sbjct: 41 FSNAELARHDFAGDSLQAAEFSNANLEMTDFTGADLRGAVLSASVMTQANLHKADLTNAM 100
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ L A+L++AV +L R+ IEGADF+DA++D AQ + LC A+G N T
Sbjct: 101 VDQVNLTGADLSDAVFKEALLLRAIFNDVNIEGADFTDALLDKAQIKELCTKASGVNSQT 160
Query: 233 GVSTRKSLGC 242
GV+TR SLGC
Sbjct: 161 GVATRDSLGC 170
>gi|428777417|ref|YP_007169204.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428691696|gb|AFZ44990.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 165
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 77/133 (57%), Gaps = 5/133 (3%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A Q + D ++ F N +AD +++ G+ FNGA L + AN+ G + S
Sbjct: 37 AVQAETQDFSGQTLIEAEFYDENLEAADFHDANLEGAVFNGATL-----HNANWRGVNFS 91
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
+ + +LTNAVL ++ RS GAI+EGADF++AV+D Q + LC+ A+G N
Sbjct: 92 NGIAYLTDFTGVDLTNAVLTEAMMLRSKFEGAIVEGADFTNAVVDRLQVKKLCERASGVN 151
Query: 230 PITGVSTRKSLGC 242
P TGVSTR+SLGC
Sbjct: 152 PTTGVSTRESLGC 164
>gi|33862602|ref|NP_894162.1| hypothetical protein PMT0329 [Prochlorococcus marinus str. MIT
9313]
gi|33634518|emb|CAE20504.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 179
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/123 (43%), Positives = 77/123 (62%), Gaps = 1/123 (0%)
Query: 120 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 179
K H ++ ++F A R +DFS S +GA L + ++NF+GADLSD LMDR+
Sbjct: 58 KDFHAQD-LSNSSFAGAVARAADFSNSNLHGAILTQGTFTQSNFSGADLSDALMDRVDFV 116
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 239
+ +L N VL + + S GA I+GADFSDA++DL ++ LC A+G N ITG++T +S
Sbjct: 117 DTDLRNCVLKGVIASGSSFAGAQIDGADFSDALLDLDDQRRLCLDADGINQITGIATFES 176
Query: 240 LGC 242
L C
Sbjct: 177 LNC 179
>gi|428770661|ref|YP_007162451.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428684940|gb|AFZ54407.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 165
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/126 (41%), Positives = 76/126 (60%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL + +N + F++ + ++FS S GA A +ANF GADL++ +
Sbjct: 39 DLSQQDFSSQNLQSMEFSNVKLNGANFSNSDLRGAVFNAARLEEANFHGADLTNGFIYVT 98
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
LN A+LT+A+L ++ R+ L GA ++GADF+ AV+D Q LCK A G NP+TG ST
Sbjct: 99 SLNRADLTDAILREAIMKRTTLKGANVDGADFTFAVLDNEQVIELCKNAQGINPVTGAST 158
Query: 237 RKSLGC 242
R+SLGC
Sbjct: 159 RQSLGC 164
>gi|428224653|ref|YP_007108750.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984554|gb|AFY65698.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 187
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 70/110 (63%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F++A++ ++F G+ G +V AN GA+L++ LMD+ L A+L A+L +
Sbjct: 77 FSNANLERANFEGADVRGGVFSASVLTDANLQGANLTNALMDQANLTRADLRGAILSEAI 136
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
L S I GADFSDA++D AQ +ALC+ A G NP+TG+STR+SLGC
Sbjct: 137 LLGSTFAETAIAGADFSDAILDGAQIKALCQRAEGVNPVTGLSTRESLGC 186
>gi|56750202|ref|YP_170903.1| hypothetical protein syc0193_c [Synechococcus elongatus PCC 6301]
gi|81300170|ref|YP_400378.1| hypothetical protein Synpcc7942_1361 [Synechococcus elongatus PCC
7942]
gi|56685161|dbj|BAD78383.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169051|gb|ABB57391.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 167
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 69/110 (62%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F S +MR+++ + A L V ANF GADLS L+DR+ L A+LT+A+LV
Sbjct: 57 FVSTEMRKANLEEANLRNAILTLGVFLDANFHGADLSGALLDRVFLVGADLTDALLVDVT 116
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
TR+ I GADF+DA+ID +++ LC A+G NP TGV+TR SLGC
Sbjct: 117 ATRTSFQDVKITGADFTDAIIDRYEQKQLCLRADGVNPKTGVATRDSLGC 166
>gi|124023686|ref|YP_001017993.1| hypothetical protein P9303_19861 [Prochlorococcus marinus str. MIT
9303]
gi|123963972|gb|ABM78728.1| Uncharacterized low-complexity proteins [Prochlorococcus marinus
str. MIT 9303]
Length = 179
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 53/123 (43%), Positives = 76/123 (61%), Gaps = 1/123 (0%)
Query: 120 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 179
K H ++ +F A R +DFS S GA L + ++NF+GADLSD LMDR+
Sbjct: 58 KDFHAQD-LSNTSFAGAVARAADFSNSNLRGAILTQGTFTQSNFSGADLSDALMDRVDFV 116
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 239
+ +L N+VL + + S GA I+GADFSDA++DL ++ LC A+G N ITG++T +S
Sbjct: 117 DTDLRNSVLKGVIASGSSFAGAQIDGADFSDALLDLDDQRRLCLDADGINQITGIATFES 176
Query: 240 LGC 242
L C
Sbjct: 177 LNC 179
>gi|427708609|ref|YP_007050986.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427361114|gb|AFY43836.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 189
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 82/130 (63%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L + + + A F++A+M ++F+G+ GA L +V KAN ADL++ +
Sbjct: 59 FSNAELARRDFSGQTLQAAEFSNANMEMANFTGADLRGAVLSASVMTKANLHQADLTNAM 118
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ L A+L++AV +L R+ I+GADF+DAV+D AQ + LC A+G N T
Sbjct: 119 VDQVNLTGADLSDAVFKEALLLRALFTDVNIQGADFTDAVLDKAQIKELCSKASGVNSKT 178
Query: 233 GVSTRKSLGC 242
GV TR+SLGC
Sbjct: 179 GVETRESLGC 188
>gi|282902031|ref|ZP_06309929.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281193118|gb|EFA68117.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 162
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 81/130 (62%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L + +N + A F++A++ ++F+ + GA +V +AN GADL++ +
Sbjct: 32 FSNAELGRHNFSGQNLQAAEFSNANLEMANFANADLRGAVFSASVMTQANLHGADLTNAM 91
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ L +A+L++A+ + +L RS+ I+GADFS A++D Q + LCK A G N T
Sbjct: 92 LDQVKLTDADLSDAIFIEAILLRSNFAKTNIDGADFSKAILDRGQIRDLCKSARGINSRT 151
Query: 233 GVSTRKSLGC 242
V TR SLGC
Sbjct: 152 HVQTRDSLGC 161
>gi|33866170|ref|NP_897729.1| hypothetical protein SYNW1636 [Synechococcus sp. WH 8102]
gi|33639145|emb|CAE08151.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 171
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 73/117 (62%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ +F A R ++FSG+ +GA + +A+F+GADLSD LMDR NL +
Sbjct: 55 QHLANTSFAGAVGRGANFSGADLHGAIFTQGAFAEADFSGADLSDALMDRADFAGTNLRD 114
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVL + + S A I GADFSDA++DL ++ LC+ A+G NP+TGV+T SLGC
Sbjct: 115 AVLTGIIASGSSFSDAQIAGADFSDALLDLDDQRRLCRDADGVNPVTGVATLDSLGC 171
>gi|443312247|ref|ZP_21041866.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442777717|gb|ELR87991.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 162
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 80/130 (61%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L+ + R A F++A+M ++FS + GA +V A+ GADLS+ +
Sbjct: 32 FSNAELKSRDFSGQTLRAAEFSNANMELANFSNADLRGAVFSASVMTGASLHGADLSNAM 91
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ L +A+L++AVL +L R+ I ADF+DA++D AQ + LC A+G NP T
Sbjct: 92 VDQVNLTKADLSDAVLTEALLLRAIFDDVSIVNADFTDAILDRAQIKELCAKASGVNPKT 151
Query: 233 GVSTRKSLGC 242
GV TR SLGC
Sbjct: 152 GVETRYSLGC 161
>gi|411116478|ref|ZP_11388965.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410712581|gb|EKQ70082.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 165
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 72/117 (61%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N R+ F +D++ ++F+G+ GA A AN G D SD + A+L++
Sbjct: 48 QNLVRSEFGDSDLQGANFAGADLRGAVFNGAKLTNANLHGVDFSDGIAYITDFANADLSD 107
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+L +L +S GA I GADFSDA ID AQ ALC+ A+GTNP+TGV TR+SLGC
Sbjct: 108 AILNSAMLLKSSFKGANITGADFSDAAIDRAQVLALCQTASGTNPVTGVDTRESLGC 164
>gi|317970566|ref|ZP_07971956.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
Length = 175
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 73/117 (62%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+ +F A + ++FSG+ +GA L + ANF GADLSD L+DR ++ +L N
Sbjct: 59 QQLANTSFAGAVGKAANFSGADLHGAILTQGAFPDANFNGADLSDVLLDRTDMSGTDLRN 118
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVLV + + S GA +E ADF+DA++D A ++ C A+GTNP TG +TR SLGC
Sbjct: 119 AVLVGVIASGSTFTGAQVENADFTDALLDRADQRNFCISASGTNPTTGANTRASLGC 175
>gi|78212794|ref|YP_381573.1| hypothetical protein Syncc9605_1263 [Synechococcus sp. CC9605]
gi|78197253|gb|ABB35018.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 169
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 87/149 (58%), Gaps = 21/149 (14%)
Query: 100 ETRGEFGIGS-AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-- 156
E RG+F + +A DL++ +K + R N + D+R + + S+ GA L A
Sbjct: 34 ELRGQFAVQEISADMHGLDLKEKEFLKADLREVNLSGTDLRGAVINTSQLQGADLRDANL 93
Query: 157 ---VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
V + ++F GADL AN TNA+++++ T A I+GADF++AVI
Sbjct: 94 SDVVGFASHFEGADLRG----------ANFTNAMMMQSRFT-----DAQIDGADFTNAVI 138
Query: 214 DLAQKQALCKYANGTNPITGVSTRKSLGC 242
DL Q++ALC A+G+NPI+GVSTR+SLGC
Sbjct: 139 DLPQQRALCARADGSNPISGVSTRESLGC 167
>gi|428306100|ref|YP_007142925.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428247635|gb|AFZ13415.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 174
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 79/130 (60%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F + DL AV F +A M+ ++F GS + A L + ANF A+L++ L
Sbjct: 54 FSNTDLTGAV----------FAAAQMKGANFQGSNLSNAILSQGTLSNANFADANLTNAL 103
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ L+ A+LTNA+ + + ++ + I GADF+DA+ID Q + LC+ A+G NP+T
Sbjct: 104 VDQVTLDGADLTNAIFRQATMVGTNFNDSAIAGADFTDAIIDRYQLKQLCQRASGVNPVT 163
Query: 233 GVSTRKSLGC 242
VSTR+SLGC
Sbjct: 164 AVSTRESLGC 173
>gi|282900610|ref|ZP_06308552.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281194410|gb|EFA69365.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 167
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 72/118 (61%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ + R ++FT A++R+SDFSGS G A ANFTGADL++ +D ANLT
Sbjct: 49 QRDLRDSSFTKANLRQSDFSGSNLTGVSFFAANLESANFTGADLTNATLDSARFIGANLT 108
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA+L + + GAII GADF+D ++ ++ LC+ ANG NP TG TR++L C
Sbjct: 109 NAILEGSFAASAKFDGAIIAGADFTDVLLRRDEQNKLCQVANGINPTTGRHTRETLFC 166
>gi|116072323|ref|ZP_01469590.1| hypothetical protein BL107_11066 [Synechococcus sp. BL107]
gi|116064845|gb|EAU70604.1| hypothetical protein BL107_11066 [Synechococcus sp. BL107]
Length = 186
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 73/117 (62%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N +F A R +DF + +GA L + +A+F GADLSD LMDR ++L +
Sbjct: 70 QNLANTSFAGATGRGADFRDAILHGAILTQGAFAEADFRGADLSDALMDRADFVASDLRD 129
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVL+ + + S A+IEGADF+DA++D ++ LC+ A+G NP TGVST SLGC
Sbjct: 130 AVLIGVIASGSSFSKALIEGADFTDALLDRDDQRRLCRDADGINPTTGVSTFDSLGC 186
>gi|412992118|emb|CCO19831.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
Length = 293
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 56/144 (38%), Positives = 79/144 (54%), Gaps = 10/144 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S F + DLR + V+ R NF+ +DMR GA + +++ A+ G+D+
Sbjct: 145 SNEDFSNLDLRGTIWVEAELRNTNFSKSDMR----------GAVMTRSIMPNADVHGSDV 194
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
S+ L D ++L AN +AV V RSD+G I+ ADF++AVID Q LC+ A G
Sbjct: 195 SNVLFDYVLLRGANFEDAVAVGANFIRSDMGEMKIKNADFTEAVIDRYQVLGLCETAEGV 254
Query: 229 NPITGVSTRKSLGCGNSRRNAYGS 252
NP TGV TR SLGC + + GS
Sbjct: 255 NPYTGVDTRMSLGCDSFVKKYEGS 278
>gi|87303664|ref|ZP_01086439.1| hypothetical protein WH5701_12843 [Synechococcus sp. WH 5701]
gi|87281769|gb|EAQ73734.1| hypothetical protein WH5701_12843 [Synechococcus sp. WH 5701]
Length = 153
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 59/146 (40%), Positives = 81/146 (55%), Gaps = 8/146 (5%)
Query: 105 FGIGSAAQFGSADLRKAVHVKE--------NFRRANFTSADMRESDFSGSKFNGAYLEKA 156
G+GSAA + +LR +++ + R+ F A M D SGS GA +
Sbjct: 7 MGVGSAAAITAPELRGQRALQDLQPDMHGRDLRQQEFLKASMGGFDLSGSDLRGAVFNSS 66
Query: 157 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
N + A+L D + + A+L+ AVL +L +S GA IEGADFSDAV+DL+
Sbjct: 67 DLTNTNLSAANLEDAVAFATRFDGADLSGAVLRNAMLMQSRFTGAQIEGADFSDAVLDLS 126
Query: 217 QKQALCKYANGTNPITGVSTRKSLGC 242
Q +ALC A+G NP TGVST +SLGC
Sbjct: 127 QVKALCSRADGVNPSTGVSTVESLGC 152
>gi|428222027|ref|YP_007106197.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427995367|gb|AFY74062.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 161
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 74/117 (63%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+ R + F +A+M ++F + GA ++ AN A+ + ++D++ A+LT+
Sbjct: 44 QELRGSGFANANMENANFERADLRGAVFSASILRNANLRAANFTTGMLDQIDFANADLTD 103
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+LV T+L RS A I+GADF+DA++D AQ + LC A GTNP TGVSTR+SLGC
Sbjct: 104 AILVDTLLLRSTFDFAKIDGADFTDALLDGAQIKWLCSKAKGTNPFTGVSTRESLGC 160
>gi|427735661|ref|YP_007055205.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427370702|gb|AFY54658.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 168
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 52/138 (37%), Positives = 76/138 (55%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
F + + S +L ++ NF SA+MR ++F G+ A K AN
Sbjct: 30 FAQTNTINYSSTNLENRDFSNQDLTAVNFISAEMRGTNFQGADLTNAMFTKGNLLGANLE 89
Query: 165 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 224
GA+ ++ L+D++ L+ ANL NA + ++RS A I GADF+DA+ID Q + +C
Sbjct: 90 GANFTNALVDQVTLDNANLKNANFTQATMSRSRFFDADITGADFTDAIIDRYQVKLMCDR 149
Query: 225 ANGTNPITGVSTRKSLGC 242
A+G NP TGV TR SLGC
Sbjct: 150 ASGVNPETGVETRYSLGC 167
>gi|87125517|ref|ZP_01081362.1| hypothetical protein RS9917_02051 [Synechococcus sp. RS9917]
gi|86166817|gb|EAQ68079.1| hypothetical protein RS9917_02051 [Synechococcus sp. RS9917]
Length = 180
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 71/117 (60%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ R +F A R +DFS + +GA + A+F GADLSD LMDR + +L
Sbjct: 64 QDLRNTSFAGAVGRGADFSDANLHGAIFTQGAFANADFHGADLSDALMDRADFSGTDLRG 123
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+L + + S GA IEGADFSDA++D + LC+ A G++P TGVSTR+SLGC
Sbjct: 124 TLLSGVIASGSSFAGAQIEGADFSDALLDRDDVRRLCRDAEGSHPHTGVSTRESLGC 180
>gi|260435516|ref|ZP_05789486.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
gi|260413390|gb|EEX06686.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
Length = 163
Score = 96.7 bits (239), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 87/149 (58%), Gaps = 21/149 (14%)
Query: 100 ETRGEFGIGS-AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-- 156
E RG+F + +A DL++ +K + R N + D+R + + S+ GA L A
Sbjct: 28 ELRGQFAVQEISADMHGLDLKEKEFLKADLREVNLSGTDLRGAVINTSQLQGADLRDADL 87
Query: 157 ---VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
V + ++F GADL AN TNA+++++ T A I+GADF++AVI
Sbjct: 88 SDVVGFASHFEGADL----------RGANFTNAMMMQSRFT-----DAQIDGADFTNAVI 132
Query: 214 DLAQKQALCKYANGTNPITGVSTRKSLGC 242
DL Q++ALC A+G+NPI+GVSTR+SLGC
Sbjct: 133 DLPQQRALCVRADGSNPISGVSTRESLGC 161
>gi|78185103|ref|YP_377538.1| hypothetical protein Syncc9902_1536 [Synechococcus sp. CC9902]
gi|78169397|gb|ABB26494.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 182
Score = 96.7 bits (239), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 72/117 (61%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N +F A R +DF + +GA L + +A+F GADLSD LMDR +L +
Sbjct: 66 QNLANTSFAGATGRGADFRDANLHGAILTQGAFAEADFRGADLSDALMDRADFVATDLRD 125
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVL+ + + S A+IEGADF+DA++D ++ LC+ A+G NP TG+ST SLGC
Sbjct: 126 AVLIGVIASGSSFSKALIEGADFTDALLDRDDQRLLCRDADGINPTTGISTFDSLGC 182
>gi|427420479|ref|ZP_18910662.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425756356|gb|EKU97210.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 169
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/122 (41%), Positives = 75/122 (61%), Gaps = 5/122 (4%)
Query: 126 ENFRRANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
NF AN +A++R ++F G+ + L KA + + TGA+LS+T DR+
Sbjct: 46 RNFENANLAGTSLAAAEVRNANFRGADLSATILTKAKFIRTDLTGANLSETFADRVEFTG 105
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
++LTNAV+ ++T S A I GADFS ++D Q + LC+ A+G NP+TGVSTR+SL
Sbjct: 106 SDLTNAVVTDALMTSSTFADATITGADFSYTILDRFQVKYLCERADGMNPVTGVSTRESL 165
Query: 241 GC 242
GC
Sbjct: 166 GC 167
>gi|411119374|ref|ZP_11391754.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410711237|gb|EKQ68744.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 182
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 76/130 (58%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F LR A N R NFT+AD+R GA + + + GADL+ +
Sbjct: 63 FSGQILRVAEFSNANLNRVNFTNADLR----------GAVMSASTMVDTSLHGADLTQAM 112
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ + +L++A+L T+L R+ +EGADF+DA++D AQ +ALC++A+G N T
Sbjct: 113 LDQVKMIRTDLSDAILANTILLRTTFENINLEGADFTDAILDGAQVKALCQFASGANSKT 172
Query: 233 GVSTRKSLGC 242
GVSTR SLGC
Sbjct: 173 GVSTRDSLGC 182
>gi|428302010|ref|YP_007140316.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428238554|gb|AFZ04344.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 162
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 80/130 (61%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A L + ++ + A F++A+M +DF G+ GA + + KAN GA+L++ L
Sbjct: 32 FSNAQLARQDFSGQSLQAAEFSNANMELADFRGADLRGAVMSASTMTKANLHGANLANAL 91
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ L A+L++AVL +L R+ I GADF+DA++D AQ + LC A+G N T
Sbjct: 92 VDQVNLTGADLSDAVLQEALLLRAIFTDVKINGADFTDAILDGAQIRELCNIASGVNSQT 151
Query: 233 GVSTRKSLGC 242
GV TR SLGC
Sbjct: 152 GVETRYSLGC 161
>gi|298492040|ref|YP_003722217.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298233958|gb|ADI65094.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 167
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/132 (43%), Positives = 76/132 (57%), Gaps = 10/132 (7%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F DLR + K N R++NFT A++R G F A LE A N GADL++
Sbjct: 45 ADFSRRDLRDSSFTKANLRQSNFTGANLR-----GVSFFAANLESA-----NLEGADLTN 94
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
+D L ANLTNAVL + GAI++GADF+DA++ +++ LC A GTNP
Sbjct: 95 ATLDSARLIRANLTNAVLEGAFAASAKFDGAIVDGADFTDALLRQDEQKKLCNLAKGTNP 154
Query: 231 ITGVSTRKSLGC 242
ITG TR++L C
Sbjct: 155 ITGRDTRETLFC 166
>gi|148240085|ref|YP_001225472.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
gi|147848624|emb|CAK24175.1| Secreted pentapeptide repeats protein [Synechococcus sp. WH 7803]
Length = 174
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/111 (45%), Positives = 67/111 (60%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+F A + +DFSG+ GA + ANF GADLSD LMDR +L +AVL+
Sbjct: 64 SFAGAAGKGADFSGANLQGAIFTQGAFADANFHGADLSDALMDRADFTGTDLRDAVLIGV 123
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ + S GA ++GADFSDA++D ++ LC+ A G NP TGV TR SL C
Sbjct: 124 IASGSSFAGAQVDGADFSDALLDRDDQRRLCQEAEGVNPTTGVLTRDSLSC 174
>gi|428772631|ref|YP_007164419.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
7202]
gi|428686910|gb|AFZ46770.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
Length = 166
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 73/130 (56%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F S L+ +N + A FT ++ ++ F+ S GA A ANF+G D+SD L
Sbjct: 36 FESKSLKGEDFTNQNLQLAEFTKVNLEDAKFNDSDLRGAVFNGVNAEGANFSGVDMSDGL 95
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+ N +L+NA+ ++ R+ A +EGADF+ AV+D Q LCK A+G NP+T
Sbjct: 96 VYVTSFNNTDLSNAIFRDAIMLRTTFKNANVEGADFTFAVLDSEQVNQLCKNASGVNPVT 155
Query: 233 GVSTRKSLGC 242
STR+SLGC
Sbjct: 156 NASTRQSLGC 165
>gi|119509637|ref|ZP_01628783.1| hypothetical protein N9414_21581 [Nodularia spumigena CCY9414]
gi|119465656|gb|EAW46547.1| hypothetical protein N9414_21581 [Nodularia spumigena CCY9414]
Length = 221
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 66/110 (60%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +A+MR ++F G+ A L K V AN A+L L+DR+ ++ ANL NA+
Sbjct: 111 FVAAEMRGANFQGANLKNAILTKGVLLNANLENANLEGALVDRVTMDGANLKNAIFTEAT 170
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+TRS A I GADF+DA+ID Q +C A G N +TGV+TR SLGC
Sbjct: 171 MTRSRFFDADITGADFTDALIDRYQVALMCDRAAGINSVTGVATRDSLGC 220
>gi|317969830|ref|ZP_07971220.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
Length = 178
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/135 (40%), Positives = 78/135 (57%), Gaps = 9/135 (6%)
Query: 112 QFGSADLRKAVH----VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
Q + DL+ +H ++ F +A+ D+ E+D G+ FN A L+ A N + AD
Sbjct: 48 QRSAQDLQPDMHGRNLQQQEFLKASMEGFDLSETDLRGAVFNTANLQNA-----NLSAAD 102
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L D + + A+L+ AV +L S GA+IEG DF+DAV+DL Q++ALC A+G
Sbjct: 103 LEDAVAFATRFDNADLSGAVFRNAMLMNSKFTGAVIEGTDFTDAVLDLPQQKALCARASG 162
Query: 228 TNPITGVSTRKSLGC 242
NP TGV TR+SL C
Sbjct: 163 VNPRTGVDTRESLAC 177
>gi|220905675|ref|YP_002480986.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219862286|gb|ACL42625.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 162
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 68/110 (61%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +A M E++F G+ A L KA +ANF GA+L+D L D + ++L+NA+L
Sbjct: 52 FAAAVMPEANFEGANLRNAILSKAELSQANFRGANLTDVLADGVSWANSDLSNAILAGAT 111
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
L + G I GADFSDA+ID LC+ A G NP+TG++TR+SLGC
Sbjct: 112 LIGTTFTGVTITGADFSDALIDRYDVSLLCQRAEGINPVTGIATRESLGC 161
>gi|282895655|ref|ZP_06303780.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
gi|281199349|gb|EFA74214.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
Length = 171
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 89/175 (50%), Gaps = 16/175 (9%)
Query: 68 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 127
+ V ++ +L + +C +++ A +Y E I A F DLR +
Sbjct: 12 FLVILNLSLLVIIPLTCLVGLTSTALALEYNKE------ILIGADFSQRDLRDS------ 59
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
+FT A++R+SDFSGS G A ANFTGADL++ +D ANLTNA+
Sbjct: 60 ----SFTKANLRQSDFSGSNLTGVSFFAANLESANFTGADLTNATLDSARFIGANLTNAI 115
Query: 188 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
L + GAII GADF+D ++ ++ LC+ A G NP TG TRK+L C
Sbjct: 116 LEGAFAASAKFDGAIITGADFTDVLLRRDEQNKLCQLAKGINPTTGRHTRKTLFC 170
>gi|308814214|ref|XP_003084412.1| COG1357: Uncharacterized low-complexity proteins (ISS)
[Ostreococcus tauri]
gi|116056297|emb|CAL56680.1| COG1357: Uncharacterized low-complexity proteins (ISS)
[Ostreococcus tauri]
Length = 186
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 76/140 (54%), Gaps = 10/140 (7%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A S DL A++ + + R AN ++ D R GA +A+ D S+
Sbjct: 34 ADLASNDLTGAIYAESDLRNANISNTDAR----------GAVFSRAIMPGVKLNATDASN 83
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
+ D VL A++ + V R+D+G A+IEGADFS+AVID + LC+ A+GTNP
Sbjct: 84 AMFDYAVLRGADMRDGVFANANFVRADMGEAMIEGADFSEAVIDRYEAIRLCERASGTNP 143
Query: 231 ITGVSTRKSLGCGNSRRNAY 250
TG+ TR +LGC +SR + Y
Sbjct: 144 WTGIETRATLGCDDSRVSKY 163
>gi|218439896|ref|YP_002378225.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218172624|gb|ACK71357.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 170
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 74/117 (63%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N A F +A+MR + S + + L +AV AN GA+L+ +L+DR+ L+ A+LTN
Sbjct: 52 KNLYGAVFAAANMRGASLENSDLSYSILTEAVLLNANLKGANLTGSLVDRVTLDFADLTN 111
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+ + +R+ I GADFS A++D Q +C+ A+G NP+TGVSTR+SLGC
Sbjct: 112 AIFTDAIASRTRFYDTTITGADFSGAILDQYQVYLMCERASGVNPVTGVSTRESLGC 168
>gi|16332305|ref|NP_443033.1| hypothetical protein sll0577 [Synechocystis sp. PCC 6803]
gi|383324046|ref|YP_005384900.1| hypothetical protein SYNGTI_3138 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383327215|ref|YP_005388069.1| hypothetical protein SYNPCCP_3137 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383493099|ref|YP_005410776.1| hypothetical protein SYNPCCN_3137 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384438367|ref|YP_005653092.1| hypothetical protein SYNGTS_3139 [Synechocystis sp. PCC 6803]
gi|451816456|ref|YP_007452908.1| hypothetical protein MYO_131750 [Synechocystis sp. PCC 6803]
gi|1653935|dbj|BAA18845.1| sll0577 [Synechocystis sp. PCC 6803]
gi|339275400|dbj|BAK51887.1| hypothetical protein SYNGTS_3139 [Synechocystis sp. PCC 6803]
gi|359273366|dbj|BAL30885.1| hypothetical protein SYNGTI_3138 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359276536|dbj|BAL34054.1| hypothetical protein SYNPCCN_3137 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359279706|dbj|BAL37223.1| hypothetical protein SYNPCCP_3137 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407960039|dbj|BAM53279.1| hypothetical protein BEST7613_4348 [Synechocystis sp. PCC 6803]
gi|451782425|gb|AGF53394.1| hypothetical protein MYO_131750 [Synechocystis sp. PCC 6803]
Length = 169
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/140 (42%), Positives = 77/140 (55%), Gaps = 10/140 (7%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKAN 162
G A+ F + L + ++ A FT+ D+ S D GS FNGA L A N
Sbjct: 34 GGASAFENMVLAETDFRDQDLLTAQFTNVDLTSSIFEAMDLRGSVFNGANLTDA-----N 88
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
G DL++ L N ANL NA+L ++ R+ A I+GADFS AV+D Q ALC
Sbjct: 89 LKGVDLTNGLTYLTSFNGANLENAILAEAIMLRTSFKNAKIQGADFSLAVLDTEQIAALC 148
Query: 223 KYANGTNPITGVSTRKSLGC 242
K A+G NP TG+STR+SLGC
Sbjct: 149 KVADGVNPKTGISTRESLGC 168
>gi|428227020|ref|YP_007111117.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427986921|gb|AFY68065.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 166
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 50/120 (41%), Positives = 74/120 (61%)
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 182
+ ++ +A F++A+++ +DFSG+ GA + AN G D SD + ++AN
Sbjct: 46 YAGQSLLQAEFSNANLKNADFSGADLRGAVFNGSTLVHANLRGVDFSDGIAYISDFSDAN 105
Query: 183 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
L++AVL +L +S GA + GADF+DAV+D AQ LCK A+G N ITG TR+SLGC
Sbjct: 106 LSDAVLSSAMLLKSRFTGADVTGADFTDAVLDRAQVLQLCKTASGVNSITGADTRESLGC 165
>gi|318041364|ref|ZP_07973320.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0101]
Length = 170
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/130 (42%), Positives = 77/130 (59%), Gaps = 9/130 (6%)
Query: 117 DLRKAVH----VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
DL+ +H ++ F +AN D ESD G+ FN A L+ A N ADL D +
Sbjct: 45 DLQPDMHGRNLQQQEFLKANLEGFDFSESDLRGAVFNTANLQGA-----NLHAADLEDAV 99
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+ A+L++AVL +L S G++I+GADF+DAV+DL Q++ALC+ A GTN T
Sbjct: 100 AFASRFDNADLSDAVLRNAMLMNSKFAGSVIDGADFTDAVLDLPQQKALCERAGGTNART 159
Query: 233 GVSTRKSLGC 242
GV+TR SL C
Sbjct: 160 GVNTRDSLNC 169
>gi|443478408|ref|ZP_21068166.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443016315|gb|ELS31005.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 150
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 76/130 (58%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A L+ N + F +A+M ++F + GA ++ KAN G D S L
Sbjct: 20 FSHAQLKNRDFSGRNLVGSGFANANMEGANFENADVRGAVFSASILRKANLKGTDFSGGL 79
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D+ +A+L+NA+LV T+L RS I+GADF+DA++D AQ++ LC A GTN T
Sbjct: 80 LDQADFAKADLSNALLVETILLRSTFDFVNIDGADFTDAIMDGAQRKWLCSKAKGTNAKT 139
Query: 233 GVSTRKSLGC 242
G++TR+SL C
Sbjct: 140 GINTRESLEC 149
>gi|282897737|ref|ZP_06305736.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
gi|281197416|gb|EFA72313.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
Length = 162
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 80/130 (61%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L + +N + A F++A++ ++F+ + GA +V +AN GADL++ +
Sbjct: 32 FSNAELGRHNFSGQNLQAAEFSNANLEMANFANADLRGAVFSASVMTQANLHGADLTNAM 91
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D++ L A+L++A+ + +L RS A I+GADF++A++D Q LCK A G N T
Sbjct: 92 LDQVKLTGADLSDAIFLEAILLRSIFTEANIDGADFTEAILDRGQVGELCKSARGVNSQT 151
Query: 233 GVSTRKSLGC 242
V TR SLGC
Sbjct: 152 HVQTRDSLGC 161
>gi|260435480|ref|ZP_05789450.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
gi|260413354|gb|EEX06650.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
Length = 173
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 70/117 (59%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N +F A R ++F G+ +GA L + +A+F GADLSD LMDR +L N
Sbjct: 57 QNLANTSFAGAVGRGANFRGANLHGAILTQGAFAEADFQGADLSDALMDRADFVATDLRN 116
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVL + + S A IEGADF+DA++D ++ LC A+G NP TGVST SLGC
Sbjct: 117 AVLTGIIASGSSFSNAQIEGADFTDALLDRDDQRRLCGEADGINPSTGVSTFDSLGC 173
>gi|88808683|ref|ZP_01124193.1| hypothetical protein WH7805_03297 [Synechococcus sp. WH 7805]
gi|88787671|gb|EAR18828.1| hypothetical protein WH7805_03297 [Synechococcus sp. WH 7805]
Length = 176
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 98/190 (51%), Gaps = 29/190 (15%)
Query: 63 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEA----ETRGEFGIGS-AAQFGSAD 117
A L N R ++TAL AA+V L D EA E RG+ + + D
Sbjct: 5 ALLCNLRRHLTTALLAALVVFTG----VLIDGPSVEAITAPELRGQRAVQDITSDMHGRD 60
Query: 118 LRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
L++ +K + R + AD+R S G+ GA LE VA+ + F GADL +
Sbjct: 61 LKEKEFLKADLREVDLGEADLRGAVINTSQLQGADLRGADLEDVVAFSSRFDGADLRN-- 118
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
AN TNA+L++ S A IEG DF++AVIDL+Q +ALC A+G N ++
Sbjct: 119 --------ANFTNAMLMQ-----SRFNDAEIEGTDFTNAVIDLSQLKALCGRASGVNSLS 165
Query: 233 GVSTRKSLGC 242
GVST++SLGC
Sbjct: 166 GVSTKESLGC 175
>gi|254415547|ref|ZP_05029307.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196177728|gb|EDX72732.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 165
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 52/112 (46%), Positives = 68/112 (60%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+F AD+RES+FS ++ G A ANF GA+LS + +D+ LN ANL NAVL
Sbjct: 53 ASFNQADLRESNFSHAELQGVSFFGANLKLANFEGANLSYSTLDKARLNGANLKNAVLEG 112
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ GA IEGADF+DA +D ++ LC+ A GTNP TG TR +L C
Sbjct: 113 AYAFNAQFDGATIEGADFTDAFLDPKAEEKLCQMATGTNPTTGRQTRDTLFC 164
>gi|443321745|ref|ZP_21050787.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
gi|442788515|gb|ELR98206.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
Length = 149
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 73/117 (62%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ A+F A +RES+FS + G A NF GA+L++ +D LN+ANL N
Sbjct: 32 QDLTDASFDLASLRESNFSHANLTGVRFFSANLESVNFEGANLTNATLDSARLNDANLKN 91
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+L+ ++ + + G IEGADF+DA+I +++ LCK A GTNP+TG TR++L C
Sbjct: 92 AILIGAFVSNAKVQGVNIEGADFTDALILPYEQKLLCKVAQGTNPVTGRDTRETLFC 148
>gi|88809155|ref|ZP_01124664.1| hypothetical protein WH7805_05666 [Synechococcus sp. WH 7805]
gi|88787097|gb|EAR18255.1| hypothetical protein WH7805_05666 [Synechococcus sp. WH 7805]
Length = 180
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 66/111 (59%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+F A + +DFSG+ GA + ANF GADLSD LMDR +L +AVL+
Sbjct: 70 SFAGAAAKGADFSGANLQGAIFTQGAFADANFRGADLSDALMDRADFTGTDLRDAVLIGV 129
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ + S A ++GADFSDA++D ++ LC+ A G NP TGV TR SL C
Sbjct: 130 IASGSSFARAQVDGADFSDALLDRDDQRKLCQEAEGLNPTTGVLTRDSLSC 180
>gi|427702634|ref|YP_007045856.1| low-complexity protein [Cyanobium gracile PCC 6307]
gi|427345802|gb|AFY28515.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
Length = 182
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 72/112 (64%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
++F A R++ F + +GA L +A +A+F GADLSD LMD++ ++ +LT AVL
Sbjct: 70 SSFAGATGRQARFRDADLHGAILTQAAFPEADFHGADLSDALMDKVDMSGTDLTGAVLRG 129
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ + S+ GA + ADF+DA++D ++ LC+ A GTNP+TG TR SL C
Sbjct: 130 AIASGSNFTGATVTDADFTDALLDRVDQRNLCREARGTNPVTGADTRLSLDC 181
>gi|428211433|ref|YP_007084577.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|427999814|gb|AFY80657.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 166
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/114 (40%), Positives = 76/114 (66%)
Query: 129 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
R A F++AD++ ++FS + GA ++ +ANF GADL++++++ L A+ T+AVL
Sbjct: 52 RTAEFSNADLQFTNFSNVQAEGAIFSLSMMKEANFHGADLTNSMLEWTNLTNADFTDAVL 111
Query: 189 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
V + +++ + GADF+DA++D AQ + LC+ A+G N TGV TR+SLGC
Sbjct: 112 VEALFLGANVKKMKVTGADFTDAILDGAQVKQLCENASGVNSKTGVDTRESLGC 165
>gi|157413511|ref|YP_001484377.1| hypothetical protein P9215_11761 [Prochlorococcus marinus str. MIT
9215]
gi|254526043|ref|ZP_05138095.1| Pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
gi|157388086|gb|ABV50791.1| conserved hpothetical protein [Prochlorococcus marinus str. MIT
9215]
gi|221537467|gb|EEE39920.1| Pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
Length = 172
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 73/130 (56%), Gaps = 10/130 (7%)
Query: 123 HVKENFRRANFTSADM----------RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
+V+ N NF D+ R++DFS +G L + +N G DL+DTL
Sbjct: 42 YVRSNITGFNFHGEDLHLSSIAGAVARDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTL 101
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
DR+ + +L NAVL+ + + S GA IEGADFS A++D ++ LC+ A+G NP T
Sbjct: 102 SDRVNFQKTDLRNAVLINMIASGSSFAGAQIEGADFSYAILDSEDQRNLCEIADGINPTT 161
Query: 233 GVSTRKSLGC 242
GVSTR SL C
Sbjct: 162 GVSTRDSLEC 171
>gi|87124337|ref|ZP_01080186.1| hypothetical protein RS9917_12025 [Synechococcus sp. RS9917]
gi|86167909|gb|EAQ69167.1| hypothetical protein RS9917_12025 [Synechococcus sp. RS9917]
Length = 178
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 67/116 (57%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ + F AD++ D SGS GA + + A+ GADLSD + + A+L NA
Sbjct: 62 DLKEKEFLKADLQGVDLSGSDLRGAVINTSSLQGADLQGADLSDVVAFASRFDGADLRNA 121
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
V +L +S G A I+GADF+DAVIDL Q +ALC A G N TGV TR SLGC
Sbjct: 122 VFTNAMLMQSRFGDAQIDGADFTDAVIDLPQLKALCARAAGENSRTGVLTRDSLGC 177
>gi|126657693|ref|ZP_01728847.1| hypothetical protein CY0110_25878 [Cyanothece sp. CCY0110]
gi|126620910|gb|EAZ91625.1| hypothetical protein CY0110_25878 [Cyanothece sp. CCY0110]
Length = 167
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 77/135 (57%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
GS A + L +++ A FT+AD+ +S+FS + GA + A+ GAD
Sbjct: 32 GSTASYEDVKLIGEDFSEKSLTYAQFTNADLTDSNFSKADLRGAVFNGSALIGADLHGAD 91
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L++ L A+LT+AVL ++ R+ A I GADFS AV+D+ + + LC A+G
Sbjct: 92 LTNGLAYLTSFKGADLTDAVLTEAIMMRTKFDDAKITGADFSLAVLDIYEVEKLCDRADG 151
Query: 228 TNPITGVSTRKSLGC 242
NP TG+STR+SLGC
Sbjct: 152 VNPKTGISTRESLGC 166
>gi|254430459|ref|ZP_05044162.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
gi|197624912|gb|EDY37471.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
Length = 180
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 72/117 (61%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ +F A R +DFSG+ +GA L +A +A+F GADLS LMD++ + A+ T
Sbjct: 64 QDLANTSFAGAAGRHADFSGANLHGAILTQAAFPEASFAGADLSGVLMDKVDFSGADFTG 123
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A L + + S+ GA + ADF+ A+ID ++ LC+ A GT+P+TG TR SLGC
Sbjct: 124 ADLSDVIASGSNFSGATVTNADFTGALIDRVDQRLLCRDAEGTHPLTGADTRLSLGC 180
>gi|123968679|ref|YP_001009537.1| hypothetical protein A9601_11461 [Prochlorococcus marinus str.
AS9601]
gi|126696485|ref|YP_001091371.1| hypothetical protein P9301_11471 [Prochlorococcus marinus str. MIT
9301]
gi|123198789|gb|ABM70430.1| conserved hypothetical protein [Prochlorococcus marinus str.
AS9601]
gi|126543528|gb|ABO17770.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9301]
Length = 172
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 71/117 (60%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
E+ ++ A R++DFS +G L + +N G DL+DTL DR+ + +L N
Sbjct: 55 EDLHLSSIAGAVARDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLSDRVNFQKTDLRN 114
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVL+ + + S GA IEGADFS A++D ++ LC+ A+G NP TGVSTR+SL C
Sbjct: 115 AVLINMIASGSSFAGAQIEGADFSYAILDSEDQRNLCEIADGINPTTGVSTRESLEC 171
>gi|254409676|ref|ZP_05023457.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183673|gb|EDX78656.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 163
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 74/117 (63%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N + A F +A+++ S+F+ + GA +V AN GADLS ++D+ L A+L++
Sbjct: 46 QNLQTAEFANANLQLSNFAYADLRGAIFSGSVMTHANLHGADLSYGMLDQADLTGADLSD 105
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+LV T+L S +I GADF+DA++D AQ + LC+ A+G N TGV+T SLGC
Sbjct: 106 VILVETLLLGSVFDNTLITGADFTDALLDGAQLKHLCQQASGINSKTGVATSDSLGC 162
>gi|78779436|ref|YP_397548.1| hypothetical protein PMT9312_1053 [Prochlorococcus marinus str. MIT
9312]
gi|78712935|gb|ABB50112.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9312]
Length = 172
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 71/117 (60%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
E+ ++ A R++DFS +G L + +N G DL+DTL DR+ + +L N
Sbjct: 55 EDLHLSSIAGAVARDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLSDRVNFQKTDLRN 114
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVL+ + + S GA IEGADFS A++D ++ LC+ A+G NP TGVSTR+SL C
Sbjct: 115 AVLINMIASGSSFAGAKIEGADFSYAILDSEDQRNLCEIADGINPTTGVSTRESLEC 171
>gi|352096257|ref|ZP_08957137.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
gi|351676951|gb|EHA60102.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
Length = 177
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 71/117 (60%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N +F A R +DFS + G +A +ANF GA+LSD LMDR ++ +L +
Sbjct: 61 QNLVNTSFAGATGRGADFSDANLQGTIFTQAEFPEANFHGANLSDALMDRADFSKTDLRD 120
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+L + S GA IEGADF+DA++D ++ LC+ A+G NP +GV+TR SL C
Sbjct: 121 ALLQGVIAAGSSFAGADIEGADFTDALLDREDQRRLCQDADGVNPSSGVATRDSLDC 177
>gi|78212400|ref|YP_381179.1| hypothetical protein Syncc9605_0856 [Synechococcus sp. CC9605]
gi|78196859|gb|ABB34624.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 181
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 70/117 (59%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N +F A R ++F G+ +GA L + +A+F GADLSD LMDR +L N
Sbjct: 65 QNLANTSFAGAVGRGANFRGANLHGAILTQGAFAEADFQGADLSDALMDRADFVGTDLRN 124
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVL + + S A IEGADF+DA++D ++ LC A+G NP TGV+T SLGC
Sbjct: 125 AVLNGIIASGSSFSNAQIEGADFTDALLDRDDQRRLCGEADGINPSTGVATFDSLGC 181
>gi|72382551|ref|YP_291906.1| hypothetical protein PMN2A_0712 [Prochlorococcus marinus str.
NATL2A]
gi|72002401|gb|AAZ58203.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL2A]
Length = 184
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 74/131 (56%), Gaps = 6/131 (4%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
E+ ++ A R++DFS +G L + +N G DL+DTL DR+ + +L N
Sbjct: 58 EDLHLSSIAGAMARDADFSNVDLHGTTLTLSDLKGSNLNGVDLTDTLSDRVNFQKTDLRN 117
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 245
++LV + + S GA IEGADF+ A++D ++ LCK A+G NP TGVSTR SL C
Sbjct: 118 SILVNMIASGSSFAGAQIEGADFTFAILDSEDQRNLCKIADGVNPTTGVSTRASLECKGD 177
Query: 246 RRNAYGSPSSP 256
+ PS P
Sbjct: 178 K------PSMP 182
>gi|172036187|ref|YP_001802688.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354552985|ref|ZP_08972292.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171697641|gb|ACB50622.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353554815|gb|EHC24204.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 179
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/135 (40%), Positives = 76/135 (56%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
GS+A + L ++ A FT+AD+ +S+FS + GA + A+ GAD
Sbjct: 44 GSSASYEDVKLIGEDFSGKSLTYAQFTNADLTDSNFSEADLRGAVFNGSALIGADLHGAD 103
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L++ L A+LTNAVL ++ R+ A I GADFS AV+D+ + LC A+G
Sbjct: 104 LTNGLAYLTSFKGADLTNAVLTEAIMMRTKFDDAKITGADFSLAVLDVYEVDKLCDRADG 163
Query: 228 TNPITGVSTRKSLGC 242
NP TGVSTR+SLGC
Sbjct: 164 VNPKTGVSTRESLGC 178
>gi|303287274|ref|XP_003062926.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455562|gb|EEH52865.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 182
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 74/142 (52%), Gaps = 14/142 (9%)
Query: 120 KAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
KA HV E+F ++ +T D+R SDFSGS A +A+ N G+D+ + +D
Sbjct: 17 KAEHVNEDFSHSDLVGAIYTEGDLRGSDFSGSDLRAAIFSRAIMPGVNLEGSDMQNAFLD 76
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA---------LCKYA 225
+VL AN+ + RSDLG + ADF++AVID Q ++ LC A
Sbjct: 77 YVVLRGANMRGVIASGANFVRSDLGDVDVTNADFTEAVIDRYQARSISHWSPYDPLCDGA 136
Query: 226 NGTNPITGVSTRKSLGCGNSRR 247
+G N TGV TR SLGC +R
Sbjct: 137 SGVNEFTGVDTRDSLGCDRLKR 158
>gi|220907989|ref|YP_002483300.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219864600|gb|ACL44939.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 171
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 69/117 (58%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N + F A + ++ SG+ GA AV AN G + SD + ++A+L N
Sbjct: 54 QNLEQVEFGDARLSGANLSGANLRGAVFNAAVLTGANLQGVNFSDGIGYLCDFSDADLEN 113
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVL +L +S+ GA I GADFS A++D Q LC+YA+G NP TGVSTR+SLGC
Sbjct: 114 AVLDSAMLLKSEFKGAKINGADFSFALLDRPQVLQLCEYASGVNPTTGVSTRESLGC 170
>gi|124026254|ref|YP_001015370.1| hypothetical protein NATL1_15481 [Prochlorococcus marinus str.
NATL1A]
gi|123961322|gb|ABM76105.1| Hypothetical protein NATL1_15481 [Prochlorococcus marinus str.
NATL1A]
Length = 184
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 74/131 (56%), Gaps = 6/131 (4%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
E+ ++ A R++DFS +G L + +N G DL+DTL DR+ + +L N
Sbjct: 58 EDLHLSSIAGAMARDADFSNVDLHGTTLTLSDLKGSNLNGVDLTDTLSDRVNFQKTDLRN 117
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 245
++LV + + S GA IEGADF+ A++D ++ LCK A+G NP TGVSTR SL C
Sbjct: 118 SILVNMIASGSSFAGAQIEGADFTFAILDSEDQRNLCKIADGVNPTTGVSTRASLECKGD 177
Query: 246 RRNAYGSPSSP 256
+ PS P
Sbjct: 178 K------PSIP 182
>gi|352094392|ref|ZP_08955563.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
gi|351680732|gb|EHA63864.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
Length = 172
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 81/149 (54%), Gaps = 21/149 (14%)
Query: 100 ETRGEFGIGSAA-QFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYL 153
E RG+F + + DL++ +K + R + + D+R S G+ +GA L
Sbjct: 38 ELRGQFAVQDISNDMHGRDLKEKEFLKADLRGVDLSDTDLRGAVINTSQLQGADLHGANL 97
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
E VA+ + F DLSD AN TNA+L+++ A IEG DF++AVI
Sbjct: 98 EDVVAFSSRFDETDLSD----------ANFTNAMLMQSRFV-----DARIEGTDFTNAVI 142
Query: 214 DLAQKQALCKYANGTNPITGVSTRKSLGC 242
DL Q +ALC A+G N ++GVSTR+SLGC
Sbjct: 143 DLTQMKALCGRASGVNSVSGVSTRESLGC 171
>gi|113953693|ref|YP_729958.1| hypothetical protein sync_0742 [Synechococcus sp. CC9311]
gi|113881044|gb|ABI46002.1| conserved hypothetical protein [Synechococcus sp. CC9311]
Length = 190
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 71/117 (60%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N +F A R +DFS + G +A +ANF GA+LSD LMDR ++ +L +
Sbjct: 74 QNLVNTSFAGATGRGADFSDANLQGTIFTQAEFPEANFHGANLSDALMDRADFSKTDLRD 133
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+LV + S GA IEGADF+DA++D ++ LC+ A+G N +GVSTR SL C
Sbjct: 134 ALLVGVIAAGSSFAGADIEGADFTDALLDREDQRRLCQDADGVNSSSGVSTRDSLDC 190
>gi|116070665|ref|ZP_01467934.1| hypothetical protein BL107_13505 [Synechococcus sp. BL107]
gi|116066070|gb|EAU71827.1| hypothetical protein BL107_13505 [Synechococcus sp. BL107]
Length = 169
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/131 (42%), Positives = 77/131 (58%), Gaps = 20/131 (15%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDT 171
DL++ +K N R N + AD+R + + ++ GA L A V + + F GADL
Sbjct: 52 DLKEKEFLKANLRDVNLSGADLRGAVINTTQLQGADLRDANLSDVVGFASRFDGADL--- 108
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
A LTNA+L+++ T A IEGADF+DAVIDL Q++ALC A+G NP
Sbjct: 109 -------RGAVLTNAMLMQSRFT-----DAQIEGADFTDAVIDLPQQRALCSSADGVNPQ 156
Query: 232 TGVSTRKSLGC 242
+GVSTR+SLGC
Sbjct: 157 SGVSTRESLGC 167
>gi|123966365|ref|YP_001011446.1| hypothetical protein P9515_11321 [Prochlorococcus marinus str. MIT
9515]
gi|123200731|gb|ABM72339.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 172
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 69/117 (58%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
E+ ++ A R++DFS +G L + +N G DL+DTL DR+ + +L N
Sbjct: 55 EDLHLSSIAGAVARDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLADRVNFQKTDLRN 114
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
++L+ + + S GA IEGADFS A++D ++ LCK A G NP TGVSTR SL C
Sbjct: 115 SILINMIASGSSFAGAQIEGADFSYAILDSEDQRNLCKIAEGVNPTTGVSTRDSLEC 171
>gi|254430802|ref|ZP_05044505.1| secreted pentapeptide repeats protein [Cyanobium sp. PCC 7001]
gi|197625255|gb|EDY37814.1| secreted pentapeptide repeats protein [Cyanobium sp. PCC 7001]
Length = 173
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 50/126 (39%), Positives = 74/126 (58%), Gaps = 1/126 (0%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL+ +H + N R+ F A + DFS + GA + +A+ + A+L D +
Sbjct: 48 DLQPDMHGR-NLRQQEFLKASLEGFDFSEADLRGAVFNGSSLREADLSAANLEDVVAYAT 106
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
+++NL A+L +L +S G+ I GADFSDAV+DL +++ALC A G NP TGVST
Sbjct: 107 RFDDSNLEGAILRNAMLMQSRFKGSSITGADFSDAVLDLPEQKALCARATGVNPSTGVST 166
Query: 237 RKSLGC 242
R+SL C
Sbjct: 167 RESLAC 172
>gi|78184792|ref|YP_377227.1| hypothetical protein Syncc9902_1219 [Synechococcus sp. CC9902]
gi|78169086|gb|ABB26183.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 169
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/131 (42%), Positives = 77/131 (58%), Gaps = 20/131 (15%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDT 171
DL++ +K N R N + AD+R + + ++ GA L A V + + F GADL
Sbjct: 52 DLKEKEFLKANLRDVNLSGADLRGAVINTTQLQGADLRDANLSDVVGFASRFDGADL--- 108
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
A LTNA+L+++ T A IEGADF+DAVIDL Q++ALC A+G NP
Sbjct: 109 -------RGAVLTNAMLMQSRFT-----DAQIEGADFTDAVIDLPQQRALCSSADGVNPQ 156
Query: 232 TGVSTRKSLGC 242
+GVSTR+SLGC
Sbjct: 157 SGVSTRESLGC 167
>gi|33865660|ref|NP_897219.1| hypothetical protein SYNW1126 [Synechococcus sp. WH 8102]
gi|33632830|emb|CAE07641.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 190
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 89/164 (54%), Gaps = 5/164 (3%)
Query: 79 AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM 138
++VA+ +S L N +A T E A Q +AD+ + +KE F AD+
Sbjct: 30 SLVAAILVVVSTLLWTNSAQAITAPELRGQRAVQEITADM-HGLDLKEK----EFLKADL 84
Query: 139 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
RE + S + GA + + A+ GADLS+ + + A+L A +L +S
Sbjct: 85 REVNLSDTDLRGAVINTSQLQGADLRGADLSNVVGFASRFDGADLRGATFTNAMLMQSRF 144
Query: 199 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A IEGADF+DAV+DL Q++ LC A G +P++GVSTR+SLGC
Sbjct: 145 ADARIEGADFTDAVLDLPQQKLLCATAAGEHPVSGVSTRESLGC 188
>gi|113953830|ref|YP_730899.1| pentapeptide repeat-containing protein [Synechococcus sp. CC9311]
gi|113881181|gb|ABI46139.1| Secreted pentapeptide repeats protein [Synechococcus sp. CC9311]
Length = 172
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 82/149 (55%), Gaps = 21/149 (14%)
Query: 100 ETRGEFGIGSAAQ-FGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYL 153
E RG+F + ++ DL++ +K + R + + D+R S G+ +GA L
Sbjct: 38 ELRGQFALQDISEDMHGRDLKEKEFLKADLRGIDLSDTDLRGAVINTSQLQGADLHGANL 97
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
E VA+ + F DLSD AN TNA+L+++ A IEG DF++AVI
Sbjct: 98 EDVVAFSSRFDETDLSD----------ANFTNAMLMQSRFV-----DARIEGTDFTNAVI 142
Query: 214 DLAQKQALCKYANGTNPITGVSTRKSLGC 242
DL Q +ALC A+G N ++GVSTR+SLGC
Sbjct: 143 DLTQLKALCGRASGVNSVSGVSTRESLGC 171
>gi|33861598|ref|NP_893159.1| hypothetical protein PMM1042 [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
gi|33634175|emb|CAE19501.1| conserved hpothetical protein [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
Length = 172
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 69/117 (58%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
E+ ++ A R++DFS +G L + +N G DL+DTL DR+ + +L N
Sbjct: 55 EDLHLSSIAGAVARDADFSEVDLHGTTLTLSDLKGSNLNGIDLTDTLADRVNFQKTDLRN 114
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
++L+ + + S GA IEGADFS A++D ++ LCK A G NP TGVSTR SL C
Sbjct: 115 SILINMIASGSSFAGAQIEGADFSYAILDSEDQRNLCKIAEGVNPTTGVSTRDSLEC 171
>gi|414079727|ref|YP_007001151.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413973006|gb|AFW97094.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 167
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 92/179 (51%), Gaps = 16/179 (8%)
Query: 64 KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVH 123
K +NW +S L A + + ++ A +Y E I +A F DL +
Sbjct: 4 KHRNWISILSLLLWAIISTTALASFVPTAVALEYNKE------ILISADFSGRDLTDSSF 57
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
K N R +NF+ +++R G F A LE A N GADL++T +D L +A+L
Sbjct: 58 TKANLRYSNFSHSNLR-----GVSFFAANLESA-----NLQGADLTNTTLDSARLIKADL 107
Query: 184 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
TNA+L + GAII+GADF+D ++ +++ LCK A GTNP+T TR +L C
Sbjct: 108 TNAILEGAFAANARFDGAIIDGADFTDVLLRQDEQKKLCKLAKGTNPVTKRDTRDTLYC 166
>gi|33240611|ref|NP_875553.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
gi|33238139|gb|AAQ00206.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
Length = 183
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 52/131 (39%), Positives = 75/131 (57%), Gaps = 2/131 (1%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ +++ A R+S+ S +G + A +N G +L+DTL DR+ + +L N
Sbjct: 53 QDLSKSSIAGATARDSNLSDVDLHGTVVTLADLKGSNLNGINLTDTLSDRVNFQKTDLRN 112
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 245
AVLV + + S GA IEGADFS AV+D ++ LC+ A GTNP TG+STR+SL C S
Sbjct: 113 AVLVNMIASGSSFAGAQIEGADFSYAVLDSDDQRNLCEIAEGTNPQTGISTRESLEC--S 170
Query: 246 RRNAYGSPSSP 256
R P P
Sbjct: 171 ERGVGYKPPMP 181
>gi|145356305|ref|XP_001422373.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582615|gb|ABP00690.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 123
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 68/118 (57%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+E+ R A + AD+R SD S GA +AV + AD SD + D +L ++ T
Sbjct: 6 REDLRGAIYAEADLRRSDLRESDARGAVFSRAVMPGVDARDADFSDAMFDYALLRGSDFT 65
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
N+V V R+DLG + ADF++AVID Q +LC+ A+GTNP TG +TR SL C
Sbjct: 66 NSVFVGANFVRADLGEVVATNADFTEAVIDRYQTLSLCERASGTNPYTGANTRDSLLC 123
>gi|427701840|ref|YP_007045062.1| low-complexity protein [Cyanobium gracile PCC 6307]
gi|427345008|gb|AFY27721.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
Length = 184
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 50/126 (39%), Positives = 75/126 (59%), Gaps = 5/126 (3%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DLR ++ F +A+ D+R++D G+ FN L +A + GADL D +
Sbjct: 63 DLRGRNLQQQEFLKASMEGFDLRDADLRGAVFNSTDLRQA-----DLRGADLEDVVAFAT 117
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
+ A+L A +L +S A I+GADFSDAV+DL +++ALC A+G++P+TGV T
Sbjct: 118 RFDGADLRGAQFRNAMLMQSRFRDARIDGADFSDAVLDLPEQKALCARASGSHPLTGVDT 177
Query: 237 RKSLGC 242
R+SLGC
Sbjct: 178 RESLGC 183
>gi|148239470|ref|YP_001224857.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
gi|147848009|emb|CAK23560.1| Secreted pentapeptide repeats protein [Synechococcus sp. WH 7803]
Length = 176
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 57/149 (38%), Positives = 81/149 (54%), Gaps = 21/149 (14%)
Query: 100 ETRGEFGIGS-AAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYL 153
E RG+ + ++ DL++ +K + R + AD+R S G+ GA L
Sbjct: 42 ELRGQRAVQDISSNMHGRDLKEKEFLKADLREVDLGDADLRGAVINTSQLQGADLRGADL 101
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
E VA+ + F GADL D AN TNA+L++ S A IEG DF++AVI
Sbjct: 102 EDVVAFSSRFDGADLRD----------ANFTNAMLMQ-----SRFNDAQIEGTDFTNAVI 146
Query: 214 DLAQKQALCKYANGTNPITGVSTRKSLGC 242
DL Q +ALC A+G N ++GVST++SLGC
Sbjct: 147 DLPQLKALCGRASGVNSLSGVSTKESLGC 175
>gi|33240300|ref|NP_875242.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
gi|33237827|gb|AAP99894.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
Length = 170
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/134 (41%), Positives = 76/134 (56%), Gaps = 20/134 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S +F DLR NFR +N T A S +G+ +GA L+ A+AY ++F ADL
Sbjct: 57 SGYEFVKFDLRGI-----NFRDSNLTGAVFNNSKLNGADLHGANLKDALAYASDFEDADL 111
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+D+ NL+NA+L+ S AIIEGADF+DAV+ Q++ LC A+GT
Sbjct: 112 TDS----------NLSNALLME-----SSFNNAIIEGADFTDAVLSRIQQKQLCSIADGT 156
Query: 229 NPITGVSTRKSLGC 242
N TG+ST SLGC
Sbjct: 157 NSSTGISTSYSLGC 170
>gi|428202122|ref|YP_007080711.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427979554|gb|AFY77154.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 168
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 74/135 (54%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G+AA F +L +N + A FT+ D+ ++FS + GA + + N GAD
Sbjct: 33 GAAASFEDKNLSGQDFSGQNLQTAQFTNVDLTSANFSNTDLRGAVFNGSALKETNLHGAD 92
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L++ L N A+L++AVL ++ R+ GA I GADF+ AV+D Q LC A+G
Sbjct: 93 LTNGLAYLSSFNGADLSDAVLTEAIMLRTTFDGANITGADFTLAVLDGDQVAKLCTIASG 152
Query: 228 TNPITGVSTRKSLGC 242
N TGV TR SLGC
Sbjct: 153 VNSKTGVETRASLGC 167
>gi|123968372|ref|YP_001009230.1| hypothetical protein A9601_08391 [Prochlorococcus marinus str.
AS9601]
gi|123198482|gb|ABM70123.1| conserved hypothetical protein [Prochlorococcus marinus str.
AS9601]
Length = 170
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 65/116 (56%), Gaps = 15/116 (12%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF +N A S SKFNGA L A+AY +FT ADLSD N TNA
Sbjct: 70 NFSESNLEGAVFNNSKLQNSKFNGANLRDALAYATDFTDADLSDV----------NFTNA 119
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+L+ S+ GA I+GADF+DAV+ Q++ LC ANGTN TG ST SLGC
Sbjct: 120 LLME-----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170
>gi|170078800|ref|YP_001735438.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
gi|169886469|gb|ACB00183.1| secreted pentapeptide repeats protein [Synechococcus sp. PCC 7002]
Length = 165
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 69/128 (53%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
S DL + +N + F+ ++ +D SGS GA + N GAD ++ +
Sbjct: 38 SEDLAGSNFAGQNLQGVEFSQVNLTNADLSGSDLRGAVFNSTLLETTNLHGADFTNGIAY 97
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
A+LT+A+ V +L RS A I+GADFS AV+D Q++ LC A G NP+TG+
Sbjct: 98 LSKFTGADLTDAIFVEAILLRSTFENAKIDGADFSFAVLDGPQQKKLCAVATGVNPVTGI 157
Query: 235 STRKSLGC 242
+T SLGC
Sbjct: 158 ATADSLGC 165
>gi|428773304|ref|YP_007165092.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
7202]
gi|428687583|gb|AFZ47443.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
Length = 164
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 75/130 (57%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F + DL AV + RR + MR SD + + + L A + NF+GA L
Sbjct: 43 FSNQDLVGAVFAASSMRRVS-----MRNSDLTNAMMTESVLLDADLHGVNFSGA-----L 92
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+DR+ + ++L++A+L+ + TR+ I GADF+DAVID Q +C+ A+G NP+T
Sbjct: 93 IDRVTFDFSDLSDAILIGAIATRTRFYDTDITGADFTDAVIDRYQVSLMCERADGVNPVT 152
Query: 233 GVSTRKSLGC 242
GV+TR SLGC
Sbjct: 153 GVATRDSLGC 162
>gi|302768839|ref|XP_002967839.1| hypothetical protein SELMODRAFT_408705 [Selaginella moellendorffii]
gi|300164577|gb|EFJ31186.1| hypothetical protein SELMODRAFT_408705 [Selaginella moellendorffii]
Length = 126
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/130 (41%), Positives = 71/130 (54%), Gaps = 10/130 (7%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT-----L 172
+R A ++ R A F + D R+ + GS L+ + A F G DL DT L
Sbjct: 1 MRGADLSGQDLRGAVFAACDCRKINLRGSN-----LDSSTDTFAGFEGGDLQDTSWVQAL 55
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
DR+V NL NA+ +LT S GA I GADF++A++D Q+ LCK A GTN IT
Sbjct: 56 ADRVVFRMTNLQNAIFTNAILTGSQFDGADITGADFTEAILDNYQRLKLCKRATGTNSIT 115
Query: 233 GVSTRKSLGC 242
GV TR+SL C
Sbjct: 116 GVETRESLAC 125
>gi|220907029|ref|YP_002482340.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863640|gb|ACL43979.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 174
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 75/137 (54%), Gaps = 4/137 (2%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G AA + L ++ R + FT A++ SDFS S G A AN +GAD
Sbjct: 38 GWAADYTKESLVGVDFSGKDLRDSEFTQANLSRSDFSQSDLRGVSFFAANLESANLSGAD 97
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYA 225
L T +D L ANLTNA+L + GA I GADF+D +DL Q + LC+ A
Sbjct: 98 LRLTTLDNARLTHANLTNAILEGAFAFNARFQGATITGADFTD--VDLRQDAQTILCQGA 155
Query: 226 NGTNPITGVSTRKSLGC 242
+GTNP+TG +TR++LGC
Sbjct: 156 SGTNPVTGRNTRETLGC 172
>gi|440684721|ref|YP_007159516.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428681840|gb|AFZ60606.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 167
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/132 (40%), Positives = 74/132 (56%), Gaps = 10/132 (7%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F DL + K N R++NF+ A++R G F A LE A N GADL++
Sbjct: 45 ADFSGRDLTDSSFTKANLRQSNFSHANLR-----GVSFFAANLESA-----NLEGADLTN 94
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
+D L ANLTN +L + GAII+GADF+DA++ +++ LCK A G NP
Sbjct: 95 ATLDSARLIRANLTNTILEGAFAASARFDGAIIDGADFTDALLRGDEQKKLCKVAKGNNP 154
Query: 231 ITGVSTRKSLGC 242
+TG TR++L C
Sbjct: 155 VTGRDTRETLFC 166
>gi|124023397|ref|YP_001017704.1| hypothetical protein P9303_16951 [Prochlorococcus marinus str. MIT
9303]
gi|123963683|gb|ABM78439.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9303]
Length = 198
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 79/149 (53%), Gaps = 25/149 (16%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN----------GAYL 153
EF G A + S D+ ++NF +A+ D+ E+D G+ FN GA L
Sbjct: 63 EFRGGQAIEEISKDMHGRDLKEQNFLKADLRGVDLSEADLRGAVFNSSQLQEADLQGADL 122
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
E VA+ + F GADL AN TNA+L++ S A+IEGADFS+AV+
Sbjct: 123 ENVVAFASRFDGADLRG----------ANFTNAMLMQ-----SQFKDALIEGADFSNAVL 167
Query: 214 DLAQKQALCKYANGTNPITGVSTRKSLGC 242
D Q+ LC ANGTN ++G +T SLGC
Sbjct: 168 DRRQQNELCSRANGTNAVSGSNTIDSLGC 196
>gi|159903694|ref|YP_001551038.1| hypothetical protein P9211_11531 [Prochlorococcus marinus str. MIT
9211]
gi|159888870|gb|ABX09084.1| Hypothetical protein P9211_11531 [Prochlorococcus marinus str. MIT
9211]
Length = 183
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/117 (38%), Positives = 70/117 (59%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ +++ A R+++ S +G + A +N G DL+DTL DR+ + +L N
Sbjct: 53 QDLSKSSIAGATARDANLSDVDLHGTVVTLADLKGSNLNGIDLTDTLSDRVNFQKTDLRN 112
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVLV + + S GA+I GADFSD+V+D ++ LC+ A G NP TG++TR SL C
Sbjct: 113 AVLVNMIASGSSFAGALIAGADFSDSVLDRDDQRNLCEIAEGVNPKTGIATRDSLEC 169
>gi|434386546|ref|YP_007097157.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428017536|gb|AFY93630.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 212
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 72/117 (61%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N + A FT+A + +++F+G+ G + + + N GA+L+ L+D++ A+L++
Sbjct: 95 KNLQTAVFTTAKLDDTNFAGADLTGVVISSSTLNRTNLHGANLTQGLLDQVRFVGADLSD 154
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AV V ++ RS I GADF+DA++ Q++ LC+ A G N TGV+TR SLGC
Sbjct: 155 AVFVEAMMLRSTFTDVNIAGADFTDAILGKLQQKELCQIATGVNSKTGVATRDSLGC 211
>gi|119389531|pdb|2G0Y|A Chain A, Crystal Structure Of A Lumenal Pentapeptide Repeat Protein
From Cyanothece Sp 51142 At 2.3 Angstrom Resolution.
Tetragonal Crystal Form
Length = 184
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/135 (39%), Positives = 75/135 (55%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
GS+A + L ++ A FT+AD+ +S+FS + GA + A+ GAD
Sbjct: 49 GSSASYEDVKLIGEDFSGKSLTYAQFTNADLTDSNFSEADLRGAVFNGSALIGADLHGAD 108
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L++ L A+LTNAVL ++ R+ A I GADFS AV+D+ + LC A+G
Sbjct: 109 LTNGLAYLTSFKGADLTNAVLTEAIMMRTKFDDAKITGADFSLAVLDVYEVDKLCDRADG 168
Query: 228 TNPITGVSTRKSLGC 242
NP TGVSTR+SL C
Sbjct: 169 VNPKTGVSTRESLRC 183
>gi|428298761|ref|YP_007137067.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428235305|gb|AFZ01095.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 169
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 93/181 (51%), Gaps = 18/181 (9%)
Query: 64 KLKN--WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 121
KL N WR+ +S L + + ++ +A +Y E I + F DL +
Sbjct: 4 KLSNNFWRIVLSALLGTVIWMISTWGLTPIAFALEYNKE------ILIQSDFSGRDLSDS 57
Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 181
K N +++NF++ ++R G F A LE + TGADLS++ +D L +A
Sbjct: 58 SFTKANLKQSNFSNTNLR-----GVSFFAANLESV-----DLTGADLSNSTLDSARLVKA 107
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 241
NLTNA+L + GAII+GADF+D ++ ++ LCK A GTNP T +TR +L
Sbjct: 108 NLTNAILEGAFAISAKFEGAIIDGADFTDILLRDDEQARLCKIATGTNPTTKRNTRDTLM 167
Query: 242 C 242
C
Sbjct: 168 C 168
>gi|354567474|ref|ZP_08986643.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353542746|gb|EHC12207.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 164
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 88/178 (49%), Gaps = 15/178 (8%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
+K+WRVF LA V+ + L+ + +R Q +AD +
Sbjct: 1 MKSWRVFAVLILAMVVL------LFPLSAEAAKSSSSR----FAGYKQMSNADFSGQTLI 50
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+E F + A+ +D G+ FN AYLEKA N GAD ++ + + +A+L+
Sbjct: 51 REEFTKVKLDKANFSNADLRGAVFNNAYLEKA-----NLHGADFTNGIAYLVDFRDADLS 105
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+A+ T+L S I G DF++AV+D + + LC ANG N TGVSTR+SL C
Sbjct: 106 DAIFTDTMLLYSTFDNVEITGTDFTNAVLDGPELKKLCARANGVNSKTGVSTRESLEC 163
>gi|443314247|ref|ZP_21043822.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442786146|gb|ELR95911.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 166
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 69/136 (50%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
+ A F +LR ++ R ++T ADM E + +G+ G L KAN GA
Sbjct: 30 VAQAESFDRQNLRMRDFSGQDLRGNDYTRADMAEVNLTGANLQGVRLFDTNLTKANLEGA 89
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
DL +D ANLTNA+L + +D AII+GADF+D +D LC A
Sbjct: 90 DLRGATLDGARFLAANLTNAILAGSYAFNTDFRKAIIDGADFTDVFLDPKTNDLLCAVAQ 149
Query: 227 GTNPITGVSTRKSLGC 242
GTNP+TG TR +L C
Sbjct: 150 GTNPVTGRDTRDTLYC 165
>gi|17232102|ref|NP_488650.1| hypothetical protein alr4610 [Nostoc sp. PCC 7120]
gi|17133747|dbj|BAB76309.1| alr4610 [Nostoc sp. PCC 7120]
Length = 164
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 101/190 (53%), Gaps = 39/190 (20%)
Query: 65 LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
+K+WRV VS LA + A+ SS+I+ A + G+ IGS +F + D
Sbjct: 1 MKDWRVVVSFVLAMVLFLFPGSAQAASSSSITRSAGDELKAKDFSGQSLIGS--EFTNVD 58
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTL 172
L EN ANF++AD+R F+G+ G L + +AY ANF ADLSD +
Sbjct: 59 L-------EN---ANFSNADLRGGVFNGTVLEGVNLHGVDFSEGIAYLANFKNADLSDAI 108
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
LTNA+++R++ + + GADF++AV+D+ Q + LC ANG N T
Sbjct: 109 ----------LTNAMMLRSIFDNVN-----VTGADFTNAVLDITQVKKLCLKANGVNSKT 153
Query: 233 GVSTRKSLGC 242
GV TR+SLGC
Sbjct: 154 GVDTRESLGC 163
>gi|67922694|ref|ZP_00516198.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|416392485|ref|ZP_11685875.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
gi|67855476|gb|EAM50731.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|357263639|gb|EHJ12621.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
Length = 170
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 74/135 (54%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G++A + L ++ A FT+AD+ +S+FS + GA + + AD
Sbjct: 35 GASASYEDVQLIGEDFSGKSLTYAQFTNADLTDSNFSDADLRGAVFNGSALIGTDLHQAD 94
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L++ L A+LTNAVL ++ R+ A I GADFS AV+DL Q LCK A+G
Sbjct: 95 LTNGLAYLTSFEGADLTNAVLTEAIMMRTTFKNANITGADFSLAVLDLQQVAELCKRADG 154
Query: 228 TNPITGVSTRKSLGC 242
N TG+STR+SLGC
Sbjct: 155 VNSKTGISTRESLGC 169
>gi|119389418|pdb|2F3L|A Chain A, Crystal Structure Of A Lumenal Rfr-Domain Protein
(Contig83.1_1_243_746) From Cyanothece Sp. 51142 At 2.1
Angstrom Resolution
Length = 184
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 53/135 (39%), Positives = 74/135 (54%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
GS+A + L ++ A FT+AD+ +S+FS + GA + A+ GAD
Sbjct: 49 GSSASYEDVKLIGEDFSGKSLTYAQFTNADLTDSNFSEADLRGAVFNGSALIGADLHGAD 108
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L++ L A+LTNAVL + R+ A I GADFS AV+D+ + LC A+G
Sbjct: 109 LTNGLAYLTSFKGADLTNAVLTEAIXXRTKFDDAKITGADFSLAVLDVYEVDKLCDRADG 168
Query: 228 TNPITGVSTRKSLGC 242
NP TGVSTR+SL C
Sbjct: 169 VNPKTGVSTRESLRC 183
>gi|428203139|ref|YP_007081728.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427980571|gb|AFY78171.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 177
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 66/110 (60%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F A++R S+FS +K G A ANF GADL+ ++ L AN TNA+LV
Sbjct: 67 FDHANLRGSNFSNAKLQGVRFFAANLESANFEGADLTGADLESARLVRANFTNAILVGAF 126
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
T + GAII+GADF+D ++ ++ LC+ A GTNP+TG +TR +L C
Sbjct: 127 ATNTLFNGAIIDGADFTDVLLRPDTEKKLCEIARGTNPVTGRNTRDTLNC 176
>gi|427723591|ref|YP_007070868.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427355311|gb|AFY38034.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 165
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 67/128 (52%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
S D +NF+ A FT + R +D S + GA + N GAD+S+ +
Sbjct: 38 SEDFANENFAGQNFQGAEFTQVNFRNADMSNTDLRGAVFNSSQLQNTNLHGADMSNGIAY 97
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
A+L+ A+ +L RS A I+GADFS AV+D +Q++ LC A G NP+TG+
Sbjct: 98 LSAFTGADLSGAIFEEAILLRSTFDDANIDGADFSFAVLDGSQQKKLCAAATGVNPVTGI 157
Query: 235 STRKSLGC 242
T SLGC
Sbjct: 158 ETADSLGC 165
>gi|186686067|ref|YP_001869263.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186468519|gb|ACC84320.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 191
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 67/112 (59%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
++FT A++R+S+FS + NG A AN G+DL + +D L ANLTNA+L
Sbjct: 79 SSFTKANLRQSNFSRANLNGVSFFAANLESANLEGSDLRNATLDSARLVRANLTNALLEG 138
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ GAII+GADF+D ++ +++ LCK A GTNP TG TR +L C
Sbjct: 139 AFAANARFDGAIIDGADFTDTLLRPDEQKKLCKLAKGTNPTTGRDTRDTLFC 190
>gi|434400099|ref|YP_007134103.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428271196|gb|AFZ37137.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 167
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 72/118 (61%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
++ R A F A +R SDFS S +G L + + NFTGA+LS+ ++ L AN T
Sbjct: 49 HQDLRDAIFDHASLRGSDFSYSDLSGVRLFGSNLSRVNFTGANLSNADLESCRLTRANFT 108
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA+L +T + L AIIEGADF++ ++ ++ LC+ A+GTNP TG +T+ +L C
Sbjct: 109 NAILTGAFMTNTLLDEAIIEGADFTNVLLSPTTEKMLCENASGTNPTTGRNTKDTLFC 166
>gi|126696175|ref|YP_001091061.1| hypothetical protein P9301_08371 [Prochlorococcus marinus str. MIT
9301]
gi|91070292|gb|ABE11210.1| conserved hypothetical protein [uncultured Prochlorococcus marinus
clone HF10-88D1]
gi|126543218|gb|ABO17460.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9301]
Length = 170
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 64/116 (55%), Gaps = 15/116 (12%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF +N A S SKF GA L A+AY +FT ADLSD N TNA
Sbjct: 70 NFSESNLEGAVFNNSKLQNSKFTGANLRDALAYATDFTDADLSD----------VNFTNA 119
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+L+ S+ GA I+GADF+DAV+ Q++ LC ANGTN TG ST SLGC
Sbjct: 120 LLME-----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170
>gi|427706684|ref|YP_007049061.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427359189|gb|AFY41911.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 169
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 52/132 (39%), Positives = 74/132 (56%), Gaps = 10/132 (7%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F DL + K N R++NF+++++R G F A LE A N G DL++
Sbjct: 47 ADFSGRDLTDSSFTKANLRQSNFSNSNLR-----GVSFFAANLESA-----NLQGTDLTN 96
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
+D L +A+LTNAVL + GAII+GADF+D ++ +++ LCK A GTNP
Sbjct: 97 ATLDSARLMKADLTNAVLEGAFAANAKFDGAIIDGADFTDVLLRPDEQKKLCKVAKGTNP 156
Query: 231 ITGVSTRKSLGC 242
TG TR +L C
Sbjct: 157 TTGRDTRDTLFC 168
>gi|33865584|ref|NP_897143.1| hypothetical protein SYNW1050 [Synechococcus sp. WH 8102]
gi|33632753|emb|CAE07565.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 162
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 68/118 (57%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ R A F +++RE++ SGS GA L A A+ +G DL + +D V+ NL+N
Sbjct: 45 KDLRGATFNLSNLREANLSGSDLRGASLYGAKLQDADLSGTDLREATLDAAVMTGTNLSN 104
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCG 243
AVL + I GADF+D + Q ++LC A+GTNP+TG STR SLGCG
Sbjct: 105 AVLEGAFAFNTRFVDVTISGADFTDVPMRGDQLKSLCAVADGTNPVTGRSTRDSLGCG 162
>gi|157413206|ref|YP_001484072.1| hypothetical protein P9215_08711 [Prochlorococcus marinus str. MIT
9215]
gi|254525828|ref|ZP_05137880.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
MIT 9202]
gi|157387781|gb|ABV50486.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT
9215]
gi|221537252|gb|EEE39705.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
MIT 9202]
Length = 170
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 64/116 (55%), Gaps = 15/116 (12%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF +N A S SKF GA L A+AY +FT ADLSD N TNA
Sbjct: 70 NFSESNLEGAVFNNSKLQNSKFTGANLRDALAYATDFTDADLSD----------VNFTNA 119
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+L+ S+ GA I+GADF+DAV+ Q++ LC ANGTN TG ST SLGC
Sbjct: 120 LLME-----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170
>gi|75909862|ref|YP_324158.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75703587|gb|ABA23263.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 194
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 74/133 (55%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A ++ L +A + ++FT A++R+S+FS S G A AN G +L+
Sbjct: 61 ALEYNKEILVEADFSGRDLTDSSFTKANLRQSNFSKSNLTGVSFFAANLESANLEGTNLT 120
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
+ +D L +ANLTNAVL + GAII+GADF+D ++ +++ LCK A GTN
Sbjct: 121 NATLDSARLIKANLTNAVLEGAFAASTKFDGAIIDGADFTDVLLRPDEQKKLCKVAKGTN 180
Query: 230 PITGVSTRKSLGC 242
P TG TR +L C
Sbjct: 181 PTTGRETRDTLFC 193
>gi|334118008|ref|ZP_08492098.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333459993|gb|EGK88603.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 171
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 72/118 (61%), Gaps = 10/118 (8%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K N R +NFT+AD+R G F A +E+A ANFTGA L + RM+ +ANLT
Sbjct: 62 KANLRNSNFTNADLR-----GVSFFAANMEEANLEGANFTGATLD---LARMM--KANLT 111
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA+L + L GA+I+GADF+D ++ + LCK A GTNP+TG TR++L C
Sbjct: 112 NAILEGAFAYNTRLEGAVIDGADFTDTLLRDDMIEKLCKVAKGTNPVTGRDTRETLFC 169
>gi|166362955|ref|YP_001655228.1| hypothetical protein MAE_02140 [Microcystis aeruginosa NIES-843]
gi|166085328|dbj|BAG00036.1| hypothetical protein MAE_02140 [Microcystis aeruginosa NIES-843]
Length = 186
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G+ A + + +L +N + A FT+ ++++S+FS + GA A + NF GAD
Sbjct: 50 GANASYENQNLTGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGAD 109
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L++ L ++L++A+ ++ R+ G I GADFS AV+D Q + LC+ A G
Sbjct: 110 LTNGLAYLSTFKNSDLSDAIFAEAIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEG 169
Query: 228 TNPITGVSTRKSLGC 242
N TG+ST +SLGC
Sbjct: 170 VNSKTGISTPESLGC 184
>gi|427731475|ref|YP_007077712.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427367394|gb|AFY50115.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 185
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 80/149 (53%), Gaps = 9/149 (6%)
Query: 103 GEFGIGSAAQFG----SADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYL 153
G GI + A F + D K + ++ +F ++FT A++R+S+FS S G
Sbjct: 36 GILGITTIAGFAPTALALDYNKEILIEADFSGRDLTDSSFTKANLRQSNFSNSNLQGVSF 95
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A AN G +LS+ +D L +A+LTNAVL + GAII+GADF+D ++
Sbjct: 96 FAANLESANLQGVNLSNATLDSARLIKADLTNAVLEGAFAANAKFDGAIIDGADFTDVLL 155
Query: 214 DLAQKQALCKYANGTNPITGVSTRKSLGC 242
+++ LCK A GTNP TG T +L C
Sbjct: 156 RPDEQKKLCKVAKGTNPTTGRDTHDTLYC 184
>gi|119490210|ref|ZP_01622723.1| hypothetical protein L8106_15969 [Lyngbya sp. PCC 8106]
gi|119454096|gb|EAW35249.1| hypothetical protein L8106_15969 [Lyngbya sp. PCC 8106]
Length = 177
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 64/112 (57%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
+ F A++R+S+FS S G L A + NF ADLS +D LN ANLTNA+L
Sbjct: 65 SEFDFANLRDSNFSHSNLRGVSLFGAKLQRTNFEAADLSYATLDTARLNRANLTNAILEG 124
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+D A+I GADF+D ++ ++ LC A GTNP+TG TR +L C
Sbjct: 125 AFAYNTDFSDAMIAGADFTDVLLRRDMQEKLCALAEGTNPVTGRDTRDTLYC 176
>gi|390438199|ref|ZP_10226689.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
gi|425441109|ref|ZP_18821396.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9717]
gi|425454770|ref|ZP_18834496.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9807]
gi|425466166|ref|ZP_18845469.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|425468563|ref|ZP_18847571.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9701]
gi|389718271|emb|CCH97753.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9717]
gi|389804467|emb|CCI16499.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9807]
gi|389831470|emb|CCI25816.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|389838386|emb|CCI30813.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
gi|389884775|emb|CCI34954.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9701]
Length = 169
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 76/135 (56%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G+ A + + +L +N + A FT+ ++++S+FS + GA A + NF GAD
Sbjct: 33 GANASYENQNLTGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGAD 92
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L++ L ++L++A+ ++ R+ G I GADFS AV+D Q + LC+ A G
Sbjct: 93 LTNGLAYLSTFKNSDLSDAIFAEAIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEG 152
Query: 228 TNPITGVSTRKSLGC 242
N TGVST +SLGC
Sbjct: 153 VNSKTGVSTPESLGC 167
>gi|194476536|ref|YP_002048715.1| hypothetical protein PCC_0045 [Paulinella chromatophora]
gi|171191543|gb|ACB42505.1| hypothetical protein PCC_0045 [Paulinella chromatophora]
Length = 167
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 74/130 (56%), Gaps = 20/130 (15%)
Query: 118 LRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
L++ +K + ++ +F+ +D+R SD + N A L+ VA+ + F GADL T
Sbjct: 53 LQQQEFLKADLQKIDFSESDLRGTVFNNSDLRNANLNAADLQDVVAFASRFDGADLRQT- 111
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
NL N +L++ S +IEGADF+DA++DL Q++ LC +ANGTN T
Sbjct: 112 ---------NLRNGMLIQ-----SKFKDTLIEGADFTDAILDLKQQKILCSFANGTNLKT 157
Query: 233 GVSTRKSLGC 242
GV T++SL C
Sbjct: 158 GVDTKESLRC 167
>gi|425445790|ref|ZP_18825810.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
gi|389734131|emb|CCI02174.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
Length = 169
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 76/135 (56%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G+ A + + +L +N + A FT+ ++++S+FS + GA A + NF GAD
Sbjct: 33 GANASYENQNLTGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGAD 92
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L++ L ++L++A+ ++ R+ G I GADFS AV+D Q + LC+ A G
Sbjct: 93 LTNGLAYLSTFKNSDLSDAIFAEAIMLRTIFEGVNINGADFSFAVLDAEQIKNLCERAEG 152
Query: 228 TNPITGVSTRKSLGC 242
N TGVST +SLGC
Sbjct: 153 VNSKTGVSTPESLGC 167
>gi|119511413|ref|ZP_01630525.1| hypothetical protein N9414_20009 [Nodularia spumigena CCY9414]
gi|119463958|gb|EAW44883.1| hypothetical protein N9414_20009 [Nodularia spumigena CCY9414]
Length = 126
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 73/117 (62%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ + A F++A+M ++F+ + GA + +V +AN GADL++ ++D++ A+L++
Sbjct: 9 QSLQAAEFSNANMELANFADADLRGAVMSASVMTQANLHGADLTNAMVDQVKFAGADLSD 68
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AV +L RS I+ ADF+DA++D Q + LC A+G N TGV TR SLGC
Sbjct: 69 AVFKEALLLRSTFTDVNIDSADFTDAILDGVQIKELCSKASGVNSKTGVETRYSLGC 125
>gi|443666115|ref|ZP_21133744.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|159030126|emb|CAO91018.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443331286|gb|ELS45952.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 169
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 76/135 (56%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G+ A + + +L +N + A FT+ ++++S+FS + GA A + NF GAD
Sbjct: 33 GANASYENQNLTGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGAD 92
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L++ L ++L++A+ ++ R+ G I GADFS AV+D Q + LC+ A G
Sbjct: 93 LTNGLAYLSTFKNSDLSDAIFAEAIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEG 152
Query: 228 TNPITGVSTRKSLGC 242
N TGVST +SLGC
Sbjct: 153 VNSKTGVSTPESLGC 167
>gi|428300991|ref|YP_007139297.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428237535|gb|AFZ03325.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 166
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/185 (32%), Positives = 92/185 (49%), Gaps = 28/185 (15%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGS--AAQFGSADLRKAV 122
+K W+ V L + AS + +A + G GS F L
Sbjct: 1 MKFWQFLVGLVLTFVIFASSTPAYAA------SSSAVTGSIVAGSLKGKDFSGQSLIAEE 54
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMV 177
N +ANF++AD+R + F+GS + A L+ + +AY ++F GA+LSD
Sbjct: 55 FTSVNLEKANFSAADLRGAVFNGSMLHDANLQGIDFSEGIAYLSDFKGANLSD------- 107
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 237
A TNA+++R+ + D + GADF++AV+D + Q LC A+G NP TGV TR
Sbjct: 108 ---AVFTNAMMLRSAFSDVD-----VTGADFTNAVLDRTEVQKLCVNASGVNPKTGVETR 159
Query: 238 KSLGC 242
+SLGC
Sbjct: 160 QSLGC 164
>gi|168067322|ref|XP_001785569.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662809|gb|EDQ49618.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 545
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 80/173 (46%), Gaps = 13/173 (7%)
Query: 83 SCSSNISALADLNKYEAETRGEF---GIGSAA----------QFGSADLRKAVHVKENFR 129
S NI LA K G GIG+++ F ADLR +N R
Sbjct: 373 SLGKNIENLAWWEKVNTAAVGVILGVGIGASSLALPAFADFLSFDHADLRGRDMSNQNLR 432
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 189
F + D R+ + GS +G+ A N + DR+V ANL NA
Sbjct: 433 GVVFAACDCRKINLEGSTMDGSTDTFAGFEGGNLKNSSWIRAFADRVVFRGANLENANFT 492
Query: 190 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
VL+ S GA I GADF+DA++D Q+ +C+ A G NP TGV+TR+SL C
Sbjct: 493 DAVLSGSQFDGADITGADFTDALVDNYQRLQMCRRAKGVNPTTGVATRESLFC 545
>gi|116074723|ref|ZP_01471984.1| hypothetical protein RS9916_29354 [Synechococcus sp. RS9916]
gi|116067945|gb|EAU73698.1| hypothetical protein RS9916_29354 [Synechococcus sp. RS9916]
Length = 173
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/189 (34%), Positives = 92/189 (48%), Gaps = 28/189 (14%)
Query: 64 KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVH 123
+L N R S LA V C AL + +A T E A Q SAD+
Sbjct: 2 RLLNPRALCSGLLATLV---CCVISVALLPSSPAQAITAPELRGQKAVQDISADMHGRDL 58
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLE----------KAVAYKANFTGADLSDTLM 173
++ F +A+ D+ E+D G+ N + L+ VA+ + F GADL D
Sbjct: 59 KEKEFLKADLQGVDLSEADLRGAVINTSLLQGSDLRSADLGDVVAFASRFDGADLRD--- 115
Query: 174 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 233
A NA+L+++ T ++ IEGADF++AVIDL Q +A+C A G N TG
Sbjct: 116 -------ARFVNAMLMQSRFTEAN-----IEGADFTNAVIDLPQLKAMCARAEGVNSATG 163
Query: 234 VSTRKSLGC 242
+STR+SLGC
Sbjct: 164 ISTRESLGC 172
>gi|428781463|ref|YP_007173249.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
gi|428695742|gb|AFZ51892.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
Length = 165
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 70/135 (51%), Gaps = 20/135 (14%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTGAD 167
F L +A E RA+F +A++ + F+G+ + G +AY +FTG D
Sbjct: 45 FSGESLIEAEFYDEELERADFHNANLEAAVFNGANLTNANWQGVNFTNGIAYLTDFTGVD 104
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
TNA+L ++ RS A +EG DF++AV+D Q + LC+ A+G
Sbjct: 105 F---------------TNAILTEAMMLRSTFNDATVEGVDFTNAVVDRLQVKRLCERASG 149
Query: 228 TNPITGVSTRKSLGC 242
NP TGVSTR+SLGC
Sbjct: 150 VNPTTGVSTRESLGC 164
>gi|422301609|ref|ZP_16388976.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9806]
gi|389789327|emb|CCI14609.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9806]
Length = 169
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 69/117 (58%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N + A FT+ ++++S+FS + GA A + NF GADL++ L ++L++
Sbjct: 51 QNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSDLSD 110
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+ ++ R+ G I GADFS AV+D Q + LC+ A G N TG+ST +SLGC
Sbjct: 111 AIFAEAIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEGVNSKTGISTLESLGC 167
>gi|218438105|ref|YP_002376434.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218170833|gb|ACK69566.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 168
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 72/135 (53%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G+ A F L A + A FT+ D+ E+DFS + GA + + GAD
Sbjct: 33 GATATFEDKKLVGADFSGQTLTLAQFTNVDLSEADFSNADLRGAVFNGSALIEGKLRGAD 92
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L++ L A+L++A+L ++ R+ A + GADFS AV+D Q LC+ A+G
Sbjct: 93 LTNALGYLSSFERADLSDAILAEVIMKRTSFKNADVTGADFSYAVLDGEQIANLCRTASG 152
Query: 228 TNPITGVSTRKSLGC 242
N TGVSTR+SLGC
Sbjct: 153 VNSKTGVSTRESLGC 167
>gi|78779169|ref|YP_397281.1| hypothetical protein PMT9312_0785 [Prochlorococcus marinus str. MIT
9312]
gi|78712668|gb|ABB49845.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9312]
Length = 170
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 64/116 (55%), Gaps = 15/116 (12%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF +N A S SKF GA L A+AY +FT ADLSD N TNA
Sbjct: 70 NFSDSNLEGAVFNNSKLQNSKFTGANLRDALAYATDFTDADLSD----------VNFTNA 119
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+L+ S+ GA I+GADF+DAV+ Q++ LC ANGTN TG ST SLGC
Sbjct: 120 LLME-----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170
>gi|116070732|ref|ZP_01468001.1| hypothetical protein BL107_13840 [Synechococcus sp. BL107]
gi|116066137|gb|EAU71894.1| hypothetical protein BL107_13840 [Synechococcus sp. BL107]
Length = 165
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 74/135 (54%), Gaps = 5/135 (3%)
Query: 113 FGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
F + D+ K V + ++ A F +++RE+D SGS GA L A A+ + D
Sbjct: 30 FAAVDVAKQVLIGADYANKDLVGATFNLSNLREADLSGSDLRGASLYGAKLQDADLSDTD 89
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L + +D V+ NL+NAV+ + +I GADF+D + Q ++LC A+G
Sbjct: 90 LREATLDSAVMTGTNLSNAVMEGAFAFNTRFKDVVITGADFTDVPMRPDQLKSLCSVADG 149
Query: 228 TNPITGVSTRKSLGC 242
TNP+TG STR+SLGC
Sbjct: 150 TNPVTGRSTRESLGC 164
>gi|354567943|ref|ZP_08987110.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353541617|gb|EHC11084.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 169
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 73/133 (54%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A + + L A K++ ++F A++R S+FS S G + +FTGADLS
Sbjct: 36 AINYNNRTLEAADFSKQDLTDSSFDHANLRNSNFSNSNLRGVRFFSSNLASVDFTGADLS 95
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
++ + +ANLTNA+L T + GAII+GADF+D I LC+ A GTN
Sbjct: 96 YADLESARMTKANLTNAILEGAFTTGTMFDGAIIDGADFTDTYIREDTLNKLCQVAKGTN 155
Query: 230 PITGVSTRKSLGC 242
P+TG +TR +L C
Sbjct: 156 PVTGRNTRDTLAC 168
>gi|17230824|ref|NP_487372.1| hypothetical protein all3332 [Nostoc sp. PCC 7120]
gi|17132427|dbj|BAB75031.1| all3332 [Nostoc sp. PCC 7120]
Length = 206
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 81/147 (55%), Gaps = 9/147 (6%)
Query: 105 FGIGSAAQFG----SADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEK 155
FG+ + A F + + K + V+ +F ++FT A++R+S+FS S G
Sbjct: 59 FGMITIANFTPPAFALEYNKEILVEADFSGRDLTDSSFTKANLRQSNFSKSNLTGVSFFA 118
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
A AN G++L++ +D L +ANL NAVL + GAII+GADF+D ++
Sbjct: 119 ANLESANLEGSNLTNATLDSARLIKANLKNAVLEGAFAASTKFDGAIIDGADFTDVLLRP 178
Query: 216 AQKQALCKYANGTNPITGVSTRKSLGC 242
+++ LCK A GTNP TG TR +L C
Sbjct: 179 DEQKKLCKVAKGTNPTTGRETRDTLFC 205
>gi|428770110|ref|YP_007161900.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428684389|gb|AFZ53856.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 193
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/131 (40%), Positives = 69/131 (52%), Gaps = 15/131 (11%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT----------GADLSDT----- 171
NF A D D +GS F + L A Y++N T GADL +T
Sbjct: 60 NFTYAQLEGEDFSHRDLTGSVFAASNLRNASFYQSNLTNSVMTEGILFGADLRETNFTGS 119
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
L+DR+ L+ A+L NA+ + TR+ IEGADF+ AVID Q +C A+G N I
Sbjct: 120 LIDRVTLDFADLRNAIFTDAIATRTRFYDTNIEGADFTGAVIDRYQVALMCDRASGVNSI 179
Query: 232 TGVSTRKSLGC 242
TGV+TR SLGC
Sbjct: 180 TGVATRDSLGC 190
>gi|119512324|ref|ZP_01631410.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
gi|119463037|gb|EAW43988.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
Length = 170
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 67/110 (60%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
FT A++R+SDF+ + G A AN ADLS +D L +ANLTNA+L
Sbjct: 60 FTKANLRQSDFNHANLRGVSFFAANLESANLESADLSFATLDSARLIKANLTNAILEGAF 119
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ + GAII+GADF+D ++ +++ LC+ A GTNP+TG +TR +L C
Sbjct: 120 ASNARFDGAIIDGADFTDILLRQDEEKKLCQLAKGTNPVTGRNTRDTLFC 169
>gi|33862830|ref|NP_894390.1| hypothetical protein PMT0557 [Prochlorococcus marinus str. MIT
9313]
gi|33634746|emb|CAE20732.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 198
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/149 (38%), Positives = 77/149 (51%), Gaps = 25/149 (16%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN----------GAYL 153
EF G A + S D+ ++NF +A+ D+ E+D G+ FN GA L
Sbjct: 63 EFRGGQAIEEISKDMHGRDLKEQNFLKADLRGVDLSEADLRGAVFNSSQLQEADLQGADL 122
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
E VA+ + F GADL AN TNA+L++ S A+IEGADFS+AV+
Sbjct: 123 ENVVAFASRFDGADLRG----------ANFTNAMLMQ-----SQFKDALIEGADFSNAVL 167
Query: 214 DLAQKQALCKYANGTNPITGVSTRKSLGC 242
D Q+ LC A+GTN +G T SLGC
Sbjct: 168 DRRQQNELCARADGTNAASGSQTLDSLGC 196
>gi|425436672|ref|ZP_18817106.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9432]
gi|425449430|ref|ZP_18829270.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
7941]
gi|425458879|ref|ZP_18838365.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9808]
gi|440755734|ref|ZP_20934936.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|389678572|emb|CCH92580.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9432]
gi|389763888|emb|CCI09674.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
7941]
gi|389823689|emb|CCI27950.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9808]
gi|440175940|gb|ELP55309.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
Length = 169
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G+ A + + +L +N + A FT+ ++++S+FS + GA A + NF GAD
Sbjct: 33 GANASYENQNLTGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGAD 92
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L++ L ++L++A+ ++ R+ G I GADFS AV+D Q + LC+ A G
Sbjct: 93 LTNGLAYLSTFKNSDLSDAIFSEAIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEG 152
Query: 228 TNPITGVSTRKSLGC 242
N TG+ST +SLGC
Sbjct: 153 VNSKTGISTPESLGC 167
>gi|443326265|ref|ZP_21054925.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442794122|gb|ELS03549.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 172
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 67/118 (56%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+N A F ++ SDFS S G A +ANF A+L ++ L++ANLT
Sbjct: 54 HQNLTDATFDHTNLIGSDFSDSNLFGVRFFAANLREANFANANLKFADLEAARLSDANLT 113
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NAVL LT + L G IIEGADFS A++D ++ LC A GTNP TG +TR +L C
Sbjct: 114 NAVLAGAYLTNALLDGVIIEGADFSGALLDRNDEKMLCDIATGTNPTTGRNTRDTLFC 171
>gi|443312459|ref|ZP_21042076.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442777437|gb|ELR87713.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 167
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 70/126 (55%), Gaps = 5/126 (3%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
D V +F +AN ++++ SD +G F A LE A N GA+L++ +D
Sbjct: 46 DFSGQVLTDASFTKANLRNSNLSHSDLTGVSFFAANLESA-----NLEGANLTNATLDAA 100
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
+ + NLTNAVL + GAII+GADF+D ++ ++ LCK A GTNP TG T
Sbjct: 101 RIIKTNLTNAVLTGAFAANAKFDGAIIDGADFTDVLLRQDEQDKLCKVAQGTNPTTGKQT 160
Query: 237 RKSLGC 242
R++L C
Sbjct: 161 RETLMC 166
>gi|427715923|ref|YP_007063917.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348359|gb|AFY31083.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 169
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/132 (39%), Positives = 73/132 (55%), Gaps = 10/132 (7%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F DL + K N R++NF++A++ SG F A LE A N GA+L++
Sbjct: 47 ADFSGRDLTDSSFTKANLRQSNFSNANL-----SGVSFFAANLESA-----NLQGANLTN 96
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
+D + NLTNAVL + GAII+GADF+D ++ +++ LCK A GTNP
Sbjct: 97 ATLDSARFIKTNLTNAVLEGAFAANAKFDGAIIDGADFTDVLLRQDEQKKLCKVAKGTNP 156
Query: 231 ITGVSTRKSLGC 242
TG TR +L C
Sbjct: 157 TTGRDTRDTLFC 168
>gi|119486074|ref|ZP_01620136.1| hypothetical protein L8106_06120 [Lyngbya sp. PCC 8106]
gi|119456849|gb|EAW37977.1| hypothetical protein L8106_06120 [Lyngbya sp. PCC 8106]
Length = 161
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 68/112 (60%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A F ++++ ++F S+ G+ KA+ AN GADL+ ++D++ + A+L+N++
Sbjct: 49 AEFANSNLESANFDHSQLVGSVFSKAMMKNANMRGADLTYAMLDQVDFSNADLSNSIFTE 108
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ S I GADF+DA++D Q + LC A+G NP TGVSTR SLGC
Sbjct: 109 VLFFGSTFKDTKITGADFTDALLDGEQLRQLCITASGVNPKTGVSTRYSLGC 160
>gi|159903526|ref|YP_001550870.1| hypothetical protein P9211_09851 [Prochlorococcus marinus str. MIT
9211]
gi|159888702|gb|ABX08916.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9211]
Length = 169
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 78/152 (51%), Gaps = 20/152 (13%)
Query: 96 KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNG 150
K E R + + + + DL VK + R NF +D+ S+ + ++FNG
Sbjct: 33 KRPPEIRNQDDLNISQDMHAQDLSGREFVKFDLRGINFKDSDLSGAVFNNSNLTNAQFNG 92
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
A + ++AY NF DLSD ANLTNA+L+ + + I+GADF+D
Sbjct: 93 ADMHDSLAYATNFENTDLSD----------ANLTNALLMESTFVNTK-----IDGADFTD 137
Query: 211 AVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AV+ Q++ LC A+GTN TG+ T SLGC
Sbjct: 138 AVLSRIQQKQLCSIASGTNSNTGIDTEYSLGC 169
>gi|428225171|ref|YP_007109268.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427985072|gb|AFY66216.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 170
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 69/116 (59%), Gaps = 5/116 (4%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF +AN S+++ ++ G F GA LE A N GA L+ +D L +ANLTNA
Sbjct: 59 NFTKANMRSSNLSRANLQGVSFFGANLESA-----NLEGAQLNYATLDSARLVKANLTNA 113
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+L T + GA IEGADF+DA++ + + LC+ A+G NP TG +TR+SL C
Sbjct: 114 ILEGTYAFNAKFAGATIEGADFTDALLRDDEIEHLCEVASGVNPTTGRATRESLMC 169
>gi|425455123|ref|ZP_18834848.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9807]
gi|389804043|emb|CCI17099.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9807]
Length = 161
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/128 (42%), Positives = 69/128 (53%), Gaps = 10/128 (7%)
Query: 125 KENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM--- 176
N +F D+R+S F GS F+ A LE + AN GAD SD M +
Sbjct: 33 NRNLTDNDFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLE 92
Query: 177 --VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
L AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPITG
Sbjct: 93 SARLTRANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIATGTNPITGR 152
Query: 235 STRKSLGC 242
+TR +L C
Sbjct: 153 NTRDTLFC 160
>gi|425440692|ref|ZP_18820990.1| Pentapeptide repeat family protein (modular protein) [Microcystis
aeruginosa PCC 9717]
gi|389718807|emb|CCH97279.1| Pentapeptide repeat family protein (modular protein) [Microcystis
aeruginosa PCC 9717]
Length = 213
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/126 (42%), Positives = 69/126 (54%), Gaps = 10/126 (7%)
Query: 127 NFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM----- 176
N +F D+R+S F GS F+ A LE + AN GAD SD M +
Sbjct: 87 NLTDNDFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESA 146
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
L AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPITG +T
Sbjct: 147 RLTRANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIATGTNPITGRNT 206
Query: 237 RKSLGC 242
R +L C
Sbjct: 207 RDTLFC 212
>gi|425470227|ref|ZP_18849097.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9701]
gi|389884202|emb|CCI35462.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9701]
Length = 161
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/121 (43%), Positives = 69/121 (57%), Gaps = 10/121 (8%)
Query: 132 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 181
+F D+R+S F GS F+ A LE + AN GAD SD M + L +A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTKA 99
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 241
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPITG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPITGRNTRDTLF 159
Query: 242 C 242
C
Sbjct: 160 C 160
>gi|443663881|ref|ZP_21133269.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|443331763|gb|ELS46407.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 150
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 54/131 (41%), Positives = 70/131 (53%), Gaps = 10/131 (7%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
FG DLR + N R +NF+ A++ G +F A LE A AN DL
Sbjct: 29 DFGGQDLRDSTFDHSNLRASNFSHANLE-----GVRFFSANLEGADFSDANMRNVDLESA 83
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
+ R AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPI
Sbjct: 84 RLTR-----ANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIAKGTNPI 138
Query: 232 TGVSTRKSLGC 242
TG +TR +L C
Sbjct: 139 TGRNTRDTLFC 149
>gi|148242344|ref|YP_001227501.1| pentapeptide repeat-containing protein [Synechococcus sp. RCC307]
gi|147850654|emb|CAK28148.1| Secreted pentapeptide repeat protein [Synechococcus sp. RCC307]
Length = 164
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 79/149 (53%), Gaps = 18/149 (12%)
Query: 108 GSAAQFGSADLRKAVHVKE--------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
G+AA + +LR A +++ N ++ F D+ +DFS S G
Sbjct: 21 GAAAAITAPELRGAKSMQDLSSDMHGRNLQQKEFLKMDLEGTDFSDSDLRGTVFNTTQLQ 80
Query: 160 KANFTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+NF+GADL D + DR L++A L N +L+++ T A I+GADF++AV+D
Sbjct: 81 DSNFSGADLRDVVAFSSRFDRADLSQARLDNGMLLQSKFT-----DATIDGADFTNAVLD 135
Query: 215 LAQKQALCKYANGTNPITGVSTRKSLGCG 243
L Q + LC A G N +G+ST SLGCG
Sbjct: 136 LPQIKQLCARATGVNERSGLSTADSLGCG 164
>gi|428317848|ref|YP_007115730.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428241528|gb|AFZ07314.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 171
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 71/118 (60%), Gaps = 10/118 (8%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K N R +NFT+AD+R G F A +E+A NF GA+L+ +D + +ANLT
Sbjct: 62 KANLRNSNFTNADLR-----GVSFFAANMEEA-----NFEGANLTGATLDLARMMKANLT 111
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA+L + L GA+I+GADF++ ++ + LCK A GTNP+TG TR++L C
Sbjct: 112 NAILEGAFAYNTRLEGAVIDGADFTETLLRDDMIEKLCKVAKGTNPVTGRDTRETLFC 169
>gi|78184858|ref|YP_377293.1| hypothetical protein Syncc9902_1285 [Synechococcus sp. CC9902]
gi|78169152|gb|ABB26249.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 162
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 65/112 (58%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A F +++RE+D SGS GA L A A+ + DL + +D V+ NL+NAV+
Sbjct: 50 ATFNLSNLREADLSGSDLRGASLYGAKLQDADLSDTDLREATLDSAVMTGTNLSNAVMEG 109
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ +I GADF+D + Q ++LC A+GTNP+TG STR+SLGC
Sbjct: 110 AFAFNTRFKDVVITGADFTDVPMRPDQLKSLCSVADGTNPVTGRSTRESLGC 161
>gi|425434011|ref|ZP_18814483.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9432]
gi|425451971|ref|ZP_18831790.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
7941]
gi|440753099|ref|ZP_20932302.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|389678210|emb|CCH92885.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9432]
gi|389766463|emb|CCI07918.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
7941]
gi|440177592|gb|ELP56865.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
Length = 161
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 54/130 (41%), Positives = 70/130 (53%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
FG DLR + N R +NF+ A++ G +F A LE A AN DL
Sbjct: 41 FGGQDLRDSTFDHSNLRGSNFSHANL-----EGVRFFSANLEGADFSDANMRNVDLESAR 95
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+ R AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPIT
Sbjct: 96 LTR-----ANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIATGTNPIT 150
Query: 233 GVSTRKSLGC 242
G +TR +L C
Sbjct: 151 GRNTRDTLFC 160
>gi|172037018|ref|YP_001803519.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354555787|ref|ZP_08975086.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171698472|gb|ACB51453.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353552111|gb|EHC21508.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 167
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 73/130 (56%), Gaps = 10/130 (7%)
Query: 123 HVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYK-----ANFTGADLSDTL 172
+ K+N +F+S D+R+SDF G F+ A L+ + ANF GADL
Sbjct: 37 YAKQNLVERDFSSQDLRDSDFEHANLRGCNFSHANLQGVRFFASNLEGANFEGADLRYAD 96
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
++ L N TNA+L T + GA+I+GADF+D ++ L ++ LC+ A GTNPIT
Sbjct: 97 LESARLVRVNFTNAILEGAFATNTLFNGAVIDGADFTDVLLRLDTEKKLCEIAKGTNPIT 156
Query: 233 GVSTRKSLGC 242
G +T+ +L C
Sbjct: 157 GRNTKDTLFC 166
>gi|422302957|ref|ZP_16390315.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9806]
gi|389792132|emb|CCI12113.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9806]
Length = 161
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 69/121 (57%), Gaps = 10/121 (8%)
Query: 132 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 181
+F D+R+S F GS F+ A LE + AN GAD SD M + L +A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTKA 99
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 241
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+TG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159
Query: 242 C 242
C
Sbjct: 160 C 160
>gi|254414183|ref|ZP_05027950.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196178858|gb|EDX73855.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 178
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 70/139 (50%), Gaps = 20/139 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 163
S F +L + K N + N ++ D+R +S + + GA ++AYK NF
Sbjct: 54 STMDFSGQNLAELEISKMNLTQTNLSNTDLRSVVISDSTMTDANLQGADFSYSIAYKVNF 113
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
GADLSD AVL +L S L I GADFS+AV+D Q Q+LC
Sbjct: 114 KGADLSD---------------AVLEEAILLGSRLDDVNITGADFSNAVLDRVQVQSLCT 158
Query: 224 YANGTNPITGVSTRKSLGC 242
A+G N TGV TR+SLGC
Sbjct: 159 KASGVNSKTGVETRESLGC 177
>gi|409992571|ref|ZP_11275753.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|409936565|gb|EKN78047.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 149
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 67/117 (57%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ + + F A+++ S+FS + G L A N ADL +D L ANLTN
Sbjct: 32 QDLKDSEFDFANLQGSNFSHTDLRGVSLFGAKMQDVNLESADLRLATLDTARLVRANLTN 91
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+L +D GAII GADF+D ++ Q+Q LC+ A+GTNP+TG TR++L C
Sbjct: 92 ALLEEAYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 148
>gi|390440388|ref|ZP_10228721.1| Pentapeptide repeat family protein [Microcystis sp. T1-4]
gi|389836192|emb|CCI32847.1| Pentapeptide repeat family protein [Microcystis sp. T1-4]
Length = 161
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 68/121 (56%), Gaps = 10/121 (8%)
Query: 132 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 181
+F D+R+S F GS F+ A LE + AN GAD SD M + L A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 99
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 241
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+TG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159
Query: 242 C 242
C
Sbjct: 160 C 160
>gi|67922307|ref|ZP_00515820.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|67855883|gb|EAM51129.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
Length = 164
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 9/132 (6%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
F DLRK A F A++R+S+FS + G A ANF GADL
Sbjct: 41 VDFSGQDLRK---------EALFDHANLRDSNFSNANVQGVRFFSANLDSANFEGADLRY 91
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
++ L + N TNA+L T + GAII+GADF+D ++D ++ LC A GTNP
Sbjct: 92 ADLEVARLTKVNFTNAILEGAFATNILVQGAIIDGADFTDVLLDPKTEKYLCTIATGTNP 151
Query: 231 ITGVSTRKSLGC 242
ITG +T+ +L C
Sbjct: 152 ITGRNTKDTLYC 163
>gi|291566844|dbj|BAI89116.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 174
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 67/117 (57%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ + + F A+++ S+FS + G L A N ADL +D L ANLTN
Sbjct: 57 QDLKDSEFDFANLQGSNFSHTDLRGVSLFGAKMQDVNLESADLRFATLDTARLVRANLTN 116
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+L +D GAII GADF+D ++ Q+Q LC+ A+GTNP+TG TR++L C
Sbjct: 117 ALLEEAYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 173
>gi|425463375|ref|ZP_18842714.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9809]
gi|389833543|emb|CCI21857.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9809]
Length = 161
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 69/121 (57%), Gaps = 10/121 (8%)
Query: 132 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 181
+F D+R+S F GS F+ A LE + AN GAD SD M + L +A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSRANLEGVRFFSANLEGADFSDANMRNVDLESARLTKA 99
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 241
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+TG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159
Query: 242 C 242
C
Sbjct: 160 C 160
>gi|416389980|ref|ZP_11685429.1| pentapeptide repeat protein [Crocosphaera watsonii WH 0003]
gi|357264135|gb|EHJ13061.1| pentapeptide repeat protein [Crocosphaera watsonii WH 0003]
Length = 164
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 9/132 (6%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
F DLRK A F A++R+S+FS + G A ANF GADL
Sbjct: 41 VDFSGQDLRK---------EALFDHANLRDSNFSNANVQGVRFFSANLDSANFEGADLRY 91
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
++ L + N TNA+L T + GAII+GADF+D ++D ++ LC A GTNP
Sbjct: 92 ADLEVARLTKVNFTNAILEGAFATNILVQGAIIDGADFTDVLLDPKTEKYLCTIATGTNP 151
Query: 231 ITGVSTRKSLGC 242
ITG +T+ +L C
Sbjct: 152 ITGRNTKDTLYC 163
>gi|427420100|ref|ZP_18910283.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425762813|gb|EKV03666.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 165
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 66/133 (49%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A + LR+ ++ R N+TS DM E+D S + G L KAN AD+S
Sbjct: 32 AKNYDRQSLRQQSFAGQDLRGNNYTSTDMAEADLSNTDLRGVRLFDTNLTKANLESADMS 91
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
+D ANL NA+ +D A IEGADF+D +D+ LC+ A G N
Sbjct: 92 GATLDGARFIRANLKNAIFEGAYAFSTDFRKANIEGADFTDVDLDVKTNDMLCEVATGVN 151
Query: 230 PITGVSTRKSLGC 242
P+TG +T+ +L C
Sbjct: 152 PVTGRATKDTLYC 164
>gi|428310976|ref|YP_007121953.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428252588|gb|AFZ18547.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 167
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/121 (41%), Positives = 65/121 (53%), Gaps = 20/121 (16%)
Query: 127 NFRRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 181
N + NF +AD+R FS S +GA +AY ++FTGADLSD
Sbjct: 61 NLEQTNFNNADLRNVVFSSSTLKQASLHGADFTSGIAYLSDFTGADLSD----------- 109
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 241
AVL ++ RS A I GADF+DAV+D Q + LC A G N TG++TR+SLG
Sbjct: 110 ----AVLTEAIMLRSRFDEADITGADFTDAVLDGVQIKKLCARATGVNSKTGMATRESLG 165
Query: 242 C 242
C
Sbjct: 166 C 166
>gi|428305184|ref|YP_007142009.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428246719|gb|AFZ12499.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 169
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 67/130 (51%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F L A K N R NF+ AD+R G GA LE N GA+LS+
Sbjct: 49 FSGRVLTDATFTKANLRNCNFSHADLR-----GVSLFGANLELV-----NLEGANLSNAT 98
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D +ANLTNAVL + GAII+GADF+D ++ ++ LCK A GTNP T
Sbjct: 99 LDTAKFTKANLTNAVLEGAFAFNAKFDGAIIDGADFTDVLVRQDVQKQLCKIATGTNPTT 158
Query: 233 GVSTRKSLGC 242
G TR +L C
Sbjct: 159 GRETRDTLLC 168
>gi|425447360|ref|ZP_18827349.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9443]
gi|389732098|emb|CCI03919.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9443]
Length = 161
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/128 (42%), Positives = 68/128 (53%), Gaps = 10/128 (7%)
Query: 125 KENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM--- 176
N +F D+R+S F GS F+ A LE + AN GAD SD M +
Sbjct: 33 NRNLTDNDFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLE 92
Query: 177 --VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
L AN TNAVL T + GAII+GADF+DA+I + LC+ A GTNPITG
Sbjct: 93 SARLTRANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEIYLCEIAKGTNPITGR 152
Query: 235 STRKSLGC 242
+TR +L C
Sbjct: 153 NTRDTLFC 160
>gi|434386960|ref|YP_007097571.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428017950|gb|AFY94044.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 168
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/133 (36%), Positives = 69/133 (51%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A F A L A ++ FT A +R F + G L A+ TGA+L+
Sbjct: 35 ADDFTKATLENADFSGKDLTSYEFTQASVRNGKFINANLTGVSLIGGNFDSADMTGANLT 94
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
+ L+D N TNA+LV + ++ GAII+GADF+D ++ ++ LCK A GTN
Sbjct: 95 NALLDTARFTRTNFTNAILVGAFTSVTNFDGAIIDGADFTDVLLRKDIQKKLCKVAKGTN 154
Query: 230 PITGVSTRKSLGC 242
P TG TR+SL C
Sbjct: 155 PTTGRDTRESLEC 167
>gi|423066922|ref|ZP_17055712.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|406711687|gb|EKD06887.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 137
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 67/117 (57%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ + + F A+++ S+FS + G L A N ADL +D L ANLTN
Sbjct: 20 QDLKDSEFDFANLQGSNFSHTDLRGVSLFGAKMQDVNLESADLRLATLDTARLVRANLTN 79
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+L +D GAII GADF+D ++ Q+Q LC+ A+GTNP+TG TR++L C
Sbjct: 80 ALLEEAYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 136
>gi|434404813|ref|YP_007147698.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428259068|gb|AFZ25018.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 172
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 64/110 (58%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F A++R+S+ S + NG A AN GADL ++ +D L ANLTNA+L
Sbjct: 62 FAKANLRQSNLSHTNLNGVSFFAANLESANLEGADLRNSTLDSARLVRANLTNALLEGAF 121
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ GAII+GADF+D ++ +++ LCK A GTNP+T TR +L C
Sbjct: 122 AANARFDGAIIDGADFTDMLLRQDEQKKLCKLAKGTNPVTLRDTRDTLFC 171
>gi|425458741|ref|ZP_18838229.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9808]
gi|389824728|emb|CCI26060.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9808]
Length = 161
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 70/130 (53%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
FG DLR + N R +NF+ A++ G +F A LE A AN DL
Sbjct: 41 FGGQDLRDSTFDHSNLRGSNFSHANL-----EGVRFFSANLEGADFSDANMRNVDLESAR 95
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+ R AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+T
Sbjct: 96 LTR-----ANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVT 150
Query: 233 GVSTRKSLGC 242
G +TR +L C
Sbjct: 151 GRNTRDTLFC 160
>gi|318041291|ref|ZP_07973247.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0101]
Length = 161
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 70/131 (53%), Gaps = 5/131 (3%)
Query: 117 DLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
D+ K V + +F R A F ++RE+DF GS GA L A AN +G DL+D
Sbjct: 30 DVAKQVLIGHDFAGMDLRGATFNLTNLREADFHGSDLRGASLFGAKLQDANLSGTDLTDA 89
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
+D VL+ +L NAVL + +IEGADF++ + LC A+GTNP+
Sbjct: 90 TLDSAVLDGTDLRNAVLENAFAFNTRFNNVLIEGADFTNVPFRGDVLKTLCASASGTNPV 149
Query: 232 TGVSTRKSLGC 242
TG +TR +L C
Sbjct: 150 TGRNTRDTLEC 160
>gi|218245449|ref|YP_002370820.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|257058486|ref|YP_003136374.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
gi|218165927|gb|ACK64664.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
gi|256588652|gb|ACU99538.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
Length = 168
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 71/118 (60%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+++ + F +AD+ E++FS S GA +ANF GA+L++ L +A+L+
Sbjct: 50 RQDLKEVKFANADLTEANFSDSDLRGAVFNGVELKQANFHGANLTNGLAYLSSFRDADLS 109
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+A+L ++ R+ A I GADF+ AV+D + LC+ A+G N TG+STR+SLGC
Sbjct: 110 DAILSEVIMLRTVFDNANITGADFTLAVLDGEEVAKLCQRADGVNSKTGMSTRESLGC 167
>gi|209527449|ref|ZP_03275954.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|376003366|ref|ZP_09781178.1| pentapeptide repeat-containing protein [Arthrospira sp. PCC 8005]
gi|209492122|gb|EDZ92472.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|375328288|emb|CCE16931.1| pentapeptide repeat-containing protein [Arthrospira sp. PCC 8005]
Length = 137
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 67/117 (57%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ + + F A+++ S+FS + G L A N ADL +D L ANLTN
Sbjct: 20 QDLKDSEFDFANLQGSNFSHTDLRGVSLFGAKMQDINLESADLRLATLDTARLVRANLTN 79
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+L +D GAII GADF+D ++ Q+Q LC+ A+GTNP+TG TR++L C
Sbjct: 80 ALLEEAYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 136
>gi|116074641|ref|ZP_01471902.1| hypothetical protein RS9916_28944 [Synechococcus sp. RS9916]
gi|116067863|gb|EAU73616.1| hypothetical protein RS9916_28944 [Synechococcus sp. RS9916]
Length = 158
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 66/131 (50%), Gaps = 5/131 (3%)
Query: 117 DLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
D K V + +F + F ++RE+DFSGS GA L A AN T +L D
Sbjct: 28 DYAKQVLIGSDFTNREMQGVTFNLTNLREADFSGSDLQGASLYGAKLQDANLTDTNLRDA 87
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
+D VL+ NLTNAVL + II GADF++ + LC A GTNP+
Sbjct: 88 TLDSAVLDGTNLTNAVLEDAFAFNTRFSNVIITGADFTNVPFRGDALKTLCAAAEGTNPV 147
Query: 232 TGVSTRKSLGC 242
TG TR +LGC
Sbjct: 148 TGRDTRDTLGC 158
>gi|443475471|ref|ZP_21065420.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019714|gb|ELS33767.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 164
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 90/189 (47%), Gaps = 37/189 (19%)
Query: 65 LKNWRVFVSTALAAAV------VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADL 118
+K W++ S L+ V V + SS+ + LN E F +L
Sbjct: 1 MKYWQLITSIVLSIFVFLMPLPVQAASSSSVTRSILNAVGGE-----------DFSGKNL 49
Query: 119 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLM 173
+A + ANFT+AD+R + F+G +GA L +AY + F DLSD
Sbjct: 50 IRAEFTSVTLKNANFTNADLRGAIFNGVLLDGANLHGSDFSSGIAYISRFKNVDLSDA-- 107
Query: 174 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 233
VLN+ N+ RS + GADF++A++D+ Q + LC A+GTN TG
Sbjct: 108 ---VLNDTNML----------RSTFDNVEVTGADFTNALLDIQQLKKLCINASGTNSKTG 154
Query: 234 VSTRKSLGC 242
VSTR+SLGC
Sbjct: 155 VSTRESLGC 163
>gi|166364098|ref|YP_001656371.1| pentapeptide repeat-containing protein [Microcystis aeruginosa
NIES-843]
gi|166086471|dbj|BAG01179.1| pentapeptide repeat family protein [Microcystis aeruginosa
NIES-843]
Length = 161
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 74/130 (56%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F DLR + N R +NF+ A++ G +F A LE A NF+ A++ +
Sbjct: 41 FAGQDLRDSTFDHSNLRGSNFSRANL-----EGVRFFSANLEGA-----NFSDANMRNVD 90
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
++ L +AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+T
Sbjct: 91 LESARLTKANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVT 150
Query: 233 GVSTRKSLGC 242
G +TR +L C
Sbjct: 151 GRNTRDTLFC 160
>gi|443320013|ref|ZP_21049146.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
gi|442790267|gb|ELR99867.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
Length = 164
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 65/131 (49%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+F + DLR +N + FT ++ +F+ + G +AN G D S
Sbjct: 33 RFDNRDLRGESFANQNLQTVEFTKVKLQGVNFANADLIGVVFNSTALDQANLQGVDFSQG 92
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
+ + +L +A+LV +L RS I GADFS AV+D Q LC YA+G N
Sbjct: 93 IAYLTSFDGVDLRDALLVEALLLRSTFKDTKISGADFSSAVLDQDQLDKLCSYADGVNSK 152
Query: 232 TGVSTRKSLGC 242
TGV TR+SLGC
Sbjct: 153 TGVKTRESLGC 163
>gi|434397761|ref|YP_007131765.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428268858|gb|AFZ34799.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 166
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 67/130 (51%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F D R +N + +F D+ ++FS + GA + AN G D S
Sbjct: 36 FSEVDFRSKDFSGKNLQSIDFAKVDLESANFSNADLRGAVFNASNLANANLQGVDFSYGF 95
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+ A+LT+A+ T+L+ S GA I+ ADF+ AV++ Q + LC A+G NP T
Sbjct: 96 AYLTNFDGADLTDAIFQETILSFSTFEGAKIKNADFTFAVLEKWQVKQLCANASGVNPKT 155
Query: 233 GVSTRKSLGC 242
GV TR+SLGC
Sbjct: 156 GVDTRESLGC 165
>gi|126696874|ref|YP_001091760.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9301]
gi|126543917|gb|ABO18159.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9301]
Length = 186
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 71/128 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
S D+ K + + D+ D + GAY+ A ++F GA+++D +
Sbjct: 46 SVDVLKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMTDLIAY 105
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
+ A+ T+A L L +S GAII+GADF+DA +DL+Q+++LC+ A+GTN TGV
Sbjct: 106 ATRFDNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLSQRKSLCERASGTNSQTGV 165
Query: 235 STRKSLGC 242
+T SL C
Sbjct: 166 NTIDSLEC 173
>gi|123969083|ref|YP_001009941.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. AS9601]
gi|123199193|gb|ABM70834.1| Pentapeptide repeats [Prochlorococcus marinus str. AS9601]
Length = 186
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 71/128 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
S D+ K + + D+ D + GAY+ A ++F GA+++D +
Sbjct: 46 SVDVLKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMTDLIAY 105
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
+ A+ T+A L L +S GAII+GADF+DA +DL+Q+++LC+ A+GTN TGV
Sbjct: 106 ATRFDNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLSQRKSLCERASGTNTKTGV 165
Query: 235 STRKSLGC 242
+T SL C
Sbjct: 166 NTIDSLEC 173
>gi|428220990|ref|YP_007105160.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994330|gb|AFY73025.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 165
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 62/110 (56%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F D+ ++F + G L A A+FTGADL + +D +N ANLTNAVL
Sbjct: 54 FNKTDLHNANFRNANLAGVSLFGANMTAADFTGADLRYSTLDTARMNGANLTNAVLEGAF 113
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ + G +I+GADFSD + + LCK A GTNP+TG TR++L C
Sbjct: 114 VYGTSFVGTVIDGADFSDVDLRNTTRSLLCKVAKGTNPVTGRDTRETLEC 163
>gi|359460819|ref|ZP_09249382.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 164
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 70/127 (55%), Gaps = 10/127 (7%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN------ 179
E+ +F+ D+RE++FS ++ GA +A F G DL+ + + +
Sbjct: 37 EDIVTQDFSGQDLREAEFSNNQLAGANFSEADLTAVVFNGVDLTGASLKNVDMTGGMAYL 96
Query: 180 ----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 235
EA+L+ A+L +L +S L A + ADFS AVID Q + LC+ A+G NP+TGV
Sbjct: 97 SSFAEADLSGAILTEAMLLQSSLRNATVTDADFSFAVIDKDQVKILCETASGVNPVTGVD 156
Query: 236 TRKSLGC 242
TR SLGC
Sbjct: 157 TRDSLGC 163
>gi|254431831|ref|ZP_05045534.1| pentapeptide repeat protein [Cyanobium sp. PCC 7001]
gi|197626284|gb|EDY38843.1| pentapeptide repeat protein [Cyanobium sp. PCC 7001]
Length = 174
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 64/117 (54%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ R F ++R++D SGS GA L A A+ + +L +T +D V N +LTN
Sbjct: 57 QDLRGGTFNLTNLRDADLSGSDLQGASLFGAKLQDADLSNTNLRETTLDSAVFNGTDLTN 116
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVL + II+GADF++ + +ALC A GTNP+TG TR +LGC
Sbjct: 117 AVLEDAFAFNTKFSDVIIDGADFTNVPLRGDALKALCAVARGTNPVTGRQTRDTLGC 173
>gi|33861334|ref|NP_892895.1| hypothetical protein PMM0777 [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
gi|33633911|emb|CAE19236.1| conserved hypothetical protein [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
Length = 170
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/140 (38%), Positives = 74/140 (52%), Gaps = 29/140 (20%)
Query: 117 DLRKAVHVKE----NFRRANFTSADMRESDFSGSKFN----------GAYLEKAVAYKAN 162
DL + +H ++ F + N D +S+ G+ FN GA L A+AY +
Sbjct: 46 DLEEDMHGQDLSGNEFVKFNLNGFDFSQSNLEGAVFNNSKLQNATMTGANLSDALAYATD 105
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
FT ADLSD N TNA+L+ S+ GA I+GADF++AV+ Q++ LC
Sbjct: 106 FTDADLSD----------VNFTNALLME-----SNFEGAKIDGADFTNAVLSRIQQKELC 150
Query: 223 KYANGTNPITGVSTRKSLGC 242
+ ANGTN TG ST SLGC
Sbjct: 151 EIANGTNSSTGESTEYSLGC 170
>gi|158337467|ref|YP_001518642.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158307708|gb|ABW29325.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 164
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 70/127 (55%), Gaps = 10/127 (7%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN------ 179
E+ +F+ D+RE++FS ++ GA +A F G DL+ + + +
Sbjct: 37 EDIVTQDFSGQDLREAEFSNNQLAGANFSEADLTAVVFNGVDLTGASLKNVDMTGGMAYL 96
Query: 180 ----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 235
EA+L+ A+L +L +S L A + ADFS AVID Q + LC+ A+G NP+TGV
Sbjct: 97 SSFAEADLSGAILTEAMLLQSSLRDATVTDADFSFAVIDKDQVKILCETASGVNPVTGVD 156
Query: 236 TRKSLGC 242
TR SLGC
Sbjct: 157 TRDSLGC 163
>gi|56752263|ref|YP_172964.1| hypothetical protein syc2254_d [Synechococcus elongatus PCC 6301]
gi|24251237|gb|AAN46157.1| unknown protein [Synechococcus elongatus PCC 7942]
gi|56687222|dbj|BAD80444.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 171
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 70/130 (53%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A++ + + ++ +A F S ++ F G+ GA +ANF AD +D +
Sbjct: 40 FDDAEVTRQDYSGQSLIQAEFASVRLKGVSFRGADLRGAVFNGVDLREANFEDADFTDGI 99
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
L N NA L +L +S+L G+ + GADFS AV+ Q ALC+ A+GTNP T
Sbjct: 100 AYVSDLRNVNFRNANLTSAMLLQSELQGSDVTGADFSFAVLSKQQITALCETASGTNPKT 159
Query: 233 GVSTRKSLGC 242
G TR+SLGC
Sbjct: 160 GADTRESLGC 169
>gi|113475775|ref|YP_721836.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110166823|gb|ABG51363.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 165
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 63/110 (57%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F A M +++F G+ L K +AN T AD + T DR+ ++++LTNA+ +
Sbjct: 55 FAGATMWKANFQGANLQNTILTKGDFLRANLTEADFTGTFADRVSFDKSDLTNAIFTDAM 114
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
L S A + G DFS A++D Q + +C+ A+G N TGV TR+SLGC
Sbjct: 115 LMSSTFRDATVIGTDFSGAMVDRYQIKLMCETASGKNKTTGVETRESLGC 164
>gi|300868113|ref|ZP_07112748.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300333887|emb|CBN57928.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 169
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 64/112 (57%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A FT A++R S+FS + G A ANF GA+L +D + + NLTNA+L
Sbjct: 57 AQFTKANLRNSNFSNANLQGVSFFAANMEDANFEGANLRGATLDLARMIKVNLTNAILEG 116
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ AI++GADF+D +I + LCK A GTNP+TG +TR++L C
Sbjct: 117 AFAYNTKFERAIVDGADFTDILIRDDMVEKLCKVARGTNPVTGRNTRETLFC 168
>gi|428313239|ref|YP_007124216.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428254851|gb|AFZ20810.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 169
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 63/112 (56%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
++FT A++R S+FS S G A ANF GA+L + +D L A+L NAVL
Sbjct: 57 SSFTKANLRSSNFSHSNLEGVSFFSANLESANFEGANLRNATLDTARLTRASLKNAVLEG 116
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ GA IEGADF++ + ++ LC A+GTNP TG STR +L C
Sbjct: 117 AFAFNTKFDGATIEGADFTEVLFRQDVQKQLCHVASGTNPTTGRSTRDTLFC 168
>gi|113474577|ref|YP_720638.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110165625|gb|ABG50165.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 144
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 70/130 (53%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F DL K R++NF++A++ SG GA+LE A N GA+LS +
Sbjct: 24 FSGKDLTNDSFTKSILRKSNFSNANL-----SGVSLFGAHLEGA-----NLEGANLSYST 73
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D V N+ANLTNA+L + AII+GADF+DA + + LCK A G N IT
Sbjct: 74 LDDAVFNKANLTNAILEGAFAFHTQFRDAIIDGADFTDAFLRKDTTKDLCKIAQGKNSIT 133
Query: 233 GVSTRKSLGC 242
G TR +L C
Sbjct: 134 GKETRDTLFC 143
>gi|123966041|ref|YP_001011122.1| hypothetical protein P9515_08061 [Prochlorococcus marinus str. MIT
9515]
gi|123200407|gb|ABM72015.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 170
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 56/148 (37%), Positives = 76/148 (51%), Gaps = 20/148 (13%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF-----NGAYLE 154
E R + + DL VK N +F+ +++ + F+ SK NGA L
Sbjct: 38 EIRNQQDLDLEQDMHGQDLSGNEFVKFNLNGFDFSQSNLEGAVFNNSKLQNATLNGANLT 97
Query: 155 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A+AY +FT ADLSD N TNA+L+ S+ GA I+GADF++AV+
Sbjct: 98 DALAYATDFTDADLSD----------VNFTNALLME-----SNFEGAKIDGADFTNAVLS 142
Query: 215 LAQKQALCKYANGTNPITGVSTRKSLGC 242
Q++ LC ANGTN TG ST SLGC
Sbjct: 143 RIQQKELCAIANGTNSSTGESTEYSLGC 170
>gi|78212716|ref|YP_381495.1| hypothetical protein Syncc9605_1185 [Synechococcus sp. CC9605]
gi|78197175|gb|ABB34940.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 165
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 68/120 (56%)
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 182
+ ++ R A F +++RE++ SGS GA L A A+ +G DL + +D V+ N
Sbjct: 45 YSNKDLRGATFNLSNLREANLSGSDLRGASLYGAKLQDADLSGTDLREATLDAAVMTGTN 104
Query: 183 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
L +AVL + +I GADF+D + Q ++LC A+GTN +TG STR+SLGC
Sbjct: 105 LEDAVLEGAFAFNTRFSDVLITGADFTDVPMRGDQLKSLCAVADGTNSVTGRSTRESLGC 164
>gi|81300649|ref|YP_400857.1| hypothetical protein Synpcc7942_1840 [Synechococcus elongatus PCC
7942]
gi|81169530|gb|ABB57870.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 168
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 70/130 (53%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A++ + + ++ +A F S ++ F G+ GA +ANF AD +D +
Sbjct: 37 FDDAEVTRQDYSGQSLIQAEFASVRLKGVSFRGADLRGAVFNGVDLREANFEDADFTDGI 96
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
L N NA L +L +S+L G+ + GADFS AV+ Q ALC+ A+GTNP T
Sbjct: 97 AYVSDLRNVNFRNANLTSAMLLQSELQGSDVTGADFSFAVLSKQQITALCETASGTNPKT 156
Query: 233 GVSTRKSLGC 242
G TR+SLGC
Sbjct: 157 GADTRESLGC 166
>gi|428771687|ref|YP_007163477.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428685966|gb|AFZ55433.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 159
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 65/115 (56%), Gaps = 5/115 (4%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
F + N SA++ +SD G F GA ++ N GA+L+++++D L ANL NAV
Sbjct: 49 FNKTNLRSANLSQSDLQGVSFFGANMDSI-----NLEGANLTNSILDSARLTRANLRNAV 103
Query: 188 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
L T + GA IEGADF+D ++ ++ LC+ A G NP TG TR +L C
Sbjct: 104 LEGAFATNTKFEGANIEGADFTDVILRPDVEEMLCEKAKGVNPTTGRKTRDTLYC 158
>gi|428215647|ref|YP_007088791.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004028|gb|AFY84871.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 183
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 71/133 (53%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A + +L A + +A+F A++R+S+ S + GA L A AN GA+LS
Sbjct: 50 AQNYNKENLLGADFSGRDLTQASFNHANLRKSNLSHANLQGASLFAAHLEDANLEGANLS 109
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
+T +D NL NA+L + + GA IEGADF+D + + LC+ A GTN
Sbjct: 110 NTTLDTARFIRTNLKNAILEGSFAFSAKFNGANIEGADFTDVFLRDDANEILCELATGTN 169
Query: 230 PITGVSTRKSLGC 242
P+TG +TR +L C
Sbjct: 170 PVTGRNTRDTLYC 182
>gi|427734374|ref|YP_007053918.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427369415|gb|AFY53371.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 167
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 71/131 (54%), Gaps = 5/131 (3%)
Query: 117 DLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
D K + ++ +F ++FT A++R+S+FS S G AN A+L
Sbjct: 36 DYNKEILIEADFSGQDLTDSSFTKANLRDSNFSNSNLQGVRFFATNLESANLRNANLRYA 95
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
+D L +A+LTNAVL + + GAII+GADF+D ++ ++ LCK A GTNP
Sbjct: 96 TLDSARLVKADLTNAVLEGAFASNARFDGAIIDGADFTDVLLRADEQDKLCKLAKGTNPT 155
Query: 232 TGVSTRKSLGC 242
TG TR +L C
Sbjct: 156 TGRDTRDTLFC 166
>gi|428780675|ref|YP_007172461.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
gi|428694954|gb|AFZ51104.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
Length = 167
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 77/147 (52%), Gaps = 7/147 (4%)
Query: 98 EAETRGEFGIGS--AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 155
EA+T F S +A DL AN T AD+ +D GS F + ++
Sbjct: 25 EAQTSTRFQRQSLISADLSEEDLSGETLQLREISDANLTGADLSNADLRGSIFTASVMKN 84
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
A + ANFT T+++ + A+L+ A+L +L+R+ L I GADF++AV+D
Sbjct: 85 ANLHGANFTF-----TVLNGVDFTNADLSQAILEDAILSRAILKDVDITGADFTNAVLDN 139
Query: 216 AQKQALCKYANGTNPITGVSTRKSLGC 242
Q LC+ A G N TGV+TR+SLGC
Sbjct: 140 QQYNQLCEMATGVNEETGVATRESLGC 166
>gi|218438527|ref|YP_002376856.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218171255|gb|ACK69988.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 172
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 64/117 (54%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ R A F A++R S+FS G A ANF GA+L ++ L N TN
Sbjct: 54 QDLRDAKFDHANLRSSNFSNVNAEGVRFFAANLESANFEGANLRYADLESARLTRVNFTN 113
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVL T + GAII+GADF+D ++ +Q LC A GTNP+TG +T+ +L C
Sbjct: 114 AVLEGAFATNTLFKGAIIDGADFTDVLLRPDTEQYLCTIAKGTNPVTGRNTKDTLYC 170
>gi|78779832|ref|YP_397944.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9312]
gi|78713331|gb|ABB50508.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9312]
Length = 186
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 75/153 (49%), Gaps = 20/153 (13%)
Query: 108 GSAAQFGSADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
G ADL+ VK ++ AN A M + + S F A ++ +AY
Sbjct: 49 GLKEDLHGADLQNNEFVKYDLSNQDLGEANLQGAYMSVTTAANSSFKSANMKDLIAYAVR 108
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
F ADLSD ANLTN L+++V GA I+GADF+DA +DL Q+++LC
Sbjct: 109 FDNADLSD----------ANLTNGELMKSVF-----DGATIDGADFTDATLDLPQRKSLC 153
Query: 223 KYANGTNPITGVSTRKSLGCGNSRRNAYGSPSS 255
+ A GTN TGV T SL C R +P +
Sbjct: 154 ERATGTNSKTGVDTVDSLECSGLRGYIPATPEA 186
>gi|434395414|ref|YP_007130361.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428267255|gb|AFZ33201.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 168
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 63/110 (57%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F A++R S+FS + G L A AN GA+L++ +D L+ ANL +AVL
Sbjct: 58 FNHANLRNSNFSHANLEGVSLFAANLESANLEGANLTNATLDSARLSNANLKDAVLEGAF 117
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ AII+GADF+D ++ ++ LCK A GTNP TG TR++L C
Sbjct: 118 AANAKFDKAIIDGADFTDVLLRRDEQDKLCKVAKGTNPTTGRETRETLMC 167
>gi|123966744|ref|YP_001011825.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9515]
gi|123201110|gb|ABM72718.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9515]
Length = 192
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/132 (39%), Positives = 71/132 (53%), Gaps = 20/132 (15%)
Query: 116 ADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
ADL+ +VK ++ AN A M + S F GA ++ +AY F AD SD
Sbjct: 63 ADLQNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFSD 122
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
ANLTN L+++V GAII+GADF+DA +DL +++LC+ A GTN
Sbjct: 123 ----------ANLTNGELMKSV-----FDGAIIDGADFTDANLDLKTRKSLCERATGTNS 167
Query: 231 ITGVSTRKSLGC 242
TGV T +SL C
Sbjct: 168 RTGVDTFESLEC 179
>gi|449018152|dbj|BAM81554.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 321
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 75/129 (58%), Gaps = 1/129 (0%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
+L A K++ +F + +R+ DFSGS A A ANF A+LS ++
Sbjct: 193 NLEGANFAKQDLHGVSFQQSIVRDVDFSGSNLQDASFFDADCSGANFQNANLSRANLELA 252
Query: 177 VLNEANLTNAVLVRT-VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 235
L +A+L NA+L V+ ++ L G IEG+D++D ++ Q++ LCK A+G NP+T ++
Sbjct: 253 NLRKADLRNAILTNAYVVGQTKLEGIQIEGSDWTDVLLRPDQRRLLCKRASGENPVTHIA 312
Query: 236 TRKSLGCGN 244
T+ SLGC +
Sbjct: 313 TKDSLGCAD 321
>gi|124025420|ref|YP_001014536.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL1A]
gi|123960488|gb|ABM75271.1| Pentapeptide repeats [Prochlorococcus marinus str. NATL1A]
Length = 156
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 68/134 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SA +G L + + + A F +D+++SDFSGS GA A AN + ++
Sbjct: 23 SALDYGKQTLIGSDFSNIDLKGATFYLSDLQDSDFSGSDLQGASFFDAKLENANLSNTNM 82
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
D MD +LN ANL+N+VL + IIEGADF+D +I + LC ANG
Sbjct: 83 RDVTMDAAILNGANLSNSVLEGAFAYNAKFENVIIEGADFTDVLIANDVRNKLCLIANGI 142
Query: 229 NPITGVSTRKSLGC 242
N +T T +L C
Sbjct: 143 NSVTNKKTSDTLDC 156
>gi|422295781|gb|EKU23080.1| pentapeptide repeat protein [Nannochloropsis gaditana CCMP526]
Length = 217
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 70/117 (59%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++F + +F+ A + ++F G+K GA K+ +A+FTGADL+ + + +A L +
Sbjct: 100 KDFSKKDFSGAFAQRANFKGAKLMGARFYKSALTEADFTGADLTSASFEGANMVDAILKD 159
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A++ T + L IEGADFSD ++D ++ LC+ A GTNP T V TR+SL C
Sbjct: 160 AIVNNAYFTETVLKVGSIEGADFSDTLLDRFVQKKLCEKATGTNPKTKVDTRESLLC 216
>gi|113954335|ref|YP_730803.1| pentapeptide repeat-containing protein [Synechococcus sp. CC9311]
gi|113881686|gb|ABI46644.1| pentapeptide repeat protein [Synechococcus sp. CC9311]
Length = 157
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 68/135 (50%), Gaps = 5/135 (3%)
Query: 113 FGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
F + D K V + +F + F ++RE+D SGS GA L A AN + ++
Sbjct: 22 FAAMDYAKQVLIGADFSNREMQGVTFNLTNLREADLSGSDLQGASLYGAKLQDANLSNSN 81
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L D +D V + NLTNAVL + +EGADF++ + + LC A G
Sbjct: 82 LRDATLDSAVFDGTNLTNAVLEDAFAFNTRFINVTVEGADFTNVPLRTDALKVLCANAEG 141
Query: 228 TNPITGVSTRKSLGC 242
NP+TG TR++LGC
Sbjct: 142 VNPVTGRDTRETLGC 156
>gi|72382023|ref|YP_291378.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL2A]
gi|124025522|ref|YP_001014638.1| hypothetical protein NATL1_08151 [Prochlorococcus marinus str.
NATL1A]
gi|72001873|gb|AAZ57675.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
NATL2A]
gi|123960590|gb|ABM75373.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL1A]
Length = 170
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 63/116 (54%), Gaps = 15/116 (12%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF +N T A S +G+ +GA L A+AY ++F GADL D + +L E+N T+A
Sbjct: 70 NFSESNLTGAVFNNSKLNGADLHGAQLNDALAYASDFEGADLRDVDFNGALLMESNFTDA 129
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+IEGADF+DAVI Q++ LC A+GTN T T SLGC
Sbjct: 130 ---------------LIEGADFTDAVISRIQQKELCNMASGTNSKTDEDTSYSLGC 170
>gi|428205702|ref|YP_007090055.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428007623|gb|AFY86186.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 169
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 62/110 (56%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F+ A++R S+FS S G L A ANF GA+L+ +D L ANL +A+L
Sbjct: 59 FSHANLRSSNFSHSNLEGVSLFAANLDSANFEGANLASATLDSARLTRANLKDAILEGAF 118
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ GA+I+GADF+D ++ + LC+ A G NP TG +TR +L C
Sbjct: 119 AANTKFDGAVIDGADFTDVLMRRDVQDKLCQVAKGVNPTTGRATRDTLFC 168
>gi|254526458|ref|ZP_05138510.1| pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
gi|221537882|gb|EEE40335.1| pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
Length = 179
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 20/132 (15%)
Query: 116 ADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
ADL +VK ++ AN A M + S F GA ++ +AY F AD +D
Sbjct: 50 ADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFTD 109
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
ANLTN L+++V GAII+GADF+DA +DL +++LC+ A GTN
Sbjct: 110 ----------ANLTNGELMKSV-----FDGAIIDGADFTDANLDLKTRKSLCERATGTNS 154
Query: 231 ITGVSTRKSLGC 242
TGV+T SL C
Sbjct: 155 QTGVNTADSLEC 166
>gi|157413912|ref|YP_001484778.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9215]
gi|157388487|gb|ABV51192.1| Pentapeptide repeat-containing proteins [Prochlorococcus marinus
str. MIT 9215]
Length = 186
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 20/132 (15%)
Query: 116 ADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
ADL +VK ++ AN A M + S F GA ++ +AY F AD +D
Sbjct: 57 ADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFTD 116
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
ANLTN L+++V GAII+GADF+DA +DL +++LC+ A GTN
Sbjct: 117 ----------ANLTNGELMKSV-----FDGAIIDGADFTDANLDLKTRKSLCERATGTNS 161
Query: 231 ITGVSTRKSLGC 242
TGV+T SL C
Sbjct: 162 QTGVNTADSLEC 173
>gi|22298403|ref|NP_681650.1| hypothetical protein tll0860 [Thermosynechococcus elongatus BP-1]
gi|22294582|dbj|BAC08412.1| tll0860 [Thermosynechococcus elongatus BP-1]
Length = 178
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 69/130 (53%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F DLR + F +AN +++ ++ G F GA LE A N GADL
Sbjct: 54 FSGRDLRGS-----EFTKANLFHSNLSHTNLQGVSFFGANLETA-----NLEGADLRYAT 103
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D L +ANLTNA+L ++ AII GADF+D + ++ LCK A+GTNP+T
Sbjct: 104 LDTARLTKANLTNAILEGAFAFNTNFDDAIITGADFTDVELREDAQRKLCKVASGTNPVT 163
Query: 233 GVSTRKSLGC 242
G T ++L C
Sbjct: 164 GRKTWETLHC 173
>gi|434392213|ref|YP_007127160.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428264054|gb|AFZ30000.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 165
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 66/117 (56%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N + A F +AD+ ++FS + G A KAN GAD ++ + + ANL++
Sbjct: 49 QNLQTAEFANADLEAANFSNADLRGVVFNGAKLIKANLHGADFTNGIAYIVDFTGANLSD 108
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AV+ ++ RS I GADF++AV+D + LC A+G N TGV+TR SLGC
Sbjct: 109 AVMEEAMMLRSIFNDVDITGADFTNAVLDRTVVKKLCAQASGVNSKTGVATRDSLGC 165
>gi|91070378|gb|ABE11292.1| pentapeptide repeats [uncultured Prochlorococcus marinus clone
HF10-88H9]
Length = 186
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 20/132 (15%)
Query: 116 ADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
ADL +VK ++ AN A M + S F GA ++ +AY F AD +D
Sbjct: 57 ADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFTD 116
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
ANLTN L+++V GAII+GADF+DA +DL +++LC+ A GTN
Sbjct: 117 ----------ANLTNGELMKSV-----FDGAIIDGADFTDANLDLKTRKSLCERATGTNS 161
Query: 231 ITGVSTRKSLGC 242
TGV+T SL C
Sbjct: 162 QTGVNTADSLEC 173
>gi|72381929|ref|YP_291284.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL2A]
gi|72001779|gb|AAZ57581.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
NATL2A]
Length = 156
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 68/134 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SA +G L + + + A F +D++ SDFSGS GA A AN + ++
Sbjct: 23 SALDYGKQTLIGSDFSNIDLKGATFYLSDLQNSDFSGSDLQGASFFDAKLENANLSNTNM 82
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
D MD +LN ANL+N++L + IIEGADF+D +I + LC ANG
Sbjct: 83 RDVTMDAAILNGANLSNSILEGAFAYNAKFENVIIEGADFTDVLIANDVRNKLCLIANGI 142
Query: 229 NPITGVSTRKSLGC 242
N +T T ++L C
Sbjct: 143 NSVTNKKTSETLDC 156
>gi|218248608|ref|YP_002373979.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|218169086|gb|ACK67823.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
Length = 152
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 70/130 (53%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F DLR A+ N R +NF+ A+++ G +F A LE A NF GADL
Sbjct: 28 FSGQDLRDALFDHANLRGSNFSHANLQ-----GVRFFSANLEGA-----NFEGADLRGAD 77
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
++ L N TNA+L T + G II+GADF+D ++ ++ LC A GTNP+T
Sbjct: 78 LESARLTRVNFTNALLEGAFATNVLIKGVIIDGADFTDVLLRPDVEKQLCAIAQGTNPVT 137
Query: 233 GVSTRKSLGC 242
G +T+ +L C
Sbjct: 138 GRNTKDTLFC 147
>gi|56751209|ref|YP_171910.1| hypothetical protein syc1200_c [Synechococcus elongatus PCC 6301]
gi|81299124|ref|YP_399332.1| hypothetical protein Synpcc7942_0313 [Synechococcus elongatus PCC
7942]
gi|56686168|dbj|BAD79390.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81168005|gb|ABB56345.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 170
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 68/131 (51%), Gaps = 5/131 (3%)
Query: 117 DLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
D K + ++ NF ANFT A++R SDFS S G A + GADLS+T
Sbjct: 39 DFTKEILIESNFSNRDLSDANFTKANLRSSDFSNSVLVGVRFYGANLESVDLHGADLSNT 98
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
++D+ + +LT+A+L + GA I GADF+D ++ + LC A G N
Sbjct: 99 ILDQARMTNTDLTDAILEGAYAFNALFQGAKITGADFTDVLMRQDAQDLLCSVAEGVNSK 158
Query: 232 TGVSTRKSLGC 242
TG +TR +L C
Sbjct: 159 TGRATRDTLDC 169
>gi|257061674|ref|YP_003139562.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
gi|256591840|gb|ACV02727.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
Length = 167
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 70/130 (53%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F DLR A+ N R +NF+ A+++ G +F A LE A NF GADL
Sbjct: 47 FSGQDLRDALFDHANLRGSNFSHANLQ-----GVRFFSANLEGA-----NFEGADLRGAD 96
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
++ L N TNA+L T + G II+GADF+D ++ ++ LC A GTNP+T
Sbjct: 97 LESARLTRVNFTNALLEGAFATNVLIKGVIIDGADFTDVLLRPDVEKQLCAIAQGTNPVT 156
Query: 233 GVSTRKSLGC 242
G +T+ +L C
Sbjct: 157 GRNTKDTLFC 166
>gi|307151213|ref|YP_003886597.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306981441|gb|ADN13322.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 174
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 70/137 (51%), Gaps = 20/137 (14%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTG 165
A F L A + +ANF+ AD+R + F+GS K +GA L A+AY ++F G
Sbjct: 46 ADFSGQRLTLAQFTNVDLTQANFSDADLRGAVFNGSALKEVKLHGADLTNALAYLSSFEG 105
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 225
ADLSD A+ +L R+ A + G DFS AV+D + LCK A
Sbjct: 106 ADLSD---------------AIFAEAILKRTSFKNADVTGTDFSFAVLDGEEIANLCKSA 150
Query: 226 NGTNPITGVSTRKSLGC 242
+G N TGVSTR SL C
Sbjct: 151 SGVNSKTGVSTRDSLRC 167
>gi|434406341|ref|YP_007149226.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428260596|gb|AFZ26546.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 165
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 68/121 (56%), Gaps = 20/121 (16%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEA 181
N ANF+ AD+R F+G+ G L + +AY NF GAD +D A
Sbjct: 59 NLENANFSDADLRGVVFNGTLLKGVNLHGVDFSQGIAYLVNFKGADFTD----------A 108
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 241
T+A+++R++ + + GADF++AV+D+ Q + LC A+G N TGV+TR+SLG
Sbjct: 109 VFTDAMMLRSLFDDVN-----VTGADFTNAVLDMQQVKKLCLKASGVNSQTGVNTRESLG 163
Query: 242 C 242
C
Sbjct: 164 C 164
>gi|428774426|ref|YP_007166214.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
7202]
gi|428688705|gb|AFZ48565.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
Length = 158
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 72/132 (54%), Gaps = 10/132 (7%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
+ F DL + K N R ++FT+A++ F G+ + A LE GA+L++
Sbjct: 36 SDFSGQDLSGSTFNKTNLRSSDFTNANLSNVSFFGANLDSANLE----------GANLTN 85
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
++D + ANL NAVL T + A IEGADF+D ++ ++ LC+ A+G NP
Sbjct: 86 AVLDSARVTRANLHNAVLEGAFATNTKFEKANIEGADFTDVLLRPDVEEMLCEVASGINP 145
Query: 231 ITGVSTRKSLGC 242
+TG +TR +L C
Sbjct: 146 VTGRNTRDTLYC 157
>gi|126659509|ref|ZP_01730642.1| hypothetical protein CY0110_07279 [Cyanothece sp. CCY0110]
gi|126619243|gb|EAZ89979.1| hypothetical protein CY0110_07279 [Cyanothece sp. CCY0110]
Length = 167
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 70/130 (53%), Gaps = 10/130 (7%)
Query: 123 HVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYK-----ANFTGADLSDTL 172
+ K+N +F+ D+R+SDF G F+ A L+ + ANF GADL
Sbjct: 37 YAKQNLVERDFSGQDLRDSDFEHANLRGCNFSHANLQGVRFFASNLEGANFEGADLRYAD 96
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
++ L N TNA+L T + GA+I+GADF+D ++ L ++ LC A GTNP+T
Sbjct: 97 LESARLVRVNFTNAILEGAFATNTLFNGAVIDGADFTDVLLRLDTEKKLCDIAKGTNPVT 156
Query: 233 GVSTRKSLGC 242
+T+ +L C
Sbjct: 157 RRNTKDTLFC 166
>gi|16331083|ref|NP_441811.1| hypothetical protein sll0274 [Synechocystis sp. PCC 6803]
gi|383322826|ref|YP_005383679.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325995|ref|YP_005386848.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491879|ref|YP_005409555.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437147|ref|YP_005651871.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
gi|451815240|ref|YP_007451692.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
gi|1653576|dbj|BAA18489.1| sll0274 [Synechocystis sp. PCC 6803]
gi|339274179|dbj|BAK50666.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
gi|359272145|dbj|BAL29664.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359275315|dbj|BAL32833.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278485|dbj|BAL36002.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451781209|gb|AGF52178.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
Length = 196
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 85/178 (47%), Gaps = 16/178 (8%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
L W+ V T + +VA+ + +LA + RG A F DLR ++
Sbjct: 34 LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 87
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
N R A+FT A+++ + F + +GA LE A A +F A L+ ANL
Sbjct: 88 HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDFESARLT----------HANLR 137
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA L + T + G IEGAD +D ++ + LC A GTNP+TG T+++L C
Sbjct: 138 NARLEGSFGTNTKFGEVDIEGADLTDIILRPDTEDYLCGLAKGTNPVTGRETKETLFC 195
>gi|452821017|gb|EME28052.1| thylakoid lumenal protein [Galdieria sulphuraria]
Length = 217
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 68/122 (55%), Gaps = 1/122 (0%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+++ +F + +RE+DF G+K A A AN ADL++ ++ L A L
Sbjct: 96 EQDLSGVSFQQSLLRETDFHGAKLVSASFFGAELSYANLEDADLTEANLELANLRSAKLK 155
Query: 185 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCG 243
NAVL R + + L I+GADFS+ ++ QK+ LC ANGTN TGV T+ SLGC
Sbjct: 156 NAVLRRAYFSGNTRLENVDIDGADFSEVILRKDQKKYLCNIANGTNSHTGVETKTSLGCN 215
Query: 244 NS 245
+S
Sbjct: 216 SS 217
>gi|427728200|ref|YP_007074437.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427364119|gb|AFY46840.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 164
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 60/112 (53%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A FT+AD+ ++FS + G V N G D S+ + ANL++AVL
Sbjct: 52 AEFTNADLENANFSDADLRGGVFNGTVLEGVNLHGVDFSNGIAYLAKFKNANLSDAVLTD 111
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
++ RS I G DF++AV+D Q + LC A+G N TGV TR+SLGC
Sbjct: 112 AMMLRSTFDNVDITGTDFTNAVLDGPQVKKLCTKASGVNSKTGVDTRESLGC 163
>gi|443328810|ref|ZP_21057403.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791546|gb|ELS01040.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 170
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 69/135 (51%), Gaps = 20/135 (14%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGAD 167
F L++ K + ANF+++D+R SD S + +GA AY +F GAD
Sbjct: 49 FSGKTLQRLDFAKVDLSEANFSNSDLRGAVFNASDLSNANLHGADFTYGFAYLTDFQGAD 108
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
LSD A+ T+L+ S A+I+GADF+ A+++ Q LC+ A G
Sbjct: 109 LSD---------------AIFRETILSFSSFEDAMIDGADFTLAILEKWQVNQLCENATG 153
Query: 228 TNPITGVSTRKSLGC 242
N TGV TR+SLGC
Sbjct: 154 VNSQTGVDTRRSLGC 168
>gi|407961546|dbj|BAM54786.1| hypothetical protein BEST7613_5855 [Synechocystis sp. PCC 6803]
Length = 194
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 85/178 (47%), Gaps = 16/178 (8%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
L W+ V T + +VA+ + +LA + RG A F DLR ++
Sbjct: 32 LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 85
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
N R A+FT A+++ + F + +GA LE A A +F A L+ ANL
Sbjct: 86 HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDFESARLT----------HANLR 135
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA L + T + G IEGAD +D ++ + LC A GTNP+TG T+++L C
Sbjct: 136 NARLEGSFGTNTKFGEVDIEGADLTDIILRPDTEDYLCGLAKGTNPVTGRETKETLFC 193
>gi|317969761|ref|ZP_07971151.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
Length = 160
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 66/131 (50%), Gaps = 5/131 (3%)
Query: 117 DLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
D K V + +F A F ++RE+DF G+ GA L A AN GADLSD
Sbjct: 29 DYAKQVLIGHDFAGVDLHGATFNLTNLREADFHGADLRGASLYGAKLQDANLAGADLSDA 88
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
+D VL +L NAVL + +I+GADF++ + LC A+GTNP+
Sbjct: 89 TLDSAVLEGTDLRNAVLENAFAFNTRFKDVLIDGADFTNVPFRGDVLKTLCASASGTNPV 148
Query: 232 TGVSTRKSLGC 242
TG T+ +L C
Sbjct: 149 TGRVTKDTLEC 159
>gi|37523524|ref|NP_926901.1| hypothetical protein gll3955 [Gloeobacter violaceus PCC 7421]
gi|35214528|dbj|BAC91896.1| gll3955 [Gloeobacter violaceus PCC 7421]
Length = 159
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 79/175 (45%), Gaps = 17/175 (9%)
Query: 68 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 127
WR V LAA +V +SA AD+ + A L ++N
Sbjct: 2 WRSGVLAGLAAGLV--LPGLVSAQADIQN---------------NYNGAYLEGRSVAEQN 44
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
++A F A++R DFS S GA L A ANF A L D + L A L AV
Sbjct: 45 LKQAQFYKANLRGVDFSSSDLRGASLFAASLRGANFNKARLDDAELSNADLQGAKLDQAV 104
Query: 188 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
L +T + L ++GADF+ +I+ QK C A GTN +T TR++LGC
Sbjct: 105 LAGAYMTAARLKDVSVDGADFTGTIINNQQKTYQCGRATGTNGLTKRQTRRTLGC 159
>gi|88770664|gb|ABD51935.1| chloroplast thylakoid 11 kDa protein [Guillardia theta]
Length = 242
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 72/139 (51%), Gaps = 2/139 (1%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
G +AA+ DLR+ ++ + A M++ DFS KF A + K A +A F G
Sbjct: 102 GQANAARDKLYDLRECPMAGKDATGFDLAGALMQKGDFSKVKFKDAVMSKVFADEATFDG 161
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK-- 223
AD S+ +MDR +++ A+ VL+ S+ G+ + +DFSD + + +CK
Sbjct: 162 ADFSNAVMDRGTWRKSSFKGAIFANAVLSGSEFEGSDLTDSDFSDTYMGDFDNKKICKNP 221
Query: 224 YANGTNPITGVSTRKSLGC 242
GTNP+TGV TR S C
Sbjct: 222 TLQGTNPVTGVDTRASASC 240
>gi|443477206|ref|ZP_21067069.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443017715|gb|ELS32099.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 167
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 68/131 (51%), Gaps = 20/131 (15%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
QF +DLR A V + + +F +A+M+E AN TGA+LS +
Sbjct: 56 QFNESDLRNASFVNADAQGVSFFAANMKE--------------------ANLTGANLSYS 95
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
+D L++ANLTNAV+ + + II+GADF+D + +Q LCK A G NP
Sbjct: 96 TLDNARLDKANLTNAVIEGSFAYGTSFNNVIIDGADFTDVDLRTPIRQKLCKSAKGQNPT 155
Query: 232 TGVSTRKSLGC 242
TG TR +L C
Sbjct: 156 TGRLTRDTLEC 166
>gi|352094203|ref|ZP_08955374.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
gi|351680543|gb|EHA63675.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
Length = 159
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 66/135 (48%), Gaps = 5/135 (3%)
Query: 113 FGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
F + D K V + +F + F ++RE+D SGS GA L A AN + +
Sbjct: 24 FAAMDYAKQVLIGADFSNREMQGVTFNLTNLREADLSGSDLQGASLYGAKLQDANLSNTN 83
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L D +D V + NLTNAVL + +EGADF++ + + LC A G
Sbjct: 84 LRDATLDSAVFDGTNLTNAVLEDAFAFNTRFINVTVEGADFTNVPLRADALKVLCANAEG 143
Query: 228 TNPITGVSTRKSLGC 242
NP+TG T ++LGC
Sbjct: 144 VNPVTGRDTSETLGC 158
>gi|307154028|ref|YP_003889412.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306984256|gb|ADN16137.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 172
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 64/117 (54%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ R + F A++R S+FS + G + ANF GA+L ++ L N TN
Sbjct: 54 QDLRDSKFDHANLRSSNFSNANLEGVRFFASNLESANFEGANLRYADLESARLIRVNFTN 113
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVL T + GAII+GADF+D ++ ++ LC A GTNP+TG T+ +L C
Sbjct: 114 AVLEGAFATNTLFKGAIIDGADFTDVLLRPDVEKYLCTIAKGTNPVTGRDTKDTLYC 170
>gi|428164857|gb|EKX33868.1| hypothetical protein GUITHDRAFT_155908 [Guillardia theta CCMP2712]
Length = 237
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 72/139 (51%), Gaps = 2/139 (1%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
G +AA+ DLR+ ++ + A M++ DFS KF A + K A +A F G
Sbjct: 97 GQANAARDKLYDLRECPMAGKDATGFDLAGALMQKGDFSKVKFKDAVMSKVFADEATFDG 156
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK-- 223
AD S+ +MDR +++ A+ VL+ S+ G+ + +DFSD + + +CK
Sbjct: 157 ADFSNAVMDRGTWRKSSFKGAIFANAVLSGSEFEGSDLTDSDFSDTYMGDFDNKKICKNP 216
Query: 224 YANGTNPITGVSTRKSLGC 242
GTNP+TGV TR S C
Sbjct: 217 TLQGTNPVTGVDTRASASC 235
>gi|86605126|ref|YP_473889.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86553668|gb|ABC98626.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 176
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 70/133 (52%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A + DL+ A ++ F A++R+SD S K GA L A KAN GADL
Sbjct: 43 AEDYSKRDLQGANFAGQDLSGWKFLKANLRQSDLSHVKAAGANLFGANLSKANLRGADLR 102
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
+D L A+L A L +++ + + G I+GADF++A+I LC+ A G N
Sbjct: 103 GATLDMANLQGADLREAQLQDSMMWLARVEGIQIDGADFTNALIRQDALSILCERATGVN 162
Query: 230 PITGVSTRKSLGC 242
P+TG +TR +L C
Sbjct: 163 PVTGRATRDTLEC 175
>gi|33861906|ref|NP_893467.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. pastoris str. CCMP1986]
gi|33640274|emb|CAE19809.1| Pentapeptide repeats [Prochlorococcus marinus subsp. pastoris str.
CCMP1986]
Length = 192
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 72/136 (52%), Gaps = 20/136 (14%)
Query: 116 ADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
ADL+ +VK ++ AN A M + S F GA ++ +AY F AD SD
Sbjct: 63 ADLQNNEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFSD 122
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
ANLTN L+++V GA I+GADF++A +DL +++LC+ A+GTN
Sbjct: 123 ----------ANLTNGELMKSVFD-----GATIDGADFTNANLDLKTRKSLCERASGTNS 167
Query: 231 ITGVSTRKSLGCGNSR 246
TGV T +SL C R
Sbjct: 168 QTGVDTFESLECSGLR 183
>gi|254423673|ref|ZP_05037391.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
gi|196191162|gb|EDX86126.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
Length = 190
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 67/133 (50%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A F +LR+ ++ ++T AD+ E+D S + L +AN GA+L+
Sbjct: 57 ADNFDRMNLRQQDFSGQDLTDNDYTRADLTEADLSHTNLERVRLFTTRLNRANLEGANLT 116
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
+D L ANL +AVL D G IEGADF+D ++D LC+ A GTN
Sbjct: 117 GATLDGASLVGANLKDAVLEGAYAINIDFRGIDIEGADFTDVLLDPKDNDKLCEIATGTN 176
Query: 230 PITGVSTRKSLGC 242
P TG T+++L C
Sbjct: 177 PTTGRKTKETLYC 189
>gi|298250074|ref|ZP_06973878.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297548078|gb|EFH81945.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 471
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 62/99 (62%), Gaps = 5/99 (5%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTL 172
L +A + + R+AN + A M +D SG+ GA LE AVA+KANFTGA+LSD L
Sbjct: 126 LHEANLCQADLRKANLSMARMHHTDLSGANLTGAILEGIDLKDAVAHKANFTGANLSDGL 185
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+D+ L+E++L+NA L ++L +DL AI+ G S A
Sbjct: 186 LDQANLSESDLSNANLHNSILDETDLSKAILRGTTLSKA 224
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 58/112 (51%), Gaps = 10/112 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTG 165
A A L AV + + +A+ + A +RE++ +G+ +GA L KA + Y+A G
Sbjct: 59 ASLQGARLENAVLYRTSLFKADLSEASIREANMTGANLSGATLHKADLQRVILYRATLAG 118
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAV 212
A+L DT + L +A+L A L + +DL GA I+EG D DAV
Sbjct: 119 ANLFDTTLHEANLCQADLRKANLSMARMHHTDLSGANLTGAILEGIDLKDAV 170
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 48/97 (49%), Gaps = 8/97 (8%)
Query: 132 NFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ AD+ +++FSG+ GA LE AV Y+ + ADLS+ + + ANL+ A
Sbjct: 40 DLMGADLSQTNFSGANLVRASLQGARLENAVLYRTSLFKADLSEASIREANMTGANLSGA 99
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
L + L R L A + GA+ D + A LC+
Sbjct: 100 TLHKADLQRVILYRATLAGANLFDTTLHEAN---LCQ 133
>gi|158337082|ref|YP_001518257.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158307323|gb|ABW28940.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 175
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 60/112 (53%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+FT AD+R SDFS S G A N GA+LS +D ANLTNA L
Sbjct: 63 ASFTKADLRGSDFSNSDLRGVSFFAANLEDVNLEGANLSVATLDSARFARANLTNANLEG 122
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
++ AII+GADF+D + + LC A GTNP+TG +TR +L C
Sbjct: 123 AFAFNTEFRRAIIDGADFTDVDLRDDTLEILCAAAQGTNPVTGRNTRDTLYC 174
>gi|359460626|ref|ZP_09249189.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 175
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 60/112 (53%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+FT AD+R SDFS S G A N GA+LS +D ANLTNA L
Sbjct: 63 ASFTKADLRGSDFSNSDLRGVSFFAANLEDVNLEGANLSVATLDSARFARANLTNANLEG 122
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
++ AII+GADF+D + + LC A GTNP+TG +TR +L C
Sbjct: 123 AFAFNAEFRKAIIDGADFTDVDLRDDTLEILCAAAQGTNPVTGRNTRDTLYC 174
>gi|384252144|gb|EIE25621.1| hypothetical protein COCSUDRAFT_83628, partial [Coccomyxa
subellipsoidea C-169]
Length = 122
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 66/117 (56%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++FR AD+R ++FS + GA L A A F GA L++ ++ + A+L+
Sbjct: 5 KDFRGQKLYKADLRGTNFSKANMEGASLFGAFCKDAKFVGAHLNNADLESVDFENADLSE 64
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+L +T + I G+D++D V+ +Q LCK A+GTNPITG TR++L C
Sbjct: 65 AILEGAQVTNAKFKNVNIAGSDWTDVVLRRDVQQQLCKIASGTNPITGQDTRETLIC 121
>gi|75908971|ref|YP_323267.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75702696|gb|ABA22372.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 164
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 96/190 (50%), Gaps = 39/190 (20%)
Query: 65 LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
+K WRV S LA + A+ SS+I+ A + G+ IGS +F + D
Sbjct: 1 MKYWRVVASFVLAMVLFLFPGSAQAASSSSITRSAGDELKAKDFSGQSLIGS--EFTNVD 58
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTL 172
L EN ANF++AD+R F+G+ G L +AY A F ADLSD
Sbjct: 59 L-------EN---ANFSNADLRGGVFNGTVLEGVNLHGVDFSNGIAYLARFKNADLSD-- 106
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
A LT+A+++R+V D + GADF++AV+D + + LC A+G N T
Sbjct: 107 --------AVLTDAMMLRSVFDNVD-----VSGADFTNAVLDGTEVKKLCVKASGVNSKT 153
Query: 233 GVSTRKSLGC 242
GV TR+SLGC
Sbjct: 154 GVDTRESLGC 163
>gi|427736970|ref|YP_007056514.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427372011|gb|AFY55967.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 164
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 65/116 (56%), Gaps = 20/116 (17%)
Query: 132 NFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF++ADMR + F+GS +G +AY +NF +DLSD + TNA
Sbjct: 63 NFSNADMRGAVFNGSLLENSNLHGVDFTDGIAYLSNFKDSDLSDAI----------FTNA 112
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+++RT+ D + GADFS A++D + + LC+ A+G N TGVSTR SL C
Sbjct: 113 MMLRTIFRNVD-----VTGADFSGAILDRVEVKKLCETASGVNSKTGVSTRASLEC 163
>gi|124023314|ref|YP_001017621.1| hypothetical protein P9303_16121 [Prochlorococcus marinus str. MIT
9303]
gi|123963600|gb|ABM78356.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9303]
Length = 158
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 65/125 (52%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
L A ++ R F A++RE++ SGS G+ L A + AN + +L D+ +D +
Sbjct: 33 LVNADFSNQDLRGDTFNLANLREANLSGSDLEGSTLFGAKLHDANLSNTNLRDSTLDSAI 92
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 237
+ +LTNAVL + I GADF++ + LC+ A GTNPITG +T
Sbjct: 93 FDGTDLTNAVLEDAFAFNTRFKNVTITGADFTNVPLRGDALTTLCEVAEGTNPITGRNTA 152
Query: 238 KSLGC 242
SLGC
Sbjct: 153 DSLGC 157
>gi|428776639|ref|YP_007168426.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428690918|gb|AFZ44212.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 167
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 67/112 (59%), Gaps = 5/112 (4%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN ++AD+ ++D GS F + ++ A + ANFT T+++ + A+L+ +L
Sbjct: 60 ANLSAADLSDTDMRGSIFTASVMKDANLHGANFTF-----TVLNGVDFTNADLSQTILED 114
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+L+R+ I GADF++AV+D Q LC+ A+G N TG++TR SLGC
Sbjct: 115 AILSRATFENTDITGADFTNAVLDSRQIDQLCETASGVNEETGMATRDSLGC 166
>gi|72382760|ref|YP_292115.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL2A]
gi|72002610|gb|AAZ58412.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
NATL2A]
Length = 182
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 63/117 (53%), Gaps = 7/117 (5%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ + + D+ D G+ F GAY + ++ TGA++++ + + ANLTN
Sbjct: 53 KDLQNTEYVKYDLSGKDLGGTNFTGAYFSVSTLKDSDLTGANMTNVIAYATRFDNANLTN 112
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
L L +S G I+GADF+DAV+D +Q++ LCK A G ST +SLGC
Sbjct: 113 VNLTGAELLKSVFDGVTIDGADFTDAVLDRSQQKNLCKVATG-------STAESLGC 162
>gi|124026482|ref|YP_001015597.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL1A]
gi|123961550|gb|ABM76333.1| Pentapeptide repeats [Prochlorococcus marinus str. NATL1A]
Length = 182
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 63/117 (53%), Gaps = 7/117 (5%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ + + D+ D G+ F GAY + ++ TGA++++ + + ANLTN
Sbjct: 53 KDLQNTEYVKYDLSGKDLGGTNFTGAYFSVSTLKDSDLTGANMTNVIAYATRFDNANLTN 112
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
L L +S G I+GADF+DAV+D +Q++ LCK A G ST +SLGC
Sbjct: 113 VNLTGAELLKSVFDGVTIDGADFTDAVLDRSQQKNLCKVATG-------STAESLGC 162
>gi|86609869|ref|YP_478631.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86558411|gb|ABD03368.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 176
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 69/133 (51%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A + DL+ ++ F A++R+SD S K GA L A KAN GADL
Sbjct: 43 AEDYTKRDLQGVSFAGQDLSGWKFLKANLRQSDLSHVKAAGANLFGANLSKANLRGADLR 102
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
+D L A+L A L +++ + + G I+GADF++A+I LC+ A G N
Sbjct: 103 GATLDMANLQGADLREAQLQDSMMWLARVEGIQIDGADFTNALIRQDALSILCERATGVN 162
Query: 230 PITGVSTRKSLGC 242
P+TG +TR +L C
Sbjct: 163 PVTGRATRDTLEC 175
>gi|186683889|ref|YP_001867085.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186466341|gb|ACC82142.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 165
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 63/112 (56%), Gaps = 10/112 (8%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
ANF++AD+R G FNG LE N G D S+ + +A+L++AVL
Sbjct: 63 ANFSNADLR-----GGVFNGTLLEGV-----NLHGVDFSEGIAYLTRFKDADLSDAVLTD 112
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
++ RS + GADF++A++D Q + LC A+G N TGV TR+SLGC
Sbjct: 113 AMMLRSTFDDVNVTGADFTNAILDGTQVKKLCVKASGVNSKTGVDTRQSLGC 164
>gi|33862899|ref|NP_894459.1| hypothetical protein PMT0626 [Prochlorococcus marinus str. MIT
9313]
gi|33634815|emb|CAE20801.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 158
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 65/125 (52%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
L A ++ R F A++RE++ SGS G+ L A + AN + +L D+ +D +
Sbjct: 33 LVNADFSNQDLRGDTFNLANLREANLSGSDLEGSTLFGAKLHDANLSNTNLRDSTLDSAI 92
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 237
+ +LTNAVL + I GADF++ + LC+ A GTNPITG +T
Sbjct: 93 FDGTDLTNAVLEDAFAFNTRFKNVTITGADFTNVPLRGDALTTLCEVAEGTNPITGRNTA 152
Query: 238 KSLGC 242
+LGC
Sbjct: 153 DTLGC 157
>gi|282897571|ref|ZP_06305571.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
gi|281197494|gb|EFA72390.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
Length = 164
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 88/185 (47%), Gaps = 29/185 (15%)
Query: 65 LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
+K W++FV L A A+ SS+I+ A + G+ +G
Sbjct: 1 MKYWQIFVGLVLTAVFFVSNLPAQAASSSSITRSAGSEIEIQDYSGKSLVG--------- 51
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
++ ++K ANF++AD+R G FNG L AN G + SD +
Sbjct: 52 -KEFTNIK--LENANFSNADLR-----GVVFNGTLL-----IDANLHGVNFSDGISYLSN 98
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 237
+NL++A+ ++ RS + GADF++A++D + + LC A+G N TGV TR
Sbjct: 99 FKNSNLSDAIFTNAMMLRSTFNNVDVTGADFTNAILDGVEVKKLCANASGVNSQTGVDTR 158
Query: 238 KSLGC 242
KSLGC
Sbjct: 159 KSLGC 163
>gi|148241708|ref|YP_001226865.1| pentapeptide repeat-containing protein [Synechococcus sp. RCC307]
gi|147850018|emb|CAK27512.1| Secreted pentapeptide repeats protein [Synechococcus sp. RCC307]
Length = 156
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/117 (38%), Positives = 61/117 (52%), Gaps = 7/117 (5%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ ++F A +R +DFSG+K +GA + +NF GADLSD LMDR NL+
Sbjct: 46 QDLEGSSFAGAVVRNADFSGAKLHGAIFTQGAFAGSNFAGADLSDVLMDRADFTGTNLSG 105
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
L V S A IEGADF+ A++D + LC+ A G TR SL C
Sbjct: 106 TNLSGVVANGSSFAKAEIEGADFTGALLDRDDQITLCRKAKG-------ETRLSLDC 155
>gi|427719897|ref|YP_007067891.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427352333|gb|AFY35057.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 165
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 63/117 (53%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ +A FTS +++ ++FS + G + N GAD S+ + +L++
Sbjct: 48 QSLIQAEFTSVNLKNTNFSNADLRGGVFNSTLLEGVNLHGADFSEGIAYLARFKNTDLSD 107
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+L ++ RS I GADF++AV+D Q + LC A+G N TG TR+SLGC
Sbjct: 108 AILTDAMMLRSTFDDVDITGADFTNAVLDGVQIKKLCVNASGVNSKTGTDTRESLGC 164
>gi|428306980|ref|YP_007143805.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428248515|gb|AFZ14295.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 160
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 66/139 (47%), Gaps = 20/139 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANF 163
S F L + V+ N NF +AD+R F+GS G+ L A +AY A+F
Sbjct: 36 SGKDFSGQTLISSEFVEANLDNTNFNNADIRGVVFNGSTLKGSSLHSADFTNGLAYAADF 95
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
+ ADLSD AV ++L +S I G DFS V+D + LC
Sbjct: 96 SNADLSD---------------AVFSESILLKSRFDEVNINGTDFSGVVLDGTNVKKLCD 140
Query: 224 YANGTNPITGVSTRKSLGC 242
A+G N TGV+TR SLGC
Sbjct: 141 VADGVNSKTGVATRASLGC 159
>gi|332706397|ref|ZP_08426459.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332354834|gb|EGJ34312.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 126
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 66/118 (55%), Gaps = 1/118 (0%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
E+ ++ T ++ +++ + +K A ++ AN GADL+ + + N+A+LT+
Sbjct: 8 EDLQKVKITYCNLDQANLADAKLIQASIKHTTLNNANLHGADLTKSDTYNISFNDADLTD 67
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVID-LAQKQALCKYANGTNPITGVSTRKSLGC 242
+ +L R+ GA I GADF+ +I + ++ LC A+G NP TGV TR SLGC
Sbjct: 68 VIFTGALLQRASFDGADITGADFTSTLIQPVRERLKLCDVASGVNPTTGVVTRDSLGC 125
>gi|87124267|ref|ZP_01080116.1| hypothetical protein RS9917_11675 [Synechococcus sp. RS9917]
gi|86167839|gb|EAQ69097.1| hypothetical protein RS9917_11675 [Synechococcus sp. RS9917]
Length = 183
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 66/131 (50%), Gaps = 5/131 (3%)
Query: 117 DLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
D K V + +F + F ++RE+D SGS GA L A A+ + +L D
Sbjct: 53 DYAKQVLIGADFSGREMQGVTFNLTNLREADLSGSDLQGASLFGAKLQDADLSNTNLRDA 112
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
+D VL+ NL+NAVL + I GADF++ + + LC A GTNP+
Sbjct: 113 TLDSAVLDGTNLSNAVLEDAFAFNTRFINVTISGADFTNVPLRGDVLKTLCAVAEGTNPV 172
Query: 232 TGVSTRKSLGC 242
TG +TR +LGC
Sbjct: 173 TGRNTRDTLGC 183
>gi|440680470|ref|YP_007155265.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428677589|gb|AFZ56355.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 168
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 48/132 (36%), Positives = 66/132 (50%), Gaps = 30/132 (22%)
Query: 126 ENFRRANFTSADMRESDFS-----GSKFNGAYLE----------KAVAYKANFTGADLSD 170
+N FTS + ++FS G FNGA LE + +AY A F D SD
Sbjct: 51 QNLAGTEFTSVKLENTNFSNADLRGGVFNGALLEGVNLHGVDFRQGIAYLARFKNTDFSD 110
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
A LT+A+++RT D + GADF++A++D+ Q + LC A G N
Sbjct: 111 ----------AVLTDAMMLRTTFDDVD-----VTGADFTNAILDMTQVKKLCVNARGVNS 155
Query: 231 ITGVSTRKSLGC 242
TGV TR+SLGC
Sbjct: 156 QTGVDTRESLGC 167
>gi|148239424|ref|YP_001224811.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
gi|147847963|emb|CAK23514.1| Secreted pentapeptide repeat protein [Synechococcus sp. WH 7803]
Length = 158
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 70/143 (48%), Gaps = 5/143 (3%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
FG+ + + D K V + +F + F ++RE+D SGS GA L A
Sbjct: 16 FGLLLPSAEAAMDYAKQVLIGADFSNRDMQGVTFNLTNLREADLSGSDLQGASLYGAKLQ 75
Query: 160 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
AN + +L D +D VLN +LT+AVL + I GADF++ + +
Sbjct: 76 DANLSRTNLRDATLDSAVLNGTDLTDAVLEDAFAFNTRFIDVTISGADFTNVPLRGDVLK 135
Query: 220 ALCKYANGTNPITGVSTRKSLGC 242
LC A GTNP+TG TR +LGC
Sbjct: 136 TLCAAAEGTNPVTGRDTRDTLGC 158
>gi|33863821|ref|NP_895381.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9313]
gi|33635404|emb|CAE21729.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9313]
Length = 209
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 67/121 (55%), Gaps = 6/121 (4%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
F + + D+ E+D GS F+ L+ A N G +L D L + A+L+ ++
Sbjct: 88 FVKYDLAGYDLSEADLRGSTFSVTSLKNA-----NLHGTNLEDVLAYATRFDNADLSESI 142
Query: 188 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC-GNSR 246
L L +S+ GA+I+GADF++A++D +++ALC A G N TGV T SL C G S
Sbjct: 143 LRNANLRKSEFAGALIDGADFTNALLDKQEQKALCARATGKNSKTGVDTYSSLDCSGISE 202
Query: 247 R 247
R
Sbjct: 203 R 203
>gi|428180855|gb|EKX49721.1| hypothetical protein GUITHDRAFT_135885 [Guillardia theta CCMP2712]
Length = 244
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 106/235 (45%), Gaps = 34/235 (14%)
Query: 20 SSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAA 79
S KGP+ L + KP+ + + + E+D D + VS AL +A
Sbjct: 28 SLKGPHALSGM-KPVTRSHPAAVRMEADADAFDAK--------------KFAVSLALGSA 72
Query: 80 VVASCSSNISALADLNKYEAETRGEFGI--GSAAQFGSAD----LRKAVHVKENFRRAN- 132
++ S I A A + G F + G+A+ S R A+ NF N
Sbjct: 73 LLFSSGMPIPAFA-------QQGGSFKVLKGAASTQDSGSRRTITRGALLEGSNFDGQNL 125
Query: 133 ----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
F + R+ F G+ GA AN GAD+S+ + + +ANL NA++
Sbjct: 126 PGISFQQSLCRDCSFVGTNLKGASFFDGDLTNANMEGADVSNVNFELTCMKDANLKNAIV 185
Query: 189 VRT-VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ + + L G IEGADF+D + Q++ LCK A+GTNP TGV T+ SL C
Sbjct: 186 NNAYIQSTTKLDGINIEGADFTDTELRKDQQRYLCKRASGTNPKTGVDTKDSLRC 240
>gi|224098455|ref|XP_002311180.1| predicted protein [Populus trichocarpa]
gi|222851000|gb|EEE88547.1| predicted protein [Populus trichocarpa]
Length = 218
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/215 (31%), Positives = 105/215 (48%), Gaps = 29/215 (13%)
Query: 40 ISSKTESDGQFPDCSNNQCAGPYAKLKNWRV---FVSTALAAAVVASCSSNISALA--DL 94
I+ + S P S + C P A + N ++ F T A + S ALA
Sbjct: 20 ITKPSLSIPHLPSLSFSHCDKPQALIPNKQLVEDFAKTGFLAILSVSLFFTDPALAFKGG 79
Query: 95 NKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
Y +E TRG+ G D +K++F+ ++ +R+++F G+K GA
Sbjct: 80 GPYGSEVTRGQDLTGK-------DFSGRTLIKQDFK-----TSILRQANFKGAKLLGASF 127
Query: 154 EKAVAYKANFTGADLSDTLM---DRMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGAD 207
+ A+ TGADLSD + D + N +ANL+NA L + T + G+ I GAD
Sbjct: 128 -----FDADLTGADLSDADLRSADFSLTNVTKANLSNANLEGALATGNTSFRGSNITGAD 182
Query: 208 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
F+D + Q++ LCK+A+G NP TG +TR +L C
Sbjct: 183 FTDVPLREDQREYLCKFADGVNPTTGNATRDTLLC 217
>gi|409993003|ref|ZP_11276163.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|409936150|gb|EKN77654.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 162
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 62/112 (55%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A F ++++ ++F ++ G+ +A+ ADL+ ++D++ ++A+L++++
Sbjct: 50 AEFANSNLEYANFDEAELRGSVFSRAIMLGVTMRKADLTYAMLDQVDFSQADLSDSIFTE 109
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ S I GADF+DA+ D Q + LC A G N TGV TR SLGC
Sbjct: 110 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSTTGVDTRYSLGC 161
>gi|88808450|ref|ZP_01123960.1| hypothetical protein WH7805_02132 [Synechococcus sp. WH 7805]
gi|88787438|gb|EAR18595.1| hypothetical protein WH7805_02132 [Synechococcus sp. WH 7805]
Length = 159
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 58/110 (52%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F ++RE+D SGS GA L A AN + +L D +D VLN +LT+AVL
Sbjct: 50 FNLTNLREADLSGSDLQGASLYGAKLQDANLSRTNLRDATLDSAVLNGTDLTDAVLEDAF 109
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ I GADF++ + + LC A GTNP+TG TR +LGC
Sbjct: 110 AFNTRFIDVTISGADFTNVPLRGDVLKTLCAAAEGTNPVTGRDTRDTLGC 159
>gi|87302765|ref|ZP_01085576.1| hypothetical protein WH5701_13470 [Synechococcus sp. WH 5701]
gi|87282648|gb|EAQ74606.1| hypothetical protein WH5701_13470 [Synechococcus sp. WH 5701]
Length = 168
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 62/132 (46%), Gaps = 10/132 (7%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F ADLR N R AN + ADMR + G+K A+ G DL +
Sbjct: 46 ADFHDADLRGVTFNLTNLRDANLSGADMRNASLFGAKLQ----------DADMHGVDLRE 95
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
+D VL +L AVL + IEGADF++ + +LC A+GTNP
Sbjct: 96 ATLDSAVLEGTDLREAVLEDAFAFNTKFVDVAIEGADFTNVPLRGDVLTSLCAIASGTNP 155
Query: 231 ITGVSTRKSLGC 242
+TG TR +LGC
Sbjct: 156 VTGRVTRDTLGC 167
>gi|291569983|dbj|BAI92255.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 170
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 62/112 (55%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A F ++++ ++F ++ G+ +A+ ADL+ ++D++ ++A+L++++
Sbjct: 58 AEFANSNLEYANFDEAELRGSVFSRAIMLGVTMRKADLTYAMLDQVDFSQADLSDSIFTE 117
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ S I GADF+DA+ D Q + LC A G N TGV TR SLGC
Sbjct: 118 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSTTGVDTRYSLGC 169
>gi|124022089|ref|YP_001016396.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9303]
gi|123962375|gb|ABM77131.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9303]
Length = 202
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 67/121 (55%), Gaps = 6/121 (4%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
F + + D+ E+D GS F+ L+ A N G +L D L + A+L+ ++
Sbjct: 81 FVKYDLAGYDLSEADLRGSTFSVTTLKNA-----NLHGTNLEDVLAYATRFDNADLSESI 135
Query: 188 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC-GNSR 246
L L +S+ GA+I+GADF++A++D +++ALC A G N TGV T SL C G S
Sbjct: 136 LRNANLRKSEFAGALIDGADFTNALLDRQEQKALCARATGKNSKTGVDTYTSLDCSGISE 195
Query: 247 R 247
R
Sbjct: 196 R 196
>gi|411119230|ref|ZP_11391610.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410711093|gb|EKQ68600.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 192
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 62/112 (55%), Gaps = 4/112 (3%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
FT A++RES+F G+ +G A AN GADL + +D L+ +NL NA L
Sbjct: 82 FTKANLRESNFRGADLHGVSFFGANLEGANLEGADLRNATLDTARLSRSNLKNANLEGAF 141
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYANGTNPITGVSTRKSLGC 242
+ GA I+GADF+ +D+ Q + ALC A GTNP T +TR +L C
Sbjct: 142 AFNAKFDGATIDGADFTG--VDMRQDVQHALCDRAAGTNPTTKRNTRDTLNC 191
>gi|209525582|ref|ZP_03274120.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423065234|ref|ZP_17054024.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493915|gb|EDZ94232.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406713366|gb|EKD08537.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 177
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 62/112 (55%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A F ++++ ++F S+ G+ +A+ ADL+ ++D++ ++A+L++++
Sbjct: 65 AEFANSNLEYANFDESELRGSVFSRAIMLGVTMRKADLTYAMVDQVDFSQADLSDSIFTE 124
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ S I GADF+DA+ D Q + LC A G N TGV TR SLGC
Sbjct: 125 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSRTGVDTRYSLGC 176
>gi|119511352|ref|ZP_01630465.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
gi|119463974|gb|EAW44898.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
Length = 164
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 62/116 (53%), Gaps = 20/116 (17%)
Query: 132 NFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF++AD R F+GS+ G L +AY F GADL+D A TNA
Sbjct: 63 NFSNADFRGGVFNGSRLEGVNLHGVDFSDGIAYLTQFKGADLTD----------AVFTNA 112
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+++R+V D I GADF++A++D Q + LC A+G N TG TR+SL C
Sbjct: 113 MMLRSVFDDVD-----ITGADFTNAILDGTQIKKLCTQASGVNSQTGADTRESLEC 163
>gi|224112717|ref|XP_002316270.1| predicted protein [Populus trichocarpa]
gi|222865310|gb|EEF02441.1| predicted protein [Populus trichocarpa]
Length = 219
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 92/208 (44%), Gaps = 33/208 (15%)
Query: 49 QFPDCSNNQCAGPYAKLKNWRV---FVSTALAAAVVASCSSNISALADLNKYEAETRGEF 105
+F S+++C P A + N ++ F T L A + S ALA
Sbjct: 30 RFLSLSHSRCPNPQALILNKQLLEDFAKTGLLALLSVSLFFTDPALA------------- 76
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
+GS R ++F T D + S + F GA L A + A+ TG
Sbjct: 77 -FKGGGPYGSEVTRGQDLTGKDFSGRTLTKQDFKTSILRQANFKGAKLLGASFFDADLTG 135
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI-----------IEGADFSDAVID 214
ADLSD L A+L+ A + + L+ ++L GA+ I GADF+D +
Sbjct: 136 ADLSDA-----DLRSADLSLANVAKVNLSNANLEGALATGNTSFRGSNITGADFTDVPLR 190
Query: 215 LAQKQALCKYANGTNPITGVSTRKSLGC 242
Q++ LCK A+G NP TG +TR +L C
Sbjct: 191 EDQREYLCKVADGVNPTTGNATRDTLLC 218
>gi|33240260|ref|NP_875202.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
gi|33237787|gb|AAP99854.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
Length = 158
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 69/139 (49%), Gaps = 5/139 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+ + F S D K V E+F + A F +++++D SGS GA L A +N
Sbjct: 19 TQSSFASIDYGKQTLVGEDFSKLDLKGATFYLTNLQDADLSGSDLEGASLFGAKLLNSNL 78
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
+ A+L + +D V NL NAVL + + I+G+DF++ ++ LC
Sbjct: 79 SNANLHNATLDSAVFEGTNLENAVLEDAFVFNARFSDVNIQGSDFTNVILRNQDLSYLCS 138
Query: 224 YANGTNPITGVSTRKSLGC 242
ANGTNP+T T+ +L C
Sbjct: 139 IANGTNPVTKRKTKDTLQC 157
>gi|427710138|ref|YP_007052515.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427362643|gb|AFY45365.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 164
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 62/116 (53%), Gaps = 10/116 (8%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ ANF++AD+R G FNG LE N G D S+ + A+L++A
Sbjct: 58 DLENANFSNADLR-----GGVFNGIVLEGV-----NMHGVDFSNGIAYLARFKNADLSDA 107
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
VL ++ RS I GADF++AV+D Q + LC A+G N T V TR+SLGC
Sbjct: 108 VLTDAMMLRSTFDNVEITGADFTNAVLDGTQVKKLCAKASGVNSKTSVDTRESLGC 163
>gi|224006618|ref|XP_002292269.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971911|gb|EED90244.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 255
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 69/121 (57%), Gaps = 4/121 (3%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
+N + F + +R+SDFS S GA A +NF AD++ ++ N ANL N
Sbjct: 133 QNLKGVAFQQSIVRDSDFSNSNLYGASFFDATLDGSNFENADMTLCNVEMAQFNRANLKN 192
Query: 186 AVLVRTVLTRSDL--GGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLG 241
A++ ++ + L G IEG+D+S+ + Q++ LC + A GTNP+TGV+TR+SL
Sbjct: 193 AIVKDMYVSGATLFEGVKDIEGSDWSETQLRKDQQKYLCNHPTAKGTNPVTGVNTRESLM 252
Query: 242 C 242
C
Sbjct: 253 C 253
>gi|332705869|ref|ZP_08425945.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332355661|gb|EGJ35125.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 150
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 65/129 (50%), Gaps = 7/129 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A + A +K F + + D+ DF + F+ L+ A NF GA+++ +
Sbjct: 26 AQAQSAATIKATFANTDLSGQDLSGQDFHNAVFSSVNLQSANLSNVNFKGANIT-----K 80
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI--TG 233
+ ANL A + + GA I GADF+ A++D Q + LCK A+ TNPI TG
Sbjct: 81 VNFTNANLQGADFSYAFINVCNFKGANITGADFTFAILDSKQYRELCKNASATNPITDTG 140
Query: 234 VSTRKSLGC 242
V TR SLGC
Sbjct: 141 VDTRYSLGC 149
>gi|376005445|ref|ZP_09782948.1| conserved exported hypothetical protein [Arthrospira sp. PCC 8005]
gi|375326159|emb|CCE18701.1| conserved exported hypothetical protein [Arthrospira sp. PCC 8005]
Length = 177
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 62/112 (55%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A F ++++ ++F ++ G+ +A+ ADL+ ++D++ ++A+L++++
Sbjct: 65 AEFANSNLEYANFDEAELRGSVFSRAIMLGVTMRKADLTYAMVDQVDFSQADLSDSIFTE 124
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ S I GADF+DA+ D Q + LC A G N TGV TR SLGC
Sbjct: 125 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSRTGVDTRYSLGC 176
>gi|427714384|ref|YP_007063008.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
gi|427378513|gb|AFY62465.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
Length = 177
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 64/131 (48%), Gaps = 10/131 (7%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
F DLR + K N +N + D+R G F A LE A + TGADL
Sbjct: 54 DFSGKDLRDSEFTKANLFHSNLSHTDLR-----GVSFFAANLETA-----DLTGADLRVA 103
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
+D +ANLT+A L + GAII+GADF+D + ++ LC A G NP+
Sbjct: 104 TLDTARFTKANLTDANLEGAFAFNTIFDGAIIDGADFTDVDLRPDARKMLCSVAKGVNPV 163
Query: 232 TGVSTRKSLGC 242
TG +T +L C
Sbjct: 164 TGRATHDTLEC 174
>gi|414077638|ref|YP_006996956.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413971054|gb|AFW95143.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 165
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 65/116 (56%), Gaps = 20/116 (17%)
Query: 132 NFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF +AD+R + F+G+ F+G + +AY + F +DLSD A T A
Sbjct: 64 NFNNADLRGAVFNGTLLDTVNFHGVDFSQGIAYLSRFKNSDLSD----------AVFTEA 113
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+++R+ + D + GADF++A++D+ Q + +C A+G N TGV TR SLGC
Sbjct: 114 MMLRSTFDQVD-----VTGADFTNAILDMIQIKKICINASGVNSKTGVDTRASLGC 164
>gi|148242416|ref|YP_001227573.1| pentapeptide repeat-containing protein [Synechococcus sp. RCC307]
gi|147850726|emb|CAK28220.1| Secreted pentapeptide repeat protein [Synechococcus sp. RCC307]
Length = 162
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 63/131 (48%), Gaps = 5/131 (3%)
Query: 117 DLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
D K V + +F + F ++RE+D SGS A L A AN +G+DL +
Sbjct: 31 DYAKQVLIGADFSSRDLKGVTFNLTNLREADLSGSDLRAASLFGAKLQDANLSGSDLREA 90
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
+D V N +L++A L + G I GADFSD + LC A GTN +
Sbjct: 91 TLDSAVFNGTDLSDARLEGAFAFNTRFSGVTITGADFSDVPLRGDALSTLCAVAEGTNSV 150
Query: 232 TGVSTRKSLGC 242
TG TR +LGC
Sbjct: 151 TGRDTRDTLGC 161
>gi|427701765|ref|YP_007044987.1| low-complexity protein [Cyanobium gracile PCC 6307]
gi|427344933|gb|AFY27646.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
Length = 175
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 65/132 (49%), Gaps = 10/132 (7%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F ADLR F ++R++D SG+ GA L A A+ +G+DL D
Sbjct: 53 ADFHGADLRGVT----------FNLTNLRDADLSGADLRGASLFGAKLQDADLSGSDLRD 102
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
+D V +L NA L + G +I+GADF++ + +LC A+GTNP
Sbjct: 103 ATLDSAVFEGTDLRNARLDDAFAFNTKFRGVLIDGADFTNVPLRGDALTSLCAAASGTNP 162
Query: 231 ITGVSTRKSLGC 242
+TG TR +L C
Sbjct: 163 VTGRLTRDTLNC 174
>gi|282900932|ref|ZP_06308865.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281194023|gb|EFA68987.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 164
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 53/183 (28%), Positives = 85/183 (46%), Gaps = 25/183 (13%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
+K W++FV L A S S +A + A ++ E + L
Sbjct: 1 MKYWQIFVGLVLTAVFFVSNLSAQAASSSSITRSAGSKIEI-----QDYSGKSLVGKEFT 55
Query: 125 KENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 179
ANF++AD+R F+G+ +G ++Y +NF ++LSD +
Sbjct: 56 NIKLENANFSNADLRGVVFNGTLLIDTNLHGVNFSDGISYLSNFKNSNLSDAI------- 108
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 239
TNA+++R+ D I GADF++A++D + + LC A+G N TGV TR+S
Sbjct: 109 ---FTNAMMLRSTFNNVD-----ITGADFTNAILDGVEVKKLCADASGVNSQTGVDTRES 160
Query: 240 LGC 242
LGC
Sbjct: 161 LGC 163
>gi|449018747|dbj|BAM82149.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 269
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 64/115 (55%), Gaps = 2/115 (1%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 189
+ +F+ + R+++FSGS +GA KA +ANF A L +++ VL +N NAVL
Sbjct: 153 QKDFSGSTCRKTNFSGSDLSGARFFKADLTEANFENAQLIGASLEQTVLRGSNFQNAVLR 212
Query: 190 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 242
T T S L A IE D++DA+++ + LC A G N +T TR+SL C
Sbjct: 213 STYWTESVLTIANIENTDWTDALLEPTWQMKLCSRSDAKGMNTLTNTDTRESLMC 267
>gi|427723472|ref|YP_007070749.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427355192|gb|AFY37915.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 170
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 66/134 (49%), Gaps = 6/134 (4%)
Query: 115 SADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
+ D K ++E+F R ++ + +R SDFS G A NF GAD+
Sbjct: 36 AVDYNKRTFIQEDFSHQDLRDNSYDLSSLRGSDFSYCDLRGVRFFSANLEFVNFEGADMR 95
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLG-GAIIEGADFSDAVIDLAQKQALCKYANGT 228
++D + AN TNA L L + +I+GADF+DA+I + LC A GT
Sbjct: 96 GAVLDSARIGHANFTNANLEGAYLASVKITPSTVIDGADFTDALILKNENDKLCDLATGT 155
Query: 229 NPITGVSTRKSLGC 242
NP TGV T +SL C
Sbjct: 156 NPDTGVDTAESLYC 169
>gi|302756827|ref|XP_002961837.1| hypothetical protein SELMODRAFT_76876 [Selaginella moellendorffii]
gi|300170496|gb|EFJ37097.1| hypothetical protein SELMODRAFT_76876 [Selaginella moellendorffii]
Length = 180
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 71/136 (52%), Gaps = 6/136 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
+GS R ++F + T D + S + F GA L A + AN TGAD SD
Sbjct: 44 YGSEVTRGQDLSGKDFSGRDLTKQDFKTSILRQANFKGAKLFGASFFDANLTGADFSDAD 103
Query: 173 MDRMVLN-----EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
+ L+ +AN TNA L ++T + L GA I GADF+D + Q+ LC+ A+
Sbjct: 104 LRGADLSLADATKANFTNANLEGALVTGNTSLKGANITGADFTDVLWREDQRSYLCRIAD 163
Query: 227 GTNPITGVSTRKSLGC 242
G NP+T STR++L C
Sbjct: 164 GINPVTSNSTRETLLC 179
>gi|302837694|ref|XP_002950406.1| hypothetical protein VOLCADRAFT_120854 [Volvox carteri f.
nagariensis]
gi|300264411|gb|EFJ48607.1| hypothetical protein VOLCADRAFT_120854 [Volvox carteri f.
nagariensis]
Length = 182
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 66/117 (56%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ R+ T A++R+++F+ + G L +++ A F GA+L + ++ A+ TN
Sbjct: 65 KDLRKLKLTKANLRQTNFTDANLEGVSLFGSLSESAIFRGANLRNADLESGNYEFADFTN 124
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AVL + + I G+D++D V+ ++ LC A+G NP TGVSTR+SL C
Sbjct: 125 AVLEGAFVNNAQFVKVTITGSDWTDVVLRKDVQKELCAIADGVNPTTGVSTRESLLC 181
>gi|159467845|ref|XP_001692102.1| hypothetical protein CHLREDRAFT_115715 [Chlamydomonas reinhardtii]
gi|158278829|gb|EDP04592.1| predicted protein [Chlamydomonas reinhardtii]
Length = 124
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 65/131 (49%), Gaps = 20/131 (15%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDT 171
DLRK K N R+ N T A++ GS F GA L A N+ AD SD
Sbjct: 8 DLRKLKLTKANLRQTNLTGANLEGVSLFGSLSEGAVFKGANLRNADLESGNYEDADFSDA 67
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
+++ +N NA VR I+G+D++D V+ ++ALC A+G NP
Sbjct: 68 ILEGAFVN-----NAQFVRVN----------IKGSDWTDVVLRKDIQKALCAIADGVNPT 112
Query: 232 TGVSTRKSLGC 242
TGVSTR+SL C
Sbjct: 113 TGVSTRESLMC 123
>gi|255570589|ref|XP_002526251.1| Thylakoid lumenal 17.4 kDa protein, chloroplast precursor, putative
[Ricinus communis]
gi|223534416|gb|EEF36120.1| Thylakoid lumenal 17.4 kDa protein, chloroplast precursor, putative
[Ricinus communis]
Length = 228
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 109/245 (44%), Gaps = 23/245 (9%)
Query: 3 LSSIS-PLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGP 61
+++IS PLS++SL SS + + + L P+ + C S+ F + + C
Sbjct: 1 MATISFPLSVRSL----SSERSRFPVPQLHPPIKIICSGSADGSKSKPFKELQSVACG-- 54
Query: 62 YAKLKNWRVFVSTALAAAVVASCSSNISALA-DLNKYEAETRGE-FGIGSAAQFGSADLR 119
L W V +A+ V + S + L+ + N+ E G G + DLR
Sbjct: 55 --LLAAWAV-----TSASPVIAASQRLPPLSTEPNRCEKAFVGNTIGQANGVYDKPIDLR 107
Query: 120 KAVHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
+ E N + + +A M ++ F G+ + + KA A A+F G D S+ ++DR+
Sbjct: 108 FCDYTNEKSNLKGKSLAAALMSDAKFDGADMSEVVMSKAYAVGASFKGVDFSNAVLDRVN 167
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 237
+ANL AV TVL+ S A + A F D +I Q LCK N + R
Sbjct: 168 FGKANLQGAVFKNTVLSGSTFDEAQLADAVFEDTIIGYIDLQKLCK-----NTSINLEGR 222
Query: 238 KSLGC 242
+ LGC
Sbjct: 223 EILGC 227
>gi|302798106|ref|XP_002980813.1| hypothetical protein SELMODRAFT_178497 [Selaginella moellendorffii]
gi|300151352|gb|EFJ17998.1| hypothetical protein SELMODRAFT_178497 [Selaginella moellendorffii]
Length = 180
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 71/136 (52%), Gaps = 6/136 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
+GS R ++F + T D + S + F GA L A + AN TGAD SD
Sbjct: 44 YGSEVTRGQDLSGKDFSGRDLTKQDFKTSILRQANFKGAKLFGASFFDANLTGADFSDAD 103
Query: 173 MDRMVLN-----EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
+ L+ +AN TNA L ++T + L GA I GADF+D + Q+ LC+ A+
Sbjct: 104 LRGADLSLADATKANFTNANLEGALVTGNTSLKGANITGADFTDVLWREDQRSYLCRIAD 163
Query: 227 GTNPITGVSTRKSLGC 242
G NP+T STR++L C
Sbjct: 164 GINPVTSNSTRETLLC 179
>gi|298715141|emb|CBJ27829.1| Thylakoid lumenal 15 kDa protein, chloroplast precursor (p15)
[Ectocarpus siliculosus]
Length = 245
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/146 (35%), Positives = 71/146 (48%), Gaps = 13/146 (8%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AA G RK V N A++ D+ F S G + A A F ADLS
Sbjct: 99 AASTGDKGARKTVTRGVNIENADYHDKDLSSVSFQQSLVRGTNFKNAKLVAAGFFDADLS 158
Query: 170 D-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGG------AIIEGADFSDAVIDLAQK 218
+ M++ L ANL+ A + ++T + + G AIIEGADF+D + Q
Sbjct: 159 NCNFESANMNQANLELANLSGANMKNALVTEAYVSGATKMEPAIIEGADFTDTFLRKDQV 218
Query: 219 QALC--KYANGTNPITGVSTRKSLGC 242
+ LC + A GTNP++GV TR SLGC
Sbjct: 219 RYLCGLETAKGTNPVSGVDTRDSLGC 244
>gi|449016903|dbj|BAM80305.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 341
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 68/128 (53%), Gaps = 11/128 (8%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA---------- 181
+ +S D+ + +G+ +GA L A ++ +GA+L D +L+EA
Sbjct: 211 DLSSVDLSTAALAGADLHGAALSHANLFQVQLSGANLRGAKFDASILDEAALDGADLSGA 270
Query: 182 NLTNAVLVRTVLTRSDLGGAI-IEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
+L A++ RT+L + L I I+GADFS A+ID ++ LC+ A G N TGV+T SL
Sbjct: 271 DLRQALVRRTLLLGARLDANISIDGADFSGALIDRTNQRLLCELAQGVNSRTGVATATSL 330
Query: 241 GCGNSRRN 248
C + N
Sbjct: 331 ACPEPKTN 338
>gi|170079322|ref|YP_001735960.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
gi|169886991|gb|ACB00705.1| Pentapeptide repeat containing protein [Synechococcus sp. PCC 7002]
Length = 166
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 65/134 (48%), Gaps = 6/134 (4%)
Query: 115 SADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
+ D K ++E+F R ++ + +R DFS S G A NF GADL
Sbjct: 32 AVDYNKRTFIQEDFSHQDLRDNSYDLSSLRGCDFSYSDLRGVRFFSANLEFVNFEGADLR 91
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLG-GAIIEGADFSDAVIDLAQKQALCKYANGT 228
++D + AN NA L L + +IEGADF+DA+I + LC+ A+GT
Sbjct: 92 GAVLDSARIGHANFKNANLEGAFLASVKITPSTVIEGADFTDALILARENDKLCELASGT 151
Query: 229 NPITGVSTRKSLGC 242
NP TG T +L C
Sbjct: 152 NPTTGRDTAATLYC 165
>gi|225449424|ref|XP_002282933.1| PREDICTED: thylakoid lumenal 15 kDa protein 1, chloroplastic [Vitis
vinifera]
gi|296086195|emb|CBI31636.3| unnamed protein product [Vitis vinifera]
Length = 221
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/123 (38%), Positives = 69/123 (56%), Gaps = 6/123 (4%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 180
++F + D + S + F GA L A + A+ TGADLSD + D + N +
Sbjct: 98 KDFSGKSLIKQDFKTSILRQANFKGANLLGASFFDADLTGADLSDADLRGADFSLANVTK 157
Query: 181 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 239
ANL+NA L + T + G+II GADF+D + Q++ LCK A+G NP TG +TR++
Sbjct: 158 ANLSNANLEGALATGNTSFRGSIITGADFTDVPLREDQREYLCKVADGVNPTTGNATRET 217
Query: 240 LGC 242
L C
Sbjct: 218 LLC 220
>gi|397595313|gb|EJK56448.1| hypothetical protein THAOC_23663 [Thalassiosira oceanica]
Length = 238
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 69/145 (47%), Gaps = 2/145 (1%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+ G +AA+ DLR+ N + + + M ++D S + F A K +NF
Sbjct: 94 KMGQANAARDKLYDLRECKLSGVNGQEFDLSGVIMSKTDLSKANFREAQFSKGYLRDSNF 153
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
AD ++ ++DR ++L + VLT + GA +E ADF+DA I + LCK
Sbjct: 154 EEADFTNAIVDRATFKGSSLKGTIFSNAVLTATSFEGADVENADFTDAYIGDFDIRNLCK 213
Query: 224 YAN--GTNPITGVSTRKSLGCGNSR 246
G NP+TG TR S CG R
Sbjct: 214 NPTLKGENPLTGADTRLSANCGPGR 238
>gi|413968546|gb|AFW90610.1| chloroplast thylakoid lumenal 17.4 kDa protein [Solanum tuberosum]
Length = 228
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 66/231 (28%), Positives = 101/231 (43%), Gaps = 25/231 (10%)
Query: 3 LSSIS-PLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGP 61
++SIS PL+ KS + S P QLH+ P+ + C S DCSN++ +
Sbjct: 1 MASISIPLAYKSHSLRRSPIYRPSQLHS---PIQIKCSASK---------DCSNSEESS- 47
Query: 62 YAKLKNWRVFVSTALAAAVVASCSSNISA-------LADLNKYEAETRGE-FGIGSAAQF 113
+ K R LA ++S S I+A D N+ E G G +
Sbjct: 48 -TQFKQLRNVACGFLAVWALSSVSPVIAAGQRLPPLSTDPNRCERAFVGSTIGQANGVYD 106
Query: 114 GSADLRKAVHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
DLR + E N + + +A M ++ F G+ + KA A A+F D S+
Sbjct: 107 KPLDLRFCDYTNEKTNLKGKSLAAALMSDAKFDGADMTEVIMSKAYAVGASFKAMDFSNA 166
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
++DR+ +ANL A TVL+ S A ++G DF D +I Q +C
Sbjct: 167 VLDRVNFEKANLQGASFKNTVLSGSTFNDAQLDGVDFEDTIIGYIDLQKIC 217
>gi|255073547|ref|XP_002500448.1| predicted protein [Micromonas sp. RCC299]
gi|226515711|gb|ACO61706.1| predicted protein [Micromonas sp. RCC299]
Length = 215
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 66/128 (51%), Gaps = 5/128 (3%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DLR+ +V ++ + A M ++ F G+ + KA A A+FTGA+ ++ ++DR+
Sbjct: 93 DLRQCNYVDKDLSTKTLSGALMVDATFKGANMTEVVMSKAYAVNADFTGANFTNAVVDRV 152
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
+ ANL+NA V+T + G + GA F +A+I + LC+ NP T
Sbjct: 153 TFDGANLSNANFFNAVITGATFEGTNLAGAQFDEALIGKEDVKKLCE-----NPTLVEET 207
Query: 237 RKSLGCGN 244
R +GC N
Sbjct: 208 RFQVGCRN 215
>gi|428166498|gb|EKX35473.1| hypothetical protein GUITHDRAFT_97823, partial [Guillardia theta
CCMP2712]
Length = 230
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 63/119 (52%), Gaps = 2/119 (1%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++F + +F+ +E+ F G+K G KA A+FTGADLS ++ L+ L N
Sbjct: 112 KDFSKKDFSGCAAKEAKFVGTKLRGTRFFKADLTGADFTGADLSTASLEDAKLDGVVLKN 171
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 242
A+L + I GADF+DA++ LCK A GTNP+T TR+SLGC
Sbjct: 172 AILSNSYTNLGLDKVKDISGADFTDALVRPDILAKLCKRSDATGTNPVTKADTRESLGC 230
>gi|298492954|ref|YP_003723131.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298234872|gb|ADI66008.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 164
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 64/120 (53%), Gaps = 20/120 (16%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEAN 182
+ NF+++D+R F+G+ G L + +AY F AD SD +
Sbjct: 59 LQNTNFSNSDLRGGVFNGTLLEGVNLHGVDFSQGIAYLVKFNNADFSDAI---------- 108
Query: 183 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
LT+A+++R+V D + GADF++A++D + + LC A+G N T V TR+SLGC
Sbjct: 109 LTDAMMLRSVFDNVD-----VTGADFTNAILDGVEIKKLCLKASGVNSKTAVDTRESLGC 163
>gi|219116308|ref|XP_002178949.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409716|gb|EEC49647.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 131
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 66/124 (53%), Gaps = 4/124 (3%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K+N + F + +R DFS S GA A +NF ++L + ++ L +
Sbjct: 8 KQNLKGVAFQQSIVRNCDFSNSDLRGASFFDATLTDSNFENSNLENVNLEMAQLTRVSFK 67
Query: 185 NAVLVRTVLTRSDL--GGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSL 240
NAV+ ++ + + G +EG+D+S+ + QK+ LC + A GTNP+TGV TR+SL
Sbjct: 68 NAVVTDAYVSGATIFDGVKDVEGSDWSETYLRADQKKLLCNHPTAKGTNPVTGVDTRESL 127
Query: 241 GCGN 244
C N
Sbjct: 128 MCPN 131
>gi|255645177|gb|ACU23086.1| unknown [Glycine max]
Length = 222
Score = 67.8 bits (164), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 71/119 (59%), Gaps = 11/119 (9%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 184
+ +F ++ +R+++F G+K GA + A+ TGADLSD + D + N +ANL+
Sbjct: 108 KQDFKTSILRQANFKGAKLIGASF-----FDADLTGADLSDADLRNADFSLANVTKANLS 162
Query: 185 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA L ++T + G+ + GADF+D + Q++ LCK A+G NP TG +TR +L C
Sbjct: 163 NANLEGALVTGNTSFRGSNVTGADFTDVPLREDQREYLCKVADGVNPTTGNATRDTLFC 221
>gi|217071608|gb|ACJ84164.1| unknown [Medicago truncatula]
Length = 240
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 69/242 (28%), Positives = 103/242 (42%), Gaps = 26/242 (10%)
Query: 13 SLNFCSSSSKGP-YQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVF 71
SL+ + S+K P + AL P + C + + E DG N K+K
Sbjct: 12 SLSIRNFSTKRPCFTTSAL--PFTITCSVVGEAELDGT----ENKPRLLSLNKIKGVACG 65
Query: 72 VSTALAAAVVASCSSNISALA--------DLNKYEAETRGE-FGIGSAAQFGSADLRKA- 121
+ LAA V S S ++A D N+ E G G + + DLRK
Sbjct: 66 I---LAAYAVTSASFPVTAATQRLPPLSTDPNRCERAFVGNTIGQANGVYDKALDLRKCD 122
Query: 122 -VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
+ K N + ++A M ++ F G+ + KA A +F G D S+ ++DR+ +
Sbjct: 123 FTNEKSNLKGKTLSAALMSDAKFDGADMTEVVMSKAYAVGGSFKGVDFSNAVLDRVNFGK 182
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
A+L AV TVL+ S A +EGA F D +I Q +C+ N G R L
Sbjct: 183 ADLQGAVFRNTVLSGSTFDDAKLEGAVFEDTIIGYIDLQKICR-----NTTIGDEGRAEL 237
Query: 241 GC 242
GC
Sbjct: 238 GC 239
>gi|298705858|emb|CBJ29003.1| thylakoid lumenal protein [Ectocarpus siliculosus]
Length = 199
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 59/105 (56%), Gaps = 2/105 (1%)
Query: 140 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 199
E++FS F + KA A +N+ AD ++ ++DR+ + +++ A+ VLT +
Sbjct: 92 EANFSKGDFKEVVMSKAYARSSNWEEADFTNAVVDRVSFDGSSMKGAIFQNAVLTSTSFT 151
Query: 200 GAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGC 242
GA +E ADF++A + ++ LCK GTNP+T TR S GC
Sbjct: 152 GADVENADFTEAYMGDFDQKNLCKNPTLKGTNPVTNADTRASAGC 196
>gi|383763560|ref|YP_005442542.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381383828|dbj|BAM00645.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 189
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/123 (39%), Positives = 65/123 (52%), Gaps = 12/123 (9%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A RG + + F A+L++A N RAN + AD+ +D SG+ GA L A
Sbjct: 30 AHLRGAHLVEADLSF--ANLQRANLAGANLERANLSGADLEGADLSGANLVGANLTGARL 87
Query: 159 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGG-----AIIEGADF 208
+AN TGA+L D L++R L E ANL NA V + L R+DLG A+ +GAD
Sbjct: 88 MRANLTGANLRDALVNRADLTEALLVDANLRNAHFVESTLVRADLGDANALKAVFKGADL 147
Query: 209 SDA 211
S A
Sbjct: 148 SGA 150
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+ A + E+D S + A L A +AN +GADL + L ANLT A L+R
Sbjct: 30 AHLRGAHLVEADLSFANLQRANLAGANLERANLSGADLEGADLSGANLVGANLTGARLMR 89
Query: 191 TVLTRSDLGGAIIEGADFSDAVI 213
LT ++L A++ AD ++A++
Sbjct: 90 ANLTGANLRDALVNRADLTEALL 112
>gi|351722845|ref|NP_001236746.1| uncharacterized protein LOC100500352 [Glycine max]
gi|255630103|gb|ACU15405.1| unknown [Glycine max]
Length = 224
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 70/119 (58%), Gaps = 11/119 (9%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 184
+ +F ++ +R+++F G+K GA + A+ TGADLSD + D + N +ANL+
Sbjct: 110 KQDFKTSILRQANFKGAKLIGASF-----FDADLTGADLSDADLRNADFSLANVTKANLS 164
Query: 185 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA L + T + G+ I GADF+D + Q++ LCK A+G NP TG +TR +L C
Sbjct: 165 NANLEGALATGNTSFKGSNITGADFTDVPLREDQREYLCKVADGVNPTTGNATRDALFC 223
>gi|147774410|emb|CAN74472.1| hypothetical protein VITISV_013914 [Vitis vinifera]
Length = 221
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 75/150 (50%), Gaps = 18/150 (12%)
Query: 97 YEAE-TRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 152
Y AE TRG+ G S D + ++ + NF+ AN A ++D +G+ + A
Sbjct: 85 YGAEVTRGQDLTGKDFSGKSLIKQDFKTSILRQANFKXANLLGASFFDADLTGADLSDAD 144
Query: 153 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
L A AN T A+LS+ ANL A+ R G+II GADF+D
Sbjct: 145 LRGADFSLANVTKANLSN----------ANLEGALATGNTSFR----GSIITGADFTDVP 190
Query: 213 IDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ Q++ LCK A+G NP TG +TR++L C
Sbjct: 191 LREDQREYLCKVADGVNPTTGNATRETLLC 220
>gi|449441422|ref|XP_004138481.1| PREDICTED: thylakoid lumenal 15 kDa protein 1, chloroplastic-like
[Cucumis sativus]
Length = 214
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 67/123 (54%), Gaps = 6/123 (4%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 180
++F D + S + F GA L A + A+ TGADLSD + D + N +
Sbjct: 91 KDFSGKTLIKQDFKTSILRQANFKGANLLGASFFDADLTGADLSDADLRGADFSLANVTK 150
Query: 181 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 239
ANL+NA L + T + G+ I GADF+D + Q++ LCK A+G NP TG +TR++
Sbjct: 151 ANLSNANLEGALATGNTSFRGSTINGADFTDVPLREDQREYLCKVADGVNPTTGNATRET 210
Query: 240 LGC 242
L C
Sbjct: 211 LLC 213
>gi|168022043|ref|XP_001763550.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685343|gb|EDQ71739.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 165
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 68/114 (59%), Gaps = 1/114 (0%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 189
+ +F ++ +R+++F G+K GA + A+ T ADL + L++ANLTNA L
Sbjct: 51 KQDFKTSILRQANFKGAKLLGASFFDSDLTGADLTDADLRGADLSLARLSKANLTNANLE 110
Query: 190 RTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+T + L G+II GADF++ Q++ LC A+G NP+TG +TR++L C
Sbjct: 111 GASVTGNTYLKGSIITGADFTEVNWRDDQRKELCLIADGVNPVTGNATRETLLC 164
>gi|388521435|gb|AFK48779.1| unknown [Lotus japonicus]
Length = 225
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 71/119 (59%), Gaps = 11/119 (9%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 184
+ +F ++ +R+++F G+K GA + ++ TGADLSD + D + N +ANL+
Sbjct: 111 KQDFKTSILRQANFKGAKLLGASF-----FDSDLTGADLSDADLRSADFFLANVTKANLS 165
Query: 185 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA L + T + G+ I GADF+D + Q++ LCK A+G NP TG +TR++L C
Sbjct: 166 NANLEGALATGNTSFKGSNITGADFTDVPLRDDQREYLCKVADGVNPTTGNATRETLLC 224
>gi|425437827|ref|ZP_18818239.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9432]
gi|389677087|emb|CCH93934.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9432]
Length = 976
Score = 67.4 bits (163), Expect = 7e-09, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 55/102 (53%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
+A A+L A + N RAN A++ E++ G+ GAYLE A +AN GA+L
Sbjct: 861 SANLERANLYMANLERANLERANLKRANLYEANLYGAYLAGAYLEGANLERANLYGANLE 920
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++R L ANL A L L R++L GA + GA+F DA
Sbjct: 921 GANLERANLERANLKGANLEGANLERANLEGAFLRGANFKDA 962
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 34/102 (33%), Positives = 46/102 (45%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A A+L+ A N RAN A + + + AYLE+A Y AN A+L
Sbjct: 811 GANLERANLKGANLYMANLERANLYRAYLYRAYLYRAYLERAYLERANLYSANLERANLY 870
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++R L ANL A L L + L GA +EGA+ A
Sbjct: 871 MANLERANLERANLKRANLYEANLYGAYLAGAYLEGANLERA 912
Score = 46.6 bits (109), Expect = 0.013, Method: Composition-based stats.
Identities = 35/111 (31%), Positives = 55/111 (49%), Gaps = 1/111 (0%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL+ + + + RAN A++ ++ G+ GA LE+A AN A+L + R
Sbjct: 778 DLQNCLLIHRDLYRANLERANLERANLYGAYLYGANLERANLKGANLYMANLERANLYRA 837
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 226
L A L A L R L R++L A +E A+ A ++ A ++A K AN
Sbjct: 838 YLYRAYLYRAYLERAYLERANLYSANLERANLYMANLERANLERANLKRAN 888
Score = 42.0 bits (97), Expect = 0.30, Method: Composition-based stats.
Identities = 29/93 (31%), Positives = 42/93 (45%), Gaps = 3/93 (3%)
Query: 91 LADLNKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
L N YEA G + G+ A A+L A N RAN A+++ ++ G+
Sbjct: 884 LKRANLYEANLYGAYLAGAYLEGANLERANLYGANLEGANLERANLERANLKGANLEGAN 943
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
A LE A ANF A++ T++D V E
Sbjct: 944 LERANLEGAFLRGANFKDANVKGTILDTEVKTE 976
Score = 40.8 bits (94), Expect = 0.67, Method: Composition-based stats.
Identities = 36/125 (28%), Positives = 54/125 (43%), Gaps = 4/125 (3%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
A+L + E +G A A+L +A N AN A++ + + A
Sbjct: 792 ANLERANLERANLYG----AYLYGANLERANLKGANLYMANLERANLYRAYLYRAYLYRA 847
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
YLE+A +AN A+L + L ANL A L R L ++L GA + GA A
Sbjct: 848 YLERAYLERANLYSANLERANLYMANLERANLERANLKRANLYEANLYGAYLAGAYLEGA 907
Query: 212 VIDLA 216
++ A
Sbjct: 908 NLERA 912
Score = 40.4 bits (93), Expect = 0.79, Method: Composition-based stats.
Identities = 33/106 (31%), Positives = 48/106 (45%), Gaps = 5/106 (4%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L +A + RA A + + + A LE+A Y AN A+L + R
Sbjct: 827 ANLERANLYRAYLYRAYLYRAYLERAYLERANLYSANLERANLYMANLERANLERANLKR 886
Query: 176 MVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADFSDAVIDLA 216
L EANL A L L R++L GA +EGA+ A ++ A
Sbjct: 887 ANLYEANLYGAYLAGAYLEGANLERANLYGANLEGANLERANLERA 932
>gi|443326649|ref|ZP_21055296.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442793770|gb|ELS03210.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 920
Score = 67.4 bits (163), Expect = 8e-09, Method: Composition-based stats.
Identities = 39/104 (37%), Positives = 56/104 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L A V+ N RAN A++ ++ +G+ GA LEKA+ ANF GA+L++
Sbjct: 801 ANLDGANLEGANLVRANLVRANLVRANLDGANLNGAILEGANLEKAILEGANFRGANLNE 860
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ L+EAN A R L R D A +GADF A++D
Sbjct: 861 ANLRGAHLSEANFQEADFDRADLQRVDFDRADFQGADFDRAIMD 904
Score = 44.7 bits (104), Expect = 0.049, Method: Composition-based stats.
Identities = 30/103 (29%), Positives = 51/103 (49%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
+L +A + N RAN A++ ++ G+ A L +A +AN GA+L+ +++
Sbjct: 782 NLYRANLYRANLYRANLVRANLDGANLEGANLVRANLVRANLVRANLDGANLNGAILEGA 841
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
L +A L A L ++L GA + A+F +A D A Q
Sbjct: 842 NLEKAILEGANFRGANLNEANLRGAHLSEANFQEADFDRADLQ 884
Score = 41.6 bits (96), Expect = 0.44, Method: Composition-based stats.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 5/74 (6%)
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR-----SDLGGAI 202
N L +A Y+AN A+L +D L ANL A LVR L R ++L GAI
Sbjct: 778 LNWLNLYRANLYRANLYRANLVRANLDGANLEGANLVRANLVRANLVRANLDGANLNGAI 837
Query: 203 IEGADFSDAVIDLA 216
+EGA+ A+++ A
Sbjct: 838 LEGANLEKAILEGA 851
>gi|298711847|emb|CBJ32870.1| Pentapeptide repeat [Ectocarpus siliculosus]
Length = 238
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 63/125 (50%), Gaps = 12/125 (9%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K ++ +F+ + +E +FSGS G KA KA+FTGA+L L EA+L
Sbjct: 116 KGKYKSKDFSGSIAKEVNFSGSDLRGVRFFKADLKKADFTGANLGTA-----SLEEADLE 170
Query: 185 NAVLVRTVLTRSDLGGAI-----IEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTR 237
++ V T S G + I GADF+DA+I + LC A GTNP TG TR
Sbjct: 171 GTIMTNAVATGSYFGNNMNNVGDISGADFTDALIRKDVAKILCARPDAKGTNPTTGTDTR 230
Query: 238 KSLGC 242
SL C
Sbjct: 231 DSLLC 235
>gi|116785879|gb|ABK23895.1| unknown [Picea sitchensis]
gi|116792150|gb|ABK26251.1| unknown [Picea sitchensis]
Length = 239
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 59/118 (50%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K N R + +A M ++ F G+ + + KA A A+F G D S+ ++DR+ +AN+
Sbjct: 126 KTNLRGKSLAAALMSDAKFDGADMSEVIMSKAYAVGASFKGVDFSNAVIDRVNFGKANMQ 185
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+AV TVL+ S A +EGA F D +I Q LC TN R LGC
Sbjct: 186 DAVFRNTVLSGSTFVDANLEGAKFEDTIIGYIDLQKLC-----TNQTLSDEGRDILGC 238
>gi|359806262|ref|NP_001240959.1| uncharacterized protein LOC100806792 [Glycine max]
gi|255626639|gb|ACU13664.1| unknown [Glycine max]
Length = 222
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 70/119 (58%), Gaps = 11/119 (9%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 184
+ +F ++ +R+++F G+K GA + A+ TGADLSD + D + N +ANL+
Sbjct: 108 KQDFKTSILRQANFKGAKLIGASF-----FDADLTGADLSDADLRNADFSLANVTKANLS 162
Query: 185 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA L + T + G+ + GADF+D + Q++ LCK A+G NP TG +TR +L C
Sbjct: 163 NANLEGALATGNTSFRGSNVTGADFTDVPLREDQREYLCKVADGVNPTTGNATRDTLFC 221
>gi|388510406|gb|AFK43269.1| unknown [Lotus japonicus]
Length = 225
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 71/119 (59%), Gaps = 11/119 (9%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 184
+ +F ++ +R+++F G+K GA + ++ TGADLSD + D + N +ANL+
Sbjct: 111 KQDFKTSILRQANFKGAKLLGASF-----FDSDLTGADLSDADLRSADFSLANVTKANLS 165
Query: 185 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA L + T + G+ I GADF+D + Q++ LCK A+G NP TG +TR++L C
Sbjct: 166 NANLEGALATGNTSFKGSNITGADFTDVPLRDDQREYLCKVADGVNPTTGNATRETLLC 224
>gi|307108672|gb|EFN56912.1| hypothetical protein CHLNCDRAFT_51710 [Chlorella variabilis]
Length = 155
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 61/126 (48%), Gaps = 5/126 (3%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DLR N + A M E+D SG+ L KA A AN GADL++ ++DR+
Sbjct: 33 DLRFCKFAGANLSGKTLSGAYMNEADMSGANMREVVLTKAYAVGANLRGADLTNAVIDRV 92
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
+ +L A LV V+T + GA ++ A+F DA+I + LC NP +
Sbjct: 93 AFDGVDLEGAQLVNAVITGTTFTGANLKDANFEDALIGSEDAKRLC-----ANPTLVGES 147
Query: 237 RKSLGC 242
R +GC
Sbjct: 148 RDQVGC 153
>gi|159474024|ref|XP_001695129.1| thylakoid lumenal 17.4 kDa protein [Chlamydomonas reinhardtii]
gi|158276063|gb|EDP01837.1| thylakoid lumenal 17.4 kDa protein [Chlamydomonas reinhardtii]
Length = 185
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 65/131 (49%), Gaps = 5/131 (3%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
D+R + ++ A + ++D S + A L KA A KANF GAD+++ ++DR+
Sbjct: 60 DMRLCSYAGKDLHGRVLAGALLADADLSNTNLQEAVLTKAYAVKANFEGADMTNAVVDRV 119
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
ANL + TV+T + GA +EG+ + DA+I LC+ NP +
Sbjct: 120 DFTNANLKRVKFINTVVTGASFAGADLEGSVWEDALIGSQDVGKLCE-----NPTLTGES 174
Query: 237 RKSLGCGNSRR 247
R +GC R+
Sbjct: 175 RAQVGCRAVRK 185
>gi|219116042|ref|XP_002178816.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409583|gb|EEC49514.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 109
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 139 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+ ++F S G KA +A+F+GADL ++ ++EA L + V V + S +
Sbjct: 3 KSTNFGKSNLKGCRFYKAYLVRADFSGADLRGASLEDTSMDEALLKDTVAVGAYFSASIM 62
Query: 199 GGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 242
+E ADF+DA + LC+ A GTNP+TGV TR+SL C
Sbjct: 63 DTLTVENADFTDAQFPIKTLPLLCERSDATGTNPVTGVDTRESLMC 108
>gi|357133836|ref|XP_003568528.1| PREDICTED: thylakoid lumenal 15 kDa protein 1, chloroplastic-like
[Brachypodium distachyon]
Length = 200
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 70/119 (58%), Gaps = 11/119 (9%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 184
+ +F ++ +R+++F G+K GA + A+ TGADLSDT + D + N + NLT
Sbjct: 86 KQDFKTSILRQTNFKGAKLLGASF-----FDADLTGADLSDTDLRNADFSLANVTKVNLT 140
Query: 185 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA L ++T + G+ I GADF+D + Q+ LCK A+G N TG +T+++L C
Sbjct: 141 NANLEGALVTGNTSFKGSTIYGADFTDVPLRDDQRDYLCKIADGVNTTTGNATKETLFC 199
>gi|428219116|ref|YP_007103581.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427990898|gb|AFY71153.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 179
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 69/133 (51%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AA++ L A ++ R A F A +R +F+ + +G L + AN +GA+L
Sbjct: 46 AAKYDRRVLEGADFSGKDLRDAQFNKAVLRSVNFANANLSGVSLFGSDLTNANLSGANLR 105
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
+ +D + +L+NA+L + + I GADF+D + ++ LC+ A GTN
Sbjct: 106 YSSLDTSRMVGTDLSNAILEGAFVYGAKFKNLKIAGADFTDVDLRETIREELCEVATGTN 165
Query: 230 PITGVSTRKSLGC 242
P TG TR++LGC
Sbjct: 166 PTTGRDTRETLGC 178
>gi|302143933|emb|CBI23038.3| unnamed protein product [Vitis vinifera]
Length = 232
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 71/249 (28%), Positives = 109/249 (43%), Gaps = 25/249 (10%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MA SI PLS++ SS + + + L P ++C S D S++Q
Sbjct: 1 MATLSI-PLSLQH----SSPKRHRFSVPELHSPFRISCSASW----DSPELKASSSQ--- 48
Query: 61 PYAKLKNWR---VFVSTALAAAVVASCSSNISALA-DLNKYEAETRGE-FGIGSAAQFGS 115
+ +LKN + V AA+ V + S + L+ + N+ E G G +
Sbjct: 49 -FKELKNVAFGILAVCAVTAASPVIAASQRLPPLSTEPNRCERAFVGNTIGQANGVYDKP 107
Query: 116 ADLRKAVHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 173
DLR + E N + + +A M E+ F G+ + + KA A A+F G D ++ ++
Sbjct: 108 IDLRFCDYTNEKSNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGVDFTNAVL 167
Query: 174 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 233
DR+ +ANL AV TVL+ S A +E A F D +I Q +C TN
Sbjct: 168 DRVNFGKANLQGAVFKNTVLSGSTFDQAQLEDAVFEDTIIGYIDLQKIC-----TNTSIN 222
Query: 234 VSTRKSLGC 242
R LGC
Sbjct: 223 ADGRAELGC 231
>gi|359490718|ref|XP_002275994.2| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic [Vitis
vinifera]
Length = 244
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 71/249 (28%), Positives = 109/249 (43%), Gaps = 25/249 (10%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MA SI PLS++ SS + + + L P ++C S D S++Q
Sbjct: 13 MATLSI-PLSLQH----SSPKRHRFSVPELHSPFRISCSASW----DSPELKASSSQ--- 60
Query: 61 PYAKLKNWR---VFVSTALAAAVVASCSSNISALA-DLNKYEAETRGE-FGIGSAAQFGS 115
+ +LKN + V AA+ V + S + L+ + N+ E G G +
Sbjct: 61 -FKELKNVAFGILAVCAVTAASPVIAASQRLPPLSTEPNRCERAFVGNTIGQANGVYDKP 119
Query: 116 ADLRKAVHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 173
DLR + E N + + +A M E+ F G+ + + KA A A+F G D ++ ++
Sbjct: 120 IDLRFCDYTNEKSNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGVDFTNAVL 179
Query: 174 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 233
DR+ +ANL AV TVL+ S A +E A F D +I Q +C TN
Sbjct: 180 DRVNFGKANLQGAVFKNTVLSGSTFDQAQLEDAVFEDTIIGYIDLQKIC-----TNTSIN 234
Query: 234 VSTRKSLGC 242
R LGC
Sbjct: 235 ADGRAELGC 243
>gi|116792169|gb|ABK26257.1| unknown [Picea sitchensis]
Length = 237
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 71/136 (52%), Gaps = 6/136 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
+G+ R A ++F N D + S +KF GA L A + A+ TGADLSD
Sbjct: 102 YGAEVTRGADLSGKDFSGKNLIQQDFKTSILRQAKFKGAKLIGASFFDADLTGADLSDAD 161
Query: 173 M---DRMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
+ D + N + NL+NA L ++T + G+ I GADF+D + Q++ LC A+
Sbjct: 162 LRGADFSLANVTKVNLSNANLEGALVTGNTSFKGSNISGADFTDVPLRDDQRRYLCNIAD 221
Query: 227 GTNPITGVSTRKSLGC 242
G N TG +TR +L C
Sbjct: 222 GVNLTTGNATRDTLLC 237
>gi|260434702|ref|ZP_05788672.1| secreted pentapeptide repeat protein [Synechococcus sp. WH 8109]
gi|260412576|gb|EEX05872.1| secreted pentapeptide repeat protein [Synechococcus sp. WH 8109]
Length = 160
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 62/120 (51%), Gaps = 1/120 (0%)
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 182
+ ++ R A F +++RE++ SGS GA L A A+ +G DL + +D V+ N
Sbjct: 41 YSNKDLRGATFNLSNLREANLSGSDLRGASLYGAKLQDADLSGTDLREATLDAAVMTGTN 100
Query: 183 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
L +AVL + +I GADF+D + L + TN +TG STR+SLGC
Sbjct: 101 LEDAVLEGAFAFNTRFRDVLITGADFTDVPCAGTNSKPL-RRCRRTNSVTGRSTRESLGC 159
>gi|126656956|ref|ZP_01728134.1| hypothetical protein CY0110_02219 [Cyanothece sp. CCY0110]
gi|126621794|gb|EAZ92503.1| hypothetical protein CY0110_02219 [Cyanothece sp. CCY0110]
Length = 1084
Score = 64.3 bits (155), Expect = 5e-08, Method: Composition-based stats.
Identities = 55/161 (34%), Positives = 75/161 (46%), Gaps = 21/161 (13%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A+ RG + G A G ADL A + A+ T AD+R +D +G+ GAYLE A
Sbjct: 931 ADLRGAYLEG--ADLGGADLTGA-----DLEGADLTGADLRGADLTGAYLEGAYLEGADL 983
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DL 215
A+ TGA L ++ L A+LT A L L +DLGGA + GAD + A + DL
Sbjct: 984 TGADLTGAYLEGAYLEGADLGGADLTGADLEGADLRGADLGGADLGGADLTGADLRGADL 1043
Query: 216 AQ-----------KQALCKYANGTNPITGVSTRKSLGCGNS 245
+ KQ NG + I K LG G++
Sbjct: 1044 TKTDLNEARYLTVKQVQEAKNNGKDAIYDEEMEKKLGLGDN 1084
Score = 53.9 bits (128), Expect = 8e-05, Method: Composition-based stats.
Identities = 38/106 (35%), Positives = 51/106 (48%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ ADL A + A+ T AD+ +D G+ GAYLE A A+ TGADL
Sbjct: 896 AKLTGADLTGAYLEGADLGGADLTGADLTGADLEGADLRGAYLEGADLGGADLTGADLEG 955
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+ L A+LT A L L +DL GA + GA A ++ A
Sbjct: 956 ADLTGADLRGADLTGAYLEGAYLEGADLTGADLTGAYLEGAYLEGA 1001
Score = 45.1 bits (105), Expect = 0.035, Method: Composition-based stats.
Identities = 29/75 (38%), Positives = 40/75 (53%)
Query: 137 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 196
++ E+ +G+ GAYLE A A+ TGADL+ ++ L A L A L LT +
Sbjct: 892 ELYEAKLTGADLTGAYLEGADLGGADLTGADLTGADLEGADLRGAYLEGADLGGADLTGA 951
Query: 197 DLGGAIIEGADFSDA 211
DL GA + GAD A
Sbjct: 952 DLEGADLTGADLRGA 966
>gi|356509222|ref|XP_003523350.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
[Glycine max]
Length = 240
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 62/128 (48%), Gaps = 7/128 (5%)
Query: 117 DLRKAVHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
DLR+ E N + + ++A M ++ F G+ + KA A A+F G D S+ ++D
Sbjct: 117 DLRQCDFTDEKTNLKGKSLSAALMSDAKFDGADMTEVVMSKAYAVGASFKGVDFSNAVLD 176
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
R+ +A+L AV TVL+ S A ++ A F D +I Q LC TN G
Sbjct: 177 RVNFEKADLEGAVFKNTVLSGSTFDDAKLDNAVFEDTIIGYIDLQKLC-----TNKTIGD 231
Query: 235 STRKSLGC 242
R LGC
Sbjct: 232 EWRVELGC 239
>gi|172036979|ref|YP_001803480.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354554778|ref|ZP_08974082.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171698433|gb|ACB51414.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353553587|gb|EHC22979.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 325
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 105/224 (46%), Gaps = 39/224 (17%)
Query: 3 LSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPY 62
L+ IS +++K + P+QL L++ + E+D QF G
Sbjct: 95 LTQISGVTVKQFKLVKTH---PFQLEDLAEQI---------DENDPQFLLIERIMSQGG- 141
Query: 63 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
N + F L+ A++ C++N+ LADL EA G S A ADL A
Sbjct: 142 ----NDQDFREANLSGAIL--CNANL-ILADL--REANLMGTDL--SGANLMGADLSGAD 190
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA---------------YKANFTGAD 167
+ N AN A++ E++ +G+ A L++A +AN GA
Sbjct: 191 LLGANLTGANLMGANLTEANLTGADLGDAILQEADLCWADLSEVNLIGADLSQANLKGAI 250
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L+D+L+ LNEANL+ A+L R++L++++L G+I+ D ++A
Sbjct: 251 LTDSLLSHTNLNEANLSEAILNRSILSKTNLSGSILSQTDLTNA 294
Score = 37.0 bits (84), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 53/115 (46%), Gaps = 28/115 (24%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G+ F A+L A+ AN AD+RE++ G+ +GA L A+ +GAD
Sbjct: 141 GNDQDFREANLSGAI-----LCNANLILADLREANLMGTDLSGANL-----MGADLSGAD 190
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
L ANLT A L+ LT ++L GA D DA++ Q+ LC
Sbjct: 191 LLG----------ANLTGANLMGANLTEANLTGA-----DLGDAIL---QEADLC 227
>gi|159903302|ref|YP_001550646.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9211]
gi|159888478|gb|ABX08692.1| Pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9211]
Length = 158
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 43/132 (32%), Positives = 62/132 (46%), Gaps = 10/132 (7%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F DLR A N + AN SGS GA L A K + + +L +
Sbjct: 36 ADFSDTDLRGATFYLTNLQNANL----------SGSNLEGASLFGAKLLKTDLSNTNLKN 85
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
+D +L+ A+LTNA L + I G+DF++ +I Q+ LC A+GTN
Sbjct: 86 ATLDSSILDGADLTNAYLEDAFAFNTQFKDVKISGSDFTNVLITNDQRNYLCSIASGTNS 145
Query: 231 ITGVSTRKSLGC 242
++ +TR SL C
Sbjct: 146 VSTRNTRDSLEC 157
>gi|115434488|ref|NP_001042002.1| Os01g0144100 [Oryza sativa Japonica Group]
gi|13486898|dbj|BAB40127.1| unknown protein [Oryza sativa Japonica Group]
gi|113531533|dbj|BAF03916.1| Os01g0144100 [Oryza sativa Japonica Group]
gi|215678959|dbj|BAG96389.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765141|dbj|BAG86838.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 198
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 69/119 (57%), Gaps = 11/119 (9%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 184
R +F ++ +R+++F G+K GA + A+ TGADLSD + D + N + NLT
Sbjct: 84 RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSDADLRGADFSLANVSKVNLT 138
Query: 185 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA L + T + G+ I GADF+D + Q++ LCK A+G N TG +T+++L C
Sbjct: 139 NANLEGALATGNTTFKGSNIYGADFTDVPLRDDQREYLCKIADGVNTTTGNATKETLFC 197
>gi|440804190|gb|ELR25067.1| pentapeptide repeatcontaining protein [Acanthamoeba castellanii
str. Neff]
Length = 293
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 57/108 (52%), Gaps = 5/108 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
AQ ADLR+A +AN AD+RE++ SG+ A L A+ +A+ +GA L +
Sbjct: 162 AQLEDADLRQANLANAKMTKANLMHADLREANLSGAVMLRADLRSAILRRADLSGAALPN 221
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDAVI 213
+ R L ANLT A L LT +D L GA + GAD S++ +
Sbjct: 222 VELQRASLRRANLTGANLTWATLTDADCTQANLSGANLSGADLSNSTL 269
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 57/116 (49%), Gaps = 15/116 (12%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRE----------SDFSGSKFNGAYLEKAVAYKAN 162
F A+L +A N AN A+MRE ++ SG+ + A L KA +AN
Sbjct: 94 FQWANLTEATLTDCNLTGANLKGANMREVQLASTNLTRANLSGANLHLARLGKAQLRRAN 153
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDAVI 213
+GA+L + ++ L +ANL NA + + L +D L GA++ AD A++
Sbjct: 154 LSGANLEEAQLEDADLRQANLANAKMTKANLMHADLREANLSGAVMLRADLRSAIL 209
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 58/118 (49%), Gaps = 6/118 (5%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
Q S +L +A N A A +R ++ SG+ A LE A +AN A ++
Sbjct: 123 QLASTNLTRANLSGANLHLARLGKAQLRRANLSGANLEEAQLEDADLRQANLANAKMTKA 182
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYANGT 228
+ L EANL+ AV++ R+DL AI+ AD S A + ++ ++A + AN T
Sbjct: 183 NLMHADLREANLSGAVML-----RADLRSAILRRADLSGAALPNVELQRASLRRANLT 235
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 44/96 (45%), Gaps = 12/96 (12%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD----------LSDTLMDRMVLNEA 181
+ T A + D G F A L +A N TGA+ L+ T + R L+ A
Sbjct: 78 DLTGARLFRCDLRGVDFQWANLTEATLTDCNLTGANLKGANMREVQLASTNLTRANLSGA 137
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
NL A L + L R++L GA +E A DA DL Q
Sbjct: 138 NLHLARLGKAQLRRANLSGANLEEAQLEDA--DLRQ 171
>gi|449456995|ref|XP_004146234.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
[Cucumis sativus]
gi|449522387|ref|XP_004168208.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
[Cucumis sativus]
Length = 237
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 56/118 (47%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K + + +A M ++ F G+ + + KA A A+F G D S+ ++DR+ +ANL
Sbjct: 124 KNQLKGKSLAAALMSDAKFDGADLSEVVMSKAYAVGASFKGVDFSNAVLDRVNFGKANLQ 183
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+ TVL+ S A +E A F D +I Q LC NP R LGC
Sbjct: 184 GALFKNTVLSGSTFDDAQLEDAVFEDTIIGYIDLQKLC-----VNPTISPEGRAELGC 236
>gi|303279747|ref|XP_003059166.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459002|gb|EEH56298.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 213
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 62/126 (49%), Gaps = 5/126 (3%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DLRK + ++ + A M ++ F G+ + KA A A+FTGA+ ++ ++DR+
Sbjct: 91 DLRKCEYDGKDLSTKTLSGALMVDASFKGTNLTEVVMSKAYALNADFTGANFTNAVVDRV 150
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
+ ANL NA V+T + G + GA F +A+I + LC NP T
Sbjct: 151 TFDGANLANADFHNAVITGTTYEGTDLTGATFEEALIGKEDVKRLCD-----NPTVKGPT 205
Query: 237 RKSLGC 242
R +GC
Sbjct: 206 RFEVGC 211
>gi|168060251|ref|XP_001782111.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666451|gb|EDQ53105.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 158
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 60/118 (50%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K N + ++A M E+ F G+ + KA A A+F G+ ++ ++DR+ +++++
Sbjct: 44 KTNLKGKTLSAALMSEAKFDGADLTEVIMSKAYAVGASFKGSVFTNAVVDRVAFDKSDMQ 103
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ TVL+ S GA +EGA F +A+I Q LCK NP +R L C
Sbjct: 104 GVQFINTVLSGSTFEGANLEGASFENALIGYVDIQKLCK-----NPTLPEESRIDLAC 156
>gi|428307622|ref|YP_007144447.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
gi|428249157|gb|AFZ14937.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
Length = 378
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 61/111 (54%), Gaps = 5/111 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A L+ A ++ N A+ + AD+R +D SG+ A L KA +AN T DL
Sbjct: 43 SNADLSRASLKDAKLIRVNLSNADLSWADLRGADLSGANLENANLSKASLDQANLTNTDL 102
Query: 169 SDTLMDR-----MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
S ++R +L++ANL NA L T L +DLG A +E AD S+A +D
Sbjct: 103 SSANLNRASLDYALLSKANLINADLSGTNLVGADLGRANLENADLSNATLD 153
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 52/98 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +ADL +A R N ++AD+ +D G+ +GA LE A KA+ A+L++
Sbjct: 40 ADLSNADLSRASLKDAKLIRVNLSNADLSWADLRGADLSGANLENANLSKASLDQANLTN 99
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
T + LN A+L A+L + L +DL G + GAD
Sbjct: 100 TDLSSANLNRASLDYALLSKANLINADLSGTNLVGADL 137
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 62/129 (48%), Gaps = 8/129 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A N +A+ A++ +D S + N A L+ A+ KAN ADL
Sbjct: 68 SWADLRGADLSGANLENANLSKASLDQANLTNTDLSSANLNRASLDYALLSKANLINADL 127
Query: 169 SDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA--- 220
S T + R L A+L+NA L ++L ++ G A ++ A +A I+ A +
Sbjct: 128 SGTNLVGADLGRANLENADLSNATLDNSILISANFGAANLKKASLCNANIERASLEGANL 187
Query: 221 LCKYANGTN 229
+ NGTN
Sbjct: 188 ISANLNGTN 196
>gi|376001358|ref|ZP_09779228.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375330187|emb|CCE14981.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 351
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/148 (33%), Positives = 72/148 (48%), Gaps = 7/148 (4%)
Query: 69 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 125
R F +L AA+ + N L+ N EA IG S +Q ADL AV +
Sbjct: 21 RNFSDISLVAAIFNEVTLNRINLSGANLSEALMVHTRLIGANLSRSQLSYADLSMAVLID 80
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
N A T + ++D SG+ +GA L + N TGA L T + LN + LT+
Sbjct: 81 ANLTGATMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTCL----LNGSQLTD 136
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+LV LTRS L GA + GA+ + +++
Sbjct: 137 AILVGATLTRSVLSGAHMTGANLNRSIL 164
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 52/100 (52%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL ++V NF AN T A++ ++ +G+ NGA L A AN TGA+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTGANLTGANL 249
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ + L ANL+ A L LT ++L GA + AD
Sbjct: 250 NGLTLQSADLRLANLSKADLRGANLTGANLAGANLLEADL 289
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSD 170
A L ++V + AN + + E D SG+ GA +L + AN TGADLS+
Sbjct: 142 ATLTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGANLTGADLSE 201
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+++ ANLT A L LT ++L GA + GA+ + A
Sbjct: 202 SVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTGA 242
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 44/90 (48%), Gaps = 5/90 (5%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD-----RMVLN 179
+ NF + +A E + +GA L +A+ GA+LS + + VL
Sbjct: 20 ERNFSDISLVAAIFNEVTLNRINLSGANLSEALMVHTRLIGANLSRSQLSYADLSMAVLI 79
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ANLT A + TVL ++DL GA + GA S
Sbjct: 80 DANLTGATMTETVLHQADLSGASLSGAILS 109
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 51/111 (45%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 160
A A+L A N AN T A++ ++ +G+ NG A L KA
Sbjct: 212 ANLTGANLTGANLTGANLNGANLTGANLTGANLTGANLNGLTLQSADLRLANLSKADLRG 271
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
AN TGA+L+ + L ANLT+A L L + L GA + GA+ + A
Sbjct: 272 ANLTGANLAGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQA 322
>gi|388504750|gb|AFK40441.1| unknown [Lotus japonicus]
Length = 239
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 56/118 (47%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K N + ++A M ++ F G+ + KA A +F G D S+ ++DR+ +A+L
Sbjct: 126 KSNLKGKTLSAALMSDAKFDGADMTEVVMSKAYAVGGSFKGVDFSNAVLDRVNFEKADLQ 185
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AV TVL+ S A +EGA F D +I Q LC+ N R LGC
Sbjct: 186 GAVFKNTVLSGSTFDDAKLEGAVFEDTIIGYIDLQKLCR-----NKTIADDWRVELGC 238
>gi|409990095|ref|ZP_11273525.1| pentapeptide repeat-containing protein, partial [Arthrospira
platensis str. Paraca]
gi|409939047|gb|EKN80281.1| pentapeptide repeat-containing protein, partial [Arthrospira
platensis str. Paraca]
Length = 220
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 81/167 (48%), Gaps = 12/167 (7%)
Query: 56 NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQ 112
N+ YA+ R F +L AA+ + N L+ N EA IG S +Q
Sbjct: 10 NKLLTRYAQ--GERNFSDISLVAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQ 67
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
ADL AV + N A+ T + ++D SG+ +GA L + N TGA L T
Sbjct: 68 LSYADLSMAVLIDANLTGASMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTC 127
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV---IDLA 216
+ LN + LT+A+LV +TRS L GA + GA+ + ++ IDL+
Sbjct: 128 L----LNGSQLTDAILVGATMTRSVLSGAHMTGANLNRSILSEIDLS 170
>gi|423066634|ref|ZP_17055424.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|406711942|gb|EKD07140.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 351
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/148 (33%), Positives = 72/148 (48%), Gaps = 7/148 (4%)
Query: 69 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 125
R F +L AA+ + N L+ N EA IG S +Q ADL AV +
Sbjct: 21 RNFSDISLMAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQLSYADLSMAVLID 80
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
N A T + ++D SG+ +GA L + N TGA L T + LN + LT+
Sbjct: 81 ANLTGATMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTCL----LNGSQLTD 136
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+LV LTRS L GA + GA+ + +++
Sbjct: 137 AILVGATLTRSVLSGAHMTGANLNRSIL 164
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 53/100 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL ++V NF AN T A++ ++ +G+ NGA L A +AN TGA+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTRANLTGANL 249
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ + L ANL+ A L LT ++L GA + AD
Sbjct: 250 NGLTLQSADLRLANLSKADLRGANLTGANLAGANLLEADL 289
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 48/101 (47%), Gaps = 12/101 (11%)
Query: 109 SAAQFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
S A A L + VH+ + N AN T AD+ ES S F A L A AN TGA+
Sbjct: 170 SGANLTGATLIR-VHLNQGNLSGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGAN 228
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
LN ANLT A L R LT ++L G ++ AD
Sbjct: 229 ----------LNGANLTGANLTRANLTGANLNGLTLQSADL 259
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSD 170
A L ++V + AN + + E D SG+ GA +L + AN TGADLS+
Sbjct: 142 ATLTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGANLTGADLSE 201
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+++ ANLT A L LT ++L GA + GA+ + A
Sbjct: 202 SVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTRA 242
Score = 43.9 bits (102), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 70/151 (46%), Gaps = 19/151 (12%)
Query: 78 AAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSA 136
A+++ +C N S L D A TR S A A+L +++ + + AN T A
Sbjct: 121 ASLIGTCLLNGSQLTDAILVGATLTRSVL---SGAHMTGANLNRSILSEIDLSGANLTGA 177
Query: 137 -----DMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEANLTNA 186
+ + + SG+ GA L ++V +NF TGA+L+ + LN ANLT A
Sbjct: 178 TLIRVHLNQGNLSGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGA 237
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L TR++L GA + G A + LA
Sbjct: 238 NL-----TRANLTGANLNGLTLQSADLRLAN 263
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 51/111 (45%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 160
A A+L A N AN T A++ ++ +G+ NG A L KA
Sbjct: 212 ANLTGANLTGANLTGANLNGANLTGANLTRANLTGANLNGLTLQSADLRLANLSKADLRG 271
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
AN TGA+L+ + L ANLT+A L L + L GA + GA+ + A
Sbjct: 272 ANLTGANLAGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQA 322
>gi|209528100|ref|ZP_03276576.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|209491459|gb|EDZ91838.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
Length = 351
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 77/161 (47%), Gaps = 9/161 (5%)
Query: 56 NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQ 112
N+ YA+ R F +L AA+ + N L+ N EA IG S +Q
Sbjct: 10 NKLLTRYAQ--GERNFSDISLMAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQ 67
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
ADL AV + N A T + ++D SG+ +GA L + N TGA L T
Sbjct: 68 LSYADLSMAVLIDANLTGATMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTC 127
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ LN + LT+A+LV LTRS L GA + GA+ + +++
Sbjct: 128 L----LNGSQLTDAILVGATLTRSVLSGAHMTGANLNRSIL 164
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 49/101 (48%), Gaps = 12/101 (11%)
Query: 109 SAAQFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
S A A L + VH+ + N AN T AD+ ES S F A L A AN TGA+
Sbjct: 170 SGANLTGATLIR-VHLNQGNLSGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGAN 228
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
LN ANLT A L R LTR++L G ++ AD
Sbjct: 229 ----------LNGANLTRANLTRANLTRANLNGLTLQSADL 259
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 53/100 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL ++V NF AN T A++ ++ +G+ NGA L +A +AN T A+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTRANLTRANLTRANL 249
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ + L ANL+ A L LT ++L GA + AD
Sbjct: 250 NGLTLQSADLRLANLSKADLRGANLTGANLAGANLLEADL 289
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 48/103 (46%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A A+L A + N RAN T A++ + A L KA AN TGA+L
Sbjct: 220 TGANLTGANLNGANLTRANLTRANLTRANLNGLTLQSADLRLANLSKADLRGANLTGANL 279
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + L ANLT+A L L + L GA + GA+ + A
Sbjct: 280 AGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQA 322
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 54/114 (47%), Gaps = 10/114 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANF 163
+ A A+L +A + N RAN SAD+R ++ S + GA L A N
Sbjct: 225 TGANLNGANLTRANLTRANLTRANLNGLTLQSADLRLANLSKADLRGANLTGA-----NL 279
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
GA+L + + L +ANL A L+ T L ++L GA + A+ A + +A
Sbjct: 280 AGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQANLIGASLSVAN 333
>gi|18406661|ref|NP_566030.1| thylakoid lumenal protein 1 [Arabidopsis thaliana]
gi|20141847|sp|O22160.2|TL15A_ARATH RecName: Full=Thylakoid lumenal 15 kDa protein 1, chloroplastic;
AltName: Full=p15; Flags: Precursor
gi|20196925|gb|AAM14836.1| pentapeptide repeat family protein [Arabidopsis thaliana]
gi|330255391|gb|AEC10485.1| thylakoid lumenal protein 1 [Arabidopsis thaliana]
Length = 224
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 68/123 (55%), Gaps = 11/123 (8%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 180
+ R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N +
Sbjct: 106 QTLIRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTK 160
Query: 181 ANLTNAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 239
NLTNA L TV + G+ I GADF+D + Q+ LCK A+G N TG +TR +
Sbjct: 161 VNLTNANLEGATVTGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDT 220
Query: 240 LGC 242
L C
Sbjct: 221 LLC 223
>gi|224120874|ref|XP_002318440.1| predicted protein [Populus trichocarpa]
gi|222859113|gb|EEE96660.1| predicted protein [Populus trichocarpa]
Length = 240
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 57/118 (48%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K N + + +A M ++ F G+ + KA A A+F G D S+ ++DR+ +A+L
Sbjct: 127 KSNLKGKSLAAALMSDAKFDGADMTEVVMSKAYAVGASFRGVDFSNAVLDRVNFGKADLK 186
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AV TVL+ S A +E A F D +I Q +C+ N G R LGC
Sbjct: 187 GAVFKNTVLSGSTFDEAQLEDAIFEDTIIGYIDLQKICR-----NTSIGPDGRAELGC 239
>gi|222423354|dbj|BAH19651.1| AT2G44920 [Arabidopsis thaliana]
Length = 224
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 68/123 (55%), Gaps = 11/123 (8%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 180
+ R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N +
Sbjct: 106 QTLIRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGGDFSLANVTK 160
Query: 181 ANLTNAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 239
NLTNA L TV + G+ I GADF+D + Q+ LCK A+G N TG +TR +
Sbjct: 161 VNLTNANLEGATVTGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDT 220
Query: 240 LGC 242
L C
Sbjct: 221 LLC 223
>gi|363807626|ref|NP_001241901.1| uncharacterized protein LOC100785667 [Glycine max]
gi|255647148|gb|ACU24042.1| unknown [Glycine max]
Length = 239
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 63/128 (49%), Gaps = 7/128 (5%)
Query: 117 DLRKA--VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
DLR+ + K N + + ++A M ++ F G+ + KA A A+F G D S+ ++D
Sbjct: 116 DLRQCDFTNEKTNLKGKSPSAALMSDAKFDGADMTEVVMSKAYAAGASFKGVDFSNAVLD 175
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
R+ +A+L A+ TVL+ S A ++ A F D +I Q LC TN G
Sbjct: 176 RVNFEKADLEGAIFKNTVLSGSPFDDAKLDNAVFEDTIIGYIDFQKLC-----TNKTIGD 230
Query: 235 STRKSLGC 242
R LGC
Sbjct: 231 EWRVELGC 238
>gi|397570889|gb|EJK47511.1| hypothetical protein THAOC_33758, partial [Thalassiosira oceanica]
Length = 122
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 64/118 (54%), Gaps = 10/118 (8%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N + F + +R++DF G+ GA A ++F GAD++ ++ ++ E ++ A
Sbjct: 11 NLKGVAFQQSIVRDTDFRGTNLFGASFFDATLDGSDFEGADMTLCNVENAIVKEMYVSGA 70
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 242
L V + IE +D+SD + Q++ LC++ A GTNP+TGV TR+SL C
Sbjct: 71 TLFEGVKS--------IENSDWSDTQLRKDQQKYLCEHPTAKGTNPVTGVDTRESLMC 120
>gi|307109822|gb|EFN58059.1| hypothetical protein CHLNCDRAFT_57123 [Chlorella variabilis]
Length = 608
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 58/101 (57%), Gaps = 1/101 (0%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ R+ +T AD+R ++ S + G L A+A ANF+GA+L + ++ + L A+L+N
Sbjct: 49 QDLRKNKYTKADLRGTNLSNANLEGVTLFGALATNANFSGANLRNADLELVELEGADLSN 108
Query: 186 AVLVRTVLTRSDLGGAI-IEGADFSDAVIDLAQKQALCKYA 225
AVL +LT + LG I GADF+D V LC+ A
Sbjct: 109 AVLEGAMLTNAQLGRVKSITGADFTDVVFRKDVMMGLCRIA 149
>gi|384246084|gb|EIE19575.1| hypothetical protein COCSUDRAFT_31020 [Coccomyxa subellipsoidea
C-169]
Length = 203
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 68/145 (46%), Gaps = 5/145 (3%)
Query: 98 EAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
A T G +A DLR ++ + A ++++ S L KA
Sbjct: 61 RAYTGNTIGQANAVSDKVLDLRMCDFTGKDLSGKTLSGALLKDAILPNSTMRETVLTKAY 120
Query: 158 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
A ANF+GAD+++ ++DR+ +ANL+N + V+T + GA ++GA F DA+I
Sbjct: 121 AVGANFSGADMTNAVIDRVDFRKANLSNVKFINAVITGTAFDGANLDGAIFEDALIGNED 180
Query: 218 KQALCKYANGTNPITGVSTRKSLGC 242
+ LC NP +R +GC
Sbjct: 181 VKRLC-----LNPTLTGESRMGVGC 200
>gi|219130181|ref|XP_002185250.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403429|gb|EEC43382.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 235
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 52/107 (48%), Gaps = 2/107 (1%)
Query: 138 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 197
M +D S + F AY K + GAD ++ ++DR ++L A+ VLT +
Sbjct: 128 MTNTDASNANFAEAYFSKGYLRDSMLDGADFTNAIVDRATFKGSSLRGAIFANAVLTGTG 187
Query: 198 LGGAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGC 242
GA +E ADF+DA I + LCK G NP TG TR S C
Sbjct: 188 FEGADVENADFTDAYIGDFDIRLLCKNPTLKGENPKTGADTRMSANC 234
>gi|291571459|dbj|BAI93731.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 351
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/148 (33%), Positives = 73/148 (49%), Gaps = 7/148 (4%)
Query: 69 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 125
R F +L AA+ + N L+ N EA IG S +Q ADL AV +
Sbjct: 21 RNFSDISLVAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQLSYADLSMAVLID 80
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
N A+ T + ++D SG+ +GA L + N TGA L T + LN + LT+
Sbjct: 81 ANLTGASMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTCL----LNGSQLTD 136
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+LV +TRS L GA + GA+ + +++
Sbjct: 137 AILVGATMTRSVLSGAHMTGANLNRSIL 164
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 52/100 (52%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL ++V NF AN T A++ ++ +G+ NGA L A AN TGA+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLAGANLAGANLNGANLTGANLTGANLTGANL 249
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ + L ANL+ A L LT ++L GA + AD
Sbjct: 250 NGLTLQCADLRLANLSKADLRGANLTGANLAGANLLEADL 289
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSD 170
A + ++V + AN + + E D SG+ GA +L + AN TGADLS+
Sbjct: 142 ATMTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGANLTGADLSE 201
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+++ ANLT A L L ++L GA + GA+ + A
Sbjct: 202 SVIQNSNFCIANLTGANLAGANLAGANLNGANLTGANLTGA 242
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 51/111 (45%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 160
A A+L A N AN T A++ ++ +G+ NG A L KA
Sbjct: 212 ANLTGANLAGANLAGANLNGANLTGANLTGANLTGANLNGLTLQCADLRLANLSKADLRG 271
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
AN TGA+L+ + L ANLT+A L L + L GA + GA+ + A
Sbjct: 272 ANLTGANLAGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQA 322
>gi|302831317|ref|XP_002947224.1| hypothetical protein VOLCADRAFT_120426 [Volvox carteri f.
nagariensis]
gi|300267631|gb|EFJ51814.1| hypothetical protein VOLCADRAFT_120426 [Volvox carteri f.
nagariensis]
Length = 244
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 64/130 (49%), Gaps = 5/130 (3%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DLR + ++ A + ++D S + A L KA A KANF AD+++ ++DR+
Sbjct: 101 DLRLCSYSGKDLHGRVLAGALLADADLSNTNLQEAVLTKAYAVKANFENADMTNAVVDRV 160
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
+ ANL TV+T + GA +EG+ + DA+I LC+ NP +
Sbjct: 161 DFSGANLRGVRFNNTVVTGAQFAGADLEGSVWEDALIGSQDVGKLCE-----NPTLTGES 215
Query: 237 RKSLGCGNSR 246
R +GC SR
Sbjct: 216 RMQVGCRVSR 225
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 30/59 (50%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
A L KA VK NF A+ T+A + DFSG+ G V A F GADL ++ +
Sbjct: 135 AVLTKAYAVKANFENADMTNAVVDRVDFSGANLRGVRFNNTVVTGAQFAGADLEGSVWE 193
>gi|443477350|ref|ZP_21067204.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443017546|gb|ELS31963.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 670
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 73/146 (50%), Gaps = 12/146 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A SA+L+ A V N R+AN A++ +S+ + N A LE A A+ A+L
Sbjct: 521 SEADLNSANLKGANLVLTNLRKANLVKANLSDSNLGAANLNDAILEGADLSAADLRSAEL 580
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----------IIEGADFSDAV-IDLA 216
+ T + L+ ANLT A LV ++L GA IE ADF++AV +D
Sbjct: 581 NLTNLSNANLSSANLTAAKLVLIEFAGANLNGANFRNAIVENIGSIESADFTNAVNLDPI 640
Query: 217 QKQALCKYANGTNPITGVSTRKSLGC 242
++ C A+G +G ST+ +L C
Sbjct: 641 VRKYFCSLASGNVADSGNSTKSTLNC 666
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 45/90 (50%), Gaps = 5/90 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N AN + A++ + +K A L+KA K N + ADL+ + L NL A
Sbjct: 484 NLTEANLSQANLLRVNLFQAKLGSANLQKAELMKTNLSEADLNSANLKGANLVLTNLRKA 543
Query: 187 VLVRTVLTRSDLGG-----AIIEGADFSDA 211
LV+ L+ S+LG AI+EGAD S A
Sbjct: 544 NLVKANLSDSNLGAANLNDAILEGADLSAA 573
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 46/100 (46%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S + DLR+ + N D+R D S + A L + +AN + A+L
Sbjct: 436 SGSVLERVDLRQVILKNANLNGVKIVKVDLRGGDLSNASAIDANLSFSNLTEANLSQANL 495
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ + L ANL A L++T L+ +DL A ++GA+
Sbjct: 496 LRVNLFQAKLGSANLQKAELMKTNLSEADLNSANLKGANL 535
>gi|297824527|ref|XP_002880146.1| thylakoid lumenal 15 kDa protein, chloroplast [Arabidopsis lyrata
subsp. lyrata]
gi|297325985|gb|EFH56405.1| thylakoid lumenal 15 kDa protein, chloroplast [Arabidopsis lyrata
subsp. lyrata]
Length = 226
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/123 (35%), Positives = 68/123 (55%), Gaps = 11/123 (8%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 180
+ R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N +
Sbjct: 108 QTLIRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTK 162
Query: 181 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 239
NLTNA L T + G+ I GADF+D + Q++ LCK A+G N TG +TR +
Sbjct: 163 VNLTNANLEGATATGNTSFKGSNITGADFTDVPLRDDQREYLCKIADGVNATTGNATRDT 222
Query: 240 LGC 242
L C
Sbjct: 223 LLC 225
>gi|376002767|ref|ZP_09780589.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375328823|emb|CCE16342.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 517
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 85/180 (47%), Gaps = 28/180 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F +A+LR+A N A+F+ A+MR D G+ +GA L +A AN +GA+LS
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---------SDAV----IDLA- 216
++ + L A+L+ A L+R + +DL GA + GA +D + +DL+
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKLYQVSRFNLKADEITCEWVDLSA 308
Query: 217 ----------QKQALCKYANGTNPITGVSTRKSL--GCGNSRRNAYGSPSS--PLLSAPP 262
+++L K+ N T PI + SL + N Y + P++ PP
Sbjct: 309 NGDHSQVYHFDRESLRKFFNQTRPIVEILVNSSLDQDANMALANIYHKIAQEFPVMERPP 368
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 66/136 (48%), Gaps = 19/136 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 164
+ A+ +A+L KA+ + AN D+ E+ S A L +A KANFT
Sbjct: 77 NVARLSNANLTKAILNQATINVANLARVDLTEAQLINSLLIRAELIRAKLTKANFTQANL 136
Query: 165 -GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAI-----IEGADFSDAVI 213
GADL +T + + N ANL+ A L T T++DL GA + ADFS+A +
Sbjct: 137 NGADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDLRGADLVKVNLPKADFSNAEL 196
Query: 214 DLAQKQALCKYANGTN 229
+QA YAN +N
Sbjct: 197 ----RQANLTYANLSN 208
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 67/156 (42%), Gaps = 27/156 (17%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L A++ + N++ LA ++ EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138
Query: 136 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 175
AD+RE+ + FNGA L A K N AD S+ + +
Sbjct: 139 ADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDLRGADLVKVNLPKADFSNAELRQ 198
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANL+NA + DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANMRWIDLQGADLSGANLTEA 234
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN +++D+RE + S + N A L A KA A ++ + R+ L EA L N++L+R
Sbjct: 59 ANLSASDLREVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEAQLINSLLIR 118
Query: 191 T-----VLTRSDLGGAIIEGADFSD 210
LT+++ A + GAD +
Sbjct: 119 AELIRAKLTKANFTQANLNGADLRE 143
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 49/102 (48%), Gaps = 2/102 (1%)
Query: 112 QFGSADLRKAVHVKENFR--RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
Q +D+ K + + +R +F ++ E + S GA L A AN + +DL
Sbjct: 8 QNSESDVLKVYEIVKKYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + R LN A L+NA L + +L ++ + A + D ++A
Sbjct: 68 EVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEA 109
>gi|212721648|ref|NP_001132583.1| uncharacterized protein LOC100194054 [Zea mays]
gi|194694818|gb|ACF81493.1| unknown [Zea mays]
gi|413933909|gb|AFW68460.1| hypothetical protein ZEAMMB73_478838 [Zea mays]
Length = 225
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 61/128 (47%), Gaps = 7/128 (5%)
Query: 117 DLRKAVHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
DLR + E N + + +A M E+ F G+ + + KA A A+F G D ++ ++D
Sbjct: 102 DLRFCDYTNEKTNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGTDFTNAVID 161
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
R+ +A+LT A+ TVL+ S A ++ F D +I Q LC TN
Sbjct: 162 RVNFEKADLTGAIFKNTVLSGSTFDDAKMDDVVFEDTIIGYIDLQKLC-----TNTSISP 216
Query: 235 STRKSLGC 242
R LGC
Sbjct: 217 DARLELGC 224
>gi|409991580|ref|ZP_11274829.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|291567915|dbj|BAI90187.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409937560|gb|EKN78975.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 390
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 66/117 (56%), Gaps = 10/117 (8%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
+A ADL +A+ +K NF +A+ +SA++ +S+ + F AYL KA +A+ ADLS
Sbjct: 111 SAHLNWADLTEAIFIKTNFHKADLSSANLTKSNLQSANFVRAYLIKANLSEADLFQADLS 170
Query: 170 DTLMDRMVLNEANLTN----------AVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+ + L+ ANLT A L LT+++LG A + GA+ +DA ++LA
Sbjct: 171 SANLKDVNLSAANLTECKMTRANLMGANLTEADLTKANLGRANLRGANLTDAYLNLA 227
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 25/160 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE---------------- 154
A F ADL A N + NF+ A++ ++ SGS NGA L+
Sbjct: 57 ADFSEADLSGAHLSLANLSKVNFSGANLTGANLSGSSLNGANLQGATLSAVNLESAHLNW 116
Query: 155 ----KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+A+ K NF ADLS + + L AN A L++ L+ +DL A + A+ D
Sbjct: 117 ADLTEAIFIKTNFHKADLSSANLTKSNLQSANFVRAYLIKANLSEADLFQADLSSANLKD 176
Query: 211 AVIDLAQKQALCKY--AN--GTNPITGVSTRKSLGCGNSR 246
+ A CK AN G N T+ +LG N R
Sbjct: 177 VNLSAANLTE-CKMTRANLMGANLTEADLTKANLGRANLR 215
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL +A N RAN + A++ ++ NGA+L K A+ G DLS
Sbjct: 227 ASLVEADLHQA-----NLTRANLSRANLSKTYLRDICLNGAHLTKVNLSGADLGGVDLSH 281
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L+ + L A L+ A LV +L ++L A + GA+ +A +
Sbjct: 282 KLLTGINLAGAYLSEATLVGALLMEANLSAANLSGANLQNACL 324
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 44/90 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A N AN T M ++ G+ A L KA +AN GA+L
Sbjct: 160 SEADLFQADLSSANLKDVNLSAANLTECKMTRANLMGANLTEADLTKANLGRANLRGANL 219
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+D ++ L EA+L A L R L+R++L
Sbjct: 220 TDAYLNLASLVEADLHQANLTRANLSRANL 249
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 48/108 (44%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A G DL + N A A + E++ S + +GA L+ A A+
Sbjct: 270 SGADLGGVDLSHKLLTGINLAGAYLSEATLVGALLMEANLSAANLSGANLQNACLINADL 329
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
GA +DR+ L +ANLT A L + L ++L AI+ G + A
Sbjct: 330 RGA-----YLDRVDLTDANLTGANLTKADLREANLRAAILAGVELKGA 372
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 57/123 (46%), Gaps = 11/123 (8%)
Query: 92 ADLNKYEAETRGEFGIGSA-AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
ADL + + GI A A A L A+ ++ N AN + A+++ + + G
Sbjct: 272 ADLGGVDLSHKLLTGINLAGAYLSEATLVGALLMEANLSAANLSGANLQNACLINADLRG 331
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
AYL++ AN TGA+L+ + L EANL A+L +L GA + GA +
Sbjct: 332 AYLDRVDLTDANLTGANLT-----KADLREANLRAAILAGV-----ELKGAQLAGATLPN 381
Query: 211 AVI 213
I
Sbjct: 382 GKI 384
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 47/90 (52%), Gaps = 5/90 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L +A K N RAN A++ ++ + + A L +A +AN + A+LS
Sbjct: 192 ANLMGANLTEADLTKANLGRANLRGANLTDAYLNLASLVEADLHQANLTRANLSRANLSK 251
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGG 200
T + + LN A+LT + L+ +DLGG
Sbjct: 252 TYLRDICLNGAHLT-----KVNLSGADLGG 276
>gi|209526071|ref|ZP_03274603.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423067542|ref|ZP_17056332.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493459|gb|EDZ93782.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406711116|gb|EKD06318.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 517
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 85/180 (47%), Gaps = 28/180 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F +A+LR+A N A+F+ A+MR D G+ +GA L +A AN +GA+LS
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---------SDAV----IDLA- 216
++ + L A+L+ A L+R + +DL GA + GA +D + +DL+
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKLYQVSRFNLKADEITCEWVDLSA 308
Query: 217 ----------QKQALCKYANGTNPITGVSTRKSL--GCGNSRRNAYGSPSS--PLLSAPP 262
+++L K+ N T PI + SL + N Y + P++ PP
Sbjct: 309 NGDHSQVYHFDRESLRKFFNQTRPIVEILVNSSLDQDANMALANIYHKIAQEFPVMERPP 368
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 66/136 (48%), Gaps = 19/136 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 164
+ A+ +A+L KA+ + AN D+ E+ S A L +A KANFT
Sbjct: 77 NVARLSNANLTKAILNQATINVANLARVDLTEAQLINSLLIRAELIRAKLTKANFTQANL 136
Query: 165 -GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAI-----IEGADFSDAVI 213
GADL +T + + N ANL+ A L T T++DL GA + ADFS+A +
Sbjct: 137 NGADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDLRGADLVKVNLPKADFSNAEL 196
Query: 214 DLAQKQALCKYANGTN 229
+QA YAN +N
Sbjct: 197 ----RQANLTYANLSN 208
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 67/156 (42%), Gaps = 27/156 (17%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L A++ + N++ LA ++ EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138
Query: 136 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 175
AD+RE+ + FNGA L A K N AD S+ + +
Sbjct: 139 ADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDLRGADLVKVNLPKADFSNAELRQ 198
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANL+NA + DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANMRWIDLQGADLSGANLTEA 234
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN +++D+RE + S + N A L A KA A ++ + R+ L EA L N++L+R
Sbjct: 59 ANLSASDLREVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEAQLINSLLIR 118
Query: 191 T-----VLTRSDLGGAIIEGADFSD 210
LT+++ A + GAD +
Sbjct: 119 AELIRAKLTKANFTQANLNGADLRE 143
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 49/102 (48%), Gaps = 2/102 (1%)
Query: 112 QFGSADLRKAVHVKENFR--RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
Q +D+ K + + +R +F ++ E + S GA L A AN + +DL
Sbjct: 8 QNSESDVLKVYEIVKKYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + R LN A L+NA L + +L ++ + A + D ++A
Sbjct: 68 EVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEA 109
>gi|326523645|dbj|BAJ92993.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524189|dbj|BAJ97105.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 200
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 44/123 (35%), Positives = 65/123 (52%), Gaps = 6/123 (4%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 180
++F D + S + F GA L A + A+ TGADLSD + D + N +
Sbjct: 77 KDFSGQTLIKQDFKTSILRQTNFKGANLLGASFFDADLTGADLSDADLRNADFSLANVTK 136
Query: 181 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 239
NLTNA L ++T + G+ I GADF+D + Q+ LCK A+G N TG +T+++
Sbjct: 137 VNLTNANLEGALVTGNTSFKGSNIYGADFTDVPLRDDQRDYLCKIADGVNTTTGNATKET 196
Query: 240 LGC 242
L C
Sbjct: 197 LFC 199
>gi|302779862|ref|XP_002971706.1| hypothetical protein SELMODRAFT_95422 [Selaginella moellendorffii]
gi|300160838|gb|EFJ27455.1| hypothetical protein SELMODRAFT_95422 [Selaginella moellendorffii]
Length = 157
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 59/118 (50%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
KE + ++A M ++ F G+ + KA A +F G D ++ ++DR+V ++A++
Sbjct: 43 KEGLKGKTLSAALMADAKFDGADMTEVVMSKAYAVGGSFKGTDFTNAVLDRVVFDKADMK 102
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AV TVL+ S GA +E ADF +A+I + LC NP + L C
Sbjct: 103 GAVFRNTVLSGSTFQGANLENADFENALIGYNDARKLC-----LNPTLSEESTIELAC 155
>gi|323452967|gb|EGB08840.1| hypothetical protein AURANDRAFT_25565 [Aureococcus anophagefferens]
Length = 176
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 68/135 (50%), Gaps = 5/135 (3%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G + A ++ + F +F+ D +++F+ SK GA KA +A+F+GAD
Sbjct: 46 GGGKDYAEATIKGQDFSGKTFNNKDFSGCDAVDTNFAKSKLRGARFFKADLARADFSGAD 105
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
LS ++ L LT A+ T +++ L + GADF+DAVI ++ LC G
Sbjct: 106 LSAASLEGANLEGTKLTGALAEGTAFSQTILDAGDLTGADFTDAVIQPYVQKGLC----G 161
Query: 228 TNPITGVSTRKSLGC 242
+TG +TR SL C
Sbjct: 162 RKDVTG-ATRDSLFC 175
>gi|21674877|ref|NP_662942.1| pentapeptide repeat-containing protein [Chlorobium tepidum TLS]
gi|21648101|gb|AAM73284.1| pentapeptide repeat family protein [Chlorobium tepidum TLS]
Length = 439
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 60/118 (50%), Gaps = 15/118 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRA---------------NFTSADMRESDFSGSKFNGAYLEK 155
A+ G DLRKA K +F RA NF ADM+E++ G+ GA L++
Sbjct: 285 AELGGVDLRKASLSKSDFERANLDKANLAGANLAGVNFQRADMKEANLKGANLEGANLDR 344
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A A+ +GA+L ++ +L ANL A+L L ++L A ++GAD + A +
Sbjct: 345 AFLKGADLSGANLKGAILYGAMLYGANLDGAILTNVSLFDANLEKASLKGADLTGATL 402
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 51/105 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A L A K N +A+ + A + +++ G+ + YL+KA N A L
Sbjct: 56 SKANLEDAKLNGANLSKANLSKADLSGASLDKANLEGANLSMTYLKKANMKAVNAAHAWL 115
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+D ++ + +A+L A L R L + + GA +E A DAV+
Sbjct: 116 ADANLNGAFMKDASLKAANLARANLRWAKMSGADLEQASLKDAVL 160
>gi|118592119|ref|ZP_01549513.1| hypothetical protein SIAM614_25622 [Stappia aggregata IAM 12614]
gi|118435415|gb|EAV42062.1| hypothetical protein SIAM614_25622 [Labrenzia aggregata IAM 12614]
Length = 275
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 63/128 (49%), Gaps = 14/128 (10%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD--------- 142
+D + EAE R +F S + F A++R K N +ANF AD+R+ D
Sbjct: 85 SDFRRTEAE-RADF---SGSDFSGANMRSVDLEKANLNKANFQDADLRDGDLNTVEANEA 140
Query: 143 -FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 201
F G+ ++VA KA+F GA + D ++R+ LN AN +A + + L R A
Sbjct: 141 IFDGADMRNVLFTRSVANKASFKGAKMDDANLERVDLNGANFQDARMRQAKLDRVKAQNA 200
Query: 202 IIEGADFS 209
GADFS
Sbjct: 201 NFSGADFS 208
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 62/124 (50%), Gaps = 8/124 (6%)
Query: 99 AETRG---EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY--- 152
AE RG E G + DL++A+ NF+ ++F + +DFSGS F+GA
Sbjct: 50 AELRGLVLENGDFAGTNLREVDLKEAMLPNANFKNSDFRRTEAERADFSGSDFSGANMRS 109
Query: 153 --LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
LEKA KANF ADL D ++ + NEA A + + TRS A +GA D
Sbjct: 110 VDLEKANLNKANFQDADLRDGDLNTVEANEAIFDGADMRNVLFTRSVANKASFKGAKMDD 169
Query: 211 AVID 214
A ++
Sbjct: 170 ANLE 173
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 57/124 (45%), Gaps = 19/124 (15%)
Query: 93 DLNKYEAETRGEFGIGSAAQFGSADLR-----KAVHVKENFRRANFTSADMRESDFSGSK 147
DLN EA + A F AD+R ++V K +F+ A A++ D +G+
Sbjct: 131 DLNTVEA---------NEAIFDGADMRNVLFTRSVANKASFKGAKMDDANLERVDLNGAN 181
Query: 148 FNGAY-----LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
F A L++ A ANF+GAD S + L ANLT +L R+ L GA
Sbjct: 182 FQDARMRQAKLDRVKAQNANFSGADFSGVRLVSSDLTGANLTGVDFDGALLRRTRLAGAD 241
Query: 203 IEGA 206
+ GA
Sbjct: 242 LSGA 245
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 47/106 (44%), Gaps = 5/106 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
G A+LR V +F N D++E+ + F + + A +A+F+G+D S
Sbjct: 47 LGLAELRGLVLENGDFAGTNLREVDLKEAMLPNANFKNSDFRRTEAERADFSGSDFSGAN 106
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVI 213
M + L +ANL A L DL AI +GAD + +
Sbjct: 107 MRSVDLEKANLNKANFQDADLRDGDLNTVEANEAIFDGADMRNVLF 152
>gi|302819846|ref|XP_002991592.1| hypothetical protein SELMODRAFT_133757 [Selaginella moellendorffii]
gi|300140625|gb|EFJ07346.1| hypothetical protein SELMODRAFT_133757 [Selaginella moellendorffii]
Length = 157
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 60/118 (50%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K+ + ++A M ++ F G+ + KA A A+F G D ++ ++DR+V ++A++
Sbjct: 43 KDGLKGKTLSAALMADAKFDGADMTEVVMSKAYAVGASFKGTDFTNAVLDRVVFDKADMK 102
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AV TVL+ S GA +E ADF +A+I + LC NP + L C
Sbjct: 103 GAVFRNTVLSGSTFQGANLENADFENALIGYNDARKLC-----LNPTLSEESTIELAC 155
>gi|220907627|ref|YP_002482938.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219864238|gb|ACL44577.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 267
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 56/105 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A +A+ +KA + +N T AD+ ++D +G + A L +A + NFTG DL
Sbjct: 132 SQANMSAANFQKATLISAYLHNSNLTQADLSDADLTGINLSDANLSQATLIRTNFTGGDL 191
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S ++ L E NLT L L+R++L G ++ GA+ + ++
Sbjct: 192 SRVMLVGANLAETNLTAVNLSDANLSRAELNGVVLAGANLNRVIL 236
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 58/104 (55%), Gaps = 5/104 (4%)
Query: 112 QFGSADLRKA--VHVK---ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
F A+L KA VH NF A ++A++ +++ S + F A L A + +N T A
Sbjct: 100 NFSEANLIKANLVHAALYCANFFMAMMSAANLSQANMSAANFQKATLISAYLHNSNLTQA 159
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
DLSD + + L++ANL+ A L+RT T DL ++ GA+ ++
Sbjct: 160 DLSDADLTGINLSDANLSQATLIRTNFTGGDLSRVMLVGANLAE 203
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 53/118 (44%), Gaps = 15/118 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSA----------DMRESDFSGSKFNGAYLEKAVA 158
S A A+L +A N RAN + A ++ E +FS + A L A
Sbjct: 57 SGANLSGANLIRANLTGANLSRANLSGATLAEVNLSRTNLTEVNFSEANLIKANLVHAAL 116
Query: 159 YKANF-----TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
Y ANF + A+LS M +A L +A L + LT++DL A + G + SDA
Sbjct: 117 YCANFFMAMMSAANLSQANMSAANFQKATLISAYLHNSNLTQADLSDADLTGINLSDA 174
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 44/93 (47%), Gaps = 5/93 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTG 165
++ DLR N + D+RE++ SG+ +GA L +A +AN +G
Sbjct: 24 SELSQMDLRGMSLCGANLAGMDLRGKDLREANLSGANLSGANLIRANLTGANLSRANLSG 83
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
A L++ + R L E N + A L++ L + L
Sbjct: 84 ATLAEVNLSRTNLTEVNFSEANLIKANLVHAAL 116
>gi|340707640|pdb|3N90|A Chain A, The 1.7 Angstrom Resolution Crystal Structure Of
At2g44920, A Pentapeptide Repeat Protein From
Arabidopsis Thaliana Thylakoid Lumen
Length = 152
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 68/119 (57%), Gaps = 11/119 (9%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 184
R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N + NLT
Sbjct: 30 RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTKVNLT 84
Query: 185 NAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
NA L T++ + G+ I GADF+D + Q+ LCK A+G N TG +TR +L C
Sbjct: 85 NANLEGATMMGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDTLLC 143
>gi|374583660|ref|ZP_09656754.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
17734]
gi|374419742|gb|EHQ92177.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
17734]
Length = 367
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 59/108 (54%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 165
S A ADL +A N RRAN + A++ E+D SG+ +GA L +A +A+ +G
Sbjct: 153 SGANLSEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEADLSRADLSGANL 212
Query: 166 --ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
ADLS + L+ ANL+ A L L+R+DL GA + AD S A
Sbjct: 213 SRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANLRRADLSGA 260
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 58/103 (56%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL +A + + AN + A++ E+D S + +GA L +A AN +GA+L
Sbjct: 98 SGANLSEADLSRADLSEADLSGANLSGANLSEADLSRADLSGANLSEADLSGANLSGANL 157
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S+ + R L+ ANL A L L+ +DL GA + GA+ S+A
Sbjct: 158 SEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEA 200
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 54/100 (54%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL +A N RA+ + A++ E+D SG+ +GA L +A +A+ +GA+L
Sbjct: 193 SGANLSEADLSRADLSGANLSRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANL 252
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ L A+L+ A L R L+ ++L A + GAD
Sbjct: 253 RRADLSGANLRRADLSGANLRRADLSEANLSEANLSGADL 292
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 71/144 (49%), Gaps = 7/144 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L A + + RA+ + A++ E+D SG+ +GA L +A +A+ +GA+L
Sbjct: 113 SEADLSGANLSGANLSEADLSRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANL 172
Query: 169 SDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
+ L+E ANL+ A L L+R+DL GA + AD S A +L++
Sbjct: 173 RRANLSGANLSEADLSGANLSGANLSEADLSRADLSGANLSRADLSGA--NLSEADLSGA 230
Query: 224 YANGTNPITGVSTRKSLGCGNSRR 247
+G N +R L N RR
Sbjct: 231 NLSGANLSEADLSRADLSGANLRR 254
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 57/103 (55%), Gaps = 11/103 (10%)
Query: 119 RKAVHVKENFRRANFTSAD----------MRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+KA+ + N AN + A+ + E+D SG+ +GA L +A +A+ +GA+L
Sbjct: 84 KKAI-LDYNLSGANLSGANLSEADLSRADLSEADLSGANLSGANLSEADLSRADLSGANL 142
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S+ + L+ ANL+ A L R L+ ++L A + GA+ S+A
Sbjct: 143 SEADLSGANLSGANLSEADLSRADLSGANLRRANLSGANLSEA 185
>gi|298243143|ref|ZP_06966950.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297556197|gb|EFH90061.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 338
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 54/90 (60%), Gaps = 10/90 (11%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN A++RE+DFSG+ +G+ + +GADLS ++ R +L A+L+ A+L
Sbjct: 95 ANLVGANLREADFSGNDLSGS----------DLSGADLSRAILRRAILRRADLSEAILRD 144
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
VL R+DL A + GAD +DA + A++ A
Sbjct: 145 AVLRRADLTDADLRGADLTDADLTGAKRDA 174
>gi|428222198|ref|YP_007106368.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427995538|gb|AFY74233.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 225
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 59/106 (55%), Gaps = 10/106 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ ADL A N +A + A++ ++ SG+ + +L +AV AN ADL+
Sbjct: 26 AELNDADLSGA-----NLSKARMSGAELNRANMSGANLHSTHLNRAVMKNANLENADLTG 80
Query: 171 TLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAIIEGADFSDA 211
M + L+EANLTNA L V + LT ++L GAI+ ADFS++
Sbjct: 81 AKMMEVNLSEANLTNANLSNVSGVESNLTMANLAGAILSSADFSNS 126
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 58/122 (47%), Gaps = 10/122 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 163
S A +A+L V+ N AN A + +DFS S + GA L+ A+ N
Sbjct: 89 SEANLTNANLSNVSGVESNLTMANLAGAILSSADFSNSNLSKVNLVGADLQGAIFSNTNL 148
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
TGADLS + + L+ ANL+ A L +L GGA I A+F+ + A + +
Sbjct: 149 TGADLSGINLKGVNLSGANLSMANLSGAIL-----GGANITKANFAQTDLSNADLRDVNI 203
Query: 224 YA 225
YA
Sbjct: 204 YA 205
>gi|449016876|dbj|BAM80278.1| similar to thylakoid lumenal 17.4 kD protein, chloroplast precursor
[Cyanidioschyzon merolae strain 10D]
Length = 288
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 59/128 (46%), Gaps = 18/128 (14%)
Query: 137 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLS---------------DTLMDRMVLNEA 181
D+R DFSG +G LE A A +A F LS D ++DR+ A
Sbjct: 161 DLRGRDFSGYDLSGVLLEGATADEARFRSTQLSKAYAPGFKCRRCDFEDAVVDRVNFENA 220
Query: 182 NLTNAVLVRTVLTRSDLG-GAIIEGADFSDAVIDLAQKQALCKYA--NGTNPITGVSTRK 238
+L+ +V VL+ S G + DF+D I + LC+ +G NP+TG TR
Sbjct: 221 DLSGSVFRNAVLSDSMFSDGTNVRDVDFTDVYIGEYGLRRLCRNPTLDGENPLTGAPTRA 280
Query: 239 SLGCGNSR 246
SLGC R
Sbjct: 281 SLGCRAER 288
>gi|300867252|ref|ZP_07111912.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300334729|emb|CBN57078.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 508
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 70/143 (48%), Gaps = 10/143 (6%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F DLR+A + N AN + A++R +D SG+ GA L +A AN GA+LS+
Sbjct: 181 ADFSGTDLRQANLCQVNLSGANLSGANLRWADLSGANLRGADLNEAKLSGANLYGANLSN 240
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
ANLTNA LV LT ++L GA GAD S + + A+ + ++
Sbjct: 241 ----------ANLTNASLVHADLTLANLNGADWVGADLSGSTLSGAKLYDVPRFGIKAEE 290
Query: 231 ITGVSTRKSLGCGNSRRNAYGSP 253
+T S NS+ +GSP
Sbjct: 291 VTCEWVDLSSNGDNSQVYRFGSP 313
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 76/152 (50%), Gaps = 13/152 (8%)
Query: 73 STALAAAVVASCSSNISAL--ADLNKYE----AETRGEFGIG-------SAAQFGSADLR 119
S+ L A++ + N++ L ADL++ + A RGE S A ADLR
Sbjct: 75 SSHLVRAILQGATLNVANLVRADLSEAQLMGAALIRGELIRAELSKANFSKANLTGADLR 134
Query: 120 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 179
+A + NF AN + A++R + + + F A L A KA+ GAD S T + + L
Sbjct: 135 EAKLTEVNFSEANLSGANLRGASGTAANFELANLHGADLSKADLNGADFSGTDLRQANLC 194
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ NL+ A L L +DL GA + GAD ++A
Sbjct: 195 QVNLSGANLSGANLRWADLSGANLRGADLNEA 226
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANF 163
+ A+ S+ L +A+ AN AD+ E+ G+ + A L KA KAN
Sbjct: 69 NVARLSSSHLVRAILQGATLNVANLVRADLSEAQLMGAALIRGELIRAELSKANFSKANL 128
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
TGADL + + + +EANL+ A L T ++ A + GAD S A
Sbjct: 129 TGADLREAKLTEVNFSEANLSGANLRGASGTAANFELANLHGADLSKA 176
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 45/80 (56%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
NFT ++ E++ S + A L +A + N +GA+L++ + LN A L+++ LVR
Sbjct: 22 NFTGINLNEANLSRINLSQANLSEASLFVTNLSGANLNEVNLSNANLNVARLSSSHLVRA 81
Query: 192 VLTRSDLGGAIIEGADFSDA 211
+L + L A + AD S+A
Sbjct: 82 ILQGATLNVANLVRADLSEA 101
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 1/80 (1%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N R N + A++ E+ + +GA L + AN A LS + + R +L A L A
Sbjct: 32 NLSRINLSQANLSEASLFVTNLSGANLNEVNLSNANLNVARLSSSHLVRAILQGATLNVA 91
Query: 187 VLVRTVLTRSDL-GGAIIEG 205
LVR L+ + L G A+I G
Sbjct: 92 NLVRADLSEAQLMGAALIRG 111
>gi|427713339|ref|YP_007061963.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
gi|427377468|gb|AFY61420.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
Length = 327
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 60/122 (49%), Gaps = 16/122 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A+ ADLR AV N A+ AD+R G+ GA L K KAN TGADL
Sbjct: 48 SGAKLQRADLRGAVLSAINLNHADLIGADLR-----GAMLMGADLRKVNLRKANLTGADL 102
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANG 227
+ ANLT A+L LT +D+ AI+ GAD + + LA+ +Q AN
Sbjct: 103 T----------RANLTGAILSEANLTAADMSQAILRGADLTLTDLTLAELEQVNLSQANL 152
Query: 228 TN 229
TN
Sbjct: 153 TN 154
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 66/152 (43%), Gaps = 45/152 (29%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLRK N R+AN T AD+ ++ +G+ + A L A +A GADL+
Sbjct: 80 AMLMGADLRKV-----NLRKANLTGADLTRANLTGAILSEANLTAADMSQAILRGADLTL 134
Query: 171 T-----LMDRMVLNEANLTNA----------VLVRTVLTRSDLGGA-------------- 201
T ++++ L++ANLTNA +L+ L +++L GA
Sbjct: 135 TDLTLAELEQVNLSQANLTNAYLRGADMADAILLEATLIQANLRGANLRNCNLQGANLQK 194
Query: 202 -----------IIEGADFSDAVIDLAQKQALC 222
+EGA+ +A + A + C
Sbjct: 195 TNLRGANLRQARLEGANLREATLTEANLRYAC 226
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 49/108 (45%), Gaps = 10/108 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A AD+ A+ ++ +AN A++R + G+ L A +A GA+L +
Sbjct: 155 AYLRGADMADAILLEATLIQANLRGANLRNCNLQGANLQKTNLRGANLRQARLEGANLRE 214
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI 213
L EANL A L L +DL GA ++ GA ++A++
Sbjct: 215 A-----TLTEANLRYACLDEACLIGADLRGASLARAMLRGAQLNEAIL 257
>gi|33240880|ref|NP_875822.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
gi|33238409|gb|AAQ00475.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
Length = 184
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 65/126 (51%), Gaps = 8/126 (6%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
L ++H +N + + D+ D + +G+Y + KA+ GA++ + +
Sbjct: 46 LDTSLH-GQNLQNTEYVKYDLSGRDLGDADLSGSYFSVSNLQKADLRGANMQNVIAYATR 104
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 237
+ A+L+NA L +S GA+I+G +F++AV+DL Q ++LC+ A G T
Sbjct: 105 FDNADLSNANFSGAELLKSRFDGAVIDGTNFTNAVLDLPQVKSLCERATG-------QTA 157
Query: 238 KSLGCG 243
+SL CG
Sbjct: 158 ESLECG 163
>gi|150016367|ref|YP_001308621.1| pentapeptide repeat-containing protein [Clostridium beijerinckii
NCIMB 8052]
gi|149902832|gb|ABR33665.1| pentapeptide repeat protein [Clostridium beijerinckii NCIMB 8052]
Length = 1084
Score = 60.5 bits (145), Expect = 8e-07, Method: Composition-based stats.
Identities = 42/144 (29%), Positives = 71/144 (49%), Gaps = 10/144 (6%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
ADL++ + G S F ADL A+ V+ +A+F+ A + E+ G+ FN +
Sbjct: 914 ADLSRASMDYTGL----SYCNFEKADLSYAILVESGVSKADFSEASLSEAHIEGTFFNKS 969
Query: 152 YLEKAV-----AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
EKA ++++F + + + V+ E+N NA + T L DL A + GA
Sbjct: 970 KFEKASLIMTQMWRSDFEDCNFNHANLSSAVMRESNFKNATFINTCLRNVDLEEADLTGA 1029
Query: 207 DFSDAVIDLAQ-KQALCKYANGTN 229
D S+A + A+ +A+ + N TN
Sbjct: 1030 DMSNANLSNAKINKAIFEGTNLTN 1053
Score = 55.1 bits (131), Expect = 3e-05, Method: Composition-based stats.
Identities = 42/127 (33%), Positives = 54/127 (42%), Gaps = 22/127 (17%)
Query: 109 SAAQFGSADLRKAVHV------KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
S A F A L +A H+ K F +A+ M SDF FN A L AV ++N
Sbjct: 947 SKADFSEASLSEA-HIEGTFFNKSKFEKASLIMTQMWRSDFEDCNFNHANLSSAVMRESN 1005
Query: 163 FTGADLSDTLMDRMVLNEANLT---------------NAVLVRTVLTRSDLGGAIIEGAD 207
F A +T + + L EA+LT A+ T LT DL IE D
Sbjct: 1006 FKNATFINTCLRNVDLEEADLTGADMSNANLSNAKINKAIFEGTNLTNVDLTNVDIENID 1065
Query: 208 FSDAVID 214
FS +ID
Sbjct: 1066 FSKTIID 1072
Score = 43.1 bits (100), Expect = 0.15, Method: Composition-based stats.
Identities = 33/113 (29%), Positives = 52/113 (46%), Gaps = 11/113 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRES--DFSG---SKFNGAYLEKAV-----AYK 160
A FG A+L + H+ NF AD+ + D++G F A L A+ K
Sbjct: 890 ANFGYANLNDS-HISGTLYNCNFKEADLSRASMDYTGLSYCNFEKADLSYAILVESGVSK 948
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+F+ A LS+ ++ N++ A L+ T + RSD A+ S AV+
Sbjct: 949 ADFSEASLSEAHIEGTFFNKSKFEKASLIMTQMWRSDFEDCNFNHANLSSAVM 1001
Score = 40.4 bits (93), Expect = 0.92, Method: Composition-based stats.
Identities = 33/104 (31%), Positives = 43/104 (41%), Gaps = 11/104 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F A L K N ANF A++ +S SG+ +N NF ADLS
Sbjct: 870 ADFSYAKLDNLEIGKLNAENANFGYANLNDSHISGTLYN-----------CNFKEADLSR 918
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
MD L+ N A L +L S + A A S+A I+
Sbjct: 919 ASMDYTGLSYCNFEKADLSYAILVESGVSKADFSEASLSEAHIE 962
>gi|428219581|ref|YP_007104046.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427991363|gb|AFY71618.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 508
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 74/140 (52%), Gaps = 8/140 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A F A+L A K + ANF+ AD+R ++ SG+ NGA L +A +AN
Sbjct: 172 SVASFNGANLTGASLAKLDLSGLDLSDANFSGADLRGANLSGANLNGADLSRANLSRANL 231
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALC 222
+ A+LS T R LNEANL+ A L + L+R+DL A + AD A + +++ A
Sbjct: 232 SRANLSRTNFVRTELNEANLSEASLSGSNLSRADLSRANLIKADLHGANLSMSKLAGAYL 291
Query: 223 KYAN--GTNPITGVSTRKSL 240
AN GTN I+ TR L
Sbjct: 292 VRANLLGTNLISADLTRAVL 311
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 81/180 (45%), Gaps = 26/180 (14%)
Query: 109 SAAQFGSADLRKAVHVKEN----------FRRANFTSADMRESDFSG-----SKFNGAYL 153
S A DL KA V+ N F AN T A + + D SG + F+GA L
Sbjct: 147 SGANLSQVDLSKATLVEANLKDAKLSVASFNGANLTGASLAKLDLSGLDLSDANFSGADL 206
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A AN GADLS + R L+ ANL+ VRT L ++L A + G++ S A
Sbjct: 207 RGANLSGANLNGADLSRANLSRANLSRANLSRTNFVRTELNEANLSEASLSGSNLSRA-- 264
Query: 214 DLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQK--LLDRDGF 271
DL++ + +G N +S K G R N G + L+SA + L++ D F
Sbjct: 265 DLSRANLIKADLHGAN----LSMSKLAGAYLVRANLLG---TNLISADLTRAVLIEADLF 317
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 53/99 (53%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
SADL +AV ++ + RAN T A++ +D + + A +A AN G DL+ +
Sbjct: 303 SADLTRAVLIEADLFRANLTEANLSRADLNRANLTEASFIEANLISANLCGTDLTRANLT 362
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ +A + A+L++T L+ + L GA A+ S A++
Sbjct: 363 GVYAIDAEIVGAILIKTNLSEASLAGANFVRANLSRAIL 401
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 45/87 (51%), Gaps = 5/87 (5%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ + RAN AD+ ++ S SK GAYL +A N ADL+ R VL EA+L
Sbjct: 263 RADLSRANLIKADLHGANLSMSKLAGAYLVRANLLGTNLISADLT-----RAVLIEADLF 317
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDA 211
A L L+R+DL A + A F +A
Sbjct: 318 RANLTEANLSRADLNRANLTEASFIEA 344
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ DL +A + N +RAN T A + +D A L +A +AN GA+LS
Sbjct: 49 AELSRIDLSRADLSESNLKRANLTEAVLVGADLISINLGRATLTEANLNRANLIGANLSG 108
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+L EA+L L + LT++DL GA + GAD S A
Sbjct: 109 A-----ILVEADLARCDLRVSNLTKADLMGANLSGADLSVA 144
Score = 44.3 bits (103), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L +A + NF R A++ E+ SGS + A L +A KA+ GA+L
Sbjct: 222 SRANLSRANLSRANLSRTNFVRTELNEANLSEASLSGSNLSRADLSRANLIKADLHGANL 281
Query: 169 SDTLMDRMVLNEANL--TNAV---LVRTVLTRSDLGGAIIEGADFSDA 211
S + + L ANL TN + L R VL +DL A + A+ S A
Sbjct: 282 SMSKLAGAYLVRANLLGTNLISADLTRAVLIEADLFRANLTEANLSRA 329
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 15/116 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY---------------LEK 155
A ADL +A + +F AN SA++ +D + + G Y L +
Sbjct: 324 ANLSRADLNRANLTEASFIEANLISANLCGTDLTRANLTGVYAIDAEIVGAILIKTNLSE 383
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A ANF A+LS ++ L+EANL A L ++ ++L GA +E AD S A
Sbjct: 384 ASLAGANFVRANLSRAILSGASLSEANLGRANLYGANMSEANLSGANLENADLSRA 439
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 45/101 (44%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A DLR + K + AN + AD+ ++ SG+ + L KA +AN A LS
Sbjct: 114 ADLARCDLRVSNLTKADLMGANLSGADLSVANLSGANLSQVDLSKATLVEANLKDAKLSV 173
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
N ANLT A L + L+ DL A GAD A
Sbjct: 174 A-----SFNGANLTGASLAKLDLSGLDLSDANFSGADLRGA 209
>gi|428308708|ref|YP_007119685.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428250320|gb|AFZ16279.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 294
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 35/84 (41%), Positives = 48/84 (57%), Gaps = 5/84 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NFRRA T+A + G+ GA L A + N GADLS ++R L +ANLT A
Sbjct: 186 NFRRAKLTAATL-----EGANLTGANLTDAQLNRVNLQGADLSGANLERACLEDANLTGA 240
Query: 187 VLVRTVLTRSDLGGAIIEGADFSD 210
+L RT L+ +++ G + G DFSD
Sbjct: 241 ILRRTQLSEANMSGTKLYGVDFSD 264
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%)
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
++ RAN + A M +S G+K +GA L A AN GA+L + ++R+ L +ANL
Sbjct: 78 IETELTRANLSGAFMVKSLLPGAKMSGADLMGANLRGANLWGANLCGSQLERVNLRDANL 137
Query: 184 TNAVLVRTVLTRSDLGGAIIEGADFS 209
L+ + L GA++ G+ +
Sbjct: 138 MGVNFKWANLSEARLMGAMLYGSSLN 163
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 50/111 (45%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA----------YK 160
+Q +LR A + NF+ AN + A + + GS N A + +A
Sbjct: 125 SQLERVNLRDANLMGVNFKWANLSEARLMGAMLYGSSLNFANMSRAWLKGVDLGGFNLEG 184
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
NF A L+ ++ L ANLT+A L R L +DL GA +E A DA
Sbjct: 185 VNFRRAKLTAATLEGANLTGANLTDAQLNRVNLQGADLSGANLERACLEDA 235
>gi|413947393|gb|AFW80042.1| putative homeobox DNA-binding domain superfamily protein [Zea mays]
Length = 202
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 63/114 (55%), Gaps = 1/114 (0%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 189
+ +F ++ +R+++F G+ GA A A+ + ADL + L +ANL+NA L
Sbjct: 88 KQDFKTSILRQANFKGANLLGASFFDADLTSADLSDADLRGADLSLANLTKANLSNANLE 147
Query: 190 RTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ T + GA I GADF+D + Q++ LCK A+G N TG T+++L C
Sbjct: 148 GALATGNTSFKGADITGADFTDVPLRDDQREYLCKIADGVNSTTGNPTKETLFC 201
>gi|427729960|ref|YP_007076197.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427365879|gb|AFY48600.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 937
Score = 60.1 bits (144), Expect = 1e-06, Method: Composition-based stats.
Identities = 33/104 (31%), Positives = 57/104 (54%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A A+L+ A + N + AN A++ ++ G+ GA L++A+ +A GA+L
Sbjct: 812 GANLYGANLQGANLQRANLQGANLQRANLYGANLEGANLYGANLQRAILQRAILEGANLQ 871
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ R L ANL A+L R L ++L GA +EGA+ +A++
Sbjct: 872 RAILQRANLEGANLQRAILQRANLEGANLEGANLEGANLQEAIL 915
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 5/110 (4%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
I S+ F A+ ++A N + AN A+++ ++ + GA L++A Y AN GA
Sbjct: 789 ILSSKDFYMANFQRANLQGANLQGANLYGANLQGANLQRANLQGANLQRANLYGANLEGA 848
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+L + R +L A L A L R +L R++L EGA+ A++ A
Sbjct: 849 NLYGANLQRAILQRAILEGANLQRAILQRANL-----EGANLQRAILQRA 893
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 33/100 (33%), Positives = 52/100 (52%), Gaps = 5/100 (5%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL+ + ++F ANF A+++ ++ G+ GA L+ A +AN GA+L +
Sbjct: 784 DLQNCILSSKDFYMANFQRANLQGANLQGANLYGANLQGANLQRANLQGANLQRANLYGA 843
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
L ANL A L R +L R AI+EGA+ A++ A
Sbjct: 844 NLEGANLYGANLQRAILQR-----AILEGANLQRAILQRA 878
>gi|218187501|gb|EEC69928.1| hypothetical protein OsI_00358 [Oryza sativa Indica Group]
Length = 191
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 14/117 (11%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 189
R +F ++ +R+++F G+K GA + A+ TGADLSD L A+ + A +
Sbjct: 84 RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSDA-----DLRGADFSLANVS 133
Query: 190 RTVLTRSDLGGAIIEG----ADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ LT ++L GA+ G DF+D + Q++ LCK A+G N TG +T+++L C
Sbjct: 134 KVNLTNANLEGALATGNTTFKDFTDVPLRDDQREYLCKIADGVNTTTGNATKETLFC 190
>gi|434384824|ref|YP_007095435.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428015814|gb|AFY91908.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 377
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 62/115 (53%), Gaps = 5/115 (4%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL + ++ N +RAN A++ +D G+ GA L+KA +AN GA+L ++ +
Sbjct: 200 DLAQTNLIRANLKRANLQGANLEGADLEGANLQGANLKKANLKRANLQGANLMIANLEGI 259
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEG-----ADFSDAVIDLAQKQALCKYAN 226
L ANL A+L+R L ++L GA +EG A+F A + A QA +AN
Sbjct: 260 NLVRANLEGAILIRANLEGANLEGANLEGAILLLANFKGAYLSKANLQACHGHAN 314
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 41/85 (48%), Gaps = 1/85 (1%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N RAN A + ++ G+ GA LE A+ ANF GA LS + + AN A
Sbjct: 260 NLVRANLEGAILIRANLEGANLEGANLEGAILLLANFKGAYLSKANL-QACHGHANFAGA 318
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
L + +DL GA +EGA+ A
Sbjct: 319 YLSKANFEGADLEGANLEGANLQRA 343
>gi|242034055|ref|XP_002464422.1| hypothetical protein SORBIDRAFT_01g017890 [Sorghum bicolor]
gi|241918276|gb|EER91420.1| hypothetical protein SORBIDRAFT_01g017890 [Sorghum bicolor]
Length = 221
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K N + + +A M E+ F G+ + + KA A A+F G D ++ ++DR+ +A+LT
Sbjct: 108 KTNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGTDFTNAVIDRVNFEKADLT 167
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+ VL+ S A ++ F D +I Q LC TN +R LGC
Sbjct: 168 GAIFKNAVLSGSTFDDAKMDDVVFEDTIIGYIDLQKLC-----TNTSISPDSRLELGC 220
>gi|91070460|gb|ABE11370.1| pentapeptide repeats [uncultured Prochlorococcus marinus clone
HOT0M-10G7]
Length = 157
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 66/134 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+A +G L A + + A F D+++++ SG + A L A N + ++L
Sbjct: 21 AALDYGKQSLVGADFSGSDLKGATFYLTDLQDANLSGCELQNATLYGAKLKDTNLSNSNL 80
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+ +D VL+ +L+N L + + I+GADF++ + + C+ A+GT
Sbjct: 81 REVTLDSAVLDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIVREFCEIASGT 140
Query: 229 NPITGVSTRKSLGC 242
NPIT TR++L C
Sbjct: 141 NPITNRDTRETLEC 154
>gi|428224166|ref|YP_007108263.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984067|gb|AFY65211.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 583
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 61/107 (57%), Gaps = 7/107 (6%)
Query: 111 AQFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A G A+LR+ VH++E N R A A + E++ SG A L + ++A GADLS
Sbjct: 155 ADLGGANLRE-VHLEEANLREAKLVEASLIEANLSGCYLRQANLSGSDLHRAILAGADLS 213
Query: 170 DTLMDRMVLNEANLTNAVLVRT-----VLTRSDLGGAIIEGADFSDA 211
+ ++ L+ ANLT A L++T L R+DL A++ ADFS+A
Sbjct: 214 EAVLHGADLSRANLTGAYLLKTSLRNARLLRADLQDALLLRADFSEA 260
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F DL A N + + + AD+ + S + N A L +A + AN A L+ L
Sbjct: 17 FAQVDLTGANLSGANLQDIDLSGADLTGVNLSWAYLNRANLTEASLHHANLRNASLNSAL 76
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+DR VL+ A+LT A L +L +D AI++ AD S A + AQ
Sbjct: 77 LDRAVLSGADLTKAELCLALLRGADCNWAILQEADLSGANLHGAQ 121
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ S L +A ++ N +RA+ +AD+ G+ +LE+A +A A L +
Sbjct: 130 AKLNSTLLNEAKLMEANLKRASLVNADL-----GGANLREVHLEEANLREAKLVEASLIE 184
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L +ANL+ + L R +L +DL A++ GAD S A
Sbjct: 185 ANLSGCYLRQANLSGSDLHRAILAGADLSEAVLHGADLSRA 225
Score = 44.7 bits (104), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 48/101 (47%), Gaps = 5/101 (4%)
Query: 118 LRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
LR+A + RA AD+ E+ D S + GAYL K A ADL D L
Sbjct: 192 LRQANLSGSDLHRAILAGADLSEAVLHGADLSRANLTGAYLLKTSLRNARLLRADLQDAL 251
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ R +EANL A L R L+ + L +I+ AD +A +
Sbjct: 252 LLRADFSEANLRGADLRRADLSGAYLSHSILCEADLGEAYL 292
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 56/114 (49%), Gaps = 4/114 (3%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L A +K + R A AD++++ + F+ A L A +A+ +GA LS
Sbjct: 220 ADLSRANLTGAYLLKTSLRNARLLRADLQDALLLRADFSEANLRGADLRRADLSGAYLSH 279
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 224
+++ L EA L + +RT L + L G I+ D +DL+ Q C+Y
Sbjct: 280 SILCEADLGEAYLLQSHFIRTNLDNACLTGCCIDNWQLED--VDLSNVQ--CQY 329
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 49/101 (48%), Gaps = 10/101 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL KA R A+ A ++E+D SG+ +GA L++ +A L+
Sbjct: 80 AVLSGADLTKAELCLALLRGADCNWAILQEADLSGANLHGAQLDQVTLERAK-----LNS 134
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
TL++ L EANL A LV +DLGGA + +A
Sbjct: 135 TLLNEAKLMEANLKRASLV-----NADLGGANLREVHLEEA 170
>gi|443476541|ref|ZP_21066442.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443018491|gb|ELS32731.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 400
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 89/188 (47%), Gaps = 29/188 (15%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----G 145
L + N EA F I A A L +A V N AN TSA M +D S G
Sbjct: 61 LVEANLAEANLTSAFLI--RADLQRACLNQAYLVAANLNSANLTSASMVNADLSLATLTG 118
Query: 146 SKFNGAYLEKA-----VAYKANFTGADLSDT-----LMDRMVLNEANLTNAVLVRTVLTR 195
+ NGA L +A ++N GADLSD+ LM + L+ ANL+ A L+ LT
Sbjct: 119 ACLNGANLSRAKLNGTFFIESNLLGADLSDSDFTGALMIKANLSGANLSQACLMNVDLTE 178
Query: 196 SDLGGAIIEGADFSDAVIDLAQKQAL-CKYANGTNPITGVSTRKS-------LGCGNSRR 247
++L GA ++G D + A+++ A A+ YAN ++GVS ++ LG +
Sbjct: 179 ANLTGAELQGVDLAGAILNAANLNAVDLVYAN----LSGVSLSRANLSWANLLGTNLEKT 234
Query: 248 NAYGSPSS 255
N GS S
Sbjct: 235 NLVGSDLS 242
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 46/92 (50%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+A+L A N AN + AD+ D GS L A+ +AN TGA+L +
Sbjct: 280 NLSNANLSGANLSGANLMGANLSGADLSNVDLRGSYLIRTNLHNAILNEANLTGANLDEA 339
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
+++ LN ANL A L R LT ++L GA +
Sbjct: 340 VLNGASLNRANLNRASLTRASLTGANLKGAFM 371
Score = 43.9 bits (102), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 51/104 (49%), Gaps = 5/104 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +L A +K N AN ++ ++ SG+ +GA L AN +GADLS+
Sbjct: 254 ADLSWTNLTGAFLMKSNLSGANLNGVNLSNANLSGANLSGANL-----MGANLSGADLSN 308
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ L NL NA+L LT ++L A++ GA + A ++
Sbjct: 309 VDLRGSYLIRTNLHNAILNEANLTGANLDEAVLNGASLNRANLN 352
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 50/99 (50%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L + + + N T A + +S+ SG+ NG L A AN +GA+L +
Sbjct: 244 ANLNETNLAEADLSWTNLTGAFLMKSNLSGANLNGVNLSNANLSGANLSGANLMGANLSG 303
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L+ +L + L+RT L + L A + GA+ +AV++
Sbjct: 304 ADLSNVDLRGSYLIRTNLHNAILNEANLTGANLDEAVLN 342
Score = 40.8 bits (94), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 15/104 (14%)
Query: 113 FGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
F A+L K++ N + R N + A + ++ S + GA+L +A +AN A+
Sbjct: 6 FTKANLTKSILEGINLKGADLKRVNLSEAKLADAKLSKANLTGAFLHRADLNRANLVEAN 65
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L+ EANLT+A L+R L R+ L A + A+ + A
Sbjct: 66 LA----------EANLTSAFLIRADLQRACLNQAYLVAANLNSA 99
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 10/100 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A + DLR + ++ N AN T A++ E+ +G+ N A L +A +A+
Sbjct: 302 SGADLSNVDLRGSYLIRTNLHNAILNEANLTGANLDEAVLNGASLNRANLNRASLTRASL 361
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
TGA+L M NL A ++ T L +++ GAI+
Sbjct: 362 TGANLKGAFMLW-----TNLRGAFMLWTNLDGANMTGAIL 396
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 17/107 (15%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A +L K V + AN ++ E+D S + GA+L K+N +GA+L
Sbjct: 222 SWANLLGTNLEKTNLVGSDLSWANLNETNLAEADLSWTNLTGAFL-----MKSNLSGANL 276
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
N NL+NA L L+ ++L GA + GAD S+ +DL
Sbjct: 277 ----------NGVNLSNANLSGANLSGANLMGANLSGADLSN--VDL 311
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 53/124 (42%), Gaps = 27/124 (21%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
LAD +A G F ADL +A V+ N AN TSA + +D + N
Sbjct: 36 LADAKLSKANLTGAF-------LHRADLNRANLVEANLAEANLTSAFLIRADLQRACLNQ 88
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
AYL A LN ANLT+A +V L+ + L GA + GA+ S
Sbjct: 89 AYLVAAN--------------------LNSANLTSASMVNADLSLATLTGACLNGANLSR 128
Query: 211 AVID 214
A ++
Sbjct: 129 AKLN 132
>gi|159903945|ref|YP_001551289.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9211]
gi|159889121|gb|ABX09335.1| Pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9211]
Length = 184
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 63/125 (50%), Gaps = 8/125 (6%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
L ++H +N + + D+ D + +G+Y + A+ GA+L + +
Sbjct: 46 LDTSLH-GQNLQNTEYVKYDLSGRDLGDANLSGSYFSVSSLKNADLRGANLQNVIAYATR 104
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 237
+ A+L+ A L L +S GA+IEG DF++AV+DL Q ++LC+ A G T
Sbjct: 105 FDNADLSGANLSGAELLKSVFNGAVIEGTDFTNAVLDLPQVKSLCERATG-------KTA 157
Query: 238 KSLGC 242
+SL C
Sbjct: 158 ESLQC 162
>gi|397645344|gb|EJK76787.1| hypothetical protein THAOC_01435 [Thalassiosira oceanica]
Length = 224
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 57/118 (48%), Gaps = 2/118 (1%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N++ +FT + + FS S G KA A+F+GAD + ++ ANL N
Sbjct: 106 NYKGKDFTQIIAKGTIFSKSNLQGCRFYKAYLVNADFSGADARGAAFEDTSMDGANLRNI 165
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN--GTNPITGVSTRKSLGC 242
V + +S L +EG DF+DA I + +C + GTNP TG TR SL C
Sbjct: 166 VASGSYFGQSLLDVESLEGGDFTDAQIPPKTLKLVCDREDVKGTNPTTGADTRDSLMC 223
>gi|193213578|ref|YP_001999531.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
8327]
gi|193087055|gb|ACF12331.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
Length = 439
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 72/128 (56%), Gaps = 5/128 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S + G A+L + N ++++F SAD+ +++ +G+ G +A KAN GA+L
Sbjct: 279 SEEKLGDANLEEVDLSNANLKQSDFESADLDKANLAGANLAGGNFSRADMEKANLKGANL 338
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANG 227
++DR + +A+L+NA L L + L GA ++GAD ++A + D ++A K G
Sbjct: 339 EGAVLDRAFMKQADLSNANLRNANLFGAMLSGANLDGADLTNASLFDANLEKASLK---G 395
Query: 228 TNPITGVS 235
TN +TG +
Sbjct: 396 TN-LTGAN 402
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 68/149 (45%), Gaps = 7/149 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+LRKA +RA+ AD+ E+ + A+L+ A +AN +G +L
Sbjct: 81 SGASLDQANLRKANLSMTYLKRADLKKADLSEAWMVSANLRDAFLKDARLSRANLSGTNL 140
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-SDAVIDLAQKQALCKYANG 227
+ L +ANL +A L T R++L G + A F +AV++ A K +N
Sbjct: 141 RWAKLWDADLGQANLKDANLFETSFERANLKGTLFTKARFLENAVMNDA------KVSNN 194
Query: 228 TNPITGVSTRKSLGCGNSRRNAYGSPSSP 256
T +G + ++ R PS+P
Sbjct: 195 TVIPSGEPASRGWAMRHNSRFVQEEPSAP 223
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 61/128 (47%), Gaps = 15/128 (11%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-T 171
F SADL KA N NF+ ADM +++ G+ GA L++A +A+ + A+L +
Sbjct: 303 FESADLDKANLAGANLAGGNFSRADMEKANLKGANLEGAVLDRAFMKQADLSNANLRNAN 362
Query: 172 LMDRMV--------------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L M+ L +ANL A L T LT ++L G + GA S + + +
Sbjct: 363 LFGAMLSGANLDGADLTNASLFDANLEKASLKGTNLTGANLIGINLTGAAISSSTLTPSG 422
Query: 218 KQALCKYA 225
K A +A
Sbjct: 423 KPATRSWA 430
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 53/106 (50%), Gaps = 12/106 (11%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANFTGADLSDT 171
DL KA N AN + A++ ++D SG+ + A L KA + Y +A+ ADLS+
Sbjct: 54 DLSKANLEDANLDGANLSEANLSKADLSGASLDQANLRKANLSMTYLKRADLKKADLSEA 113
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
M ANL +A L L+R++L G + A DA DL Q
Sbjct: 114 WMV-----SANLRDAFLKDARLSRANLSGTNLRWAKLWDA--DLGQ 152
>gi|83955651|ref|ZP_00964231.1| hypothetical protein NAS141_07590 [Sulfitobacter sp. NAS-14.1]
gi|83839945|gb|EAP79121.1| hypothetical protein NAS141_07590 [Sulfitobacter sp. NAS-14.1]
Length = 189
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 64/117 (54%), Gaps = 11/117 (9%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
N EA+ RG + A G ADLR A + R A+ + A++ +D SG+K GA L
Sbjct: 12 NLTEADLRGA-DLREADLSGRADLRGA-----DLREADLSGAELFYADLSGAKLIGAILS 65
Query: 155 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+A+ AN +GADL R+ L+ A+L+ +L+ LT +DL GA + AD S A
Sbjct: 66 RAILISANLSGADLR-----RVDLSGADLSGTILIGANLTGADLTGANLSSADLSGA 117
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 58/119 (48%), Gaps = 2/119 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A+ A L +A+ + N A+ D+ +D SG+ GA L A AN + ADL
Sbjct: 55 SGAKLIGAILSRAILISANLSGADLRRVDLSGADLSGTILIGANLTGADLTGANLSSADL 114
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
S + M+L ANL+ A L R L+ ++L GA + AD A +L + Y NG
Sbjct: 115 SGANLSGMILRGANLSGANLSRADLSGANLSGASVTEADLGGA--NLTEANLTRTYLNG 171
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 46/95 (48%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL + + N A+ T A++ +D SG+ +G L A AN + ADLS +
Sbjct: 87 ADLSGTILIGANLTGADLTGANLSSADLSGANLSGMILRGANLSGANLSRADLSGANLSG 146
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+ EA+L A L LTR+ L GA + SD
Sbjct: 147 ASVTEADLGGANLTEANLTRTYLNGATLCNTTMSD 181
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 64/138 (46%), Gaps = 20/138 (14%)
Query: 62 YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYE---AETRGEFGIG--------SA 110
YA L + + L+ A++ S+N+S ADL + + A+ G IG +
Sbjct: 51 YADLSGAK-LIGAILSRAIL--ISANLSG-ADLRRVDLSGADLSGTILIGANLTGADLTG 106
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A SADL A R AN + A++ +D SG+ +GA + +A AN T A+L+
Sbjct: 107 ANLSSADLSGANLSGMILRGANLSGANLSRADLSGANLSGASVTEADLGGANLTEANLT- 165
Query: 171 TLMDRMVLNEANLTNAVL 188
R LN A L N +
Sbjct: 166 ----RTYLNGATLCNTTM 179
>gi|242052129|ref|XP_002455210.1| hypothetical protein SORBIDRAFT_03g006310 [Sorghum bicolor]
gi|241927185|gb|EES00330.1| hypothetical protein SORBIDRAFT_03g006310 [Sorghum bicolor]
Length = 200
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 64/126 (50%), Gaps = 4/126 (3%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
D +K++F+ + A+ + ++ G+ F A L A A+ GAD S + +
Sbjct: 78 DFSGLTLIKQDFKTSILRQANFKGANLLGASFFDADLTSADLSDADLRGADFSLANLTKT 137
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
L+ ANL A+ V + GA I GADF+D + Q++ LCK A+G N TG T
Sbjct: 138 NLSNANLEGAL----VTGNTSFKGANITGADFTDVPLRDDQREYLCKIADGVNSTTGNPT 193
Query: 237 RKSLGC 242
+++L C
Sbjct: 194 KETLFC 199
>gi|332712234|ref|ZP_08432162.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332349040|gb|EGJ28652.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 280
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 52/95 (54%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL A NF RA+ + A++ ++ +G+ F GA L A AN TGA+LS+T +
Sbjct: 171 ADLTNANLTGANFSRADLSQANLSNANLTGADFAGADLANADLSGANLTGANLSNTDLKG 230
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
L ANL L R L RSDL A+ GA+F +
Sbjct: 231 SNLTGANLNGTDLARADLERSDLRDAMTNGANFEN 265
Score = 45.4 bits (106), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 2/98 (2%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+ + AD+ ++ +G+ F+ A L +A AN TGAD + + L+ ANLT A L T
Sbjct: 167 DLSGADLTNANLTGANFSRADLSQANLSNANLTGADFAGADLANADLSGANLTGANLSNT 226
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
L S+L GA + G D + A DL + NG N
Sbjct: 227 DLKGSNLTGANLNGTDLARA--DLERSDLRDAMTNGAN 262
>gi|119488860|ref|ZP_01621822.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119455021|gb|EAW36163.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 1011
Score = 59.3 bits (142), Expect = 2e-06, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 54/103 (52%), Gaps = 15/103 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A +ADLR A N RAN + A++R ++ SG+ +G YL A +AN A+
Sbjct: 850 SGADLRTADLRSA-----NLIRANLSDANLRSANLSGANLSGVYLNSADLRRANLNDAN- 903
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
LN+A+L+ A L L+ +DL GA + ADFS A
Sbjct: 904 ---------LNDADLSGANLRSADLSGADLSGADLSVADFSSA 937
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 5/90 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD-----RMVLNEA 181
N R ++ + AD+R +D + A L A AN +GA+LS ++ R LN+A
Sbjct: 843 NLRTSDLSGADLRTADLRSANLIRANLSDANLRSANLSGANLSGVYLNSADLRRANLNDA 902
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
NL +A L L +DL GA + GAD S A
Sbjct: 903 NLNDADLSGANLRSADLSGADLSGADLSVA 932
Score = 43.1 bits (100), Expect = 0.14, Method: Composition-based stats.
Identities = 26/77 (33%), Positives = 37/77 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S SADLR+A N A+ + A++R +D SG+ +GA L A AN A+L
Sbjct: 885 SGVYLNSADLRRANLNDANLNDADLSGANLRSADLSGADLSGADLSVADFSSANLGAANL 944
Query: 169 SDTLMDRMVLNEANLTN 185
+ L+ NL N
Sbjct: 945 GAANLSGANLSGVNLNN 961
>gi|383763954|ref|YP_005442936.1| hypothetical protein CLDAP_29990 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381384222|dbj|BAM01039.1| hypothetical protein CLDAP_29990 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 244
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 66/128 (51%), Gaps = 12/128 (9%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK--- 147
L + N YEA+ S A ADLR A + R A AD+R+++ +G+
Sbjct: 87 LREANLYEADL-------SNAVLDQADLRYATLERAVLRSATLRGADLRDANLAGADLRV 139
Query: 148 --FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
F+GA +E+A+ A+ A+L++ ++ R L ANL NAVL L +DL GA + G
Sbjct: 140 ADFSGAQMERAILTGASLVDANLANAVLRRADLRNANLRNAVLRYADLRGADLSGADLMG 199
Query: 206 ADFSDAVI 213
AD A +
Sbjct: 200 ADLMGARL 207
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ + R NFT A + +++ S + A L +A +AN ADLS+ ++D+ L A L
Sbjct: 54 RADLNRVNFTEASLNQANLSRATLLMAILSRAQLREANLYEADLSNAVLDQADLRYATLE 113
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
AVL L +DL A + GAD A AQ +
Sbjct: 114 RAVLRSATLRGADLRDANLAGADLRVADFSGAQME 148
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 50/121 (41%), Gaps = 20/121 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L A+ + R AN AD+ + + A LE+AV A GADL D
Sbjct: 70 ANLSRATLLMAILSRAQLREANLYEADLSNAVLDQADLRYATLERAVLRSATLRGADLRD 129
Query: 171 ---------------TLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
M+R +L +ANL NAVL R L ++L A++ AD
Sbjct: 130 ANLAGADLRVADFSGAQMERAILTGASLVDANLANAVLRRADLRNANLRNAVLRYADLRG 189
Query: 211 A 211
A
Sbjct: 190 A 190
>gi|78779034|ref|YP_397146.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9312]
gi|78712533|gb|ABB49710.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9312]
Length = 157
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 69/136 (50%), Gaps = 4/136 (2%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+A +G L A + + A F D+++++ SG + A L A N + ++L
Sbjct: 21 AALDYGKQSLVGADFSGSDLKGATFYLTDLQDANLSGCELQNATLYGAKLKDTNLSNSNL 80
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--DLAQKQALCKYAN 226
+ +D VL+ +L+N L + + I+GADF++ + D+ +K C+ A+
Sbjct: 81 REVTLDSAVLDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIVRK--FCESAS 138
Query: 227 GTNPITGVSTRKSLGC 242
GTNPIT TR++L C
Sbjct: 139 GTNPITNRDTRETLEC 154
>gi|115482792|ref|NP_001064989.1| Os10g0502000 [Oryza sativa Japonica Group]
gi|22165076|gb|AAM93693.1| hypothetical protein [Oryza sativa Japonica Group]
gi|31432906|gb|AAP54482.1| Thylakoid lumenal 17.4 kDa protein, chloroplast precursor,
putative, expressed [Oryza sativa Japonica Group]
gi|113639598|dbj|BAF26903.1| Os10g0502000 [Oryza sativa Japonica Group]
gi|125532544|gb|EAY79109.1| hypothetical protein OsI_34214 [Oryza sativa Indica Group]
gi|125575308|gb|EAZ16592.1| hypothetical protein OsJ_32066 [Oryza sativa Japonica Group]
gi|215704684|dbj|BAG94312.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 236
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K N + + +A M +S F G+ + + KA A A+F G D ++ ++DR+ +A+L
Sbjct: 123 KTNLKGKSLAAALMSDSKFDGADMSEVVMSKAYAVGASFKGTDFTNAVIDRVNFEKADLQ 182
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
A+ TVL+ S A ++ F D +I Q LC TN +R LGC
Sbjct: 183 GAIFRNTVLSGSTFDDAKMQDVVFEDTIIGYIDLQKLC-----TNTSISADSRLELGC 235
>gi|158340319|ref|YP_001521675.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158310560|gb|ABW32174.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 284
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 15/141 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S F ++ L++++ + A+F+ AD+R +DFS +K + A L++ +AN GADL
Sbjct: 68 SGVNFKASKLQRSLAIWVQAYWADFSDADLRHADFSCAKLSAAQLKRTDFSQANLMGADL 127
Query: 169 SDTLMDRMVL----------NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
SD++ ANLTNA L + + L GA + +D S + +
Sbjct: 128 SDSVAQDSCFKGANLWGVWAQRANLTNACLSHVDMATAKLTGAQLLDSDLSWSCL----S 183
Query: 219 QALCKYANGTNP-ITGVSTRK 238
QA+CK AN T+ + G RK
Sbjct: 184 QAVCKGANLTSACLEGSDLRK 204
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 50/119 (42%), Gaps = 20/119 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA---- 166
A F A L A + +F +AN AD+ +S S F GA L A +AN T A
Sbjct: 100 ADFSCAKLSAAQLKRTDFSQANLMGADLSDSVAQDSCFKGANLWGVWAQRANLTNACLSH 159
Query: 167 ----------------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
DLS + + + V ANLT+A L + L + D A + AD S
Sbjct: 160 VDMATAKLTGAQLLDSDLSWSCLSQAVCKGANLTSACLEGSDLRKIDFRDACLSRADLS 218
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 60/139 (43%), Gaps = 31/139 (22%)
Query: 84 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE--- 140
CS ++ + KY A R QF +L A + N +F+ + + E
Sbjct: 14 CSKDLQKFWE--KYHASER---------QFAGTNLPGANFYQMNLSGFDFSHSRLSEVNL 62
Query: 141 --SDFSGSKFNGAYLEKAVA-----YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 193
+D SG F + L++++A Y A+F+ AD L A+ + A L L
Sbjct: 63 IWADISGVNFKASKLQRSLAIWVQAYWADFSDAD----------LRHADFSCAKLSAAQL 112
Query: 194 TRSDLGGAIIEGADFSDAV 212
R+D A + GAD SD+V
Sbjct: 113 KRTDFSQANLMGADLSDSV 131
>gi|223995969|ref|XP_002287658.1| thylakoid lumenal 17.4 kDa protein, chloroplast precursor
[Thalassiosira pseudonana CCMP1335]
gi|220976774|gb|EED95101.1| thylakoid lumenal 17.4 kDa protein, chloroplast precursor
[Thalassiosira pseudonana CCMP1335]
Length = 245
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 5/110 (4%)
Query: 138 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 197
M ++D S +F A K +NF GAD ++ ++DR ++L AV VLT +
Sbjct: 128 MTKTDVSNGQFKEAQFSKGYLRDSNFDGADFTNAIVDRASFKGSSLKGAVFKNAVLTATS 187
Query: 198 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRR 247
GA +E ADF+DA I + LCK NP VS + N++R
Sbjct: 188 FEGADVENADFTDAYIGDFDIRTLCK-----NPTLKVSRFYRMTYRNAQR 232
>gi|209526959|ref|ZP_03275476.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|376005813|ref|ZP_09783205.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423064919|ref|ZP_17053709.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209492561|gb|EDZ92899.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|375325803|emb|CCE18958.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406714162|gb|EKD09330.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 331
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 68/138 (49%), Gaps = 9/138 (6%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
F T L AA + + ++ L D N +A+ RG A ADLR A N R
Sbjct: 87 FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139
Query: 130 -RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
R + S ++R +D G+ G L A +AN TGA+L++ ++ +LN+ NL+ L
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLTGANLTECVLRGAILNQTNLSETNL 199
Query: 189 VRTVLTRSDLGGAIIEGA 206
+LT +L GA + G+
Sbjct: 200 QGAILTEVNLSGANLIGS 217
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 63/126 (50%), Gaps = 6/126 (4%)
Query: 94 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSK 147
LN+Y + + G+ A+ +ADL A +F+ ANF A ++ ++ ++
Sbjct: 7 LNQYRSGEKLFRGVNLRNAELSNADLIGANLSGGDFQGANFVLAYLNGVNLTRANLEKAR 66
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
GA L +A A T AD T++ L +ANLT A LV L ++DL GA ++GAD
Sbjct: 67 LGGANLSRANLSGAQLTDADFHGTILQAADLRKANLTLATLVDANLIQADLRGANLQGAD 126
Query: 208 FSDAVI 213
A +
Sbjct: 127 LRGACL 132
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 51/119 (42%), Gaps = 17/119 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFT----------SADMRESDFSGSKFNGAYL----- 153
S AQ AD + + R+AN T AD+R ++ G+ GA L
Sbjct: 78 SGAQLTDADFHGTILQAADLRKANLTLATLVDANLIQADLRGANLQGADLRGACLRGANM 137
Query: 154 --EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
E+ + N GADL T + + L A+LT A L LT L GAI+ + S+
Sbjct: 138 RYERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLTGANLTECVLRGAILNQTNLSE 196
>gi|119486763|ref|ZP_01620738.1| hypothetical protein L8106_10952 [Lyngbya sp. PCC 8106]
gi|119456056|gb|EAW37189.1| hypothetical protein L8106_10952 [Lyngbya sp. PCC 8106]
Length = 331
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 62/123 (50%), Gaps = 9/123 (7%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR--RANFTSADMRESDFSG 145
++ L D N +A+ RG F ADLR A N R R + ++R +D G
Sbjct: 104 LAILLDANLIQADLRG-------VNFQGADLRGACLRGANLRYERRIYDGVNLRGADLRG 156
Query: 146 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
+ G L A +AN GA+L++T++ +L +ANLT A L LT +DL GA + G
Sbjct: 157 ADLQGVNLTGADLTRANLRGANLAETVLRGAILKQANLTQANLQSAFLTEADLSGARLIG 216
Query: 206 ADF 208
A+
Sbjct: 217 ANL 219
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 60/106 (56%), Gaps = 10/106 (9%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSDTL 172
LR A+ + N +AN SA + E+D SG++ GA LE+A+ +A G +L D++
Sbjct: 184 LRGAILKQANLTQANLQSAFLTEADLSGARLIGANLRKVKLERAILIEAQLPGVELCDSI 243
Query: 173 MDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 213
+ + L+ ANL+ A L RT L TR+DL A + AD +DA +
Sbjct: 244 LPDVKLSSANLSGADLSRTNLVRADLTRTDLSNANLTQADLTDASV 289
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 12/106 (11%)
Query: 112 QFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
Q G D R +H++ N A+ A+ +D GS+F AYL +AN A LS
Sbjct: 11 QAGERDFRD-IHLRNANLNSADLIDANFNHADLQGSEFVFAYLNSVNFVRANLGSAKLSG 69
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+++ L+ ANL++A DL GA+++GADF A + LA
Sbjct: 70 AYLNKANLSGANLSDA----------DLHGAVLQGADFRKANLSLA 105
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 58/132 (43%), Gaps = 24/132 (18%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF----------- 143
N+Y+A R F LR A + ANF AD++ S+F
Sbjct: 8 NRYQAGER---------DFRDIHLRNANLNSADLIDANFNHADLQGSEFVFAYLNSVNFV 58
Query: 144 ----SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 199
+K +GAYL KA AN + ADL ++ +ANL+ A+L+ L ++DL
Sbjct: 59 RANLGSAKLSGAYLNKANLSGANLSDADLHGAVLQGADFRKANLSLAILLDANLIQADLR 118
Query: 200 GAIIEGADFSDA 211
G +GAD A
Sbjct: 119 GVNFQGADLRGA 130
>gi|428226754|ref|YP_007110851.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427986655|gb|AFY67799.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 330
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 72/143 (50%), Gaps = 13/143 (9%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L D N ++A+ G G A ADL AV + N A+ +A++ +D +G+ G
Sbjct: 77 LVDANLHDADLHGASLRG--ADLRGADLSLAVLLDANLMDADLRNANLSGADLTGACLRG 134
Query: 151 AYLEK-----------AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 199
A L + ++ YKA+ G +LS + R+ L EANLT A L T L+ +DL
Sbjct: 135 ANLRQEMRSQHTNLRGSILYKADLRGVNLSGADLTRVDLREANLTEASLRETDLSGADLS 194
Query: 200 GAIIEGADFSDAVIDLAQKQALC 222
GA + GA SDA ++ A + C
Sbjct: 195 GANLTGALLSDACLEGAILEGAC 217
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 51/94 (54%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
LR A + N +AN A+++ + ++ GA L++ + +A T DLS +
Sbjct: 218 LRNAKLERANLSQANLFRANLQNALLPQARLTGAGLQQTIFAQAKLTDVDLSRADLFEAD 277
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L EANLT A L RT LTR++L A++ A+ S A
Sbjct: 278 LREANLTGAYLARTNLTRANLSDALLVRAELSSA 311
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 22/110 (20%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
Y+A+ RG ADL + + R AN T A +RE+D SG+ +G
Sbjct: 154 YKADLRG-------VNLSGADL-----TRVDLREANLTEASLRETDLSGADLSG------ 195
Query: 157 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
AN TGA LSD ++ +L A L NA L R L++++L A ++ A
Sbjct: 196 ----ANLTGALLSDACLEGAILEGACLRNAKLERANLSQANLFRANLQNA 241
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 50/100 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A S+DL A + N R+AN A + ++ S + A L A + A+ GADL
Sbjct: 38 SQADLRSSDLFFAYLNRANLRQANLLGARLSGANLSQATLVDANLHDADLHGASLRGADL 97
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ VL +ANL +A L L+ +DL GA + GA+
Sbjct: 98 RGADLSLAVLLDANLMDADLRNANLSGADLTGACLRGANL 137
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 42/85 (49%), Gaps = 15/85 (17%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N + AD+R SD F AYL +A +AN GA LS ANL+ A LV
Sbjct: 36 NLSQADLRSSDLF---F--AYLNRANLRQANLLGARLSG----------ANLSQATLVDA 80
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLA 216
L +DL GA + GAD A + LA
Sbjct: 81 NLHDADLHGASLRGADLRGADLSLA 105
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 54/115 (46%), Gaps = 10/115 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSAD----------MRESDFSGSKFNGAYLEKAVA 158
S +ADL + N +A+ S+D +R+++ G++ +GA L +A
Sbjct: 18 SGRNLSNADLTNVDLIGINLSQADLRSSDLFFAYLNRANLRQANLLGARLSGANLSQATL 77
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
AN ADL + L A+L+ AVL+ L +DL A + GAD + A +
Sbjct: 78 VDANLHDADLHGASLRGADLRGADLSLAVLLDANLMDADLRNANLSGADLTGACL 132
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 57/108 (52%), Gaps = 5/108 (4%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A+ A+L +A + N + +A T A ++++ F+ +K L +A ++A+
Sbjct: 221 AKLERANLSQANLFRANLQNALLPQARLTGAGLQQTIFAQAKLTDVDLSRADLFEADLRE 280
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+L+ + R L ANL++A+LVR L+ ++L A ++ A D +
Sbjct: 281 ANLTGAYLARTNLTRANLSDALLVRAELSSANLMDANLQRAVLPDGKV 328
>gi|158316060|ref|YP_001508568.1| pentapeptide repeat-containing protein [Frankia sp. EAN1pec]
gi|158111465|gb|ABW13662.1| pentapeptide repeat protein [Frankia sp. EAN1pec]
Length = 411
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 56/100 (56%), Gaps = 6/100 (6%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ + RRAN T A++ ++D +G++ A L A+ ++A TGA L + L A+LT
Sbjct: 282 RADLRRANLTDAELVDADLTGARLADATLAGALLFRATLTGAQLGRADLTGAQLGGADLT 341
Query: 185 NAVLVRTVLTRSDLGGA-----IIEGADFSDAVIDLAQKQ 219
NAVL +L + L GA ++GAD + A LAQKQ
Sbjct: 342 NAVLDEAILADAVLSGANLTNARLDGADLT-AATGLAQKQ 380
Score = 43.9 bits (102), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 50/99 (50%), Gaps = 6/99 (6%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+FT + + D + + A L A A+ TGA L+D + +L A LT A
Sbjct: 269 DFTGGSLDDVDLARADLRRANLTDAELVDADLTGARLADATLAGALLFRATLTGA----- 323
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLA-QKQALCKYANGTN 229
L R+DL GA + GAD ++AV+D A A+ AN TN
Sbjct: 324 QLGRADLTGAQLGGADLTNAVLDEAILADAVLSGANLTN 362
>gi|254424332|ref|ZP_05038050.1| DnaJ domain protein [Synechococcus sp. PCC 7335]
gi|196191821|gb|EDX86785.1| DnaJ domain protein [Synechococcus sp. PCC 7335]
Length = 411
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 44/77 (57%), Gaps = 10/77 (12%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+ + A+++E DFSG +GA N +GADLSDT M ++ LN ANL A L R
Sbjct: 298 DMSGANLKEKDFSGRNLSGA----------NLSGADLSDTFMHKVNLNRANLRKARLFRA 347
Query: 192 VLTRSDLGGAIIEGADF 208
L ++DL A + GAD
Sbjct: 348 NLLQADLSHADLSGADL 364
>gi|386828484|ref|ZP_10115591.1| putative low-complexity protein [Beggiatoa alba B18LD]
gi|386429368|gb|EIJ43196.1| putative low-complexity protein [Beggiatoa alba B18LD]
Length = 986
Score = 58.5 bits (140), Expect = 3e-06, Method: Composition-based stats.
Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 25/109 (22%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY--------------------KANFTG 165
+N R +F+ D+R +DFSG+ A + A+ Y ANF+
Sbjct: 645 QNLRGQDFSGQDLRYADFSGADLTDALFKNAILYHVNFSNATLKNADFTKTDLSNANFSD 704
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
ADL+D L +L AN ++A L T++DL A+F+DA+ D
Sbjct: 705 ADLTDALFKNAILQHANFSDATLKNADFTKTDL-----SNANFTDAICD 748
Score = 40.4 bits (93), Expect = 0.81, Method: Composition-based stats.
Identities = 22/76 (28%), Positives = 36/76 (47%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A L+ A K + ANF+ AD+ ++ F + A A A+FT DLS+
Sbjct: 682 FSNATLKNADFTKTDLSNANFSDADLTDALFKNAILQHANFSDATLKNADFTKTDLSNAN 741
Query: 173 MDRMVLNEANLTNAVL 188
+ +E +L A +
Sbjct: 742 FTDAICDEVSLLGATV 757
>gi|428216484|ref|YP_007100949.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988266|gb|AFY68521.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 673
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 60/110 (54%), Gaps = 7/110 (6%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGAD 167
F + DL + N +A + E+DFS ++ GA L A+A A+ F+GAD
Sbjct: 416 FANVDLSGEILSGAELNEINLQNALLSETDFSDARLGGANLTGAIATGADLRGVDFSGAD 475
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L++ + +++E NLT A L+R L ++DL A++ GA+ A DL+Q
Sbjct: 476 LTEANLTNAIMSEVNLTGARLLRANLKQADLNFAVLRGAELMRA--DLSQ 523
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 10/102 (9%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKA 156
I S A+ +L+ A+ + +F A T AD+R DFSG+ A L A
Sbjct: 425 ILSGAELNEINLQNALLSETDFSDARLGGANLTGAIATGADLRGVDFSGADLTEANLTNA 484
Query: 157 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+ + N TGA L + + LN A L A L+R L+++DL
Sbjct: 485 IMSEVNLTGARLLRANLKQADLNFAVLRGAELMRADLSQTDL 526
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 28/131 (21%)
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 195
A+++ES+ S ++ A LE AV A+ A+L ++ L E +L++A L T+
Sbjct: 549 ANLQESNLSAAELENAQLEAAVLLLADLRSANLKLANLNYADLREVDLSSADL-----TQ 603
Query: 196 SDLGG----------------AIIEGADFSDAVIDLAQ--KQALCKYANGT----NPITG 233
++L G A I+GADF+D V++LA K CK A G +P
Sbjct: 604 ANLIGANLSGANLRGTDVNQLASIDGADFTD-VVNLADTSKTYFCKIAAGQTFAESPEQR 662
Query: 234 VSTRKSLGCGN 244
+TR +L C N
Sbjct: 663 RATRATLDCPN 673
>gi|73669894|ref|YP_305909.1| hypothetical protein Mbar_A2409 [Methanosarcina barkeri str.
Fusaro]
gi|72397056|gb|AAZ71329.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 234
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 62/112 (55%), Gaps = 3/112 (2%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F A+L++A +K N +A+ A++ +D G+ GA ++A +AN GADLS+
Sbjct: 27 ANFQDANLQEAYLIKANLTQADLQGANLYRADLRGADLRGANFQEANLQEANLQGADLSN 86
Query: 171 TLMDRMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
+ + + L ANL A L LTR++L GA ++GA+ + I LA Q
Sbjct: 87 SYLLEGIGTNLQGANLQGANLQGANLTRANLKGANLKGANLQLSNIHLANLQ 138
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 64/147 (43%), Gaps = 25/147 (17%)
Query: 68 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 127
W F L A + + + L N Y A+ RG ADLR A N
Sbjct: 26 WANFQDANLQEAYLIKANLTQADLQGANLYRADLRG------------ADLRGA-----N 68
Query: 128 FRRANFTSADMRESDFSGSKF---NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
F+ AN A+++ +D S S G L+ A AN GA+L+ R L ANL
Sbjct: 69 FQEANLQEANLQGADLSNSYLLEGIGTNLQGANLQGANLQGANLT-----RANLKGANLK 123
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDA 211
A L + + ++L GA ++GA+F A
Sbjct: 124 GANLQLSNIHLANLQGANLQGANFQGA 150
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 5/82 (6%)
Query: 138 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 197
+ + ++ G GA+LEK ANF A+L + + + ANLT A L L R+D
Sbjct: 4 IEKKNYRGVNLPGAHLEKNNLIWANFQDANLQEAYLIK-----ANLTQADLQGANLYRAD 58
Query: 198 LGGAIIEGADFSDAVIDLAQKQ 219
L GA + GA+F +A + A Q
Sbjct: 59 LRGADLRGANFQEANLQEANLQ 80
>gi|85860772|ref|YP_462974.1| pentapeptide repeat-containing protein [Syntrophus aciditrophicus
SB]
gi|85723863|gb|ABC78806.1| pentapeptide repeat domain protein [Syntrophus aciditrophicus SB]
Length = 306
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A + DLR+A + AN T AD+++S+ S + N L +AN +GADL
Sbjct: 157 SEANLSNTDLREADLHGADLSDANLTGADLQKSNLSKANLNWTRL-----REANLSGADL 211
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S+ + R L +ANL+ A LV L R++L G + GAD +A +
Sbjct: 212 SEAYLKRADLRKANLSRANLVDANLNRANLRGTDLRGADLGNANL 256
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 52/103 (50%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L A K N +AN +RE++ SG+ + AYL++A KAN + A+L D
Sbjct: 174 ADLSDANLTGADLQKSNLSKANLNWTRLREANLSGADLSEAYLKRADLRKANLSRANLVD 233
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
ANL A L T L +DLG A + GAD +A +
Sbjct: 234 ----------ANLNRANLRGTDLRGADLGNANLAGADLREANL 266
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 56/104 (53%), Gaps = 5/104 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G DL A + +A+ + A+++E+D SG+ + A L A N GADLS+
Sbjct: 39 ADLGGMDLCNA-----DLGKADLSEANLQETDLSGANLHKADLNGANLKGVNLVGADLSE 93
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
++ L+EA+L A L RT L++ +L G + A+ S+ +D
Sbjct: 94 ACLNGADLSEADLGKADLRRTCLSKVNLRGTKLIEANLSNTDLD 137
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 5/111 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A + DL + +N RR A++ E++ S + A L A AN TGADL
Sbjct: 129 ANLSNTDLDEVELRGQNLRRTKLIGANLSEANLSNTDLREADLHGADLSDANLTGADLQK 188
Query: 171 TLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+ + + LN EANL+ A L L R+DL A + A+ DA ++ A
Sbjct: 189 SNLSKANLNWTRLREANLSGADLSEAYLKRADLRKANLSRANLVDANLNRA 239
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 45/100 (45%), Gaps = 10/100 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADLRKA N RAN A++ ++ G+ GA L AN GADL
Sbjct: 212 SEAYLKRADLRKA-----NLSRANLVDANLNRANLRGTDLRGADL-----GNANLAGADL 261
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ + + L A L A L T L+ +D G + AD
Sbjct: 262 REANLGKTCLRGARLQGAKLNETDLSDADFTGVDLSEADL 301
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 49/118 (41%), Gaps = 29/118 (24%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A G ADLR+ K N R A++ +D + G L + GA+L
Sbjct: 102 SEADLGKADLRRTCLSKVNLRGTKLIEANLSNTDLDEVELRGQNL-----RRTKLIGANL 156
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQKQALCK 223
S EANL+N +DL A + GAD SDA + DL QK L K
Sbjct: 157 S----------EANLSN----------TDLREADLHGADLSDANLTGADL-QKSNLSK 193
>gi|119512769|ref|ZP_01631839.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
gi|119462587|gb|EAW43554.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
Length = 268
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 50/85 (58%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N AN +AD+ E++ ++ NGAYL KA YKAN A LS + R +EANL+ A
Sbjct: 160 NLIEANLINADLSEANLYEAQLNGAYLYKANFYKANLHQAHLSGAYLFRANFSEANLSCA 219
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
L + LT ++L GA ++GA+ A
Sbjct: 220 NLTWSNLTGANLAGANLQGANLRGA 244
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 52/101 (51%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +A+L+ A+ + N + A++RE+D S +K A L A +AN ADLS+
Sbjct: 114 ADLSTANLQGAIIAEANLIGTDLRDANLRETDLSTAKLIRANLGFANLIEANLINADLSE 173
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ LN A L A + L ++ L GA + A+FS+A
Sbjct: 174 ANLYEAQLNGAYLYKANFYKANLHQAHLSGAYLFRANFSEA 214
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SAA +LR A N + + + A + ++ S + +GA L +A +AN + A+L
Sbjct: 32 SAANLKGENLRGANLQGVNLNKVDLSHALLVRANLSNADLSGANLHQAKLIEANLSEANL 91
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDA 211
S + L +ANL+ A L+ L+ ++L GAII G D DA
Sbjct: 92 SVANLSGATLTQANLSYAHLIGADLSTANLQGAIIAEANLIGTDLRDA 139
Score = 40.4 bits (93), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 49/94 (52%), Gaps = 5/94 (5%)
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD--RMV--- 177
+K + AN ++R ++ G N L A+ +AN + ADLS + +++
Sbjct: 26 QMKLDISAANLKGENLRGANLQGVNLNKVDLSHALLVRANLSNADLSGANLHQAKLIEAN 85
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L+EANL+ A L LT+++L A + GAD S A
Sbjct: 86 LSEANLSVANLSGATLTQANLSYAHLIGADLSTA 119
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 55/119 (46%), Gaps = 15/119 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVA 158
S A A+L +A ++ N AN T A++ + G+ + A L+ A+
Sbjct: 67 SNADLSGANLHQAKLIEANLSEANLSVANLSGATLTQANLSYAHLIGADLSTANLQGAII 126
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+AN G DL D L E +L+ A L+R L ++L A + AD S+A + AQ
Sbjct: 127 AEANLIGTDLRDA-----NLRETDLSTAKLIRANLGFANLIEANLINADLSEANLYEAQ 180
>gi|123965950|ref|YP_001011031.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9515]
gi|123200316|gb|ABM71924.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9515]
Length = 157
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 61/130 (46%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +DL+ A + + AN + D++ + G+K N + ++L +
Sbjct: 35 FSGSDLKGATFYLTDLQDANLSDCDLQNASLYGAKLK----------DTNLSNSNLREVT 84
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D VL+ +LTN L + + I+GADF++ + + CK A+GTNP T
Sbjct: 85 LDSAVLDGTDLTNTNLEDSFAYSTQFENVKIQGADFTNVYLPKDIVREFCKEASGTNPFT 144
Query: 233 GVSTRKSLGC 242
TR++L C
Sbjct: 145 NRETRETLEC 154
>gi|323454309|gb|EGB10179.1| hypothetical protein AURANDRAFT_23610 [Aureococcus anophagefferens]
Length = 107
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 6/101 (5%)
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII---- 203
FN A L A + A+ G D M ++ L A+L+NA L LT + + GA+I
Sbjct: 6 FNKAQLFSASFFDADLAGTTFVDADMKQVNLEMADLSNADLTNADLTEAYMAGAVIKDLK 65
Query: 204 --EGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
+ D++D + Q+ LC A GTNP TG+ TR +L C
Sbjct: 66 KIDNTDWTDVDMRKDQRTYLCSIAKGTNPKTGMDTRDTLMC 106
>gi|409994208|ref|ZP_11277326.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|409934956|gb|EKN76502.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 517
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 57/98 (58%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F +A+LR+A N A+F+ A++R +D G+ +GA L +A AN +GA+LS
Sbjct: 189 ADFTNAELRQANLTYANLSNADFSGANLRWTDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
++ + L A+L+ A L+R + +DL GA + GA
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKL 286
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 73/135 (54%), Gaps = 10/135 (7%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKAN 162
G+ +F DLR A +K N +A+FT+A++R+++ S + F+GA L A+
Sbjct: 166 GALTKFTKTDLRGADLLKANLPKADFTNAELRQANLTYANLSNADFSGANLRWTDLQGAD 225
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADFSDAVIDLAQ 217
+GA+L++ + L+ ANL++AVLV+ L +DL A + GAD S A + A+
Sbjct: 226 LSGANLTEANLSGANLSGANLSSAVLVKASLVHADLSQANLIRANWSGADLSGATLTGAK 285
Query: 218 KQALCKYANGTNPIT 232
+ ++ + IT
Sbjct: 286 LYQVSRFNLKADEIT 300
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 67/156 (42%), Gaps = 27/156 (17%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L A++ + N++ L + EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138
Query: 136 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 175
AD+RES + FNGA L A KAN AD ++ + +
Sbjct: 139 ADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPKADFTNAELRQ 198
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANL+NA L +DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANLRWTDLQGADLSGANLTEA 234
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 22/144 (15%)
Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
TR + A+ +A+L KA+ + AN AD+ E+ + A L +A K
Sbjct: 72 TRANLNV---ARLSNANLTKAILNQATINVANLVRADLTEAQLINTLLIRAELVRAKLSK 128
Query: 161 ANFT-----GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAI-----IEG 205
ANFT GADL ++ + + N ANL+ A L T T++DL GA +
Sbjct: 129 ANFTQANLNGADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPK 188
Query: 206 ADFSDAVIDLAQKQALCKYANGTN 229
ADF++A + +QA YAN +N
Sbjct: 189 ADFTNAEL----RQANLTYANLSN 208
Score = 37.0 bits (84), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 49/102 (48%), Gaps = 2/102 (1%)
Query: 112 QFGSADLRKAVHVKENFR--RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
Q +D+ K + + +R +F ++ E + S GA L A AN + +DL
Sbjct: 8 QNSESDVLKVYAIVKRYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ R LN A L+NA L + +L ++ + A + AD ++A
Sbjct: 68 GVNLTRANLNVARLSNANLTKAILNQATINVANLVRADLTEA 109
>gi|113476913|ref|YP_722974.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
gi|110167961|gb|ABG52501.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
Length = 567
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 53/106 (50%), Gaps = 5/106 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 165
A A+L KAV V N RR N + A++ ++ + F+GAYL +A +AN G
Sbjct: 418 ASLEGANLTKAVLVSANLRRVNLSGANLNSTNLRAANFSGAYLREAKLSRANLEGANLKK 477
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A+LS M L A+L A L L R DL GA + G F DA
Sbjct: 478 ANLSGANMSHASLRGADLRRATLKDANLKRVDLVGANLAGVTFLDA 523
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 10/87 (11%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NFRRANF + K AYL A ++AN G +L + L +A L +
Sbjct: 354 NFRRANFAAL----------KLEDAYLRNADLFQANLRGVELRGARLQNANLKKAQLQGS 403
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVI 213
+L++ L +++L A +EGA+ + AV+
Sbjct: 404 ILIKAKLQKANLYRASLEGANLTKAVL 430
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 52/104 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +ADL +A R A +A+++++ GS A L+KA Y+A+ GA+L+
Sbjct: 368 AYLRNADLFQANLRGVELRGARLQNANLKKAQLQGSILIKAKLQKANLYRASLEGANLTK 427
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
++ L NL+ A L T L ++ GA + A S A ++
Sbjct: 428 AVLVSANLRRVNLSGANLNSTNLRAANFSGAYLREAKLSRANLE 471
Score = 37.4 bits (85), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 48/110 (43%), Gaps = 15/110 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+A + A LR A + N R A ++ ++ ++ G+ L KA KAN A L
Sbjct: 361 AALKLEDAYLRNADLFQANLRGVELRGARLQNANLKKAQLQGSILIKAKLQKANLYRASL 420
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI 213
ANLT AVLV L R +L GA + A+FS A +
Sbjct: 421 EG----------ANLTKAVLVSANLRRVNLSGANLNSTNLRAANFSGAYL 460
>gi|334117749|ref|ZP_08491840.1| stress protein [Microcoleus vaginatus FGP-2]
gi|333460858|gb|EGK89466.1| stress protein [Microcoleus vaginatus FGP-2]
Length = 578
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 57/105 (54%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S+A +A L + + N + AN S +++ +D + +GA L KA+ Y A A+L
Sbjct: 312 SSANLANAKLIQVNLIGSNLQGANLNSTNLQSADLIEANLSGANLTKAILYYARLIHANL 371
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S + L++ANLT A L R LT++ LG A + GAD S + +
Sbjct: 372 SQANLSEAKLDKANLTTANLSRANLTQASLGSANLTGADLSQSKV 416
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A L A + N +AN + A + +++ + + + A L +A AN TGADL
Sbjct: 352 SGANLTKAILYYARLIHANLSQANLSEAKLDKANLTTANLSRANLTQASLGSANLTGADL 411
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
S + + ++ L+ ANL+ L LT +L G + G + S
Sbjct: 412 SQSKVTKVNLSGANLSGVNLTGVSLTGVNLQGVNLSGMNLS 452
>gi|308813604|ref|XP_003084108.1| COG1357: Uncharacterized low-complexity proteins (ISS)
[Ostreococcus tauri]
gi|116055991|emb|CAL58524.1| COG1357: Uncharacterized low-complexity proteins (ISS)
[Ostreococcus tauri]
Length = 177
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 66/139 (47%), Gaps = 27/139 (19%)
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
A K + +RANF A++ F G+ GA F GA+L + + + L++
Sbjct: 48 AFFTKGSLKRANFDGANLEGITFFGADLTGA----------TFRGANLQNANLGQANLSK 97
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ-----------------ALCK 223
A+LT+A+L +++ + IEG+D+S+ ++ + + LCK
Sbjct: 98 ADLTDAILSGAIVSSAQFDDVKIEGSDWSEVIVRKREAKDDTTDDLFCVAYQDILTGLCK 157
Query: 224 YANGTNPITGVSTRKSLGC 242
A G NP+TG+ T +L C
Sbjct: 158 VAKGENPVTGLPTELTLMC 176
>gi|411118568|ref|ZP_11390949.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410712292|gb|EKQ69798.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 321
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 56/105 (53%), Gaps = 5/105 (4%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AA A+L +A+ N AN T A++ E+ S ++ GA L++A KAN T ADLS
Sbjct: 194 AANLSGANLGRALLEGVNLIGANLTQANLIEARLSLAEMRGAKLDQAELTKANLTEADLS 253
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L+ A L AV+V VL AI+ GADFSDA ID
Sbjct: 254 WASFRGTNLSAATLHKAVMVDVVL-----DAAILRGADFSDATID 293
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 41/80 (51%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
NF + D+ + + GA L KAV ++A+ TGA+L D + + L NLT A L
Sbjct: 16 NFDTVDLSGVNLRQADLRGASLRKAVLFEADLTGANLVDVELHGVALRHTNLTAACLAGV 75
Query: 192 VLTRSDLGGAIIEGADFSDA 211
L +DL A + AD S A
Sbjct: 76 KLVGADLSAAQLVRADLSGA 95
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 61/126 (48%), Gaps = 20/126 (15%)
Query: 109 SAAQFGSADL----------RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
SAAQ ADL R A N R N +A++ E+D + ++ + A L +A
Sbjct: 83 SAAQLVRADLSGANLWRSLLRNANLHAANLERTNLHAANLVEADLTTARLSHANLAEANL 142
Query: 159 YKANFTGADL----------SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A+ TGA L S + + + L +A+L AVLV L+R++L A + GA+
Sbjct: 143 SDADLTGATLRWVNGVEAMFSRSRLRGVDLEQADLKKAVLVEVDLSRANLEAANLSGANL 202
Query: 209 SDAVID 214
A+++
Sbjct: 203 GRALLE 208
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 46/93 (49%), Gaps = 5/93 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTG 165
A+ A++R A + +AN T AD+ + F G+ + A L KAV A G
Sbjct: 225 ARLSLAEMRGAKLDQAELTKANLTEADLSWASFRGTNLSAATLHKAVMVDVVLDAAILRG 284
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
AD SD +D LN+++LT +L VL S L
Sbjct: 285 ADFSDATIDPACLNQSSLTWVILPSGVLQISSL 317
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 51/103 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A LR V+ F R+ D+ ++D + L +A AN +GA+L
Sbjct: 143 SDADLTGATLRWVNGVEAMFSRSRLRGVDLEQADLKKAVLVEVDLSRANLEAANLSGANL 202
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L++ + L ANLT A L+ L+ +++ GA ++ A+ + A
Sbjct: 203 GRALLEGVNLIGANLTQANLIEARLSLAEMRGAKLDQAELTKA 245
>gi|37520785|ref|NP_924162.1| hypothetical protein gll1216 [Gloeobacter violaceus PCC 7421]
gi|35211780|dbj|BAC89157.1| gll1216 [Gloeobacter violaceus PCC 7421]
Length = 287
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 63/130 (48%), Gaps = 8/130 (6%)
Query: 105 FGIGSAAQFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
F + A ADL ++V++K + R A AD+R + G+ +G+ LE A K
Sbjct: 137 FAVLPFADLSGADLSRSVNLKRADLRGARLVGADLRGAFLHGANLSGSRLEAADLMKVAL 196
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI-----IEGADFSDAVIDLAQK 218
GA+LS + R L A+L A L RT L +DL GA +EGAD A ++ A
Sbjct: 197 AGANLSGADLSRANLRAAHLEGADLRRTNLGEADLAGAFLRGARLEGADLRRARLEGADL 256
Query: 219 QALCKYANGT 228
+ C GT
Sbjct: 257 E--CAATEGT 264
>gi|428222472|ref|YP_007106642.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427995812|gb|AFY74507.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 340
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 78/155 (50%), Gaps = 12/155 (7%)
Query: 66 KNWR--VFVSTA------LAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSA 116
NWR VF S L+AA ++S + +++ L +N A ++ S A G A
Sbjct: 18 NNWRSEVFRSKIDLSYADLSAATLSSINLSLANLRSINLSRANLSKANL---SGAILGKA 74
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
+L +A + N ANF AD+ + S S + A L AVA ANF A+LS T
Sbjct: 75 NLTEASLINANLSMANFIMADLSGAYLSESNLSRANLGNAVAIAANFIMANLSGTYFSES 134
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ ANL++A L +L +++L G+ + A+F+ A
Sbjct: 135 DFSRANLSSANLTEAILVKTNLTGSYLSKANFTSA 169
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 56/103 (54%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F A+LR+A + N + AN + A + ++ +G+ GA L A AN GA
Sbjct: 239 ANFYQANLREANLDRANAQNANLSEAYLSNANLTGTILEGANLSSAYISNANLVGA---- 294
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
VL A+LT A+L+ LT+++ GA ++GADF+ A++
Sbjct: 295 ------VLKGADLTGAILIGANLTKANFSGAKLDGADFTSAIM 331
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 58/108 (53%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A SA+L +A+ VK N +ANFTSA++ +D S + + A + A AN
Sbjct: 137 SRANLSSANLTEAILVKTNLTGSYLSKANFTSANLSMTDLSEADLSSANMHLADLSMANL 196
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ A+L ++ + L +ANLT A L LT +DL + + GA+F A
Sbjct: 197 SSANLIGAILTDVDLRQANLTGAYLNTANLTGADLATSTLVGANFYQA 244
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 10/103 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S + A+L AV + NF AN + ESDFS + + A L +A+ K N TG+ L
Sbjct: 102 SESNLSRANLGNAVAIAANFIMANLSGTYFSESDFSRANLSSANLTEAILVKTNLTGSYL 161
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S +AN T+A L T L+ +DL A + AD S A
Sbjct: 162 S----------KANFTSANLSMTDLSEADLSSANMHLADLSMA 194
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 54/110 (49%), Gaps = 10/110 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 163
S+A ADL A N A T D+R+++ +G+ N GA L + ANF
Sbjct: 182 SSANMHLADLSMANLSSANLIGAILTDVDLRQANLTGAYLNTANLTGADLATSTLVGANF 241
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+L + +DR AN NA L L+ ++L G I+EGA+ S A I
Sbjct: 242 YQANLREANLDR-----ANAQNANLSEAYLSNANLTGTILEGANLSSAYI 286
>gi|220910076|ref|YP_002485387.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219866687|gb|ACL47026.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 332
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 44/82 (53%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N AN D+RE+D SG+ GA L ++AN GADLS++ + + L ANL A
Sbjct: 177 NLEEANLREVDLREADLSGANLRGALLTDVNLFQANLAGADLSNSNLKGVDLQRANLQQA 236
Query: 187 VLVRTVLTRSDLGGAIIEGADF 208
L LT ++L G +++ A
Sbjct: 237 KLTGATLTEANLAGVMMQRAQM 258
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 48/99 (48%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+LR A+ N +AN AD+ S+ G A L++A A T A+L+
Sbjct: 191 ADLSGANLRGALLTDVNLFQANLAGADLSNSNLKGVDLQRANLQQAKLTGATLTEANLAG 250
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+M R + + L A L R L +DL GA + GA+ +
Sbjct: 251 VMMQRAQMFQVRLNRANLSRANLQGADLRGASLIGANLA 289
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 41/95 (43%), Gaps = 16/95 (16%)
Query: 132 NFTSADMRESDFSGSKFNGA----------------YLEKAVAYKANFTGADLSDTLMDR 175
N D+RE+D SG+ GA L A+ + GA+LS + R
Sbjct: 111 NLIETDLREADLSGANLTGACLRSANLRTERRGTPVNLRGAILAGVDLRGANLSGASLVR 170
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+ L ANL A L L +DL GA + GA +D
Sbjct: 171 VNLQGANLEEANLREVDLREADLSGANLRGALLTD 205
>gi|291570912|dbj|BAI93184.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 517
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 57/96 (59%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F +A+LR+A N A+F+ A++R +D G+ +GA L +A AN +GA+LS
Sbjct: 189 ADFTNAELRQANLTYANLSNADFSGANLRWTDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
++ + L A+L+ A L+R + +DL GA + GA
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGA 284
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 73/135 (54%), Gaps = 10/135 (7%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKAN 162
G+ +F DLR A +K N +A+FT+A++R+++ S + F+GA L A+
Sbjct: 166 GALTKFTKTDLRGADLLKANLPKADFTNAELRQANLTYANLSNADFSGANLRWTDLQGAD 225
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADFSDAVIDLAQ 217
+GA+L++ + L+ ANL++AVLV+ L +DL A + GAD S A + A+
Sbjct: 226 LSGANLTEANLSGANLSGANLSSAVLVKASLVHADLSQANLIRANWSGADLSGATLTGAK 285
Query: 218 KQALCKYANGTNPIT 232
+ ++ + IT
Sbjct: 286 LYQVSRFNLKADEIT 300
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 67/156 (42%), Gaps = 27/156 (17%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L A++ + N++ L + EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138
Query: 136 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 175
AD+RES + FNGA L A KAN AD ++ + +
Sbjct: 139 ADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPKADFTNAELRQ 198
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANL+NA L +DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANLRWTDLQGADLSGANLTEA 234
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 22/144 (15%)
Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
TR + A+ +A+L KA+ + AN AD+ E+ + A L +A K
Sbjct: 72 TRANLNV---ARLSNANLTKAILNQATINVANLVRADLTEAQLINTLLIRAELVRAKLSK 128
Query: 161 ANFT-----GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAI-----IEG 205
ANFT GADL ++ + + N ANL+ A L T T++DL GA +
Sbjct: 129 ANFTQANLNGADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPK 188
Query: 206 ADFSDAVIDLAQKQALCKYANGTN 229
ADF++A + +QA YAN +N
Sbjct: 189 ADFTNAEL----RQANLTYANLSN 208
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 50/102 (49%), Gaps = 2/102 (1%)
Query: 112 QFGSADLRKAVHVKENFR--RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
Q +D+ K + + +R +F ++ E + S GA L A AN + +DL
Sbjct: 8 QNSESDVLKVYEIVKKYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + R LN A L+NA L + +L ++ + A + AD ++A
Sbjct: 68 EVNLTRANLNVARLSNANLTKAILNQATINVANLVRADLTEA 109
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ++DLR+ N RAN A + ++ + + N A + A +A+ T A L
Sbjct: 57 SVANLSASDLREV-----NLTRANLNVARLSNANLTKAILNQATINVANLVRADLTEAQL 111
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+TL+ R A LVR L++++ A + GAD ++
Sbjct: 112 INTLLIR----------AELVRAKLSKANFTQANLNGADLRES 144
Score = 37.4 bits (85), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 49/111 (44%), Gaps = 13/111 (11%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ R N T A++ + S + A L +A AN ADL+ EA L N
Sbjct: 65 DLREVNLTRANLNVARLSNANLTKAILNQATINVANLVRADLT----------EAQLINT 114
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 237
+L+R L R+ L A A+ + A DL + + NG N ++G + R
Sbjct: 115 LLIRAELVRAKLSKANFTQANLNGA--DLRESKLQQTNFNGAN-LSGANLR 162
>gi|157803630|ref|YP_001492179.1| hypothetical protein A1E_02245 [Rickettsia canadensis str. McKiel]
gi|157784893|gb|ABV73394.1| Uncharacterized low-complexity protein [Rickettsia canadensis str.
McKiel]
Length = 956
Score = 57.4 bits (137), Expect = 7e-06, Method: Composition-based stats.
Identities = 40/113 (35%), Positives = 61/113 (53%), Gaps = 6/113 (5%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+A++ KA+ K N AN T A + ++ +K + A LEKA A G +++D +
Sbjct: 559 NANMNKALLDKANLEYANLTGAILTDASAQFAKLSNATLEKAEA-----EGLNIADAIAK 613
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 226
M EAN NA++ R LT+++L AI+E AD A +D K+A K AN
Sbjct: 614 NMNAKEANFKNAIMKRADLTKANLEKAILENADMQAAEALDAIFKEANLKQAN 666
Score = 40.4 bits (93), Expect = 0.98, Method: Composition-based stats.
Identities = 40/148 (27%), Positives = 61/148 (41%), Gaps = 27/148 (18%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L A++ S+ + L++ +AE G ++ A+ N + ANF +
Sbjct: 577 LTGAILTDASAQFAKLSNATLEKAEAEG------------LNIADAIAKNMNAKEANFKN 624
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 195
A M+ +D + A LEKA+ A+ A+ D + EANL A L L R
Sbjct: 625 AIMKRADLTK-----ANLEKAILENADMQAAEALDA-----IFKEANLKQANLKAANLAR 674
Query: 196 SDLGGAIIEGADFSDAVIDLAQKQALCK 223
+ GADF A +D A K K
Sbjct: 675 INKA-----GADFDQAKVDDATKMHYTK 697
>gi|119487930|ref|ZP_01621427.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119455506|gb|EAW36644.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 276
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 56/103 (54%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A SADL A + N R N T++ + E+ G+ AYL +A NFT ADL
Sbjct: 38 SGANLISADLSHANLCQTNLRGINLTNSTLSEARLRGADLCDAYLSEA-----NFTRADL 92
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S+ + L EANLT+A LV T L ++L A ++ A+ S+A
Sbjct: 93 SEAQLLNAYLKEANLTHAQLVNTNLNGANLSNAKLQNANLSNA 135
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 49/101 (48%), Gaps = 10/101 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A+ ADL A ANFT AD+ E+ + A L A N GA+L
Sbjct: 68 SEARLRGADLCDAY-----LSEANFTRADLSEAQLLNAYLKEANLTHAQLVNTNLNGANL 122
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
S+ L ANL+NA L+ TVLT +L GA + GA+ +
Sbjct: 123 SNA-----KLQNANLSNANLLNTVLTGVNLTGANLNGANLT 158
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 52/115 (45%), Gaps = 11/115 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F ADL +A + + AN T A + ++ NGA L A AN + A+L
Sbjct: 83 SEANFTRADLSEAQLLNAYLKEANLTHAQLVNTNL-----NGANLSNAKLQNANLSNANL 137
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
+T++ + L ANL A L L R +L G I D L+QK L +
Sbjct: 138 LNTVLTGVNLTGANLNGANLTGVELCRVNLNGTQI------DENTQLSQKWLLVQ 186
>gi|427417538|ref|ZP_18907721.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425760251|gb|EKV01104.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 397
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 25/103 (24%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+F A+++E DFSG + K+N GADLSDT + ++ LN+ANL A L R
Sbjct: 283 ADFKGANLKEKDFSGRNLS----------KSNLEGADLSDTFLHKVNLNQANLHKAKLFR 332
Query: 191 TVLTR---------------SDLGGAIIEGADFSDAVIDLAQK 218
L + +DL GA + GAD S A+I K
Sbjct: 333 ANLLQANLSHANLREANLIGADLSGADLSGADLSGAIIGYGDK 375
>gi|126696014|ref|YP_001090900.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9301]
gi|126543057|gb|ABO17299.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9301]
Length = 157
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 64/134 (47%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+A +G L A + + A F D+++++ SG + A L A N + ++L
Sbjct: 21 AALDYGKQSLVGADFSGSDLKGATFYLTDLQDANLSGCELQNATLYGAKLKDTNLSNSNL 80
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+ +D VL+ +L+N L + + I+GADF++ + + C+ A GT
Sbjct: 81 REVTLDSAVLDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIIKKFCESATGT 140
Query: 229 NPITGVSTRKSLGC 242
NP T TR++L C
Sbjct: 141 NPFTNRETRETLEC 154
>gi|428314577|ref|YP_007151024.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428256301|gb|AFZ22256.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 281
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 62/111 (55%), Gaps = 5/111 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADL +A + N RA N + A + +++ S + + A+L +A AN
Sbjct: 123 SRANLSRADLSEANLSRANLSRADLSDANLSPASLSDANLSRANLSRAFLSRANLSDANL 182
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ A+LSD + R L+ ANL+ A L R L+ ++LGGA + GA+F ++ ID
Sbjct: 183 SRANLSDANLSRADLSRANLSRANLSRADLSGANLGGANLSGANFRNSEID 233
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 52/95 (54%), Gaps = 5/95 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEA 181
N R A +A++RE + S + + A L +A +AN + ADLSD + L++A
Sbjct: 101 NVRNAPLENANLREINLSEANLSRANLSRADLSEANLSRANLSRADLSDANLSPASLSDA 160
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
NL+ A L R L+R++L A + A+ SDA + A
Sbjct: 161 NLSRANLSRAFLSRANLSDANLSRANLSDANLSRA 195
>gi|428310629|ref|YP_007121606.1| serine/threonine protein kinase [Microcoleus sp. PCC 7113]
gi|428252241|gb|AFZ18200.1| serine/threonine protein kinase [Microcoleus sp. PCC 7113]
Length = 542
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 46/87 (52%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 189
R +F S D+ D +G +A K NF GADLS+ R LN +NL +A L
Sbjct: 415 RRDFASQDLSGLDLHKVDLSGGIFHQAKLAKTNFQGADLSNADFGRASLNRSNLRDANLG 474
Query: 190 RTVLTRSDLGGAIIEGADFSDAVIDLA 216
R L+ +DL GA + GAD S A ++ A
Sbjct: 475 RAYLSYADLEGADLRGADLSYAYLNHA 501
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 39/81 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A FG A L ++ N RA + AD+ +D G+ + AYL A AN GA+L
Sbjct: 454 SNADFGRASLNRSNLRDANLGRAYLSYADLEGADLRGADLSYAYLNHANLKGANLCGANL 513
Query: 169 SDTLMDRMVLNEANLTNAVLV 189
S+ + L +A A ++
Sbjct: 514 SNAKISEEQLTQAKTNWATVL 534
>gi|379022817|ref|YP_005299478.1| hypothetical protein RCA_02115 [Rickettsia canadensis str. CA410]
gi|376323755|gb|AFB20996.1| hypothetical protein RCA_02115 [Rickettsia canadensis str. CA410]
Length = 956
Score = 57.4 bits (137), Expect = 7e-06, Method: Composition-based stats.
Identities = 40/113 (35%), Positives = 61/113 (53%), Gaps = 6/113 (5%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+A++ KA+ K N AN T A + ++ +K + A LEKA A G +++D +
Sbjct: 559 NANMNKALLDKANLEYANLTGAILTDASAQFAKLSNATLEKAEA-----EGLNIADAIAK 613
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 226
M EAN NA++ R LT+++L AI+E AD A +D K+A K AN
Sbjct: 614 NMNAKEANFKNAIMKRADLTKANLEKAILENADMQAAEALDAIFKEANLKQAN 666
Score = 40.0 bits (92), Expect = 1.1, Method: Composition-based stats.
Identities = 40/148 (27%), Positives = 61/148 (41%), Gaps = 27/148 (18%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L A++ S+ + L++ +AE G ++ A+ N + ANF +
Sbjct: 577 LTGAILTDASAQFAKLSNATLEKAEAEG------------LNIADAIAKNMNAKEANFKN 624
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 195
A M+ +D + A LEKA+ A+ A+ D + EANL A L L R
Sbjct: 625 AIMKRADLTK-----ANLEKAILENADMQAAEALDA-----IFKEANLKQANLKAANLAR 674
Query: 196 SDLGGAIIEGADFSDAVIDLAQKQALCK 223
+ GADF A +D A K K
Sbjct: 675 INKA-----GADFDQAKVDDATKMHYTK 697
>gi|359459933|ref|ZP_09248496.1| hypothetical protein ACCM5_14478 [Acaryochloris sp. CCMEE 5410]
Length = 315
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 53/92 (57%), Gaps = 5/92 (5%)
Query: 132 NFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
++ AD++E DFSG + A L + A +K N GA+L++ + R L +ANLT A
Sbjct: 202 DWHGADLQERDFSGRNLSQANLANVNLKDAFMHKVNLAGANLTNANLTRANLLQANLTQA 261
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
L LT +DL GA + GADF+ A + + +K
Sbjct: 262 NLQGANLTAADLSGADLRGADFTGANMGIGKK 293
>gi|418020640|ref|ZP_12659878.1| putative low-complexity protein [Candidatus Regiella insecticola
R5.15]
gi|347604005|gb|EGY28733.1| putative low-complexity protein [Candidatus Regiella insecticola
R5.15]
Length = 148
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 12/112 (10%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFN------------GAYLEKAVAYKANF 163
A+L+ A + R + ADMRE+ G K N GA L + KA
Sbjct: 4 ANLQNATLNDADMREVDLVGADMREAKLIGKKTNLEGANLSGADLQGAELYHTILIKAVL 63
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
+ ADLS+ ++R+ L EANL +A+L T L + L A +EG + DAV+++
Sbjct: 64 SWADLSNAKLERVNLREANLYHAILEETSLYITKLENANLEGVNLKDAVLEV 115
>gi|22299142|ref|NP_682389.1| hypothetical protein tlr1599 [Thermosynechococcus elongatus BP-1]
gi|22295324|dbj|BAC09151.1| tlr1599 [Thermosynechococcus elongatus BP-1]
Length = 309
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SA A+L +A+ + N RRA A++RE F + A L+KA N GADL
Sbjct: 183 SATNLQQANLERAILIGANLRRARLEEANLREVAFKEANLRHACLDKA-----NLVGADL 237
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ + +L ANL++A+L+ L ++L GA + GA+ +A++
Sbjct: 238 RGVSLAQALLRGANLSSAILIGANLMGANLSGADLRGANLIEAIL 282
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 70/144 (48%), Gaps = 10/144 (6%)
Query: 89 SALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
+AL N A+ RG G S A ADLR + V + R + +R+++ +G
Sbjct: 45 AALQSTNLQRADLRGAILTGANLSQADLRGADLRGVILVSADLRWVS-----LRKANLTG 99
Query: 146 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
+ A L A +AN TGA LS+ ++ L +LT A L R LTR++L A + G
Sbjct: 100 ADLTRANLANADLSEANLTGAQLSEAIVRDANLTLTDLTLAELERANLTRANLTEAYLRG 159
Query: 206 ADFSDAVIDLAQKQALCKYANGTN 229
AD +DAV L + Q L G N
Sbjct: 160 ADLTDAV--LRESQLLQANLRGAN 181
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 15/108 (13%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A+ A+LR+ + N R +AN AD+R + + GA L A+ AN G
Sbjct: 205 ARLEEANLREVAFKEANLRHACLDKANLVGADLRGVSLAQALLRGANLSSAILIGANLMG 264
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+LS A+L A L+ +LT + L G + D S+A++
Sbjct: 265 ANLSG----------ADLRGANLIEAILTGASLNGVDLSAVDMSEAIL 302
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 60/132 (45%), Gaps = 17/132 (12%)
Query: 91 LADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS 146
L DL E E TR + A ADL AV R + A++R ++ S +
Sbjct: 134 LTDLTLAELERANLTRANL---TEAYLRGADLTDAV-----LRESQLLQANLRGANLSAT 185
Query: 147 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----A 201
A LE+A+ AN A L + + + EANL +A L + L +DL G A
Sbjct: 186 NLQQANLERAILIGANLRRARLEEANLREVAFKEANLRHACLDKANLVGADLRGVSLAQA 245
Query: 202 IIEGADFSDAVI 213
++ GA+ S A++
Sbjct: 246 LLRGANLSSAIL 257
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 58/125 (46%), Gaps = 9/125 (7%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
L +Y R GI A L + + + RA+ T A ++ ++ + GA L
Sbjct: 7 LKRYSVGDRDFAGI----HLRRAHLSRCILTGIDLSRADLTDAALQSTNLQRADLRGAIL 62
Query: 154 EKAVAYKANFTGADLSDTLMD----RMV-LNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A +A+ GADL ++ R V L +ANLT A L R L +DL A + GA
Sbjct: 63 TGANLSQADLRGADLRGVILVSADLRWVSLRKANLTGADLTRANLANADLSEANLTGAQL 122
Query: 209 SDAVI 213
S+A++
Sbjct: 123 SEAIV 127
>gi|427707611|ref|YP_007049988.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427360116|gb|AFY42838.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 521
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 54/101 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +ADLR+A K N RRAN + A ++ S +G+ A L A ++ + +GA+L D
Sbjct: 120 ANLSNADLREATLRKANLRRANLSEASLKGSSLAGTNLEMANLNAADLHRTDLSGANLRD 179
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + L ANL+ A L L +DL GA + AD S A
Sbjct: 180 AELKQTNLTHANLSGADLSGANLRWADLSGANLSWADLSGA 220
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 50/103 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L++ N A+ + A++R +D SG+ + A L A AN GA+L
Sbjct: 173 SGANLRDAELKQTNLTHANLSGADLSGANLRWADLSGANLSWADLSGAKLSGANLMGANL 232
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S+ + ANLT A L++ +DL GA + GA A
Sbjct: 233 SNANLTNTSFVHANLTEATLIKAEWIGADLTGATLTGAKLHSA 275
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 47/105 (44%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S F A+L N AN + A + + SG F GA + A AN ADL
Sbjct: 33 SGINFSEANLSVVNLSGANLSDANLSHAKLNVARLSGVNFVGAIMNYASLNVANLIRADL 92
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S R L A+L A L+R L+R+DL A + AD +A +
Sbjct: 93 S-----RAQLRGASLVRAELIRAELSRADLFEANLSNADLREATL 132
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 51/111 (45%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A LR A V+ RA + AD+ E++ S + A L KA +AN + A L
Sbjct: 90 ADLSRAQLRGASLVRAELIRAELSRADLFEANLSNADLREATLRKANLRRANLSEASLKG 149
Query: 171 TLMDRMVLNEANLTNAVLVRTVLT----------RSDLGGAIIEGADFSDA 211
+ + L ANL A L RT L+ +++L A + GAD S A
Sbjct: 150 SSLAGTNLEMANLNAADLHRTDLSGANLRDAELKQTNLTHANLSGADLSGA 200
Score = 37.0 bits (84), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 41/80 (51%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
NF+ ++ E++ SG+K +G +A N +GA+LSD + LN A L+ V
Sbjct: 16 NFSGIELCEANLSGAKLSGINFSEANLSVVNLSGANLSDANLSHAKLNVARLSGVNFVGA 75
Query: 192 VLTRSDLGGAIIEGADFSDA 211
++ + L A + AD S A
Sbjct: 76 IMNYASLNVANLIRADLSRA 95
>gi|145355959|ref|XP_001422212.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582452|gb|ABP00529.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 125
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 61/135 (45%), Gaps = 20/135 (14%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G+ F L++A NF AN T + +D S + F A L A +AN TGAD
Sbjct: 11 GTGEYFTKGSLKRA-----NFNDANLTGITLFGADLSNATFVNANLSNANLGQANLTGAD 65
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
TNA+L +++ + L + +D+SD ++ LCK A+G
Sbjct: 66 F---------------TNAILSGAIVSSAQLDEVKLTNSDWSDVIVRKDVLTGLCKVADG 110
Query: 228 TNPITGVSTRKSLGC 242
NP+TG T SL C
Sbjct: 111 ENPVTGNITALSLMC 125
>gi|158336687|ref|YP_001517861.1| hypothetical protein AM1_3555 [Acaryochloris marina MBIC11017]
gi|158306928|gb|ABW28545.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 315
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 53/92 (57%), Gaps = 5/92 (5%)
Query: 132 NFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
++ AD++E DFSG + A L + A +K N GA+L++ + R L +ANLT A
Sbjct: 202 DWHGADLQERDFSGRNLSQANLANVNLKDAFMHKVNLAGANLTNANLTRANLLQANLTQA 261
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
L LT +DL GA + GADF+ A + + +K
Sbjct: 262 NLQGANLTAADLSGADLRGADFTGANMGIGKK 293
>gi|110597243|ref|ZP_01385531.1| Pentapeptide repeat [Chlorobium ferrooxidans DSM 13031]
gi|110341079|gb|EAT59547.1| Pentapeptide repeat [Chlorobium ferrooxidans DSM 13031]
Length = 447
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 57/108 (52%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S + F SA L +A N + NF ADM+ + G+ GA L++A A+ + +L
Sbjct: 304 SGSSFKSASLDEANLAGANLSKVNFHKADMKGAHLQGANLQGANLDRAFLKDADLSNTNL 363
Query: 169 SD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S+ T++ L ANL NA L L ++LGGA ++GA+ +DA
Sbjct: 364 SNAVLFGTILTGANLQNANLENASLFEADLEEANLGGANLKGANITDA 411
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 71/145 (48%), Gaps = 17/145 (11%)
Query: 81 VASCSSNI----SALADLNKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANF 133
+AS ++NI + L D + EA G + S ++ A+L+ A N A
Sbjct: 46 LASPAANIDLYKAVLEDADLSEANLGGALLVRSDLSGSKLNRANLKGA-----NLMMAFI 100
Query: 134 TSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
ADM+ +D SG + G+++++A+ AN GA+L +++ + +ANL N VL
Sbjct: 101 KKADMKGTDLSGACLIKANMKGSFMKEAIFRGANLQGANLRWVMLEEADMEDANLANTVL 160
Query: 189 VRTVLTRSDLGGAIIEGADFSDAVI 213
L ++L GA ++ A F D +
Sbjct: 161 FEANLENANLKGANLKDAVFLDQAL 185
>gi|357014784|ref|ZP_09079783.1| hypothetical protein PelgB_35370 [Paenibacillus elgii B69]
Length = 843
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 2/105 (1%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL--EKAVAYKANFTGADLSDTLMD 174
DL A + + +F AD+ +D SG GA + F A LS M+
Sbjct: 154 DLTWAYMASADLKSVSFEDADLSHADLSGCNLYGALFTGDDLKLSHTVFASATLSYARMN 213
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
+V++ A+ TNAV+ LT S+L G + GAD +DA+I+ AQ Q
Sbjct: 214 EIVIDSADFTNAVMTNVYLTNSNLQGNSLTGADMTDALINGAQFQ 258
Score = 37.0 bits (84), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 46/105 (43%), Gaps = 6/105 (5%)
Query: 111 AQFGSADLRKA-----VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
AQF +ADL A + F +AN T AD+ + + GA L T
Sbjct: 255 AQFQNADLTGAKLYGATATETRFDKANLTKADLTRAMITDFHIPGAMLAYTKLDNQTLTT 314
Query: 166 ADL-SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
A++ +DT + L+N +L +DL GA+++G D +
Sbjct: 315 AEIDADTDFTGASMQNVFLSNCMLQGVTFAHADLTGAVLDGTDLT 359
>gi|436841883|ref|YP_007326261.1| Pentapeptide repeat protein [Desulfovibrio hydrothermalis AM13 = DSM
14728]
gi|432170789|emb|CCO24160.1| Pentapeptide repeat protein [Desulfovibrio hydrothermalis AM13 = DSM
14728]
Length = 1278
Score = 57.0 bits (136), Expect = 1e-05, Method: Composition-based stats.
Identities = 37/100 (37%), Positives = 50/100 (50%), Gaps = 6/100 (6%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
AD R A K F+ + AD R++D + FNGA K V K NF GA+L R
Sbjct: 1094 ADFRNAFIKKSIFKGSTLDGADFRKADVHETLFNGA---KGV--KVNFAGANLDKLRTGR 1148
Query: 176 MV-LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
EA+ T A L + +DL GA+ GAD +A++D
Sbjct: 1149 NAEFPEADFTGATLRSSAFRETDLTGALFRGADLENALVD 1188
Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats.
Identities = 45/135 (33%), Positives = 61/135 (45%), Gaps = 23/135 (17%)
Query: 86 SNISALADLNKYEAETRGEF---GIGSAAQFGSADLRKA-VH---------VKENFRRAN 132
S +S AD EA+ R F I + AD RKA VH VK NF AN
Sbjct: 1085 SMVSGKAD----EADFRNAFIKKSIFKGSTLDGADFRKADVHETLFNGAKGVKVNFAGAN 1140
Query: 133 FT------SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+A+ E+DF+G+ + + A F GADL + L+D +L +ANL A
Sbjct: 1141 LDKLRTGRNAEFPEADFTGATLRSSAFRETDLTGALFRGADLENALVDNCMLVDANLNGA 1200
Query: 187 VLVRTVLTRSDLGGA 201
T+S+L GA
Sbjct: 1201 SAKGARFTKSNLEGA 1215
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 40/138 (28%), Positives = 69/138 (50%), Gaps = 16/138 (11%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-----FNGAYLEK----- 155
G A F +A ++K++ A+F AD+ E+ F+G+K F GA L+K
Sbjct: 1089 GKADEADFRNAFIKKSIFKGSTLDGADFRKADVHETLFNGAKGVKVNFAGANLDKLRTGR 1148
Query: 156 -AVAYKANFTGADLS-----DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
A +A+FTGA L +T + + A+L NA++ +L ++L GA +GA F+
Sbjct: 1149 NAEFPEADFTGATLRSSAFRETDLTGALFRGADLENALVDNCMLVDANLNGASAKGARFT 1208
Query: 210 DAVIDLAQKQALCKYANG 227
+ ++ A +A + G
Sbjct: 1209 KSNLEGASMRAFNLFMGG 1226
Score = 47.4 bits (111), Expect = 0.006, Method: Composition-based stats.
Identities = 34/125 (27%), Positives = 62/125 (49%), Gaps = 14/125 (11%)
Query: 107 IGSAAQFGSADLR-----KAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKA 156
+G +A F A L+ +A+ F ++ T A R++ F GS F GA L + A
Sbjct: 1006 MGRSADFTKASLKGVNFERAMLGNAIFEESDLTGAQARQASFKGSSFKGATLADAVFDMA 1065
Query: 157 VAYKANFTGADLSDTLMDRMVL----NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
+ K +F+ A+LS ++ ++ +EA+ NA + +++ S L GA AD + +
Sbjct: 1066 ILEKTDFSKANLSGARINMSMVSGKADEADFRNAFIKKSIFKGSTLDGADFRKADVHETL 1125
Query: 213 IDLAQ 217
+ A+
Sbjct: 1126 FNGAK 1130
Score = 42.7 bits (99), Expect = 0.16, Method: Composition-based stats.
Identities = 24/86 (27%), Positives = 41/86 (47%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN T ++ +DF + + A L + + A+FT A L +R +L A + L
Sbjct: 980 ANLTGCQLKNTDFKETCLDNAKLIQTMGRSADFTKASLKGVNFERAMLGNAIFEESDLTG 1039
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLA 216
++ G+ +GA +DAV D+A
Sbjct: 1040 AQARQASFKGSSFKGATLADAVFDMA 1065
>gi|116754331|ref|YP_843449.1| pentapeptide repeat-containing protein [Methanosaeta thermophila
PT]
gi|116665782|gb|ABK14809.1| pentapeptide repeat protein [Methanosaeta thermophila PT]
Length = 389
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 57/106 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F A L A + FR + F+ A++ ++ +G+ +G+ ++ +A TGADL
Sbjct: 177 SHANFVGAHLSWADMSRSRFRESQFSRAELYGANLTGTDLSGSDFTRSYMMRARMTGADL 236
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
SD +D L EA L + L + +DL GA + GAD S+ V+D
Sbjct: 237 SDASLDYADLTEAELRDTDLSGCKMRYADLSGANLAGADISEVVLD 282
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 78/176 (44%), Gaps = 33/176 (18%)
Query: 46 SDGQFPDCSNNQCAGP---YAKLKNWRVFVSTALAAAV-VASCSSNISALADLNKYEAET 101
+D D S +G AKL+N R+ ++ + A + +A C+ + + D++ +AE
Sbjct: 99 ADLSMADLSGANLSGTDLSRAKLRNARLSGASLVNANLTMADCTEAL--MDDVSLEDAEM 156
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
G +F DL AV + ANF A + +D S S+F + +A Y A
Sbjct: 157 TG-------TRFFRTDLTGAVFSGASLSHANFVGAHLSWADMSRSRFRESQFSRAELYGA 209
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
N TG DLS + + T + ++R +T GAD SDA +D A
Sbjct: 210 NLTGTDLSGS----------DFTRSYMMRARMT----------GADLSDASLDYAD 245
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 49/111 (44%), Gaps = 15/111 (13%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 160
A A LR A V N A+ AD+ +D SG+ +G A L A
Sbjct: 74 ANLNGAYLRSAWLVNANLEGASLAGADLSMADLSGANLSGTDLSRAKLRNARLSGASLVN 133
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
AN T AD ++ LMD + L +A +T RT DL GA+ GA S A
Sbjct: 134 ANLTMADCTEALMDDVSLEDAEMTGTRFFRT-----DLTGAVFSGASLSHA 179
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 43/84 (51%), Gaps = 5/84 (5%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+ T A++R++D SG K A L A N GAD+S+ ++D + NL+ A+L +
Sbjct: 244 ADLTEAELRDTDLSGCKMRYADLSGA-----NLAGADISEVVLDSVKTTGVNLSGAILYK 298
Query: 191 TVLTRSDLGGAIIEGADFSDAVID 214
T L DL + G A +D
Sbjct: 299 TSLFNLDLRDIDMHGVQIKKAKMD 322
>gi|347755497|ref|YP_004863061.1| putative low-complexity protein [Candidatus Chloracidobacterium
thermophilum B]
gi|347588015|gb|AEP12545.1| putative low-complexity protein [Candidatus Chloracidobacterium
thermophilum B]
Length = 419
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 60/123 (48%), Gaps = 8/123 (6%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL-- 168
A SA LR A V+ N AN AD+ ++ G+ GA L +A AN GADL
Sbjct: 57 ANLASASLRDAFLVRANLEGANLRGADLESANLEGANLRGADLSRANLEGANLEGADLTG 116
Query: 169 ----SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCK 223
S L+D L A L NAV L + LGGA + DF +A+++ A ++AL
Sbjct: 117 ARLPSAQLID-AKLGVATLENAVFANADLRNAYLGGANLTAVDFQNAILEAANFEEALLT 175
Query: 224 YAN 226
AN
Sbjct: 176 GAN 178
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 50/99 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F +ADLR A N +F +A + ++F + GA L AV +A GADLS
Sbjct: 137 AVFANADLRNAYLGGANLTAVDFQNAILEAANFEEALLTGANLRDAVLRRAVLPGADLSG 196
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
++R VL A+L+ L+ + GA ++GA FS
Sbjct: 197 AKLERAVLEGADLSQVSLLEADCRHATFQGARLKGAKFS 235
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 5/106 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFTG 165
A ADL A + + RRAN SA +R+ ++ G+ GA LE A AN G
Sbjct: 37 ANLRRADLEGANLEEASLRRANLASASLRDAFLVRANLEGANLRGADLESANLEGANLRG 96
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
ADLS ++ L A+LT A L L + LG A +E A F++A
Sbjct: 97 ADLSRANLEGANLEGADLTGARLPSAQLIDAKLGVATLENAVFANA 142
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 44/88 (50%), Gaps = 5/88 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ G A L AV + R A A++ DF + A E+A+ TGA+L D
Sbjct: 127 AKLGVATLENAVFANADLRNAYLGGANLTAVDFQNAILEAANFEEAL-----LTGANLRD 181
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDL 198
++ R VL A+L+ A L R VL +DL
Sbjct: 182 AVLRRAVLPGADLSGAKLERAVLEGADL 209
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL KA N RRA+ A++ E+ + A L A +AN GA+L ++
Sbjct: 28 DLAKANLDNANLRRADLEGANLEEASLRRANLASASLRDAFLVRANLEGANLRGADLESA 87
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L ANL A L+R++L GA +EGAD + A + AQ
Sbjct: 88 NLEGANLRGA-----DLSRANLEGANLEGADLTGARLPSAQ 123
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 72/169 (42%), Gaps = 36/169 (21%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAV------ 157
+A A+LR A + N AN AD+ + +K A LE AV
Sbjct: 85 ESANLEGANLRGADLSRANLEGANLEGADLTGARLPSAQLIDAKLGVATLENAVFANADL 144
Query: 158 --AY--KANFTGADLSDTLM-----DRMVLNEANLTNAVLVRTVLTRSDLGG-----AII 203
AY AN T D + ++ + +L ANL +AVL R VL +DL G A++
Sbjct: 145 RNAYLGGANLTAVDFQNAILEAANFEEALLTGANLRDAVLRRAVLPGADLSGAKLERAVL 204
Query: 204 EGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGS 252
EGAD S + +A C++A G + G SR + YGS
Sbjct: 205 EGADLSQVSL----LEADCRHAT----FQGARLK---GAKFSRTHLYGS 242
>gi|409994014|ref|ZP_11277136.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|291569676|dbj|BAI91948.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409935088|gb|EKN76630.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 331
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 67/138 (48%), Gaps = 9/138 (6%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
F T L AA + + ++ L D N +A+ RG A ADLR A N R
Sbjct: 87 FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139
Query: 130 -RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
R + S ++R +D G+ G L A +AN GA+L++ ++ +LN+ NL+ L
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLMGANLTECVLRGAILNQTNLSETNL 199
Query: 189 VRTVLTRSDLGGAIIEGA 206
+LT +L GA + G+
Sbjct: 200 QGAILTEVNLSGANLIGS 217
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 63/126 (50%), Gaps = 6/126 (4%)
Query: 94 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSK 147
LNKY + + G+ A+ +ADL A +F+ ANF A ++ ++ +K
Sbjct: 7 LNKYRSGEKLFRGVNLRNAELSNADLIGANLSGGDFQGANFVLAYLNGVNLTRANLEKAK 66
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
GA L +A A T AD T++ L +ANLT A LV L ++DL GA ++GAD
Sbjct: 67 LGGANLSRANLSGAQLTDADFHGTILQAADLRKANLTLATLVDANLIQADLRGANLQGAD 126
Query: 208 FSDAVI 213
A +
Sbjct: 127 LRGACL 132
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 52/119 (43%), Gaps = 17/119 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFT----------SADMRESDFSGSKFNGAYL----- 153
S AQ AD + + R+AN T AD+R ++ G+ GA L
Sbjct: 78 SGAQLTDADFHGTILQAADLRKANLTLATLVDANLIQADLRGANLQGADLRGACLRGANM 137
Query: 154 --EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
E+ + N GADL T + + L A+LT A L+ LT L GAI+ + S+
Sbjct: 138 RYERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLMGANLTECVLRGAILNQTNLSE 196
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 48/104 (46%), Gaps = 10/104 (9%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
LR A+ + N N A + E + SG+ G+ + K +A T A + + +
Sbjct: 184 LRGAILNQTNLSETNLQGAILTEVNLSGANLIGSRMVKVKLERAILTNAQMPRVELCDSI 243
Query: 178 LNEANLTN----------AVLVRTVLTRSDLGGAIIEGADFSDA 211
L +ANL+N A LVR L R++L A + AD +DA
Sbjct: 244 LPDANLSNANLSHANLSRANLVRAELNRTNLSSANLTQADLTDA 287
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 5/85 (5%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN ++A++ ++ S + A L + AN T ADL+D + R L ANL+ A L R
Sbjct: 247 ANLSNANLSHANLSRANLVRAELNRTNLSSANLTQADLTDASLGRTNLRNANLSYAYLTR 306
Query: 191 TVLTRS-----DLGGAIIEGADFSD 210
T + + +L GAI+ + D
Sbjct: 307 TEFSSANTIGVNLHGAIMPNGEIHD 331
>gi|300863681|ref|ZP_07108615.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300338313|emb|CBN53761.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 238
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 57/109 (52%), Gaps = 8/109 (7%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+F ADLR++ K NF +A+F D+ ES G+ A L +AV +A+ +GA L+D
Sbjct: 36 EFDRADLRQSRLGKTNFTQASFQETDLSESILWGTDLTEANLYRAVLREADLSGAKLTDA 95
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---SDAVIDLAQ 217
L EANL A L L R+ L AI+ AD SD + DL Q
Sbjct: 96 -----NLEEANLMKACLSGANLVRAKLLRAILFEADLRSTSDQITDLGQ 139
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A F DL +++ + AN A +RE+D SG+K A LE+A KA +GA+L
Sbjct: 53 TQASFQETDLSESILWGTDLTEANLYRAVLREADLSGAKLTDANLEEANLMKACLSGANL 112
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ R +L EA+L + T +DLG AI+ AD S
Sbjct: 113 VRAKLLRAILFEADLRS-----TSDQITDLGQAILTNADLS 148
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 8/95 (8%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK--------ANFTGADL 168
DL +A+ + +N + A + +++ G+K A+L + + + A+ GADL
Sbjct: 136 DLGQAILTNADLSYSNLSGALLYQANLDGAKLCRAHLNETIQQRFLATNLSEASLQGADL 195
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
S + +L +ANL A + RT+LT +DL GAI+
Sbjct: 196 SYADLSGAILRKANLRGADMTRTILTNTDLEGAIM 230
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 44/87 (50%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K +F + D+ ++ G +F+ A L ++ K NFT A +T + +L +LT
Sbjct: 14 KRSFHQVKLQEIDLLNAELQGIEFDRADLRQSRLGKTNFTQASFQETDLSESILWGTDLT 73
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDA 211
A L R VL +DL GA + A+ +A
Sbjct: 74 EANLYRAVLREADLSGAKLTDANLEEA 100
>gi|75910595|ref|YP_324891.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704320|gb|ABA23996.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 521
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 59/116 (50%), Gaps = 6/116 (5%)
Query: 115 SADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
SA+LR A + NFR AN + AD+ R +D SG + A L A AN GADLS
Sbjct: 174 SANLRDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSNAKLSGANLVGADLS 233
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY 224
+ + L ANL A L+R +DL AI+ GA +S + L + +C++
Sbjct: 234 NANLTNASLVHANLIQAKLIRAEWVGADLTSAILTGAKLYSTSRFGLKTEGLICQW 289
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 49/103 (47%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A SADLR+A N R AN A ++ + G+ A L + + + T A+L
Sbjct: 118 SEANLNSADLREATLRHANLRHANLNGASLKGASLVGANLEMANLNGSDLSRCDLTSANL 177
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
D + ++ ANL+ A L L +DL G + AD S+A
Sbjct: 178 RDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSNA 220
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 55/107 (51%), Gaps = 9/107 (8%)
Query: 125 KENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLN 179
+ N AN + +++ E+DFS +K N GA L A+ ++ A+L + + R L
Sbjct: 39 QANLSIANLSGSNLSEADFSHAKLNVARLSGANLTNAIFNHSSLNVANLIRSDLSRAQLR 98
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
A+L A L+R L+R DL A + AD +A + + A ++AN
Sbjct: 99 GASLVRAELIRAELSRVDLSEANLNSADLREATL----RHANLRHAN 141
Score = 43.5 bits (101), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 5/108 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +DL + N R A + R ++ SG+ +GA L A N + ADLS+
Sbjct: 160 ANLNGSDLSRCDLTSANLRDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSN 219
Query: 171 TLMD--RMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ +V L+ ANLTNA LV L ++ L A GAD + A++
Sbjct: 220 AKLSGANLVGADLSNANLTNASLVHANLIQAKLIRAEWVGADLTSAIL 267
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 62/136 (45%), Gaps = 8/136 (5%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRAN 132
L A+ S N++ L + A+ RG + + A+ DL +A + R A
Sbjct: 72 LTNAIFNHSSLNVANLIRSDLSRAQLRGASLVRAELIRAELSRVDLSEANLNSADLREAT 131
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
A++R ++ +G+ GA L A AN G+DLS R L ANL +A L +
Sbjct: 132 LRHANLRHANLNGASLKGASLVGANLEMANLNGSDLS-----RCDLTSANLRDAELKQVN 186
Query: 193 LTRSDLGGAIIEGADF 208
++L GA + GA+
Sbjct: 187 FRHANLSGADLSGANL 202
Score = 40.8 bits (94), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 59/121 (48%), Gaps = 26/121 (21%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTNA 186
NF+ D+ E++ SG K G +A AN +G++LS+ LN A NLTNA
Sbjct: 16 NFSGVDLSEANLSGVKLCGVNFSQANLSIANLSGSNLSEADFSHAKLNVARLSGANLTNA 75
Query: 187 V----------LVRTVLTRSDLGGAIIEGADFSDAV---IDLAQ--------KQALCKYA 225
+ L+R+ L+R+ L GA + A+ A +DL++ ++A ++A
Sbjct: 76 IFNHSSLNVANLIRSDLSRAQLRGASLVRAELIRAELSRVDLSEANLNSADLREATLRHA 135
Query: 226 N 226
N
Sbjct: 136 N 136
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 53/108 (49%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A+ A+L A+ F ++ A++ SD S ++ GA L +A +A + DL
Sbjct: 63 NVARLSGANLTNAI-----FNHSSLNVANLIRSDLSRAQLRGASLVRAELIRAELSRVDL 117
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
S+ LN A+L A L L ++L GA ++GA A +++A
Sbjct: 118 SEA-----NLNSADLREATLRHANLRHANLNGASLKGASLVGANLEMA 160
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 46/101 (45%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L+ A V N AN +D+ D + + A L++ AN +GADLS
Sbjct: 140 ANLNGASLKGASLVGANLEMANLNGSDLSRCDLTSANLRDAELKQVNFRHANLSGADLSG 199
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L A+L+ L L+ + L GA + GAD S+A
Sbjct: 200 A-----NLRWADLSGVNLSWADLSNAKLSGANLVGADLSNA 235
>gi|428220816|ref|YP_007104986.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994156|gb|AFY72851.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 418
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 59/108 (54%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANF 163
S A F +DL A+ ++ + RRAN + A++ E +D SG F+G+ L +A +ANF
Sbjct: 143 SMANFTGSDLSGAIMIRADLRRANISRANLNEADISRADLSGVDFSGSNLSQANFEEANF 202
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
G + S R L EAN +N L+ SDL GA + A+F++A
Sbjct: 203 LGTNFS-----RTNLIEANFSNTNFREVDLSGSDLIGADLSNANFAEA 245
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A LR+A + NF N + AD+R + SG+ GA L A+ GADL
Sbjct: 58 SGADLSRAKLRRATFGETNFSNTNLSEADLRRVNLSGADLRGANLS-----TADLIGADL 112
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ +L EA+L+ LV T +T ++L A G+D S A++
Sbjct: 113 RRATLEGAILAEADLSRTNLVGTNMTDANLSMANFTGSDLSGAIM 157
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L A + + RRA A + E+D S + G + A ANFTG+DL
Sbjct: 93 SGADLRGANLSTADLIGADLRRATLEGAILAEADLSRTNLVGTNMTDANLSMANFTGSDL 152
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S +M R L AN++ A L ++R+DL G G++ S A
Sbjct: 153 SGAIMIRADLRRANISRANLNEADISRADLSGVDFSGSNLSQA 195
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 72/146 (49%), Gaps = 20/146 (13%)
Query: 95 NKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
N E + G IG S A F ADLR+A N ANF +A+++E+D SG+ GA
Sbjct: 221 NFREVDLSGSDLIGADLSNANFAEADLRRA-----NLVGANFNNANLKEADLSGAYLIGA 275
Query: 152 YLEKAVAYKANF----------TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 201
L A +A+F TGADL+ + L+ ANL++ L LT +DL A
Sbjct: 276 TLVNANIVRADFRRANLIGADLTGADLTGADLVGANLSGANLSDCNLTSVSLTSADLSMA 335
Query: 202 IIEGADFSDAVIDLAQKQALCKYANG 227
D ++A +L++ QAL +G
Sbjct: 336 NFANCDLTNA--NLSRVQALSTNFSG 359
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 50/102 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S F DL + + + ANF AD+R ++ G+ FN A L++A A GA L
Sbjct: 218 SNTNFREVDLSGSDLIGADLSNANFAEADLRRANLVGANFNNANLKEADLSGAYLIGATL 277
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+ + R ANL A L LT +DL GA + GA+ SD
Sbjct: 278 VNANIVRADFRRANLIGADLTGADLTGADLVGANLSGANLSD 319
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 50/100 (50%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
F +L +A NFR + + +D+ +D S + F A L +A ANF A+L +
Sbjct: 206 NFSRTNLIEANFSNTNFREVDLSGSDLIGADLSNANFAEADLRRANLVGANFNNANLKEA 265
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L A L NA +VR R++L GA + GAD + A
Sbjct: 266 DLSGAYLIGATLVNANIVRADFRRANLIGADLTGADLTGA 305
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 61/134 (45%), Gaps = 13/134 (9%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
N EA+ G + IG A L A V+ +FRRAN AD+ +D +G+ GA L
Sbjct: 261 NLKEADLSGAYLIG-------ATLVNANIVRADFRRANLIGADLTGADLTGADLVGANLS 313
Query: 155 KAVAYKANFTGADLSDTLMDRMVLNEAN--LTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
A N T L T D + N AN LTNA L R ++ GA++ GA+ D
Sbjct: 314 GANLSDCNLTSVSL--TSADLSMANFANCDLTNANLSRVQALSTNFSGAMLTGANLEDWS 371
Query: 213 IDLAQK--QALCKY 224
++ K C Y
Sbjct: 372 VNSKTKLDDVECDY 385
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 46/90 (51%), Gaps = 11/90 (12%)
Query: 123 HVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 181
H+++ + +FT A++ E DF+G+ KANF+GADLS + R E
Sbjct: 26 HIQDLDLSDCDFTGANLSEVDFAGTDL----------QKANFSGADLSRAKLRRATFGET 75
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
N +N L L R +L GA + GA+ S A
Sbjct: 76 NFSNTNLSEADLRRVNLSGADLRGANLSTA 105
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 48/101 (47%), Gaps = 15/101 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S F DL+KA NF A+ + A +R + F + F+ L +A + N +GADL
Sbjct: 43 SEVDFAGTDLQKA-----NFSGADLSRAKLRRATFGETNFSNTNLSEADLRRVNLSGADL 97
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
ANL+ A L+ L R+ L GAI+ AD S
Sbjct: 98 RG----------ANLSTADLIGADLRRATLEGAILAEADLS 128
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 52/105 (49%), Gaps = 10/105 (9%)
Query: 109 SAAQFGSADLRKA-----VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR+A + + + R N +M +++ S + F G+ L A+ +A+
Sbjct: 103 STADLIGADLRRATLEGAILAEADLSRTNLVGTNMTDANLSMANFTGSDLSGAIMIRADL 162
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A++S R LNEA+++ A L + S+L A E A+F
Sbjct: 163 RRANIS-----RANLNEADISRADLSGVDFSGSNLSQANFEEANF 202
>gi|17228637|ref|NP_485185.1| hypothetical protein alr1142 [Nostoc sp. PCC 7120]
gi|17130488|dbj|BAB73099.1| alr1142 [Nostoc sp. PCC 7120]
Length = 521
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 59/116 (50%), Gaps = 6/116 (5%)
Query: 115 SADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
SA+LR A + NFR AN + AD+ R +D SG + A L A AN GADLS
Sbjct: 174 SANLRDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSNAKLSGANLVGADLS 233
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY 224
+ + L ANL A L+R +DL AI+ GA +S + L + +C++
Sbjct: 234 NANLTNASLVHANLIQAKLIRAEWVGADLTSAILTGAKLYSTSRFGLKTEGLICQW 289
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 49/103 (47%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A SADLR+A N R AN A ++ + G+ A L + + + T A+L
Sbjct: 118 SEANLNSADLREATLRHANLRHANLNGASLKGASLVGANLEMANLNGSDLSRCDLTSANL 177
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
D + ++ ANL+ A L L +DL G + AD S+A
Sbjct: 178 RDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSNA 220
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 54/107 (50%), Gaps = 9/107 (8%)
Query: 125 KENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLN 179
+ N AN + +++ E+DFS +K N GA L A+ ++ A+L + R L
Sbjct: 39 QANLSIANLSGSNLSEADFSHAKLNVARLSGANLTNAIFNHSSLNVANLIRADLSRAQLR 98
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
A+L A L+R L+R DL A + AD +A + + A ++AN
Sbjct: 99 GASLVRAELIRAELSRVDLSEANLNSADLREATL----RHANLRHAN 141
Score = 43.5 bits (101), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 5/108 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +DL + N R A + R ++ SG+ +GA L A N + ADLS+
Sbjct: 160 ANLNGSDLSRCDLTSANLRDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSN 219
Query: 171 TLMD--RMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ +V L+ ANLTNA LV L ++ L A GAD + A++
Sbjct: 220 AKLSGANLVGADLSNANLTNASLVHANLIQAKLIRAEWVGADLTSAIL 267
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 62/136 (45%), Gaps = 8/136 (5%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRAN 132
L A+ S N++ L + A+ RG + + A+ DL +A + R A
Sbjct: 72 LTNAIFNHSSLNVANLIRADLSRAQLRGASLVRAELIRAELSRVDLSEANLNSADLREAT 131
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
A++R ++ +G+ GA L A AN G+DLS R L ANL +A L +
Sbjct: 132 LRHANLRHANLNGASLKGASLVGANLEMANLNGSDLS-----RCDLTSANLRDAELKQVN 186
Query: 193 LTRSDLGGAIIEGADF 208
++L GA + GA+
Sbjct: 187 FRHANLSGADLSGANL 202
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 58/121 (47%), Gaps = 26/121 (21%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTNA 186
NF+ D+ E++ SG K G +A AN +G++LS+ LN A NLTNA
Sbjct: 16 NFSGVDLSEANLSGVKLCGVNFSQANLSIANLSGSNLSEADFSHAKLNVARLSGANLTNA 75
Query: 187 V----------LVRTVLTRSDLGGAIIEGADFSDAV---IDLAQ--------KQALCKYA 225
+ L+R L+R+ L GA + A+ A +DL++ ++A ++A
Sbjct: 76 IFNHSSLNVANLIRADLSRAQLRGASLVRAELIRAELSRVDLSEANLNSADLREATLRHA 135
Query: 226 N 226
N
Sbjct: 136 N 136
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 46/101 (45%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L+ A V N AN +D+ D + + A L++ AN +GADLS
Sbjct: 140 ANLNGASLKGASLVGANLEMANLNGSDLSRCDLTSANLRDAELKQVNFRHANLSGADLSG 199
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L A+L+ L L+ + L GA + GAD S+A
Sbjct: 200 A-----NLRWADLSGVNLSWADLSNAKLSGANLVGADLSNA 235
>gi|254409695|ref|ZP_05023476.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183692|gb|EDX78675.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 350
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 64/136 (47%), Gaps = 35/136 (25%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL----------------- 153
A ADL+ A+ ++ +A+ T+A +RE+D SG+ GA L
Sbjct: 66 ADLSKADLKNALLIEATLSQADLTAAILREADLSGAILTGATLLDADLRHATLIGTSLID 125
Query: 154 ---EKAVAYKANFTG----------ADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTR 195
++A KAN TG ADL +++R +L++ ANL A +R L R
Sbjct: 126 AKMKRAKLAKANCTGASFSRANLKAADLQGVILNRAILSQADLRGANLRGACFIRAYLHR 185
Query: 196 SDLGGAIIEGADFSDA 211
+DL A + GAD SDA
Sbjct: 186 ADLRDANLTGADLSDA 201
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 62/119 (52%), Gaps = 6/119 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A +ADL+ + RA + AD+R ++ G+ F AYL +A AN TGADL
Sbjct: 144 SRANLKAADLQGVI-----LNRAILSQADLRGANLRGACFIRAYLHRADLRDANLTGADL 198
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA-LCKYAN 226
SD + L+ ANL+ A L L+ ++L GA + GA +A + LA L K AN
Sbjct: 199 SDADLKGADLSHANLSRANLSCANLSHANLTGANLTGAHLQNANLSLANLSGLLLKKAN 257
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 47/92 (51%), Gaps = 5/92 (5%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-----DLSDTLMDRMVLN 179
+ NF N AD+ E++ S + A+L++A KA GA DLS + +L
Sbjct: 20 ERNFPGVNLIRADLTEANLSRINLSAAHLQRANLAKAKLIGAQLKDADLSKADLKNALLI 79
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
EA L+ A L +L +DL GAI+ GA DA
Sbjct: 80 EATLSQADLTAAILREADLSGAILTGATLLDA 111
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 55/106 (51%), Gaps = 10/106 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A ADL+ A N RAN + A++ ++ +G+ GA+L+ A AN +G
Sbjct: 194 TGADLSDADLKGADLSHANLSRANLSCANLSHANLTGANLTGAHLQNANLSLANLSG--- 250
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
++L +ANL +A L + L R++L A + GA+ +A ++
Sbjct: 251 -------LLLKKANLQSAQLSKANLNRANLYKANLSGANLLEANLE 289
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 52/103 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR A F RA AD+R+++ +G+ + A L+ A AN + A+LS
Sbjct: 161 AILSQADLRGANLRGACFIRAYLHRADLRDANLTGADLSDADLKGADLSHANLSRANLSC 220
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ L ANLT A L L+ ++L G +++ A+ A +
Sbjct: 221 ANLSHANLTGANLTGAHLQNANLSLANLSGLLLKKANLQSAQL 263
>gi|358458677|ref|ZP_09168884.1| pentapeptide repeat protein [Frankia sp. CN3]
gi|357077988|gb|EHI87440.1| pentapeptide repeat protein [Frankia sp. CN3]
Length = 377
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 55/109 (50%), Gaps = 5/109 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A+ DL A V A+ T A + +D +G++ + A L+ A AN TG
Sbjct: 223 ARLAGRDLTFATFVAARLTGADLTGAVLAKTKLTATDLAGTRLSRANLDGADLANANLTG 282
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A L D ++ L+EA L A+L R L R+DL GA + GAD + A +D
Sbjct: 283 ARLDDAVLTGAHLSEARLVGAILTRADLHRADLVGADLTGADLTGARLD 331
Score = 37.4 bits (85), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 38/82 (46%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
++T A + G++ G L A A TGADL+ ++ + L +L L R
Sbjct: 209 DWTIAHYPGAQLVGARLAGRDLTFATFVAARLTGADLTGAVLAKTKLTATDLAGTRLSRA 268
Query: 192 VLTRSDLGGAIIEGADFSDAVI 213
L +DL A + GA DAV+
Sbjct: 269 NLDGADLANANLTGARLDDAVL 290
>gi|33861206|ref|NP_892767.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. pastoris str. CCMP1986]
gi|33639938|emb|CAE19108.1| Pentapeptide repeats [Prochlorococcus marinus subsp. pastoris str.
CCMP1986]
Length = 157
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 61/130 (46%), Gaps = 10/130 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +DL+ A + + AN + D++ + G+K N + ++L +
Sbjct: 35 FSGSDLQGATFYLTDLQDANLSDCDLQNASLYGAKLK----------DTNLSNSNLREVT 84
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
+D VL+ +LTN L + + I+GADF++ + + CK A+GTNP T
Sbjct: 85 LDSAVLDGTDLTNTNLEDSFAYSTQFENVKIQGADFTNVYLPKDVLREFCKDASGTNPFT 144
Query: 233 GVSTRKSLGC 242
TR++L C
Sbjct: 145 NRETRETLEC 154
>gi|434396750|ref|YP_007130754.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428267847|gb|AFZ33788.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 331
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 57/111 (51%), Gaps = 10/111 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 164
S A ++L KA ++ NF RAN T A + ++D S GA L A+ K N T
Sbjct: 65 SGADLSQSNLEKAQLIETNFSRANLTEASLIQADLS-----GAILSSAIGTKTNLTAAIL 119
Query: 165 -GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
G L T + + L EANLT A L +LT S+L AI+ A S+A ++
Sbjct: 120 IGCSLVGTQLLKSKLKEANLTGASLTGAILTGSNLTRAILTRAILSNANLE 170
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 5/83 (6%)
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
+K NF+RA+ + D++ + + FN A L AN +GADLS + +++ L E N
Sbjct: 30 IKANFQRASLNNIDLKMAVLKKANFNQAQL-----INANLSGADLSQSNLEKAQLIETNF 84
Query: 184 TNAVLVRTVLTRSDLGGAIIEGA 206
+ A L L ++DL GAI+ A
Sbjct: 85 SRANLTEASLIQADLSGAILSSA 107
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSD 170
ADLR A N + AN +++++D + + + A LE AV AN G +L+
Sbjct: 202 ADLRGANLEGANLQGANLEGVNLQDADLTEANLSAANLEGAVLSNANLQQVILKGTNLTG 261
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
T + L +ANL+ A L + L +DL GA + GAD + A
Sbjct: 262 TNLLNANLGQANLSQANLCQAGLLFTDLTGANLMGADLTSA 302
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 1/94 (1%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
+R +H + N ++AN AD+R +D G+ GA L+ A N ADL++ +
Sbjct: 180 IRAYLH-RVNLKKANLEKADLRFADLRGANLEGANLQGANLEGVNLQDADLTEANLSAAN 238
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L A L+NA L + +L ++L G + A+ A
Sbjct: 239 LEGAVLSNANLQQVILKGTNLTGTNLLNANLGQA 272
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 53/95 (55%), Gaps = 10/95 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A + DL+ AV ++ANF A + ++ SG+ + + LEKA + NF+ A+L++
Sbjct: 37 ASLNNIDLKMAV-----LKKANFNQAQLINANLSGADLSQSNLEKAQLIETNFSRANLTE 91
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
L +A+L+ A+L + T+++L AI+ G
Sbjct: 92 A-----SLIQADLSGAILSSAIGTKTNLTAAILIG 121
>gi|56751008|ref|YP_171709.1| hypothetical protein syc0999_c [Synechococcus elongatus PCC 6301]
gi|81299332|ref|YP_399540.1| hypothetical protein Synpcc7942_0521 [Synechococcus elongatus PCC
7942]
gi|56685967|dbj|BAD79189.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81168213|gb|ABB56553.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 195
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 5/103 (4%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL A+ V + RRA A +RE+D SG+ GA L ++ +A G++L +++
Sbjct: 49 ADLTGAILVGADLRRAWLRGAILREADCSGANLLGADLLRSDLCRAQLVGSNLRRAMLND 108
Query: 176 MVLNEAN-----LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+L EAN L A LVR +L R+D A + AD S A I
Sbjct: 109 SILAEANCRQACLQQADLVRAILYRTDFTAADLHEADLSHAFI 151
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 53/110 (48%), Gaps = 2/110 (1%)
Query: 118 LRKAVHVKENFRRANFTSA-DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
LR+ V +R N T D+R++D S LE+A A GADL +
Sbjct: 10 LRRGTAVWSRWRSQNPTVIPDLRQADLSFVDLVNVDLERADLTGAILVGADLRRAWLRGA 69
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYA 225
+L EA+ + A L+ L RSDL A + G++ A++ D +A C+ A
Sbjct: 70 ILREADCSGANLLGADLLRSDLCRAQLVGSNLRRAMLNDSILAEANCRQA 119
>gi|167771967|ref|ZP_02444020.1| hypothetical protein ANACOL_03340 [Anaerotruncus colihominis DSM
17241]
gi|167665765|gb|EDS09895.1| pentapeptide repeat protein [Anaerotruncus colihominis DSM 17241]
Length = 314
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 67/129 (51%), Gaps = 9/129 (6%)
Query: 94 LNKYEAETRGEF-GIG---SAAQFGSADLRKAVH-----VKENFRRANFTSADMRESDFS 144
L+K+ A RGE G+ + A ADL KA N +AN + A++ ++ S
Sbjct: 7 LDKHAAWLRGEPEGVKADLTGANLPGADLSKANLSGANLFGANLSKANLSGANLFGANLS 66
Query: 145 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 204
G+ GA L KA AN +GADLS T + L++ANL+ A L L+R+ L GA +
Sbjct: 67 GANLFGANLSKANLSGANLSGADLSRTHLPGADLSKANLSGANLSGADLSRTHLPGADLS 126
Query: 205 GADFSDAVI 213
A+ S A +
Sbjct: 127 KANLSKANL 135
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 65/133 (48%), Gaps = 11/133 (8%)
Query: 92 ADLNKYEAETRGEFGIG------SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
ADL+K FG S A A+L A N +AN + A++ +D S
Sbjct: 33 ADLSKANLSGANLFGANLSKANLSGANLFGANLSGANLFGANLSKANLSGANLSGADLSR 92
Query: 146 SKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGG 200
+ GA L KA AN +GADLS T + + L++ANL+ A L L++++L G
Sbjct: 93 THLPGADLSKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANLFGANLSKANLSG 152
Query: 201 AIIEGADFSDAVI 213
A + GA+ S A +
Sbjct: 153 ANLFGANLSGANL 165
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 54/106 (50%), Gaps = 5/106 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL + + +AN + A++ +D S + GA L KA KAN +GA+L
Sbjct: 81 SGANLSGADLSRTHLPGADLSKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANL 140
Query: 169 SDTLMDRMVLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFS 209
+ + L+ ANL + A L L++++L GA + GAD S
Sbjct: 141 FGANLSKANLSGANLFGANLSGANLFGANLSKANLSGANLSGADLS 186
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 53/105 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL + + +AN + A++ ++ G+ + A L A + AN +GA+L
Sbjct: 106 SGANLSGADLSRTHLPGADLSKANLSKANLSGANLFGANLSKANLSGANLFGANLSGANL 165
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ + L+ ANL+ A L RT L +DL A + A+ S A +
Sbjct: 166 FGANLSKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANL 210
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 48/90 (53%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL KA K N AN A++ +++ SG+ GA L A + AN + A+LS +
Sbjct: 123 ADLSKANLSKANLSGANLFGANLSKANLSGANLFGANLSGANLFGANLSKANLSGANLSG 182
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
L+ +L A L + L++++L GA + G
Sbjct: 183 ADLSRTHLPGADLSKANLSKANLSGANLSG 212
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 55/105 (52%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L A + + A+ + A++ +++ SG+ GA L KA AN GA+L
Sbjct: 101 SKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANLFGANLSKANLSGANLFGANL 160
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S + L++ANL+ A L L+R+ L GA + A+ S A +
Sbjct: 161 SGANLFGANLSKANLSGANLSGADLSRTHLPGADLSKANLSKANL 205
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 50/108 (46%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A A+L A + R + AD+ +++ S + +GA L A KAN +GA+L
Sbjct: 97 GADLSKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANLFGANLSKANLSGANLF 156
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+ L ANL+ A L L+ +DL + GAD S A + A
Sbjct: 157 GANLSGANLFGANLSKANLSGANLSGADLSRTHLPGADLSKANLSKAN 204
>gi|254526129|ref|ZP_05138181.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9202]
gi|221537553|gb|EEE40006.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9202]
Length = 148
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 64/134 (47%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+A +G L A + + A F D+++++ S + A L A N + ++L
Sbjct: 12 AALDYGKQSLIGADFSGSDLKGATFYLTDLQDANLSDCELQNATLYGAKLKDTNLSNSNL 71
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+ +D +L+ +L+N L + + I+GADF++ + + C+ A GT
Sbjct: 72 REVTLDSAILDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVYLPKDIIREFCESATGT 131
Query: 229 NPITGVSTRKSLGC 242
NPIT TR++L C
Sbjct: 132 NPITNRDTRETLEC 145
>gi|436670209|ref|YP_007317948.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428262481|gb|AFZ28430.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 309
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 57/119 (47%), Gaps = 5/119 (4%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
+T E I S A DL K+ + RRA+ T AD+ E+D + A L +
Sbjct: 162 QTNWEGAILSQASLQRVDLEKSQLNETILRRADLTEADLVEADLRYADLTEAILCRVALE 221
Query: 160 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI 213
AN GADLS + R L A+L AVL T L +DL A + G+DFSD+ +
Sbjct: 222 LANLVGADLSRATLKRASLFRADLEGAVLQDTNLVETDLRYANFKDTQLMGSDFSDSRV 280
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 5/103 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L A V+ RAN ++E+D +G+ F+ A L + N+ GA LS
Sbjct: 118 ADLSEASLESACLVQAVLSRANLFKVSLKEADCTGANFDEANLR-----QTNWEGAILSQ 172
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ R+ L ++ L +L R LT +DL A + AD ++A++
Sbjct: 173 ASLQRVDLEKSQLNETILRRADLTEADLVEADLRYADLTEAIL 215
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 51/90 (56%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ NF+ + + D++ ++ S F+ A L A AN +G L T ++R L +ANLT
Sbjct: 17 ERNFQNLDLSRVDLKGTNLKSSDFSHANLNSADLSYANLSGTSLIWTDLNRANLRQANLT 76
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A L+R+ L +DL A + A+ S+A+++
Sbjct: 77 QACLLRSSLFWADLQEATLVNANLSNALLN 106
Score = 44.7 bits (104), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 59/128 (46%), Gaps = 3/128 (2%)
Query: 87 NISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS 146
NI +D+ K+ A+ F DL+ +F AN SAD+ ++ SG+
Sbjct: 2 NIINASDIVKHYADQERNF---QNLDLSRVDLKGTNLKSSDFSHANLNSADLSYANLSGT 58
Query: 147 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
L +A +AN T A L + + L EA L NA L +L +L A ++GA
Sbjct: 59 SLIWTDLNRANLRQANLTQACLLRSSLFWADLQEATLVNANLSNALLNHVNLTSACLKGA 118
Query: 207 DFSDAVID 214
D S+A ++
Sbjct: 119 DLSEASLE 126
>gi|428220994|ref|YP_007105164.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994334|gb|AFY73029.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 283
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 55/108 (50%), Gaps = 5/108 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTG 165
A +A++ A N R A A++R S +G+ F GA L +AV N T
Sbjct: 169 ANLDTANISDADLTNANLRWATLRDANLRGSILTGANGNLANFTGANLSQAVLRGINLTN 228
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
ADLS+ ++ L+ ANL A LV LT +DL GA I AD S AV+
Sbjct: 229 ADLSNAKLNAADLSNANLVGASLVGANLTSADLTGANITNADLSGAVM 276
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 47/95 (49%), Gaps = 15/95 (15%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT----------GADLSDTL----- 172
+ +N +A + SD SG+ A L A YK+N + ADLSD
Sbjct: 111 LKESNLGNAYISTSDLSGANLTAANLRSASLYKSNLSLAILTQATLAEADLSDASFTEAN 170
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
+D +++A+LTNA L L ++L G+I+ GA+
Sbjct: 171 LDTANISDADLTNANLRWATLRDANLRGSILTGAN 205
>gi|434400818|ref|YP_007134822.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428271915|gb|AFZ37856.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 209
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 67/137 (48%), Gaps = 30/137 (21%)
Query: 127 NFRRANFTSADMRESDFSGS---------------KFNGAYLEKAVAYKANFTGADLSDT 171
NF +AN T AD RE D + + A LE+AV Y+A+ +LS +
Sbjct: 36 NFSQANLTGADFREIDLTQAILCEANLSQTILIEANLTKANLERAVLYRASLQLVNLSQS 95
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAI----------IEGADFSDAVIDLAQKQAL 221
++ L EANLT A+L +T L ++ L GA+ + GA+ S A++ QA
Sbjct: 96 ILTEADLREANLTEALLYKTSLGKAQLQGAVLNRAILQRTFLRGANLSQAIL----SQAN 151
Query: 222 CKYANGTNP-ITGVSTR 237
+ AN T+ +TG + R
Sbjct: 152 LQEANLTDADLTGANLR 168
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 64/131 (48%)
Query: 84 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 143
C +N+S + + E + A +L +++ + + R AN T A + ++
Sbjct: 58 CEANLSQTILIEANLTKANLERAVLYRASLQLVNLSQSILTEADLREANLTEALLYKTSL 117
Query: 144 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
++ GA L +A+ + GA+LS ++ + L EANLT+A L L ++L GA +
Sbjct: 118 GKAQLQGAVLNRAILQRTFLRGANLSQAILSQANLQEANLTDADLTGANLRGANLQGAFL 177
Query: 204 EGADFSDAVID 214
A+ +A ++
Sbjct: 178 VEANLFEASLE 188
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 26/96 (27%), Positives = 49/96 (51%), Gaps = 2/96 (2%)
Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 181
++ +E R + R+ D S G L+ +AN TGAD + + + +L EA
Sbjct: 1 MNTEELLRLYAMGEREFRQVDLSYRVLRGVDLQAINFSQANLTGADFREIDLTQAILCEA 60
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
NL+ +L+ LT+++L A++ A +++L+Q
Sbjct: 61 NLSQTILIEANLTKANLERAVLYRASLQ--LVNLSQ 94
>gi|428216301|ref|YP_007100766.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988083|gb|AFY68338.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 188
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 63/147 (42%), Gaps = 12/147 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A DL A + + + AN ++R + S FN A L A N TGA D
Sbjct: 40 ADLHGCDLSGAYIIASDLQGANLADTNLRGASLKNSNFNRANLSWANMSWTNLTGASFMD 99
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG-----------ADFSDAV-IDLAQK 218
MD L+ ANL +A L L ++L G + G ADFS +D +
Sbjct: 100 ARMDVTNLSSANLIDADLRGANLQGANLRGTNLRGTQIEPLRSIDNADFSRVKNLDQRVR 159
Query: 219 QALCKYANGTNPITGVSTRKSLGCGNS 245
LC A G +P T STR +L C NS
Sbjct: 160 VYLCSIATGAHPFTKNSTRATLECNNS 186
>gi|392410624|ref|YP_006447231.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
gi|390623760|gb|AFM24967.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
Length = 285
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 56/102 (54%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A L+KA +F RA+ + AD+ +D SG+ +GA L A + + + DL
Sbjct: 161 SGADLFGAKLKKAALSAVDFSRADLSGADLSGADLSGAILSGARLNGANLSRVDLSFTDL 220
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
S + L+ ANLT A L + L+ +DL GA ++GAD +D
Sbjct: 221 SGAHLSGANLSAANLTGAYLPGSDLSGADLSGANLQGADITD 262
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 55/103 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+AA+ +L +A N A+ + A++ ++D S + +GA L KA+ A+ +GADL
Sbjct: 106 AAAKLVEINLTQANLCGANLCGADLSKANLSQADLSRAILSGANLSKALLPFADLSGADL 165
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + L+ + + A L L+ +DL GAI+ GA + A
Sbjct: 166 FGAKLKKAALSAVDFSRADLSGADLSGADLSGAILSGARLNGA 208
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 20/118 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT------ 164
AQ DL A N A +SAD+ E++ SG+ + A+L A +AN +
Sbjct: 43 AQLSGEDLSFA-----NLSNAKLSSADLSEANLSGASLDRAHLTVAKLDRANLSNANASC 97
Query: 165 ----GADLSDTLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVI 213
GA L+ + + L +ANL A L + L+++DL AI+ GA+ S A++
Sbjct: 98 AGLLGARLAAAKLVEINLTQANLCGANLCGADLSKANLSQADLSRAILSGANLSKALL 155
>gi|78033474|emb|CAJ30090.1| hypothetical acidic protein, pentapeptide repeat [Magnetospirillum
gryphiswaldense MSR-1]
gi|144901135|emb|CAM77999.1| pentapeptide repeat containing protein [Magnetospirillum
gryphiswaldense MSR-1]
Length = 503
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 57/100 (57%), Gaps = 8/100 (8%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+LRKAV N R +N A + ++D SG+K GA L A +ANF+GA++ R
Sbjct: 28 ANLRKAVLSGANLRDSNLPRASLEDADLSGAKLQGANLAGATLLRANFSGANM------R 81
Query: 176 MV-LNEANLTNAVLVRTV-LTRSDLGGAIIEGADFSDAVI 213
M L ANL + + V LT ++L GA + GA+FS A +
Sbjct: 82 MANLAGANLAGRMDLSGVDLTGANLAGAKLMGANFSGATL 121
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 38/61 (62%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN A + ++FSG+ GA L A A ANF+GADL+D + +L+ AN++ AV+ R
Sbjct: 104 ANLAGAKLMGANFSGATLTGANLAGADARNANFSGADLTDAVTAGTLLDGANMSGAVIRR 163
Query: 191 T 191
+
Sbjct: 164 S 164
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 50/87 (57%), Gaps = 5/87 (5%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 189
R + TSA++R ++ +G +G+ L A KA +GA+L D+ + R L +A+L+ A
Sbjct: 2 RPDLTSANLRGANLAGMDLSGSLLSLANLRKAVLSGANLRDSNLPRASLEDADLSGA--- 58
Query: 190 RTVLTRSDLGGAIIEGADFSDAVIDLA 216
L ++L GA + A+FS A + +A
Sbjct: 59 --KLQGANLAGATLLRANFSGANMRMA 83
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 48/114 (42%), Gaps = 16/114 (14%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRES-----------DFSGSKFNGAYLEKAVAY 159
A A L+ A RANF+ A+MR + D SG GA L A
Sbjct: 53 ADLSGAKLQGANLAGATLLRANFSGANMRMANLAGANLAGRMDLSGVDLTGANLAGAKLM 112
Query: 160 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
ANF+GA L+ + AN + A L V G +++GA+ S AVI
Sbjct: 113 GANFSGATLTGANLAGADARNANFSGADLTDAV-----TAGTLLDGANMSGAVI 161
>gi|284929723|ref|YP_003422245.1| hypothetical protein UCYN_11960 [cyanobacterium UCYN-A]
gi|284810167|gb|ADB95864.1| uncharacterized low-complexity protein [cyanobacterium UCYN-A]
Length = 243
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 62/125 (49%), Gaps = 14/125 (11%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
LNKY+ R F S LR+ + N + NF SAD+R+S S FNGA L
Sbjct: 7 LNKYDLGER---------NFQSICLREVDLTEVNLPKINFESADIRQSRLGKSNFNGAIL 57
Query: 154 EKA-----VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
++A + + N +LS ++ L+ A LTNA L L+++ L GA + A+
Sbjct: 58 KQADLSESIIWGTNLENTNLSKAILRDTDLSGAELTNADLTNAYLSKASLCGANLAKANL 117
Query: 209 SDAVI 213
S AV+
Sbjct: 118 SHAVL 122
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 51/105 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A ADL +++ N N + A +R++D SG++ A L A KA+ GA+L
Sbjct: 53 NGAILKQADLSESIIWGTNLENTNLSKAILRDTDLSGAELTNADLTNAYLSKASLCGANL 112
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ + VL E +L RT L R++L + A S A++
Sbjct: 113 AKANLSHAVLYEVDLRPLSNRRTNLGRANLSSTDLSYAKLSSALL 157
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 42/91 (46%), Gaps = 5/91 (5%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTGAD 167
F SAD+R++ K NF A AD+ ES G+ L KA+ A T AD
Sbjct: 37 FESADIRQSRLGKSNFNGAILKQADLSESIIWGTNLENTNLSKAILRDTDLSGAELTNAD 96
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
L++ + + L ANL A L VL DL
Sbjct: 97 LTNAYLSKASLCGANLAKANLSHAVLYEVDL 127
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 54/123 (43%), Gaps = 28/123 (22%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 166
S A DLR + + N RAN +S D+ A L A+ ++AN +GA
Sbjct: 118 SHAVLYEVDLRPLSNRRTNLGRANLSSTDLSY----------AKLSSALLFRANLSGAKL 167
Query: 167 ----------------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
DL++ + L+ A+L NA+LV L +DL G ++GA+
Sbjct: 168 CRAELNQDPYKFPFLTDLTEANLQGADLSYADLGNAILVNANLKNADLTGTNLKGANLQG 227
Query: 211 AVI 213
A++
Sbjct: 228 AIM 230
>gi|119487545|ref|ZP_01621155.1| hypothetical protein L8106_26852 [Lyngbya sp. PCC 8106]
gi|119455714|gb|EAW36850.1| hypothetical protein L8106_26852 [Lyngbya sp. PCC 8106]
Length = 277
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 64/116 (55%), Gaps = 4/116 (3%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ DL A ++ N AN T+A D +GS G+ + +AN T A+L++
Sbjct: 60 AKLMGVDLSDANLMEANLIGANLTNAKFDRCDLTGSNLRGSSSKLVSLTQANLTDANLTE 119
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
+ ANLTNA L+RT L +++L GA++EGA+ ++ ++ ++++ + AN
Sbjct: 120 ANLAEANFVGANLTNATLIRTNLMKANLTGAVLEGANLTNVIL----RESILEGAN 171
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 50/97 (51%), Gaps = 10/97 (10%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-----DLSDTLMDRMVLN 179
+ N AN T A++ E++F G+ A L + KAN TGA +L++ ++ +L
Sbjct: 109 QANLTDANLTEANLAEANFVGANLTNATLIRTNLMKANLTGAVLEGANLTNVILRESILE 168
Query: 180 EANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDA 211
ANL +A L +L T +D+ + GAD SDA
Sbjct: 169 GANLIHATLSGALLISANFTDADMSRVTMIGADLSDA 205
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 58/137 (42%), Gaps = 30/137 (21%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSA----------DMRESDFSGSKFNGAYLEKAVAYK 160
A F A+L A ++ N +AN T A +RES G+ A L A+
Sbjct: 125 ANFVGANLTNATLIRTNLMKANLTGAVLEGANLTNVILRESILEGANLIHATLSGALLIS 184
Query: 161 ANFT----------GADLSDTLMDRM----------VLNEANLTNAVLVRTVLTRSDLGG 200
ANFT GADLSD + + L ANL+ A L RT L+ S+L G
Sbjct: 185 ANFTDADMSRVTMIGADLSDANLSGVNLRAANVSWTTLRGANLSRARLYRTKLSWSNLSG 244
Query: 201 AIIEGADFSDAVIDLAQ 217
A + A D +D A
Sbjct: 245 ANLIEAVLLDTRLDHAN 261
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 10/101 (9%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
+A F AD+ + + + AN + ++R ++ S + GA L +A Y+ + ++LS
Sbjct: 184 SANFTDADMSRVTMIGADLSDANLSGVNLRAANVSWTTLRGANLSRARLYRTKLSWSNLS 243
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
ANL AVL+ T L ++L I GA D
Sbjct: 244 G----------ANLIEAVLLDTRLDHANLRDVDIRGAILPD 274
>gi|330509039|ref|YP_004385467.1| pentapeptide repeat-containing protein [Methanosaeta concilii GP6]
gi|328929847|gb|AEB69649.1| pentapeptide repeat protein [Methanosaeta concilii GP6]
Length = 386
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 57/110 (51%), Gaps = 5/110 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-VAY----KANF 163
S + AD +A ++ N AN ADM +D + + GA L+ A + Y KANF
Sbjct: 204 SGSDLSDADFTRAYLMRSNLTGANIDWADMAYADLTEAVLTGASLKSAKMPYSDLTKANF 263
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
TGADLS+ +D +L A L NA L R L DL G + GA ++V+
Sbjct: 264 TGADLSEAYLDGAILAGATLRNAKLDRVNLREVDLRGLEMGGASLKNSVL 313
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 49/104 (47%), Gaps = 5/104 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A AD+ A + F RA S ++ SD S + F AYL ++N TGA++
Sbjct: 176 AHISWADMSVAYLSQGQFSRAELYSTNLSGSDLSDADFTRAYL-----MRSNLTGANIDW 230
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
M L EA LT A L + SDL A GAD S+A +D
Sbjct: 231 ADMAYADLTEAVLTGASLKSAKMPYSDLTKANFTGADLSEAYLD 274
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 51/101 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +DL+ N A SA + S +GS A L AV +A+ TGADL+
Sbjct: 51 AHLNQSDLQGCNLNGSNLDGAYLRSAWLMASHLNGSTLENADLTGAVLTEADLTGADLTG 110
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ R+ +++A L A +V+ LT +D+ + + AD +DA
Sbjct: 111 ANLIRVQMSKAKLNGARIVKADLTEADISDSDLSDADLTDA 151
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEANLTN 185
A+ +D++ + +GS +GAYL A ++ G ADL+ ++ L A+LT
Sbjct: 51 AHLNQSDLQGCNLNGSNLDGAYLRSAWLMASHLNGSTLENADLTGAVLTEADLTGADLTG 110
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A L+R ++++ L GA I AD ++A I
Sbjct: 111 ANLIRVQMSKAKLNGARIVKADLTEADI 138
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 53/103 (51%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADL AV + + A+ T A++ S +K NGA + KA +A+ + +DLSD +
Sbjct: 90 NADLTGAVLTEADLTGADLTGANLIRVQMSKAKLNGARIVKADLTEADISDSDLSDADLT 149
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L +L+ A L LT +++ GA I AD S A + Q
Sbjct: 150 DARLFRTDLSGAKLKGIYLTSANMIGAHISWADMSVAYLSQGQ 192
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 59/118 (50%), Gaps = 15/118 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-------------V 157
A+ ADL +A + A+ T A + +D SG+K G YL A V
Sbjct: 126 ARIVKADLTEADISDSDLSDADLTDARLFRTDLSGAKLKGIYLTSANMIGAHISWADMSV 185
Query: 158 AY--KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
AY + F+ A+L T + L++A+ T A L+R+ LT +++ A + AD ++AV+
Sbjct: 186 AYLSQGQFSRAELYSTNLSGSDLSDADFTRAYLMRSNLTGANIDWADMAYADLTEAVL 243
Score = 37.4 bits (85), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 45/103 (43%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L+ A + +ANFT AD+ E+ G+ GA L A + N DL
Sbjct: 241 AVLTGASLKSAKMPYSDLTKANFTGADLSEAYLDGAILAGATLRNAKLDRVNLREVDLRG 300
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ + A+L N+VL + +DL GAD DA +
Sbjct: 301 -----LEMGGASLKNSVLTGVFMAMTDLA-----GADLRDATL 333
>gi|434395496|ref|YP_007130443.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428267337|gb|AFZ33283.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 249
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 55/105 (52%), Gaps = 10/105 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L+ A N AN + AD+ E+D SG+ +GA L + AN + A L
Sbjct: 128 SGANLAQANLKGA-----NLTEANLSKADLTEADLSGADLSGATLSGVILSDANLSDAIL 182
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S ++ VL ANL+ AVL LT +L EGA+ S+AV+
Sbjct: 183 SRAILTLAVLQGANLSGAVLSGVNLTEVNL-----EGANLSNAVL 222
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 58/101 (57%), Gaps = 10/101 (9%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
G A+L + N N + A++ +++ G+ A L KA +A+ +GADLS
Sbjct: 112 LGGANLSQG-----NLSGVNLSGANLAQANLKGANLTEANLSKADLTEADLSGADLSGAT 166
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ ++L++ANL++A+L R +LT A+++GA+ S AV+
Sbjct: 167 LSGVILSDANLSDAILSRAILTL-----AVLQGANLSGAVL 202
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 44/78 (56%)
Query: 147 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
F+G L A +A ADLS+ ++ +L +A L+ A L RT+LT++DL A++ GA
Sbjct: 16 NFSGENLRSADLTRATLNAADLSEAILSEAILTQAELSEANLSRTILTKADLTEAVLAGA 75
Query: 207 DFSDAVIDLAQKQALCKY 224
+ A++ A+ + Y
Sbjct: 76 KLTGAILTEAELSRVNLY 93
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 46/103 (44%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L A+ + R N A M + +G+ L A + N +G +LS
Sbjct: 70 AVLAGAKLTGAILTEAELSRVNLYDAFMLGVNLTGANVTEGNLGGANLSQGNLSGVNLSG 129
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ + L ANLT A L + LT +DL GA + GA S ++
Sbjct: 130 ANLAQANLKGANLTEANLSKADLTEADLSGADLSGATLSGVIL 172
>gi|422295276|gb|EKU22575.1| hypothetical protein NGA_0469800 [Nannochloropsis gaditana CCMP526]
Length = 90
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 47/83 (56%), Gaps = 2/83 (2%)
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 221
NF GAD S+ ++DR+ + +NL ++ VL+ + GA + +DF+D + + L
Sbjct: 7 NFEGADFSNAVVDRVSFDGSNLKGSIFSNAVLSGTSFVGADLTDSDFTDTYMGEFNLREL 66
Query: 222 CKYAN--GTNPITGVSTRKSLGC 242
CK GTNP+T T++S GC
Sbjct: 67 CKNPTLKGTNPVTQAPTKESAGC 89
>gi|163797086|ref|ZP_02191041.1| pentapeptide repeat protein [alpha proteobacterium BAL199]
gi|159177602|gb|EDP62155.1| pentapeptide repeat protein [alpha proteobacterium BAL199]
Length = 421
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 60/116 (51%), Gaps = 13/116 (11%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F ADLR +V + +A F++A + + DF+G+K GA L A A ADL+D
Sbjct: 51 ALFAGADLRGSVFAGGHLEQAQFSTARLEQVDFAGAKLMGANLRGANLKGAKLMAADLTD 110
Query: 171 --------TLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
M+R + L++A+L+NA VRT L+ +++ I G F AV+
Sbjct: 111 ADLRPAKIVDMNRTIEQSANLHKADLSNAQFVRTNLSGANMSAIIAVGTAFQSAVL 166
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 48/102 (47%), Gaps = 10/102 (9%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLS 169
SA+L KA F R N + A+M G+ F A L +A K++F G+DL
Sbjct: 128 SANLHKADLSNAQFVRTNLSGANMSAIIAVGTAFQSAVLRNVNLSRADLSKSSFKGSDLR 187
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L N +AVL T LT SDL ++GAD S A
Sbjct: 188 GS-----NLRGVNFADAVLTDTDLTGSDLRSCNLDGADLSGA 224
>gi|440681678|ref|YP_007156473.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428678797|gb|AFZ57563.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 402
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 55/103 (53%), Gaps = 8/103 (7%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL KA K NF ANFT A + E+ G+ F AYL +A+ TGA+L+
Sbjct: 281 AILAGADLTKA---KANFTGANFTGAILTEAILIGANFEKAYL-----IRADLTGANLTG 332
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
T + R L EA+LT A L R L ++ L AI+E A++
Sbjct: 333 TNLTRADLTEADLTGANLTRAYLIKAILEEAILEEVILRGAIL 375
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 60/113 (53%), Gaps = 12/113 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAV-------A 158
A A+L++A+ + NF A FT AD+ E++F+ + GA E+A+
Sbjct: 231 AILAEANLKRAILIGANFEGAIFTRADLAEANFTRAILTEAILIGANFEEAILAGADLTK 290
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
KANFTGA+ + ++ +L AN A L+R LT ++L G + AD ++A
Sbjct: 291 AKANFTGANFTGAILTEAILIGANFEKAYLIRADLTGANLTGTNLTRADLTEA 343
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 69/145 (47%), Gaps = 9/145 (6%)
Query: 71 FVSTALAAAVV--ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 128
F L A++ A+ I A ADL K +A G A F A L +A+ + NF
Sbjct: 263 FTRAILTEAILIGANFEEAILAGADLTKAKANFTG-------ANFTGAILTEAILIGANF 315
Query: 129 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
+A AD+ ++ +G+ A L +A AN T A L +++ +L E L A+L
Sbjct: 316 EKAYLIRADLTGANLTGTNLTRADLTEADLTGANLTRAYLIKAILEEAILEEVILRGAIL 375
Query: 189 VRTVLTRSDLGGAIIEGADFSDAVI 213
+LTR+ L GA ++GA D I
Sbjct: 376 RGAILTRAILRGANLKGATMPDGSI 400
Score = 43.9 bits (102), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 5/80 (6%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N + A++ E++F + A L++A+ ANF GA + R L EAN T A+L
Sbjct: 217 NISKANLTEANFKRAILAEANLKRAILIGANFEGA-----IFTRADLAEANFTRAILTEA 271
Query: 192 VLTRSDLGGAIIEGADFSDA 211
+L ++ AI+ GAD + A
Sbjct: 272 ILIGANFEEAILAGADLTKA 291
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 7/99 (7%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
++ KA + NF+RA A+++ + G+ F GA +A +ANFT A L++
Sbjct: 217 NISKANLTEANFKRAILAEANLKRAILIGANFEGAIFTRADLAEANFTRAILTEA----- 271
Query: 177 VLNEANLTNAVLVRTVLT--RSDLGGAIIEGADFSDAVI 213
+L AN A+L LT +++ GA GA ++A++
Sbjct: 272 ILIGANFEEAILAGADLTKAKANFTGANFTGAILTEAIL 310
>gi|440793397|gb|ELR14582.1| K+ channel tetramerisation subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 381
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 53/105 (50%), Gaps = 10/105 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+F DLR + RRANF D+ D +K NGA L + AN +GA
Sbjct: 229 KFNGCDLRGFDFHAMHLRRANFHRCDLTGVDLRHAKLNGACLVECCLRDANLSGA----- 283
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
VL+ +LT+A R LT +DL GA++ GAD S+A +D A
Sbjct: 284 -----VLSGVDLTDADCRRADLTNADLRGAVLSGADLSEAKLDRA 323
>gi|312195986|ref|YP_004016047.1| pentapeptide repeat-containing protein [Frankia sp. EuI1c]
gi|311227322|gb|ADP80177.1| pentapeptide repeat protein [Frankia sp. EuI1c]
Length = 377
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 56/105 (53%), Gaps = 10/105 (9%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AA+ ADL ++ +K + +D +G++ + A L+ A AN TGA L
Sbjct: 237 AARLTGADLTGSILIKTK----------LTATDLAGARLSQANLDGADLANANLTGARLD 286
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
D ++ + L+E L +AVL R L R+DL GA + GAD + A +D
Sbjct: 287 DAILTGVHLSEGRLVDAVLTRANLHRADLVGADLTGADLTGARLD 331
>gi|411117892|ref|ZP_11390273.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410711616|gb|EKQ69122.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 577
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 59/128 (46%), Gaps = 25/128 (19%)
Query: 111 AQFGSADLRKAVHVKENF----------RRANFTSADMRE---------------SDFSG 145
AQ A+LR+A V N R+AN T AD+ +D S
Sbjct: 165 AQLDEANLREATLVGTNLNEASLIGAYLRQANLTEADLHRVVLSSADLSEAILANADLSR 224
Query: 146 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
+ GAYL KA +KA+ ADL D + R L+EANL A L R L+ + L I+
Sbjct: 225 ANLAGAYLLKASFHKAHLLRADLQDVYLLRADLSEANLRGANLQRADLSGAYLNHTILSE 284
Query: 206 ADFSDAVI 213
AD S+A +
Sbjct: 285 ADLSEAYL 292
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 57/116 (49%), Gaps = 20/116 (17%)
Query: 111 AQFGSADLRKAVHVKENFRRANFT---------------SADMRESDFSGSKFNGAYLEK 155
A+ SA L+ A ++ N RRAN A++RE+ G+ N A L
Sbjct: 130 AKLNSAQLKGAELMEANLRRANLAGANLDQANLREAQLDEANLREATLVGTNLNEASLIG 189
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A +AN T ADL R+VL+ A+L+ A+L L+R++L GA + A F A
Sbjct: 190 AYLRQANLTEADLH-----RVVLSSADLSEAILANADLSRANLAGAYLLKASFHKA 240
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 45/83 (54%)
Query: 137 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 196
D SD SG+ +G L A +AN T A+LS +++ +L ANL A L L+ +
Sbjct: 16 DFSHSDLSGANLSGFNLRGANFTEANLTEANLSWAFLNQAILTGANLRRADLRNASLSGA 75
Query: 197 DLGGAIIEGADFSDAVIDLAQKQ 219
DL AI+ GA+ S + LAQ Q
Sbjct: 76 DLNHAILHGANLSKIDLRLAQLQ 98
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 53/105 (50%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +DL A N R ANFT A++ E++ S + N A L A +A+ A LS
Sbjct: 17 FSHSDLSGANLSGFNLRGANFTEANLTEANLSWAFLNQAILTGANLRRADLRNASLSGAD 76
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
++ +L+ ANL+ L L +++L A ++ AD A + A+
Sbjct: 77 LNHAILHGANLSKIDLRLAQLQQANLNWATLQDADMGGANLAFAK 121
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 50/121 (41%), Gaps = 15/121 (12%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADM---------------RESDFSGSKFNG 150
I A DLR A + N A ADM + + ++ G
Sbjct: 80 AILHGANLSKIDLRLAQLQQANLNWATLQDADMGGANLAFAKLDQVNLERAKLNSAQLKG 139
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
A L +A +AN GA+L + L+EANL A LV T L + L GA + A+ ++
Sbjct: 140 AELMEANLRRANLAGANLDQANLREAQLDEANLREATLVGTNLNEASLIGAYLRQANLTE 199
Query: 211 A 211
A
Sbjct: 200 A 200
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 57/128 (44%), Gaps = 20/128 (15%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANF 163
+ A A L +A+ N RRA+ +A + +D + + +GA L K A +AN
Sbjct: 43 TEANLSWAFLNQAILTGANLRRADLRNASLSGADLNHAILHGANLSKIDLRLAQLQQANL 102
Query: 164 TGADLSDTLM---------------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A L D M +R LN A L A L+ L R++L GA ++ A+
Sbjct: 103 NWATLQDADMGGANLAFAKLDQVNLERAKLNSAQLKGAELMEANLRRANLAGANLDQANL 162
Query: 209 SDAVIDLA 216
+A +D A
Sbjct: 163 REAQLDEA 170
>gi|300864770|ref|ZP_07109621.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
gi|300337239|emb|CBN54769.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
Length = 334
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A DLR ++ N + N T AD+RE+D S + N A L+ A AN GA L
Sbjct: 230 ADLHDTDLRGGNLIQANLMKTNLTEADLREADLSHTNLNLANLKGADLSGANLQGAYLWA 289
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
T +D L A+L A L +++ +DL AI+ GA D I
Sbjct: 290 TNLDGACLKGADLRGASLRNAIISGADLRDAILTGATMPDGKI 332
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 51/110 (46%), Gaps = 5/110 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S AQ A+L V R +AN AD+ ++D G A L K +A+
Sbjct: 198 SGAQLSGANLSGTVLSGARMRFTKLEQANLKQADLHDTDLRGGNLIQANLMKTNLTEADL 257
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
ADLS T ++ L A+L+ A L L ++L GA ++GAD A +
Sbjct: 258 READLSHTNLNLANLKGADLSGANLQGAYLWATNLDGACLKGADLRGASL 307
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 53/110 (48%), Gaps = 2/110 (1%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
EA G F G+ F A L+ + + + +A+ A + + D +G++ +GA L
Sbjct: 58 LEANLNGAFLYGANLSF--AKLKGSHLLGADLTKADLRGAQLAKVDLTGAQLSGAILSWV 115
Query: 157 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
++AN G +L + + L ANL A L LT + L GA ++GA
Sbjct: 116 SLFQANLPGVNLCGANLSGINLRSANLAGANLNWANLTGARLSGANLKGA 165
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 45/101 (44%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +LR A N AN T A + ++ G+ NG L KA N G D S
Sbjct: 130 ANLSGINLRSANLAGANLNWANLTGARLSGANLKGALLNGVKLNKAFLNGLNLAGIDFSG 189
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++ + L+ A L+ A L TVL+ + + +E A+ A
Sbjct: 190 LELEDVKLSGAQLSGANLSGTVLSGARMRFTKLEQANLKQA 230
>gi|443324431|ref|ZP_21053184.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442795950|gb|ELS05284.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 239
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 60/105 (57%), Gaps = 10/105 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGA 166
QF +L++A +K N + T+AD+R++ S F A L A + + +FT A
Sbjct: 16 QFSRINLQEAELIKVNLSNVDLTAADLRQARLGRSNFGHACLRSADLSESILWGTDFTQA 75
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
DLS + V+ EA+L+ A+L + L +++L +I+EGA+FS A
Sbjct: 76 DLS-----QAVMREADLSGAILTQANLEKANLIKSILEGANFSGA 115
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 60/112 (53%), Gaps = 3/112 (2%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
R FG A SADL +++ +F +A+ + A MRE+D SG+ A LEKA K+
Sbjct: 49 RSNFG---HACLRSADLSESILWGTDFTQADLSQAVMREADLSGAILTQANLEKANLIKS 105
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
GA+ S + ++ E +L A RT L+++DL A + A+ S A++
Sbjct: 106 ILEGANFSGAKLRHALMIEVDLRPASDYRTNLSQADLSYADLSYANLSMALL 157
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 57/123 (46%), Gaps = 18/123 (14%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
I A F A LR A+ ++ + R A+ ++ ++D S + + A L A+ Y+A GA
Sbjct: 106 ILEGANFSGAKLRHALMIEVDLRPASDYRTNLSQADLSYADLSYANLSMALLYQAKLDGA 165
Query: 167 ------------------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
DL++ + L+ ANLT A+L R LT +DL G I+ D
Sbjct: 166 RLSRANLSAGRGENALATDLTEASLRDADLSYANLTGAILHRADLTGADLTGTILTNTDL 225
Query: 209 SDA 211
+A
Sbjct: 226 REA 228
Score = 37.4 bits (85), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 59/128 (46%), Gaps = 6/128 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY-----LEKAVAYKANF 163
S A ADL A+ + N +AN + + ++FSG+K A L A Y+ N
Sbjct: 78 SQAVMREADLSGAILTQANLEKANLIKSILEGANFSGAKLRHALMIEVDLRPASDYRTNL 137
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
+ ADLS + L+ A L A L L+R++L E A +D + + + + A
Sbjct: 138 SQADLSYADLSYANLSMALLYQAKLDGARLSRANLSAGRGENALATD-LTEASLRDADLS 196
Query: 224 YANGTNPI 231
YAN T I
Sbjct: 197 YANLTGAI 204
>gi|443310213|ref|ZP_21039874.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442779757|gb|ELR89989.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 253
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 54/101 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A SA+L +A ++ N AN T A + +D S + A L A+ YKA A+L+D
Sbjct: 139 ANLKSANLSEAKLIRANLNEANLTEAHLNYADLSHANLGSASLVGAILYKAELRQANLND 198
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + L +ANL+ A L+ L ++L GA + GA+ + A
Sbjct: 199 AYLHKAYLFDANLSQARLINADLRWANLRGANLRGANLTGA 239
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 20/116 (17%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRES---------------DFSGSKFNGAYLEK 155
A F A+L ++ +K N AN + A+++++ D G+ + A LE
Sbjct: 69 ANFTLANLSHSLLMKANLSNANLSIANLQDANLKGAFLGAANLIGADLQGANLSNADLEN 128
Query: 156 -----AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
A AN A+LS+ + R LNEANLT A L L+ ++LG A + GA
Sbjct: 129 VNLIGANLQNANLKSANLSEAKLIRANLNEANLTEAHLNYADLSHANLGSASLVGA 184
>gi|428215909|ref|YP_007089053.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004290|gb|AFY85133.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 447
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +DLR A + + + N T AD+RE+D + + GA L A +A+ TGA
Sbjct: 330 ANMKGSDLRGADLIGASLNKVNLTQADLREADLTRADLRGANLRLADLREADLTGAS--- 386
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
LN+ NL A L LTR+DL GA + GAD +A
Sbjct: 387 -------LNQVNLAEADLRGVDLTRADLRGANLSGADLREA 420
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 62/123 (50%), Gaps = 13/123 (10%)
Query: 91 LADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
LAD N ++ RG IG++ ADLR+A + + R AN AD+RE+D +G+
Sbjct: 327 LADANMKGSDLRGADLIGASLNKVNLTQADLREADLTRADLRGANLRLADLREADLTGAS 386
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
N + N ADL + R L ANL+ A L LT+++L A ++GA+
Sbjct: 387 LN----------QVNLAEADLRGVDLTRADLRGANLSGADLREADLTKANLHWANLDGAN 436
Query: 208 FSD 210
+D
Sbjct: 437 LTD 439
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 47/85 (55%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ R A +SA++ ++D +G+ + A L KA AN G+DL + LN+ NLT A
Sbjct: 296 DLRGAMLSSANLSQADMTGTDLSRANLRKAYLADANMKGSDLRGADLIGASLNKVNLTQA 355
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
L LTR+DL GA + AD +A
Sbjct: 356 DLREADLTRADLRGANLRLADLREA 380
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 51/102 (50%), Gaps = 25/102 (24%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKA-----------VAYK---------ANFTGADLSDT 171
NF + D+ D G+ G+YL +A + Y AN +GADLSD
Sbjct: 29 NFMTPDLSNKDLIGASLRGSYLREAKLSGANLSEAILCYADLIGADLKGANLSGADLSDA 88
Query: 172 LMDRMVLNEANLTNA-----VLVRTVLTRSDLGGAIIEGADF 208
++ L+E+NLT A +LV T L+ +DL GA ++GA+
Sbjct: 89 NLNLANLSESNLTGANFKGSLLVGTDLSEADLRGANLKGANL 130
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 59/127 (46%), Gaps = 5/127 (3%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ A+L +A+ + A+ A++ +D S + N A L ++ ANF G+ L
Sbjct: 53 AKLSGANLSEAILCYADLIGADLKGANLSGADLSDANLNLANLSESNLTGANFKGSLLVG 112
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-----VIDLAQKQALCKYA 225
T + L ANL A L+ L ++L GA + G D S+A ++ A ++
Sbjct: 113 TDLSEADLRGANLKGANLIGAKLAEANLSGANLSGTDLSEADLRGTILQKAVYDLRTRFC 172
Query: 226 NGTNPIT 232
G +P T
Sbjct: 173 EGLDPQT 179
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 46/179 (25%), Positives = 74/179 (41%), Gaps = 39/179 (21%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRAN 132
L+ A ++ + N++ L++ N A +G +G S A A+L+ A + AN
Sbjct: 80 LSGADLSDANLNLANLSESNLTGANFKGSLLVGTDLSEADLRGANLKGANLIGAKLAEAN 139
Query: 133 FTSADMRESDFSGSKFNGAYLEKAV-----------------AY---------KANFTGA 166
+ A++ +D S + G L+KAV AY AN +G
Sbjct: 140 LSGANLSGTDLSEADLRGTILQKAVYDLRTRFCEGLDPQTSGAYLIGADVALPAANLSGV 199
Query: 167 DLSDTLMDRMVLNEANLTNAVLV----------RTVLTRSDLGGAIIEGADFSDAVIDL 215
DL+ + R L ANL A L+ R L+ ++L G +G + AV DL
Sbjct: 200 DLTGFNLKRADLRGANLRYAKLIGANLEGANLFRANLSGANLTGVNFKGTNLQKAVYDL 258
>gi|316934318|ref|YP_004109300.1| pentapeptide repeat-containing protein [Rhodopseudomonas palustris
DX-1]
gi|315602032|gb|ADU44567.1| pentapeptide repeat protein [Rhodopseudomonas palustris DX-1]
Length = 273
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A N RA+ + A++ +D SG+ +GA L +A + AN +GADL
Sbjct: 57 SGANLSGADLSGANLSGANLYRADLSGANLSGADLSGANLSGANLYRAKLFSANLSGADL 116
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S L+ ANL A L L R+DL GA + GAD S A
Sbjct: 117 SGA-----NLSGANLYRADLSGANLYRADLSGANLSGADLSGA 154
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 53/103 (51%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A N RA A++ ++ SG+ +GA L A Y+A+ +GA+L
Sbjct: 27 SGANLSGADLSGANLSGANLYRAKLFGANLSGANLSGADLSGANLSGANLYRADLSGANL 86
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S L+ ANL+ A L R L ++L GA + GA+ S A
Sbjct: 87 SGA-----DLSGANLSGANLYRAKLFSANLSGADLSGANLSGA 124
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 46/80 (57%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N + AD+ ++ SG+ +GA L A Y+A GA+LS + L+ ANL+ A L R
Sbjct: 20 NLSGADLSGANLSGADLSGANLSGANLYRAKLFGANLSGANLSGADLSGANLSGANLYRA 79
Query: 192 VLTRSDLGGAIIEGADFSDA 211
L+ ++L GA + GA+ S A
Sbjct: 80 DLSGANLSGADLSGANLSGA 99
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 5/95 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANF 163
S A ADL A N RA SA++ +D SG+ +GA L +A Y+A+
Sbjct: 82 SGANLSGADLSGANLSGANLYRAKLFSANLSGADLSGANLSGANLYRADLSGANLYRADL 141
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+GA+LS + L+ ANL+ A V L R+ +
Sbjct: 142 SGANLSGADLSGANLHRANLSGAKGVDLSLARTRI 176
>gi|220907082|ref|YP_002482393.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863693|gb|ACL44032.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 309
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 5/103 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F A+L ++ N R A T AD+RE+ K N A L ++ +AN TGADL
Sbjct: 185 ADFQGANLSRSTLTGANLRGAYLTGADLREA-----KLNEANLRRSDLSQANLTGADLRG 239
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++R L ANL ++L+ L ++L A ++GA+ +AV+
Sbjct: 240 ANLNRATLRGANLRESILIGASLMGANLSQASLQGANLLEAVL 282
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 54/106 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A++ A+ + N A+ T A++ +++ G+ GAYL A TGA+L
Sbjct: 113 SEANLTGAEISAAILREANLTLADLTLAELSQTNLRGANLTGAYLRGAELLGTQLTGAEL 172
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
S L EA+ A L R+ LT ++L GA + GAD +A ++
Sbjct: 173 SQANFRGTNLTEADFQGANLSRSTLTGANLRGAYLTGADLREAKLN 218
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 15/103 (14%)
Query: 116 ADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
ADLR+A + N RR AN T AD+R ++ + + GA L +++ A+ GA+LS
Sbjct: 210 ADLREAKLNEANLRRSDLSQANLTGADLRGANLNRATLRGANLRESILIGASLMGANLS- 268
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+A+L A L+ VLT ++L G + G D S V+
Sbjct: 269 ---------QASLQGANLLEAVLTGANLTGVDLTGVDLSATVM 302
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 48/93 (51%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADLRK K N A+ T A++ +D S + GA + A+ +AN T ADL+ + +
Sbjct: 85 ADLRKVNLRKANLTGADLTGANLTGADLSEANLTGAEISAAILREANLTLADLTLAELSQ 144
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
L ANLT A L L + L GA + A+F
Sbjct: 145 TNLRGANLTGAYLRGAELLGTQLTGAELSQANF 177
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 59/115 (51%), Gaps = 1/115 (0%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A L +A + + + + A++R + SG+ GA L A AN + ADL
Sbjct: 17 FTGASLYQANLNRVHLSQVDLQGANLRGAGLSGANLQGADLRGATLAAANLSNADLRGAD 76
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 226
+ ++L EA+L L + LT +DL GA + GAD S+A + A+ A+ + AN
Sbjct: 77 LRGVLLMEADLRKVNLRKANLTGADLTGANLTGADLSEANLTGAEISAAILREAN 131
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 52/109 (47%), Gaps = 15/109 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADLR A AN ++AD+R +D G A L K KAN TGADL
Sbjct: 48 SGANLQGADLRGAT-----LAAANLSNADLRGADLRGVLLMEADLRKVNLRKANLTGADL 102
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+ ANLT A L LT +++ AI+ A+ + A + LA+
Sbjct: 103 TG----------ANLTGADLSEANLTGAEISAAILREANLTLADLTLAE 141
>gi|427731151|ref|YP_007077388.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427367070|gb|AFY49791.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 572
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 53/104 (50%), Gaps = 5/104 (4%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A ADL A+ N N T A + +D S +K NGA L A A F G
Sbjct: 391 ADLSGADLSHAILNGTNLSDTILFSTNLTDASLMAADLSYAKLNGAKLIDAKLNGAMFLG 450
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
ADLS + R+VLN+A+L+ ++L L+ +DL AI+ G D S
Sbjct: 451 ADLSGVDLSRVVLNDADLSGSILSEADLSSADLSDAILLGTDLS 494
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 55/103 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A F A+L A N AN + AD+ +D S + GA L A Y+ +F+ ADL
Sbjct: 274 TGANFQDANLAGANLGDANLSGANLSGADLSSADLSSANLTGANLTGATLYRTDFSRADL 333
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S ++ + A+L+ A L T L R++L AI+ GA+ SDA
Sbjct: 334 SSCHLNDAEMGHADLSGANLRDTQLCRTNLTNAILFGANLSDA 376
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A N AN T A + +DFS + + +L A A+ +GA+L
Sbjct: 294 SGANLSGADLSSADLSSANLTGANLTGATLYRTDFSRADLSSCHLNDAEMGHADLSGANL 353
Query: 169 SDTLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
DT + R +L ANL++A L L+ +DL A + GAD S A+++
Sbjct: 354 RDTQLCRTNLTNAILFGANLSDANLKHINLSHADLCRADLSGADLSHAILN 404
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 65/136 (47%), Gaps = 10/136 (7%)
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAV 157
G+F G A F A L A NF+ AN + A + +++ +G+ F GA L A
Sbjct: 235 GQFLKG--ANFRGAYLGDANLTGANFQGANLSGAYLGDANLTGANFQDANLAGANLGDAN 292
Query: 158 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA- 216
AN +GADLS + L ANLT A L RT +R+DL + A+ A + A
Sbjct: 293 LSGANLSGADLSSADLSSANLTGANLTGATLYRTDFSRADLSSCHLNDAEMGHADLSGAN 352
Query: 217 -QKQALCKYANGTNPI 231
+ LC+ N TN I
Sbjct: 353 LRDTQLCR-TNLTNAI 367
Score = 44.3 bits (103), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 48/90 (53%)
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
V + + ANF A + +++ +G+ F GA L A AN TGA+ D + L +ANL
Sbjct: 234 VGQFLKGANFRGAYLGDANLTGANFQGANLSGAYLGDANLTGANFQDANLAGANLGDANL 293
Query: 184 TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ A L L+ +DL A + GA+ + A +
Sbjct: 294 SGANLSGADLSSADLSSANLTGANLTGATL 323
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 5/102 (4%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AA A L A + A F AD+ D S N A L ++ +A+ + ADLS
Sbjct: 425 AADLSYAKLNGAKLIDAKLNGAMFLGADLSGVDLSRVVLNDADLSGSILSEADLSSADLS 484
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
D ++ L+ ANL +A L+ S+L GA++ GAD S+A
Sbjct: 485 DAILLGTDLSFANLNSA-----NLSGSNLSGAMLNGADLSEA 521
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 38/74 (51%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
I S A SADL A+ + + AN SA++ S+ SG+ NGA L +A A
Sbjct: 472 ILSEADLSSADLSDAILLGTDLSFANLNSANLSGSNLSGAMLNGADLSEANLSDAILEDT 531
Query: 167 DLSDTLMDRMVLNE 180
DLS+ +++M E
Sbjct: 532 DLSEANLEQMTWGE 545
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 48/99 (48%), Gaps = 7/99 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F ++ L + ++ + NF+ ++ G+ F GAYL A ANF GA+LS
Sbjct: 210 FFTSQLLRVIYYSDAIEIGNFS--NIVGQFLKGANFRGAYLGDANLTGANFQGANLSGA- 266
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L +ANLT A L ++LG A + GA+ S A
Sbjct: 267 ----YLGDANLTGANFQDANLAGANLGDANLSGANLSGA 301
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 51/109 (46%), Gaps = 10/109 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ G ADL A R N T+A + ++ S + L A +A+ +GADLS
Sbjct: 341 AEMGHADLSGANLRDTQLCRTNLTNAILFGANLSDANLKHINLSHADLCRADLSGADLS- 399
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTR-----SDLGGAIIEGADFSDAVID 214
+LN NL++ +L T LT +DL A + GA DA ++
Sbjct: 400 ----HAILNGTNLSDTILFSTNLTDASLMAADLSYAKLNGAKLIDAKLN 444
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 50/120 (41%), Gaps = 16/120 (13%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
+ +A FG A+L A N A+ AD+ +D S + NG L + + N T A
Sbjct: 363 LTNAILFG-ANLSDANLKHINLSHADLCRADLSGADLSHAILNGTNLSDTILFSTNLTDA 421
Query: 167 DLSDTLMDRMVLNEANLTNAV---------------LVRTVLTRSDLGGAIIEGADFSDA 211
L + LN A L +A L R VL +DL G+I+ AD S A
Sbjct: 422 SLMAADLSYAKLNGAKLIDAKLNGAMFLGADLSGVDLSRVVLNDADLSGSILSEADLSSA 481
>gi|309792396|ref|ZP_07686863.1| pentapeptide repeat-containing protein [Oscillochloris trichoides
DG-6]
gi|308225551|gb|EFO79312.1| pentapeptide repeat-containing protein [Oscillochloris trichoides
DG6]
Length = 314
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 60/125 (48%), Gaps = 9/125 (7%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLRK N AN T A++R ++ S + F+GA L A N +G DL D
Sbjct: 89 ADLSDADLRKGDLAWANLEFANLTGANLRGANLSAADFSGANLYGANLSLCNLSGVDLRD 148
Query: 171 TLMDRMVLNEANLTNAVLVRTV--------LTRSDLGGAIIEGADFSDA-VIDLAQKQAL 221
T+M L EA L A LV L + LGGA ++G + S A ++ ++A
Sbjct: 149 TIMIGANLTEAQLREAQLVNLSGANLSGANLNKVSLGGASMQGVNLSGASLLSANLREAT 208
Query: 222 CKYAN 226
+ AN
Sbjct: 209 LREAN 213
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 5/107 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTG 165
A A LR+A + N AN + AD+ +D S + +G YL E A+ AN +
Sbjct: 202 ANLREATLREANLIGANLYEANLSEADLSAADLSMANLSGIYLSGANLEGAILTHANLSR 261
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
A+LS + LN NL A L LT +DL GA + D S +
Sbjct: 262 ANLSGCNLRGAQLNGCNLREASLADADLTGADLTGADLSECDLSGVI 308
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 47/103 (45%), Gaps = 2/103 (1%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A LR+A V N AN + A++ + G+ G L A AN A L +
Sbjct: 154 ANLTEAQLREAQLV--NLSGANLSGANLNKVSLGGASMQGVNLSGASLLSANLREATLRE 211
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ L EANL+ A L L+ ++L G + GA+ A++
Sbjct: 212 ANLIGANLYEANLSEADLSAADLSMANLSGIYLSGANLEGAIL 254
Score = 37.0 bits (84), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 49/106 (46%), Gaps = 10/106 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A L A N A+ SA++RE+ + GA L Y+AN + ADL
Sbjct: 175 SGANLNKVSLGGASMQGVNLSGASLLSANLREATLREANLIGANL-----YEANLSEADL 229
Query: 169 S--DTLMDRM---VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
S D M + L+ ANL A+L L+R++L G + GA +
Sbjct: 230 SAADLSMANLSGIYLSGANLEGAILTHANLSRANLSGCNLRGAQLN 275
>gi|428313200|ref|YP_007124177.1| pentapeptide repeat protein,protein kinase family protein
[Microcoleus sp. PCC 7113]
gi|428254812|gb|AFZ20771.1| pentapeptide repeat protein,protein kinase family protein
[Microcoleus sp. PCC 7113]
Length = 464
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ NF N + +++E++ SG F A L K NF GADLSD + LN+ANL
Sbjct: 321 ERNFAFRNISGLNLQEANLSGGLFYSAKLAKT-----NFQGADLSDAYFGQANLNQANLR 375
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
NA L T + +DL GA ++GAD A +
Sbjct: 376 NANLGGTSFSNADLSGADLQGADLRFAYL 404
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 51/109 (46%), Gaps = 25/109 (22%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A FG A+L +A N R AN +D SG+ GA L A KAN GA+L
Sbjct: 360 SDAYFGQANLNQA-----NLRNANLGGTSFSNADLSGADLQGADLRFAYLSKANLKGANL 414
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
EANL+NA ++ GA + GA+ S+A+I AQ
Sbjct: 415 C----------EANLSNA----------NIKGANLCGANLSNAIITEAQ 443
>gi|86605499|ref|YP_474262.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86554041|gb|ABC98999.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 330
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 58/118 (49%), Gaps = 3/118 (2%)
Query: 99 AETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 155
A+ RG +G Q G A+L++A+ + N AN + AD+ +D S + A L +
Sbjct: 207 ADLRGASFLGGDLQGVQMGRANLKEAMLSQVNLAEANLSEADLAGADLSAACLRSAKLAR 266
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+AN GADL + L NL NA L +LTR+DL A + GA+ A +
Sbjct: 267 TDLSRANLAGADLRSASLVDAYLGRTNLENADLREAILTRADLSTANLAGANLRGATL 324
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 54/121 (44%), Gaps = 20/121 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G A L+KA V N AN + AD+ E+D + +G L+ A + AN A L D
Sbjct: 52 AYLGRAKLQKANLVGANLSGANLSQADLSEADLRDAHLHGTTLQGADLHGANLALALLID 111
Query: 171 TLMDRMVLNEANLTNA--------------------VLVRTVLTRSDLGGAIIEGADFSD 210
+ L ANLT+A VL L+R+DL GA + GAD +
Sbjct: 112 ANLLEADLRWANLTSANLGGACLRGANLRFESRRAAVLRSANLSRADLSGANLAGADLTR 171
Query: 211 A 211
A
Sbjct: 172 A 172
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 55/107 (51%), Gaps = 4/107 (3%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ RA+ D+ ++D G AYL +A KAN GA+LS + + L+EA+L +A
Sbjct: 28 DLSRADLIGIDLSQADLHGINLIFAYLGRAKLQKANLVGANLSGANLSQADLSEADLRDA 87
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 233
L T L +DL GA + A DA + +A ++AN T+ G
Sbjct: 88 HLHGTTLQGADLHGANLALALLIDANL----LEADLRWANLTSANLG 130
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 44/98 (44%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL +A N + A+ A ++ ++ + GA L A A+F G DL
Sbjct: 160 SGANLAGADLTRADLRGANLKEASLIGAHLQGANLQRACLRGALLSNADLRGASFLGGDL 219
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
M R L EA L+ L L+ +DL GA + A
Sbjct: 220 QGVQMGRANLKEAMLSQVNLAEANLSEADLAGADLSAA 257
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 64/135 (47%), Gaps = 7/135 (5%)
Query: 74 TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANF 133
T L A + + ++ L D N EA+ R + ++A G A LR A E+ R A
Sbjct: 92 TTLQGADLHGANLALALLIDANLLEADLR--WANLTSANLGGACLRGANLRFESRRAAVL 149
Query: 134 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 193
SA++ +D SG+ GA L +A+ GA+L + + L ANL A L +L
Sbjct: 150 RSANLSRADLSGANLAGADL-----TRADLRGANLKEASLIGAHLQGANLQRACLRGALL 204
Query: 194 TRSDLGGAIIEGADF 208
+ +DL GA G D
Sbjct: 205 SNADLRGASFLGGDL 219
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 55/108 (50%), Gaps = 10/108 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG-----AYLEKAVAYKANFTG 165
A A+L++A R A ++AD+R + F G G A L++A+ + N
Sbjct: 187 AHLQGANLQRAC-----LRGALLSNADLRGASFLGGDLQGVQMGRANLKEAMLSQVNLAE 241
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+LS+ + L+ A L +A L RT L+R++L GA + A DA +
Sbjct: 242 ANLSEADLAGADLSAACLRSAKLARTDLSRANLAGADLRSASLVDAYL 289
>gi|75911045|ref|YP_325341.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704770|gb|ABA24446.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 973
Score = 55.8 bits (133), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/96 (37%), Positives = 47/96 (48%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
SADL A + R A AD+ +D SG+ NGAYL A A + ADLS +
Sbjct: 841 SADLSGAYLRGADLRDAYLNGADLSGADLSGAYLNGAYLNGAYLNGAYLSHADLSRADLR 900
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
L ANL +A L+ L +DL GA + A+ D
Sbjct: 901 SADLRSANLISADLISADLISADLNGADLSHANLGD 936
Score = 48.9 bits (115), Expect = 0.003, Method: Composition-based stats.
Identities = 36/108 (33%), Positives = 49/108 (45%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANF 163
S A A L+ + A AD+R++ D SG+ +GAYL A A
Sbjct: 825 SGADLSGAFLKGVFLRSADLSGAYLRGADLRDAYLNGADLSGADLSGAYLNGAYLNGAYL 884
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
GA LS + R L A+L +A L+ L +DL A + GAD S A
Sbjct: 885 NGAYLSHADLSRADLRSADLRSANLISADLISADLISADLNGADLSHA 932
Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats.
Identities = 36/116 (31%), Positives = 52/116 (44%), Gaps = 12/116 (10%)
Query: 124 VKENFRRANFTSADM----------RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 173
V + R A+ + AD+ R +D SG+ GA L A A+ +GADLS +
Sbjct: 815 VGKFLRGADLSGADLSGAFLKGVFLRSADLSGAYLRGADLRDAYLNGADLSGADLSGAYL 874
Query: 174 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
+ LN A L A L L+R+DL A + A+ A DL + NG +
Sbjct: 875 NGAYLNGAYLNGAYLSHADLSRADLRSADLRSANLISA--DLISADLISADLNGAD 928
Score = 43.5 bits (101), Expect = 0.12, Method: Composition-based stats.
Identities = 31/95 (32%), Positives = 44/95 (46%), Gaps = 4/95 (4%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL + + F R AD+ +D SG+ G +L A A GADL D ++
Sbjct: 807 DLGNFIRIVGKFLRG----ADLSGADLSGAFLKGVFLRSADLSGAYLRGADLRDAYLNGA 862
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L+ A+L+ A L L + L GA + AD S A
Sbjct: 863 DLSGADLSGAYLNGAYLNGAYLNGAYLSHADLSRA 897
>gi|123968240|ref|YP_001009098.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. AS9601]
gi|123198350|gb|ABM69991.1| Pentapeptide repeat-containing protein [Prochlorococcus marinus
str. AS9601]
Length = 157
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 64/134 (47%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+A +G L A + + A F D+++++ S + A L A N + ++L
Sbjct: 21 AALDYGKQSLIGADFSGSDLKGATFYLTDLQDANLSDCELQNATLYGAKLKDTNLSNSNL 80
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+ +D +L+ +L+N L + + I+GADF++ + + C+ A GT
Sbjct: 81 REVTLDSAILDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIIRKFCESATGT 140
Query: 229 NPITGVSTRKSLGC 242
NPIT TR++L C
Sbjct: 141 NPITNRETRETLEC 154
>gi|298245086|ref|ZP_06968892.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297552567|gb|EFH86432.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 394
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 62/121 (51%), Gaps = 12/121 (9%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L +N Y+++ R A DLR+A + RAN A++RE+ +
Sbjct: 247 LYKINLYKSDLR-------EANLSKTDLREA-----DISRANLYKANLRETFLLKANLYE 294
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
A L +A +AN + A+LS T + R L +ANL+ A L+ L+R DL GA + ADFS
Sbjct: 295 ADLHRANLSEANLSEANLSKTDLSRTNLTKANLSKADLISANLSRGDLSGADLSKADFSG 354
Query: 211 A 211
A
Sbjct: 355 A 355
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 57/117 (48%), Gaps = 17/117 (14%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
N Y+A+ RG AD KA N R AN A++RE+D S A+L
Sbjct: 206 NLYKADLRG------------ADFSKATLCGANLREANLCEANLREADIS-----RAFLY 248
Query: 155 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
K YK++ A+LS T + ++ ANL A L T L +++L A + A+ S+A
Sbjct: 249 KINLYKSDLREANLSKTDLREADISRANLYKANLRETFLLKANLYEADLHRANLSEA 305
Score = 44.7 bits (104), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 49/94 (52%), Gaps = 4/94 (4%)
Query: 95 NKYEAET-RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
N YEA+ R S A A+L K + N +AN + AD+ ++ S +GA L
Sbjct: 291 NLYEADLHRANL---SEANLSEANLSKTDLSRTNLTKANLSKADLISANLSRGDLSGADL 347
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
KA AN +GA+LS ++ +LN+AN+ A+
Sbjct: 348 SKADFSGANLSGANLSGATLNEAILNKANIQQAL 381
Score = 43.9 bits (102), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 53/105 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A S DL+ +F AN AD+R +DFS + GA L +A +AN AD+
Sbjct: 183 SQADMKSMDLKGVKAHNIDFSGANLYKADLRGADFSKATLCGANLREANLCEANLREADI 242
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S + ++ L +++L A L +T L +D+ A + A+ + +
Sbjct: 243 SRAFLYKINLYKSDLREANLSKTDLREADISRANLYKANLRETFL 287
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 12/113 (10%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
N Y+A R F + A ADL +A N AN + A++ ++D S + A L
Sbjct: 276 NLYKANLRETFLLK--ANLYEADLHRA-----NLSEANLSEANLSKTDLSRTNLTKANLS 328
Query: 155 KAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
KA AN + GADLS L+ ANL+ A L +L ++++ A+
Sbjct: 329 KADLISANLSRGDLSGADLSKADFSGANLSGANLSGATLNEAILNKANIQQAL 381
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 52/138 (37%), Gaps = 22/138 (15%)
Query: 98 EAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS------------ADMRESDFSG 145
+AE R + + G D + HV R A S ADM+ D G
Sbjct: 135 DAEVRKVARVRTLTVLGQLDAPRINHVFSFLREAQLVSSKPGESIVSLSQADMKSMDLKG 194
Query: 146 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL----------TNAVLVRTVLTR 195
K + A YKA+ GAD S + L EANL + A L + L +
Sbjct: 195 VKAHNIDFSGANLYKADLRGADFSKATLCGANLREANLCEANLREADISRAFLYKINLYK 254
Query: 196 SDLGGAIIEGADFSDAVI 213
SDL A + D +A I
Sbjct: 255 SDLREANLSKTDLREADI 272
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+LR+ +K N A+ A++ E++ S + + L + KAN + ADL
Sbjct: 273 SRANLYKANLRETFLLKANLYEADLHRANLSEANLSEANLSKTDLSRTNLTKANLSKADL 332
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
+ R +L+ A L + + ++L GA + GA ++A+++ A Q
Sbjct: 333 ISANLSR-----GDLSGADLSKADFSGANLSGANLSGATLNEAILNKANIQ 378
>gi|448412419|ref|ZP_21576534.1| hypothetical protein C475_19468 [Halosimplex carlsbadense 2-9-1]
gi|445668180|gb|ELZ20811.1| hypothetical protein C475_19468 [Halosimplex carlsbadense 2-9-1]
Length = 561
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 54/120 (45%), Gaps = 15/120 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 166
+A S D A +FRRA +A++R++D G+ F GA L A A+ TGA
Sbjct: 251 TAGTLESVDFGGATLTDASFRRAGLQNAELRDADLVGADFQGADLRNASLTNADLTGANF 310
Query: 167 -------------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
DLS+ + L A+L +A L R L SDL A + AD SD +
Sbjct: 311 RDADLTDAHLRGADLSEADLKDATLCGADLKDATLTRASLWNSDLTEAYLRNADLSDGYL 370
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 70/151 (46%), Gaps = 14/151 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F ADLR A + A+ T A+ R++D + + GA L +A A GADL D
Sbjct: 288 ADFQGADLRNA-----SLTNADLTGANFRDADLTDAHLRGADLSEADLKDATLCGADLKD 342
Query: 171 TLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK-Y 224
+ R L EA L NA L L R DL A + AD + DL + +L + +
Sbjct: 343 ATLTRASLWNSDLTEAYLRNADLSDGYLRRVDLTDADLPAADLTG---DLNARCSLGRTF 399
Query: 225 ANGTNPITGVSTRKSLGCGNSRRNAYGSPSS 255
+ I+ + R+SL C ++ G P++
Sbjct: 400 SMPRCAISDHTGRRSLTCRSTSARPSGRPTT 430
Score = 43.9 bits (102), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 49/90 (54%), Gaps = 1/90 (1%)
Query: 138 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 197
+RE+D SG+ G+ L+ A+ A+ DL+ M VL EA+LT+ L + ++
Sbjct: 145 LREADLSGANLAGSTLKGAILTDASLREVDLTGADMMGAVLVEADLTSGTLAQLSGDKAV 204
Query: 198 LGGAIIEGADFSDAVI-DLAQKQALCKYAN 226
+ GAI++ A+ A + DL +A+ K A
Sbjct: 205 MRGAILKDANLERAHLWDLTAPEAVFKRAT 234
Score = 43.5 bits (101), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 59/148 (39%), Gaps = 17/148 (11%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
+ AV+ LA L+ +A RG I A A L + F+RA
Sbjct: 180 MMGAVLVEADLTSGTLAQLSGDKAVMRG--AILKDANLERAHLWDLTAPEAVFKRATLCE 237
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV------ 189
A MR++ G+ F LE +F GA L+D R L A L +A LV
Sbjct: 238 ATMRDAVLPGASFTAGTLESV-----DFGGATLTDASFRRAGLQNAELRDADLVGADFQG 292
Query: 190 ----RTVLTRSDLGGAIIEGADFSDAVI 213
LT +DL GA AD +DA +
Sbjct: 293 ADLRNASLTNADLTGANFRDADLTDAHL 320
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 47/114 (41%), Gaps = 10/114 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA----------VAYK 160
A ADL A + A T A +RE D +G+ GA L +A K
Sbjct: 143 AVLREADLSGANLAGSTLKGAILTDASLREVDLTGADMMGAVLVEADLTSGTLAQLSGDK 202
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A GA L D ++R L + AV R L + + A++ GA F+ ++
Sbjct: 203 AVMRGAILKDANLERAHLWDLTAPEAVFKRATLCEATMRDAVLPGASFTAGTLE 256
>gi|427715911|ref|YP_007063905.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348347|gb|AFY31071.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 589
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 55/110 (50%), Gaps = 5/110 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A SA+L A + R N +SAD+ +D S + N A L A AN + ADL
Sbjct: 321 SHADLSSANLSGANLTNTDLNRTNLSSADLSSADLSSTNLNSADLSSANLKDANLSSADL 380
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVI 213
S T + L++ANL+ L L R+DL G AI+ G + SD ++
Sbjct: 381 SHTHLFGANLSDANLSGVNLSHADLCRADLSGADMSKAILNGTNLSDTIL 430
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A AD+ KA+ N N + A + +D S +K NGA L A A F G
Sbjct: 408 ADLSGADMSKAILNGTNLSDTILFSTNLSDAILIAADLSYAKLNGAKLNYARLNGAMFLG 467
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
ADLS + ++LN+A+L+ +L L+ +DL AI+ G D S A ++ A
Sbjct: 468 ADLSGVDLSGVILNDADLSGVLLSEADLSDADLSDAILFGTDLSYANLNRA 518
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 57/110 (51%), Gaps = 8/110 (7%)
Query: 103 GEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
GEF G A G A+L A NF AN + A + +++ +G F+GA L A
Sbjct: 252 GEFLRGGNFRGAYLGDANLTGA-----NFSGANLSGAYLGDANLTGVNFSGANLSGANLG 306
Query: 160 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
AN +GA+LS+ + L+ ANL+ A L T L R++L A + AD S
Sbjct: 307 DANLSGANLSNANLSHADLSSANLSGANLTNTDLNRTNLSSADLSSADLS 356
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 58/103 (56%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A G A+L NF AN + A++ +++ SG+ + A L A AN +GA+L
Sbjct: 281 SGAYLGDANLTGV-----NFSGANLSGANLGDANLSGANLSNANLSHADLSSANLSGANL 335
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++T ++R L+ A+L++A L T L +DL A ++ A+ S A
Sbjct: 336 TNTDLNRTNLSSADLSSADLSSTNLNSADLSSANLKDANLSSA 378
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 13/103 (12%)
Query: 125 KENFRRAN---FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM-------- 173
K N+ R N F AD+ D SG N A L + +A+ + ADLSD ++
Sbjct: 454 KLNYARLNGAMFLGADLSGVDLSGVILNDADLSGVLLSEADLSDADLSDAILFGTDLSYA 513
Query: 174 --DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+R L+ +NL+ A+L L+ ++L AI+ GAD SDA ++
Sbjct: 514 NLNRANLSGSNLSGALLNGADLSHTNLSCAILGGADVSDANLE 556
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 46/87 (52%)
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
V E R NF A + +++ +G+ F+GA L A AN TG + S + L +ANL
Sbjct: 251 VGEFLRGGNFRGAYLGDANLTGANFSGANLSGAYLGDANLTGVNFSGANLSGANLGDANL 310
Query: 184 TNAVLVRTVLTRSDLGGAIIEGADFSD 210
+ A L L+ +DL A + GA+ ++
Sbjct: 311 SGANLSNANLSHADLSSANLSGANLTN 337
Score = 44.3 bits (103), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 51/103 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ F A+L A N AN ++A++ +D S + +GA L + N + ADL
Sbjct: 291 TGVNFSGANLSGANLGDANLSGANLSNANLSHADLSSANLSGANLTNTDLNRTNLSSADL 350
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + LN A+L++A L L+ +DL + GA+ SDA
Sbjct: 351 SSADLSSTNLNSADLSSANLKDANLSSADLSHTHLFGANLSDA 393
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 51/106 (48%), Gaps = 5/106 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S+A S +L A N + AN +SAD+ + G+ + A L A+ ADL
Sbjct: 351 SSADLSSTNLNSADLSSANLKDANLSSADLSHTHLFGANLSDANLSGVNLSHADLCRADL 410
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
S M + +LN NL++ +L T +L AI+ AD S A ++
Sbjct: 411 SGADMSKAILNGTNLSDTILFST-----NLSDAILIAADLSYAKLN 451
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 63/134 (47%), Gaps = 13/134 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 165
S A +L + N A +AD+ + +G+K N A L A+ A+ +G
Sbjct: 416 SKAILNGTNLSDTILFSTNLSDAILIAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDL 475
Query: 166 -------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DL 215
ADLS L+ L++A+L++A+L T L+ ++L A + G++ S A++ DL
Sbjct: 476 SGVILNDADLSGVLLSEADLSDADLSDAILFGTDLSYANLNRANLSGSNLSGALLNGADL 535
Query: 216 AQKQALCKYANGTN 229
+ C G +
Sbjct: 536 SHTNLSCAILGGAD 549
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 7/99 (7%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F ++ L + +H + NF+S + G F GAYL A N TGA+ S
Sbjct: 227 FFTSQLLRVIHYSDAIEIGNFSS--IVGEFLRGGNFRGAYLGDA-----NLTGANFSGAN 279
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L +ANLT L+ ++LG A + GA+ S+A
Sbjct: 280 LSGAYLGDANLTGVNFSGANLSGANLGDANLSGANLSNA 318
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 5/92 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEA 181
N A+ AD+ +D S + NG L + + N + ADLS ++ LN A
Sbjct: 399 NLSHADLCRADLSGADMSKAILNGTNLSDTILFSTNLSDAILIAADLSYAKLNGAKLNYA 458
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L A+ + L+ DL G I+ AD S ++
Sbjct: 459 RLNGAMFLGADLSGVDLSGVILNDADLSGVLL 490
>gi|425458953|ref|ZP_18838439.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9808]
gi|389823440|emb|CCI28334.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9808]
Length = 425
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 60/120 (50%), Gaps = 4/120 (3%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L A+ ++ N R A + AD+ E+D SG+ A L KA+ +A A LS+
Sbjct: 285 ANLIKAILSWAILIEANLRGAILSEADLSEADLSGANLRRANLIKAILRRAILIEAILSE 344
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
+ L ANL A+L+ +L +DL GA + A+ S+A I+ A+ A G P
Sbjct: 345 ADLSGANLRRANLIKAILIEAILIEADLRGADLRWANLSEADIE----NAIFIDATGITP 400
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 61/114 (53%), Gaps = 5/114 (4%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHV----KENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
+ EF A A+L KA+ K+ ++ + + AD+ E+D SG+ +GA L +A
Sbjct: 203 KAEFT-TDAKVIQKAELIKAIREGTINKKTLQQVDLSGADLSEADLSGAILSGANLSEAN 261
Query: 158 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
AN +GA+LS + L ANL A+L +L ++L GAI+ AD S+A
Sbjct: 262 LSGANLSGANLSWANLIDANLRRANLIKAILSWAILIEANLRGAILSEADLSEA 315
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+LR+A +K RRA A + E+D SG+ A L KA+ +A ADL
Sbjct: 313 SEADLSGANLRRANLIKAILRRAILIEAILSEADLSGANLRRANLIKAILIEAILIEADL 372
Query: 169 SDTLMDRMVLNEANLTNAVLVR-TVLTRSDLGGAIIEGADFSD 210
+ L+EA++ NA+ + T +T I GA F D
Sbjct: 373 RGADLRWANLSEADIENAIFIDATGITPEQKQDLIRRGAIFGD 415
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 10/113 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA----------VA 158
S A ADL A+ N AN + A++ ++ S + A L +A +
Sbjct: 238 SGADLSEADLSGAILSGANLSEANLSGANLSGANLSWANLIDANLRRANLIKAILSWAIL 297
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+AN GA LS+ + L+ ANL A L++ +L R+ L AI+ AD S A
Sbjct: 298 IEANLRGAILSEADLSEADLSGANLRRANLIKAILRRAILIEAILSEADLSGA 350
>gi|378579963|ref|ZP_09828623.1| hypothetical protein CKS_2597 [Pantoea stewartii subsp. stewartii
DC283]
gi|377817422|gb|EHU00518.1| hypothetical protein CKS_2597 [Pantoea stewartii subsp. stewartii
DC283]
Length = 272
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 55/106 (51%), Gaps = 10/106 (9%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
GS A ADLR A R A+ + AD+ +D SG+ GAYL A A+ +GAD
Sbjct: 24 GSRADLRGADLRGAY-----LRGADLSGADLSGADLSGADLRGAYLRDADLRGADLSGAD 78
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
LSD + L +A+L A L+ +DL GA + GAD S A +
Sbjct: 79 LSDADLRGAYLRDADLRGA-----DLSDADLSGAYLRGADLSGADL 119
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 53/110 (48%), Gaps = 10/110 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR A + R A+ + AD+ R +D SG+ GAYL A +
Sbjct: 75 SGADLSDADLRGAYLRDADLRGADLSDADLSGAYLRGADLSGADLRGAYLRDA-----DL 129
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
GADLSD + L +A+L A L L + L A + GAD SDA +
Sbjct: 130 RGADLSDADLSGAYLRDADLRGADLRGADLRGAYLRDADLRGADLSDADL 179
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 55/115 (47%), Gaps = 2/115 (1%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A+ RG + G A ADL A + R A AD+R +D SG+ + A L A
Sbjct: 32 ADLRGAYLRG--ADLSGADLSGADLSGADLRGAYLRDADLRGADLSGADLSDADLRGAYL 89
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+ GADLSD + L A+L+ A L L +DL GA + AD S A +
Sbjct: 90 RDADLRGADLSDADLSGAYLRGADLSGADLRGAYLRDADLRGADLSDADLSGAYL 144
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 41/90 (45%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A LR A + R A AD+R +D S + +GAYL A A+ GADL
Sbjct: 100 SDADLSGAYLRGADLSGADLRGAYLRDADLRGADLSDADLSGAYLRDADLRGADLRGADL 159
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+ L A+L++A L L DL
Sbjct: 160 RGAYLRDADLRGADLSDADLSGAYLRDGDL 189
>gi|163795566|ref|ZP_02189532.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
gi|159179165|gb|EDP63698.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
Length = 427
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 65/147 (44%), Gaps = 24/147 (16%)
Query: 94 LNKYEAETRGEF--GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
LN Y R + G + AQ DLR+A+ +FR A F A++ E+ +GS+ A
Sbjct: 23 LNNYPGGQRADMRGGRHNGAQLNGVDLRRAMMSAADFRGAQFVGANLSEATLAGSQLRVA 82
Query: 152 YLEKAVAYKANFTGADL------SDTLMD----------------RMVLNEANLTNAVLV 189
L A K +F GADL S + D L+ A+L + V
Sbjct: 83 DLSGAKLVKTDFRGADLEQAKLTSSDITDADFRATTIGAPAGSDIATKLDGADLDHVKAV 142
Query: 190 RTVLTRSDLGGAIIEGADFSDAVIDLA 216
RT LTR+ L GA GA F A +D A
Sbjct: 143 RTNLTRASLMGATARGAHFDGASLDRA 169
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 50/109 (45%), Gaps = 10/109 (9%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A + ADL V+ N RA+ A R G+ F+GA L++A AN A
Sbjct: 128 ATKLDGADLDHVKAVRTNLTRASLMGATAR-----GAHFDGASLDRANFKGANLEHATFV 182
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVI 213
+ + L E N +A L T LT +DL GA + GAD +D VI
Sbjct: 183 SSSLRGANLQEVNFADATLSNTDLTGADLRSCHLDGADMSGADLTDCVI 231
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 50/99 (50%), Gaps = 7/99 (7%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTL 172
+RK H N+ ADMR +G++ NG L +A+ A+ F GA+LS+
Sbjct: 16 IRKHGHFLNNYPGGQ--RADMRGGRHNGAQLNGVDLRRAMMSAADFRGAQFVGANLSEAT 73
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L A+L+ A LV+T +DL A + +D +DA
Sbjct: 74 LAGSQLRVADLSGAKLVKTDFRGADLEQAKLTSSDITDA 112
>gi|157413067|ref|YP_001483933.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9215]
gi|157387642|gb|ABV50347.1| Pentapeptide repeat-containing proteins [Prochlorococcus marinus
str. MIT 9215]
Length = 157
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 64/134 (47%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+A +G L A + + A F D+++++ S + A L A N + ++L
Sbjct: 21 AALDYGKQSLIGADFSGSDLKGATFYLTDLQDANLSDCELQNATLYGAKLKDTNLSNSNL 80
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+ +D +L+ +L+N L + + I+GADF++ + + C+ A GT
Sbjct: 81 REVTLDSAILDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVYLPKDIIREFCESATGT 140
Query: 229 NPITGVSTRKSLGC 242
NPIT TR++L C
Sbjct: 141 NPITNRDTRETLEC 154
>gi|428314172|ref|YP_007125149.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428255784|gb|AFZ21743.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 276
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 60/114 (52%), Gaps = 10/114 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SAA A LR+A N + AN + D++ +D G+ GA L++A N +GADL
Sbjct: 104 SAATLKGAKLREA-----NLQGANLRAVDLKNADLCGANLQGADLKRADLINTNLSGADL 158
Query: 169 S-----DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
S D + +++ L EANL A L L+ +DL GA + A+ + A + AQ
Sbjct: 159 SGANLTDVIFEKVNLREANLRGANLQGLDLSEADLTGADLSEANLNGARLQEAQ 212
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 15/119 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGS-----KFNGAYLEKAVA 158
S A A+L + K N R AN A D+ E+D +G+ NGA L++A
Sbjct: 154 SGADLSGANLTDVIFEKVNLREANLRGANLQGLDLSEADLTGADLSEANLNGARLQEAQL 213
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+AN +G D M + L+ ANL A L L+++ L G + GA+ +A++D A+
Sbjct: 214 SQANLSGLD-----MTHLNLSGANLRQANLSEAQLSQAQLYGTDLRGANLDEAILDQAK 267
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 46/90 (51%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+EN + + ++ +++ +K + A L+ A +AN GA+L + L ANL
Sbjct: 80 QENLVWMDLSGVNLSQANLQQAKLSAATLKGAKLREANLQGANLRAVDLKNADLCGANLQ 139
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A L R L ++L GA + GA+ +D + +
Sbjct: 140 GADLKRADLINTNLSGADLSGANLTDVIFE 169
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 37/127 (29%), Positives = 57/127 (44%), Gaps = 20/127 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTG 165
A + DL+ A N + A+ AD+ ++ SG+ +GA L EK +AN G
Sbjct: 121 ANLRAVDLKNADLCGANLQGADLKRADLINTNLSGADLSGANLTDVIFEKVNLREANLRG 180
Query: 166 ---------------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
ADLS+ ++ L EA L+ A L +T +L GA + A+ S+
Sbjct: 181 ANLQGLDLSEADLTGADLSEANLNGARLQEAQLSQANLSGLDMTHLNLSGANLRQANLSE 240
Query: 211 AVIDLAQ 217
A + AQ
Sbjct: 241 AQLSQAQ 247
>gi|334119992|ref|ZP_08494075.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333457174|gb|EGK85799.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 566
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 58/106 (54%), Gaps = 5/106 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+LR+A V+ N R A ++ ESD +G++ A L A ++A GADL++
Sbjct: 155 ANLTGANLREAHLVEANLRSAILIGVNLIESDLNGAQMRSANLTGADLHRAVLAGADLTE 214
Query: 171 TLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDA 211
++D L+ ANL + L+ + +L R++L G + AD S+A
Sbjct: 215 AVLDNADLSRANLAGSYLLKASFKKALLLRANLQGVYLLRADLSEA 260
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 60/136 (44%), Gaps = 30/136 (22%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADM----------RESDFSGSKFNGAYLEKAVA 158
S A ADLR A A F AD+ + +F+G+K +GA L A
Sbjct: 83 SGANLAKADLRLACLAAAELNWAAFPEADLGGANLQGVKSDQINFAGAKLDGAKLMAAEL 142
Query: 159 YKAN-----FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG------------- 200
+AN GA+L+ + L EANL +A+L+ L SDL G
Sbjct: 143 MEANLNRASLVGANLTGANLREAHLVEANLRSAILIGVNLIESDLNGAQMRSANLTGADL 202
Query: 201 --AIIEGADFSDAVID 214
A++ GAD ++AV+D
Sbjct: 203 HRAVLAGADLTEAVLD 218
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 51/113 (45%), Gaps = 10/113 (8%)
Query: 111 AQFGSADLRKAVHVKENF----------RRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
A A+LR A+ + N R AN T AD+ + +G+ A L+ A +
Sbjct: 165 AHLVEANLRSAILIGVNLIESDLNGAQMRSANLTGADLHRAVLAGADLTEAVLDNADLSR 224
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
AN G+ L + +L ANL L+R L+ ++L A + AD S A +
Sbjct: 225 ANLAGSYLLKASFKKALLLRANLQGVYLLRADLSEANLRSADLRKADLSGAYL 277
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 22/117 (18%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG---- 165
+A ADL +AV + A +AD+ ++ +GS A +KA+ +AN G
Sbjct: 194 SANLTGADLHRAVLAGADLTEAVLDNADLSRANLAGSYLLKASFKKALLLRANLQGVYLL 253
Query: 166 -ADLSDT----------------LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
ADLS+ LMD M L EA+L A L+ L R++L A + G
Sbjct: 254 RADLSEANLRSADLRKADLSGAYLMDAM-LGEADLREACLIECRLIRTNLEAAQLTG 309
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 52/103 (50%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A+ A L A ++ N RA+ A++ +G+ A+L +A A G +L
Sbjct: 128 AGAKLDGAKLMAAELMEANLNRASLVGANL-----TGANLREAHLVEANLRSAILIGVNL 182
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++ ++ + ANLT A L R VL +DL A+++ AD S A
Sbjct: 183 IESDLNGAQMRSANLTGADLHRAVLAGADLTEAVLDNADLSRA 225
>gi|90419937|ref|ZP_01227846.1| conserved hypothetical protein with pentapeptide repeats
[Aurantimonas manganoxydans SI85-9A1]
gi|90335978|gb|EAS49726.1| conserved hypothetical protein with pentapeptide repeats
[Aurantimonas manganoxydans SI85-9A1]
Length = 292
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 59/109 (54%), Gaps = 7/109 (6%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY-LEKAVAYKANFTGADLS 169
A F ADL A + +F RA+F A+M+ +DFS N + L + V A+ TGADLS
Sbjct: 168 ATFDGADL-SAARIAGDFSRASFVRANMKGADFSADMRNQSMGLMRGVLNSADLTGADLS 226
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTR-----SDLGGAIIEGADFSDAVI 213
+ R A+ T+A L LTR ++ G ++EGADF+DA +
Sbjct: 227 GANLSRAAAEFADFTDADLSGADLTRFEASGANFNGTMVEGADFADAEL 275
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL-- 188
A+ TSA + +D S ++ GA L++A ANFTGADLS + + + +A A L
Sbjct: 118 ADLTSAYLNGTDLSNARLAGAKLDQAWGLGANFTGADLSGASLFQSQMQDATFDGADLSA 177
Query: 189 --VRTVLTRSDLGGAIIEGADFS 209
+ +R+ A ++GADFS
Sbjct: 178 ARIAGDFSRASFVRANMKGADFS 200
>gi|119485665|ref|ZP_01619940.1| hypothetical protein L8106_24820 [Lyngbya sp. PCC 8106]
gi|119456990|gb|EAW38117.1| hypothetical protein L8106_24820 [Lyngbya sp. PCC 8106]
Length = 433
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 60/111 (54%), Gaps = 7/111 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA------VAYKAN 162
S A F A+LR+A K N A+ + A + ++D G K GA L A + Y AN
Sbjct: 116 SGANFRDANLREAYLWKANLSNADLSDAYLEKADLRGVKLEGADLGYAMLKGANLGY-AN 174
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
F A L++T + L +ANL A LV L ++DL GA +EGA+ S+A +
Sbjct: 175 FVRARLANTDLSNANLWQANLREAHLVDANLQQADLRGAKLEGANLSNAKL 225
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 55/103 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ + DL A + N R A+ A+++++D G+K GA L A +AN A
Sbjct: 178 ARLANTDLSNANLWQANLREAHLVDANLQQADLRGAKLEGANLSNAKLVQANLESAIFVG 237
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ L++A+L A L +T +TR+DLG A ++ A DA +
Sbjct: 238 ANLENANLHQASLKGANLAKTQMTRADLGFANLQKASLGDAQL 280
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 63/137 (45%), Gaps = 13/137 (9%)
Query: 91 LADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRRANFTSA----------D 137
L D N +A+ RG E S A+ A+L A+ V N AN A
Sbjct: 200 LVDANLQQADLRGAKLEGANLSNAKLVQANLESAIFVGANLENANLHQASLKGANLAKTQ 259
Query: 138 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 197
M +D + A L A +AN ADL++ + L +ANL NA+L + L +
Sbjct: 260 MTRADLGFANLQKASLGDAQLSQANLESADLTEAKLWVAKLEDANLNNAILEKAKLGFAQ 319
Query: 198 LGGAIIEGADFSDAVID 214
L GA +E A+ +DA+++
Sbjct: 320 LKGANLEDANLTDAILE 336
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 58/111 (52%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK---------- 160
A G A+L+KA +AN SAD+ E+ +K A L A+ K
Sbjct: 263 ADLGFANLQKASLGDAQLSQANLESADLTEAKLWVAKLEDANLNNAILEKAKLGFAQLKG 322
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
AN A+L+D +++ ++L +ANL +A L L +++L GA ++ A+ ++A
Sbjct: 323 ANLEDANLTDAILEGVILEDANLEDANLEGAKLEQANLIGAYLKDANLTEA 373
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 57/124 (45%), Gaps = 27/124 (21%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSA----------DMRESDFSGSKFN-----GA 151
I A+ G A L+ A N AN T A ++ +++ G+K GA
Sbjct: 309 ILEKAKLGFAQLKGA-----NLEDANLTDAILEGVILEDANLEDANLEGAKLEQANLIGA 363
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
YL+ A +AN GADL L +ANL NA L L ++L GA ++GA+ D
Sbjct: 364 YLKDANLTEANLQGADLRGA-----NLTKANLRNAYLQGANLRGANLKGASLKGANLRD- 417
Query: 212 VIDL 215
+DL
Sbjct: 418 -VDL 420
>gi|428225059|ref|YP_007109156.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984960|gb|AFY66104.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 315
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 67/152 (44%), Gaps = 20/152 (13%)
Query: 109 SAAQFGSADLR----KAVHVKENFRRA------NFTSADMRESDFSGSKFNGAYL----- 153
S A+ A LR V +K+ F N D+R + SG+ +GA L
Sbjct: 148 SGARLSGATLRGSFLNGVKLKDAFLNGVDLNGINLDGVDLRSTKLSGATLHGANLAATNF 207
Query: 154 -----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
AN +GA+LS + R LN ANLT A L LT ++L GA IEGA+F
Sbjct: 208 SDAKMHGGSFTGANLSGANLSRAFLKRANLNWANLTRADLTDADLTEANLLGARIEGAEF 267
Query: 209 SDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
+ + ++ L A G P + TR +L
Sbjct: 268 TGVTLSDPTRRYLRLIATGVTPWSQQPTRSTL 299
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 49/108 (45%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 163
S A A+LR + N AN + ++R +D +G+ N GA L A +
Sbjct: 103 SGANLNGANLRGSHLQHANLCGANLNAINLRGADLTGANLNWANLSGARLSGATLRGSFL 162
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
G L D ++ + LN NL L T L+ + L GA + +FSDA
Sbjct: 163 NGVKLKDAFLNGVDLNGINLDGVDLRSTKLSGATLHGANLAATNFSDA 210
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 23/111 (20%)
Query: 132 NFTSADMRESDFSG----------SKFNGAYLEKAVAYKANFTGA----------DLSDT 171
NF D+R +D SG + GA L +A +AN +GA LS+
Sbjct: 16 NFAGVDLRGADLSGVTLIAVDLSDANLMGANLSRAFLTQANLSGAFLNWADLRYVKLSEG 75
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
+ + L +ANL+ A +V++ R+ L GA + GA+ + + Q LC
Sbjct: 76 CLTHVDLTKANLSGAFMVKSDFNRAKLSGANLNGANLRGSHL---QHANLC 123
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 46/99 (46%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L A VK +F RA + A++ ++ GS A L A N GADL+ ++
Sbjct: 85 ANLSGAFMVKSDFNRAKLSGANLNGANLRGSHLQHANLCGANLNAINLRGADLTGANLNW 144
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L+ A L+ A L + L L A + G D + +D
Sbjct: 145 ANLSGARLSGATLRGSFLNGVKLKDAFLNGVDLNGINLD 183
Score = 37.7 bits (86), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 53/118 (44%), Gaps = 14/118 (11%)
Query: 109 SAAQFGSADLRKA-------VHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKA 156
S A ADLR HV + +AN + A M +SDF SG+ NGA L +
Sbjct: 58 SGAFLNWADLRYVKLSEGCLTHV--DLTKANLSGAFMVKSDFNRAKLSGANLNGANLRGS 115
Query: 157 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
AN GA+L+ + L ANL A L L+ + L G+ + G DA ++
Sbjct: 116 HLQHANLCGANLNAINLRGADLTGANLNWANLSGARLSGATLRGSFLNGVKLKDAFLN 173
>gi|86608820|ref|YP_477582.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86557362|gb|ABD02319.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 328
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 59/119 (49%), Gaps = 3/119 (2%)
Query: 98 EAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
E + RG +G+ AQ A+L++A+ + N AN + AD+ +D S S A L
Sbjct: 204 ETDLRGVSFLGADLQGAQMARANLKEAILRQVNLTEANLSEADLAGADLSASSLCSAKLA 263
Query: 155 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ +AN GADL + L NL NA L +LTR+DL A + GA+ A +
Sbjct: 264 RTDLSRANLAGADLRCANLVDAYLGRTNLENADLGEAILTRADLSTANLSGANLRGATL 322
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 51/96 (53%), Gaps = 10/96 (10%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
G A L+KA V N AN + AD+ E+D ++ +GA L+ A + AN T A
Sbjct: 52 LGRAKLQKANLVGANLGGANLSQADLSEADLRDAQLHGATLQGADLHGANLTLA------ 105
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+L +ANL +A L LT ++LGGA + GA+
Sbjct: 106 ----LLIDANLLDADLRWANLTSANLGGACLRGANL 137
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 5/101 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL +A N + A+ A+++ ++ ++ GA L + +F GADL
Sbjct: 158 SGANLSGADLTRADLSGANLKEASLIKANLQGANLQQARLQGAILSETDLRGVSFLGADL 217
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
M R ANL A+L + LT ++L A + GAD S
Sbjct: 218 QGAQMAR-----ANLKEAILRQVNLTEANLSEADLAGADLS 253
Score = 44.3 bits (103), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 61/118 (51%), Gaps = 15/118 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLE----------K 155
A A+L++A +K N + AN A ++ E+D G F GA L+ +
Sbjct: 170 ADLSGANLKEASLIKANLQGANLQQARLQGAILSETDLRGVSFLGADLQGAQMARANLKE 229
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+ + N T A+LS+ + L+ ++L +A L RT L+R++L GA + A+ DA +
Sbjct: 230 AILRQVNLTEANLSEADLAGADLSASSLCSAKLARTDLSRANLAGADLRCANLVDAYL 287
Score = 43.9 bits (102), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 84/167 (50%), Gaps = 9/167 (5%)
Query: 63 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
A L++ ++ +T L A + + ++ L D N +A+ R + ++A G A LR A
Sbjct: 80 ADLRDAQLHGAT-LQGADLHGANLTLALLIDANLLDADLR--WANLTSANLGGACLRGAN 136
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 182
++ R A +A++ +D SG+ +GA L +A+ +GA+L + + + L AN
Sbjct: 137 LRFDSRRGAVLRNANLSRADLSGANLSGADL-----TRADLSGANLKEASLIKANLQGAN 191
Query: 183 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGT 228
L A L +L+ +DL G GAD A + A K+A+ + N T
Sbjct: 192 LQQARLQGAILSETDLRGVSFLGADLQGAQMARANLKEAILRQVNLT 238
>gi|428310592|ref|YP_007121569.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428252204|gb|AFZ18163.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 522
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 2/134 (1%)
Query: 94 LNKYEAETRGEFGIGSA-AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 152
L KY A R G+ A +A+L A + N AN + A++ ++ S +K N A
Sbjct: 7 LKKYAAGDRDFSGLNLAEVNLSAANLSGANLSEVNLSVANLSGANLSGANLSRAKLNVAR 66
Query: 153 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
L A KAN A L+ T + R L ANLT A L+R L R++L GA ++ A+ S A
Sbjct: 67 LSGANISKANLIQASLNVTNLIRADLRRANLTQAALIRAELIRAELSGATLKEANLSGAD 126
Query: 213 I-DLAQKQALCKYA 225
+ + A +QA+ A
Sbjct: 127 LREAALRQAILSRA 140
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 61/119 (51%), Gaps = 1/119 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A L ++ + RRAN T A + ++ ++ +GA L++A A+ A L
Sbjct: 73 SKANLIQASLNVTNLIRADLRRANLTQAALIRAELIRAELSGATLKEANLSGADLREAAL 132
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 226
++ R L+EANL A L ++L ++L A + AD SD+ I A +QA +AN
Sbjct: 133 RQAILSRATLSEANLRGAFLTASILEGTNLNKADLNRADLSDSNIREADLRQANLSFAN 191
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 56/117 (47%), Gaps = 10/117 (8%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADLR+A N A+ + A++R +D SG+ A L A AN GADLS
Sbjct: 180 ADLRQANLSFANLSGADLSRANLRWADLSGADLRWANLSDAKLSGANLMGADLS------ 233
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
ANL NA LV LT++ L GAD S A + A+ A+ ++ T IT
Sbjct: 234 ----HANLHNASLVHADLTQASLIKVDWIGADLSGATMTGAKLYAVSRFGLKTTGIT 286
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 53/101 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR+A + RA + A++R + + S G L KA +A+ + +++ +
Sbjct: 120 ANLSGADLREAALRQAILSRATLSEANLRGAFLTASILEGTNLNKADLNRADLSDSNIRE 179
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + L+ ANL+ A L R L +DL GA + A+ SDA
Sbjct: 180 ADLRQANLSFANLSGADLSRANLRWADLSGADLRWANLSDA 220
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 48/106 (45%), Gaps = 5/106 (4%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
I S A A+LR A N AD+ +D S S A L +A AN +G
Sbjct: 135 AILSRATLSEANLRGAFLTASILEGTNLNKADLNRADLSDSNIREADLRQANLSFANLSG 194
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
ADLS R L A+L+ A L L+ + L GA + GAD S A
Sbjct: 195 ADLS-----RANLRWADLSGADLRWANLSDAKLSGANLMGADLSHA 235
Score = 40.4 bits (93), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 58/111 (52%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA-----NFTG 165
A A+L +A ++ RA + A ++E++ SG+ A L +A+ +A N G
Sbjct: 90 ADLRRANLTQAALIRAELIRAELSGATLKEANLSGADLREAALRQAILSRATLSEANLRG 149
Query: 166 ADLSDTLMDRMVLNEANLTNAVL----VRTV-LTRSDLGGAIIEGADFSDA 211
A L+ ++++ LN+A+L A L +R L +++L A + GAD S A
Sbjct: 150 AFLTASILEGTNLNKADLNRADLSDSNIREADLRQANLSFANLSGADLSRA 200
>gi|354564725|ref|ZP_08983901.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353549851|gb|EHC19290.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 564
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 10/106 (9%)
Query: 109 SAAQFGSADLR-----KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S ADLR + N R A +AD+ + +G+K NGA L A+ A+
Sbjct: 386 SGTNLNHADLRGSNLSDTILFSTNLRNAILIAADLSYAKLNGAKLNGANLRSAILLGADL 445
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
G DL+D ++LNEA+L+ VL L+ +D+ AI+ G D S
Sbjct: 446 GGVDLTD-----VILNEADLSGVVLNEADLSGADISDAILFGTDLS 486
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 57/114 (50%), Gaps = 7/114 (6%)
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
GEF GS F A L A NF AN TSA + +++ +G F+ A L A AN
Sbjct: 227 GEFLQGS--NFSGAYLGDANLTGVNFSAANLTSAYLGDANLTGVNFSAANLNAANLGDAN 284
Query: 163 FTGADLSD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+GA+LS T + L+ ANL A L R L+ +DL A + GAD S A
Sbjct: 285 LSGANLSGANLRCTDLSSANLSGANLAGADLYRADLSHADLSSANLSGADLSHA 338
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 55/103 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ F +A+L A N AN + A++R +D S + +GA L A Y+A+ + ADL
Sbjct: 266 TGVNFSAANLNAANLGDANLSGANLSGANLRCTDLSSANLSGANLAGADLYRADLSHADL 325
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + L+ ANL++A L L+ S L AI+ A+ SDA
Sbjct: 326 SSANLSGADLSHANLSSANLRDAELSSSYLSHAILFAANLSDA 368
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 52/99 (52%), Gaps = 15/99 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ A+LR A+ + + + T + E+D SG N +A+ +GAD+SD
Sbjct: 428 AKLNGANLRSAILLGADLGGVDLTDVILNEADLSGVVLN----------EADLSGADISD 477
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
++ L+ ANL++A L+ S+L GAI+ GAD S
Sbjct: 478 AILFGTDLSYANLSSA-----NLSGSNLSGAILSGADLS 511
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ S+ L A+ N AN SA++ +D + +G L A + G++LSD
Sbjct: 348 AELSSSYLSHAILFAANLSDANLNSANLSYADLCRADLSGTNLNHA-----DLRGSNLSD 402
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
T +L NL NA+L+ L+ + L GA + GA+ A++
Sbjct: 403 T-----ILFSTNLRNAILIAADLSYAKLNGAKLNGANLRSAIL 440
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 51/112 (45%), Gaps = 25/112 (22%)
Query: 127 NFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF-----TGADLSDTLMDRM 176
N AN + AD+ +D SG+ N G+ L + + N ADLS ++
Sbjct: 369 NLNSANLSYADLCRADLSGTNLNHADLRGSNLSDTILFSTNLRNAILIAADLSYAKLNGA 428
Query: 177 VLNEANLTNAVLV----------RTVLTRSDLGGAIIE-----GADFSDAVI 213
LN ANL +A+L+ +L +DL G ++ GAD SDA++
Sbjct: 429 KLNGANLRSAILLGADLGGVDLTDVILNEADLSGVVLNEADLSGADISDAIL 480
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 10/115 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A DL A N A+ AD+ +D S + +GA L A AN A+L
Sbjct: 291 SGANLRCTDLSSANLSGANLAGADLYRADLSHADLSSANLSGADLSHANLSSANLRDAEL 350
Query: 169 SDTLMDRMV----------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S + + + LN ANL+ A L R L+ ++L A + G++ SD ++
Sbjct: 351 SSSYLSHAILFAANLSDANLNSANLSYADLCRADLSGTNLNHADLRGSNLSDTIL 405
>gi|158340059|ref|YP_001521229.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158310300|gb|ABW31915.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 483
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 57/112 (50%), Gaps = 10/112 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSAD----------MRESDFSGSKFNGAYLEKAVA 158
S + A+LR A NFR+AN + AD + ++D SG+ F+GAYL KA
Sbjct: 305 SYSNLRKANLRHAHLSGANFRKANLSLADISKAHLGHAHLNDADLSGAYFSGAYLYKANL 364
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
A GADLS + ++L ANL +A L L+ +DL AI+ D +
Sbjct: 365 SSAFLIGADLSRANLSDVILRGANLLSANLSDASLSSADLNNAILLNTDLRE 416
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 50/100 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F A+L A N R+AN A + ++F + + A + KA A+ ADL
Sbjct: 290 SIANFIGANLGGANLSYSNLRKANLRHAHLSGANFRKANLSLADISKAHLGHAHLNDADL 349
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
S L +ANL++A L+ L+R++L I+ GA+
Sbjct: 350 SGAYFSGAYLYKANLSSAFLIGADLSRANLSDVILRGANL 389
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 56/124 (45%), Gaps = 3/124 (2%)
Query: 91 LADLNKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
L D N A RG I + A A+L A NF AN A++ S+ +
Sbjct: 254 LIDANLSGANLRGANLIDANLRGANLIDANLSDAYLSIANFIGANLGGANLSYSNLRKAN 313
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
A+L A KAN + AD+S + LN+A+L+ A L +++L A + GAD
Sbjct: 314 LRHAHLSGANFRKANLSLADISKAHLGHAHLNDADLSGAYFSGAYLYKANLSSAFLIGAD 373
Query: 208 FSDA 211
S A
Sbjct: 374 LSRA 377
>gi|209526319|ref|ZP_03274848.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|376001485|ref|ZP_09779353.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423062694|ref|ZP_17051484.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493248|gb|EDZ93574.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|375330094|emb|CCE15106.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406715650|gb|EKD10803.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 390
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 64/115 (55%), Gaps = 10/115 (8%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV-----AYKANFT 164
+A ADL +A+ +K N +A+ +SA++ +S+ + F AYL KA ++A+ +
Sbjct: 111 SAHLNWADLTEAIFIKTNLHKADLSSANLTKSNLQSANFVRAYLIKANLSEADLFQADLS 170
Query: 165 GADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A+L D + L+E ANL A L LT+++LG A + GA+ +DA ++
Sbjct: 171 SANLKDVNLSAANLSECKMTRANLMGANLTEADLTKANLGRANLRGANLTDAYLN 225
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 10/101 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F ADL A N + N + A++ ++ SGS NGA N GA LS
Sbjct: 57 ADFSEADLSGAHLSLANLSKVNLSGANLTGANLSGSSLNGA----------NLQGATLSG 106
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++ LN A+LT A+ ++T L ++DL A + ++ A
Sbjct: 107 VNLESAHLNWADLTEAIFIKTNLHKADLSSANLTKSNLQSA 147
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 51/103 (49%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A SA L +A + N RAN + A++ ++ NGA+L K A+ G DLS
Sbjct: 222 AYLNSASLVEADLYQANLTRANLSRANLSKTYLRDICLNGAHLTKVNLSGADLGGVDLSQ 281
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L+ + L A L+ A LV +L ++L A + GA+ A +
Sbjct: 282 KLLTGINLAGAYLSEATLVGALLMEANLSAANLSGANLQSACL 324
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 44/90 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A N AN + M ++ G+ A L KA +AN GA+L
Sbjct: 160 SEADLFQADLSSANLKDVNLSAANLSECKMTRANLMGANLTEADLTKANLGRANLRGANL 219
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+D ++ L EA+L A L R L+R++L
Sbjct: 220 TDAYLNSASLVEADLYQANLTRANLSRANL 249
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 49/108 (45%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A G DL + + N A A + E++ S + +GA L+ A A+
Sbjct: 270 SGADLGGVDLSQKLLTGINLAGAYLSEATLVGALLMEANLSAANLSGANLQSACLIHADL 329
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
GA +DR+ L +ANLT A L + L ++L AI+ G + A
Sbjct: 330 GGA-----YLDRVDLTDANLTGANLTKADLREANLRAAILAGVELKGA 372
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 17/108 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L +A K N RAN A++ ++ + + A L +A +AN + A+LS
Sbjct: 192 ANLMGANLTEADLTKANLGRANLRGANLTDAYLNSASLVEADLYQANLTRANLSRANLSK 251
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
T + + LN A+LT + L+ +DLGG +DL+QK
Sbjct: 252 TYLRDICLNGAHLT-----KVNLSGADLGG------------VDLSQK 282
>gi|448473532|ref|ZP_21601674.1| RDD domain-containing protein [Halorubrum aidingense JCM 13560]
gi|445819044|gb|EMA68893.1| RDD domain-containing protein [Halorubrum aidingense JCM 13560]
Length = 348
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 57/115 (49%), Gaps = 12/115 (10%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N R AN T AD+ S + A L KA Y AN +GADL+ L+D+ L A+L
Sbjct: 64 NLRGANITGADL-----SSANLTDALLTKANLYSANLSGADLTGALLDKANLRSADLRGV 118
Query: 187 VLVRTVLTRSDLGGAIIEGADFSD------AVIDLAQKQALCKYAN-GTNPITGV 234
LTR+DL A + GA+FSD AV D + A AN G +TGV
Sbjct: 119 GFTEAHLTRADLHSADLRGANFSDADLFGAAVTDADLRGADLTDANLGDTDLTGV 173
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 49/105 (46%), Gaps = 10/105 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADM----------RESDFSGSKFNGAYLEKAVA 158
+ A SA+L A+ K N AN + AD+ R +D G F A+L +A
Sbjct: 71 TGADLSSANLTDALLTKANLYSANLSGADLTGALLDKANLRSADLRGVGFTEAHLTRADL 130
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
+ A+ GA+ SD + + +A+L A L L +DL G I+
Sbjct: 131 HSADLRGANFSDADLFGAAVTDADLRGADLTDANLGDTDLTGVIL 175
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 38/80 (47%), Gaps = 10/80 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A SADLR F A+ T AD+ +D G+ F+ A L A A+ GADL+D
Sbjct: 108 ANLRSADLRGV-----GFTEAHLTRADLHSADLRGANFSDADLFGAAVTDADLRGADLTD 162
Query: 171 TLMDRMVLNEANLTNAVLVR 190
L + +LT +L R
Sbjct: 163 A-----NLGDTDLTGVILAR 177
>gi|78187857|ref|YP_375900.1| pentapeptide repeat-containing protein [Chlorobium luteolum DSM
273]
gi|78167759|gb|ABB24857.1| pentapeptide repeat family protein [Chlorobium luteolum DSM 273]
Length = 447
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 59/118 (50%), Gaps = 20/118 (16%)
Query: 111 AQFGSADLRKAVHVKE----------NFRRANFTSADMRESDFSGSKFNGAYLEKA---- 156
A+ ADLR+ V ++ N R AN A +R++D G+ GA+L KA
Sbjct: 63 AELAGADLRRTVLIRADLSGANLNGANLREANLAMAFIRKADMKGADMTGAWLVKANLKS 122
Query: 157 -----VAYK-ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+++ AN GA+L + + + L ANL+NAVL L +DL GA + GA F
Sbjct: 123 SFMNGASFRGANLLGANLRWSSLRKADLTGANLSNAVLFEANLAGADLSGANLSGATF 180
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 1/106 (0%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A+ F A L A N R A AD++ + G+ GA L++A A+ +GA+L
Sbjct: 306 ASSFNGATLDNADMRGANLRNAYMKKADLKSAKLGGACLEGANLDRAFLKDADLSGANLR 365
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VID 214
T++ L+ ANL A L L +DL GA ++GAD A V+D
Sbjct: 366 GTMLYGATLSGANLEGADLAGASLFDADLRGANLDGADLEGANVMD 411
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 52/98 (53%), Gaps = 5/98 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADLR+A +F A +ADMR G+ AY++KA A GA L +DR
Sbjct: 297 ADLRQADLGASSFNGATLDNADMR-----GANLRNAYMKKADLKSAKLGGACLEGANLDR 351
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L +A+L+ A L T+L + L GA +EGAD + A +
Sbjct: 352 AFLKDADLSGANLRGTMLYGATLSGANLEGADLAGASL 389
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 20/89 (22%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
KE ++ AD+R++D S FNGA L+ A + ANL
Sbjct: 286 KEKLESSSLEGADLRQADLGASSFNGATLDNAD--------------------MRGANLR 325
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
NA + + L + LGGA +EGA+ A +
Sbjct: 326 NAYMKKADLKSAKLGGACLEGANLDRAFL 354
>gi|158340188|ref|YP_001521358.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158310429|gb|ABW32044.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 292
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 72/142 (50%), Gaps = 15/142 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F ++ L++++ + ++F+ AD+R +DFS +K + A L++ +AN GADL
Sbjct: 68 SGANFKASKLQRSLAIWVQAYWSDFSDADLRHADFSCAKLSAAQLKRTDFSQANLMGADL 127
Query: 169 SDTLMDRMVLNEA----------NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
SD+ A NLTN L + +T SDL A + +D S + +
Sbjct: 128 SDSEAQDACFKGANLWGVWAQRTNLTNVCLSQVDMTTSDLTEAQLSESDLSWSFL----S 183
Query: 219 QALCKYANGTNP-ITGVSTRKS 239
QA+C AN T+ + G +K+
Sbjct: 184 QAVCVGANLTSACLEGSDLKKT 205
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 44/89 (49%), Gaps = 10/89 (11%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA----------DLSDTLMDRMVLN 179
+ + T++D+ E+ S S + ++L +AV AN T A D D + R L+
Sbjct: 159 QVDMTTSDLTEAQLSESDLSWSFLSQAVCVGANLTSACLEGSDLKKTDFQDACLSRADLS 218
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A+ NA L ++DL GA + GADF
Sbjct: 219 AADCENACFFNANLYKADLRGAKLCGADF 247
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 10/113 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F A L A + +F +AN AD+ +S+ + F GA L A + N T LS
Sbjct: 100 ADFSCAKLSAAQLKRTDFSQANLMGADLSDSEAQDACFKGANLWGVWAQRTNLTNVCLSQ 159
Query: 171 TLMDRMVLNEAN----------LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
M L EA L+ AV V LT + L G+ ++ DF DA +
Sbjct: 160 VDMTTSDLTEAQLSESDLSWSFLSQAVCVGANLTSACLEGSDLKKTDFQDACL 212
>gi|254409513|ref|ZP_05023294.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183510|gb|EDX78493.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 209
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 61/118 (51%), Gaps = 15/118 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVAYK 160
A +A+L +A ++ N +RAN T A +RE+ + +GA L +A+ +
Sbjct: 80 ANLTAAELVRATLIECNLKRANLTEAHLREASLMFANLAQACLYQADLHGAMLHQAILHW 139
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRT-----VLTRSDLGGAIIEGADFSDAVI 213
A+ ADL ++ + A+L+ A L+R +L +DL GAI+ GA+F A++
Sbjct: 140 ASLKNADLIGAILQGADMRGADLSQACLIRADVSKAILMVADLRGAIVMGANFKAAIL 197
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 49/97 (50%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
A A++R SD SG+ +GA L+ + +AN + A+LS + + LN+ANLT A
Sbjct: 27 LTEAILNGANLRRSDLSGANLSGASLKGSNLSEANLSQANLSVANLSKAELNDANLTAAE 86
Query: 188 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 224
LVR L +L A + A +A + A C Y
Sbjct: 87 LVRATLIECNLKRANLTEAHLREASLMFANLAQACLY 123
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 42/79 (53%), Gaps = 5/79 (6%)
Query: 140 ESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 194
E DF G K GAYL +A+ AN +GA+LS + L+EANL+ A L L+
Sbjct: 14 ERDFEGVKLRGAYLTEAILNGANLRRSDLSGANLSGASLKGSNLSEANLSQANLSVANLS 73
Query: 195 RSDLGGAIIEGADFSDAVI 213
+++L A + A+ A +
Sbjct: 74 KAELNDANLTAAELVRATL 92
>gi|390438023|ref|ZP_10226524.1| Pentapeptide repeat protein [Microcystis sp. T1-4]
gi|389838556|emb|CCI30648.1| Pentapeptide repeat protein [Microcystis sp. T1-4]
Length = 275
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 69/139 (49%), Gaps = 18/139 (12%)
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
A+ K N A +A++R +D SG+ GAYL A AN A LS + R L
Sbjct: 135 AIGPKANLTGAYLNNANLRFADLSGANLRGAYLSGADLTGANLAAAALSGANLQRASLTG 194
Query: 181 ANLTNAVLVRTVLTRSDLGGAI-----------IEGADFS--DAVIDLAQKQALCKYAN- 226
A L +A LV L +DL GA +EGADFS + + DL ++ LC ++
Sbjct: 195 AFLRDARLVGVELQFADLRGADLTGAILEQIQNLEGADFSQVEGLSDL-ERSYLCGRSSR 253
Query: 227 --GT-NPITGVSTRKSLGC 242
GT NP T +T +SLGC
Sbjct: 254 ELGTWNPYTRSNTGQSLGC 272
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 54/111 (48%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
D+RKA ++ AN D+ + D + F GA L A AN TGA+L + R
Sbjct: 17 DVRKARDKGQSLSAANLEGIDLSQMDLKNADFTGAILLGADLAGANLTGANLEAADLRRA 76
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L ++L A L T+L R+ L GA ++GAD + A I L+ + G
Sbjct: 77 NLRGSDLRGANLRDTLLYRAILCGANLQGADLTGAKISLSVYDGTTSWPEG 127
>gi|304414054|ref|ZP_07395422.1| pentapeptide repeat-containing protein [Candidatus Regiella
insecticola LSR1]
gi|304283268|gb|EFL91664.1| pentapeptide repeat-containing protein [Candidatus Regiella
insecticola LSR1]
Length = 283
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 57/128 (44%), Gaps = 22/128 (17%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDF----------------------SGS 146
S A +ADLR A N + A ADMRE D SG+
Sbjct: 122 SNATLSNADLRGAYMSWANLQNATLNDADMREVDLVGADMREAKLIGKKTNLEGANLSGA 181
Query: 147 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
GA L + KA + ADLS ++R+ L EANL +A+L T L + L A +E
Sbjct: 182 DLRGAELCHTILIKAALSWADLSYAKLERVNLREANLYHAILEETSLYLTKLENANLESV 241
Query: 207 DFSDAVID 214
+ DAV++
Sbjct: 242 NLKDAVLE 249
>gi|86606854|ref|YP_475617.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86555396|gb|ABD00354.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 248
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 69/148 (46%), Gaps = 15/148 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----G 165
A F ++DLR + + +NFT+A + +S F G F+ + +A AN T
Sbjct: 89 ANFAASDLRGSSFSQALGDYSNFTAAKLDKSSFQGGHFSHSIFREASLVAANLTEGNFFA 148
Query: 166 AD----------LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
AD LS ++ L AN A+LV L + + GA GADF+DA +
Sbjct: 149 ADFRQANLFRCNLSQAILSSCQLQNANFDQALLVGANLQEAQIEGASFVGADFTDAKLSD 208
Query: 216 AQKQALCKYANGTNPITGVSTRKSLGCG 243
++ L + A+GTN +T T +L G
Sbjct: 209 EMRKFLLERASGTNELTQRDTLNTLLAG 236
>gi|239909009|ref|YP_002955751.1| hypothetical protein DMR_43740 [Desulfovibrio magneticus RS-1]
gi|239798876|dbj|BAH77865.1| hypothetical protein [Desulfovibrio magneticus RS-1]
Length = 972
Score = 55.1 bits (131), Expect = 3e-05, Method: Composition-based stats.
Identities = 33/98 (33%), Positives = 51/98 (52%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+F +A L K + R +NFT+A ++F ++ + L KA NF ADL++T
Sbjct: 820 EFANAILNKTNFESASLRESNFTNAICNNANFKKARMEKSNLHKATLINTNFEKADLTNT 879
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
L ANL+N+ L LTR++L A + GA+ S
Sbjct: 880 NFSEASLEGANLSNSKLKEANLTRANLCDANLVGANLS 917
Score = 43.5 bits (101), Expect = 0.094, Method: Composition-based stats.
Identities = 40/119 (33%), Positives = 59/119 (49%), Gaps = 6/119 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTG 165
A+ ++L KA + NF +A+ T+ + E+ G SK A L +A AN G
Sbjct: 854 ARMEKSNLHKATLINTNFEKADLTNTNFSEASLEGANLSNSKLKEANLTRANLCDANLVG 913
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALCK 223
A+LS + + + N+ANL NA L+ S GA ++ A F D V IDL Q C+
Sbjct: 914 ANLSGSDLSKANFNKANLANANLLNCKFNFSKFLGANLDNAKFDDDVDIDLLTNQKRCQ 972
Score = 42.7 bits (99), Expect = 0.17, Method: Composition-based stats.
Identities = 25/96 (26%), Positives = 46/96 (47%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
S++L K+ ANF +++ E +F+G+K + A+ K NF A L ++
Sbjct: 783 SSNLAKSSLKNCQLFNANFMFSNLSEVNFNGAKLDDVEFANAILNKTNFESASLRESNFT 842
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+ N AN A + ++ L ++ L E AD ++
Sbjct: 843 NAICNNANFKKARMEKSNLHKATLINTNFEKADLTN 878
>gi|376003692|ref|ZP_09781500.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375327990|emb|CCE17253.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 740
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 54/101 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +LR A N A+ AD+R +D G+ F GA L +A Y+AN T + +
Sbjct: 580 ANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITEGNFNG 639
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ R+ N ++L +A L+R L++S L A ++GA+ S +
Sbjct: 640 ANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQS 680
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 61/133 (45%), Gaps = 17/133 (12%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L +N A RG G A ADLR A NF+ AN A+ +++ + FNG
Sbjct: 582 LRGVNLRNANLRG--GNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITEGNFNG 639
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNE----------ANLTNAVLVRTVLTRSDLGG 200
A L + NF +DL D + R+ L++ ANL+ + L T TR+DL
Sbjct: 640 ANLR-----RVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGTDFTRADLSN 694
Query: 201 AIIEGADFSDAVI 213
A GAD S +I
Sbjct: 695 AKFNGADLSFTLI 707
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 51/112 (45%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
+QF DLR+ N + +F ADMRE + G L KAN + A L+
Sbjct: 430 SQFQGQDLRQKNLKGVNLKTIDFKGADMREKNLKGMSLIKLDLRLVNLAKANLSHAILNG 489
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
+ + L AN+ A LV+T L R+DL + A + A + A ++ C
Sbjct: 490 SKLAVANLKGANMQEASLVKTDLRRADLEDVNLSYASLTTAQLQRANLRSAC 541
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 54/113 (47%), Gaps = 10/113 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 164
+ A A++++A VK + RRA+ ++ + + ++ A L A KAN
Sbjct: 493 AVANLKGANMQEASLVKTDLRRADLEDVNLSYASLTTAQLQRANLRSACLIKANLMAASL 552
Query: 165 ------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
GADLS+ ++ LN+ANL +A L L ++L G +EGA A
Sbjct: 553 EGCDLQGADLSNGNLESAKLNQANLAHANLRGVNLRNANLRGGNLEGAHLEGA 605
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 3/111 (2%)
Query: 95 NKYEAE-TRGEFGIGSA--AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
N Y+A T G F + F +DLR A ++ + ++ SA ++ ++ S S G
Sbjct: 626 NFYQANITEGNFNGANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGT 685
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
+A A F GADLS TL+ L+ A+LTNA L + L S+ G I
Sbjct: 686 DFTRADLSNAKFNGADLSFTLIRHANLSGADLTNAKLEKANLFGSNTVGCI 736
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 46/103 (44%), Gaps = 10/103 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQ A+LR A +K N A+ D++ +D S A L +A AN G +L
Sbjct: 528 TTAQLQRANLRSACLIKANLMAASLEGCDLQGADLSNGNLESAKLNQANLAHANLRGVNL 587
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ ANL L L +DL GA ++GA+F A
Sbjct: 588 RN----------ANLRGGNLEGAHLEGADLRGADLQGANFKGA 620
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 66/141 (46%), Gaps = 20/141 (14%)
Query: 113 FGSADLRK-----AVHVKENFRRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKAN 162
F AD+R+ +K + R N A++ + +GSK GA +++A K +
Sbjct: 452 FKGADMREKNLKGMSLIKLDLRLVNLAKANLSHAILNGSKLAVANLKGANMQEASLVKTD 511
Query: 163 FTGADLSDT-----LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
ADL D + L ANL +A L++ L + L G ++GAD S+ ++ A+
Sbjct: 512 LRRADLEDVNLSYASLTTAQLQRANLRSACLIKANLMAASLEGCDLQGADLSNGNLESAK 571
Query: 218 -KQALCKYANGTNPITGVSTR 237
QA +AN + GV+ R
Sbjct: 572 LNQANLAHAN----LRGVNLR 588
>gi|428216913|ref|YP_007101378.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988695|gb|AFY68950.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 227
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 65/120 (54%), Gaps = 4/120 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S+ + A+L V N A+ ++ ++ +D S S + A L + ANF+ A L
Sbjct: 21 SSVKLPGAELDGEVLHHANLADADLSAGNLNHADLSNSDLSRANLYRCSLKHANFSAAKL 80
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
S+ + + LN+ANL++A+L L +DL GAI+ GAD S A DL + LC +AN T
Sbjct: 81 SNANLKDVQLNDANLSDAILSCANLAEADLSGAILVGADLSGA--DLTNAE-LC-HANLT 136
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 81/171 (47%), Gaps = 11/171 (6%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ++DL +A + + + ANF++A + ++ + N A L A+ AN ADLS
Sbjct: 53 ADLSNSDLSRANLYRCSLKHANFSAAKLSNANLKDVQLNDANLSDAILSCANLAEADLSG 112
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDAVIDLAQKQALCKYA 225
++ L+ A+LTNA L LT ++L G ++ GA+F++A ++ AQ A
Sbjct: 113 AILVGADLSGADLTNAELCHANLTGANLEGVLLHNANLTGANFTNANMENAQLDG----A 168
Query: 226 NGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKL--LDRDGFCDS 274
+ TN +T ++ NS A ++ L Q L+ CD+
Sbjct: 169 DLTNANLSGTTLHNVNLANSNLQAVNLTNADLRGVNLQHTHNLETANLCDA 219
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 48/102 (47%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
I S A ADL A+ V + A+ T+A++ ++ +G+ G L A ANFT A
Sbjct: 99 ILSCANLAEADLSGAILVGADLSGADLTNAELCHANLTGANLEGVLLHNANLTGANFTNA 158
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
++ + +D L ANL+ L L S+L + AD
Sbjct: 159 NMENAQLDGADLTNANLSGTTLHNVNLANSNLQAVNLTNADL 200
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SAA+ +A+L+ N A + A++ E+D SG+ GA L A A A+L
Sbjct: 76 SAAKLSNANLKDVQLNDANLSDAILSCANLAEADLSGAILVGADLSGADLTNAELCHANL 135
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ ++ ++L+ ANLT A T +++ A ++GAD ++A
Sbjct: 136 TGANLEGVLLHNANLTGA-----NFTNANMENAQLDGADLTNA 173
>gi|381207604|ref|ZP_09914675.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 255
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 15/111 (13%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL +A + N + A+ T D+ ++ G+ +GA L A AN GADL+D +
Sbjct: 96 ADLHEANAPEANLKNADLTEVDLLHANLGGTDLSGAKLSGAKLRGANLVGADLTDADLSE 155
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAI---------------IEGADFSDA 211
L+EANL+ A L L +DLG A+ ++GAD +DA
Sbjct: 156 ANLSEANLSEADLSGADLREADLGKAVLSQAKLVGANLHRIRLQGADLTDA 206
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 60/123 (48%), Gaps = 15/123 (12%)
Query: 102 RGE-FGIGSAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEK 155
+GE FG+ ADL KAV + R +AN AD++E++ G+ + L
Sbjct: 20 KGELFGV----DLSEADLPKAVLYSSDLREAKLSKANLAKADLQEANLVGAGLHRVDLNG 75
Query: 156 AVAYKANFTGADLSDTLM---DRMVLN--EANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
A ++AN ADLS L+ D N EANL NA L L ++LGG + GA S
Sbjct: 76 ANLHQANLAQADLSGALLFFADLHEANAPEANLKNADLTEVDLLHANLGGTDLSGAKLSG 135
Query: 211 AVI 213
A +
Sbjct: 136 AKL 138
Score = 44.3 bits (103), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 57/119 (47%), Gaps = 12/119 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G DL A R AN AD+ ++D S + + A L +A+ +GADL +
Sbjct: 121 ANLGGTDLSGAKLSGAKLRGANLVGADLTDADLSEANLSEANLS-----EADLSGADLRE 175
Query: 171 TLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADFSDAVID--LAQKQALC 222
+ + VL++A L A L R LT +DL A + G D +A+ + L +K LC
Sbjct: 176 ADLGKAVLSQAKLVGANLHRIRLQGADLTDADLTDANLYGIDLREAITENTLFEKAKLC 234
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 50/105 (47%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A+ A+L A + AN + A++ E+D SG+ A L KAV +A GA+L
Sbjct: 134 SGAKLRGANLVGADLTDADLSEANLSEANLSEADLSGADLREADLGKAVLSQAKLVGANL 193
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
R+ L A+LT+A L L DL AI E F A +
Sbjct: 194 H-----RIRLQGADLTDADLTDANLYGIDLREAITENTLFEKAKL 233
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADL++A V R AN A++ ++D SG+ A L +A A +AN
Sbjct: 49 SKANLAKADLQEANLVGAGLHRVDLNGANLHQANLAQADLSGALLFFADLHEANAPEANL 108
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
ADL++ + L ANL L L+ + L GA + GAD +DA
Sbjct: 109 KNADLTE-----VDLLHANLGGTDLSGAKLSGAKLRGANLVGADLTDA 151
>gi|428319029|ref|YP_007116911.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428242709|gb|AFZ08495.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 520
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 62/121 (51%), Gaps = 1/121 (0%)
Query: 94 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 152
L KY A R GI + A +L A N AN + A++ +++ +G+K N A
Sbjct: 7 LKKYAAGERNFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSKTNLTGAKLNIAR 66
Query: 153 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
L A A+ T ADL+ + R+ L +A L A L+R L R++L GA + GA+ S A
Sbjct: 67 LSGAHLGGADLTDADLNVAYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANLSGAT 126
Query: 213 I 213
+
Sbjct: 127 L 127
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL+KA+ + RA A++ ++ SG+ +GA L +A AN A+L +
Sbjct: 91 DLKKAILIGAKLIRAELIRAELSGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGA 150
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L EANL A L L+R+DL GA + G + A
Sbjct: 151 CLTEANLEQANLQGADLSRADLSGADLRGTELRQA 185
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L A + R AN A++R + SG+ A LE+A A+ + ADL
Sbjct: 113 SGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRADL 172
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG-----ADFSDA 211
S + L +ANLT AVL L+ +L AI+ G AD S+A
Sbjct: 173 SGADLRGTELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEA 220
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 60/132 (45%), Gaps = 24/132 (18%)
Query: 98 EAETRGEFGIGSAAQFGSADLRKAVHV------KENFRRANFTSADMRESDFSGSKFNGA 151
EA RG A A+LR A H+ + N +AN AD+ +D SG+ G
Sbjct: 129 EATLRG-------ANLAQANLRGA-HLSGACLTEANLEQANLQGADLSRADLSGADLRGT 180
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMV----------LNEANLTNAVLVRTVLTRSDLGGA 201
L +A +A +GADLS + + L+EA L+ A L R L ++L A
Sbjct: 181 ELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEAKLSGADLSRADLCHANLLNA 240
Query: 202 IIEGADFSDAVI 213
+ AD S+A +
Sbjct: 241 SLVHADLSNAYL 252
Score = 40.4 bits (93), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A G ADL A + A D++++ G+K A L +A AN +GA+L
Sbjct: 68 SGAHLGGADLTDA-----DLNVAYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANL 122
Query: 169 SDTLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + L +ANL A L LT ++L A ++GAD S A
Sbjct: 123 SGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRA 170
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 15/85 (17%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
NF ++ E++ SG +GA L+ A AN +GA+LS T NLT A L
Sbjct: 16 NFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSKT----------NLTGAKL--- 62
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLA 216
+ L GA + GAD +DA +++A
Sbjct: 63 --NIARLSGAHLGGADLTDADLNVA 85
>gi|209526910|ref|ZP_03275429.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423063829|ref|ZP_17052619.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209492689|gb|EDZ93025.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406714678|gb|EKD09839.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 740
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 54/101 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +LR A N A+ AD+R +D G+ F GA L +A Y+AN T + +
Sbjct: 580 ANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITEGNFNG 639
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ R+ N ++L +A L+R L++S L A ++GA+ S +
Sbjct: 640 ANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQS 680
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 61/133 (45%), Gaps = 17/133 (12%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L +N A RG G A ADLR A NF+ AN A+ +++ + FNG
Sbjct: 582 LRGVNLRNANLRG--GNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITEGNFNG 639
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNE----------ANLTNAVLVRTVLTRSDLGG 200
A L + NF +DL D + R+ L++ ANL+ + L T TR+DL
Sbjct: 640 ANLR-----RVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGTDFTRADLSN 694
Query: 201 AIIEGADFSDAVI 213
A GAD S +I
Sbjct: 695 AKFNGADLSFTLI 707
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 51/112 (45%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
+QF DLR+ N + +F ADMRE + G L KAN + A L+
Sbjct: 430 SQFQGQDLRQKNLKGVNLKTIDFKGADMREKNLKGMSLIKLDLRLVNLAKANLSHAILNG 489
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
+ + L AN+ A LV+T L R+DL + A + A + A ++ C
Sbjct: 490 SKLAVANLKGANMQEASLVKTDLRRADLEDVNLSYASLTTAQLQRANLRSAC 541
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 54/113 (47%), Gaps = 10/113 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 164
+ A A++++A VK + RRA+ ++ + + ++ A L A KAN
Sbjct: 493 AVANLKGANMQEASLVKTDLRRADLEDVNLSYASLTTAQLQRANLRSACLIKANLMAASL 552
Query: 165 ------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
GADLS+ ++ LN+ANL +A L L ++L G +EGA A
Sbjct: 553 EGCDLQGADLSNGNLESAKLNQANLAHANLRGVNLRNANLRGGNLEGAHLEGA 605
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 3/111 (2%)
Query: 95 NKYEAE-TRGEFGIGSA--AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
N Y+A T G F + F +DLR A ++ + ++ SA ++ ++ S S G
Sbjct: 626 NFYQANITEGNFNGANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGT 685
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
+A A F GADLS TL+ L+ A+LTNA L + L S+ G I
Sbjct: 686 DFTRADLSNAKFNGADLSFTLIRHANLSGADLTNAKLEKANLFGSNTVGCI 736
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 46/103 (44%), Gaps = 10/103 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQ A+LR A +K N A+ D++ +D S A L +A AN G +L
Sbjct: 528 TTAQLQRANLRSACLIKANLMAASLEGCDLQGADLSNGNLESAKLNQANLAHANLRGVNL 587
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ ANL L L +DL GA ++GA+F A
Sbjct: 588 RN----------ANLRGGNLEGAHLEGADLRGADLQGANFKGA 620
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 66/141 (46%), Gaps = 20/141 (14%)
Query: 113 FGSADLRK-----AVHVKENFRRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKAN 162
F AD+R+ +K + R N A++ + +GSK GA +++A K +
Sbjct: 452 FKGADMREKNLKGMSLIKLDLRLVNLAKANLSHAILNGSKLAVANLKGANMQEASLVKTD 511
Query: 163 FTGADLSDT-----LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
ADL D + L ANL +A L++ L + L G ++GAD S+ ++ A+
Sbjct: 512 LRRADLEDVNLSYASLTTAQLQRANLRSACLIKANLMAASLEGCDLQGADLSNGNLESAK 571
Query: 218 -KQALCKYANGTNPITGVSTR 237
QA +AN + GV+ R
Sbjct: 572 LNQANLAHAN----LRGVNLR 588
>gi|434394300|ref|YP_007129247.1| heat shock protein DnaJ domain protein [Gloeocapsa sp. PCC 7428]
gi|428266141|gb|AFZ32087.1| heat shock protein DnaJ domain protein [Gloeocapsa sp. PCC 7428]
Length = 213
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 49/92 (53%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+F+RAN D+ DF + F GA L A +K N +GA+L + R L +ANL++A
Sbjct: 100 DFKRANLKEKDLSGRDFRNANFTGANLSDAFMHKVNLSGANLFQANLFRANLLQANLSHA 159
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
L L +DL G+ + GAD A I + +
Sbjct: 160 NLREANLVGADLSGSDLSGADLRGARIGVGDR 191
>gi|428224583|ref|YP_007108680.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984484|gb|AFY65628.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 156
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 71/138 (51%), Gaps = 8/138 (5%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A L +A + N ++AN +SAD+ +D S + +GA L +A A+ T ADL
Sbjct: 17 FQQAALHQADLEEVNLQQANLSSADLSSADLSHANLSGANLSRANLSNADLTNADLRSAD 76
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV---IDLAQKQALCKYANGTN 229
+ + L ANL+ A L R L ++DL AI+ ADF+ A +DL+ +GTN
Sbjct: 77 LSEVNLIGANLSGAKLGRANLFQADLRSAILTDADFTGANLEDVDLSGAD-----LSGTN 131
Query: 230 PITGVSTRKSLGCGNSRR 247
T ++ + G SRR
Sbjct: 132 LRTAELSKAASSHGVSRR 149
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANF-----TSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S+A SADL A N RAN T+AD+R +D S GA L A +AN
Sbjct: 38 SSADLSSADLSHANLSGANLSRANLSNADLTNADLRSADLSEVNLIGANLSGAKLGRANL 97
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
ADL +L +A+ T A L L+ +DL G + A+ S A
Sbjct: 98 FQADLRSA-----ILTDADFTGANLEDVDLSGADLSGTNLRTAELSKA 140
>gi|78189684|ref|YP_380022.1| pentapeptide repeat-containing protein [Chlorobium chlorochromatii
CaD3]
gi|78171883|gb|ABB28979.1| pentapeptide repeat family protein [Chlorobium chlorochromatii
CaD3]
Length = 389
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 49/88 (55%), Gaps = 5/88 (5%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM-----DRMVLNEANLTN 185
ANF ADM+ + G+ GA+ ++A +AN GA+L+ L+ D+ L ANLT
Sbjct: 270 ANFYKADMKGAQLQGANLQGAHCDRAFLLQANLQGANLTKALLFGATLDKADLRNANLTE 329
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A L +DL GAI+ A+ +DAV+
Sbjct: 330 ASLFGANCEGADLRGAILTRANVTDAVL 357
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 61/129 (47%), Gaps = 16/129 (12%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLR-----KAVHVKENFRRANFTSADMRESDFSG 145
LA N Y+A+ +G AQ A+L+ +A ++ N + AN T A + +
Sbjct: 267 LAGANFYKADMKG-------AQLQGANLQGAHCDRAFLLQANLQGANLTKALLFGATLDK 319
Query: 146 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG----A 201
+ A L +A + AN GADL ++ R + +A LTNA++ T + S A
Sbjct: 320 ADLRNANLTEASLFGANCEGADLRGAILTRANVTDAVLTNALISSTTVLPSGKAATRQWA 379
Query: 202 IIEGADFSD 210
+++ A FS
Sbjct: 380 LMQQAIFSQ 388
>gi|428203771|ref|YP_007082360.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427981203|gb|AFY78803.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 180
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 52/97 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL KA V N N AD+ ++ SG+ GA L A + AN + A+L +
Sbjct: 60 ANLTDADLIKANLVGANLIEINLIGADLTSANLSGADLTGADLRCANLHNANLSQANLRE 119
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
+D L+ ANL+ A+LV T L+ +D GA ++G D
Sbjct: 120 VHLDGADLSGANLSGAILVNTDLSVADTVGAKLDGID 156
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 50/95 (52%), Gaps = 10/95 (10%)
Query: 122 VHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
+ V+E F R NF ++ ++ G KF G L +A+ +GADLS+T +
Sbjct: 1 MKVRELFIRYLKNQRNFEEVNLHIANLQGLKFQGINL-----TRADLSGADLSETDLSGA 55
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L +ANLT+A L++ L ++L + GAD + A
Sbjct: 56 CLKQANLTDADLIKANLVGANLIEINLIGADLTSA 90
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 47/102 (46%), Gaps = 15/102 (14%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT-- 184
N RA+ + AD+ E+D SG+ A L A KAN GA+L + + L ANL+
Sbjct: 36 NLTRADLSGADLSETDLSGACLKQANLTDADLIKANLVGANLIEINLIGADLTSANLSGA 95
Query: 185 -------------NAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
NA L + L L GA + GA+ S A++
Sbjct: 96 DLTGADLRCANLHNANLSQANLREVHLDGADLSGANLSGAIL 137
>gi|186682860|ref|YP_001866056.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186465312|gb|ACC81113.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 589
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 53/106 (50%), Gaps = 5/106 (4%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A ADL A+ N N + A + +D S +K NGA L A A F G
Sbjct: 408 ADLSGADLSHAILNGTNLSDTILFSTNLSDAILMAADLSYAKLNGAKLNNARLNGAMFLG 467
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
ADLS + R+ LNEA+L+ +L L+ +DL AI+ G DFS A
Sbjct: 468 ADLSGVDLSRVSLNEADLSGVILSEADLSGADLTDAILFGTDFSYA 513
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 56/115 (48%), Gaps = 10/115 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANF 163
S A ADL N RAN + AD+ +D S + N A L +A NF
Sbjct: 316 SGANLSGADLSSTNLSGANLSRANLSRADLNRADLSSTNLNRADLSNTNLSRADLSSTNF 375
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVI 213
+ ADLS+ ++ L+EANL+N L L R+DL G AI+ G + SD ++
Sbjct: 376 SRADLSNAILFGANLSEANLSNVSLNHADLCRADLSGADLSHAILNGTNLSDTIL 430
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ F A+L A N AN + A++ +D S + +GA L +A +A+ ADL
Sbjct: 291 TGVNFIGANLSGANFGDANLSGANLSGANLSGADLSSTNLSGANLSRANLSRADLNRADL 350
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S T ++R L+ NL+ A L T +R+DL AI+ GA+ S+A
Sbjct: 351 SSTNLNRADLSNTNLSRADLSSTNFSRADLSNAILFGANLSEA 393
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A G A+L + N ANF A++ ++ SG+ +GA L AN + A+L
Sbjct: 281 SLAYLGDANLTGVNFIGANLSGANFGDANLSGANLSGANLSGADLSSTNLSGANLSRANL 340
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S ++R L+ NL A L T L+R+DL AD S+A++
Sbjct: 341 SRADLNRADLSSTNLNRADLSNTNLSRADLSSTNFSRADLSNAIL 385
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 57/115 (49%), Gaps = 8/115 (6%)
Query: 63 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
AKL N R+ + L A + S +S LN EA+ G I S A ADL A+
Sbjct: 453 AKLNNARLNGAMFLGADLSGVDLSRVS----LN--EADLSGV--ILSEADLSGADLTDAI 504
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
+F AN SA++ S+ SG+ NGA L + A +GADLSD M++M
Sbjct: 505 LFGTDFSYANLNSANLSGSNLSGAILNGANLSHSNLSYAILSGADLSDANMEKMT 559
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 54/115 (46%), Gaps = 10/115 (8%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AA A L A A F AD+ D S N A L + +A+ +GADL+
Sbjct: 442 AADLSYAKLNGAKLNNARLNGAMFLGADLSGVDLSRVSLNEADLSGVILSEADLSGADLT 501
Query: 170 DTLM----------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
D ++ + L+ +NL+ A+L L+ S+L AI+ GAD SDA ++
Sbjct: 502 DAILFGTDFSYANLNSANLSGSNLSGAILNGANLSHSNLSYAILSGADLSDANME 556
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 48/90 (53%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NFR A A++ +DFSG+ + AYL A NF GA+LS L+ ANL+ A
Sbjct: 259 NFRSAYLGDANLTGADFSGADLSLAYLGDANLTGVNFIGANLSGANFGDANLSGANLSGA 318
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
L L+ ++L GA + A+ S A ++ A
Sbjct: 319 NLSGADLSSTNLSGANLSRANLSRADLNRA 348
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 46/101 (45%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S+ F ADL A+ N AN ++ + +D + +GA L A+ N + L
Sbjct: 371 SSTNFSRADLSNAILFGANLSEANLSNVSLNHADLCRADLSGADLSHAILNGTNLSDTIL 430
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
T + +L A+L+ A L L + L GA+ GAD S
Sbjct: 431 FSTNLSDAILMAADLSYAKLNGAKLNNARLNGAMFLGADLS 471
>gi|412993172|emb|CCO16705.1| predicted protein [Bathycoccus prasinos]
Length = 163
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 65/140 (46%), Gaps = 5/140 (3%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
G +A + DLRK + + + + A M +S F S F+ + K A KA+F
Sbjct: 29 IGQANAVSDKTLDLRKCQYDNVSVKGITLSGALMVDSVFDNSDFSETVMSKVYATKASFK 88
Query: 165 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 224
+ ++ ++DR + +++T A VLT GA + GA+F +A+I + LC
Sbjct: 89 NVNFTNAVIDRATFDGSDMTGANFQNAVLTGVSYEGANLTGANFEEALIGDQDVKLLC-- 146
Query: 225 ANGTNPITGVSTRKSLGCGN 244
NP +R +GC N
Sbjct: 147 ---LNPTVVDESRMQIGCKN 163
>gi|86605838|ref|YP_474601.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86554380|gb|ABC99338.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 158
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 54/112 (48%), Gaps = 15/112 (13%)
Query: 117 DLRKAVHVKENFRRANFT----------SADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
DL +A N R AN + AD+R +D S + GA L A ++AN GA
Sbjct: 30 DLVRATLQGANLRGANLSFGKLSGINLQEADLRGADLSSANLMGANLRGANLWEANLIGA 89
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVI 213
DLS + L+ A L A L R L SDL GGA++ GAD S A++
Sbjct: 90 DLSFADLREANLHGAYLWEAKLTRAQLQGSDLSGAKIGGAVLTGADLSGAIL 141
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 54/118 (45%), Gaps = 7/118 (5%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
V L A + + + L+ +N EA+ RG A SA+L A N
Sbjct: 31 LVRATLQGANLRGANLSFGKLSGINLQEADLRG-------ADLSSANLMGANLRGANLWE 83
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
AN AD+ +D + +GAYL +A +A G+DLS + VL A+L+ A+L
Sbjct: 84 ANLIGADLSFADLREANLHGAYLWEAKLTRAQLQGSDLSGAKIGGAVLTGADLSGAIL 141
>gi|425454434|ref|ZP_18834174.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9807]
gi|389804880|emb|CCI15729.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9807]
Length = 962
Score = 55.1 bits (131), Expect = 4e-05, Method: Composition-based stats.
Identities = 34/103 (33%), Positives = 52/103 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A LR A+ N + AN A+++E++ + F GA L +A +AN GA+L +
Sbjct: 798 ANLEGAILRGAILEGANLKEANLKEANLKEANLEEAFFEGAILAEANLERANLYGANLGE 857
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ L ANL A L R L + L GA +E A+ A +
Sbjct: 858 ANLEEAFLAGANLEEAFLERANLKGAFLMGAFLERANLKGAFL 900
Score = 50.8 bits (120), Expect = 6e-04, Method: Composition-based stats.
Identities = 34/106 (32%), Positives = 51/106 (48%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A A+ + N RAN A++ E++ + GA LE+A +AN GA L
Sbjct: 828 ANLEEAFFEGAILAEANLERANLYGANLGEANLEEAFLAGANLEEAFLERANLKGAFLMG 887
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
++R L A L A L + R++L GA +E A F A ++ A
Sbjct: 888 AFLERANLKGAFLMGAFLQWADIERANLDGANLETASFYGANLERA 933
Score = 47.4 bits (111), Expect = 0.007, Method: Composition-based stats.
Identities = 28/100 (28%), Positives = 51/100 (51%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL+ + + + +AN A++ + G+ GA L++A +AN A+L + +
Sbjct: 779 DLKNCLLICRDLYKANLERANLEGAILRGAILEGANLKEANLKEANLKEANLEEAFFEGA 838
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+L EANL A L L ++L A + GA+ +A ++ A
Sbjct: 839 ILAEANLERANLYGANLGEANLEEAFLAGANLEEAFLERA 878
Score = 37.7 bits (86), Expect = 5.4, Method: Composition-based stats.
Identities = 26/94 (27%), Positives = 43/94 (45%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L +A + N + A A + ++ G+ GA+L+ A +AN GA+L
Sbjct: 863 AFLAGANLEEAFLERANLKGAFLMGAFLERANLKGAFLMGAFLQWADIERANLDGANLET 922
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 204
L ANL A LV +++ G I++
Sbjct: 923 ASFYGANLERANLERANLVGANFKDANVKGTILD 956
>gi|17227929|ref|NP_484477.1| hypothetical protein alr0433 [Nostoc sp. PCC 7120]
gi|17129778|dbj|BAB72391.1| alr0433 [Nostoc sp. PCC 7120]
Length = 143
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 46/87 (52%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N ++A+ AD+R ++ +G+ A LE A AN GA+LS L+ NLTN
Sbjct: 47 NLQQAHLIGADLRNANLAGANLKLANLEGADLTGANLKGANLSQVFASDASLSATNLTNV 106
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVI 213
L+ L +DL GA++ AD A++
Sbjct: 107 KLINAELYNADLEGAVLANADLRGAIL 133
Score = 43.5 bits (101), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 49/99 (49%), Gaps = 10/99 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR A N AN A++ +D +G+ GA L + A A+ + +L++
Sbjct: 51 AHLIGADLRNA-----NLAGANLKLANLEGADLTGANLKGANLSQVFASDASLSATNLTN 105
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ L A L NA L VL +DL GAI+ GA +S
Sbjct: 106 -----VKLINAELYNADLEGAVLANADLRGAILFGALYS 139
>gi|186685487|ref|YP_001868683.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186467939|gb|ACC83740.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 146
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 53/106 (50%), Gaps = 9/106 (8%)
Query: 115 SADLRKAVHVKE---------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
SA +R+ + +E N + A+ D+R ++ G+ GA LE A AN
Sbjct: 28 SAPVRRLLETRECLGCNLAGANLKGAHLIGVDLRNANLKGANLEGANLEGADLTGANLKS 87
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A+L++ + +LN ANLTN L + L +D+ GA++ D S A
Sbjct: 88 ANLTEAFVSDTILNNANLTNVNLSNSRLYNTDVDGAVLANIDLSGA 133
>gi|427735760|ref|YP_007055304.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427370801|gb|AFY54757.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 263
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 46/88 (52%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++FRRAN D+ + S +K NGA L A +K GADLSD + R L A++
Sbjct: 149 QDFRRANLKGRDLSGRNLSYAKLNGANLSDAFMHKVVLRGADLSDANLFRANLLLADMKE 208
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A L L +DL GA + GAD A I
Sbjct: 209 ANLQGADLIGADLSGADLRGADLRGARI 236
>gi|113477234|ref|YP_723295.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110168282|gb|ABG52822.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 227
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 63/122 (51%), Gaps = 10/122 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVAYK 160
A+F ADL +A ++ + A+F+ +M ++D S G+ G +A+ K
Sbjct: 90 AKFNKADLTRAKLIRADLSCADFSQVNMVDADLSRAILYEIDLHGANLYGVNFRRAILNK 149
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
A+ GA+L M + L EANLT A L +L+ +DL GA + GA+ SD + A QA
Sbjct: 150 ADLIGANLIRANMTGVDLIEANLTRANLTEAILSGADLNGASLLGANISDVNLVGAALQA 209
Query: 221 LC 222
+
Sbjct: 210 VI 211
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 66/154 (42%), Gaps = 25/154 (16%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFN 149
N +EA G A A+L + K N AN D+ E++ S G+KFN
Sbjct: 41 NFFEANLTG-------ANLSQANLSRVNLAKANLTGANLIGTDLSEANLSDTLLVGAKFN 93
Query: 150 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
A L +A +A+ + AD S + N+ +A L R +L DL GA + G +F
Sbjct: 94 KADLTRAKLIRADLSCADFS----------QVNMVDADLSRAILYEIDLHGANLYGVNFR 143
Query: 210 DAVI---DLAQKQALCKYANGTNPITGVSTRKSL 240
A++ DL + G + I TR +L
Sbjct: 144 RAILNKADLIGANLIRANMTGVDLIEANLTRANL 177
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 58/119 (48%), Gaps = 6/119 (5%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
E ++ IG F L++A +K N ANF A++ ++ S + + L KA
Sbjct: 10 ELLRQYAIGEK-NFSGLYLQEAHLLKANLEGANFFEANLTGANLSQANLSRVNLAKANLT 68
Query: 160 KANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
AN G D LSDTL+ N+A+LT A L+R L+ +D + AD S A++
Sbjct: 69 GANLIGTDLSEANLSDTLLVGAKFNKADLTRAKLIRADLSCADFSQVNMVDADLSRAIL 127
>gi|428221053|ref|YP_007105223.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994393|gb|AFY73088.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 270
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 58/105 (55%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ + +ADL + R+AN T+AD+ ++ + +GA L A AN + A+L
Sbjct: 121 TGSNLSNADLVYVNLENADLRQANLTNADLIYANLKNANLSGANLSGANLSGANLSDANL 180
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
D L+ + L+ ANL +A T+L R++L GA + GA F +A++
Sbjct: 181 EDALLHKAKLSNANLKSANFSGTILVRANLIGADLTGAIFKEAIL 225
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 2/113 (1%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A G + IG A DL + V N R N D++ +D + A + +
Sbjct: 63 ANLMGAYLIG--ANLSHVDLSGSNLVGANLRSINLNDTDLKGADLRETILRNARMARVNL 120
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+N + ADL ++ L +ANLTNA L+ L ++L GA + GA+ S A
Sbjct: 121 TGSNLSNADLVYVNLENADLRQANLTNADLIYANLKNANLSGANLSGANLSGA 173
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L A + N + AN + A++ ++ SG+ + A LE A+ +KA + A+L
Sbjct: 138 ADLRQANLTNADLIYANLKNANLSGANLSGANLSGANLSDANLEDALLHKAKLSNANLK- 196
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
AN + +LVR L +DL GAI + A A +
Sbjct: 197 ---------SANFSGTILVRANLIGADLTGAIFKEAILVHATM 230
>gi|359464087|ref|ZP_09252650.1| hypothetical protein ACCM5_35600 [Acaryochloris sp. CCMEE 5410]
Length = 237
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 61/123 (49%), Gaps = 20/123 (16%)
Query: 111 AQFGSADLRKAVHVKENF-----RRANF----------TSADMR-----ESDFSGSKFNG 150
A F ADLR++ + NF RRAN TSADMR E+D SG+K
Sbjct: 35 ADFSDADLRQSRFGRTNFSYTCFRRANLSETIFWGADLTSADMRQANLREADLSGAKLIQ 94
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
L +A KA GA+LS MD +L E +L RT L R++L GA + A+ S
Sbjct: 95 TQLTEANLLKACLCGANLSAVQMDGAILIEVDLRPTSDQRTDLGRANLAGADLSYANLSQ 154
Query: 211 AVI 213
A++
Sbjct: 155 ALL 157
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 56/102 (54%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +LR+A + A+F+ AD+R+S F + F+ +A + F GADL+
Sbjct: 17 FHRIELREAELINSELCGADFSDADLRQSRFGRTNFSYTCFRRANLSETIFWGADLTSAD 76
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
M + L EA+L+ A L++T LT ++L A + GA+ S +D
Sbjct: 77 MRQANLREADLSGAKLIQTQLTEANLLKACLCGANLSAVQMD 118
>gi|158341584|ref|YP_001522748.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158311825|gb|ABW33434.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 521
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 52/99 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G ADL A N RANF A ++E+D + + +GA+L A AN +GA LS
Sbjct: 88 AYLGGADLYSANLRGANLIRANFNDAHLKEADLTNANLSGAHLRGANLLNANLSGALLSR 147
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
++ L+ ANL NA L L +DL A ++ AD S
Sbjct: 148 ANLENADLSYANLENADLSYANLENADLSHANLKNADLS 186
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 59/136 (43%), Gaps = 21/136 (15%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHV-----KEN------FRRANFTSADMRESDFSG 145
Y ETRG+ Q L++ + V +EN FRR + + + E DFS
Sbjct: 3 YSGETRGKDYAAMTNQVLVELLKRGIGVWNSWREENLYENLDFRRVQLSGSYLSEVDFSH 62
Query: 146 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV----------LTR 195
+ F AYL A AN G +L+ + L ANL A L+R LT
Sbjct: 63 ANFEIAYLSSAKLSCANLEGINLNRAYLGGADLYSANLRGANLIRANFNDAHLKEADLTN 122
Query: 196 SDLGGAIIEGADFSDA 211
++L GA + GA+ +A
Sbjct: 123 ANLSGAHLRGANLLNA 138
Score = 40.4 bits (93), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 54/109 (49%), Gaps = 6/109 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF A +SA + ++ G N AYL A Y AN GA+L + L EA+LTNA
Sbjct: 64 NFEIAYLSSAKLSCANLEGINLNRAYLGGADLYSANLRGANLIRANFNDAHLKEADLTNA 123
Query: 187 VL----VRTV-LTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGTN 229
L +R L ++L GA++ A+ +A + A + A YAN N
Sbjct: 124 NLSGAHLRGANLLNANLSGALLSRANLENADLSYANLENADLSYANLEN 172
>gi|428305945|ref|YP_007142770.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428247480|gb|AFZ13260.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 273
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 78/167 (46%), Gaps = 22/167 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANF-----TSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADL + V N A+ SAD+ +D S + N AYL A AN
Sbjct: 123 SGASLLGADLSRINLVAANLSNAHLEGATMISADLSHADLSQTNINDAYLHLANLSNANL 182
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
TGA+LS + L+ A+L+NA L L ++L A + GAD S+AV +A +
Sbjct: 183 TGANLSGS-----ELHIADLSNANLSEAQLNSAELNNANLLGADLSNAVF----AEANLR 233
Query: 224 YANGT-NPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRD 269
N T N I+ + ++G G G+ +S +L P L DRD
Sbjct: 234 GTNLTSNQISSANLEGAIGLG------EGASASTVLD-QPTILEDRD 273
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 51/108 (47%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S Q ADL V N +N A++R++D G A L +A A+ GADL
Sbjct: 78 SRVQLSGADL-----VDANLNSSNLIQANLRDTDMLGVDLREANLSEADLSGASLLGADL 132
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
S R+ L ANL+NA L + +DL A + + +DA + LA
Sbjct: 133 S-----RINLVAANLSNAHLEGATMISADLSHADLSQTNINDAYLHLA 175
>gi|334121546|ref|ZP_08495612.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333454932|gb|EGK83604.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 388
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 65/118 (55%), Gaps = 8/118 (6%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +A+L KAV + N +A A++ ++F + + A L++A +A TGA+LS
Sbjct: 133 ADMSAANLTKAVLTEANLSKAYLIKANLNGANFQDAYLSLASLKEADLTEAQLTGAELSK 192
Query: 171 -----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
+ R L++ANL A L RT LT++ L GA + G+D S+A +D A LCK
Sbjct: 193 ANLAGANLTRANLSKANLLKANLRRTNLTQAYLNGACLIGSDLSEACLDRAN---LCK 247
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 54/108 (50%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L A + + AN A++ ++ +G+ N A L A+ AN GAD+
Sbjct: 76 SKADLSGANLTGANLMAASLSGANLIGANLTGANLAGAHLNWANLTGAILPNANLIGADM 135
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
S + + VL EANL+ A L++ A + GA+F DA + LA
Sbjct: 136 SAANLTKAVLTEANLSKAYLIK----------ANLNGANFQDAYLSLA 173
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 42/81 (51%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN ADM ++ + + A L KA KAN GA+ D + L EA+LT A L
Sbjct: 128 ANLIGADMSAANLTKAVLTEANLSKAYLIKANLNGANFQDAYLSLASLKEADLTEAQLTG 187
Query: 191 TVLTRSDLGGAIIEGADFSDA 211
L++++L GA + A+ S A
Sbjct: 188 AELSKANLAGANLTRANLSKA 208
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 57/126 (45%), Gaps = 20/126 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA--------- 161
A A+L KA +K N RR N T A + + GS + A L++A KA
Sbjct: 198 ANLTRANLSKANLLKANLRRTNLTQAYLNGACLIGSDLSEACLDRANLCKADLSKTYLRN 257
Query: 162 -----------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
NF+GADLS + R +L N+ A+L L+ + L A + GA+ S
Sbjct: 258 ITLNGSHLSGINFSGADLSGVDLSRKLLTGINMAEALLNEANLSGAYLMEANLSGANLSK 317
Query: 211 AVIDLA 216
A + LA
Sbjct: 318 ANLSLA 323
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANF 163
S F ADL ++ N A + E++ SG+ +GA L KA A
Sbjct: 266 SGINFSGADLSGVDLSRKLLTGINMAEALLNEANLSGAYLMEANLSGANLSKANLSLAYL 325
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
ADLS++ + + L++ANL+ A L + LT ++L GAI+ AD + A
Sbjct: 326 INADLSNSCLHEINLSKANLSKASLQKADLTGANLRGAILTEADLTGA 373
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 59/143 (41%), Gaps = 37/143 (25%)
Query: 111 AQFGSADLRKAVHVKENFRRANF--------------------TSADMRESDFSGSKFNG 150
A A+L KA +K N ANF T A++ +++ +G+
Sbjct: 143 AVLTEANLSKAYLIKANLNGANFQDAYLSLASLKEADLTEAQLTGAELSKANLAGANLTR 202
Query: 151 AYLEKAVAYKAN---------------FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 195
A L KA KAN G+DLS+ +DR L +A+L+ L L
Sbjct: 203 ANLSKANLLKANLRRTNLTQAYLNGACLIGSDLSEACLDRANLCKADLSKTYLRNITLNG 262
Query: 196 SDLGGAIIEGADFSDAVIDLAQK 218
S L G GAD S +DL++K
Sbjct: 263 SHLSGINFSGADLSG--VDLSRK 283
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANF 163
+ A+ A+L A + N +AN A++R ++ + + NGA L +A +AN
Sbjct: 186 TGAELSKANLAGANLTRANLSKANLLKANLRRTNLTQAYLNGACLIGSDLSEACLDRANL 245
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
ADLS T + + LN ++L+ L+ DL ++ G + ++A+++
Sbjct: 246 CKADLSKTYLRNITLNGSHLSGINFSGADLSGVDLSRKLLTGINMAEALLN 296
>gi|354569053|ref|ZP_08988212.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353539057|gb|EHC08553.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 519
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 63/117 (53%), Gaps = 1/117 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A +A++R+A NF AN + A++R +D +G+ + A L +A AN GADL
Sbjct: 173 SGANCRNAEMRQANLSHSNFSGANLSGANLRWADLNGANLSWADLSEAKLSGANLIGADL 232
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY 224
S+ + L A+LT A L++ +DL GA + GA +S + L + +C++
Sbjct: 233 SNANLTNASLVHADLTQAKLIKAEWVGADLSGATLTGAKLYSTSRFGLKTEGMICEW 289
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 70/144 (48%), Gaps = 20/144 (13%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKA----VHVKE-NFRRANFTSADMRES-----DF 143
L KYEA R F S DL +A V + E NF AN + ++ S DF
Sbjct: 7 LAKYEAGER---------DFRSVDLSEANLSGVKLNEANFSHANLSIVNLSGSHLCGTDF 57
Query: 144 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
S ++ N A L A ++AN A L+ + R L+ A L +A L+R L R+DL A +
Sbjct: 58 SHAQINVARLSGAYLHQANLNHASLNVANLIRADLSRAQLQSASLIRAELIRADLSRADL 117
Query: 204 EGADFSDAVIDLAQ-KQALCKYAN 226
A+ + A + A + A+ +YAN
Sbjct: 118 FAANLNCADLREASLRHAILRYAN 141
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 55/108 (50%), Gaps = 5/108 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A + DL + N R A A++ S+FSG+ +GA L A AN + ADLS+
Sbjct: 160 ANLNNTDLSRTDCSGANCRNAEMRQANLSHSNFSGANLSGANLRWADLNGANLSWADLSE 219
Query: 171 TLMD--RMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ ++ L+ ANLTNA LV LT++ L A GAD S A +
Sbjct: 220 AKLSGANLIGADLSNANLTNASLVHADLTQAKLIKAEWVGADLSGATL 267
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 60/114 (52%), Gaps = 3/114 (2%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+LR ++ + N AN + D+ +D SG+ A + +A +NF+GA+LS
Sbjct: 140 ANLNEANLRDSLLTEANLEGANLNNTDLSRTDCSGANCRNAEMRQANLSHSNFSGANLSG 199
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQKQAL 221
+ LN ANL+ A L L+ ++L GA + A+ ++A + DL Q + +
Sbjct: 200 ANLRWADLNGANLSWADLSEAKLSGANLIGADLSNANLTNASLVHADLTQAKLI 253
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 55/110 (50%), Gaps = 10/110 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGS-----KFNGAYLEKAVA 158
S AQ SA L +A ++ + RA N AD+RE+ + N A L ++
Sbjct: 93 SRAQLQSASLIRAELIRADLSRADLFAANLNCADLREASLRHAILRYANLNEANLRDSLL 152
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+AN GA+L++T + R + AN NA + + L+ S+ GA + GA+
Sbjct: 153 TEANLEGANLNNTDLSRTDCSGANCRNAEMRQANLSHSNFSGANLSGANL 202
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 50/102 (49%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AA ADLR+A R AN A++R+S + + GA L + + +GA+
Sbjct: 119 AANLNCADLREASLRHAILRYANLNEANLRDSLLTEANLEGANLNNTDLSRTDCSGANCR 178
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ M + L+ +N + A L L +DL GA + AD S+A
Sbjct: 179 NAEMRQANLSHSNFSGANLSGANLRWADLNGANLSWADLSEA 220
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L+ A ++ RA+ + AD+ ++ + + A L A+ AN A+L D
Sbjct: 90 ADLSRAQLQSASLIRAELIRADLSRADLFAANLNCADLREASLRHAILRYANLNEANLRD 149
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+L L EANL A L T L+R+D GA A+ A
Sbjct: 150 SL-----LTEANLEGANLNNTDLSRTDCSGANCRNAEMRQA 185
Score = 40.4 bits (93), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 48/101 (47%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L A N RA+ + A ++ + ++ A L +A + AN ADL
Sbjct: 68 SGAYLHQANLNHASLNVANLIRADLSRAQLQSASLIRAELIRADLSRADLFAANLNCADL 127
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ + +L ANL A L ++LT ++L GA + D S
Sbjct: 128 REASLRHAILRYANLNEANLRDSLLTEANLEGANLNNTDLS 168
>gi|303289212|ref|XP_003063894.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454962|gb|EEH52267.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 124
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 65/138 (47%), Gaps = 32/138 (23%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL K + K + +RANF++ S+ SG G L A NFTGADLS+
Sbjct: 6 DLTKEFYTKGSMKRANFSN-----SNLSGVTLFGGDLSYA-----NFTGADLSN------ 49
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGA-----------DFSDAVIDLAQKQALC-KY 224
AN+ L T+ T ++L GAI+ GA D++D ++ +C K
Sbjct: 50 ----ANIGQCNLTGTIFTNANLSGAIVSGANMDELGDITGSDWTDVIVRKDVNDKICAKG 105
Query: 225 ANGTNPITGVSTRKSLGC 242
+G NP+TG T +L C
Sbjct: 106 VSGENPVTGNPTAMTLFC 123
>gi|357146891|ref|XP_003574148.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
[Brachypodium distachyon]
Length = 227
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 58/128 (45%), Gaps = 7/128 (5%)
Query: 117 DLRKAVHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
DLR + E N + ++A M ++ F G+ + KA A A+F G D ++ ++D
Sbjct: 104 DLRFCDYTNEKNNLKGKTLSAALMSDAKFDGADLTEVVMSKAYAVGASFKGTDFTNAVID 163
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 234
R +A+L A+ TVL+ S A ++ F D +I Q LC+ N
Sbjct: 164 RANFGKADLEGAIFKNTVLSGSTFDDANMKDVVFEDTIIGYIDLQKLCR-----NMSINE 218
Query: 235 STRKSLGC 242
R LGC
Sbjct: 219 DARLDLGC 226
>gi|58613539|gb|AAW79356.1| chloroplast thylakoid 11kDa protein [Heterocapsa triquetra]
Length = 91
Score = 54.7 bits (130), Expect = 5e-05, Method: Composition-based stats.
Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 1/83 (1%)
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQ 219
A+FTGA L+ ++ L A+LTNA++ + + L +GADF+D + Q+
Sbjct: 7 ADFTGAVLTQANLELAQLTGADLTNAIVTEAYINGTTKLEVKSADGADFTDTPLRKDQQM 66
Query: 220 ALCKYANGTNPITGVSTRKSLGC 242
LC A GTNP+T V TR+S+ C
Sbjct: 67 YLCGIAKGTNPVTKVDTRESMAC 89
>gi|152980852|ref|YP_001353914.1| pentapeptide repeat-containing protein [Janthinobacterium sp.
Marseille]
gi|151280929|gb|ABR89339.1| Uncharacterized conserved protein, pentapeptide repeat family
[Janthinobacterium sp. Marseille]
Length = 243
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 58/116 (50%), Gaps = 5/116 (4%)
Query: 96 KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 155
+++ E AA A+LR A N R AN AD+R++D SG+ A L
Sbjct: 16 EHDIEDNTMLATVKAALAAGANLRDADLSGANLRGANLRDADLRDADLSGANLRDADLSG 75
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A A+ +GA+LSD L+ ANL+ A L L ++LGGA + GAD S A
Sbjct: 76 ANLRDADLSGANLSDA-----DLSGANLSGADLSGANLGGANLGGANLSGADLSGA 126
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 52/100 (52%), Gaps = 5/100 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR A N R A+ + A++R++D SG+ + A L A N +GADLS
Sbjct: 51 ANLRDADLRDADLSGANLRDADLSGANLRDADLSGANLSDADLSGA-----NLSGADLSG 105
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+ L ANL+ A L L+ ++L GA + GA+ D
Sbjct: 106 ANLGGANLGGANLSGADLSGANLSGANLRGANLSGANLRD 145
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 46/98 (46%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A + AN + AD+ ++ SG+ +GA L A AN +GADL
Sbjct: 64 SGANLRDADLSGANLRDADLSGANLSDADLSGANLSGADLSGANLGGANLGGANLSGADL 123
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
S + L ANL+ A L + D+ A+ E A
Sbjct: 124 SGANLSGANLRGANLSGANLRDYPVKIKDIHKAVYEAA 161
>gi|427728370|ref|YP_007074607.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427364289|gb|AFY47010.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 238
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 61/116 (52%), Gaps = 15/116 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A F ADLR+ K N +A SAD+ ES G + A L +A+ +A+ TGA L
Sbjct: 34 TGADFSYADLRQTRLGKTNLSQACLQSADLSESILWGIDLSAADLYRAILREADLTGAKL 93
Query: 169 SDTLMD--RMV--------LNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFS 209
T ++ ++ LN ANL+N++L + L R+DLG A++ GAD S
Sbjct: 94 VKTRLESANLIKASLCGANLNGANLSNSLLFQADLRPSSNQRTDLGYAVLSGADLS 149
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 58/106 (54%), Gaps = 5/106 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +L++A N A+F+ AD+R++ + + A L+ A ++ G DLS
Sbjct: 18 FQRVNLQEAELTNVNLTGADFSYADLRQTRLGKTNLSQACLQSADLSESILWGIDLSAAD 77
Query: 173 MDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADFSDAVI 213
+ R +L EA+LT A LV+T L ++ L GA + GA+ S++++
Sbjct: 78 LYRAILREADLTGAKLVKTRLESANLIKASLCGANLNGANLSNSLL 123
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 61/134 (45%), Gaps = 24/134 (17%)
Query: 105 FGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN- 162
+GI SAA A LR+A + SA++ ++ G+ NGA L ++ ++A+
Sbjct: 69 WGIDLSAADLYRAILREADLTGAKLVKTRLESANLIKASLCGANLNGANLSNSLLFQADL 128
Query: 163 --------------FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS--------DLGG 200
+GADLS + L+ ANL A L R L ++ DL G
Sbjct: 129 RPSSNQRTDLGYAVLSGADLSYADLRATSLHHANLDRAKLCRANLGKTIQWGNLAADLTG 188
Query: 201 AIIEGADFSDAVID 214
A ++GAD S A +D
Sbjct: 189 ASLQGADLSYANLD 202
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 53/106 (50%), Gaps = 8/106 (7%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF--------TGAD 167
ADLR + + + + A + AD+ +D + + A L++A +AN AD
Sbjct: 126 ADLRPSSNQRTDLGYAVLSGADLSYADLRATSLHHANLDRAKLCRANLGKTIQWGNLAAD 185
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L+ + L+ ANL +A+L + L +DL GAI+ A+ A++
Sbjct: 186 LTGASLQGADLSYANLDSAILRKANLRGADLTGAILTDAELEGAIM 231
>gi|434392029|ref|YP_007126976.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428263870|gb|AFZ29816.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 532
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 58/115 (50%), Gaps = 10/115 (8%)
Query: 110 AAQFGSADLRKAVHVKENF----------RRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
A Q +A+L + + NF A+ + AD+R++D SG+ GA L A
Sbjct: 310 ATQLNNANLSDSQLIGANFSNVVAEDIFLENADLSGADLRDADLSGANLKGANLSGANLT 369
Query: 160 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
GADLS+ + +LN A L NA++ +T LT +D A + GAD +A+ D
Sbjct: 370 GVELDGADLSEANLAGAILNGAVLDNALVQKTDLTGADFTNATLTGADLKEAIGD 424
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 56/117 (47%), Gaps = 16/117 (13%)
Query: 111 AQFGSADLRKAV-HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT----- 164
A ADL++A+ NF AN A + F GS F A L KANFT
Sbjct: 411 ATLTGADLKEAIGDSLTNFTGANLNGASLEVGSFIGSNFTDAALRDTNLIKANFTDALFI 470
Query: 165 --------GADL-SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
GADL S T +D + +N + NA+LV LT+++ GA + GA+ S A+
Sbjct: 471 DGSDANSVGADLTSSTFIDGIAIN-GDFRNALLVNANLTKANFTGANLAGANLSGAI 526
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 73/170 (42%), Gaps = 28/170 (16%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS--------------- 135
LA LN +A+ G A+F A L + N NF+S
Sbjct: 263 LAGLNLADADLTG-------ARFNGAILNNFIGGDLNLSGVNFSSFVASNGQVFATQLNN 315
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 195
A++ +S G+ F+ E A+ +GADL D + L ANL+ A L L
Sbjct: 316 ANLSDSQLIGANFSNVVAEDIFLENADLSGADLRDADLSGANLKGANLSGANLTGVELDG 375
Query: 196 SDLGGAIIEGADFSDAVID--LAQKQAL--CKYANGTNPITGVSTRKSLG 241
+DL A + GA + AV+D L QK L + N T +TG ++++G
Sbjct: 376 ADLSEANLAGAILNGAVLDNALVQKTDLTGADFTNAT--LTGADLKEAIG 423
>gi|254409899|ref|ZP_05023679.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196182935|gb|EDX77919.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 478
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 69/151 (45%), Gaps = 19/151 (12%)
Query: 95 NKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANF---------------TSA 136
N EA RG F G+ A +ADL ++ NFR A F + A
Sbjct: 141 NLSEANLRGAFVTGANLEGANLNAADLSRSDLSNSNFRHAEFKQANLSCANLAGADLSGA 200
Query: 137 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 196
++R +D SG+ + A L +A AN TGADL+ + L A+LT A L+ +
Sbjct: 201 NLRWTDLSGANLSWANLSEAKLSGANLTGADLTHANLLNTSLVHADLTQARLIHADWIGA 260
Query: 197 DLGGAIIEGADFSD-AVIDLAQKQALCKYAN 226
DL GA + GA + + L + +C++ +
Sbjct: 261 DLTGATLTGAKLHGVSRVGLKTQGIVCEWVD 291
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 64/129 (49%), Gaps = 3/129 (2%)
Query: 91 LADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
L++ N A G + IG S A+ A L A K N +AN A++ +D G++
Sbjct: 37 LSEANLSVANLSGAYLIGTNLSRARLNVARLSGANLTKANLTKANLNVANLIRADLGGAQ 96
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
A + +A +A +GA L++ + L EA L +A L R L+ ++L GA + GA+
Sbjct: 97 LTQAAMIRAELIRAKLSGATLTEANLSGADLREAALRDAKLQRANLSEANLRGAFVTGAN 156
Query: 208 FSDAVIDLA 216
A ++ A
Sbjct: 157 LEGANLNAA 165
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 66/139 (47%), Gaps = 8/139 (5%)
Query: 87 NISALADLNKYEAETRGEFGIGSA-AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
N++A+ L KY A + G A A +L A + N AN + A + ++ S
Sbjct: 2 NVTAI--LKKYAAGVKNFSGANLAEANLSGINLSGADLSEANLSVANLSGAYLIGTNLSR 59
Query: 146 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-- 203
++ N A L A KAN T A+L+ + R L A LT A ++R L R+ L GA +
Sbjct: 60 ARLNVARLSGANLTKANLTKANLNVANLIRADLGGAQLTQAAMIRAELIRAKLSGATLTE 119
Query: 204 ---EGADFSDAVIDLAQKQ 219
GAD +A + A+ Q
Sbjct: 120 ANLSGADLREAALRDAKLQ 138
Score = 40.4 bits (93), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G A L +A ++ RA + A + E++ SG+ A L A +AN + A+L
Sbjct: 90 ADLGGAQLTQAAMIRAELIRAKLSGATLTEANLSGADLREAALRDAKLQRANLSEANLRG 149
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ L ANL A L R+ L+ S+ A + A+ S
Sbjct: 150 AFVTGANLEGANLNAADLSRSDLSNSNFRHAEFKQANLS 188
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 50/101 (49%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR+A +RAN + A++R + +G+ GA L A +++ + ++
Sbjct: 120 ANLSGADLREAALRDAKLQRANLSEANLRGAFVTGANLEGANLNAADLSRSDLSNSNFRH 179
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L+ ANL A L L +DL GA + A+ S+A
Sbjct: 180 AEFKQANLSCANLAGADLSGANLRWTDLSGANLSWANLSEA 220
>gi|334117106|ref|ZP_08491198.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333461926|gb|EGK90531.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 520
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 61/121 (50%), Gaps = 1/121 (0%)
Query: 94 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 152
L KY A R GI + A +L A N AN + A++ +++ G+K N A
Sbjct: 7 LKKYAAGERNFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSQTNLIGAKLNIAR 66
Query: 153 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
L A A+ T ADL+ + R+ L +A L A L+R L R++L GA + GA+ S A
Sbjct: 67 LSGAHLGGADLTDADLNVAYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANLSGAT 126
Query: 213 I 213
+
Sbjct: 127 L 127
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL+KA+ + RA A++ ++ SG+ +GA L +A AN A+L +
Sbjct: 91 DLKKAILIGAKLIRAELIRAELSGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGA 150
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L EANL A L L+R+DL GA + G + A
Sbjct: 151 CLTEANLEQANLQGADLSRADLSGADLRGTELRQA 185
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L A + R AN A++R + SG+ A LE+A A+ + ADL
Sbjct: 113 SGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRADL 172
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG-----ADFSDA 211
S + L +ANLT AVL L+ +L AI+ G AD S+A
Sbjct: 173 SGADLRGTELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEA 220
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 53/120 (44%), Gaps = 24/120 (20%)
Query: 98 EAETRGEFGIGSAAQFGSADLRKAVHV------KENFRRANFTSADMRESDFSGSKFNGA 151
EA RG A A+LR A H+ + N +AN AD+ +D SG+ G
Sbjct: 129 EATLRG-------ANLAQANLRGA-HLSGACLTEANLEQANLQGADLSRADLSGADLRGT 180
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L +A +A +GADLS NL A+L L +DL A + GAD S A
Sbjct: 181 ELRQANLTQAVLSGADLSGV----------NLRWAILSGCNLRWADLSEAKLSGADLSRA 230
Score = 40.8 bits (94), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A G ADL A + A D++++ G+K A L +A AN +GA+L
Sbjct: 68 SGAHLGGADLTDA-----DLNVAYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANL 122
Query: 169 SDTLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + L +ANL A L LT ++L A ++GAD S A
Sbjct: 123 SGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRA 170
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 15/85 (17%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
NF ++ E++ SG +GA L+ A AN +GA+LS T + LN A L+
Sbjct: 16 NFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSQTNLIGAKLNIARLS------- 68
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLA 216
GA + GAD +DA +++A
Sbjct: 69 --------GAHLGGADLTDADLNVA 85
>gi|428215879|ref|YP_007089023.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004260|gb|AFY85103.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 284
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 62/128 (48%), Gaps = 26/128 (20%)
Query: 109 SAAQFGSADLRKAVHVKE------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
S A+L++A H+ E N RAN + D+ E+D SG AN
Sbjct: 133 SGINLSGANLQEA-HIAEVSFHNANLSRANLSGLDLSETDLSG---------------AN 176
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
+ ADLSDT + +L ANLT A+L L + + G++++GAD S A + + A
Sbjct: 177 LSYADLSDTQLTEAILYGANLTGAILTSAQLDGAKMNGSLVDGADLSQANL----QDAEV 232
Query: 223 KYANGTNP 230
K+ + TN
Sbjct: 233 KWVDLTNA 240
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 54/105 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S +QF SA L+ A V+ N + +AD+R +D S + G+ L +A + N TGA+L
Sbjct: 43 SHSQFCSAILQGATLVEANLEQTKLRAADLRRADLSHANLMGSDLSRADMIETNLTGANL 102
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ ++ +E +A L R L +L G + GA+ +A I
Sbjct: 103 EQANLTEVIFSEVIFADANLSRANLQGLNLSGINLSGANLQEAHI 147
>gi|224014282|ref|XP_002296804.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968659|gb|EED87005.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 2544
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 5/113 (4%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
++ D+ DFS + + G + NF GAD+ + ++ ANL + V V +
Sbjct: 2434 DYAGIDISGQDFSNASYKGKDFTQV---NTNFEGADVRGVSFEDTSMDNANLKDIVAVGS 2490
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN--GTNPITGVSTRKSLGC 242
+S + +E DF+DA I + +C + GTNP TG TR SL C
Sbjct: 2491 YFGQSLVDVKTLENGDFTDATIPPKTLKLVCDREDVKGTNPTTGADTRDSLMC 2543
>gi|381206177|ref|ZP_09913248.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 210
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 71/145 (48%), Gaps = 6/145 (4%)
Query: 98 EAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
EA+ G +G+ + A L++A N AN + A++ E++ G+ G L
Sbjct: 42 EADLGGSLLMGATLISTNLTGAKLQEANLTNANLSEANLSEANLSEANLFGANLTGTNLT 101
Query: 155 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+A +A+ + ADLS+ + L+EAN + A L RT L ++L A + GAD A D
Sbjct: 102 EANLSEADLSWADLSEANLSEANLSEANFSKANLSRTNLRETNLQKADLRGADLRSA--D 159
Query: 215 LAQKQALCKYANGTNPITGVSTRKS 239
L + + Y N N + G RK+
Sbjct: 160 LREAVLVAAYLNEAN-LDGADMRKA 183
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 48/95 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A ADL A + N AN + A+ +++ S + L+KA A+ ADL
Sbjct: 101 TEANLSEADLSWADLSEANLSEANLSEANFSKANLSRTNLRETNLQKADLRGADLRSADL 160
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
+ ++ LNEANL A + + L R+ +GGAI+
Sbjct: 161 REAVLVAAYLNEANLDGADMRKANLYRASMGGAIL 195
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 42/80 (52%), Gaps = 5/80 (6%)
Query: 137 DMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
D+ E+D GS GA L A +AN T A+LS+ + L+EANL A L T
Sbjct: 39 DLSEADLGGSLLMGATLISTNLTGAKLQEANLTNANLSEANLSEANLSEANLFGANLTGT 98
Query: 192 VLTRSDLGGAIIEGADFSDA 211
LT ++L A + AD S+A
Sbjct: 99 NLTEANLSEADLSWADLSEA 118
>gi|428311554|ref|YP_007122531.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428253166|gb|AFZ19125.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 411
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 49/133 (36%), Positives = 68/133 (51%), Gaps = 9/133 (6%)
Query: 77 AAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSA 136
A +VAS S+ + L D N ++A G A+ G A+LR A N A +
Sbjct: 65 ADLIVASLSA--ADLRDANLHDANLIG-------AKLGVANLRDADLSGANLSGAELSCT 115
Query: 137 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 196
D+ S+ +G+ +GA L KA +AN GA+LS T M L+ ANL A L L +
Sbjct: 116 DLTCSNLNGAYISGANLIKAKLSRANLQGANLSVTNMIGADLSGANLQGANLGGANLIEA 175
Query: 197 DLGGAIIEGADFS 209
DLGGA ++GA S
Sbjct: 176 DLGGANLQGAKLS 188
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 49/102 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A A+L KA + N + AN + +M +D SG+ GA L A +A+ GA+L
Sbjct: 123 NGAYISGANLIKAKLSRANLQGANLSVTNMIGADLSGANLQGANLGGANLIEADLGGANL 182
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+ R L NL N+ L L+ S+L G + AD +
Sbjct: 183 QGAKLSRSNLAYVNLANSDLSNADLSDSNLAGTNLTNADLDN 224
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 51/119 (42%), Gaps = 10/119 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL----------EKAVAYK 160
A DLR N + + AD+R++D G+ GA L A +
Sbjct: 25 ADLRGVDLRGIDLSDANLSDTDLSDADLRDADLIGANLRGADLIVASLSAADLRDANLHD 84
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
AN GA L + L+ ANL+ A L T LT S+L GA I GA+ A + A Q
Sbjct: 85 ANLIGAKLGVANLRDADLSGANLSGAELSCTDLTCSNLNGAYISGANLIKAKLSRANLQ 143
>gi|443475539|ref|ZP_21065485.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019605|gb|ELS33670.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 222
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 64/118 (54%), Gaps = 15/118 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA----------VAYK 160
A A+L + + + +F RA+ T A++ ++D S + A L KA +A
Sbjct: 88 ANLSRANLSEGILMGVDFSRADLTEANLSKADLYNSLLSSANLTKANLKSSTLDSSIATD 147
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNA-----VLVRTVLTRSDLGGAIIEGADFSDAVI 213
ANF+ A +++T + +VL+ ANL+NA + + LT SDL GA GAD S++V+
Sbjct: 148 ANFSNAIVTETTLKSIVLSRANLSNADFSNSKMRNSRLTNSDLRGAKFGGADLSNSVM 205
Score = 43.5 bits (101), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 52/111 (46%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYK----- 160
A A+L A+ R AN + A++ E DFS + A L KA Y
Sbjct: 68 ANLSGANLSGALLNDSKLRGANLSRANLSEGILMGVDFSRADLTEANLSKADLYNSLLSS 127
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
AN T A+L + +D + +AN +NA++ T L L A + ADFS++
Sbjct: 128 ANLTKANLKSSTLDSSIATDANFSNAIVTETTLKSIVLSRANLSNADFSNS 178
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 57/120 (47%), Gaps = 16/120 (13%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+ + AD+ D GS NGA L A N +GA L+D+ + L+ ANL+ +L+
Sbjct: 49 DLSGADLSYKDLYGSALNGANLSGA-----NLSGALLNDSKLRGANLSRANLSEGILMGV 103
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLAQ-----------KQALCKYANGTNPITGVSTRKSL 240
+R+DL A + AD ++++ A ++ AN +N I +T KS+
Sbjct: 104 DFSRADLTEANLSKADLYNSLLSSANLTKANLKSSTLDSSIATDANFSNAIVTETTLKSI 163
>gi|334120837|ref|ZP_08494914.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333455836|gb|EGK84476.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 197
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 59/111 (53%), Gaps = 6/111 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF----- 163
S A A+L++AV ++ N R A+ + AD+R +DF + GA A+ A+F
Sbjct: 42 SGANLAGANLQRAV-LRANLRGADLSGADLRGADFRNADLRGASFANALVRDASFGGAFL 100
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
TGA + + + + L A+L A L R +L +DL A + GAD S A ++
Sbjct: 101 TGASIGNLDLSGVDLRGADLRGAALARAILHSADLSNANLSGADLSGADLE 151
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 54/114 (47%), Gaps = 20/114 (17%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSA----------DMRESDFSGSKFNGAYLEKAVAYK 160
A F +ADLR A R A+F A D+ D G+ GA L +A+ +
Sbjct: 73 ADFRNADLRGASFANALVRDASFGGAFLTGASIGNLDLSGVDLRGADLRGAALARAILHS 132
Query: 161 ANFT----------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 204
A+ + GADL + +++ VL ANLT A L+ ++ ++ GA+++
Sbjct: 133 ADLSNANLSGADLSGADLEEAILNGAVLRGANLTGANLLCAMIEQTLWDGALLD 186
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 43/80 (53%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADLR A + A+ ++A++ +D SG+ A L AV AN TGA+L ++++
Sbjct: 118 ADLRGAALARAILHSADLSNANLSGADLSGADLEEAILNGAVLRGANLTGANLLCAMIEQ 177
Query: 176 MVLNEANLTNAVLVRTVLTR 195
+ + A L A L T L+R
Sbjct: 178 TLWDGALLDRACLQGTPLSR 197
>gi|254416875|ref|ZP_05030623.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196176239|gb|EDX71255.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 332
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 7/125 (5%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
Y A+ RG I ADLR A +K N R AN ++RE+D G+ +GA L A
Sbjct: 144 YTAKLRG--AILQNVDLQGADLRGADLLKVNLRGANLRETNLREADLRGANLSGANLSSA 201
Query: 157 VAYKANFTGADLSDTLM-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ N GA+L ++ +R +L+EA+LT L V+ L A + G + S A
Sbjct: 202 FLTEVNLMGANLRGAILKNVKLERAILSEADLTGVNLQGAVMPDVRLSKAQVSGGNLSFA 261
Query: 212 VIDLA 216
++ A
Sbjct: 262 RLNRA 266
Score = 43.9 bits (102), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 48/96 (50%), Gaps = 5/96 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL A + RANF R+++ G++ +GA L +A N + ADL +
Sbjct: 40 ADLHGATLIFAYLSRANF-----RKANLVGTRLSGANLNQAWLSGVNLSNADLHGASLQS 94
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANLT A L+ L +DL GA + GAD + A
Sbjct: 95 ADLRSANLTLASLLDANLMDADLRGANLSGADLTGA 130
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 52/118 (44%), Gaps = 20/118 (16%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGS--------------------KFNGAYLEK 155
A+LR A+ RA + AD+ + G+ + N A L +
Sbjct: 211 ANLRGAILKNVKLERAILSEADLTGVNLQGAVMPDVRLSKAQVSGGNLSFARLNRADLSR 270
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+AN + +DL + + R L ANL+NA L R L+ ++L GA ++GA D I
Sbjct: 271 TNLREANLSDSDLIEAYLARTNLMGANLSNANLTRAELSTTNLMGANLQGATMPDGRI 328
>gi|307592031|ref|YP_003899622.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306985676|gb|ADN17556.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 161
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 48/87 (55%), Gaps = 5/87 (5%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+ AD+ E D +G K GA L KA Y AN +GA LS + L+ ANL+ + L +
Sbjct: 49 DLVEADLHEKDLAGVKLYGADLSKAKLYGANLSGASLSGANLSGASLSGANLSGSYLQKA 108
Query: 192 -----VLTRSDLGGAIIEGADFSDAVI 213
L +++L GA + GAD SDAV+
Sbjct: 109 NLKGAYLQKANLEGAALYGADLSDAVL 135
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 5/99 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL KA N A+ + A++ + SG+ +G+YL+KA N GA L ++
Sbjct: 68 ADLSKAKLYGANLSGASLSGANLSGASLSGANLSGSYLQKA-----NLKGAYLQKANLEG 122
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L A+L++AVL L + L GA +EGA A+ D
Sbjct: 123 AALYGADLSDAVLYGANLKGAKLKGANLEGAKTKGAIFD 161
>gi|119490886|ref|ZP_01623169.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119453704|gb|EAW34863.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 517
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 57/106 (53%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
I +AA ADLR+A + + R+AN SA++R++ S G L A +A+ G
Sbjct: 115 AILTAANLSEADLREATLRQVDLRQANLKSANLRDAVLIASNLEGTNLHGADLTRADLRG 174
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A+L + + + L++ANL+ A L L +DL GA + GA+ A
Sbjct: 175 ANLVNAELRQANLSQANLSGANLKGANLRWADLNGADLRGANLEQA 220
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 52/110 (47%), Gaps = 10/110 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTS----------ADMRESDFSGSKFNGAYLEKAVAYK 160
A ADLR A V R+AN + A++R +D +G+ GA LE+A
Sbjct: 165 ADLTRADLRGANLVNAELRQANLSQANLSGANLKGANLRWADLNGADLRGANLEQARLSG 224
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
A+ GADLS + L A+LT A L T ++L GA + GA D
Sbjct: 225 ASLYGADLSHASLLYTHLIHADLTQANLTGADWTGAELTGAALTGAKLYD 274
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 45/80 (56%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N + A++ ++ S +K N A L A AN TGA L+ + R L+ A L NA +R+
Sbjct: 46 NLSGANLSGTNLSQAKLNVAKLSGANLSGANLTGAILNVANLIRADLSHATLINASAIRS 105
Query: 192 VLTRSDLGGAIIEGADFSDA 211
L R+DL AI+ A+ S+A
Sbjct: 106 ELIRADLSHAILTAANLSEA 125
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 49/101 (48%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A SA+LR AV + N N AD+ +D G+ A L +A +AN +GA+L
Sbjct: 140 ANLKSANLRDAVLIASNLEGTNLHGADLTRADLRGANLVNAELRQANLSQANLSGANLKG 199
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ LN A+L A L ++ L GA + GAD S A
Sbjct: 200 ANLRWADLNGADLRGA-----NLEQARLSGASLYGADLSHA 235
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 51/103 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L A+ N RA+ + A + + S+ A L A+ AN + ADL
Sbjct: 68 SGANLSGANLTGAILNVANLIRADLSHATLINASAIRSELIRADLSHAILTAANLSEADL 127
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + ++ L +ANL +A L VL S+L G + GAD + A
Sbjct: 128 REATLRQVDLRQANLKSANLRDAVLIASNLEGTNLHGADLTRA 170
>gi|440681606|ref|YP_007156401.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428678725|gb|AFZ57491.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 943
Score = 54.3 bits (129), Expect = 6e-05, Method: Composition-based stats.
Identities = 40/108 (37%), Positives = 56/108 (51%), Gaps = 13/108 (12%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+FG A ADL +A NF R N + A M ++FS + FN A L +A +AN
Sbjct: 808 DFG---GANLSHADLSRANLNCANFSRTNCSGAYMISANFSEALFNHANLHEANFIRANL 864
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
TGADLS ++ L+ A+L+ A +L GA +E A+FS A
Sbjct: 865 TGADLSSADLNYADLSLADLSGA----------NLSGANLEDANFSGA 902
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 26/72 (36%), Positives = 41/72 (56%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F A+L +A ++ N A+ +SAD+ +D S + +GA L A ANF+GA L
Sbjct: 845 SEALFNHANLHEANFIRANLTGADLSSADLNYADLSLADLSGANLSGANLEDANFSGAKL 904
Query: 169 SDTLMDRMVLNE 180
S+ L+ + +E
Sbjct: 905 SNGLLGDICWDE 916
Score = 45.1 bits (105), Expect = 0.032, Method: Composition-based stats.
Identities = 39/131 (29%), Positives = 63/131 (48%), Gaps = 6/131 (4%)
Query: 87 NISALADLNKYEAETRGE-FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE----- 140
+ + L D+ Y GE F + +LR A + +F AN + AD+
Sbjct: 767 DTTQLRDIINYSNCLSGENFNQIVGSFLSGTNLRGADLSEVDFGGANLSHADLSRANLNC 826
Query: 141 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 200
++FS + +GAY+ A +A F A+L + R L A+L++A L L+ +DL G
Sbjct: 827 ANFSRTNCSGAYMISANFSEALFNHANLHEANFIRANLTGADLSSADLNYADLSLADLSG 886
Query: 201 AIIEGADFSDA 211
A + GA+ DA
Sbjct: 887 ANLSGANLEDA 897
>gi|428223553|ref|YP_007107650.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427983454|gb|AFY64598.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 521
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 60/128 (46%), Gaps = 11/128 (8%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
E +G G FG DLR+A + N + + AD+ ++ SG+ +GA L A
Sbjct: 5 ELLKRYGAGER-NFGGMDLREANLSRANLSHIDLSGADLSVANLSGANLSGADLRGARLN 63
Query: 160 KANFTGADLSDTLMDRMVLNEANLT----------NAVLVRTVLTRSDLGGAIIEGADFS 209
A +GA+LS + +LN ANL A L+R L R+DL A ++ AD
Sbjct: 64 VAKLSGANLSGANLSSCILNVANLVRADLTGANLNQAALIRAELMRADLKQATLDSADLG 123
Query: 210 DAVIDLAQ 217
A + AQ
Sbjct: 124 GAQLQEAQ 131
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 10/124 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTG 165
A F A+LR A + N RANF +A++R ++ S G+ +GA L A A G
Sbjct: 155 AVFDQANLRGADLNRANATRANFRNAELRLANLSEILLIGADLHGANLRWANLTGARLRG 214
Query: 166 ADLSDTLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
ADL++ + L NLT+A L+ L+R++L G GA+ + A + A+
Sbjct: 215 ADLTEAKLSGAAIVGADLRNVNLTHASLIHADLSRANLIGTDWIGAELTGATLTGAKLHG 274
Query: 221 LCKY 224
+ +Y
Sbjct: 275 VSRY 278
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 74/164 (45%), Gaps = 30/164 (18%)
Query: 76 LAAAVVASCSSNISAL-------ADLNKYEAETRGEF-------GIGSAAQFGSADLRKA 121
L+ A ++SC N++ L A+LN+ A R E +A G A L++A
Sbjct: 72 LSGANLSSCILNVANLVRADLTGANLNQ-AALIRAELMRADLKQATLDSADLGGAQLQEA 130
Query: 122 VHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
+ NF RAN F A + ++ F + GA L +A A +ANF A+L + +
Sbjct: 131 QLHQANFSRANLSEVNFHRATLADAVFDQANLRGADLNRANATRANFRNAELRLANLSEI 190
Query: 177 V----------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+ L ANLT A L LT + L GA I GAD +
Sbjct: 191 LLIGADLHGANLRWANLTGARLRGADLTEAKLSGAAIVGADLRN 234
Score = 44.3 bits (103), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 56/112 (50%), Gaps = 5/112 (4%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
I + A ADL A + RA AD++++ + GA L++A ++ANF+ A
Sbjct: 81 ILNVANLVRADLTGANLNQAALIRAELMRADLKQATLDSADLGGAQLQEAQLHQANFSRA 140
Query: 167 DLSDTLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+LS+ R V ++ANL A L R TR++ A + A+ S+ ++
Sbjct: 141 NLSEVNFHRATLADAVFDQANLRGADLNRANATRANFRNAELRLANLSEILL 192
>gi|307152500|ref|YP_003887884.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306982728|gb|ADN14609.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 305
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 65/137 (47%), Gaps = 25/137 (18%)
Query: 90 ALADL-NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK- 147
A+ DL NKY+A R S + DLR + NF+ A+F+ A++RE DFSG+
Sbjct: 6 AVIDLKNKYDAGERN----FSKIELRRVDLRGFNLSQANFKGADFSYANLREVDFSGADL 61
Query: 148 ----FN---------------GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
FN G+YL KA K N A+LS + L+++NLTNA L
Sbjct: 62 SEAFFNEADLTGANLQEANLQGSYLMKAYLMKTNLQSANLSKAYLTGAYLSKSNLTNANL 121
Query: 189 VRTVLTRSDLGGAIIEG 205
L S L GA + G
Sbjct: 122 TGAYLNGSKLNGADLTG 138
>gi|194337742|ref|YP_002019536.1| pentapeptide repeat-containing protein [Pelodictyon
phaeoclathratiforme BU-1]
gi|194310219|gb|ACF44919.1| pentapeptide repeat protein [Pelodictyon phaeoclathratiforme BU-1]
Length = 408
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 5/98 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L A+ K N R+++FT S +G+ G++++ AV +AN GA+L
Sbjct: 96 ADLKGANLTMALIKKANLRKSDFTG-----SSLTGANLQGSFMKGAVLREANLEGANLRW 150
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+++ LN ANLT A L L +DL GA ++ A F
Sbjct: 151 AMLENGDLNRANLTGATLFEANLAGADLKGANLKNAHF 188
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 48/87 (55%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+F +A+ A+M+ + G+ GA L++A A+ + ++LS+ L+ L ANL+ A
Sbjct: 282 DFHKADLHKAEMKSAKLQGADLQGANLDRAFLKGADLSNSNLSNALLYGAKLGNANLSGA 341
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVI 213
L L +DL GA +EGA+ A I
Sbjct: 342 NLEGASLFEADLEGANLEGANLKGANI 368
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 5/90 (5%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLT 184
R + + ++ +G+ F+ A L KA A GADL +DR L A NL+
Sbjct: 265 RTRVEQSSFQNTNMAGADFHKADLHKAEMKSAKLQGADLQGANLDRAFLKGADLSNSNLS 324
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
NA+L L ++L GA +EGA +A ++
Sbjct: 325 NALLYGAKLGNANLSGANLEGASLFEADLE 354
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 44/96 (45%), Gaps = 20/96 (20%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL+ A N A A++R+SDF+GS + TGA+L + M
Sbjct: 96 ADLKGA-----NLTMALIKKANLRKSDFTGS---------------SLTGANLQGSFMKG 135
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
VL EANL A L +L DL A + GA +A
Sbjct: 136 AVLREANLEGANLRWAMLENGDLNRANLTGATLFEA 171
>gi|300866933|ref|ZP_07111605.1| exported hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300335037|emb|CBN56767.1| exported hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 253
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 55/119 (46%), Gaps = 10/119 (8%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
N+ + E E G Q DLR A N R AN AD+R ++ G+ GA L
Sbjct: 26 NRRDVEKLKETG-----QCSRCDLRDA-----NLRNANLQGADLRNANLRGANLRGAALR 75
Query: 155 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A A+ GADL D + R L ANL++A L L R+++ G +G D A +
Sbjct: 76 NADLSNADLRGADLRDADLSRSNLRNANLSDANLRNADLERAEVRGVNFQGTDLRGANV 134
>gi|428219623|ref|YP_007104088.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427991405|gb|AFY71660.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 172
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 51/95 (53%), Gaps = 5/95 (5%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DLR A N R A +A +R SD +G+ A L A AN TGADL+ M+
Sbjct: 69 DLRGA-----NLRGAFLKNARLRGSDLTGADLRDATLTGAYFTGANLTGADLAGAEMEWA 123
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L +ANL +A L L+RSDL GA ++GAD A
Sbjct: 124 NLRDANLQDANLQDANLSRSDLDGANLDGADLRGA 158
>gi|428318916|ref|YP_007116798.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428242596|gb|AFZ08382.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 568
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 76/158 (48%), Gaps = 13/158 (8%)
Query: 67 NWRVFVSTALAAAVVASCSSNISALA----DLNKYEAETRGEFGIGSAAQFGS----ADL 118
NW F L A + S+ LA D K A E + A+ G+ A+L
Sbjct: 103 NWAAFPEADLGGANLQRVKSDQINLAGAKLDGAKLMAAELMEANLNRASLVGANLTGANL 162
Query: 119 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 178
R+A V+ N R A ++ E+D +G++ A L A ++A GADL++ ++D L
Sbjct: 163 REAHLVEANLRSAILLGVNLIEADLNGAQMRSANLAGADLHRAVLAGADLTEAVLDNADL 222
Query: 179 NEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDA 211
+ ANL + L+ + +L R++L G + AD S+A
Sbjct: 223 SRANLAGSYLLKASFQKALLLRANLQGVYLLRADLSEA 260
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 59/136 (43%), Gaps = 30/136 (22%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADM----------RESDFSGSKFNGAYLEKAVA 158
S A ADLR A A F AD+ + + +G+K +GA L A
Sbjct: 83 SGANLAKADLRLACLEAAELNWAAFPEADLGGANLQRVKSDQINLAGAKLDGAKLMAAEL 142
Query: 159 YKANF-----TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA------------ 201
+AN GA+L+ + L EANL +A+L+ L +DL GA
Sbjct: 143 MEANLNRASLVGANLTGANLREAHLVEANLRSAILLGVNLIEADLNGAQMRSANLAGADL 202
Query: 202 ---IIEGADFSDAVID 214
++ GAD ++AV+D
Sbjct: 203 HRAVLAGADLTEAVLD 218
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 56/127 (44%), Gaps = 15/127 (11%)
Query: 111 AQFGSADLRKAVHVKENF----------RRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
A A+LR A+ + N R AN AD+ + +G+ A L+ A +
Sbjct: 165 AHLVEANLRSAILLGVNLIEADLNGAQMRSANLAGADLHRAVLAGADLTEAVLDNADLSR 224
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS-----DAVIDL 215
AN G+ L + +L ANL L+R L+ ++L A + AD S DA++
Sbjct: 225 ANLAGSYLLKASFQKALLLRANLQGVYLLRADLSEANLRSADLRKADLSGAYLMDAMLGE 284
Query: 216 AQKQALC 222
A +A C
Sbjct: 285 ADLRAAC 291
>gi|344171276|emb|CCA83758.1| hypothetical protein, Pentapeptide repeat domains [blood disease
bacterium R229]
Length = 325
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 51/103 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A A + AD+ +D SG+ +GAYL A AN +GADL
Sbjct: 52 SGADLSGADLSGAYLSGAYLSGAYLSDADLSGADLSGADLSGAYLSGAYLSGANLSGADL 111
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + L+ A+L+ A L L+ + L GA + GAD S A
Sbjct: 112 SGANLSGADLSGADLSGADLSGAYLSGAYLSGAYLSGADLSGA 154
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 50/105 (47%), Gaps = 10/105 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A A + AD+ +D SG+ +GAYL A AN +GADL
Sbjct: 122 SGADLSGADLSGAYLSGAYLSGAYLSGADLSGADLSGADLSGAYLSGAYLSSANLSGADL 181
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S ANL+ A L L+ +DL GA + GA+ S A +
Sbjct: 182 SG----------ANLSGANLSGAYLSSADLSGANLSGANLSGAYL 216
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A + A+ + AD+ + SG+ +GAYL A A+ +GADL
Sbjct: 102 SGANLSGADLSGANLSGADLSGADLSGADLSGAYLSGAYLSGAYLSGADLSGADLSGADL 161
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + L+ ANL+ A L L+ ++L GA + AD S A
Sbjct: 162 SGAYLSGAYLSSANLSGADLSGANLSGANLSGAYLSSADLSGA 204
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 52/105 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A L A + AN + AD+ +D SG+ +GAYL A A +GADL
Sbjct: 92 SGAYLSGAYLSGANLSGADLSGANLSGADLSGADLSGADLSGAYLSGAYLSGAYLSGADL 151
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S + L+ A L+ A L L+ +DL GA + GA+ S A +
Sbjct: 152 SGADLSGADLSGAYLSGAYLSSANLSGADLSGANLSGANLSGAYL 196
>gi|428314781|ref|YP_007150965.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428256164|gb|AFZ22121.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 237
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 53/96 (55%), Gaps = 5/96 (5%)
Query: 113 FGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
F ADL A + N +AN + A ++R++D +G+K A L A A+ TGA+
Sbjct: 130 FQGADLSNAQLLNTNLAKANLSMATLNRTELRDADLTGAKLESANLSNATLVGAHMTGAN 189
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
L+ + VL A+LT AVL++T L +DL AI+
Sbjct: 190 LTGANFNNAVLRYADLTKAVLIKTNLKGADLSLAIM 225
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 57/110 (51%), Gaps = 10/110 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANF 163
S ADLR + RAN + ++++ SG++ N A L + A + +F
Sbjct: 76 SGLDLSGADLRNT-----DLSRANLKNTKLKDAKMSGARLNQANLTYADLDGADFQECDF 130
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
GADLS+ + L +ANL+ A L RT L +DL GA +E A+ S+A +
Sbjct: 131 QGADLSNAQLLNTNLAKANLSMATLNRTELRDADLTGAKLESANLSNATL 180
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 58/123 (47%), Gaps = 4/123 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A + L+ A +AN T AD+ +DF F GA L A N A+L
Sbjct: 91 SRANLKNTKLKDAKMSGARLNQANLTYADLDGADFQECDFQGADLSNAQLLNTNLAKANL 150
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
S ++R L +A+LT A L L+ + L GA + GA+ + A + A+ +YA+ T
Sbjct: 151 SMATLNRTELRDADLTGAKLESANLSNATLVGAHMTGANLTGANFN----NAVLRYADLT 206
Query: 229 NPI 231
+
Sbjct: 207 KAV 209
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 38/85 (44%), Gaps = 12/85 (14%)
Query: 137 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 196
D+RE + SG +GA L +AN L D M LN+ANLT A
Sbjct: 69 DLREINLSGLDLSGADLRNTDLSRANLKNTKLKDAKMSGARLNQANLTYA---------- 118
Query: 197 DLGGAIIEGADFSDAVIDLAQKQAL 221
DL GA + DF A DL+ Q L
Sbjct: 119 DLDGADFQECDFQGA--DLSNAQLL 141
>gi|383312720|ref|YP_005365521.1| hypothetical protein MCE_05120 [Candidatus Rickettsia amblyommii
str. GAT-30V]
gi|378931380|gb|AFC69889.1| hypothetical protein MCE_05120 [Candidatus Rickettsia amblyommii
str. GAT-30V]
Length = 958
Score = 54.3 bits (129), Expect = 6e-05, Method: Composition-based stats.
Identities = 41/121 (33%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ SADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKSADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVI-DLAQKQALCKYA 225
+ + EAN NA++ R LT++D A++E AD ++A+ ++ KQA K A
Sbjct: 610 IAQNINAKEANFKNAIMQRADLTKADFTKAVLENADMQAVAAAEAIFKEVNLKQANLKAA 669
Query: 226 N 226
N
Sbjct: 670 N 670
Score = 37.7 bits (86), Expect = 6.3, Method: Composition-based stats.
Identities = 33/108 (30%), Positives = 50/108 (46%), Gaps = 5/108 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ +A L KA N A + + +E++ F A +++A KA+FT A L +
Sbjct: 589 AKLSNATLEKAEAEGLNISDAIAQNINAKEAN-----FKNAIMQRADLTKADFTKAVLEN 643
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
M + EA L + L ++L G EGADF A I+ A K
Sbjct: 644 ADMQAVAAAEAIFKEVNLKQANLKAANLAGINQEGADFDKAKINDATK 691
>gi|282896932|ref|ZP_06304938.1| hglK (Pentapeptide repeat protein) [Raphidiopsis brookii D9]
gi|281198341|gb|EFA73231.1| hglK (Pentapeptide repeat protein) [Raphidiopsis brookii D9]
Length = 689
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 58/105 (55%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S AQ ADL A + + + + +++ ++++ G+ + +YL A ANF+ A+L
Sbjct: 536 SGAQLQEADLYAAQLARVSAIGSQLSHSNLTKTNWQGADLSESYLNHANLNSANFSAANL 595
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S +L AN+TNA L ++R+DL GA +EG DF A++
Sbjct: 596 SGA-----ILRSANMTNANLRNADISRADLRGANLEGTDFQGAIL 635
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 52/117 (44%), Gaps = 19/117 (16%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAY---------LEKAVAYKANFTGADLSDTLMDRM-- 176
+ AN A + S F +G + L KA +N + A+LS LM R+
Sbjct: 431 LKSANLNQASFKSSRFRSVGEDGRWDTYDDIIADLSKAQLKGSNLSSANLSRVLMSRVDL 490
Query: 177 ---VLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 225
VLN ANL N+ L+ R L SDL AI++ A + A I AQ Q YA
Sbjct: 491 SFSVLNRANLANSKLIGANLSRAQLVGSDLQQAILQDAILTGADISGAQLQEADLYA 547
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 65/131 (49%), Gaps = 14/131 (10%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSG 145
+ADL+K A+ +G + SA+L + + + + RAN ++ + ++ S
Sbjct: 462 IADLSK--AQLKG-------SNLSSANLSRVLMSRVDLSFSVLNRANLANSKLIGANLSR 512
Query: 146 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
++ G+ L++A+ A TGAD+S + L A L + + L+ S+L +G
Sbjct: 513 AQLVGSDLQQAILQDAILTGADISGAQLQEADLYAAQLARVSAIGSQLSHSNLTKTNWQG 572
Query: 206 ADFSDAVIDLA 216
AD S++ ++ A
Sbjct: 573 ADLSESYLNHA 583
>gi|443651776|ref|ZP_21130709.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|159027471|emb|CAO89436.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443334417|gb|ELS48929.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 931
Score = 54.3 bits (129), Expect = 6e-05, Method: Composition-based stats.
Identities = 36/117 (30%), Positives = 55/117 (47%), Gaps = 2/117 (1%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
G A+L +A + + AN A++ ++ G+ GA L A +AN GA+L
Sbjct: 789 LGGANLERANLAEADIGGANLEGANLEGANLKGANLEGANLAMAFLKRANLEGANLRGAN 848
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
++ L ANL A L R L ++L GA + GA+ A +D A + Y G N
Sbjct: 849 LEEAYLEGANLAMAFLKRANLEGANLRGANLYGANLKGANLDWANLEG--AYLEGAN 903
Score = 45.4 bits (106), Expect = 0.029, Method: Composition-based stats.
Identities = 42/129 (32%), Positives = 56/129 (43%), Gaps = 5/129 (3%)
Query: 98 EAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
E E IG A G A+L A N AN A ++ ++ G+ GA LE+A
Sbjct: 795 ERANLAEADIGGANLEG-ANLEGANLKGANLEGANLAMAFLKRANLEGANLRGANLEEAY 853
Query: 158 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
AN A L ++ L ANL A L L ++L GA +EGA+ +D A
Sbjct: 854 LEGANLAMAFLKRANLEGANLRGANLYGANLKGANLDWANLEGAYLEGANLRGVFLDGAN 913
Query: 218 KQALCKYAN 226
KYAN
Sbjct: 914 ----FKYAN 918
>gi|393766611|ref|ZP_10355166.1| pentapeptide repeat-containing protein [Methylobacterium sp. GXF4]
gi|392727929|gb|EIZ85239.1| pentapeptide repeat-containing protein [Methylobacterium sp. GXF4]
Length = 448
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 51/89 (57%), Gaps = 5/89 (5%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTN 185
A F A MR +D SG+ +GA +A + A+F+GAD DT+ +D L +ANLT+
Sbjct: 133 ARFGQAAMRFADLSGALLDGASFAEADLWGADFSGADADDTVFRDARLDEAKLADANLTH 192
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A LT++ L G+ + GA F+ A +D
Sbjct: 193 ADFEGASLTKASLAGSRLRGAKFTGAKLD 221
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 54/182 (29%), Positives = 74/182 (40%), Gaps = 15/182 (8%)
Query: 74 TALAAAVVASCSSNISAL----ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 129
TALAA A + L ADL++ E A A+LR+A R
Sbjct: 41 TALAAGGTAPADAESGGLPLAEADLSRARIEE---------ADLSGANLRRASLTGAVGR 91
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 189
F A + E+D S + +GA VA + F A L D + + A+L+ A+L
Sbjct: 92 STRFVGAILEETDLSEADMSGADFTGIVAGQVKFASAMLEDARFGQAAMRFADLSGALLD 151
Query: 190 RTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYANGTNP-ITGVSTRKSLGCGNSRR 247
+DL GA GAD D V D +A AN T+ G S K+ G+ R
Sbjct: 152 GASFAEADLWGADFSGADADDTVFRDARLDEAKLADANLTHADFEGASLTKASLAGSRLR 211
Query: 248 NA 249
A
Sbjct: 212 GA 213
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 46/97 (47%), Gaps = 5/97 (5%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A L +A N A+F A + ++ +GS+ GA A A+ +GADLSDT
Sbjct: 175 FRDARLDEAKLADANLTHADFEGASLTKASLAGSRLRGAKFTGAKLDGADLSGADLSDTD 234
Query: 173 MDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIE 204
+ R+ L A A L T ++ LGGA+ E
Sbjct: 235 LVRLNLATCRLRHARFAGAWLNGTRMSVEQLGGAVGE 271
>gi|119488469|ref|ZP_01621642.1| hypothetical protein L8106_23865 [Lyngbya sp. PCC 8106]
gi|119455280|gb|EAW36420.1| hypothetical protein L8106_23865 [Lyngbya sp. PCC 8106]
Length = 463
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 60/109 (55%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F +ADL + + R A+ +SA + D + + A L +A +AN GA+L +
Sbjct: 119 ANFHNADLDAVNLISADLRGADLSSASLSWYDKVVANLSRADLTEANLSEANLCGANLLE 178
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
T + R LN+ANL +A L+RT+L SDL A + A DA ++ A+ Q
Sbjct: 179 TNLTRANLNKANLQDANLIRTILLESDLSLAELSNARLQDANLEGAKLQ 227
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 53/111 (47%), Gaps = 15/111 (13%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L KA N R +D+ ++ S ++ A LE A +AN TG +LS + R
Sbjct: 184 ANLNKANLQDANLIRTILLESDLSLAELSNARLQDANLEGAKLQQANLTGINLSRLNLAR 243
Query: 176 MVLNEANLTNAVLVRTV---------------LTRSDLGGAIIEGADFSDA 211
+ LN ANL NA L+ T L R++L A + GAD +DA
Sbjct: 244 VNLNRANLKNANLLETSFEGANLRIVNLNQANLIRANLSRASLIGADLTDA 294
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 55/105 (52%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A+ +A L+ A ++AN T ++ + + N A L+ A + +F GA+L
Sbjct: 207 SLAELSNARLQDANLEGAKLQQANLTGINLSRLNLARVNLNRANLKNANLLETSFEGANL 266
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+++ L ANL+ A L+ LT ++L GA +E A+F AV+
Sbjct: 267 RIVNLNQANLIRANLSRASLIGADLTDANLYGANLENAEFLGAVM 311
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 4/105 (3%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSA----DMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
A A+L + ++ N AN TSA D+R++ S + + A L +A + + TGA
Sbjct: 40 ADLTDANLNETKLMRANLSHANLTSANLRGDLRQATLSYATLSEADLGRAKLHGVDLTGA 99
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+L+ + LN ANL A L +L A + GAD S A
Sbjct: 100 NLTGANLTGASLNHANLKQANFHNADLDAVNLISADLRGADLSSA 144
>gi|411117186|ref|ZP_11389673.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410713289|gb|EKQ70790.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 544
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 59/119 (49%), Gaps = 1/119 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A +LR+A + N R AN T A++R +D SG+ + A L A AN TG +L
Sbjct: 173 SGADLSYTELRQANLSRANLRGANLTGANLRWADLSGADLSWADLSGARLSGANLTGVNL 232
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALCKYAN 226
S + +L A+LT A L+ SDL GA + GA + + + LC++ +
Sbjct: 233 SYANLLGTILVHADLTRASLIGADWAGSDLSGATLTGAKLHGVLRFGVKTEGILCEWVD 291
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 59/119 (49%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
S+DL A R+AN + A++R ++ +G+ A L A A+ +GA LS
Sbjct: 167 LSSSDLSGADLSYTELRQANLSRANLRGANLTGANLRWADLSGADLSWADLSGARLSGAN 226
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
+ + L+ ANL +LV LTR+ L GA G+D S A + A+ + ++ T I
Sbjct: 227 LTGVNLSYANLLGTILVHADLTRASLIGADWAGSDLSGATLTGAKLHGVLRFGVKTEGI 285
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 3/135 (2%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A DL+ A + N AN + A ++ + F+ + + A L ++ +GADL
Sbjct: 118 SFANLSGVDLKDAKLRQANLSHANISRASLKWATFTSANLSQANLHGTDLSSSDLSGADL 177
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL-CKYAN- 226
S T + + L+ ANL A L L +DL GA + AD S A + A + YAN
Sbjct: 178 SYTELRQANLSRANLRGANLTGANLRWADLSGADLSWADLSGARLSGANLTGVNLSYANL 237
Query: 227 -GTNPITGVSTRKSL 240
GT + TR SL
Sbjct: 238 LGTILVHADLTRASL 252
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 58/123 (47%), Gaps = 9/123 (7%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 163
S A +L KA N AN +++ E++ S +K N GA L KA +AN
Sbjct: 23 SEANLSGVNLSKANLNGANLSVANLCGSNLSEANLSKAKLNVAKLSGANLSKANLEEANL 82
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
A+L+ + L +A+L A + R L+ ++L A + G D DA + +QA
Sbjct: 83 NVANLTLADLSHAELRQASLVRAEMARAELSEANLSFANLSGVDLKDAKL----RQANLS 138
Query: 224 YAN 226
+AN
Sbjct: 139 HAN 141
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 62/131 (47%), Gaps = 15/131 (11%)
Query: 84 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM----- 138
C SN+S A+L+K + + A+ A+L KA + N AN T AD+
Sbjct: 48 CGSNLSE-ANLSKAKL---------NVAKLSGANLSKANLEEANLNVANLTLADLSHAEL 97
Query: 139 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
R++ ++ A L +A AN +G DL D + + L+ AN++ A L T ++L
Sbjct: 98 RQASLVRAEMARAELSEANLSFANLSGVDLKDAKLRQANLSHANISRASLKWATFTSANL 157
Query: 199 GGAIIEGADFS 209
A + G D S
Sbjct: 158 SQANLHGTDLS 168
>gi|428216610|ref|YP_007101075.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988392|gb|AFY68647.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 373
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 56/111 (50%), Gaps = 14/111 (12%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A L KA V+ N + AN + A++ E++ SG+ A L A+ AN +GA+LS
Sbjct: 189 AQLNKAYFVRANLQNANLSDANLTEANLSGADLREADLSGAILCGANLSGANLS------ 242
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
EANL A VL SDL GA + GAD +A + QA K AN
Sbjct: 243 ----EANLRTANFKAAVLIGSDLSGADLSGADLYNANL----SQADLKIAN 285
Score = 37.4 bits (85), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 59/120 (49%), Gaps = 9/120 (7%)
Query: 84 CSSNISA--LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRES 141
C +N+S L++ N A + IGS ADL A + AN + AD++ +
Sbjct: 232 CGANLSGANLSEANLRTANFKAAVLIGS--DLSGADLSGA-----DLYNANLSQADLKIA 284
Query: 142 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 201
+ S + + L A AY AN + ADLS ++ L ANL+ A LV+T ++LG A
Sbjct: 285 NLSLADLGASNLFGANAYGANLSLADLSMANLNDAFLYGANLSWANLVQTNFAGANLGAA 344
>gi|189499620|ref|YP_001959090.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
BS1]
gi|189495061|gb|ACE03609.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
Length = 300
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 67/126 (53%), Gaps = 12/126 (9%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSG 145
L+D N EA+ G + A+LR A + R + TSA++ E+DF+
Sbjct: 93 LSDANLVEADLSGSMLV-------EANLRGANLSRGKVRDVDLTSANLSDGFFIETDFTR 145
Query: 146 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
S+ + +++A +A TG +LS + ++++ L+ A+L NAVLV +T SDL A G
Sbjct: 146 SQMVRSKMQRAFLGRATLTGTNLSWSNLEKVNLDNADLQNAVLVDVDITSSDLVAANFSG 205
Query: 206 ADFSDA 211
AD DA
Sbjct: 206 ADLRDA 211
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 46/104 (44%), Gaps = 29/104 (27%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AA F ADLR A + N R A+ T AD+R GA L ++ N TG
Sbjct: 200 AANFSGADLRDADLSEVNLRNADLTGADLR----------GARL----SFSQNMTG---- 241
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ L NA+L L ++L GA IE ADFS A I
Sbjct: 242 -----------STLNNAILHSANLIGTNLNGADIEQADFSGAKI 274
>gi|428202965|ref|YP_007081554.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427980397|gb|AFY77997.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 179
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 49/95 (51%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL+ A N AN +AD+ E++ G+ GA L+ A K N GA+L +
Sbjct: 65 DLQNANLQGANLEGANLQNADLEEANLQGANLAGANLQGADLEKGNLAGANLQTANLINA 124
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L EANL NA L L R+DL A + GA+ ++A
Sbjct: 125 DLEEANLQNANLQGASLQRADLEKANLTGANTNEA 159
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 5/79 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A ADL K N + AN +AD+ E++ + GA L++A KAN TGA+
Sbjct: 97 AGANLQGADLEKGNLAGANLQTANLINADLEEANLQNANLQGASLQRADLEKANLTGANT 156
Query: 169 SDTLMDRMVLNEANLTNAV 187
++ L ANL NA+
Sbjct: 157 NEA-----NLQGANLENAI 170
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 68/143 (47%), Gaps = 9/143 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A S +LR+ + KE N + D++ ++ G+ GA L+ A +AN GA+L
Sbjct: 38 STAPEASTELRRLLDTKE-CAGCNLSGVDLQNANLQGANLEGANLQNADLEEANLQGANL 96
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+ + L + NL A L L +DL A ++ A+ A + ++A + AN
Sbjct: 97 AGANLQGADLEKGNLAGANLQTANLINADLEEANLQNANLQGASL----QRADLEKAN-- 150
Query: 229 NPITGVSTRKSLGCGNSRRNAYG 251
+TG +T ++ G + NA G
Sbjct: 151 --LTGANTNEANLQGANLENAIG 171
>gi|332707026|ref|ZP_08427086.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332354291|gb|EGJ33771.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 239
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 51/105 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S F AD +A N AN A + + F +K A L+ A N GADL
Sbjct: 78 SGVDFSRADFSQANLSDSNLENANLKDAKVIGARFENAKLTSADLDGADFKDTNLKGADL 137
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
SD + + L A+L+ A+L RT L +DL GA +E AD S A I
Sbjct: 138 SDANLLNIRLANADLSTAILNRTELREADLTGANMEHADLSHASI 182
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 59/118 (50%), Gaps = 4/118 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S + +A+L+ A + F A TSAD+ +DF + GA L A ADL
Sbjct: 93 SDSNLENANLKDAKVIGARFENAKLTSADLDGADFKDTNLKGADLSDANLLNIRLANADL 152
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
S +++R L EA+LT A + L+ + + GAI+ A+ + A + +A +YAN
Sbjct: 153 STAILNRTELREADLTGANMEHADLSHASIYGAILREANLTGANL----YKANLRYAN 206
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 41/82 (50%), Gaps = 5/82 (6%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA-----YKANFTGA 166
+ +ADL A+ + R A+ T A+M +D S + GA L +A YKAN A
Sbjct: 146 RLANADLSTAILNRTELREADLTGANMEHADLSHASIYGAILREANLTGANLYKANLRYA 205
Query: 167 DLSDTLMDRMVLNEANLTNAVL 188
+L D ++ L A+L AV+
Sbjct: 206 NLQDAVLKGTNLKGADLQFAVM 227
>gi|254425612|ref|ZP_05039329.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
gi|196188035|gb|EDX83000.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
Length = 215
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 54/107 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L A K N AN + AD+ ESD S + GA L A A+ +GADL
Sbjct: 15 ANLSEANLDGATLDKANLMGANLSEADLSESDLSSADLPGATLHNATLQNADLSGADLRS 74
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+ R L+EANL +A L L +DL GA + GA+ A + +A
Sbjct: 75 ADLFRADLSEANLRSADLSSADLRGADLPGAKLIGANLIGANLSIAN 121
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 46/81 (56%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+ + AD+R ++ S + +GA L+KA AN + ADLS++ + L A L NA L
Sbjct: 5 ADLSGADLRGANLSEANLDGATLDKANLMGANLSEADLSESDLSSADLPGATLHNATLQN 64
Query: 191 TVLTRSDLGGAIIEGADFSDA 211
L+ +DL A + AD S+A
Sbjct: 65 ADLSGADLRSADLFRADLSEA 85
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 40/76 (52%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
++AD+ +D G+ + A L+ A KAN GA+LS+ + L+ A+L A L
Sbjct: 2 LSNADLSGADLRGANLSEANLDGATLDKANLMGANLSEADLSESDLSSADLPGATLHNAT 61
Query: 193 LTRSDLGGAIIEGADF 208
L +DL GA + AD
Sbjct: 62 LQNADLSGADLRSADL 77
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 32/57 (56%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
S A SADL +A + N R A+ +SAD+R +D G+K GA L A AN TG
Sbjct: 68 SGADLRSADLFRADLSEANLRSADLSSADLRGADLPGAKLIGANLIGANLSIANVTG 124
>gi|163797895|ref|ZP_02191839.1| pentapeptide repeat family protein [alpha proteobacterium BAL199]
gi|159176857|gb|EDP61425.1| pentapeptide repeat family protein [alpha proteobacterium BAL199]
Length = 396
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 47/95 (49%), Gaps = 10/95 (10%)
Query: 114 GSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 173
G+AD + A N +FT AD+RE DF+G+ GA A +A GADLS
Sbjct: 15 GAADGQPASFANANLFGFDFTGADLREVDFAGASLQGARFVGADLTRAVLVGADLSGVSF 74
Query: 174 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
VL EA+LT A LV GA+ EGAD
Sbjct: 75 RNAVLLEADLTGARLV----------GAVFEGADL 99
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 76/163 (46%), Gaps = 17/163 (10%)
Query: 63 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQFGSADLR 119
A L+ R FV L AV+ + + + EA+ G +G+ A S LR
Sbjct: 47 ASLQGAR-FVGADLTRAVLVGADLSGVSFRNAVLLEADLTGARLVGAVFEGADLRSVSLR 105
Query: 120 KAVHV------KENFRR--ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
A V +E+ RR F A M ++ +G+KF E V + + TGA+L
Sbjct: 106 GASGVSAEPVTEESPRREAVTFAGARMHRANLTGAKF-----ENVVLAQTDLTGANLERA 160
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ R ++ A L NA+L+ L+ +DL +++ GAD S A +D
Sbjct: 161 SLRRASMSGAVLRNAILIDADLSHADLTDSLVTGADLSGAQLD 203
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 44/89 (49%), Gaps = 1/89 (1%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+ T AD+R + SG+ GA L +AV A ADLS + L NL+ A L
Sbjct: 270 DLTDADLRSLNLSGADLRGAVLRRAVLTDALLVLADLSGADLTLASLARCNLSGANLAGA 329
Query: 192 VLTRSDLGGAIIEGAD-FSDAVIDLAQKQ 219
L+R+DL AI+ A S A D ++Q
Sbjct: 330 NLSRADLTDAILTAAPILSQAGADTGRRQ 358
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 55/110 (50%), Gaps = 1/110 (0%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
RAN T A + + GA LE+A +A+ +GA L + ++ L+ A+LT+++
Sbjct: 132 MHRANLTGAKFENVVLAQTDLTGANLERASLRRASMSGAVLRNAILIDADLSHADLTDSL 191
Query: 188 LVRTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYANGTNPITGVST 236
+ L+ + L GA +E A+F A + D+ + A T P V+T
Sbjct: 192 VTGADLSGAQLDGATVERANFVGARLRDVDLSRVDTSKARLTPPTDSVTT 241
>gi|226365701|ref|YP_002783484.1| hypothetical protein ROP_62920 [Rhodococcus opacus B4]
gi|226244191|dbj|BAH54539.1| hypothetical protein [Rhodococcus opacus B4]
Length = 201
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 15/131 (11%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYL 153
+E R E I + F ADL ++ HV FR +FT + S+F GS+F+ L
Sbjct: 38 SELRTESVIFTECDFTGADLAESHHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97
Query: 154 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 203
V + +FT GADL EANL L R VL +DL GGA +
Sbjct: 98 RPMVFDECDFTLVSLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKL 157
Query: 204 EGADFSDAVID 214
+GAD A +D
Sbjct: 158 DGADLRGAHVD 168
>gi|398354158|ref|YP_006399622.1| hypothetical protein USDA257_c43260 [Sinorhizobium fredii USDA 257]
gi|390129484|gb|AFL52865.1| hypothetical protein USDA257_c43260 [Sinorhizobium fredii USDA 257]
Length = 249
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 62/124 (50%), Gaps = 11/124 (8%)
Query: 109 SAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A+ +A+L KA V+ + +ANF+ + DFSG GA + +A+F
Sbjct: 85 SGAELTAANLEKATLVRASLAGAKADKANFSRVEAYRGDFSGISAEGALFVSSELQRADF 144
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGA-DFSDAVIDLAQ 217
TGA L+ ++ L AN AVL T L+R++L GA+ EG DF A + L +
Sbjct: 145 TGARLTGADFEKAELGRANFGKAVLTGTRFSVANLSRANLSGALFEGPLDFDRAFLFLTR 204
Query: 218 KQAL 221
+ L
Sbjct: 205 IEGL 208
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 49/111 (44%), Gaps = 5/111 (4%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G A + + R+ + + + ++ D +D SG++ A LEKA +A+ GA
Sbjct: 49 GPGADWRECNKRQLMLGGSDLKGSHLVDTDFASTDLSGAELTAANLEKATLVRASLAGAK 108
Query: 168 LSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
R+ + + A+ V + L R+D GA + GADF A +
Sbjct: 109 ADKANFSRVEAYRGDFSGISAEGALFVSSELQRADFTGARLTGADFEKAEL 159
>gi|428320418|ref|YP_007118300.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428244098|gb|AFZ09884.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 479
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 55/105 (52%), Gaps = 10/105 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL ++ N RA+ T A +RE++ G +F GA L++A KAN GA+L
Sbjct: 60 SGANLSGADLAESFLNLANLTRADLTGAVLREANLVGVEFTGANLKQASLIKANLVGANL 119
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+EANLT A L L S L GAI++ A +++ I
Sbjct: 120 ----------HEANLTRANLSGADLRGSQLSGAILDKAVYNNRTI 154
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 48/97 (49%), Gaps = 10/97 (10%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
SADLR + + AN AD+RE+DF+G A AN +GADL +
Sbjct: 338 SADLRGVDLTRADLSGANLRDADLRETDFTG----------ATLLFANLSGADLRGVDLT 387
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L+ A L A L + L R +L GA + AD SDA
Sbjct: 388 KADLSGAKLNEADLRKADLMRVNLEGADLTEADLSDA 424
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 51/103 (49%), Gaps = 15/103 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR+ AN + AD+R ++D SG+K N A L KA + N
Sbjct: 352 SGANLRDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGAKLNEADLRKADLMRVNL 411
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
GADL+ EA+L++A L R L ++L G ++GA
Sbjct: 412 EGADLT----------EADLSDAHLFRVNLRGANLKGTNLKGA 444
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 51/97 (52%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
LR A ++ + AN + AD+ ES + + A L AV +AN G + + + +
Sbjct: 49 LRYADLIEADLSGANLSGADLAESFLNLANLTRADLTGAVLREANLVGVEFTGANLKQAS 108
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L +ANL A L LTR++L GA + G+ S A++D
Sbjct: 109 LIKANLVGANLHEANLTRANLSGADLRGSQLSGAILD 145
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 44/84 (52%), Gaps = 5/84 (5%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
R AN D+RE++ SG+ A L +A AN +GADL+++ LN ANLT A
Sbjct: 29 LRGANLRGTDLRETNLSGAMLRYADLIEADLSGANLSGADLAESF-----LNLANLTRAD 83
Query: 188 LVRTVLTRSDLGGAIIEGADFSDA 211
L VL ++L G GA+ A
Sbjct: 84 LTGAVLREANLVGVEFTGANLKQA 107
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 5/95 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL + N +N TS +++ D S + AYL+ A + GADLS
Sbjct: 274 ADLNGSDLSGANLSASNLTSVNLKNVDLSRASLKKAYLKGANLEGTDLRGADLSGA---- 329
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+L++ NL++A L LTR+DL GA + AD +
Sbjct: 330 -ILHQVNLSSADLRGVDLTRADLSGANLRDADLRE 363
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 53/106 (50%), Gaps = 5/106 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 165
A DLR A + N +SAD+R D + + +GA L A + +FTG
Sbjct: 314 ANLEGTDLRGADLSGAILHQVNLSSADLRGVDLTRADLSGANLRDADLRETDFTGATLLF 373
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A+LS + + L +A+L+ A L L ++DL +EGAD ++A
Sbjct: 374 ANLSGADLRGVDLTKADLSGAKLNEADLRKADLMRVNLEGADLTEA 419
Score = 40.4 bits (93), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 63/156 (40%), Gaps = 31/156 (19%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV----------- 157
+ A A L KA V N AN T A++ +D GS+ +GA L+KAV
Sbjct: 100 TGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAVYNNRTIFPEDI 159
Query: 158 ---AYKA------------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
A A N DL++ + L NL A+L L R++L GA
Sbjct: 160 DPGAMGAFLLAPNASLPGLNLAMVDLTEADLKGADLRRTNLYKAILFGAKLDRANLAGAN 219
Query: 203 IEGADFSDA-----VIDLAQKQALCKYANGTNPITG 233
+ AD +A +++ A ++ G +P G
Sbjct: 220 LSAADLREASLSGTILEKAVYSNKTLFSEGIDPALG 255
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 49/106 (46%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
++ + DL +A K + AN D+R +D SG+ + L A + T ADL
Sbjct: 292 TSVNLKNVDLSRASLKKAYLKGANLEGTDLRGADLSGAILHQVNLSSADLRGVDLTRADL 351
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
S + L E + T A L+ L+ +DL G + AD S A ++
Sbjct: 352 SGANLRDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGAKLN 397
>gi|428302093|ref|YP_007140399.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428238637|gb|AFZ04427.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 146
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 71/143 (49%), Gaps = 17/143 (11%)
Query: 74 TALAAAVVASCSSNISALADLN---KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
TAL A + S ISA AD+ ++ ETR + + +LR A N +
Sbjct: 7 TALTIASTITLSLPISAQADMKSDVQHLLETRECY---------ACNLRGA-----NLKG 52
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+ AD+R ++ G+ GA LE A A+ A+LS ++ LN ANLTNA L
Sbjct: 53 AHLIGADLRNANLKGANLAGANLEGADLTGADLEEANLSYAFVNSTSLNYANLTNANLSN 112
Query: 191 TVLTRSDLGGAIIEGADFSDAVI 213
L ++L GA++ GAD + A I
Sbjct: 113 AHLYSAELDGAVMVGADLAGADI 135
>gi|6226483|sp|Q52118.1|YMO3_ERWST RecName: Full=Uncharacterized protein in mobD 3'region
gi|886362|gb|AAA69501.1| unknown [Plasmid pSW200]
Length = 295
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 61/105 (58%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANF 163
S A +ADL++A N A+ T+A++ ++D +GA L A +AY +A+
Sbjct: 170 SNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADL 229
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ A+LS+ + R L++ANL++A L L R+DL AI++GA+
Sbjct: 230 SNANLSNADLKRADLSDANLSDANLTNVDLKRADLSNAILKGANL 274
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANF 163
S A A+L A N AN T A + E+D S + +GA L A + N
Sbjct: 90 SDADLSDANLSDANLSGANLAHANLTMAYLSEADLSNANLSGADLTNANLNQTDLPNVNL 149
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+GA+L+ + L+EA+L+NA L L R+DL A + GAD ++A ++
Sbjct: 150 SGANLAHANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNANLN 200
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 49/95 (51%), Gaps = 10/95 (10%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM---------- 176
N AN T A + E+D S + + A L++A AN +GADL++ +++
Sbjct: 153 NLAHANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGA 212
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANLT A L L+ ++L A ++ AD SDA
Sbjct: 213 NLAHANLTMAYLSEADLSNANLSNADLKRADLSDA 247
Score = 40.4 bits (93), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 50/106 (47%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL A + AN D+ + SG+ A L A +A+ + A+LS+
Sbjct: 117 AYLSEADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADLSNANLSN 176
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+ R L+ ANL+ A L L ++DL + GA+ + A + +A
Sbjct: 177 ADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMA 222
>gi|113477694|ref|YP_723755.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110168742|gb|ABG53282.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 204
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 62/131 (47%), Gaps = 13/131 (9%)
Query: 96 KYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSA----------DMRESD 142
K A RG G+ A F +ADLR A+ + R A+F A D+ D
Sbjct: 63 KLRANLRGADFTGADLRGADFRNADLRGAILIDAQLREASFAGAFLNGAIFNNLDLSGID 122
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
F G+ G L KA ++A + A+LS + L EANL+ AVL T L S+L A
Sbjct: 123 FRGADLRGVNLSKANLFRAELSNANLSGADLSSADLEEANLSGAVLRGTNLQSSNLLCAS 182
Query: 203 IEGADFSDAVI 213
+E AD + ++
Sbjct: 183 VEQADLTGTLL 193
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 58/109 (53%), Gaps = 11/109 (10%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A+L+KA ++ N R A+FT AD+R +DF + GA L A +A+F GA L+ +
Sbjct: 54 FAGANLQKA-KLRANLRGADFTGADLRGADFRNADLRGAILIDAQLREASFAGAFLNGAI 112
Query: 173 MDRMVLN----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + L+ NL+ A L R L+ ++L GA + AD +A
Sbjct: 113 FNNLDLSGIDFRGADLRGVNLSKANLFRAELSNANLSGADLSSADLEEA 161
>gi|434407711|ref|YP_007150596.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428261966|gb|AFZ27916.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 268
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 52/101 (51%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G++ L + N N +AD+ E+ ++ GAYL K YKAN T A LS
Sbjct: 144 ADLGTSKLHRTNLCFANLIAVNLIAADLSEATLHEAEVMGAYLYKTDLYKANLTEAHLSG 203
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ R L EA+L+NA L T L ++L GA + GA+ A
Sbjct: 204 AYLLRANLTEADLSNADLSWTNLRGANLTGANLRGANLRGA 244
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 43/98 (43%), Gaps = 15/98 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G ADL A N A A++ ++ S + GA L +A AN G+DLS
Sbjct: 64 ANLGGADLTGA-----NLYNAKLIEANLSAANLSAANLRGATLTQADMNCANLIGSDLS- 117
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
EANL AV+ L +DL GA + AD
Sbjct: 118 ---------EANLKGAVITDANLIGADLRGANLRDADL 146
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 56/112 (50%), Gaps = 10/112 (8%)
Query: 84 CSSNISAL----ADLNK---YEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANF 133
C +N+ A+ ADL++ +EAE G + + A A L A ++ N A+
Sbjct: 157 CFANLIAVNLIAADLSEATLHEAEVMGAYLYKTDLYKANLTEAHLSGAYLLRANLTEADL 216
Query: 134 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++AD+ ++ G+ GA L A AN TGA+LS + ++ ++++ N
Sbjct: 217 SNADLSWTNLRGANLTGANLRGANLRGANLTGANLSSVNLHETIMPDSSMHN 268
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 50/98 (51%), Gaps = 5/98 (5%)
Query: 119 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 178
+K + + R AN ++R ++ G + L A+ +AN GADL+ + L
Sbjct: 22 KKNPQIAPDLRGANLQGDNLRGANLQGVNLSKVDLSNALLVRANLGGADLTGANLYNAKL 81
Query: 179 NEANLTNAVL----VR-TVLTRSDLGGAIIEGADFSDA 211
EANL+ A L +R LT++D+ A + G+D S+A
Sbjct: 82 IEANLSAANLSAANLRGATLTQADMNCANLIGSDLSEA 119
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 51/109 (46%), Gaps = 15/109 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SAA +A+LR A T ADM ++ GS + A L+ AV AN GADL
Sbjct: 87 SAANLSAANLRGAT----------LTQADMNCANLIGSDLSEANLKGAVITDANLIGADL 136
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L +A+L + L RT L ++L + AD S+A + A+
Sbjct: 137 RGA-----NLRDADLGTSKLHRTNLCFANLIAVNLIAADLSEATLHEAE 180
>gi|443476809|ref|ZP_21066696.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443018179|gb|ELS32476.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 330
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 68/131 (51%), Gaps = 12/131 (9%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L+ +N A G IG+ F +L +A + N + AN AD++ ++ + G
Sbjct: 183 LSRVNLQGANLSGAIAIGTI--FTEVNLSQANLTEVNLKGANLMKADLKNANLRLANLFG 240
Query: 151 AYLEKA---VAYKAN-------FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 200
A L KA +A +N TG+DLS +L+DR L++A+L +A LVR L +DL
Sbjct: 241 ANLSKANLSMATLSNAGLIQAILTGSDLSRSLLDRANLSQASLVDAYLVRANLDGADLSN 300
Query: 201 AIIEGADFSDA 211
AI+ A+ S A
Sbjct: 301 AILTRAELSGA 311
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 66/136 (48%), Gaps = 11/136 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFTG 165
A ++DL K + RR NF A + + F SG+K GA L +A+ AN T
Sbjct: 25 ANLFNSDLIGINLTKADLRRTNFVFAYLNKVTFNHANLSGAKLGGATLNQAIMMSANLTE 84
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 225
ADL ++ R+ L ANL+ A L+ L+ +DL + GA+ A++ AL +
Sbjct: 85 ADLHGAMLQRVNLFGANLSLANLMDANLSEADLRSVNLRGANLRCAIL----SAALMREE 140
Query: 226 NGTNP--ITGVSTRKS 239
G P + G + RK+
Sbjct: 141 RGYPPTNMVGANLRKA 156
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 47/94 (50%), Gaps = 5/94 (5%)
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
V N R+A+ A++ SD +G +GA L +A + N GA+LS + + E NL
Sbjct: 149 VGANLRKADLRGANLSGSDLTGVDLSGANLSEATLSRVNLQGANLSGAIAIGTIFTEVNL 208
Query: 184 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+ A LT +L GA + AD +A + LA
Sbjct: 209 SQA-----NLTEVNLKGANLMKADLKNANLRLAN 237
Score = 44.3 bits (103), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 49/99 (49%), Gaps = 5/99 (5%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F S +L A + N T AD+R ++F F AYL K AN +GA L
Sbjct: 17 FASLNLANANLFNSDLIGINLTKADLRRTNFV---F--AYLNKVTFNHANLSGAKLGGAT 71
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+++ ++ ANLT A L +L R +L GA + A+ DA
Sbjct: 72 LNQAIMMSANLTEADLHGAMLQRVNLFGANLSLANLMDA 110
>gi|227496450|ref|ZP_03926734.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434]
gi|226834032|gb|EEH66415.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434]
Length = 222
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 53/104 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR+++ + R AN +DMR +D G+ G +L A+ GADL D
Sbjct: 98 ADMAGADLRRSILPRAELRNANLVDSDMRGADLRGADLRGTWLPYTDMRGADLAGADLRD 157
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
++ L+ A+L ++ L LT ++L A + GAD A ID
Sbjct: 158 ADLEGADLHGASLQSSDLRGADLTDAELTDADLRGADLRGADID 201
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 43/93 (46%), Gaps = 6/93 (6%)
Query: 110 AAQFGSA-DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
A + G A DLR N R + T AD+R G+ +GA L + A+ T ADL
Sbjct: 16 AHRLGQAPDLRDTDLSNLNLRELDLTDADLR-----GANLDGADLSWSTLSTADLTDADL 70
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 201
+ R VL A LT A L + +D+ GA
Sbjct: 71 RGATLRRTVLTRAVLTRAALTQVYARDADMAGA 103
>gi|428218432|ref|YP_007102897.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427990214|gb|AFY70469.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 403
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +A L +A + +AN T A + ++D S GAYL A +AN GA
Sbjct: 9 ANLTNASLTRADLKGVDLVKANLTGASLSDADLSQVNLTGAYLNGADLNRANLAGA---- 64
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+L+EANL A L+R L R+ L AI+ GA+F +A +
Sbjct: 65 ------ILDEANLAAAFLIRANLQRASLNEAILAGANFHEASL 101
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 58/119 (48%), Gaps = 15/119 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-------------- 156
A ADL+ VK N A+ + AD+ + + +G+ NGA L +A
Sbjct: 14 ASLTRADLKGVDLVKANLTGASLSDADLSQVNLTGAYLNGADLNRANLAGAILDEANLAA 73
Query: 157 -VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+AN A L++ ++ +EA+LT A L L+ +DL GA + GA+ SDA ++
Sbjct: 74 AFLIRANLQRASLNEAILAGANFHEASLTGANLRSADLSLADLAGADLAGANLSDACMN 132
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 10/108 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVAYK 160
A A L +A+ NF A+ T A++R +D S G+ + A + A +
Sbjct: 79 ANLQRASLNEAILAGANFHEASLTGANLRSADLSLADLAGADLAGANLSDACMNSAFFIE 138
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
AN GADLS T + L +ANL+ A L LT +DL A + GA+
Sbjct: 139 ANLLGADLSLTSLRGASLAKANLSGANLRSADLTGADLSHATMTGAEL 186
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 49/95 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A S +++ A+ V+ N AN +A++ ++ + NGA L +A +AN +GA L
Sbjct: 302 SGADLSSTEMKGAILVRTNLNGANLANANLTGANLEQANLNGANLGEANLNRANLSGASL 361
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
+ + L ANL L+ L ++L GAI+
Sbjct: 362 TGANLKGAFLLWANLKGTFLLWANLDEANLTGAIL 396
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 48/94 (51%), Gaps = 10/94 (10%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
LR A K N AN SAD+ +D S + GA L ++ + TGA+L D+
Sbjct: 151 LRGASLAKANLSGANLRSADLTGADLSHATMTGAEL-----HQVDLTGANL-----DQTN 200
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
LN A+L NA L L+R++LG A + G +A
Sbjct: 201 LNAADLVNASLDGAFLSRANLGWANLIGTTMKEA 234
Score = 43.9 bits (102), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 57/120 (47%), Gaps = 12/120 (10%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL----- 153
A G F +G A A+L A N AN + AD+ ++ G+ +GA L
Sbjct: 259 ANLTGAFLMG--ANLNGANLNGA-----NLTNANLSGADLSNTNLMGTSLSGADLSSTEM 311
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ A+ + N GA+L++ + L +ANL A L L R++L GA + GA+ A +
Sbjct: 312 KGAILVRTNLNGANLANANLTGANLEQANLNGANLGEANLNRANLSGASLTGANLKGAFL 371
Score = 43.9 bits (102), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 57/109 (52%), Gaps = 5/109 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL A + N A+ + A++ + G+ NGA L A AN +GADLS+
Sbjct: 234 ANLVGADLSWANLNEVNLAGADLSWANLTGAFLMGANLNGANLNGANLTNANLSGADLSN 293
Query: 171 TLMDRMVLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDAVID 214
T + L+ A+L++ A+LVRT L ++L A + GA+ A ++
Sbjct: 294 TNLMGTSLSGADLSSTEMKGAILVRTNLNGANLANANLTGANLEQANLN 342
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 15/92 (16%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
++AN T+A + +D G KAN TGA LSD L++ NLT A
Sbjct: 6 LKKANLTNASLTRADLKGVDL----------VKANLTGASLSDA-----DLSQVNLTGAY 50
Query: 188 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
L L R++L GAI++ A+ + A + A Q
Sbjct: 51 LNGADLNRANLAGAILDEANLAAAFLIRANLQ 82
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 47/101 (46%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L A F AN AD+ + G+ A L A A+ TGADLS
Sbjct: 119 ADLAGANLSDACMNSAFFIEANLLGADLSLTSLRGASLAKANLSGANLRSADLTGADLSH 178
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
M L++ +LT A L +T L +DL A ++GA S A
Sbjct: 179 ATMTGAELHQVDLTGANLDQTNLNAADLVNASLDGAFLSRA 219
Score = 40.8 bits (94), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 48/95 (50%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A L KA N R A+ T AD+ + +G++ + L A + N ADL + +D
Sbjct: 154 ASLAKANLSGANLRSADLTGADLSHATMTGAELHQVDLTGANLDQTNLNAADLVNASLDG 213
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
L+ ANL A L+ T + ++L GA + A+ ++
Sbjct: 214 AFLSRANLGWANLIGTTMKEANLVGADLSWANLNE 248
Score = 37.0 bits (84), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 5/107 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L A + N AN A++ ++ SG+ + L + +GADLS
Sbjct: 254 ADLSWANLTGAFLMGANLNGANLNGANLTNANLSGADLSNTNL-----MGTSLSGADLSS 308
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
T M +L NL A L LT ++L A + GA+ +A ++ A
Sbjct: 309 TEMKGAILVRTNLNGANLANANLTGANLEQANLNGANLGEANLNRAN 355
>gi|443317576|ref|ZP_21046968.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442782825|gb|ELR92773.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 303
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 55/103 (53%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G DL +A+ V+ N R++ + ++ +++ + + G L +A +ANFT A+L
Sbjct: 99 ADLGETDLSQAILVEANLNRSDLSGVNLHQANLTKASLIGVELNRANLREANFTEANLRR 158
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ R L +ANL TR++L A + ADFSDA++
Sbjct: 159 VELQRAQLGKANL----------TRANLADARMLHADFSDAIL 191
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 66/141 (46%), Gaps = 22/141 (15%)
Query: 74 TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANF 133
T L+ A++ + N S L+ +N ++A IG +L +A N R ANF
Sbjct: 104 TDLSQAILVEANLNRSDLSGVNLHQANLTKASLIG-------VELNRA-----NLREANF 151
Query: 134 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 193
T A++R + ++ A L +A A AD SD +L E NL+ A L R L
Sbjct: 152 TEANLRRVELQRAQLGKANLTRANLADARMLHADFSDA-----ILQETNLSGARLNRANL 206
Query: 194 TRSDLGGAIIE-----GADFS 209
TR+DL A ++ GAD S
Sbjct: 207 TRTDLTAANLKETNLLGADLS 227
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 59/119 (49%), Gaps = 1/119 (0%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G+ DL+ A + RAN + E+D SG+ L A KAN +GA+L+
Sbjct: 34 ANLGNFDLKGANLSGADLTRANCIGVILSEADLSGATLVRTDLSGADINKANLSGANLTK 93
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANGT 228
+ L E +L+ A+LV L RSDL G + A+ + A +I + +A + AN T
Sbjct: 94 ANLLGADLGETDLSQAILVEANLNRSDLSGVNLHQANLTKASLIGVELNRANLREANFT 152
Score = 43.9 bits (102), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 59/123 (47%), Gaps = 20/123 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTG 165
AQ G A+L +A A+F+ A ++E++ SG++ N A L + A + N G
Sbjct: 164 AQLGKANLTRANLADARMLHADFSDAILQETNLSGARLNRANLTRTDLTAANLKETNLLG 223
Query: 166 ADLSDTLMDRMVLNEANLTNA---------------VLVRTVLTRSDLGGAIIEGADFSD 210
ADLS +L EANL+ A L T LT+++L GA + A+ +
Sbjct: 224 ADLSYANFTEALLAEANLSGADLSYANLAGLDLTGLNLAGTNLTQANLAGANLTEANLEE 283
Query: 211 AVI 213
AV+
Sbjct: 284 AVL 286
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 37/73 (50%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL A + AN + AD+ ++ +G G L +AN GA+L++ ++
Sbjct: 224 ADLSYANFTEALLAEANLSGADLSYANLAGLDLTGLNLAGTNLTQANLAGANLTEANLEE 283
Query: 176 MVLNEANLTNAVL 188
VL EANLT A +
Sbjct: 284 AVLTEANLTQATM 296
>gi|427723149|ref|YP_007070426.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427354869|gb|AFY37592.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 508
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A+ LR+A N RR + + A + ++D S + GAYL A Y AN GA+L
Sbjct: 67 SGAKLSKVHLRQAYLYGTNLRRTHLSEAFLFKADLSKTNLYGAYLYGAYLYGANLYGANL 126
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
S + L+EA+L+ A L L+ +DL G + AD S
Sbjct: 127 S-----KADLSEADLSEADLSEADLSEADLSGVSLSEADLS 162
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
AQF A L N A+ + AD+ +D SG+K + +L +A Y N LS+
Sbjct: 34 AQFSGAHLSGVNLSGVNLSGADLSGADLSGADLSGAKLSKVHLRQAYLYGTNLRRTHLSE 93
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + L++ NL A L L ++L GA + AD S+A
Sbjct: 94 AFLFKADLSKTNLYGAYLYGAYLYGANLYGANLSKADLSEA 134
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 59/123 (47%), Gaps = 7/123 (5%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L+ N Y A G + G A A+L KA + A+ + AD+ E+D S + +G
Sbjct: 101 LSKTNLYGAYLYGAYLYG--ANLYGANLSKA-----DLSEADLSEADLSEADLSEADLSG 153
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
L +A N +G +LS + + L+ NL+ A L T+ S L GA ++ AD +
Sbjct: 154 VSLSEADLSGVNLSGVNLSGVNLSGVNLSGVNLSGAKLCHTLCKLSTLVGASLKSADLTG 213
Query: 211 AVI 213
A I
Sbjct: 214 ACI 216
>gi|158341150|ref|YP_001522487.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158311391|gb|ABW33002.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 150
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 49/84 (58%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
F +AN T+A + + F G+ F A L+ A AN +GA+L + + +L ANLT A
Sbjct: 20 FAKANLTNAILHGATFIGTSFQQANLQAAGLISANLSGANLKEANLTNALLTTANLTGAD 79
Query: 188 LVRTVLTRSDLGGAIIEGADFSDA 211
L ++L R+ L AI++GA+ DA
Sbjct: 80 LRSSILCRAVLTDAILQGANLRDA 103
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 55/105 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +A L A + +F++AN +A + ++ SG+ A L A+ AN TGADL
Sbjct: 23 ANLTNAILHGATFIGTSFQQANLQAAGLISANLSGANLKEANLTNALLTTANLTGADLRS 82
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
+++ R VL +A L A L L +D A + GAD S A ++L
Sbjct: 83 SILCRAVLTDAILQGANLRDADLRETDFKNADLTGADLSGAKVNL 127
>gi|254417634|ref|ZP_05031369.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196175575|gb|EDX70604.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 470
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 56/111 (50%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A SADLR A N A AD+R +D G+K N A L+ A AN +GA+LS
Sbjct: 214 ANLVSADLRNA-----NLTDAQLEVADIRSADLRGAKLNNANLDTVNADSANLSGANLS- 267
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 221
+ + A+ A+LVRT L + L G+ + AD + A + AQ + +
Sbjct: 268 ----QAYITNADFNGAILVRTTLREAVLNGSNFQIADLTQANLQGAQLKGI 314
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 69/158 (43%), Gaps = 22/158 (13%)
Query: 67 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
N + V T L AV+ + I+ L N A+ +G IG F A+L KA
Sbjct: 277 NGAILVRTTLREAVLNGSNFQIADLTQANLQGAQLKG---IG----FNRANLTKANLEGA 329
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL----------SDTLMDRM 176
+ A AD+ + +G+ + AYL A AN +G DL S+ +
Sbjct: 330 DLTNAKLAIADLTNAQLTGAILHSAYLHSATLANANLSGVDLQGAQLREANLSNVTLVGA 389
Query: 177 VLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFS 209
L +ANL T A L T LTR DL GA + GAD S
Sbjct: 390 TLEDANLIRSTLTGANLTYTNLTRCDLRGANLTGADLS 427
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 51/110 (46%), Gaps = 5/110 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A +AD A+ V+ R A NF AD+ +++ G++ G +A KAN
Sbjct: 267 SQAYITNADFNGAILVRTTLREAVLNGSNFQIADLTQANLQGAQLKGIGFNRANLTKANL 326
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
GADL++ + L A LT A+L L + L A + G D A +
Sbjct: 327 EGADLTNAKLAIADLTNAQLTGAILHSAYLHSATLANANLSGVDLQGAQL 376
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 5/144 (3%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
F++ AA V +N+ ADL Y A G + S A A L A V+ R
Sbjct: 154 FIANWYAAVVTDLRDTNLQG-ADL--YRANLDG--ALLSRANLQDAQLDYANLVRTYLRE 208
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A T+A++ +D + A LE A A+ GA L++ +D + + ANL+ A L +
Sbjct: 209 ATLTNANLVSADLRNANLTDAQLEVADIRSADLRGAKLNNANLDTVNADSANLSGANLSQ 268
Query: 191 TVLTRSDLGGAIIEGADFSDAVID 214
+T +D GAI+ +AV++
Sbjct: 269 AYITNADFNGAILVRTTLREAVLN 292
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A SA L A N + A +RE++ S GA LE A ++ TGA+L
Sbjct: 347 TGAILHSAYLHSATLANANLSGVDLQGAQLREANLSNVTLVGATLEDANLIRSTLTGANL 406
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADFSDA 211
+ T + R L ANLT A L LT+++ A++ +GA+ SDA
Sbjct: 407 TYTNLTRCDLRGANLTGADLSYANLTQANFSQAVLMDASFQGANLSDA 454
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 56/122 (45%), Gaps = 23/122 (18%)
Query: 111 AQFGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYL-----EKAVAYK 160
A SADLR A N AN + A++ ++ + + FNGA L +AV
Sbjct: 234 ADIRSADLRGAKLNNANLDTVNADSANLSGANLSQAYITNADFNGAILVRTTLREAVLNG 293
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD---AVIDLAQ 217
+NF ADL+ +ANL A L R++L A +EGAD ++ A+ DL
Sbjct: 294 SNFQIADLT----------QANLQGAQLKGIGFNRANLTKANLEGADLTNAKLAIADLTN 343
Query: 218 KQ 219
Q
Sbjct: 344 AQ 345
>gi|334121293|ref|ZP_08495365.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333455228|gb|EGK83883.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 299
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 61/120 (50%), Gaps = 9/120 (7%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
LN YE R +F + A A+L A+ + N RAN + A++ + + + GA L
Sbjct: 7 LNNYEKGHR-DF---TGADLSGANLSGAILIGVNLSRANLSGANLSRAFLTKATLQGAVL 62
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ N + A + +T + L +ANL+ A LV+ L R+ L GA + GA+ AV+
Sbjct: 63 QRT-----NLSFAKMGETQLSGADLTKANLSGAFLVKAKLPRAKLSGATLTGANLRGAVL 117
Score = 37.0 bits (84), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 38/85 (44%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF+ AN A + + G++ G L +A N GADLS + L ANL +
Sbjct: 141 NFKWANLYGAKLNSAKLFGAQLTGVSLRRAQLTGVNLCGADLSGVNVSEAKLMGANLEGS 200
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
L + + L G + GA+ + A
Sbjct: 201 NLTGANFSAAQLRGVKLAGANLTGA 225
>gi|16331795|ref|NP_442523.1| hypothetical protein slr0516 [Synechocystis sp. PCC 6803]
gi|383323538|ref|YP_005384392.1| hypothetical protein SYNGTI_2630 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383326707|ref|YP_005387561.1| hypothetical protein SYNPCCP_2629 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383492591|ref|YP_005410268.1| hypothetical protein SYNPCCN_2629 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437859|ref|YP_005652584.1| hypothetical protein SYNGTS_2631 [Synechocystis sp. PCC 6803]
gi|451815947|ref|YP_007452399.1| hypothetical protein MYO_126560 [Synechocystis sp. PCC 6803]
gi|6226382|sp|Q55837.1|Y516_SYNY3 RecName: Full=Uncharacterized protein slr0516
gi|1001755|dbj|BAA10593.1| slr0516 [Synechocystis sp. PCC 6803]
gi|339274892|dbj|BAK51379.1| hypothetical protein SYNGTS_2631 [Synechocystis sp. PCC 6803]
gi|359272858|dbj|BAL30377.1| hypothetical protein SYNGTI_2630 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359276028|dbj|BAL33546.1| hypothetical protein SYNPCCN_2629 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359279198|dbj|BAL36715.1| hypothetical protein SYNPCCP_2629 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407960570|dbj|BAM53810.1| hypothetical protein BEST7613_4879 [Bacillus subtilis BEST7613]
gi|451781916|gb|AGF52885.1| hypothetical protein MYO_126560 [Synechocystis sp. PCC 6803]
Length = 166
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 5/88 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN---- 182
+ R N +A + SD SG+ +G L +A+ +AN TGA+LS+T + L EAN
Sbjct: 49 DLREFNLENARLNRSDLSGANLSGVNLRRALLDRANLTGANLSETDLTEAALTEANLAGA 108
Query: 183 -LTNAVLVRTVLTRSDLGGAIIEGADFS 209
L+ A L R+ L DL GA ++GA+ +
Sbjct: 109 DLSGANLERSFLRDVDLTGANLKGANLA 136
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 45/83 (54%), Gaps = 10/83 (12%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N AD+RE FN LE A +++ +GA+LS + R +L+ ANLT A L T
Sbjct: 44 NLAGADLRE-------FN---LENARLNRSDLSGANLSGVNLRRALLDRANLTGANLSET 93
Query: 192 VLTRSDLGGAIIEGADFSDAVID 214
LT + L A + GAD S A ++
Sbjct: 94 DLTEAALTEANLAGADLSGANLE 116
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 16/121 (13%)
Query: 92 ADLNKYEAET-RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
ADL ++ E R S A +LR+A+ RAN T A++ E+D +
Sbjct: 48 ADLREFNLENARLNRSDLSGANLSGVNLRRAL-----LDRANLTGANLSETDLT------ 96
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+A +AN GADLS ++R L + +LT A L L ++L A + D +
Sbjct: 97 ----EAALTEANLAGADLSGANLERSFLRDVDLTGANLKGANLAWANLTAANLTDVDLEE 152
Query: 211 A 211
A
Sbjct: 153 A 153
>gi|325106774|ref|YP_004267842.1| pentapeptide repeat-containing protein [Planctomyces brasiliensis
DSM 5305]
gi|324967042|gb|ADY57820.1| pentapeptide repeat protein [Planctomyces brasiliensis DSM 5305]
Length = 194
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 72/159 (45%), Gaps = 16/159 (10%)
Query: 119 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 178
+K + E RAN + AD+ E+D G+ +GA L +A +A+ GADLS + L
Sbjct: 14 QKWLKGDEGGERANLSEADLSEADLRGADLSGANLSEADLSEADLRGADLSGANLSWANL 73
Query: 179 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ------KQALCKYANGTNPIT 232
+ ANL+ A L L+ +DL A + GAD S A + A +A+ + G I
Sbjct: 74 SWANLSEADLSGANLSEADLSEADLRGADLSGANLRGANLSGANLSEAVARLDFGAWSIC 133
Query: 233 GVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGF 271
S+GC R + + L P D DGF
Sbjct: 134 VRKDVTSIGCRTYRNDRW-------LEWTPD---DVDGF 162
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 63/120 (52%), Gaps = 6/120 (5%)
Query: 96 KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 155
K++ +G+ G G A ADL +A + AN + AD+ E+D G+ +GA L
Sbjct: 12 KHQKWLKGDEG-GERANLSEADLSEADLRGADLSGANLSEADLSEADLRGADLSGANLSW 70
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
A AN + ADLS L+EA+L+ A L L+ ++L GA + GA+ S+AV L
Sbjct: 71 ANLSWANLSEADLSGA-----NLSEADLSEADLRGADLSGANLRGANLSGANLSEAVARL 125
>gi|94266259|ref|ZP_01289965.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
gi|93453141|gb|EAT03609.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
Length = 818
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N D+RE DF G++ +G ++A A+F+GADL + L A L A L R
Sbjct: 142 NLAGMDLREVDFRGARLHGVSFQEANLRGADFSGADLMHADLSEADLRGAKLVGANLSRV 201
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYA 225
L R+DLG A + AD + A + A+ +QA+ + A
Sbjct: 202 NLARADLGEADLSEADLTRANLGGARLRQAILRRA 236
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 67/152 (44%), Gaps = 28/152 (18%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----------EKAVA 158
AA AD A K NF AN T+A +R++D +G + A L +A
Sbjct: 375 AANLSRADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATL 434
Query: 159 YKANFT----------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+AN T GADLS+ +L A+L AVLVRT LT + L A + +
Sbjct: 435 IRANLTNASLREADLTGADLSNA-----ILTGADLREAVLVRTRLTHAHLNRADLAWSTL 489
Query: 209 SDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
SDA DL+ NG N G S +SL
Sbjct: 490 SDA--DLSNADLKEASLNGVNLGAGASVLQSL 519
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 55/131 (41%), Gaps = 26/131 (19%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 165
SAA A L+K++ + S R +DF + + A A KANF G
Sbjct: 339 SAADLAGAKLQKSIMAGATLHGSRLVSVTARNADFRAANLSRADATGADFSKANFAGANL 398
Query: 166 -----------------ADLSDTLMD------RMVLNEANLTNAVLVRTVLTRSDLGGAI 202
A+L+D +D R L ANLTNA L LT +DL AI
Sbjct: 399 TAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLTNASLREADLTGADLSNAI 458
Query: 203 IEGADFSDAVI 213
+ GAD +AV+
Sbjct: 459 LTGADLREAVL 469
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 54/113 (47%), Gaps = 25/113 (22%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS- 169
A G A LR+A+ RRA F D R+ D + F GA ++ NF+GADLS
Sbjct: 221 ANLGGARLRQAI-----LRRALFGETDARKVDARQADFRGATFQRG-----NFSGADLSR 270
Query: 170 ----DTLMDRMVLNEANLTNAVLVRTVLTR----------SDLGGAIIEGADF 208
DT + +L E +L A L + L+R ++LGGA + GAD
Sbjct: 271 ARFADTDLSGAILQEVDLAGAELEGSDLSRLALPGVRLVKANLGGANLYGADL 323
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 7/123 (5%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
LA ++ E + RG G F A+LR A +F A+ AD+ E+D G+K G
Sbjct: 143 LAGMDLREVDFRGARLHG--VSFQEANLRGA-----DFSGADLMHADLSEADLRGAKLVG 195
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
A L + +A+ ADLS+ + R L A L A+L R + +D ADF
Sbjct: 196 ANLSRVNLARADLGEADLSEADLTRANLGGARLRQAILRRALFGETDARKVDARQADFRG 255
Query: 211 AVI 213
A
Sbjct: 256 ATF 258
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 7/111 (6%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A FG D RK + +FR R NF+ AD+ + F+ + +GA L++ A G
Sbjct: 236 ALFGETDARKVDARQADFRGATFQRGNFSGADLSRARFADTDLSGAILQEVDLAGAELEG 295
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+DLS + + L +ANL A L L +DL A + AD S A DLA
Sbjct: 296 SDLSRLALPGVRLVKANLGGANLYGADLRAADLTDASLVEADLSAA--DLA 344
Score = 43.9 bits (102), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F D+R +D G+ A L A A+ ADLS + R L ANLT+A+L T+
Sbjct: 529 FVRYDLRNADLRGANLRDADLADADLSNADLANADLSRANLSRSDLRWANLTDAILQGTI 588
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQ 217
L+ + L A A+F++A DL Q
Sbjct: 589 LSNASLNDANFNRANFAEA--DLTQ 611
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 2/121 (1%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +ADL A V+ + A+ A +++S +G+ +G+ L A A+F A+LS
Sbjct: 321 ADLRAADLTDASLVEADLSAADLAGAKLQKSIMAGATLHGSRLVSVTARNADFRAANLSR 380
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYANGT 228
++AN A L VL ++DL G + A+ +DA +D A +A AN T
Sbjct: 381 ADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLT 440
Query: 229 N 229
N
Sbjct: 441 N 441
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 49/119 (41%), Gaps = 20/119 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAV--- 157
A ADLR A V N R N AD+ E+D S G++ A L +A+
Sbjct: 181 ADLSEADLRGAKLVGANLSRVNLARADLGEADLSEADLTRANLGGARLRQAILRRALFGE 240
Query: 158 -------AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
A +A+F GA L+ A + L +L DL GA +EG+D S
Sbjct: 241 TDARKVDARQADFRGATFQRGNFSGADLSRARFADTDLSGAILQEVDLAGAELEGSDLS 299
>gi|300023195|ref|YP_003755806.1| pentapeptide repeat protein [Hyphomicrobium denitrificans ATCC
51888]
gi|299525016|gb|ADJ23485.1| pentapeptide repeat protein [Hyphomicrobium denitrificans ATCC
51888]
Length = 282
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 60/113 (53%), Gaps = 3/113 (2%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
FG+ + + F ADL A+ N + F R +D SG+ +GA L +A + F+
Sbjct: 149 FGVFAGSNFAGADLTDAISAPLN--KTGFIEYIWR-TDLSGANLSGAQLTRANMTQTRFS 205
Query: 165 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
A L D + +L EA+L+ AVL L+ +DL GA + GAD + A +D A+
Sbjct: 206 FAVLRDASLHDTILREADLSGAVLTGADLSGADLTGADLSGADVTGANLDGAK 258
>gi|94266194|ref|ZP_01289904.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
gi|93453242|gb|EAT03697.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
Length = 818
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N D+RE DF G++ +G ++A A+F+GADL + L A L A L R
Sbjct: 142 NLAGMDLREVDFRGARLHGVSFQEANLRGADFSGADLMHADLSEADLRGAKLVGANLSRV 201
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYA 225
L R+DLG A + AD + A + A+ +QA+ + A
Sbjct: 202 NLARADLGEADLSEADLTRANLGGARLRQAILRRA 236
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 67/152 (44%), Gaps = 28/152 (18%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----------EKAVA 158
AA AD A K NF AN T+A +R++D +G + A L +A
Sbjct: 375 AANLSRADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATL 434
Query: 159 YKANFT----------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+AN T GADLS+ +L A+L AVLVRT LT + L A + +
Sbjct: 435 IRANLTNASLREADLTGADLSNA-----ILTGADLREAVLVRTRLTHAHLNRADLAWSTL 489
Query: 209 SDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
SDA DL+ NG N G S +SL
Sbjct: 490 SDA--DLSNADLKEASLNGVNLGAGASVLQSL 519
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 55/131 (41%), Gaps = 26/131 (19%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 165
SAA A L+K++ + S R +DF + + A A KANF G
Sbjct: 339 SAADLAGAKLQKSIMAGATLHGSRLVSVTARNADFRAANLSRADATGADFSKANFAGANL 398
Query: 166 -----------------ADLSDTLMD------RMVLNEANLTNAVLVRTVLTRSDLGGAI 202
A+L+D +D R L ANLTNA L LT +DL AI
Sbjct: 399 TAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLTNASLREADLTGADLSNAI 458
Query: 203 IEGADFSDAVI 213
+ GAD +AV+
Sbjct: 459 LTGADLREAVL 469
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 54/113 (47%), Gaps = 25/113 (22%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS- 169
A G A LR+A+ RRA F D R+ D + F GA ++ NF+GADLS
Sbjct: 221 ANLGGARLRQAI-----LRRALFGETDARKVDARQADFRGATFQRG-----NFSGADLSR 270
Query: 170 ----DTLMDRMVLNEANLTNAVLVRTVLTR----------SDLGGAIIEGADF 208
DT + +L E +L A L + L+R ++LGGA + GAD
Sbjct: 271 ARFADTDLSGAILQEVDLAGAELEGSDLSRLALPGVRLVKANLGGANLYGADL 323
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 7/123 (5%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
LA ++ E + RG G F A+LR A +F A+ AD+ E+D G+K G
Sbjct: 143 LAGMDLREVDFRGARLHG--VSFQEANLRGA-----DFSGADLMHADLSEADLRGAKLVG 195
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
A L + +A+ ADLS+ + R L A L A+L R + +D ADF
Sbjct: 196 ANLSRVNLARADLGEADLSEADLTRANLGGARLRQAILRRALFGETDARKVDARQADFRG 255
Query: 211 AVI 213
A
Sbjct: 256 ATF 258
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 7/111 (6%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A FG D RK + +FR R NF+ AD+ + F+ + +GA L++ A G
Sbjct: 236 ALFGETDARKVDARQADFRGATFQRGNFSGADLSRARFADTDLSGAILQEVDLAGAELEG 295
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+DLS + + L +ANL A L L +DL A + AD S A DLA
Sbjct: 296 SDLSRLALPGVRLVKANLGGANLYGADLRAADLTDASLVEADLSAA--DLA 344
Score = 43.9 bits (102), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F D+R +D G+ A L A A+ ADLS + R L ANLT+A+L T+
Sbjct: 529 FVRYDLRNADLRGANLRDADLADADLSNADLANADLSRANLSRSDLRWANLTDAILQGTI 588
Query: 193 LTRSDLGGAIIEGADFSDAVIDLAQ 217
L+ + L A A+F++A DL Q
Sbjct: 589 LSNASLNDANFNRANFAEA--DLTQ 611
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 2/121 (1%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +ADL A V+ + A+ A +++S +G+ +G+ L A A+F A+LS
Sbjct: 321 ADLRAADLTDASLVEADLSAADLAGAKLQKSIMAGATLHGSRLVSVTARNADFRAANLSR 380
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYANGT 228
++AN A L VL ++DL G + A+ +DA +D A +A AN T
Sbjct: 381 ADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLT 440
Query: 229 N 229
N
Sbjct: 441 N 441
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 49/119 (41%), Gaps = 20/119 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAV--- 157
A ADLR A V N R N AD+ E+D S G++ A L +A+
Sbjct: 181 ADLSEADLRGAKLVGANLSRVNLARADLGEADLSEADLTRANLGGARLRQAILRRALFGE 240
Query: 158 -------AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
A +A+F GA L+ A + L +L DL GA +EG+D S
Sbjct: 241 TDARKVDARQADFRGATFQRGNFSGADLSRARFADTDLSGAILQEVDLAGAELEGSDLS 299
>gi|451980423|ref|ZP_21928815.1| conserved hypothetical protein, contains pentapeptide repeats
[Nitrospina gracilis 3/211]
gi|451762323|emb|CCQ90046.1| conserved hypothetical protein, contains pentapeptide repeats
[Nitrospina gracilis 3/211]
Length = 289
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 64/136 (47%), Gaps = 30/136 (22%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GA------------ 151
S A+F A L++A N R+ F A M E++ +G +FN GA
Sbjct: 100 SGAKFHQALLKRAQFEGANLVRSEFLEAQMNEANLAGVRFNKSDLRGAMMIGINLAGAQI 159
Query: 152 ---YLEKAVAYKANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRTV-----LTRSDL 198
+L K K + TG D L+ + + VL E N NA+L RT LT ++L
Sbjct: 160 PQSHLSKTNISKGDLTGTDVSGCNLTGSDLREAVLRETNFQNAILDRTFLKGADLTGANL 219
Query: 199 GGAIIEGADFSDAVID 214
GA + GADF++ V+D
Sbjct: 220 TGARLRGADFAETVLD 235
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 10/90 (11%)
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
++ +F R N + + R +D SG+KF+ A L+ +A F GA+L + +NEANL
Sbjct: 80 IRADFTRTNLSGVNFRNTDLSGAKFHQALLK-----RAQFEGANLVRSEFLEAQMNEANL 134
Query: 184 TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
VR +SDL GA++ G + + A I
Sbjct: 135 AG---VR--FNKSDLRGAMMIGINLAGAQI 159
>gi|428312148|ref|YP_007123125.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428253760|gb|AFZ19719.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 223
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 50/88 (56%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N N + D+R +DF G+ A L +A AN GA LS +++R VLN A L++A
Sbjct: 21 NLEGINLSDTDLRGADFRGADLFDANLARADLSDANLGGAILSRAVLNRAVLNRAVLSSA 80
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVID 214
+L L R+ L GA++ GA + A+++
Sbjct: 81 LLSNAFLNRAVLCGAVLRGAILNGAILN 108
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 58/125 (46%), Gaps = 6/125 (4%)
Query: 88 ISALADLNKYEAETRG-EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS 146
++A+ L +Y A R +F DLR A +FR A+ A++ +D S +
Sbjct: 1 MNAIELLERYAAGERSFDFPNLEGINLSDTDLRGA-----DFRGADLFDANLARADLSDA 55
Query: 147 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
GA L +AV +A A LS L+ LN A L AVL +L + L GA + GA
Sbjct: 56 NLGGAILSRAVLNRAVLNRAVLSSALLSNAFLNRAVLCGAVLRGAILNGAILNGANLSGA 115
Query: 207 DFSDA 211
D A
Sbjct: 116 DLYHA 120
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 58/131 (44%), Gaps = 10/131 (7%)
Query: 67 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA----AQFGSADLRKAV 122
N V L A++ N + L+ + Y A G +G A A SA LR+A
Sbjct: 88 NRAVLCGAVLRGAILNGAILNGANLSGADLYHANLSGAL-LGYADLYHAYLNSALLREAD 146
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 182
R AN A++R ++ SG+ GA L AN GA+LSD L AN
Sbjct: 147 LYHAYLREANLFGANLRSANLSGADLTGANLMATNLRSANLFGANLSDA-----NLGGAN 201
Query: 183 LTNAVLVRTVL 193
+ A++ +T++
Sbjct: 202 MRCALICQTIM 212
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 5/89 (5%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEAN 182
RA A +R + +G+ NGA L A Y AN +G ADL ++ +L EA+
Sbjct: 87 LNRAVLCGAVLRGAILNGAILNGANLSGADLYHANLSGALLGYADLYHAYLNSALLREAD 146
Query: 183 LTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L +A L L ++L A + GAD + A
Sbjct: 147 LYHAYLREANLFGANLRSANLSGADLTGA 175
>gi|86608719|ref|YP_477481.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86557261|gb|ABD02218.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 207
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 65/127 (51%), Gaps = 4/127 (3%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L++A+ + N + A+ + A + +D G+ +G+ A ++A+ A+L
Sbjct: 68 ADLSGANLKEAILRQANLQAADLSQAILNLADLRGANLSGSAQAGAFLWEADLAQANLQQ 127
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
T + L ANL+ A L R +LTR+DL GA + AD A DL + A + A+ T
Sbjct: 128 TDLTGANLQVANLSGADLRRAILTRADLTGAKLHNADLRGA--DL--RGAFLEGADLTGA 183
Query: 231 ITGVSTR 237
+ TR
Sbjct: 184 LYNAQTR 190
Score = 40.4 bits (93), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 56/113 (49%), Gaps = 1/113 (0%)
Query: 105 FGIGSAAQFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
GI +AA F +L + + F + + ++ ++ G+ +GA L++A+ +AN
Sbjct: 26 LGIPTAAAFAQLELDAQLGRSQIVFPSKDCPACNLTGAELPGADLSGANLKEAILRQANL 85
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
ADLS +++ L ANL+ + L +DL A ++ D + A + +A
Sbjct: 86 QAADLSQAILNLADLRGANLSGSAQAGAFLWEADLAQANLQQTDLTGANLQVA 138
>gi|20090742|ref|NP_616817.1| hypothetical protein MA1892 [Methanosarcina acetivorans C2A]
gi|19915798|gb|AAM05297.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans
C2A]
Length = 560
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 62/122 (50%), Gaps = 11/122 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTG 165
A A+LR K N R + + AD+RE+D SG +GA L A +AN G
Sbjct: 389 ANLSGANLRGTNLSKANLREVDLSGADLREADLSGVDLSGANLSGADLSGVDLSRANLNG 448
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKY 224
ADL+ + R LNEANL+ +T L +DL A + GA S+A + A+ K A +
Sbjct: 449 ADLNGIDLRRANLNEANLS-----KTNLNEADLSKAKLSGAYLSEAKLKGAKLKGAYMRK 503
Query: 225 AN 226
AN
Sbjct: 504 AN 505
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 49/91 (53%), Gaps = 5/91 (5%)
Query: 120 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 179
+A+ V N A+ + +D+R++ + N A L KA KAN + ADL M R L+
Sbjct: 272 QALLVINNLIGADLSESDLRDAFLHEAHLNEADLSKANLSKANLSEADLKGAYMRRANLS 331
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
EANL+ A L + DL GA + GAD ++
Sbjct: 332 EANLSKAKL-----SGVDLSGANLSGADLNE 357
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 51/103 (49%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 165
A DLR+A N AN + ++ E+D S +K +GAYL +A A G
Sbjct: 449 ADLNGIDLRRA-----NLNEANLSKTNLNEADLSKAKLSGAYLSEAKLKGAKLKGAYMRK 503
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A+LS+ ++ L EANL+ A L L+ DL GA + G +
Sbjct: 504 ANLSEADLNGADLREANLSEANLNGVDLSVIDLRGANLNGVNI 546
Score = 43.9 bits (102), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 55/114 (48%), Gaps = 11/114 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRR-ANFTSADMRESD----------FSGSKFNGAYLEKAV 157
S A ADL + K + R AN + AD+ E+D SG+ G L KA
Sbjct: 346 SGANLSGADLNEFYLNKATYTRGANLSEADLSEADLSEANLKGANLSGANLRGTNLSKAN 405
Query: 158 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + +GADL + + + L+ ANL+ A L L+R++L GA + G D A
Sbjct: 406 LREVDLSGADLREADLSGVDLSGANLSGADLSGVDLSRANLNGADLNGIDLRRA 459
>gi|242277903|ref|YP_002990032.1| pentapeptide repeat-containing protein [Desulfovibrio salexigens DSM
2638]
gi|242120797|gb|ACS78493.1| pentapeptide repeat protein [Desulfovibrio salexigens DSM 2638]
Length = 1277
Score = 53.5 bits (127), Expect = 9e-05, Method: Composition-based stats.
Identities = 41/149 (27%), Positives = 69/149 (46%), Gaps = 4/149 (2%)
Query: 70 VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 129
+F AV+ + +++ L + EAE +G I G AD KA + N +
Sbjct: 1045 IFKGAQFPKAVLRDTNFDMAILEKTDFSEAELKGA-RINMCMISGKAD--KADFSQSNIK 1101
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV-LNEANLTNAVL 188
++ F ++ + +DFS + N + A+K NFT A+L R +++ +A L
Sbjct: 1102 KSIFKASSLTGADFSEASVNESLFNDVDAHKVNFTDANLDKLRTGRNSNFKDSDFRHATL 1161
Query: 189 VRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L SD G+ GADF + +ID +Q
Sbjct: 1162 HGAALRESDFTGSDFRGADFENGLIDNSQ 1190
Score = 47.0 bits (110), Expect = 0.008, Method: Composition-based stats.
Identities = 34/118 (28%), Positives = 54/118 (45%), Gaps = 9/118 (7%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
IG +A F A LR+A + F +A F +D+ E++ + + F GA KAV NF
Sbjct: 1004 AIGMSADFSKASLRRADLSRGLFNKALFVESDLSEANGAQAIFKGAQFPKAVLRDTNFDM 1063
Query: 166 ADLSDTLMDRMVLNEANLT---------NAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A L T L A + A ++ + +S + + GADFS+A ++
Sbjct: 1064 AILEKTDFSEAELKGARINMCMISGKADKADFSQSNIKKSIFKASSLTGADFSEASVN 1121
Score = 45.1 bits (105), Expect = 0.035, Method: Composition-based stats.
Identities = 32/148 (21%), Positives = 65/148 (43%), Gaps = 4/148 (2%)
Query: 70 VFVSTALAAAVVASCSSNISALADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVK 125
+F +++L A + S N S D++ ++ + G + F +D R A
Sbjct: 1104 IFKASSLTGADFSEASVNESLFNDVDAHKVNFTDANLDKLRTGRNSNFKDSDFRHATLHG 1163
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
R ++FT +D R +DF + + L +A + GA + + ++ + AN+
Sbjct: 1164 AALRESDFTGSDFRGADFENGLIDNSQLVRANLNGVSAKGARFTKSNLEGASMRAANVHM 1223
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ + L +DL G+ + DF +V+
Sbjct: 1224 GGMRKARLVDTDLRGSNLFAVDFYKSVL 1251
Score = 39.7 bits (91), Expect = 1.4, Method: Composition-based stats.
Identities = 34/108 (31%), Positives = 46/108 (42%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S ADL K K NF+ A F A +DFS + A L + + KA F
Sbjct: 972 SGLDLSGADLSKCQLQKTNFKGAILDNVKFVQAIGMSADFSKASLRRADLSRGLFNKALF 1031
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+DLS+ + + A AVL T + AI+E DFS+A
Sbjct: 1032 VESDLSEANGAQAIFKGAQFPKAVLRDT-----NFDMAILEKTDFSEA 1074
Score = 37.7 bits (86), Expect = 6.0, Method: Composition-based stats.
Identities = 39/141 (27%), Positives = 58/141 (41%), Gaps = 22/141 (15%)
Query: 94 LNKYEAETRGEFGIGSAAQFG-SADLRKAVHVKENFRR-----ANFTSADMRESDFSGSK 147
L K EA+ + A+ G SAD +A+ +E +R + A + D SG
Sbjct: 917 LKKLEAKELPDAAKAKLAEHGLSADSLRAL-TREEVQRYHEQGKSLVGAVLSGVDLSGLD 975
Query: 148 FNGAYLEKAVAYKANFTGA---------------DLSDTLMDRMVLNEANLTNAVLVRTV 192
+GA L K K NF GA D S + R L+ A+ V +
Sbjct: 976 LSGADLSKCQLQKTNFKGAILDNVKFVQAIGMSADFSKASLRRADLSRGLFNKALFVESD 1035
Query: 193 LTRSDLGGAIIEGADFSDAVI 213
L+ ++ AI +GA F AV+
Sbjct: 1036 LSEANGAQAIFKGAQFPKAVL 1056
>gi|193083812|gb|ACF09494.1| pentapeptide repeat protein [uncultured marine crenarchaeote
SAT1000-23-F7]
Length = 741
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 61/110 (55%), Gaps = 13/110 (11%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NFR +NFTS ++ ++F+ +GA L + TGADL + L+ A+L+N
Sbjct: 495 NFRESNFTSTNIANANFTSVNLSGADLSMKDLTENILTGADLRNA-----NLSGADLSNN 549
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA----VID---LAQKQALCKYANGTN 229
LV T+LT +DL AI+ GAD S A +ID + QK L K AN TN
Sbjct: 550 QLVNTILTGADLTDAILSGADLSTANIFGIIDGINILQKTKL-KGANFTN 598
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 14/114 (12%)
Query: 107 IGSAAQFGSAD----LRKAVHVKENFRRANFTS-----ADMRESDFSGSKFNGAYLEKAV 157
+ +A FG D L+K NF AN T+ D+ E+ G+ G LEKA
Sbjct: 571 LSTANIFGIIDGINILQKTKLKGANFTNANLTNINLIGVDISETILKGADLTGVKLEKAK 630
Query: 158 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+N DLS + ++ L ++NL+ RT+L+ +DL A + GA+ SDA
Sbjct: 631 VNNSNLEDLDLSFKNLSKIRLVDSNLS-----RTILSGADLSNAELMGANLSDA 679
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 51/119 (42%), Gaps = 15/119 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A DL + + + R AN + AD+ + + GA L A+ +GADL
Sbjct: 517 SGADLSMKDLTENILTGADLRNANLSGADLSNNQLVNTILTGADLTDAI-----LSGADL 571
Query: 169 SD----------TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
S ++ + L AN TNA L L D+ I++GAD + ++ A+
Sbjct: 572 STANIFGIIDGINILQKTKLKGANFTNANLTNINLIGVDISETILKGADLTGVKLEKAK 630
>gi|300867251|ref|ZP_07111911.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300334728|emb|CBN57077.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 520
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 61/117 (52%), Gaps = 9/117 (7%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
ADLN+ A+ RG +A+LR+A + N A+ A++R +D +G+ GA
Sbjct: 165 ADLNR--ADLRG-------VNLSNAELRQANLSQANLSGADLRGANLRWADLNGADLTGA 215
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
L++A AN GA+LS + +L A+LT A L+R +DL GA + GA
Sbjct: 216 DLDEARLSGANLYGANLSSANLLNAILVHADLTQANLIRADWVGADLTGAALTGAKL 272
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 55/110 (50%), Gaps = 3/110 (2%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A LR + V N RAN AD+ +D G + A L +A +AN +GADL
Sbjct: 140 ADLSGAHLRGSSLVSANLERANLHRADLNRADLRGVNLSNAELRQANLSQANLSGADLRG 199
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 217
+ LN A+LT A L L+ ++L GA + A+ +A++ DL Q
Sbjct: 200 ANLRWADLNGADLTGADLDEARLSGANLYGANLSSANLLNAILVHADLTQ 249
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 47/80 (58%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N + A++ + +F +K N A L A +AN +GA L+ + R LN A+L+ A L+R
Sbjct: 46 NMSGANLSDVNFRKAKLNVARLSGANLSRANLSGAILNVANLIRADLNSADLSEATLIRA 105
Query: 192 VLTRSDLGGAIIEGADFSDA 211
L R+D+ A + GA+ S+A
Sbjct: 106 ELIRADMSNASLSGANLSEA 125
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 61/127 (48%), Gaps = 5/127 (3%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL +A N A A++ +++ SG+ GA L A A+ TGADL +
Sbjct: 160 ANLHRADLNRADLRGVNLSNAELRQANLSQANLSGADLRGANLRWADLNGADLTGADLDE 219
Query: 171 TLMD-----RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 225
+ L+ ANL NA+LV LT+++L A GAD + A + A+ + ++
Sbjct: 220 ARLSGANLYGANLSSANLLNAILVHADLTQANLIRADWVGADLTGAALTGAKLYGVSRFG 279
Query: 226 NGTNPIT 232
+ IT
Sbjct: 280 LKADDIT 286
Score = 45.1 bits (105), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 46/101 (45%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A SADL +A + RA ADM + SG+ + A L + +AN ADLS
Sbjct: 90 ADLNSADLSEATLI-----RAELIRADMSNASLSGANLSEADLREGTLRQANLEQADLSG 144
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L ANL A L R L R+DL G + A+ A
Sbjct: 145 AHLRGSSLVSANLERANLHRADLNRADLRGVNLSNAELRQA 185
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 50/108 (46%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S F A L A N RAN + A + ++ + N A L +A +A AD+
Sbjct: 53 SDVNFRKAKLNVARLSGANLSRANLSGAILNVANLIRADLNSADLSEATLIRAELIRADM 112
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
S+ + L+EA+L L + L ++DL GA + G+ A ++ A
Sbjct: 113 SNASLSGANLSEADLREGTLRQANLEQADLSGAHLRGSSLVSANLERA 160
>gi|383482351|ref|YP_005391265.1| hypothetical protein MCI_01270 [Rickettsia montanensis str. OSU
85-930]
gi|378934705|gb|AFC73206.1| hypothetical protein MCI_01270 [Rickettsia montanensis str. OSU
85-930]
Length = 959
Score = 53.5 bits (127), Expect = 9e-05, Method: Composition-based stats.
Identities = 39/118 (33%), Positives = 62/118 (52%), Gaps = 10/118 (8%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ---KQALCKYAN 226
+ + EAN NA++ R LT++D A++E AD ++ A+ K+A+ K AN
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKADFTKALLENADMQ--AVEAAEAIFKEAILKQAN 665
Score = 38.1 bits (87), Expect = 4.5, Method: Composition-based stats.
Identities = 28/96 (29%), Positives = 44/96 (45%), Gaps = 10/96 (10%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------AN 162
F ADL+K+ + RA D+ E++ + SKFN + A A K +N
Sbjct: 404 FLFADLKKSKIENSDMSRAYMPKVDLSEAEVTNSKFNAVMMVNADAEKLIIKNSEWKNSN 463
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
TG L+ M R+ + L NA+L + + +DL
Sbjct: 464 LTGISLAYADMQRVQMQGVVLNNALLDQANIVSTDL 499
Score = 37.7 bits (86), Expect = 5.7, Method: Composition-based stats.
Identities = 25/109 (22%), Positives = 51/109 (46%), Gaps = 5/109 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A+L+ AV R F AD+++S S + AY+ K +A T + + +
Sbjct: 384 FEGANLQNAVFQNVTARNVGFLFADLKKSKIENSDMSRAYMPKVDLSEAEVTNSKFNAVM 443
Query: 173 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
M +++++ + N+ L L +D+ ++G ++A++D A
Sbjct: 444 MVNADAEKLIIKNSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 492
>gi|424851694|ref|ZP_18276091.1| pentapeptide repeat-containing protein [Rhodococcus opacus PD630]
gi|356666359|gb|EHI46430.1| pentapeptide repeat-containing protein [Rhodococcus opacus PD630]
Length = 194
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 59/131 (45%), Gaps = 15/131 (11%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYL 153
+E R E I + F ADL ++ HV FR +FT + S+F GS+F+ L
Sbjct: 31 SELRTESVIFTDCDFTGADLAESRHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 90
Query: 154 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 203
V + +FT GADL EANL L R VL +DL GGA
Sbjct: 91 RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 150
Query: 204 EGADFSDAVID 214
+GAD A ID
Sbjct: 151 DGADLRGARID 161
>gi|432333149|ref|ZP_19584958.1| hypothetical protein Rwratislav_00760 [Rhodococcus wratislaviensis
IFP 2016]
gi|430779982|gb|ELB95096.1| hypothetical protein Rwratislav_00760 [Rhodococcus wratislaviensis
IFP 2016]
Length = 220
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 51/108 (47%), Gaps = 5/108 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +ADLR R A+ TS +M E D SG+ A L A AN ADL+D
Sbjct: 34 ANLRNADLRLGFLRDATLRNADLTSCNMYEVDLSGANLYLAQLSGAHMTGANLNNADLTD 93
Query: 171 TLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
T + + +L E L A L R L +DL GA + G D SDA +
Sbjct: 94 TKLIKTQLSGAMLIEVELDGADLSRAFLQNADLTGAHLRGTDLSDATL 141
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 65/150 (43%), Gaps = 16/150 (10%)
Query: 95 NKYEAETRGE---FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
N YE + G S A A+L A + + A + E + G+ + A
Sbjct: 60 NMYEVDLSGANLYLAQLSGAHMTGANLNNADLTDTKLIKTQLSGAMLIEVELDGADLSRA 119
Query: 152 YLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLTNAVLVRTVLTRSDLGGAIIEGA 206
+L+ A A+ G DLSD + + M N EA L +A L LT +DL GA + GA
Sbjct: 120 FLQNADLTGAHLRGTDLSDATLVGAELMATNLAEAELVDADLTDADLTFADLTGADLRGA 179
Query: 207 -----DFSDAVI---DLAQKQALCKYANGT 228
DF+DA + DL Q +Y + T
Sbjct: 180 NLTRTDFTDADLTGADLGTTQDKARYDDTT 209
>gi|17230606|ref|NP_487154.1| hypothetical protein all3114 [Nostoc sp. PCC 7120]
gi|17132208|dbj|BAB74813.1| all3114 [Nostoc sp. PCC 7120]
Length = 576
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 45/78 (57%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N + A + +D S +K NGA L A A F GADLS + +VLN+A+L+ +L
Sbjct: 421 NLSDAILEAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGILSEA 480
Query: 192 VLTRSDLGGAIIEGADFS 209
LT +DL AI+ G DFS
Sbjct: 481 DLTGADLSDAILLGTDFS 498
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 64/117 (54%), Gaps = 12/117 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A FG A+L N AN +SAD+ ++ +G+ +GA L++A + + ADL
Sbjct: 288 TGADFGDANLSSV-----NLSGANLSSADLSSANLTGANLSGANLQRA-----DLSRADL 337
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCK 223
S ++++ + ANL+ L L R++L AI+ GA+ SDA ++ A + LC+
Sbjct: 338 SSSILNDGEFSHANLSGVNLRDAELRRANLSNAILFGANLSDANLNHADLSRADLCR 394
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 51/90 (56%), Gaps = 5/90 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEA 181
NF+ A +A++ +FSG+ +GAYL A ANF TGAD D + + L+ A
Sbjct: 246 NFQGAYLGNANLTGVNFSGANLSGAYLGDANLTGANFQDANLTGADFGDANLSSVNLSGA 305
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
NL++A L LT ++L GA ++ AD S A
Sbjct: 306 NLSSADLSSANLTGANLSGANLQRADLSRA 335
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 50/95 (52%), Gaps = 8/95 (8%)
Query: 125 KENFRRAN---FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 181
K N+ R N F AD+ D +G N A L + +A+ TGADLSD ++ + A
Sbjct: 441 KLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGILSEADLTGADLSDAILLGTDFSFA 500
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
NL +A L + S+L GAI+ GAD S A + A
Sbjct: 501 NLNSANL-----SGSNLSGAILNGADLSSANLSYA 530
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 48/106 (45%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A SADL A N AN AD+ +D S S N A N A+L
Sbjct: 303 SGANLSSADLSSANLTGANLSGANLQRADLSRADLSSSILNDGEFSHANLSGVNLRDAEL 362
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ +L ANL++A L L+R+DL A + GAD + A ++
Sbjct: 363 RRANLSNAILFGANLSDANLNHADLSRADLCRADLSGADLTHATLN 408
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 55/110 (50%), Gaps = 5/110 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANF 163
S+A SA+L A N +RA+ + AD+ S +FS + +G L A +AN
Sbjct: 308 SSADLSSANLTGANLSGANLQRADLSRADLSSSILNDGEFSHANLSGVNLRDAELRRANL 367
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ A L + LN A+L+ A L R L+ +DL A + G + SD ++
Sbjct: 368 SNAILFGANLSDANLNHADLSRADLCRADLSGADLTHATLNGTNLSDTIL 417
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 58/115 (50%), Gaps = 18/115 (15%)
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
GEF S A +LR A RRAN ++A + ++ S + N A L +A +A+
Sbjct: 345 GEF---SHANLSGVNLRDA-----ELRRANLSNAILFGANLSDANLNHADLSRADLCRAD 396
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+GADL+ LN NL++ +L T +L AI+E AD S A ++ A+
Sbjct: 397 LSGADLT-----HATLNGTNLSDTILFST-----NLSDAILEAADLSYAKLNGAK 441
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 44/88 (50%)
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
V E R NF A + ++ +G F+GA L A AN TGA+ D + +ANL
Sbjct: 238 VGEFLRGGNFQGAYLGNANLTGVNFSGANLSGAYLGDANLTGANFQDANLTGADFGDANL 297
Query: 184 TNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++ L L+ +DL A + GA+ S A
Sbjct: 298 SSVNLSGANLSSADLSSANLTGANLSGA 325
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 60/128 (46%), Gaps = 7/128 (5%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L+ +N +AE R + +A FG A+L A + RA+ AD+ +D + + NG
Sbjct: 352 LSGVNLRDAELR-RANLSNAILFG-ANLSDANLNHADLSRADLCRADLSGADLTHATLNG 409
Query: 151 AYLEKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
L + + N + ADLS ++ LN A L A+ + L+ DL G ++
Sbjct: 410 TNLSDTILFSTNLSDAILEAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDLTGVVLND 469
Query: 206 ADFSDAVI 213
AD S ++
Sbjct: 470 ADLSGGIL 477
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 37/75 (49%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
GI S A ADL A+ + +F AN SA++ S+ SG+ NGA L A A
Sbjct: 475 GILSEADLTGADLSDAILLGTDFSFANLNSANLSGSNLSGAILNGADLSSANLSYAILDD 534
Query: 166 ADLSDTLMDRMVLNE 180
D+S+ ++ M E
Sbjct: 535 TDISEANLEEMTWGE 549
>gi|67459256|ref|YP_246880.1| hypothetical protein RF_0864 [Rickettsia felis URRWXCal2]
gi|67004789|gb|AAY61715.1| Uncharacterized low-complexity protein [Rickettsia felis URRWXCal2]
Length = 959
Score = 53.5 bits (127), Expect = 9e-05, Method: Composition-based stats.
Identities = 41/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAKLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 225
+ + EAN NA++ R LT++D A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAQEANFKNAIMQRADLTKADFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 226 N 226
N
Sbjct: 670 N 670
Score = 43.5 bits (101), Expect = 0.097, Method: Composition-based stats.
Identities = 28/111 (25%), Positives = 54/111 (48%), Gaps = 5/111 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F A+L+ AV N R A F AD++ S S + AY+ K +A T + +
Sbjct: 382 ANFEGANLQNAVFQNVNARNAGFLFADLKNSKIENSDMSRAYMPKVDLSEAEVTNSKFNA 441
Query: 171 TLM-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+M +++++ ++ N+ L L +D+ ++G ++A++D A
Sbjct: 442 VMMVNADAEKLIIKDSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 492
Score = 39.3 bits (90), Expect = 1.8, Method: Composition-based stats.
Identities = 30/106 (28%), Positives = 47/106 (44%), Gaps = 10/106 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK---------- 160
A F ADL+ + + RA D+ E++ + SKFN + A A K
Sbjct: 402 AGFLFADLKNSKIENSDMSRAYMPKVDLSEAEVTNSKFNAVMMVNADAEKLIIKDSEWKN 461
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
+N TG L+ M R+ + L NA+L + + +DL A + A
Sbjct: 462 SNLTGISLAYADMQRVQMQGVVLNNALLDQANIVSTDLENAFMNNA 507
>gi|428304969|ref|YP_007141794.1| heat shock protein DnaJ domain-containing protein [Crinalium
epipsammum PCC 9333]
gi|428246504|gb|AFZ12284.1| heat shock protein DnaJ domain protein [Crinalium epipsammum PCC
9333]
Length = 242
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 48/92 (52%), Gaps = 5/92 (5%)
Query: 132 NFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N + AD++E DFSG + + A L A +K N GA+L + R L +ANL+NA
Sbjct: 128 NMSGADLKEKDFSGRNLSDANLSHANLSDAFLHKVNLQGANLYKANLFRANLLQANLSNA 187
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
L L +DL GA + GAD + A I K
Sbjct: 188 CLREANLIGADLSGADLRGADLTGAKIGFNDK 219
>gi|374723788|gb|EHR75868.1| Pentapeptide repeats containing protein [uncultured marine group II
euryarchaeote]
Length = 148
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 49/106 (46%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
LRK H NFRR AD+ E DFS F A + + K+ F GAD + +
Sbjct: 35 LRKGRHAGSNFRRGILDGADLTEGDFSNCDFRKASMYEVDLMKSAFDGADFRGADLRKAR 94
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
LN +N N L AI +G+D+ +A +D ++AL K
Sbjct: 95 LNLSNFRNCKFAGADLRGIRGKYAIWQGSDWWNATMDEGLEKALAK 140
>gi|428307284|ref|YP_007144109.1| serine/threonine protein kinase with pentapeptide repeats
[Crinalium epipsammum PCC 9333]
gi|428248819|gb|AFZ14599.1| serine/threonine protein kinase with pentapeptide repeats
[Crinalium epipsammum PCC 9333]
Length = 564
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 54/92 (58%), Gaps = 5/92 (5%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ +F N ++ ++++++ SG F+ A L + NF GA+LS+T M + L+ A L
Sbjct: 437 RRDFGEQNLSNLNLQKANLSGGNFHQANLTQT-----NFQGANLSNTDMGQTSLSGAMLR 491
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+A LVR L+ +DL GA + GAD S A + A
Sbjct: 492 DANLVRAYLSYADLEGADLRGADLSFAYFNYA 523
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 49/117 (41%), Gaps = 10/117 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A + +A + NF+ AN ++ DM ++ SG+ A L +A A+ GADL
Sbjct: 453 ANLSGGNFHQANLTQTNFQGANLSNTDMGQTSLSGAMLRDANLVRAYLSYADLEGADLRG 512
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
+ N ANL A +L GA + GA +D I A+ NG
Sbjct: 513 ADLSFAYFNYANLRGA----------NLCGANLTGAKINDEQIAQAKTNWATVLPNG 559
>gi|318042736|ref|ZP_07974692.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0101]
Length = 164
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 52/94 (55%), Gaps = 5/94 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL + + R A+ AD+R S+ G+ +GA L A+ + + ADLSD
Sbjct: 54 ADLSGLLLNGIDLRDADLRGADLRGSNLEGADLSGADLRGAMLQDSWLSNADLSD----- 108
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ L +ANL +AVL++ + L GA++ GADF+
Sbjct: 109 VDLRQANLRDAVLIQALTPGLQLEGAVLIGADFT 142
>gi|170751525|ref|YP_001757785.1| pentapeptide repeat-containing protein [Methylobacterium
radiotolerans JCM 2831]
gi|170658047|gb|ACB27102.1| pentapeptide repeat protein [Methylobacterium radiotolerans JCM
2831]
Length = 456
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTN 185
A F A MR +D SG+ +G A + A+F+GAD DT+ +D L +ANLT+
Sbjct: 141 ARFGEAAMRFADLSGALLDGTDFAGADLWGADFSGADADDTVFRGARLDEAKLADANLTH 200
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A LT++ L G+ + GA F+ A +D
Sbjct: 201 ADFAEASLTKASLAGSRLRGAHFTGAKLD 229
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 47/105 (44%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A L AV F A +AD+ E+D SG+ F G VA + F GA L
Sbjct: 84 SGANLRGASLTGAVGRSTRFTGAILEAADLSEADLSGADFTG-----IVAGQVKFAGAML 138
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
D + A+L+ A+L T +DL GA GAD D V
Sbjct: 139 EDARFGEAAMRFADLSGALLDGTDFAGADLWGADFSGADADDTVF 183
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 47/97 (48%), Gaps = 5/97 (5%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A L +A N A+F A + ++ +GS+ GA+ A A+ +GADLSDT
Sbjct: 183 FRGARLDEAKLADANLTHADFAEASLTKASLAGSRLRGAHFTGAKLDGADLSGADLSDTD 242
Query: 173 MDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIE 204
+ R+ L A A L T ++ LGGA+ E
Sbjct: 243 LVRLNLATCRLRHARFAGAWLNGTRMSVEQLGGAVGE 279
Score = 45.1 bits (105), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 5/86 (5%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+ + A M E+D SG+ GA L AV FTGA +++ L+EA+L+ A
Sbjct: 71 ADLSRARMEEADLSGANLRGASLTGAVGRSTRFTGA-----ILEAADLSEADLSGADFTG 125
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLA 216
V + GA++E A F +A + A
Sbjct: 126 IVAGQVKFAGAMLEDARFGEAAMRFA 151
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 51/106 (48%), Gaps = 15/106 (14%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTG 165
A+FG A +R A +F AD+ +DFSG+ F GA L++A AN T
Sbjct: 141 ARFGEAAMRFADLSGALLDGTDFAGADLWGADFSGADADDTVFRGARLDEAKLADANLTH 200
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
AD + EA+LT A L + L + GA ++GAD S A
Sbjct: 201 ADFA----------EASLTKASLAGSRLRGAHFTGAKLDGADLSGA 236
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 47/98 (47%), Gaps = 10/98 (10%)
Query: 124 VKENFRRANFTSADMRESDFSG----------SKFNGAYLEKAVAYKANFTGADLSDTLM 173
V + RA AD+ ++ G ++F GA LE A +A+ +GAD + +
Sbjct: 69 VGADLSRARMEEADLSGANLRGASLTGAVGRSTRFTGAILEAADLSEADLSGADFTGIVA 128
Query: 174 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++ A L +A + +DL GA+++G DF+ A
Sbjct: 129 GQVKFAGAMLEDARFGEAAMRFADLSGALLDGTDFAGA 166
>gi|162450958|ref|YP_001613325.1| WD repeat-containing protein [Sorangium cellulosum So ce56]
gi|161161540|emb|CAN92845.1| Hypothetical WD-repeat protein [Sorangium cellulosum So ce56]
Length = 2305
Score = 53.5 bits (127), Expect = 9e-05, Method: Composition-based stats.
Identities = 40/130 (30%), Positives = 60/130 (46%), Gaps = 18/130 (13%)
Query: 97 YEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDF---------- 143
+ ET G G+ Q DLR A N R AN + AD+ +D
Sbjct: 1111 WAEETAGWISEGADLHGVQLAGEDLRGAPLAGANLRDANLSGADLSGADLTDAALSGAML 1170
Query: 144 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
SG+K +G L +A+A++A+FT A+ +++ A L+ AVL ++ T GA
Sbjct: 1171 SGAKLHGTILRRAIAHRADFTQAEAKGAIVEL-----AKLSGAVLRQSTWTGCRWNGAQA 1225
Query: 204 EGADFSDAVI 213
EG D S +I
Sbjct: 1226 EGTDLSACLI 1235
Score = 41.2 bits (95), Expect = 0.54, Method: Composition-based stats.
Identities = 40/142 (28%), Positives = 60/142 (42%), Gaps = 16/142 (11%)
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS-----DTLMDRMVLNEANLTNAVLVR 190
AD+ +G GA L A AN +GADLS D + +L+ A L +L R
Sbjct: 1123 ADLHGVQLAGEDLRGAPLAGANLRDANLSGADLSGADLTDAALSGAMLSGAKLHGTILRR 1182
Query: 191 TVLTRSDL-----GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 245
+ R+D GAI+E A S AV+ + C++ T +S C +
Sbjct: 1183 AIAHRADFTQAEAKGAIVELAKLSGAVLRQSTWTG-CRWNGAQAEGTDLS-----ACLIA 1236
Query: 246 RRNAYGSPSSPLLSAPPQKLLD 267
R A+ + L + PP +D
Sbjct: 1237 GRGAHPERARRLAATPPLAHVD 1258
>gi|126655992|ref|ZP_01727376.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
gi|126622272|gb|EAZ92978.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
Length = 319
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 56/118 (47%), Gaps = 15/118 (12%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
Q ADLR +FR +F+ A++RE DF+G+ AYL +A N TGA+L T
Sbjct: 25 QLRRADLRGLNLSNTDFRGVDFSYANLREVDFTGADLRDAYLNEADLTGVNLTGANLEGT 84
Query: 172 LMDRMVLNEAN-----LTNAVLVRTVLTRSD----------LGGAIIEGADFSDAVID 214
+ ++ L +AN + A L LT+SD L G + GA DA D
Sbjct: 85 SLIKIYLIKANCYQTDFSGAYLTGAYLTKSDFKEAKFNGAYLNGTKLSGAKLGDAYYD 142
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 115 SADLRKAVHV-KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 173
S DL++ + NF+ AD+R + S + F G A + +FTGADL D +
Sbjct: 7 SIDLKERYEKGQRNFQEFQLRRADLRGLNLSNTDFRGVDFSYANLREVDFTGADLRDAYL 66
Query: 174 DRMVLNEANLTNAVLVRTVLTR----------SDLGGAIIEGADFSDAVIDLAQKQALCK 223
+ L NLT A L T L + +D GA + GA + + D + +
Sbjct: 67 NEADLTGVNLTGANLEGTSLIKIYLIKANCYQTDFSGAYLTGAYLTKS--DFKEAKFNGA 124
Query: 224 YANGT 228
Y NGT
Sbjct: 125 YLNGT 129
>gi|119489371|ref|ZP_01622151.1| hypothetical protein L8106_02407 [Lyngbya sp. PCC 8106]
gi|119454644|gb|EAW35790.1| hypothetical protein L8106_02407 [Lyngbya sp. PCC 8106]
Length = 166
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 51/92 (55%), Gaps = 5/92 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N + AN + A + +FS + +GA L A +ANFT A+LS+ + L A LTNA
Sbjct: 68 NLKGANLSGALLDNVNFSQADLSGANLSSAALTQANFTEANLSEANLTGAFLRSAILTNA 127
Query: 187 VLVRTVLTRSDLG-----GAIIEGADFSDAVI 213
L L ++DL GA I+GADF +A++
Sbjct: 128 KLTNASLNKADLNTAKLEGAEIKGADFKEAIM 159
>gi|448449600|ref|ZP_21591825.1| pentapeptide repeat-containing protein [Halorubrum litoreum JCM
13561]
gi|445813229|gb|EMA63210.1| pentapeptide repeat-containing protein [Halorubrum litoreum JCM
13561]
Length = 822
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 50/98 (51%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL AV + A+ + E+D SG+ GA L +A+ T ADLS+
Sbjct: 178 ASLLGADLPGAVLTDTDLSGADLIKTGLIEADLSGADLTGANLRHGRLKEADLTNADLSN 237
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ R+ L +A+L AVL +T +DL GA++ AD
Sbjct: 238 ADLYRVDLTDADLEGAVLTDADITDADLEGAVLTDADL 275
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 67/148 (45%), Gaps = 21/148 (14%)
Query: 85 SSNISALADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKE-----NFRRANFTSADM 138
S +I ADL+K + G + A G A+L A V+ N R A+ T AD+
Sbjct: 16 SEDIEPSADLSKVDLSDADLSGADLTNAYLGGANLSNATLVEADLTGANLRDADLTDADL 75
Query: 139 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT------- 191
+D + + G L A +A+ T A L + +L EA+LT+A L RT
Sbjct: 76 YRTDLTDAYLEGVNLSGATPVEADLTDASLKRANLSSTILMEADLTDADLYRTDFTDAYL 135
Query: 192 --------VLTRSDLGGAIIEGADFSDA 211
L+ SDL A +EGA+ +DA
Sbjct: 136 EGANLTNAYLSGSDLTNAYLEGANLTDA 163
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 69/147 (46%), Gaps = 10/147 (6%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L + N EA+ G G+ A LR+A N + T+A +RE+D +G+ G
Sbjct: 450 LTNANLREADLTGAHLKGT--DLTDASLREADLTDVNLEEIDLTNASLREADLTGAHLEG 507
Query: 151 -----AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
A+LE AN ADL+ +++ L ANLT+A L L+ +DL + G
Sbjct: 508 VDLTGAHLEGIDLTSANLNQADLTSANLNQADLRGANLTDASLREANLSGADLTDTELSG 567
Query: 206 ADFSDAVI---DLAQKQALCKYANGTN 229
AD S + DL + ++L +G N
Sbjct: 568 ADLSRTDLEKSDLHKSKSLPTNLSGAN 594
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 49/105 (46%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
E + + A ADL AV + + T A+++ +D +G+ A L A A
Sbjct: 251 EGAVLTDADITDADLEGAVLTDADLEGTDLTGANLKVADLTGANLKVADLTGADLEDAVL 310
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
T ADL T + L A+LT A L LT DLGGA++ AD
Sbjct: 311 TDADLERTDLIEASLLSADLTGASLKEADLTEVDLGGAVLTDADL 355
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 48/105 (45%), Gaps = 5/105 (4%)
Query: 111 AQFGSADLRKAVHVKEN-----FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A DL V N FR ++ T A +R SD S + GA+LE A+
Sbjct: 378 ADLTEVDLEGTVLTDANLRFSEFRGSDITDASLRGSDLSNTDLTGAHLEGIDLTDASLRE 437
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
ADL+D ++ + L ANL A L L +DL A + AD +D
Sbjct: 438 ADLTDVNLEEIDLTNANLREADLTGAHLKGTDLTDASLREADLTD 482
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 48/96 (50%), Gaps = 5/96 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A LR+A N + T+A++RE+D +G+ G L A +A+ T +L + +
Sbjct: 433 ASLREADLTDVNLEEIDLTNANLREADLTGAHLKGTDLTDASLREADLTDVNLEEIDLTN 492
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L EA+LT A L DL GA +EG D + A
Sbjct: 493 ASLREADLTGAHLEGV-----DLTGAHLEGIDLTSA 523
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 51/96 (53%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L +AV + A+ A + ++D SG+ L +A A+ TGA+L +
Sbjct: 168 AELPRAVLTDASLLGADLPGAVLTDTDLSGADLIKTGLIEADLSGADLTGANLRHGRLKE 227
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L A+L+NA L R LT +DL GA++ AD +DA
Sbjct: 228 ADLTNADLSNADLYRVDLTDADLEGAVLTDADITDA 263
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 51/100 (51%), Gaps = 2/100 (2%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL AV + A+ A + ++D G+ GA L+ A AN ADL+ ++
Sbjct: 248 ADLEGAVLTDADITDADLEGAVLTDADLEGTDLTGANLKVADLTGANLKVADLTGADLED 307
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
VL +A+L L+ L +DL GA ++ AD ++ +DL
Sbjct: 308 AVLTDADLERTDLIEASLLSADLTGASLKEADLTE--VDL 345
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL A + N AD+ ++D + F AYLE A A +G+DL++ ++
Sbjct: 98 ADLTDASLKRANLSSTILMEADLTDADLYRTDFTDAYLEGANLTNAYLSGSDLTNAYLEG 157
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
L +A+ A L R VLT + L GA + GA +D
Sbjct: 158 ANLTDASPIGAELPRAVLTDASLLGADLPGAVLTD 192
Score = 44.3 bits (103), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 54/113 (47%), Gaps = 10/113 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A SADL A + + + A + ++D G+ AYL A+ ADL++
Sbjct: 323 ASLLSADLTGASLKEADLTEVDLGGAVLTDADLEGTALTEAYLPSPDLTGASLKEADLTE 382
Query: 171 TLMDRMVLNEANL----------TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ VL +ANL T+A L + L+ +DL GA +EG D +DA +
Sbjct: 383 VDLEGTVLTDANLRFSEFRGSDITDASLRGSDLSNTDLTGAHLEGIDLTDASL 435
Score = 43.9 bits (102), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 47/93 (50%), Gaps = 5/93 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL K ++ + A+ T A++R + A L A Y+ + T ADL +
Sbjct: 198 ADLIKTGLIEADLSGADLTGANLRHGRLKEADLTNADLSNADLYRVDLTDADL-----EG 252
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
VL +A++T+A L VLT +DL G + GA+
Sbjct: 253 AVLTDADITDADLEGAVLTDADLEGTDLTGANL 285
Score = 43.9 bits (102), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 63/131 (48%), Gaps = 3/131 (2%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+LR + + A+ ++AD+ D + + GA L A A+ GA L
Sbjct: 211 SGADLTGANLRHGRLKEADLTNADLSNADLYRVDLTDADLEGAVLTDADITDADLEGAVL 270
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+D ++ L ANL A L L +DL GA +E A +DA + ++ L + + +
Sbjct: 271 TDADLEGTDLTGANLKVADLTGANLKVADLTGADLEDAVLTDADL---ERTDLIEASLLS 327
Query: 229 NPITGVSTRKS 239
+TG S +++
Sbjct: 328 ADLTGASLKEA 338
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 58/124 (46%), Gaps = 19/124 (15%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL A + +F A A++ + SGS AYLE A A+ GA+L R
Sbjct: 118 ADLTDADLYRTDFTDAYLEGANLTNAYLSGSDLTNAYLEGANLTDASPIGAELP-----R 172
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGA------IIE----GADFSDAVIDLAQKQALCKYA 225
VL +A+L A L VLT +DL GA +IE GAD + A + + K A
Sbjct: 173 AVLTDASLLGADLPGAVLTDTDLSGADLIKTGLIEADLSGADLTGANL----RHGRLKEA 228
Query: 226 NGTN 229
+ TN
Sbjct: 229 DLTN 232
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 59/131 (45%), Gaps = 7/131 (5%)
Query: 89 SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 148
+ L D N +E RG I A+ GS DL + + T A +RE+D +
Sbjct: 388 TVLTDANLRFSEFRGS-DITDASLRGS-DLSNTDLTGAHLEGIDLTDASLREADLTDVNL 445
Query: 149 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAII 203
L A +A+ TGA L T + L EA+LT+ L LT +DL GA +
Sbjct: 446 EEIDLTNANLREADLTGAHLKGTDLTDASLREADLTDVNLEEIDLTNASLREADLTGAHL 505
Query: 204 EGADFSDAVID 214
EG D + A ++
Sbjct: 506 EGVDLTGAHLE 516
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 5/103 (4%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-----DLSD 170
A L++A + + T A++R S+F GS A L + + TGA DL+D
Sbjct: 373 ASLKEADLTEVDLEGTVLTDANLRFSEFRGSDITDASLRGSDLSNTDLTGAHLEGIDLTD 432
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ L + NL L L +DL GA ++G D +DA +
Sbjct: 433 ASLREADLTDVNLEEIDLTNANLREADLTGAHLKGTDLTDASL 475
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 48/92 (52%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
++A ADLR A + R AN + AD+ +++ SG+ + LEK+ +K+ +L
Sbjct: 531 TSANLNQADLRGANLTDASLREANLSGADLTDTELSGADLSRTDLEKSDLHKSKSLPTNL 590
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 200
S + + L+E NL++ L R L +L G
Sbjct: 591 SGANLRGLNLSEQNLSSVNLSRADLRDVNLIG 622
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 46/93 (49%), Gaps = 20/93 (21%)
Query: 131 ANFTSADMRESDFSGSKFNGAY-----LEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
A+ + D+ ++D SG+ AY L A +A+ TGA+L D A+LT+
Sbjct: 23 ADLSKVDLSDADLSGADLTNAYLGGANLSNATLVEADLTGANLRD----------ADLTD 72
Query: 186 AVLVRTVLTRSDLGGAIIEG-----ADFSDAVI 213
A L RT LT + L G + G AD +DA +
Sbjct: 73 ADLYRTDLTDAYLEGVNLSGATPVEADLTDASL 105
>gi|427724799|ref|YP_007072076.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427356519|gb|AFY39242.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 276
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 64/138 (46%), Gaps = 16/138 (11%)
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDR 175
AV K N A +A++R +D G+ GAYL AN F+GA+L + +
Sbjct: 135 AVGPKANLSGAYLNTANLRGADLQGANLRGAYLSGTDFTGANLTGVAFSGANLKRSFLTG 194
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIE------GADFSDAV-IDLAQKQALC----KY 224
L EA L N L L +DL GA++E GADFSD + +++ LC K
Sbjct: 195 ACLREARLINVELEMADLRGADLTGAMLEQIESLAGADFSDVRGLSDSERSYLCSRSPKE 254
Query: 225 ANGTNPITGVSTRKSLGC 242
N T +TR SL C
Sbjct: 255 LGTWNSFTRKNTRASLNC 272
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 11/98 (11%)
Query: 123 HVKENFRRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 181
VKE R N +A++ + D +G + A L+ A+ NFTGA L+ + L A
Sbjct: 17 EVKEILERGNSLENANLEDLDLAGYDLSDANLQGAILIGVNFTGATLAGAQLQNADLRRA 76
Query: 182 NLTN----------AVLVRTVLTRSDLGGAIIEGADFS 209
NLTN A L RT+L DL GA+++GA+ +
Sbjct: 77 NLTNASLKGATLSEAYLQRTILNDCDLAGAVLDGANLT 114
>gi|307944130|ref|ZP_07659471.1| pentapeptide repeat protein [Roseibium sp. TrichSKD4]
gi|307772476|gb|EFO31696.1| pentapeptide repeat protein [Roseibium sp. TrichSKD4]
Length = 534
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 53/113 (46%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I A+ ADLR A + + R A AD+R + +K GA L++A +A+
Sbjct: 63 LAILQEAKLQEADLRGAKLQQADLRGAKLQQADLRLAKLQQAKLWGADLQEADLQEADLR 122
Query: 165 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
GADL + L A L A L L +DL GA + GAD A ++ A+
Sbjct: 123 GADLRGAKLQEADLRGAKLQEADLRGAKLQEADLRGAKLRGADLRGAKLEWAK 175
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ ADLR A+ + + A+ A ++++D G+K A L A +A GADL +
Sbjct: 54 AKLQQADLRLAILQEAKLQEADLRGAKLQQADLRGAKLQQADLRLAKLQQAKLWGADLQE 113
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L EA+L A L L +DL GA ++ AD A +
Sbjct: 114 A-----DLQEADLRGADLRGAKLQEADLRGAKLQEADLRGAKL 151
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 75/183 (40%), Gaps = 19/183 (10%)
Query: 67 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRG---EFGIGSAAQFGSADLRKAVH 123
W L A + ++ L + EA+ RG + A+ ADLR A
Sbjct: 42 EWADLWGANLQQAKLQQADLRLAILQEAKLQEADLRGAKLQQADLRGAKLQQADLRLAKL 101
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
+ A+ AD++E+D G+ GA L+ +A+ GA L + + L EA+L
Sbjct: 102 QQAKLWGADLQEADLQEADLRGADLRGAKLQ-----EADLRGAKLQEADLRGAKLQEADL 156
Query: 184 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ------ALCKYANGTNPITGVSTR 237
A L +DL GA +E A A ++ A + A+ +A TG T+
Sbjct: 157 RGA-----KLRGADLRGAKLEWAKLEWAKLEWADVRTVKSSLAVSGFARADFTHTGYLTQ 211
Query: 238 KSL 240
K +
Sbjct: 212 KQV 214
>gi|119356056|ref|YP_910700.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
DSM 266]
gi|119353405|gb|ABL64276.1| pentapeptide repeat protein [Chlorobium phaeobacteroides DSM 266]
Length = 446
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 52/98 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A +ADLR + + ++A+ AD+RE+ A++EK++ KAN A+L
Sbjct: 82 SGANLNNADLRGSNLQQAFIKKADLKGADLREAYLVKVNLKEAFMEKSMLQKANLQSANL 141
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
T R L +NL +AVL T +DL GA ++GA
Sbjct: 142 RWTRFHRADLAGSNLQDAVLFETSFVDADLRGANLKGA 179
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 5/103 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTG 165
A F A+L +A+ + +A+F ADM++ G+ +GA ++E A AN +G
Sbjct: 307 ADFEDANLDEAMMEGADLSKADFQKADMKKVKLQGANLSGANLDRSFMEGADLRNANLSG 366
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A+L ++ L+ ANL+ A L T L ++L GA ++GA+
Sbjct: 367 ANLFGAMLKDANLSGANLSGASLFETDLEGANLSGANLKGANL 409
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 49/96 (51%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ +L+KA +F AN A M +D S + F A ++K AN +GA+L
Sbjct: 292 ARLKGVNLQKASMPGADFEDANLDEAMMEGADLSKADFQKADMKKVKLQGANLSGANLDR 351
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
+ M+ L ANL+ A L +L ++L GA + GA
Sbjct: 352 SFMEGADLRNANLSGANLFGAMLKDANLSGANLSGA 387
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 53/105 (50%), Gaps = 15/105 (14%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGAD---- 167
DL KA + AN +++ + ++ SG+ N G+ L++A KA+ GAD
Sbjct: 55 DLDKAKLEDADLEGANLSNSSLVRAELSGANLNNADLRGSNLQQAFIKKADLKGADLREA 114
Query: 168 ------LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
L + M++ +L +ANL +A L T R+DL G+ ++ A
Sbjct: 115 YLVKVNLKEAFMEKSMLQKANLQSANLRWTRFHRADLAGSNLQDA 159
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/89 (26%), Positives = 46/89 (51%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+E A +++++ G+ F A L++A+ A+ + AD M ++ L ANL+
Sbjct: 286 EEKLENARLKGVNLQKASMPGADFEDANLDEAMMEGADLSKADFQKADMKKVKLQGANLS 345
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A L R+ + +DL A + GA+ A++
Sbjct: 346 GANLDRSFMEGADLRNANLSGANLFGAML 374
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 44/95 (46%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F AD++K N AN + M +D + +GA L A+ AN +GA+L
Sbjct: 325 SKADFQKADMKKVKLQGANLSGANLDRSFMEGADLRNANLSGANLFGAMLKDANLSGANL 384
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
S + L ANL+ A L L +L AII
Sbjct: 385 SGASLFETDLEGANLSGANLKGANLVEPNLKNAII 419
>gi|334118424|ref|ZP_08492513.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333459431|gb|EGK88044.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 479
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 51/97 (52%), Gaps = 10/97 (10%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
SADLR + + AN + AD+RE+DF+G A AN +GADL +
Sbjct: 338 SADLRGVDLTRADLSGANLSDADLRETDFTG----------ATLLFANLSGADLRGVDLT 387
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L+ ANLT A L + L R +L GA + AD SDA
Sbjct: 388 KADLSGANLTEADLRKADLMRVNLEGADLTEADLSDA 424
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 52/97 (53%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
LR A ++ + AN + AD+ ES + + A L AV +AN GA+ + + +
Sbjct: 49 LRYADLIEADLSGANLSGADLAESFLNLANLTRADLTGAVLREANLVGAEFTGANLKQAS 108
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L +ANL A L LTR++L GA + G+ S A++D
Sbjct: 109 LIKANLVGANLHEANLTRANLSGADLRGSQLSGAILD 145
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
R AN D+RE++ SG+ A L +A AN +GADL+++ LN ANLT A
Sbjct: 29 LRGANLRGTDLRETNLSGAMLRYADLIEADLSGANLSGADLAESF-----LNLANLTRAD 83
Query: 188 LVRTVLTRSDLGGAIIEGADFSDAVI 213
L VL ++L GA GA+ A +
Sbjct: 84 LTGAVLREANLVGAEFTGANLKQASL 109
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 55/110 (50%), Gaps = 20/110 (18%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADL ++ AN T AD +RE++ G++F GA L++A KAN
Sbjct: 60 SGANLSGADLAESF-----LNLANLTRADLTGAVLREANLVGAEFTGANLKQASLIKANL 114
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
GA+ L+EANLT A L L S L GAI++ A +++ I
Sbjct: 115 VGAN----------LHEANLTRANLSGADLRGSQLSGAILDKAVYNNRTI 154
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 50/95 (52%), Gaps = 5/95 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL + N +N +S +++ DFS + AYL+ A + + GADLS
Sbjct: 274 ADLNGSDLSGANLSGSNLSSVNLKNVDFSRASLKKAYLKGANLEQTDLRGADLSGA---- 329
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+L++ NL++A L LTR+DL GA + AD +
Sbjct: 330 -ILHQVNLSSADLRGVDLTRADLSGANLSDADLRE 363
Score = 43.5 bits (101), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR+ AN + AD+R ++D SG+ A L KA + N
Sbjct: 352 SGANLSDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGANLTEADLRKADLMRVNL 411
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
GADL+ EA+L++A L R L ++L G ++GA
Sbjct: 412 EGADLT----------EADLSDAHLFRVNLRGANLKGTNLKGA 444
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 68/159 (42%), Gaps = 36/159 (22%)
Query: 111 AQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAV-------- 157
A+F A+L++A +K N AN T A++ +D GS+ +GA L+KAV
Sbjct: 97 AEFTGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAVYNNRTIFP 156
Query: 158 ------AYKA------------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 199
A A N DL++ + L NL A+L L R++L
Sbjct: 157 EDIDPGAMGAFLLAPNASLPGLNLAMVDLTEADLKGADLRRTNLYKAILFGAKLDRANLA 216
Query: 200 GAIIEGADFSDAVID--LAQKQALCK---YANGTNPITG 233
GA + AD +A + + +K K ++ G +P G
Sbjct: 217 GANLSAADLREASLSGTILEKAVYSKKTLFSEGIDPALG 255
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 43/99 (43%), Gaps = 5/99 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFTG 165
A ADLR K + AN T AD+R++D G+ A L A ++ N G
Sbjct: 374 ANLSGADLRGVDLTKADLSGANLTEADLRKADLMRVNLEGADLTEADLSDAHLFRVNLRG 433
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 204
A+L T + L LT+A L T L DL + E
Sbjct: 434 ANLKGTNLKGASLKGVFLTDAYLSETDLADIDLSPSFFE 472
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 46/103 (44%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S+ + D +A K + AN D+R +D SG+ + L A + T ADL
Sbjct: 292 SSVNLKNVDFSRASLKKAYLKGANLEQTDLRGADLSGAILHQVNLSSADLRGVDLTRADL 351
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + L E + T A L+ L+ +DL G + AD S A
Sbjct: 352 SGANLSDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGA 394
>gi|428770507|ref|YP_007162297.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428684786|gb|AFZ54253.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 355
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 1/97 (1%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N R A+ T D+ E++ +K NG L A AN T A+L++ + L ANLTNA
Sbjct: 253 NLRGADLTDVDLSEANLQNTKLNGVDLSGAYLEGANLTNANLTNASLALSNLIGANLTNA 312
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALC 222
L T L + LG I++GA F++ + ++ +KQ L
Sbjct: 313 NLTNTNLQNTSLGQTIVKGAIFANNLGLNEEKKQELI 349
>gi|218247298|ref|YP_002372669.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|218167776|gb|ACK66513.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
Length = 371
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 67/136 (49%), Gaps = 10/136 (7%)
Query: 80 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 139
+ A+ + N++ L L + T G AA+ + +L A + NFR AN T A++
Sbjct: 218 LYAANTHNLAELIKLAHFNPLTDLAGGNFLAAELSAVELSGANLTQTNFRGANLTDAELS 277
Query: 140 ES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 194
E+ FSG+ +GAYL A KA+F A L+ + L EANL A L+ T
Sbjct: 278 EAILNYCKFSGADLSGAYLGNAQLVKADFHRASLAVANLIGANLTEANLREANLIDT--- 334
Query: 195 RSDLGGAIIEGADFSD 210
+L GA ++ A F +
Sbjct: 335 --NLSGATVKNAKFGE 348
>gi|254413321|ref|ZP_05027092.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196179941|gb|EDX74934.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 636
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 51/97 (52%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AA +A+LR+ + K N R A A + E++ + A L +A Y+A T ADLS
Sbjct: 210 AANLTTANLREVLLEKANLRDAILVGATLTEANLRQACLRRANLTQAELYRAILTDADLS 269
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
+ DR+ L+ ANL A L+R L ++L +++
Sbjct: 270 EVTGDRVNLSRANLMGAYLLRASLVNANLRRTVLQNV 306
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 46/97 (47%), Gaps = 10/97 (10%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR----------MV 177
+N T A + ++ ++ A L +A AN T A+L + L+++
Sbjct: 178 LNHSNLTGATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAILVGAT 237
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L EANL A L R LT+++L AI+ AD S+ D
Sbjct: 238 LTEANLRQACLRRANLTQAELYRAILTDADLSEVTGD 274
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTG 165
A ++L +A+ + N + + SA + +++ S + +GA L+ A +N TG
Sbjct: 126 AILKHSNLNQAILTRVNLSKVDGQSASLCQANLSWVEAPYCNLSGANLQAAQLNHSNLTG 185
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A L T + L ANL A L+ LT ++L ++E A+ DA++
Sbjct: 186 ATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAIL 233
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 66/173 (38%), Gaps = 38/173 (21%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYE-----AETRGEFGIG---SAAQFGSADLRKAV 122
+ST L AA + S + L N E A R +G + A A LR+A
Sbjct: 193 LISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAILVGATLTEANLRQACLRRAN 252
Query: 123 HVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKAN--------------- 162
+ RA T AD+ E + S + GAYL +A AN
Sbjct: 253 LTQAELYRAILTDADLSEVTGDRVNLSRANLMGAYLLRASLVNANLRRTVLQNVYCLQTN 312
Query: 163 ----------FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
ADLS ++ +L EANLT+A L+ + L R L A + G
Sbjct: 313 LTAANLQGADLRQADLSGAYLNETILTEANLTDAYLIGSYLIRPKLEQAQLTG 365
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 46/90 (51%), Gaps = 5/90 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 181
N AN +A + S+ +G+ + L A Y+A+ A+L+ + ++L +A
Sbjct: 167 NLSGANLQAAQLNHSNLTGATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKA 226
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
NL +A+LV LT ++L A + A+ + A
Sbjct: 227 NLRDAILVGATLTEANLRQACLRRANLTQA 256
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 42/96 (43%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A L K + AN A + ++ + + LEKA A GA L++ + +
Sbjct: 186 ATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAILVGATLTEANLRQ 245
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANLT A L R +LT +DL + + S A
Sbjct: 246 ACLRRANLTQAELYRAILTDADLSEVTGDRVNLSRA 281
>gi|443329141|ref|ZP_21057730.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791290|gb|ELS00788.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 174
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 53/90 (58%), Gaps = 5/90 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEA 181
NF RAN + A +R S+ SG+ F A L+KA + NF+GA+L + + + L+EA
Sbjct: 36 NFIRANLSQAILRNSNLSGAFFVLADLQKADLSGAILIVVNFSGANLQEANLTQSKLSEA 95
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
LT L LT ++L GAI+ GA+ S+A
Sbjct: 96 VLTGTQLQGANLTEANLQGAILAGANLSEA 125
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 54/113 (47%), Gaps = 10/113 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTG 165
A ADL A+ + NF AN A++ +S S G++ GA L +A A G
Sbjct: 60 ADLQKADLSGAILIVVNFSGANLQEANLTQSKLSEAVLTGTQLQGANLTEANLQGAILAG 119
Query: 166 ADLSDTLMDRMVLNEAN-----LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+LS+ + L AN L NA L +T ++L GA +EGA + +I
Sbjct: 120 ANLSEANLRGGDLRGANLYGVDLRNADLTDAKITHANLRGANLEGAIMPEQLI 172
>gi|163797791|ref|ZP_02191737.1| hypothetical protein BAL199_22152 [alpha proteobacterium BAL199]
gi|159176913|gb|EDP61479.1| hypothetical protein BAL199_22152 [alpha proteobacterium BAL199]
Length = 427
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 70/157 (44%), Gaps = 37/157 (23%)
Query: 92 ADLNK---YEAETRGEFGIGS---AAQFGSADLRKAVHVKENFR------RANFTSADMR 139
ADLN A+ RG F GS A ADLR + N R+N +DM
Sbjct: 78 ADLNHALLIRADLRGAFMRGSNLAGANLKEADLRGGALISGNLAAPATIIRSNIGQSDMD 137
Query: 140 ESDFSGSKFNG----------AYLEKAVA----------YKANFTGADLSDTLMD--RMV 177
E+D G+ +G A LEK + AN GADLS + R++
Sbjct: 138 EADMGGANLSGTDLSHSSMIGATLEKTLLCGANLSGVNLEGANLQGADLSGANLSSARII 197
Query: 178 ---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L+ ANL+ A++ RT +S+L GAI+E D S A
Sbjct: 198 GANLSGANLSGALIHRTQFQKSELHGAILENVDLSTA 234
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 58/115 (50%), Gaps = 15/115 (13%)
Query: 116 ADLRKAVHVKENFRR----------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
ADLR+A+ V RR +N + AD+R+S+ +G GA L A A+ T
Sbjct: 294 ADLREAILVSAVMRRTSLVMSDLSGSNLSGADLRDSELAGINLAGANLTNARIAGADLTS 353
Query: 166 ADL---SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+L R+ + +NL+ AVLV LT + L GA ++GAD + A + AQ
Sbjct: 354 VELKGPDGQATGRLWV--SNLSGAVLVNADLTGARLTGANLKGADLTGAKLARAQ 406
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 49/94 (52%), Gaps = 3/94 (3%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN + ++ ++ G+ +GA L A AN +GA+LS L+ R ++ L A+L
Sbjct: 169 ANLSGVNLEGANLQGADLSGANLSSARIIGANLSGANLSGALIHRTQFQKSELHGAILEN 228
Query: 191 TVLTRSDLGGAII---EGADFSDAVIDLAQKQAL 221
L+ +DL GA + +G S ++ D+ + A+
Sbjct: 229 VDLSTADLSGANLTSGDGRGLSRSLRDILHEHAV 262
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 15/115 (13%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSG---------------SKFNGAYLEKA 156
QF ++L A+ + A+ + A++ D G + G +A
Sbjct: 215 QFQKSELHGAILENVDLSTADLSGANLTSGDGRGLSRSLRDILHEHAVWIREQGRGGSRA 274
Query: 157 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
K TG D+SD + L EA L +AV+ RT L SDL G+ + GAD D+
Sbjct: 275 QLAKTELTGIDVSDVNLSGADLREAILVSAVMRRTSLVMSDLSGSNLSGADLRDS 329
>gi|334117701|ref|ZP_08491792.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333460810|gb|EGK89418.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 214
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 58/108 (53%), Gaps = 5/108 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTG 165
A G ADL +A+ V+ N RA A++ ++D SG + GA + +A +A+ G
Sbjct: 70 ADLGGADLTEALLVEANLNRAELMGANLSKADLSGASLIQATLIGANVSRATLSRADLHG 129
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+L + R VL E +L A L + L+ +DL GA + AD ++A++
Sbjct: 130 VNLYGVNLRRAVLTECDLIGANLSKVDLSGADLMGASLIRADLTEAIL 177
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 57/117 (48%), Gaps = 2/117 (1%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
YEAE G + A G A+L KA + NF +AN ++ +D G+ A L +A
Sbjct: 28 YEAELIGA-NLYEADLIG-ANLSKAKLNRVNFGKANLCKINLMRADLGGADLTEALLVEA 85
Query: 157 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+A GA+LS + L +A L A + R L+R+DL G + G + AV+
Sbjct: 86 NLNRAELMGANLSKADLSGASLIQATLIGANVSRATLSRADLHGVNLYGVNLRRAVL 142
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 55/127 (43%), Gaps = 12/127 (9%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE----------SDFS 144
N YEA+ G S A+ + KA K N RA+ AD+ E ++
Sbjct: 36 NLYEADLIGANL--SKAKLNRVNFGKANLCKINLMRADLGGADLTEALLVEANLNRAELM 93
Query: 145 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 204
G+ + A L A +A GA++S + R L+ NL L R VLT DL GA +
Sbjct: 94 GANLSKADLSGASLIQATLIGANVSRATLSRADLHGVNLYGVNLRRAVLTECDLIGANLS 153
Query: 205 GADFSDA 211
D S A
Sbjct: 154 KVDLSGA 160
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 37/74 (50%), Gaps = 5/74 (6%)
Query: 140 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 199
E +FSG + YL +A AN ADL + + LN N A L + L R+DLG
Sbjct: 14 ERNFSGVYLHEVYLYEAELIGANLYEADLIGANLSKAKLNRVNFGKANLCKINLMRADLG 73
Query: 200 GAIIEGADFSDAVI 213
GAD ++A++
Sbjct: 74 -----GADLTEALL 82
Score = 37.0 bits (84), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 55/115 (47%), Gaps = 13/115 (11%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN AD+ ++ S +K N KA K N ADL + +L EANL A L+
Sbjct: 35 ANLYEADLIGANLSKAKLNRVNFGKANLCKINLMRADLGGADLTEALLVEANLNRAELMG 94
Query: 191 TVLTRSDLGGA-IIE----GADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
L+++DL GA +I+ GA+ S A + A + Y GV+ R+++
Sbjct: 95 ANLSKADLSGASLIQATLIGANVSRATLSRADLHGVNLY--------GVNLRRAV 141
>gi|389694674|ref|ZP_10182768.1| putative low-complexity protein [Microvirga sp. WSM3557]
gi|388588060|gb|EIM28353.1| putative low-complexity protein [Microvirga sp. WSM3557]
Length = 251
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 94/226 (41%), Gaps = 35/226 (15%)
Query: 33 PLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRV--------------FVSTALAA 78
P W CQ DG P + C+ L N + F S+ +A
Sbjct: 22 PAWAKCQ-------DGPGPGVDWSGCSKARLMLTNEDLTGTNFQRSLLTLSDFASSKMAG 74
Query: 79 AVVASCSSNISAL--ADLNKYEAET----RGEFGIG--SAAQFGSADLRKAVHVKENFRR 130
A ++ + + ADL+K R FG + A FGSAD+ ++ +
Sbjct: 75 ANLSETEVSRTRFEGADLSKANFTKALGWRANFGQANLTGADFGSADMNRSNFAQVKAAG 134
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKA----VAYK-ANFTGADLSDTLMDRMVLNEANLTN 185
ANF+ +++ SDFSG+ +GA + KA V ++ A G D S + + R L+ NL
Sbjct: 135 ANFSKSELNRSDFSGADLSGANISKAELARVLFQSAKIAGVDFSYSNLSRSRLDGLNLQG 194
Query: 186 AVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKYANGTNP 230
+ L + +GGA + GA + ID+A A K NP
Sbjct: 195 VNFTGSYLYLTQIGGADLSGATGLTQEQIDIACGSAQTKLPPSINP 240
>gi|219849225|ref|YP_002463658.1| pentapeptide repeat-containing protein [Chloroflexus aggregans DSM
9485]
gi|219543484|gb|ACL25222.1| pentapeptide repeat protein [Chloroflexus aggregans DSM 9485]
Length = 311
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 56/114 (49%), Gaps = 13/114 (11%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLRKA N A A++R ++ S + F+GA L A N +GADL D
Sbjct: 89 ADLSDADLRKADLSWANLEFATLIGANLRGANLSAADFSGANLYGANLSLCNLSGADLRD 148
Query: 171 TLMDRMVLNE-------------ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
T+M L+E ANL+ A+L+R L ++L GA + GA+ A
Sbjct: 149 TVMIGANLSEAQLREAQLVNLSGANLSGAILLRVSLNGANLNGANLAGANLMHA 202
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 43/82 (52%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N AN A++RE+ GA L + +A+ AD SD + + L+ A+L NA
Sbjct: 193 NLAGANLMHANLREATLDEVNCIGANLSETNLSEASLCNADFSDANLSGIYLSGAHLRNA 252
Query: 187 VLVRTVLTRSDLGGAIIEGADF 208
+ R L+R++L GA + GA+
Sbjct: 253 IFTRANLSRANLSGANLRGANL 274
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 30/62 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A LR A+ + N RAN + A++R ++ G A L A A+ T ADL
Sbjct: 240 SGIYLSGAHLRNAIFTRANLSRANLSGANLRGANLRGVNLREASLADADLTDADLTDADL 299
Query: 169 SD 170
+D
Sbjct: 300 TD 301
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 43/87 (49%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ N A+ +AD +++ SG +GA+L A+ +AN + A+LS + L NL
Sbjct: 221 ETNLSEASLCNADFSDANLSGIYLSGAHLRNAIFTRANLSRANLSGANLRGANLRGVNLR 280
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDA 211
A L LT +DL A + D S A
Sbjct: 281 EASLADADLTDADLTDADLTDCDLSGA 307
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 47/107 (43%), Gaps = 5/107 (4%)
Query: 109 SAAQFGSADLRKAVH-----VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+ A A+LR+A + N N + A + +DFS + +G YL A A F
Sbjct: 195 AGANLMHANLREATLDEVNCIGANLSETNLSEASLCNADFSDANLSGIYLSGAHLRNAIF 254
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
T A+LS + L ANL L L +DL A + AD +D
Sbjct: 255 TRANLSRANLSGANLRGANLRGVNLREASLADADLTDADLTDADLTD 301
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 44/86 (51%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN + A++ ++ SG+ + A L +A AN ADLS + L+ ANL A L
Sbjct: 24 ANLSGANLSAANLSGANLSEAKLSRARLTDANLYRADLSICELGEANLSWANLREAKLNW 83
Query: 191 TVLTRSDLGGAIIEGADFSDAVIDLA 216
L R+DL A + AD S A ++ A
Sbjct: 84 AQLVRADLSDADLRKADLSWANLEFA 109
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 50/114 (43%), Gaps = 4/114 (3%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A+ R IG A A LR+A V N AN + A + +G+ NGA L A
Sbjct: 144 ADLRDTVMIG--ANLSEAQLREAQLV--NLSGANLSGAILLRVSLNGANLNGANLAGANL 199
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
AN A L + L+E NL+ A L + ++L G + GA +A+
Sbjct: 200 MHANLREATLDEVNCIGANLSETNLSEASLCNADFSDANLSGIYLSGAHLRNAI 253
>gi|418939072|ref|ZP_13492497.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
gi|375054219|gb|EHS50602.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
Length = 202
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 10/102 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A ADLR A NF AN SAD++ +D + + GA L A +AN TGA
Sbjct: 63 TGANLTGADLRWADCDGANFTGANLKSADLQHTDLTNANLTGANLTGANLTEANLTGA-- 120
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+L EA L A L++ + +++L G + GAD +D
Sbjct: 121 --------ILKEARLDKASLIQAIKQKANLQGVDLSGADLTD 154
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 50/104 (48%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L V + + A + +DF+G+ GA L A ANFTGA+L +
Sbjct: 35 ANLSNGVFAGADLEQVRLAGASLEGADFTGANLTGADLRWADCDGANFTGANLKSADLQH 94
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
L ANLT A L LT ++L GAI++ A A + A KQ
Sbjct: 95 TDLTNANLTGANLTGANLTEANLTGAILKEARLDKASLIQAIKQ 138
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 54/110 (49%), Gaps = 15/110 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANF 163
+ A SADL+ N AN T A++ E++ +G+ A L+K A+ KAN
Sbjct: 83 TGANLKSADLQHTDLTNANLTGANLTGANLTEANLTGAILKEARLDKASLIQAIKQKANL 142
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
G DLS A+LT+ L R T +L GAI++GA + A++
Sbjct: 143 QGVDLSG----------ADLTDMNLSRVDFTGVNLKGAILKGAILTGAIL 182
>gi|410672126|ref|YP_006924497.1| pentapeptide repeat protein [Methanolobus psychrophilus R15]
gi|409171254|gb|AFV25129.1| pentapeptide repeat protein [Methanolobus psychrophilus R15]
Length = 418
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 67/131 (51%), Gaps = 4/131 (3%)
Query: 81 VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE 140
++S + I A+L + + E G A A+L++A + N RRAN AD+
Sbjct: 197 LSSSMAEIKPQANLQRIDMEKTDLLG----ANLMEANLKEANLREANLRRANLEGADLMG 252
Query: 141 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 200
++ G+ A L A A+ GA+L D + ++ LN+ANL A L+ T L R++L
Sbjct: 253 ANLMGADMREANLMLANLEGASLMGANLMDANLKKINLNKANLVGANLIGTNLLRAELTE 312
Query: 201 AIIEGADFSDA 211
A++ A+ DA
Sbjct: 313 ALLMNAEIIDA 323
>gi|113474166|ref|YP_720227.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110165214|gb|ABG49754.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 1033
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 51/180 (28%), Positives = 82/180 (45%), Gaps = 9/180 (5%)
Query: 56 NQCAGPYAKLKNWRVFVSTA------LAAAVVASCSSNISALADLNKYEAETRGEFGIGS 109
+QC G A + F+S A L+ A + + + L+ A+ G + IG+
Sbjct: 816 SQCLGVGAFWETVGQFLSGADLRYADLSGAYLIVANLRYADLSGAYLISADLSGAYLIGA 875
Query: 110 ---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
A ADLR A N A + A++ ++ SG+ +GA L A A+ + A
Sbjct: 876 NLIGADLSRADLRYADLSGANLSDAKLSGANLSDAKLSGAGLSGADLRYADLSGADLSRA 935
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
LSD + L+ A L+ A L L+ +DL A + GAD SDA + + + K++N
Sbjct: 936 KLSDAGLSGANLSVAGLSGADLRYADLSGADLRYADLSGADLSDANLSNVRWNSQTKWSN 995
>gi|448661888|ref|ZP_21683780.1| hypothetical protein C435_21969 [Haloarcula californiae ATCC 33799]
gi|445758247|gb|EMA09568.1| hypothetical protein C435_21969 [Haloarcula californiae ATCC 33799]
Length = 480
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 16/128 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A LR+A N + AN T A +R++D + + GA L A +A+ T A L +
Sbjct: 168 ANLTDTSLRQADLTDANLKGANLTDASLRQADLTDANLKGADLPGASLLRADLTDAFLRE 227
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRS---------------DLGGAIIEGADFSDA-VID 214
+ LN ANLT +L + LT + DL GA + GADFS+A +I+
Sbjct: 228 VNLTDAALNRANLTGTILHKADLTDTDLQVADFTNADLRYADLTGATLPGADFSEANLIN 287
Query: 215 LAQKQALC 222
++ L
Sbjct: 288 TTLREVLL 295
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 57/124 (45%), Gaps = 6/124 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----G 165
A ADL A + + AN T +R++D + + GA L A +A+ T G
Sbjct: 148 ADLTDADLWAAALPDADLKGANLTDTSLRQADLTDANLKGANLTDASLRQADLTDANLKG 207
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKY 224
ADL + R L +A L L L R++L G I+ AD +D + +A A +Y
Sbjct: 208 ADLPGASLLRADLTDAFLREVNLTDAALNRANLTGTILHKADLTDTDLQVADFTNADLRY 267
Query: 225 ANGT 228
A+ T
Sbjct: 268 ADLT 271
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 65/141 (46%), Gaps = 7/141 (4%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR----- 130
L AV+A + + L + N EAE I A A L A N R
Sbjct: 55 LKGAVLADVNFAGADLVNANIKEAELTD--AILRQADLTDAALWDANLTGSNLLRTDLPG 112
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
ANF AD+ +++ GS F A L +A A ADL+D + L +A+L A L
Sbjct: 113 ANFLRADLHDANLKGSDFTDAALRQADLTDATLRQADLTDADLWAAALPDADLKGANLTD 172
Query: 191 TVLTRSDLGGAIIEGADFSDA 211
T L ++DL A ++GA+ +DA
Sbjct: 173 TSLRQADLTDANLKGANLTDA 193
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 54/112 (48%), Gaps = 5/112 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTG 165
+ F A LR+A R+A+ T AD+ ++D G+ L +A AN G
Sbjct: 128 SDFTDAALRQADLTDATLRQADLTDADLWAAALPDADLKGANLTDTSLRQADLTDANLKG 187
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
A+L+D + + L +ANL A L L R+DL A + + +DA ++ A
Sbjct: 188 ANLTDASLRQADLTDANLKGADLPGASLLRADLTDAFLREVNLTDAALNRAN 239
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 25/119 (21%)
Query: 120 KAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDT--- 171
K ++ + AN + A ++E+D +G + GA L+ AV NF GADL +
Sbjct: 17 KDIYPGADLTDANLSGAFLKEADLTGANLTRTDLTGANLKGAVLADVNFAGADLVNANIK 76
Query: 172 ---LMDRMV---------LNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 213
L D ++ L +ANLT + L+RT L R+DL A ++G+DF+DA +
Sbjct: 77 EAELTDAILRQADLTDAALWDANLTGSNLLRTDLPGANFLRADLHDANLKGSDFTDAAL 135
Score = 44.3 bits (103), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTG 165
A F ADL A N + ++FT A +R++D + + A L A + A+ G
Sbjct: 113 ANFLRADLHDA-----NLKGSDFTDAALRQADLTDATLRQADLTDADLWAAALPDADLKG 167
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A+L+DT + + L +ANL A L L ++DL A ++GAD
Sbjct: 168 ANLTDTSLRQADLTDANLKGANLTDASLRQADLTDANLKGADL 210
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD- 174
A+L+ AV NF A+ +A+++E++ + + A L A + AN TG++L T +
Sbjct: 53 ANLKGAVLADVNFAGADLVNANIKEAELTDAILRQADLTDAALWDANLTGSNLLRTDLPG 112
Query: 175 ----RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
R L++ANL + L ++DL A + AD +DA
Sbjct: 113 ANFLRADLHDANLKGSDFTDAALRQADLTDATLRQADLTDA 153
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%), Gaps = 6/82 (7%)
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADF 208
+ +V+ K + GADL+D + L EA+LT A L RT LT ++L GA++ GAD
Sbjct: 11 DDSVSDKDIYPGADLTDANLSGAFLKEADLTGANLTRTDLTGANLKGAVLADVNFAGADL 70
Query: 209 SDAVIDLAQ-KQALCKYANGTN 229
+A I A+ A+ + A+ T+
Sbjct: 71 VNANIKEAELTDAILRQADLTD 92
>gi|220907270|ref|YP_002482581.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863881|gb|ACL44220.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 369
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 46/92 (50%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ R AN D+ + S + + A L A +K NF GA+L + R L +ANLTNA
Sbjct: 256 DLRGANLAEKDLAGRNLSNANLSSANLSDAFLHKTNFHGANLFRANLFRANLLQANLTNA 315
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
L T L +DL GA + GAD A I K
Sbjct: 316 NLRETNLIGADLSGADLRGADLRGAKIGFDNK 347
>gi|428771470|ref|YP_007163260.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428685749|gb|AFZ55216.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 195
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 60/118 (50%), Gaps = 21/118 (17%)
Query: 111 AQFGSADLRKAVHVK---ENFRRA------------NFTSADMRESDFSGSKFNGAYLEK 155
A ADLR A+ + EN A + +AD+R +D G GA L+K
Sbjct: 77 ADLRGADLRGAILLSSQVENISLAGSFLAGAILTNLDLCNADLRGADLRGVNLVGACLQK 136
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A AN +GADLS + L EANL+ A+L T LT+++L AI+EG F D VI
Sbjct: 137 ADLSNANLSGADLS-----QADLEEANLSGAILHGTNLTQANLLCAIVEGVSF-DYVI 188
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 58/119 (48%), Gaps = 8/119 (6%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L KA+ ++ N R A+ T A ++ +D G+ GA L + + G+ L+ ++
Sbjct: 53 ANLEKAI-LRCNLRGADLTGASLQGADLRGADLRGAILLSSQVENISLAGSFLAGAILTN 111
Query: 176 MVLNEANLTNA-----VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
+ L A+L A LV L ++DL A + GAD S A DL + +GTN
Sbjct: 112 LDLCNADLRGADLRGVNLVGACLQKADLSNANLSGADLSQA--DLEEANLSGAILHGTN 168
>gi|427735932|ref|YP_007055476.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427370973|gb|AFY54929.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 713
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 68/153 (44%), Gaps = 32/153 (20%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY--------- 159
S+A ADLR AV A+ T AD+ E+ + + GA L + VA
Sbjct: 534 SSASLAKADLRNAV-----LENASLTGADLGEARLNDADLYGARLGRVVAIGTQLSNANL 588
Query: 160 -KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL---------------TRSDLGGAII 203
K + GADLS +DR L+ ANL+ A L +L + +DL GA +
Sbjct: 589 IKTEWQGADLSSAYLDRANLSNANLSAARLTGAILRSTNLQNVNLRNADLSLADLRGANL 648
Query: 204 EGADFSDAVIDLAQKQALCKYANGTNPITGVST 236
GADF ++ Q+ K+ + P TG+ +
Sbjct: 649 AGADFQGTILSARQQNPADKFVD--TPTTGIQS 679
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 54/111 (48%), Gaps = 5/111 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTG 165
A A+L + + V+ N R+N A++ + G+ + A L KA V A+ TG
Sbjct: 496 ANLSGANLSRVLMVRTNLSRSNLNKANLSAARLVGANLSSASLAKADLRNAVLENASLTG 555
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
ADL + ++ L A L V + T L+ ++L +GAD S A +D A
Sbjct: 556 ADLGEARLNDADLYGARLGRVVAIGTQLSNANLIKTEWQGADLSSAYLDRA 606
Score = 43.9 bits (102), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 50/97 (51%), Gaps = 14/97 (14%)
Query: 132 NFTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMDRMV 177
+F A++ ++ F+GS+F G A L +A +AN +GA+LS LM R
Sbjct: 453 DFKYANLDKASFTGSRFRGPGKDGRWDTYDDWIANLSQAQLKQANLSGANLSRVLMVRTN 512
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L+ +NL A L L ++L A + AD +AV++
Sbjct: 513 LSRSNLNKANLSAARLVGANLSSASLAKADLRNAVLE 549
>gi|86608529|ref|YP_477291.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86557071|gb|ABD02028.1| pentapeptide repeat protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 248
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 70/148 (47%), Gaps = 15/148 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTG 165
A F ++DLR + + +NFT+A + +S F G +F+ + +A AN F
Sbjct: 89 ANFTASDLRGSSFSQALGDYSNFTAAKLDKSSFQGGRFSHSIFREASLVAANLAEGNFFA 148
Query: 166 ADLSDTLMDRMVLNEA----------NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
AD + R L++A NL A+LV L + + + GADF+DA +
Sbjct: 149 ADFRQANLSRCNLSQAALVSCQLQFANLEQAILVGANLRDAQIEDTLFSGADFTDAKLSD 208
Query: 216 AQKQALCKYANGTNPITGVSTRKSLGCG 243
++ L + A+GTN +T T +L G
Sbjct: 209 ETRKLLIERASGTNELTQRDTLNTLLAG 236
>gi|428296910|ref|YP_007135216.1| RDD domain-containing protein [Calothrix sp. PCC 6303]
gi|428233454|gb|AFY99243.1| RDD domain containing protein [Calothrix sp. PCC 6303]
Length = 718
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 60/116 (51%), Gaps = 10/116 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 163
S+AQ ADLR AV + A+ A + E++ G++ N GA L A K ++
Sbjct: 540 SSAQMVGADLRNAVLENASLTGADLGEAKLNEAELYGARLNRAIAIGAQLSYANLTKTDW 599
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
ADLS + +DR+ NLTNA L LT + L A +EGA+ +A + LA Q
Sbjct: 600 QAADLSGSYLDRV-----NLTNANLSTARLTGAILRSANLEGANLRNADLTLADFQ 650
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 51/110 (46%), Gaps = 11/110 (10%)
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
GE + A +G A L +A+ + AN T D + +D SGS YL++ N
Sbjct: 565 GEAKLNEAELYG-ARLNRAIAIGAQLSYANLTKTDWQAADLSGS-----YLDRV-----N 613
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
T A+LS + +L ANL A L LT +D GA + DF A+
Sbjct: 614 LTNANLSTARLTGAILRSANLEGANLRNADLTLADFQGANVANVDFQGAI 663
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 50/98 (51%), Gaps = 14/98 (14%)
Query: 131 ANFTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMDRM 176
NF A++ ++ F S+F G A L +A +ANF+ A+LS L+++
Sbjct: 458 VNFKGANLDQASFKNSRFRGPGDDGLWDTFDDAIADLSQAQLKQANFSEANLSRVLLNKS 517
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L+ + L A L + L ++L A + GAD +AV++
Sbjct: 518 DLSRSTLNKANLAGSRLIGANLSSAQMVGADLRNAVLE 555
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 51/119 (42%), Gaps = 20/119 (16%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFN--------------------GAYLEK 155
ADL +A + NF AN + + +SD S S N GA L
Sbjct: 492 ADLSQAQLKQANFSEANLSRVLLNKSDLSRSTLNKANLAGSRLIGANLSSAQMVGADLRN 551
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
AV A+ TGADL + ++ L A L A+ + L+ ++L + AD S + +D
Sbjct: 552 AVLENASLTGADLGEAKLNEAELYGARLNRAIAIGAQLSYANLTKTDWQAADLSGSYLD 610
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 52/119 (43%), Gaps = 20/119 (16%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG------ 165
Q DLR+ V + + N S + D G F GA L++A + F G
Sbjct: 425 QMKKVDLRR-VRLGQTIDGQNTFSLSLDRVDLWGVNFKGANLDQASFKNSRFRGPGDDGL 483
Query: 166 --------ADLSDTLMDRMVLNEANLTNAV-----LVRTVLTRSDLGGAIIEGADFSDA 211
ADLS + + +EANL+ + L R+ L +++L G+ + GA+ S A
Sbjct: 484 WDTFDDAIADLSQAQLKQANFSEANLSRVLLNKSDLSRSTLNKANLAGSRLIGANLSSA 542
>gi|86606624|ref|YP_475387.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86555166|gb|ABD00124.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 371
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 69/152 (45%), Gaps = 14/152 (9%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGI----GSAAQFGSADLRKA----VHVKE- 126
L A V+ S S L++ + E R + + G F DL KA + +++
Sbjct: 204 LRGAKVSGTSLRGSRLSEETRLEERLRHIWQLQNWGGQGQDFSGQDLSKADLRGLGLRQI 263
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEA 181
R AN D+R S+ G+ GA L++A AN GADL + + L A
Sbjct: 264 RLRGANLKRVDLRGSNLEGADLRGANLQRADLRGANLQNADLEGADLGGAELRQAQLQGA 323
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
NL A L R LT+++L GA IEG S + I
Sbjct: 324 NLRRADLSRANLTQANLEGAQIEGLKHSGSQI 355
Score = 44.3 bits (103), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 50/100 (50%), Gaps = 4/100 (4%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A F A+LRKA NF A+ AD+R+++ G+K +GA L+ A A+ GA +S
Sbjct: 151 GANFYEANLRKANLGLCNFNGAHLHQADLRQANLQGAKLSGAVLQGADLRGADLRGAKVS 210
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
T + L+E L R + + GG +G DFS
Sbjct: 211 GTSLRGSRLSEETRLEERL-RHIWQLQNWGG---QGQDFS 246
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ NF A+ D++E+ G+ F A L KA NF GA L L +ANL
Sbjct: 131 ERNFAYADLEGVDLQEARLGGANFYEANLRKANLGLCNFNGAHLHQA-----DLRQANLQ 185
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFS 209
A L VL +DL GA + GA S
Sbjct: 186 GAKLSGAVLQGADLRGADLRGAKVS 210
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 113 FGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
F ADL + V ++E ANF A++R+++ FNGA+L +A +AN GA LS
Sbjct: 134 FAYADL-EGVDLQEARLGGANFYEANLRKANLGLCNFNGAHLHQADLRQANLQGAKLSGA 192
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDL 198
++ L A+L A + T L S L
Sbjct: 193 VLQGADLRGADLRGAKVSGTSLRGSRL 219
>gi|418939008|ref|ZP_13492446.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
gi|375054283|gb|EHS50653.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
Length = 229
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 60/131 (45%), Gaps = 19/131 (14%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDF---------------SGSKFNGAYLEK 155
A F A+L AV + AN AD+R++D SG+K + A L +
Sbjct: 100 ANFTGANLESAVLQHTDLTNANLDRADLRDADLHGTILHRANLTGAILSGAKLDKASLIQ 159
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
A+A KAN G DLS + M L+ + T L + T ++L GAI GA A +
Sbjct: 160 AIAQKANLQGVDLSGADLTDMNLSRVDFTAVNLKGAIFTGTNLTGAIFSGAKLDKASL-- 217
Query: 216 AQKQALCKYAN 226
QA+ + AN
Sbjct: 218 --IQAIAQKAN 226
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 58/120 (48%), Gaps = 20/120 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYK----- 160
A F A+L+ A + ANFT AD++ +D G+ F GA LE AV
Sbjct: 60 ANFTEANLKGANLRGADCDGANFTRADLKSADLRWADCDGANFTGANLESAVLQHTDLTN 119
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAV----------LVRTVLTRSDLGGAIIEGADFSD 210
AN ADL D + +L+ ANLT A+ L++ + +++L G + GAD +D
Sbjct: 120 ANLDRADLRDADLHGTILHRANLTGAILSGAKLDKASLIQAIAQKANLQGVDLSGADLTD 179
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 63/129 (48%), Gaps = 19/129 (14%)
Query: 113 FGSADLRK----AVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-----KAN 162
F ADL + +KE NF AN A++R +D G+ F A L+ A AN
Sbjct: 42 FAGADLEQVRLAGASLKEANFTEANLKGANLRGADCDGANFTRADLKSADLRWADCDGAN 101
Query: 163 FTGADLSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
FTGA+L ++ L ANL +A L T+L R++L GAI+ GA A +
Sbjct: 102 FTGANLESAVLQHTDLTNANLDRADLRDADLHGTILHRANLTGAILSGAKLDKASL---- 157
Query: 218 KQALCKYAN 226
QA+ + AN
Sbjct: 158 IQAIAQKAN 166
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 45/96 (46%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+LR + + A ++E++F+ + GA L A ANFT ADL +
Sbjct: 35 ANLRNGDFAGADLEQVRLAGASLKEANFTEANLKGANLRGADCDGANFTRADLKSADLRW 94
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ AN T A L VL +DL A ++ AD DA
Sbjct: 95 ADCDGANFTGANLESAVLQHTDLTNANLDRADLRDA 130
>gi|440681919|ref|YP_007156714.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428679038|gb|AFZ57804.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 269
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 66/133 (49%), Gaps = 7/133 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A+ A+L +A NF A+ T AD+ + + G+ + A L AV AN G D
Sbjct: 77 SQAKLIEANLSQANLSIANFSGADLTQADLSQVNLIGANLSDANLRNAVITDANLIGTDF 136
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
S+ +LN+A+L A L+R+ L+ ++L GA + AD S+A +L + + Y
Sbjct: 137 SNA-----ILNDADLAAAKLIRSNLSFANLIGANLIAADLSEA--NLYDAEVMTAYLYKA 189
Query: 229 NPITGVSTRKSLG 241
N TR LG
Sbjct: 190 NLSKANLTRVHLG 202
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 59/126 (46%), Gaps = 23/126 (18%)
Query: 99 AETRGEFGIGSAAQ---FGSADLRKAVHVKENFRRANFTSADMRES----------DFSG 145
A +GE G+ Q F DL A+ V+ N AN T+A++ ++ + S
Sbjct: 34 ANLQGENLRGANLQGVNFTKVDLSHALLVRTNLMFANLTNANLSQAKLIEANLSQANLSI 93
Query: 146 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
+ F+GA L +A + N GA+LSD ANL NAV+ L +D AI+
Sbjct: 94 ANFSGADLTQADLSQVNLIGANLSD----------ANLRNAVITDANLIGTDFSNAILND 143
Query: 206 ADFSDA 211
AD + A
Sbjct: 144 ADLAAA 149
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 55/108 (50%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADL A ++ N AN +AD+ E++ ++ AYL KA KAN
Sbjct: 137 SNAILNDADLAAAKLIRSNLSFANLIGANLIAADLSEANLYDAEVMTAYLYKANLSKANL 196
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
T L + + + L+EANLTNA L + L ++L GA ++ A+ A
Sbjct: 197 TRVHLGSSYLFKANLSEANLTNADLSWSNLRYANLAGANLQRANLRGA 244
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 140 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 199
E D S + G L A NFT DLS L+ R L ANLTNA L + L ++L
Sbjct: 28 EIDLSTANLQGENLRGANLQGVNFTKVDLSHALLVRTNLMFANLTNANLSQAKLIEANLS 87
Query: 200 GAIIEGADFSDAVIDLAQ 217
A + A+FS A DL Q
Sbjct: 88 QANLSIANFSGA--DLTQ 103
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 51/102 (50%), Gaps = 2/102 (1%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+LR AV N +F++A + ++D + +K + L A AN ADLS+
Sbjct: 114 ANLSDANLRNAVITDANLIGTDFSNAILNDADLAAAKLIRSNLSFANLIGANLIAADLSE 173
Query: 171 -TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L D V+ A L A L + LTR LG + + A+ S+A
Sbjct: 174 ANLYDAEVM-TAYLYKANLSKANLTRVHLGSSYLFKANLSEA 214
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 48/105 (45%), Gaps = 15/105 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A++ A K N +AN T + S YL KA +AN T ADL
Sbjct: 172 SEANLYDAEVMTAYLYKANLSKANLTRVHLGSS----------YLFKANLSEANLTNADL 221
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S + L ANL A L R L ++L GA ++GA+ D ++
Sbjct: 222 SWS-----NLRYANLAGANLQRANLRGANLQGANLKGANLQDTIM 261
>gi|390441101|ref|ZP_10229280.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
gi|389835591|emb|CCI33406.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
Length = 436
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 63/127 (49%), Gaps = 9/127 (7%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+LR A + + A+ + AD+ E+D SG+ A L +A KAN A+L
Sbjct: 289 SGADLSGANLRGANLSEADLSEADLSEADLSEADLSGANLIDANLRRANLIKANLRRANL 348
Query: 169 SDTLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
+ ++ L+ ANL A+L+ +L +DL GA + A+ S+A I+ A+
Sbjct: 349 IEAILSEADLSGANLRRANLIKAILIEAILIEADLRGADLRWANLSEADIE----NAIFI 404
Query: 224 YANGTNP 230
A G P
Sbjct: 405 DATGITP 411
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 72/150 (48%), Gaps = 10/150 (6%)
Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
T+ EF A A+L KA+ R +++ D SG+ GA L A+
Sbjct: 203 TKAEFT-TDAKVIEKAELIKAI------REGTIDKTTLQQVDLSGAILRGAILIGAILRG 255
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 217
AN +GA+LSD ++ +L+ A L+ A L L+ +DL GA + GA+ S+A + DL++
Sbjct: 256 ANLSGANLSDAILRGAILSRAFLSGAFLSEADLSGADLSGANLRGANLSEADLSEADLSE 315
Query: 218 KQALCKYANGTNPITGVSTRKSLGCGNSRR 247
+G N I R +L N RR
Sbjct: 316 ADLSEADLSGANLIDANLRRANLIKANLRR 345
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 52/103 (50%), Gaps = 1/103 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+LR+A +K N RRAN A + E+D SG+ A L KA+ +A ADL
Sbjct: 324 SGANLIDANLRRANLIKANLRRANLIEAILSEADLSGANLRRANLIKAILIEAILIEADL 383
Query: 169 SDTLMDRMVLNEANLTNAVLVR-TVLTRSDLGGAIIEGADFSD 210
+ L+EA++ NA+ + T +T I GA F D
Sbjct: 384 RGADLRWANLSEADIENAIFIDATGITPEQKQDLIRRGAIFGD 426
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 54/103 (52%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A LR A+ + A + AD+ +D SG+ GA L +A +A+ + ADL
Sbjct: 259 SGANLSDAILRGAILSRAFLSGAFLSEADLSGADLSGANLRGANLSEADLSEADLSEADL 318
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S+ + L +ANL A L++ L R++L AI+ AD S A
Sbjct: 319 SEADLSGANLIDANLRRANLIKANLRRANLIEAILSEADLSGA 361
>gi|158338487|ref|YP_001519664.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158308728|gb|ABW30345.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 464
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 68/129 (52%), Gaps = 20/129 (15%)
Query: 111 AQFGSADLR----KAVHVKE-NFR----------RANFTSADMRESDFSGSKFNGAYLEK 155
A+ G ADLR K ++KE N R RA+ AD+RE++ S ++ + LEK
Sbjct: 36 AKLGGADLRNANLKGANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLTLSNLEK 95
Query: 156 -----AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
A+ ++AN + A L+ + ++ L +ANL+ A L L R++LG A + A+ +
Sbjct: 96 SQLGAAILFRANLSQAQLTLSNLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTT 155
Query: 211 AVIDLAQKQ 219
A + A+ Q
Sbjct: 156 ANLSQARLQ 164
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 47/87 (54%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL+ A + + + +F A + +++ + S +GA L +A ++A+ TGA L +
Sbjct: 369 DLKTADLAQADLNQVDFFRAQLPQANLAQSILDGANLTEANLFRADLTGASLKAATLKNA 428
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAII 203
L EANL NA + T L + L GAI+
Sbjct: 429 NLAEANLENANIEGTNLDDAYLCGAIM 455
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 189
+A AD+R ++ GA L++A A GADL + + L EANL++A L
Sbjct: 35 KAKLGGADLRNANLK-----GANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLT 89
Query: 190 RTVLTRSDLGGAIIEGADFSDAVIDLA 216
+ L +S LG AI+ A+ S A + L+
Sbjct: 90 LSNLEKSQLGAAILFRANLSQAQLTLS 116
>gi|75911046|ref|YP_325342.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704771|gb|ABA24447.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 576
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 45/78 (57%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N + A + +D S +K NGA L A A F GADLS + +VLN+A+L+ +L
Sbjct: 421 NLSDAILEAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGILSEA 480
Query: 192 VLTRSDLGGAIIEGADFS 209
LT +DL A++ G DFS
Sbjct: 481 DLTGADLSDAVLLGTDFS 498
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 51/90 (56%), Gaps = 5/90 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEA 181
NF+ A +A++ +FSG+ +GAYL A ANF TGAD D + + L+ A
Sbjct: 246 NFQGAYLGNANLTGVNFSGANLSGAYLGDANLTGANFQGANLTGADFGDANLSSVNLSGA 305
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
NL++A L LT ++L GA +E AD S A
Sbjct: 306 NLSSADLSSANLTGANLSGANLERADLSRA 335
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 67/133 (50%), Gaps = 24/133 (18%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A G A+L A NF+ AN T AD +++ S +GA L A AN TGA+L
Sbjct: 268 SGAYLGDANLTGA-----NFQGANLTGADFGDANLSSVNLSGANLSSADLSSANLTGANL 322
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG---------------AIIEGADFSDA-- 211
S ++R L+ A+L++ +L L+ ++L G AI+ GA+ SDA
Sbjct: 323 SGANLERADLSRADLSSCILNDGELSHANLSGVNFRDAELCRANLSNAILFGANLSDANL 382
Query: 212 -VIDLAQKQALCK 223
+DL++ LC+
Sbjct: 383 NHVDLSRAD-LCR 394
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 57/111 (51%), Gaps = 6/111 (5%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRAN---FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
I AA A L A K N+ R N F AD+ D +G N A L + +A+
Sbjct: 426 ILEAADLSYAKLNGA---KLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGILSEADL 482
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
TGADLSD ++ + ANL +A L + L+ + L GA + A+FS A++D
Sbjct: 483 TGADLSDAVLLGTDFSFANLNSANLSGSNLSGAILNGADLSSANFSYAILD 533
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 49/106 (46%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A SADL A N AN AD+ +D S N L A NF A+L
Sbjct: 303 SGANLSSADLSSANLTGANLSGANLERADLSRADLSSCILNDGELSHANLSGVNFRDAEL 362
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ +L ANL++A L L+R+DL A + GAD + A ++
Sbjct: 363 CRANLSNAILFGANLSDANLNHVDLSRADLCRADLSGADLTHATLN 408
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 56/117 (47%), Gaps = 13/117 (11%)
Query: 103 GEF---GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
GEF G A G+A+L NF AN + A + +++ +G+ F GA L A
Sbjct: 239 GEFLRDGNFQGAYLGNANLTGV-----NFSGANLSGAYLGDANLTGANFQGANLTGADFG 293
Query: 160 KANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
AN + GA+LS + L ANL+ A L R L+R+DL I+ + S A
Sbjct: 294 DANLSSVNLSGANLSSADLSSANLTGANLSGANLERADLSRADLSSCILNDGELSHA 350
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 42/80 (52%), Gaps = 10/80 (12%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
GI S A ADL AV + +F AN SA++ S+ SG+ NGA L ANF+
Sbjct: 475 GILSEADLTGADLSDAVLLGTDFSFANLNSANLSGSNLSGAILNGADLS-----SANFSY 529
Query: 166 ADLSDTLMDRMVLNEANLTN 185
A L DT L+EANL +
Sbjct: 530 AILDDT-----DLSEANLED 544
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 52/112 (46%), Gaps = 6/112 (5%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-- 164
+ +A FG A+L A + RA+ AD+ +D + + NG L + + N +
Sbjct: 367 LSNAILFG-ANLSDANLNHVDLSRADLCRADLSGADLTHATLNGTNLSDTILFSTNLSDA 425
Query: 165 ---GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
ADLS ++ LN A L A+ + L+ DL G ++ AD S ++
Sbjct: 426 ILEAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGIL 477
>gi|428219102|ref|YP_007103567.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427990884|gb|AFY71139.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 698
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 54/101 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A A+L A NF +AN A++R + SG +GA L A AN +GA+L
Sbjct: 67 TGANLTGANLTGANLTGANFSKANLRGANLRGVNLSGVNLSGANLSGANLSGANLSGANL 126
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
S + R+ L+ AN +NA L L+ DL GA + GA+FS
Sbjct: 127 SGVNLSRVNLSGANFSNANLNNFDLSGFDLTGANLTGANFS 167
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 47/85 (55%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N ANF++A++ D SG +G L A AN +GA+LS+ + + L + NL+ A
Sbjct: 210 NLSGANFSNANLNNFDLSGFDLSGVNLSGANLSGANLSGANLSEANLSEVDLYQINLSGA 269
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
L R LT ++L GA GA+ S A
Sbjct: 270 NLSRIDLTGANLSGANFSGANLSGA 294
Score = 44.3 bits (103), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L + N ANF++A++ D SG GA L A NF+G +L
Sbjct: 117 SGANLSGANLSGVNLSRVNLSGANFSNANLNNFDLSGFDLTGANLTGA-----NFSGVNL 171
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + R L+ AN +NA L L+ DL G + GA+ S A
Sbjct: 172 SGVNLSRANLSGANFSNANLNNFDLSGFDLSGVNLSGANLSGA 214
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 58/122 (47%), Gaps = 13/122 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A DL + N R + T A++ ++FSG+ +GA A + +G DL
Sbjct: 252 SEANLSEVDLYQINLSGANLSRIDLTGANLSGANFSGANLSGANFSNANLNNFDLSGFDL 311
Query: 169 SDTLMDRMVLNEANLTNAVL---------VRTV-LTRSDLGGAIIEGADFSDA---VIDL 215
S + L+ ANL+ A L +R + L+ +DLGG + GA+ S+A +DL
Sbjct: 312 SGVNLSGANLSGANLSGANLNNFDLSGFDLRGINLSGADLGGTNLSGANLSEANLSEVDL 371
Query: 216 AQ 217
Q
Sbjct: 372 YQ 373
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 41/81 (50%)
Query: 129 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
+RA AD+R +D +G+ GA L A AN TGA+ S + L NL+ L
Sbjct: 47 KRAYLRGADLRGADLTGANLTGANLTGANLTGANLTGANFSKANLRGANLRGVNLSGVNL 106
Query: 189 VRTVLTRSDLGGAIIEGADFS 209
L+ ++L GA + GA+ S
Sbjct: 107 SGANLSGANLSGANLSGANLS 127
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 47/91 (51%), Gaps = 2/91 (2%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N AN + A++ D SG G L A N +GA+LS+ + + L + NL+ A
Sbjct: 320 NLSGANLSGANLNNFDLSGFDLRGINLSGADLGGTNLSGANLSEANLSEVDLYQINLSGA 379
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L R LT ++L GA + A+ ++ +DL Q
Sbjct: 380 NLSRIDLTGANLTGANLSEANLNE--VDLYQ 408
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 2/95 (2%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N AN + A++ E D +GA L + AN TGA+LS+ ++ + L + NL+ A
Sbjct: 355 NLSGANLSEANLSEVDLYQINLSGANLSRIDLTGANLTGANLSEANLNEVDLYQINLSGA 414
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 221
L + DLGG ++ + + A +L + +AL
Sbjct: 415 NLSKVNFQGFDLGGFDLKNVNLTGA--NLREVKAL 447
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 55/128 (42%), Gaps = 25/128 (19%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF----- 163
+ A F +L + N ANF++A++ D SG +G L A ANF
Sbjct: 162 TGANFSGVNLSGVNLSRANLSGANFSNANLNNFDLSGFDLSGVNLSGANLSGANFSNANL 221
Query: 164 ---------------TGADLSDTLMDRMVLNEANLTNAVLVRT-----VLTRSDLGGAII 203
+GA+LS + L+EANL+ L + L+R DL GA +
Sbjct: 222 NNFDLSGFDLSGVNLSGANLSGANLSGANLSEANLSEVDLYQINLSGANLSRIDLTGANL 281
Query: 204 EGADFSDA 211
GA+FS A
Sbjct: 282 SGANFSGA 289
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 41/82 (50%), Gaps = 5/82 (6%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
R N + AD+ ++ SG+ + A L + Y+ N +GA+LS R+ L ANLT A
Sbjct: 341 LRGINLSGADLGGTNLSGANLSEANLSEVDLYQINLSGANLS-----RIDLTGANLTGAN 395
Query: 188 LVRTVLTRSDLGGAIIEGADFS 209
L L DL + GA+ S
Sbjct: 396 LSEANLNEVDLYQINLSGANLS 417
>gi|427734924|ref|YP_007054468.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427369965|gb|AFY53921.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 213
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 67/134 (50%), Gaps = 17/134 (12%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G Q A+L + + RAN A++ ++F+GSKF GA+LE AN GA+
Sbjct: 9 GELKQLAGANLEDENLSQTDLSRANLAGANLVGTNFAGSKFEGAHLE-----GANLMGAN 63
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
L +T + ANL A L++ LT +D+ G+ + GA+ AVI ++ + +G
Sbjct: 64 LKETDL------RANLMGANLMQADLTGADVRGSNLRGANLMGAVI--SEVSFAGAFLSG 115
Query: 228 TNPIT----GVSTR 237
TN I GV R
Sbjct: 116 TNLINVDLQGVDLR 129
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 60/133 (45%), Gaps = 19/133 (14%)
Query: 120 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 179
K ++ N AN AD+ +D GS GA L AV + +F GA LS T + + L
Sbjct: 65 KETDLRANLMGANLMQADLTGADVRGSNLRGANLMGAVISEVSFAGAFLSGTNLINVDLQ 124
Query: 180 ----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--------DLAQKQAL 221
ANLT A L L+R+DL GA++ A+ +A + +LA L
Sbjct: 125 GVDLRGADLRGANLTGANLKGADLSRADLQGALLSEANLEEADLRKANLSGANLAGANLL 184
Query: 222 CKYANGTNPITGV 234
C G N + GV
Sbjct: 185 CAELEGAN-VNGV 196
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 44/87 (50%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N + D+R +D G+ GA L+ A +A+ GA LS+ ++ L +ANL+ A
Sbjct: 117 NLINVDLQGVDLRGADLRGANLTGANLKGADLSRADLQGALLSEANLEEADLRKANLSGA 176
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVI 213
L L ++L GA + G DF A +
Sbjct: 177 NLAGANLLCAELEGANVNGVDFDRACL 203
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 43/86 (50%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L AV + +F A + ++ D G GA L A AN GADLS +
Sbjct: 96 ANLMGAVISEVSFAGAFLSGTNLINVDLQGVDLRGADLRGANLTGANLKGADLSRADLQG 155
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGA 201
+L+EANL A L + L+ ++L GA
Sbjct: 156 ALLSEANLEEADLRKANLSGANLAGA 181
>gi|119491336|ref|ZP_01623390.1| hypothetical protein L8106_22104 [Lyngbya sp. PCC 8106]
gi|119453500|gb|EAW34662.1| hypothetical protein L8106_22104 [Lyngbya sp. PCC 8106]
Length = 122
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 45/76 (59%), Gaps = 7/76 (9%)
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 182
H + NF AN T AD+R+SD S ++ GA LE A N TGA+LS T + + L +A+
Sbjct: 47 HAQLNF--ANLTHADLRDSDLSHAQLIGATLEGA-----NLTGANLSHTNLSQANLKQAD 99
Query: 183 LTNAVLVRTVLTRSDL 198
LT A L T+ + S L
Sbjct: 100 LTEATLQDTIYSHSTL 115
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 49/102 (48%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ A+L A V N + T A+++++ G+ F+ A L A A+ +DLS
Sbjct: 8 AKLTDANLESAKLVVANLSQTVITRANLQQAKCVGANFSHAQLNFANLTHADLRDSDLSH 67
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
+ L ANLT A L T L++++L A + A D +
Sbjct: 68 AQLIGATLEGANLTGANLSHTNLSQANLKQADLTEATLQDTI 109
>gi|111023196|ref|YP_706168.1| hypothetical protein RHA1_ro06233 [Rhodococcus jostii RHA1]
gi|110822726|gb|ABG98010.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length = 201
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 59/131 (45%), Gaps = 15/131 (11%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYL 153
+E R E I + F ADL ++ HV FR +FT + S+F GS+F+ L
Sbjct: 38 SELRTESVIFTECDFTGADLAESHHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97
Query: 154 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 203
V + +FT GADL EANL L R VL +DL GGA
Sbjct: 98 RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 157
Query: 204 EGADFSDAVID 214
+GAD A +D
Sbjct: 158 DGADLRGAHVD 168
>gi|376002766|ref|ZP_09780588.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
gi|375328822|emb|CCE16341.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
Length = 529
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 61/112 (54%), Gaps = 14/112 (12%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN----- 179
+ N +ANFT A + ++FSG+ G L +A + +GA L ++ VLN
Sbjct: 39 RVNLSQANFTEAVLSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLI 98
Query: 180 -----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
+ANL +A L+R L R++L AI+ GA+ ++A DL ++A ++A+
Sbjct: 99 RADLSQANLVDASLIRAELMRAELSEAIVNGANLTEA--DL--REATLRHAD 146
Score = 43.9 bits (102), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 47/91 (51%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L +A + N R+N T AD+ +D G A L +A A+ GA+LS +
Sbjct: 155 ANLSEACLILSNLERSNLTRADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRW 214
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
L+ ANL+ A L T L+ + L GA + GA
Sbjct: 215 ANLSGANLSGANLEATQLSGASLRGANLSGA 245
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 63/129 (48%), Gaps = 12/129 (9%)
Query: 101 TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
TR + + S A A+L +AV N RA+ + A++ ++ ++ A L +A+
Sbjct: 68 TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLVDASLIRAELMRAELSEAIV 127
Query: 159 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSDLGGAIIEGADF 208
AN T ADL + + L + ANL+ A L+ R+ LTR+DL A + G +
Sbjct: 128 NGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNL 187
Query: 209 SDAVIDLAQ 217
+A + A+
Sbjct: 188 RNAELRQAE 196
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 48/98 (48%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ A+L +A+ N A+ A +R +D + +GA L +A +N ++L+
Sbjct: 115 AELMRAELSEAIVNGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTR 174
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ R L NL NA L + L +DL GA + GA+
Sbjct: 175 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANL 212
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL +A N R A A++ +D G+ +GA L A AN +GA+L T +
Sbjct: 175 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 234
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANL+ A L+ +DL A + D++DA
Sbjct: 235 ASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDA 270
Score = 40.8 bits (94), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 142 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 201
DFS A L + +ANFT A LS T + ANLT L R L S L GA
Sbjct: 26 DFSAILLCEANLSRVNLSQANFTEAVLSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 80
Query: 202 IIEGADFSDAVIDLA 216
I++GA+ ++AV+++A
Sbjct: 81 ILQGANLNEAVLNVA 95
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 57/114 (50%), Gaps = 9/114 (7%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
ADL + A+ RG +A+LR+A + R AN + A++R ++ SG+ +GA
Sbjct: 175 ADLTR--ADLRG-------VNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGA 225
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
LE A+ GA+LS + A+LT A L+ T ++L G+ + G
Sbjct: 226 NLEATQLSGASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDANLRGSALTG 279
>gi|298489886|ref|YP_003720063.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298231804|gb|ADI62940.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 256
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 41/76 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR A N RAN T AD+R ++ +G+ G L +A +AN TGADL +
Sbjct: 51 ADLSGADLRGANLEGANLSRANLTGADLRSANLAGASLFGVNLSRAKLNEANLTGADLRN 110
Query: 171 TLMDRMVLNEANLTNA 186
T + + L ANL A
Sbjct: 111 TYLMNIELTNANLNGA 126
>gi|378582929|ref|ZP_09831540.1| hypothetical protein CKS_5479 [Pantoea stewartii subsp. stewartii
DC283]
gi|377814439|gb|EHT97579.1| hypothetical protein CKS_5479 [Pantoea stewartii subsp. stewartii
DC283]
Length = 375
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 61/105 (58%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANF 163
S A +ADL++A N A+ T+A++ ++D +GA L A +AY +A+
Sbjct: 250 SNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADL 309
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ A+LS+ + R L++ANL++A L L R+DL AI++GA+
Sbjct: 310 SNANLSNADLKRADLSDANLSDANLTNVDLKRADLSNAILKGANL 354
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANF 163
S A A+L A N AN T A + E+D S + +GA L A + N
Sbjct: 170 SDADLSDANLSDANLSGANLAHANLTMAYLSEADLSNANLSGADLTNANLNQTDLPNVNL 229
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+GA+L+ + L+EA+L+NA L L R+DL A + GAD ++A
Sbjct: 230 SGANLAHANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNA 277
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 49/95 (51%), Gaps = 10/95 (10%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM---------- 176
N AN T A + E+D S + + A L++A AN +GADL++ +++
Sbjct: 233 NLAHANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGA 292
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANLT A L L+ ++L A ++ AD SDA
Sbjct: 293 NLAHANLTMAYLSEADLSNANLSNADLKRADLSDA 327
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 51/108 (47%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A ADL A + AN D+ + SG+ A L A +A+ + A+L
Sbjct: 195 TMAYLSEADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADLSNANL 254
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
S+ + R L+ ANL+ A L L ++DL + GA+ + A + +A
Sbjct: 255 SNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMA 302
>gi|261821705|ref|YP_003259811.1| hypothetical protein Pecwa_2443 [Pectobacterium wasabiae WPP163]
gi|261605718|gb|ACX88204.1| Protein of unknown function DUF2169 [Pectobacterium wasabiae
WPP163]
Length = 846
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 77/160 (48%), Gaps = 13/160 (8%)
Query: 79 AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM 138
A++ SCS + A+ ++ T + S + SAD +A + N R+A+ A
Sbjct: 688 ALLDSCSW-VETQANEARFTGATWLTSAVASGSSMNSADFTQATLRQSNLRQASLIGAV- 745
Query: 139 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
F+ +K + L +A + NF A+L+ +L R EAN T+A L+ +L +S L
Sbjct: 746 ----FALAKLENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQL 801
Query: 199 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 238
GGA GA+ A DL+Q + + T + G T++
Sbjct: 802 GGANFRGANLFRA--DLSQ-----AFTSNTTQLDGAWTKR 834
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 26/96 (27%), Positives = 42/96 (43%), Gaps = 10/96 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F L++A+ F A FT RE+ F+ F+ A L + + + G D
Sbjct: 606 SRAHFKDTQLQEALFDHCTFAEATFTELLFRETWFTQCGFHRATLNACIFMELSLPGLDF 665
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 204
SD A LT +++ L R+ GA+++
Sbjct: 666 SD----------AKLTKTTFLKSTLERATFNGALLD 691
>gi|407781954|ref|ZP_11129170.1| hypothetical protein P24_07031 [Oceanibaculum indicum P24]
gi|407206993|gb|EKE76937.1| hypothetical protein P24_07031 [Oceanibaculum indicum P24]
Length = 392
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 64/134 (47%), Gaps = 20/134 (14%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL--------------EKA 156
A ADL A + + R+A A++ + F GS NGA L E A
Sbjct: 72 ADLTGADLTAATLDEASLRKAKLVDANLSGASFRGSDLNGADLRGAHGTVSMSSPGFEGA 131
Query: 157 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+ N +GADLS +D +A+LT A+LV TVL + L GA + D +DA + A
Sbjct: 132 MLRLTNLSGADLSGANLD-----QADLTGAMLVGTVLRNASLAGANMRNTDLTDADLGAA 186
Query: 217 Q-KQALCKYANGTN 229
++AL AN +N
Sbjct: 187 NLREALLNGANLSN 200
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 47/101 (46%), Gaps = 10/101 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A+ V R A+ A+MR +D + + A L +A+ AN + A L
Sbjct: 144 SGANLDQADLTGAMLVGTVLRNASLAGANMRNTDLTDADLGAANLREALLNGANLSNAHL 203
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
N ANL A LV LT L GA EGA+F+
Sbjct: 204 ----------NGANLQRARLVGVTLTEGVLDGADTEGANFA 234
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 59/133 (44%), Gaps = 26/133 (19%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G A+ ADL A + R N + A +R ++ G+ NGA L + GAD
Sbjct: 262 GERAELDGADLTDA-----DLRGFNLSGASLRAANLRGALLNGALL-----VLTDLAGAD 311
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA----------------IIEGADFSDA 211
LS + R L+ ANL A L L+ + LG A ++EGAD ++A
Sbjct: 312 LSQASLVRANLSGANLRGAKLHSADLSGAKLGPAPLIGADGRPTGRSRATVLEGADLTEA 371
Query: 212 VIDLAQKQALCKY 224
V+D QK L +
Sbjct: 372 VLDDEQKSVLPDF 384
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 37/75 (49%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+ + D+R+ SG+ G L A A+ TGADL+ +D L +A L +A L
Sbjct: 43 DLSGRDLRKCQLSGAGLQGIRLTGANLEGADLTGADLTAATLDEASLRKAKLVDANLSGA 102
Query: 192 VLTRSDLGGAIIEGA 206
SDL GA + GA
Sbjct: 103 SFRGSDLNGADLRGA 117
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 74/173 (42%), Gaps = 27/173 (15%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L AA + S + L D N A RG + ADLR A H + F
Sbjct: 79 LTAATLDEASLRKAKLVDANLSGASFRG-------SDLNGADLRGA-HGTVSMSSPGFEG 130
Query: 136 ADMRESDFSGSKFNGAYLEKA----------VAYKANFTGADLSDTLMDRMVLNEANLTN 185
A +R ++ SG+ +GA L++A V A+ GA++ +T + L ANL
Sbjct: 131 AMLRLTNLSGADLSGANLDQADLTGAMLVGTVLRNASLAGANMRNTDLTDADLGAANLRE 190
Query: 186 AVLVRTVLTRSDLGGAIIE-----GADFSDAVIDLAQKQALCKYANGTNPITG 233
A+L L+ + L GA ++ G ++ V+D A + AN P+ G
Sbjct: 191 ALLNGANLSNAHLNGANLQRARLVGVTLTEGVLDGADTEG----ANFAPPLDG 239
>gi|425447182|ref|ZP_18827173.1| Genome sequencing data, contig C314 (fragment) [Microcystis
aeruginosa PCC 9443]
gi|389732326|emb|CCI03724.1| Genome sequencing data, contig C314 (fragment) [Microcystis
aeruginosa PCC 9443]
Length = 285
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 46/78 (58%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+ T A++ E+ +G+ NGA LE+A A+ GA+L + ++ L EANL A L+R
Sbjct: 137 ADLTEANLTEAKLNGADLNGANLEEAKLNGADLNGANLEEAKLNGAFLEEANLKRANLIR 196
Query: 191 TVLTRSDLGGAIIEGADF 208
L S L GA ++GA+
Sbjct: 197 ANLIGSGLWGANLKGANL 214
>gi|359457996|ref|ZP_09246559.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 464
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 68/129 (52%), Gaps = 20/129 (15%)
Query: 111 AQFGSADLR----KAVHVKE-NFR----------RANFTSADMRESDFSGSKFNGAYLEK 155
A+ G ADLR K ++KE N R RA+ AD+RE++ S ++ + LEK
Sbjct: 36 AKLGGADLRNANLKGANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLTLSNLEK 95
Query: 156 -----AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
A+ ++AN + A L+ + ++ L +ANL+ A L L R++LG A + A+ +
Sbjct: 96 SQLGAAILFRANLSQAQLTLSDLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTT 155
Query: 211 AVIDLAQKQ 219
A + A+ Q
Sbjct: 156 ANLSQARLQ 164
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 58/119 (48%), Gaps = 7/119 (5%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
N EA RG A+ ADL +A + + R AN +SA + S+ S+ A L
Sbjct: 52 NLKEANLRG-------AKLDGADLLRADLKQADLREANLSSAQLTLSNLEKSQLGAAILF 104
Query: 155 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+A +A T +DL + + L++ANLT A L R L ++ L A + A+ S A +
Sbjct: 105 RANLSQAQLTLSDLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTTANLSQARL 163
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 53/105 (50%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S+AQ ++L K+ RAN + A + SD ++ A L +A +AN A+L
Sbjct: 84 SSAQLTLSNLEKSQLGAAILFRANLSQAQLTLSDLENAQLRDANLSQANLTEANLARANL 143
Query: 169 SDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADF 208
+++ L ANL+ NA LV T L ++L GA ++GA+
Sbjct: 144 GKAQLNQANLTTANLSQARLQNASLVGTQLINANLEGASLKGANL 188
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 189
+A AD+R ++ GA L++A A GADL + + L EANL++A L
Sbjct: 35 KAKLGGADLRNANLK-----GANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLT 89
Query: 190 RTVLTRSDLGGAIIEGADFSDAVIDLA 216
+ L +S LG AI+ A+ S A + L+
Sbjct: 90 LSNLEKSQLGAAILFRANLSQAQLTLS 116
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 46/87 (52%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL+ A + + + +F + +++ + S +GA L +A ++A+ TGA L +
Sbjct: 369 DLKTADLAQADLSQVDFFRVQLPQANLTQSILDGANLTEANLFRADLTGASLKAATLKNA 428
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAII 203
L EANL NA + T L + L GAI+
Sbjct: 429 NLAEANLENANIEGTNLDDAYLCGAIM 455
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 49/106 (46%), Gaps = 5/106 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S AQ +DL A R AN + A++ E++ + + A L +A AN + A L
Sbjct: 109 SQAQLTLSDLENA-----QLRDANLSQANLTEANLARANLGKAQLNQANLTTANLSQARL 163
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ + L ANL A L L +DL GA + AD +A +D
Sbjct: 164 QNASLVGTQLINANLEGASLKGANLIGADLTGANLVNADLREAKLD 209
>gi|428770347|ref|YP_007162137.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428684626|gb|AFZ54093.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 278
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 58/129 (44%), Gaps = 10/129 (7%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
Q DLR A N + + AD+R++D SG+ + YL +A AN TGA+L+
Sbjct: 25 QLRRIDLRNAQLKGVNLGGCDLSYADLRDADLSGADLSKCYLNEANLSGANLTGANLTGA 84
Query: 172 LM----------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 221
+ + ++ EA T + L R ++DL GA + GA + + A
Sbjct: 85 YLIKAYLTKVNFQKAIVKEAYFTGSFLTRANFYKADLSGAFLNGAHLNGGIFKDASYDNT 144
Query: 222 CKYANGTNP 230
++ G NP
Sbjct: 145 TRFDKGFNP 153
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 15/99 (15%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL- 183
+ NF + D+R + G G L A A+ +GADLS + LNEANL
Sbjct: 18 ERNFPKLQLRRIDLRNAQLKGVNLGGCDLSYADLRDADLSGADLS-----KCYLNEANLS 72
Query: 184 ---------TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
T A L++ LT+ + AI++ A F+ + +
Sbjct: 73 GANLTGANLTGAYLIKAYLTKVNFQKAIVKEAYFTGSFL 111
>gi|434405486|ref|YP_007148371.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428259741|gb|AFZ25691.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 808
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 57/103 (55%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A+ N +AN + A++R ++ G+ +GAY A A+ +GA L
Sbjct: 103 SGANLSGADLSGAILFGANLSQANLSQANLRGANLRGADLSGAYPSGADLRGADLSGAYL 162
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S+ + + L++ANL+ A L + L+ + L GA + GAD S A
Sbjct: 163 SEAKLSQAKLSQANLSQANLSQADLSGAYLTGAYLSGADLSGA 205
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 56/96 (58%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL A + N AN + A++ E+ G+K + A L +A AN +GA+LS+ ++
Sbjct: 30 ADLLGADLLGANLSGANLSQANLSEAILFGAKLSQANLSQANLSGANLSGANLSEAILFG 89
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L++ANL+ A L L+ +DL GAI+ GA+ S A
Sbjct: 90 AKLSQANLSQANLSGANLSGADLSGAILFGANLSQA 125
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 54/105 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A + +A + A++ +++ S + +GAYL A A+ +GADL
Sbjct: 148 SGADLRGADLSGAYLSEAKLSQAKLSQANLSQANLSQADLSGAYLTGAYLSGADLSGADL 207
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S + R L+ A+L+ A L L+ +DL A + GA S A +
Sbjct: 208 SGARLSRADLSRADLSAADLRGAYLSAADLSAAYLSGAYLSAAYL 252
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 1/105 (0%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
+ A FG A L +A + N AN + A++ E+ G+K + A L +A AN +GA
Sbjct: 52 LSEAILFG-AKLSQANLSQANLSGANLSGANLSEAILFGAKLSQANLSQANLSGANLSGA 110
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
DLS ++ L++ANL+ A L L +DL GA GAD A
Sbjct: 111 DLSGAILFGANLSQANLSQANLRGANLRGADLSGAYPSGADLRGA 155
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 55/123 (44%), Gaps = 20/123 (16%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 165
S A A L A + N +AN + A++ +D SG+ GA L +A +AN G
Sbjct: 78 SGANLSEAILFGAKLSQANLSQANLSGANLSGADLSGAILFGANLSQANLSQANLRGANL 137
Query: 166 -----------------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
ADLS + L++A L+ A L + L+++DL GA + GA
Sbjct: 138 RGADLSGAYPSGADLRGADLSGAYLSEAKLSQAKLSQANLSQANLSQADLSGAYLTGAYL 197
Query: 209 SDA 211
S A
Sbjct: 198 SGA 200
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 59/124 (47%), Gaps = 3/124 (2%)
Query: 91 LADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
L+ N +A+ G + G S A ADL A + + RA+ ++AD+R + S +
Sbjct: 177 LSQANLSQADLSGAYLTGAYLSGADLSGADLSGARLSRADLSRADLSAADLRGAYLSAAD 236
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
+ AYL A A +GA L+ + L+ +L+ L L+ DL GA + GA+
Sbjct: 237 LSAAYLSGAYLSAAYLSGAYLNAAYLSGAYLSGFDLSGVNLSGVNLSGFDLSGANLSGAN 296
Query: 208 FSDA 211
S A
Sbjct: 297 LSGA 300
>gi|416374431|ref|ZP_11683193.1| hypothetical protein CWATWH0003_0051 [Crocosphaera watsonii WH
0003]
gi|357266721|gb|EHJ15312.1| hypothetical protein CWATWH0003_0051 [Crocosphaera watsonii WH
0003]
Length = 279
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 60/111 (54%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A SADLR A + N A+ TSA++ ++ +G+ NGA L + AN +G DLS
Sbjct: 54 ATLASADLRGANLKQVNLSYADLTSANLSGANLTGAILNGAKLNRVDLSYANLSGVDLSG 113
Query: 171 TLMDR-----MVLNEANLTNAVLVRTVLTRS-----DLGGAIIEGADFSDA 211
+ R + L EA+LTNA L + +++S D A ++GA+FS A
Sbjct: 114 ANLSRSDLSYVDLREADLTNANLYKADISQSKLHNTDFQEAFLQGANFSRA 164
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 6/98 (6%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DLR+A N +A+ + + + +DF + GA +A AN GA L + + +
Sbjct: 125 DLREADLTNANLYKADISQSKLHNTDFQEAFLQGANFSRANLKGANLGGASLREVNLSLV 184
Query: 177 VLNEANLTNAVLVRTV------LTRSDLGGAIIEGADF 208
L+E NL V + L +++L GAI+ A+
Sbjct: 185 NLSEFNLQRVTRVGEIDLSSANLQKANLQGAILRHANL 222
Score = 37.0 bits (84), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 45/103 (43%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A ADL A + A+ T A+++ +D S A L A AN +L
Sbjct: 12 TGADLNRADLIYARLLSAKLIDADLTGANLQNADLSWVDLENATLASADLRGANLKQVNL 71
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + L+ ANLT A+L L R DL A + G D S A
Sbjct: 72 SYADLTSANLSGANLTGAILNGAKLNRVDLSYANLSGVDLSGA 114
>gi|392382587|ref|YP_005031784.1| conserved protein of unknown function; pentapeptide repeats
[Azospirillum brasilense Sp245]
gi|356877552|emb|CCC98392.1| conserved protein of unknown function; pentapeptide repeats
[Azospirillum brasilense Sp245]
Length = 493
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 60/124 (48%), Gaps = 26/124 (20%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDF-SGSKFNG---------------AY 152
+A+ ADLR A N RA T A++R +DF +GS NG A
Sbjct: 84 TASTLIGADLRGA-----NLHRAILTDANLRGADFRAGSLMNGTDDKPRSDGVTRLTEAK 138
Query: 153 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
+E+++ ANFTG DLS LN+A+LT A + VL +D GA ++G F
Sbjct: 139 MERSILAGANFTGCDLSGA-----DLNDADLTGADMTAAVLVGADFWGATLDGVTFDGTT 193
Query: 213 IDLA 216
ID A
Sbjct: 194 IDEA 197
>gi|308205942|gb|ADO19342.1| pentapeptide repeat protein [Nostoc flagelliforme str. Sunitezuoqi]
Length = 146
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 52/106 (49%), Gaps = 9/106 (8%)
Query: 115 SADLRKAVHVKE---------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
SA +R+ + +E N + A+ D+R ++ G+ GA LE A AN
Sbjct: 28 SAPVRRLLETRECFGCNLTGANLKGAHLIGVDLRNANLKGANLEGANLEGADLTGANLKY 87
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A+L+ + +LN ANLTN L + L SD+ GA++ D S A
Sbjct: 88 ANLTKAFVSDTILNNANLTNVNLSNSRLYNSDVDGAVMANIDLSGA 133
>gi|428298482|ref|YP_007136788.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428235026|gb|AFZ00816.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 567
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A+ SADL RA+F A++R +DFSG+ N A A ANF+ ADL
Sbjct: 83 SDAKLNSADLS----------RADFYQANLRNTDFSGANLNSANFRNADLRNANFSNADL 132
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
++ + L N +NA + T L R +L G + GAD S
Sbjct: 133 ANADFSGLDLYGVNFSNAKMRGTRLDRVNLSGVNLSGADLS 173
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 53/110 (48%), Gaps = 10/110 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F A+LR N ANF +AD+R ++FS + A Y NF+ A +
Sbjct: 93 SRADFYQANLRNTDFSGANLNSANFRNADLRNANFSNADLANADFSGLDLYGVNFSNAKM 152
Query: 169 SDTLMDRMVLNEANLTNAVL----VRTV------LTRSDLGGAIIEGADF 208
T +DR+ L+ NL+ A L +R V LTR +L A + G DF
Sbjct: 153 RGTRLDRVNLSGVNLSGADLSGIDLRNVNLRGINLTRINLSHANLIGFDF 202
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 46/96 (47%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL + + + AN +D+R +D S +K N A L +A Y+AN D S ++
Sbjct: 55 ADLSRKNLKRADLYNANLQRSDLRNTDLSDAKLNSADLSRADFYQANLRNTDFSGANLNS 114
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A+L NA L +D G + G +FS+A
Sbjct: 115 ANFRNADLRNANFSNADLANADFSGLDLYGVNFSNA 150
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 11/131 (8%)
Query: 92 ADLNK---YEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
ADL++ Y+A R G ++A F +ADLR A + A+F+ D+ +FS
Sbjct: 90 ADLSRADFYQANLRNTDFSGANLNSANFRNADLRNANFSNADLANADFSGLDLYGVNFSN 149
Query: 146 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
+K G L++ N +GADLS + L NL L R L+ ++L G G
Sbjct: 150 AKMRGTRLDRVNLSGVNLSGADLSG-----IDLRNVNLRGINLTRINLSHANLIGFDFRG 204
Query: 206 ADFSDAVIDLA 216
D +A + A
Sbjct: 205 TDLRNANLSYA 215
Score = 37.4 bits (85), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 38/83 (45%), Gaps = 4/83 (4%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ R AN + AD+R SD S +K A L A NF ADL +D L A+L A
Sbjct: 206 DLRNANLSYADLRNSDLSNAKLESADLRNANLSGVNFRNADLIGVNLDGASLQNADLRGA 265
Query: 187 VLVRTVLTRSDLGGAIIEGADFS 209
L T L G + E D++
Sbjct: 266 NLNFTSLP----SGIVAEAEDYT 284
Score = 37.0 bits (84), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 37/70 (52%)
Query: 142 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 201
D SG+ + L++A Y AN +DL +T + LN A+L+ A + L +D GA
Sbjct: 51 DLSGADLSRKNLKRADLYNANLQRSDLRNTDLSDAKLNSADLSRADFYQANLRNTDFSGA 110
Query: 202 IIEGADFSDA 211
+ A+F +A
Sbjct: 111 NLNSANFRNA 120
>gi|209526072|ref|ZP_03274604.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423067543|ref|ZP_17056333.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493460|gb|EDZ93783.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406711117|gb|EKD06319.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 519
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 61/112 (54%), Gaps = 14/112 (12%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN----- 179
+ N +ANFT A + ++FSG+ G L +A + +GA L ++ VLN
Sbjct: 29 RVNLSQANFTEAVLSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLI 88
Query: 180 -----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
+ANL +A L+R L R++L AI+ GA+ ++A DL ++A ++A+
Sbjct: 89 RADLSQANLVDASLIRAELMRAELSEAIVNGANLTEA--DL--REATLRHAD 136
Score = 43.5 bits (101), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 47/91 (51%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L +A + N R+N T AD+ +D G A L +A A+ GA+LS +
Sbjct: 145 ANLSEACLILSNLERSNLTRADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRW 204
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
L+ ANL+ A L T L+ + L GA + GA
Sbjct: 205 ANLSGANLSGANLEATQLSGASLRGANLSGA 235
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 63/129 (48%), Gaps = 12/129 (9%)
Query: 101 TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
TR + + S A A+L +AV N RA+ + A++ ++ ++ A L +A+
Sbjct: 58 TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLVDASLIRAELMRAELSEAIV 117
Query: 159 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSDLGGAIIEGADF 208
AN T ADL + + L + ANL+ A L+ R+ LTR+DL A + G +
Sbjct: 118 NGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNL 177
Query: 209 SDAVIDLAQ 217
+A + A+
Sbjct: 178 RNAELRQAE 186
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 48/98 (48%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ A+L +A+ N A+ A +R +D + +GA L +A +N ++L+
Sbjct: 105 AELMRAELSEAIVNGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTR 164
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ R L NL NA L + L +DL GA + GA+
Sbjct: 165 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANL 202
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL +A N R A A++ +D G+ +GA L A AN +GA+L T +
Sbjct: 165 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 224
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANL+ A L+ +DL A + D++DA
Sbjct: 225 ASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDA 260
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 142 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 201
DFS A L + +ANFT A LS T + ANLT L R L S L GA
Sbjct: 16 DFSAILLCEANLSRVNLSQANFTEAVLSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 70
Query: 202 IIEGADFSDAVIDLA 216
I++GA+ ++AV+++A
Sbjct: 71 ILQGANLNEAVLNVA 85
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 57/114 (50%), Gaps = 9/114 (7%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
ADL + A+ RG +A+LR+A + R AN + A++R ++ SG+ +GA
Sbjct: 165 ADLTR--ADLRG-------VNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGA 215
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
LE A+ GA+LS + A+LT A L+ T ++L G+ + G
Sbjct: 216 NLEATQLSGASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDANLRGSALTG 269
>gi|427724651|ref|YP_007071928.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427356371|gb|AFY39094.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 281
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 66/116 (56%), Gaps = 9/116 (7%)
Query: 107 IGSAAQFGSADLRKA----VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
I + A A+LR+A ++ N +AN S+++ E++ + +K + + A +A
Sbjct: 46 IFTGATLDQANLREADLSYASLQGNLSQANLISSNLTEANLTAAKMAYSGMRAANLTRAK 105
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVI 213
T ADLS +++ ++ EANL+ A LV R LT+++L GA ++GA+ + A++
Sbjct: 106 LTSADLSYCILNEAIMREANLSKATLVDAFIGRANLTQANLEGANLQGANLTSAIL 161
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 48/106 (45%), Gaps = 10/106 (9%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFN----------GAYLEKAVAYKANFTG 165
A+L KA V RAN T A++ ++ G+ GA L A + N TG
Sbjct: 124 ANLSKATLVDAFIGRANLTQANLEGANLQGANLTSAILIGANLRGANLANATLHGINATG 183
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ D + + LN ANLTN L T L + L + GAD ++A
Sbjct: 184 STADDADLSKSKLNSANLTNVKLRGTNLREAQLAWTTMRGADLTEA 229
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 117 DLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGA-----YLEKAVAYKANFTGA 166
+L +A + N AN T+A M R ++ + +K A L +A+ +AN + A
Sbjct: 70 NLSQANLISSNLTEANLTAAKMAYSGMRAANLTRAKLTSADLSYCILNEAIMREANLSKA 129
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L D + R L +ANL A L LT + L GA + GA+ ++A +
Sbjct: 130 TLVDAFIGRANLTQANLEGANLQGANLTSAILIGANLRGANLANATL 176
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 43/84 (51%), Gaps = 5/84 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S ++ SA+L N R A MR +D + +K L +A Y +NFTGA+L
Sbjct: 192 SKSKLNSANLTNVKLRGTNLREAQLAWTTMRGADLTEAK-----LFRAKLYWSNFTGANL 246
Query: 169 SDTLMDRMVLNEANLTNAVLVRTV 192
+ T++ +++ N NA+L T+
Sbjct: 247 TRTMLMDATMDQVNFRNAILDGTI 270
>gi|332705327|ref|ZP_08425405.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
gi|332355687|gb|EGJ35149.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
Length = 221
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 51/107 (47%), Gaps = 15/107 (14%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR + + R AN T AD+R +D G+ GA L +A +AN ADLS
Sbjct: 111 AILTRADLRLTILQDTDLRGANLTRADLRYADLRGANLTGACLHQADLTRANLCDADLS- 169
Query: 171 TLMDRMVLNEANLTNAV-----LVRTVLTRSDLGGAIIEGADFSDAV 212
+ANL+ A+ L R L+ DLG A + GA D +
Sbjct: 170 ---------QANLSGAILSQVDLRRVTLSNVDLGQAELSGATVPDQL 207
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 48/100 (48%)
Query: 114 GSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 173
G D R + N N T R + + + + A L++ +AN TGA L T +
Sbjct: 24 GERDFRGVDLQQINLSEVNLTGVIFRRVNLADANLSLAVLQEVNLNQANLTGAKLWRTNL 83
Query: 174 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ L EANL+ A ++R LTR +L AI+ AD ++
Sbjct: 84 KKTSLVEANLSQAFMIRANLTRVNLRQAILTRADLRLTIL 123
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 52/121 (42%), Gaps = 6/121 (4%)
Query: 94 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 152
L +Y A R G+ +L + + N AN + A ++E + + + GA
Sbjct: 18 LERYSAGERDFRGVDLQQINLSEVNLTGVIFRRVNLADANLSLAVLQEVNLNQANLTGAK 77
Query: 153 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGAD 207
L + K + A+LS M R L NL A+L R T+L +DL GA + AD
Sbjct: 78 LWRTNLKKTSLVEANLSQAFMIRANLTRVNLRQAILTRADLRLTILQDTDLRGANLTRAD 137
Query: 208 F 208
Sbjct: 138 L 138
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 5/99 (5%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
A+L AV + N +AN T A + ++ + A L +A +AN T +L +
Sbjct: 53 LADANLSLAVLQEVNLNQANLTGAKLWRTNLKKTSLVEANLSQAFMIRANLTRVNLRQAI 112
Query: 173 MDR-----MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
+ R +L + +L A L R L +DL GA + GA
Sbjct: 113 LTRADLRLTILQDTDLRGANLTRADLRYADLRGANLTGA 151
>gi|189499236|ref|YP_001958706.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
BS1]
gi|189494677|gb|ACE03225.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
Length = 442
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 64/128 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ AD ++ +++RA+ R ++ G+ FN A+++KA A+ TGA L +
Sbjct: 307 AKLDHADFSESDLSSTSWKRASLVETVFRNANLQGADFNRAFMKKADLSGADLTGAQLRE 366
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
T + L ++NL+ L T LT +DL GA + GA+ ++D A A +G
Sbjct: 367 TRLQEADLKKSNLSKTNLYDTDLTCADLRGADLTGANLLYTILDNALISAETITPSGEKA 426
Query: 231 ITGVSTRK 238
TG + K
Sbjct: 427 TTGWAVLK 434
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ AD R A + +R D++++D SG+ GA L+ + A +A F ADL+
Sbjct: 83 AKLNGADFRNAKLFSASLKRT-----DLKQTDLSGANLRGADLKNSYAKEAKFINADLTG 137
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
T L A+LT AVL + ++L A + G + + A
Sbjct: 138 TDFRYANLEGADLTGAVLENALFFDANLSSADLRGVNLTGA 178
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 10/90 (11%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE----- 180
E+ A ADM++ D + S NGA L+ A +F+ +DLS T R L E
Sbjct: 282 EDLDDAGLKGADMKKLDMTSSTMNGAKLDHA-----DFSESDLSSTSWKRASLVETVFRN 336
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
ANL A R + ++DL GA + GA +
Sbjct: 337 ANLQGADFNRAFMKKADLSGADLTGAQLRE 366
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 47/103 (45%), Gaps = 5/103 (4%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
+L KA A+ +A M + +G+K NGA A + A+ DL T +
Sbjct: 54 NLDKATLEDATLVNADLHNASMVNTRLNGAKLNGADFRNAKLFSASLKRTDLKQTDLSGA 113
Query: 177 VLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L A+L N A + LT +D A +EGAD + AV++
Sbjct: 114 NLRGADLKNSYAKEAKFINADLTGTDFRYANLEGADLTGAVLE 156
>gi|319791261|ref|YP_004152901.1| hypothetical protein Varpa_0569 [Variovorax paradoxus EPS]
gi|315593724|gb|ADU34790.1| Protein of unknown function DUF2169 [Variovorax paradoxus EPS]
Length = 865
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 34/86 (39%), Positives = 44/86 (51%), Gaps = 5/86 (5%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++F + T AD D G F GA+LE A AN +GA+LS VL ANL
Sbjct: 544 KHFSGMDLTGADFSGLDLRGVNFTGAWLESANFENANLSGANLS-----HAVLAHANLRG 598
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDA 211
A+ V T L ++LGGA + A DA
Sbjct: 599 AIAVETSLVGANLGGARLASAVLEDA 624
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 35/115 (30%), Positives = 54/115 (46%), Gaps = 3/115 (2%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A F DLR ANF +A++ ++ S + A L A+A + + GA+L
Sbjct: 552 TGADFSGLDLRGVNFTGAWLESANFENANLSGANLSHAVLAHANLRGAIAVETSLVGANL 611
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV---IDLAQKQA 220
+ VL +A+ NA T + L GA +EGA + D V +DL + QA
Sbjct: 612 GGARLASAVLEDADCRNARFDGCDWTGARLRGARLEGASWLDVVWGGVDLQRAQA 666
Score = 40.4 bits (93), Expect = 0.94, Method: Composition-based stats.
Identities = 40/145 (27%), Positives = 53/145 (36%), Gaps = 41/145 (28%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDF----------------------------- 143
F DLR V + ANF D+R D
Sbjct: 671 FYKQDLRGTVFTEAVLDDANFIECDLRGCDLRAAHMARATFVQCRLDGVHASGVQAEGVV 730
Query: 144 --SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT---------- 191
G GA L A ANF G DLS + +L+ ANL L R+
Sbjct: 731 FVEGCSLVGADLGHAAMGSANFGGMDLSQVSLVGSMLDGANLIGTRLARSDWRLASAKGV 790
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLA 216
+L ++DL A + GA+FS+AV+ A
Sbjct: 791 LLCKADLAHARMAGANFSNAVLQHA 815
>gi|427416432|ref|ZP_18906615.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425759145|gb|EKU99997.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 237
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 54/106 (50%), Gaps = 15/106 (14%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F + +L +++ + R AN A +RESD S + A LEKA KA+ GA+LSD
Sbjct: 57 FENCNLSESILWGSDLRNANLKQAQLRESDLSSALLTQANLEKANLIKASLCGANLSD-- 114
Query: 173 MDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 213
ANL NA L+ L R+DLG + + GAD S A +
Sbjct: 115 --------ANLANACLLDADLRSNSDQRTDLGQSNLSGADLSYAFL 152
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 47/83 (56%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
ANF+ +D+R+S + F E ++ G+DL + + + L E++L++A+L +
Sbjct: 35 ANFSQSDLRQSRLGRTHFCRVNFENCNLSESILWGSDLRNANLKQAQLRESDLSSALLTQ 94
Query: 191 TVLTRSDLGGAIIEGADFSDAVI 213
L +++L A + GA+ SDA +
Sbjct: 95 ANLEKANLIKASLCGANLSDANL 117
>gi|428314300|ref|YP_007125277.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428255912|gb|AFZ21871.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 355
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 56/112 (50%), Gaps = 7/112 (6%)
Query: 111 AQFGSADLR-----KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A +ADLR KA ++ RA+ T A + E+D SG+ +GA L A A G
Sbjct: 61 ANLSNADLRVANFTKAQLIETTLSRADLTQAILSEADLSGAILSGALLSGADLKGATLIG 120
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L L+ L + NLT A L R +L ++DL AI+ A +A DL++
Sbjct: 121 VSLIGALIKGAKLTKVNLTGATLSRAILVQADLKKAILNRAILGEA--DLSE 170
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 60/114 (52%), Gaps = 10/114 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL A N RAN T +++++ G++ + A L KA KAN +GA+L +
Sbjct: 226 ANLSHADLSGADLQGANLTRANLTGVLLKKANLRGAELSKANLHKANLSKANLSGANLLE 285
Query: 171 TLMDRMVLNEANL----------TNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ L++ANL TNA L T L ++L GA +EGA+ S+A ++
Sbjct: 286 ANLLDANLSQANLLRSGLLLTYLTNANLSSTNLNEANLIGANLEGANLSEASLE 339
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A L +A + N R+AN AD+ E+D G+ +GA L AN +GADL
Sbjct: 169 SEANLSGASLVRAYLNRVNLRQANLEEADLSEADLKGANLSGANLS-----GANLSGADL 223
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + L+ A+L A L R LT L A + GA+ S A
Sbjct: 224 REANLSHADLSGADLQGANLTRANLTGVLLKKANLRGAELSKA 266
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 41/82 (50%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
NF+ A + D SGS N L A ANFT L + L AN T A L+ T
Sbjct: 22 NFSGAKLSGVDLSGSNLNRINLSSAHLNGANFTKTKLIRANLSNADLRVANFTKAQLIET 81
Query: 192 VLTRSDLGGAIIEGADFSDAVI 213
L+R+DL AI+ AD S A++
Sbjct: 82 TLSRADLTQAILSEADLSGAIL 103
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 50/92 (54%), Gaps = 5/92 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 181
N R N +SA + ++F+ +K A L A ANFT A L +T + R +L+EA
Sbjct: 37 NLNRINLSSAHLNGANFTKTKLIRANLSNADLRVANFTKAQLIETTLSRADLTQAILSEA 96
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+L+ A+L +L+ +DL GA + G A+I
Sbjct: 97 DLSGAILSGALLSGADLKGATLIGVSLIGALI 128
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 10/101 (9%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL+KA+ RA AD+ E++ SG+ AYL + +AN ADLS+ +
Sbjct: 151 ADLKKAI-----LNRAILGEADLSEANLSGASLVRAYLNRVNLRQANLEEADLSEADLKG 205
Query: 176 MVLNEANLTNAVL----VRTV-LTRSDLGGAIIEGADFSDA 211
L+ ANL+ A L +R L+ +DL GA ++GA+ + A
Sbjct: 206 ANLSGANLSGANLSGADLREANLSHADLSGADLQGANLTRA 246
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 48/94 (51%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
++ A K N A + A + ++D + N A L +A +AN +GA L ++R+
Sbjct: 128 IKGAKLTKVNLTGATLSRAILVQADLKKAILNRAILGEADLSEANLSGASLVRAYLNRVN 187
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L +ANL A L L ++L GA + GA+ S A
Sbjct: 188 LRQANLEEADLSEADLKGANLSGANLSGANLSGA 221
>gi|332708407|ref|ZP_08428384.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332352810|gb|EGJ32373.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 309
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 109 SAAQFGSADLRKAV-----HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S AQ ADLR+A + N + AN + + E++FSG+ + A LE A +
Sbjct: 115 SLAQLQKADLREATGKGITFINANLKMANLGAVNFPEANFSGASLDIASLEAANLMDTKW 174
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
GADL + R L A+LT+A L+ L +DL I+ GA ++ ++
Sbjct: 175 VGADLERANLSRASLVRADLTSANLIVANLRAADLTEVILRGAQLLESSLE 225
>gi|434394477|ref|YP_007129424.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428266318|gb|AFZ32264.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 132
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 49/88 (55%), Gaps = 4/88 (4%)
Query: 115 SADLRKAVHVKE----NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
S++L++ ++ K+ N R AN +A++ E++ SG+ GA L+ A KAN GA+L
Sbjct: 40 SSELQRLLNTKQCPGCNLRGANLRNANLEEANLSGANLQGANLQNADLEKANLQGANLQQ 99
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+ L EANL NA L L +DL
Sbjct: 100 ANLSDADLQEANLQNANLQNANLRSADL 127
>gi|77404498|ref|YP_345074.1| hypothetical protein pREC1_0013 [Rhodococcus erythropolis PR4]
gi|77019879|dbj|BAE46254.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length = 589
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 5/106 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL A K N R A + A + E+D +G+ GA L AN +GADL+D
Sbjct: 403 ADLEDADLESAKLSKANLRLAILSGATLPEADLTGAVLIGANLTNTTFSGANLSGADLTD 462
Query: 171 -----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++ L EANLT AVL+ L ++L A + A+ SDA
Sbjct: 463 ADLSVADLEEADLTEANLTGAVLIGANLAHANLTDADLSKANLSDA 508
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 66/141 (46%), Gaps = 9/141 (6%)
Query: 95 NKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
N EA G + G+A A A L KA K A +AD++E+ G+ A
Sbjct: 349 NLAEANLTGAYMFGAALTEAVLTDATLTKAHLAKTTLAGALLINADLQEATLEGADLEDA 408
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
LE A KAN A LS L EA+LT AVL+ LT + GA + GAD +DA
Sbjct: 409 DLESAKLSKANLRLAILSGA-----TLPEADLTGAVLIGANLTNTTFSGANLSGADLTDA 463
Query: 212 VIDLAQ-KQALCKYANGTNPI 231
+ +A ++A AN T +
Sbjct: 464 DLSVADLEEADLTEANLTGAV 484
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 49/98 (50%), Gaps = 15/98 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L AV + N AN T AD+ +++ S + Y AN T A+LSD
Sbjct: 473 ADLTEANLTGAVLIGANLAHANLTDADLSKANLSDADL----------YSANLTDANLSD 522
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
L+ A LT A L+ T+LTR DL GA++ G D
Sbjct: 523 A-----DLSGATLTRAGLMGTILTRVDLTGAVLTGLDL 555
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 10/109 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADLR A N RAN A + E++ + + GAY+ GA L
Sbjct: 316 SGATLFEADLRSATLTGANLERANLAHAKLFEANLAEANLTGAYM----------FGAAL 365
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
++ ++ L +A+L L +L +DL A +EGAD DA ++ A+
Sbjct: 366 TEAVLTDATLTKAHLAKTTLAGALLINADLQEATLEGADLEDADLESAK 414
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 51/98 (52%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A LR A + R AN AD++ ++ SG+ A L+ A+ +A+ TGA+L+D +
Sbjct: 223 ARLRGASLGFADLRAANLQGADLQTAELSGATLRLANLKGAILREADLTGANLTDATLTE 282
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L EA L A+LV L DL +E A+ S A +
Sbjct: 283 ADLAEAKLQGAILVNVNLQNFDLSRLDLEKANLSGATL 320
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 75/166 (45%), Gaps = 5/166 (3%)
Query: 49 QFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG 108
Q D + +G +L N + + L A + + + L + + EA+ +G +
Sbjct: 241 QGADLQTAELSGATLRLANLKGAI---LREADLTGANLTDATLTEADLAEAKLQGAILVN 297
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
Q + DL + K N A AD+R + +G+ A L A ++AN A+L
Sbjct: 298 VNLQ--NFDLSRLDLEKANLSGATLFEADLRSATLTGANLERANLAHAKLFEANLAEANL 355
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ M L EA LT+A L + L ++ L GA++ AD +A ++
Sbjct: 356 TGAYMFGAALTEAVLTDATLTKAHLAKTTLAGALLINADLQEATLE 401
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 63/139 (45%), Gaps = 8/139 (5%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA--- 166
AA ADL+ A R AN A +RE+D +G+ A L +A +A GA
Sbjct: 237 AANLQGADLQTAELSGATLRLANLKGAILREADLTGANLTDATLTEADLAEAKLQGAILV 296
Query: 167 --DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQKQAL 221
+L + + R+ L +ANL+ A L L + L GA +E A+ + A + +LA+
Sbjct: 297 NVNLQNFDLSRLDLEKANLSGATLFEADLRSATLTGANLERANLAHAKLFEANLAEANLT 356
Query: 222 CKYANGTNPITGVSTRKSL 240
Y G V T +L
Sbjct: 357 GAYMFGAALTEAVLTDATL 375
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 64/144 (44%), Gaps = 8/144 (5%)
Query: 70 VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGS---AAQFGSADLRKAVHVKE 126
F L+ A + +++ L + + EA G IG+ A ADL KA
Sbjct: 449 TFSGANLSGADLTDADLSVADLEEADLTEANLTGAVLIGANLAHANLTDADLSKANLSDA 508
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ AN T A++ ++D SG+ A L + + + TGA L+ + L NLT+
Sbjct: 509 DLYSANLTDANLSDADLSGATLTRAGLMGTILTRVDLTGAVLTG-----LDLVGVNLTDV 563
Query: 187 VLVRTVLTRSDLGGAIIEGADFSD 210
L + DL GAI+ G D S+
Sbjct: 564 NLDNVNMDDVDLSGAILPGTDTSE 587
>gi|397736621|ref|ZP_10503302.1| pentapeptide repeats family protein [Rhodococcus sp. JVH1]
gi|396927531|gb|EJI94759.1| pentapeptide repeats family protein [Rhodococcus sp. JVH1]
Length = 201
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 59/131 (45%), Gaps = 15/131 (11%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYL 153
+E R E I + F ADL ++ HV FR +FT + S+F GS+F+ L
Sbjct: 38 SELRTESVIFTECDFTGADLAESNHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97
Query: 154 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 203
V + +FT GADL EANL L R VL +DL GGA
Sbjct: 98 RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 157
Query: 204 EGADFSDAVID 214
+GAD A +D
Sbjct: 158 DGADLRGAHVD 168
>gi|312194409|ref|YP_004014470.1| pentapeptide repeat-containing protein [Frankia sp. EuI1c]
gi|311225745|gb|ADP78600.1| pentapeptide repeat protein [Frankia sp. EuI1c]
Length = 2027
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 30/83 (36%), Positives = 45/83 (54%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+ T D+ ++D +G+ A L+ A AN TGA L+ R+ L ANLT+A L R
Sbjct: 1243 ADLTGLDLSDADLAGANLTDADLDDANLTGANLTGARLTGVRARRLRLTGANLTDADLRR 1302
Query: 191 TVLTRSDLGGAIIEGADFSDAVI 213
LT DL G ++ G+ + A +
Sbjct: 1303 ARLTDPDLTGTVLTGSKWERAAL 1325
Score = 46.2 bits (108), Expect = 0.015, Method: Composition-based stats.
Identities = 29/92 (31%), Positives = 44/92 (47%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A ADL + AN T AD+ +++ +G+ GA L A + TGA+L+
Sbjct: 1237 GAHLEGADLTGLDLSDADLAGANLTDADLDDANLTGANLTGARLTGVRARRLRLTGANLT 1296
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 201
D + R L + +LT VL + R+ L GA
Sbjct: 1297 DADLRRARLTDPDLTGTVLTGSKWERAALLGA 1328
>gi|282898711|ref|ZP_06306699.1| hglK (Pentapeptide repeat protein) [Cylindrospermopsis raciborskii
CS-505]
gi|281196579|gb|EFA71488.1| hglK (Pentapeptide repeat protein) [Cylindrospermopsis raciborskii
CS-505]
Length = 682
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S AQ ADL A + + + + A++ ++++ G+ + +YL A ANF+ A+L
Sbjct: 529 SGAQLQEADLYAAQLARVSAIGSQLSHANLTKTNWQGADLSESYLNHANLNSANFSAANL 588
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S +L AN+TN L ++R+DL GA +EG DF A++
Sbjct: 589 SGA-----ILRYANMTNTNLRSADISRADLRGANLEGTDFQGAIL 628
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 59/125 (47%), Gaps = 16/125 (12%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
DL ++V RAN S+ + +++ S ++ G+ L++A A TGAD+S +
Sbjct: 481 VDLSRSV-----LNRANLASSKLIDANLSSAQLVGSDLQQATLQDAVLTGADISGAQLQE 535
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ-----------ALCKY 224
L A L + + L+ ++L +GAD S++ ++ A A+ +Y
Sbjct: 536 ADLYAAQLARVSAIGSQLSHANLTKTNWQGADLSESYLNHANLNSANFSAANLSGAILRY 595
Query: 225 ANGTN 229
AN TN
Sbjct: 596 ANMTN 600
>gi|428181173|gb|EKX50038.1| hypothetical protein GUITHDRAFT_135709 [Guillardia theta CCMP2712]
Length = 1263
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 5/113 (4%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
I S ++ ++L K+ K + N T DM SD A L +++ Y+AN + A
Sbjct: 484 ILSGSKLEKSNLHKSKLSKVDLSNCNLTLTDMSSSDL-----QKADLSRSLFYRANLSSA 538
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
+L + M+ L+ NL++A L R L S L GA +EGADFS + A Q
Sbjct: 539 NLKSSNMNGADLSHCNLSSACLERASLYGSKLEGANLEGADFSHCDLSFAMLQ 591
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 44/77 (57%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ R F ++D R DFSGSK +G L KA +N + DL+ + M + L ANL A
Sbjct: 1033 DLRSCKFANSDFRGQDFSGSKLSGVQLSKANLTGSNLSSCDLTGSDMSKCHLERANLLGA 1092
Query: 187 VLVRTVLTRSDLGGAII 203
VL + L+++ L GA++
Sbjct: 1093 VLKGSDLSQARLKGAVL 1109
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 53/110 (48%), Gaps = 22/110 (20%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
++ R + + AD+ DF+G+ F+G+ L +A ++ G DLS N
Sbjct: 931 KDLRNSKLSEADLSHQDFAGADFSGSKLSRANLRQSKLDGCDLS---------------N 975
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL-------CKYANGT 228
L R++L + L GA+I G DFS+A ++ A A CK+A T
Sbjct: 976 CDLSRSILEGASLQGAVIRGTDFSNAKLEGAALPAWVEVDFECCKFAGAT 1025
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 50/107 (46%), Gaps = 5/107 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANF 163
S++ ADL +++ + N AN S++M +D S + A LE+A Y AN
Sbjct: 516 SSSDLQKADLSRSLFYRANLSSANLKSSNMNGADLSHCNLSSACLERASLYGSKLEGANL 575
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
GAD S + +L NL A LT +D G+ +EGA D
Sbjct: 576 EGADFSHCDLSFAMLQNCNLRGANFTGAKLTGTDFSGSDLEGAIMPD 622
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 51/108 (47%), Gaps = 10/108 (9%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD------ 170
+L KA+ + + A ++ +D S + GA LE A +N + +LS
Sbjct: 254 NLSKAMLQQARLQGAQLQGCNLSYNDLSDANLEGAKLEGADLSYSNLSQCNLSQASCSRI 313
Query: 171 ----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
++M R LN+ + +A L LT S L + EGADF D+V+D
Sbjct: 314 MLQFSVMTRARLNDGDFGSANLSECDLTHSQLSSSCFEGADFRDSVLD 361
Score = 44.3 bits (103), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 51/123 (41%), Gaps = 20/123 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY----------- 159
A F DL A+ N R ANFT A + +DFSGS GA + Y
Sbjct: 578 ADFSHCDLSFAMLQNCNLRGANFTGAKLTGTDFSGSDLEGAIMPDMEGYDLQGVCLSGTS 637
Query: 160 ---------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+AN ADL + + L +A+L+ A L L +DL G + G + S
Sbjct: 638 GFFKDKSARRANLCDADLRGQELSGVNLQQADLSFADLTGANLQGADLTGTKLNGTNLSQ 697
Query: 211 AVI 213
+ +
Sbjct: 698 SRL 700
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 47/103 (45%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A L +A N +N +S D+ ++ SG+ GA L A + + + L
Sbjct: 191 SRADLSEAKLCRADLTHANLTESNLSSCDLSDTILSGANLGGADLSGAKLFNCDLSRTSL 250
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
D + + +L +A L A L L+ +DL A +EGA A
Sbjct: 251 MDVNLSKAMLQQARLQGAQLQGCNLSYNDLSDANLEGAKLEGA 293
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 47/106 (44%), Gaps = 5/106 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANF 163
S + + D AV NF RAN T A +MR + F + F A +
Sbjct: 411 SESNLTACDFSGAVMNDSNFERANLTKARFVGCEMRNASFQHATFASATFSDVKMEGVDL 470
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
TG DLS + +++L+ + L + L ++ L++ DL + D S
Sbjct: 471 TGCDLSSCDLSKLILSGSKLEKSNLHKSKLSKVDLSNCNLTLTDMS 516
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 50/91 (54%), Gaps = 10/91 (10%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A LR + V + + NF+ D+ +++ S S + A L +A +A+ T A+L+
Sbjct: 158 ATLRGSSFVSSSCAQTNFSRCDLSDANLSMSTLSRADLSEAKLCRADLTHANLT------ 211
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
E+NL++ L T+L+ ++LGGA + GA
Sbjct: 212 ----ESNLSSCDLSDTILSGANLGGADLSGA 238
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 30/118 (25%), Positives = 48/118 (40%), Gaps = 16/118 (13%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ D+ + + R N +++ E +FS FNGA L Y N T DL
Sbjct: 725 RLSGVDMNNSSWTGADLRGVNMAGSNLNECNFSEVSFNGADLTGCSIYNTNLTNCDLKGV 784
Query: 172 LMDRMVLNEANLTNAVL----------------VRTVLTRSDLGGAIIEGADFSDAVI 213
+ R L ++L+++ + V T T + GA + ADFS AV+
Sbjct: 785 NLSRANLQYSDLSHSAMDGATLPEWSSGSFEGVVLTGATGINFVGADLRKADFSQAVL 842
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 39/82 (47%), Gaps = 15/82 (18%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
NF AD+R++DFS + G L A +AN AD + E NLT L
Sbjct: 826 NFVGADLRKADFSQAVLKGHDLSAADLSQANLRNADFT----------ECNLTGCNL--- 872
Query: 192 VLTRSDLGGAIIEGADFSDAVI 213
T+S+L G +GA S A+I
Sbjct: 873 --TQSNLSGCNFDGAILSGAII 892
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 41/90 (45%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S AQF +A+L R+NF+ +++ DFSG+ N + E+A KA F G ++
Sbjct: 386 SDAQFVNANLSNVKLNAARVLRSNFSESNLTACDFSGAVMNDSNFERANLTKARFVGCEM 445
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+ A ++ + LT DL
Sbjct: 446 RNASFQHATFASATFSDVKMEGVDLTGCDL 475
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 53/111 (47%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A + L +A + RA+ T A++ ES+ S + L A A+ +GA L
Sbjct: 181 SDANLSMSTLSRADLSEAKLCRADLTHANLTESNLSSCDLSDTILSGANLGGADLSGAKL 240
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
+ + R L + NL+ A+L + L + L G + D SDA ++ A+ +
Sbjct: 241 FNCDLSRTSLMDVNLSKAMLQQARLQGAQLQGCNLSYNDLSDANLEGAKLE 291
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 38/79 (48%), Gaps = 2/79 (2%)
Query: 129 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
RRAN AD+R + SG A L A AN GADL+ T ++ L+++ L A
Sbjct: 646 RRANLCDADLRGQELSGVNLQQADLSFADLTGANLQGADLTGTKLNGTNLSQSRLAGACF 705
Query: 189 VRTVLTRSDLGGAIIEGAD 207
+ D+ G + GA+
Sbjct: 706 --SCWAERDVSGIKLAGAE 722
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 41/85 (48%), Gaps = 5/85 (5%)
Query: 113 FGSADLRKA-----VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
F ADLRKA V + A+ + A++R +DF+ G L ++ NF GA
Sbjct: 827 FVGADLRKADFSQAVLKGHDLSAADLSQANLRNADFTECNLTGCNLTQSNLSGCNFDGAI 886
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTV 192
LS ++ ++ L+ L A+L +
Sbjct: 887 LSGAIIKQVDLSTTRLNGAILPELI 911
Score = 37.0 bits (84), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 22/80 (27%), Positives = 43/80 (53%)
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
V + R+A+F+ A ++ D S + + A L A + N TG +L+ + + + A L
Sbjct: 828 VGADLRKADFSQAVLKGHDLSAADLSQANLRNADFTECNLTGCNLTQSNLSGCNFDGAIL 887
Query: 184 TNAVLVRTVLTRSDLGGAII 203
+ A++ + L+ + L GAI+
Sbjct: 888 SGAIIKQVDLSTTRLNGAIL 907
>gi|239947676|ref|ZP_04699429.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes
scapularis]
gi|239921952|gb|EER21976.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes
scapularis]
Length = 953
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 49/178 (27%), Positives = 79/178 (44%), Gaps = 22/178 (12%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
L+N + + AL A C N+ + N Y ++T E + ADLR+A+
Sbjct: 494 LENAFMNKTHALEAKFKEQC--NMQGITARNAYFSDTEFE----NILSLKEADLREAIMQ 547
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY----------KANFTGADLSDTLMD 174
+ + A+ T A + ++ + A L A A KA G ++SD +
Sbjct: 548 RVKLKNADLTKAKLDKAKLEYADLTNATLTNATAQFAKLSNATLEKAEAEGLNISDAIAK 607
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYAN 226
+ EAN NA++ R LT++D A++E AD ++A+ A KQA K AN
Sbjct: 608 NINAKEANFKNAIMQRADLTKADFTKAVLENADMQAMEAAEAIFKEANLKQANLKVAN 665
Score = 39.7 bits (91), Expect = 1.5, Method: Composition-based stats.
Identities = 38/149 (25%), Positives = 64/149 (42%), Gaps = 13/149 (8%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
LKN +F S L +++C+ + + N A + + F ADL+K+
Sbjct: 354 LKN-TLFASANLENIKISNCNLDFTNFEGANLQNAVFQNVTARNTGFLF--ADLKKSKIE 410
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTGADLSDTLMD 174
+ RA D+ E++ + SKFN + A A K +N TG L+ M
Sbjct: 411 NSDMSRAYMPKVDLSEAEVTNSKFNAVMMVNADAEKLIIKDSEWKNSNLTGISLAYADMQ 470
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAII 203
R+ + L NA+L + + +DL A +
Sbjct: 471 RVQMQGVVLNNALLDQANIVSTDLENAFM 499
Score = 39.3 bits (90), Expect = 2.0, Method: Composition-based stats.
Identities = 39/137 (28%), Positives = 56/137 (40%), Gaps = 30/137 (21%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA----------VAYKA 161
+ +ADL KA K A+ T+A + + +K + A LEKA +A
Sbjct: 550 KLKNADLTKAKLDKAKLEYADLTNATLTNATAQFAKLSNATLEKAEAEGLNISDAIAKNI 609
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAV--------------------LVRTVLTRSDLGGA 201
N A+ + +M R L +A+ T AV L + L ++L G
Sbjct: 610 NAKEANFKNAIMQRADLTKADFTKAVLENADMQAMEAAEAIFKEANLKQANLKVANLAGI 669
Query: 202 IIEGADFSDAVIDLAQK 218
EGADF A ID A K
Sbjct: 670 NKEGADFDKAKIDDATK 686
>gi|383501588|ref|YP_005414947.1| hypothetical protein MC5_03910 [Rickettsia australis str. Cutlack]
gi|378932599|gb|AFC71104.1| hypothetical protein MC5_03910 [Rickettsia australis str. Cutlack]
Length = 960
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 38/116 (32%), Positives = 57/116 (49%), Gaps = 6/116 (5%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAKLDKANLEYADLTNATLTNATAQFVKLSNATLEKAEA-----EGLNISDV 609
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 226
+ + EAN N ++ R LT++D A++E AD +D K+A K AN
Sbjct: 610 IAKNINAKEANFKNVIMQRADLTKADFTKAVLENADMQAVEALDAIFKEATLKQAN 665
Score = 40.8 bits (94), Expect = 0.73, Method: Composition-based stats.
Identities = 43/169 (25%), Positives = 71/169 (42%), Gaps = 16/169 (9%)
Query: 51 PDCSNNQCAGPYA---KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGI 107
PD S+ +G LKN +F S L +++C+ + + N A +
Sbjct: 342 PDLSDINLSGKTLTNLNLKN-TLFASANLENINISNCNLDFTNFEGANLQNAVFQDVTAR 400
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK------- 160
+ F ADL+K+ + RA D+ E++ + SKFN + A A K
Sbjct: 401 NTGFLF--ADLKKSKIENSDMSRAYMPKVDLSEAEVTNSKFNAVMMVNADAEKLIMQDSE 458
Query: 161 ---ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
+N TG L+ M R+ + L NA+L + + +DL A + A
Sbjct: 459 WKNSNLTGISLAYADMQRVQMQGVVLNNALLDQANIISTDLENAFMNNA 507
>gi|304393841|ref|ZP_07375766.1| pentapeptide repeat-containing protein [Ahrensia sp. R2A130]
gi|303294040|gb|EFL88415.1| pentapeptide repeat-containing protein [Ahrensia sp. R2A130]
Length = 247
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 59/115 (51%), Gaps = 5/115 (4%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
G+G + GS + V +F +FT A+M SDFSGS + K+ +ANFTG
Sbjct: 109 GVGLSKVEGS----RTVLQNSDFTDTDFTKAEMFRSDFSGSILKNVNMNKSEFSRANFTG 164
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
ADLS ++ ++ ANL +A L T + S + A + G D S A L Q+Q
Sbjct: 165 ADLSGAMITFANISRANLADAKLDGTDFSSSWMYLAKVAGVDMS-ATKGLTQEQV 218
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 51/118 (43%), Gaps = 20/118 (16%)
Query: 119 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLM 173
R + NF AN D+ SD KF+GA + K++ +AN + G LS
Sbjct: 58 RNVILSGYNFSLANLNQTDLFGSDLRDVKFDGADMTKSILTRANLSNSSLKGVGLSKVEG 117
Query: 174 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIE---------------GADFSDAVIDLA 216
R VL ++ T+ + + RSD G+I++ GAD S A+I A
Sbjct: 118 SRTVLQNSDFTDTDFTKAEMFRSDFSGSILKNVNMNKSEFSRANFTGADLSGAMITFA 175
>gi|113477518|ref|YP_723579.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110168566|gb|ABG53106.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 710
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 61/127 (48%), Gaps = 14/127 (11%)
Query: 91 LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFS 144
L + N ++A T F + A GSADL KA + N + F +D+RES++
Sbjct: 534 LIETNLHQANLTEATF---TGADLGSADLSKANLYRANLSKVKAEGTTFQLSDLRESNWQ 590
Query: 145 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 204
G+ +GA +AN ADLS L+ A L NA L T ++ +DL GA +
Sbjct: 591 GANLSGANFS-----RANLKKADLSLALLTNANFRNAQLQNANLRNTDISLADLRGANLS 645
Query: 205 GADFSDA 211
G DF A
Sbjct: 646 GTDFKGA 652
Score = 40.4 bits (93), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 53/120 (44%), Gaps = 15/120 (12%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSA----------DMRESDFSGSKFNGAYLEKA 156
I A A L KA+ +ANF+SA ++ E+ F+G+ A L KA
Sbjct: 503 IMKRADLFRATLSKAIMPGSTITQANFSSAKLIETNLHQANLTEATFTGADLGSADLSKA 562
Query: 157 VAYKANFTGADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
Y+AN + T L E ANL+ A R L ++DL A++ A+F +A
Sbjct: 563 NLYRANLSKVKAEGTTFQLSDLRESNWQGANLSGANFSRANLKKADLSLALLTNANFRNA 622
>gi|359458687|ref|ZP_09247250.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 203
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 66/146 (45%), Gaps = 22/146 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDF----------SGSKFNGAYLEKAVAYK 160
A F SADLRKA + + R A AD+R ++ SG+ +GA L A+ Y
Sbjct: 53 ANFASADLRKAKLFRADLRAACLYRADLRGANLKGANLFGANLSGANLSGANLSNAMLYC 112
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF--SDAV-IDLAQ 217
AN GA+L T++D L N ++ L +L + L G EG +D + I+L Q
Sbjct: 113 ANLGGANLRGTILDSANLMRVNFSHGDLRNAMLRNAKLQGTHFEGTRMLQTDLIEINLNQ 172
Query: 218 KQALCKY---------ANGTNPITGV 234
Q Y A G ITG+
Sbjct: 173 AQIDGVYLMDPDANNTAMGNTAITGI 198
>gi|428777412|ref|YP_007169199.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428691691|gb|AFZ44985.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 333
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 50/92 (54%), Gaps = 5/92 (5%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLN 179
+ N RRA+ S ++ E DF+ + A L +A KAN GADLS+ + + L
Sbjct: 154 RTNLRRADLESLNLDELDFTQANLTEANLVRATLTKANLQGADLSEANLFNADLSKANLK 213
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
ANL A L+R L R+DL GA + GA ++A
Sbjct: 214 GANLRGANLIRANLERADLSGADLRGAYLNEA 245
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 48/95 (50%), Gaps = 5/95 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA-----NF 163
S A +ADL KA N R AN A++ +D SG+ GAYL +A ++A N
Sbjct: 198 SEANLFNADLSKANLKGANLRGANLIRANLERADLSGADLRGAYLNEAKMFEASLDNVNL 257
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+ A+L T M R AN +NA L + ++DL
Sbjct: 258 SQANLHRTRMIRASFKHANFSNANLTEANMRQADL 292
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 68/160 (42%), Gaps = 23/160 (14%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 165
+ F A L +A V N A+ +++ E++ +G+ +GA L A FTG
Sbjct: 85 SDFHGAILHRANLVDTNLTLASLLDSNLMEANLAGADLSGADLSGVCLLGAVFTGSEQRG 144
Query: 166 ---------------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
ADL +D + +ANLT A LVR LT+++L GA + A+ +
Sbjct: 145 SRKSTTKLKRTNLRRADLESLNLDELDFTQANLTEANLVRATLTKANLQGADLSEANLFN 204
Query: 211 AVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAY 250
A DL++ G N I R L G R AY
Sbjct: 205 A--DLSKANLKGANLRGANLIRANLERADL-SGADLRGAY 241
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 56/108 (51%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+ A A+L +A K N + AN +AD+ +++ G+ GA L +A +A+
Sbjct: 173 TQANLTEANLVRATLTKANLQGADLSEANLFNADLSKANLKGANLRGANLIRANLERADL 232
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+GADL ++ + EA+L N L + L R+ + A + A+FS+A
Sbjct: 233 SGADLRGAYLNEAKMFEASLDNVNLSQANLHRTRMIRASFKHANFSNA 280
>gi|37522461|ref|NP_925838.1| hypothetical protein gll2892 [Gloeobacter violaceus PCC 7421]
gi|35213462|dbj|BAC90833.1| gll2892 [Gloeobacter violaceus PCC 7421]
Length = 457
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR A N AN AD+ +D +G+ N A+L A +AN GA+L+
Sbjct: 79 ANLSEADLRGANLNWANLNWANLNWADLSGADLNGANLNWAHLNWADLREANLGGAELNR 138
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L ANL+ L R ++ +DL GA + GA+ S+A
Sbjct: 139 ANLREANLGGANLSGVSLSRAFMSGADLRGADLGGANLSEA 179
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 50/103 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
A G A+L +A N AN A++ +D SG+ NGA L A A+ A+L
Sbjct: 72 EGANLGGANLSEADLRGANLNWANLNWANLNWADLSGADLNGANLNWAHLNWADLREANL 131
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++R L EANL A L L+R+ + GA + GAD A
Sbjct: 132 GGAELNRANLREANLGGANLSGVSLSRAFMSGADLRGADLGGA 174
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 51/104 (49%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL A N A+ AD+RE++ G++ N A L +A AN +G LS
Sbjct: 99 ANLNWADLSGADLNGANLNWAHLNWADLREANLGGAELNRANLREANLGGANLSGVSLSR 158
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
M L A+L A L L ++LGGA ++GAD A ++
Sbjct: 159 AFMSGADLRGADLGGANLSEADLGGANLGGANLKGADLGGANLE 202
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 51/103 (49%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G ADL A N AN + AD+R ++ + + N A L A A+ GA+L+
Sbjct: 59 ADLGGADLGGADLEGANLGGANLSEADLRGANLNWANLNWANLNWADLSGADLNGANLNW 118
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ L EANL A L R L ++LGGA + G S A +
Sbjct: 119 AHLNWADLREANLGGAELNRANLREANLGGANLSGVSLSRAFM 161
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 50/100 (50%), Gaps = 5/100 (5%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A+ A+LR+A N RA + AD+R +D G+ + A L A AN G
Sbjct: 134 AELNRANLREANLGGANLSGVSLSRAFMSGADLRGADLGGANLSEADLGGANLGGANLKG 193
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
ADL ++R L A+L A L RT LT L GA++EG
Sbjct: 194 ADLGGANLERTSLRGADLRGADLRRTRLTGCSLEGAVLEG 233
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 47/98 (47%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR+A RAN A++ ++ SG + A++ A A+ GA+LS+
Sbjct: 119 AHLNWADLREANLGGAELNRANLREANLGGANLSGVSLSRAFMSGADLRGADLGGANLSE 178
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ L ANL A L L R+ L GA + GAD
Sbjct: 179 ADLGGANLGGANLKGADLGGANLERTSLRGADLRGADL 216
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 66/143 (46%), Gaps = 7/143 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SAA G A+L + AN AD+ +D G+ GA LE A AN + ADL
Sbjct: 32 SAADLGGANLGGV-----DLGGANLGGADLDGADLGGADLGGADLEGANLGGANLSEADL 86
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN- 226
++ LN ANL A L L ++L A + AD +A + A+ +A + AN
Sbjct: 87 RGANLNWANLNWANLNWADLSGADLNGANLNWAHLNWADLREANLGGAELNRANLREANL 146
Query: 227 GTNPITGVSTRKSLGCGNSRRNA 249
G ++GVS ++ G R A
Sbjct: 147 GGANLSGVSLSRAFMSGADLRGA 169
>gi|114799805|ref|YP_760951.1| pentapeptide repeat-containing protein [Hyphomonas neptunium ATCC
15444]
gi|114739979|gb|ABI78104.1| pentapeptide repeat domain protein [Hyphomonas neptunium ATCC
15444]
Length = 245
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
ADLR A F A F +A M++ +DFS ++ GA LEKA NF GA L
Sbjct: 88 ADLRGADLTSARFADATFNNARMQDVLASGADFSRARLQGANLEKARLIGVNFEGASL-- 145
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L R L A+L+ A T+L R++L G I +GA+ S+A
Sbjct: 146 -LFAR--LETADLSGANCTGTILDRANLRGTIFDGANLSEA 183
>gi|119486130|ref|ZP_01620190.1| hypothetical protein L8106_17342 [Lyngbya sp. PCC 8106]
gi|119456621|gb|EAW37750.1| hypothetical protein L8106_17342 [Lyngbya sp. PCC 8106]
Length = 207
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 64/137 (46%), Gaps = 23/137 (16%)
Query: 96 KYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANF---------------TSAD 137
K A RG G+ A +ADLR A+ + + R A+F T D
Sbjct: 62 KLRANLRGADLTGTNLIGADLRNADLRGAILLDADVREASFAGAFLTGASCGALDLTGVD 121
Query: 138 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 197
+R +D G + A L++A N +GADLS + L EANL+ AVL T L R++
Sbjct: 122 LRGADLRGVSLSQAILQQADLRNTNLSGADLS-----QADLEEANLSGAVLRGTNLERAN 176
Query: 198 LGGAIIEGADFSDAVID 214
L AI+E + ++D
Sbjct: 177 LLCAIVEQTQWFGTILD 193
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 62/131 (47%), Gaps = 19/131 (14%)
Query: 116 ADLRKAVHVKENFRRANFTS-----ADMRESDFSG----------SKFNGAYLEKAVAYK 160
A+L++A ++ N R A+ T AD+R +D G + F GA+L A
Sbjct: 56 ANLQRA-KLRANLRGADLTGTNLIGADLRNADLRGAILLDADVREASFAGAFLTGASCGA 114
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 217
+ TG DL + + L++A L A L T L+ +DL A +E A+ S AV+ +L +
Sbjct: 115 LDLTGVDLRGADLRGVSLSQAILQQADLRNTNLSGADLSQADLEEANLSGAVLRGTNLER 174
Query: 218 KQALCKYANGT 228
LC T
Sbjct: 175 ANLLCAIVEQT 185
>gi|443321008|ref|ZP_21050077.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
gi|442789287|gb|ELR98951.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
Length = 333
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 61/138 (44%), Gaps = 35/138 (25%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG----------------------- 145
S A +A+L ++ + N +AN A++ ++D SG
Sbjct: 64 SEADLEAANLTRSTLIDINLSKANLNHANLTDADLSGANLSNSNLTGADLSNASLISSSM 123
Query: 146 -------SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV-----LVRTVL 193
SK A L AV KAN ADLS ++R +L EANL A+ L+R+ L
Sbjct: 124 IGSCLSKSKLKLANLTSAVLAKANLQYADLSFAGLNRAILTEANLRGAILKQATLIRSYL 183
Query: 194 TRSDLGGAIIEGADFSDA 211
R DL GA ++G + S A
Sbjct: 184 NRVDLSGANLQGCNLSLA 201
Score = 44.3 bits (103), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
I + A A L++A ++ R + + A+++ + S + GA L A AN GA
Sbjct: 162 ILTEANLRGAILKQATLIRSYLNRVDLSGANLQGCNLSLADLRGANLTGANLQGANLEGA 221
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+LSD + L+ ANLT A LV T L R++L GA + A+
Sbjct: 222 NLSD-----VNLSGANLTKANLVGTQLVRANLTGAKLSYANL 258
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 44/93 (47%), Gaps = 10/93 (10%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADLR A N + AN A++ + + SG+ A L +AN TGA LS
Sbjct: 201 ADLRGANLTGANLQGANLEGANLSDVNLSGANLTKANLVGTQLVRANLTGAKLS------ 254
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
ANL + L++ L++++L A + GA
Sbjct: 255 ----YANLKGSNLLKANLSQANLAAANLSGAGL 283
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 48/96 (50%), Gaps = 10/96 (10%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L +A N +RA SA + E+ G+ + A LE A N T + L D + +
Sbjct: 31 ANLNQANLNLINLKRAMLKSAQIIEAKLIGANLSEADLEAA-----NLTRSTLIDINLSK 85
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
LN ANLT+A L L+ S+L GAD S+A
Sbjct: 86 ANLNHANLTDADLSGANLSNSNL-----TGADLSNA 116
>gi|428309179|ref|YP_007120156.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428250791|gb|AFZ16750.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 303
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 53/106 (50%), Gaps = 5/106 (4%)
Query: 113 FGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
F + L +AV + +F A +F AD+RE+DF+ F+ A L +A AN A
Sbjct: 136 FWRSHLMRAVLRRVDFHEAILQETSFRQADLREADFTRVYFSEASLSEANLRGANLDQAL 195
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ T R L +A+L A L R V ++DL GA +GA AV
Sbjct: 196 VKRTSFWRTNLQQASLKGAYLKRIVFNQTDLSGASFQGAQLQGAVF 241
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 53/105 (50%), Gaps = 15/105 (14%)
Query: 125 KENFRRANFTSADMRESDFSGSKF----------NGAYLEKAVAYK-----ANFTGADLS 169
+ N RAN + A++ ++ SG++ N A LE A+ ++ AN GA L
Sbjct: 53 RTNLSRANLSRANLSHANLSGARLECVSLSRANLNQADLEGAILFQSNLSQANLIGASLP 112
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+T + L +ANLT A L T+ RS L A++ DF +A++
Sbjct: 113 ETDLQVATLFQANLTGACLRGTIFWRSHLMRAVLRRVDFHEAILQ 157
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 55/116 (47%), Gaps = 20/116 (17%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVA 158
S A A+L +A+ + +F R N A ++ ++D SG+ F GA L+ AV
Sbjct: 182 SEANLRGANLDQALVKRTSFWRTNLQQASLKGAYLKRIVFNQTDLSGASFQGAQLQGAVF 241
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
AN TGA+ ++R V ANLT ++L GA ++ A F + I+
Sbjct: 242 RGANLTGANFEGANLERAVFRGANLTG----------TNLKGASLQWAVFKEVNIE 287
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 45/103 (43%), Gaps = 20/103 (19%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A+ + N +AN A + E+D L+ A ++AN TGA L
Sbjct: 82 SRANLNQADLEGAILFQSNLSQANLIGASLPETD----------LQVATLFQANLTGACL 131
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
T+ R + L+R VL R D AI++ F A
Sbjct: 132 RGTIFWR----------SHLMRAVLRRVDFHEAILQETSFRQA 164
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 51/121 (42%), Gaps = 20/121 (16%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMR---------------ESDFSGSKFNGAYLEKAV 157
F ADLR+A + F A+ + A++R ++ + GAYL++ V
Sbjct: 161 FRQADLREADFTRVYFSEASLSEANLRGANLDQALVKRTSFWRTNLQQASLKGAYLKRIV 220
Query: 158 AYKANFTGADLSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSDAV 212
+ + +GA + V ANLT A L R V ++L G ++GA AV
Sbjct: 221 FNQTDLSGASFQGAQLQGAVFRGANLTGANFEGANLERAVFRGANLTGTNLKGASLQWAV 280
Query: 213 I 213
Sbjct: 281 F 281
>gi|428218533|ref|YP_007102998.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427990315|gb|AFY70570.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 348
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 57/116 (49%), Gaps = 15/116 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SA A+L A ++ N AN A+ E++ S + N AYL KA + AN T A+L
Sbjct: 47 SAVNLRGANLSMANLIRANLSGANLIEANFDEANLSMAYLNCAYLNKAYLHGANLTWANL 106
Query: 169 SDTLMDRMVLNEANLTNAVLVRT---------------VLTRSDLGGAIIEGADFS 209
S + + +EANL+ AVL T L+ +DLGGA + GA+ S
Sbjct: 107 SQSCLIDTDASEANLSGAVLSGTDAYGSNFSGANLSEAYLSVADLGGANLHGANLS 162
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 56/114 (49%), Gaps = 20/114 (17%)
Query: 115 SADLRKAVHVKENFRRANFTS-----ADMRESDFSGSKFNGA---------------YLE 154
+AD+R A ++ + RA+ T AD+ ++ G++ +GA +LE
Sbjct: 208 AADIRGASLIETDLSRADLTKVSLICADLSDAHLIGTELHGANLSQANLKHADLRLSHLE 267
Query: 155 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A Y A+ ADLS ++ LNEA L A+L T L +DL GA + GA+
Sbjct: 268 AANLYGASLYSADLSQANLNAAYLNEAFLFGAILKWTNLADADLSGAHLGGANL 321
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 59/119 (49%), Gaps = 5/119 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTG 165
A A+L + NF RAN ++A+M +++ + SKF A L++A Y A+ G
Sbjct: 154 ANLHGANLSSVYAIATNFERANLSNANMSKANCAKSKFGSAILDRANLSMSYLYAADIRG 213
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 224
A L +T + R L + +L A L L ++L GA + A+ A + L+ +A Y
Sbjct: 214 ASLIETDLSRADLTKVSLICADLSDAHLIGTELHGANLSQANLKHADLRLSHLEAANLY 272
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 57/110 (51%), Gaps = 10/110 (9%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
G+ + Q+ SA+ R ++ + A+ TS D+ ++D S GA L A +AN +G
Sbjct: 13 GVSTWNQWRSANSR----IQVDLTGADLTSVDLLDADLSAVNLRGANLSMANLIRANLSG 68
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VID 214
A+L + D EANL+ A L L ++ L GA + A+ S + +ID
Sbjct: 69 ANLIEANFD-----EANLSMAYLNCAYLNKAYLHGANLTWANLSQSCLID 113
Score = 40.4 bits (93), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 48/105 (45%), Gaps = 20/105 (19%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR----------M 176
NF AN + A + +D G+ +GA L A NF A+LS+ M +
Sbjct: 135 NFSGANLSEAYLSVADLGGANLHGANLSSVYAIATNFERANLSNANMSKANCAKSKFGSA 194
Query: 177 VLNEANLT----------NAVLVRTVLTRSDLGGAIIEGADFSDA 211
+L+ ANL+ A L+ T L+R+DL + AD SDA
Sbjct: 195 ILDRANLSMSYLYAADIRGASLIETDLSRADLTKVSLICADLSDA 239
>gi|419963472|ref|ZP_14479445.1| hypothetical protein WSS_A15164 [Rhodococcus opacus M213]
gi|432333027|ref|ZP_19584842.1| hypothetical protein Rwratislav_00170 [Rhodococcus wratislaviensis
IFP 2016]
gi|414571123|gb|EKT81843.1| hypothetical protein WSS_A15164 [Rhodococcus opacus M213]
gi|430780078|gb|ELB95186.1| hypothetical protein Rwratislav_00170 [Rhodococcus wratislaviensis
IFP 2016]
Length = 201
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 59/131 (45%), Gaps = 15/131 (11%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYL 153
+E R E I + F ADL ++ HV FR +FT + S+F GS+F+ L
Sbjct: 38 SELRTESVIFTDCDFTGADLAESRHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97
Query: 154 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 203
V + +FT GADL EANL L R VL +DL GGA
Sbjct: 98 RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 157
Query: 204 EGADFSDAVID 214
+GAD A +D
Sbjct: 158 DGADLRGAHVD 168
>gi|159030580|emb|CAO88243.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length = 354
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 66/133 (49%)
Query: 80 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 139
+ A+ S + LA L + + T AA+ L A + N R AN T AD+
Sbjct: 204 IYAAVSDDFLELAQLAELDPLTDFTGANLLAAELSGISLGMANLYQANLRGANLTDADLS 263
Query: 140 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 199
E + S + F GA L A+ A+ + AD + + L +NLT A LV +T+++L
Sbjct: 264 EINGSHASFKGADLSGALLANADLSYADFYRSSLALANLIGSNLTGANLVEVNITQANLS 323
Query: 200 GAIIEGADFSDAV 212
GA ++GA F+D V
Sbjct: 324 GAKVQGAKFADNV 336
>gi|153871558|ref|ZP_02000700.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
gi|152071976|gb|EDN69300.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
Length = 179
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 64/123 (52%), Gaps = 14/123 (11%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL + + R A+ + AD+ E+D SG+ +G AN +GADL
Sbjct: 59 SGADLSGADLSNSDIRAGDLRVADLSEADLSEADLSGADLSG----------ANLSGADL 108
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+ R +LN+ANL+ A L L+ +DL GA + GA+ S +DL++ + AN T
Sbjct: 109 RWADLYRTILNDANLSYANLCSADLSEADLSGANLSGANLS--RVDLSEAN--LEGANLT 164
Query: 229 NPI 231
+ I
Sbjct: 165 DAI 167
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL + + N AN SAD+ E+D SG+ +GA L + +AN GA+L
Sbjct: 104 SGADLRWADLYRTILNDANLSYANLCSADLSEADLSGANLSGANLSRVDLSEANLEGANL 163
Query: 169 SDTLMDRMVLNE 180
+D ++ + NE
Sbjct: 164 TDAILTGAIFNE 175
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 45/85 (52%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ AN AD++ ++ G+ N AYL +A+ + N +GADLS + + +L A
Sbjct: 22 DLSEANLNGADLKNANLRGADLNHAYLFRAILTQINLSGADLSGADLSNSDIRAGDLRVA 81
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
L L+ +DL GA + GA+ S A
Sbjct: 82 DLSEADLSEADLSGADLSGANLSGA 106
>gi|428320632|ref|YP_007118514.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428244312|gb|AFZ10098.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 280
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 51/91 (56%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
LR+ + NFR N AD++ + S + F A + A AN TGA+L + + +
Sbjct: 7 LRQYAAGERNFREINLAGADLKGVNLSEANFTRANFQDANLKGANLTGANLREVKLAGVD 66
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
L EANL+ A L+ T L+R++L GA + GA+
Sbjct: 67 LTEANLSEANLIGTDLSRANLSGANLMGANL 97
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 60/109 (55%), Gaps = 2/109 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+LR ++ + N +AN T A++ E++F+ + A L A + N A+L
Sbjct: 88 SGANLMGANLRGSMAREVNMTKANLTEANLTEANFTEANLFAANLTDASMIRINLMKANL 147
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
S + + + L A L+ ++L R LT++ L GA++ GA+ + A DL Q
Sbjct: 148 SWSTLKAVNLTNAILSESLLERANLTQAILSGAMVSGANLTGA--DLRQ 194
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 66/131 (50%), Gaps = 14/131 (10%)
Query: 111 AQFGSADLRKA----VHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A A+LR+ V + E N AN D+ ++ SG+ GA L ++A + N T
Sbjct: 50 ANLTGANLREVKLAGVDLTEANLSEANLIGTDLSRANLSGANLMGANLRGSMAREVNMTK 109
Query: 166 ADLSDTLMDRMVLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
A+L++ + EANL T+A ++R L +++L + ++ + ++A++ ++
Sbjct: 110 ANLTEANLTEANFTEANLFAANLTDASMIRINLMKANLSWSTLKAVNLTNAIL----SES 165
Query: 221 LCKYANGTNPI 231
L + AN T I
Sbjct: 166 LLERANLTQAI 176
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 58/116 (50%), Gaps = 20/116 (17%)
Query: 116 ADLRKAVHVKENFRRANFTSADM----------RESDFSGSKFNGAYLEKAVAYKANFTG 165
A+L +A + + RAN + A++ RE + + + A L +A +AN
Sbjct: 70 ANLSEANLIGTDLSRANLSGANLMGANLRGSMAREVNMTKANLTEANLTEANFTEANLFA 129
Query: 166 ADLSDTLMDRMVLNEA----------NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A+L+D M R+ L +A NLTNA+L ++L R++L AI+ GA S A
Sbjct: 130 ANLTDASMIRINLMKANLSWSTLKAVNLTNAILSESLLERANLTQAILSGAMVSGA 185
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 5/90 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA-----NF 163
S A ADLR+ V N AN ++A++R ++ S S A L A Y+A NF
Sbjct: 183 SGANLTGADLRQVTMVGANLTEANLSNANLRVANVSWSTLARANLSGANLYRAKLCWSNF 242
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVL 193
+GA L + ++ LN N +A L R ++
Sbjct: 243 SGAVLVEAVLIDANLNRTNFRDADLRRAIM 272
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 56/106 (52%), Gaps = 7/106 (6%)
Query: 112 QFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTG 165
ADL K V++ E NF RANF A+++ ++ +G+ K G L +A +AN G
Sbjct: 21 NLAGADL-KGVNLSEANFTRANFQDANLKGANLTGANLREVKLAGVDLTEANLSEANLIG 79
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
DLS + L ANL ++ +T+++L A + A+F++A
Sbjct: 80 TDLSRANLSGANLMGANLRGSMAREVNMTKANLTEANLTEANFTEA 125
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 57/123 (46%), Gaps = 20/123 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVAYK 160
A +A+L A ++ N +AN T+A + ES + A L A+
Sbjct: 125 ANLFAANLTDASMIRINLMKANLSWSTLKAVNLTNAILSESLLERANLTQAILSGAMVSG 184
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVL-VRTV----LTRSDLGGAIIEGA-----DFSD 210
AN TGADL M L EANL+NA L V V L R++L GA + A +FS
Sbjct: 185 ANLTGADLRQVTMVGANLTEANLSNANLRVANVSWSTLARANLSGANLYRAKLCWSNFSG 244
Query: 211 AVI 213
AV+
Sbjct: 245 AVL 247
>gi|307152112|ref|YP_003887496.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306982340|gb|ADN14221.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 180
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 45/87 (51%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL KA V N N AD+RE++ SG+ A L A AN TGA+L +
Sbjct: 60 ANLTDADLLKAHLVGANLVEINLIGADLREANLSGADLTKADLRCANLTGANLTGANLRE 119
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSD 197
+D L ANLT+A ++ T L +D
Sbjct: 120 VNLDGANLMGANLTDAQIINTDLNMAD 146
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 66/128 (51%), Gaps = 6/128 (4%)
Query: 87 NISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS 146
NI L L +Y+A+ R +F + A+L A + N RA+ + AD+ E+D SG+
Sbjct: 2 NIQEL--LKRYKAKER-DF---QGSNLHQANLEGANLQRINLTRADLSGADLSEADLSGA 55
Query: 147 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
A L A KA+ GA+L + + L EANL+ A L + L ++L GA + GA
Sbjct: 56 CLMQANLTDADLLKAHLVGANLVEINLIGADLREANLSGADLTKADLRCANLTGANLTGA 115
Query: 207 DFSDAVID 214
+ + +D
Sbjct: 116 NLREVNLD 123
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 62/120 (51%), Gaps = 4/120 (3%)
Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 181
++++E +R D + S+ + GA L++ +A+ +GADLS+ + L +A
Sbjct: 1 MNIQELLKRYKAKERDFQGSNLHQANLEGANLQRINLTRADLSGADLSEADLSGACLMQA 60
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQKQALCKYANGTNPITGVSTRK 238
NLT+A L++ L ++L + GAD +A + DL + C G N +TG + R+
Sbjct: 61 NLTDADLLKAHLVGANLVEINLIGADLREANLSGADLTKADLRCANLTGAN-LTGANLRE 119
>gi|158337660|ref|YP_001518836.1| pentapeptide repeat-containing serine/threonine kinase
[Acaryochloris marina MBIC11017]
gi|158307901|gb|ABW29518.1| serine/threonine kinase with pentapeptide repeats [Acaryochloris
marina MBIC11017]
Length = 532
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 56/116 (48%), Gaps = 20/116 (17%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+F + DLR A+ + NF RANFT A++R ++ L +A A+ ADL
Sbjct: 429 KFQNTDLRDAILINANFGRANFTGANLRNAN----------LMQAYMSHADLANADLRG- 477
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
ANL++A L L ++L GA + GA S++ + AQ L Y NG
Sbjct: 478 ---------ANLSDAYLSHANLRGANLCGADLSGAKLSESQLSFAQTNWLTVYPNG 524
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 10/87 (11%)
Query: 140 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDT-LMDRMVLNE---------ANLTNAVLV 189
+ DFSG L K ANF +T L D +++N ANL NA L+
Sbjct: 402 QRDFSGQDLRNLNLRKFQLPSANFHEGKFQNTDLRDAILINANFGRANFTGANLRNANLM 461
Query: 190 RTVLTRSDLGGAIIEGADFSDAVIDLA 216
+ ++ +DL A + GA+ SDA + A
Sbjct: 462 QAYMSHADLANADLRGANLSDAYLSHA 488
>gi|428314592|ref|YP_007151039.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428256316|gb|AFZ22271.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 237
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 61/119 (51%), Gaps = 20/119 (16%)
Query: 113 FGSADLRKAVHVKENFRRANF---------------TSADMRESDFSGSKFNGAYLEKAV 157
F +A+LR AV V++N + NF + D+ +D S + NGA L +A
Sbjct: 105 FANANLRCAVLVEQNLCQCNFSYVKLNFANLSGINLSGVDLTSADLSDACLNGANLSQAS 164
Query: 158 AY-----KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
Y +AN + A+L T + + LN+ANLT A L L+ +DL GAI++ A S A
Sbjct: 165 LYRTLLTRANLSQANLRGTNLFKASLNDANLTQADLTGANLSFADLRGAILDEATLSGA 223
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 42/82 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L +A + RAN + A++R ++ + N A L +A AN + ADL
Sbjct: 151 SDACLNGANLSQASLYRTLLTRANLSQANLRGTNLFKASLNDANLTQADLTGANLSFADL 210
Query: 169 SDTLMDRMVLNEANLTNAVLVR 190
++D L+ ANLT A L +
Sbjct: 211 RGAILDEATLSGANLTGAKLTQ 232
>gi|428313439|ref|YP_007124416.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428255051|gb|AFZ21010.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 167
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 10/101 (9%)
Query: 119 RKAVHVKENFRRANFTSADMRE----------SDFSGSKFNGAYLEKAVAYKANFTGADL 168
R+ + + NF RAN +D+R+ ++ S +GA L + Y+AN + ADL
Sbjct: 8 RRYLAGERNFHRANLNGSDLRKIPLMRADLLKANLHNSNLSGANLTRVNLYQANLSKADL 67
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
T+ + +L+ A LT A L R L ++DL A ++GA +
Sbjct: 68 RQTIFNEAILHGAELTGANLHRASLIKADLCEANLKGASLT 108
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYK----- 160
A +++L A + N +AN + AD+R++ F+ G++ GA L +A K
Sbjct: 40 ANLHNSNLSGANLTRVNLYQANLSKADLRQTIFNEAILHGAELTGANLHRASLIKADLCE 99
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
AN GA L+ T + L+ ANL NA L L ++DL A +EGAD S A
Sbjct: 100 ANLKGASLTHTNLGAAKLSGANLNNANLTWANLRKADLKNANLEGADLSGA 150
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 56/112 (50%), Gaps = 6/112 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL KA N AN T ++ +++ S + +A+ + A TGA+L + +
Sbjct: 35 ADLLKANLHNSNLSGANLTRVNLYQANLSKADLRQTIFNEAILHGAELTGANLHRASLIK 94
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 226
L EANL A LT ++LG A + GA+ ++A + A ++A K AN
Sbjct: 95 ADLCEANLKGA-----SLTHTNLGAAKLSGANLNNANLTWANLRKADLKNAN 141
>gi|425455658|ref|ZP_18835373.1| Genome sequencing data, contig C328 [Microcystis aeruginosa PCC
9807]
gi|389803408|emb|CCI17656.1| Genome sequencing data, contig C328 [Microcystis aeruginosa PCC
9807]
Length = 354
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 55/103 (53%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AA+ L A + N R AN T AD+ E + S + F GA L A+ A+ + AD
Sbjct: 234 AAELSGISLGMANLYQANLRGANLTDADLSEINGSHASFKGADLSGALLANADLSYADFY 293
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
+ + L +NLT A LV +T+++L GA ++GA F+D V
Sbjct: 294 RSSLALANLIGSNLTGANLVEVNITQANLSGAKVQGAKFADNV 336
>gi|218440553|ref|YP_002378882.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218173281|gb|ACK72014.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 320
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 58/111 (52%), Gaps = 15/111 (13%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +A+L++A+ +F+ AN + A++ G K NGA L +A KAN +G DL+
Sbjct: 100 ANLSNANLKQAILTNVDFKSANLSGANL-----VGVKLNGANLSRADLSKANLSGIDLTG 154
Query: 171 TLMDRMVLNEANLT----------NAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ R+ L+ ANL A L R+ L DL GAI++G++ A
Sbjct: 155 ANLSRVDLSRANLNGADLSGANLYKADLSRSNLRNGDLQGAILQGSNLHKA 205
Score = 37.7 bits (86), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 55/124 (44%), Gaps = 16/124 (12%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA--- 156
E ++G G+ F DL+ ++ AN + + ++ SG+ N A L +A
Sbjct: 5 EILWQYGQGNR-DFSRLDLQNINIIQAELMEANLSRTALDWANLSGTNLNRANLNRADLM 63
Query: 157 -------VAYKANFTGADLSD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 204
+A+ GADLSD ++ L ANL+NA L + +LT D A +
Sbjct: 64 HAKLISAQLIEADLIGADLSDADLSWVNLEGAKLTYANLSNANLKQAILTNVDFKSANLS 123
Query: 205 GADF 208
GA+
Sbjct: 124 GANL 127
>gi|218439290|ref|YP_002377619.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218172018|gb|ACK70751.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 231
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 70/142 (49%), Gaps = 25/142 (17%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F AD R + K NF A F AD+ E+ G+ F GA LEKA+ + +GA
Sbjct: 33 SGADFSKADFRSSRLGKTNFAYACFFGADLSEAILWGTDFTGANLEKAILREVELSGA-- 90
Query: 169 SDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADF---SDAVIDLAQ--- 217
+L++ANLT L++ L+ ++L AI+ ADF S+ + +L Q
Sbjct: 91 --------ILSQANLTGVNLMKATLGGANLSLANLREAILYEADFRPTSEHITNLQQADL 142
Query: 218 KQALCKYANGTNPITGVSTRKS 239
+A YA + GV+ R++
Sbjct: 143 SEADLSYA----KLNGVNLRQA 160
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 60/107 (56%), Gaps = 5/107 (4%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGA 166
QF + ++A +K N A+F+ AD R S +F+ + F GA L +A+ + +FTGA
Sbjct: 16 QFKTCKFQEAELIKVNLSGADFSKADFRSSRLGKTNFAYACFFGADLSEAILWGTDFTGA 75
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+L ++ + L+ A L+ A L L ++ LGGA + A+ +A++
Sbjct: 76 NLEKAILREVELSGAILSQANLTGVNLMKATLGGANLSLANLREAIL 122
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 54/109 (49%), Gaps = 13/109 (11%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A A+LR+A+ + +FR N AD+ E+D S +K NG L +A A
Sbjct: 110 ANLSLANLREAILYEADFRPTSEHITNLQQADLSEADLSYAKLNGVNLRQAKLMGAKLCR 169
Query: 166 ADLSDTLMDRMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
ADLS + + L EANL NA L+ +DL GAI+ AD + A
Sbjct: 170 ADLSKGIWQNSLPTDLCEANLRNA-----DLSYADLSGAILSYADLTGA 213
>gi|443328868|ref|ZP_21057461.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791604|gb|ELS01098.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 266
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 5/110 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR+A ++ N + + AD+R ++ G GA L KA + + A+LS+
Sbjct: 153 ADLNDADLREAQLIRANLSEVDLSGADLRAANLKGVNLRGADLNKA-----DLSRANLSE 207
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
+ LNEANL+ A L L ++L + GA F + ++D K++
Sbjct: 208 AYLYLANLNEANLSRADLSEANLHEANLSRVDLRGAIFCETIMDDGHKES 257
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 44/84 (52%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ N N + D+ ++ SG+ +GA L +A + N + +L+ ++ LN+A+L
Sbjct: 102 QSNLSGVNLSGVDLSGANLSGADLSGADLSEADLSRVNLSRVNLNGANLNDADLNDADLR 161
Query: 185 NAVLVRTVLTRSDLGGAIIEGADF 208
A L+R L+ DL GA + A+
Sbjct: 162 EAQLIRANLSEVDLSGADLRAANL 185
Score = 37.4 bits (85), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 48/91 (52%), Gaps = 1/91 (1%)
Query: 119 RKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
R + +KE + ++N + ++ D SG+ +GA L A +A+ + +LS ++
Sbjct: 90 RSLLSLKEFDLSQSNLSGVNLSGVDLSGANLSGADLSGADLSEADLSRVNLSRVNLNGAN 149
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
LN+A+L +A L L R++L + GAD
Sbjct: 150 LNDADLNDADLREAQLIRANLSEVDLSGADL 180
>gi|119487879|ref|ZP_01621376.1| hypothetical protein L8106_28486 [Lyngbya sp. PCC 8106]
gi|119455455|gb|EAW36593.1| hypothetical protein L8106_28486 [Lyngbya sp. PCC 8106]
Length = 514
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 70/141 (49%), Gaps = 5/141 (3%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQ---FGSADLRKAVHVKE--NFRR 130
L A+++ + + S LAD N +A+ G G+ + A L + H++E N R
Sbjct: 265 LKQAILSEVNLSESNLADANLEQADLMGAELRGATLKGTNLSQAYLVRTNHLREVKNLRE 324
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN A++ ++ GA L++A +A GA+L D + R L EA L +A L R
Sbjct: 325 ANLKGANLTRANLREVNLQGANLQQANLQQAILQGANLKDANLIRANLREAKLQDAKLQR 384
Query: 191 TVLTRSDLGGAIIEGADFSDA 211
L R++L A + A+ S+A
Sbjct: 385 VNLERANLQAANLTDANLSNA 405
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 10/98 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L++A+ N + AN A++RE+ +K LE+A AN T A+LS+
Sbjct: 345 ANLQQANLQQAILQGANLKDANLIRANLREAKLQDAKLQRVNLERANLQAANLTDANLSN 404
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
ANLT+A L T L ++ A++ DF
Sbjct: 405 ----------ANLTDASLCDTCLNQTQFYQAVLIRVDF 432
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 61/126 (48%), Gaps = 21/126 (16%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYK--- 160
S A ADL+ A + NF+ AN A++ + +DF G+ ++L++A + +
Sbjct: 185 SGANLQGADLQGANLHETNFQGANLAGANLGGANLKCTDFQGTNLQESHLKQAYSVRKAK 244
Query: 161 -------------ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 207
N GA+L ++ + L+E+NL +A L + L ++L GA ++G +
Sbjct: 245 FAQANLSGVDFQGVNLRGANLKQAILSEVNLSESNLADANLEQADLMGAELRGATLKGTN 304
Query: 208 FSDAVI 213
S A +
Sbjct: 305 LSQAYL 310
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 15/113 (13%)
Query: 113 FGSADLRKAVHVKENF--RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
F +L+++ H+K+ + R+A F A++ DF G GA L++A+ + N + ++L+D
Sbjct: 224 FQGTNLQES-HLKQAYSVRKAKFAQANLSGVDFQGVNLRGANLKQAILSEVNLSESNLAD 282
Query: 171 TLMDR----------MVLNEANLTNAVLVRTVLTRS--DLGGAIIEGADFSDA 211
+++ L NL+ A LVRT R +L A ++GA+ + A
Sbjct: 283 ANLEQADLMGAELRGATLKGTNLSQAYLVRTNHLREVKNLREANLKGANLTRA 335
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 69/162 (42%), Gaps = 20/162 (12%)
Query: 68 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 127
W+V STA + V + + + AL DLN G I S A +L A V+ N
Sbjct: 113 WQVVDSTATSG--VFASRARLKALQDLNNEGVSLDG-LDI-SQAYLKEINLSGANLVEAN 168
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD----------TLMDRMV 177
AN A + ++ SG+ GA L+ A ++ NF GA+L+ T
Sbjct: 169 LEGANLQGASLSHANLSGANLQGADLQGANLHETNFQGANLAGANLGGANLKCTDFQGTN 228
Query: 178 LNEANLTNAVLVRTV------LTRSDLGGAIIEGADFSDAVI 213
L E++L A VR L+ D G + GA+ A++
Sbjct: 229 LQESHLKQAYSVRKAKFAQANLSGVDFQGVNLRGANLKQAIL 270
>gi|443324425|ref|ZP_21053179.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442795970|gb|ELS05303.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 305
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 68/134 (50%), Gaps = 5/134 (3%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTG 165
A +A+L+ AV + AN ++AD+ ++ D S + GA L A ANF+
Sbjct: 71 ADLATANLQAAVLIGICLIEANLSNADLSDAYLMDGDLSNANLIGADLRDANCDHANFSN 130
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 225
A+L TLM ++ L ANLT A L RT L+ ++L A + AD S+A + A+ + Y
Sbjct: 131 ANLIGTLMRKVRLRHANLTGAKLQRTNLSEAELIEAHLSEADLSNANLYEAELLNIFGYK 190
Query: 226 NGTNPITGVSTRKS 239
+ ++T S
Sbjct: 191 TNFCRVQAIATHMS 204
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 57/121 (47%), Gaps = 18/121 (14%)
Query: 91 LADLNKYEAETRGEFGIGS------------------AAQFGSADLRKAVHVKENFRRAN 132
L++ N YEAE FG + A F A+L K N RAN
Sbjct: 173 LSNANLYEAELLNIFGYKTNFCRVQAIATHMSRAYLFQANFSEAELIKIDLRWANCDRAN 232
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
F +A+++++D G+ N A L++A +AN GA+L+ + L +AN+ +A+ +
Sbjct: 233 FRNANLQQADLRGTNLNQADLKQANLTRANLRGANLNHADLRGANLTDANIQDAIFKSAI 292
Query: 193 L 193
L
Sbjct: 293 L 293
>gi|254411218|ref|ZP_05024995.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196181719|gb|EDX76706.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 293
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 64/127 (50%), Gaps = 23/127 (18%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA----------YLEKAVAYKANFTG 165
ADL +A + N +AN + A++ + S NGA L +A+ +AN
Sbjct: 84 ADLVEANLISSNLTQANLSEANLINASLRASTLNGANLSRANLSEAILSEAIMREANLNQ 143
Query: 166 ADLSDTLMDRMVLN----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA---V 212
A L D + R L+ +ANLTNA+L+ T L +++L A++ GA+F+ A
Sbjct: 144 AKLIDASLSRTNLSYATLISANLEKANLTNAILLETNLKQANLNKALLHGANFTQADLTE 203
Query: 213 IDLAQKQ 219
+DL+Q +
Sbjct: 204 VDLSQAR 210
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 60/124 (48%), Gaps = 20/124 (16%)
Query: 110 AAQFGSADLRKAVHVKENFRRAN----------FTSADMRESDFSGSKFNGAYLEKAVAY 159
+A A+L A+ ++ N ++AN FT AD+ E D S ++ NG L +A+
Sbjct: 163 SANLEKANLTNAILLETNLKQANLNKALLHGANFTQADLTEVDLSQARLNGVNLTRAILV 222
Query: 160 KA----------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
A GA+LS + R L +NLT A+L+ TVL +++ + GA +
Sbjct: 223 GAKLRGVSICWTTLRGANLSKANLYRAKLCWSNLTEAILLETVLLDANMDQVNLRGATLT 282
Query: 210 DAVI 213
A++
Sbjct: 283 GAIL 286
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +L A + N +AN T+A + E++ + N KA+ + ANFT ADL++
Sbjct: 149 ASLSRTNLSYATLISANLEKANLTNAILLETNLKQANLN-----KALLHGANFTQADLTE 203
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + LN NLT A+LV L + + GA+ S A
Sbjct: 204 VDLSQARLNGVNLTRAILVGAKLRGVSICWTTLRGANLSKA 244
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 60/124 (48%), Gaps = 19/124 (15%)
Query: 118 LRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVAYKANFT--- 164
LR+ + +FRR N T A++R SD S S GA L+ +AN T
Sbjct: 21 LRRYAVGERDFRRVNLRNASLIGADLTHANLRGSDLSQSNLTGASLKLVNFREANLTQIT 80
Query: 165 --GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
GADL + + L +ANL+ A L+ L S L GA + A+ S+A++ +A+
Sbjct: 81 LRGADLVEANLISSNLTQANLSEANLINASLRASTLNGANLSRANLSEAIL----SEAIM 136
Query: 223 KYAN 226
+ AN
Sbjct: 137 REAN 140
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 81/177 (45%), Gaps = 9/177 (5%)
Query: 44 TESDGQFPDCSNNQCAGPYAKLKNWR---VFVSTALAAAVVAS--CSSNISA--LADLNK 96
T ++ + D S + G KL N+R + T A +V + SSN++ L++ N
Sbjct: 47 THANLRGSDLSQSNLTGASLKLVNFREANLTQITLRGADLVEANLISSNLTQANLSEANL 106
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
A R G A A+L +A+ + R AN A + ++ S + + A L A
Sbjct: 107 INASLRASTLNG--ANLSRANLSEAILSEAIMREANLNQAKLIDASLSRTNLSYATLISA 164
Query: 157 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
KAN T A L +T + + LN+A L A + LT DL A + G + + A++
Sbjct: 165 NLEKANLTNAILLETNLKQANLNKALLHGANFTQADLTEVDLSQARLNGVNLTRAIL 221
>gi|291570913|dbj|BAI93185.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 484
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 61/112 (54%), Gaps = 14/112 (12%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN----- 179
+ N +ANFT A + ++FSG+ G L +A + +GA L ++ VLN
Sbjct: 29 RVNLSQANFTEAVLSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLI 88
Query: 180 -----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
+ANL +A L+R L R++L A++ GA+ ++A DL ++A ++A+
Sbjct: 89 RADLSQANLVDASLIRAELMRAELSEAVVNGANLTEA--DL--REATLRHAD 136
Score = 43.9 bits (102), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 47/93 (50%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L +A + N R+N T AD+ +D G A L +A A+ GA+LS +
Sbjct: 145 ANLSEACLILSNLERSNLTRADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRW 204
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
L+ ANL+ A L T L+ + L GA + GA
Sbjct: 205 ANLSGANLSGANLEATQLSGASLRGANLSGASL 237
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 63/129 (48%), Gaps = 12/129 (9%)
Query: 101 TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
TR + + S A A+L +AV N RA+ + A++ ++ ++ A L +AV
Sbjct: 58 TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLVDASLIRAELMRAELSEAVV 117
Query: 159 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSDLGGAIIEGADF 208
AN T ADL + + L + ANL+ A L+ R+ LTR+DL A + G +
Sbjct: 118 NGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNL 177
Query: 209 SDAVIDLAQ 217
+A + A+
Sbjct: 178 RNAELRQAE 186
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 48/98 (48%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ A+L +AV N A+ A +R +D + +GA L +A +N ++L+
Sbjct: 105 AELMRAELSEAVVNGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTR 164
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ R L NL NA L + L +DL GA + GA+
Sbjct: 165 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANL 202
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL +A N R A A++ +D G+ +GA L A AN +GA+L T +
Sbjct: 165 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 224
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANL+ A L+ +DL A + D++DA
Sbjct: 225 ASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDA 260
Score = 40.8 bits (94), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 142 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 201
DFS A L + +ANFT A LS T + ANLT L R L S L GA
Sbjct: 16 DFSAILLCEANLSRVNLSQANFTEAVLSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 70
Query: 202 IIEGADFSDAVIDLA 216
I++GA+ ++AV+++A
Sbjct: 71 ILQGANLNEAVLNVA 85
>gi|114569789|ref|YP_756469.1| pentapeptide repeat-containing protein [Maricaulis maris MCS10]
gi|114340251|gb|ABI65531.1| pentapeptide repeat protein [Maricaulis maris MCS10]
Length = 493
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 77/166 (46%), Gaps = 30/166 (18%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A F S +L A V+ N +RA A++ +D SG GA L A AN GADL
Sbjct: 339 TGANFTSVELSNARIVESNMQRAILAGANLSYADLSGIDLAGADLTGADLSGANLIGADL 398
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRS---------------DLGGAIIEGADFSDAVI 213
+ + R ANLT A+L T LTR+ L GA ++ AD +DA +
Sbjct: 399 TGANLTR-----ANLTGAILFGTDLTRAILANARLNSAQLVGAQLSGARLDSADLTDANL 453
Query: 214 DLAQKQALCKYANGTNPITGVST--RKSLGCGNSRRNAYGSP-SSP 256
AQ A + P++G T R + G+ R ++ G+ SSP
Sbjct: 454 FGAQNAA-------SIPVSGTMTFCRTRMADGSDRSSSCGAAVSSP 492
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 44/94 (46%), Gaps = 5/94 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANF 163
S A A+L ++ N RA+ ++ D+ E+D F G+ F GA L AN
Sbjct: 65 SGANMSGANLSRSRFPDANLDRADLSNTDLTEADLSTGRFVGANFRGALLRNTSLTGANL 124
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 197
TGADL+ +N+A L N L TV+ D
Sbjct: 125 TGADLTGARELGYEINQARLCNTRLSATVVLNRD 158
Score = 37.4 bits (85), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 49/113 (43%), Gaps = 15/113 (13%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN------------- 162
A L +A+ +FR N T + +G+ F L A ++N
Sbjct: 311 ASLSQAIFPGNDFRTINLTGVQIYGMVLTGANFTSVELSNARIVESNMQRAILAGANLSY 370
Query: 163 --FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+G DL+ + L+ ANL A L LTR++L GAI+ G D + A++
Sbjct: 371 ADLSGIDLAGADLTGADLSGANLIGADLTGANLTRANLTGAILFGTDLTRAIL 423
>gi|443314210|ref|ZP_21043788.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442786182|gb|ELR95944.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 516
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 56/124 (45%), Gaps = 1/124 (0%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
E + S A A+LR A + + AN A++R +D SG+ A L A A
Sbjct: 174 EDTVLSGAVLQRAELRHATLMGADLSGANLRGANLRWADLSGANLQEADLTDAKLSGATL 233
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALC 222
GADLS + +L +L+ L R SDL GA + GA + AV DL + C
Sbjct: 234 VGADLSGATLVNTILVHTDLSRTRLQRVYCVDSDLSGATLNGAFLAGAVCYDLVTAETTC 293
Query: 223 KYAN 226
+ +
Sbjct: 294 DWVD 297
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 5/114 (4%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
E R + S A +DLRK+ NF AN A + + + +GA L++A
Sbjct: 135 EARLRWARLSGANLSQSDLRKS-----NFLGANLEGAQLYAAQMEDTVLSGAVLQRAELR 189
Query: 160 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A GADLS + L A+L+ A L LT + L GA + GAD S A +
Sbjct: 190 HATLMGADLSGANLRGANLRWADLSGANLQEADLTDAKLSGATLVGADLSGATL 243
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 57/113 (50%), Gaps = 10/113 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L +A +AN + AD+RE+ ++ +GA L ++ K+NF GA+L
Sbjct: 104 SEASLIRAELLRADLSNATLNQANLSEADLREARLRWARLSGANLSQSDLRKSNFLGANL 163
Query: 169 ----------SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
DT++ VL A L +A L+ L+ ++L GA + AD S A
Sbjct: 164 EGAQLYAAQMEDTVLSGAVLQRAELRHATLMGADLSGANLRGANLRWADLSGA 216
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQ LR+A N AN + +D++++ + S+F+GA L +A +A A+L
Sbjct: 34 AGAQIPHIVLRQANLNIVNLSTANLSFSDLQQASLNVSRFSGANLSQACLRQAQLNVANL 93
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
R VL A+L+ A L+R L R+DL A + A+ S+A
Sbjct: 94 I-----RAVLVGADLSEASLIRAELLRADLSNATLNQANLSEA 131
Score = 37.4 bits (85), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 49/105 (46%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A LR+A N RA AD+ E+ ++ A L A +AN + ADL
Sbjct: 74 SGANLSQACLRQAQLNVANLIRAVLVGADLSEASLIRAELLRADLSNATLNQANLSEADL 133
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ + L+ ANL+ + L ++ ++L GA + A D V+
Sbjct: 134 REARLRWARLSGANLSQSDLRKSNFLGANLEGAQLYAAQMEDTVL 178
>gi|406937704|gb|EKD71085.1| hypothetical protein ACD_46C00278G0012 [uncultured bacterium]
Length = 585
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 74/145 (51%), Gaps = 14/145 (9%)
Query: 79 AVVASCSSNISALADLN-KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSAD 137
AV+ + + + + D N +Y T+ F S + +++L KA+ NF AN + A
Sbjct: 327 AVLICTNMSDTTITDTNLQYANLTKTNF---SKSNLSNSNLSKAIFQGTNFSEANLSHAI 383
Query: 138 MRESDFSGSKFNGAYLEKA----VAYKA-NFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
M+ESD S F+ L A +K+ NF+GADL ++ L+ A+L+NA L+
Sbjct: 384 MKESDCSNIDFSNLCLYHANLANTKFKSTNFSGADLQKAILTDCDLSNADLSNANLIHAN 443
Query: 193 LTR-----SDLGGAIIEGADFSDAV 212
LTR +DL +E A +DA+
Sbjct: 444 LTRAYLGETDLSTTNLEHATLTDAM 468
Score = 40.4 bits (93), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 10/100 (10%)
Query: 120 KAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+A V+ NF +NF DM+ + + F+ A L NF ADLS+T
Sbjct: 206 EACFVEANFTNSNFVKTRFFLCDMQRINAMNTDFSSAIL-----MGTNFANADLSNTNFT 260
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L++A L ++L T+L +L GA ++G + + +D
Sbjct: 261 NANLSQAKLDRSILTNTILKNVNLSGASLQGVSYPNKKLD 300
>gi|425473009|ref|ZP_18851753.1| Genome sequencing data, contig C314 [Microcystis aeruginosa PCC
9701]
gi|389880711|emb|CCI38594.1| Genome sequencing data, contig C314 [Microcystis aeruginosa PCC
9701]
Length = 453
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 59/117 (50%), Gaps = 5/117 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTG 165
A A+L +A + N RAN A++ + +G+ GA L +A +AN G
Sbjct: 286 ANLNGANLNRANLNRANLNRANLNGAELYRAYLNGANLKGANLNEANLIGANLNEANLIG 345
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
A+L+ ++ R LN ANL A L +L ++L GAI+ GA A +D Q ++ C
Sbjct: 346 ANLNGAILYRANLNGANLNGAYLNGAILYGANLYGAILYGAILWGAEVDPKQIKSAC 402
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 55/102 (53%), Gaps = 5/102 (4%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA-----NFTGADLSDT 171
+LR+A + N RAN A++ ++ + + N A L A Y+A N GA+L++
Sbjct: 272 NLREANLILANLNRANLNGANLNRANLNRANLNRANLNGAELYRAYLNGANLKGANLNEA 331
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ LNEANL A L +L R++L GA + GA + A++
Sbjct: 332 NLIGANLNEANLIGANLNGAILYRANLNGANLNGAYLNGAIL 373
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 5/87 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N AN A++ E++ G+ NGA L Y+AN GA+L+ ++ +L ANL A
Sbjct: 327 NLNEANLIGANLNEANLIGANLNGAIL-----YRANLNGANLNGAYLNGAILYGANLYGA 381
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVI 213
+L +L +++ I+ A F + I
Sbjct: 382 ILYGAILWGAEVDPKQIKSACFWERAI 408
>gi|359151325|ref|ZP_09184042.1| pentapeptide repeat-containing protein [Streptomyces sp. S4]
Length = 240
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 44/86 (51%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
N RAN SAD+ + +G+ GA L + AN TGADL + L NLT
Sbjct: 68 HNLSRANLISADLARVNLTGANLTGADLARVNLTGANLTGADLIYANLAGADLTRVNLTR 127
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDA 211
A + T LT +DL GA + G D ++A
Sbjct: 128 ARMKLTNLTGADLTGADLAGGDLTNA 153
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 53/107 (49%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L A + N AN T AD+ ++ +G+ L +A N TGADL+ +
Sbjct: 88 ANLTGADLARVNLTGANLTGADLIYANLAGADLTRVNLTRARMKLTNLTGADLTGADLAG 147
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
L A+LTNA L LT DL GAI+ GA+ A + A++ L
Sbjct: 148 GDLTNADLTNADLTGAHLTNVDLTGAILTGANLGGANLAAARQLRLV 194
>gi|428223745|ref|YP_007107842.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427983646|gb|AFY64790.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 183
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 53/109 (48%), Gaps = 10/109 (9%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAYKAN 162
F DLR+A N + ++D+R +++ G+K GA + A Y+AN
Sbjct: 20 FDEIDLREANLFNANLEAVSLQNSDLRSTYLPYTNLNKANLQGAKLQGAEMSDAQLYQAN 79
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
GADL + + R L A+L A L L +DL GA ++GA+ DA
Sbjct: 80 LAGADLRGSNLSRATLRYASLQQANLQGANLQGADLYGANLQGANLQDA 128
Score = 37.4 bits (85), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 46/107 (42%), Gaps = 13/107 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVAYK 160
A A L+ A +AN AD+R S+ S + GA L+ A Y
Sbjct: 58 ANLQGAKLQGAEMSDAQLYQANLAGADLRGSNLSRATLRYASLQQANLQGANLQGADLYG 117
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS---DLGGAIIE 204
AN GA+L D + R L++A L +L L R+ D GA ++
Sbjct: 118 ANLQGANLQDADLQRADLDQATLKATILANANLFRAQNIDWTGAAVD 164
>gi|220909896|ref|YP_002485207.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219866507|gb|ACL46846.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 184
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+F N + + G++ NGA L A+ G DL+ +++ LN+ANL
Sbjct: 14 HRDFSHVNLVQVCLTNAKLVGARLNGAEL-----VGADLQGVDLTAAHLNQARLNQANLA 68
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDA 211
A +++ LTR+DL GA + GAD +DA
Sbjct: 69 GAEMIQACLTRADLSGAYLAGADLTDA 95
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 43/79 (54%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+ T AD+ +D SG+ GA L KA KA+ +GADL + L E +L++A L
Sbjct: 90 ADLTDADLSGADLSGANLGGADLRKADLSKADLSGADLRGADLSGANLRETDLSDADLDG 149
Query: 191 TVLTRSDLGGAIIEGADFS 209
L +DL GA +E F+
Sbjct: 150 AYLGHADLTGADVERTRFN 168
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 43/90 (47%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A + AN AD+R++D S + +GA L A AN DL
Sbjct: 83 SGAYLAGADLTDADLSGADLSGANLGGADLRKADLSKADLSGADLRGADLSGANLRETDL 142
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
SD +D L A+LT A + RT +S L
Sbjct: 143 SDADLDGAYLGHADLTGADVERTRFNQSQL 172
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 15/106 (14%)
Query: 111 AQFGSADLR----KAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A+ ADL+ A H+ + +AN A+M ++ + + +GAYL A A+ +G
Sbjct: 40 AELVGADLQGVDLTAAHLNQARLNQANLAGAEMIQACLTRADLSGAYLAGADLTDADLSG 99
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
ADLS ANL A L + L+++DL GA + GAD S A
Sbjct: 100 ADLS----------GANLGGADLRKADLSKADLSGADLRGADLSGA 135
Score = 37.4 bits (85), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 39/81 (48%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A A++ +D G A+L +A +AN GA++ + R L+ A L A L
Sbjct: 35 ARLNGAELVGADLQGVDLTAAHLNQARLNQANLAGAEMIQACLTRADLSGAYLAGADLTD 94
Query: 191 TVLTRSDLGGAIIEGADFSDA 211
L+ +DL GA + GAD A
Sbjct: 95 ADLSGADLSGANLGGADLRKA 115
>gi|428300657|ref|YP_007138963.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428237201|gb|AFZ02991.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 516
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 56/102 (54%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AA +ADLR+A + N R+AN + +++ S +G+ A L K + + +GA+L
Sbjct: 119 AANLKNADLREATLRQANLRQANLSEVNLKGSLLTGANLEQANLSKTDLSRTDLSGANLR 178
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
DT + + L+ ANL+ A L L ++L GA + AD + A
Sbjct: 179 DTELKQSNLSRANLSGANLAGANLRWANLTGANLRWADLTGA 220
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 10/82 (12%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N R AN T A++R +D +G+K +GA + TGA+LS+ + L ANL A
Sbjct: 201 NLRWANLTGANLRWADLTGAKLSGA----------DLTGANLSNANLSNCTLVHANLHQA 250
Query: 187 VLVRTVLTRSDLGGAIIEGADF 208
L++T +DL GA + GA
Sbjct: 251 RLIKTEWVGADLSGASLTGAKL 272
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +L+ ++ N +AN + D+ +D SG+ L+++ +AN +GA+L+
Sbjct: 140 ANLSEVNLKGSLLTGANLEQANLSKTDLSRTDLSGANLRDTELKQSNLSRANLSGANLAG 199
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANLT A L LT + L GA + GA+ S+A
Sbjct: 200 A-----NLRWANLTGANLRWADLTGAKLSGADLTGANLSNA 235
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 48/93 (51%), Gaps = 10/93 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A+ A L ++ ++ N RA +A+++ +D L +A +AN A+L
Sbjct: 93 SHAELSKASLVRSELIRANLSRATLIAANLKNAD----------LREATLRQANLRQANL 142
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 201
S+ + +L ANL A L +T L+R+DL GA
Sbjct: 143 SEVNLKGSLLTGANLEQANLSKTDLSRTDLSGA 175
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 50/101 (49%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L KA V+ RAN + A + ++ + A L +A +AN + +L
Sbjct: 90 ADLSHAELSKASLVRSELIRANLSRATLIAANLKNADLREATLRQANLRQANLSEVNLKG 149
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+L+ L +ANL+ L RT L+ ++L ++ ++ S A
Sbjct: 150 SLLTGANLEQANLSKTDLSRTDLSGANLRDTELKQSNLSRA 190
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 49/101 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S +A+L+ A N RA+ + A++ ++ S+ A L +A AN ADL
Sbjct: 68 SGVHLTNANLKGASLNVTNLVRADLSHAELSKASLVRSELIRANLSRATLIAANLKNADL 127
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ + + L +ANL+ L ++LT ++L A + D S
Sbjct: 128 REATLRQANLRQANLSEVNLKGSLLTGANLEQANLSKTDLS 168
>gi|332711043|ref|ZP_08430978.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332350169|gb|EGJ29774.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 343
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/164 (30%), Positives = 76/164 (46%), Gaps = 14/164 (8%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFR 129
+ LA A++ S N + L N A+ T+ + A +A L KA+ ++ N
Sbjct: 170 LIDIDLANAILHQASLNDAELTGANLTGADLTKANL---ARANLNTAKLSKALLIRANLS 226
Query: 130 RANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ N + +AD+R +D SG+ F GA L A AN TG+D LN ANL
Sbjct: 227 KTNLSITELRNADLRNADLSGANFMGADLTGADLTSANLTGSDFR-----YAKLNGANLK 281
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
+A L LT ++L G + GAD + A ++ K+ N T
Sbjct: 282 HADLSGADLTDANLNGMDLTGADLTSANLEGISWNRQTKWKNAT 325
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 51/96 (53%), Gaps = 4/96 (4%)
Query: 120 KAVHVKENF----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
KA+ V N+ R + +AD+ + D + + + A L A AN TGADL+ + R
Sbjct: 148 KALEVLNNYGVSMRGLDAPNADLIDIDLANAILHQASLNDAELTGANLTGADLTKANLAR 207
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
LN A L+ A+L+R L++++L + AD +A
Sbjct: 208 ANLNTAKLSKALLIRANLSKTNLSITELRNADLRNA 243
Score = 37.4 bits (85), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 58/135 (42%), Gaps = 17/135 (12%)
Query: 77 AAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSA 136
A A VA+ + + AL LN Y RG D A + + A A
Sbjct: 136 AGAGVATSHARVKALEVLNNYGVSMRG------------LDAPNADLIDIDLANAILHQA 183
Query: 137 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 196
+ +++ +G+ GA L KA N A+L+ + + +L ANL+ L T L +
Sbjct: 184 SLNDAELTGANLTGADLTKA-----NLARANLNTAKLSKALLIRANLSKTNLSITELRNA 238
Query: 197 DLGGAIIEGADFSDA 211
DL A + GA+F A
Sbjct: 239 DLRNADLSGANFMGA 253
>gi|218441428|ref|YP_002379757.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218174156|gb|ACK72889.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 362
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S Q G A+L H+ N R A T AD+ E+D + K +GA L A AN + +DL
Sbjct: 245 SGVQLGGANL---YHI--NLRGAVLTDADLGEADLNHGKLSGADLSGAYLGNANLSYSDL 299
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+ L A+L A L L++++L GAI+EG F+D
Sbjct: 300 HKASLALTNLIGADLRGANLTEVNLSQANLSGAIVEGTRFAD 341
>gi|172037842|ref|YP_001804343.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354556328|ref|ZP_08975624.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171699296|gb|ACB52277.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353551765|gb|EHC21165.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 319
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 61/133 (45%), Gaps = 10/133 (7%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
Q ADLR +FR + + A++RE DF+G+ AYL +A NFT A+L
Sbjct: 25 QLRRADLRGLNLSHTDFRGVDLSYANLREVDFTGADLRDAYLNEADLTAVNFTDANLEGA 84
Query: 172 LMDRMVLNEAN----------LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 221
+ ++ L +AN LT A L +T + GA + GA S A ++ A
Sbjct: 85 SLIKIYLIKANCYQTNFSGAYLTGAYLTKTNFKEAKFHGAYLNGAKLSGAKLEDAYYDHQ 144
Query: 222 CKYANGTNPITGV 234
++ +P T +
Sbjct: 145 TRFDTSFDPKTAL 157
Score = 40.4 bits (93), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 17/105 (16%)
Query: 119 RKAVHVKE-------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
++A+ +KE NF+ AD+R + S + F G L A + +FTGADL D
Sbjct: 5 QEAIDLKERYEKGQRNFQEFQLRRADLRGLNLSHTDFRGVDLSYANLREVDFTGADLRDA 64
Query: 172 LMDRMVL-----NEANLTNAVLVRTVLTR-----SDLGGAIIEGA 206
++ L +ANL A L++ L + ++ GA + GA
Sbjct: 65 YLNEADLTAVNFTDANLEGASLIKIYLIKANCYQTNFSGAYLTGA 109
>gi|159045175|ref|YP_001533969.1| hypothetical protein Dshi_2635 [Dinoroseobacter shibae DFL 12]
gi|157912935|gb|ABV94368.1| hypothetical protein Dshi_2635 [Dinoroseobacter shibae DFL 12]
Length = 245
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 59/137 (43%), Gaps = 25/137 (18%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANF 163
+ A+ ADLR A F AN AD+R++ FSG++ G ++ A F
Sbjct: 85 AGAELAGADLRDAYLTYAVFDGANLEGADLRDAFMPFAQFSGARMRGILFDRTNARDTVF 144
Query: 164 TGADLSDTLMDRMVLNEANLT--------------------NAVLVRTVLTRSDLGGAII 203
GADL M + L A LT NA LV VL +DL GA +
Sbjct: 145 AGADLRAASMVGVALPRATLTEADLGGADLSGAFLEGANFGNARLVGAVLREADLTGARL 204
Query: 204 EGADFSDAVIDLAQKQA 220
GAD S+A + A QA
Sbjct: 205 TGADLSEADLTGAVTQA 221
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 42/91 (46%), Gaps = 10/91 (10%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGS----------KFNGAYLEKAVAYKAN 162
F ADLR A V RA T AD+ +D SG+ + GA L +A A
Sbjct: 144 FAGADLRAASMVGVALPRATLTEADLGGADLSGAFLEGANFGNARLVGAVLREADLTGAR 203
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVL 193
TGADLS+ + V A + AV RTV+
Sbjct: 204 LTGADLSEADLTGAVTQAAGFSGAVFCRTVM 234
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 33/68 (48%), Gaps = 5/68 (7%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTG 165
A G ADL A NF A A +RE+D +G++ GA L + AV A F+G
Sbjct: 167 ADLGGADLSGAFLEGANFGNARLVGAVLREADLTGARLTGADLSEADLTGAVTQAAGFSG 226
Query: 166 ADLSDTLM 173
A T+M
Sbjct: 227 AVFCRTVM 234
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 44/105 (41%), Gaps = 30/105 (28%)
Query: 137 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---------------DRM----- 176
D+ ++ +G+ AYL AV AN GADL D M DR
Sbjct: 83 DLAGAELAGADLRDAYLTYAVFDGANLEGADLRDAFMPFAQFSGARMRGILFDRTNARDT 142
Query: 177 -----VLNEANLTNAVLVRTVLTRSDLG-----GAIIEGADFSDA 211
L A++ L R LT +DLG GA +EGA+F +A
Sbjct: 143 VFAGADLRAASMVGVALPRATLTEADLGGADLSGAFLEGANFGNA 187
>gi|424513094|emb|CCO66678.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
Length = 140
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 66/141 (46%), Gaps = 31/141 (21%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
+ DL + K + +RANF R ++ SG GA LE+A +FTGA+L +
Sbjct: 19 YHDQDLTQTYFTKGSLKRANF-----RGANLSGISLFGANLEEA-----DFTGANLEN-- 66
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA-----------DFSDAVIDLAQKQAL 221
ANL L++T T ++L AI+ GA D+S +I +
Sbjct: 67 --------ANLGQCNLLKTNFTGANLTNAIVSGASNLETVKANDSDWSQVIIRKDVLMGI 118
Query: 222 CKYANGTNPITGVSTRKSLGC 242
C A+G +P++G T+ +L C
Sbjct: 119 CANADGVSPVSGDPTKMTLEC 139
>gi|385871982|gb|AFI90502.1| Pentapeptide repeat protein [Pectobacterium sp. SCC3193]
Length = 273
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 77/160 (48%), Gaps = 13/160 (8%)
Query: 79 AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM 138
A++ SCS + A+ ++ T + S + SAD +A + N R+A+ A
Sbjct: 115 ALLDSCSW-VETQANEARFTGATWLTSAVASGSSMNSADFTQATLRQSNLRQASLIGAV- 172
Query: 139 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
F+ +K + L +A + NF A+L+ +L R EAN T+A L+ +L +S L
Sbjct: 173 ----FALAKLENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQL 228
Query: 199 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 238
GGA GA+ A DL+Q + + T + G T++
Sbjct: 229 GGANFRGANLFRA--DLSQ-----AFTSNTTQLDGAWTKR 261
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/96 (27%), Positives = 42/96 (43%), Gaps = 10/96 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F L++A+ F A FT RE+ F+ F+ A L + + + G D
Sbjct: 33 SRAHFKDTQLQEALFDHCTFAEATFTELLFRETWFTQCGFHRATLNACIFMELSLPGLDF 92
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 204
SD A LT +++ L R+ GA+++
Sbjct: 93 SD----------AKLTKTTFLKSTLERATFNGALLD 118
>gi|443327376|ref|ZP_21056002.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442792998|gb|ELS02459.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 187
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 55/102 (53%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A+L KAV + NF+ +D+ E+D + F+ + KA +K+ A+L+ +
Sbjct: 47 FTGANLGKAVFYRTVVELGNFSQSDLGEADLREANFSQSLFYKASLFKSQLQKANLNQVI 106
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
R +ANL +AVL L +++L A + GAD S+A ++
Sbjct: 107 AIRAFFRDANLNHAVLTSANLQQANLTNADLRGADLSNANLE 148
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 39/75 (52%), Gaps = 5/75 (6%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
I A F A+L AV N ++AN T+AD+R +D S A LE A AN GA
Sbjct: 106 IAIRAFFRDANLNHAVLTSANLQQANLTNADLRGADLS-----NANLESAFLVGANLLGA 160
Query: 167 DLSDTLMDRMVLNEA 181
L D ++R +L +A
Sbjct: 161 SLVDANLERAILTDA 175
Score = 40.4 bits (93), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 53/105 (50%), Gaps = 10/105 (9%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
E G S + G ADLR+A NF ++ F A + +S + N + +A +A F
Sbjct: 63 ELGNFSQSDLGEADLREA-----NFSQSLFYKASLFKSQLQKANLN-----QVIAIRAFF 112
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A+L+ ++ L +ANLTNA L L+ ++L A + GA+
Sbjct: 113 RDANLNHAVLTSANLQQANLTNADLRGADLSNANLESAFLVGANL 157
>gi|428222289|ref|YP_007106459.1| serine/threonine protein kinase [Synechococcus sp. PCC 7502]
gi|427995629|gb|AFY74324.1| serine/threonine protein kinase [Synechococcus sp. PCC 7502]
Length = 563
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 54/97 (55%), Gaps = 5/97 (5%)
Query: 120 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-----TLMD 174
+ V V+ + NF + D+ ++ +G+ +G + ++ + +F +DL+ +M
Sbjct: 396 RKVIVEYGHGKRNFANLDLSKASLAGTNLSGIVMSRSKLVETDFCQSDLTHASFTGAIMT 455
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++ LN ANL A + R +LT++DLGGA + AD +A
Sbjct: 456 QVKLNGANLAQAKMQRAILTKADLGGACLNQADLREA 492
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANF 163
S A +L V + +F +D+ + F+G+ K NGA L +A +A
Sbjct: 415 SKASLAGTNLSGIVMSRSKLVETDFCQSDLTHASFTGAIMTQVKLNGANLAQAKMQRAIL 474
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
T ADL +++ L EANL +A + + L+ +DL GA ++GA S A
Sbjct: 475 TKADLGGACLNQADLREANLQSAYMSKADLSGADLTGANLKGAYLSQA 522
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 54/123 (43%), Gaps = 11/123 (8%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
E+G G F + DL KA N + + + E+DF S A A+ +
Sbjct: 401 EYGHGKR-NFANLDLSKASLAGTNLSGIVMSRSKLVETDFCQSDLTHASFTGAIMTQVKL 459
Query: 164 TGADLSDTLMDRMVL----------NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
GA+L+ M R +L N+A+L A L ++++DL GA + GA+ A +
Sbjct: 460 NGANLAQAKMQRAILTKADLGGACLNQADLREANLQSAYMSKADLSGADLTGANLKGAYL 519
Query: 214 DLA 216
A
Sbjct: 520 SQA 522
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 37/70 (52%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A+ + + I + A G A L +A + N + A + AD+ +D +G+ GAYL +A
Sbjct: 465 AQAKMQRAILTKADLGGACLNQADLREANLQSAYMSKADLSGADLTGANLKGAYLSQANL 524
Query: 159 YKANFTGADL 168
N +GADL
Sbjct: 525 RGTNLSGADL 534
>gi|452966664|gb|EME71673.1| putative low-complexity protein [Magnetospirillum sp. SO-1]
Length = 241
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 48/101 (47%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L+ AV + A F ADM +D S + GA L A F GA L D
Sbjct: 70 ANLSGASLKGAVFAGADLFHAIFDEADMTGADLSDTYLFGANLIATRLVGAEFKGAFLKD 129
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
LM+R L++A + ++R V + L GA + GAD + A
Sbjct: 130 VLMERADLSQAKMAGVYMLRGVFEEAKLAGADLSGADMTGA 170
Score = 37.4 bits (85), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 43/104 (41%), Gaps = 10/104 (9%)
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
V F+ A M +D S +K G Y+ + V +A GADLS M A+
Sbjct: 118 VGAEFKGAFLKDVLMERADLSQAKMAGVYMLRGVFEEAKLAGADLSGADMTGAAAEGADF 177
Query: 184 TNAVLVRTVLT----------RSDLGGAIIEGADFSDAVIDLAQ 217
T A L T L+ R+DL GA AD V D A+
Sbjct: 178 TGANLKGTRLSGASMRFARFVRADLDGADFAKADLLHTVFDGAR 221
>gi|428214178|ref|YP_007087322.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428002559|gb|AFY83402.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 346
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 73/151 (48%), Gaps = 2/151 (1%)
Query: 67 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
NW L+ A +A+ + + L+ N A+ + IG+ S DLR+A
Sbjct: 95 NWADLSGANLSGANLANADVSGANLSGANLSGAKLNQTYLIGT--NLKSVDLREANLSLA 152
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ +A+ T A++R++D +G+K + L A AN TGA+L + + LN ANLT A
Sbjct: 153 SLNKADLTKANLRQADLTGAKLKQSNLNLADLTHANLTGANLKQANLSQAHLNWANLTKA 212
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L L ++L A + D ++ + AQ
Sbjct: 213 DLREANLCGANLSKANLSQTDLTEVCLKDAQ 243
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 45/86 (52%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
F R + AD+ E++ SG GA L KA AN + A+LS + L ANLT A
Sbjct: 29 FNRLSLAKADLSEANLSGVYLGGASLTKANLSGANLSRANLSGASLSGANLTGANLTGAN 88
Query: 188 LVRTVLTRSDLGGAIIEGADFSDAVI 213
L L +DL GA + GA+ ++A +
Sbjct: 89 LAGAHLNWADLSGANLSGANLANADV 114
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 55/118 (46%), Gaps = 15/118 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRES----------DFSGSKFNGAYLEKAVAYK 160
A ADLR+A N +AN + D+ E +FSG+ G L +
Sbjct: 207 ANLTKADLREANLCGANLSKANLSQTDLTEVCLKDAQLSGINFSGANLTGVDLSNKLLTG 266
Query: 161 ANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
AN +GA+ LS + + L EANL+ A L+ + L +DL A + GA+ S A +
Sbjct: 267 ANLSGAELSLANLSGAYLIQTNLREANLSEANLMGSHLMDADLTKANLSGANLSQANV 324
Score = 43.5 bits (101), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 55/118 (46%), Gaps = 10/118 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A L A N AN A + +D SG+ +GA L A AN +GA+L
Sbjct: 65 SRANLSGASLSGANLTGANLTGANLAGAHLNWADLSGANLSGANLANADVSGANLSGANL 124
Query: 169 SDTLMDRMVL----------NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
S +++ L EANL+ A L + LT+++L A + GA + ++LA
Sbjct: 125 SGAKLNQTYLIGTNLKSVDLREANLSLASLNKADLTKANLRQADLTGAKLKQSNLNLA 182
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 52/108 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S G A L KA N RAN + A + ++ +G+ GA L A A+ +GA+L
Sbjct: 45 SGVYLGGASLTKANLSGANLSRANLSGASLSGANLTGANLTGANLAGAHLNWADLSGANL 104
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
S + ++ ANL+ A L L ++ L G ++ D +A + LA
Sbjct: 105 SGANLANADVSGANLSGANLSGAKLNQTYLIGTNLKSVDLREANLSLA 152
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 60/121 (49%), Gaps = 20/121 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK----------AVAYK 160
A A+L++A + + AN T AD+RE++ G+ + A L + A
Sbjct: 187 ANLTGANLKQANLSQAHLNWANLTKADLREANLCGANLSKANLSQTDLTEVCLKDAQLSG 246
Query: 161 ANFTGADLSDT-LMDRMV---------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
NF+GA+L+ L ++++ L+ ANL+ A L++T L ++L A + G+ D
Sbjct: 247 INFSGANLTGVDLSNKLLTGANLSGAELSLANLSGAYLIQTNLREANLSEANLMGSHLMD 306
Query: 211 A 211
A
Sbjct: 307 A 307
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 63/139 (45%), Gaps = 15/139 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL A N AN +AD+ ++ SG+ +GA L + N DL +
Sbjct: 92 AHLNWADLSGA-----NLSGANLANADVSGANLSGANLSGAKLNQTYLIGTNLKSVDLRE 146
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ---KQALCKY 224
+ LN+A+LT A L + LT + L + + AD + A + +L Q QA +
Sbjct: 147 ANLSLASLNKADLTKANLRQADLTGAKLKQSNLNLADLTHANLTGANLKQANLSQAHLNW 206
Query: 225 ANGTNPITGVSTRKSLGCG 243
AN +T R++ CG
Sbjct: 207 AN----LTKADLREANLCG 221
>gi|427414830|ref|ZP_18905017.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425755483|gb|EKU96348.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 1182
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 56/103 (54%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L KA + N + N + A++ ++DFSG+ +GA L KA+ + +L
Sbjct: 642 SEANLSEANLSKANLRETNLHKTNLSKANLSKTDFSGANLSGANLSGTNLRKADLSKLNL 701
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + LN ANL+ A L RT L++++LG A + A+ A
Sbjct: 702 KEINLTGANLNGANLSEADLSRTNLSKANLGKANLGAANLEGA 744
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 57/106 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S F A+L A N R+A+ + +++E + +G+ NGA L +A + N + A+L
Sbjct: 672 SKTDFSGANLSGANLSGTNLRKADLSKLNLKEINLTGANLNGANLSEADLSRTNLSKANL 731
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ L ANLT + L +T L +++L G + GA+ ++A +D
Sbjct: 732 GKANLGAANLEGANLTGSNLNKTDLHQANLNGTDLTGANLNEANLD 777
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 48/103 (46%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S F +LR A K N AN SA++ +++ S + A L KA N GADL
Sbjct: 532 SKMDFTGVNLRGANLRKTNLCEANLNSAELNQANLSEANLRKANLSKAKLLGTNLQGADL 591
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + L+E NL A++ L + +L + G + SDA
Sbjct: 592 RGVTLTEINLSEVNLHGAIISEAALNKINLAKTNLCGINLSDA 634
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 49/90 (54%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A ADL + K N +AN +A++ ++ +GS N L +A + TGA+L
Sbjct: 712 NGANLSEADLSRTNLSKANLGKANLGAANLEGANLTGSNLNKTDLHQANLNGTDLTGANL 771
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
++ +D + L++A LT A L++ L ++ L
Sbjct: 772 NEANLDEVNLHQAKLTKAKLIKVDLRKTKL 801
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 44/87 (50%), Gaps = 5/87 (5%)
Query: 127 NFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 181
N RAN + ADM +D G+ A L + +AN +GA+LSD + L+ A
Sbjct: 409 NLSRANLSGADMHLANLNRTDLRGAVLCEAKLTRVTLEEANLSGANLSDAAVFEANLSRA 468
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADF 208
NL+ A L +T L S+L GA + D
Sbjct: 469 NLSGAKLYKTYLVESNLIGANLSETDL 495
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 10/106 (9%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G ADLR A + A+ + A++ E++ +K A LE + +AN +GAD
Sbjct: 365 GICPDLSGADLRSA-----DLTEADLSRANLSEANLCRAKLCAANLEGSNLSRANLSGAD 419
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
M LN +L AVL LTR L A + GA+ SDA +
Sbjct: 420 -----MHLANLNRTDLRGAVLCEAKLTRVTLEEANLSGANLSDAAV 460
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 7/109 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S ++ DL K + N N + AD+ + DF+G GA L K +AN A+L
Sbjct: 502 SESKLTRDDLTKMNLRETNLHGINLSGADLSKMDFTGVNLRGANLRKTNLCEANLNSAEL 561
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
++ L+EANL A L + L ++L GA + G ++ I+L++
Sbjct: 562 -----NQANLSEANLRKANLSKAKLLGTNLQGADLRGVTLTE--INLSE 603
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 58/126 (46%), Gaps = 25/126 (19%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTS----------ADMR----------ESDFSGSKFNG 150
A SA+L +A + N R+AN + AD+R E + G+ +
Sbjct: 554 ANLNSAELNQANLSEANLRKANLSKAKLLGTNLQGADLRGVTLTEINLSEVNLHGAIISE 613
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEG 205
A L K K N G +LSD + +M L+EANL+ A L + T L +++L A +
Sbjct: 614 AALNKINLAKTNLCGINLSDADLSKMNLSEANLSEANLSKANLRETNLHKTNLSKANLSK 673
Query: 206 ADFSDA 211
DFS A
Sbjct: 674 TDFSGA 679
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 53/124 (42%), Gaps = 25/124 (20%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAYK 160
A A+LRKA K N AD+R E + G+ + A L K K
Sbjct: 564 ANLSEANLRKANLSKAKLLGTNLQGADLRGVTLTEINLSEVNLHGAIISEAALNKINLAK 623
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL---------------TRSDLGGAIIEG 205
N G +LSD + +M L+EANL+ A L + L +++D GA + G
Sbjct: 624 TNLCGINLSDADLSKMNLSEANLSEANLSKANLRETNLHKTNLSKANLSKTDFSGANLSG 683
Query: 206 ADFS 209
A+ S
Sbjct: 684 ANLS 687
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 18/124 (14%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
LA+LN+ + RG A A L + + N AN + A + E++ S + +G
Sbjct: 422 LANLNR--TDLRG-------AVLCEAKLTRVTLEEANLSGANLSDAAVFEANLSRANLSG 472
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR-----SDLGGAIIEG 205
A L K ++N GA+LS+T + LN A+L+ + L R LT+ ++L G + G
Sbjct: 473 AKLYKTYLVESNLIGANLSETDL----LNGASLSESKLTRDDLTKMNLRETNLHGINLSG 528
Query: 206 ADFS 209
AD S
Sbjct: 529 ADLS 532
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 5/99 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A L KA +K + R+ D+ E D S + L K + G +LS D
Sbjct: 784 AKLTKAKLIKVDLRKTKLNKTDLCEIDLRESNLSKINLSKTNLSRTQLAGTNLS--FAD- 840
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L E+NL+ A L L+++ L GA ++GAD S+A ++
Sbjct: 841 --LRESNLSKADLYGADLSQAMLCGANLKGADLSEAKLN 877
>gi|291571143|dbj|BAI93415.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 331
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 79/163 (48%), Gaps = 21/163 (12%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL + V +F+ AN +A ++E++ GS F L+ A +KAN T + +
Sbjct: 135 ADLEQVTLVDTDFKEANLKTAKLQEANLKGSTFELTQLQGANLWKANLQECFFLLTQLQK 194
Query: 176 MVLNEANLTNAV-----LVRTVLTRSDLGGAII----EGADFSDAVIDLAQKQ------A 220
+ LN ANL NA L+ L +++L GA I +GA+F +A + A Q A
Sbjct: 195 VNLNAANLQNAELQGVNLLEANLQQANLQGAYILGNLQGANFQEANLKGANLQGAYLQDA 254
Query: 221 LCKYAN--GTN----PITGVSTRKSLGCGNSRRNAYGSPSSPL 257
K AN G N +TGV+ ++ G + +NA G + +
Sbjct: 255 NFKRANLRGVNLKDANLTGVNFEEAHLQGANLQNAQGLTTQQI 297
>gi|30696344|ref|NP_851183.1| thylakoid lumenal protein [Arabidopsis thaliana]
gi|38503418|sp|P81760.2|TL17_ARATH RecName: Full=Thylakoid lumenal 17.4 kDa protein, chloroplastic;
AltName: Full=P17.4; Flags: Precursor
gi|13899115|gb|AAK48979.1|AF370552_1 thylakoid lumenal 17.4 kD protein, chloroplast precursor (P17.4)
[Arabidopsis thaliana]
gi|9759188|dbj|BAB09725.1| thylakoid lumenal 17.4 kD protein, chloroplast precursor (P17.4)
[Arabidopsis thaliana]
gi|28059599|gb|AAO30073.1| thylakoid lumenal 17.4 kD protein, chloroplast precursor (P17.4)
[Arabidopsis thaliana]
gi|332008985|gb|AED96368.1| thylakoid lumenal protein [Arabidopsis thaliana]
Length = 236
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ N + ++A M + F G+ + KA A +A+F G + ++ ++DR+ ++NL
Sbjct: 123 QTNLKGKTLSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLK 182
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AV TVL+ S A +E F D +I Q +C+ N R LGC
Sbjct: 183 GAVFRNTVLSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 235
>gi|158313419|ref|YP_001505927.1| pentapeptide repeat-containing protein [Frankia sp. EAN1pec]
gi|158108824|gb|ABW11021.1| pentapeptide repeat protein [Frankia sp. EAN1pec]
Length = 299
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 55/117 (47%), Gaps = 6/117 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR + R A AD+R++D S + GA L A+ A TGADL
Sbjct: 103 AYLSGADLRG-----TDLRDACLRGADLRDADLSQAALGGADLAGALLAGAFLTGADLHG 157
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 226
T + L+ A+L A L R L +D G I+ GAD A D +QA + A+
Sbjct: 158 TDLHGAFLHNADLRKAFLARADLRGADADGIIMRGADLRAADATDAVLRQADLRAAD 214
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A G ADL A+ A T AD+ +D G+ + A L KA +A+ GAD
Sbjct: 131 SQAALGGADLAGALLAG-----AFLTGADLHGTDLHGAFLHNADLRKAFLARADLRGADA 185
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDA 211
+M L A+ T+AVL + L +D L GAI+ G D A
Sbjct: 186 DGIIMRGADLRAADATDAVLRQADLRAADLRGIRLAGAILRGVDLRGA 233
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 37/71 (52%)
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 195
AD+ +D +G G L A + A +GADL T + L A+L +A L + L
Sbjct: 78 ADLTGADLAGVCLTGRILRGAQLHGAYLSGADLRGTDLRDACLRGADLRDADLSQAALGG 137
Query: 196 SDLGGAIIEGA 206
+DL GA++ GA
Sbjct: 138 ADLAGALLAGA 148
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 36/78 (46%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+ T AD+ +G GA L A A+ G DL D + L +A+L+ A L
Sbjct: 78 ADLTGADLAGVCLTGRILRGAQLHGAYLSGADLRGTDLRDACLRGADLRDADLSQAALGG 137
Query: 191 TVLTRSDLGGAIIEGADF 208
L + L GA + GAD
Sbjct: 138 ADLAGALLAGAFLTGADL 155
>gi|30696347|ref|NP_200161.2| thylakoid lumenal protein [Arabidopsis thaliana]
gi|332008984|gb|AED96367.1| thylakoid lumenal protein [Arabidopsis thaliana]
Length = 235
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ N + ++A M + F G+ + KA A +A+F G + ++ ++DR+ ++NL
Sbjct: 122 QTNLKGKTLSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLK 181
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AV TVL+ S A +E F D +I Q +C+ N R LGC
Sbjct: 182 GAVFRNTVLSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 234
>gi|427419722|ref|ZP_18909905.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425762435|gb|EKV03288.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 308
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANF 163
S A ADL A+ + F AN A + +SDFS + F GA L +A ANF
Sbjct: 115 SFATLTQADLSNAIGHRTRFSWANLVKAQLIDSDFSEAVFEGANLTRSNWHRATVRGANF 174
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
ADL + LN NLT A L+ +L ++ L GA++ A
Sbjct: 175 QQADLEAARLRAANLNGVNLTKANLLNAILEQTQLDGAVLMAA 217
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 55/116 (47%), Gaps = 5/116 (4%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKA 161
IG +F A+L KA + +F A F A++ S++ G+ F A LE A A
Sbjct: 128 IGHRTRFSWANLVKAQLIDSDFSEAVFEGANLTRSNWHRATVRGANFQQADLEAARLRAA 187
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
N G +L+ + +L + L AVL+ + L GA + D++DA + AQ
Sbjct: 188 NLNGVNLTKANLLNAILEQTQLDGAVLMAAQADWATLNGASLIETDWTDASMMGAQ 243
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 58/126 (46%), Gaps = 15/126 (11%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
+ S A+F A L A + R F RE +KF+GA+L +A A T
Sbjct: 66 LAVLSDARFDKARLDAAELTRARLERGIF-----RELQAPKAKFHGAHLTEADLSFATLT 120
Query: 165 GADLSDTLMDRMVLNEANLTNAVLVRT----------VLTRSDLGGAIIEGADFSDAVID 214
ADLS+ + R + ANL A L+ + LTRS+ A + GA+F A ++
Sbjct: 121 QADLSNAIGHRTRFSWANLVKAQLIDSDFSEAVFEGANLTRSNWHRATVRGANFQQADLE 180
Query: 215 LAQKQA 220
A+ +A
Sbjct: 181 AARLRA 186
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F A+L ++ + R ANF AD+ + + NG L KAN A L
Sbjct: 150 SEAVFEGANLTRSNWHRATVRGANFQQADLEAARLRAANLNGVNL-----TKANLLNAIL 204
Query: 169 SDTLMDRMVL-----NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
T +D VL + A L A L+ T T + + GA +EGA+ + A
Sbjct: 205 EQTQLDGAVLMAAQADWATLNGASLIETDWTDASMMGAQLEGANLAGA 252
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 52/124 (41%), Gaps = 12/124 (9%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
N + A RG A F ADL A N N T A++ + ++ +GA L
Sbjct: 163 NWHRATVRG-------ANFQQADLEAARLRAANLNGVNLTKANLLNAILEQTQLDGAVLM 215
Query: 155 KAVAYKANFTGA-----DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
A A A GA D +D M L ANL A L L +++L A + DF+
Sbjct: 216 AAQADWATLNGASLIETDWTDASMMGAQLEGANLAGANLAGVNLQQANLENANLTAVDFT 275
Query: 210 DAVI 213
DA +
Sbjct: 276 DAQV 279
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 49/98 (50%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
D KA V A F A + ++ + ++ + A KA F GA L++ +
Sbjct: 58 DFSKATLVLAVLSDARFDKARLDAAELTRARLERGIFRELQAPKAKFHGAHLTEADLSFA 117
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L +A+L+NA+ RT + ++L A + +DFS+AV +
Sbjct: 118 TLTQADLSNAIGHRTRFSWANLVKAQLIDSDFSEAVFE 155
>gi|425467207|ref|ZP_18846491.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9809]
gi|389830088|emb|CCI28159.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9809]
Length = 442
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/121 (38%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANF 163
S A A L KA N RRAN + A++ + SG+ GA L K A+ + AN
Sbjct: 310 SKANLSWAKLSKAKLSGANLRRANLSKANLSWAFMSGANLIGAILSKANLRGAILWGANL 369
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALC 222
+GA+LS L+ ANL+ A L L+++DL GA +E A F DA I QKQ L
Sbjct: 370 SGANLSGA-----NLSGANLSKADLSGANLSKADLSGAKVENAIFIDATGITPEQKQDLI 424
Query: 223 K 223
+
Sbjct: 425 R 425
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 65/145 (44%), Gaps = 37/145 (25%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT--------------------SA 136
Y+ + G I S A ADL A N RRAN + A
Sbjct: 235 YKVDLSG--AILSGAILSGADLSGA-----NLRRANLSWAFLSWADLIEADLSWAFLRRA 287
Query: 137 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN----------A 186
D+ ++D SG+K +GA L KA KAN + A LS + L ANL+ A
Sbjct: 288 DLIDADLSGAKLSGANLNKANLSKANLSWAKLSKAKLSGANLRRANLSKANLSWAFMSGA 347
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
L+ +L++++L GAI+ GA+ S A
Sbjct: 348 NLIGAILSKANLRGAILWGANLSGA 372
Score = 43.9 bits (102), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 54/101 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL A N +AN + A++ + S +K +GA L +A KAN + A +S
Sbjct: 287 ADLIDADLSGAKLSGANLNKANLSKANLSWAKLSKAKLSGANLRRANLSKANLSWAFMSG 346
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ +L++ANL A+L L+ ++L GA + GA+ S A
Sbjct: 347 ANLIGAILSKANLRGAILWGANLSGANLSGANLSGANLSKA 387
Score = 37.0 bits (84), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 71/166 (42%), Gaps = 6/166 (3%)
Query: 50 FPDCSNNQCAGPYAKLK-NWRVFVS---TALAAAVVASCSSNISALADLNKYEAETRGEF 105
+ + + P KLK + R+F+ L S + AL LN+ +++ E
Sbjct: 144 LQETARDNTIKPIMKLKGSIRLFLEGSEEGLKHLADLHQSGELQAL--LNELKSDDIPEI 201
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
+ A A + + + + R + + D SG+ +GA L A AN
Sbjct: 202 IVKKAEFTTDAKVIEKAELIKAIREGTIDKKTLYKVDLSGAILSGAILSGADLSGANLRR 261
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A+LS + L EA+L+ A L R L +DL GA + GA+ + A
Sbjct: 262 ANLSWAFLSWADLIEADLSWAFLRRADLIDADLSGAKLSGANLNKA 307
>gi|428211266|ref|YP_007084410.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|427999647|gb|AFY80490.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 279
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 71/142 (50%), Gaps = 15/142 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A A+L + N R AN +A++ +++ S + + A + +A+ +AN A L
Sbjct: 53 TGANLREANLMGVTLHQANLREANLINANLSKANLSEADLSLANISRAIVERANLERAKL 112
Query: 169 -----SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
S+T + L EA + A L R L+ +DL GA +EGA+ + A++ QA+ +
Sbjct: 113 VQALASETRLGWANLKEATMNQANLSRANLSEADLTGANLEGANLTIAIL----IQAIME 168
Query: 224 YANGTNP------ITGVSTRKS 239
N TN +TGV+ R S
Sbjct: 169 KVNLTNATLNGANLTGVNLRDS 190
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 53/103 (51%), Gaps = 10/103 (9%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
+ S + G A+L++A + N RAN + AD+ ++ G+ A L +A+ K N T A
Sbjct: 116 LASETRLGWANLKEATMNQANLSRANLSEADLTGANLEGANLTIAILIQAIMEKVNLTNA 175
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
LN ANLT L + L+R+++ G+ + GAD +
Sbjct: 176 ----------TLNGANLTGVNLRDSDLSRANMSGSNLAGADLT 208
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 49/98 (50%), Gaps = 15/98 (15%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
+DL +A N A+ T + +R ++ S + G LE A Y+AN ++LS
Sbjct: 190 SDLSRANMSGSNLAGADLTKSQLRGTNVSWTTMRGTNLEGASLYRANLGWSNLSG----- 244
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
ANLTNA+L+ T L R++L DF+ A++
Sbjct: 245 -----ANLTNAILMDTNLYRTNL-----RDVDFTGAIM 272
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 49/95 (51%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ +FR N A++ D S F A L +A TGA+L + + + L++ANL
Sbjct: 14 ERDFRNLNLIGANLAGLDLSEVTFRDADLRQANLTCTKLTGANLREANLMGVTLHQANLR 73
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
A L+ L++++L A + A+ S A+++ A +
Sbjct: 74 EANLINANLSKANLSEADLSLANISRAIVERANLE 108
>gi|76819210|ref|YP_336861.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
1710b]
gi|76583683|gb|ABA53157.1| pentapeptide repeat family protein [Burkholderia pseudomallei
1710b]
Length = 862
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 38/102 (37%), Positives = 52/102 (50%), Gaps = 5/102 (4%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G+AA+ + A ++ + A+ T D+ D G++ GA LE A A+ TGAD
Sbjct: 526 GAAARARRECVASAAAAGQSLQGADLTGVDLSGMDLRGARLAGAMLENADLSDADLTGAD 585
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 586 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 622
Score = 41.2 bits (95), Expect = 0.48, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 780 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 839
Score = 37.4 bits (85), Expect = 6.4, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 15/120 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A +ADL A + R AD+ + ++ A L A + +F+G+DL
Sbjct: 567 AGAMLENADLSDADLTGADLSRTVLVRADLTRAKLVDARLTAANLSLAHCERTDFSGSDL 626
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIE----------GADFSDAVI 213
SD + +++ L + +VL T D G A + G FSDA I
Sbjct: 627 SDGIFEQVHLRDCRFNGSVLASTRFDACRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 686
>gi|410472731|ref|YP_006896012.1| hypothetical protein BN117_2075 [Bordetella parapertussis Bpp5]
gi|408442841|emb|CCJ49408.1| Hypothetical protein BN117_2075 [Bordetella parapertussis Bpp5]
Length = 329
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 2/107 (1%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L +A + N RAN A++ ++ + + GA L +A +AN GA+L+D
Sbjct: 66 ADLAGANLARANLARANLARANLAGANLADAYLADADLAGANLARANLARANLAGANLAD 125
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+ R L +A L +A L L R++L A + GAD + A DLA+
Sbjct: 126 AYLARAYLADAYLADAYLADADLARANLACANLAGADLAGA--DLAR 170
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 70/168 (41%), Gaps = 19/168 (11%)
Query: 104 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
E + S A ADL A N AN A + ++D +G+ A L +A +AN
Sbjct: 34 EQAVKSGANLARADLAGA-----NLAGANLADAYLADADLAGANLARANLARANLARANL 88
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--------DL 215
GA+L+D + L ANL A L R L ++L A + A +DA + DL
Sbjct: 89 AGANLADAYLADADLAGANLARANLARANLAGANLADAYLARAYLADAYLADAYLADADL 148
Query: 216 AQKQALCKYANGTNPITGVSTRKSLGCGN------SRRNAYGSPSSPL 257
A+ C G + R +L N +R N G+ + P+
Sbjct: 149 ARANLACANLAGADLAGADLARANLAGANLAGAYLARANLAGARNLPV 196
>gi|409993957|ref|ZP_11277081.1| hypothetical protein APPUASWS_22623 [Arthrospira platensis str.
Paraca]
gi|409935173|gb|EKN76713.1| hypothetical protein APPUASWS_22623 [Arthrospira platensis str.
Paraca]
Length = 336
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 79/163 (48%), Gaps = 21/163 (12%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL + V +F+ AN +A ++E++ GS F L+ A +KAN T + +
Sbjct: 140 ADLEQVTLVDTDFKEANLKTAKLQEANLKGSTFELTQLQGANLWKANLQECFFLLTQLQK 199
Query: 176 MVLNEANLTNAV-----LVRTVLTRSDLGGAII----EGADFSDAVIDLAQKQ------A 220
+ LN ANL NA L+ L +++L GA I +GA+F +A + A Q A
Sbjct: 200 VNLNAANLQNAELQGVNLLEANLQQANLQGAYILGNLQGANFQEANLKGANLQGAYLQDA 259
Query: 221 LCKYAN--GTN----PITGVSTRKSLGCGNSRRNAYGSPSSPL 257
K AN G N +TGV+ ++ G + +NA G + +
Sbjct: 260 NFKRANLRGVNLKDANLTGVNFEEAHLQGANLQNAQGLTTQQI 302
>gi|427715910|ref|YP_007063904.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348346|gb|AFY31070.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 1031
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 56/108 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S F A+L A N RAN + + ++ SG+ +GA L A +AN G L
Sbjct: 843 SGGNFSRANLSGANLSVANLSRANLSGTNFSRANLSGANLSGADLSTANLSRANLNGVYL 902
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
S ++R L+ AN + A L R L+ +DL GA + GAD SDA ++ A
Sbjct: 903 SRANLNRANLSGANFSRADLSRANLSGADLSGADLSGADLSDANLNRA 950
Score = 43.9 bits (102), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 54/110 (49%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
RG F G ADL A + AN + A++ ++D SG F+ A L A A
Sbjct: 801 RGNFNSVVGQFLGGADLSGANLSDADLSLANLSHANLSDADLSGGNFSRANLSGANLSVA 860
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
N + A+LS T R L+ ANL+ A L L+R++L G + A+ + A
Sbjct: 861 NLSRANLSGTNFSRANLSGANLSGADLSTANLSRANLNGVYLSRANLNRA 910
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 55/128 (42%), Gaps = 30/128 (23%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 165
S A ADL A N AN + AD+ +FS + +GA L A +AN +G
Sbjct: 818 SGANLSDADLSLA-----NLSHANLSDADLSGGNFSRANLSGANLSVANLSRANLSGTNF 872
Query: 166 ----------------------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
A+L+ + R LN ANL+ A R L+R++L GA +
Sbjct: 873 SRANLSGANLSGADLSTANLSRANLNGVYLSRANLNRANLSGANFSRADLSRANLSGADL 932
Query: 204 EGADFSDA 211
GAD S A
Sbjct: 933 SGADLSGA 940
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 52/103 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A + N + A++ ++ SG+ F+ A L +A A+ +GADL
Sbjct: 878 SGANLSGADLSTANLSRANLNGVYLSRANLNRANLSGANFSRADLSRANLSGADLSGADL 937
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + LN ANL+ A L R L+ ++L A + G + S A
Sbjct: 938 SGADLSDANLNRANLSRANLKRANLSDANLSSANLSGDNLSRA 980
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 53/102 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A L +A + N ANF+ AD+ ++ SG+ +GA L A AN A+L
Sbjct: 893 SRANLNGVYLSRANLNRANLSGANFSRADLSRANLSGADLSGADLSGADLSDANLNRANL 952
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
S + R L++ANL++A L L+R++L A + A+ D
Sbjct: 953 SRANLKRANLSDANLSSANLSGDNLSRANLSRANLSDANLGD 994
>gi|334188366|ref|NP_001190531.1| thylakoid lumenal protein [Arabidopsis thaliana]
gi|332008986|gb|AED96369.1| thylakoid lumenal protein [Arabidopsis thaliana]
Length = 250
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ N + ++A M + F G+ + KA A +A+F G + ++ ++DR+ ++NL
Sbjct: 137 QTNLKGKTLSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLK 196
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AV TVL+ S A +E F D +I Q +C+ N R LGC
Sbjct: 197 GAVFRNTVLSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 249
>gi|359459150|ref|ZP_09247713.1| pentapeptide repeat-containing serine/threonine kinase
[Acaryochloris sp. CCMEE 5410]
Length = 514
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 56/116 (48%), Gaps = 20/116 (17%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+F + DLR A+ + NF RANFT A++R ++ L +A A+ ADL
Sbjct: 411 KFQNTDLRDAILINANFGRANFTGANLRNAN----------LMQAYMSHADLANADLRG- 459
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
ANL++A L L ++L GA + GA S++ + AQ L Y NG
Sbjct: 460 ---------ANLSDAYLSHANLRGANLCGADLSGAKLSESQLSFAQTNWLTVYPNG 506
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 10/87 (11%)
Query: 140 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDT-LMDRMVLNE---------ANLTNAVLV 189
+ DFSG L K ANF +T L D +++N ANL NA L+
Sbjct: 384 QRDFSGQDLRNLNLRKFQLPSANFHEGKFQNTDLRDAILINANFGRANFTGANLRNANLM 443
Query: 190 RTVLTRSDLGGAIIEGADFSDAVIDLA 216
+ ++ +DL A + GA+ SDA + A
Sbjct: 444 QAYMSHADLANADLRGANLSDAYLSHA 470
>gi|416402943|ref|ZP_11687479.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
gi|357261803|gb|EHJ11028.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
Length = 330
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 54/121 (44%), Gaps = 20/121 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A DL +A + N + AN AD+R++D + + GA L A TGA+L++
Sbjct: 161 ANMKGVDLSRANLMGANLKEANLRDADLRKADLTNANLKGALLTDTNLTGAKLTGANLTN 220
Query: 171 TLMDR--------------------MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
T M R VLN ANL A L T + +DL A + GA+ +D
Sbjct: 221 TNMVRAQLSQAELSDIMAKGAILTHAVLNRANLNQADLTLTRMNHADLSRANLSGANLTD 280
Query: 211 A 211
A
Sbjct: 281 A 281
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 49/103 (47%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
AQ A+L + A A++ ++D + ++ N A L +A AN T ADL +
Sbjct: 226 AQLSQAELSDIMAKGAILTHAVLNRANLNQADLTLTRMNHADLSRANLSGANLTDADLVE 285
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
R L ANLTNA L R L ++L G I+ GA D +
Sbjct: 286 AFFARANLMGANLTNANLTRAELMSANLAGVILRGATMPDGKV 328
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 5/90 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL--- 183
N N A + +++ S GA L AV +AN ADL+ T M+ L+ ANL
Sbjct: 217 NLTNTNMVRAQLSQAELSDIMAKGAILTHAVLNRANLNQADLTLTRMNHADLSRANLSGA 276
Query: 184 --TNAVLVRTVLTRSDLGGAIIEGADFSDA 211
T+A LV R++L GA + A+ + A
Sbjct: 277 NLTDADLVEAFFARANLMGANLTNANLTRA 306
>gi|186684326|ref|YP_001867522.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186466778|gb|ACC82579.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 413
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANF 163
S A A+L KA+ V N + NFT A++ E+D S GS F A L KA +AN
Sbjct: 216 SNADLTEANLSKAIFVGANLQWVNFTQANLSEADLSITNLCGSVFYEANLSKATLPEANL 275
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
G L + + + +ANL A+L L + L A +EGA DA
Sbjct: 276 QGVILRKANLSKAIFYDANLEGAILCDANLVGAILCDANLEGAILCDA 323
Score = 44.3 bits (103), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 61/123 (49%), Gaps = 11/123 (8%)
Query: 96 KYEAETRGEFGIGSAAQ-----FGSADLRK-AVHVKENFRRANFTSADMRESDFSGSKFN 149
KYE E + + + Q G D K V+ K + R + ++AD+ E++ S + F
Sbjct: 172 KYEDELQVSSKLPTDIQTAITVIGRRDSHKDPVNQKLDLRNTDLSNADLTEANLSKAIFV 231
Query: 150 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
GA L+ +AN + ADLS T + V EANL+ A L ++L G I+ A+ S
Sbjct: 232 GANLQWVNFTQANLSEADLSITNLCGSVFYEANLSKA-----TLPEANLQGVILRKANLS 286
Query: 210 DAV 212
A+
Sbjct: 287 KAI 289
>gi|162456757|ref|YP_001619124.1| pentapeptide repeat-containing protein [Sorangium cellulosum So
ce56]
gi|161167339|emb|CAN98644.1| pentapeptide repeats hypothetical protein [Sorangium cellulosum So
ce56]
Length = 895
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 38/111 (34%), Positives = 58/111 (52%), Gaps = 11/111 (9%)
Query: 108 GSAAQFGSADLRKAVHVK-ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
G F SA L++ V V +F A+F+ ADM ++ G+ GA L++A A+ +G
Sbjct: 747 GERVSFRSACLQQGVVVHGSSFPEADFSDADMERANLRGTVLAGARLDRANLRGADLSGC 806
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
D S EA+L AVL +L R+DL A ++GA+ DA+ A+
Sbjct: 807 DAS----------EASLERAVLQGGLLIRTDLVNASLQGANLMDALASKAR 847
Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats.
Identities = 40/118 (33%), Positives = 60/118 (50%), Gaps = 11/118 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVA-YKAN 162
S +F ADL +A V+ A+F+SA +R++ F F A L++ V + ++
Sbjct: 708 SGVRFTGADLSEANLVESTLDGADFSSATLRKTTFVACHGERVSFRSACLQQGVVVHGSS 767
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
F AD SD M+R ANL VL L R++L GA + G D S+A ++ A Q
Sbjct: 768 FPEADFSDADMER-----ANLRGTVLAGARLDRANLRGADLSGCDASEASLERAVLQG 820
Score = 50.4 bits (119), Expect = 8e-04, Method: Composition-based stats.
Identities = 36/102 (35%), Positives = 53/102 (51%), Gaps = 10/102 (9%)
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
A+ V E+ +FT A++ SG +GA+LE A+ +G DLS T ++ VL
Sbjct: 570 ALEVGESLANRDFTGANLAGMCLSGVDLSGAFLE-----SADLSGCDLSRTNLEGAVLAR 624
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEG-----ADFSDAVIDLAQ 217
ANL A L L ++LGGA + G AD +AV+ A+
Sbjct: 625 ANLAGANLADARLRGANLGGAALRGASLDRADLKEAVLSRAE 666
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 55/173 (31%), Positives = 81/173 (46%), Gaps = 36/173 (20%)
Query: 74 TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGS----ADLRKAVHVKENFR 129
T L AV+A + LA N +A RG +G AA G+ ADL++AV +
Sbjct: 615 TNLEGAVLARAN-----LAGANLADARLRGA-NLGGAALRGASLDRADLKEAVLSRAELE 668
Query: 130 RANFTSADMRESDF-----SGSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDRMVLN 179
RA F+ AD+ +D+ G+ F GA L + K + FTGADLS+ + L+
Sbjct: 669 RARFSGADLTGADWFETKPGGADFTGATLGQCNLLKVDLSGVRFTGADLSEANLVESTLD 728
Query: 180 EANLTNAVLVRTVLT---------RSDL--GGAIIEG-----ADFSDAVIDLA 216
A+ ++A L +T RS G ++ G ADFSDA ++ A
Sbjct: 729 GADFSSATLRKTTFVACHGERVSFRSACLQQGVVVHGSSFPEADFSDADMERA 781
Score = 38.5 bits (88), Expect = 3.3, Method: Composition-based stats.
Identities = 34/118 (28%), Positives = 47/118 (39%), Gaps = 20/118 (16%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL--------------- 153
S +L AV + N AN A +R ++ G+ GA L
Sbjct: 608 SGCDLSRTNLEGAVLARANLAGANLADARLRGANLGGAALRGASLDRADLKEAVLSRAEL 667
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
E+A A+ TGAD +T A+ T A L + L + DL G GAD S+A
Sbjct: 668 ERARFSGADLTGADWFETKP-----GGADFTGATLGQCNLLKVDLSGVRFTGADLSEA 720
>gi|434388230|ref|YP_007098841.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428019220|gb|AFY95314.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 193
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 45/96 (46%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G A ADLR A N + N AD+R +D +G GA L +A AN T AD
Sbjct: 97 GDRASLHKADLRLASLQGANLSQVNLVGADLRYADLTGVNLTGANLSRANLTGANLTKAD 156
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
L + + +L NL A L+ L+ DL AI+
Sbjct: 157 LRGVTLAQAILENTNLCEASLIDVDLSCVDLRHAIL 192
>gi|427738633|ref|YP_007058177.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427373674|gb|AFY57630.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 436
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 7/135 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A DL A N ANFT+ ++ ++F+ + GA LE A AN T ADLS
Sbjct: 239 ADLSGIDLCDANFSDANLEGANFTNVNLEGANFTNANLEGANLENAKLNNANLTNADLSY 298
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA-----DFSDAVIDLAQKQALCKYA 225
T + + L ANL N+ L +R++L AI+ GA +FSDA +L + Y
Sbjct: 299 TNLRKADLRCANLINSDLSNADASRANLSDAIVNGANLIQSNFSDA--NLRGCNLIKTYL 356
Query: 226 NGTNPITGVSTRKSL 240
+G N I R +L
Sbjct: 357 SGANLIRADLKRANL 371
>gi|300863629|ref|ZP_07108569.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
gi|300338371|emb|CBN53713.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
Length = 386
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 51/90 (56%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S AQ ADL +A + AN + A++ ++ +G+ A L KA KAN ADL
Sbjct: 160 SQAQLNDADLTQANLKDADLTDANLSGAELARANLAGANLTRADLTKANLLKANLRRADL 219
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+++ ++ L EA+L+ A+L R L+++DL
Sbjct: 220 TESYLNWASLGEADLSEAILTRANLSKADL 249
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 53/103 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A A L A N +AN DM ++FSG+ N A L KA K+N + A L
Sbjct: 105 TGANLTGAHLNWANLSTANLSKANLKGTDMSAANFSGAILNDANLGKAYLIKSNLSQAQL 164
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+D + + L +A+LT+A L L R++L GA + AD + A
Sbjct: 165 NDADLTQANLKDADLTDANLSGAELARANLAGANLTRADLTKA 207
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 73/147 (49%), Gaps = 11/147 (7%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L +VV S S + L N E + R + IG A ADL KA + RAN +
Sbjct: 25 LVLSVVDSHSGDTPTLVLANINEQQNR-PYLIG--ANLSEADLSKA-----HLSRANLSK 76
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 195
AD+ ++ G+ GA L A AN TGA+L+ ++ L+ ANL+ A L T ++
Sbjct: 77 ADLSGANLCGANLVGASLSGANLTGANLTGANLTGAHLNWANLSTANLSKANLKGTDMSA 136
Query: 196 SDLGGAIIEGADFSDAVI---DLAQKQ 219
++ GAI+ A+ A + +L+Q Q
Sbjct: 137 ANFSGAILNDANLGKAYLIKSNLSQAQ 163
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 53/103 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L A V + AN T A++ ++ +G+ N A L A KAN G D+
Sbjct: 75 SKADLSGANLCGANLVGASLSGANLTGANLTGANLTGAHLNWANLSTANLSKANLKGTDM 134
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S +LN+ANL A L+++ L+++ L A + A+ DA
Sbjct: 135 SAANFSGAILNDANLGKAYLIKSNLSQAQLNDADLTQANLKDA 177
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L KA +K N +A AD+ +++ + A L A +AN GA+L
Sbjct: 140 SGAILNDANLGKAYLIKSNLSQAQLNDADLTQANLKDADLTDANLSGAELARANLAGANL 199
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ R L +ANL A L R LT S L A + AD S+A++
Sbjct: 200 T-----RADLTKANLLKANLRRADLTESYLNWASLGEADLSEAIL 239
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 52/103 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L +A K N +AN AD+ ES + + A L +A+ +AN + ADLS
Sbjct: 192 ANLAGANLTRADLTKANLLKANLRRADLTESYLNWASLGEADLSEAILTRANLSKADLSK 251
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
T + ++VL+ +L+ L L DL ++ G + + A +
Sbjct: 252 TYLRKIVLHGCHLSGINLSGADLGGLDLSKKLLTGINLASAYL 294
Score = 44.3 bits (103), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A G DL K + N AN + A + E++ S + GA L A KANF
Sbjct: 270 SGADLGGLDLSKKLLTGINLASAYLSEANLSGAYLIEANLSDANLCGADLSDACLMKANF 329
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
GA M + L+ ANLT A L + L ++L GAI+ AD A
Sbjct: 330 IGAR-----MGNINLSNANLTGAKLCKADLMGANLRGAILTEADMRGA 372
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 2/108 (1%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L A + N AN T AD+ +++ + A L ++ A+ ADLS+
Sbjct: 177 ADLTDANLSGAELARANLAGANLTRADLTKANLLKANLRRADLTESYLNWASLGEADLSE 236
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
++ R L++A+L+ L + VL L G + GAD +DL++K
Sbjct: 237 AILTRANLSKADLSKTYLRKIVLHGCHLSGINLSGADLGG--LDLSKK 282
>gi|119488080|ref|ZP_01621524.1| hypothetical protein L8106_11802 [Lyngbya sp. PCC 8106]
gi|119455369|gb|EAW36508.1| hypothetical protein L8106_11802 [Lyngbya sp. PCC 8106]
Length = 351
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 48/85 (56%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N A +A++ + S + GA L K AN +GADLS+ + + +L EA L A
Sbjct: 27 NLMAAQLNAANLNRVNLSYANLTGANLSKTRLICANLSGADLSNANLSQAILIEATLNGA 86
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
L +T+L +++L GA++ G+ S+A
Sbjct: 87 SLTQTLLVQANLSGALLSGSILSEA 111
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 64/139 (46%), Gaps = 36/139 (25%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 166
S A +A+L +A+ ++ A+ T + +++ SG+ +G+ L +A AN TGA
Sbjct: 64 SGADLSNANLSQAILIEATLNGASLTQTLLVQANLSGALLSGSILSEADLSGANLTGASL 123
Query: 167 -----------------------------DLSDTLMDRMVLNE-----ANLTNAVLVRTV 192
DLS + R +L+E ANL++A L+R
Sbjct: 124 IGTSLLNGSKLIEATLIGATLSRATLSAIDLSGVNLTRAILSESELGGANLSSACLIRAY 183
Query: 193 LTRSDLGGAIIEGADFSDA 211
L RS+L GA + GAD S+A
Sbjct: 184 LNRSNLSGANLMGADLSEA 202
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 53/99 (53%), Gaps = 5/99 (5%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AAQ +A+L + N AN + + ++ SG+ + A L +A+ +A GA L+
Sbjct: 30 AAQLNAANLNRVNLSYANLTGANLSKTRLICANLSGADLSNANLSQAILIEATLNGASLT 89
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
TL+ +ANL+ A+L ++L+ +DL GA + GA
Sbjct: 90 QTLLV-----QANLSGALLSGSILSEADLSGANLTGASL 123
Score = 44.7 bits (104), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 56/106 (52%), Gaps = 1/106 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L K + N A+ ++A++ ++ + NGA L + + +AN +GA L
Sbjct: 44 SYANLTGANLSKTRLICANLSGADLSNANLSQAILIEATLNGASLTQTLLVQANLSGALL 103
Query: 169 SDTLMDRMVLNEANLTNAVLVRT-VLTRSDLGGAIIEGADFSDAVI 213
S +++ L+ ANLT A L+ T +L S L A + GA S A +
Sbjct: 104 SGSILSEADLSGANLTGASLIGTSLLNGSKLIEATLIGATLSRATL 149
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 49/103 (47%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL +A N AN T A+++ +D G+ NGA L A N A+L
Sbjct: 190 SGANLMGADLSEASLCNANLCVANLTRANLQGADLEGANLNGAQLSGANLKSTNLKNANL 249
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ ++ L A+L+ A L LT ++L GA + AD A
Sbjct: 250 NGLILHEADLRLADLSQANLRGANLTGANLAGASLLEADLRGA 292
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 7/111 (6%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
I S ++ G A+L A ++ R+N + A++ +D S + A L A +AN GA
Sbjct: 163 ILSESELGGANLSSACLIRAYLNRSNLSGANLMGADLSEASLCNANLCVANLTRANLQGA 222
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
DL + LN A L+ A L T L ++L G I+ AD A DL+Q
Sbjct: 223 DL-----EGANLNGAQLSGANLKSTNLKNANLNGLILHEADLRLA--DLSQ 266
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 46/96 (47%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A L +A + N T A + ES+ G+ + A L +A ++N +GA+L +
Sbjct: 142 ATLSRATLSAIDLSGVNLTRAILSESELGGANLSSACLIRAYLNRSNLSGANLMGADLSE 201
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANL A L R L +DL GA + GA S A
Sbjct: 202 ASLCNANLCVANLTRANLQGADLEGANLNGAQLSGA 237
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 43/90 (47%), Gaps = 5/90 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEA 181
N RAN AD+ ++ +G++ +GA L+ AN G ADL + + L A
Sbjct: 213 NLTRANLQGADLEGANLNGAQLSGANLKSTNLKNANLNGLILHEADLRLADLSQANLRGA 272
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
NLT A L L +DL GA + A+ A
Sbjct: 273 NLTGANLAGASLLEADLRGANLSHANLKGA 302
>gi|409912856|ref|YP_006891321.1| pentapeptide repeat-containing protein [Geobacter sulfurreducens
KN400]
gi|298506440|gb|ADI85163.1| pentapeptide repeat domain protein [Geobacter sulfurreducens KN400]
Length = 259
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 51/98 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+F A+L A K N + NF+ A++ ++FSG+K A L AV NF+ ADLS
Sbjct: 117 AKFVGANLSGADMRKVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSA 176
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
T + + L AN A T+L + L GA GAD
Sbjct: 177 TDLGSLDLEGANFRGATFNGTLLRDAKLKGADFTGADL 214
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 73/162 (45%), Gaps = 6/162 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQ A L +A+ + R A+ + A + + F G+ +GA + K K NF+ A+L
Sbjct: 85 TGAQMDGASLDEAIFDTADMRSAHCSGAYIHHAKFVGANLSGADMRKVNVEKGNFSQANL 144
Query: 169 SDTLMDRMVLNEANLTNAVLVRT-----VLTRSDLGGAIIEGADFSDAVID-LAQKQALC 222
++ L ANL AVL T L+ +DLG +EGA+F A + + A
Sbjct: 145 TNANFSGAKLKYANLGGAVLRGTNFSFADLSATDLGSLDLEGANFRGATFNGTLLRDAKL 204
Query: 223 KYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQK 264
K A+ T S S+ ++ N G P+ A Q+
Sbjct: 205 KGADFTGADLRQSRFHSVSIYDTATNRLGESFDPVRCADLQE 246
>gi|334117107|ref|ZP_08491199.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333461927|gb|EGK90532.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 520
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 67/138 (48%), Gaps = 2/138 (1%)
Query: 74 TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANF 133
T L+ A + N++ L+ N Y+A G I + A ADLR+A V+ R+
Sbjct: 50 TNLSNANMRKAKLNVARLSGANLYKANLSG--AILNVANLIRADLREAQLVEATMIRSEL 107
Query: 134 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 193
A++ ++ +G+ + A L +A +AN ADLS + L ANL A L R L
Sbjct: 108 IRANLSSANLTGANLSEADLREATLREANLEQADLSGAHLRGASLTAANLERANLHRADL 167
Query: 194 TRSDLGGAIIEGADFSDA 211
+R+DL G + A+ A
Sbjct: 168 SRADLRGVNLCNAELRQA 185
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 63/111 (56%), Gaps = 9/111 (8%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+ +A+ N N ++A+MR++ + ++ +GA L YKAN +GA L+ + R
Sbjct: 35 ANFSEAILSLTNMSGTNLSNANMRKAKLNVARLSGANL-----YKANLSGAILNVANLIR 89
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
L EA L A ++R+ L R++L A + GA+ S+A DL ++A + AN
Sbjct: 90 ADLREAQLVEATMIRSELIRANLSSANLTGANLSEA--DL--REATLREAN 136
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 51/92 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+A+LR+A + N A+ A++R +D SG+ GA L++A AN GA+LS+ +
Sbjct: 179 NAELRQANLSQANLSGADLRGANLRWADLSGANLTGADLDEARLSGANLYGANLSNVNLL 238
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
L A+LT A L+ +DL GA + GA
Sbjct: 239 NATLVHADLTQANLIHADWVGADLTGAALTGA 270
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 54/103 (52%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A ADLR+A + N +A+ + A +R + + + A L +A +A+ G +L
Sbjct: 118 TGANLSEADLREATLREANLEQADLSGAHLRGASLTAANLERANLHRADLSRADLRGVNL 177
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + + L++ANL+ A L L +DL GA + GAD +A
Sbjct: 178 CNAELRQANLSQANLSGADLRGANLRWADLSGANLTGADLDEA 220
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 67/134 (50%), Gaps = 10/134 (7%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANF 163
+AA A+L +A + + R N +A++R+++ S G+ GA L A AN
Sbjct: 153 TAANLERANLHRADLSRADLRGVNLCNAELRQANLSQANLSGADLRGANLRWADLSGANL 212
Query: 164 TGADLSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
TGADL + + L ANL+ NA LV LT+++L A GAD + A + A+
Sbjct: 213 TGADLDEARLSGANLYGANLSNVNLLNATLVHADLTQANLIHADWVGADLTGAALTGAKI 272
Query: 219 QALCKYANGTNPIT 232
A+ ++ + IT
Sbjct: 273 YAVSRFDVKADDIT 286
Score = 44.3 bits (103), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 51/110 (46%), Gaps = 3/110 (2%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A LR A N RAN AD+ +D G A L +A +AN +GADL
Sbjct: 140 ADLSGAHLRGASLTAANLERANLHRADLSRADLRGVNLCNAELRQANLSQANLSGADLRG 199
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 217
+ L+ ANLT A L L+ ++L GA + + +A + DL Q
Sbjct: 200 ANLRWADLSGANLTGADLDEARLSGANLYGANLSNVNLLNATLVHADLTQ 249
>gi|443475317|ref|ZP_21065270.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019839|gb|ELS33873.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 377
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 59/111 (53%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT- 164
A A+L A+ VK + + RAN T AD+RE+D SG++ A L KA KAN +
Sbjct: 140 ADLTQANLSAAILVKASLKQVILNRANLTEADLREADLSGAQLYLAVLSKANLAKANLSL 199
Query: 165 ----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A+L + ++ + EANL NA L ++ L ++L A + A+ S A
Sbjct: 200 ANLDSANLLEAKLEGSLFCEANLENANLSQSFLMEANLTKANLRKANLSKA 250
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 57/121 (47%), Gaps = 15/121 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A S DL +A R+N A++ E+D SG+ L A+ AN + DLS
Sbjct: 65 ANLSSTDLVRANLRSARLDRSNLVRANLYEADLSGASLVNINLSNAICASANLSHVDLSQ 124
Query: 171 TLM----------DRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDAVIDL 215
+ + DR L +ANL+ A+LV+ +L R++L A + AD S A + L
Sbjct: 125 SNLSSTNLSLANLDRADLTQANLSAAILVKASLKQVILNRANLTEADLREADLSGAQLYL 184
Query: 216 A 216
A
Sbjct: 185 A 185
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 71/154 (46%), Gaps = 17/154 (11%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L+AA++ S L N EA+ R E + S AQ A L KA K N AN S
Sbjct: 147 LSAAILVKASLKQVILNRANLTEADLR-EADL-SGAQLYLAVLSKANLAKANLSLANLDS 204
Query: 136 ADMRESDFSGSKFNGAYLE---------------KAVAYKANFTGADLSDTLMDRMVLNE 180
A++ E+ GS F A LE KA KAN + A+L+ ++ + L
Sbjct: 205 ANLLEAKLEGSLFCEANLENANLSQSFLMEANLTKANLRKANLSKANLTSAILSQANLLG 264
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
ANL A L + L SD GA ++G + S A ++
Sbjct: 265 ANLAGASLAKANLAESDCFGANLQGTNLSQANVE 298
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A ADL A V N A N + D+ +S+ S + + A L++A +AN +
Sbjct: 90 ANLYEADLSGASLVNINLSNAICASANLSHVDLSQSNLSSTNLSLANLDRADLTQANLSA 149
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
A L + +++LN ANLT A L L+ + L A++ A+ + A + LA
Sbjct: 150 AILVKASLKQVILNRANLTEADLREADLSGAQLYLAVLSKANLAKANLSLA 200
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 53/101 (52%), Gaps = 10/101 (9%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA-----YKANFTGADLSD 170
A+L KA K N +AN TSA + +++ G+ GA L KA + AN G +LS
Sbjct: 235 ANLTKANLRKANLSKANLTSAILSQANLLGANLAGASLAKANLAESDCFGANLQGTNLSQ 294
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++ + L E++L A LV ++L GA + GA+ DA
Sbjct: 295 ANVEAVDLRESDLAKANLV-----GANLAGANLFGAELLDA 330
>gi|297796179|ref|XP_002865974.1| thylakoid lumenal 17.4 kDa protein, chloroplast [Arabidopsis lyrata
subsp. lyrata]
gi|297311809|gb|EFH42233.1| thylakoid lumenal 17.4 kDa protein, chloroplast [Arabidopsis lyrata
subsp. lyrata]
Length = 236
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 5/118 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ N + ++A M + F G+ + KA A +A+F G + ++ ++DR+ ++NL
Sbjct: 123 QTNLKGKTLSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLK 182
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 242
AV TVL+ S A +E F D +I Q +C+ N R LGC
Sbjct: 183 GAVFRNTVLSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 235
>gi|220906448|ref|YP_002481759.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863059|gb|ACL43398.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 309
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 55/115 (47%), Gaps = 9/115 (7%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
L +YEA R GI +LR A + N R N A + +++F G+ GA L
Sbjct: 134 LQRYEAGERNFQGI---------NLRGAQLNQLNLRAINLEQAQLEDANFQGTVLEGANL 184
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+A +AN GA L + +D L A+L A L T L R++L A + G +F
Sbjct: 185 RQANLSRANLKGARLDGSSLDNANLTSADLEGASLQSTSLDRANLTAANLMGVNF 239
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 5/121 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L+ A + AN TSAD+ + + + A L A NF ADL
Sbjct: 187 ANLSRANLKGARLDGSSLDNANLTSADLEGASLQSTSLDRANLTAANLMGVNFWLADLQS 246
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
+ L ANL + R ++L G + GAD DA+ D Q C + G NP
Sbjct: 247 VNFTQANLTGANLGGTDVSRANFKAANLTGVNLSGADRRDAIYD----QFTC-FPEGFNP 301
Query: 231 I 231
+
Sbjct: 302 L 302
>gi|15892731|ref|NP_360445.1| hypothetical protein RC0808 [Rickettsia conorii str. Malish 7]
gi|15619907|gb|AAL03346.1| unknown [Rickettsia conorii str. Malish 7]
Length = 957
Score = 52.0 bits (123), Expect = 3e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 553 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 607
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 225
+ + EAN NA++ R LT+++ A++E AD ++A+ A+ KQA K A
Sbjct: 608 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEAKLKQANLKAA 667
Query: 226 N 226
N
Sbjct: 668 N 668
Score = 42.4 bits (98), Expect = 0.24, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 582 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 639
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 640 ENADMQAVEAAEAIFKEAKLKQANLKAANLAGINKEGADFDKAKINDATK 689
Score = 40.4 bits (93), Expect = 0.87, Method: Composition-based stats.
Identities = 26/109 (23%), Positives = 53/109 (48%), Gaps = 5/109 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A+L+ AV R A F AD+++S S + AY+ K + T + + +
Sbjct: 382 FEGANLQNAVFQNVTARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVM 441
Query: 173 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
M +++++ ++ TN+ L L +D+ ++G ++A++D A
Sbjct: 442 MVNADAEKLIIKDSEWTNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 490
>gi|302522367|ref|ZP_07274709.1| OxyO [Streptomyces sp. SPB78]
gi|302431262|gb|EFL03078.1| OxyO [Streptomyces sp. SPB78]
Length = 233
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 64/139 (46%), Gaps = 16/139 (11%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE---------------KAVAYK 160
+DL A+ N AN T A+++ S S + N A+L KA ++
Sbjct: 78 SDLSHAMLYGANLAYANLTDANLKYSSLSSTHLNEAWLSHSVLSHASLSLADLSKANLHE 137
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
A+ T AD+S + L A +TNA RT L+ ++L GA + GAD S V +L QKQ
Sbjct: 138 ADLTKADVSGANLSEADLAGAKMTNANFFRTNLSGAELTGADLSGADLS-TVKNLTQKQV 196
Query: 221 LCKYANGTNPITGVSTRKS 239
N T + TR S
Sbjct: 197 SSARTNRTTRLPSGLTRAS 215
>gi|414077930|ref|YP_006997248.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413971346|gb|AFW95435.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 189
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 45/94 (47%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N + N D +D G+ A L A KAN GA L+ + ++L A+LT A
Sbjct: 26 NLQGVNLGGVDFGRADLRGANLTAASLSGANLSKANLQGAILARAHLSEVILCGADLTQA 85
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
L L SDL GA++ GA+ DA + +A A
Sbjct: 86 TLTTAHLNESDLSGALLSGANLCDANLHMASISA 119
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 56/113 (49%), Gaps = 20/113 (17%)
Query: 109 SAAQFGSADLRKAV----HVKE------NFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
S A A+L+ A+ H+ E + +A T+A + ESD SG+ +GA L A
Sbjct: 53 SGANLSKANLQGAILARAHLSEVILCGADLTQATLTTAHLNESDLSGALLSGANLCDANL 112
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ A+ + A+L ANL+ A + + ++DL GA + GAD S+A
Sbjct: 113 HMASISAANLQG----------ANLSGAKMGGVRMWKADLQGADLSGADLSEA 155
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 46/95 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A +DL A+ N AN A + ++ G+ +GA + +KA+ GADL
Sbjct: 88 TTAHLNESDLSGALLSGANLCDANLHMASISAANLQGANLSGAKMGGVRMWKADLQGADL 147
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
S + L E NLT A L T ++ + L GAI+
Sbjct: 148 SGADLSEANLCEVNLTGANLDDTDMSETFLTGAIM 182
>gi|167921391|ref|ZP_02508482.1| pentapeptide repeat protein [Burkholderia pseudomallei BCC215]
Length = 825
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQVADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
>gi|332710578|ref|ZP_08430523.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332350633|gb|EGJ30228.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 185
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 51/106 (48%), Gaps = 15/106 (14%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKAN 162
G+ A F +A+L A N RA+F+ A + +D SGS F GA L +KAN
Sbjct: 88 GTGATFRNANLDSAYATGANMSRADFSGASVVWANFISADLSGSSFRGADLSNTTFFKAN 147
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
GADLS ANLTNA + LT ++L A + GA
Sbjct: 148 LNGADLSG----------ANLTNANFINADLTNANLDNANLTGAQL 183
>gi|159029340|emb|CAO90206.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length = 405
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 59/118 (50%), Gaps = 12/118 (10%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A+ RG F S A ADLR+A AN + AD+ E++ SG+ GA L A+
Sbjct: 245 ADLRGAFL--SEANLKGADLRRAF-----LSEANLSGADLSEANLSGADLRGAILSGAIL 297
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDA 211
+ AN GA LS + +L+ ANL A L L+ ++L GAI+ AD +A
Sbjct: 298 WGANLKGAGLSLAFLRGAILSGANLGQADLWEANLSGANLSEANLSGAILWEADLIEA 355
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 56/121 (46%), Gaps = 17/121 (14%)
Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
T+ EF A A+L KA+ R +++ D SG+ GA L A +
Sbjct: 202 TKAEFTT-DAKVIKKAELIKAI------REGTIDKTTLQQVDLSGAILRGADLRGAFLSE 254
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVIDL 215
AN GADL R L+EANL+ A L L+ +DL GAI+ GA+ A + L
Sbjct: 255 ANLKGADLR-----RAFLSEANLSGADLSEANLSGADLRGAILSGAILWGANLKGAGLSL 309
Query: 216 A 216
A
Sbjct: 310 A 310
>gi|443475216|ref|ZP_21065173.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443020003|gb|ELS34017.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 352
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 52/102 (50%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L A V A A + ++DF+G NGA + A + GADL+D + R
Sbjct: 228 ANLSSASLVGAVLNNAKLERAILIDADFNGVTLNGAIMADIKASRVQMQGADLTDAKLSR 287
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L+ ANL A++VR L + L + AD +DA+++ A+
Sbjct: 288 ADLSRANLKGAIMVRANLIEAYLARTNLADADLTDAILNRAE 329
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 15/113 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L A K + R AN A + ++ S +K N AYL+ +AN + A L
Sbjct: 176 SGADLRGANLSGADLYKADLRGANLQEATLSGANLSEAKLNNAYLQGVFLTEANLSSASL 235
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII----------EGADFSDA 211
VLN A L A+L+ L GAI+ +GAD +DA
Sbjct: 236 VGA-----VLNNAKLERAILIDADFNGVTLNGAIMADIKASRVQMQGADLTDA 283
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 68/137 (49%), Gaps = 9/137 (6%)
Query: 88 ISALADLNKYEAETR----GEFGIGSAAQFGSADLRKAVHVKE----NFRRANFTSADMR 139
++ L D N +A+ R G +G A G A+LR+ V + + R + A +
Sbjct: 113 LANLMDANLIDADMRTINLGGANLGGACMRG-ANLRQERAVGDRDEIDVSRKKRSIASLI 171
Query: 140 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 199
++ SG+ GA L A YKA+ GA+L + + L+EA L NA L LT ++L
Sbjct: 172 GANLSGADLRGANLSGADLYKADLRGANLQEATLSGANLSEAKLNNAYLQGVFLTEANLS 231
Query: 200 GAIIEGADFSDAVIDLA 216
A + GA ++A ++ A
Sbjct: 232 SASLVGAVLNNAKLERA 248
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F L A+ R AD+ ++ S + + A L+ A+ +AN A L+
Sbjct: 253 ADFNGVTLNGAIMADIKASRVQMQGADLTDAKLSRADLSRANLKGAIMVRANLIEAYLA- 311
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
R L +A+LT+A+L R L+ ++L GAI++GA D +
Sbjct: 312 ----RTNLADADLTDAILNRAELSSANLVGAILKGATLPDGKV 350
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 45/93 (48%), Gaps = 10/93 (10%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+LR A+ V N AN A + + S + +GA L+ + AN + A+L D
Sbjct: 64 ANLRGALMVGANLCGANLNQASLSNVNLSNADLHGASLQGTTLFGANLSLANLMD----- 118
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
ANL +A + L ++LGGA + GA+
Sbjct: 119 -----ANLIDADMRTINLGGANLGGACMRGANL 146
>gi|381205231|ref|ZP_09912302.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 236
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
AQ ADL A + + AN A+++ ++ +G+ A L A YKAN GADL
Sbjct: 99 AQLVGADLEGADLDRADLFEANLEIANLQWANLAGASLENANLGLANLYKANLQGADLRG 158
Query: 171 TLMDRMVLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDAV 212
+ +L EANL+N A L+ L+R++L GA ++GA +A+
Sbjct: 159 ANLTGAMLGEANLSNANLEGARLMVVNLSRANLKGANLKGAKIHEAI 205
Score = 37.0 bits (84), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 41/86 (47%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G A+L KA + R AN T A + E++ S + GA L +AN GA+L
Sbjct: 139 ANLGLANLYKANLQGADLRGANLTGAMLGEANLSNANLEGARLMVVNLSRANLKGANLKG 198
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRS 196
+ + + A+LT+ + + R+
Sbjct: 199 AKIHEAIFSGADLTDVEMTDAQICRT 224
>gi|75906828|ref|YP_321124.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75700553|gb|ABA20229.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 727
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 47/103 (45%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L + + + AN T D + SD SG+ +AN + ADLS
Sbjct: 581 ANLYGARLSRVIAIGAQLSFANLTKTDWQSSDLSGADLE----------RANLSNADLSA 630
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
T M +L A L NA L L+ DL GA + GADF D ++
Sbjct: 631 TRMTGAILRSAQLENANLRNADLSLVDLRGANVAGADFKDTIL 673
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 29/108 (26%)
Query: 133 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLM----- 173
F SA++ ++ F GS+F A L +A +ANFT A+LS LM
Sbjct: 469 FKSANLNQASFKGSRFRSVGDDGRLDTYDDAIADLSQAQMKQANFTDANLSRVLMTRSDL 528
Query: 174 DRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDA 211
R LN ANL+NA L+ L +DL G ++E GAD DA
Sbjct: 529 SRATLNRANLSNARLIGANLSSAQLVGADLRGTVLENASLTGADLGDA 576
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 65/137 (47%), Gaps = 24/137 (17%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFS 144
A+ADL++ + + A F A+L + + + + RAN ++A + ++ S
Sbjct: 499 AIADLSQAQMKQ---------ANFTDANLSRVLMTRSDLSRATLNRANLSNARLIGANLS 549
Query: 145 GSKFNGAYLEKAVAYKANFTGADLSDTLMD----------RMVLNEANLTNAVLVRTVLT 194
++ GA L V A+ TGADL D + R++ A L+ A L +T
Sbjct: 550 SAQLVGADLRGTVLENASLTGADLGDAKLQEANLYGARLSRVIAIGAQLSFANLTKTDWQ 609
Query: 195 RSDLGGAIIEGADFSDA 211
SDL GA +E A+ S+A
Sbjct: 610 SSDLSGADLERANLSNA 626
>gi|359459044|ref|ZP_09247607.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 256
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 53/98 (54%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADLR A + + AD+R ++ +G+ A L K +AN +GA LS +
Sbjct: 43 ADLRGADLEGIDLNHIDLCWADLRGTNLAGANLQAANLMKTDFCQANLSGAILSGASLQD 102
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
V+ +A+L A+L++T + ++ L GAI+ GA+ A I
Sbjct: 103 AVMTQADLNGAILIKTSMIQTRLRGAILRGANLKQARI 140
>gi|332705303|ref|ZP_08425383.1| hypothetical protein LYNGBM3L_05720 [Moorea producens 3L]
gi|332355929|gb|EGJ35389.1| hypothetical protein LYNGBM3L_05720 [Moorea producens 3L]
Length = 240
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 54/105 (51%), Gaps = 5/105 (4%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A F A+L + NF A T A++ E+ GS F+ A L A KAN GA+LS
Sbjct: 133 AVNFTKANLSRV-----NFTEAVMTGANLNEAQLIGSNFDKANLTGADLVKANLKGANLS 187
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ L EANL+ L + LT +DL A ++GA+ +A +D
Sbjct: 188 QANLSYTNLREANLSETNLRKANLTGADLTHANLQGANLIEAELD 232
Score = 37.7 bits (86), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 39/82 (47%), Gaps = 10/82 (12%)
Query: 137 DMRESDFSGSKFNGAYLEKAVAYKANFTGA-----DLSD-----TLMDRMVLNEANLTNA 186
D+ E D SG NG L +A AN +G+ DL++ + D+ +L A L A
Sbjct: 30 DLMEVDLSGQNLNGFNLFQAELMGANLSGSLLIYTDLTEACVVGAIFDKAILRHAYLNRA 89
Query: 187 VLVRTVLTRSDLGGAIIEGADF 208
L RT R+DL +E A+
Sbjct: 90 KLTRTSFQRADLTMTSLEDANL 111
Score = 37.4 bits (85), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 54/115 (46%), Gaps = 10/115 (8%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYLEKAVAYKA 161
I A A L +A + +F+RA+ T A++ +FS + GA L +
Sbjct: 75 IFDKAILRHAYLNRAKLTRTSFQRADLTMTSLEDANLIRVNFSLADLEGANLFRTNLIAV 134
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDA 211
NFT A+LS V+ ANL A L+ + LT +DL A ++GA+ S A
Sbjct: 135 NFTKANLSRVNFTEAVMTGANLNEAQLIGSNFDKANLTGADLVKANLKGANLSQA 189
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 51/95 (53%), Gaps = 5/95 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A A+L +A + NF +AN T AD+ +++ G+ + A L +Y N A+L
Sbjct: 147 TEAVMTGANLNEAQLIGSNFDKANLTGADLVKANLKGANLSQANL----SY-TNLREANL 201
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
S+T + + L A+LT+A L L ++L G I+
Sbjct: 202 SETNLRKANLTGADLTHANLQGANLIEAELDGVIL 236
>gi|443668754|ref|ZP_21134246.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|443330716|gb|ELS45411.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 403
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 59/118 (50%), Gaps = 12/118 (10%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A+ RG F S A ADLR+A AN + AD+ E++ SG+ GA L A+
Sbjct: 243 ADLRGAFL--SEANLKGADLRRAF-----LSEANLSGADLSEANLSGADLRGAILSGAIL 295
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDA 211
+ AN GA LS + +L+ ANL A L L+ ++L GAI+ AD +A
Sbjct: 296 WGANLKGAGLSLAFLRGAILSGANLGQADLWEANLSGANLSEANLSGAILWEADLIEA 353
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 56/121 (46%), Gaps = 17/121 (14%)
Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
T+ EF A A+L KA+ R +++ D SG+ GA L A +
Sbjct: 200 TKAEFTT-DAKVIKKAELIKAI------REGTIDKTTLQQVDLSGAILRGADLRGAFLSE 252
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVIDL 215
AN GADL R L+EANL+ A L L+ +DL GAI+ GA+ A + L
Sbjct: 253 ANLKGADLR-----RAFLSEANLSGADLSEANLSGADLRGAILSGAILWGANLKGAGLSL 307
Query: 216 A 216
A
Sbjct: 308 A 308
>gi|126455703|ref|YP_001074295.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
1106a]
gi|167896768|ref|ZP_02484170.1| pentapeptide repeat protein [Burkholderia pseudomallei 7894]
gi|242312992|ref|ZP_04812009.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106b]
gi|254195379|ref|ZP_04901807.1| pentapeptide repeat protein [Burkholderia pseudomallei S13]
gi|126229471|gb|ABN92884.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106a]
gi|169652126|gb|EDS84819.1| pentapeptide repeat protein [Burkholderia pseudomallei S13]
gi|242136231|gb|EES22634.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106b]
Length = 825
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQVADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
>gi|428316245|ref|YP_007114127.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428239925|gb|AFZ05711.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 410
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 52/102 (50%), Gaps = 5/102 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SA F SA+L + NFR A AD +++ +KF GA L A A+ +GADL
Sbjct: 292 SAVDFSSANLDRV-----NFRGATLNDADFSDANLQNAKFGGADLSGAFLGNADLSGADL 346
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+ L+ ANL+ A L+ LT ++ GA +E A F +
Sbjct: 347 HKASLALANLSGANLSGANLLEVNLTNTNFSGANVESARFGN 388
>gi|378826441|ref|YP_005189173.1| BTB/POZ domain-containing protein KCTD9 [Sinorhizobium fredii
HH103]
gi|365179493|emb|CCE96348.1| BTB/POZ domain-containing protein KCTD9 [Sinorhizobium fredii
HH103]
Length = 250
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 60/122 (49%), Gaps = 11/122 (9%)
Query: 111 AQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A +A+L KA V+ + +ANF+ + DFSG GA + +A+FTG
Sbjct: 88 ADLTAANLEKATLVRASLAGAKADKANFSRVEGYRGDFSGISAEGALFVSSELQRADFTG 147
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGA-DFSDAVIDLAQKQ 219
A L+ ++ L AN AV+ T L+R+DL GA+ EG DF A + L + +
Sbjct: 148 ARLTGADFEKAELGRANFGKAVVTGTRFSVANLSRADLSGAVFEGPIDFDRAFLFLTRIE 207
Query: 220 AL 221
L
Sbjct: 208 GL 209
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 47/111 (42%), Gaps = 5/111 (4%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
G A + + R+ + + + N D +D +G+ A LEKA +A+ GA
Sbjct: 50 GPGADWQDCNKRQLMLGGSDLKGGNLVDTDFASTDLNGADLTAANLEKATLVRASLAGAK 109
Query: 168 LSDTLMDRMVLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDAVI 213
R+ + + A+ V + L R+D GA + GADF A +
Sbjct: 110 ADKANFSRVEGYRGDFSGISAEGALFVSSELQRADFTGARLTGADFEKAEL 160
>gi|53721218|ref|YP_110203.1| hypothetical protein BPSS0182 [Burkholderia pseudomallei K96243]
gi|167818308|ref|ZP_02449988.1| hypothetical protein Bpse9_24431 [Burkholderia pseudomallei 91]
gi|418395056|ref|ZP_12969100.1| type VI secretion system [Burkholderia pseudomallei 354a]
gi|418554994|ref|ZP_13119746.1| type VI secretion system [Burkholderia pseudomallei 354e]
gi|52211632|emb|CAH37627.1| conserved hypothetical protein [Burkholderia pseudomallei K96243]
gi|385369399|gb|EIF74730.1| type VI secretion system [Burkholderia pseudomallei 354e]
gi|385374364|gb|EIF79254.1| type VI secretion system [Burkholderia pseudomallei 354a]
Length = 825
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQVADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 34/60 (56%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L+D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILID 802
>gi|168705224|ref|ZP_02737501.1| pentapeptide repeat [Gemmata obscuriglobus UQM 2246]
Length = 831
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 45/92 (48%), Gaps = 5/92 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ R A F A + E+ FSGS+ GA A KANF A +D + +L ANL A
Sbjct: 537 DLRGAKFDGAMLSEASFSGSQIQGASFADVPARKANFASARAADAVFRGAILANANLRAA 596
Query: 187 VLVRTVLTRSDLGGA-----IIEGADFSDAVI 213
+RT DL GA + GADF+ A +
Sbjct: 597 TFLRTNFQNVDLTGADFAFSDLRGADFTGATL 628
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 5/99 (5%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
F++ + + A++ +S F G F GA L A K +FT A+L+ L N TNA
Sbjct: 231 FKKTDLSGAELEQSHFGGCDFTGADLSHAKLQKTDFTAANLAGATCVDADLRGTNFTNAD 290
Query: 188 LVR-----TVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 221
L + L +DL GA + GADF+ A + A+ L
Sbjct: 291 LRKANFRGANLAGADLTGANVAGADFTGANLTGAKVDGL 329
Score = 44.3 bits (103), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 38/62 (61%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +A+L A V + R NFT+AD+R+++F G+ GA L A A+FTGA+L+
Sbjct: 266 FTAANLAGATCVDADLRGTNFTNADLRKANFRGANLAGADLTGANVAGADFTGANLTGAK 325
Query: 173 MD 174
+D
Sbjct: 326 VD 327
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 54/119 (45%), Gaps = 9/119 (7%)
Query: 52 DCSNNQCAGPYAKLKNWRV----FVSTALAAAVVASCSSNISALADLNKYEAE---TRGE 104
D SN + AG A+L N + F L+ A + ++ AD+ +A R
Sbjct: 522 DLSNEKLAG--ARLNNLDLRGAKFDGAMLSEASFSGSQIQGASFADVPARKANFASARAA 579
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+ A +A+LR A ++ NF+ + T AD SD G+ F GA L+ A +A F
Sbjct: 580 DAVFRGAILANANLRAATFLRTNFQNVDLTGADFAFSDLRGADFTGATLKNASFSQAKF 638
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 51/114 (44%), Gaps = 5/114 (4%)
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAV 157
G F + F DL A + +F +FT AD+ +++DF+ + GA A
Sbjct: 221 GSFTRATDCTFKKTDLSGAELEQSHFGGCDFTGADLSHAKLQKTDFTAANLAGATCVDAD 280
Query: 158 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
NFT ADL L A+LT A + T ++L GA ++G D S A
Sbjct: 281 LRGTNFTNADLRKANFRGANLAGADLTGANVAGADFTGANLTGAKVDGLDASKA 334
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 42/94 (44%), Gaps = 4/94 (4%)
Query: 93 DLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 152
DL+ E E + FG F ADL A K +F AN A ++D G+ F A
Sbjct: 235 DLSGAELE-QSHFG---GCDFTGADLSHAKLQKTDFTAANLAGATCVDADLRGTNFTNAD 290
Query: 153 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
L KA AN GADL+ + ANLT A
Sbjct: 291 LRKANFRGANLAGADLTGANVAGADFTGANLTGA 324
>gi|126442493|ref|YP_001061349.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
668]
gi|126221984|gb|ABN85489.1| pentapeptide repeat protein [Burkholderia pseudomallei 668]
Length = 825
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQGADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
>gi|17228308|ref|NP_484856.1| heterocyst-specific glycolipids-directing protein [Nostoc sp. PCC
7120]
gi|535436|gb|AAB59979.1| HglK [Nostoc sp. PCC 7120]
gi|17130158|dbj|BAB72770.1| heterocyst-specific glycolipids-directing protein [Nostoc sp. PCC
7120]
gi|1585247|prf||2124368C hglK gene
Length = 727
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 47/103 (45%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L + + + AN T D + SD SG+ +AN + ADLS
Sbjct: 581 ANLYGARLSRVIAIGAQLSFANLTKTDWQSSDLSGADLE----------RANLSNADLSA 630
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
T M +L A L NA L L+ DL GA + GADF D ++
Sbjct: 631 TRMTGAILRSAQLENANLRNADLSLVDLRGANVAGADFKDTIL 673
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 29/108 (26%)
Query: 133 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLM----- 173
F SA++ ++ F GS+F A L +A +ANFT A+LS LM
Sbjct: 469 FKSANLNQASFKGSRFRSVGDDGRWDTYDDAIADLSQAQMKQANFTDANLSRVLMTRSDL 528
Query: 174 DRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDA 211
R LN ANL+NA L+ L +DL G ++E GAD DA
Sbjct: 529 SRATLNRANLSNARLIGANLSSAQLVGADLRGTVLENASLTGADLGDA 576
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 65/137 (47%), Gaps = 24/137 (17%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFS 144
A+ADL++ + + A F A+L + + + + RAN ++A + ++ S
Sbjct: 499 AIADLSQAQMKQ---------ANFTDANLSRVLMTRSDLSRATLNRANLSNARLIGANLS 549
Query: 145 GSKFNGAYLEKAVAYKANFTGADLSDTLMD----------RMVLNEANLTNAVLVRTVLT 194
++ GA L V A+ TGADL D + R++ A L+ A L +T
Sbjct: 550 SAQLVGADLRGTVLENASLTGADLGDAKLQEANLYGARLSRVIAIGAQLSFANLTKTDWQ 609
Query: 195 RSDLGGAIIEGADFSDA 211
SDL GA +E A+ S+A
Sbjct: 610 SSDLSGADLERANLSNA 626
>gi|428768931|ref|YP_007160721.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428683210|gb|AFZ52677.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 320
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 54/103 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
+ F A+L+ K NF ANFT A++ +D SG GA +A AN G DL +
Sbjct: 115 SDFSYANLQNCKLTKANFMGANFTRANLSGADLSGVNLTGADFTRADLSGANLQGCDLEE 174
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ L+++ L NA L ++L +L A + GA+FS AV+
Sbjct: 175 ANLRCADLSKSILRNADLSESILQGVNLENANLRGANFSGAVL 217
>gi|73668253|ref|YP_304268.1| hypothetical protein Mbar_A0710 [Methanosarcina barkeri str.
Fusaro]
gi|72395415|gb|AAZ69688.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 381
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 62/117 (52%), Gaps = 1/117 (0%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F ADL K N + +F D+ +++ + GA LE+A +AN GA+L +
Sbjct: 152 ADFQGADLEKVNLQGTNLKETSFKRTDLEKTNLQEADLQGADLEEANLQRANLQGANLKE 211
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 226
+ R L +AN+ A L + +++L GA ++ A+F ++ A+ K+A+ + AN
Sbjct: 212 ANLQRTDLRKANIQGADLGKANFEQANLKGANLKKANFEKTNLEEAKLKEAILQGAN 268
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 54/104 (51%), Gaps = 5/104 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L++A + + R+AN AD+ +++F + GA L+KA NF +L +
Sbjct: 202 ANLQGANLKEANLQRTDLRKANIQGADLGKANFEQANLKGANLKKA-----NFEKTNLEE 256
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ +L ANL A L++ L +++L A GA+ A ++
Sbjct: 257 AKLKEAILQGANLIKAKLIKAKLQKANLKSANFNGANLIKAKLE 300
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 54/112 (48%), Gaps = 5/112 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTG 165
A F A+L+ A K NF + N A ++E+ G+ K A L+KA ANF G
Sbjct: 232 ANFEQANLKGANLKKANFEKTNLEEAKLKEAILQGANLIKAKLIKAKLQKANLKSANFNG 291
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
A+L ++ L ANL A L R + A ++GA F +A ++ AQ
Sbjct: 292 ANLIKAKLEGANLQRANLKEANFNGADLQRVNFRKANLQGAKFKEANLEGAQ 343
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 48/99 (48%), Gaps = 5/99 (5%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDT 171
DL+ A +K A+F A++ E++F GS F GA LEKA G +L +
Sbjct: 8 DLQGANFIKTKLEGADFMGANLEEANFIGSNLKGANFKGANLEKANLQATELQGVNLQEA 67
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+ R L A L A L R L ++L GA ++ AD +
Sbjct: 68 NLHRAKLQVATLYGADLQRANLQEANLQGANLQRADLQE 106
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 54/121 (44%), Gaps = 20/121 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG---------------SKFNGAYLEK 155
A A ++ A+ + + + ANF AD++ +DF G + F LEK
Sbjct: 122 ANLEKAKVQGAIFCEADLQEANFQGADLQGADFQGADLEKVNLQGTNLKETSFKRTDLEK 181
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
+A+ GADL + + R ANL A L L R+DL A I+GAD A +
Sbjct: 182 TNLQEADLQGADLEEANLQR-----ANLQGANLKEANLQRTDLRKANIQGADLGKANFEQ 236
Query: 216 A 216
A
Sbjct: 237 A 237
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 55/116 (47%), Gaps = 15/116 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA--------- 161
A F A+L +A + N + ANF A++ +++ ++ G L++A ++A
Sbjct: 22 ADFMGANLEEANFIGSNLKGANFKGANLEKANLQATELQGVNLQEANLHRAKLQVATLYG 81
Query: 162 ------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
N A+L + R L E NL A L RT L ++L A ++GA F +A
Sbjct: 82 ADLQRANLQEANLQGANLQRADLQEVNLQEANLQRTDLVEANLEKAKVQGAIFCEA 137
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 60/122 (49%), Gaps = 6/122 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL-- 168
A A+L++A + N + AN D+ E++ +K GA +A +ANF GADL
Sbjct: 92 ANLQGANLQRADLQEVNLQEANLQRTDLVEANLEKAKVQGAIFCEADLQEANFQGADLQG 151
Query: 169 ---SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ-ALCKY 224
++++ L NL RT L +++L A ++GAD +A + A Q A K
Sbjct: 152 ADFQGADLEKVNLQGTNLKETSFKRTDLEKTNLQEADLQGADLEEANLQRANLQGANLKE 211
Query: 225 AN 226
AN
Sbjct: 212 AN 213
Score = 44.3 bits (103), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 59/122 (48%), Gaps = 3/122 (2%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F DL K + + + A+ A+++ ++ G+ A L++ KAN GADL
Sbjct: 174 FKRTDLEKTNLQEADLQGADLEEANLQRANLQGANLKEANLQRTDLRKANIQGADLGKAN 233
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYA--NGTN 229
++ L ANL A +T L + L AI++GA+ A +I ++A K A NG N
Sbjct: 234 FEQANLKGANLKKANFEKTNLEEAKLKEAILQGANLIKAKLIKAKLQKANLKSANFNGAN 293
Query: 230 PI 231
I
Sbjct: 294 LI 295
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 50/108 (46%), Gaps = 3/108 (2%)
Query: 107 IGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
IGS A F A+L KA + N A++ + + GA L++A +AN
Sbjct: 35 IGSNLKGANFKGANLEKANLQATELQGVNLQEANLHRAKLQVATLYGADLQRANLQEANL 94
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
GA+L + + L EANL LV L ++ + GAI AD +A
Sbjct: 95 QGANLQRADLQEVNLQEANLQRTDLVEANLEKAKVQGAIFCEADLQEA 142
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL KA NF +AN A++++++F + A L++A+ AN A L
Sbjct: 222 ANIQGADLGKA-----NFEQANLKGANLKKANFEKTNLEEAKLKEAILQGANLIKAKLIK 276
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + L AN A L++ L ++L A ++ A+F+ A
Sbjct: 277 AKLQKANLKSANFNGANLIKAKLEGANLQRANLKEANFNGA 317
>gi|217423045|ref|ZP_03454547.1| pentapeptide repeat protein [Burkholderia pseudomallei 576]
gi|217393953|gb|EEC33973.1| pentapeptide repeat protein [Burkholderia pseudomallei 576]
Length = 825
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQGADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
>gi|167905147|ref|ZP_02492352.1| pentapeptide repeat protein [Burkholderia pseudomallei NCTC 13177]
gi|237508538|ref|ZP_04521253.1| pentapeptide repeat family protein [Burkholderia pseudomallei
MSHR346]
gi|235000743|gb|EEP50167.1| pentapeptide repeat family protein [Burkholderia pseudomallei
MSHR346]
Length = 825
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQGADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
>gi|86606920|ref|YP_475683.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86555462|gb|ABD00420.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 154
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/128 (35%), Positives = 59/128 (46%), Gaps = 16/128 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 165
S AQ A+LR V R A+ + AD+RE D SG+ +GA L A + N G
Sbjct: 32 SGAQLSGANLRGIV-----LRDADLSGADLREGDLSGADLSGADLRGAKLRRVNLIGAKL 86
Query: 166 --ADLSDTLMDRMVLNEANLTNAVLVRTVL-TRSDLGGAIIEGADFSDAVIDLAQKQALC 222
ADL + R L A+L+ A L R L +DL GAII F A+ D
Sbjct: 87 VKADLRGANLYRAKLLRADLSEADLSRADLRIGADLRGAIITNTRFRGALYD-----EYT 141
Query: 223 KYANGTNP 230
K+ G NP
Sbjct: 142 KFPEGFNP 149
>gi|341583996|ref|YP_004764487.1| hypothetical protein Rh054_04430 [Rickettsia heilongjiangensis 054]
gi|340808221|gb|AEK74809.1| hypothetical protein Rh054_04430 [Rickettsia heilongjiangensis 054]
Length = 959
Score = 52.0 bits (123), Expect = 3e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 225
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAKEANFKNAIMKRADLTKANFTKAVLENADMQAAEAAEAIFKEANLKQANLKAA 669
Query: 226 N 226
N
Sbjct: 670 N 670
Score = 39.3 bits (90), Expect = 1.8, Method: Composition-based stats.
Identities = 37/143 (25%), Positives = 59/143 (41%), Gaps = 27/143 (18%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L A + + ++ + L++ +AE G ++ A+ N + ANF +
Sbjct: 576 LTNATLTNATAQFAKLSNATLEKAEAEG------------LNISDAIAKNINAKEANFKN 623
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 195
A M+ +D + + F A LE A D+ + EANL A L
Sbjct: 624 AIMKRADLTKANFTKAVLENA----------DMQAAEAAEAIFKEANLKQA-----NLKA 668
Query: 196 SDLGGAIIEGADFSDAVIDLAQK 218
++L G EGADF A I+ A K
Sbjct: 669 ANLAGINKEGADFDKAKINDATK 691
Score = 38.1 bits (87), Expect = 4.1, Method: Composition-based stats.
Identities = 38/144 (26%), Positives = 61/144 (42%), Gaps = 13/144 (9%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
LKN +F S L +++C+ + + N A + A F ADL+K+
Sbjct: 359 LKN-TLFASANLENIKISNCNLDFTNFEGANLQNAVFQNV--TARNAGFLFADLKKSKIE 415
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTGADLSDTLMD 174
+ RA D+ E + + SKFN + A A K +N TG L+ M
Sbjct: 416 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKLIIKDSEWKNSNLTGISLAYADMQ 475
Query: 175 RMVLNEANLTNAVLVRTVLTRSDL 198
R+ + L NA+L + + +DL
Sbjct: 476 RVQMQGVVLNNALLDQANIVSTDL 499
>gi|332710048|ref|ZP_08430003.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332351191|gb|EGJ30776.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 739
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 58/112 (51%), Gaps = 5/112 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S+AQ +AD R+A+ A+ T A++ E+ FS S +GA L K A +++F+ ADL
Sbjct: 561 SSAQLINADFRRAI-----LENASLTGANLGEAKFSLSSLHGARLGKVSAVRSDFSSADL 615
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
S + L+ ANL+NA L + L GA + A +A + A A
Sbjct: 616 SQSSWQGANLSRANLSNANLKNVDFNSTQLVGANLRNAKLYNAKLRYANLSA 667
>gi|428202846|ref|YP_007081435.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427980278|gb|AFY77878.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 253
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 51/106 (48%), Gaps = 5/106 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTG 165
A ADL A ++ N AN A++ E+D S S GAYL +A YKA
Sbjct: 110 ANLRRADLSAAKLIRSNLSEANLVDANLNEADLSQSNLYEAEAIGAYLYRATLYKAKLVE 169
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A LS + L EA+L L LT+++LGGA + A+ S A
Sbjct: 170 AHLSKVYLVGADLREAHLYRTDLRYAHLTKANLGGAHLLEANLSGA 215
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 75/172 (43%), Gaps = 14/172 (8%)
Query: 44 TESDGQFPDCSNNQCAGPYAKLKNWR-------VFVSTALAAAVVASCSSNISALADLNK 96
+E++ D S+ G K N R + + L+ A + + N + L+ N
Sbjct: 88 SEANLSGADLSHANLIGTVLKKANLRRADLSAAKLIRSNLSEANLVDANLNEADLSQSNL 147
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
YEAE G + A L KA V+ + + AD+RE+ + A+L KA
Sbjct: 148 YEAEAIGAY-------LYRATLYKAKLVEAHLSKVYLVGADLREAHLYRTDLRYAHLTKA 200
Query: 157 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A+ A+LS + + L ANL A L L ++DL GA ++GA F
Sbjct: 201 NLGGAHLLEANLSGANLRKANLRGANLQGADLRCANLHQADLRGANLQGALF 252
Score = 40.4 bits (93), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 45/83 (54%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN +A++ E++ SG+ + A L V KAN ADLS + R L+EANL +A L
Sbjct: 80 ANLIAANLSEANLSGADLSHANLIGTVLKKANLRRADLSAAKLIRSNLSEANLVDANLNE 139
Query: 191 TVLTRSDLGGAIIEGADFSDAVI 213
L++S+L A GA A +
Sbjct: 140 ADLSQSNLYEAEAIGAYLYRATL 162
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 50/93 (53%), Gaps = 2/93 (2%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ N + A ++A++ + SG + A L A +AN +GADLS + VL +ANL
Sbjct: 54 QNNLQNAELSNANLVGVNLSGVDLSDANLIAANLSEANLSGADLSHANLIGTVLKKANLR 113
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
A L L RS+L A + A+ ++A DL+Q
Sbjct: 114 RADLSAAKLIRSNLSEANLVDANLNEA--DLSQ 144
>gi|297569025|ref|YP_003690369.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
gi|296924940|gb|ADH85750.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
Length = 830
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 67/133 (50%), Gaps = 12/133 (9%)
Query: 90 ALADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
ALADL + +R F S A+ ADLR+ + + +FR A+ AD RE+
Sbjct: 225 ALADLGGADLRRADLSRANF---SQARLRQADLRQVLFSESDFRHADARRADFREATLRQ 281
Query: 146 SKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 200
+ F+GA L +A+ + TG A+L+ +++ L+ L +V+ L S+L G
Sbjct: 282 ANFSGADLSRAIFSGTDLTGGVFQQANLAGAVLEGADLSRLALAGVKMVKANLAGSNLYG 341
Query: 201 AIIEGADFSDAVI 213
A + G D +DA +
Sbjct: 342 ADLRGVDLTDASL 354
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 42/80 (52%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N D+R +F+ S+ +G L+ A A+F+ ADL + L EA L +A L R
Sbjct: 163 NLAGLDLRGVNFADSRLHGVNLQGANLRGADFSRADLMHADLSEADLREAKLVDANLARA 222
Query: 192 VLTRSDLGGAIIEGADFSDA 211
L +DLGGA + AD S A
Sbjct: 223 SLALADLGGADLRRADLSRA 242
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 54/114 (47%), Gaps = 10/114 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRA----------NFTSADMRESDFSGSKFNGAYLEKAVAYK 160
A ADLR+A V N RA + AD+ ++FS ++ A L + + +
Sbjct: 202 ADLSEADLREAKLVDANLARASLALADLGGADLRRADLSRANFSQARLRQADLRQVLFSE 261
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
++F AD L +AN + A L R + + +DL G + + A+ + AV++
Sbjct: 262 SDFRHADARRADFREATLRQANFSGADLSRAIFSGTDLTGGVFQQANLAGAVLE 315
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 77/180 (42%), Gaps = 38/180 (21%)
Query: 70 VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 129
VF LA AV+ + ALA + +A G G A DL A ++ +
Sbjct: 303 VFQQANLAGAVLEGADLSRLALAGVKMVKANLAGSNLYG--ADLRGVDLTDASLLEADLS 360
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAY--------------------KANFTGADL- 168
A+ A + ++ F+G +GA L AVA +A+FTGA+L
Sbjct: 361 AADLAGARLDKAVFAGGTLHGARLLSAVARNADFRAANLTRVAAQQADFSQADFTGANLT 420
Query: 169 ----SDTLMDRMVLNEANLTNAVL-----------VRTVLTRSDLGGAIIEGADFSDAVI 213
S+ +M L EANLTNA L +R LT + L A + GAD S+A++
Sbjct: 421 AAVFSEAIMAGAKLLEANLTNANLDGADLTSRVSMIRGNLTNASLQKADLHGADLSNAIV 480
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 58/126 (46%), Gaps = 26/126 (20%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR---------------ESDFSGSKFNGAYLEK 155
A F +A+L + + +F +A+FT A++ E++ + + +GA L
Sbjct: 392 ADFRAANLTRVAAQQADFSQADFTGANLTAAVFSEAIMAGAKLLEANLTNANLDGADLTS 451
Query: 156 AVAY-----------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 204
V+ KA+ GADLS+ ++ VL EANL L L R+DL A I
Sbjct: 452 RVSMIRGNLTNASLQKADLHGADLSNAIVTGAVLREANLRRVRLSHASLNRADLSWATIV 511
Query: 205 GADFSD 210
AD S+
Sbjct: 512 DADLSN 517
Score = 43.5 bits (101), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 39/82 (47%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DLR A V N R A+ AD+ +D + A L ++ N T ADLS ++
Sbjct: 554 DLRNANLVNANLRDADLADADLSNADLRQANLARANLSRSDLRWVNLTDADLSGAILSGA 613
Query: 177 VLNEANLTNAVLVRTVLTRSDL 198
LN+A+ AV LTR+ L
Sbjct: 614 SLNDADFNRAVFAEANLTRASL 635
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 43/87 (49%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+F RA+ AD+ E+D +K A L +A A+ GADL + R ++A L A
Sbjct: 193 DFSRADLMHADLSEADLREAKLVDANLARASLALADLGGADLRRADLSRANFSQARLRQA 252
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVI 213
L + + + SD A ADF +A +
Sbjct: 253 DLRQVLFSESDFRHADARRADFREATL 279
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 15/78 (19%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +ADLR+A N RAN + +D+R + + + +GA L +GA L+D
Sbjct: 573 ADLSNADLRQA-----NLARANLSRSDLRWVNLTDADLSGAIL----------SGASLND 617
Query: 171 TLMDRMVLNEANLTNAVL 188
+R V EANLT A L
Sbjct: 618 ADFNRAVFAEANLTRASL 635
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 63/130 (48%), Gaps = 17/130 (13%)
Query: 118 LRKAV-----HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
LR AV ++ + R AN +A++R++D + + + A L +A +AN + +DL
Sbjct: 540 LRSAVSLGGRMIRYDLRNANLVNANLRDADLADADLSNADLRQANLARANLSRSDL---- 595
Query: 173 MDRMV-LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA----VIDLAQKQALCKYANG 227
R V L +A+L+ A+L L +D A+ A+ + A V +L K + A G
Sbjct: 596 --RWVNLTDADLSGAILSGASLNDADFNRAVFAEANLTRASLFNVKNL-DKARMLDQAQG 652
Query: 228 TNPITGVSTR 237
P G +R
Sbjct: 653 YEPKAGDDSR 662
Score = 37.0 bits (84), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 59/148 (39%), Gaps = 36/148 (24%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA---------- 158
S + F AD R+A + R+ANF+ AD+ + FSG+ G ++A
Sbjct: 260 SESDFRHADARRADFREATLRQANFSGADLSRAIFSGTDLTGGVFQQANLAGAVLEGADL 319
Query: 159 --------------------YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
Y A+ G DL+D + L+ A+L A L + V L
Sbjct: 320 SRLALAGVKMVKANLAGSNLYGADLRGVDLTDASLLEADLSAADLAGARLDKAVFAGGTL 379
Query: 199 GG-----AIIEGADFSDA-VIDLAQKQA 220
G A+ ADF A + +A +QA
Sbjct: 380 HGARLLSAVARNADFRAANLTRVAAQQA 407
>gi|73670411|ref|YP_306426.1| hypothetical protein Mbar_A2951 [Methanosarcina barkeri str.
Fusaro]
gi|72397573|gb|AAZ71846.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 286
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 53/107 (49%), Gaps = 5/107 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR+ + R AN AD+RE++ G+ GA L + N GADL +
Sbjct: 72 ANLEGADLRETNLGGADLREANLGGADLREANLEGADLEGADL-----RETNLGGADLRE 126
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+ L EANL A L T L ++L GA +EGA+ A ++ A
Sbjct: 127 ANLGGADLREANLEGADLRETNLLEANLEGASLEGANLKVANLERAN 173
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 50/103 (48%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
G ADLR+A + R AN AD+RE++ + GA LE A AN A+L
Sbjct: 118 NLGGADLREANLGGADLREANLEGADLRETNLLEANLEGASLEGANLKVANLERANLKGV 177
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ L+ A L A LV + L ++ GA +E D + A ++
Sbjct: 178 NLIEAELSWAELKGANLVESYLVGTNFTGANLEWVDLTKANLE 220
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 7/119 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G ADLR+A N A+ AD+RE++ G+ A L A +AN GADL +
Sbjct: 92 ANLGGADLREA-----NLEGADLEGADLRETNLGGADLREANLGGADLREANLEGADLRE 146
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
T + L A+L A L L R++L G + A+ S A +L + Y GTN
Sbjct: 147 TNLLEANLEGASLEGANLKVANLERANLKGVNLIEAELSWA--ELKGANLVESYLVGTN 203
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 5/85 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N RAN ++ E++ S ++ GA L ++ NFTGA+L + + L +ANL A
Sbjct: 168 NLERANLKGVNLIEAELSWAELKGANLVESYLVGTNFTGANL-----EWVDLTKANLEEA 222
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
+ L +++ GA I+GA+ +A
Sbjct: 223 IFTWADLEGANISGANIKGANLKEA 247
Score = 42.4 bits (98), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 48/96 (50%), Gaps = 15/96 (15%)
Query: 125 KENFRRANFTS-----ADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMD 174
++N +++NF A+++E F G GA LEKA AN GADL +T +
Sbjct: 26 EDNLKKSNFIGTCLIGANLKELSFEGVNLREANLLGANLEKANLLGANLEGADLRETNLG 85
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
L EANL A L ++L GA +EGAD +
Sbjct: 86 GADLREANLGGADL-----REANLEGADLEGADLRE 116
>gi|307150734|ref|YP_003886118.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306980962|gb|ADN12843.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 231
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 52/105 (49%), Gaps = 15/105 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F +D R + K NF A F AD E+ G+ F+ A LEKA+ + + +GA
Sbjct: 33 SRADFSYSDFRSSRLGKTNFSAACFLGADFSEAILWGTDFSKANLEKAILREVDLSGA-- 90
Query: 169 SDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADF 208
+L EANLT L++ L+ + L GAI+ ADF
Sbjct: 91 --------ILTEANLTQVNLIKATLGGANLSLAQLPGAIVYEADF 127
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 57/117 (48%), Gaps = 2/117 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SAA F AD +A+ +F +AN A +RE D SG+ A L + KA GA+L
Sbjct: 53 SAACFLGADFSEAILWGTDFSKANLEKAILREVDLSGAILTEANLTQVNLIKATLGGANL 112
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCK 223
S + ++ EA+ RT LT+++L A + A + A + AQ LC+
Sbjct: 113 SLAQLPGAIVYEADFRPTSEQRTNLTQANLSAANLSYAKLNGANLYQAQLMNAQLCR 169
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 56/121 (46%), Gaps = 18/121 (14%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A A L A+ + +FR R N T A++ ++ S +K NGA L +A A
Sbjct: 110 ANLSLAQLPGAIVYEADFRPTSEQRTNLTQANLSAANLSYAKLNGANLYQAQLMNAQLCR 169
Query: 166 ADLSDTLMDRMV---LNEANLTNA----------VLVRTVLTRSDLGGAIIEGADFSDAV 212
ADLS + + L+EANL NA +L LT +DL G I+ D + A+
Sbjct: 170 ADLSKGIWQNCLPTDLSEANLQNADLSYADLSGAILCYADLTGADLTGTILTNVDLTGAI 229
Query: 213 I 213
+
Sbjct: 230 L 230
>gi|218442709|ref|YP_002381029.1| hypothetical protein PCC7424_5734 [Cyanothece sp. PCC 7424]
gi|218175067|gb|ACK73799.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 266
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 66/143 (46%), Gaps = 26/143 (18%)
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
AV K N A +A+++ +D G+ GAYL A + TGA+L D + L
Sbjct: 125 AVGPKANLNGAFLNTANLKNADLKGANLRGAYLSGA-----DLTGANLEDAALSGANLQG 179
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID-----------LAQ------KQALCK 223
A LT A L + L ++L GA + AD +DA ++ LAQ K LC
Sbjct: 180 ALLTGAYLRKARLIGAELQGADLRAADLTDANLEQLQNLAGADFTLAQGLTEDTKAMLCS 239
Query: 224 YAN---GT-NPITGVSTRKSLGC 242
GT NP T +T +SLGC
Sbjct: 240 RPAQELGTWNPFTRSNTAQSLGC 262
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 55/113 (48%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
+L KA+ +N +AN ++ + D S + + A L A + N GA+L + +
Sbjct: 7 ELTKALSEGKNLAKANLQGINLAQMDLSNADLSAANLIGANLSETNLKGANLEGADLRGV 66
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
L++ANL A L + L RS+L G ++ A A I LA+ + + G N
Sbjct: 67 NLSKANLEGANLQNSYLFRSNLEGCCLKEAQLQGAKIQLARYDSYTVWPEGYN 119
>gi|416394625|ref|ZP_11686208.1| pentapeptide repeat protein [Crocosphaera watsonii WH 0003]
gi|357263221|gb|EHJ12255.1| pentapeptide repeat protein [Crocosphaera watsonii WH 0003]
Length = 164
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 49/89 (55%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
E+ R A+ +D+R+++ G+ A LE A AN GA+L+ ++ LN++NL
Sbjct: 62 NEDLRYAHLIGSDLRDANLEGAILIEANLEGADLTGANLEGANLTGAMLSNASLNDSNLD 121
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
N L ++ +D+ GA +E D ++A I
Sbjct: 122 NVNLASAIIYDADVTGASMENLDITNAQI 150
>gi|189347104|ref|YP_001943633.1| pentapeptide repeat-containing protein [Chlorobium limicola DSM
245]
gi|189341251|gb|ACD90654.1| pentapeptide repeat protein [Chlorobium limicola DSM 245]
Length = 408
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ SA LR A+ V+ + +A +AD+ ++ + F GA+++ AV KA+ TGAD S
Sbjct: 92 ARLDSAVLRSALLVRASLDKARLHNADLEDAVLEAASFKGAFMQTAVLKKADCTGADFSG 151
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L E N A L +LT +DL + AD S +V+
Sbjct: 152 A-----DLRETNFREARLAGALLTGADLRATYLWRADMSRSVL 189
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 44/78 (56%)
Query: 139 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
R +D SG++ G L A A+ +GADL+ + + + L+ A L +AVL +L R+ L
Sbjct: 50 RVADLSGAQLKGMNLRGADLSYADLSGADLASSDLSKARLDHARLDSAVLRSALLVRASL 109
Query: 199 GGAIIEGADFSDAVIDLA 216
A + AD DAV++ A
Sbjct: 110 DKARLHNADLEDAVLEAA 127
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 57/120 (47%), Gaps = 1/120 (0%)
Query: 113 FGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
F D+RK E N R+A F ++ +D + ++ GA KA + A+ ADLS
Sbjct: 285 FAWNDMRKRNRAMEVNLRQAKFDQKNLSYADLAHARLQGASFRKADLFDADLRNADLSGC 344
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 231
M L +A+L A L L R++LG A + G S + + K+A K+A + +
Sbjct: 345 DMREANLEKADLGGADLSGVNLWRANLGRARLNGVKVSASTVLDTGKKADQKWAERHDAV 404
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 52/126 (41%), Gaps = 20/126 (15%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
N Y A G S AQ +LR A + A+ + AD+ SD S ++ + A L+
Sbjct: 41 NSYRAGLGGRVADLSGAQLKGMNLRGA-----DLSYADLSGADLASSDLSKARLDHARLD 95
Query: 155 KAVAY----------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 204
AV KA ADL D VL A+ A + VL ++D GA
Sbjct: 96 SAVLRSALLVRASLDKARLHNADLEDA-----VLEAASFKGAFMQTAVLKKADCTGADFS 150
Query: 205 GADFSD 210
GAD +
Sbjct: 151 GADLRE 156
>gi|254189534|ref|ZP_04896044.1| pentapeptide repeat protein [Burkholderia pseudomallei Pasteur
52237]
gi|157937212|gb|EDO92882.1| pentapeptide repeat protein [Burkholderia pseudomallei Pasteur
52237]
Length = 825
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQVADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSDADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
>gi|254486622|ref|ZP_05099827.1| hypothetical protein RGAI101_1279 [Roseobacter sp. GAI101]
gi|214043491|gb|EEB84129.1| hypothetical protein RGAI101_1279 [Roseobacter sp. GAI101]
Length = 200
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 54/112 (48%), Gaps = 18/112 (16%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
G ADL A + A T A++ S+ SG+ GAYLE A A TGADL+
Sbjct: 94 NLGGADLSGA-----DLTGAVLTQANLEMSNLSGATLTGAYLELANLAGARVTGADLT-- 146
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
+ANLT+A L VL + L GA++ GAD A + + LCK
Sbjct: 147 --------KANLTSANLRGAVLLEAKLVGAVLLGADLDGASL---EGAILCK 187
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 58/113 (51%), Gaps = 16/113 (14%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F ADLR+ + K ++ + + +D++ D +G+ GA L A + A+ + ADLS
Sbjct: 4 AAFDEADLRQLLDTKV-CQKCDLSGSDLKGVDLAGANLAGANLSGAKLWAADLSKADLSG 62
Query: 171 TLMDRMVLN----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ L +ANL+ A L T ++LGGA + GAD + AV+
Sbjct: 63 VNLEAATLTAANLAGANLADANLSGAYL-----TTTNLGGADLSGADLTGAVL 110
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 52/123 (42%), Gaps = 20/123 (16%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG--------------------SKF 148
S + DL A N A +AD+ ++D SG +
Sbjct: 26 SGSDLKGVDLAGANLAGANLSGAKLWAADLSKADLSGVNLEAATLTAANLAGANLADANL 85
Query: 149 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+GAYL A+ +GADL+ ++ + L +NL+ A L L ++L GA + GAD
Sbjct: 86 SGAYLTTTNLGGADLSGADLTGAVLTQANLEMSNLSGATLTGAYLELANLAGARVTGADL 145
Query: 209 SDA 211
+ A
Sbjct: 146 TKA 148
>gi|428320809|ref|YP_007118691.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428244489|gb|AFZ10275.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 290
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 62/120 (51%), Gaps = 9/120 (7%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
L +YE R +F + A A+L A+ V N RAN + A++ + + ++ NGA L
Sbjct: 7 LKEYENGNR-DF---ADANLSGANLSGAILVGVNLSRANLSGANLSRAHLTKAELNGANL 62
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
Y+AN + A + + L +ANL+ A LV+ L R+ L GA + G++ A++
Sbjct: 63 -----YRANLSFAKMGQARLADAELTKANLSGAFLVKAKLPRAKLSGAQLIGSNLRSAIL 117
>gi|334118359|ref|ZP_08492448.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333459366|gb|EGK87979.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 280
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 51/91 (56%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
LR+ + NFR N AD++ + S + F + + A AN TGA+L + + +
Sbjct: 7 LRQYAAGERNFREINLVGADLKGVNLSEANFTRSNFQDANLKGANLTGANLREVKLAGVD 66
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
L EANL+ A L+ T L+R++L GA + GA+
Sbjct: 67 LTEANLSEANLIGTDLSRANLSGANLMGANL 97
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+LR ++ + N +AN T A++ E++F+ + A L A + N A+L
Sbjct: 88 SGANLMGANLRGSMAREVNMTKANLTEANLTEANFTEANLFAANLTDASMIRINLMKANL 147
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + L NLTNA+L ++L R++L AI+ GA S A
Sbjct: 148 SWS-----TLKAVNLTNAILSESLLERANLNQAILSGAMLSGA 185
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 65/131 (49%), Gaps = 14/131 (10%)
Query: 111 AQFGSADLRKA----VHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A A+LR+ V + E N AN D+ ++ SG+ GA L ++A + N T
Sbjct: 50 ANLTGANLREVKLAGVDLTEANLSEANLIGTDLSRANLSGANLMGANLRGSMAREVNMTK 109
Query: 166 ADLSDTLMDRMVLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
A+L++ + EANL T+A ++R L +++L + ++ + ++A++ ++
Sbjct: 110 ANLTEANLTEANFTEANLFAANLTDASMIRINLMKANLSWSTLKAVNLTNAIL----SES 165
Query: 221 LCKYANGTNPI 231
L + AN I
Sbjct: 166 LLERANLNQAI 176
Score = 41.2 bits (95), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 51/108 (47%), Gaps = 10/108 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVAYK 160
A +A+L A ++ N +AN T+A + ES + N A L A+
Sbjct: 125 ANLFAANLTDASMIRINLMKANLSWSTLKAVNLTNAILSESLLERANLNQAILSGAMLSG 184
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
AN TGADL M L+EANL+NA L ++ S L A + GA+
Sbjct: 185 ANLTGADLRQVTMVGANLSEANLSNANLRVANVSWSTLAKANLSGANL 232
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 5/95 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADLR+ V N AN ++A++R ++ S S A L A Y+A ++L
Sbjct: 183 SGANLTGADLRQVTMVGANLSEANLSNANLRVANVSWSTLAKANLSGANLYRAKLCWSNL 242
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
S ++ VL +ANL RT +DL AI+
Sbjct: 243 SGAVLLEAVLIDANLN-----RTNFRDADLRRAIM 272
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 56/102 (54%), Gaps = 7/102 (6%)
Query: 116 ADLRKAVHVKE-NFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLS 169
ADL K V++ E NF R+NF A+++ ++ +G+ K G L +A +AN G DLS
Sbjct: 25 ADL-KGVNLSEANFTRSNFQDANLKGANLTGANLREVKLAGVDLTEANLSEANLIGTDLS 83
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L ANL ++ +T+++L A + A+F++A
Sbjct: 84 RANLSGANLMGANLRGSMAREVNMTKANLTEANLTEANFTEA 125
>gi|262196377|ref|YP_003267586.1| pentapeptide repeat-containing protein [Haliangium ochraceum DSM
14365]
gi|262079724|gb|ACY15693.1| pentapeptide repeat protein [Haliangium ochraceum DSM 14365]
Length = 903
Score = 52.0 bits (123), Expect = 3e-04, Method: Composition-based stats.
Identities = 41/127 (32%), Positives = 59/127 (46%), Gaps = 12/127 (9%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADLR A F +AN A++ +++F ++F GA L A AN A L + +
Sbjct: 768 ADLRHA-----GFEQANLVQANLIQANFGYARFLGADLRGAQLLGANLQDAKLQNANLQG 822
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 235
L ANL A L L +DL GA + A+ S A AQ K+ +G +P
Sbjct: 823 ANLQGANLQGAKLQNANLQGADLQGADLRAANLSAANFLGAQYSTETKWPDGVDP----- 877
Query: 236 TRKSLGC 242
++LGC
Sbjct: 878 --EALGC 882
Score = 42.0 bits (97), Expect = 0.33, Method: Composition-based stats.
Identities = 27/85 (31%), Positives = 42/85 (49%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ RA AD+ +D +G+ + A+LE+A +ANF A L + + L A A
Sbjct: 719 DLARAYLAGADLAGADLAGADLSLAHLERASLERANFRSAKLLYSNLRYADLRHAGFEQA 778
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
LV+ L +++ G A GAD A
Sbjct: 779 NLVQANLIQANFGYARFLGADLRGA 803
>gi|428313290|ref|YP_007124267.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428254902|gb|AFZ20861.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 283
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 43/85 (50%)
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
++ + AN ++R + G+ A L+ + AN TGA+LS + LNEANL
Sbjct: 27 LEPDLSEANLIGVNLRGAHLQGTNLRKALLDHTLLIAANLTGANLSQANLSHASLNEANL 86
Query: 184 TNAVLVRTVLTRSDLGGAIIEGADF 208
A L+ T L +DL A + GA+
Sbjct: 87 VEACLIDTTLISADLSHAELTGANL 111
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 58/113 (51%), Gaps = 10/113 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S AQ +L +A V+ N N ++A++ E++ G+ YL KA KAN + A L
Sbjct: 147 SGAQLLRTNLSEAKLVQANLSHTNLSNANLHEAELIGT-----YLYKAELQKANLSEAHL 201
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIEGADFSDAVIDLA 216
S + R L EA+L A L L+RS DL GA + GA+ S A ++ A
Sbjct: 202 SGAYLSRANLREADLERADLRWANLSRSNLCEADLKGANLRGANLSKANLERA 254
Score = 43.5 bits (101), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 63/131 (48%), Gaps = 18/131 (13%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEF-------------GIGSAAQFGSAD 117
+ T L+ A + + + + L++ N +EAE G + S A A+
Sbjct: 151 LLRTNLSEAKLVQANLSHTNLSNANLHEAELIGTYLYKAELQKANLSEAHLSGAYLSRAN 210
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
LR+A + + R AN + +++ E+D G+ GA L KA +A+ GA+L T
Sbjct: 211 LREADLERADLRWANLSRSNLCEADLKGANLRGANLSKANLERADLRGANLRGT-----N 265
Query: 178 LNEANLTNAVL 188
LN+ANL A++
Sbjct: 266 LNKANLQGAMM 276
Score = 40.8 bits (94), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 55/114 (48%), Gaps = 10/114 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A L +A V+ SAD+ ++ +G+ GA L Y AN G DL
Sbjct: 72 SQANLSHASLNEANLVEACLIDTTLISADLSHAELTGANLIGADL-----YGANLKGVDL 126
Query: 169 SD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
SD T + R+ L A+L+ A L+RT L+ + L A + + S+A + A+
Sbjct: 127 SDANLIGTNLRRVNLQGADLSGAQLLRTNLSEAKLVQANLSHTNLSNANLHEAE 180
Score = 37.4 bits (85), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 45/101 (44%), Gaps = 15/101 (14%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +LRKA+ AN T A++ +++ S + N A L +A ADLS
Sbjct: 44 AHLQGTNLRKALLDHTLLIAANLTGANLSQANLSHASLNEANLVEACLIDTTLISADLS- 102
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A LT A L+ +DL GA ++G D SDA
Sbjct: 103 ---------HAELTGANLI-----GADLYGANLKGVDLSDA 129
>gi|428226949|ref|YP_007111046.1| hypothetical protein GEI7407_3527 [Geitlerinema sp. PCC 7407]
gi|427986850|gb|AFY67994.1| Tetratricopeptide TPR_1 repeat-containing protein [Geitlerinema sp.
PCC 7407]
Length = 575
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 69/148 (46%), Gaps = 1/148 (0%)
Query: 68 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKE 126
WR + AL A + + ++ + K ETR S +G A+L
Sbjct: 15 WRSLAALALVVAPMVGTDAALAEKPEHRKQLLETRRCISCDLSNGDYGRANLSGFDLSNS 74
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N A+F SAD++ +DFS + A LE+A +A+F A L+ + L+ ANL+N+
Sbjct: 75 NLENADFESADLQRTDFSSANLRRADLERADLERADFQSAILNGADLSNSDLSYANLSNS 134
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVID 214
L L+ SDL GA GA+ A D
Sbjct: 135 DLSYADLSGSDLDGANFWGANLFQANFD 162
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 43/97 (44%), Gaps = 5/97 (5%)
Query: 120 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 179
K H K+ S D+ D+ + +G L + A+F ADL R +
Sbjct: 38 KPEHRKQLLETRRCISCDLSNGDYGRANLSGFDLSNSNLENADFESADLQ-----RTDFS 92
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
ANL A L R L R+D AI+ GAD S++ + A
Sbjct: 93 SANLRRADLERADLERADFQSAILNGADLSNSDLSYA 129
>gi|282898833|ref|ZP_06306820.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281196360|gb|EFA71270.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 189
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 45/91 (49%), Gaps = 2/91 (2%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N R N D +D S S G L A +AN GA L + + ++L A+LT A
Sbjct: 26 NLRGVNLGGIDFARADLSWSDLTGISLSGANLSQANLRGAKLENAHLSEVILCGADLTQA 85
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+L+ L SDL GA++ A+ DA DL Q
Sbjct: 86 ILINAHLNESDLSGALLVDANLCDA--DLHQ 114
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 10/98 (10%)
Query: 111 AQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A +DL A+ V N +A+ T+A+++ + +G+K G + KA A+ TG
Sbjct: 90 AHLNESDLSGALLVDANLCDADLHQASITAANLQSAKLNGAKMGGVRMWKADLQGADLTG 149
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
ADLS+ M + L+ ANL+ + T LT GAI+
Sbjct: 150 ADLSEANMCGVNLSMANLSATDMSETFLT-----GAIM 182
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 63/135 (46%), Gaps = 21/135 (15%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVK-ENFRRANFTSADMRESDFSG------- 145
LN+Y RGE F LR AV+++ N +F AD+ SD +G
Sbjct: 7 LNRY---ARGE------RNFNGICLR-AVNLRGVNLGGIDFARADLSWSDLTGISLSGAN 56
Query: 146 ---SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
+ GA LE A + GADL+ ++ LNE++L+ A+LV L +DL A
Sbjct: 57 LSQANLRGAKLENAHLSEVILCGADLTQAILINAHLNESDLSGALLVDANLCDADLHQAS 116
Query: 203 IEGADFSDAVIDLAQ 217
I A+ A ++ A+
Sbjct: 117 ITAANLQSAKLNGAK 131
>gi|154251684|ref|YP_001412508.1| pentapeptide repeat-containing protein [Parvibaculum
lavamentivorans DS-1]
gi|154155634|gb|ABS62851.1| pentapeptide repeat protein [Parvibaculum lavamentivorans DS-1]
Length = 363
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 49/93 (52%), Gaps = 10/93 (10%)
Query: 129 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
+RA+FT D+ DFS + GA+ +A+ ANF ++ +L A+ +NA+L
Sbjct: 272 QRADFTRMDLSRKDFSRAVLAGAHFREAILADANF----------EKAILAAADFSNAIL 321
Query: 189 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 221
R L +DL GA + GAD +A D +K L
Sbjct: 322 FRANLAGADLRGADLRGADLKNARQDDTKKGEL 354
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 37/77 (48%)
Query: 138 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 197
M+E D SG F +FTG DL D L AN +A L RT +R+D
Sbjct: 62 MKECDLSGLDFRNLNFSHGHFIGCDFTGCDLEDAHFSGANLFSANFDHANLTRTNFSRAD 121
Query: 198 LGGAIIEGADFSDAVID 214
L GA E A+ +DA +D
Sbjct: 122 LRGANFEDAEMADAQLD 138
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 16/104 (15%)
Query: 111 AQFGSADLRKAVHVK---------EN--FRRANFTSADMRE-----SDFSGSKFNGAYLE 154
AQ ADLR+ ++ EN FR A +M E +DF G+ +GA L+
Sbjct: 135 AQLDGADLRRGAVIRRGASAPVGRENSSFRGARMYGTNMAECKLLDADFEGASISGASLQ 194
Query: 155 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
A ANF GA+L + L +A+ AV+ + R D+
Sbjct: 195 GADLRGANFAGAELKGVELSGANLADADFRRAVMDEATIARGDM 238
Score = 37.4 bits (85), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+FR NF+ DF+G A+ A + ANF A+L+ T R L AN +A
Sbjct: 71 DFRNLNFSHGHFIGCDFTGCDLEDAHFSGANLFSANFDHANLTRTNFSRADLRGANFEDA 130
Query: 187 VLVRTVLTRSDL-GGAII 203
+ L +DL GA+I
Sbjct: 131 EMADAQLDGADLRRGAVI 148
>gi|94263119|ref|ZP_01286937.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
gi|93456490|gb|EAT06604.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
Length = 355
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 52/111 (46%), Gaps = 10/111 (9%)
Query: 113 FGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVAYKAN 162
F D R A + F++ +F T D+R+ + G+ F GA L K + AN
Sbjct: 41 FKGVDFRGAKITRTGFKKCSFAGARFDETDLTMVDLRQLELPGASFKGARLHKTLLGGAN 100
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
G D S + +L EA+L+ A + RS L A E ADFS+AV+
Sbjct: 101 LAGCDFSQARIFWSLLQEADLSRASFRQAEFERSILQDANCEEADFSEAVL 151
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 46/104 (44%), Gaps = 10/104 (9%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVAYKANFTG 165
ADL +A + F R+ A+ E+DFS S+ G L +A +K +G
Sbjct: 119 ADLSRASFRQAEFERSILQDANCEEADFSEAVLFKTILLNSRLKGINLRQAKMHKVLLSG 178
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
DL+ M E N NA L +R+D+ G + GAD S
Sbjct: 179 CDLAGQDFSDMRFREVNFANAKLGGADFSRADISGCVFTGADLS 222
Score = 37.4 bits (85), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 40/83 (48%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+F RA+ + +D S S+ +G +++ N GADL + + L E+NL A
Sbjct: 205 DFSRADISGCVFTGADLSASRLSGVIARQSMFAGTNLQGADLEGAGLVQAYLGESNLEGA 264
Query: 187 VLVRTVLTRSDLGGAIIEGADFS 209
LV L + L A GADF+
Sbjct: 265 SLVGANLESASLEKARAMGADFT 287
Score = 37.4 bits (85), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 49/105 (46%), Gaps = 5/105 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKANFTG 165
A F A L K + N +F+ A ++E+D S + F A E+++ AN
Sbjct: 84 ASFKGARLHKTLLGGANLAGCDFSQARIFWSLLQEADLSRASFRQAEFERSILQDANCEE 143
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
AD S+ ++ + +L + L L + + + L G + G DFSD
Sbjct: 144 ADFSEAVLFKTILLNSRLKGINLRQAKMHKVLLSGCDLAGQDFSD 188
>gi|304404631|ref|ZP_07386292.1| pentapeptide repeat protein [Paenibacillus curdlanolyticus YK9]
gi|304346438|gb|EFM12271.1| pentapeptide repeat protein [Paenibacillus curdlanolyticus YK9]
Length = 288
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 48/87 (55%)
Query: 129 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
+ F + + SDFSG+ G+ + + +ANF GA+L+D + L +A+ ++L
Sbjct: 99 HKGQFKGSALHGSDFSGADLTGSSFKGSDVREANFDGANLTDCSFTALDLTKASFNKSIL 158
Query: 189 VRTVLTRSDLGGAIIEGADFSDAVIDL 215
VRT ++S L GA +G +D V+ L
Sbjct: 159 VRTNFSKSGLDGAAFKGVKLTDVVLTL 185
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 50/110 (45%), Gaps = 9/110 (8%)
Query: 102 RGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
+G+F GSA + F ADL + + R ANF A++ + F+ A K++
Sbjct: 100 KGQFK-GSALHGSDFSGADLTGSSFKGSDVREANFDGANLTDCSFTALDLTKASFNKSIL 158
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ NF+ + L D LT+ VL T L ++ G + +G DF
Sbjct: 159 VRTNFSKSGL-----DGAAFKGVKLTDVVLTLTDLRKTSFEGCLFDGVDF 203
>gi|427709341|ref|YP_007051718.1| endoribonuclease L-PSP [Nostoc sp. PCC 7107]
gi|427361846|gb|AFY44568.1| endoribonuclease L-PSP [Nostoc sp. PCC 7107]
Length = 433
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 60/138 (43%), Gaps = 35/138 (25%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFS----------GSKFNGAYL 153
S A S+DL A V N +AN + AD+ ESDF+ G+K
Sbjct: 133 SGANLSSSDLSVASLVGANLNKANLSKADLGDAYLMESDFTLANLTEATLIGAKLQNVKF 192
Query: 154 EKAVAYKANFTGADLSDT--------------------LMDRMVLNEANLTNAVLVRTVL 193
+A Y+ N +G +L+D ++R+ L ANLTNA L L
Sbjct: 193 HRANLYQVNLSGMNLTDVDFTAASLQSTNLIKSRLQGANLERVNLRGANLTNANLDGANL 252
Query: 194 TRSDLGGAIIEGADFSDA 211
R+DL GA I GA F DA
Sbjct: 253 RRADLTGADIYGASFIDA 270
>gi|428306403|ref|YP_007143228.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428247938|gb|AFZ13718.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 276
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 62/127 (48%), Gaps = 8/127 (6%)
Query: 95 NKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
N YEA G G++ +F A L+ + + NF+ A + S+ + GA
Sbjct: 146 NLYEARLSGALLSGASLNGVKFSRAFLKDVDLNGADLQGINFSEARLGGSNLESANLVGA 205
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGA 206
L A Y+ N T ADLS + R L +ANLT A L + LT + L GA ++GA
Sbjct: 206 DLSDAHLYQVNLTAADLSGANLIRASLEQANLTWINLSKANLCQANLTNAILKGANLDGA 265
Query: 207 DFSDAVI 213
D +DA++
Sbjct: 266 DLTDAIL 272
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A SA+LR A + N A+ A++RE++ SG N A L +A A +GA L
Sbjct: 103 SGANLNSANLRGANLREANLSSASLQRANLREANLSGVNLNWANLYEARLSGALLSGASL 162
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDA 211
+ R L + +L A L + + LGG+ +E GAD SDA
Sbjct: 163 NGVKFSRAFLKDVDLNGADLQGINFSEARLGGSNLESANLVGADLSDA 210
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 10/113 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVA 158
S A+ A+L + V +K + RAN A++ ++D + + GA +EKA
Sbjct: 38 SGAKLMGANLSRTVMIKSDLSRANLNWANLSFAKMSAVKLGDADLTKANLQGAVMEKAKL 97
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+A +GA+L+ + L EANL++A L R L ++L G + A+ +A
Sbjct: 98 PRAKLSGANLNSANLRGANLREANLSSASLQRANLREANLSGVNLNWANLYEA 150
>gi|186681457|ref|YP_001864653.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186463909|gb|ACC79710.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 539
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 55/103 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A SADLR+A + N R AN + + +R + +G+ A L + + + +GA+L
Sbjct: 133 SEADLTSADLREATLRQANLRHANLSESVLRGASMTGANLEMANLNASDLSRCDLSGANL 192
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
DT + + L+ ANL+ A L L +DL GA + AD S A
Sbjct: 193 RDTELRQANLSHANLSGADLSGANLRWADLSGANLRWADLSGA 235
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 58/119 (48%), Gaps = 1/119 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A +LR+A N A+ + A++R +D SG+ A L A A GADL
Sbjct: 188 SGANLRDTELRQANLSHANLSGADLSGANLRWADLSGANLRWADLSGAKLSGATLIGADL 247
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKYAN 226
++ + + A+LT A L+R +DL GA + GA ++ + L + +C++ +
Sbjct: 248 TNANLTNTIFIHADLTQAKLIRAEWIGADLTGATLTGAKLYATSRFGLKTEGMICEWVD 306
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 15/92 (16%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTNA 186
NF+ D+ E++ SG K +G L A N +GA+LS+ + LN A NL+NA
Sbjct: 31 NFSGIDLAEANLSGVKLSGVNLSDANLSIVNLSGANLSEANLSNAKLNVARLSGVNLSNA 90
Query: 187 V----------LVRTVLTRSDLGGAIIEGADF 208
+ L+R L+R+ L GA++ A+
Sbjct: 91 ILNNASLNVANLIRADLSRAQLKGALLIRAEL 122
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N N + A++ E++ S +K N A L A A L+ + R L+ A L A
Sbjct: 56 NLSIVNLSGANLSEANLSNAKLNVARLSGVNLSNAILNNASLNVANLIRADLSRAQLKGA 115
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ---KQALCKYAN 226
+L+R L R+DL A + AD + A DL + +QA ++AN
Sbjct: 116 LLIRAELIRADLSRADLSEADLTSA--DLREATLRQANLRHAN 156
Score = 37.0 bits (84), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 49/101 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A+ +L A+ + AN AD+ + G+ A L +A +A+ + ADL
Sbjct: 78 NVARLSGVNLSNAILNNASLNVANLIRADLSRAQLKGALLIRAELIRADLSRADLSEADL 137
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ + L +ANL +A L +VL + + GA +E A+ +
Sbjct: 138 TSADLREATLRQANLRHANLSESVLRGASMTGANLEMANLN 178
>gi|428307960|ref|YP_007144785.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428249495|gb|AFZ15275.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 201
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 54/100 (54%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A G ADL +A + N AN A + +D +G+ F A L A ++AN +GA+LS
Sbjct: 95 ANLGGADLIEADLFEANLTGANLIGAKLIGADLTGANFREANLMGADLFEANLSGANLSG 154
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
+ L ANL+ A L+ L+R L GA I+GA+ ++
Sbjct: 155 ANLSGANLTLANLSGANLMGVDLSRVTLMGASIDGANLNN 194
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGA 166
F ADL KA+ + N AN AD+ E++FS + GA L +A ++AN TGA
Sbjct: 56 NFSKADLSKAILMGANLMGANLCEADIMEANFSKANLCEANLGGADLIEADLFEANLTGA 115
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+L + L AN A L+ L ++L GA + GA+ S A + LA
Sbjct: 116 NLIGAKLIGADLTGANFREANLMGADLFEANLSGANLSGANLSGANLTLAN 166
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 53/103 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F A+L A K N NF+ AD+ ++ G+ GA L +A +ANF+ A+L
Sbjct: 33 SGANFSKANLSGAHFSKANLIGVNFSKADLSKAILMGANLMGANLCEADIMEANFSKANL 92
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + L EA+L A L L + L GA + GA+F +A
Sbjct: 93 CEANLGGADLIEADLFEANLTGANLIGAKLIGADLTGANFREA 135
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 59/125 (47%), Gaps = 23/125 (18%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKF 148
L +YEA R +F + DL +A + N ANF+ A++ + FS G F
Sbjct: 7 LARYEAGER---------EFHNCDLIEANLIGANLSGANFSKANLSGAHFSKANLIGVNF 57
Query: 149 NGAYLEKAVAYKANFTGADLSDTLMDRMVLN-------EANLTNAVLVRTVLTRSDLGGA 201
+ A L KA+ AN GA+L + D M N EANL A L+ L ++L GA
Sbjct: 58 SKADLSKAILMGANLMGANLCEA--DIMEANFSKANLCEANLGGADLIEADLFEANLTGA 115
Query: 202 IIEGA 206
+ GA
Sbjct: 116 NLIGA 120
>gi|428216569|ref|YP_007101034.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988351|gb|AFY68606.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 330
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 54/177 (30%), Positives = 81/177 (45%), Gaps = 18/177 (10%)
Query: 56 NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGS---AAQ 112
NQ AKL + + AL A + + + L D N A+ R G+ A
Sbjct: 73 NQAHLSEAKLNDVDLH-GAALVGATLVNADLTFAVLIDANLMNADLRSANLSGANLAGAC 131
Query: 113 FGSADLRKAVH-----VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
A LR+A N RRA+ AD+ E++ +G+ GA L +A + TGA+
Sbjct: 132 LKGATLRRASKNITSLRNANLRRADLRGADLSEANLAGADLRGADLSEANLANTDLTGAN 191
Query: 168 LSDTLMDRMVLNEANLTNAVL-------VRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L++ +M L EANLT A L VRT R++L A ++G + AV+ +A
Sbjct: 192 LAEAIMRGTGLTEANLTGANLANAYMQNVRT--ERANLSEADLQGTNLDLAVMSMAN 246
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 77/163 (47%), Gaps = 12/163 (7%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRG----EFGIGSAAQFGSADLRKAVHVKENFRRA 131
L A + S NI++L + N A+ RG E + A G ADL +A + A
Sbjct: 132 LKGATLRRASKNITSLRNANLRRADLRGADLSEANLAGADLRG-ADLSEANLANTDLTGA 190
Query: 132 NFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N A MR E++ +G+ AY++ +AN + ADL T +D V++ ANL+ +
Sbjct: 191 NLAEAIMRGTGLTEANLTGANLANAYMQNVRTERANLSEADLQGTNLDLAVMSMANLSKS 250
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 229
L L R++L G + + S A +L + Q + Y TN
Sbjct: 251 NLSEASLYRANLNGTDLSRTNLSGA--NLREAQLVESYMARTN 291
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 60/122 (49%), Gaps = 7/122 (5%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFN 149
N EA RG G+ + A A+L A RAN + AD++ ++ S + +
Sbjct: 191 NLAEAIMRGT-GL-TEANLTGANLANAYMQNVRTERANLSEADLQGTNLDLAVMSMANLS 248
Query: 150 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ L +A Y+AN G DLS T + L EA L + + RT LT +DL A++ A+ S
Sbjct: 249 KSNLSEASLYRANLNGTDLSRTNLSGANLREAQLVESYMARTNLTNADLADALLARAELS 308
Query: 210 DA 211
A
Sbjct: 309 SA 310
Score = 43.9 bits (102), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 49/97 (50%), Gaps = 8/97 (8%)
Query: 95 NKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
N EA+ +G + + S A ++L +A + RAN D+ ++ SG+ A
Sbjct: 226 NLSEADLQGTNLDLAVMSMANLSKSNLSEA-----SLYRANLNGTDLSRTNLSGANLREA 280
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
L ++ + N T ADL+D L+ R L+ ANL NA L
Sbjct: 281 QLVESYMARTNLTNADLADALLARAELSSANLLNANL 317
>gi|452964739|gb|EME69773.1| serine/threonine protein kinase [Magnetospirillum sp. SO-1]
Length = 137
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 46/89 (51%), Gaps = 15/89 (16%)
Query: 141 SDFSGSKFNGAYLEKAVAYKANFTGA----------DLSDTLMDRMVLNEAN-----LTN 185
SDFSGS N A L +AV ANF GA DL++ R VLN AN L
Sbjct: 8 SDFSGSVLNAADLRQAVLIGANFEGAVLNHARLTDADLTEARFLRSVLNNANMHGACLKG 67
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A+L V+ +DL A +EGAD A+I+
Sbjct: 68 AILAGAVMNNADLSCATLEGADLRGAIIN 96
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 59/111 (53%), Gaps = 1/111 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S + +ADLR+AV + NF A A + ++D + ++F + L A + A GA L
Sbjct: 11 SGSVLNAADLRQAVLIGANFEGAVLNHARLTDADLTEARFLRSVLNNANMHGACLKGAIL 70
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
+ +M+ L+ A L A L ++ +DL GA + GAD + A ++L + Q
Sbjct: 71 AGAVMNNADLSCATLEGADLRGAIINNADLSGADLRGADLTGA-LNLTRDQ 120
>gi|428225932|ref|YP_007110029.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427985833|gb|AFY66977.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 180
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 52/105 (49%), Gaps = 10/105 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S + SA LR + N R AN AD+++S+ G+ A L A +A+ GADL
Sbjct: 66 SKSNLYSAKLRGSDLGLANLREANLGDADLKQSNLRGADLRNANLLGASLIEADLRGADL 125
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
D ANLTNA L L +++L GA++ G F AV+
Sbjct: 126 RD----------ANLTNANLDGADLRQTNLQGAVLTGVSFRGAVL 160
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 61/132 (46%), Gaps = 24/132 (18%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N ++N SA +R SD + A L A ++N GADL + ANL A
Sbjct: 64 NLSKSNLYSAKLRGSDLGLANLREANLGDADLKQSNLRGADLRN----------ANLLGA 113
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGTNPITGVSTRKSLGCGNS 245
L+ +DL GA + A+ ++A +D A +Q + A +TGVS R ++ CG +
Sbjct: 114 SLI-----EADLRGADLRDANLTNANLDGADLRQTNLQGA----VLTGVSFRGAVLCGAT 164
Query: 246 RRNA----YGSP 253
N YG P
Sbjct: 165 MPNGLAARYGCP 176
>gi|428224795|ref|YP_007108892.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984696|gb|AFY65840.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 284
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 53/103 (51%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL +A N RRANFT+A MR + S GA + + Y+A ++LS
Sbjct: 185 ANLSDADLTRANLGSTNLRRANFTNAKMRGASLIWSSLRGAKMIRVNLYRAKLNWSNLS- 243
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
EA+L+ A+L+ T L R++L A ++ A+ S A +
Sbjct: 244 ---------EADLSEAILIDTNLRRANLRDANLQNANLSGATM 277
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 5/117 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTG 165
A + R+A ++ N +N T + RE++ SGS GA L++ A AN G
Sbjct: 30 ADLRDVNFREAHLIEVNLSGSNLTGVNFREANLSGSNLGGAMLQECNLIGANLLGANLMG 89
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
+DLS + + L+E NL +A L ++ ++L GA + G + + A + A+ C
Sbjct: 90 SDLSGSSLRSANLSEVNLRSANLSDAIVGEANLSGANLYGTNLTGAHLSRARLVETC 146
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 50/94 (53%), Gaps = 10/94 (10%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS-----DTLMDRMVLNEAN 182
R AN + ++R ++ S + A L A Y N TGA LS +T ++ +L+ AN
Sbjct: 97 LRSANLSEVNLRSANLSDAIVGEANLSGANLYGTNLTGAHLSRARLVETCLEHAILDNAN 156
Query: 183 LTNAVLVRTVLT-----RSDLGGAIIEGADFSDA 211
L+ +VL LT ++ L GA +EGA+ SDA
Sbjct: 157 LSGSVLNGANLTGARLSQAVLSGASLEGANLSDA 190
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 67/146 (45%), Gaps = 11/146 (7%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANF 163
S + G A L++ + N AN +D+ R ++ S A L A+ +AN
Sbjct: 63 SGSNLGGAMLQECNLIGANLLGANLMGSDLSGSSLRSANLSEVNLRSANLSDAIVGEANL 122
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
+GA+L T + L+ A L L +L ++L G+++ GA+ + A + QA+
Sbjct: 123 SGANLYGTNLTGAHLSRARLVETCLEHAILDNANLSGSVLNGANLTGARL----SQAVLS 178
Query: 224 YAN--GTNPITGVSTRKSLGCGNSRR 247
A+ G N TR +LG N RR
Sbjct: 179 GASLEGANLSDADLTRANLGSTNLRR 204
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 68/153 (44%), Gaps = 25/153 (16%)
Query: 103 GEFGIGSAAQFGS----ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
GE + A +G+ A L +A V+ A +A++ S +G+ GA L +AV
Sbjct: 118 GEANLSGANLYGTNLTGAHLSRARLVETCLEHAILDNANLSGSVLNGANLTGARLSQAVL 177
Query: 159 YKANFTGADLSDTLMDR-----MVLNEANLTN---------------AVLVRTVLTRSDL 198
A+ GA+LSD + R L AN TN A ++R L R+ L
Sbjct: 178 SGASLEGANLSDADLTRANLGSTNLRRANFTNAKMRGASLIWSSLRGAKMIRVNLYRAKL 237
Query: 199 GGAIIEGADFSDAV-IDLAQKQALCKYANGTNP 230
+ + AD S+A+ ID ++A + AN N
Sbjct: 238 NWSNLSEADLSEAILIDTNLRRANLRDANLQNA 270
>gi|300868761|ref|ZP_07113372.1| hypothetical protein OSCI_3800094 [Oscillatoria sp. PCC 6506]
gi|300333322|emb|CBN58564.1| hypothetical protein OSCI_3800094 [Oscillatoria sp. PCC 6506]
Length = 195
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 51/102 (50%), Gaps = 5/102 (4%)
Query: 117 DLRKAVHVKENFRRANFTS-----ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
DLR A NF A+F AD+R ++ F GA L AN GAD
Sbjct: 61 DLRGAPLAGINFAGADFKEVRLYFADLRGANLELCDFRGADLSDTNLSDANLAGADFEGC 120
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
M + L +ANL+NA L+R VLT S+L A GAD + A++
Sbjct: 121 FMMSINLTKANLSNAQLMRVVLTGSNLVEANFSGADLTGALL 162
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 48/98 (48%), Gaps = 5/98 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADLR A N +F AD+ +++ S + GA E N T A+LS+ + R
Sbjct: 85 ADLRGA-----NLELCDFRGADLSDTNLSDANLAGADFEGCFMMSINLTKANLSNAQLMR 139
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+VL +NL A LT + L GA +EG F A++
Sbjct: 140 VVLTGSNLVEANFSGADLTGALLLGAKLEGKVFDGAIL 177
>gi|163760882|ref|ZP_02167961.1| hypothetical protein HPDFL43_07047 [Hoeflea phototrophica DFL-43]
gi|162281926|gb|EDQ32218.1| hypothetical protein HPDFL43_07047 [Hoeflea phototrophica DFL-43]
Length = 239
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 52/102 (50%), Gaps = 5/102 (4%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
S DLR++ N AN A + + +GSK GA ++ AY+A+F+ D +
Sbjct: 71 STDLRES-----NLIEANLEKATLFRASLAGSKATGARFDRIEAYRADFSNLDATGASFG 125
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+ A L N++L T T++DLG A +GAD S + LA
Sbjct: 126 SAEMQRAKLNNSMLANTDFTKADLGRAQFDGADISGSRFSLA 167
>gi|434407898|ref|YP_007150783.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428262153|gb|AFZ28103.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 182
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 5/84 (5%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+F A + +D SG+K GA E + +AN GADLS+ L + LT A LVR
Sbjct: 90 ADFRGAQLNHADLSGAKLCGANFEGCLMVRANLAGADLSNA-----SLAGSALTGANLVR 144
Query: 191 TVLTRSDLGGAIIEGADFSDAVID 214
+++DL A++ GA+ DAV D
Sbjct: 145 ANFSQADLTNAVLFGAETEDAVFD 168
>gi|162450992|ref|YP_001613359.1| hypothetical protein sce2720 [Sorangium cellulosum So ce56]
gi|161161574|emb|CAN92879.1| hypothetical protein sce2720 [Sorangium cellulosum So ce56]
Length = 579
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 51/102 (50%), Gaps = 5/102 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 163
+ A+ A+LR+A+ R AD+ ++D G+ GA LE+A+ AN
Sbjct: 286 TGAELTGANLRRALLQGAILRGQRLAGADLEMTLLVDADLEGADLQGARLERAILDGANL 345
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
GADL+ L+ + +L A L +L + + R DL G ++G
Sbjct: 346 RGADLTRALLLQTLLRGAALDGVILDKAIFDRVDLTGTDLQG 387
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 49/104 (47%), Gaps = 5/104 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A DL A N RRA A +R G + GA LE + A+ GADL
Sbjct: 278 AMLAGCDLTGAELTGANLRRALLQGAILR-----GQRLAGADLEMTLLVDADLEGADLQG 332
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
++R +L+ ANL A L R +L ++ L GA ++G A+ D
Sbjct: 333 ARLERAILDGANLRGADLTRALLLQTLLRGAALDGVILDKAIFD 376
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 52/99 (52%), Gaps = 5/99 (5%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
LR A + + R A FT +D+R + G+ +GA L +A A+ GADL+ TL+
Sbjct: 66 LRGASLDRCDLRGATFTGSDLRGARLRGANLSGAKLLRANLAGADLAGADLTATLLLGAD 125
Query: 178 LNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDA 211
L A LT A L R L ++L GA+++GA + A
Sbjct: 126 LTGARLTGAKLDRIRLDFAKLPGAELAGAVLQGASLNKA 164
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 45/97 (46%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL + + E F T D+R G++ GA L+ +A+ G+DL DT R
Sbjct: 5 DLARRLRAGEPFAGKTITRFDLRGKQLGGARLRGAKLKDIHLDEADLAGSDLQDTQWFRC 64
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L A+L L T SDL GA + GA+ S A +
Sbjct: 65 PLRGASLDRCDLRGATFTGSDLRGARLRGANLSGAKL 101
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 5/89 (5%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTN 185
AN + +R ++ + GA LE+A+ + TGA+L+ + R +L A L
Sbjct: 253 ANLAGSSLRGTNLRNANLRGANLEQAMLAGCDLTGAELTGANLRRALLQGAILRGQRLAG 312
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A L T+L +DL GA ++GA A++D
Sbjct: 313 ADLEMTLLVDADLEGADLQGARLERAILD 341
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 18/111 (16%)
Query: 112 QFGSADLR----KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
Q G A LR K +H+ E A+ +D++++ + GA L++ A FTG+D
Sbjct: 30 QLGGARLRGAKLKDIHLDE----ADLAGSDLQDTQWFRCPLRGASLDRCDLRGATFTGSD 85
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI 213
L L ANL+ A L+R L +DL GA ++ GAD + A +
Sbjct: 86 LRGA-----RLRGANLSGAKLLRANLAGADLAGADLTATLLLGADLTGARL 131
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ DLR+A +F +NFT AD+R +D S A L +A +A+ +GA +
Sbjct: 403 AKLAGMDLREA-----DFTGSNFTRADLRGADLRSSVLTRATLMEADLARADLSGATAKE 457
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L A +A L R TR+DL A + GAD D V+
Sbjct: 458 AFFGDAALAGARARDARLRRATFTRADLDHADLSGADLGDVVM 500
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 48/108 (44%), Gaps = 5/108 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L +A+ N R A+ T A + ++ G+ +G L+KA+ + + TG DL
Sbjct: 328 ADLQGARLERAILDGANLRGADLTRALLLQTLLRGAALDGVILDKAIFDRVDLTGTDLQG 387
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVI 213
+ M + + A L L +D G A + GAD +V+
Sbjct: 388 VRLAGMTMTQCCFIEAKLAGMDLREADFTGSNFTRADLRGADLRSSVL 435
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 54/109 (49%), Gaps = 5/109 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTG 165
A+ A+L A ++ N A+ AD+ + D +G++ GA L++ A G
Sbjct: 89 ARLRGANLSGAKLLRANLAGADLAGADLTATLLLGADLTGARLTGAKLDRIRLDFAKLPG 148
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A+L+ ++ LN+A+LT A+L +T S A + GAD A ++
Sbjct: 149 AELAGAVLQGASLNKADLTRALLRDARITGSTFYDARLGGADLGGATLE 197
>gi|378721493|ref|YP_005286380.1| hypothetical protein RPL_04520 [Rickettsia rickettsii str.
Colombia]
gi|376326517|gb|AFB23756.1| hypothetical protein RPL_04520 [Rickettsia rickettsii str.
Colombia]
Length = 959
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 225
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 226 N 226
N
Sbjct: 670 N 670
Score = 42.0 bits (97), Expect = 0.29, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 584 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 641
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 642 ENADMQAVEAAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATK 691
Score = 37.7 bits (86), Expect = 5.4, Method: Composition-based stats.
Identities = 41/144 (28%), Positives = 61/144 (42%), Gaps = 18/144 (12%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
LKN +F S L + +++C+ + + N A + A F ADL+K+
Sbjct: 359 LKN-TLFASANLESVKISNCNLDFTNFEGANLQNAVFQNV--TARNAGFLFADLKKSKIE 415
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTG-----ADLS 169
+ RA D+ E + + SKFN + A A K +N TG AD+
Sbjct: 416 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKFIIKDSEWKNSNLTGISLAYADMQ 475
Query: 170 DTLMDRMVLNEANLTNAVLVRTVL 193
M +VLN A L A +V T L
Sbjct: 476 RVQMQGVVLNNALLDQANIVSTNL 499
Score = 37.7 bits (86), Expect = 6.4, Method: Composition-based stats.
Identities = 25/109 (22%), Positives = 51/109 (46%), Gaps = 5/109 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A+L+ AV R A F AD+++S S + AY+ K + T + + +
Sbjct: 384 FEGANLQNAVFQNVTARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVM 443
Query: 173 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
M ++ ++ ++ N+ L L +D+ ++G ++A++D A
Sbjct: 444 MVNADAEKFIIKDSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 492
>gi|307155293|ref|YP_003890677.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306985521|gb|ADN17402.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 145
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 50/97 (51%), Gaps = 5/97 (5%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DLR A N A+ AD+R ++ SG+ A LE A AN GADL+ ++
Sbjct: 42 DLRGA-----NLSAAHLIGADLRNANLSGANLVEANLEGADLTGANLQGADLTGAMVTNA 96
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
LN +NL + +L +D+ GA++EG + +A I
Sbjct: 97 SLNNSNLKDVNFTNAMLYDADVTGALMEGLNLKNAQI 133
>gi|448677922|ref|ZP_21689112.1| pentapeptide repeat-containing protein [Haloarcula argentinensis
DSM 12282]
gi|445773597|gb|EMA24630.1| pentapeptide repeat-containing protein [Haloarcula argentinensis
DSM 12282]
Length = 428
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+A F + LR A + +AN +SAD+RE+D SG+ A L A KA+ +GADL
Sbjct: 49 NAISFENTGLRGADLSDADLGKANLSSADLREADLSGADLGSADLSGANLQKADLSGADL 108
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSD 210
S + L A+L++A L RT L+ +DL A + DFSD
Sbjct: 109 SYANLSGADLENADLSSADLRRTNLSGVKFVETDLADADLRNIDFSD 155
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 52/106 (49%), Gaps = 5/106 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A SADLR+ F + AD+R DFS ++ G L A + + +GADL
Sbjct: 121 ADLSSADLRRTNLSGVKFVETDLADADLRNIDFSDTELVGTDLSGADFFATDLSGADLRV 180
Query: 171 TLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDA 211
M + L EA+L+ A L T L+ +DL GA + G D SDA
Sbjct: 181 ADMSNVNLREADLSGADLGGTDLSDANLREADLSGADLGGVDLSDA 226
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 51/98 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A SADLR+A + A+ + A+++++D SG+ + A L A A+ + ADL
Sbjct: 71 ANLSSADLREADLSGADLGSADLSGANLQKADLSGADLSYANLSGADLENADLSSADLRR 130
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
T + + E +L +A L + ++L G + GADF
Sbjct: 131 TNLSGVKFVETDLADADLRNIDFSDTELVGTDLSGADF 168
>gi|374319450|ref|YP_005065949.1| hypothetical protein Rsl_930 [Rickettsia slovaca 13-B]
gi|383751452|ref|YP_005426553.1| hypothetical protein MC3_04505 [Rickettsia slovaca str. D-CWPP]
gi|360041999|gb|AEV92381.1| hypothetical protein Rsl_930 [Rickettsia slovaca 13-B]
gi|379774466|gb|AFD19822.1| hypothetical protein MC3_04505 [Rickettsia slovaca str. D-CWPP]
Length = 959
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 225
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 226 N 226
N
Sbjct: 670 N 670
Score = 42.0 bits (97), Expect = 0.28, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 584 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 641
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 642 ENADMQAVEAAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATK 691
Score = 38.5 bits (88), Expect = 3.6, Method: Composition-based stats.
Identities = 25/109 (22%), Positives = 52/109 (47%), Gaps = 5/109 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A+L+ AV R A F AD+++S S + AY+ K + T + + +
Sbjct: 384 FEGANLQNAVFQNVTARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVM 443
Query: 173 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
M +++++ ++ N+ L L +D+ ++G ++A++D A
Sbjct: 444 MVNADAEKLIIKDSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 492
>gi|157828677|ref|YP_001494919.1| hypothetical protein A1G_04555 [Rickettsia rickettsii str. 'Sheila
Smith']
gi|165933396|ref|YP_001650185.1| hypothetical protein RrIowa_0959 [Rickettsia rickettsii str. Iowa]
gi|378722841|ref|YP_005287727.1| hypothetical protein RPO_04530 [Rickettsia rickettsii str. Arizona]
gi|378724195|ref|YP_005289079.1| hypothetical protein RPM_04500 [Rickettsia rickettsii str. Hauke]
gi|379016252|ref|YP_005292487.1| hypothetical protein RPN_02430 [Rickettsia rickettsii str. Brazil]
gi|379017982|ref|YP_005294217.1| hypothetical protein RPJ_04485 [Rickettsia rickettsii str. Hino]
gi|157801158|gb|ABV76411.1| hypothetical protein A1G_04555 [Rickettsia rickettsii str. 'Sheila
Smith']
gi|165908483|gb|ABY72779.1| hypothetical protein RrIowa_0959 [Rickettsia rickettsii str. Iowa]
gi|376324776|gb|AFB22016.1| hypothetical protein RPN_02430 [Rickettsia rickettsii str. Brazil]
gi|376327865|gb|AFB25103.1| hypothetical protein RPO_04530 [Rickettsia rickettsii str. Arizona]
gi|376330548|gb|AFB27784.1| hypothetical protein RPJ_04485 [Rickettsia rickettsii str. Hino]
gi|376333210|gb|AFB30443.1| hypothetical protein RPM_04500 [Rickettsia rickettsii str. Hauke]
Length = 959
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 225
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 226 N 226
N
Sbjct: 670 N 670
Score = 42.0 bits (97), Expect = 0.28, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 584 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 641
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 642 ENADMQAVEAAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATK 691
Score = 37.7 bits (86), Expect = 5.3, Method: Composition-based stats.
Identities = 41/144 (28%), Positives = 61/144 (42%), Gaps = 18/144 (12%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
LKN +F S L + +++C+ + + N A + A F ADL+K+
Sbjct: 359 LKN-TLFASANLESVKISNCNLDFTNFEGANLQNAVFQNV--TARNAGFLFADLKKSKIE 415
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTG-----ADLS 169
+ RA D+ E + + SKFN + A A K +N TG AD+
Sbjct: 416 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKFIIKDSEWKNSNLTGISLAYADMQ 475
Query: 170 DTLMDRMVLNEANLTNAVLVRTVL 193
M +VLN A L A +V T L
Sbjct: 476 RVQMQGVVLNNALLDQANIVSTNL 499
Score = 37.7 bits (86), Expect = 6.3, Method: Composition-based stats.
Identities = 25/109 (22%), Positives = 51/109 (46%), Gaps = 5/109 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A+L+ AV R A F AD+++S S + AY+ K + T + + +
Sbjct: 384 FEGANLQNAVFQNVTARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVM 443
Query: 173 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
M ++ ++ ++ N+ L L +D+ ++G ++A++D A
Sbjct: 444 MVNADAEKFIIKDSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 492
>gi|34581546|ref|ZP_00143026.1| hypothetical protein [Rickettsia sibirica 246]
gi|28262931|gb|EAA26435.1| unknown [Rickettsia sibirica 246]
Length = 957
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 553 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 607
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 225
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 608 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 667
Query: 226 N 226
N
Sbjct: 668 N 668
Score = 42.0 bits (97), Expect = 0.28, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 582 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 639
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 640 ENADMQAVEAAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATK 689
Score = 40.4 bits (93), Expect = 0.86, Method: Composition-based stats.
Identities = 26/109 (23%), Positives = 53/109 (48%), Gaps = 5/109 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A+L+ AV R A F AD+++S S + AY+ K + T + + +
Sbjct: 382 FEGANLQNAVFQNVTARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVM 441
Query: 173 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
M +++++ ++ TN+ L L +D+ ++G ++A++D A
Sbjct: 442 MVNADAEKLIIKDSEWTNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 490
>gi|427716392|ref|YP_007064386.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348828|gb|AFY31552.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 521
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 71/149 (47%), Gaps = 10/149 (6%)
Query: 92 ADLNKYEAETRG--------EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 143
ADL+ EA RG E +A G DL A R+AN + A++ +D
Sbjct: 140 ADLS--EATLRGASLTGANLEMANLNATDMGRTDLSGANLRDTELRQANLSHANLSGADL 197
Query: 144 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
SG+ A L KA A+ +GA LS + L+ ANLTNA L+ LT++ L A
Sbjct: 198 SGANLRWADLSKANLRWADLSGAKLSGATLIGADLSNANLTNASLIHANLTQAKLIKAEW 257
Query: 204 EGADFSDAVIDLAQKQALCKYANGTNPIT 232
GAD + A++ A+ A ++ T +T
Sbjct: 258 IGADLTGAILTGAKLYATSRFGLKTEGMT 286
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 71/149 (47%), Gaps = 13/149 (8%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSA-------------AQFGSADLRKAV 122
LA+A++ + S N++ L + A+ RG I + A ADLR+A
Sbjct: 72 LASAILNNTSLNVANLIRADLSRAQLRGASLIRAELIRAELSRADLFEANLSGADLREAT 131
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 182
+ N RRA+ + A +R + +G+ A L + + +GA+L DT + + L+ AN
Sbjct: 132 LRQANLRRADLSEATLRGASLTGANLEMANLNATDMGRTDLSGANLRDTELRQANLSHAN 191
Query: 183 LTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L+ A L L +DL A + AD S A
Sbjct: 192 LSGADLSGANLRWADLSKANLRWADLSGA 220
Score = 40.4 bits (93), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 41/87 (47%), Gaps = 5/87 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N AN + A + + SG+ A L AN ADLS R L A+L A
Sbjct: 51 NLSEANLSDAKLNVARLSGANLASAILNNTSLNVANLIRADLS-----RAQLRGASLIRA 105
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVI 213
L+R L+R+DL A + GAD +A +
Sbjct: 106 ELIRAELSRADLFEANLSGADLREATL 132
>gi|379712563|ref|YP_005300902.1| hypothetical protein RSA_04485 [Rickettsia philipii str. 364D]
gi|376329208|gb|AFB26445.1| hypothetical protein RSA_04485 [Rickettsia philipii str. 364D]
Length = 959
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 225
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 226 N 226
N
Sbjct: 670 N 670
Score = 42.0 bits (97), Expect = 0.28, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 584 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 641
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 642 ENADMQAVEAAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATK 691
Score = 38.5 bits (88), Expect = 3.0, Method: Composition-based stats.
Identities = 41/144 (28%), Positives = 61/144 (42%), Gaps = 18/144 (12%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
LKN +F S L + +++C+ + + N A + A F ADL+K+
Sbjct: 359 LKN-TLFASANLESVKISNCNLDFTNFEGANLQNAVVQNV--TARNAGFLFADLKKSKIE 415
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTG-----ADLS 169
+ RA D+ E + + SKFN + A A K +N TG AD+
Sbjct: 416 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKFIIKDSEWKNSNLTGISLAYADMQ 475
Query: 170 DTLMDRMVLNEANLTNAVLVRTVL 193
M +VLN A L A +V T L
Sbjct: 476 RVQMQGVVLNNALLDQANIVSTNL 499
>gi|376007502|ref|ZP_09784697.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
gi|375324138|emb|CCE20450.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
Length = 179
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 57/111 (51%), Gaps = 1/111 (0%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
RGE+ G AD+ R+AN T+A+M + DF+G+ F + L +
Sbjct: 40 RGEYSSCQGCNLGGADMSNQSRRNAQLRQANLTNANMSDGDFTGAFFTCSNLSNSNLSGG 99
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVL-TRSDLGGAIIEGADFSDA 211
NF A+ D + + L+ A+L+ A L ++ +R++L GA ++GA DA
Sbjct: 100 NFNFANFVDANLSGVDLSNADLSRADLSGAIIDSRTNLDGANLDGARLWDA 150
>gi|86610069|ref|YP_478831.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86558611|gb|ABD03568.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 160
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 45/87 (51%), Gaps = 5/87 (5%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N AD+R +D S + GA L A ++AN GADLS + L+ A L A L R
Sbjct: 67 NLQEADLRGADLSSANLMGANLRGANLWEANLIGADLSFADLREANLHGAYLWEAKLTRA 126
Query: 192 VLTRSDL-----GGAIIEGADFSDAVI 213
L SDL GGA++ GAD A++
Sbjct: 127 QLQGSDLSGAKIGGAVLTGADLRGAIL 153
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 47/98 (47%), Gaps = 7/98 (7%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L+ +N EA+ RG A SA+L A N AN AD+ +D + +G
Sbjct: 63 LSGINLQEADLRG-------ADLSSANLMGANLRGANLWEANLIGADLSFADLREANLHG 115
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
AYL +A +A G+DLS + VL A+L A+L
Sbjct: 116 AYLWEAKLTRAQLQGSDLSGAKIGGAVLTGADLRGAIL 153
>gi|354567300|ref|ZP_08986470.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353543601|gb|EHC13059.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 1022
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 34/101 (33%), Positives = 47/101 (46%), Gaps = 5/101 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ FG DL A + N AN A++ ++ +G+ A L A AN TGA+L
Sbjct: 863 AGVNFGQIDLSNANFMGANLVGANLQDANLAGANLTGANLTDANLSGANLASANLTGANL 922
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ L NLTN L + VL +D AI+ GA FS
Sbjct: 923 TGA-----NLQSTNLTNTCLFQAVLQETDKEIAILNGAIFS 958
Score = 41.6 bits (96), Expect = 0.43, Method: Composition-based stats.
Identities = 26/77 (33%), Positives = 38/77 (49%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
N + AD+ ++ +G F L A AN GA+L D + L ANLT+A L
Sbjct: 851 NLSGADLSQAMLAGVNFGQIDLSNANFMGANLVGANLQDANLAGANLTGANLTDANLSGA 910
Query: 192 VLTRSDLGGAIIEGADF 208
L ++L GA + GA+
Sbjct: 911 NLASANLTGANLTGANL 927
>gi|300866980|ref|ZP_07111651.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300335015|emb|CBN56817.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 300
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 65/120 (54%), Gaps = 9/120 (7%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
L +YE R +F + A SA+L A+ + N AN + A++ + + S+ NGA L
Sbjct: 7 LKRYENGDR-DF---AGADLSSANLSGAILIGVNLSGANLSGANLSRAFLTKSELNGASL 62
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++AN + A + + + L +AN++ A LV++ L R+ L GA I GA+ +A++
Sbjct: 63 -----HRANLSFAKMGEIRLADADLTKANISGAFLVKSKLPRAKLSGANITGANLRNAIL 117
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 67/141 (47%), Gaps = 18/141 (12%)
Query: 109 SAAQFGSADLRKAVHVKE-----NFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVA 158
S A A+L +A K + RAN + A M E +D + + +GA+L K+
Sbjct: 38 SGANLSGANLSRAFLTKSELNGASLHRANLSFAKMGEIRLADADLTKANISGAFLVKSKL 97
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+A +GA+++ + +L ANL +A L T L ++L GA + A+F A + A+
Sbjct: 98 PRAKLSGANITGANLRNAILWNANLCSAELQLTNLRGANLTGANLNWANFYGAKLSGAKL 157
Query: 219 QALCKYANGTNPITGVSTRKS 239
+TG+S RK+
Sbjct: 158 FGA--------QLTGISLRKA 170
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 15/113 (13%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANF-- 163
A+ A L A + R+A+ D+ D +G +K NG LE + ANF
Sbjct: 150 AKLSGAKLFGAQLTGISLRKAHLNGIDLGGVDLNGVNLSEAKLNGVNLEGSNLVGANFYA 209
Query: 164 --------TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
TGADL+ + R L +ANL + L + L+++DL A + GA+F
Sbjct: 210 AQLRSVKLTGADLTKANLVRACLVQANLNWSRLSQANLSQADLSEATLMGANF 262
>gi|298246992|ref|ZP_06970797.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297549651|gb|EFH83517.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 381
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 56/119 (47%), Gaps = 8/119 (6%)
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAV 157
G +GS + GSA + ++ + A A MR S D S + GA L KA
Sbjct: 236 GHDALGSQGERGSA---RHPDLQAHLSHAQLAGAKMRGSYLSGVDLSQANLRGADLSKAY 292
Query: 158 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
Y AN GADLS + L EAN+ A L L+++ L GA + AD S A + LA
Sbjct: 293 FYGANLQGADLSGANLTETTLTEANIEGANLTEANLSKATLIGANLRQADLSGARLTLA 351
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 48/104 (46%), Gaps = 21/104 (20%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
GS A G ADL+K V + N D+R +F +AN GAD
Sbjct: 146 GSKALVG-ADLQKIV-----LPQINLAQMDLRRVNFR---------------EANLQGAD 184
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
LS + R L+ ANL++A L L +DL G + GAD SD+
Sbjct: 185 LSGVNLYRADLSGANLSHATLKGADLRGADLRGTDLTGADLSDS 228
>gi|162453209|ref|YP_001615576.1| hypothetical protein sce4933 [Sorangium cellulosum So ce56]
gi|161163791|emb|CAN95096.1| hypothetical protein sce4933 [Sorangium cellulosum So ce56]
Length = 890
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 53/105 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
A+Q + A E + T AD R D G +F A+LE A A+ +GA L
Sbjct: 552 EASQMARVVVDAAREAGEPLDERDLTGADFRGVDLRGMRFARAFLEGADLRGADLSGAVL 611
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ + L+ ANLT A L L +++L GA+ + AD ++AV+
Sbjct: 612 EGAVLAKADLSGANLTGARLRGANLGKANLEGAVFDDADLTEAVL 656
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 52/108 (48%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A F DLR F RA AD+R +D SG A LE AV KA+ +GA+L
Sbjct: 577 TGADFRGVDLRGM-----RFARAFLEGADLRGADLSG-----AVLEGAVLAKADLSGANL 626
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+ + L +ANL AV LT + L GA + GA A ++ A
Sbjct: 627 TGARLRGANLGKANLEGAVFDDADLTEAVLMGARLAGASLKRAKLERA 674
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 50/102 (49%), Gaps = 15/102 (14%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F + DLR A F RA T D+ E+D + + F+ A ++ A+ + N
Sbjct: 777 FRTTDLRGA-----RFDRAQMTMTDLSEADATDATFDRAVMKNALLIRTN---------- 821
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+DR L NLT A+L ++ L +D GA + ADFS A D
Sbjct: 822 LDRASLRGCNLTEAILSKSRLAGADFTGAQLCRADFSRARGD 863
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 56/117 (47%), Gaps = 16/117 (13%)
Query: 108 GSAAQFGSADLRKAVHV-KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
G+ A+F A + V V K A+F A + ++ F + GA ++A + + A
Sbjct: 741 GAKARFAGARFSEGVAVHKSGLPEADFRDAVLDKTCFRTTDLRGARFDRAQMTMTDLSEA 800
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIE-----GADFSDAVI 213
D +D DR V+ NA+L+RT L R+ L G AI+ GADF+ A +
Sbjct: 801 DATDATFDRAVMK-----NALLIRTNLDRASLRGCNLTEAILSKSRLAGADFTGAQL 852
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-MDRMVLNEANLTN 185
N RA + +DFSG++ + L KA F GA S+ + + + L EA+ +
Sbjct: 710 NLERAMLLECSLDGTDFSGARLHKTSLMSCTGAKARFAGARFSEGVAVHKSGLPEADFRD 769
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
AVL +T +DL GA + A + + DL++ A
Sbjct: 770 AVLDKTCFRTTDLRGARFDRAQMT--MTDLSEADA 802
>gi|75911106|ref|YP_325402.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704831|gb|ABA24507.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 268
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 70/161 (43%), Gaps = 39/161 (24%)
Query: 81 VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE 140
VA+ S +I ADL + F IG A F A+LR A+ + N +F+SAD+R+
Sbjct: 93 VANLSQSILTQADL------SHAHF-IG--ADFSGANLRGAIVAEANLIGTDFSSADLRD 143
Query: 141 SDFSGSKF------------------------------NGAYLEKAVAYKANFTGADLSD 170
+D +G+K GAYL KA YKAN A L
Sbjct: 144 ADLAGAKLIRSNLCFANLIAANLIAADFSEANLYQAEVMGAYLYKANFYKANLHKAHLGG 203
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ R L A+L A L LT ++L GA + GA+ A
Sbjct: 204 AYLFRANLTAADLRGADLAWANLTSANLAGANLSGANLRGA 244
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 62/126 (49%), Gaps = 17/126 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVA 158
S A SA+L +A + N ANF T AD+ + F G+ F+GA L A+
Sbjct: 67 SGADLSSANLYQAKISEANLSAANFSVANLSQSILTQADLSHAHFIGADFSGANLRGAIV 126
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+AN G D S L +A+L A L+R+ L ++L A + ADFS+A +L Q
Sbjct: 127 AEANLIGTDFSSA-----DLRDADLAGAKLIRSNLCFANLIAANLIAADFSEA--NLYQA 179
Query: 219 QALCKY 224
+ + Y
Sbjct: 180 EVMGAY 185
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 52/94 (55%), Gaps = 5/94 (5%)
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 182
++ + AN ++R ++ G+ + L A+ +AN +GADLS + + ++EAN
Sbjct: 26 QIEPDLSTANLQENNLRGANLEGTNLSRVDLSHALLVRANLSGADLSSANLYQAKISEAN 85
Query: 183 LTNAV-----LVRTVLTRSDLGGAIIEGADFSDA 211
L+ A L +++LT++DL A GADFS A
Sbjct: 86 LSAANFSVANLSQSILTQADLSHAHFIGADFSGA 119
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A +LR A N R + + A + ++ SG+ + A L +A +AN + A+
Sbjct: 32 STANLQENNLRGANLEGTNLSRVDLSHALLVRANLSGADLSSANLYQAKISEANLSAANF 91
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDA 211
S + + +L +A+L++A + + ++L GAI+ G DFS A
Sbjct: 92 SVANLSQSILTQADLSHAHFIGADFSGANLRGAIVAEANLIGTDFSSA 139
Score = 37.4 bits (85), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 49/104 (47%), Gaps = 10/104 (9%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
AA F A+L +A + +ANF A++ ++ GAYL ++AN T ADL
Sbjct: 168 AADFSEANLYQAEVMGAYLYKANFYKANLHKA-----HLGGAYL-----FRANLTAADLR 217
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ L ANL A L L ++L GA + G + + ++
Sbjct: 218 GADLAWANLTSANLAGANLSGANLRGANLKGANLNGVNLQETIM 261
>gi|379019292|ref|YP_005295526.1| hypothetical protein RPK_04435 [Rickettsia rickettsii str. Hlp#2]
gi|376331872|gb|AFB29106.1| hypothetical protein RPK_04435 [Rickettsia rickettsii str. Hlp#2]
Length = 959
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 225
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 226 N 226
N
Sbjct: 670 N 670
Score = 42.0 bits (97), Expect = 0.29, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 584 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 641
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 642 ENADMQAVEAAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATK 691
Score = 37.7 bits (86), Expect = 5.5, Method: Composition-based stats.
Identities = 41/144 (28%), Positives = 61/144 (42%), Gaps = 18/144 (12%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
LKN +F S L + +++C+ + + N A + A F ADL+K+
Sbjct: 359 LKN-TLFASANLESVKISNCNLDFTNFEGANLQNAVFQNV--TARNAGFLFADLKKSKIE 415
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTG-----ADLS 169
+ RA D+ E + + SKFN + A A K +N TG AD+
Sbjct: 416 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKFIIKDSEWKNSNLTGISLAYADMQ 475
Query: 170 DTLMDRMVLNEANLTNAVLVRTVL 193
M +VLN A L A +V T L
Sbjct: 476 RVQMQGVVLNNALLDQANIVSTNL 499
Score = 37.4 bits (85), Expect = 6.6, Method: Composition-based stats.
Identities = 25/109 (22%), Positives = 51/109 (46%), Gaps = 5/109 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A+L+ AV R A F AD+++S S + AY+ K + T + + +
Sbjct: 384 FEGANLQNAVFQNVTARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVM 443
Query: 173 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
M ++ ++ ++ N+ L L +D+ ++G ++A++D A
Sbjct: 444 MVNADAEKFIIKDSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 492
>gi|381206178|ref|ZP_09913249.1| hypothetical protein SclubJA_11179 [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 205
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 68/136 (50%), Gaps = 15/136 (11%)
Query: 93 DLNKYEAETRGEFGIGSAAQFGSADLRKA-VHVKE----NFRRANFTSADMRESDFSGSK 147
DL+K +A + S A G A+L A +H N + AN AD+RE+D +
Sbjct: 55 DLDKLQATNKCIRCDLSGADLGGANLSDANLHFANLQGTNLKGANLNWADLREADLRKAD 114
Query: 148 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN-----AVLVRTVLTRSD----- 197
N A L+ A +A+ GA+L + + + EANL A L R L+R++
Sbjct: 115 LNWARLKNADLRRADLYGANLGEAFLQYSDMREANLREVDLEAADLYRAELSRANLEDAR 174
Query: 198 LGGAIIEGADFSDAVI 213
LGGAI++ A S+A++
Sbjct: 175 LGGAILKFASMSEAIL 190
>gi|158338763|ref|YP_001519940.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158309004|gb|ABW30621.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 299
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 58/111 (52%), Gaps = 10/111 (9%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-VAY----KANFTGADLSDT 171
DLR A +N + A+ A+++ ++ G + A LE A + Y KA A L+ T
Sbjct: 63 DLRGANLQDQNLKGASLQGANLQGANLQGVNLDDANLESANLKYANLSKATLRRASLTTT 122
Query: 172 LMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDAVIDLAQ 217
L L +ANLT A LV+T L R++L A +E ADFS AV++ Q
Sbjct: 123 LKQATNLQDANLTQATLVKTKLKGADLRRANLFEATLEDADFSVAVVETTQ 173
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 53/103 (51%), Gaps = 5/103 (4%)
Query: 116 ADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
+DLR+A K N + A ++A++ E++ G+K A L+ A ADL
Sbjct: 181 SDLREANFNKSNLKNATLNQVYLSNANLSEANLKGAKLKQAQLKYTNLNGAKLNNADLRK 240
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ + L+EA+L++A L + + +L GA + AD S A +
Sbjct: 241 ASLESVNLSEADLSSAHLGKIAMKDVNLRGANLSNADLSGAKL 283
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 53/122 (43%), Gaps = 14/122 (11%)
Query: 107 IGSAAQFGSADLRKAVHVKE-----NFRRANFTSADMRESDFSGSKFNGAY--------- 152
+ A A+L +A VK + RRAN A + ++DFS +
Sbjct: 123 LKQATNLQDANLTQATLVKTKLKGADLRRANLFEATLEDADFSVAVVETTQGIRRVYFSD 182
Query: 153 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
L +A K+N A L+ + L+EANL A L + L ++L GA + AD A
Sbjct: 183 LREANFNKSNLKNATLNQVYLSNANLSEANLKGAKLKQAQLKYTNLNGAKLNNADLRKAS 242
Query: 213 ID 214
++
Sbjct: 243 LE 244
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 54/127 (42%), Gaps = 18/127 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A L + N + AN T A + ++ G+ A L +A A+F+ A +
Sbjct: 110 SKATLRRASLTTTLKQATNLQDANLTQATLVKTKLKGADLRRANLFEATLEDADFSVAVV 169
Query: 169 SDTLMDRMV---------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
T R V N++NL NA L + L+ ++L A ++GA KQ
Sbjct: 170 ETTQGIRRVYFSDLREANFNKSNLKNATLNQVYLSNANLSEANLKGAKL---------KQ 220
Query: 220 ALCKYAN 226
A KY N
Sbjct: 221 AQLKYTN 227
>gi|334137987|ref|ZP_08511411.1| pentapeptide repeat protein [Paenibacillus sp. HGF7]
gi|333604520|gb|EGL15910.1| pentapeptide repeat protein [Paenibacillus sp. HGF7]
Length = 242
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 72/166 (43%), Gaps = 24/166 (14%)
Query: 50 FPDCSNNQCAGPYAKLKNWRVFVSTALAAAV--VASCSSNISALADLNKYEAETRGEFGI 107
DC ++ A++K+ + +ST + V C+ N+S + L K GI
Sbjct: 90 IADCVLSEATLRNAQMKDAEIKISTCIETCFDEVELCNGNLSG-STLIKATFRQANLHGI 148
Query: 108 -GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
S A F +DLR A V +F ++F SA++ E D S + F G L A+ NFTG
Sbjct: 149 SASKAYFDESDLRGANLVNGDFEESDFISANLSEVDASYANFTGGNLTGAILCNGNFTG- 207
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
AN TN L GAI+EGA F DAV
Sbjct: 208 --------------ANCTN-----VKLNDVSWKGAIVEGAIFDDAV 234
>gi|332706458|ref|ZP_08426519.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332354342|gb|EGJ33821.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 345
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 46/88 (52%), Gaps = 5/88 (5%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANLTN 185
A+F AD++E DFS A L +A ++ N GA+L + R L +ANL+N
Sbjct: 231 ADFRGADLKERDFSNRNLQSANLSQANLKDAFLHRVNLAGANLEGANLFRANLFQANLSN 290
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A L L +D+ GA + GAD S A +
Sbjct: 291 ANLREANLIGADMSGADLSGADLSGAKV 318
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 49/98 (50%), Gaps = 10/98 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A F ADL++ N + AN + A+++++ GA LE A ++AN A+L
Sbjct: 229 SGADFRGADLKERDFSNRNLQSANLSQANLKDAFLHRVNLAGANLEGANLFRANLFQANL 288
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
S+ ANL A L+ ++ +DL GA + GA
Sbjct: 289 SN----------ANLREANLIGADMSGADLSGADLSGA 316
>gi|226194659|ref|ZP_03790253.1| pentapeptide repeat protein [Burkholderia pseudomallei Pakistan 9]
gi|386863935|ref|YP_006276883.1| type VI secretion system [Burkholderia pseudomallei 1026b]
gi|418534996|ref|ZP_13100802.1| type VI secretion system [Burkholderia pseudomallei 1026a]
gi|225933225|gb|EEH29218.1| pentapeptide repeat protein [Burkholderia pseudomallei Pakistan 9]
gi|385357281|gb|EIF63347.1| type VI secretion system [Burkholderia pseudomallei 1026a]
gi|385661063|gb|AFI68485.1| type VI secretion system [Burkholderia pseudomallei 1026b]
Length = 825
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQGADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
>gi|427714529|ref|YP_007063153.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
gi|427378658|gb|AFY62610.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
Length = 333
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 62/125 (49%), Gaps = 18/125 (14%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A G A+ R+A + R AN T AD+ ES + +GA LEKA+ A+ T ADL
Sbjct: 51 SFALLGRANFRRANLAGADLRGANLTQADLTESLLQEANLHGASLEKAILVGADITLADL 110
Query: 169 SDTLMDRMVLNEANLTNAVLV----------------RTVLTRSDLGGAIIEGADFSDAV 212
+D + L +ANL++ V RTVL +DL A + A+ ++A
Sbjct: 111 TDCNLIEADLRQANLSSTRFVGACFRGANLRKDNYQERTVLRGTDLEKADFQSANLAEA- 169
Query: 213 IDLAQ 217
DLA+
Sbjct: 170 -DLAR 173
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 55/107 (51%), Gaps = 10/107 (9%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFT-----GA 166
+L+KA N R A F A ++E+DF + F+ A+L++ + KANFT A
Sbjct: 210 NLKKAGLAWANLREARFDRAQLQEADFFQADCYQANFSHAHLDQIIGEKANFTQAIFTKA 269
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
DL + L EA L A L RT LT +DL GA + A+ S ++
Sbjct: 270 DLRRANLRGSTLKEARLIEAYLARTDLTGADLTGANLIRAEISSTLL 316
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 44/83 (53%), Gaps = 5/83 (6%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F A L + + K NF +A FT AD+R ++ GS A L +A + + TGADL+
Sbjct: 244 ANFSHAHLDQIIGEKANFTQAIFTKADLRRANLRGSTLKEARLIEAYLARTDLTGADLTG 303
Query: 171 TLMDR-----MVLNEANLTNAVL 188
+ R +L +ANLT+ +
Sbjct: 304 ANLIRAEISSTLLLDANLTDVTM 326
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 66/145 (45%), Gaps = 6/145 (4%)
Query: 75 ALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT 134
+L A++ ++ L D N EA+ R S+ +F A R A K+N++
Sbjct: 94 SLEKAILVGADITLADLTDCNLIEADLRQ--ANLSSTRFVGACFRGANLRKDNYQERTV- 150
Query: 135 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 194
+R +D + F A L +A + N TGA+L + + L ANL A T LT
Sbjct: 151 ---LRGTDLEKADFQSANLAEADLARVNLTGANLKEANLRGADLMGANLERAYCQMTRLT 207
Query: 195 RSDLGGAIIEGADFSDAVIDLAQKQ 219
++L A + A+ +A D AQ Q
Sbjct: 208 DTNLKKAGLAWANLREARFDRAQLQ 232
>gi|254182800|ref|ZP_04889393.1| pentapeptide repeat protein [Burkholderia pseudomallei 1655]
gi|184213334|gb|EDU10377.1| pentapeptide repeat protein [Burkholderia pseudomallei 1655]
Length = 825
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQGADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 40.8 bits (94), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
>gi|302039057|ref|YP_003799379.1| hypothetical protein NIDE3778 [Candidatus Nitrospira defluvii]
gi|300607121|emb|CBK43454.1| conserved exported protein of unknown function [Candidatus
Nitrospira defluvii]
Length = 476
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 58/109 (53%), Gaps = 5/109 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLRKA+ VK + R A ++ G+ F A LE+A A+ GADLS+
Sbjct: 133 ANLEGADLRKALLVKAHLNRIAADEAAFYGANLQGALFREALLERAHFEDADLQGADLSN 192
Query: 171 -TLMDRMV----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
TL+D L++ NLT+A L T L R++L A + A+ A++D
Sbjct: 193 ATLLDGYFYGANLSKTNLTDADLAGTDLRRTNLRQANLRRANLQGALLD 241
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 53/104 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F A+L+ A+ + RA+F AD++ +D S + Y A K N T ADL+
Sbjct: 158 AAFYGANLQGALFREALLERAHFEDADLQGADLSNATLLDGYFYGANLSKTNLTDADLAG 217
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
T + R L +ANL A L +L ++L GA + AD A +D
Sbjct: 218 TDLRRTNLRQANLRRANLQGALLDSANLDGASLIEADLESAYLD 261
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 15/121 (12%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKAN 162
G A DLR+ V N R N A ++R + + GA +AV AN
Sbjct: 75 GRRANLCRTDLRQLRLVGANLERINLEGAILKGSNLRTASLVQAHLKGADFSQAVLDDAN 134
Query: 163 FTGADLSDTLMDRMVLNE----------ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
GADL L+ + LN ANL A+ +L R+ A ++GAD S+A
Sbjct: 135 LEGADLRKALLVKAHLNRIAADEAAFYGANLQGALFREALLERAHFEDADLQGADLSNAT 194
Query: 213 I 213
+
Sbjct: 195 L 195
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 46/103 (44%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SAA A+L AV + RA+ AD+ E + A L +A AN ADL
Sbjct: 326 SAANLHGANLHHAVLIGTQLARADLRKADLTEIYGPNAHLQQARLSEANLELANLVAADL 385
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + V+ + NL L L+ SDL GA++ AD A
Sbjct: 386 SQADISHAVVVQTNLQETNLRGANLSASDLTGALLNNADLGQA 428
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 47/98 (47%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL A + F AN + ++ ++D +G+ L +A +AN GA L
Sbjct: 183 ADLQGADLSNATLLDGYFYGANLSKTNLTDADLAGTDLRRTNLRQANLRRANLQGALLDS 242
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+D L EA+L +A L L +DL A + GADF
Sbjct: 243 ANLDGASLIEADLESAYLDDASLANADLHEASLRGADF 280
Score = 44.3 bits (103), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 63/138 (45%), Gaps = 32/138 (23%)
Query: 89 SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 148
++LA+ + +EA RG AD R N +R N +A+M + S+
Sbjct: 263 ASLANADLHEASLRG------------ADFRFTHLGGANLQRVNLENANMEGATLVKSRL 310
Query: 149 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG--------- 199
+ A L V YKAN + A+ L+ ANL +AVL+ T L R+DL
Sbjct: 311 DSATLTMTVLYKANLSAAN----------LHGANLHHAVLIGTQLARADLRKADLTEIYG 360
Query: 200 -GAIIEGADFSDAVIDLA 216
A ++ A S+A ++LA
Sbjct: 361 PNAHLQQARLSEANLELA 378
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 15/104 (14%)
Query: 129 RRANFTSADMRE----------SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 178
RRAN D+R+ + G+ G+ L A +A+ GAD S ++D L
Sbjct: 76 RRANLCRTDLRQLRLVGANLERINLEGAILKGSNLRTASLVQAHLKGADFSQAVLDDANL 135
Query: 179 NEANLTNAVLVRTVLTR-----SDLGGAIIEGADFSDAVIDLAQ 217
A+L A+LV+ L R + GA ++GA F +A+++ A
Sbjct: 136 EGADLRKALLVKAHLNRIAADEAAFYGANLQGALFREALLERAH 179
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 55/128 (42%), Gaps = 25/128 (19%)
Query: 111 AQFGSADLRKAVHVKENFRRANFT----------SADMRESDFSGSKFNGAYLEKAVAYK 160
A DLR+ + N RRAN A + E+D + + A L A ++
Sbjct: 213 ADLAGTDLRRTNLRQANLRRANLQGALLDSANLDGASLIEADLESAYLDDASLANADLHE 272
Query: 161 ANFTGADLSDTL-----MDRMVLNEANLTNAVLVR----------TVLTRSDLGGAIIEG 205
A+ GAD T + R+ L AN+ A LV+ TVL +++L A + G
Sbjct: 273 ASLRGADFRFTHLGGANLQRVNLENANMEGATLVKSRLDSATLTMTVLYKANLSAANLHG 332
Query: 206 ADFSDAVI 213
A+ AV+
Sbjct: 333 ANLHHAVL 340
>gi|409994207|ref|ZP_11277325.1| hypothetical protein APPUASWS_23863 [Arthrospira platensis str.
Paraca]
gi|409934955|gb|EKN76501.1| hypothetical protein APPUASWS_23863 [Arthrospira platensis str.
Paraca]
Length = 519
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 52/97 (53%), Gaps = 10/97 (10%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN----- 179
+ N +ANFT A + ++FSG+ G L +A + +GA L ++ VLN
Sbjct: 29 RVNLSQANFTEAILSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLI 88
Query: 180 -----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ANL +A L+R L R++L A++ GA+ ++A
Sbjct: 89 RADLSQANLIDASLIRAELMRAELSEAVVNGANLTEA 125
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 62/129 (48%), Gaps = 12/129 (9%)
Query: 101 TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
TR + + S A A+L +AV N RA+ + A++ ++ ++ A L +AV
Sbjct: 58 TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLIDASLIRAELMRAELSEAVV 117
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNA----------VLVRTVLTRSDLGGAIIEGADF 208
AN T ADL + + L +ANL+ A L R+ LTRSDL A + G +
Sbjct: 118 NGANLTEADLREATLRHTELQQANLSGANLSEACLILSNLERSNLTRSDLTRADLRGVNL 177
Query: 209 SDAVIDLAQ 217
+A + A+
Sbjct: 178 RNAELRQAE 186
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 48/96 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L +A + N R+N T +D+ +D G A L +A A+ GA+LS
Sbjct: 140 ANLSGANLSEACLILSNLERSNLTRSDLTRADLRGVNLRNAELRQAELSGADLRGANLSG 199
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
+ L+ ANL+ A L T L+ + L GA + GA
Sbjct: 200 ANLRWANLSGANLSGANLEATQLSGASLRGANLSGA 235
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 50/98 (51%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ A+L +AV N A+ A +R ++ + +GA L +A +N ++L+
Sbjct: 105 AELMRAELSEAVVNGANLTEADLREATLRHTELQQANLSGANLSEACLILSNLERSNLTR 164
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ + R L NL NA L + L+ +DL GA + GA+
Sbjct: 165 SDLTRADLRGVNLRNAELRQAELSGADLRGANLSGANL 202
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 49/91 (53%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+A+LR+A + R AN + A++R ++ SG+ +GA LE A+ GA+LS +
Sbjct: 179 NAELRQAELSGADLRGANLSGANLRWANLSGANLSGANLEATQLSGASLRGANLSGASLL 238
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
A+LT A L+ T +DL G+ + G
Sbjct: 239 NCTAIHADLTQANLIECDWTDADLRGSALTG 269
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 142 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 201
DFS A L + +ANFT A LS T + ANLT L R L S L GA
Sbjct: 16 DFSAILLCEANLSRVNLSQANFTEAILSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 70
Query: 202 IIEGADFSDAVIDLA 216
I++GA+ ++AV+++A
Sbjct: 71 ILQGANLNEAVLNVA 85
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 47/96 (48%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
+DL +A N R A A++ +D G+ +GA L A AN +GA+L T +
Sbjct: 165 SDLTRADLRGVNLRNAELRQAELSGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 224
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANL+ A L+ +DL A + D++DA
Sbjct: 225 ASLRGANLSGASLLNCTAIHADLTQANLIECDWTDA 260
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 58/139 (41%), Gaps = 18/139 (12%)
Query: 91 LADLNKYEAE-TRGEF--GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE------- 140
L D + AE R E + + A ADLR+A ++AN + A++ E
Sbjct: 97 LIDASLIRAELMRAELSEAVVNGANLTEADLREATLRHTELQQANLSGANLSEACLILSN 156
Query: 141 --------SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
SD + + G L A +A +GADL + L ANL+ A L
Sbjct: 157 LERSNLTRSDLTRADLRGVNLRNAELRQAELSGADLRGANLSGANLRWANLSGANLSGAN 216
Query: 193 LTRSDLGGAIIEGADFSDA 211
L + L GA + GA+ S A
Sbjct: 217 LEATQLSGASLRGANLSGA 235
>gi|113476307|ref|YP_722368.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110167355|gb|ABG51895.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 225
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 46/88 (52%)
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
+ NF N +A+M ++ +G F GA L A AN TGA+L + R +N ANL
Sbjct: 50 TRANFHDINLKNANMSGANLTGVNFQGADLNGANLSGANLTGANLEKANLYRADINRANL 109
Query: 184 TNAVLVRTVLTRSDLGGAIIEGADFSDA 211
TN L T L +DL A + A+ +DA
Sbjct: 110 TNTNLTSTRLLEADLTLANLNHANLTDA 137
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 60/117 (51%), Gaps = 1/117 (0%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F +L+ A N NF AD+ ++ SG+ GA LEKA Y+A+ A+L++
Sbjct: 52 ANFHDINLKNANMSGANLTGVNFQGADLNGANLSGANLTGANLEKANLYRADINRANLTN 111
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL-CKYAN 226
T + L EA+LT A L LT ++L A + GA + A ++ A +YAN
Sbjct: 112 TNLTSTRLLEADLTLANLNHANLTDANLLEARLWGASLAGANLNNASLHGTNLEYAN 168
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 45/97 (46%), Gaps = 9/97 (9%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD---------RMV 177
N AN T A++ E+ G+ GA L A + N A+L+D+ + R
Sbjct: 128 NLNHANLTDANLLEARLWGASLAGANLNNASLHGTNLEYANLADSNLSGADFHSFSFRSY 187
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
E NL+NA L ++T +DL G I+ G D I+
Sbjct: 188 KQETNLSNANLEGAIITNTDLSGVIMRGTIMPDGSIN 224
>gi|217968703|ref|YP_002353937.1| pentapeptide repeat-containing protein [Thauera sp. MZ1T]
gi|217506030|gb|ACK53041.1| pentapeptide repeat protein [Thauera sp. MZ1T]
Length = 215
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 1/109 (0%)
Query: 110 AAQFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
A G D+R+ N AN ++ S F+ S +GA L A +ANF+ A++
Sbjct: 29 AGNIGGCDIRRGTLCTNLNLNGANLEGVNLANSQFTRSDLSGANLRGATLNEANFSQAEM 88
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+ +D+ + ANL A L +DL AI++ AD DA + AQ
Sbjct: 89 AGATLDKASMLRANLRGARLTGASFKEADLRNAILQNADLHDADLTAAQ 137
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A +L + + + AN A + E++FS ++ GA L+KA +AN GA L
Sbjct: 49 NGANLEGVNLANSQFTRSDLSGANLRGATLNEANFSQAEMAGATLDKASMLRANLRGARL 108
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ EA+L NA+L L +DL A + GAD ++A
Sbjct: 109 TGA-----SFKEADLRNAILQNADLHDADLTAAQLGGADLTNA 146
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 55/115 (47%), Gaps = 11/115 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A F ADLR A+ +AD+ ++D + ++ GA L A A ++ F GA+L
Sbjct: 109 TGASFKEADLRNAI----------LQNADLHDADLTAAQLGGADLTNARAERSRFDGAEL 158
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 223
+ + + L A+ A L+ TR+ L A + GA+ A+ AQ CK
Sbjct: 159 TRSNLRAAKLAGASFVGANLLGANFTRAQLSNADLTGANLDQAIFLNAQTDG-CK 212
Score = 43.9 bits (102), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 49/105 (46%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A+ A L KA ++ N R A T A +E+D + A L A A GADL
Sbjct: 84 SQAEMAGATLDKASMLRANLRGARLTGASFKEADLRNAILQNADLHDADLTAAQLGGADL 143
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ +R + A LT + L L + GA + GA+F+ A +
Sbjct: 144 TNARAERSRFDGAELTRSNLRAAKLAGASFVGANLLGANFTRAQL 188
>gi|167913453|ref|ZP_02500544.1| pentapeptide repeat family protein [Burkholderia pseudomallei 112]
gi|403521532|ref|YP_006657101.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
BPC006]
gi|403076599|gb|AFR18178.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
BPC006]
Length = 825
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQGADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
>gi|428214427|ref|YP_007087571.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428002808|gb|AFY83651.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 155
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+ S D+ ++D SG + A L A +AN TGA+LS + L EANLT+A L T
Sbjct: 61 DLQSVDLEKADLSGVDLSNANLTNADLEEANLTGANLSTADLTNADLEEANLTDANLQNT 120
Query: 192 VLTRSDLGGAI-----IEGADFSDAVID 214
T +DL AI + GADF+ A +D
Sbjct: 121 NFTSADLEDAILTNANVTGADFTGADLD 148
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 55/109 (50%), Gaps = 11/109 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S S DL KA + AN T+AD+ E++ +G+ + A L A +AN T A+L
Sbjct: 58 SGCDLQSVDLEKADLSGVDLSNANLTNADLEEANLTGANLSTADLTNADLEEANLTDANL 117
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+T N T+A L +LT +++ GA GAD D+VI L +
Sbjct: 118 QNT----------NFTSADLEDAILTNANVTGADFTGADL-DSVIGLTR 155
>gi|381395251|ref|ZP_09920956.1| hypothetical protein GPUN_1974 [Glaciecola punicea DSM 14233 = ACAM
611]
gi|379329152|dbj|GAB56089.1| hypothetical protein GPUN_1974 [Glaciecola punicea DSM 14233 = ACAM
611]
Length = 258
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 61/116 (52%), Gaps = 9/116 (7%)
Query: 107 IGSAAQFGSADLR----KAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
IGS F AD+R K V + F R+ T+ADMR DF G F+ A LE A A
Sbjct: 139 IGST--FIDADMRDSSLKNVRARSAMFTRSVLTNADMRWGDFEGVDFSNANLEGADLTMA 196
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
N GA+L+ + +L NL A+L T++ + + GA ++ DF+ +DL+Q
Sbjct: 197 NLRGANLTAANLKNAMLLYTNLEGAILNGTIMDGAQIVGANMKRVDFTK--VDLSQ 250
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 52/121 (42%), Gaps = 15/121 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTS---------------ADMRESDFSGSKFNGAYL 153
S + ++ V VK F+R+N T+ AD+ ES+ + FN A L
Sbjct: 69 SGSNLTGSNFSSTVLVKAKFKRSNLTNTNFQNANLGAAQLLGADLSESNLRNANFNKAVL 128
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ A G+ D M L +A+ R+VLT +D+ EG DFS+A +
Sbjct: 129 QYTGFIDATLIGSTFIDADMRDSSLKNVRARSAMFTRSVLTNADMRWGDFEGVDFSNANL 188
Query: 214 D 214
+
Sbjct: 189 E 189
>gi|284008185|emb|CBA74448.1| conserved pentapeptide repeat protein [Arsenophonus nasoniae]
Length = 1253
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 5/109 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A L A+ + N AN S D+ + + +K N A L +A + N ADL +
Sbjct: 931 ATLHKASLNGAILHRVNLNNANLISVDLYRAILNDAKLNNANLLRANLKETNLVNADLIN 990
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVID 214
+ + L+ NL +A L LT++DL A I G +FS A++D
Sbjct: 991 ADLTQATLSHTNLMHANLAHADLTQTDLSHANLQQVSIHGTNFSGAILD 1039
Score = 40.0 bits (92), Expect = 1.0, Method: Composition-based stats.
Identities = 24/87 (27%), Positives = 42/87 (48%), Gaps = 10/87 (11%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N N T+A + ++ +G A+ ++ N A+L + R +LN+A L NA
Sbjct: 922 NLSNLNLTNATLHKASLNG----------AILHRVNLNNANLISVDLYRAILNDAKLNNA 971
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVI 213
L+R L ++L A + AD + A +
Sbjct: 972 NLLRANLKETNLVNADLINADLTQATL 998
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 24/83 (28%), Positives = 37/83 (44%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N +N ++ ++ + + NGA L + AN DL +++ LN ANL A
Sbjct: 917 NLSGSNLSNLNLTNATLHKASLNGAILHRVNLNNANLISVDLYRAILNDAKLNNANLLRA 976
Query: 187 VLVRTVLTRSDLGGAIIEGADFS 209
L T L +DL A + A S
Sbjct: 977 NLKETNLVNADLINADLTQATLS 999
>gi|428309842|ref|YP_007120819.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428251454|gb|AFZ17413.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 289
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 52/95 (54%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
+LR+A + + R N +AD+ E+D +K +GA L A +AN D S + R+
Sbjct: 30 NLREANLREAHLRYVNLCTADLSEADLFNAKLSGADLTGANLTRANLFLVDFSTADLTRV 89
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ANLT A L T LT ++L GA + GA+ +A
Sbjct: 90 DLTGANLTRANLFFTNLTGANLTGANLTGANLKEA 124
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 50/101 (49%), Gaps = 10/101 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ ADL A + N +F++AD+ D +G A L +A + N TGA+L+
Sbjct: 59 AKLSGADLTGANLTRANLFLVDFSTADLTRVDLTG-----ANLTRANLFFTNLTGANLTG 113
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L EAN +NA L R +DL GA + AD S A
Sbjct: 114 ANLTGANLKEANFSNAGLCR-----ADLSGANLNRADLSKA 149
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 61/123 (49%), Gaps = 6/123 (4%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L A N + ANF++A + +D SG+ N A L KA N +GADLS + +
Sbjct: 109 ANLTGANLTGANLKEANFSNAGLCRADLSGANLNRADLSKADLRNINLSGADLSGANLGK 168
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN----PI 231
L+ ANL A L L ++L G + A+F A +L+ +A GTN +
Sbjct: 169 ANLSGANLCAANLSGANLCAANLSGTNLCAANFKRA--NLSGASLSNTHALGTNFEQARL 226
Query: 232 TGV 234
TGV
Sbjct: 227 TGV 229
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 50/96 (52%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +ADL + N RAN ++ ++ +G+ GA L++A A ADLS
Sbjct: 81 FSTADLTRVDLTGANLTRANLFFTNLTGANLTGANLTGANLKEANFSNAGLCRADLSGAN 140
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
++R L++A+L N L L+ ++LG A + GA+
Sbjct: 141 LNRADLSKADLRNINLSGADLSGANLGKANLSGANL 176
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 48/103 (46%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F +A L +A N RA+ + AD+R + SG+ +GA L KA AN A+LS
Sbjct: 124 ANFSNAGLCRADLSGANLNRADLSKADLRNINLSGADLSGANLGKANLSGANLCAANLSG 183
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ L+ NL A R L+ + L G +F A +
Sbjct: 184 ANLCAANLSGTNLCAANFKRANLSGASLSNTHALGTNFEQARL 226
>gi|254414225|ref|ZP_05027992.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196178900|gb|EDX73897.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 963
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 34/101 (33%), Positives = 50/101 (49%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F A+ A + N +RAN A++ E++ + F GA LE+A +AN GA+L +
Sbjct: 776 ANFEGANFEGANLEEANLKRANLFEANLFEANLFEANFEGANLERANLKRANLEGANLEE 835
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L EANL A L R+ L A +E A+ A
Sbjct: 836 ANLKGANLEEANLEEANFEGANLKRATLFEANLEWANLKRA 876
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 40/117 (34%), Positives = 56/117 (47%), Gaps = 11/117 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F A+L +A N +RAN A++ E++ G+ A LE+A NF GA+L
Sbjct: 811 ANFEGANLERA-----NLKRANLEGANLEEANLKGANLEEANLEEA-----NFEGANLKR 860
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 226
+ L ANL A L L ++ GA +EGA A + A K+A K AN
Sbjct: 861 ATLFEANLEWANLKRANLFEANLFDANFEGANLEGAHLKGANLKRANLKRANLKRAN 917
Score = 50.4 bits (119), Expect = 7e-04, Method: Composition-based stats.
Identities = 47/168 (27%), Positives = 71/168 (42%), Gaps = 25/168 (14%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L + N +EA G A A+L++A N AN A++ E++ + F G
Sbjct: 803 LFEANLFEANFEG-------ANLERANLKRANLEGANLEEANLKGANLEEANLEEANFEG 855
Query: 151 AYLEKAVAYKANFTGADLSDT------LMDRMV---------LNEANLTNAVLVRTVLTR 195
A L++A ++AN A+L L D L ANL A L R L R
Sbjct: 856 ANLKRATLFEANLEWANLKRANLFEANLFDANFEGANLEGAHLKGANLKRANLKRANLKR 915
Query: 196 SDLGGAIIEGADFSDAVIDLA---QKQALCKYANGTNPITGVSTRKSL 240
++L A EGA+F A ++ A + G PI+ T ++L
Sbjct: 916 ANLFEANFEGANFEGATLEWANLFEANLKGTILEGKVPISSPETEQTL 963
Score = 47.8 bits (112), Expect = 0.006, Method: Composition-based stats.
Identities = 35/102 (34%), Positives = 50/102 (49%), Gaps = 5/102 (4%)
Query: 117 DLRKAVHVKE-----NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
DLR V V + NF ANF A++ E++ + A L +A ++ANF GA+L
Sbjct: 762 DLRGCVLVFKDFYWANFEGANFEGANLEEANLKRANLFEANLFEANLFEANFEGANLERA 821
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ R L ANL A L L ++L A EGA+ A +
Sbjct: 822 NLKRANLEGANLEEANLKGANLEEANLEEANFEGANLKRATL 863
>gi|254299592|ref|ZP_04967041.1| pentapeptide repeat protein [Burkholderia pseudomallei 406e]
gi|418542641|ref|ZP_13108060.1| type VI secretion system [Burkholderia pseudomallei 1258a]
gi|418549165|ref|ZP_13114243.1| type VI secretion system [Burkholderia pseudomallei 1258b]
gi|157809489|gb|EDO86659.1| pentapeptide repeat protein [Burkholderia pseudomallei 406e]
gi|385355180|gb|EIF61399.1| type VI secretion system [Burkholderia pseudomallei 1258a]
gi|385356028|gb|EIF62174.1| type VI secretion system [Burkholderia pseudomallei 1258b]
Length = 825
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQGADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
>gi|434392917|ref|YP_007127864.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428264758|gb|AFZ30704.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 313
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 58/108 (53%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A+L KA+ + N RAN T AD+ +++ SG + + A L +AV A+
Sbjct: 93 SGVNLWRANLNKAILCEANLSRANLDEANLTGADLSKANLSGIQLSKANLTEAVIVDAHL 152
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A+L++T + R L L A L+ + LT +DL A +EGA+ S+A
Sbjct: 153 NRANLTETKLMRSHLCGTQLERAELIASDLTAADLSRANLEGANLSEA 200
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 52/99 (52%), Gaps = 10/99 (10%)
Query: 118 LRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
L++A+ + R+ AD+ +++ + ++FNG++L AN TGADLS
Sbjct: 37 LKRAILEATDLSRSILVGADLNGVILKQATMTATRFNGSHLVGVDLTAANLTGADLSGVN 96
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ R ANL A+L L+R++L A + GAD S A
Sbjct: 97 LWR-----ANLNKAILCEANLSRANLDEANLTGADLSKA 130
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 57/116 (49%), Gaps = 15/116 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA--VAYKA---NFTG 165
A+ ++DL A + N AN + A++ +++ SG+ G L +A +A KA N G
Sbjct: 175 AELIASDLTAADLSRANLEGANLSEANLSQANLSGANLTGVNLHRANLIAAKAILANLRG 234
Query: 166 ADLSDTLMDRMVLNEA----------NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A+L + L EA NL+ A L R +LT +L AI+ GA+ DA
Sbjct: 235 ANLEQAELITTNLTEADLSWANLSKTNLSGADLHRAILTDVNLNSAILRGANLIDA 290
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 68/154 (44%), Gaps = 36/154 (23%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFT--------------------SADMRESDFSGSKF 148
S Q A+L +AV V + RAN T ++D+ +D S +
Sbjct: 133 SGIQLSKANLTEAVIVDAHLNRANLTETKLMRSHLCGTQLERAELIASDLTAADLSRANL 192
Query: 149 NGAYLEKAVAYKANFTGADLSDTLMDR--MV--------LNEANLTNAVLVRTVLTRSDL 198
GA L +A +AN +GA+L+ + R ++ L ANL A L+ T LT +DL
Sbjct: 193 EGANLSEANLSQANLSGANLTGVNLHRANLIAAKAILANLRGANLEQAELITTNLTEADL 252
Query: 199 GGA-----IIEGADFSDAVI-DLAQKQALCKYAN 226
A + GAD A++ D+ A+ + AN
Sbjct: 253 SWANLSKTNLSGADLHRAILTDVNLNSAILRGAN 286
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 42/77 (54%), Gaps = 10/77 (12%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVA----------YKANFTGADLSDTLMDRMVLNEA 181
+ T+A++ +D SG A L KA+ +AN TGADLS + + L++A
Sbjct: 81 DLTAANLTGADLSGVNLWRANLNKAILCEANLSRANLDEANLTGADLSKANLSGIQLSKA 140
Query: 182 NLTNAVLVRTVLTRSDL 198
NLT AV+V L R++L
Sbjct: 141 NLTEAVIVDAHLNRANL 157
>gi|86607938|ref|YP_476700.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86556480|gb|ABD01437.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 154
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 61/128 (47%), Gaps = 16/128 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 165
S AQ A+L+ + R A+ + AD+RE+D SG+ +GA L A + N G
Sbjct: 32 SGAQLSGANLKGII-----LRDADLSGADLREADLSGADLSGADLRGAKLRRVNLIGAKL 86
Query: 166 --ADLSDTLMDRMVLNEANLTNAVLVRTVL-TRSDLGGAIIEGADFSDAVIDLAQKQALC 222
ADL + R L A+L+ A L R L +DL GAII F A+ D
Sbjct: 87 VKADLRGANLYRAKLLRADLSEAELNRADLRIGADLRGAIITNTHFRGALYD-----EYT 141
Query: 223 KYANGTNP 230
K+ +G NP
Sbjct: 142 KFPDGFNP 149
>gi|53715998|ref|YP_106439.1| pentapeptide repeat-containing protein [Burkholderia mallei ATCC
23344]
gi|121597894|ref|YP_990510.1| pentapeptide repeat-containing protein [Burkholderia mallei SAVP1]
gi|124382797|ref|YP_001025000.1| pentapeptide repeat-containing protein [Burkholderia mallei NCTC
10229]
gi|126447556|ref|YP_001079344.1| pentapeptide repeat-containing protein [Burkholderia mallei NCTC
10247]
gi|166999172|ref|ZP_02265018.1| pentapeptide repeat family protein [Burkholderia mallei PRL-20]
gi|238561876|ref|ZP_00441284.2| pentapeptide repeat family protein [Burkholderia mallei GB8 horse
4]
gi|254176522|ref|ZP_04883180.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 10399]
gi|254203434|ref|ZP_04909795.1| pentapeptide repeat family protein [Burkholderia mallei FMH]
gi|254205313|ref|ZP_04911666.1| pentapeptide repeat family protein [Burkholderia mallei JHU]
gi|254356120|ref|ZP_04972397.1| pentapeptide repeat family protein [Burkholderia mallei 2002721280]
gi|52421968|gb|AAU45538.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 23344]
gi|121225692|gb|ABM49223.1| pentapeptide repeat family protein [Burkholderia mallei SAVP1]
gi|126240410|gb|ABO03522.1| pentapeptide repeat family protein [Burkholderia mallei NCTC 10247]
gi|147745673|gb|EDK52752.1| pentapeptide repeat family protein [Burkholderia mallei FMH]
gi|147754899|gb|EDK61963.1| pentapeptide repeat family protein [Burkholderia mallei JHU]
gi|148025103|gb|EDK83272.1| pentapeptide repeat family protein [Burkholderia mallei 2002721280]
gi|160697564|gb|EDP87534.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 10399]
gi|238523698|gb|EEP87135.1| pentapeptide repeat family protein [Burkholderia mallei GB8 horse
4]
gi|243064727|gb|EES46913.1| pentapeptide repeat family protein [Burkholderia mallei PRL-20]
gi|261826983|gb|ABM99323.2| pentapeptide repeat family protein [Burkholderia mallei NCTC 10229]
Length = 825
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQGADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
>gi|354565480|ref|ZP_08984655.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353549439|gb|EHC18881.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 182
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 73/161 (45%), Gaps = 19/161 (11%)
Query: 94 LNKYEAETRGEFGIG-----------SAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
L++YE R G+ S A F ADL A + N NF+ A++ ++D
Sbjct: 7 LSRYETGERDFVGVNLHKVNLREVDLSGANFCGADLSGADLSQANLSGCNFSRANLTDAD 66
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
+ + NGA L + N GADL + ++ L+ A+L A LVR LT+++L A
Sbjct: 67 LTRADLNGANLS-----EINLIGADLINANLEGTNLSRADLRGANLVRANLTKANLSEAE 121
Query: 203 IEGADFSDAVI---DLAQKQALCKYANGTNPITGVSTRKSL 240
+ GAD S A + +L + NG N T K +
Sbjct: 122 LSGADLSGANLNQANLIETNLNEAELNGVNITGATVTEKEM 162
>gi|344339023|ref|ZP_08769953.1| pentapeptide repeat protein [Thiocapsa marina 5811]
gi|343800943|gb|EGV18887.1| pentapeptide repeat protein [Thiocapsa marina 5811]
Length = 284
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 51/103 (49%), Gaps = 11/103 (10%)
Query: 120 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----------VAYKANFTGADL 168
KA+ ++ N R AN AD+R+ + + GA + +A V ANF GADL
Sbjct: 149 KALFIRANLREANLCGADLRDCHLNDANLAGASMHEADLTSALPGGFTVINLANFEGADL 208
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + + E N NA L LT + LGGAI+ AD ++A
Sbjct: 209 RGSKLRSVSAQETNFRNANLTDVDLTNAVLGGAILRRADVTNA 251
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 50/106 (47%), Gaps = 6/106 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADLR A + + R AN ADMR++DF GS KA+ +AN A+L
Sbjct: 103 SKANLERADLRHADVRRADLRGANLAHADMRDTDFQGSDLCHVVAPKALFIRANLREANL 162
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG------AIIEGADF 208
+ LN+ANL A + LT + GG A EGAD
Sbjct: 163 CGADLRDCHLNDANLAGASMHEADLTSALPGGFTVINLANFEGADL 208
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 50/118 (42%), Gaps = 16/118 (13%)
Query: 111 AQFGSADLRKAVHVKE-----------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
A G ADL +A E N +AN AD+R +D + GA L A
Sbjct: 74 ADLGGADLTQAHLGAERPSRAATLNGANLSKANLERADLRHADVRRADLRGANLAHADMR 133
Query: 160 KANFTGADLSDT-----LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
+F G+DL L R L EANL A L L ++L GA + AD + A+
Sbjct: 134 DTDFQGSDLCHVVAPKALFIRANLREANLCGADLRDCHLNDANLAGASMHEADLTSAL 191
Score = 40.4 bits (93), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 53/108 (49%), Gaps = 14/108 (12%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETR----GEFGIGSAAQFGSADLR----KAVHVKE- 126
L A + C N + LA + +EA+ G F + + A F ADLR ++V +E
Sbjct: 162 LCGADLRDCHLNDANLAGASMHEADLTSALPGGFTVINLANFEGADLRGSKLRSVSAQET 221
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
NFR AN T D+ + + GA L +A A+F+G +L+ M+
Sbjct: 222 NFRNANLTDVDL-----TNAVLGGAILRRADVTNADFSGVELASVTME 264
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 51/113 (45%), Gaps = 11/113 (9%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A ADLR + R A+ +SA++ SD G+ +GA L A+ + T ADL
Sbjct: 18 ADTLAGADLRDMHLNGADLRGADLSSANLESSDLVGALLSGARLIDAILVATDLTDADLG 77
Query: 170 DTLMDR-----------MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + LN ANL+ A L R L +D+ A + GA+ + A
Sbjct: 78 GADLTQAHLGAERPSRAATLNGANLSKANLERADLRHADVRRADLRGANLAHA 130
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 53/129 (41%), Gaps = 16/129 (12%)
Query: 119 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 178
R+ V E AD+R+ +G+ GA L A ++ GA +L
Sbjct: 7 RETGQVLERIDADTLAGADLRDMHLNGADLRGADLSSANLESSDLVGA----------LL 56
Query: 179 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 238
+ A L +A+LV T LT +DLGGA + A A++ + NG N R
Sbjct: 57 SGARLIDAILVATDLTDADLGGADLTQAHLG------AERPSRAATLNGANLSKANLERA 110
Query: 239 SLGCGNSRR 247
L + RR
Sbjct: 111 DLRHADVRR 119
>gi|307150160|ref|YP_003885544.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306980388|gb|ADN12269.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 215
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 55/119 (46%), Gaps = 3/119 (2%)
Query: 96 KYEAETRGEFGIGSAAQ---FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 152
+ E + RG +G+ Q ADLR + V AN A++ D G+ N A
Sbjct: 25 QLEIDLRGIYGLNLDLQGINLEKADLRGSYLVGAFLEGANLVGANLSGVDLKGANLNNAN 84
Query: 153 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L A +N +L L+ R LN L A L + L ++DL GAI+EGA ++A
Sbjct: 85 LTDAHLVGSNLREVNLKGALLTRAFLNGVYLNAANLDESDLRQADLRGAILEGASMTNA 143
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 39/155 (25%), Positives = 66/155 (42%), Gaps = 8/155 (5%)
Query: 91 LADLNKYEAETRGEFGIGS--------AAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
L +N +A+ RG + +G+ A DL+ A N A+ +++RE +
Sbjct: 40 LQGINLEKADLRGSYLVGAFLEGANLVGANLSGVDLKGANLNNANLTDAHLVGSNLREVN 99
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G+ A+L AN +DL + +L A++TNA L L R GA
Sbjct: 100 LKGALLTRAFLNGVYLNAANLDESDLRQADLRGAILEGASMTNANLREADLRRCQFEGAN 159
Query: 203 IEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 237
+EG+ DA++ + L + N N V ++
Sbjct: 160 LEGSLLIDAILQDQGQDHLIIWENFYNNSNTVESK 194
>gi|427711398|ref|YP_007060022.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
gi|427375527|gb|AFY59479.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
Length = 449
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 58/114 (50%), Gaps = 8/114 (7%)
Query: 92 ADLNKY---EAETRGEFGIGS----AAQFGSADLRKAVHVKENFRRANFTSADMRESDFS 144
ADL++ EA+ RG G A A+L V+ + R + AD+ ++ S
Sbjct: 115 ADLSRVDLAEADLRG-LGFNQVNLRGANLQGANLHNTEMVQADLGRVDLIEADLSNANLS 173
Query: 145 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
G+ +GA L +A AN +GADLS + + L+EANLT A L L ++DL
Sbjct: 174 GANLSGANLSRANLANANLSGADLSRVDLTEVKLSEANLTKANLSGAELGKADL 227
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 56/110 (50%), Gaps = 7/110 (6%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGAD 167
F A L A + + N AD+ + SG+K N GA L A+ K + + A+
Sbjct: 17 FNRASLSNAELINVDLSGINLARADLEWVNLSGTKLNNANLSGAELINAILIKTDLSQAN 76
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L+ + R L+ ANL+ L R+ L+ + L GA ++GAD S +DLA+
Sbjct: 77 LTGVNLSRTDLSWANLSYTNLSRSELSEATLRGANLQGADLSR--VDLAE 124
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A+ +LR A V N A AD+ ++D + +G L + AN +GA+L
Sbjct: 268 SRAKLVGTNLRGANLVGANLTGATLDGADLSQADMRSANLSGLLLNGVILRGANLSGANL 327
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + LN+ANL+ A L+ L+R+ + G + A S+A
Sbjct: 328 RE-----IELNQANLSRADLIEANLSRAKMAGVNLSRATLSEA 365
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 59/119 (49%), Gaps = 6/119 (5%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A DL + + N +AN + A++ ++D S + L A N +GA+L
Sbjct: 193 SGADLSRVDLTEVKLSEANLTKANLSGAELGKADLSALELCDVNLSGA-----NLSGANL 247
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 226
++T + R L+ ANL L R L ++L GA + GA+ + A +D A QA + AN
Sbjct: 248 ANTNLSRADLSGANLRGVNLSRAKLVGTNLRGANLVGANLTGATLDGADLSQADMRSAN 306
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 54/114 (47%), Gaps = 12/114 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 163
S ++ A LR A N + A+ + D+ E+D G FN GA L+ A +
Sbjct: 98 SRSELSEATLRGA-----NLQGADLSRVDLAEADLRGLGFNQVNLRGANLQGANLHNTEM 152
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
ADL + L+ ANL+ A L L+R++L A + GAD S +DL +
Sbjct: 153 VQADLGRVDLIEADLSNANLSGANLSGANLSRANLANANLSGADLSR--VDLTE 204
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+LR+ +AN + AD+ E++ S +K G L +A +AN + A LS
Sbjct: 320 ANLSGANLREI-----ELNQANLSRADLIEANLSRAKMAGVNLSRATLSEANMSRATLSG 374
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ R+ L+ + L L+ +DLG A + GA+ S A
Sbjct: 375 ATLSRVTLSGDTIGKVDLSGVNLSGADLGDAQLLGANLSRA 415
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 54/116 (46%), Gaps = 15/116 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTG 165
A A+L A+ +K + +AN T ++ +D S + + + L +A AN G
Sbjct: 55 ANLSGAELINAILIKTDLSQANLTGVNLSRTDLSWANLSYTNLSRSELSEATLRGANLQG 114
Query: 166 ADLSDTLM----------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
ADLS + +++ L ANL A L T + ++DLG + AD S+A
Sbjct: 115 ADLSRVDLAEADLRGLGFNQVNLRGANLQGANLHNTEMVQADLGRVDLIEADLSNA 170
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 50/109 (45%), Gaps = 7/109 (6%)
Query: 109 SAAQFGSADLRKAVHV------KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
S A+ G ADL A+ + N AN + ++ +D SG+ G L +A N
Sbjct: 218 SGAELGKADL-SALELCDVNLSGANLSGANLANTNLSRADLSGANLRGVNLSRAKLVGTN 276
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
GA+L + L+ A+L+ A + L+ L G I+ GA+ S A
Sbjct: 277 LRGANLVGANLTGATLDGADLSQADMRSANLSGLLLNGVILRGANLSGA 325
>gi|428208320|ref|YP_007092673.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428010241|gb|AFY88804.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 160
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 5/103 (4%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DL A N RAN +A+++ ++ S S GA L A AN + A+L +TL
Sbjct: 42 DLSNAPLNNLNLSRANLRNANLQGANLSRSILAGADLSDANLETANISSANLFETL---- 97
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
L ANL +AVLV + LT + L A +EGA+ A++DL +
Sbjct: 98 -LIGANLKSAVLVNSNLTGAGLMAANLEGANLRGAIMDLVNSR 139
>gi|241663874|ref|YP_002982234.1| pentapeptide repeat-containing protein [Ralstonia pickettii 12D]
gi|240865901|gb|ACS63562.1| pentapeptide repeat protein [Ralstonia pickettii 12D]
Length = 277
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A A AD+ +D SG+ +GAYL +GADL
Sbjct: 82 SGAYLSGADLSGAYLSDAYLSGAYLRGADLSGADLSGADLSGAYL----------SGADL 131
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANG 227
S + L A L++A L L+ +DL GA + GAD SDA VI+ ++ YA
Sbjct: 132 SGAYLSDAYLRGAYLSDAYLSDADLSGADLSGAYLSGADLSDAPVIENIHQKV---YAAA 188
Query: 228 TNP 230
+ P
Sbjct: 189 SQP 191
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV---AY--KANFTGADLSDTL 172
+ +AV A+ + AD+ +D SG+ +GAYL A AY A +GADLS
Sbjct: 26 VEQAVKGGAYLSGADLSGADLSGADLSGAYLSGAYLSDAYLRGAYLSGAYLSGADLSGAY 85
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ L+ A L++A L L +DL GA + GAD S A +
Sbjct: 86 LSGADLSGAYLSDAYLSGAYLRGADLSGADLSGADLSGAYL 126
>gi|229818699|ref|YP_002880225.1| pentapeptide repeat-containing protein [Beutenbergia cavernae DSM
12333]
gi|229564612|gb|ACQ78463.1| pentapeptide repeat-containing protein [Beutenbergia cavernae DSM
12333]
Length = 205
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 59/131 (45%), Gaps = 12/131 (9%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVH-----VKENFRRANFTSADMRESDFSG 145
D++ EA TRG + S F + + H V FRR NF A G
Sbjct: 37 FVDVDLTEASTRGT--VFSECVFSNVAFNVSHHASTAFVNCTFRRCNFFDATFTGCKLVG 94
Query: 146 SKFNGA-----YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 200
+ F+G +++ A F GADL + L E++LT+A R+V DL G
Sbjct: 95 AMFDGCSFGIMKVDRGDWSFAGFPGADLEGVEFTGVRLRESDLTHARCARSVFAGCDLSG 154
Query: 201 AIIEGADFSDA 211
+ + GADF+DA
Sbjct: 155 SWLHGADFTDA 165
>gi|209523090|ref|ZP_03271646.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|376002100|ref|ZP_09779947.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423066405|ref|ZP_17055195.1| hypothetical protein SPLC1_S430130 [Arthrospira platensis C1]
gi|209496241|gb|EDZ96540.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|375329486|emb|CCE15700.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406712077|gb|EKD07268.1| hypothetical protein SPLC1_S430130 [Arthrospira platensis C1]
Length = 336
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 9/105 (8%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL + V +F+ AN +A ++E++ GS F L+ A +KAN T + +
Sbjct: 140 ADLEQVTLVDTDFKEANLKTAKLQEANLKGSTFELTQLQGANLWKANLQECFFLLTQLQK 199
Query: 176 MVLNEANLTNAV-----LVRTVLTRSDLGGAII----EGADFSDA 211
+ LN ANL NA L+ L +++L GA I +GA+F +A
Sbjct: 200 VNLNAANLENAELQGVNLLEANLQQANLQGAYILGNLQGANFQEA 244
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 45/89 (50%), Gaps = 9/89 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSA---------DMRESDFSGSKFNGAYLEKAVAY 159
+AA +A+L+ ++ N ++AN A + +E++ G+ GAYL+ A
Sbjct: 203 NAANLENAELQGVNLLEANLQQANLQGAYILGNLQGANFQEANLKGANLQGAYLQDANFK 262
Query: 160 KANFTGADLSDTLMDRMVLNEANLTNAVL 188
+AN G +L D + + EANL +A L
Sbjct: 263 RANLRGVNLKDANLTGVNFEEANLQSANL 291
>gi|17230748|ref|NP_487296.1| hypothetical protein all3256 [Nostoc sp. PCC 7120]
gi|17132351|dbj|BAB74955.1| all3256 [Nostoc sp. PCC 7120]
Length = 268
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 57/198 (28%), Positives = 86/198 (43%), Gaps = 52/198 (26%)
Query: 81 VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE 140
VA+ S ++ ADL + F IG A F A+LR A+ + N +F+SAD+R+
Sbjct: 93 VANLSQSVLTHADL------SHAHF-IG--ADFSGANLRGAIVTEANLIGTDFSSADLRD 143
Query: 141 SDFSGSKF------------------------------NGAYLEKAVAYKANFTGADLSD 170
+D +G+K GAYL KA YKAN A L
Sbjct: 144 ADLAGAKLIRSNLCFANLIAANFIAVDFSEANLYQAEVMGAYLYKANFYKANLHQAHLGG 203
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
+ R L A+L A L LT ++L GA + GA+ A + NG N
Sbjct: 204 AYLFRANLTAADLRGADLAWANLTSANLAGANLSGANLRGANL------------NGAN- 250
Query: 231 ITGVSTRKSLGCGNSRRN 248
+ GV+ ++++ +SR +
Sbjct: 251 LNGVNLQETIMPDSSRHD 268
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 59/126 (46%), Gaps = 17/126 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVA 158
S A SA+L A + N ANF T AD+ + F G+ F+GA L A+
Sbjct: 67 SGADLSSANLHHAKLSEANLSAANFSVANLSQSVLTHADLSHAHFIGADFSGANLRGAIV 126
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+AN G D S L +A+L A L+R+ L ++L A DFS+A +L Q
Sbjct: 127 TEANLIGTDFSSA-----DLRDADLAGAKLIRSNLCFANLIAANFIAVDFSEA--NLYQA 179
Query: 219 QALCKY 224
+ + Y
Sbjct: 180 EVMGAY 185
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 54/98 (55%), Gaps = 5/98 (5%)
Query: 119 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 178
+K ++ + +N ++R ++ +G+ + L +A+ +AN +GADLS + L
Sbjct: 22 QKNPQIEPDLSTSNLQENNLRGANLAGANLSRVDLSRALLIRANLSGADLSSANLHHAKL 81
Query: 179 NEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDA 211
+EANL+ A L ++VLT +DL A GADFS A
Sbjct: 82 SEANLSAANFSVANLSQSVLTHADLSHAHFIGADFSGA 119
Score = 38.1 bits (87), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 50/100 (50%), Gaps = 5/100 (5%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
+LR A N R + + A + ++ SG+ + A L A +AN + A+ S + +
Sbjct: 40 NLRGANLAGANLSRVDLSRALLIRANLSGADLSSANLHHAKLSEANLSAANFSVANLSQS 99
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDA 211
VL A+L++A + + ++L GAI+ G DFS A
Sbjct: 100 VLTHADLSHAHFIGADFSGANLRGAIVTEANLIGTDFSSA 139
>gi|359460749|ref|ZP_09249312.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 299
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 57/111 (51%), Gaps = 10/111 (9%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-----DLSDT 171
DLR A +N + A+ A+++ ++ G + A LE A AN + A L+ T
Sbjct: 63 DLRGANLQDQNLKGASLQGANLQGANLQGVNLDDANLESANLKSANLSKATLRRASLTTT 122
Query: 172 LMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDAVIDLAQ 217
L L +ANLT A LV+T L R++L A +E ADFS AV++ Q
Sbjct: 123 LKQATNLQDANLTQATLVKTKLKGADLRRANLFEATLEDADFSVAVVETTQ 173
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 52/103 (50%), Gaps = 5/103 (4%)
Query: 116 ADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
+DLR+A K N + A +A++ E++ G+K A L+ A ADL
Sbjct: 181 SDLREANFNKSNLKNATLNQVYLANANLSEANLKGAKLKQAQLKYTNLNGAKLNNADLRK 240
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ + L+EA+L++A L + + +L GA + AD S A +
Sbjct: 241 ASLESVNLSEADLSSAHLGKIAMKDVNLRGANLSNADLSGAKL 283
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 58/122 (47%), Gaps = 14/122 (11%)
Query: 107 IGSAAQFGSADLRKAVHVKE-----NFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-- 159
+ A A+L +A VK + RRAN A + ++DFS + + V +
Sbjct: 123 LKQATNLQDANLTQATLVKTKLKGADLRRANLFEATLEDADFSVAVVETTQGIRRVYFSD 182
Query: 160 --KANFTGADLSDTLMDRMVL-----NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
+ANF ++L + ++++ L +EANL A L + L ++L GA + AD A
Sbjct: 183 LREANFNKSNLKNATLNQVYLANANLSEANLKGAKLKQAQLKYTNLNGAKLNNADLRKAS 242
Query: 213 ID 214
++
Sbjct: 243 LE 244
>gi|334117108|ref|ZP_08491200.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333461928|gb|EGK90533.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 509
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 55/117 (47%), Gaps = 9/117 (7%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
ADL K E S +AD+R+A + N AN + A+++ +D +G+ NGA
Sbjct: 171 ADLTKAEL---------SGVNLSNADMRQASLQQVNLSSANLSGANLKWADLTGANLNGA 221
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
L A AN GADL +T + A+LT L+ +DL GA + GA
Sbjct: 222 DLSFAKLSGANLNGADLRNTNLGSASFVHADLTETNLINADWVGADLRGATLTGAKL 278
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADLR+ + N AN + A++R + S + F A L A KA +G +L
Sbjct: 124 SGANLTEADLREVKLTEANLCGANLSGANLRGASASSANFQEANLHGADLTKAELSGVNL 183
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
S+ M + L + NL++A L L +DL GA + GAD S
Sbjct: 184 SNADMRQASLQQVNLSSANLSGANLKWADLTGANLNGADLS 224
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 54/115 (46%), Gaps = 10/115 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANF 163
S+A F A+L A K N ++ADMR++ + S + +GA L+ A AN
Sbjct: 159 SSANFQEANLHGADLTKAELSGVNLSNADMRQASLQQVNLSSANLSGANLKWADLTGANL 218
Query: 164 TGADLSDTLMDRMVLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDAVI 213
GADLS + LN A+L N A V LT ++L A GAD A +
Sbjct: 219 NGADLSFAKLSGANLNGADLRNTNLGSASFVHADLTETNLINADWVGADLRGATL 273
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 49/100 (49%), Gaps = 5/100 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+LR A NF+ AN AD+ +++ SG + A + +A + N + A+LS
Sbjct: 146 ANLSGANLRGASASSANFQEANLHGADLTKAELSGVNLSNADMRQASLQQVNLSSANLSG 205
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
L A+LT A L L+ + L GA + GAD +
Sbjct: 206 A-----NLKWADLTGANLNGADLSFAKLSGANLNGADLRN 240
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+FT ++ E++ S + + L +A + N +GA+LS+ L+EANL A L T
Sbjct: 22 DFTGINLNEANLSRINLSQSILRRASLFVTNLSGANLSEA-----NLSEANLNVARLSST 76
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLAQ 217
L+R+ L GA I A+ A + AQ
Sbjct: 77 NLSRAILNGATINVANLVRADLSAAQ 102
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 62/136 (45%), Gaps = 2/136 (1%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L+ A ++ + N++ L+ N A G + A ADL A ++ + R+
Sbjct: 58 LSEANLSEANLNVARLSSTNLSRAILNG--ATINVANLVRADLSAAQLIRASLIRSELVR 115
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 195
++ +++FSG+ A L + +AN GA+LS + + AN A L LT+
Sbjct: 116 CELSKTNFSGANLTEADLREVKLTEANLCGANLSGANLRGASASSANFQEANLHGADLTK 175
Query: 196 SDLGGAIIEGADFSDA 211
++L G + AD A
Sbjct: 176 AELSGVNLSNADMRQA 191
Score = 37.0 bits (84), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN------- 179
N R N + + +R + + +GA L +A +AN A LS T + R +LN
Sbjct: 32 NLSRINLSQSILRRASLFVTNLSGANLSEANLSEANLNVARLSSTNLSRAILNGATINVA 91
Query: 180 ---EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A+L+ A L+R L RS+L + +FS A
Sbjct: 92 NLVRADLSAAQLIRASLIRSELVRCELSKTNFSGA 126
>gi|239816752|ref|YP_002945662.1| pentapeptide repeat-containing protein [Variovorax paradoxus S110]
gi|239803329|gb|ACS20396.1| pentapeptide repeat protein [Variovorax paradoxus S110]
Length = 866
Score = 51.2 bits (121), Expect = 4e-04, Method: Composition-based stats.
Identities = 39/113 (34%), Positives = 50/113 (44%), Gaps = 9/113 (7%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
L +A NF AD+ + D G+ F+GA LE A N A LSD + V
Sbjct: 538 LAEAAPGARNFSGMRLVGADLSDMDLRGADFSGAALEDA-----NLDNAQLSDANFNGAV 592
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ----KQALCKYAN 226
L A L+ L ++LGGA E ADFS A + A + A C AN
Sbjct: 593 LARARLSRTSLASATFRNANLGGAHCEFADFSGADLSSANCEKTRFASCSMAN 645
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 42/144 (29%), Positives = 58/144 (40%), Gaps = 21/144 (14%)
Query: 111 AQFGSADLRKAVHVKENFR----------RANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
A F ADL A K F + FT+++M DF GS ++ +L K
Sbjct: 621 ADFSGADLSSANCEKTRFASCSMANTVLDQTRFTASEMSHCDFRGSDWHQVFLTKLRMSG 680
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 220
F GA + L + NA LVR SD ++ DFSDA +D
Sbjct: 681 MAFDGASFQQVVWLECTLADVRFANASLVRCSFVTSDCSRSV----DFSDARLD------ 730
Query: 221 LCKYANGTNPITGVSTRKSLG-CG 243
C +A+G+ V R +L CG
Sbjct: 731 ACSFAHGSTLAGAVLRRAALKQCG 754
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 52/110 (47%), Gaps = 10/110 (9%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFT 164
+ A LR+A + R AD+RE+ DFS GA LE+ VA ++ F
Sbjct: 737 GSTLAGAVLRRAALKQCGLRTTPLQQADLREARLDNCDFSECALQGAKLERLVAGESLFV 796
Query: 165 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
ADL+ L ANL +A + V ++DL GA + D S ++ID
Sbjct: 797 RADLTGA-----SLRGANLIDANFSKAVFVQADLSGANLFRTDVSQSLID 841
Score = 40.8 bits (94), Expect = 0.65, Method: Composition-based stats.
Identities = 28/100 (28%), Positives = 44/100 (44%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L A NF A A + + + + F A L A A+F+GADL
Sbjct: 569 SGAALEDANLDNAQLSDANFNGAVLARARLSRTSLASATFRNANLGGAHCEFADFSGADL 628
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
S ++ ++ N VL +T T S++ G+D+
Sbjct: 629 SSANCEKTRFASCSMANTVLDQTRFTASEMSHCDFRGSDW 668
>gi|134280632|ref|ZP_01767342.1| pentapeptide repeat protein [Burkholderia pseudomallei 305]
gi|134247654|gb|EBA47738.1| pentapeptide repeat protein [Burkholderia pseudomallei 305]
Length = 825
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
C+ + A A L+ A R E + SAA G ++ + A+ T AD+ D
Sbjct: 476 QCAQHQDAPARLHGAAARARREC-VASAAAAG-----------QSLQGADLTGADLSGMD 523
Query: 143 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
G++ GA LE A A+ TGADLS R VL A+LT A LV LT ++L A
Sbjct: 524 LRGARLAGAMLENADLSGADLTGADLS-----RTVLVRADLTRAKLVDARLTAANLSLAH 578
Query: 203 IEGADFS 209
E DFS
Sbjct: 579 CERTDFS 585
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
>gi|440683010|ref|YP_007157805.1| serine/threonine protein kinase with pentapeptide repeats [Anabaena
cylindrica PCC 7122]
gi|428680129|gb|AFZ58895.1| serine/threonine protein kinase with pentapeptide repeats [Anabaena
cylindrica PCC 7122]
Length = 535
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 53/99 (53%), Gaps = 5/99 (5%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
L+ V + +F N + ++ +D SG+ F+ A L++ N GA+L +T R
Sbjct: 401 LQAYVKGRRDFASYNISMLSLQGADLSGTNFHHAQLKQT-----NLQGANLQNTDFGRAS 455
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
L +ANL +A L + L+ +DL GA + GAD S A + A
Sbjct: 456 LMQANLRDANLTKAYLSNADLEGADLRGADLSYAYMSQA 494
Score = 40.0 bits (92), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 49/101 (48%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF A +++ ++ + F A L +A AN T A LS+ ++ L A+L+ A
Sbjct: 430 NFHHAQLKQTNLQGANLQNTDFGRASLMQANLRDANLTKAYLSNADLEGADLRGADLSYA 489
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
+ + L ++L GA + GA +D I LA+ L NG
Sbjct: 490 YMSQANLRGANLCGANLTGAKVTDEQIALAKTNWLTVRPNG 530
>gi|428217541|ref|YP_007102006.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427989323|gb|AFY69578.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 353
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 59/126 (46%), Gaps = 4/126 (3%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A FGSA+L A + N +AN AD+ ++D G+K G L +A +AN D +
Sbjct: 54 ANFGSANLLGANLSEANLTKANLREADLYKADLGGAKLIGTSLIRAYLREANLRDCDCNS 113
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230
T + L E L NA L L+ S+L A + A DA++ A+ YAN
Sbjct: 114 TALIGADLTEVCLENADLTGANLSESNLSSANLNFAILKDAIL----SNAIASYANMNET 169
Query: 231 ITGVST 236
I ++
Sbjct: 170 IMDMAV 175
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 14/145 (9%)
Query: 82 ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRES 141
A S+ I++ A++N ET + + AQ D A V+ + R AN AD+ +
Sbjct: 154 AILSNAIASYANMN----ETIMDMAVLDRAQLNFVDFNGAAMVQASLRHANLCGADLSGA 209
Query: 142 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE----------ANLTNAVLVRT 191
+ S + +GA L +A+ AN + A+LS ++ L+ ANLT+A+L
Sbjct: 210 NLSYANLSGANLCEAILSNANLSHANLSGAILRDASLSNANLSGADLSGANLTDAILSDA 269
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLA 216
L+R++L AI+ GA A ++ A
Sbjct: 270 DLSRANLSEAILAGAQLISAKLEAA 294
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 112 QFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
+ A+LR+A + N R ANF SA++ ++ S + A L +A YKA+ GA
Sbjct: 30 ELSGANLRRATLREVNLSGVDLRWANFGSANLLGANLSEANLTKANLREADLYKADLGGA 89
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L T + R L EANL + T L +DL +E AD + A
Sbjct: 90 KLIGTSLIRAYLREANLRDCDCNSTALIGADLTEVCLENADLTGA 134
Score = 43.9 bits (102), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 48/87 (55%), Gaps = 5/87 (5%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL---- 183
A+ T A++ ES+ S + N A L+ A+ A + A++++T+MD VL+ A L
Sbjct: 126 LENADLTGANLSESNLSSANLNFAILKDAILSNAIASYANMNETIMDMAVLDRAQLNFVD 185
Query: 184 -TNAVLVRTVLTRSDLGGAIIEGADFS 209
A +V+ L ++L GA + GA+ S
Sbjct: 186 FNGAAMVQASLRHANLCGADLSGANLS 212
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 55/108 (50%), Gaps = 5/108 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANF 163
S A A+L +A+ N AN + A +R++ + SG+ +GA L A+ A+
Sbjct: 212 SYANLSGANLCEAILSNANLSHANLSGAILRDASLSNANLSGADLSGANLTDAILSDADL 271
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ A+LS+ ++ L A L A LV T L +++L A ++G DA
Sbjct: 272 SRANLSEAILAGAQLISAKLEAAFLVGTDLIKANLRLASLKGVSLKDA 319
>gi|427418755|ref|ZP_18908938.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425761468|gb|EKV02321.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 312
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 62/135 (45%), Gaps = 16/135 (11%)
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
G + AQ DLR+A N A+FT A + S+ + + GA E A + N
Sbjct: 170 GSYAQLYMAQIQGCDLRQA-----NLNHADFTQAVLTRSNLNQATLIGANGEAATLEQVN 224
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI------DLA 216
T A+LS + L ANL AV+V LT S+L A ++ A +AVI D+
Sbjct: 225 LTRANLS-----HVNLTSANLQQAVMVHATLTESNLSEANLQNATLDNAVIRQCYLRDIN 279
Query: 217 QKQALCKYANGTNPI 231
+QA + + PI
Sbjct: 280 WQQASVQGTHFCQPI 294
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 15/119 (12%)
Query: 111 AQFGSADLRKAVHVKENF-----RRANFTS-----ADMRESDFSGSKFNGAYLEKAVAYK 160
AQ +L A+ N RRAN A + ++ S + GA L +A YK
Sbjct: 68 AQMSDVNLSGAILTSANLTATSLRRANLLGAVLMFATLEQATLSHANLAGANLTEAELYK 127
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
ANFT ADLS ++ R A+L NA R + + + GA + G D S A + +AQ Q
Sbjct: 128 ANFTEADLSHAMLRR-----ASLVNANFHRACMKQVNANGAELYGIDGSYAQLYMAQIQ 181
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 50/104 (48%), Gaps = 10/104 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F ADL A+ + + ANF A M++ + +G++ G A Y A G DL
Sbjct: 128 ANFTEADLSHAMLRRASLVNANFHRACMKQVNANGAELYGIDGSYAQLYMAQIQGCDLR- 186
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ANL +A + VLTRS+L A + GA+ A ++
Sbjct: 187 ---------QANLNHADFTQAVLTRSNLNQATLIGANGEAATLE 221
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 5/86 (5%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
EN R +FT AD+ SG+ L A N +GA L+ + L ANL
Sbjct: 43 ENLERVDFTRADL-----SGANLERTQLIDAQMSDVNLSGAILTSANLTATSLRRANLLG 97
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDA 211
AVL+ L ++ L A + GA+ ++A
Sbjct: 98 AVLMFATLEQATLSHANLAGANLTEA 123
Score = 37.7 bits (86), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 10/104 (9%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F ADL A N R A M + + SG+ A L +AN GA L
Sbjct: 50 FTRADLSGA-----NLERTQLIDAQMSDVNLSGAILTSANLTATSLRRANLLGAVLMFAT 104
Query: 173 MDRMVLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+++ L+ ANL T A L + T +DL A++ A +A
Sbjct: 105 LEQATLSHANLAGANLTEAELYKANFTEADLSHAMLRRASLVNA 148
>gi|443475349|ref|ZP_21065301.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019796|gb|ELS33834.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 243
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 62/120 (51%), Gaps = 15/120 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVA 158
S ++ ADL +A + N ANF+ +D+ E+D S G+ GA L KA
Sbjct: 53 SNSKLNGADLNRAKLYRSNLVSANFSGSDLGETDLSEANLSDARLYGANLYGAILNKAKL 112
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVI 213
+A TGA++ ++R L+EA L +A L ++L R+DL A + GA+ + A++
Sbjct: 113 PRAKLTGANMGKAKLNRADLSEAILRDARLFGASLNESMLQRADLSRASLNGANLNKAML 172
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 67/136 (49%), Gaps = 21/136 (15%)
Query: 94 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 152
+N Y++ R G+ S DL A + AN + + + ++ S SK NGA
Sbjct: 7 INSYQSGRRNFAGVNLSKTDMNGIDLSNA-----DLSGANLSESSLYGANLSNSKLNGAD 61
Query: 153 LEKAVAYK-----ANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSD 197
L +A Y+ ANF+G+DL +T + L++ ANL A+L R LT ++
Sbjct: 62 LNRAKLYRSNLVSANFSGSDLGETDLSEANLSDARLYGANLYGAILNKAKLPRAKLTGAN 121
Query: 198 LGGAIIEGADFSDAVI 213
+G A + AD S+A++
Sbjct: 122 MGKAKLNRADLSEAIL 137
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 60/119 (50%), Gaps = 2/119 (1%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L+D Y A G I + A+ A L A K RA+ + A +R++ G+ N
Sbjct: 92 LSDARLYGANLYG--AILNKAKLPRAKLTGANMGKAKLNRADLSEAILRDARLFGASLNE 149
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ L++A +A+ GA+L+ ++ + L A+L A L L+ +DL A ++G+D +
Sbjct: 150 SMLQRADLSRASLNGANLNKAMLCEVDLTFASLYGASLCDADLSEADLTSANLQGSDLT 208
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 55/109 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ ADL +A+ A+ + ++ +D S + NGA L KA+ + + T A L
Sbjct: 125 AKLNRADLSEAILRDARLFGASLNESMLQRADLSRASLNGANLNKAMLCEVDLTFASLYG 184
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
+ L+EA+LT+A L + LTR + A + + F+D V + Q +
Sbjct: 185 ASLCDADLSEADLTSANLQGSDLTRVNFYKANLSKSKFADTVTEGMQTR 233
Score = 37.0 bits (84), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 5/84 (5%)
Query: 109 SAAQFGSADLRKAV--HVKENFRR---ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A A+L KA+ V F A+ AD+ E+D + + G+ L + YKAN
Sbjct: 158 SRASLNGANLNKAMLCEVDLTFASLYGASLCDADLSEADLTSANLQGSDLTRVNFYKANL 217
Query: 164 TGADLSDTLMDRMVLNEANLTNAV 187
+ + +DT+ + M EANLT +
Sbjct: 218 SKSKFADTVTEGMQTREANLTGII 241
>gi|386720786|ref|YP_006187111.1| Pentapeptide repeat-containing protein [Paenibacillus mucilaginosus
K02]
gi|384087910|gb|AFH59346.1| Pentapeptide repeat-containing protein [Paenibacillus mucilaginosus
K02]
Length = 201
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 61/122 (50%), Gaps = 10/122 (8%)
Query: 112 QFGSADL-----RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
+FG A+L R+ + +F +A + + D+S + L K F A
Sbjct: 73 KFGGANLFVSKFRECKMIGSDFAKAQLDGITIEQGDWSYTNLRQTNLGKQDLRNVKFMEA 132
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQKQAL 221
DLS+ +++ L EA+LT A+L R L+ SDL GA ++G DF A +DLAQ AL
Sbjct: 133 DLSECNLEKANLREADLTRALLGRARLSGSDLRGAKMDGVDFRAMDVKGARMDLAQAVAL 192
Query: 222 CK 223
+
Sbjct: 193 AR 194
>gi|428200510|ref|YP_007079099.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427977942|gb|AFY75542.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 174
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 53/99 (53%), Gaps = 5/99 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR+A + AN + AD++E++ SG+ + A L AV KAN +GA L
Sbjct: 60 ASLDRADLREACLIV-----ANLSGADLKEANLSGANLSEAVLTGAVLQKANLSGAKLRG 114
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
++ + L E+NL A L L +DL GA + AD S
Sbjct: 115 AILAGVNLAESNLRGANLQGANLYGADLRGADLRNADLS 153
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 66/136 (48%), Gaps = 9/136 (6%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KF 148
L +Y A R +F S DL +A N RAN + A +R ++ SG+
Sbjct: 7 LTRYAAGER-DF---SRIDLHGVDLAQAKLSGANLIRANLSGALLRGANLSGAFLVVASL 62
Query: 149 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ A L +A AN +GADL + + L+EA LT AVL + L+ + L GAI+ G +
Sbjct: 63 DRADLREACLIVANLSGADLKEANLSGANLSEAVLTGAVLQKANLSGAKLRGAILAGVNL 122
Query: 209 SDAVIDLAQKQALCKY 224
+++ + A Q Y
Sbjct: 123 AESNLRGANLQGANLY 138
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 5/83 (6%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTG 165
A A+L +AV ++AN + A +R + +G S GA L+ A Y A+ G
Sbjct: 85 ANLSGANLSEAVLTGAVLQKANLSGAKLRGAILAGVNLAESNLRGANLQGANLYGADLRG 144
Query: 166 ADLSDTLMDRMVLNEANLTNAVL 188
ADL + + R L ANL ++
Sbjct: 145 ADLRNADLSRTNLRGANLERTIM 167
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 44/88 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL++A N A T A +++++ SG+K GA L ++N GA+L
Sbjct: 75 ANLSGADLKEANLSGANLSEAVLTGAVLQKANLSGAKLRGAILAGVNLAESNLRGANLQG 134
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+ L A+L NA L RT L ++L
Sbjct: 135 ANLYGADLRGADLRNADLSRTNLRGANL 162
>gi|409993775|ref|ZP_11276905.1| hypothetical protein APPUASWS_21733 [Arthrospira platensis str.
Paraca]
gi|291572160|dbj|BAI94432.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409935380|gb|EKN76914.1| hypothetical protein APPUASWS_21733 [Arthrospira platensis str.
Paraca]
Length = 741
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 52/101 (51%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +LR A N A+ AD+R +D G+ GA L +A Y+AN T + +
Sbjct: 581 ANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANLKGANLYRANFYQANITEGNFNG 640
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ R+ N ++L +A L+R L++S L A + GA+ S +
Sbjct: 641 AKLRRVNFNRSDLRDAELIRVDLSKSRLRSACLRGANLSQS 681
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 48/103 (46%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL+ A N RANF A++ E +F+G+K ++ A DLS
Sbjct: 606 ADLRGADLQGANLKGANLYRANFYQANITEGNFNGAKLRRVNFNRSDLRDAELIRVDLSK 665
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ + L ANL+ + L LTR+DL GAD S +I
Sbjct: 666 SRLRSACLRGANLSQSNLKGADLTRADLSNVKFTGADLSCTLI 708
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 59/121 (48%), Gaps = 7/121 (5%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L +N A RG G A ADLR A N + AN A+ +++ + FNG
Sbjct: 583 LRGVNLRNANLRG--GNLEGAHLEGADLRGADLQGANLKGANLYRANFYQANITEGNFNG 640
Query: 151 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
A L + NF +DL D + R+ L+++ L +A L L++S+L GA + AD S+
Sbjct: 641 AKLR-----RVNFNRSDLRDAELIRVDLSKSRLRSACLRGANLSQSNLKGADLTRADLSN 695
Query: 211 A 211
Sbjct: 696 V 696
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 3/111 (2%)
Query: 95 NKYEAE-TRGEFGIGSA--AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
N Y+A T G F F +DLR A ++ + ++ SA +R ++ S S GA
Sbjct: 627 NFYQANITEGNFNGAKLRRVNFNRSDLRDAELIRVDLSKSRLRSACLRGANLSQSNLKGA 686
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 202
L +A FTGADLS TL+ L+ A+L NA L + L S+ G I
Sbjct: 687 DLTRADLSNVKFTGADLSCTLIRHANLSGADLRNAKLEKANLFGSNTVGCI 737
Score = 43.9 bits (102), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 54/113 (47%), Gaps = 10/113 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 164
+ A A+L++A VK + RRA+ ++ + + +K A L A +AN
Sbjct: 494 AVANLKGANLQEASLVKADLRRADLEEVNLSYASLTTAKLQRANLRSACLIEANLMAASL 553
Query: 165 ------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
GADLS+ ++ LN+ANL +A L L ++L G +EGA A
Sbjct: 554 EGCDLKGADLSNANLESAKLNQANLAHANLRGVNLRNANLRGGNLEGAHLEGA 606
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 51/112 (45%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
+QF DLR+ N ++ + T ADMRE + G L KAN + A L+
Sbjct: 431 SQFQGLDLRQTNLKGVNLKKMDLTGADMREKNLEGMSLIQLDLRLVNLAKANLSHAILNG 490
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
+ + L ANL A LV+ L R+DL + A + A + A ++ C
Sbjct: 491 SKLAVANLKGANLQEASLVKADLRRADLEEVNLSYASLTTAKLQRANLRSAC 542
>gi|218442155|ref|YP_002380484.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218174883|gb|ACK73616.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 180
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 61/126 (48%), Gaps = 14/126 (11%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG--- 150
L++Y+A+ R F L +A V N +R NFT D+ SD +G+ +G
Sbjct: 7 LHRYQAQER---------NFEELSLHQANLVGANLQRINFTRTDLSGSDLNGADLSGSCL 57
Query: 151 --AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A L A KAN GA+L + + L EANL A L + L ++L GA + GA+
Sbjct: 58 KQANLTDADLEKANLVGANLVEVNLIGADLKEANLAGADLTKADLRCANLEGANLTGANL 117
Query: 209 SDAVID 214
+ ++
Sbjct: 118 TQVNLE 123
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 44/87 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL KA V N N AD++E++ +G+ A L A AN TGA+L+
Sbjct: 60 ANLTDADLEKANLVGANLVEVNLIGADLKEANLAGADLTKADLRCANLEGANLTGANLTQ 119
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSD 197
++ L ANL+ A ++ T L +D
Sbjct: 120 VNLEGANLKGANLSEAQIIGTDLNVAD 146
>gi|153873268|ref|ZP_02001907.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
gi|152070268|gb|EDN68095.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
Length = 159
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 49/84 (58%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
F RAN + D+ +D SG+ +GA L +A ANFT A+LS+ + +ANLT+A
Sbjct: 46 FFRANLSHVDLTNTDLSGANLSGANLNEANLTNANFTKANLSEANLCESYFAKANLTDAN 105
Query: 188 LVRTVLTRSDLGGAIIEGADFSDA 211
L LT++ L + + GA+ S+A
Sbjct: 106 LSEANLTKAYLIESFLSGANLSEA 129
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 10/106 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L +A NF +AN + A++ ES Y KA AN + A+L
Sbjct: 62 SGANLSGANLNEANLTNANFTKANLSEANLCES----------YFAKANLTDANLSEANL 111
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+ + L+ ANL+ A L R+ L SDL A + GA+ A D
Sbjct: 112 TKAYLIESFLSGANLSEANLFRSNLFESDLFRANLTGANLYKAKFD 157
>gi|428773363|ref|YP_007165151.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
7202]
gi|428687642|gb|AFZ47502.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
Length = 319
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L A ANFT AD+ E++ SG A L +A +N G
Sbjct: 113 SNANLTGANLTGATLTGATLTGANFTRADLTEANLSGLNLMEADLTRANLSASNLQGCSF 172
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ R L EA+L N++L L R++L A + GA+FS AV+
Sbjct: 173 NEANFSRADLREADLKNSILEGVFLHRANLSRANLRGANFSGAVL 217
>gi|428307821|ref|YP_007144646.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428249356|gb|AFZ15136.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 263
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 56/107 (52%), Gaps = 22/107 (20%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKA----VHVKE------NFRRANFTSADMR-----ES 141
YEAE G A F ADL KA H+ E N +A AD+R ++
Sbjct: 157 YEAELIG-------AYFYKADLFKANLSNAHLGEAYLFGANLSQAELKKADLRWTNLSKA 209
Query: 142 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
+F+G+ GA L A KANFTGA+L+D +D + L++ANL A++
Sbjct: 210 NFTGANLVGANLRGANLSKANFTGANLTDANLDTVNLHKANLEGAIM 256
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 56/111 (50%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN----------GAYLEKAVAYK 160
A A+L+ A ++ N R A+F +A++ ++ S S N GAY KA +K
Sbjct: 114 ANLMGANLKGADLIEANMRGADFINANLMSANLSNSFLNYAKFYEAELIGAYFYKADLFK 173
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
AN + A L + + L++A L A L T L++++ GA + GA+ A
Sbjct: 174 ANLSNAHLGEAYLFGANLSQAELKKADLRWTNLSKANFTGANLVGANLRGA 224
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTG 165
A+F A+L A K + +AN ++A + E+ G+ + A L+KA KANFTG
Sbjct: 154 AKFYEAELIGAYFYKADLFKANLSNAHLGEAYLFGANLSQAELKKADLRWTNLSKANFTG 213
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+L + L++AN T A L L +L A +EGA D I
Sbjct: 214 ANLVGANLRGANLSKANFTGANLTDANLDTVNLHKANLEGAIMPDGTI 261
>gi|157825867|ref|YP_001493587.1| hypothetical protein A1C_04030 [Rickettsia akari str. Hartford]
gi|157799825|gb|ABV75079.1| Uncharacterized low-complexity protein [Rickettsia akari str.
Hartford]
Length = 954
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 41/121 (33%), Positives = 59/121 (48%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKANLDKANLEYADLTNATLTNATAQFVKLSNATLEKAEA-----EGLNISDV 609
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS-----DAVIDLAQ-KQALCKYA 225
+ + EAN N ++ R LT++D A++E AD DA+ A KQA K A
Sbjct: 610 IATNINAKEANFKNVIMQRADLTKADFTKAMLENADMQAVEALDAIFKEANLKQANLKAA 669
Query: 226 N 226
N
Sbjct: 670 N 670
Score = 40.0 bits (92), Expect = 1.2, Method: Composition-based stats.
Identities = 41/168 (24%), Positives = 70/168 (41%), Gaps = 14/168 (8%)
Query: 51 PDCSNNQCAGPYAKLKNWR--VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG 108
PD S+ +G N + +F S L +++C+ + + N + A +
Sbjct: 342 PDLSDINLSGKTLTNLNMKNTLFASANLENINISNCNLDFTNFEGANLHNAVFQDV--TA 399
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-------- 160
A F ADL+ + + RA D+ E++ + SKFN + A A K
Sbjct: 400 RNAVFLFADLKNSKIENSDMSRAYMPKVDLSEAEVTNSKFNAIMMVNADAEKLIMQDSEW 459
Query: 161 --ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
+N TG L+ M R+ + L NA+L + + +DL A + A
Sbjct: 460 QNSNLTGISLAYADMQRVQMQGVILNNALLDQANIVSTDLENAFMNNA 507
>gi|434393337|ref|YP_007128284.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428265178|gb|AFZ31124.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 213
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 51/102 (50%), Gaps = 5/102 (4%)
Query: 107 IGSAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
I + F DL +A K N R NFT A + ++D SGS + L +A A
Sbjct: 105 IATQVGFLETDLERANLKKVNLRDRDLSYTNFTKAKLEKADLSGSNLSHTNLSRAKLRNA 164
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
N +GA+LS+ + R L ANL A L L+R+ L GAI+
Sbjct: 165 NLSGANLSNADLSRADLRNANLIGANLDGANLSRAKLEGAIM 206
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 51/91 (56%)
Query: 124 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 183
++ + RAN ++R+ D S + F A LEKA +N + +LS + L+ ANL
Sbjct: 112 LETDLERANLKKVNLRDRDLSYTNFTKAKLEKADLSGSNLSHTNLSRAKLRNANLSGANL 171
Query: 184 TNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+NA L R L ++L GA ++GA+ S A ++
Sbjct: 172 SNADLSRADLRNANLIGANLDGANLSRAKLE 202
>gi|428301952|ref|YP_007140258.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428238496|gb|AFZ04286.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 267
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 54/113 (47%), Gaps = 15/113 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L+ A+ RAN A++ +D SG+ A LEKA N GA L
Sbjct: 57 SKANLQGANLQGAILNYALLGRANLEGANLSNADLSGTFLGEANLEKA-----NLQGAKL 111
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA----------IIEGADFSDA 211
S + + L ANL+NA L T LTR++L GA I+ AD DA
Sbjct: 112 SQAFLYKANLEGANLSNAYLSGTALTRANLRGANLRKSVIFVSILSEADLQDA 164
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 61/122 (50%), Gaps = 7/122 (5%)
Query: 95 NKYEAETRGEFGIGSA----AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
N A+ G F +G A A A L +A K N AN ++A + + + + G
Sbjct: 85 NLSNADLSGTF-LGEANLEKANLQGAKLSQAFLYKANLEGANLSNAYLSGTALTRANLRG 143
Query: 151 AYLEKAVAYKANFTGADLSD-TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
A L K+V + + + ADL D LM+ +L +NL A L R LT++ L AI++ A+ +
Sbjct: 144 ANLRKSVIFVSILSEADLQDANLMEAKLL-SSNLERANLARANLTKAQLHNAILQDANLT 202
Query: 210 DA 211
A
Sbjct: 203 QA 204
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 123 HVKE----------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
HVK+ + A+ A + +++ G+ GA L A+ +AN GA+LS+
Sbjct: 31 HVKQLLNTNSCPSCDLSNADLYGAKLSKANLQGANLQGAILNYALLGRANLEGANLSNAD 90
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ L EANL A L L+++ L A +EGA+ S+A +
Sbjct: 91 LSGTFLGEANLEKANLQGAKLSQAFLYKANLEGANLSNAYL 131
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 55/106 (51%), Gaps = 10/106 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+LRK+V + + AD+++++ +K + LE+A +AN T A L +
Sbjct: 139 ANLRGANLRKSV-----IFVSILSEADLQDANLMEAKLLSSNLERANLARANLTKAQLHN 193
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
+L +ANLT A LV+ L ++ L A + AD + A++ A
Sbjct: 194 A-----ILQDANLTQAKLVKAELNQASLARANLLNADLTGAILQQA 234
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 49/107 (45%), Gaps = 10/107 (9%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
I S A A+L +A + N RAN A++ + A L A+ AN T A
Sbjct: 155 ILSEADLQDANLMEAKLLSSNLERANLARANLTK----------AQLHNAILQDANLTQA 204
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L +++ L ANL NA L +L ++ L A I GA F +A +
Sbjct: 205 KLVKAELNQASLARANLLNADLTGAILQQATLYLANINGAIFKEAFL 251
>gi|433593191|ref|YP_007282677.1| putative low-complexity protein [Natrinema pellirubrum DSM 15624]
gi|448335744|ref|ZP_21524879.1| hypothetical protein C488_20057 [Natrinema pellirubrum DSM 15624]
gi|433308229|gb|AGB34039.1| putative low-complexity protein [Natrinema pellirubrum DSM 15624]
gi|445615954|gb|ELY69591.1| hypothetical protein C488_20057 [Natrinema pellirubrum DSM 15624]
Length = 644
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F + DLR + N +FT A +RE+ F+ S +GA L +A+ T ADLS+ L
Sbjct: 24 FSNTDLRGTTFGEANLADTDFTEAILREAQFAASDLSGASL-----TQADLTDADLSNAL 78
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ L ANL NA L + L + L A +EGA F +A +
Sbjct: 79 APMVNLTGANLRNADLANSDLRQVTLTNAHLEGASFREARL 119
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 56/121 (46%), Gaps = 15/121 (12%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMR----------ESDFSGSKFNGAYLEK 155
A+ SA LR A V + R + +FT D+R E++F G++ A L +
Sbjct: 177 ARLQSATLRGATLVHSDLRSTFCRQTDFTECDLRNVTAERMYAPEAEFDGARLTEANLRQ 236
Query: 156 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
A A+F G D S + L+ + ++A L ++DL GA + GAD S A +
Sbjct: 237 AEVTSASFDGVDASGIDVTEADLSATDWSDADLSGATFDQADLSGATLSGADLSGATFNQ 296
Query: 216 A 216
A
Sbjct: 297 A 297
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 48/105 (45%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
SA + ADL A F +A+ + A + +D SG+ FN A L+ A A+ T +L
Sbjct: 260 SATDWSDADLSGAT-----FDQADLSGATLSGADLSGATFNQATLKDADLSGADLTDVEL 314
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
SDT + +L E L T +DL GA I F D+
Sbjct: 315 SDTALTGALLRETRLAPETACGADFTEADLTGADISSGQFDDSTF 359
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 77/174 (44%), Gaps = 28/174 (16%)
Query: 63 AKLKNWRVFVSTALAAAVVAS------CSSNISALADLNKYEAET----RGEFGIGSAAQ 112
A L+N R+ +T A +V S C DL AE EF A+
Sbjct: 172 AALENARLQSATLRGATLVHSDLRSTFCRQTDFTECDLRNVTAERMYAPEAEF---DGAR 228
Query: 113 FGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 167
A+LR+A +F + T AD+ +D+S + +GA ++A A +GAD
Sbjct: 229 LTEANLRQAEVTSASFDGVDASGIDVTEADLSATDWSDADLSGATFDQADLSGATLSGAD 288
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE----------GADFSDA 211
LS ++ L +A+L+ A L L+ + L GA++ GADF++A
Sbjct: 289 LSGATFNQATLKDADLSGADLTDVELSDTALTGALLRETRLAPETACGADFTEA 342
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 48/105 (45%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A ADL A+ N AN +AD+ SD A+LE A +A GADL
Sbjct: 65 TQADLTDADLSNALAPMVNLTGANLRNADLANSDLRQVTLTNAHLEGASFREARLWGADL 124
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+D + + L A+L + L L++ +L AD S A++
Sbjct: 125 ADADLTVVALAGADLQESTLRGARLSQCELDNTSFREADLSGAIL 169
>gi|119490887|ref|ZP_01623170.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119453705|gb|EAW34864.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 517
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 58/116 (50%), Gaps = 10/116 (8%)
Query: 108 GSAAQFGSADLRKAVHVKENF----------RRANFTSADMRESDFSGSKFNGAYLEKAV 157
G++ ADLR+A VK N R+ N T AD+R+++ SG+ A L A
Sbjct: 157 GASTNLQRADLRRANLVKANLPKADFSHAEMRQTNLTYADLRQANLSGANLRWADLRGAN 216
Query: 158 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+ +GA+LS + L+ A L A LV LT+++L A GAD S A +
Sbjct: 217 LLGADLSGANLSGANLSGANLSRATLAKASLVHVDLTQANLIKADWMGADISGATL 272
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 51/98 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F A++R+ + R+AN + A++R +D G+ GA L A AN +GA+LS
Sbjct: 180 ADFSHAEMRQTNLTYADLRQANLSGANLRWADLRGANLLGADLSGANLSGANLSGANLSR 239
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
+ + L +LT A L++ +D+ GA + GA
Sbjct: 240 ATLAKASLVHVDLTQANLIKADWMGADISGATLTGAKL 277
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A ADLR+A + NF +AN + A++R + + A L +A KAN AD
Sbjct: 123 TKANLNGADLREARVGQANFSQANLSGANLRGVSGASTNLQRADLRRANLVKANLPKADF 182
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S M + L A+L A L L +DL GA + GAD S A
Sbjct: 183 SHAEMRQTNLTYADLRQANLSGANLRWADLRGANLLGADLSGA 225
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 56/104 (53%), Gaps = 5/104 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGAD 167
F +L +A + N +AN + A + ++ SG+ +G L +A +AN TGA+
Sbjct: 22 FTGINLNEANLSRINLSQANLSDASLCVTNLSGANLSGINLSRANLNVSRLSQANLTGAN 81
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
LS ++ L A+L++A+LV T+ RS+L A + A+ + A
Sbjct: 82 LSRATLNVANLVRADLSDAILVETLAIRSELIRARLNNANLTKA 125
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 43/82 (52%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
NFT ++ E++ S + A L A N +GA+LS + R LN + L+ A L
Sbjct: 21 NFTGINLNEANLSRINLSQANLSDASLCVTNLSGANLSGINLSRANLNVSRLSQANLTGA 80
Query: 192 VLTRSDLGGAIIEGADFSDAVI 213
L+R+ L A + AD SDA++
Sbjct: 81 NLSRATLNVANLVRADLSDAIL 102
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 44/85 (51%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N RAN + + +++ +G+ + A L A +A+ + A L +TL R L A L NA
Sbjct: 61 NLSRANLNVSRLSQANLTGANLSRATLNVANLVRADLSDAILVETLAIRSELIRARLNNA 120
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
L + L +DL A + A+FS A
Sbjct: 121 NLTKANLNGADLREARVGQANFSQA 145
>gi|197106790|ref|YP_002132167.1| pentapeptide repeat-containing protein [Phenylobacterium zucineum
HLK1]
gi|196480210|gb|ACG79738.1| pentapeptide repeat family protein [Phenylobacterium zucineum HLK1]
Length = 412
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 85/208 (40%), Gaps = 47/208 (22%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A L A+ + N RA AD+RE+D G+ A L A AN +GA+L+
Sbjct: 65 ADLAGVKLEGAMLARANLSRAILFGADLREADLRGANMKRADLRGACLKGANLSGAELAG 124
Query: 171 --------TLMDRM------------------VLNEANLTNAVLVRTVLTRSD-----LG 199
L D++ VL+ ANL A + TV SD L
Sbjct: 125 CDLREGRIALQDKLDGFRILRHEHRPGELNYAVLSGANLAGAQMAGTVAMASDFTDANLT 184
Query: 200 GAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPS--SPL 257
GA++ GA + AV+D A L G +TGVS ++++ G + A S +
Sbjct: 185 GAVLAGARLTRAVLDGAD---LSGADLGGADLTGVSLKRAVLAGANLDQARLEDVDLSEV 241
Query: 258 LSAPP-----------QKLLDRDGFCDS 274
L APP + L D + +CDS
Sbjct: 242 LRAPPPIVYVDDRSLEEVLADHEAYCDS 269
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 5/87 (5%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 191
+ D+R++D +G K GA L +A +A GADL + L AN+ A L
Sbjct: 56 SLEGVDLRDADLAGVKLEGAMLARANLSRAILFGADLREA-----DLRGANMKRADLRGA 110
Query: 192 VLTRSDLGGAIIEGADFSDAVIDLAQK 218
L ++L GA + G D + I L K
Sbjct: 111 CLKGANLSGAELAGCDLREGRIALQDK 137
>gi|126656707|ref|ZP_01727921.1| hypothetical protein CY0110_23751 [Cyanothece sp. CCY0110]
gi|126621927|gb|EAZ92635.1| hypothetical protein CY0110_23751 [Cyanothece sp. CCY0110]
Length = 257
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 66/124 (53%), Gaps = 6/124 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQ A+L KA + N ++N A ++E++ + N + ++ A YKA TG+ L
Sbjct: 69 TEAQLKQANLTKANLFEANLSQSNLEEAILQEANLINTNLNKSIIKNANLYKALLTGSKL 128
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ------KQALC 222
S+ ++ L +A+L+ + T+L R++L A ++ +D A + AQ +Q+
Sbjct: 129 SNANLEGANLEQADLSTYEELPTLLNRTNLNKANLKQSDLEGAWLVKAQLIEANLQQSNL 188
Query: 223 KYAN 226
KYAN
Sbjct: 189 KYAN 192
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 59/124 (47%), Gaps = 12/124 (9%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
+L KA N N A+++ ++ ++ A L++A KAN A+LS + ++
Sbjct: 36 VNLEKANLQHANLHETNLNKANLKNANLQQTRLTEAQLKQANLTKANLFEANLSQSNLEE 95
Query: 176 MVLNEANLTN----------AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 225
+L EANL N A L + +LT S L A +EGA+ A DL+ + L
Sbjct: 96 AILQEANLINTNLNKSIIKNANLYKALLTGSKLSNANLEGANLEQA--DLSTYEELPTLL 153
Query: 226 NGTN 229
N TN
Sbjct: 154 NRTN 157
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 54/99 (54%), Gaps = 10/99 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +A+L++ + ++AN T A++ E++ S S LE+A+ +AN +L
Sbjct: 56 ANLKNANLQQTRLTEAQLKQANLTKANLFEANLSQSN-----LEEAILQEANLINTNL-- 108
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
++ ++ ANL A+L + L+ ++L GA +E AD S
Sbjct: 109 ---NKSIIKNANLYKALLTGSKLSNANLEGANLEQADLS 144
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 40/81 (49%), Gaps = 5/81 (6%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTGADLSDTLMDRMVLNEANLTNAV 187
+ D+++ D G A L+ A + KAN A+L T + L +ANLT A
Sbjct: 23 LENVDLQQLDLEGVNLEKANLQHANLHETNLNKANLKNANLQQTRLTEAQLKQANLTKAN 82
Query: 188 LVRTVLTRSDLGGAIIEGADF 208
L L++S+L AI++ A+
Sbjct: 83 LFEANLSQSNLEEAILQEANL 103
>gi|418728079|ref|ZP_13286659.1| NifU-like N-terminal domain protein [Leptospira interrogans str. UI
12758]
gi|410777124|gb|EKR57092.1| NifU-like N-terminal domain protein [Leptospira interrogans str. UI
12758]
Length = 263
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 53/102 (51%), Gaps = 4/102 (3%)
Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 181
+ VK + R + +S + + F G F+GA L A ++F GA+ S + LN A
Sbjct: 141 LKVKGSLRDEDLSSIILEKLKFDGVDFSGANLGHAFLQNSSFVGANFSGAKLRGSFLNNA 200
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID----LAQKQ 219
+L N+ L + L GA +EGADF+DA+ D L QKQ
Sbjct: 201 DLRNSNFRGADLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242
>gi|257061367|ref|YP_003139255.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
gi|256591533|gb|ACV02420.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
Length = 371
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 67/136 (49%), Gaps = 10/136 (7%)
Query: 80 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 139
+ A+ + N++ L L + T G AA+ + +L A + NFR AN T A++
Sbjct: 218 LYAANTHNLAELIKLAHFNPLTDLAGGNFLAAELSAVELSGANLTQTNFRGANLTDAELS 277
Query: 140 ES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 194
E+ FSG+ +GAYL A KA+F A L+ + L EANL A L+
Sbjct: 278 EAILNYCKFSGADLSGAYLGNAQLVKADFHRASLAVANLIGANLTEANLREANLI----- 332
Query: 195 RSDLGGAIIEGADFSD 210
++L GA ++ A F +
Sbjct: 333 DANLSGATVKDAKFGE 348
>gi|428316180|ref|YP_007114062.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428239860|gb|AFZ05646.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 298
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 59/120 (49%), Gaps = 9/120 (7%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
LN YE R +F + A +L A+ + N RAN + A++ + + + GA L
Sbjct: 7 LNNYEKGER-DF---TGADLSGKNLSGAILIGVNLSRANLSGANLSRAFLTKATLKGALL 62
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
++ N + A + +T + L +ANL+ A LV+ L R L GA + GA+ AV+
Sbjct: 63 -----HRTNLSFAKMGETQLSGADLTKANLSGAFLVKAKLPRVKLSGATLTGANLRGAVL 117
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 39/85 (45%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF+ AN A + + G++ G L +A N GADLS + L ANL +
Sbjct: 141 NFKWANLYGARLNSAKLFGAQLTGVSLRRAQLTGVNLCGADLSGVNVSEAKLMGANLEGS 200
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
L T + + L G + GA+ + A
Sbjct: 201 NLAGTNFSAAQLRGVKLAGANLTGA 225
>gi|428774386|ref|YP_007166174.1| serine/threonine protein kinase with pentapeptide repeats
[Cyanobacterium stanieri PCC 7202]
gi|428688665|gb|AFZ48525.1| serine/threonine protein kinase with pentapeptide repeats
[Cyanobacterium stanieri PCC 7202]
Length = 506
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 41/71 (57%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F AD +A V+ N +A+ A+++ +DF + GA LE A YKAN GA+L+D
Sbjct: 420 ANFYHADFSRARLVRANLTKAHLFKAELQYADFRNANLTGANLEGANLYKANLCGANLTD 479
Query: 171 TLMDRMVLNEA 181
+D + L EA
Sbjct: 480 ANIDDIQLQEA 490
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 51/119 (42%), Gaps = 10/119 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A + A +K NF AN A+ +DFS ++ A L KA +KA AD
Sbjct: 393 SNASLPRVNFHHAKFIKTNFEDANLVEANFYHADFSRARLVRANLTKAHLFKAELQYADF 452
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 227
+ ANLT A L L +++L GA + A+ D + A+ + +G
Sbjct: 453 RN----------ANLTGANLEGANLYKANLCGANLTDANIDDIQLQEAETNWATIFPDG 501
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 185
E F N ++A + +F +KF E A +ANF AD S + R L +A+L
Sbjct: 385 ECFNNFNLSNASLPRVNFHHAKFIKTNFEDANLVEANFYHADFSRARLVRANLTKAHLFK 444
Query: 186 AVLVRTVLTRSDLGGAIIEGADFSDA 211
A L ++L GA +EGA+ A
Sbjct: 445 AELQYADFRNANLTGANLEGANLYKA 470
>gi|326506328|dbj|BAJ86482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 181
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 58/121 (47%), Gaps = 6/121 (4%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
+G R ++F D + S + F GA L A + A+ TGADLSD
Sbjct: 61 YGQQVTRGQDLTGKDFSGQTLIKQDFKTSILRQTNFKGANLLGASFFDADLTGADLSDAD 120
Query: 173 M---DRMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 226
+ D + N + NLTNA L ++T + G+ I GADF+D + Q+ LCK A+
Sbjct: 121 LRNADFSLANVTKVNLTNANLEGALVTGNTSFKGSNIYGADFTDVPLRDDQRDYLCKIAD 180
Query: 227 G 227
G
Sbjct: 181 G 181
>gi|428215789|ref|YP_007088933.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004170|gb|AFY85013.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 222
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 48/81 (59%), Gaps = 5/81 (6%)
Query: 138 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT-----NAVLVRTV 192
+ ++ S + +GA L+ A AN +GA+LS+ M + L+EANLT NA L +
Sbjct: 65 LNGANLSNANLSGALLKDAKLQTANLSGANLSNAEMSGITLSEANLTGANLSNAELENAL 124
Query: 193 LTRSDLGGAIIEGADFSDAVI 213
+++ DL GA + GAD DA+I
Sbjct: 125 MSKVDLTGADLTGADLIDAII 145
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 58/106 (54%), Gaps = 5/106 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 163
S A A L+ A N AN ++A+M E++ +G+ + A LE A+ K +
Sbjct: 71 SNANLSGALLKDAKLQTANLSGANLSNAEMSGITLSEANLTGANLSNAELENALMSKVDL 130
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
TGADL+ + ++++ANL+NA + + L ++ L + + GADFS
Sbjct: 131 TGADLTGADLIDAIISDANLSNASVTQAQLKKAILSRSNLSGADFS 176
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 43/81 (53%), Gaps = 5/81 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANF 163
+ A +A+L A+ K + A+ T AD+ +++ S + A L+KA+ ++N
Sbjct: 111 TGANLSNAELENALMSKVDLTGADLTGADLIDAIISDANLSNASVTQAQLKKAILSRSNL 170
Query: 164 TGADLSDTLMDRMVLNEANLT 184
+GAD S + M L +ANLT
Sbjct: 171 SGADFSSSSMRDTKLADANLT 191
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 41/81 (50%), Gaps = 5/81 (6%)
Query: 133 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 192
+ D+ D GS NGA L A N +GA L D + L+ ANL+NA +
Sbjct: 50 LSGVDLSGKDLYGSALNGANLSNA-----NLSGALLKDAKLQTANLSGANLSNAEMSGIT 104
Query: 193 LTRSDLGGAIIEGADFSDAVI 213
L+ ++L GA + A+ +A++
Sbjct: 105 LSEANLTGANLSNAELENALM 125
>gi|374300595|ref|YP_005052234.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
gi|332553531|gb|EGJ50575.1| Protein of unknown function DUF2169 [Desulfovibrio africanus str.
Walvis Bay]
Length = 1248
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 29/93 (31%), Positives = 50/93 (53%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ R + + + ++ G+ GA L KA+ +A+F+GA LS + VL + +L A
Sbjct: 949 DLRGIDLSGTQLGKTLMCGTNLAGANLSKAMGQEADFSGACLSGANLTGAVLQKTSLVEA 1008
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
+L L ++ L G+ + GAD SDA +D+ Q
Sbjct: 1009 ILSGACLKQAVLNGSDLSGADLSDATLDMVVIQ 1041
Score = 45.8 bits (107), Expect = 0.021, Method: Composition-based stats.
Identities = 34/102 (33%), Positives = 51/102 (50%), Gaps = 5/102 (4%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
L K + N AN + A +E+DFSG+ +GA L AV K + A LS + + V
Sbjct: 960 LGKTLMCGTNLAGANLSKAMGQEADFSGACLSGANLTGAVLQKTSLVEAILSGACLKQAV 1019
Query: 178 LN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
LN A+L++A L V+ ++ L GA + A VI+
Sbjct: 1020 LNGSDLSGADLSDATLDMVVIQKAKLDGADVRRASLKMCVIE 1061
Score = 39.7 bits (91), Expect = 1.6, Method: Composition-based stats.
Identities = 41/140 (29%), Positives = 63/140 (45%), Gaps = 7/140 (5%)
Query: 84 CSSNISALADLNK---YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE 140
C +N++ A+L+K EA+ G S A A L+K V+ A A +
Sbjct: 966 CGTNLAG-ANLSKAMGQEADFSG--ACLSGANLTGAVLQKTSLVEAILSGACLKQAVLNG 1022
Query: 141 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 200
SD SG+ + A L+ V KA GAD+ + +M + E A T+ L
Sbjct: 1023 SDLSGADLSDATLDMVVIQKAKLDGADVRRASL-KMCVIEGPAAGADFRGARFTQCVLKR 1081
Query: 201 AIIEGADFSDAVIDLAQKQA 220
+++GADFS A ++ QA
Sbjct: 1082 MLLDGADFSGAALNSTVLQA 1101
>gi|418019711|ref|ZP_12659144.1| putative low-complexity protein [Candidatus Regiella insecticola
R5.15]
gi|347604938|gb|EGY29471.1| putative low-complexity protein [Candidatus Regiella insecticola
R5.15]
Length = 381
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 54/104 (51%), Gaps = 1/104 (0%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S DL K + N +AN T A++RE D +G+ GA LE+A +A ADL
Sbjct: 76 SHTYLAGLDLSKMDLSRVNLEKANLTGANLREMDLTGANLTGANLERARLVRAILEWADL 135
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
++ + +L +A+L A+L L R+ + GA + D +D+V
Sbjct: 136 TNANLFEAILLDASLNGAILKNANLERTFVEGAHMSTVD-TDSV 178
>gi|354567192|ref|ZP_08986362.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353543493|gb|EHC12951.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 206
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 48/91 (52%), Gaps = 5/91 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
+ R NF AD+R ++ SG+ GA L +A + NF ADLS + L +ANL A
Sbjct: 39 DLSRINFKGADLRSANLSGAILTGANLREANLQQVNFCDADLS-----QADLTQANLCGA 93
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
L R L+ S L GA + AD +A + AQ
Sbjct: 94 CLWRVQLSDSQLWGASLCNADLREADLSAAQ 124
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 47/81 (58%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
A+ +AD+RE+D S ++ A L +A +AN T A L +++ N+ANLTNA L
Sbjct: 108 ASLCNADLREADLSAAQLIEASLVEANLVRANLTKAKLCGSVLIEANFNQANLTNADLKW 167
Query: 191 TVLTRSDLGGAIIEGADFSDA 211
T L ++ A +E A+F +A
Sbjct: 168 TNLMAANFSEANLENANFKNA 188
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 55/103 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+LR+A + NF A+ + AD+ +++ G+ L + + A+ ADL
Sbjct: 56 SGAILTGANLREANLQQVNFCDADLSQADLTQANLCGACLWRVQLSDSQLWGASLCNADL 115
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + L EA+L A LVR LT++ L G+++ A+F+ A
Sbjct: 116 READLSAAQLIEASLVEANLVRANLTKAKLCGSVLIEANFNQA 158
>gi|158335878|ref|YP_001517052.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158306119|gb|ABW27736.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 170
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 49/100 (49%), Gaps = 5/100 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL + ++ N R AN A++R + S A LE A NFT A L + ++
Sbjct: 45 ADLSGLILIRANLRNANLQGANLRNTSLLLSNLENANLENA-----NFTAAYLYGSNLEN 99
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
L + T AVL L +D+ A + GAD +DA +DL
Sbjct: 100 TQLTSTDFTQAVLRSAKLQGADVCTATLAGADLTDADVDL 139
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 38/78 (48%)
Query: 142 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 201
D SG+ +G L +A AN GA+L +T + L ANL NA L S+L
Sbjct: 41 DLSGADLSGLILIRANLRNANLQGANLRNTSLLLSNLENANLENANFTAAYLYGSNLENT 100
Query: 202 IIEGADFSDAVIDLAQKQ 219
+ DF+ AV+ A+ Q
Sbjct: 101 QLTSTDFTQAVLRSAKLQ 118
>gi|119483470|ref|ZP_01618884.1| hypothetical protein L8106_05436 [Lyngbya sp. PCC 8106]
gi|119458237|gb|EAW39359.1| hypothetical protein L8106_05436 [Lyngbya sp. PCC 8106]
Length = 301
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 9/115 (7%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
L +YE +GE + ADLR + + N +AN T A++ + G+ F GA L
Sbjct: 7 LRRYE---QGEIDF-TGIDLQGADLRGVILIGVNLSKANLTGANLSRAFLMGANFEGACL 62
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
++ N + ++ + + L +ANL+ A +VR+ L+++ + GAI+ GA+
Sbjct: 63 -----HRTNLSFVKMNKAHLAKADLTKANLSGAFVVRSKLSKAQMSGAILVGANL 112
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 49/107 (45%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ A L + N R A ++ +++ G + A L +AN TGA+ S
Sbjct: 150 ARLQGAKLTGHLLTGVNLRGAYLNGVNLAQAELEGVNLSEAKLNGVNLTRANLTGANFSF 209
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
M LN ANLT A L L R++L A + AD SDA + AQ
Sbjct: 210 AQMRAARLNGANLTGANLEGVCLKRANLNFAQLSNADLSDADLTDAQ 256
>gi|358461868|ref|ZP_09172018.1| pentapeptide repeat protein [Frankia sp. CN3]
gi|357072553|gb|EHI82089.1| pentapeptide repeat protein [Frankia sp. CN3]
Length = 376
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 61/114 (53%), Gaps = 20/114 (17%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GAD 167
S DLR + R+A+F A + ++ +G++ +GA L A Y+A+ + GA+
Sbjct: 219 LASGDLRDV-----DLRQADFRDARLFYANLTGARLHGANLTNADLYQADLSFARLHGAN 273
Query: 168 LSDTLMDRM-----VLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDA 211
L+ ++R LNEANLTN AVL VL ++L GA + GA+ +DA
Sbjct: 274 LTSARLERADLSTAELNEANLTNGQLHEAVLYSAVLHGANLTGARLHGANLTDA 327
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 55/114 (48%), Gaps = 16/114 (14%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +ADL +A AN TSA + +D S ++ N +AN T L +
Sbjct: 252 ANLTNADLYQADLSFARLHGANLTSARLERADLSTAELN----------EANLTNGQLHE 301
Query: 171 TLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDAVIDLAQKQ 219
++ VL+ ANLT A L LT R++L GA + G D S V++L Q+Q
Sbjct: 302 AVLYSAVLHGANLTGARLHGANLTDAQPYRANLTGAQLHGVDLSR-VVNLTQEQ 354
>gi|254413874|ref|ZP_05027643.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196179471|gb|EDX74466.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 359
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 62/117 (52%), Gaps = 12/117 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN----------GAYLEKAVAYK 160
A A+LR AV + +AN A++ +++ G+ N GA+L +A Y
Sbjct: 92 ADLRQANLRGAVLSNADLTQANLEGANLTDANLEGTTLNYANLKMVDLRGAHLYQAYLYA 151
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
AN + A L + + L EANL A ++R L +++L GA ++GA+ S++ D++Q
Sbjct: 152 ANVSEAKLRGANLGKTDLREANLKQASIIRAYLGQANLQGADLDGANLSES--DMSQ 206
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 90/200 (45%), Gaps = 34/200 (17%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKE-----------NFRRANFTSADMRESDF 143
N EA+ RG A G DLR+A ++K+ N + A+ A++ ESD
Sbjct: 153 NVSEAKLRG-------ANLGKTDLREA-NLKQASIIRAYLGQANLQGADLDGANLSESDM 204
Query: 144 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
S +K N A L A+F+ +DLS + R + A+L A L L R++L GA +
Sbjct: 205 SQAKLNRAKLRNTQLRNADFSLSDLSQATLIRANASHAHLIRANLRGADLIRTNLTGADL 264
Query: 204 EGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN-SRRNAYGS---------- 252
+GAD S A + LA L N TN I + LG N SR N +
Sbjct: 265 QGADLSLADLSLANLY-LANLGN-TNLIRANLSIAELGGANLSRANLNQADLRGANVENA 322
Query: 253 --PSSPLLSAPPQKLLDRDG 270
S+P LS +++L R G
Sbjct: 323 EFASNPGLSEEMKRVLKRRG 342
Score = 43.9 bits (102), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 63/135 (46%), Gaps = 25/135 (18%)
Query: 80 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 139
VVA+ + N++ LA + G+ A F ++LR N +AN AD+
Sbjct: 16 VVAAETDNLAQLAAM----------VGLNLARDFAESNLRDTNLKGANLVKANLRGADLH 65
Query: 140 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 199
++ ++ GA L A +AN GADL +ANL A VL+ +DL
Sbjct: 66 GANLMKARLCGADLRGADLIQANLCGADLR----------QANLRGA-----VLSNADLT 110
Query: 200 GAIIEGADFSDAVID 214
A +EGA+ +DA ++
Sbjct: 111 QANLEGANLTDANLE 125
>gi|193214429|ref|YP_001995628.1| pentapeptide repeat-containing protein [Chloroherpeton thalassium
ATCC 35110]
gi|193087906|gb|ACF13181.1| pentapeptide repeat protein [Chloroherpeton thalassium ATCC 35110]
Length = 694
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 56/107 (52%), Gaps = 10/107 (9%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFT 164
+A ADLR A N + A+ +SA+++ +D S + GA L + AV + AN
Sbjct: 482 SANLQGADLRAA-----NLQGADLSSANLQGADLSSANLQGAVLWLANLQGAVLWLANLQ 536
Query: 165 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
GADLSD + VL+ ANL A L L +DL A ++GAD A
Sbjct: 537 GADLSDAKLQGAVLSFANLQGADLRSAKLQGADLRSANLQGADLRSA 583
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 47/99 (47%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F ADLR A + AN AD+ ++ G+ A L+ A AN GADLS
Sbjct: 455 FQGADLRAANLQGADLISANLQGADLISANLQGADLRAANLQGADLSSANLQGADLSSAN 514
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ VL ANL AVL L +DL A ++GA S A
Sbjct: 515 LQGAVLWLANLQGAVLWLANLQGADLSDAKLQGAVLSFA 553
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 45/98 (45%), Gaps = 1/98 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADLR A + R AN AD+R ++ G+ A L+ A A GADL
Sbjct: 551 SFANLQGADLRSAKLQGADLRSANLQGADLRSANLQGAYLRSANLQGAYLRSAKLQGADL 610
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVL-TRSDLGGAIIEG 205
S+ + L+ A L A L + ++D GA +G
Sbjct: 611 SEANLQGADLDSAKLQGAYLRNIEIDEKTDFNGATADG 648
>gi|443314200|ref|ZP_21043780.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442786200|gb|ELR95960.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 185
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 73/146 (50%), Gaps = 28/146 (19%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
I S A ADL+ A N + A T AD+RE++ S + AYL+ +AN TGA
Sbjct: 43 ILSGADLKGADLKGA-----NLKVATLTGADLREANLSKANLMLAYLD-----EANLTGA 92
Query: 167 DLSDTLMD-----RMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDAVIDLA 216
+LS++ M+ + L+ ANL+NA + + L +D LGGAI+ A +
Sbjct: 93 NLSNSQMNGAQMPHVNLHGANLSNAEMTQVNLLEADLSDANLGGAIMLSVKLGTANL--- 149
Query: 217 QKQALCKYANGTNPITGVSTRKSLGC 242
K A K AN + GV+ ++L C
Sbjct: 150 -KGANLKGAN----LRGVNRSQALFC 170
>gi|428211194|ref|YP_007084338.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|427999575|gb|AFY80418.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 190
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 51/94 (54%), Gaps = 5/94 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L A N AN + AD+ +D + + GA L A + +FTGA+L R
Sbjct: 70 ANLSGANLTGANLTGANLSGADLSGADLTDADLGGADLSYATLHYTDFTGANLF-----R 124
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+L +A L +A LVR L ++L GAI+EGA FS
Sbjct: 125 AMLVDAKLNHAKLVRVRLRSANLNGAIVEGAIFS 158
Score = 37.0 bits (84), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 43/87 (49%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+ +FR + AD+ ++ SG F L A ++A G D + + L ANL+
Sbjct: 14 ERDFRDTDLFRADLSNAELSGVSFFRTSLFGANLFRAKLIGCDFFRSTLIGANLYCANLS 73
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDA 211
A L LT ++L GA + GAD +DA
Sbjct: 74 GANLTGANLTGANLSGADLSGADLTDA 100
>gi|334117594|ref|ZP_08491685.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333460703|gb|EGK89311.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 290
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 63/120 (52%), Gaps = 9/120 (7%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
L +YE R +F + A A+L A+ V NF RAN + A++ + + ++ N A L
Sbjct: 7 LKEYENGNR-DF---AGANLSGANLSGAILVGVNFSRANLSGANLSRAHLTKAELNDANL 62
Query: 154 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
Y+AN + A + + L +ANL+ A LV+ L R+ L GA + G++ + A++
Sbjct: 63 -----YRANLSFAKMGQARLADADLTKANLSGAFLVKAKLPRAKLSGAQLIGSNLAMAIL 117
>gi|428320925|ref|YP_007118807.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428244605|gb|AFZ10391.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 214
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 50/103 (48%), Gaps = 10/103 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A G ADL +A+ V+ N RA A++ ++D S KA +A GA+L
Sbjct: 68 SKADLGGADLTEALLVEANLNRAELMGANLSKADLS----------KASLIQATLIGANL 117
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S + + R L+ NL L R VLT DL GA + D S A
Sbjct: 118 SRSTLSRADLHGVNLYGVNLRRAVLTECDLIGANLSKVDLSGA 160
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 59/117 (50%), Gaps = 2/117 (1%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
YEAE G + A G A L KA + NF +AN ++ ++D G+ A L +A
Sbjct: 28 YEAELIGA-NLYEADLIG-AHLSKAKLNRVNFGKANLCKINLSKADLGGADLTEALLVEA 85
Query: 157 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+A GA+LS + + L +A L A L R+ L+R+DL G + G + AV+
Sbjct: 86 NLNRAELMGANLSKADLSKASLIQATLIGANLSRSTLSRADLHGVNLYGVNLRRAVL 142
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 48/99 (48%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL KA ++ AN + + + +D G G L +AV + + GA+LS
Sbjct: 95 ANLSKADLSKASLIQATLIGANLSRSTLSRADLHGVNLYGVNLRRAVLTECDLIGANLSK 154
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ L A+L A L +L+ SDL GA + GA+ +
Sbjct: 155 VDLSGADLMGASLIRADLTEAILSASDLSGANLLGANLT 193
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 38/74 (51%), Gaps = 5/74 (6%)
Query: 140 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 199
E +FSG + YL +A AN ADL + + LN N A L + L+++DLG
Sbjct: 14 ERNFSGVYLHEVYLYEAELIGANLYEADLIGAHLSKAKLNRVNFGKANLCKINLSKADLG 73
Query: 200 GAIIEGADFSDAVI 213
GAD ++A++
Sbjct: 74 -----GADLTEALL 82
>gi|209522801|ref|ZP_03271359.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|209496850|gb|EDZ97147.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
Length = 274
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
AQ A+L A NF RAN T A MR S N A L A +ANFT A+
Sbjct: 70 AQLADANLISANLTDANFSRANLTGASMRGSISKNVTLNMANLTDANLAEANFTEANFIG 129
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+L N+ L+RT L +++L GA ++GA+ ++ ++
Sbjct: 130 ----------AHLVNSTLIRTNLLKANLSGANLDGANLTNVIM 162
Score = 44.3 bits (103), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 61/127 (48%), Gaps = 14/127 (11%)
Query: 91 LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
LAD N A T F S A A +R ++ AN T A++ E++F+ + F
Sbjct: 72 LADANLISANLTDANF---SRANLTGASMRGSISKNVTLNMANLTDANLAEANFTEANFI 128
Query: 150 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIE 204
GA+L + + N A+LS +D ANLTN ++ + L + + L GA++
Sbjct: 129 GAHLVNSTLIRTNLLKANLSGANLD-----GANLTNVIMRDSTLEGANLSNATLSGAMLM 183
Query: 205 GADFSDA 211
GA+F A
Sbjct: 184 GANFHRA 190
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 15/98 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F ADL + V + AN + A++R ++ S + GA + +A Y+ ++LS
Sbjct: 185 ANFHRADLSRVTMVGADLTDANLSEANLRAANVSWTSLRGANMSRARLYRTKLNWSNLSG 244
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAII 203
NL AV++ TVL R+ DL GAI+
Sbjct: 245 V----------NLIEAVMLDTVLYRANLRDADLRGAIL 272
Score = 40.4 bits (93), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 56/127 (44%), Gaps = 12/127 (9%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A RG I A+L A + NF ANF A + S + L KA
Sbjct: 95 ASMRGS--ISKNVTLNMANLTDANLAEANFTEANFIGAHLVNSTLIRTN-----LLKANL 147
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDAVI 213
AN GA+L++ +M L ANL+NA L +L R+DL + GAD +DA +
Sbjct: 148 SGANLDGANLTNVIMRDSTLEGANLSNATLSGAMLMGANFHRADLSRVTMVGADLTDANL 207
Query: 214 DLAQKQA 220
A +A
Sbjct: 208 SEANLRA 214
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +A L A+ + NF RA+ + M +D + + + A L A + GA++S
Sbjct: 170 ANLSNATLSGAMLMGANFHRADLSRVTMVGADLTDANLSEANLRAANVSWTSLRGANMSR 229
Query: 171 TLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSD 210
+ R LN +NL+ AV++ TVL R++L A + GA D
Sbjct: 230 ARLYRTKLNWSNLSGVNLIEAVMLDTVLYRANLRDADLRGAILPD 274
Score = 37.0 bits (84), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 52/110 (47%), Gaps = 5/110 (4%)
Query: 83 SCSSNISALADLNKYEAE-TRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADM 138
+ + N++ L D N EA T F IG+ + +L KA N AN T+ M
Sbjct: 104 NVTLNMANLTDANLAEANFTEANF-IGAHLVNSTLIRTNLLKANLSGANLDGANLTNVIM 162
Query: 139 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 188
R+S G+ + A L A+ ANF ADLS M L +ANL+ A L
Sbjct: 163 RDSTLEGANLSNATLSGAMLMGANFHRADLSRVTMVGADLTDANLSEANL 212
>gi|157964675|ref|YP_001499499.1| hypothetical protein RMA_0846 [Rickettsia massiliae MTU5]
gi|157844451|gb|ABV84952.1| hypothetical protein RMA_0846 [Rickettsia massiliae MTU5]
Length = 964
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 39/121 (32%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A L+KA A G ++SD
Sbjct: 560 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLKKAEA-----EGLNISDA 614
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 225
+ + EAN NA++ R LT+++ A++E AD ++A++ A KQA K A
Sbjct: 615 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIVKEANLKQANLKAA 674
Query: 226 N 226
N
Sbjct: 675 N 675
Score = 42.4 bits (98), Expect = 0.24, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 589 ATAQF--AKLSNATLKKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 646
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+ M + EA + A L + L ++L G EGADF A I+ A K
Sbjct: 647 ENADMQAVEAAEAIVKEANLKQANLKAANLAGINKEGADFDKAEINNATK 696
Score = 38.1 bits (87), Expect = 3.8, Method: Composition-based stats.
Identities = 42/161 (26%), Positives = 66/161 (40%), Gaps = 16/161 (9%)
Query: 51 PDCSNNQCAGPYA---KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGI 107
PD S +G LKN +F S L +++C+ + + N A +
Sbjct: 347 PDLSATNLSGKILTNLNLKN-TLFASANLENIKISNCNLDFTNFEGANLQNAVFQNV--T 403
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK------- 160
A F ADL+K+ + RA D+ E + + SKFN + A A K
Sbjct: 404 ARNAGFLFADLQKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKLIIKDSE 463
Query: 161 ---ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+N TG L+ M R+ + L NA+L + + +DL
Sbjct: 464 WKNSNLTGISLAYADMQRVQMQGVVLNNALLDQANIVSTDL 504
>gi|443319118|ref|ZP_21048355.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442781316|gb|ELR91419.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 331
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 65/152 (42%), Gaps = 20/152 (13%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S ++ G L + A D+R D SG + A L A N TGA+L
Sbjct: 169 SGSRMGGVALDRTQLADVTLEGAYLNGVDLRGMDLSGVNLSQARLNGAKLDLVNLTGANL 228
Query: 169 SDTLMDRMVLNEANLTNAVLVRTV----------LTRSD-----LGGAIIE-----GADF 208
S + R L +ANLT +L V LTR+D L GA+++ GA+F
Sbjct: 229 SQATLRRASLQQANLTGTILTGAVLWHADMQGVNLTRADLSQANLAGALLQATSITGAEF 288
Query: 209 SDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
+DA++ + L A G + TR++L
Sbjct: 289 TDAILPEESRNGLYALATGETLWSHRLTRETL 320
Score = 42.4 bits (98), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 48/91 (52%), Gaps = 1/91 (1%)
Query: 122 VHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 180
+H+ N + + D+ E+ G+ A+L KA Y+AN A+LS T + + L +
Sbjct: 41 IHLNSVNLSQRILVAVDLAEASLVGADLARAFLTKANLYRANLHRANLSFTKLSDVNLRQ 100
Query: 181 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++L+ A L T + ++ L GA + GA+ A
Sbjct: 101 SDLSKADLRSTFMVKAHLEGANLSGANLGQA 131
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 48/109 (44%), Gaps = 10/109 (9%)
Query: 116 ADLRKAVHVKE----------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
ADLR VK N +AN A++ ++ G+ GA L A +AN +
Sbjct: 106 ADLRSTFMVKAHLEGANLSGANLGQANLRGANLEGANLCGANLQGANLRGANLSQANLSW 165
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A+LS + M + L+ L + L L DL G + G + S A ++
Sbjct: 166 ANLSGSRMGGVALDRTQLADVTLEGAYLNGVDLRGMDLSGVNLSQARLN 214
>gi|24213719|ref|NP_711200.1| hypothetical protein LA_1019 [Leptospira interrogans serovar Lai
str. 56601]
gi|386073300|ref|YP_005987617.1| hypothetical protein LIF_A0826 [Leptospira interrogans serovar Lai
str. IPAV]
gi|418709761|ref|ZP_13270547.1| NifU-like N-terminal domain protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|24194537|gb|AAN48218.1| hypothetical protein LA_1019 [Leptospira interrogans serovar Lai
str. 56601]
gi|353457089|gb|AER01634.1| hypothetical protein LIF_A0826 [Leptospira interrogans serovar Lai
str. IPAV]
gi|410769996|gb|EKR45223.1| NifU-like N-terminal domain protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|456968595|gb|EMG09774.1| NifU-like N-terminal domain protein [Leptospira interrogans serovar
Grippotyphosa str. LT2186]
Length = 263
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 53/102 (51%), Gaps = 4/102 (3%)
Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 181
+ VK + R + +S + + F G F+GA L A ++F GA+ S + LN A
Sbjct: 141 LKVKGSLRDEDLSSIILEKLKFDGVDFSGANLGHAFLQNSSFVGANFSGAKLRGSFLNNA 200
Query: 182 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID----LAQKQ 219
+L N+ L + L GA +EGADF+DA+ D L QKQ
Sbjct: 201 DLRNSNFRGADLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242
>gi|386001277|ref|YP_005919576.1| Pentapeptide repeat protein [Methanosaeta harundinacea 6Ac]
gi|357209333|gb|AET63953.1| Pentapeptide repeat protein [Methanosaeta harundinacea 6Ac]
Length = 385
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A++R + + F RA F D+ SD S S F+ AYL +AN + A+L+
Sbjct: 178 AHMNWAEMRGSYLNRGQFSRAEFYGTDLSGSDLSDSDFSRAYL-----MRANLSDANLNW 232
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L L EA L+ + L T L+ +DL GA + GAD +DA
Sbjct: 233 ALFAYADLTEAKLSRSTLRGTKLSYADLTGADLSGADLTDA 273
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 5/111 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S + +D +A ++ N AN F AD+ E+ S S G L A A+
Sbjct: 206 SGSDLSDSDFSRAYLMRANLSDANLNWALFAYADLTEAKLSRSTLRGTKLSYADLTGADL 265
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
+GADL+D + + L ++NL+N + R L DL G + GA DA ID
Sbjct: 266 SGADLTDADLTAIRLIKSNLSNTKMGRAYLQGLDLRGVDLSGAYLRDATID 316
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 53/104 (50%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L+KA + A+ + AD+ E D S +K GA + A A T ADL+ T +
Sbjct: 83 ANLKKANLAGADLSGADLSEADLSEVDLSEAKLWGAKISGASLVDATLTKADLTRTDITD 142
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 219
L A +TNA L R LT + + G + G +F A ++ A+ +
Sbjct: 143 ADLTGAEMTNARLFRADLTGATMTGVYLIGGNFVGAHMNWAEMR 186
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 50/99 (50%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADL A + A+ T+ + +S+ S +K AYL+ + +GA L D +DR
Sbjct: 258 ADLTGADLSGADLTDADLTAIRLIKSNLSNTKMGRAYLQGLDLRGVDLSGAYLRDATIDR 317
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
L +ANLT A L L+ ++ GA + GA+ A +D
Sbjct: 318 TYLTDANLTGADLRGATLSSVEMTGADLAGANLIRAKVD 356
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 58/112 (51%), Gaps = 10/112 (8%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
+A DL A + + A+ + AD+R++ + GA L+KA A+ +GADLS
Sbjct: 42 SADLSGRDLVGAHLNQSDLSGADLSGADLRDAYLRSTWLLGANLKKANLAGADLSGADLS 101
Query: 170 DTLMDRMVLNE----------ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + + L+E A+L +A L + LTR+D+ A + GA+ ++A
Sbjct: 102 EADLSEVDLSEAKLWGAKISGASLVDATLTKADLTRTDITDADLTGAEMTNA 153
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 51/107 (47%), Gaps = 13/107 (12%)
Query: 137 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 196
D+R +D SG GA+L ++ A+ +GADL D + L ANL A
Sbjct: 39 DLRSADLSGRDLVGAHLNQSDLSGADLSGADLRDAYLRSTWLLGANLKKA---------- 88
Query: 197 DLGGAIIEGADFSDA---VIDLAQKQALCKYANGTNPITGVSTRKSL 240
+L GA + GAD S+A +DL++ + +G + + T+ L
Sbjct: 89 NLAGADLSGADLSEADLSEVDLSEAKLWGAKISGASLVDATLTKADL 135
>gi|383481718|ref|YP_005390633.1| hypothetical protein MCC_05165 [Rickettsia rhipicephali str.
3-7-female6-CWPP]
gi|378934057|gb|AFC72560.1| hypothetical protein MCC_05165 [Rickettsia rhipicephali str.
3-7-female6-CWPP]
Length = 957
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 39/121 (32%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A L+KA A G ++SD
Sbjct: 553 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLKKAEA-----EGLNISDA 607
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 225
+ + EAN NA++ R LT+++ A++E AD ++A++ A KQA K A
Sbjct: 608 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIVKEANLKQANLKAA 667
Query: 226 N 226
N
Sbjct: 668 N 668
Score = 42.4 bits (98), Expect = 0.24, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 582 ATAQF--AKLSNATLKKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 639
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+ M + EA + A L + L ++L G EGADF A I+ A K
Sbjct: 640 ENADMQAVEAAEAIVKEANLKQANLKAANLAGINKEGADFDKAEINNATK 689
Score = 38.5 bits (88), Expect = 3.5, Method: Composition-based stats.
Identities = 42/161 (26%), Positives = 66/161 (40%), Gaps = 16/161 (9%)
Query: 51 PDCSNNQCAGPYA---KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGI 107
PD S +G LKN +F S L +++C+ + + N A +
Sbjct: 340 PDLSATNLSGKILTNLNLKN-TLFASANLENIKISNCNLDFTNFEGANLQNAIFQNV--T 396
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK------- 160
A F ADL+K+ + RA D+ E + + SKFN + A A K
Sbjct: 397 ARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKLIIKDSE 456
Query: 161 ---ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+N TG L+ M R+ + L NA+L + + +DL
Sbjct: 457 WKNSNLTGISLAYADMQRVQMQGVVLNNALLDQANIVSTDL 497
>gi|427736744|ref|YP_007056288.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427371785|gb|AFY55741.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 443
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 61/124 (49%), Gaps = 15/124 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSA----------DMRESDFSGSKFNGAYLEKAVA 158
++ +F ADLR+A V N +F++A D+ +D SG+ +GAY A
Sbjct: 319 TSTKFIGADLREANFVGANLDNVDFSNANLSGTNLSGADLSGADLSGAYLSGAYFYDADL 378
Query: 159 YKANFTGADLS-----DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
AN GADLS D + L A+L+ A L+ ++L GA + GAD +D I
Sbjct: 379 SDANLQGADLSGAYFYDADLSGANLQGADLSGAYFYDADLSGANLQGANLNGADLTDTYI 438
Query: 214 DLAQ 217
D A+
Sbjct: 439 DRAK 442
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 45/94 (47%), Gaps = 5/94 (5%)
Query: 128 FRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 182
R AN AD+R +SDF+ + GA L A AN GADLS+ + LN
Sbjct: 168 LRGANLARADLRGTKLNQSDFTNANLAGADLRDADLTNANLAGADLSNADLTNANLNSVQ 227
Query: 183 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 216
L A L+ L +DL A + GA DA I+ A
Sbjct: 228 LVKAQLINARLVDTDLRKANLNGAYLIDANINRA 261
Score = 44.3 bits (103), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 50/106 (47%), Gaps = 5/106 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL-- 168
A ADLR + +F AN AD+R++D + + GA L A AN L
Sbjct: 171 ANLARADLRGTKLNQSDFTNANLAGADLRDADLTNANLAGADLSNADLTNANLNSVQLVK 230
Query: 169 SDTLMDRMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + R+V L +ANL A L+ + R++L G + AD + A
Sbjct: 231 AQLINARLVDTDLRKANLNGAYLIDANINRANLSGTNLSNADLTSA 276
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 56/122 (45%), Gaps = 12/122 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF-RRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN----- 162
S +ADL A ++E F NF A++ DFSG NG L A AN
Sbjct: 264 SGTNLSNADLTSA-KLRETFPSNTNFCGANLSGIDFSGFILNGINLRWAKLIGANLTSTK 322
Query: 163 FTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
F GADL + +D + + ANL+ L L+ +DL GA + GA F DA + A
Sbjct: 323 FIGADLREANFVGANLDNVDFSNANLSGTNLSGADLSGADLSGAYLSGAYFYDADLSDAN 382
Query: 218 KQ 219
Q
Sbjct: 383 LQ 384
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 49/109 (44%), Gaps = 10/109 (9%)
Query: 111 AQFGSADLRKA-----VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A+ DLRKA + N RAN + ++ +D + +K L + NF G
Sbjct: 236 ARLVDTDLRKANLNGAYLIDANINRANLSGTNLSNADLTSAK-----LRETFPSNTNFCG 290
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
A+LS +LN NL A L+ LT + GA + A+F A +D
Sbjct: 291 ANLSGIDFSGFILNGINLRWAKLIGANLTSTKFIGADLREANFVGANLD 339
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 52/119 (43%), Gaps = 20/119 (16%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY----------K 160
A ADLR A + AN AD+ +D + + N L KA K
Sbjct: 191 ANLAGADLRDA-----DLTNANLAGADLSNADLTNANLNSVQLVKAQLINARLVDTDLRK 245
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVID 214
AN GA L D ++R L+ NL+NA L T + ++ GA + G DFS +++
Sbjct: 246 ANLNGAYLIDANINRANLSGTNLSNADLTSAKLRETFPSNTNFCGANLSGIDFSGFILN 304
>gi|379713712|ref|YP_005302050.1| hypothetical protein RMB_03905 [Rickettsia massiliae str. AZT80]
gi|376334358|gb|AFB31590.1| hypothetical protein RMB_03905 [Rickettsia massiliae str. AZT80]
Length = 957
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 39/121 (32%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+ +ADL KA K N A+ T+A + + +K + A L+KA A G ++SD
Sbjct: 553 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLKKAEA-----EGLNISDA 607
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 225
+ + EAN NA++ R LT+++ A++E AD ++A++ A KQA K A
Sbjct: 608 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIVKEANLKQANLKAA 667
Query: 226 N 226
N
Sbjct: 668 N 668
Score = 42.4 bits (98), Expect = 0.24, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 582 ATAQF--AKLSNATLKKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 639
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
+ M + EA + A L + L ++L G EGADF A I+ A K
Sbjct: 640 ENADMQAVEAAEAIVKEANLKQANLKAANLAGINKEGADFDKAEINNATK 689
Score = 38.1 bits (87), Expect = 4.0, Method: Composition-based stats.
Identities = 42/161 (26%), Positives = 66/161 (40%), Gaps = 16/161 (9%)
Query: 51 PDCSNNQCAGPYA---KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGI 107
PD S +G LKN +F S L +++C+ + + N A +
Sbjct: 340 PDLSATNLSGKILTNLNLKN-TLFASANLENIKISNCNLDFTNFEGANLQNAVFQNV--T 396
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK------- 160
A F ADL+K+ + RA D+ E + + SKFN + A A K
Sbjct: 397 ARNAGFLFADLQKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKLIIKDSE 456
Query: 161 ---ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
+N TG L+ M R+ + L NA+L + + +DL
Sbjct: 457 WKNSNLTGISLAYADMQRVQMQGVVLNNALLDQANIVSTDL 497
>gi|409989952|ref|ZP_11273410.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|291567017|dbj|BAI89289.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409939186|gb|EKN80392.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 274
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
AQ A+L A NF RAN T A MR S N A L A +ANFT A+
Sbjct: 70 AQLADANLISANLTDANFSRANLTGASMRGSISKNVTLNMANLTDANLAEANFTEANFIG 129
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+L N+ L+RT L +++L GA ++GA+ ++ ++
Sbjct: 130 ----------AHLVNSTLIRTNLLKANLSGANLDGANLTNVIM 162
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 61/127 (48%), Gaps = 14/127 (11%)
Query: 91 LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
LAD N A T F S A A +R ++ AN T A++ E++F+ + F
Sbjct: 72 LADANLISANLTDANF---SRANLTGASMRGSISKNVTLNMANLTDANLAEANFTEANFI 128
Query: 150 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIE 204
GA+L + + N A+LS +D ANLTN ++ + L + + L GA++
Sbjct: 129 GAHLVNSTLIRTNLLKANLSGANLD-----GANLTNVIMRDSTLEGANLSNATLSGAMLM 183
Query: 205 GADFSDA 211
GA+F A
Sbjct: 184 GANFHQA 190
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 15/98 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F ADL + V + AN + A++R ++ S + GA + +A Y+ ++LS
Sbjct: 185 ANFHQADLSRVTMVGADLTDANLSEANLRAANISWTSLRGANMSRARLYRTKLNWSNLSG 244
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAII 203
NL AV++ TVL R+ DL GAI+
Sbjct: 245 V----------NLIEAVMLDTVLYRANLRDADLRGAIL 272
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 56/127 (44%), Gaps = 12/127 (9%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A RG I A+L A + NF ANF A + S + L KA
Sbjct: 95 ASMRGS--ISKNVTLNMANLTDANLAEANFTEANFIGAHLVNSTLIRTN-----LLKANL 147
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDAVI 213
AN GA+L++ +M L ANL+NA L +L ++DL + GAD +DA +
Sbjct: 148 SGANLDGANLTNVIMRDSTLEGANLSNATLSGAMLMGANFHQADLSRVTMVGADLTDANL 207
Query: 214 DLAQKQA 220
A +A
Sbjct: 208 SEANLRA 214
Score = 37.7 bits (86), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 63/133 (47%), Gaps = 10/133 (7%)
Query: 83 SCSSNISALADLNKYEAE-TRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADM 138
+ + N++ L D N EA T F IG+ + +L KA N AN T+ M
Sbjct: 104 NVTLNMANLTDANLAEANFTEANF-IGAHLVNSTLIRTNLLKANLSGANLDGANLTNVIM 162
Query: 139 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 198
R+S G+ + A L A+ ANF ADLS R+ + A+LT+A L L +++
Sbjct: 163 RDSTLEGANLSNATLSGAMLMGANFHQADLS-----RVTMVGADLTDANLSEANLRAANI 217
Query: 199 GGAIIEGADFSDA 211
+ GA+ S A
Sbjct: 218 SWTSLRGANMSRA 230
>gi|307154067|ref|YP_003889451.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306984295|gb|ADN16176.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 334
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 60/106 (56%), Gaps = 5/106 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +A+L++A+ + + + AN + A++ + +G+ + A L KA + +GA+L+
Sbjct: 114 ANLSNANLKQAILINADLKSANLSGANLMGVNLTGANLSRADLSKANLSNIDLSGANLNR 173
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIEGADFSDA 211
+ R LN A+L+ A L + L+RS DL GAI++GA+ A
Sbjct: 174 VDLSRANLNGADLSGANLYKADLSRSNLRNGDLEGAILQGANLHKA 219
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 9/101 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A ADL A N +A+ + +++R D G+ GA L KA AN +GA L
Sbjct: 177 SRANLNGADLSGA-----NLYKADLSRSNLRNGDLEGAILQGANLHKANLKGANLSGAQL 231
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
++ ++ + L+E +L L +R DL A + GA+ S
Sbjct: 232 KESNLNLVNLSEFSLHAGRLS----SRIDLSSANLAGANLS 268
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 47/98 (47%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ A+L + N N T A++ +D +K A L + A+ + ADLS
Sbjct: 44 AELMEANLSRTALDWSNLSGTNLTRANLNRADLISAKLISATLIQTDLTGADLSNADLSW 103
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
++ L ANL+NA L + +L +DL A + GA+
Sbjct: 104 VNLEGAKLTYANLSNANLKQAILINADLKSANLSGANL 141
>gi|84687546|ref|ZP_01015422.1| hypothetical protein 1099457000249_RB2654_04949 [Maritimibacter
alkaliphilus HTCC2654]
gi|84664455|gb|EAQ10943.1| hypothetical protein RB2654_04949 [Rhodobacterales bacterium
HTCC2654]
Length = 158
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 58/129 (44%), Gaps = 18/129 (13%)
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT- 184
E+F A AD+ D +GS F+GA+L +AN TG + D +L +A +T
Sbjct: 18 EDFSSAMLKGADLSGQDLAGSDFSGAFLG-----EANLTGTTVDGATFDGALLTDATMTG 72
Query: 185 ----NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
A+ VL +DL G + GADF+ A+++ A +TG S R +L
Sbjct: 73 CSAKGAIFTGAVLKDADLSGCALAGADFTGALLEGASLAG--------ADLTGASLRSTL 124
Query: 241 GCGNSRRNA 249
G A
Sbjct: 125 MAGADLTGA 133
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 47/103 (45%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A G A+L F A T A M G+ F GA L+ A GAD
Sbjct: 41 SGAFLGEANLTGTTVDGATFDGALLTDATMTGCSAKGAIFTGAVLKDADLSGCALAGADF 100
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L++ L A+LT A L T++ +DL GA + DF++A
Sbjct: 101 TGALLEGASLAGADLTGASLRSTLMAGADLTGATVTEVDFTEA 143
Score = 40.8 bits (94), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 52/105 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+A F SA L+ A ++ ++F+ A + E++ +G+ +GA + A+ A TG
Sbjct: 16 TAEDFSSAMLKGADLSGQDLAGSDFSGAFLGEANLTGTTVDGATFDGALLTDATMTGCSA 75
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ VL +A+L+ L T + L GA + GAD + A +
Sbjct: 76 KGAIFTGAVLKDADLSGCALAGADFTGALLEGASLAGADLTGASL 120
>gi|328541950|ref|YP_004302059.1| Pentapeptide repeat protein [Polymorphum gilvum SL003B-26A1]
gi|326411700|gb|ADZ68763.1| Pentapeptide repeat protein [Polymorphum gilvum SL003B-26A1]
Length = 276
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 56/107 (52%), Gaps = 10/107 (9%)
Query: 109 SAAQFGSADLRKAVHVKEN-----FRRANFTSADMRESDFSGSKFNGAY-----LEKAVA 158
+ A+F ADLR A ++ FR A AD+ +D SG+ F+GA L++ A
Sbjct: 140 AGARFAGADLRNASLIRAAATGAAFRNAKMDGADLERADLSGADFSGARLPYSDLDRVRA 199
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
A+F GADL+ + L+ A+L+ A + T+L + L GA + G
Sbjct: 200 AGASFRGADLTGVRLSSADLSGADLSGADMTDTLLRNTRLPGADLTG 246
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 52/105 (49%), Gaps = 10/105 (9%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKA-----VAYKA 161
+F ADLR+A + + RR++F+ A DM ++D SG+ +GA L A A
Sbjct: 83 KFAKADLRRAELERADLRRSDFSGASMRAVDMEKADLSGAVLDGADLRDADLNGTSLAGA 142
Query: 162 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 206
F GADL + + R A NA + L R+DL GA GA
Sbjct: 143 RFAGADLRNASLIRAAATGAAFRNAKMDGADLERADLSGADFSGA 187
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 51/106 (48%), Gaps = 10/106 (9%)
Query: 116 ADLRKAV-----HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
AD+++ V K + RRA AD+R SDFSG+ +EKA A GADL D
Sbjct: 72 ADMKEVVLPDGKFAKADLRRAELERADLRRSDFSGASMRAVDMEKADLSGAVLDGADLRD 131
Query: 171 TLMDRMVL-----NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++ L A+L NA L+R T + A ++GAD A
Sbjct: 132 ADLNGTSLAGARFAGADLRNASLIRAAATGAAFRNAKMDGADLERA 177
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 56/115 (48%), Gaps = 10/115 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESD----------FSGSKFNGAYLEKAVA 158
S A ADLR A + A F AD+R + F +K +GA LE+A
Sbjct: 120 SGAVLDGADLRDADLNGTSLAGARFAGADLRNASLIRAAATGAAFRNAKMDGADLERADL 179
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+F+GA L + +DR+ A+ A L L+ +DL GA + GAD +D ++
Sbjct: 180 SGADFSGARLPYSDLDRVRAAGASFRGADLTGVRLSSADLSGADLSGADMTDTLL 234
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DLR + A+ T A +RE+D +++ V F ADL ++R
Sbjct: 48 DLRLVTLRGVEIQGADLTRAILREAD----------MKEVVLPDGKFAKADLRRAELERA 97
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L ++ + A + + ++DL GA+++GAD DA
Sbjct: 98 DLRRSDFSGASMRAVDMEKADLSGAVLDGADLRDA 132
>gi|428311473|ref|YP_007122450.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428253085|gb|AFZ19044.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 580
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 53/95 (55%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A F A+LR+A + N + A ++ E++ + GA L +A ++A TGAD+S
Sbjct: 154 ATNFTGANLREANLEQANLQEATLVGVNLTEANLNNVYLRGANLRQADLHRAILTGADMS 213
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 204
+ + L+ ANLT A L+R L ++DL A+++
Sbjct: 214 EANCEGADLSRANLTGAYLLRASLRKADLLRAVLQ 248
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 45/79 (56%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 190
AN T A++R+ + +G+ +GA L ANFTGA+L + +L+ AN T ++L R
Sbjct: 25 ANLTGANLRKINLTGANLSGANLSWCCFSHANFTGANLHQANLHSAILDNANFTQSILSR 84
Query: 191 TVLTRSDLGGAIIEGADFS 209
L++ DL A + AD +
Sbjct: 85 AKLSKVDLRLANLREADLN 103
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 51/96 (53%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L A+ + AN ++ ++F+G+ A LE+A +A G +L++ ++
Sbjct: 130 ANLNHALLMGAQLMEANLCRTNLIATNFTGANLREANLEQANLQEATLVGVNLTEANLNN 189
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L ANL A L R +LT +D+ A EGAD S A
Sbjct: 190 VYLRGANLRQADLHRAILTGADMSEANCEGADLSRA 225
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 52/108 (48%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
I A F + L +A K + R AN AD+ +D S S +GA L+ + N
Sbjct: 70 AILDNANFTQSILSRAKLSKVDLRLANLREADLNWADLSASNLSGADLQNTQLDQINLEH 129
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+L+ L+ L EANL L+ T T ++L A +E A+ +A +
Sbjct: 130 ANLNHALLMGAQLMEANLCRTNLIATNFTGANLREANLEQANLQEATL 177
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 57/115 (49%), Gaps = 4/115 (3%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L A ++ + R+A+ A ++E + + A L A KA+ +GA L D
Sbjct: 220 ADLSRANLTGAYLLRASLRKADLLRAVLQEVYLLRTDLSEANLRGADLRKADLSGAYLKD 279
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 225
TL+ L+ A L + L+RT L R++L G I + +DL+ + C+Y
Sbjct: 280 TLLSEANLSGAYLLESYLIRTKLDRAELTGCCIHQWHLEE--VDLSYVE--CRYV 330
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 41/84 (48%), Gaps = 5/84 (5%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 187
R AN AD+ + +G+ + A E A +AN TGA L R L +A+L AV
Sbjct: 192 LRGANLRQADLHRAILTGADMSEANCEGADLSRANLTGAYLL-----RASLRKADLLRAV 246
Query: 188 LVRTVLTRSDLGGAIIEGADFSDA 211
L L R+DL A + GAD A
Sbjct: 247 LQEVYLLRTDLSEANLRGADLRKA 270
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 49/106 (46%), Gaps = 20/106 (18%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFTG 165
A ADL +A+ T ADM E +D S + GAYL +A KA+
Sbjct: 195 ANLRQADLHRAI----------LTGADMSEANCEGADLSRANLTGAYLLRASLRKADLLR 244
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
A L + + R L+EANL A L ++DL GA ++ S+A
Sbjct: 245 AVLQEVYLLRTDLSEANLRGA-----DLRKADLSGAYLKDTLLSEA 285
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 50/112 (44%), Gaps = 15/112 (13%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAYKAN 162
F DL A N R+ N T A++ ++F+G+ + A L A+ AN
Sbjct: 17 FAHIDLSGANLTGANLRKINLTGANLSGANLSWCCFSHANFTGANLHQANLHSAILDNAN 76
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
FT + LS + ++ L ANL A L +DL + + GAD + +D
Sbjct: 77 FTQSILSRAKLSKVDLRLANLREA-----DLNWADLSASNLSGADLQNTQLD 123
>gi|332704952|ref|ZP_08425038.1| hypothetical protein LYNGBM3L_00660 [Moorea producens 3L]
gi|332356304|gb|EGJ35758.1| hypothetical protein LYNGBM3L_00660 [Moorea producens 3L]
Length = 544
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 10/111 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE----------KAVAYK 160
A ADL N A+ + AD+ +D SG+ FN A L +A +
Sbjct: 236 ANLSDADLSDTKLSGANLCDADLSGADLSGADLSGADFNDANLSGADLSSANLIRANLIR 295
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
AN +GA+LSD + L ANL+NA L R++L GA + GAD S+A
Sbjct: 296 ANLSGANLSDVKVIGGNLGNANLSNANFSSAKLIRANLSGADLSGADLSNA 346
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 55/116 (47%), Gaps = 15/116 (12%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-------- 164
G+A+L A RAN + AD+ +D S + F+GA L A AN +
Sbjct: 313 LGNANLSNANFSSAKLIRANLSGADLSGADLSNANFSGASLYSANLSNANLSSANLRGTE 372
Query: 165 -------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
GADL T + L+ ANL+NA L+ + L ++L GA + GA+ A +
Sbjct: 373 LSGANLSGADLRGTKLSGANLSGANLSNAKLIDSNLRGTELSGANLSGANLRGASL 428
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 53/108 (49%), Gaps = 10/108 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRE----------SDFSGSKFNGAYLEKAVAYK 160
A ADL A ++ N RAN + A++ + ++ S + F+ A L +A
Sbjct: 276 ANLSGADLSSANLIRANLIRANLSGANLSDVKVIGGNLGNANLSNANFSSAKLIRANLSG 335
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
A+ +GADLS+ L ANL+NA L L ++L GA + GAD
Sbjct: 336 ADLSGADLSNANFSGASLYSANLSNANLSSANLRGTELSGANLSGADL 383
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 54/105 (51%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A +A+ A N AN +SA++R ++ SG+ +GA L AN +GA+L
Sbjct: 339 SGADLSNANFSGASLYSANLSNANLSSANLRGTELSGANLSGADLRGTKLSGANLSGANL 398
Query: 169 SDT-LMDRMV----LNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
S+ L+D + L+ ANL+ A L L ++L GA + GA
Sbjct: 399 SNAKLIDSNLRGTELSGANLSGANLRGASLYSANLSGANLRGASL 443
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 51/108 (47%), Gaps = 10/108 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR N AN ++A ++R ++ SG+ +GA L A Y AN
Sbjct: 374 SGANLSGADLRGTKLSGANLSGANLSNAKLIDSNLRGTELSGANLSGANLRGASLYSANL 433
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+GA+L L ANL+ A L L+ ++L + G DFS A
Sbjct: 434 SGANLRGA-----SLYSANLSGANLSGANLSLANLCPMRVSGTDFSAA 476
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 54/125 (43%), Gaps = 6/125 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+LR A N AN A + ++ SG+ +GA L A +G D
Sbjct: 414 SGANLSGANLRGASLYSANLSGANLRGASLYSANLSGANLSGANLSLANLCPMRVSGTDF 473
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANG 227
S L+ ANL A L R L +DL A + GAD S A ++ A K A Y G
Sbjct: 474 S-----AANLSGANLGGAYLYRADLKDTDLSSANLTGADLSSANLNGADVKNARFGYIVG 528
Query: 228 TNPIT 232
+ T
Sbjct: 529 IDEST 533
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 52/111 (46%), Gaps = 10/111 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKAN- 162
S A SA+L A N R AN + AD+R + SG+ +GA L A +N
Sbjct: 349 SGASLYSANLSNANLSSANLRGTELSGANLSGADLRGTKLSGANLSGANLSNAKLIDSNL 408
Query: 163 ----FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+GA+LS + L ANL+ A L L ++L GA + GA+ S
Sbjct: 409 RGTELSGANLSGANLRGASLYSANLSGANLRGASLYSANLSGANLSGANLS 459
>gi|441166522|ref|ZP_20968750.1| pentapeptide repeat-containing protein [Streptomyces rimosus subsp.
rimosus ATCC 10970]
gi|440615904|gb|ELQ79069.1| pentapeptide repeat-containing protein [Streptomyces rimosus subsp.
rimosus ATCC 10970]
Length = 388
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 53/104 (50%), Gaps = 10/104 (9%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F ADLR NF R + AD+RE+D G+ A L A +AN GA L
Sbjct: 208 FRRADLRGM-----NFERVDLGGADLREADLRGASLRDADLSGAGLREANLRGAGLVRAR 262
Query: 173 MDRMVLNEANLTNAVL----VR-TVLTRSDLGGAIIEGADFSDA 211
+ + L EANL +A+L +R T L +DL A +EGAD + A
Sbjct: 263 LAKADLQEANLRDALLWFADLRDTNLQAADLTEADLEGADLTRA 306
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 39/84 (46%), Gaps = 5/84 (5%)
Query: 130 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 189
R + D R +D G F L A +A+ GA L D + L EANL A LV
Sbjct: 200 RVDLRGIDFRRADLRGMNFERVDLGGADLREADLRGASLRDADLSGAGLREANLRGAGLV 259
Query: 190 RTVLTRSDLGGAIIEGADFSDAVI 213
R L ++DL + A+ DA++
Sbjct: 260 RARLAKADL-----QEANLRDALL 278
>gi|428209167|ref|YP_007093520.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428011088|gb|AFY89651.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 163
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 4/107 (3%)
Query: 115 SADLRKAVHVKE----NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
S++++K + K+ N AN +AD+ E++ G+ A L+ A +AN GA+L
Sbjct: 43 SSEVQKLLKTKQCPGCNLSGANLQNADLDEANLQGANLQNANLQNADLEEANLQGANLQG 102
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
+ R L +ANL +A L + L R+D+ GA + A+ + A + A+
Sbjct: 103 ANLIRADLEKANLQSANLQQASLQRADIEGANLTKANITGANLQQAE 149
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 53/105 (50%), Gaps = 10/105 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A +ADL +A N + AN +AD+ E++ G+ GA L +A KAN A+L
Sbjct: 61 SGANLQNADLDEANLQGANLQNANLQNADLEEANLQGANLQGANLIRADLEKANLQSANL 120
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ R + AN LT++++ GA ++ A+F + V+
Sbjct: 121 QQASLQRADIEGAN----------LTKANITGANLQQAEFENTVM 155
>gi|75910293|ref|YP_324589.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704018|gb|ABA23694.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 143
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 63/119 (52%), Gaps = 5/119 (4%)
Query: 119 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 178
RK + +FR A + AD++E+ G +GA L+ A KA+ +GA+LS + + L
Sbjct: 17 RKYEAGERDFRAAELSKADLQETYLEGVDLSGANLDAAKLSKADLSGANLSGVYLRKANL 76
Query: 179 NEANLTNA-----VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 232
ANL+ A LV L+ +DL GA + GA+ + A ++ A+ + + +PI+
Sbjct: 77 RGANLSGADLFKDKLVGADLSEADLRGADLRGANLNGANLNAAKYDSYTVFPEDFDPIS 135
>gi|425434939|ref|ZP_18815403.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9432]
gi|389675416|emb|CCH95473.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9432]
Length = 470
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 67/135 (49%), Gaps = 21/135 (15%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY----- 159
+ I S A+ A+LR+A N R AN + AD+R+++ SG+ GA L +A +
Sbjct: 324 WAILSGAKLSGANLREA-----NLREANLSGADLRKANLSGANLWGAILIEANLWGAILI 378
Query: 160 ----------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
AN G +LS + + L+ NL+ A+L L+ ++L GA IE A F
Sbjct: 379 EANLRGVNLSGANLRGVNLSGVNLRGVNLSGVNLSGAILRGANLSGANLSGADIENAIFI 438
Query: 210 DAV-IDLAQKQALCK 223
DA I QKQ L +
Sbjct: 439 DATGITPEQKQDLIR 453
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 5/106 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD- 167
S A A+LR A ++ N AN AD+ + SG+K +GA L +A +AN +GAD
Sbjct: 293 SDANLRGANLRWADLMEANLSGANLIEADLSWAILSGAKLSGANLREANLREANLSGADL 352
Query: 168 ----LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
LS + +L EANL A+L+ L +L GA + G + S
Sbjct: 353 RKANLSGANLWGAILIEANLWGAILIEANLRGVNLSGANLRGVNLS 398
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 53/103 (51%), Gaps = 5/103 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL +A N R AN AD+ E++ SG+ A L A+ A +GA+L +
Sbjct: 280 ADLSWADLIEADLSDANLRGANLRWADLMEANLSGANLIEADLSWAILSGAKLSGANLRE 339
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L EANL+ A L + L+ ++L GAI+ A+ A++
Sbjct: 340 A-----NLREANLSGADLRKANLSGANLWGAILIEANLWGAIL 377
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 54/105 (51%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 166
I S A A L +A ++ + A+ + AD+ E+D S + GA L A +AN +GA
Sbjct: 256 ILSEAILIGAALIEADLIEADLIEADLSWADLIEADLSDANLRGANLRWADLMEANLSGA 315
Query: 167 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+L + + +L+ A L+ A L L ++L GA + A+ S A
Sbjct: 316 NLIEADLSWAILSGAKLSGANLREANLREANLSGADLRKANLSGA 360
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 47/94 (50%), Gaps = 5/94 (5%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
+R+ K R + + AD+R ++ G+ GA L +A+ A ADL +
Sbjct: 222 IREGTIDKTTLRFVDLSGADLRRANLIGANLKGAILSEAILIGAALIEADLIEA-----D 276
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L EA+L+ A L+ L+ ++L GA + AD +A
Sbjct: 277 LIEADLSWADLIEADLSDANLRGANLRWADLMEA 310
>gi|126659170|ref|ZP_01730309.1| pentapeptide repeat family protein [Cyanothece sp. CCY0110]
gi|126619577|gb|EAZ90307.1| pentapeptide repeat family protein [Cyanothece sp. CCY0110]
Length = 301
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 60/139 (43%), Gaps = 28/139 (20%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTG 165
A F A L+ K N + ANF++A ++ DFS + KFNG+ L K N TG
Sbjct: 151 ANFEEAKLKNINFSKANLKNANFSNAKLQNIDFSEANLYEVKFNGSDLYKIDFRDKNLTG 210
Query: 166 ADLS--------------------DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
D S D + + L ANLTNA L L + L GAI++G
Sbjct: 211 GDFSGADFWNVNLDNANLTDTNFSDANLKVINLKNANLTNADLSVANLAHAKLEGAILDG 270
Query: 206 ADFSDAVIDLAQKQALCKY 224
A+ A I + LC Y
Sbjct: 271 ANLEGAAI---RGTVLCDY 286
>gi|392412448|ref|YP_006449055.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
gi|390625584|gb|AFM26791.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
Length = 241
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 53/103 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A+ DL F R NF+ A++ +++F+ S G+ L AV A TG+DL
Sbjct: 37 SEAELSQVDLSSLNLSGMKFMRCNFSRANLTKTNFADSDLTGSNLTTAVLVAATLTGSDL 96
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++T + L A+L N+ L+ L + L A ++GAD S A
Sbjct: 97 TETNLTGADLTAADLVNSTLINADLYWARLTLATLDGADLSQA 139
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 43/94 (45%), Gaps = 10/94 (10%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
LR+ ++F A + D+ + SG KF +A K NF +DL+ +
Sbjct: 26 LREKARPGDDFSEAELSQVDLSSLNLSGMKFMRCNFSRANLTKTNFADSDLTGS------ 79
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
NLT AVLV LT SDL + GAD + A
Sbjct: 80 ----NLTTAVLVAATLTGSDLTETNLTGADLTAA 109
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 48/101 (47%), Gaps = 10/101 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADL +A K + A+ T AD+ +D G+ G L KAV AN + A
Sbjct: 129 ATLDGADLSQANLSKSDLTLASLTGADLFWADLGGATLVGTNLSKAVLTVANLSKA---- 184
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L A+L+ A+L L+ +DL A + GAD S+A
Sbjct: 185 ------ALMMADLSGAILAGADLSGADLSEANLTGADLSEA 219
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 12/114 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANF 163
+ A +ADL + + + A T AD+ +++ S S A L A + A+
Sbjct: 102 TGADLTAADLVNSTLINADLYWARLTLATLDGADLSQANLSKSDLTLASLTGADLFWADL 161
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
GA L T + + VL ANL+ A L+ +DL GAI+ GAD S A DL++
Sbjct: 162 GGATLVGTNLSKAVLTVANLSKAALMM-----ADLSGAILAGADLSGA--DLSE 208
>gi|254416808|ref|ZP_05030557.1| Leucine Rich Repeat domain protein [Coleofasciculus chthonoplastes
PCC 7420]
gi|196176354|gb|EDX71369.1| Leucine Rich Repeat domain protein [Coleofasciculus chthonoplastes
PCC 7420]
Length = 1492
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 10/101 (9%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F A+L+K R +F+ AD+ DF+G+KFN NF+GA+L+D
Sbjct: 390 FSGANLKKGSFNSSTLTRIDFSQADLESVDFTGTKFN----------SINFSGANLTDAE 439
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
M L+E NL A LV+ L +L + + S+A +
Sbjct: 440 MSGADLHEMNLQGATLVKVYLCNGNLSSQNLRNQNLSEANL 480
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 5/111 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A A+L ++ NF AN T + S+ SGS F A L + +A A+F A+
Sbjct: 129 SDADLSLANLDESDLQNINFSGANLTQCQLNNSNLSGSNFQDANLTRILARTASFKQANF 188
Query: 169 SDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
++ ++RM + AN NA L L S+L GA GA+ S ++D
Sbjct: 189 NEAKLNRMNCDRCDFSGANFQNADLSGAFLQNSNLKGADFRGANLSGTLLD 239
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 53/115 (46%), Gaps = 3/115 (2%)
Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
TR +F S A S D NF AN T A+M +D GA L K
Sbjct: 406 TRIDF---SQADLESVDFTGTKFNSINFSGANLTDAEMSGADLHEMNLQGATLVKVYLCN 462
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
N + +L + + L EANL++A L L++++L + + GA+ DA +++
Sbjct: 463 GNLSSQNLRNQNLSEANLREANLSHADLSGANLSQANLNRSDLTGANLQDANLEM 517
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 49/106 (46%), Gaps = 10/106 (9%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTGAD 167
F A+L A N AN A++ E++ + NGA L++ A ANF GAD
Sbjct: 924 FSHANLTAANLENANLENANLEGANLEEANLENANLNGANLKQLEGNYANFCGANFVGAD 983
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
LS + +EANLT A L + GA ++GA DA +
Sbjct: 984 LSYAEFEDSSFSEANLTGANL-----SHGTFNGAYLQGACLRDARL 1024
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 48/108 (44%), Gaps = 11/108 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-AVAYKANFTGAD 167
S + ADL +A + N RANFT A+ F A LE A ANFT AD
Sbjct: 1343 SQSDLTDADLTRAFLSRTNLDRANFTRAN----------FQNASLENIAKISAANFTDAD 1392
Query: 168 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 215
S+ L+ + + + + A+ L+ S A + GAD +DL
Sbjct: 1393 FSEALIQNVDFHNSTMQKAIFQSARLSMSRFSYADLTGADLRKMQMDL 1440
Score = 40.4 bits (93), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 10/82 (12%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFN-----GAYLEKAVA 158
S + F A+L + + +F++ANF A + DFSG+ F GA+L+ +
Sbjct: 164 SGSNFQDANLTRILARTASFKQANFNEAKLNRMNCDRCDFSGANFQNADLSGAFLQNSNL 223
Query: 159 YKANFTGADLSDTLMDRMVLNE 180
A+F GA+LS TL+D L +
Sbjct: 224 KGADFRGANLSGTLLDNSCLED 245
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 53/124 (42%), Gaps = 26/124 (20%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DLR N+ AN +A++ +F G+ GA L+ +K NF ADLS +++M
Sbjct: 660 DLRDQDFQGVNWDGANLENANLSNCNFVGASLVGANLKGTYLFKTNFDRADLSRANLEKM 719
Query: 177 V------------------LNEANLTN--------AVLVRTVLTRSDLGGAIIEGADFSD 210
L+ ANL N A L LT +DL GA + AD D
Sbjct: 720 RGELEKVIGYLPLSCRQANLSHANLQNHKLFDLFGANLSHANLTGADLAGANLNDADLRD 779
Query: 211 AVID 214
A ++
Sbjct: 780 ANLN 783
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 36/80 (45%), Gaps = 10/80 (12%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
E++ + D+R+ DF G ++GA LE A NF GA L ANL
Sbjct: 648 NESYSGQDLQGMDLRDQDFQGVNWDGANLENANLSNCNFVGASLVG----------ANLK 697
Query: 185 NAVLVRTVLTRSDLGGAIIE 204
L +T R+DL A +E
Sbjct: 698 GTYLFKTNFDRADLSRANLE 717
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 14/124 (11%)
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA--- 156
+ G + A F ADL A +F AN T A++ F+G+ GA L A
Sbjct: 966 QLEGNYANFCGANFVGADLSYAEFEDSSFSEANLTGANLSHGTFNGAYLQGACLRDARLC 1025
Query: 157 -VAY----------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 205
V + +A+ ADL + L+ ANL N L +L GA ++G
Sbjct: 1026 HVTFDWSDDGTNLEEADLENADLEGANLQGCYLSNANLKNINWTGAKLDNVELQGANLQG 1085
Query: 206 ADFS 209
AD S
Sbjct: 1086 ADLS 1089
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 43/109 (39%), Gaps = 26/109 (23%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD----------------R 175
N AD+ +D G+ G YL A N+TGA L + +
Sbjct: 1037 NLEEADLENADLEGANLQGCYLSNANLKNINWTGAKLDNVELQGANLQGADLSKVAQFVE 1096
Query: 176 MVLNEANLTNAVLVRTVLTRS----------DLGGAIIEGADFSDAVID 214
+ L ANL A L+RS DL GA ++GA F+D ID
Sbjct: 1097 VALRGANLQGANFRGLNLSRSYWQGVNLSGVDLRGANLQGAHFADCQID 1145
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 49/92 (53%), Gaps = 12/92 (13%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
+E+F A+ T+AD+ E AYLE+ AN TG + ++ +++ L+ ANL
Sbjct: 811 QEDFA-ADLTNADLSE----------AYLERCQLSGANLTGVNFTNANLEQADLSHANLE 859
Query: 185 NAVLVRTVLTRSDLGGAIIEGA-DFSDAVIDL 215
NA L L ++L G I GA + SD + +L
Sbjct: 860 NANLEGANLEEANLIGTAISGALNVSDRIREL 891
Score = 37.0 bits (84), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 54/111 (48%), Gaps = 11/111 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY-LEKAVAYKANFTGAD 167
+ A ADLR A N + N T + ++ K+ + L + A+ T AD
Sbjct: 768 AGANLNDADLRDA-----NLNQVNLTRTTINQNTQLDPKWRLVWQLLNQEDFAADLTNAD 822
Query: 168 LSDTLMDRMVLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
LS+ ++R L+ ANL TNA L + L+ ++L A +EGA+ +A +
Sbjct: 823 LSEAYLERCQLSGANLTGVNFTNANLEQADLSHANLENANLEGANLEEANL 873
>gi|428776740|ref|YP_007168527.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428691019|gb|AFZ44313.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 157
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 58/121 (47%), Gaps = 25/121 (20%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVA 158
S A +ADL +A + N RAN T+ D+ ++D G+ GA L +A+
Sbjct: 38 SEADLSNADLSQATLCRSNLSRANLTNTDLNQADLRSANLSQVNLIGASLVGAKLGRAIL 97
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 218
A+ GADLSD A+LT A LT ++L GA++ GA+ D ++ A
Sbjct: 98 TGADLRGADLSD----------ADLTGA-----NLTDAELSGAVLTGANIEDVELEKAAT 142
Query: 219 Q 219
+
Sbjct: 143 E 143
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 44/85 (51%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
N + A D+ E+D S + + A L ++ +AN T DL+ + L++ NL A
Sbjct: 26 NLKSAYLEEIDLSEADLSNADLSQATLCRSNLSRANLTNTDLNQADLRSANLSQVNLIGA 85
Query: 187 VLVRTVLTRSDLGGAIIEGADFSDA 211
LV L R+ L GA + GAD SDA
Sbjct: 86 SLVGAKLGRAILTGADLRGADLSDA 110
>gi|402773132|ref|YP_006592669.1| pentapeptide repeat protein [Methylocystis sp. SC2]
gi|401775152|emb|CCJ08018.1| Pentapeptide repeat protein [Methylocystis sp. SC2]
Length = 261
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 60/139 (43%), Gaps = 35/139 (25%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F S L A K + NFT AD++ +DFSG++ N A L A+ A F ADLS+
Sbjct: 115 ADFFSTKLAGAKLAKADLSATNFTRADLQNADFSGARMNAATLYAALLDGATFADADLSN 174
Query: 171 T---------------LMD---------------RMVLNEAN-----LTNAVLVRTVLTR 195
L+D R L +AN LT A L VLT
Sbjct: 175 ARIIGGGKGVNFRNAKLIDADLGADPANQGMAPVRAELPDANFDGADLTRANLTHAVLTG 234
Query: 196 SDLGGAIIEGADFSDAVID 214
++ AI+ GA F AV+D
Sbjct: 235 ANFTAAIVSGARFDYAVLD 253
>gi|359459695|ref|ZP_09248258.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 332
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 58/113 (51%), Gaps = 10/113 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A DL+ + + N A T A++ +++ + + +GA L++AV +KA+ T ADL
Sbjct: 48 SLAYLNRVDLQTSNLTQSNLSGATLTQANLSQANLTDAALHGANLQRAVLFKADLTLADL 107
Query: 169 SD-TLMD----RMVLNEANLTNAVLVRTVLTR-----SDLGGAIIEGADFSDA 211
+D LM+ + L NLT A L L +DL GAI++G D A
Sbjct: 108 TDANLMEADLREVTLRSTNLTGACLRSANLREENRNCADLRGAILDGVDLQGA 160
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 55/117 (47%), Gaps = 3/117 (2%)
Query: 95 NKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
N+ A+ RG G A ADL K N R AN +A++ +D G+ A
Sbjct: 141 NRNCADLRGAILDGVDLQGANLRGADLSKVSLQGANLRNANLRAANLAGADLQGANLEQA 200
Query: 152 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 208
L +A +AN + A L+D ++R+ L A L N+ L L S+L A ++GA
Sbjct: 201 LLIEANLQQANLSHATLADAKLERVNLQMAQLVNSDLSDCTLVESELSQANLQGATL 257
Score = 44.3 bits (103), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 47/98 (47%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
ADLR A+ + + AN AD+ + G+ A L A A+ GA+L L+
Sbjct: 145 ADLRGAILDGVDLQGANLRGADLSKVSLQGANLRNANLRAANLAGADLQGANLEQALLIE 204
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
L +ANL++A L L R +L A + +D SD +
Sbjct: 205 ANLQQANLSHATLADAKLERVNLQMAQLVNSDLSDCTL 242
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 10/104 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTG 165
A A+L A R N A + SD S S+ + A L+ A Y++
Sbjct: 205 ANLQQANLSHATLADAKLERVNLQMAQLVNSDLSDCTLVESELSQANLQGATLYRSRLNR 264
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
A+LS R L ANL A L++ L R+DL A ++GA+ +
Sbjct: 265 ANLS-----RANLTAANLQEAFLIQAFLARTDLTDAHLQGANLT 303
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 48/101 (47%), Gaps = 5/101 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A ADLR+ N A SA++RE + + + GA L+ AN GADLS
Sbjct: 110 ANLMEADLREVTLRSTNLTGACLRSANLREENRNCADLRGAILDGVDLQGANLRGADLS- 168
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++ L ANL NA L L +DL GA +E A +A
Sbjct: 169 ----KVSLQGANLRNANLRAANLAGADLQGANLEQALLIEA 205
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 57/122 (46%), Gaps = 6/122 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANFTG 165
A A+L++AV K + A+ T AD+RE + GA L A + N
Sbjct: 85 AALHGANLQRAVLFKADLTLADLTDANLMEADLREVTLRSTNLTGACLRSANLREENRNC 144
Query: 166 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKY 224
ADL ++D + L ANL A L + L ++L A + A+ + A + A +QAL
Sbjct: 145 ADLRGAILDGVDLQGANLRGADLSKVSLQGANLRNANLRAANLAGADLQGANLEQALLIE 204
Query: 225 AN 226
AN
Sbjct: 205 AN 206
>gi|158341491|ref|YP_001522656.1| peptidase C14, caspase catalytic subunit p20 [Acaryochloris marina
MBIC11017]
gi|158311732|gb|ABW33342.1| peptidase C14, caspase catalytic subunit p20 [Acaryochloris marina
MBIC11017]
Length = 1037
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 58/108 (53%), Gaps = 4/108 (3%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 174
SADLR A+ ++ N N ++ ++ +D S + + A L A +AN +GADL +T +
Sbjct: 884 SADLRNAILIRANLFSTNLSNVNLYSADLSSTDMSSANLSNADLIRANLSGADLHNTDLF 943
Query: 175 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 222
L+ ANL+NA L +L S+L E + + A ++ A+ +C
Sbjct: 944 YANLSNANLSNANLSNAILLSSNLR----ETKNLTQAQLEGAEHPLIC 987
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 52/109 (47%), Gaps = 10/109 (9%)
Query: 111 AQFGSADLRKAVHVKEN----------FRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
A+ ADLR A+ ++ N F A+ AD+R +D + + FN A L
Sbjct: 805 AKLRHADLRSAILIRANLFAADLNFTDFSDADLRYADLRRTDLNFTDFNHANLNFTKLGN 864
Query: 161 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
AN G +LSD + L A+L NA+L+R L ++L + AD S
Sbjct: 865 ANLNGTNLSDANLIGTNLYSADLRNAILIRANLFSTNLSNVNLYSADLS 913
>gi|186684179|ref|YP_001867375.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186466631|gb|ACC82432.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 223
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 58/129 (44%), Gaps = 17/129 (13%)
Query: 84 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 143
C+ + L D N A G A +ADL +A + N + ANF AD+ + +
Sbjct: 104 CNLTGAMLKDANLQAANLEG-------ANLQNADLERANLQQTNLQGANFQGADLGKVNL 156
Query: 144 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
G+ GA L A KAN GA+L ANL A L +T LT +++ G +
Sbjct: 157 LGANLLGANLFDADLEKANLLGANLQ----------MANLQGADLEKTNLTNANIQGVNL 206
Query: 204 EGADFSDAV 212
G D DA+
Sbjct: 207 MGVDLEDAI 215
>gi|385802320|ref|YP_005838722.1| pentapeptide repeat-containing protein [Haloquadratum walsbyi C23]
gi|339730551|emb|CCC41895.1| pentapeptide repeat family protein [Haloquadratum walsbyi C23]
Length = 554
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 55/99 (55%), Gaps = 5/99 (5%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+ +A L+ A ++ N AN +SAD+ ++ + + F+G+ L A+FT DL D
Sbjct: 391 ARLSNASLQGADLIRANLSGANLSSADLSNANLNQADFSGSSL-----TDADFTHTDLID 445
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 209
+ L+ ANL+NA L RT L+ DL A ++ AD S
Sbjct: 446 ADLSEANLSRANLSNADLNRTNLSDVDLSDASLKKADLS 484
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 56/119 (47%), Gaps = 18/119 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +A LR A + N + A +R+ DFSG+ +G L A A A+F+GADL++
Sbjct: 175 ADLSNASLRNASLRDADLSDTNLSGASLRDVDFSGADLSGVDLTLASAIDADFSGADLTN 234
Query: 171 T------LMD-RMV-----------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L D R+ L ANL+N L++ L+ SDL + AD DA
Sbjct: 235 ADLSNADLFDPRLTDVSGADLTNTDLRMANLSNTSLIQADLSDSDLSSTDLTDADLRDA 293
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 56/107 (52%), Gaps = 5/107 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S+A +A+L +A + A+FT D+ ++D S + + A L A + N + DL
Sbjct: 414 SSADLSNANLNQADFSGSSLTDADFTHTDLIDADLSEANLSRANLSNADLNRTNLSDVDL 473
Query: 169 SDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSD 210
SD + + L++ ANLT+A L T L+ +DL GA + D SD
Sbjct: 474 SDASLKKADLSQTNLSGANLTDADLPGTNLSNADLSGASLNKTDLSD 520
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 57/116 (49%), Gaps = 18/116 (15%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADM-------RESDFSGSKFNGAYLEKAVAYKA 161
S+ ADLR A N AN T AD+ ++D SG+ GA L +A A
Sbjct: 281 SSTDLTDADLRDA-----NLAGANLTDADLSSGSRPLTKTDLSGADLTGATLTQANLTGA 335
Query: 162 NFTGADLSDT-LMD-----RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ + AD++D L D + +EA L + ++ +T+L+ +L A + AD +DA
Sbjct: 336 SVSDADITDAQLTDARFEYTIPQDEAILPHNIIPQTILSGGNLSDAYLREADLADA 391
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 51/102 (50%), Gaps = 6/102 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKF------NGAYLEKAVAYKANFTGADLS 169
ADL A + N A+ + AD+ ++ + ++F + A L + + +G +LS
Sbjct: 320 ADLTGATLTQANLTGASVSDADITDAQLTDARFEYTIPQDEAILPHNIIPQTILSGGNLS 379
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
D + L +A L+NA L L R++L GA + AD S+A
Sbjct: 380 DAYLREADLADARLSNASLQGADLIRANLSGANLSSADLSNA 421
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 55/122 (45%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S F DL + A +AD+ ++ + + GA L A +A +GA++
Sbjct: 22 SGMNFTDTDLSGVDLYNSDLSDARLLNADLSGANLTNANLAGADLSAASLSEATLSGANI 81
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 228
SD + L+ A+LT+ L L +DL GA + GA +DA + AQ + +
Sbjct: 82 SDANLTGTDLSSADLTDTDLSSAYLLDADLTGASLSGACVTDAQLADAQFEYTIPHDEDV 141
Query: 229 NP 230
+P
Sbjct: 142 SP 143
>gi|307154970|ref|YP_003890354.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306985198|gb|ADN17079.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 231
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 50/92 (54%), Gaps = 5/92 (5%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 186
NF+ + D+RE++ + + N A L KA +AN +GA+LS +++ +L ANL A
Sbjct: 45 NFKDTDLQGTDLREANLTQANLNWANLHKADLTQANLSGANLSQAFLEKAILIAANLREA 104
Query: 187 VLV-----RTVLTRSDLGGAIIEGADFSDAVI 213
L+ + L +DL + A+F +A++
Sbjct: 105 WLIGSDFEKANLRDADLSKTLAAKANFKNAIL 136
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 52/98 (53%), Gaps = 5/98 (5%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 176
DLR+A + N AN AD+ +++ SG+ + A+LEKA+ AN A L + ++
Sbjct: 55 DLREANLTQANLNWANLHKADLTQANLSGANLSQAFLEKAILIAANLREAWLIGSDFEK- 113
Query: 177 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
ANL +A L +T+ +++ AI+ G D ID
Sbjct: 114 ----ANLRDADLSKTLAAKANFKNAILTGVCLHDWSID 147
>gi|21673746|ref|NP_661811.1| pentapeptide repeat-containing protein [Chlorobium tepidum TLS]
gi|21646871|gb|AAM72153.1| pentapeptide repeat family protein [Chlorobium tepidum TLS]
Length = 382
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 51/92 (55%), Gaps = 5/92 (5%)
Query: 125 KENFRRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLN 179
K +F + + ADMR+SDF S+F +GA L+ +V + FTGAD++ + +
Sbjct: 24 KIDFSQTSLAGADMRQSDFGRSEFRDADLSGAKLDGSVLAGSRFTGADMNQASLAGALCA 83
Query: 180 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
++ + A + TVL R+D G A + G D S A
Sbjct: 84 GSDFSGAKMASTVLRRADCGEAKLRGTDLSGA 115
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 51/103 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S AD+R++ + FR A+ + A + S +GS+F GA + +A A G+D
Sbjct: 28 SQTSLAGADMRQSDFGRSEFRDADLSGAKLDGSVLAGSRFTGADMNQASLAGALCAGSDF 87
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
S M VL A+ A L T L+ +DL A +E AD S A
Sbjct: 88 SGAKMASTVLRRADCGEAKLRGTDLSGADLREANLEHADLSRA 130
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 4/83 (4%)
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 195
AD+ +D + + LEKA AN GADL + R L +A+L A L R
Sbjct: 283 ADLHGADLEKASLKRSDLEKADLKSANLRGADLRSANLQRADLRQADLRGANLWLANTGR 342
Query: 196 SDLGGAIIEGADFSDAVIDLAQK 218
++ GAI+ S+ V+D +K
Sbjct: 343 AEFEGAIVS----SETVLDTGKK 361
>gi|428213326|ref|YP_007086470.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428001707|gb|AFY82550.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 340
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 51/105 (48%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A SA+L A + + A A +R ++ S S GA L +A+ +GADL
Sbjct: 192 SGAVLNSANLSGASVRQAFLQGAQMEGASLRNTNMSTSNLRGALL-----TQADLSGADL 246
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
D M +VLNEA L N L L + L G I+ GAD A++
Sbjct: 247 LDADMQGVVLNEAILINTQLRNVQLQGASLEGTILSGADLEGAIL 291
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 53/104 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L A N R AN T ++ ++ S S+ GA L A+ T A+LS
Sbjct: 94 ANLTGANLTGANLQGVNLRGANLTGVNLTGANLSRSQLVGAVLFLINLANADLTEANLSG 153
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
T + R+ + +ANL A L + LT ++L G + A+ S AV++
Sbjct: 154 TDLSRIYIEQANLNGAQLQGSNLTGAELFGVTLNNANLSGAVLN 197
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 55/116 (47%), Gaps = 15/116 (12%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A + DL A N A+F +D+R ++ +G+ GA L+ AN TG +L+
Sbjct: 64 ANLSNTDLTGADLSGSNLTNASFRGSDLRGANLTGANLTGANLQGVNLRGANLTGVNLTG 123
Query: 171 TLMDR--MV-------------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ R +V L EANL+ L R + +++L GA ++G++ + A
Sbjct: 124 ANLSRSQLVGAVLFLINLANADLTEANLSGTDLSRIYIEQANLNGAQLQGSNLTGA 179
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 50/96 (52%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+L ++ V N +AD+ E++ SG+ + Y+E+A A G++L+ +
Sbjct: 124 ANLSRSQLVGAVLFLINLANADLTEANLSGTDLSRIYIEQANLNGAQLQGSNLTGAELFG 183
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ LN ANL+ AVL L+ + + A ++GA A
Sbjct: 184 VTLNNANLSGAVLNSANLSGASVRQAFLQGAQMEGA 219
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 27/104 (25%), Positives = 41/104 (39%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
AQ A LR N R A T AD+ +D + G L +A+ L
Sbjct: 214 AQMEGASLRNTNMSTSNLRGALLTQADLSGADLLDADMQGVVLNEAILINTQLRNVQLQG 273
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 214
++ +L+ A+L A+L L G + GAD + I+
Sbjct: 274 ASLEGTILSGADLEGAILTGATFRNVQLTGTNLRGADLTQIEIE 317
>gi|376007406|ref|ZP_09784602.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423063332|ref|ZP_17052122.1| pentapeptide repeat-containing protein [Arthrospira platensis C1]
gi|375324195|emb|CCE20355.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406715454|gb|EKD10610.1| pentapeptide repeat-containing protein [Arthrospira platensis C1]
Length = 274
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 10/103 (9%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
AQ A+L A NF RAN T A MR S N A L A +ANFT A+
Sbjct: 70 AQLADANLISANLTDANFSRANLTGASMRGSISKNVTLNMANLTDANLAEANFTEANFIG 129
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
A+L N+ L+RT L +++L GA ++GA+ ++ ++
Sbjct: 130 ----------AHLVNSTLIRTNLLKANLSGANLDGANLTNVIM 162
Score = 44.3 bits (103), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 61/127 (48%), Gaps = 14/127 (11%)
Query: 91 LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
LAD N A T F S A A +R ++ AN T A++ E++F+ + F
Sbjct: 72 LADANLISANLTDANF---SRANLTGASMRGSISKNVTLNMANLTDANLAEANFTEANFI 128
Query: 150 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIE 204
GA+L + + N A+LS +D ANLTN ++ + L + + L GA++
Sbjct: 129 GAHLVNSTLIRTNLLKANLSGANLD-----GANLTNVIMRDSTLEGANLSNATLSGAMLM 183
Query: 205 GADFSDA 211
GA+F A
Sbjct: 184 GANFHRA 190
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 15/98 (15%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A F ADL + V + AN + A++R ++ S + GA + +A Y+ ++LS
Sbjct: 185 ANFHRADLSRVTMVGADLTDANLSEANLRAANVSWTSLRGANMSRARLYRTKLNWSNLSG 244
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAII 203
NL AV++ TVL R+ DL GAI+
Sbjct: 245 V----------NLIEAVMLDTVLYRANLRDADLRGAIL 272
Score = 40.4 bits (93), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 56/127 (44%), Gaps = 12/127 (9%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A RG I A+L A + NF ANF A + S + L KA
Sbjct: 95 ASMRGS--ISKNVTLNMANLTDANLAEANFTEANFIGAHLVNSTLIRTN-----LLKANL 147
Query: 159 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDAVI 213
AN GA+L++ +M L ANL+NA L +L R+DL + GAD +DA +
Sbjct: 148 SGANLDGANLTNVIMRDSTLEGANLSNATLSGAMLMGANFHRADLSRVTMVGADLTDANL 207
Query: 214 DLAQKQA 220
A +A
Sbjct: 208 SEANLRA 214
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A +A L A+ + NF RA+ + M +D + + + A L A + GA++S
Sbjct: 170 ANLSNATLSGAMLMGANFHRADLSRVTMVGADLTDANLSEANLRAANVSWTSLRGANMSR 229
Query: 171 TLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSD 210
+ R LN +NL+ AV++ TVL R++L A + GA D
Sbjct: 230 ARLYRTKLNWSNLSGVNLIEAVMLDTVLYRANLRDADLRGAILPD 274
>gi|254413444|ref|ZP_05027214.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196179551|gb|EDX74545.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 768
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 59/108 (54%), Gaps = 7/108 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANF 163
++ Q+G DL + + N +AN T AD++ S + GA LE+ A ++A+
Sbjct: 561 TSLQYG--DLSEVDLSEANLSQANLTGADLQRSQLDQANLEGATLEQANLSGASLFRADL 618
Query: 164 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ A+LS+ +++ +L ANL L R +L+ ++L GA + AD S A
Sbjct: 619 SQANLSNAQLNQAMLRGANLQEVRLRRAILSHANLEGANLSRADLSRA 666
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 58/112 (51%), Gaps = 3/112 (2%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S + + +LR+A + R N AD+ + + + GA L+ A + + DL
Sbjct: 509 SRSNLQNINLRRASLIGAKLRHTNLQQADLSHGNLNQAILTGANLKNANLRQTSLQYGDL 568
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 217
S+ + L++ANLT A L R+ L +++L GA +E A+ S A + DL+Q
Sbjct: 569 SEVDLSEANLSQANLTGADLQRSQLDQANLEGATLEQANLSGASLFRADLSQ 620
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 68/159 (42%), Gaps = 16/159 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S A +A L +A+ N + A + ++ G+ + A L +A N GADL
Sbjct: 619 SQANLSNAQLNQAMLRGANLQEVRLRRAILSHANLEGANLSRADLSRADLSHLNLRGADL 678
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-SDAVIDLAQKQALCKYANG 227
S T + + L A+L A L L ++L G +EGA F +A + AQ + L +
Sbjct: 679 SHTFLRHVNLTNADLRQANLTGANLFNANLSGVKVEGAIFKQNAGLSAAQGKELEQRG-A 737
Query: 228 TNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLL 266
T ++ + +R RN + P PP LL
Sbjct: 738 TVELSQLKSRD--------RNVWKHP------LPPHSLL 762
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 55/106 (51%), Gaps = 14/106 (13%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR-----MVLNEANLTN 185
AN +A++R++ + L +A +AN TGADL + +D+ L +ANL+
Sbjct: 551 ANLKNANLRQTSLQYGDLSEVDLSEANLSQANLTGADLQRSQLDQANLEGATLEQANLSG 610
Query: 186 AVLVRTVLTRSDLGG-----AIIEGADFSDAVIDLAQKQALCKYAN 226
A L R L++++L A++ GA+ + + ++A+ +AN
Sbjct: 611 ASLFRADLSQANLSNAQLNQAMLRGANLQEVRL----RRAILSHAN 652
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 54/116 (46%), Gaps = 20/116 (17%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL------- 168
AD+R + + R AN +SAD+ E++ S +K GA L A+ A+ T DL
Sbjct: 441 ADIRNSDLSAADLREANLSSADLSEANLSLAKLGGANLSSAILLGADLTVTDLNSANLNG 500
Query: 169 --------SDTLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDA 211
S + + + L A+L A L T L ++D L AI+ GA+ +A
Sbjct: 501 ANLNNANLSRSNLQNINLRRASLIGAKLRHTNLQQADLSHGNLNQAILTGANLKNA 556
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 47/101 (46%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+L A + + +AN ++A + ++ G+ L +A+ AN GA+LS
Sbjct: 601 ATLEQANLSGASLFRADLSQANLSNAQLNQAMLRGANLQEVRLRRAILSHANLEGANLSR 660
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ R L+ NL A L T L +L A + A+ + A
Sbjct: 661 ADLSRADLSHLNLRGADLSHTFLRHVNLTNADLRQANLTGA 701
>gi|427419123|ref|ZP_18909306.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425761836|gb|EKV02689.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 365
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 52/96 (54%)
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 177
LR+A+ ++ R N + D S+FSG+ A L +A+ Y AN + A L + +
Sbjct: 33 LREALLIRAELPRVNLSQVDGCISNFSGANLFQANLSQAIFYTANLSQAKLDWARLTGVD 92
Query: 178 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ EANL +A L+R ++ A + G++F +A++
Sbjct: 93 MREANLRDASLIRVDGQHTNFAMADLHGSNFREAIL 128
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 50/93 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A A+LR+A +FR AN T A + + + + F A L++A+ A A+L+
Sbjct: 166 ADLQQANLRQAKLTSADFREANLTQAILEDINACHTSFARAILDRAILTGALLADANLTM 225
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 203
+ +LN+ANL+NA + VLT + L GA +
Sbjct: 226 ANLKLTILNKANLSNAQVQNAVLTEASLVGATL 258
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 54/99 (54%), Gaps = 10/99 (10%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 172
F +ADL++A N R+A TSAD RE++ + A LE A +F A L +
Sbjct: 163 FYTADLQQA-----NLRQAKLTSADFREANLTQ-----AILEDINACHTSFARAILDRAI 212
Query: 173 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ +L +ANLT A L T+L +++L A ++ A ++A
Sbjct: 213 LTGALLADANLTMANLKLTILNKANLSNAQVQNAVLTEA 251
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 58/133 (43%), Gaps = 11/133 (8%)
Query: 108 GSAAQFGSADL-----RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
G F ADL R+A+ NF +A+ D + S + L+ ++ Y A+
Sbjct: 108 GQHTNFAMADLHGSNFREAILTGSNFYKASLAGVDGCMAQMSQCNLSHTDLQNSLFYTAD 167
Query: 163 FTGADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAIIEGADFSDAVIDLAQ 217
A+L + EANLT A+L T R+ L AI+ GA +DA + +A
Sbjct: 168 LQQANLRQAKLTSADFREANLTQAILEDINACHTSFARAILDRAILTGALLADANLTMAN 227
Query: 218 -KQALCKYANGTN 229
K + AN +N
Sbjct: 228 LKLTILNKANLSN 240
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 47/96 (48%), Gaps = 5/96 (5%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 175
A+LR A ++ + + NF AD+ GS F A L + YKA+ G D M +
Sbjct: 96 ANLRDASLIRVDGQHTNFAMADLH-----GSNFREAILTGSNFYKASLAGVDGCMAQMSQ 150
Query: 176 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
L+ +L N++ L +++L A + ADF +A
Sbjct: 151 CNLSHTDLQNSLFYTADLQQANLRQAKLTSADFREA 186
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 51/105 (48%), Gaps = 5/105 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
S AQ +A L +A V +A +++M ++D + F+ A LE A+ +GA+L
Sbjct: 239 SNAQVQNAVLTEASLVGATLCQAQLQNSNMAQTDCRNTLFSEANLENVNLQGADLSGANL 298
Query: 169 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
S + L L +A L T L+ DL + GAD S A++
Sbjct: 299 S-----KAALQGGCLKDAKLKHTNLSHVDLRHTDLTGADLSHAIL 338
>gi|218439263|ref|YP_002377592.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218171991|gb|ACK70724.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 294
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 72/161 (44%), Gaps = 25/161 (15%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
NKY+A R + + DLR NF+ A+ + A++RE + SG+ A+L
Sbjct: 12 NKYDAGERDFCNL----ELRRIDLRGLNLSHANFKGADLSYANLREINLSGADLREAFLN 67
Query: 155 KAVAYKANFTGADLSDTLMDRMVL-----NEANLTNAVLVRTVLTRSDL----------G 199
+A AN GA+L T + + L EANL+ A L L++S+L
Sbjct: 68 EADLTGANLQGANLEGTYLIKAYLMKTNLQEANLSKAYLTGAYLSKSNLTKANLSGAYLN 127
Query: 200 GAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 240
GA + GAD +D D + + P+ V T+K L
Sbjct: 128 GAKLSGADLTDISYD------ETTHFDVNFPLNKVETKKEL 162
>gi|418744036|ref|ZP_13300395.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
CBC379]
gi|418751631|ref|ZP_13307915.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
MOR084]
gi|409968104|gb|EKO35917.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
MOR084]
gi|410795431|gb|EKR93328.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
CBC379]
Length = 263
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 4/99 (4%)
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 184
K + + + +S + + +F G F+GA L A ++F GA+ S + LN ANL
Sbjct: 144 KGSLKGEDLSSIILEKQNFDGVDFSGANLGHAFLQNSSFVGANFSSAKLRGSFLNNANLR 203
Query: 185 NAVLVRTVLTRSDLGGAIIEGADFSDAVID----LAQKQ 219
N L + L GA +EGADF+DA+ D L QKQ
Sbjct: 204 NTNFRGADLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.128 0.372
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,288,672,049
Number of Sequences: 23463169
Number of extensions: 166809880
Number of successful extensions: 478096
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4073
Number of HSP's successfully gapped in prelim test: 761
Number of HSP's that attempted gapping in prelim test: 398954
Number of HSP's gapped (non-prelim): 43242
length of query: 282
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 141
effective length of database: 9,050,888,538
effective search space: 1276175283858
effective search space used: 1276175283858
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)