BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023997
(274 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255583634|ref|XP_002532572.1| conserved hypothetical protein [Ricinus communis]
gi|223527699|gb|EEF29806.1| conserved hypothetical protein [Ricinus communis]
Length = 280
Score = 433 bits (1113), Expect = e-119, Method: Compositional matrix adjust.
Identities = 221/280 (78%), Positives = 238/280 (85%), Gaps = 6/280 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTE------SDGQFPGPY 54
MA +SISPLSIKS+N SSS+ PY L + SKP + CQ++++ E S ++ +
Sbjct: 1 MAFTSISPLSIKSVNISPSSSRSPYHLPSQSKPFHILCQLATEREDRILDCSTTRYKVHH 60
Query: 55 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 114
+K KNWR VSTALAAA + + A ADLNK+EAE RGEFGIGSAAQFGSADLRKAV
Sbjct: 61 SKPKNWRTLVSTALAAAAAVNLGFGLPAAADLNKFEAELRGEFGIGSAAQFGSADLRKAV 120
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
HV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN
Sbjct: 121 HVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 180
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LTNAVLVR+VLTRSDLGGAIIEGADFSDAVIDL QKQALCKYANGTN ITGVSTRKSLGC
Sbjct: 181 LTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNSITGVSTRKSLGC 240
Query: 235 GNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
GNSRRNAYGSPSSPLLSAPPQKLLDRDGFCD TGLCDAK
Sbjct: 241 GNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDEATGLCDAK 280
>gi|297741150|emb|CBI31881.3| unnamed protein product [Vitis vinifera]
Length = 261
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 216/274 (78%), Positives = 229/274 (83%), Gaps = 13/274 (4%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNW 60
MALSS+SPL I SK P L + SKP V C+I + G + A+ K W
Sbjct: 1 MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNYCRANAESKKW 48
Query: 61 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 120
+ VSTALAAAVV + S + A+ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV ENF
Sbjct: 49 QRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVNENF 107
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
RRANFTSADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL
Sbjct: 108 RRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 167
Query: 181 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRN 240
RTVLTRSDLGGA+IEGADFSDAVIDL QKQALCKYA+GTNPITGVSTR SLGCGNSRR+
Sbjct: 168 ARTVLTRSDLGGAVIEGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASLGCGNSRRS 227
Query: 241 AYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
AYGSPSSPLLSAPP KLLDRDGFCD GTGLCDAK
Sbjct: 228 AYGSPSSPLLSAPPPKLLDRDGFCDEGTGLCDAK 261
>gi|359474379|ref|XP_002265958.2| PREDICTED: uncharacterized protein LOC100250522 isoform 2 [Vitis
vinifera]
Length = 596
Score = 416 bits (1068), Expect = e-114, Method: Compositional matrix adjust.
Identities = 216/274 (78%), Positives = 229/274 (83%), Gaps = 13/274 (4%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNW 60
MALSS+SPL I SK P L + SKP V C+I + G + A+ K W
Sbjct: 336 MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNYCRANAESKKW 383
Query: 61 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 120
+ VSTALAAAVV + S + A+ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV ENF
Sbjct: 384 QRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVNENF 442
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
RRANFTSADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL
Sbjct: 443 RRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 502
Query: 181 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRN 240
RTVLTRSDLGGA+IEGADFSDAVIDL QKQALCKYA+GTNPITGVSTR SLGCGNSRR+
Sbjct: 503 ARTVLTRSDLGGAVIEGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASLGCGNSRRS 562
Query: 241 AYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
AYGSPSSPLLSAPP KLLDRDGFCD GTGLCDAK
Sbjct: 563 AYGSPSSPLLSAPPPKLLDRDGFCDEGTGLCDAK 596
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 108/171 (63%), Positives = 123/171 (71%), Gaps = 11/171 (6%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNW 60
MALSS+SPL I SK P L +LSKP V C+I + E++ + A+ K W
Sbjct: 1 MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRENNWRGEA-NAESKKW 50
Query: 61 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 120
+ VSTALAAAVV + S + A+ADLNKYE ETRGEFGIGSAAQFGSADLRKAVHV ENF
Sbjct: 51 QRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRKAVHVNENF 109
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 171
RRANFTSADMRESDFSGS FNG YLEKAVAYKA+ TG D +MVL+
Sbjct: 110 RRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLTGPDAPHARPYKMVLH 160
>gi|224071571|ref|XP_002303521.1| predicted protein [Populus trichocarpa]
gi|222840953|gb|EEE78500.1| predicted protein [Populus trichocarpa]
Length = 275
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 211/282 (74%), Positives = 233/282 (82%), Gaps = 15/282 (5%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPG-------- 52
MA +SIS +SIKS N + P+++ +LSKP +A Q+ TE QF
Sbjct: 1 MAFTSISSMSIKSPNIST-----PHRILSLSKPFRIAYQL--DTERGNQFADCSKNGYEV 53
Query: 53 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 112
AK KNW VST L AA ++ S N+ A+ADLN++EAETRGEFGIGSAAQFGSADLRK
Sbjct: 54 ETAKAKNWARVVSTTLVAAAISFSSCNLPAVADLNRFEAETRGEFGIGSAAQFGSADLRK 113
Query: 113 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 172
AVH+ ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
Sbjct: 114 AVHLNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 173
Query: 173 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 232
+NLTNAVLVR+VLTRSDLGGA+I GADFSDAVIDL QKQALCKYA+GTNPITGVSTR SL
Sbjct: 174 SNLTNAVLVRSVLTRSDLGGALIAGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASL 233
Query: 233 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
GCGNSRRNAYG+PSSPLLSAPPQKLLDRDGFCD GTGLCDAK
Sbjct: 234 GCGNSRRNAYGTPSSPLLSAPPQKLLDRDGFCDQGTGLCDAK 275
>gi|449459702|ref|XP_004147585.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Cucumis sativus]
gi|449520611|ref|XP_004167327.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Cucumis sativus]
Length = 279
Score = 399 bits (1026), Expect = e-109, Method: Compositional matrix adjust.
Identities = 203/278 (73%), Positives = 228/278 (82%), Gaps = 6/278 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTE-----SDGQFPGPYA 55
MALSSIS LS+K L SS S+ P L K + + QI+ + + S+ + G
Sbjct: 1 MALSSISSLSVKCLPLNSSKSRHPCSLQT-RKQISMVSQINPQKDQTQDCSERKHIGKIT 59
Query: 56 KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVH 115
+ K W+ VSTALAAA V SS + ++A+LNKYEA+TRGEFGIGSAAQ+GSADLRKAVH
Sbjct: 60 EPKRWQKLVSTALAAAAVIGFSSGMPSVAELNKYEADTRGEFGIGSAAQYGSADLRKAVH 119
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
+ ENFRRANFTSADMRESDFSG FNGAYLEKAVAYK NF+GADLSDTLMDRMVLNEAN
Sbjct: 120 INENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANF 179
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCG 235
TNAVLVR+VLTRSDLGGAII GADFSDAVIDL QKQALCKYA+GTNP+TGVSTR SLGCG
Sbjct: 180 TNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCG 239
Query: 236 NSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDA 273
NSRRNAYG+PSSPLLSAPPQ+LLDRDGFCD TGLC+A
Sbjct: 240 NSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQDTGLCEA 277
>gi|388505216|gb|AFK40674.1| unknown [Lotus japonicus]
Length = 273
Score = 396 bits (1017), Expect = e-108, Method: Compositional matrix adjust.
Identities = 206/279 (73%), Positives = 230/279 (82%), Gaps = 16/279 (5%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQL------HALSKPLWVACQISSKTESDGQFPGPY 54
MAL+S+SPLSI ++N SS+ +L H S P+ V CQ++S + +
Sbjct: 2 MALNSLSPLSI-NINSLHVSSRPTSELSNSLHFHPKSSPI-VLCQMNSNRDHPQES---- 55
Query: 55 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 114
K W VS LAAAV+A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRKAV
Sbjct: 56 ---KKWGKLVSATLAAAVIA-FSSDMSALADLNKFEAEIRGEFGIGSAAQFGSADLRKAV 111
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
HV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEAN
Sbjct: 112 HVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEAN 171
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LTNA+LVRTVLTRSDLGG+IIEGADFSDAV+DL QK ALCKYA+GTNP+TGVSTR SLGC
Sbjct: 172 LTNAILVRTVLTRSDLGGSIIEGADFSDAVLDLTQKLALCKYASGTNPVTGVSTRVSLGC 231
Query: 235 GNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDA 273
GN RRNAYG+PSSPLLSAPPQKLL+RDGFCD TGLCD+
Sbjct: 232 GNKRRNAYGTPSSPLLSAPPQKLLNRDGFCDEATGLCDS 270
>gi|356540500|ref|XP_003538726.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Glycine max]
Length = 260
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 207/274 (75%), Positives = 228/274 (83%), Gaps = 14/274 (5%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNW 60
MAL+S+SPLSI SL+ SSS+ H+ S P+ V ++++ W
Sbjct: 1 MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPVVVKSVANAES-------------TKW 47
Query: 61 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 120
VS LAAAV+A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRKAVHV ENF
Sbjct: 48 GKVVSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRKAVHVNENF 106
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
RRANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEANLTNA+L
Sbjct: 107 RRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAIL 166
Query: 181 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRN 240
+RTVLTRSDLGGAIIEGADFSDAV+DL QKQALCKYA+GTNP+TGVSTR SLGCGN RRN
Sbjct: 167 LRTVLTRSDLGGAIIEGADFSDAVLDLPQKQALCKYASGTNPVTGVSTRVSLGCGNKRRN 226
Query: 241 AYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
AYGSPSSPLLSAPPQKLLDRDGFCD TGLCDAK
Sbjct: 227 AYGSPSSPLLSAPPQKLLDRDGFCDDATGLCDAK 260
>gi|357481963|ref|XP_003611267.1| Thylakoid lumenal protein [Medicago truncatula]
gi|355512602|gb|AES94225.1| Thylakoid lumenal protein [Medicago truncatula]
Length = 262
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 200/274 (72%), Positives = 221/274 (80%), Gaps = 12/274 (4%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNW 60
MAL+S +PLSI S + S +SK V C++S + P KNW
Sbjct: 1 MALNSFTPLSINSHHVSCYPSS-----SKVSKSSQVICKMSLNNDH------PQESNKNW 49
Query: 61 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 120
VS LAAAV+ SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K VHV ENF
Sbjct: 50 GKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKKTVHVNENF 108
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
RRANFTSADMRESDFSGS FNGAY+EKAVA+KANFTGADLSDTLMDRMVLNEANLTNA+L
Sbjct: 109 RRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTGADLSDTLMDRMVLNEANLTNAIL 168
Query: 181 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRN 240
RTVLTRSDLGGAIIEGADFSDAV+DL QK ALCKYA+GTNP+TGVSTR SLGCGN RRN
Sbjct: 169 SRTVLTRSDLGGAIIEGADFSDAVLDLPQKLALCKYASGTNPVTGVSTRVSLGCGNKRRN 228
Query: 241 AYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
AYG+PSSPLLSAPPQKLLDRDGFCD +GLCD+K
Sbjct: 229 AYGTPSSPLLSAPPQKLLDRDGFCDEASGLCDSK 262
>gi|357481965|ref|XP_003611268.1| Thylakoid lumenal protein [Medicago truncatula]
gi|355512603|gb|AES94226.1| Thylakoid lumenal protein [Medicago truncatula]
Length = 232
Score = 379 bits (972), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/217 (84%), Positives = 198/217 (91%), Gaps = 1/217 (0%)
Query: 58 KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 117
KNW VS LAAAV+ SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K VHV
Sbjct: 17 KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKKTVHVN 75
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFTGADLSDTLMDRMVLNEANLTN
Sbjct: 76 ENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTGADLSDTLMDRMVLNEANLTN 135
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 237
A+L RTVLTRSDLGGAIIEGADFSDAV+DL QK ALCKYA+GTNP+TGVSTR SLGCGN
Sbjct: 136 AILSRTVLTRSDLGGAIIEGADFSDAVLDLPQKLALCKYASGTNPVTGVSTRVSLGCGNK 195
Query: 238 RRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
RRNAYG+PSSPLLSAPPQKLLDRDGFCD +GLCD+K
Sbjct: 196 RRNAYGTPSSPLLSAPPQKLLDRDGFCDEASGLCDSK 232
>gi|116785652|gb|ABK23807.1| unknown [Picea sitchensis]
Length = 291
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 183/233 (78%), Positives = 203/233 (87%), Gaps = 3/233 (1%)
Query: 41 SSKTESDGQFPGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG 100
+ + + D Q + KNW+ ++ ALA V+ + ++A ADLNKYEAETRGEFGIG
Sbjct: 58 TDQHKKDAQPASATPESKNWQRCLAAALATIVIGT---GMNAEADLNKYEAETRGEFGIG 114
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SAAQFGSA+LRK VH ENFRRANFTSAD+RESDFSGS FNGAYLEKAVAYK NFTGADL
Sbjct: 115 SAAQFGSAELRKTVHANENFRRANFTSADIRESDFSGSTFNGAYLEKAVAYKTNFTGADL 174
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
SDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGAIIEGADFSDAVID QKQALCKYA+GT
Sbjct: 175 SDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDFTQKQALCKYASGT 234
Query: 221 NPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDA 273
NPITG+STRKSLGCGNSRRNAYG+PS+PLLSAPP+KLLD+DGFCDS TGLCDA
Sbjct: 235 NPITGISTRKSLGCGNSRRNAYGTPSAPLLSAPPEKLLDKDGFCDSSTGLCDA 287
>gi|212721536|ref|NP_001132582.1| uncharacterized protein LOC100194053 [Zea mays]
gi|194694816|gb|ACF81492.1| unknown [Zea mays]
gi|195647732|gb|ACG43334.1| hypothetical protein [Zea mays]
gi|413937988|gb|AFW72539.1| hypothetical protein ZEAMMB73_749291 [Zea mays]
Length = 268
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 178/194 (91%), Positives = 187/194 (96%)
Query: 80 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 139
+ A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFTSADMRESDFSGS
Sbjct: 74 MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 133
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGAIIEGAD
Sbjct: 134 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGAD 193
Query: 200 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 259
FSDAVIDL+QKQALCKYA+GTNP+TGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQK+LD
Sbjct: 194 FSDAVIDLSQKQALCKYASGTNPMTGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKILD 253
Query: 260 RDGFCDSGTGLCDA 273
RDGFCD TG+CDA
Sbjct: 254 RDGFCDPATGMCDA 267
>gi|242066558|ref|XP_002454568.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
gi|241934399|gb|EES07544.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
Length = 270
Score = 367 bits (941), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 178/194 (91%), Positives = 185/194 (95%)
Query: 80 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 139
+ A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFTSADMRESDFSGS
Sbjct: 76 MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 135
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGAIIEGAD
Sbjct: 136 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGAD 195
Query: 200 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 259
FSDAVIDL QKQALCKYA+GTN ITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD
Sbjct: 196 FSDAVIDLPQKQALCKYASGTNSITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 255
Query: 260 RDGFCDSGTGLCDA 273
RDGFCD TG+C+A
Sbjct: 256 RDGFCDPATGMCEA 269
>gi|125540470|gb|EAY86865.1| hypothetical protein OsI_08249 [Oryza sativa Indica Group]
Length = 276
Score = 366 bits (939), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 200/280 (71%), Positives = 221/280 (78%), Gaps = 10/280 (3%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQ------FPGPY 54
MAL + SPL+ + C+ + + L + V+CQ + DG
Sbjct: 1 MALPTTSPLAAAAARPCAFPTPWRCRSPPLRRLPHVSCQANRGGSRDGNSLSTSAAAAAA 60
Query: 55 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 114
+ WR VS ALAAA+V++ A ADLNK+EAE RGEFGIGSAAQFGSADL+KAV
Sbjct: 61 SPPPRWRAAVSAALAAAIVSA----APAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAV 116
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
HV ENFRRANFT+ADMRES+FSGS FNGAYLEKAVAY+ANFTGADLSDTLMDRMVLNEAN
Sbjct: 117 HVNENFRRANFTAADMRESNFSGSTFNGAYLEKAVAYRANFTGADLSDTLMDRMVLNEAN 176
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LTNAVLVR+VLTRSDLGGAIIEGADFSDAVIDL QKQALCKYANGTNP+TGVSTRKSLGC
Sbjct: 177 LTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNPLTGVSTRKSLGC 236
Query: 235 GNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
GNSRRNAYGSPSSPLLSAPP KLLDRDGFCD TG+CDAK
Sbjct: 237 GNSRRNAYGSPSSPLLSAPPPKLLDRDGFCDEATGMCDAK 276
>gi|115447561|ref|NP_001047560.1| Os02g0643500 [Oryza sativa Japonica Group]
gi|49388647|dbj|BAD25782.1| thylakoid lumenal protein-like [Oryza sativa Japonica Group]
gi|113537091|dbj|BAF09474.1| Os02g0643500 [Oryza sativa Japonica Group]
gi|125583041|gb|EAZ23972.1| hypothetical protein OsJ_07699 [Oryza sativa Japonica Group]
gi|215687060|dbj|BAG90906.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 277
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 200/281 (71%), Positives = 221/281 (78%), Gaps = 11/281 (3%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQ-------FPGP 53
MAL + SPL+ + C+ + + L + V+CQ + DG
Sbjct: 1 MALPTTSPLAAAAARPCAFPTPWRCRSPPLRRLPHVSCQANRGGSRDGNSLSTSAAAAAA 60
Query: 54 YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 113
+ WR VS ALAAA+V++ A ADLNK+EAE RGEFGIGSAAQFGSADL+KA
Sbjct: 61 ASPPPRWRAAVSAALAAAIVSA----APAYADLNKFEAEQRGEFGIGSAAQFGSADLKKA 116
Query: 114 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
VHV ENFRRANFT+ADMRES+FSGS FNGAYLEKAVAY+ANFTGADLSDTLMDRMVLNEA
Sbjct: 117 VHVNENFRRANFTAADMRESNFSGSTFNGAYLEKAVAYRANFTGADLSDTLMDRMVLNEA 176
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 233
NLTNAVLVR+VLTRSDLGGAIIEGADFSDAVIDL QKQALCKYANGTNP+TGVSTRKSLG
Sbjct: 177 NLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNPLTGVSTRKSLG 236
Query: 234 CGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
CGNSRRNAYGSPSSPLLSAPP KLLDRDGFCD TG+CDAK
Sbjct: 237 CGNSRRNAYGSPSSPLLSAPPPKLLDRDGFCDEATGMCDAK 277
>gi|145323868|ref|NP_001077523.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
gi|332190737|gb|AEE28858.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
Length = 206
Score = 364 bits (934), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 178/207 (85%), Positives = 191/207 (92%), Gaps = 1/207 (0%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH ENFRRANFTS
Sbjct: 1 MAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTS 59
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEANLTNAVLVR+VLTR
Sbjct: 60 ADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTR 119
Query: 188 SDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSS 247
SDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TRKSLGCGNSRRNAYGSPSS
Sbjct: 120 SDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTRKSLGCGNSRRNAYGSPSS 179
Query: 248 PLLSAPPQKLLDRDGFCDSGTGLCDAK 274
PLLSAPPQ+LL RDGFCD TGLCD K
Sbjct: 180 PLLSAPPQRLLGRDGFCDEKTGLCDVK 206
>gi|297844088|ref|XP_002889925.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
lyrata]
gi|297335767|gb|EFH66184.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
lyrata]
Length = 280
Score = 364 bits (934), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 204/284 (71%), Positives = 231/284 (81%), Gaps = 14/284 (4%)
Query: 1 MALSSISPLSIKSLNFCSSSSKG---PYQLHALSKPLWVACQISSKTESDGQFPG----- 52
MA SS+SPL +KSL+ SSS PY H PL Q+SS++ S+ +
Sbjct: 1 MAFSSLSPLPMKSLDISRSSSSVSRSPY--HYQRYPLR-RLQLSSRSNSEIKDSSNAREG 57
Query: 53 --PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADL 110
++ W+ +S A+AAAV+AS SS++ A+A+LN++EA+TRGEFGIGSAAQ+GSADL
Sbjct: 58 CCSRSESNTWKRILSAAMAAAVIAS-SSSVPAMAELNRFEADTRGEFGIGSAAQYGSADL 116
Query: 111 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 170
K +H ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVL
Sbjct: 117 SKTIHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVL 176
Query: 171 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 230
NEANLTNAVLVR+VLTRSDLGGA IEGADFSDAVIDL QKQALCKYANGTNP+TGV TRK
Sbjct: 177 NEANLTNAVLVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYANGTNPLTGVDTRK 236
Query: 231 SLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
SLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD TGLCDAK
Sbjct: 237 SLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDAK 280
>gi|14334898|gb|AAK59627.1| unknown protein [Arabidopsis thaliana]
Length = 280
Score = 363 bits (932), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 201/281 (71%), Positives = 226/281 (80%), Gaps = 8/281 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKT-----ESDGQFPG--P 53
MA SS+SPL +KSL+ SSS + + L Q+SS++ +S G
Sbjct: 1 MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSRSNLEIKDSSNTREGCCS 60
Query: 54 YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 113
A+ W+ +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K
Sbjct: 61 SAESNKWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKT 119
Query: 114 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
VH ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEA
Sbjct: 120 VHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEA 179
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 233
NLTNAVLVR+VLTRSDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TRKSLG
Sbjct: 180 NLTNAVLVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTRKSLG 239
Query: 234 CGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
CGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD TGLCD K
Sbjct: 240 CGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDVK 280
>gi|18391370|ref|NP_563902.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
gi|75151954|sp|Q8H1Q1.1|TL225_ARATH RecName: Full=Thylakoid lumenal protein At1g12250, chloroplastic;
Flags: Precursor
gi|23297125|gb|AAN13098.1| unknown protein [Arabidopsis thaliana]
gi|332190736|gb|AEE28857.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
Length = 280
Score = 363 bits (932), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 201/281 (71%), Positives = 226/281 (80%), Gaps = 8/281 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKT-----ESDGQFPG--P 53
MA SS+SPL +KSL+ SSS + + L Q+SS++ +S G
Sbjct: 1 MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSRSNLEIKDSSNTREGCCS 60
Query: 54 YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 113
A+ W+ +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K
Sbjct: 61 SAESNTWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKT 119
Query: 114 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
VH ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEA
Sbjct: 120 VHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEA 179
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 233
NLTNAVLVR+VLTRSDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TRKSLG
Sbjct: 180 NLTNAVLVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTRKSLG 239
Query: 234 CGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
CGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD TGLCD K
Sbjct: 240 CGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDVK 280
>gi|357136761|ref|XP_003569972.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Brachypodium distachyon]
Length = 268
Score = 361 bits (926), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 198/275 (72%), Positives = 219/275 (79%), Gaps = 8/275 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPL-WVACQISSKTESDGQFPGPYAKLKN 59
MAL+S SPL+ + ++S+ LS+ L +CQ ++ G
Sbjct: 1 MALASTSPLAAATTARPTTSTP---STGCLSRRLPRFSCQATTDGARGGNVSSTSPTPPK 57
Query: 60 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 119
WRV VS ALAAA+V + + A ADLNK+EAE RGEFGIGSAAQFG+ADL+K VHV EN
Sbjct: 58 WRVAVSAALAAAIVTA----MPAYADLNKFEAEQRGEFGIGSAAQFGNADLKKTVHVNEN 113
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
FRRANFTSADMRESDFSGS FNGAY+EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV
Sbjct: 114 FRRANFTSADMRESDFSGSTFNGAYMEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 173
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRR 239
L RTVLTRSDLGGA IEGADFSDAV+DL QK ALCKYA+GTNP+TGVSTRKSLGCGNSRR
Sbjct: 174 LARTVLTRSDLGGATIEGADFSDAVLDLQQKLALCKYASGTNPVTGVSTRKSLGCGNSRR 233
Query: 240 NAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
NAYGSPSSPLLSAPP KLLDRDGFCD TG+CDAK
Sbjct: 234 NAYGSPSSPLLSAPPPKLLDRDGFCDEATGMCDAK 268
>gi|326490876|dbj|BAJ90105.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 267
Score = 355 bits (911), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 199/276 (72%), Positives = 217/276 (78%), Gaps = 11/276 (3%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQL-HALSKPLW-VACQISSKTESDGQFPGPYAKLK 58
MAL+S SPL+ + K P L S+ L ++CQ ++ G
Sbjct: 1 MALASTSPLAATV-----ARPKAPASLTRCRSRRLQRISCQATTDRSGGGNASNTSPAPP 55
Query: 59 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 118
WRV VS ALAAAVV + + A ADLNKYEA+ RGEFGIGSAAQFG+ADL+ VHV E
Sbjct: 56 RWRVAVSAALAAAVVVA----MPAHADLNKYEADQRGEFGIGSAAQFGNADLKNTVHVNE 111
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NFRRANFTSADMRESDFSGS FNGAY+EKAVA++ANFTGADLSDTLMDRMVLNEANLTNA
Sbjct: 112 NFRRANFTSADMRESDFSGSTFNGAYMEKAVAFRANFTGADLSDTLMDRMVLNEANLTNA 171
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSR 238
VL RTVLTRSDLGGA IEGADFSDAVIDL QK ALCKYA+GTNPITGVSTRKSLGCGNSR
Sbjct: 172 VLSRTVLTRSDLGGATIEGADFSDAVIDLPQKLALCKYASGTNPITGVSTRKSLGCGNSR 231
Query: 239 RNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
RNAYGSPSSPLLSAPP KLLDRDGFCD +GLCDAK
Sbjct: 232 RNAYGSPSSPLLSAPPPKLLDRDGFCDEASGLCDAK 267
>gi|10086510|gb|AAG12570.1|AC022522_3 Hypothetical protein [Arabidopsis thaliana]
Length = 293
Score = 340 bits (872), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 170/224 (75%), Positives = 181/224 (80%), Gaps = 29/224 (12%)
Query: 80 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 139
+ A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH ENFRRANFTSADMRESDFSGS
Sbjct: 70 VPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTSADMRESDFSGST 129
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEANLTNAVLVR+VLTRSDLGGA IEGAD
Sbjct: 130 FNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAKIEGAD 189
Query: 200 FSDAVIDLAQKQ-----------------------------ALCKYANGTNPITGVSTRK 230
FSDAVIDL QKQ ALCKYA GTNP+TGV TRK
Sbjct: 190 FSDAVIDLLQKQVTTTHHYIYPSFRSTIKKYFTNGFHNVLKALCKYATGTNPLTGVDTRK 249
Query: 231 SLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 274
SLGCGNSRRNAYGSPSSPLLSAPPQ+LL RDGFCD TGLCD K
Sbjct: 250 SLGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDVK 293
>gi|302822738|ref|XP_002993025.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
gi|300139117|gb|EFJ05864.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
Length = 196
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 156/192 (81%), Positives = 177/192 (92%)
Query: 80 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 139
++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+ H ENFRRANFTSADMRE+DFSGS
Sbjct: 1 MNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFRRANFTSADMREADFSGST 60
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
FNG YLEKAVAY+ NF+GADLSDTLMDRMVLNEA+LTNA+LVR VLTRSDLGGA IEGAD
Sbjct: 61 FNGGYLEKAVAYRTNFSGADLSDTLMDRMVLNEADLTNALLVRAVLTRSDLGGAKIEGAD 120
Query: 200 FSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 259
FSDAV+DLAQKQALCKYANG NP+TG+ TRKSLGCGN+RRNAYG+PS+P+LSAPP++LLD
Sbjct: 121 FSDAVLDLAQKQALCKYANGVNPVTGMDTRKSLGCGNARRNAYGTPSAPILSAPPERLLD 180
Query: 260 RDGFCDSGTGLC 271
+DGFCD TG C
Sbjct: 181 KDGFCDDATGKC 192
>gi|302780733|ref|XP_002972141.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
gi|300160440|gb|EFJ27058.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
Length = 219
Score = 334 bits (857), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 161/205 (78%), Positives = 184/205 (89%), Gaps = 4/205 (1%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RRANFT 126
LAA V+A+ ++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+ H ENF RRANFT
Sbjct: 14 LAATVLAT---GMNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFSRRANFT 70
Query: 127 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 186
SADMRE+DFSGS FNG YLEKAVAY+ NF+GADLSDTLMDRMVLNEA+LTNA+LVR VLT
Sbjct: 71 SADMREADFSGSTFNGGYLEKAVAYRTNFSGADLSDTLMDRMVLNEADLTNALLVRAVLT 130
Query: 187 RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPS 246
RSDLGGA IEGADFSDAV+DLAQKQALCKYANG NP+TG+ TRKSLGCGN+RRNAYG+PS
Sbjct: 131 RSDLGGAKIEGADFSDAVLDLAQKQALCKYANGVNPVTGMDTRKSLGCGNARRNAYGTPS 190
Query: 247 SPLLSAPPQKLLDRDGFCDSGTGLC 271
+P+LSAPP++LLD+DGFCD TG C
Sbjct: 191 APILSAPPERLLDKDGFCDDATGKC 215
>gi|168028137|ref|XP_001766585.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682230|gb|EDQ68650.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 225
Score = 319 bits (818), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 154/193 (79%), Positives = 171/193 (88%), Gaps = 2/193 (1%)
Query: 82 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 141
+LADLN EA TRGEFGIGSA QFGSADL+K H ENFRR NFTSADM+E++FS S FN
Sbjct: 28 SLADLNSLEANTRGEFGIGSAVQFGSADLKKTQHANENFRRGNFTSADMKEANFSNSTFN 87
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
GAYLEKAVAY+ NF+GADLSDTLMDRMVLNEANL+NA+LVR VLTRSDLG AIIEGADFS
Sbjct: 88 GAYLEKAVAYRTNFSGADLSDTLMDRMVLNEANLSNALLVRAVLTRSDLGSAIIEGADFS 147
Query: 202 DAVIDLAQKQ--ALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLD 259
DAV+DL QKQ ALCKYA+GTNP+TG+STRKSLGCGN+RRNAYGSPSSP LSAPP LLD
Sbjct: 148 DAVLDLTQKQAFALCKYASGTNPVTGMSTRKSLGCGNARRNAYGSPSSPELSAPPPILLD 207
Query: 260 RDGFCDSGTGLCD 272
++GFCD+ TG CD
Sbjct: 208 KNGFCDNSTGKCD 220
>gi|356495617|ref|XP_003516671.1| PREDICTED: LOW QUALITY PROTEIN: thylakoid lumenal protein
At1g12250, chloroplastic-like [Glycine max]
Length = 222
Score = 297 bits (760), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 166/243 (68%), Positives = 186/243 (76%), Gaps = 21/243 (8%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNW 60
MAL+S SPLS+ SL+ S SS + + S P V CQ +S + +
Sbjct: 1 MALNSFSPLSVNSLHVSSISSSKISRSLSKSFP--VVCQTNSNRDH-----------RQG 47
Query: 61 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 120
V VS LAAA++A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRKAVHV ENF
Sbjct: 48 NV-VSATLAAAIIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRKAVHVNENF 105
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
R +NFT+ADMRESDFSGS FNGAYLEKAVAYKANF G DLSDTL DRMVLNEANL+NA+L
Sbjct: 106 RXSNFTAADMRESDFSGSTFNGAYLEKAVAYKANFPGVDLSDTLTDRMVLNEANLSNAIL 165
Query: 181 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRN 240
+RTVLTRSDLGGAIIEGADFSDAV+DL QK ALCKY +T VSTR SLGCGN RRN
Sbjct: 166 LRTVLTRSDLGGAIIEGADFSDAVLDLPQKHALCKY------VTRVSTRVSLGCGNKRRN 219
Query: 241 AYG 243
AYG
Sbjct: 220 AYG 222
>gi|159478056|ref|XP_001697120.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
gi|158274594|gb|EDP00375.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
Length = 239
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 101/167 (60%), Positives = 120/167 (71%)
Query: 82 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 141
ALADLN YEA T GEFGIGSA Q+G AD++ ++ RR+NFTSAD R + F GS
Sbjct: 51 ALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCRNATFKGSNLQ 110
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
GAY KAV Y+ NF A+LSD LMDR + EANL NA+L RTV TRSDL A+IEGADF+
Sbjct: 111 GAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLKNAILQRTVFTRSDLKDAVIEGADFT 170
Query: 202 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 248
+A++D Q ALCKYA+GTNP+TG TRKSLGCG RR PS+P
Sbjct: 171 NALLDKTQVMALCKYASGTNPVTGADTRKSLGCGGKRRYQASYPSNP 217
>gi|302829835|ref|XP_002946484.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
nagariensis]
gi|300268230|gb|EFJ52411.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
nagariensis]
Length = 214
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 100/167 (59%), Positives = 121/167 (72%)
Query: 82 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 141
A ADLN YEAE GEFGIGSA Q+G AD++ ++ RR+NFTSAD R ++F GS
Sbjct: 26 AFADLNVYEAEAGGEFGIGSAQQYGEADVQGRDFSGQDLRRSNFTSADCRNANFKGSNLQ 85
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
GAY KAV Y+ NF A+LSD LMDR + EANL NAVL R V TRSDL A++EGADF+
Sbjct: 86 GAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLRNAVLQRAVFTRSDLKDAVVEGADFT 145
Query: 202 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 248
+A++D Q ALCKYA+G NP+TGVSTRKSLGCG+ RR PS+P
Sbjct: 146 NALLDKTQVMALCKYADGVNPVTGVSTRKSLGCGSQRRYKASYPSNP 192
>gi|255638223|gb|ACU19425.1| unknown [Glycine max]
Length = 199
Score = 200 bits (509), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 115/168 (68%), Positives = 130/168 (77%), Gaps = 9/168 (5%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNW 60
MAL+S+SPLSI SL+ SSS+ H+ S P+ V CQI+S + + W
Sbjct: 2 MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPV-VVCQINSNRDHRQEST-------KW 53
Query: 61 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 120
VS LAAAV+A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRKAVHV ENF
Sbjct: 54 GKVVSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRKAVHVNENF 112
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
RRANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRM
Sbjct: 113 RRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRM 160
>gi|384248119|gb|EIE21604.1| thylakoid lumenal protein [Coccomyxa subellipsoidea C-169]
Length = 217
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 104/185 (56%), Positives = 125/185 (67%), Gaps = 2/185 (1%)
Query: 82 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 141
A+ADLNKYEA GEFG G+A Q+G ADL+ E+ RR+NFT+AD R +F S
Sbjct: 29 AIADLNKYEAAAGGEFGNGTAQQYGEADLKGRDFHGEDLRRSNFTAADCRNCNFKDSNLQ 88
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
GAY K+V KANF A+LSD LMDR VLNEANL NA R VLTRSDLGGA I G DF+
Sbjct: 89 GAYFIKSVVPKANFENANLSDVLMDRAVLNEANLRNANFQRAVLTRSDLGGADINGTDFT 148
Query: 202 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRD 261
+A++D Q+ ALC+YA+GTN TGV TRKSLGCG+ RR SPS+P P +D+
Sbjct: 149 NALLDKTQQIALCRYADGTNTETGVETRKSLGCGSRRRFRESSPSNP--EGPQVADVDKK 206
Query: 262 GFCDS 266
F S
Sbjct: 207 AFVKS 211
>gi|297741151|emb|CBI31882.3| unnamed protein product [Vitis vinifera]
Length = 201
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 105/161 (65%), Positives = 118/161 (73%), Gaps = 11/161 (6%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNW 60
MALSS+SPL I SK P L +LSKP V C+I + E++ + A+ K W
Sbjct: 1 MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRENNWRGEA-NAESKKW 50
Query: 61 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 120
+ VSTALAAAVV + S + A+ADLNKYE ETRGEFGIGSAAQFGSADLRKAVHV ENF
Sbjct: 51 QRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRKAVHVNENF 109
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
RRANFTSADMRESDFSGS FNG YLEKAVAYKA+ T A S
Sbjct: 110 RRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLTDAQSS 150
>gi|307105880|gb|EFN54127.1| hypothetical protein CHLNCDRAFT_31689 [Chlorella variabilis]
Length = 259
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/167 (53%), Positives = 119/167 (71%)
Query: 82 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 141
A A+LNKYE GEF +G+A Q+G AD++ ++ +R+NFT+AD R+++F SK
Sbjct: 71 ASAELNKYEFGVTGEFNVGTARQYGEADVKGQDFSNQDLQRSNFTAADCRDANFQNSKLQ 130
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
AY K+V +AN ADLSD LMDR V+ +ANL AVL R +LTRSDL + I GADF+
Sbjct: 131 AAYFMKSVLARANLENADLSDALMDRAVIVDANLRGAVLQRAILTRSDLDRSDIYGADFT 190
Query: 202 DAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSP 248
+A++D Q+ ALCKYA+G NP+TGVSTRKSL CG+SRR SPS+P
Sbjct: 191 NALVDKTQQMALCKYADGVNPMTGVSTRKSLNCGSSRRFKASSPSNP 237
>gi|303288862|ref|XP_003063719.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454787|gb|EEH52092.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 277
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 104/198 (52%), Positives = 130/198 (65%), Gaps = 8/198 (4%)
Query: 78 SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE---NFRRANFTSADMRESD 134
S+ +A A+LN EA GEF GSA QFG DLR V + + R +NFT A+MR +
Sbjct: 81 SSPAAHAELNAREANRGGEFNRGSAQQFGGYDLRNEDVVGKYGADLRLSNFTGAEMRGAK 140
Query: 135 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
G+ GAYL KAVA++A+F GA+LSD LMDR VLN AN +A+L R VLT SDLG A
Sbjct: 141 LRGANLTGAYLMKAVAFEADFEGANLSDALMDRAVLNSANFRDAILTRVVLTSSDLGDAK 200
Query: 195 IEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPL---LS 251
I+GADFSDA+ID +Q+Q LC+YA+GTN +TGVSTR+SL CG R + SPS + S
Sbjct: 201 IDGADFSDALIDKSQQQKLCQYASGTNSVTGVSTRRSLNCGGGVRTS--SPSRYMTDETS 258
Query: 252 APPQKLLDRDGFCDSGTG 269
A P+ D F GTG
Sbjct: 259 AKPEAAFDASRFSAYGTG 276
>gi|424513452|emb|CCO66074.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
Length = 231
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 101/179 (56%), Positives = 116/179 (64%), Gaps = 6/179 (3%)
Query: 82 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF----RRANFTSADMRESDFSG 137
A+A+LN EA GEF GSA QFG DLR A +V E + R +NFT A+MR+S G
Sbjct: 39 AVAELNSREANQGGEFNRGSAQQFGGYDLR-AENVSEKYGTDLRLSNFTGAEMRDSKLVG 97
Query: 138 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
+K NGAYL KAVA A+FT ADLSD LMDR V AN TNA+L R VLT SDL GA I
Sbjct: 98 AKLNGAYLMKAVAANADFTDADLSDALMDRGVFVNANFTNAILARVVLTSSDLNGANITN 157
Query: 198 ADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQK 256
ADFSDA++D + LCK A GTNP TGV+TRKSL C R N GSPS + QK
Sbjct: 158 ADFSDALLDNTMQMKLCKIATGTNPTTGVNTRKSLNCTGGRGNV-GSPSRYMTEEDAQK 215
>gi|357481967|ref|XP_003611269.1| Thylakoid lumenal protein [Medicago truncatula]
gi|355512604|gb|AES94227.1| Thylakoid lumenal protein [Medicago truncatula]
Length = 147
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 99/158 (62%), Positives = 113/158 (71%), Gaps = 14/158 (8%)
Query: 1 MALSSISPLSIKSLNF-CSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKN 59
MAL+S +PLSI S + C SS +SK V C++S + P KN
Sbjct: 1 MALNSFTPLSINSHHVSCYPSSS------KVSKSSQVICKMSLNNDH------PQESNKN 48
Query: 60 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 119
W VS LAAAV+ SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K VHV EN
Sbjct: 49 WGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKKTVHVNEN 107
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
FRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFTG
Sbjct: 108 FRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTG 145
>gi|308811122|ref|XP_003082869.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
gi|116054747|emb|CAL56824.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
Length = 247
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 97/184 (52%), Positives = 116/184 (63%), Gaps = 6/184 (3%)
Query: 58 KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 117
K V S ALA A S + A A+LN+ EA GEF GSA QFG DL K K
Sbjct: 34 KKGHVITSIALATAFALSGAP---AHAELNRAEANRGGEFNRGSAKQFGGYDLVKVDIAK 90
Query: 118 E---NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
E + R +NFT ADMR + G+ GAY+ K VA + +FTGAD+SD LMDR VL AN
Sbjct: 91 EYGKDLRLSNFTGADMRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALMDRSVLVGAN 150
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
T+AVL R VLT SD+ AIIE ADF+DA++D +QALCK A+G NP TGV+TR SLGC
Sbjct: 151 FTDAVLNRVVLTSSDMKDAIIENADFTDALLDPKTQQALCKTASGKNPETGVATRVSLGC 210
Query: 235 GNSR 238
R
Sbjct: 211 SGGR 214
>gi|255087366|ref|XP_002505606.1| predicted protein [Micromonas sp. RCC299]
gi|226520876|gb|ACO66864.1| predicted protein [Micromonas sp. RCC299]
Length = 146
Score = 150 bits (379), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 71/108 (65%), Positives = 85/108 (78%)
Query: 130 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 189
MR++ G+ GAYL KAVA+ A+F GA+LSD LMDR VLN AN +A++ R VLT SD
Sbjct: 1 MRKAKLRGANLTGAYLMKAVAFAADFEGANLSDALMDRAVLNNANFKDAIMTRVVLTSSD 60
Query: 190 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 237
LG A+IEGADFSDA+ID+ Q+QALCKYANG N +TGVSTRKSL CG S
Sbjct: 61 LGDAVIEGADFSDALIDVKQQQALCKYANGVNSVTGVSTRKSLNCGGS 108
>gi|145356542|ref|XP_001422487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582730|gb|ABP00804.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 114
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/111 (61%), Positives = 84/111 (75%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
NFT AD+R + G+ GAY+ K VA + +FTGAD+SD LMDR VL +AN TNA+L R
Sbjct: 4 NFTGADLRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALMDRAVLVKANFTNAILNRV 63
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
VLT SDL GAI+E ADF+DA++D+ +QALCK A+G NP TGVSTR SLGC
Sbjct: 64 VLTSSDLEGAIVENADFTDALLDVKTQQALCKTASGKNPETGVSTRVSLGC 114
>gi|224125144|ref|XP_002329904.1| predicted protein [Populus trichocarpa]
gi|222871141|gb|EEF08272.1| predicted protein [Populus trichocarpa]
Length = 108
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 63/81 (77%), Positives = 68/81 (83%), Gaps = 4/81 (4%)
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 227
MV+NEANLTNAVLVR+ LTR DLGGA I GAD SD+VIDL QKQ YA+GTNP TGVS
Sbjct: 1 MVINEANLTNAVLVRSALTRCDLGGAQIAGADSSDSVIDLPQKQ----YASGTNPTTGVS 56
Query: 228 TRKSLGCGNSRRNAYGSPSSP 248
R SLGCGNSRRNAYG+PSSP
Sbjct: 57 NRASLGCGNSRRNAYGTPSSP 77
>gi|434390855|ref|YP_007125802.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428262696|gb|AFZ28642.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 176
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 59/110 (53%), Positives = 77/110 (70%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +A+MRE++F G+ A L K V +AN GA+L+ L+DR+ L+EANL NA+L +
Sbjct: 66 FVAAEMREANFQGADLTNAILTKGVLLRANLEGANLTGALVDRVTLDEANLKNAILQEAI 125
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LTRS L A I GADF+DA+ID Q LC A+G NP+TGVSTR+SLGC
Sbjct: 126 LTRSRLFDADITGADFTDALIDRYQVSLLCDRADGVNPVTGVSTRESLGC 175
>gi|67921246|ref|ZP_00514765.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|67857363|gb|EAM52603.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
Length = 172
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/138 (44%), Positives = 84/138 (60%), Gaps = 10/138 (7%)
Query: 97 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 156
+G F DL K V F +ADMRE++F GS + A +A KAN
Sbjct: 44 YGELQQQDFSHKDLEKGV----------FAAADMREANFEGSNLSYAIFTEATLLKANLK 93
Query: 157 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 216
GA+L+ +L+DR+ L+ A+LT+A+L+ + TR+ A+I GADF+DAVID Q +C+
Sbjct: 94 GANLTSSLLDRVTLDFADLTDAILIDAIATRTRFYDAVITGADFTDAVIDRYQVSLMCER 153
Query: 217 ANGTNPITGVSTRKSLGC 234
A G NP+TGVSTR SLGC
Sbjct: 154 AEGVNPVTGVSTRDSLGC 171
>gi|416382245|ref|ZP_11684306.1| Pentapeptide repeat containing protein [Crocosphaera watsonii WH
0003]
gi|357265427|gb|EHJ14194.1| Pentapeptide repeat containing protein [Crocosphaera watsonii WH
0003]
Length = 171
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/138 (44%), Positives = 84/138 (60%), Gaps = 10/138 (7%)
Query: 97 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 156
+G F DL K V F +ADMRE++F GS + A +A KAN
Sbjct: 43 YGELQQQDFSHKDLEKGV----------FAAADMREANFEGSNLSYAIFTEATLLKANLK 92
Query: 157 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 216
GA+L+ +L+DR+ L+ A+LT+A+L+ + TR+ A+I GADF+DAVID Q +C+
Sbjct: 93 GANLTSSLLDRVTLDFADLTDAILIDAIATRTRFYDAVITGADFTDAVIDRYQVSLMCER 152
Query: 217 ANGTNPITGVSTRKSLGC 234
A G NP+TGVSTR SLGC
Sbjct: 153 AEGVNPVTGVSTRDSLGC 170
>gi|254421873|ref|ZP_05035591.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
gi|196189362|gb|EDX84326.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
Length = 187
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 77/117 (65%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N +F +AD R+++F G+ +G L KA + N GAD + T DR++ + A+LTN
Sbjct: 69 KNLSGTSFAAADARDANFEGADMSGTILTKATFLRTNLKGADFTKTFADRVLFDGADLTN 128
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+ V + T S G II GADFSDA+ID Q + +CK A+G NP+TG+STR+SLGC
Sbjct: 129 AIFVEAIATSSSFGDTIITGADFSDAIIDRFQVKKMCKRADGINPVTGISTRESLGC 185
>gi|218247318|ref|YP_002372689.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|218167796|gb|ACK66533.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
Length = 172
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 83/130 (63%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F DL KAV F +A+MRE++F GS + A L + V KAN A+L+ +L
Sbjct: 52 FSHRDLEKAV----------FAAAEMRETNFEGSNLSYAILTEGVLLKANLKDANLTGSL 101
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+DR+ L+ A+LTNA+LV + TR+ II GADF+DAVID Q +C+ A+G NP+T
Sbjct: 102 LDRVTLDFADLTNAILVDAIATRTRFYDTIITGADFTDAVIDRYQVALMCERADGVNPVT 161
Query: 225 GVSTRKSLGC 234
GV+TR SLGC
Sbjct: 162 GVATRDSLGC 171
>gi|434384986|ref|YP_007095597.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428015976|gb|AFY92070.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 165
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 65/144 (45%), Positives = 83/144 (57%), Gaps = 15/144 (10%)
Query: 106 GSADLRKAVHVKENFRRAN---------------FTSADMRESDFSGSKFNGAYLEKAVA 150
G D+ AV + NF R N F S+++R + SG+ A L AV
Sbjct: 21 GINDVTLAVSSQTNFSRINLTDRDFGGQDLTGGVFVSSELRGVNMSGANLTNAMLTMAVL 80
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
K N +GA+L+ L DR +EA+LTNA+L LTRS GA I GADF+DA+ID AQ
Sbjct: 81 LKTNLSGANLTGALADRATFDEADLTNAILTEATLTRSRFYGAKITGADFTDALIDRAQA 140
Query: 211 QALCKYANGTNPITGVSTRKSLGC 234
+ LC A+G NP+TGVSTR SLGC
Sbjct: 141 KLLCDRADGINPVTGVSTRDSLGC 164
>gi|257061347|ref|YP_003139235.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
gi|256591513|gb|ACV02400.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
Length = 172
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 60/130 (46%), Positives = 82/130 (63%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F DL KAV F +A+MRE++F GS + A L + V KAN +L+ +L
Sbjct: 52 FSHRDLEKAV----------FAAAEMRETNFEGSNLSYAILTEGVLLKANLKDVNLTGSL 101
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+DR+ L+ A+LTNA+LV + TR+ II GADF+DAVID Q +C+ A+G NP+T
Sbjct: 102 LDRVTLDFADLTNAILVDAIATRTRFYDTIITGADFTDAVIDRYQVALMCERADGVNPVT 161
Query: 225 GVSTRKSLGC 234
GV+TR SLGC
Sbjct: 162 GVATRDSLGC 171
>gi|354555882|ref|ZP_08975181.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|353552206|gb|EHC21603.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 182
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/134 (41%), Positives = 84/134 (62%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ + +L++ +N + F +ADMRE++F GS + + + + AN GA+L
Sbjct: 48 NTVNYTYGELQQQDFSHKNLEKGVFAAADMREANFEGSNLSYSIFTEGILLGANLKGANL 107
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
S +L+DR+ L+ A+LTNA+LV + TR+ A I GADF++AVID Q +C+ A G
Sbjct: 108 SSSLLDRVTLDFADLTNAILVDAIATRTRFYDATITGADFTNAVIDRYQVSLMCERAEGV 167
Query: 221 NPITGVSTRKSLGC 234
NP+TGVSTR SLGC
Sbjct: 168 NPVTGVSTRDSLGC 181
>gi|172037118|ref|YP_001803619.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|171698572|gb|ACB51553.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
Length = 184
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/134 (41%), Positives = 84/134 (62%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ + +L++ +N + F +ADMRE++F GS + + + + AN GA+L
Sbjct: 50 NTVNYTYGELQQQDFSHKNLEKGVFAAADMREANFEGSNLSYSIFTEGILLGANLKGANL 109
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
S +L+DR+ L+ A+LTNA+LV + TR+ A I GADF++AVID Q +C+ A G
Sbjct: 110 SSSLLDRVTLDFADLTNAILVDAIATRTRFYDATITGADFTNAVIDRYQVSLMCERAEGV 169
Query: 221 NPITGVSTRKSLGC 234
NP+TGVSTR SLGC
Sbjct: 170 NPVTGVSTRDSLGC 183
>gi|254412921|ref|ZP_05026693.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196180085|gb|EDX75077.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 180
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 76/117 (64%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N +F AD+R + F G+ G+ L KA ++A+ TGA+LS+TL DR+V + ANLTN
Sbjct: 63 QNLEGTSFAGADLRGASFRGASLQGSILTKAAFFEADLTGANLSETLADRVVFDGANLTN 122
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+ + +RS I GADFS A++D Q +C+ A+G NP+TGVSTR SLGC
Sbjct: 123 AIFTNAIASRSRFFDTTITGADFSGAILDTYQISLMCQRADGVNPVTGVSTRDSLGC 179
>gi|126658078|ref|ZP_01729230.1| hypothetical protein CY0110_05667 [Cyanothece sp. CCY0110]
gi|126620716|gb|EAZ91433.1| hypothetical protein CY0110_05667 [Cyanothece sp. CCY0110]
Length = 181
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/134 (41%), Positives = 83/134 (61%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ + +L++ +N + F +ADMRE++F GS + + + + AN G DL
Sbjct: 47 NTVNYTYGELQQEDFSHKNLQGGVFAAADMREANFEGSNLSYSIFTEGILLGANLKGVDL 106
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
S +L+DR+ L+ A+LTNA+LV + TR+ A I GADF++AVID Q +C+ A G
Sbjct: 107 SSSLLDRVTLDFADLTNAILVDAIATRTRFYDATITGADFTNAVIDRYQVSLMCERAEGV 166
Query: 221 NPITGVSTRKSLGC 234
NP+TGVSTR SLGC
Sbjct: 167 NPVTGVSTRDSLGC 180
>gi|75908890|ref|YP_323186.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75702615|gb|ABA22291.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 168
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/143 (41%), Positives = 83/143 (58%)
Query: 92 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 151
+T F + + +A+L + NF +A+MR ++F G+ A L K V
Sbjct: 25 DTHPAFAQINTINYNNANLENRDFANADLVGVNFVAAEMRGTNFQGANLTNAILTKGVLL 84
Query: 152 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
KAN + A+L+ L+DR+ L+ ANL NA+ LTRS A I GADF+DA+ID Q
Sbjct: 85 KANLSEANLTGALVDRVTLDNANLKNAIFTEATLTRSRFYDADITGADFTDAIIDRYQVS 144
Query: 212 ALCKYANGTNPITGVSTRKSLGC 234
LC+ A+G NP+TGV+TR SLGC
Sbjct: 145 LLCERADGVNPVTGVATRDSLGC 167
>gi|332712340|ref|ZP_08432267.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332348814|gb|EGJ28427.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 169
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 76/116 (65%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N R F A+MR ++F G+ +G+ K KAN GA+L+D+L DR++L++ANLTNA
Sbjct: 53 NLVRGVFAGAEMRGTNFQGADLSGSIFTKGNLLKANLEGANLTDSLADRVILDQANLTNA 112
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+L ++ + A I GADF+DA+ID Q + +C A G NP+TG+STR SLGC
Sbjct: 113 ILTDAIMNSTRFYDAEITGADFTDALIDRYQAKLMCGRATGVNPVTGISTRDSLGC 168
>gi|428316344|ref|YP_007114226.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428240024|gb|AFZ05810.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 169
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 55/110 (50%), Positives = 72/110 (65%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +A+MR ++F G+ A L K V AN +GA+LS L DR+ + ANLTNA +
Sbjct: 59 FVAAEMRGTNFQGADLTNAILTKGVLLNANLSGANLSGALADRVTFDGANLTNANFTEAI 118
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+TR+ A I GADFSDA+ID Q LC+ A+G NP+TGVSTR+SLGC
Sbjct: 119 MTRTRFFDAAISGADFSDAIIDAYQVSILCEKADGVNPVTGVSTRESLGC 168
>gi|434405844|ref|YP_007148729.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428260099|gb|AFZ26049.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 168
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 55/110 (50%), Positives = 72/110 (65%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +A+MR ++F G+ A L K V KAN GA+L+ L+DR+ L+ ANL NA+
Sbjct: 58 FVAAEMRGTNFQGANLTNAILTKGVLLKANLEGANLAGALVDRVTLDGANLKNAIFTEAT 117
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LTRS A + GADF+DA+ID Q LCK A+G NP+TG+STR SLGC
Sbjct: 118 LTRSRFFDADVTGADFTDALIDRYQVALLCKSADGVNPVTGISTRDSLGC 167
>gi|427720966|ref|YP_007068960.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427353402|gb|AFY36126.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 168
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 81/141 (57%)
Query: 94 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 153
R F + + + + +L E+ A F +A+MR ++F G+ A L K V KA
Sbjct: 27 RPAFALTNVINYNNINLENRDFAHEDLTGATFVAAEMRGANFQGANLTNAVLTKGVLLKA 86
Query: 154 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 213
+ + A+L+ L+DR+ L+ ANL NA+ LTRS A I GADF+DA+ID Q +
Sbjct: 87 DLSDANLTGALVDRVTLDGANLKNAIFTEATLTRSRFYDAEITGADFTDALIDRYQVSLM 146
Query: 214 CKYANGTNPITGVSTRKSLGC 234
C A G NP+TGVSTR SLGC
Sbjct: 147 CDRAAGINPVTGVSTRDSLGC 167
>gi|295293762|gb|ADF88289.1| pentapeptide repeat-containing protein [Aphanizomenon sp. 10E6]
Length = 168
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 71/110 (64%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +A+MR ++F G+ A K V KAN A+L+ L+DR+ L+ ANL NA+ +
Sbjct: 58 FVAAEMRGTNFQGANLTNAIFTKGVLLKANLEAANLTGALVDRVTLDSANLRNAIFTKAT 117
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LTRS A I GADF+DA+ID Q LC+ A+G NP+TGVSTR SLGC
Sbjct: 118 LTRSRFYDADITGADFTDALIDRYQVSLLCQRADGVNPVTGVSTRDSLGC 167
>gi|186685193|ref|YP_001868389.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186467645|gb|ACC83446.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 168
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 72/110 (65%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +A+MR ++F G+ A L K V KAN GA+LS L+DR+ ++ ANL NA+
Sbjct: 58 FVAAEMRGTNFQGANLTNAILTKGVLLKANLEGANLSGALVDRVTMDGANLKNAIFTEAT 117
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LTRS A I GADF+DA+ID Q +C+ A+G NP+TG+STR SLGC
Sbjct: 118 LTRSRFFDAEITGADFTDALIDRYQVSLMCERADGVNPVTGMSTRDSLGC 167
>gi|428224803|ref|YP_007108900.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984704|gb|AFY65848.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 176
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 71/110 (64%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F SA+MR ++F G+ + A L K V AN GA+L+ L DR+ +ANL NA+LV
Sbjct: 67 FVSAEMRNANFEGANLSNAILTKGVLLNANLEGANLTGALADRVFWLDANLRNAILVDVT 126
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
TR+ G + GADFSDA++D + + LCK A G NP+TGV+TR SLGC
Sbjct: 127 ATRTSFEGVDVTGADFSDAILDRYELKELCKRAEGVNPVTGVATRDSLGC 176
>gi|428778133|ref|YP_007169920.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428692412|gb|AFZ45706.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 174
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 62/138 (44%), Positives = 82/138 (59%), Gaps = 10/138 (7%)
Query: 97 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 156
+ I S F + DL AV F +A+MR+++FSGS A K A+ +
Sbjct: 46 YTIVSERDFSNKDLVGAV----------FAAAEMRKTNFSGSNLENAMFTKGTLINADLS 95
Query: 157 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 216
+LS LMDR+ L+ A+L NAVL T LTRS L G IEGADF+DA+++ Q + LC+
Sbjct: 96 NTNLSGALMDRVSLDGADLRNAVLQGTFLTRSTLEGTKIEGADFTDAILNRYQVKLLCER 155
Query: 217 ANGTNPITGVSTRKSLGC 234
A G NP TGV+TR SLGC
Sbjct: 156 AEGVNPKTGVATRDSLGC 173
>gi|427728139|ref|YP_007074376.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427364058|gb|AFY46779.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 168
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 71/110 (64%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +A+MR ++F G+ A K V AN +GA+L+ L+DR L+ ANL NA+
Sbjct: 58 FVAAEMRGTNFQGANLTNAIFTKGVLLNANLSGANLTGALVDRATLDSANLKNAIFTEAT 117
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LTRS A I GADF+DA+ID Q LC+ A+G NP+TGV+TR+SLGC
Sbjct: 118 LTRSRFYDADITGADFTDAIIDRYQVSLLCERADGINPVTGVATRESLGC 167
>gi|359460928|ref|ZP_09249491.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 172
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 94/180 (52%), Gaps = 13/180 (7%)
Query: 56 KLKNWRVFVSTALAAA-VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 114
++K W +S A +V SC + ++ + A F ADLR
Sbjct: 4 RIKPWLRTISVVFAVVWLVGSC------------FVLNSQPTWADDGAQNFTFADLRYED 51
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
+NF A+ A + +++ + + G L A ++N T ADL++T DR++ NEA+
Sbjct: 52 FENKNFEGASLAGAILLKANLTNANLKGTILTMATFQRSNLTNADLTETFADRVLFNEAD 111
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LTNA+ +LT S A I GADFS A +D Q +C+YA+G NP+TGVSTR+SL C
Sbjct: 112 LTNAIFTDAMLTSSKFYDATITGADFSYAFLDRDQVTMMCEYADGVNPVTGVSTRESLEC 171
>gi|158337601|ref|YP_001518776.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158307842|gb|ABW29459.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 172
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 56/133 (42%), Positives = 79/133 (59%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A F ADLR +NF A+ A + +++ + + G L A ++N T ADL+
Sbjct: 39 AQNFTFADLRYEDFENKNFEGASLAGAILLKANLTNANLKGTILTMATFQRSNLTNADLT 98
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
+T DR++ NEA+LTNA+ +LT S A I GADFS A +D Q +C+YA+G N
Sbjct: 99 ETFADRVLFNEADLTNAIFTDAMLTSSKFYDATITGADFSYAFLDRDQVTMMCEYADGVN 158
Query: 222 PITGVSTRKSLGC 234
P+TGVSTR+SL C
Sbjct: 159 PVTGVSTRESLEC 171
>gi|443314355|ref|ZP_21043921.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442786047|gb|ELR95821.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 173
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 98/182 (53%), Gaps = 13/182 (7%)
Query: 54 YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQ-FGSADLRK 112
+ + WR + L A+ I+A A IG Q F +DL +
Sbjct: 3 WQRSGEWRQILRGGLLFAIAIVLWGGIAARA------------IAIGEITQDFTYSDLNR 50
Query: 113 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 172
EN A+ +AD RE++FSG+ + L K YKA GA+L+ + DR++ +
Sbjct: 51 QDFAGENLAGASLAAADAREANFSGADLSQTILTKGNFYKAKLVGANLTQSFADRVIFDG 110
Query: 173 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 232
A+L+NA++V ++T + G A I+GADFS ++D Q +C+YA+G NP+TGV+TR SL
Sbjct: 111 ADLSNALVVDAIMTSTSFGEATIQGADFSGTILDRYQVAQMCEYADGVNPVTGVATRDSL 170
Query: 233 GC 234
GC
Sbjct: 171 GC 172
>gi|334119379|ref|ZP_08493465.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333458167|gb|EGK86786.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 169
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 71/110 (64%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +A+MR ++F G+ A L K V AN +GA+LS L DR+ + ANLTNA +
Sbjct: 59 FVAAEMRGTNFQGADLTNAILTKGVLLNANLSGANLSGALADRVTFDGANLTNANFSEAI 118
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+TR+ A I GADF+DA+ID Q LC+ A+G NP TGVSTR+SLGC
Sbjct: 119 MTRTRFFDAAISGADFTDAIIDAYQVSILCEKADGVNPATGVSTRESLGC 168
>gi|428299988|ref|YP_007138294.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428236532|gb|AFZ02322.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 193
Score = 106 bits (265), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 79/134 (58%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+A + +A+L + NF +A+MR +F G+ A L K V KAN GA+L
Sbjct: 59 NAMNYNNANLENRDFSHADLVGINFVAAEMRGINFEGANLTNAMLTKGVMLKANLEGANL 118
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+ L+DR+ L+ ANL NA LTRS L A I GADFS+A+ID Q + LC A+GT
Sbjct: 119 TAALVDRVALDGANLKNANFTDATLTRSRLFDADITGADFSNALIDTYQMKLLCDRASGT 178
Query: 221 NPITGVSTRKSLGC 234
NP+TGV TR SL C
Sbjct: 179 NPVTGVDTRDSLEC 192
>gi|440681954|ref|YP_007156749.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428679073|gb|AFZ57839.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 168
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 71/110 (64%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +A+MR ++F G+ + A L K V KAN A+L+ L+DR+ L+ ANL NA+
Sbjct: 58 FVAAEMRGANFQGANLSNAILTKGVLLKANLEDANLTGALVDRVTLDSANLKNAIFTEAT 117
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LTRS A I GADF+DA+ID Q LC+ ANG N +TG+STR SLGC
Sbjct: 118 LTRSRFYDADITGADFTDALIDRYQVSLLCERANGVNSVTGISTRDSLGC 167
>gi|16331228|ref|NP_441956.1| hypothetical protein sll0301 [Synechocystis sp. PCC 6803]
gi|383322971|ref|YP_005383824.1| hypothetical protein SYNGTI_2062 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383326140|ref|YP_005386993.1| hypothetical protein SYNPCCP_2061 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383492024|ref|YP_005409700.1| hypothetical protein SYNPCCN_2061 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437292|ref|YP_005652016.1| hypothetical protein SYNGTS_2063 [Synechocystis sp. PCC 6803]
gi|451815384|ref|YP_007451836.1| hypothetical protein MYO_120830 [Synechocystis sp. PCC 6803]
gi|1001404|dbj|BAA10026.1| sll0301 [Synechocystis sp. PCC 6803]
gi|339274324|dbj|BAK50811.1| hypothetical protein SYNGTS_2063 [Synechocystis sp. PCC 6803]
gi|359272290|dbj|BAL29809.1| hypothetical protein SYNGTI_2062 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359275460|dbj|BAL32978.1| hypothetical protein SYNPCCN_2061 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278630|dbj|BAL36147.1| hypothetical protein SYNPCCP_2061 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451781353|gb|AGF52322.1| hypothetical protein MYO_120830 [Synechocystis sp. PCC 6803]
Length = 169
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/126 (42%), Positives = 80/126 (63%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL ++ ++ +A F +AD+RES+F GS + + L AV A+ GA+LS +L+DR+
Sbjct: 43 DLARSDFSHQDLNKAVFAAADLRESNFEGSDLSFSILTDAVFLHASLRGANLSGSLVDRV 102
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
L+ A+L + + + TR+ I GADFSDAVID Q + +C+ A G NP+TGV+T
Sbjct: 103 TLDFADLRDTIFTEAIATRTRFYDTDITGADFSDAVIDAYQVKLMCERAEGVNPVTGVAT 162
Query: 229 RKSLGC 234
R SLGC
Sbjct: 163 RDSLGC 168
>gi|17227682|ref|NP_484230.1| hypothetical protein all0186 [Nostoc sp. PCC 7120]
gi|17135164|dbj|BAB77710.1| all0186 [Nostoc sp. PCC 7120]
Length = 168
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 71/112 (63%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
NF +A+MR ++F G+ A L K V KAN + A+L+ L+DR L+ ANL NA+
Sbjct: 56 VNFVAAEMRGTNFQGANLTNAILTKGVLLKANLSEANLTGALVDRATLDNANLKNAIFTE 115
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LTRS A I GADF+DA+ID Q LC+ ANG N +TG++TR SLGC
Sbjct: 116 ATLTRSRFYDADITGADFTDALIDRYQVSLLCERANGVNRVTGIATRDSLGC 167
>gi|407961395|dbj|BAM54635.1| hypothetical protein BEST7613_5704 [Synechocystis sp. PCC 6803]
Length = 147
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/126 (42%), Positives = 80/126 (63%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL ++ ++ +A F +AD+RES+F GS + + L AV A+ GA+LS +L+DR+
Sbjct: 21 DLARSDFSHQDLNKAVFAAADLRESNFEGSDLSFSILTDAVFLHASLRGANLSGSLVDRV 80
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
L+ A+L + + + TR+ I GADFSDAVID Q + +C+ A G NP+TGV+T
Sbjct: 81 TLDFADLRDTIFTEAIATRTRFYDTDITGADFSDAVIDAYQVKLMCERAEGVNPVTGVAT 140
Query: 229 RKSLGC 234
R SLGC
Sbjct: 141 RDSLGC 146
>gi|428779391|ref|YP_007171177.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
gi|428693670|gb|AFZ49820.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
Length = 171
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/138 (44%), Positives = 82/138 (59%), Gaps = 10/138 (7%)
Query: 97 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 156
+ + S F + DL AV F +A+MR ++FSGS A K A+ +
Sbjct: 43 YTVVSERDFSNKDLVGAV----------FAAAEMRRTNFSGSNLENAMFTKGTLINADLS 92
Query: 157 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 216
+LS LMDR+ L+ A+L+NAVL T LTRS L G I GADF+DA+++ Q + LC+
Sbjct: 93 NTNLSGALMDRVNLDGADLSNAVLNGTFLTRSTLEGTKITGADFTDAILNRYQVKLLCEK 152
Query: 217 ANGTNPITGVSTRKSLGC 234
A G NP TGVSTR+SLGC
Sbjct: 153 AEGVNPKTGVSTRESLGC 170
>gi|170077406|ref|YP_001734044.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
gi|169885075|gb|ACA98788.1| Pentapeptide repeats protein [Synechococcus sp. PCC 7002]
Length = 169
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/120 (47%), Positives = 77/120 (64%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
EN + A+F AD+R SDF+GS + A L + +AN T A+LS+ MD++ + ANLT
Sbjct: 50 HENLQAASFARADVRGSDFTGSDLSRAILTEGKFMEANLTEANLSEAFMDQVNMEGANLT 109
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 236
NA+ V V ++ AII+GADFS A++D Q LCK A+GTN ITG+ TR SL C N
Sbjct: 110 NALFVDAVAPGTNFAEAIIDGADFSGALLDRYQLSELCKRASGTNTITGIDTRYSLNCKN 169
>gi|428308896|ref|YP_007119873.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428250508|gb|AFZ16467.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 176
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 77/134 (57%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S+ + S DL +N A F +A+MR ++F S A L K V AN A+L
Sbjct: 42 SSINYSSTDLTNRDFSHKNLVGAVFVAAEMRGTNFQESDLTNAILTKGVMLGANLQDANL 101
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+ L+DR+ L+ ANL NA+ + RS A I GADF+DA+ID Q LC+ A+G
Sbjct: 102 TGALVDRVTLDNANLKNAIFQEATMIRSRFYDADITGADFTDAIIDRYQVSLLCEKASGV 161
Query: 221 NPITGVSTRKSLGC 234
NPITGV+TR SLGC
Sbjct: 162 NPITGVATRDSLGC 175
>gi|255083653|ref|XP_002508401.1| predicted protein [Micromonas sp. RCC299]
gi|226523678|gb|ACO69659.1| predicted protein [Micromonas sp. RCC299]
Length = 187
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/133 (42%), Positives = 75/133 (56%), Gaps = 5/133 (3%)
Query: 112 KAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
KA H+ E+F + +T D+R SDFSGS A +AV N GAD+S++ +D
Sbjct: 30 KAEHINEDFSHEDLVGAIYTEGDLRGSDFSGSDLRAAIFSRAVMPGVNLEGADMSNSFLD 89
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
+VL +N+ + RSDLG + ADF++AVID Q LC A+GTNP TGV
Sbjct: 90 YVVLRGSNMRGVIAREANFVRSDLGDCDVTDADFTEAVIDRYQAIGLCDSASGTNPFTGV 149
Query: 227 STRKSLGCGNSRR 239
TR SLGC +R
Sbjct: 150 DTRDSLGCERLKR 162
>gi|411119939|ref|ZP_11392315.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410710095|gb|EKQ67606.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 169
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 55/135 (40%), Positives = 81/135 (60%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G +G+A+L ++ F SA+MR ++FSG+ A K AN +GA+
Sbjct: 34 GKFLNYGNANLTNQDFSNQDLGGGVFVSAEMRGTNFSGAILTNAMFTKGNLLGANLSGAN 93
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L L+DR L +A+L+NA+L+ L+ S L A ++GADF++A++D LCK A G
Sbjct: 94 LEGALLDRTTLYKADLSNAILIDATLSNSILDEATVDGADFTNAIVDRYAVSQLCKRAQG 153
Query: 220 TNPITGVSTRKSLGC 234
TNP TG+STR+SLGC
Sbjct: 154 TNPTTGISTRESLGC 168
>gi|87302980|ref|ZP_01085784.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
gi|87282476|gb|EAQ74435.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
Length = 203
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 76/117 (64%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+ +F R++DFSG+ +G+ L +A +++F+GADLSD LMDR + +L+
Sbjct: 86 QQLANTSFAGVMARDADFSGADLHGSILTQAAFLRSDFSGADLSDALMDRADFSGTDLSG 145
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+L + S GA+I+ ADFSDA++D + ++ALC+ A GTNP TGVSTR SL C
Sbjct: 146 ALLRGVIAAGSSFSGAVIDDADFSDALLDRSDQRALCRRAQGTNPTTGVSTRLSLDC 202
>gi|434407744|ref|YP_007150629.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428261999|gb|AFZ27949.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 162
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 85/130 (65%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L + ++ + A F++A++ ++F+G+ GA L +V KAN GADL++ +
Sbjct: 32 FSNAELGRQDFSGQSLQAAEFSNANLELTNFTGADLRGAVLSASVMTKANLHGADLTNAM 91
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ L A+L++AV + +L R+ IEGADF+DA++D AQ + LC+ A+G N T
Sbjct: 92 VDQVNLTRADLSDAVFIEALLLRAIFTDVNIEGADFTDAILDRAQVKELCEKASGVNSQT 151
Query: 225 GVSTRKSLGC 234
GV TR SLGC
Sbjct: 152 GVQTRDSLGC 161
>gi|428309499|ref|YP_007120476.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428251111|gb|AFZ17070.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 166
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 70/179 (39%), Positives = 92/179 (51%), Gaps = 27/179 (15%)
Query: 64 VSTALAAAVVASCSSNISALADLNKY--------EAETRGEFGIGSAAQFGSADLRKAVH 115
++T L A +V C + ALA KY AE +G+ F LR A
Sbjct: 6 LATFLLALIVWCCP--LPALAQATKYYPPPLSYSNAELKGK-------DFSGQTLRSAEF 56
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
N R NFT AD+R + FS S V AN GADLS+ ++D++ A+L
Sbjct: 57 SNANLERTNFTDADLRGTIFSAS----------VMTHANLHGADLSNAMIDQVSFTNADL 106
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
++AVL +++ RS I GADFSDA++D AQ + LC A G N TGVSTR SLGC
Sbjct: 107 SDAVLTESIMLRSTFDNVDITGADFSDAILDGAQIKELCTKATGVNSQTGVSTRDSLGC 165
>gi|443313318|ref|ZP_21042930.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442776723|gb|ELR87004.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 182
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 56/131 (42%), Positives = 77/131 (58%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +A+L+ N F +A+MR ++F G+ A + K V AN GA+LS
Sbjct: 51 NYNNANLQNRDFSHTNLIGGVFVAAEMRGANFQGADLTNAIMTKGVLLGANLEGANLSGA 110
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
L+DR+ L+ ANL NA+ LTRS A I GADFS+A+ID Q LC A GTNP+
Sbjct: 111 LVDRVTLDNANLKNAIFTDATLTRSRFFDADITGADFSNALIDRYQINLLCDRATGTNPV 170
Query: 224 TGVSTRKSLGC 234
TG++T +SLGC
Sbjct: 171 TGITTTESLGC 181
>gi|443328655|ref|ZP_21057250.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791786|gb|ELS01278.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 222
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 81/134 (60%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
++ + ++LR +F F +AD+R S+F GS + + L KA+ N +G DL
Sbjct: 88 NSVNYTYSELRNEDLSHRDFSGGVFAAADVRGSNFEGSDLSNSILTKAIFTDTNLSGVDL 147
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+++ MDR+ L+ +NL+NA+L + T ++ I GADFS A+ID Q LC+ A G
Sbjct: 148 TNSFMDRVDLSNSNLSNAILQDIIATSTNFYNTDITGADFSGAIIDRYQTYVLCQRAAGV 207
Query: 221 NPITGVSTRKSLGC 234
NP+TGVSTR SLGC
Sbjct: 208 NPVTGVSTRYSLGC 221
>gi|440684176|ref|YP_007158971.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428681295|gb|AFZ60061.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 162
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 86/130 (66%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L + ++ + A F++A++ ++F+G+ GA L +V +AN GADL++ +
Sbjct: 33 FSNAELGRQDFSGQSLQAAEFSNANLEMANFTGADLRGAVLSASVMTQANLHGADLTNAM 92
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ LN A+L++A+L+ +L RS I GADF+DA++D AQ + LC+ A+G N T
Sbjct: 93 IDQVKLNGADLSDAILLEALLLRSIFTDVNIAGADFTDAILDKAQIKELCQKASGVNSRT 152
Query: 225 GVSTRKSLGC 234
GV TR SLGC
Sbjct: 153 GVETRDSLGC 162
>gi|427706655|ref|YP_007049032.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427359160|gb|AFY41882.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 168
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 57/141 (40%), Positives = 79/141 (56%)
Query: 94 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 153
R F + + +A+L + F +A+MR ++F + A K V KA
Sbjct: 27 RPAFAQINTINYSNANLENRDFANADLAGVTFVAAEMRGTNFQAANLTNAIFTKGVLLKA 86
Query: 154 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 213
N GA+L+ L+DR+ L+ ANL NA LTRS A I GADF+DA+ID Q L
Sbjct: 87 NLEGANLTGALVDRVTLDGANLKNANFTEATLTRSRFYDADITGADFTDALIDRYQISLL 146
Query: 214 CKYANGTNPITGVSTRKSLGC 234
C+ A+G NP+TGV+TR+SLGC
Sbjct: 147 CERADGVNPVTGVATRESLGC 167
>gi|298489879|ref|YP_003720056.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298231797|gb|ADI62933.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 163
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 85/130 (65%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L + ++ + A F++A++ ++F+G+ G +V KAN GA+L++ +
Sbjct: 33 FSNAELGRQDFSGQSLQAAEFSNANLEMANFAGADLRGTVFSASVMTKANLHGANLTNAM 92
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
++ + LN A+L++A+L+ +L RS IEGADFSDA++D +Q Q LCK A+G N T
Sbjct: 93 VNEVKLNGADLSDAILLEALLLRSIFTDVNIEGADFSDAILDRSQIQELCKKASGVNSQT 152
Query: 225 GVSTRKSLGC 234
GV TR+SLGC
Sbjct: 153 GVETRESLGC 162
>gi|443322626|ref|ZP_21051645.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
gi|442787675|gb|ELR97389.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
Length = 164
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 76/130 (58%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
+ S +L+ ++ A F AD+R ++F + + L + V AN T A+L+D L
Sbjct: 33 YTSTELQNRDFSGQDLEGAVFADADLRGANFQAANLANSILTQGVFLNANLTKANLTDAL 92
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
DR EANLT+A+LV + +RS AII GADFS A++D Q LC A GTNP+T
Sbjct: 93 ADRATFAEANLTDAILVNIIASRSSFVDAIITGADFSGAILDKYQVALLCDRAQGTNPVT 152
Query: 225 GVSTRKSLGC 234
GVSTR SL C
Sbjct: 153 GVSTRASLNC 162
>gi|17230233|ref|NP_486781.1| hypothetical protein alr2741 [Nostoc sp. PCC 7120]
gi|17131834|dbj|BAB74440.1| alr2741 [Nostoc sp. PCC 7120]
Length = 182
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 82/130 (63%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L + E+ + A F++A++ ++F G+ GA L +V +AN GADL++ +
Sbjct: 52 FSNAELSRHNFAGESLQAAEFSNANLEMTNFVGADLRGAVLSASVMTQANLQGADLTNAM 111
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ L ANL++ VL +L R+ IEGADF+DA++D AQ + LC A+G N T
Sbjct: 112 VDQVNLTGANLSDVVLKEALLLRAIFANVNIEGADFTDAILDKAQIKELCTKASGVNTKT 171
Query: 225 GVSTRKSLGC 234
GV TR SLGC
Sbjct: 172 GVETRDSLGC 181
>gi|75910505|ref|YP_324801.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704230|gb|ABA23906.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 182
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 82/130 (63%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L + E+ + A F++A++ ++F G+ GA L +V +AN GADL++ +
Sbjct: 52 FSNAELSRHNFAGESLQAAEFSNANLEMTNFVGADLRGAVLSASVMTQANLQGADLTNAM 111
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ L ANL++ VL +L R+ IEGADF+DA++D AQ + LC A+G N T
Sbjct: 112 VDQVNLTGANLSDVVLKEALLLRAIFANVNIEGADFTDAILDKAQIKELCTKASGVNTKT 171
Query: 225 GVSTRKSLGC 234
GV TR SLGC
Sbjct: 172 GVKTRDSLGC 181
>gi|425438309|ref|ZP_18818714.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9432]
gi|425452591|ref|ZP_18832408.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 7941]
gi|440756403|ref|ZP_20935604.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|443646807|ref|ZP_21129485.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|159025958|emb|CAO87888.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|389676535|emb|CCH94452.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9432]
gi|389765527|emb|CCI08587.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 7941]
gi|440173625|gb|ELP53083.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|443335636|gb|ELS50100.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 166
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 76/118 (64%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
++ R F +A MR + GS + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGSDLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|425469693|ref|ZP_18848608.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9701]
gi|389880432|emb|CCI38813.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9701]
Length = 166
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 76/118 (64%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
++ R F +A MR + G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDAYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|428203864|ref|YP_007082453.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427981296|gb|AFY78896.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 170
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 72/110 (65%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +ADMR +F S + L + V AN GA+L+++LMDR+ L+ A+LTNA+ V +
Sbjct: 60 FAAADMRGINFEDSDLSNTILTEGVLLGANLKGANLTNSLMDRVTLDFADLTNAIFVDAI 119
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
TR+ I GADFS AV+D Q + LC A+G NP+TG+STR+SLGC
Sbjct: 120 ATRTRFYDTTITGADFSGAVLDRYQVKLLCDRADGVNPVTGISTRESLGC 169
>gi|425465439|ref|ZP_18844748.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|389832325|emb|CCI24153.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
Length = 166
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 77/118 (65%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
++ R F +A MR ++ G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGANLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|422303610|ref|ZP_16390961.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9806]
gi|389791366|emb|CCI12792.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9806]
Length = 166
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 76/118 (64%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
++ R F +A MR + G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDAYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|86609913|ref|YP_478675.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86558455|gb|ABD03412.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 173
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 83/130 (63%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +ADL+ +++R ++F SA+++ +D G+ GA KA AN +GADLS++L
Sbjct: 43 FNNADLQGQDLSGQDWRGSSFVSANLQGADLHGANLAGAAFTKANLAGANLSGADLSNSL 102
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D L A+L A L + R+ GA I GADFS+A +D A K+ LC+ A G++PIT
Sbjct: 103 LDLANLAGADLRGAKLTGAIAARAVWQGAQIAGADFSEAYVDRAAKRQLCERAEGSHPIT 162
Query: 225 GVSTRKSLGC 234
GV+TR+SLGC
Sbjct: 163 GVTTRESLGC 172
>gi|300868096|ref|ZP_07112733.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
gi|300333934|emb|CBN57911.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
Length = 174
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 68/110 (61%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +A+MR ++F G+ A L K V AN + A+LS L DR+ + ANLTNA +
Sbjct: 64 FVAAEMRNTNFEGADLTNAILTKGVLLNANLSNANLSGALADRVTFDGANLTNANFTEAI 123
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LTR+ I GADF+DA+ID Q LC+ A G N +TGVSTR+SLGC
Sbjct: 124 LTRTRFYDTAISGADFTDAIIDSYQVNLLCEKAEGVNSVTGVSTRESLGC 173
>gi|414075538|ref|YP_006994856.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413968954|gb|AFW93043.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 168
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/131 (41%), Positives = 74/131 (56%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +A+L N F +A+MR ++F + A K V KAN A+L+
Sbjct: 37 NYNNANLENRDFSHTNLVGGTFVAAEMRGTNFQDANLTNAIFTKGVLLKANLESANLTGA 96
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
L+DR+ + ANL NA+ LTRS A I GADF+DA+ID Q LC+ A+G NP+
Sbjct: 97 LVDRVTFDSANLRNAIFAEATLTRSRFYDADITGADFTDALIDRYQVSLLCQRADGVNPV 156
Query: 224 TGVSTRKSLGC 234
TG+STR SLGC
Sbjct: 157 TGISTRDSLGC 167
>gi|166365075|ref|YP_001657348.1| hypothetical protein MAE_23340 [Microcystis aeruginosa NIES-843]
gi|166087448|dbj|BAG02156.1| hypothetical protein MAE_23340 [Microcystis aeruginosa NIES-843]
Length = 166
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 76/118 (64%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
++ R F +A MR + G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|425439807|ref|ZP_18820122.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9717]
gi|425456970|ref|ZP_18836676.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9807]
gi|389719892|emb|CCH96344.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9717]
gi|389801790|emb|CCI19079.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9807]
Length = 166
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 76/118 (64%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
++ R F +A MR + G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|425446471|ref|ZP_18826474.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9443]
gi|389733275|emb|CCI02926.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9443]
Length = 166
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 76/118 (64%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
++ R F +A MR + G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
N + + TRS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|427716094|ref|YP_007064088.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348530|gb|AFY31254.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 163
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 83/130 (63%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L++ E + A F++A++ +++F+G+ GA L +V + N GADL+D L
Sbjct: 33 FSNAELKRHDFSGETLQGAEFSNANLEQANFAGADLRGAVLSASVMTQTNLHGADLTDAL 92
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ L +A+L++AVL +L R+ I ADF+DAV+D AQ + LC A+G N T
Sbjct: 93 VDQVNLTKADLSDAVLKEALLLRAIFTDVNINSADFTDAVLDRAQIKELCGKASGVNSKT 152
Query: 225 GVSTRKSLGC 234
GV TR SLGC
Sbjct: 153 GVQTRDSLGC 162
>gi|390440134|ref|ZP_10228485.1| Similar to Pentapeptide repeat [Microcystis sp. T1-4]
gi|389836418|emb|CCI32609.1| Similar to Pentapeptide repeat [Microcystis sp. T1-4]
Length = 166
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 76/118 (64%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
++ R F +A MR + G+ + + L +AV KAN GADL+ +L+DR+ L+ A+LT
Sbjct: 48 HQDLRGGVFAAAAMRGVNLEGADLSYSILTEAVLLKANLKGADLTASLVDRVTLDFADLT 107
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
N + + +RS II GADF++AVID Q + +C+ A+G NP+TGV+TR SLGC
Sbjct: 108 NTIFTDAIASRSRFYDTIITGADFTNAVIDAYQVKLMCERADGINPVTGVATRDSLGC 165
>gi|428316951|ref|YP_007114833.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428240631|gb|AFZ06417.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 165
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 82/130 (63%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L + + R A F++A+M ++FS + GA + +V +AN GA+L++ +
Sbjct: 35 FSNAELTRRDFSGQMLRAAEFSNANMDLTNFSNADLRGAIMSASVMTQANLHGANLTNAM 94
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ A+L++A+L T+L RS G I GADF+DA++D +Q + LC A G N T
Sbjct: 95 IDQVKFTNADLSDAILAETILLRSTFDGVDITGADFTDAIMDGSQVKELCTKATGINSQT 154
Query: 225 GVSTRKSLGC 234
G+STR SLGC
Sbjct: 155 GISTRDSLGC 164
>gi|116073351|ref|ZP_01470613.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
gi|116068656|gb|EAU74408.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
Length = 167
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 72/111 (64%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
+F A R +DFSG+ +GA + +A+F+ ADLSD+LMDR + NLTNA+L
Sbjct: 57 SFAGAVGRGADFSGADLHGAIFTQGAFAEADFSDADLSDSLMDRADFSGTNLTNALLNGV 116
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ + S GA IEGADFSDA++D LC+ A G NPITG++TR SLGC
Sbjct: 117 IASGSSFAGASIEGADFSDALLDRDDVVRLCRDAEGVNPITGMATRDSLGC 167
>gi|318040416|ref|ZP_07972372.1| hypothetical protein SCB01_01865 [Synechococcus sp. CB0101]
Length = 174
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 73/117 (62%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+ +F A + ++F+G+ +GA + +A+F+GADLSD LMDR ++ NL N
Sbjct: 58 QQLVNTSFAGAVGKGANFAGANLHGAIFTQGAFPEADFSGADLSDVLMDRTDMSHTNLRN 117
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVLV + + GA + GADFSDA+ID A ++ LC A+GTNP TG TR SLGC
Sbjct: 118 AVLVGVIAAGASFSGADVTGADFSDALIDRADQRQLCAKASGTNPSTGADTRASLGC 174
>gi|434400337|ref|YP_007134341.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428271434|gb|AFZ37375.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 169
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 82/134 (61%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SA + +++R ++ A F +AD R ++F GS + + L KAV AN +L
Sbjct: 35 SAVNYTYSEIRDQDFSHKDLAGAVFAAADARGANFEGSDLSNSILTKAVFSNANLAEINL 94
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+ +LMDR+ L+ +NLTNA++ V T ++ GA I GADFSD+++D Q LCK A G
Sbjct: 95 TKSLMDRVALDNSNLTNAIIREAVATSTNFDGATITGADFSDSILDRYQIYLLCKRAEGV 154
Query: 221 NPITGVSTRKSLGC 234
NP TGVSTR SLGC
Sbjct: 155 NPTTGVSTRDSLGC 168
>gi|428206519|ref|YP_007090872.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428008440|gb|AFY87003.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 192
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 84/130 (64%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
+ +A+L ++ R A F++A+M + +F+ + GA + +V +AN GADLS +
Sbjct: 62 YSNAELTGKDFSRQILRAAEFSNANMEQVNFTDADLRGAIMSASVMTQANLHGADLSIAM 121
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ + A+L++AVL +L R+ G I GADFSDA++D AQ + LC+ A+G N T
Sbjct: 122 VDQVKMTGADLSDAVLQEALLLRTIFTGVDITGADFSDAILDGAQVKELCQRASGINSKT 181
Query: 225 GVSTRKSLGC 234
G++TR+SLGC
Sbjct: 182 GIATRESLGC 191
>gi|425462969|ref|ZP_18842432.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9808]
gi|389823905|emb|CCI27601.1| Similar to Pentapeptide repeat [Microcystis aeruginosa PCC 9808]
Length = 166
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 57/130 (43%), Positives = 75/130 (57%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F DLR V R AN AD+ S L +AV KAN GADL+ +L
Sbjct: 46 FSHQDLRGGVFAAAAMRGANLEEADLSYS----------ILTEAVLLKANLKGADLTASL 95
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+DR+ L+ A+LTN + + TRS II GADF++AVID Q + +C+ A+G NP+T
Sbjct: 96 VDRVTLDFADLTNTIFTDAIATRSRFYDTIITGADFTNAVIDNYQVKLMCERADGINPVT 155
Query: 225 GVSTRKSLGC 234
GV+TR SLGC
Sbjct: 156 GVATRDSLGC 165
>gi|434390929|ref|YP_007125876.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428262770|gb|AFZ28716.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 163
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 80/130 (61%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L+ + R + F++A+M +++F+ + GA +V KAN GA+L++ +
Sbjct: 33 FSNAELKGRDFSGQMLRASEFSNANMEQTNFTDADLRGAIFSASVMTKANLHGANLTNAM 92
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
D++ A+L+ AVL T+L RS I ADFSDA++D Q + LC+ A+G NP T
Sbjct: 93 ADQVNFTNADLSAAVLAETILLRSVFDNTDITAADFSDAILDGVQIKELCQRASGVNPTT 152
Query: 225 GVSTRKSLGC 234
GV TR+SLGC
Sbjct: 153 GVDTRESLGC 162
>gi|428209239|ref|YP_007093592.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428011160|gb|AFY89723.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 165
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 73/117 (62%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N RA F + + E++FS + GA AV KAN G D S + L+ A+L++
Sbjct: 48 QNLVRAEFNNTKLAEANFSSADLRGAVFNSAVLRKANLHGVDFSYGIAYLSDLSAADLSD 107
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+L ++ RS+ GA + GADFS+AV+D Q LC+YA+G NP+TGV TR+SLGC
Sbjct: 108 AILTSAMMLRSNFKGAKVTGADFSEAVLDREQVVQLCEYASGVNPVTGVDTRESLGC 164
>gi|334116781|ref|ZP_08490873.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333461601|gb|EGK90206.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 165
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 82/130 (63%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L + + R A F++A+M ++FS + GA + +V +AN GA+L++ +
Sbjct: 35 FSNAELTRRDFSGQMLRAAEFSNANMDLTNFSNADLQGAIMSASVMTQANLHGANLTNAM 94
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ A+L++A+L T+L RS G I GADF+DA++D +Q + LC A+G N T
Sbjct: 95 IDQVKFTNADLSDAILAETILLRSTFEGVDITGADFTDAIMDGSQIKELCTKASGINSQT 154
Query: 225 GVSTRKSLGC 234
G+ TR SLGC
Sbjct: 155 GIYTRDSLGC 164
>gi|186684198|ref|YP_001867394.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186466650|gb|ACC82451.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 174
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 85/130 (65%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L + ++ + A F++A+M ++FS + GA + +V KAN GADL++ +
Sbjct: 44 FSNAELSRRDFSGDSLQAAEFSNANMELANFSNADLRGAVMSASVMTKANLHGADLTNAM 103
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ L +A+L++A+ +L R+ I+GADF+DA++D AQ + LC+ A+G N T
Sbjct: 104 VDQVNLTKADLSDAIFKEALLLRAIFNDVNIDGADFTDAILDRAQIKELCRKASGVNSKT 163
Query: 225 GVSTRKSLGC 234
GV TR+SLGC
Sbjct: 164 GVQTRESLGC 173
>gi|307153777|ref|YP_003889161.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306984005|gb|ADN15886.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 173
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 75/118 (63%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
++N R A F +ADMR + F S + A L + + AN GA+L+ TL+DR+ L+ A+L
Sbjct: 54 EKNLRGAVFAAADMRGASFENSDLSYAILTEGILLNANLKGANLTGTLLDRVTLDFADLR 113
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+A+L + TR+ + I GADF+ AVID Q +C+ A+G N ITGVSTR SLGC
Sbjct: 114 DAILTDAIATRTRFYDSDITGADFTGAVIDTYQISLMCERADGVNSITGVSTRDSLGC 171
>gi|86605651|ref|YP_474414.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86554193|gb|ABC99151.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 165
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 79/130 (60%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +ADL+ +++R ++F SA+++ +D G+ G KA AN GADLS++L
Sbjct: 35 FSNADLQGQDLSGQDWRGSSFVSANLQGADLQGANLAGVAFTKANLAGANLAGADLSNSL 94
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D L A+L A L + R+ GA I GADFSDA +D A + LC+ A G++PIT
Sbjct: 95 LDLANLAGADLRGANLRGAIAARAVWDGAQIAGADFSDAYVDRAALRQLCQRAEGSHPIT 154
Query: 225 GVSTRKSLGC 234
GVSTR SLGC
Sbjct: 155 GVSTRASLGC 164
>gi|354568879|ref|ZP_08988040.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353539391|gb|EHC08878.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 172
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 78/127 (61%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
+DLR ++ +F A+M+ ++F G+ +G L K +A+ + A+L++ DR
Sbjct: 45 SDLRYRDFSHQDLHGTSFAGAEMQGANFQGANLSGTILTKGSFLQADLSNANLAEAFADR 104
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 227
++ N+ANLTNA+ +L S A I GADFS A++D Q + +C A+G NP+TGVS
Sbjct: 105 VIFNKANLTNAIFRDAMLASSRFFEAEITGADFSGAIVDPYQVKLMCDRADGINPVTGVS 164
Query: 228 TRKSLGC 234
TR+SLGC
Sbjct: 165 TRESLGC 171
>gi|427722287|ref|YP_007069564.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427354007|gb|AFY36730.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 175
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 73/120 (60%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ + A+F AD+R SDFSGS + A L + N +GADL++ MD++ L+ ANLT
Sbjct: 56 HQQLQAASFARADVRSSDFSGSDLSRAILSEGKFMDTNLSGADLTEAFMDQVNLSGANLT 115
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN 236
NA+ V ++ A I GADFS A++D Q LCK A+GTN ITG+ TR SL C N
Sbjct: 116 NAIFTDAVAPGTNFTDANIAGADFSGALLDRYQLSQLCKRASGTNAITGIETRYSLNCEN 175
>gi|414079521|ref|YP_007000945.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413972800|gb|AFW96888.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 162
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 83/130 (63%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
+ +A+L + ++ + A F++A++ ++F+G+ G +V KAN GADL++ +
Sbjct: 32 YSNAELSRQDFSGQSLQAAEFSNANLEMANFTGADLRGTVFSASVMTKANLHGADLTNAM 91
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
++ + L A+L+NAVL+ +L R+ I GADF+DA++D AQ + LC+ A+G N T
Sbjct: 92 VNEVKLAGADLSNAVLIEALLLRTVFTDVNITGADFTDAILDKAQIKELCQKASGVNSQT 151
Query: 225 GVSTRKSLGC 234
GV TR+SLGC
Sbjct: 152 GVETRESLGC 161
>gi|427729477|ref|YP_007075714.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427365396|gb|AFY48117.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 170
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 83/130 (63%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L + ++ + A F++A++ +DF+G+ GA L +V +AN ADL++ +
Sbjct: 41 FSNAELARHDFAGDSLQAAEFSNANLEMTDFTGADLRGAVLSASVMTQANLHKADLTNAM 100
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ L A+L++AV +L R+ IEGADF+DA++D AQ + LC A+G N T
Sbjct: 101 VDQVNLTGADLSDAVFKEALLLRAIFNDVNIEGADFTDALLDKAQIKELCTKASGVNSQT 160
Query: 225 GVSTRKSLGC 234
GV+TR SLGC
Sbjct: 161 GVATRDSLGC 170
>gi|428777417|ref|YP_007169204.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428691696|gb|AFZ44990.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 165
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 77/133 (57%), Gaps = 5/133 (3%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A Q + D ++ F N +AD +++ G+ FNGA L + AN+ G + S
Sbjct: 37 AVQAETQDFSGQTLIEAEFYDENLEAADFHDANLEGAVFNGATL-----HNANWRGVNFS 91
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
+ + +LTNAVL ++ RS GAI+EGADF++AV+D Q + LC+ A+G N
Sbjct: 92 NGIAYLTDFTGVDLTNAVLTEAMMLRSKFEGAIVEGADFTNAVVDRLQVKKLCERASGVN 151
Query: 222 PITGVSTRKSLGC 234
P TGVSTR+SLGC
Sbjct: 152 PTTGVSTRESLGC 164
>gi|33862602|ref|NP_894162.1| hypothetical protein PMT0329 [Prochlorococcus marinus str. MIT
9313]
gi|33634518|emb|CAE20504.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 179
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/123 (43%), Positives = 77/123 (62%), Gaps = 1/123 (0%)
Query: 112 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 171
K H ++ ++F A R +DFS S +GA L + ++NF+GADLSD LMDR+
Sbjct: 58 KDFHAQD-LSNSSFAGAVARAADFSNSNLHGAILTQGTFTQSNFSGADLSDALMDRVDFV 116
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 231
+ +L N VL + + S GA I+GADFSDA++DL ++ LC A+G N ITG++T +S
Sbjct: 117 DTDLRNCVLKGVIASGSSFAGAQIDGADFSDALLDLDDQRRLCLDADGINQITGIATFES 176
Query: 232 LGC 234
L C
Sbjct: 177 LNC 179
>gi|428224653|ref|YP_007108750.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984554|gb|AFY65698.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 187
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 70/110 (63%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F++A++ ++F G+ G +V AN GA+L++ LMD+ L A+L A+L +
Sbjct: 77 FSNANLERANFEGADVRGGVFSASVLTDANLQGANLTNALMDQANLTRADLRGAILSEAI 136
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
L S I GADFSDA++D AQ +ALC+ A G NP+TG+STR+SLGC
Sbjct: 137 LLGSTFAETAIAGADFSDAILDGAQIKALCQRAEGVNPVTGLSTRESLGC 186
>gi|428770661|ref|YP_007162451.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428684940|gb|AFZ54407.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 165
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 52/126 (41%), Positives = 76/126 (60%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL + +N + F++ + ++FS S GA A +ANF GADL++ +
Sbjct: 39 DLSQQDFSSQNLQSMEFSNVKLNGANFSNSDLRGAVFNAARLEEANFHGADLTNGFIYVT 98
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
LN A+LT+A+L ++ R+ L GA ++GADF+ AV+D Q LCK A G NP+TG ST
Sbjct: 99 SLNRADLTDAILREAIMKRTTLKGANVDGADFTFAVLDNEQVIELCKNAQGINPVTGAST 158
Query: 229 RKSLGC 234
R+SLGC
Sbjct: 159 RQSLGC 164
>gi|124023686|ref|YP_001017993.1| hypothetical protein P9303_19861 [Prochlorococcus marinus str. MIT
9303]
gi|123963972|gb|ABM78728.1| Uncharacterized low-complexity proteins [Prochlorococcus marinus
str. MIT 9303]
Length = 179
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/123 (43%), Positives = 76/123 (61%), Gaps = 1/123 (0%)
Query: 112 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 171
K H ++ +F A R +DFS S GA L + ++NF+GADLSD LMDR+
Sbjct: 58 KDFHAQD-LSNTSFAGAVARAADFSNSNLRGAILTQGTFTQSNFSGADLSDALMDRVDFV 116
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 231
+ +L N+VL + + S GA I+GADFSDA++DL ++ LC A+G N ITG++T +S
Sbjct: 117 DTDLRNSVLKGVIASGSSFAGAQIDGADFSDALLDLDDQRRLCLDADGINQITGIATFES 176
Query: 232 LGC 234
L C
Sbjct: 177 LNC 179
>gi|56750202|ref|YP_170903.1| hypothetical protein syc0193_c [Synechococcus elongatus PCC 6301]
gi|81300170|ref|YP_400378.1| hypothetical protein Synpcc7942_1361 [Synechococcus elongatus PCC
7942]
gi|56685161|dbj|BAD78383.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169051|gb|ABB57391.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 167
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 69/110 (62%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F S +MR+++ + A L V ANF GADLS L+DR+ L A+LT+A+LV
Sbjct: 57 FVSTEMRKANLEEANLRNAILTLGVFLDANFHGADLSGALLDRVFLVGADLTDALLVDVT 116
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
TR+ I GADF+DA+ID +++ LC A+G NP TGV+TR SLGC
Sbjct: 117 ATRTSFQDVKITGADFTDAIIDRYEQKQLCLRADGVNPKTGVATRDSLGC 166
>gi|427708609|ref|YP_007050986.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427361114|gb|AFY43836.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 189
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 82/130 (63%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L + + + A F++A+M ++F+G+ GA L +V KAN ADL++ +
Sbjct: 59 FSNAELARRDFSGQTLQAAEFSNANMEMANFTGADLRGAVLSASVMTKANLHQADLTNAM 118
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ L A+L++AV +L R+ I+GADF+DAV+D AQ + LC A+G N T
Sbjct: 119 VDQVNLTGADLSDAVFKEALLLRALFTDVNIQGADFTDAVLDKAQIKELCSKASGVNSKT 178
Query: 225 GVSTRKSLGC 234
GV TR+SLGC
Sbjct: 179 GVETRESLGC 188
>gi|282902031|ref|ZP_06309929.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281193118|gb|EFA68117.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 162
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 81/130 (62%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L + +N + A F++A++ ++F+ + GA +V +AN GADL++ +
Sbjct: 32 FSNAELGRHNFSGQNLQAAEFSNANLEMANFANADLRGAVFSASVMTQANLHGADLTNAM 91
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ L +A+L++A+ + +L RS+ I+GADFS A++D Q + LCK A G N T
Sbjct: 92 LDQVKLTDADLSDAIFIEAILLRSNFAKTNIDGADFSKAILDRGQIRDLCKSARGINSRT 151
Query: 225 GVSTRKSLGC 234
V TR SLGC
Sbjct: 152 HVQTRDSLGC 161
>gi|33866170|ref|NP_897729.1| hypothetical protein SYNW1636 [Synechococcus sp. WH 8102]
gi|33639145|emb|CAE08151.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 171
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 73/117 (62%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ +F A R ++FSG+ +GA + +A+F+GADLSD LMDR NL +
Sbjct: 55 QHLANTSFAGAVGRGANFSGADLHGAIFTQGAFAEADFSGADLSDALMDRADFAGTNLRD 114
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVL + + S A I GADFSDA++DL ++ LC+ A+G NP+TGV+T SLGC
Sbjct: 115 AVLTGIIASGSSFSDAQIAGADFSDALLDLDDQRRLCRDADGVNPVTGVATLDSLGC 171
>gi|443312247|ref|ZP_21041866.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442777717|gb|ELR87991.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 162
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 80/130 (61%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L+ + R A F++A+M ++FS + GA +V A+ GADLS+ +
Sbjct: 32 FSNAELKSRDFSGQTLRAAEFSNANMELANFSNADLRGAVFSASVMTGASLHGADLSNAM 91
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ L +A+L++AVL +L R+ I ADF+DA++D AQ + LC A+G NP T
Sbjct: 92 VDQVNLTKADLSDAVLTEALLLRAIFDDVSIVNADFTDAILDRAQIKELCAKASGVNPKT 151
Query: 225 GVSTRKSLGC 234
GV TR SLGC
Sbjct: 152 GVETRYSLGC 161
>gi|78212794|ref|YP_381573.1| hypothetical protein Syncc9605_1263 [Synechococcus sp. CC9605]
gi|78197253|gb|ABB35018.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 169
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 87/149 (58%), Gaps = 21/149 (14%)
Query: 92 ETRGEFGIGS-AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-- 148
E RG+F + +A DL++ +K + R N + D+R + + S+ GA L A
Sbjct: 34 ELRGQFAVQEISADMHGLDLKEKEFLKADLREVNLSGTDLRGAVINTSQLQGADLRDANL 93
Query: 149 ---VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
V + ++F GADL AN TNA+++++ T A I+GADF++AVI
Sbjct: 94 SDVVGFASHFEGADLRG----------ANFTNAMMMQSRFT-----DAQIDGADFTNAVI 138
Query: 206 DLAQKQALCKYANGTNPITGVSTRKSLGC 234
DL Q++ALC A+G+NPI+GVSTR+SLGC
Sbjct: 139 DLPQQRALCARADGSNPISGVSTRESLGC 167
>gi|411116478|ref|ZP_11388965.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410712581|gb|EKQ70082.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 165
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 72/117 (61%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N R+ F +D++ ++F+G+ GA A AN G D SD + A+L++
Sbjct: 48 QNLVRSEFGDSDLQGANFAGADLRGAVFNGAKLTNANLHGVDFSDGIAYITDFANADLSD 107
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+L +L +S GA I GADFSDA ID AQ ALC+ A+GTNP+TGV TR+SLGC
Sbjct: 108 AILNSAMLLKSSFKGANITGADFSDAAIDRAQVLALCQTASGTNPVTGVDTRESLGC 164
>gi|317970566|ref|ZP_07971956.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
Length = 175
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 73/117 (62%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+ +F A + ++FSG+ +GA L + ANF GADLSD L+DR ++ +L N
Sbjct: 59 QQLANTSFAGAVGKAANFSGADLHGAILTQGAFPDANFNGADLSDVLLDRTDMSGTDLRN 118
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVLV + + S GA +E ADF+DA++D A ++ C A+GTNP TG +TR SLGC
Sbjct: 119 AVLVGVIASGSTFTGAQVENADFTDALLDRADQRNFCISASGTNPTTGANTRASLGC 175
>gi|428306100|ref|YP_007142925.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428247635|gb|AFZ13415.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 174
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 79/130 (60%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F + DL AV F +A M+ ++F GS + A L + ANF A+L++ L
Sbjct: 54 FSNTDLTGAV----------FAAAQMKGANFQGSNLSNAILSQGTLSNANFADANLTNAL 103
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ L+ A+LTNA+ + + ++ + I GADF+DA+ID Q + LC+ A+G NP+T
Sbjct: 104 VDQVTLDGADLTNAIFRQATMVGTNFNDSAIAGADFTDAIIDRYQLKQLCQRASGVNPVT 163
Query: 225 GVSTRKSLGC 234
VSTR+SLGC
Sbjct: 164 AVSTRESLGC 173
>gi|282900610|ref|ZP_06308552.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281194410|gb|EFA69365.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 167
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 72/118 (61%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ + R ++FT A++R+SDFSGS G A ANFTGADL++ +D ANLT
Sbjct: 49 QRDLRDSSFTKANLRQSDFSGSNLTGVSFFAANLESANFTGADLTNATLDSARFIGANLT 108
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA+L + + GAII GADF+D ++ ++ LC+ ANG NP TG TR++L C
Sbjct: 109 NAILEGSFAASAKFDGAIIAGADFTDVLLRRDEQNKLCQVANGINPTTGRHTRETLFC 166
>gi|427735661|ref|YP_007055205.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427370702|gb|AFY54658.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 168
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 52/138 (37%), Positives = 76/138 (55%)
Query: 97 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 156
F + + S +L ++ NF SA+MR ++F G+ A K AN
Sbjct: 30 FAQTNTINYSSTNLENRDFSNQDLTAVNFISAEMRGTNFQGADLTNAMFTKGNLLGANLE 89
Query: 157 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 216
GA+ ++ L+D++ L+ ANL NA + ++RS A I GADF+DA+ID Q + +C
Sbjct: 90 GANFTNALVDQVTLDNANLKNANFTQATMSRSRFFDADITGADFTDAIIDRYQVKLMCDR 149
Query: 217 ANGTNPITGVSTRKSLGC 234
A+G NP TGV TR SLGC
Sbjct: 150 ASGVNPETGVETRYSLGC 167
>gi|428222027|ref|YP_007106197.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427995367|gb|AFY74062.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 161
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 74/117 (63%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+ R + F +A+M ++F + GA ++ AN A+ + ++D++ A+LT+
Sbjct: 44 QELRGSGFANANMENANFERADLRGAVFSASILRNANLRAANFTTGMLDQIDFANADLTD 103
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+LV T+L RS A I+GADF+DA++D AQ + LC A GTNP TGVSTR+SLGC
Sbjct: 104 AILVDTLLLRSTFDFAKIDGADFTDALLDGAQIKWLCSKAKGTNPFTGVSTRESLGC 160
>gi|412992118|emb|CCO19831.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
Length = 293
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 56/144 (38%), Positives = 79/144 (54%), Gaps = 10/144 (6%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S F + DLR + V+ R NF+ +DMR GA + +++ A+ G+D+
Sbjct: 145 SNEDFSNLDLRGTIWVEAELRNTNFSKSDMR----------GAVMTRSIMPNADVHGSDV 194
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
S+ L D ++L AN +AV V RSD+G I+ ADF++AVID Q LC+ A G
Sbjct: 195 SNVLFDYVLLRGANFEDAVAVGANFIRSDMGEMKIKNADFTEAVIDRYQVLGLCETAEGV 254
Query: 221 NPITGVSTRKSLGCGNSRRNAYGS 244
NP TGV TR SLGC + + GS
Sbjct: 255 NPYTGVDTRMSLGCDSFVKKYEGS 278
>gi|87303664|ref|ZP_01086439.1| hypothetical protein WH5701_12843 [Synechococcus sp. WH 5701]
gi|87281769|gb|EAQ73734.1| hypothetical protein WH5701_12843 [Synechococcus sp. WH 5701]
Length = 153
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 59/146 (40%), Positives = 81/146 (55%), Gaps = 8/146 (5%)
Query: 97 FGIGSAAQFGSADLRKAVHVKE--------NFRRANFTSADMRESDFSGSKFNGAYLEKA 148
G+GSAA + +LR +++ + R+ F A M D SGS GA +
Sbjct: 7 MGVGSAAAITAPELRGQRALQDLQPDMHGRDLRQQEFLKASMGGFDLSGSDLRGAVFNSS 66
Query: 149 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
N + A+L D + + A+L+ AVL +L +S GA IEGADFSDAV+DL+
Sbjct: 67 DLTNTNLSAANLEDAVAFATRFDGADLSGAVLRNAMLMQSRFTGAQIEGADFSDAVLDLS 126
Query: 209 QKQALCKYANGTNPITGVSTRKSLGC 234
Q +ALC A+G NP TGVST +SLGC
Sbjct: 127 QVKALCSRADGVNPSTGVSTVESLGC 152
>gi|116072323|ref|ZP_01469590.1| hypothetical protein BL107_11066 [Synechococcus sp. BL107]
gi|116064845|gb|EAU70604.1| hypothetical protein BL107_11066 [Synechococcus sp. BL107]
Length = 186
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 73/117 (62%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N +F A R +DF + +GA L + +A+F GADLSD LMDR ++L +
Sbjct: 70 QNLANTSFAGATGRGADFRDAILHGAILTQGAFAEADFRGADLSDALMDRADFVASDLRD 129
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVL+ + + S A+IEGADF+DA++D ++ LC+ A+G NP TGVST SLGC
Sbjct: 130 AVLIGVIASGSSFSKALIEGADFTDALLDRDDQRRLCRDADGINPTTGVSTFDSLGC 186
>gi|260435516|ref|ZP_05789486.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
gi|260413390|gb|EEX06686.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
Length = 163
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 87/149 (58%), Gaps = 21/149 (14%)
Query: 92 ETRGEFGIGS-AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-- 148
E RG+F + +A DL++ +K + R N + D+R + + S+ GA L A
Sbjct: 28 ELRGQFAVQEISADMHGLDLKEKEFLKADLREVNLSGTDLRGAVINTSQLQGADLRDADL 87
Query: 149 ---VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
V + ++F GADL AN TNA+++++ T A I+GADF++AVI
Sbjct: 88 SDVVGFASHFEGADL----------RGANFTNAMMMQSRFT-----DAQIDGADFTNAVI 132
Query: 206 DLAQKQALCKYANGTNPITGVSTRKSLGC 234
DL Q++ALC A+G+NPI+GVSTR+SLGC
Sbjct: 133 DLPQQRALCVRADGSNPISGVSTRESLGC 161
>gi|87125517|ref|ZP_01081362.1| hypothetical protein RS9917_02051 [Synechococcus sp. RS9917]
gi|86166817|gb|EAQ68079.1| hypothetical protein RS9917_02051 [Synechococcus sp. RS9917]
Length = 180
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 71/117 (60%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ R +F A R +DFS + +GA + A+F GADLSD LMDR + +L
Sbjct: 64 QDLRNTSFAGAVGRGADFSDANLHGAIFTQGAFANADFHGADLSDALMDRADFSGTDLRG 123
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+L + + S GA IEGADFSDA++D + LC+ A G++P TGVSTR+SLGC
Sbjct: 124 TLLSGVIASGSSFAGAQIEGADFSDALLDRDDVRRLCRDAEGSHPHTGVSTRESLGC 180
>gi|428302010|ref|YP_007140316.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428238554|gb|AFZ04344.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 162
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 80/130 (61%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A L + ++ + A F++A+M +DF G+ GA + + KAN GA+L++ L
Sbjct: 32 FSNAQLARQDFSGQSLQAAEFSNANMELADFRGADLRGAVMSASTMTKANLHGANLANAL 91
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ L A+L++AVL +L R+ I GADF+DA++D AQ + LC A+G N T
Sbjct: 92 VDQVNLTGADLSDAVLQEALLLRAIFTDVKINGADFTDAILDGAQIRELCNIASGVNSQT 151
Query: 225 GVSTRKSLGC 234
GV TR SLGC
Sbjct: 152 GVETRYSLGC 161
>gi|411119374|ref|ZP_11391754.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410711237|gb|EKQ68744.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 182
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 76/130 (58%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F LR A N R NFT+AD+R GA + + + GADL+ +
Sbjct: 63 FSGQILRVAEFSNANLNRVNFTNADLR----------GAVMSASTMVDTSLHGADLTQAM 112
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ + +L++A+L T+L R+ +EGADF+DA++D AQ +ALC++A+G N T
Sbjct: 113 LDQVKMIRTDLSDAILANTILLRTTFENINLEGADFTDAILDGAQVKALCQFASGANSKT 172
Query: 225 GVSTRKSLGC 234
GVSTR SLGC
Sbjct: 173 GVSTRDSLGC 182
>gi|427420479|ref|ZP_18910662.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425756356|gb|EKU97210.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 169
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 51/122 (41%), Positives = 75/122 (61%), Gaps = 5/122 (4%)
Query: 118 ENFRRANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 172
NF AN +A++R ++F G+ + L KA + + TGA+LS+T DR+
Sbjct: 46 RNFENANLAGTSLAAAEVRNANFRGADLSATILTKAKFIRTDLTGANLSETFADRVEFTG 105
Query: 173 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 232
++LTNAV+ ++T S A I GADFS ++D Q + LC+ A+G NP+TGVSTR+SL
Sbjct: 106 SDLTNAVVTDALMTSSTFADATITGADFSYTILDRFQVKYLCERADGMNPVTGVSTRESL 165
Query: 233 GC 234
GC
Sbjct: 166 GC 167
>gi|78185103|ref|YP_377538.1| hypothetical protein Syncc9902_1536 [Synechococcus sp. CC9902]
gi|78169397|gb|ABB26494.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 182
Score = 96.7 bits (239), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 72/117 (61%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N +F A R +DF + +GA L + +A+F GADLSD LMDR +L +
Sbjct: 66 QNLANTSFAGATGRGADFRDANLHGAILTQGAFAEADFRGADLSDALMDRADFVATDLRD 125
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVL+ + + S A+IEGADF+DA++D ++ LC+ A+G NP TG+ST SLGC
Sbjct: 126 AVLIGVIASGSSFSKALIEGADFTDALLDRDDQRLLCRDADGINPTTGISTFDSLGC 182
>gi|298492040|ref|YP_003722217.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298233958|gb|ADI65094.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 167
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/132 (43%), Positives = 76/132 (57%), Gaps = 10/132 (7%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F DLR + K N R++NFT A++R G F A LE A N GADL++
Sbjct: 45 ADFSRRDLRDSSFTKANLRQSNFTGANLR-----GVSFFAANLESA-----NLEGADLTN 94
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
+D L ANLTNAVL + GAI++GADF+DA++ +++ LC A GTNP
Sbjct: 95 ATLDSARLIRANLTNAVLEGAFAASAKFDGAIVDGADFTDALLRQDEQKKLCNLAKGTNP 154
Query: 223 ITGVSTRKSLGC 234
ITG TR++L C
Sbjct: 155 ITGRDTRETLFC 166
>gi|428772631|ref|YP_007164419.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
7202]
gi|428686910|gb|AFZ46770.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
Length = 166
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 73/130 (56%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F S L+ +N + A FT ++ ++ F+ S GA A ANF+G D+SD L
Sbjct: 36 FESKSLKGEDFTNQNLQLAEFTKVNLEDAKFNDSDLRGAVFNGVNAEGANFSGVDMSDGL 95
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+ N +L+NA+ ++ R+ A +EGADF+ AV+D Q LCK A+G NP+T
Sbjct: 96 VYVTSFNNTDLSNAIFRDAIMLRTTFKNANVEGADFTFAVLDSEQVNQLCKNASGVNPVT 155
Query: 225 GVSTRKSLGC 234
STR+SLGC
Sbjct: 156 NASTRQSLGC 165
>gi|148240085|ref|YP_001225472.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
gi|147848624|emb|CAK24175.1| Secreted pentapeptide repeats protein [Synechococcus sp. WH 7803]
Length = 174
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/110 (45%), Positives = 66/110 (60%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F A + +DFSG+ GA + ANF GADLSD LMDR +L +AVL+ +
Sbjct: 65 FAGAAGKGADFSGANLQGAIFTQGAFADANFHGADLSDALMDRADFTGTDLRDAVLIGVI 124
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ S GA ++GADFSDA++D ++ LC+ A G NP TGV TR SL C
Sbjct: 125 ASGSSFAGAQVDGADFSDALLDRDDQRRLCQEAEGVNPTTGVLTRDSLSC 174
>gi|119509637|ref|ZP_01628783.1| hypothetical protein N9414_21581 [Nodularia spumigena CCY9414]
gi|119465656|gb|EAW46547.1| hypothetical protein N9414_21581 [Nodularia spumigena CCY9414]
Length = 221
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 66/110 (60%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +A+MR ++F G+ A L K V AN A+L L+DR+ ++ ANL NA+
Sbjct: 111 FVAAEMRGANFQGANLKNAILTKGVLLNANLENANLEGALVDRVTMDGANLKNAIFTEAT 170
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+TRS A I GADF+DA+ID Q +C A G N +TGV+TR SLGC
Sbjct: 171 MTRSRFFDADITGADFTDALIDRYQVALMCDRAAGINSVTGVATRDSLGC 220
>gi|317969830|ref|ZP_07971220.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
Length = 178
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/135 (40%), Positives = 78/135 (57%), Gaps = 9/135 (6%)
Query: 104 QFGSADLRKAVH----VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
Q + DL+ +H ++ F +A+ D+ E+D G+ FN A L+ A N + AD
Sbjct: 48 QRSAQDLQPDMHGRNLQQQEFLKASMEGFDLSETDLRGAVFNTANLQNA-----NLSAAD 102
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L D + + A+L+ AV +L S GA+IEG DF+DAV+DL Q++ALC A+G
Sbjct: 103 LEDAVAFATRFDNADLSGAVFRNAMLMNSKFTGAVIEGTDFTDAVLDLPQQKALCARASG 162
Query: 220 TNPITGVSTRKSLGC 234
NP TGV TR+SL C
Sbjct: 163 VNPRTGVDTRESLAC 177
>gi|220905675|ref|YP_002480986.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219862286|gb|ACL42625.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 162
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 68/110 (61%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +A M E++F G+ A L KA +ANF GA+L+D L D + ++L+NA+L
Sbjct: 52 FAAAVMPEANFEGANLRNAILSKAELSQANFRGANLTDVLADGVSWANSDLSNAILAGAT 111
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
L + G I GADFSDA+ID LC+ A G NP+TG++TR+SLGC
Sbjct: 112 LIGTTFTGVTITGADFSDALIDRYDVSLLCQRAEGINPVTGIATRESLGC 161
>gi|282895655|ref|ZP_06303780.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
gi|281199349|gb|EFA74214.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
Length = 171
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 70/118 (59%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ + R ++FT A++R+SDFSGS G A ANFTGADL++ +D ANLT
Sbjct: 53 QRDLRDSSFTKANLRQSDFSGSNLTGVSFFAANLESANFTGADLTNATLDSARFIGANLT 112
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA+L + GAII GADF+D ++ ++ LC+ A G NP TG TRK+L C
Sbjct: 113 NAILEGAFAASAKFDGAIITGADFTDVLLRRDEQNKLCQLAKGINPTTGRHTRKTLFC 170
>gi|308814214|ref|XP_003084412.1| COG1357: Uncharacterized low-complexity proteins (ISS)
[Ostreococcus tauri]
gi|116056297|emb|CAL56680.1| COG1357: Uncharacterized low-complexity proteins (ISS)
[Ostreococcus tauri]
Length = 186
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 76/140 (54%), Gaps = 10/140 (7%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A S DL A++ + + R AN ++ D R GA +A+ D S+
Sbjct: 34 ADLASNDLTGAIYAESDLRNANISNTDAR----------GAVFSRAIMPGVKLNATDASN 83
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
+ D VL A++ + V R+D+G A+IEGADFS+AVID + LC+ A+GTNP
Sbjct: 84 AMFDYAVLRGADMRDGVFANANFVRADMGEAMIEGADFSEAVIDRYEAIRLCERASGTNP 143
Query: 223 ITGVSTRKSLGCGNSRRNAY 242
TG+ TR +LGC +SR + Y
Sbjct: 144 WTGIETRATLGCDDSRVSKY 163
>gi|16332305|ref|NP_443033.1| hypothetical protein sll0577 [Synechocystis sp. PCC 6803]
gi|383324046|ref|YP_005384900.1| hypothetical protein SYNGTI_3138 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383327215|ref|YP_005388069.1| hypothetical protein SYNPCCP_3137 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383493099|ref|YP_005410776.1| hypothetical protein SYNPCCN_3137 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384438367|ref|YP_005653092.1| hypothetical protein SYNGTS_3139 [Synechocystis sp. PCC 6803]
gi|451816456|ref|YP_007452908.1| hypothetical protein MYO_131750 [Synechocystis sp. PCC 6803]
gi|1653935|dbj|BAA18845.1| sll0577 [Synechocystis sp. PCC 6803]
gi|339275400|dbj|BAK51887.1| hypothetical protein SYNGTS_3139 [Synechocystis sp. PCC 6803]
gi|359273366|dbj|BAL30885.1| hypothetical protein SYNGTI_3138 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359276536|dbj|BAL34054.1| hypothetical protein SYNPCCN_3137 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359279706|dbj|BAL37223.1| hypothetical protein SYNPCCP_3137 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407960039|dbj|BAM53279.1| hypothetical protein BEST7613_4348 [Synechocystis sp. PCC 6803]
gi|451782425|gb|AGF53394.1| hypothetical protein MYO_131750 [Synechocystis sp. PCC 6803]
Length = 169
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/140 (42%), Positives = 77/140 (55%), Gaps = 10/140 (7%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKAN 154
G A+ F + L + ++ A FT+ D+ S D GS FNGA L A N
Sbjct: 34 GGASAFENMVLAETDFRDQDLLTAQFTNVDLTSSIFEAMDLRGSVFNGANLTDA-----N 88
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
G DL++ L N ANL NA+L ++ R+ A I+GADFS AV+D Q ALC
Sbjct: 89 LKGVDLTNGLTYLTSFNGANLENAILAEAIMLRTSFKNAKIQGADFSLAVLDTEQIAALC 148
Query: 215 KYANGTNPITGVSTRKSLGC 234
K A+G NP TG+STR+SLGC
Sbjct: 149 KVADGVNPKTGISTRESLGC 168
>gi|218439896|ref|YP_002378225.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218172624|gb|ACK71357.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 170
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 78/131 (59%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ A+L +N A F +A+MR + S + + L +AV AN GA+L+ +
Sbjct: 38 NYTYAELADQDFSNKNLYGAVFAAANMRGASLENSDLSYSILTEAVLLNANLKGANLTGS 97
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
L+DR+ L+ A+LTNA+ + +R+ I GADFS A++D Q +C+ A+G NP+
Sbjct: 98 LVDRVTLDFADLTNAIFTDAIASRTRFYDTTITGADFSGAILDQYQVYLMCERASGVNPV 157
Query: 224 TGVSTRKSLGC 234
TGVSTR+SLGC
Sbjct: 158 TGVSTRESLGC 168
>gi|428227020|ref|YP_007111117.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427986921|gb|AFY68065.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 166
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/120 (41%), Positives = 74/120 (61%)
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
+ ++ +A F++A+++ +DFSG+ GA + AN G D SD + ++AN
Sbjct: 46 YAGQSLLQAEFSNANLKNADFSGADLRGAVFNGSTLVHANLRGVDFSDGIAYISDFSDAN 105
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
L++AVL +L +S GA + GADF+DAV+D AQ LCK A+G N ITG TR+SLGC
Sbjct: 106 LSDAVLSSAMLLKSRFTGADVTGADFTDAVLDRAQVLQLCKTASGVNSITGADTRESLGC 165
>gi|318041364|ref|ZP_07973320.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0101]
Length = 170
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/130 (42%), Positives = 77/130 (59%), Gaps = 9/130 (6%)
Query: 109 DLRKAVH----VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
DL+ +H ++ F +AN D ESD G+ FN A L+ A N ADL D +
Sbjct: 45 DLQPDMHGRNLQQQEFLKANLEGFDFSESDLRGAVFNTANLQGA-----NLHAADLEDAV 99
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+ A+L++AVL +L S G++I+GADF+DAV+DL Q++ALC+ A GTN T
Sbjct: 100 AFASRFDNADLSDAVLRNAMLMNSKFAGSVIDGADFTDAVLDLPQQKALCERAGGTNART 159
Query: 225 GVSTRKSLGC 234
GV+TR SL C
Sbjct: 160 GVNTRDSLNC 169
>gi|282897737|ref|ZP_06305736.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
gi|281197416|gb|EFA72313.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
Length = 162
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 80/130 (61%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L + +N + A F++A++ ++F+ + GA +V +AN GADL++ +
Sbjct: 32 FSNAELGRHNFSGQNLQAAEFSNANLEMANFANADLRGAVFSASVMTQANLHGADLTNAM 91
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D++ L A+L++A+ + +L RS A I+GADF++A++D Q LCK A G N T
Sbjct: 92 LDQVKLTGADLSDAIFLEAILLRSIFTEANIDGADFTEAILDRGQVGELCKSARGVNSQT 151
Query: 225 GVSTRKSLGC 234
V TR SLGC
Sbjct: 152 HVQTRDSLGC 161
>gi|443478408|ref|ZP_21068166.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443016315|gb|ELS31005.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 150
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 76/130 (58%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A L+ N + F +A+M ++F + GA ++ KAN G D S L
Sbjct: 20 FSHAQLKNRDFSGRNLVGSGFANANMEGANFENADVRGAVFSASILRKANLKGTDFSGGL 79
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D+ +A+L+NA+LV T+L RS I+GADF+DA++D AQ++ LC A GTN T
Sbjct: 80 LDQADFAKADLSNALLVETILLRSTFDFVNIDGADFTDAIMDGAQRKWLCSKAKGTNAKT 139
Query: 225 GVSTRKSLGC 234
G++TR+SL C
Sbjct: 140 GINTRESLEC 149
>gi|260435480|ref|ZP_05789450.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
gi|260413354|gb|EEX06650.1| secreted pentapeptide repeats protein [Synechococcus sp. WH 8109]
Length = 173
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 70/117 (59%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N +F A R ++F G+ +GA L + +A+F GADLSD LMDR +L N
Sbjct: 57 QNLANTSFAGAVGRGANFRGANLHGAILTQGAFAEADFQGADLSDALMDRADFVATDLRN 116
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVL + + S A IEGADF+DA++D ++ LC A+G NP TGVST SLGC
Sbjct: 117 AVLTGIIASGSSFSNAQIEGADFTDALLDRDDQRRLCGEADGINPSTGVSTFDSLGC 173
>gi|254415547|ref|ZP_05029307.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196177728|gb|EDX72732.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 165
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/112 (46%), Positives = 68/112 (60%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+F AD+RES+FS ++ G A ANF GA+LS + +D+ LN ANL NAVL
Sbjct: 53 ASFNQADLRESNFSHAELQGVSFFGANLKLANFEGANLSYSTLDKARLNGANLKNAVLEG 112
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ GA IEGADF+DA +D ++ LC+ A GTNP TG TR +L C
Sbjct: 113 AYAFNAQFDGATIEGADFTDAFLDPKAEEKLCQMATGTNPTTGRQTRDTLFC 164
>gi|88808683|ref|ZP_01124193.1| hypothetical protein WH7805_03297 [Synechococcus sp. WH 7805]
gi|88787671|gb|EAR18828.1| hypothetical protein WH7805_03297 [Synechococcus sp. WH 7805]
Length = 176
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 98/190 (51%), Gaps = 29/190 (15%)
Query: 55 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEA----ETRGEFGIGS-AAQFGSAD 109
A L N R ++TAL AA+V L D EA E RG+ + + D
Sbjct: 5 ALLCNLRRHLTTALLAALVVFTG----VLIDGPSVEAITAPELRGQRAVQDITSDMHGRD 60
Query: 110 LRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
L++ +K + R + AD+R S G+ GA LE VA+ + F GADL +
Sbjct: 61 LKEKEFLKADLREVDLGEADLRGAVINTSQLQGADLRGADLEDVVAFSSRFDGADLRN-- 118
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
AN TNA+L++ S A IEG DF++AVIDL+Q +ALC A+G N ++
Sbjct: 119 --------ANFTNAMLMQ-----SRFNDAEIEGTDFTNAVIDLSQLKALCGRASGVNSLS 165
Query: 225 GVSTRKSLGC 234
GVST++SLGC
Sbjct: 166 GVSTKESLGC 175
>gi|443321745|ref|ZP_21050787.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
gi|442788515|gb|ELR98206.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
Length = 149
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 73/117 (62%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ A+F A +RES+FS + G A NF GA+L++ +D LN+ANL N
Sbjct: 32 QDLTDASFDLASLRESNFSHANLTGVRFFSANLESVNFEGANLTNATLDSARLNDANLKN 91
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+L+ ++ + + G IEGADF+DA+I +++ LCK A GTNP+TG TR++L C
Sbjct: 92 AILIGAFVSNAKVQGVNIEGADFTDALILPYEQKLLCKVAQGTNPVTGRDTRETLFC 148
>gi|88809155|ref|ZP_01124664.1| hypothetical protein WH7805_05666 [Synechococcus sp. WH 7805]
gi|88787097|gb|EAR18255.1| hypothetical protein WH7805_05666 [Synechococcus sp. WH 7805]
Length = 180
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 66/111 (59%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
+F A + +DFSG+ GA + ANF GADLSD LMDR +L +AVL+
Sbjct: 70 SFAGAAAKGADFSGANLQGAIFTQGAFADANFRGADLSDALMDRADFTGTDLRDAVLIGV 129
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ + S A ++GADFSDA++D ++ LC+ A G NP TGV TR SL C
Sbjct: 130 IASGSSFARAQVDGADFSDALLDRDDQRKLCQEAEGLNPTTGVLTRDSLSC 180
>gi|427702634|ref|YP_007045856.1| low-complexity protein [Cyanobium gracile PCC 6307]
gi|427345802|gb|AFY28515.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
Length = 182
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 72/112 (64%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
++F A R++ F + +GA L +A +A+F GADLSD LMD++ ++ +LT AVL
Sbjct: 70 SSFAGATGRQARFRDADLHGAILTQAAFPEADFHGADLSDALMDKVDMSGTDLTGAVLRG 129
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ + S+ GA + ADF+DA++D ++ LC+ A GTNP+TG TR SL C
Sbjct: 130 AIASGSNFTGATVTDADFTDALLDRVDQRNLCREARGTNPVTGADTRLSLDC 181
>gi|428211433|ref|YP_007084577.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|427999814|gb|AFY80657.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 166
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/114 (40%), Positives = 76/114 (66%)
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
R A F++AD++ ++FS + GA ++ +ANF GADL++++++ L A+ T+AVL
Sbjct: 52 RTAEFSNADLQFTNFSNVQAEGAIFSLSMMKEANFHGADLTNSMLEWTNLTNADFTDAVL 111
Query: 181 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
V + +++ + GADF+DA++D AQ + LC+ A+G N TGV TR+SLGC
Sbjct: 112 VEALFLGANVKKMKVTGADFTDAILDGAQVKQLCENASGVNSKTGVDTRESLGC 165
>gi|87124337|ref|ZP_01080186.1| hypothetical protein RS9917_12025 [Synechococcus sp. RS9917]
gi|86167909|gb|EAQ69167.1| hypothetical protein RS9917_12025 [Synechococcus sp. RS9917]
Length = 178
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 67/116 (57%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ + F AD++ D SGS GA + + A+ GADLSD + + A+L NA
Sbjct: 62 DLKEKEFLKADLQGVDLSGSDLRGAVINTSSLQGADLQGADLSDVVAFASRFDGADLRNA 121
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
V +L +S G A I+GADF+DAVIDL Q +ALC A G N TGV TR SLGC
Sbjct: 122 VFTNAMLMQSRFGDAQIDGADFTDAVIDLPQLKALCARAAGENSRTGVLTRDSLGC 177
>gi|157413511|ref|YP_001484377.1| hypothetical protein P9215_11761 [Prochlorococcus marinus str. MIT
9215]
gi|254526043|ref|ZP_05138095.1| Pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
gi|157388086|gb|ABV50791.1| conserved hpothetical protein [Prochlorococcus marinus str. MIT
9215]
gi|221537467|gb|EEE39920.1| Pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
Length = 172
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 73/130 (56%), Gaps = 10/130 (7%)
Query: 115 HVKENFRRANFTSADM----------RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
+V+ N NF D+ R++DFS +G L + +N G DL+DTL
Sbjct: 42 YVRSNITGFNFHGEDLHLSSIAGAVARDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTL 101
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
DR+ + +L NAVL+ + + S GA IEGADFS A++D ++ LC+ A+G NP T
Sbjct: 102 SDRVNFQKTDLRNAVLINMIASGSSFAGAQIEGADFSYAILDSEDQRNLCEIADGINPTT 161
Query: 225 GVSTRKSLGC 234
GVSTR SL C
Sbjct: 162 GVSTRDSLEC 171
>gi|126657693|ref|ZP_01728847.1| hypothetical protein CY0110_25878 [Cyanothece sp. CCY0110]
gi|126620910|gb|EAZ91625.1| hypothetical protein CY0110_25878 [Cyanothece sp. CCY0110]
Length = 167
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 77/135 (57%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
GS A + L +++ A FT+AD+ +S+FS + GA + A+ GAD
Sbjct: 32 GSTASYEDVKLIGEDFSEKSLTYAQFTNADLTDSNFSKADLRGAVFNGSALIGADLHGAD 91
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L A+LT+AVL ++ R+ A I GADFS AV+D+ + + LC A+G
Sbjct: 92 LTNGLAYLTSFKGADLTDAVLTEAIMMRTKFDDAKITGADFSLAVLDIYEVEKLCDRADG 151
Query: 220 TNPITGVSTRKSLGC 234
NP TG+STR+SLGC
Sbjct: 152 VNPKTGISTRESLGC 166
>gi|72382551|ref|YP_291906.1| hypothetical protein PMN2A_0712 [Prochlorococcus marinus str.
NATL2A]
gi|72002401|gb|AAZ58203.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL2A]
Length = 184
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 74/131 (56%), Gaps = 6/131 (4%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
E+ ++ A R++DFS +G L + +N G DL+DTL DR+ + +L N
Sbjct: 58 EDLHLSSIAGAMARDADFSNVDLHGTTLTLSDLKGSNLNGVDLTDTLSDRVNFQKTDLRN 117
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 237
++LV + + S GA IEGADF+ A++D ++ LCK A+G NP TGVSTR SL C
Sbjct: 118 SILVNMIASGSSFAGAQIEGADFTFAILDSEDQRNLCKIADGVNPTTGVSTRASLECKGD 177
Query: 238 RRNAYGSPSSP 248
+ PS P
Sbjct: 178 K------PSMP 182
>gi|352096257|ref|ZP_08957137.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
gi|351676951|gb|EHA60102.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
Length = 177
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 71/117 (60%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N +F A R +DFS + G +A +ANF GA+LSD LMDR ++ +L +
Sbjct: 61 QNLVNTSFAGATGRGADFSDANLQGTIFTQAEFPEANFHGANLSDALMDRADFSKTDLRD 120
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+L + S GA IEGADF+DA++D ++ LC+ A+G NP +GV+TR SL C
Sbjct: 121 ALLQGVIAAGSSFAGADIEGADFTDALLDREDQRRLCQDADGVNPSSGVATRDSLDC 177
>gi|254409676|ref|ZP_05023457.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183673|gb|EDX78656.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 163
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 74/117 (63%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N + A F +A+++ S+F+ + GA +V AN GADLS ++D+ L A+L++
Sbjct: 46 QNLQTAEFANANLQLSNFAYADLRGAIFSGSVMTHANLHGADLSYGMLDQADLTGADLSD 105
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+LV T+L S +I GADF+DA++D AQ + LC+ A+G N TGV+T SLGC
Sbjct: 106 VILVETLLLGSVFDNTLITGADFTDALLDGAQLKHLCQQASGINSKTGVATSDSLGC 162
>gi|254430459|ref|ZP_05044162.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
gi|197624912|gb|EDY37471.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
Length = 180
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 72/117 (61%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ +F A R +DFSG+ +GA L +A +A+F GADLS LMD++ + A+ T
Sbjct: 64 QDLANTSFAGAAGRHADFSGANLHGAILTQAAFPEASFAGADLSGVLMDKVDFSGADFTG 123
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A L + + S+ GA + ADF+ A+ID ++ LC+ A GT+P+TG TR SLGC
Sbjct: 124 ADLSDVIASGSNFSGATVTNADFTGALIDRVDQRLLCRDAEGTHPLTGADTRLSLGC 180
>gi|172036187|ref|YP_001802688.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354552985|ref|ZP_08972292.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171697641|gb|ACB50622.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353554815|gb|EHC24204.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 179
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/135 (40%), Positives = 76/135 (56%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
GS+A + L ++ A FT+AD+ +S+FS + GA + A+ GAD
Sbjct: 44 GSSASYEDVKLIGEDFSGKSLTYAQFTNADLTDSNFSEADLRGAVFNGSALIGADLHGAD 103
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L A+LTNAVL ++ R+ A I GADFS AV+D+ + LC A+G
Sbjct: 104 LTNGLAYLTSFKGADLTNAVLTEAIMMRTKFDDAKITGADFSLAVLDVYEVDKLCDRADG 163
Query: 220 TNPITGVSTRKSLGC 234
NP TGVSTR+SLGC
Sbjct: 164 VNPKTGVSTRESLGC 178
>gi|123968679|ref|YP_001009537.1| hypothetical protein A9601_11461 [Prochlorococcus marinus str.
AS9601]
gi|126696485|ref|YP_001091371.1| hypothetical protein P9301_11471 [Prochlorococcus marinus str. MIT
9301]
gi|123198789|gb|ABM70430.1| conserved hypothetical protein [Prochlorococcus marinus str.
AS9601]
gi|126543528|gb|ABO17770.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9301]
Length = 172
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 71/117 (60%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
E+ ++ A R++DFS +G L + +N G DL+DTL DR+ + +L N
Sbjct: 55 EDLHLSSIAGAVARDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLSDRVNFQKTDLRN 114
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVL+ + + S GA IEGADFS A++D ++ LC+ A+G NP TGVSTR+SL C
Sbjct: 115 AVLINMIASGSSFAGAQIEGADFSYAILDSEDQRNLCEIADGINPTTGVSTRESLEC 171
>gi|78779436|ref|YP_397548.1| hypothetical protein PMT9312_1053 [Prochlorococcus marinus str. MIT
9312]
gi|78712935|gb|ABB50112.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9312]
Length = 172
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 71/117 (60%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
E+ ++ A R++DFS +G L + +N G DL+DTL DR+ + +L N
Sbjct: 55 EDLHLSSIAGAVARDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLSDRVNFQKTDLRN 114
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVL+ + + S GA IEGADFS A++D ++ LC+ A+G NP TGVSTR+SL C
Sbjct: 115 AVLINMIASGSSFAGAKIEGADFSYAILDSEDQRNLCEIADGINPTTGVSTRESLEC 171
>gi|78212400|ref|YP_381179.1| hypothetical protein Syncc9605_0856 [Synechococcus sp. CC9605]
gi|78196859|gb|ABB34624.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 181
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 70/117 (59%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N +F A R ++F G+ +GA L + +A+F GADLSD LMDR +L N
Sbjct: 65 QNLANTSFAGAVGRGANFRGANLHGAILTQGAFAEADFQGADLSDALMDRADFVGTDLRN 124
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVL + + S A IEGADF+DA++D ++ LC A+G NP TGV+T SLGC
Sbjct: 125 AVLNGIIASGSSFSNAQIEGADFTDALLDRDDQRRLCGEADGINPSTGVATFDSLGC 181
>gi|124026254|ref|YP_001015370.1| hypothetical protein NATL1_15481 [Prochlorococcus marinus str.
NATL1A]
gi|123961322|gb|ABM76105.1| Hypothetical protein NATL1_15481 [Prochlorococcus marinus str.
NATL1A]
Length = 184
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 74/131 (56%), Gaps = 6/131 (4%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
E+ ++ A R++DFS +G L + +N G DL+DTL DR+ + +L N
Sbjct: 58 EDLHLSSIAGAMARDADFSNVDLHGTTLTLSDLKGSNLNGVDLTDTLSDRVNFQKTDLRN 117
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 237
++LV + + S GA IEGADF+ A++D ++ LCK A+G NP TGVSTR SL C
Sbjct: 118 SILVNMIASGSSFAGAQIEGADFTFAILDSEDQRNLCKIADGVNPTTGVSTRASLECKGD 177
Query: 238 RRNAYGSPSSP 248
+ PS P
Sbjct: 178 K------PSIP 182
>gi|220907989|ref|YP_002483300.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219864600|gb|ACL44939.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 171
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 69/117 (58%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N + F A + ++ SG+ GA AV AN G + SD + ++A+L N
Sbjct: 54 QNLEQVEFGDARLSGANLSGANLRGAVFNAAVLTGANLQGVNFSDGIGYLCDFSDADLEN 113
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVL +L +S+ GA I GADFS A++D Q LC+YA+G NP TGVSTR+SLGC
Sbjct: 114 AVLDSAMLLKSEFKGAKINGADFSFALLDRPQVLQLCEYASGVNPTTGVSTRESLGC 170
>gi|303287274|ref|XP_003062926.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455562|gb|EEH52865.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 182
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 74/142 (52%), Gaps = 14/142 (9%)
Query: 112 KAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
KA HV E+F ++ +T D+R SDFSGS A +A+ N G+D+ + +D
Sbjct: 17 KAEHVNEDFSHSDLVGAIYTEGDLRGSDFSGSDLRAAIFSRAIMPGVNLEGSDMQNAFLD 76
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA---------LCKYA 217
+VL AN+ + RSDLG + ADF++AVID Q ++ LC A
Sbjct: 77 YVVLRGANMRGVIASGANFVRSDLGDVDVTNADFTEAVIDRYQARSISHWSPYDPLCDGA 136
Query: 218 NGTNPITGVSTRKSLGCGNSRR 239
+G N TGV TR SLGC +R
Sbjct: 137 SGVNEFTGVDTRDSLGCDRLKR 158
>gi|113953693|ref|YP_729958.1| hypothetical protein sync_0742 [Synechococcus sp. CC9311]
gi|113881044|gb|ABI46002.1| conserved hypothetical protein [Synechococcus sp. CC9311]
Length = 190
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 71/117 (60%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N +F A R +DFS + G +A +ANF GA+LSD LMDR ++ +L +
Sbjct: 74 QNLVNTSFAGATGRGADFSDANLQGTIFTQAEFPEANFHGANLSDALMDRADFSKTDLRD 133
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+LV + S GA IEGADF+DA++D ++ LC+ A+G N +GVSTR SL C
Sbjct: 134 ALLVGVIAAGSSFAGADIEGADFTDALLDREDQRRLCQDADGVNSSSGVSTRDSLDC 190
>gi|123966365|ref|YP_001011446.1| hypothetical protein P9515_11321 [Prochlorococcus marinus str. MIT
9515]
gi|123200731|gb|ABM72339.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 172
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 69/117 (58%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
E+ ++ A R++DFS +G L + +N G DL+DTL DR+ + +L N
Sbjct: 55 EDLHLSSIAGAVARDADFSDVDLHGTTLTLSDLKGSNLNGIDLTDTLADRVNFQKTDLRN 114
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
++L+ + + S GA IEGADFS A++D ++ LCK A G NP TGVSTR SL C
Sbjct: 115 SILINMIASGSSFAGAQIEGADFSYAILDSEDQRNLCKIAEGVNPTTGVSTRDSLEC 171
>gi|352094392|ref|ZP_08955563.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
gi|351680732|gb|EHA63864.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
Length = 172
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 81/149 (54%), Gaps = 21/149 (14%)
Query: 92 ETRGEFGIGSAA-QFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYL 145
E RG+F + + DL++ +K + R + + D+R S G+ +GA L
Sbjct: 38 ELRGQFAVQDISNDMHGRDLKEKEFLKADLRGVDLSDTDLRGAVINTSQLQGADLHGANL 97
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
E VA+ + F DLSD AN TNA+L+++ A IEG DF++AVI
Sbjct: 98 EDVVAFSSRFDETDLSD----------ANFTNAMLMQSRFV-----DARIEGTDFTNAVI 142
Query: 206 DLAQKQALCKYANGTNPITGVSTRKSLGC 234
DL Q +ALC A+G N ++GVSTR+SLGC
Sbjct: 143 DLTQMKALCGRASGVNSVSGVSTRESLGC 171
>gi|254430802|ref|ZP_05044505.1| secreted pentapeptide repeats protein [Cyanobium sp. PCC 7001]
gi|197625255|gb|EDY37814.1| secreted pentapeptide repeats protein [Cyanobium sp. PCC 7001]
Length = 173
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/126 (39%), Positives = 74/126 (58%), Gaps = 1/126 (0%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL+ +H + N R+ F A + DFS + GA + +A+ + A+L D +
Sbjct: 48 DLQPDMHGR-NLRQQEFLKASLEGFDFSEADLRGAVFNGSSLREADLSAANLEDVVAYAT 106
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
+++NL A+L +L +S G+ I GADFSDAV+DL +++ALC A G NP TGVST
Sbjct: 107 RFDDSNLEGAILRNAMLMQSRFKGSSITGADFSDAVLDLPEQKALCARATGVNPSTGVST 166
Query: 229 RKSLGC 234
R+SL C
Sbjct: 167 RESLAC 172
>gi|116070665|ref|ZP_01467934.1| hypothetical protein BL107_13505 [Synechococcus sp. BL107]
gi|116066070|gb|EAU71827.1| hypothetical protein BL107_13505 [Synechococcus sp. BL107]
Length = 169
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/131 (42%), Positives = 77/131 (58%), Gaps = 20/131 (15%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDT 163
DL++ +K N R N + AD+R + + ++ GA L A V + + F GADL
Sbjct: 52 DLKEKEFLKANLRDVNLSGADLRGAVINTTQLQGADLRDANLSDVVGFASRFDGADL--- 108
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
A LTNA+L+++ T A IEGADF+DAVIDL Q++ALC A+G NP
Sbjct: 109 -------RGAVLTNAMLMQSRFT-----DAQIEGADFTDAVIDLPQQRALCSSADGVNPQ 156
Query: 224 TGVSTRKSLGC 234
+GVSTR+SLGC
Sbjct: 157 SGVSTRESLGC 167
>gi|78184792|ref|YP_377227.1| hypothetical protein Syncc9902_1219 [Synechococcus sp. CC9902]
gi|78169086|gb|ABB26183.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 169
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/131 (42%), Positives = 77/131 (58%), Gaps = 20/131 (15%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDT 163
DL++ +K N R N + AD+R + + ++ GA L A V + + F GADL
Sbjct: 52 DLKEKEFLKANLRDVNLSGADLRGAVINTTQLQGADLRDANLSDVVGFASRFDGADL--- 108
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
A LTNA+L+++ T A IEGADF+DAVIDL Q++ALC A+G NP
Sbjct: 109 -------RGAVLTNAMLMQSRFT-----DAQIEGADFTDAVIDLPQQRALCSSADGVNPQ 156
Query: 224 TGVSTRKSLGC 234
+GVSTR+SLGC
Sbjct: 157 SGVSTRESLGC 167
>gi|113953830|ref|YP_730899.1| pentapeptide repeat-containing protein [Synechococcus sp. CC9311]
gi|113881181|gb|ABI46139.1| Secreted pentapeptide repeats protein [Synechococcus sp. CC9311]
Length = 172
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 82/149 (55%), Gaps = 21/149 (14%)
Query: 92 ETRGEFGIGSAAQ-FGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYL 145
E RG+F + ++ DL++ +K + R + + D+R S G+ +GA L
Sbjct: 38 ELRGQFALQDISEDMHGRDLKEKEFLKADLRGIDLSDTDLRGAVINTSQLQGADLHGANL 97
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
E VA+ + F DLSD AN TNA+L+++ A IEG DF++AVI
Sbjct: 98 EDVVAFSSRFDETDLSD----------ANFTNAMLMQSRFV-----DARIEGTDFTNAVI 142
Query: 206 DLAQKQALCKYANGTNPITGVSTRKSLGC 234
DL Q +ALC A+G N ++GVSTR+SLGC
Sbjct: 143 DLTQLKALCGRASGVNSVSGVSTRESLGC 171
>gi|33865660|ref|NP_897219.1| hypothetical protein SYNW1126 [Synechococcus sp. WH 8102]
gi|33632830|emb|CAE07641.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 190
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 89/164 (54%), Gaps = 5/164 (3%)
Query: 71 AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM 130
++VA+ +S L N +A T E A Q +AD+ + +KE F AD+
Sbjct: 30 SLVAAILVVVSTLLWTNSAQAITAPELRGQRAVQEITADM-HGLDLKEK----EFLKADL 84
Query: 131 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
RE + S + GA + + A+ GADLS+ + + A+L A +L +S
Sbjct: 85 REVNLSDTDLRGAVINTSQLQGADLRGADLSNVVGFASRFDGADLRGATFTNAMLMQSRF 144
Query: 191 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A IEGADF+DAV+DL Q++ LC A G +P++GVSTR+SLGC
Sbjct: 145 ADARIEGADFTDAVLDLPQQKLLCATAAGEHPVSGVSTRESLGC 188
>gi|33861598|ref|NP_893159.1| hypothetical protein PMM1042 [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
gi|33634175|emb|CAE19501.1| conserved hpothetical protein [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
Length = 172
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 69/117 (58%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
E+ ++ A R++DFS +G L + +N G DL+DTL DR+ + +L N
Sbjct: 55 EDLHLSSIAGAVARDADFSEVDLHGTTLTLSDLKGSNLNGIDLTDTLADRVNFQKTDLRN 114
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
++L+ + + S GA IEGADFS A++D ++ LCK A G NP TGVSTR SL C
Sbjct: 115 SILINMIASGSSFAGAQIEGADFSYAILDSEDQRNLCKIAEGVNPTTGVSTRDSLEC 171
>gi|33240611|ref|NP_875553.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
gi|33238139|gb|AAQ00206.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
Length = 183
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 52/131 (39%), Positives = 75/131 (57%), Gaps = 2/131 (1%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ +++ A R+S+ S +G + A +N G +L+DTL DR+ + +L N
Sbjct: 53 QDLSKSSIAGATARDSNLSDVDLHGTVVTLADLKGSNLNGINLTDTLSDRVNFQKTDLRN 112
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 237
AVLV + + S GA IEGADFS AV+D ++ LC+ A GTNP TG+STR+SL C S
Sbjct: 113 AVLVNMIASGSSFAGAQIEGADFSYAVLDSDDQRNLCEIAEGTNPQTGISTRESLEC--S 170
Query: 238 RRNAYGSPSSP 248
R P P
Sbjct: 171 ERGVGYKPPMP 181
>gi|414079727|ref|YP_007001151.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413973006|gb|AFW97094.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 167
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 92/179 (51%), Gaps = 16/179 (8%)
Query: 56 KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVH 115
K +NW +S L A + + ++ A +Y E I +A F DL +
Sbjct: 4 KHRNWISILSLLLWAIISTTALASFVPTAVALEYNKE------ILISADFSGRDLTDSSF 57
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
K N R +NF+ +++R G F A LE A N GADL++T +D L +A+L
Sbjct: 58 TKANLRYSNFSHSNLR-----GVSFFAANLESA-----NLQGADLTNTTLDSARLIKADL 107
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
TNA+L + GAII+GADF+D ++ +++ LCK A GTNP+T TR +L C
Sbjct: 108 TNAILEGAFAANARFDGAIIDGADFTDVLLRQDEQKKLCKLAKGTNPVTKRDTRDTLYC 166
>gi|145356305|ref|XP_001422373.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582615|gb|ABP00690.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 123
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 68/118 (57%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+E+ R A + AD+R SD S GA +AV + AD SD + D +L ++ T
Sbjct: 6 REDLRGAIYAEADLRRSDLRESDARGAVFSRAVMPGVDARDADFSDAMFDYALLRGSDFT 65
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
N+V V R+DLG + ADF++AVID Q +LC+ A+GTNP TG +TR SL C
Sbjct: 66 NSVFVGANFVRADLGEVVATNADFTEAVIDRYQTLSLCERASGTNPYTGANTRDSLLC 123
>gi|148239470|ref|YP_001224857.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
gi|147848009|emb|CAK23560.1| Secreted pentapeptide repeats protein [Synechococcus sp. WH 7803]
Length = 176
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 57/149 (38%), Positives = 81/149 (54%), Gaps = 21/149 (14%)
Query: 92 ETRGEFGIGS-AAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYL 145
E RG+ + ++ DL++ +K + R + AD+R S G+ GA L
Sbjct: 42 ELRGQRAVQDISSNMHGRDLKEKEFLKADLREVDLGDADLRGAVINTSQLQGADLRGADL 101
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
E VA+ + F GADL D AN TNA+L++ S A IEG DF++AVI
Sbjct: 102 EDVVAFSSRFDGADLRD----------ANFTNAMLMQ-----SRFNDAQIEGTDFTNAVI 146
Query: 206 DLAQKQALCKYANGTNPITGVSTRKSLGC 234
DL Q +ALC A+G N ++GVST++SLGC
Sbjct: 147 DLPQLKALCGRASGVNSLSGVSTKESLGC 175
>gi|427701840|ref|YP_007045062.1| low-complexity protein [Cyanobium gracile PCC 6307]
gi|427345008|gb|AFY27721.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
Length = 184
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 50/126 (39%), Positives = 75/126 (59%), Gaps = 5/126 (3%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DLR ++ F +A+ D+R++D G+ FN L +A + GADL D +
Sbjct: 63 DLRGRNLQQQEFLKASMEGFDLRDADLRGAVFNSTDLRQA-----DLRGADLEDVVAFAT 117
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
+ A+L A +L +S A I+GADFSDAV+DL +++ALC A+G++P+TGV T
Sbjct: 118 RFDGADLRGAQFRNAMLMQSRFRDARIDGADFSDAVLDLPEQKALCARASGSHPLTGVDT 177
Query: 229 RKSLGC 234
R+SLGC
Sbjct: 178 RESLGC 183
>gi|33240300|ref|NP_875242.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
gi|33237827|gb|AAP99894.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
Length = 170
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 56/134 (41%), Positives = 76/134 (56%), Gaps = 20/134 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S +F DLR NFR +N T A S +G+ +GA L+ A+AY ++F ADL
Sbjct: 57 SGYEFVKFDLRGI-----NFRDSNLTGAVFNNSKLNGADLHGANLKDALAYASDFEDADL 111
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+D+ NL+NA+L+ S AIIEGADF+DAV+ Q++ LC A+GT
Sbjct: 112 TDS----------NLSNALLME-----SSFNNAIIEGADFTDAVLSRIQQKQLCSIADGT 156
Query: 221 NPITGVSTRKSLGC 234
N TG+ST SLGC
Sbjct: 157 NSSTGISTSYSLGC 170
>gi|123968372|ref|YP_001009230.1| hypothetical protein A9601_08391 [Prochlorococcus marinus str.
AS9601]
gi|123198482|gb|ABM70123.1| conserved hypothetical protein [Prochlorococcus marinus str.
AS9601]
Length = 170
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 65/116 (56%), Gaps = 15/116 (12%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NF +N A S SKFNGA L A+AY +FT ADLSD N TNA
Sbjct: 70 NFSESNLEGAVFNNSKLQNSKFNGANLRDALAYATDFTDADLSDV----------NFTNA 119
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+L+ S+ GA I+GADF+DAV+ Q++ LC ANGTN TG ST SLGC
Sbjct: 120 LLME-----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170
>gi|428202122|ref|YP_007080711.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427979554|gb|AFY77154.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 168
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 74/135 (54%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA F +L +N + A FT+ D+ ++FS + GA + + N GAD
Sbjct: 33 GAAASFEDKNLSGQDFSGQNLQTAQFTNVDLTSANFSNTDLRGAVFNGSALKETNLHGAD 92
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L N A+L++AVL ++ R+ GA I GADF+ AV+D Q LC A+G
Sbjct: 93 LTNGLAYLSSFNGADLSDAVLTEAIMLRTTFDGANITGADFTLAVLDGDQVAKLCTIASG 152
Query: 220 TNPITGVSTRKSLGC 234
N TGV TR SLGC
Sbjct: 153 VNSKTGVETRASLGC 167
>gi|302768839|ref|XP_002967839.1| hypothetical protein SELMODRAFT_408705 [Selaginella moellendorffii]
gi|300164577|gb|EFJ31186.1| hypothetical protein SELMODRAFT_408705 [Selaginella moellendorffii]
Length = 126
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/130 (41%), Positives = 71/130 (54%), Gaps = 10/130 (7%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT-----L 164
+R A ++ R A F + D R+ + GS L+ + A F G DL DT L
Sbjct: 1 MRGADLSGQDLRGAVFAACDCRKINLRGSN-----LDSSTDTFAGFEGGDLQDTSWVQAL 55
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
DR+V NL NA+ +LT S GA I GADF++A++D Q+ LCK A GTN IT
Sbjct: 56 ADRVVFRMTNLQNAIFTNAILTGSQFDGADITGADFTEAILDNYQRLKLCKRATGTNSIT 115
Query: 225 GVSTRKSLGC 234
GV TR+SL C
Sbjct: 116 GVETRESLAC 125
>gi|159903694|ref|YP_001551038.1| hypothetical protein P9211_11531 [Prochlorococcus marinus str. MIT
9211]
gi|159888870|gb|ABX09084.1| Hypothetical protein P9211_11531 [Prochlorococcus marinus str. MIT
9211]
Length = 183
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 72/120 (60%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ +++ A R+++ S +G + A +N G DL+DTL DR+ + +L N
Sbjct: 53 QDLSKSSIAGATARDANLSDVDLHGTVVTLADLKGSNLNGIDLTDTLSDRVNFQKTDLRN 112
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 237
AVLV + + S GA+I GADFSD+V+D ++ LC+ A G NP TG++TR SL C ++
Sbjct: 113 AVLVNMIASGSSFAGALIAGADFSDSVLDRDDQRNLCEIAEGVNPKTGIATRDSLECSDN 172
>gi|170078800|ref|YP_001735438.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
gi|169886469|gb|ACB00183.1| secreted pentapeptide repeats protein [Synechococcus sp. PCC 7002]
Length = 165
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 69/128 (53%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
S DL + +N + F+ ++ +D SGS GA + N GAD ++ +
Sbjct: 38 SEDLAGSNFAGQNLQGVEFSQVNLTNADLSGSDLRGAVFNSTLLETTNLHGADFTNGIAY 97
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
A+LT+A+ V +L RS A I+GADFS AV+D Q++ LC A G NP+TG+
Sbjct: 98 LSKFTGADLTDAIFVEAILLRSTFENAKIDGADFSFAVLDGPQQKKLCAVATGVNPVTGI 157
Query: 227 STRKSLGC 234
+T SLGC
Sbjct: 158 ATADSLGC 165
>gi|440684721|ref|YP_007159516.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428681840|gb|AFZ60606.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 167
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/132 (40%), Positives = 74/132 (56%), Gaps = 10/132 (7%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F DL + K N R++NF+ A++R G F A LE A N GADL++
Sbjct: 45 ADFSGRDLTDSSFTKANLRQSNFSHANLR-----GVSFFAANLESA-----NLEGADLTN 94
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
+D L ANLTN +L + GAII+GADF+DA++ +++ LCK A G NP
Sbjct: 95 ATLDSARLIRANLTNTILEGAFAASARFDGAIIDGADFTDALLRGDEQKKLCKVAKGNNP 154
Query: 223 ITGVSTRKSLGC 234
+TG TR++L C
Sbjct: 155 VTGRDTRETLFC 166
>gi|124023397|ref|YP_001017704.1| hypothetical protein P9303_16951 [Prochlorococcus marinus str. MIT
9303]
gi|123963683|gb|ABM78439.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9303]
Length = 198
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 79/149 (53%), Gaps = 25/149 (16%)
Query: 96 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN----------GAYL 145
EF G A + S D+ ++NF +A+ D+ E+D G+ FN GA L
Sbjct: 63 EFRGGQAIEEISKDMHGRDLKEQNFLKADLRGVDLSEADLRGAVFNSSQLQEADLQGADL 122
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
E VA+ + F GADL AN TNA+L++ S A+IEGADFS+AV+
Sbjct: 123 ENVVAFASRFDGADLRG----------ANFTNAMLMQ-----SQFKDALIEGADFSNAVL 167
Query: 206 DLAQKQALCKYANGTNPITGVSTRKSLGC 234
D Q+ LC ANGTN ++G +T SLGC
Sbjct: 168 DRRQQNELCSRANGTNAVSGSNTIDSLGC 196
>gi|428773304|ref|YP_007165092.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
7202]
gi|428687583|gb|AFZ47443.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
Length = 164
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 75/130 (57%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F + DL AV + RR + MR SD + + + L A + NF+GA L
Sbjct: 43 FSNQDLVGAVFAASSMRRVS-----MRNSDLTNAMMTESVLLDADLHGVNFSGA-----L 92
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+DR+ + ++L++A+L+ + TR+ I GADF+DAVID Q +C+ A+G NP+T
Sbjct: 93 IDRVTFDFSDLSDAILIGAIATRTRFYDTDITGADFTDAVIDRYQVSLMCERADGVNPVT 152
Query: 225 GVSTRKSLGC 234
GV+TR SLGC
Sbjct: 153 GVATRDSLGC 162
>gi|220907029|ref|YP_002482340.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863640|gb|ACL43979.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 174
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 75/137 (54%), Gaps = 4/137 (2%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G AA + L ++ R + FT A++ SDFS S G A AN +GAD
Sbjct: 38 GWAADYTKESLVGVDFSGKDLRDSEFTQANLSRSDFSQSDLRGVSFFAANLESANLSGAD 97
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYA 217
L T +D L ANLTNA+L + GA I GADF+D +DL Q + LC+ A
Sbjct: 98 LRLTTLDNARLTHANLTNAILEGAFAFNARFQGATITGADFTD--VDLRQDAQTILCQGA 155
Query: 218 NGTNPITGVSTRKSLGC 234
+GTNP+TG +TR++LGC
Sbjct: 156 SGTNPVTGRNTRETLGC 172
>gi|434386546|ref|YP_007097157.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428017536|gb|AFY93630.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 212
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 72/117 (61%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N + A FT+A + +++F+G+ G + + + N GA+L+ L+D++ A+L++
Sbjct: 95 KNLQTAVFTTAKLDDTNFAGADLTGVVISSSTLNRTNLHGANLTQGLLDQVRFVGADLSD 154
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AV V ++ RS I GADF+DA++ Q++ LC+ A G N TGV+TR SLGC
Sbjct: 155 AVFVEAMMLRSTFTDVNIAGADFTDAILGKLQQKELCQIATGVNSKTGVATRDSLGC 211
>gi|119389531|pdb|2G0Y|A Chain A, Crystal Structure Of A Lumenal Pentapeptide Repeat Protein
From Cyanothece Sp 51142 At 2.3 Angstrom Resolution.
Tetragonal Crystal Form
Length = 184
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/135 (39%), Positives = 75/135 (55%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
GS+A + L ++ A FT+AD+ +S+FS + GA + A+ GAD
Sbjct: 49 GSSASYEDVKLIGEDFSGKSLTYAQFTNADLTDSNFSEADLRGAVFNGSALIGADLHGAD 108
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L A+LTNAVL ++ R+ A I GADFS AV+D+ + LC A+G
Sbjct: 109 LTNGLAYLTSFKGADLTNAVLTEAIMMRTKFDDAKITGADFSLAVLDVYEVDKLCDRADG 168
Query: 220 TNPITGVSTRKSLGC 234
NP TGVSTR+SL C
Sbjct: 169 VNPKTGVSTRESLRC 183
>gi|428298761|ref|YP_007137067.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428235305|gb|AFZ01095.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 169
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 93/181 (51%), Gaps = 18/181 (9%)
Query: 56 KLKN--WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 113
KL N WR+ +S L + + ++ +A +Y E I + F DL +
Sbjct: 4 KLSNNFWRIVLSALLGTVIWMISTWGLTPIAFALEYNKE------ILIQSDFSGRDLSDS 57
Query: 114 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
K N +++NF++ ++R G F A LE + TGADLS++ +D L +A
Sbjct: 58 SFTKANLKQSNFSNTNLR-----GVSFFAANLESV-----DLTGADLSNSTLDSARLVKA 107
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 233
NLTNA+L + GAII+GADF+D ++ ++ LCK A GTNP T +TR +L
Sbjct: 108 NLTNAILEGAFAISAKFEGAIIDGADFTDILLRDDEQARLCKIATGTNPTTKRNTRDTLM 167
Query: 234 C 234
C
Sbjct: 168 C 168
>gi|354567474|ref|ZP_08986643.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353542746|gb|EHC12207.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 164
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 88/178 (49%), Gaps = 15/178 (8%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
+K+WRVF LA V+ + L+ + +R Q +AD +
Sbjct: 1 MKSWRVFAVLILAMVVL------LFPLSAEAAKSSSSR----FAGYKQMSNADFSGQTLI 50
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+E F + A+ +D G+ FN AYLEKA N GAD ++ + + +A+L+
Sbjct: 51 REEFTKVKLDKANFSNADLRGAVFNNAYLEKA-----NLHGADFTNGIAYLVDFRDADLS 105
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+A+ T+L S I G DF++AV+D + + LC ANG N TGVSTR+SL C
Sbjct: 106 DAIFTDTMLLYSTFDNVEITGTDFTNAVLDGPELKKLCARANGVNSKTGVSTRESLEC 163
>gi|443314247|ref|ZP_21043822.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442786146|gb|ELR95911.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 166
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 69/136 (50%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
+ A F +LR ++ R ++T ADM E + +G+ G L KAN GA
Sbjct: 30 VAQAESFDRQNLRMRDFSGQDLRGNDYTRADMAEVNLTGANLQGVRLFDTNLTKANLEGA 89
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
DL +D ANLTNA+L + +D AII+GADF+D +D LC A
Sbjct: 90 DLRGATLDGARFLAANLTNAILAGSYAFNTDFRKAIIDGADFTDVFLDPKTNDLLCAVAQ 149
Query: 219 GTNPITGVSTRKSLGC 234
GTNP+TG TR +L C
Sbjct: 150 GTNPVTGRDTRDTLYC 165
>gi|17232102|ref|NP_488650.1| hypothetical protein alr4610 [Nostoc sp. PCC 7120]
gi|17133747|dbj|BAB76309.1| alr4610 [Nostoc sp. PCC 7120]
Length = 164
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 101/190 (53%), Gaps = 39/190 (20%)
Query: 57 LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 109
+K+WRV VS LA + A+ SS+I+ A + G+ IGS +F + D
Sbjct: 1 MKDWRVVVSFVLAMVLFLFPGSAQAASSSSITRSAGDELKAKDFSGQSLIGS--EFTNVD 58
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTL 164
L EN ANF++AD+R F+G+ G L + +AY ANF ADLSD +
Sbjct: 59 L-------EN---ANFSNADLRGGVFNGTVLEGVNLHGVDFSEGIAYLANFKNADLSDAI 108
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
LTNA+++R++ + + GADF++AV+D+ Q + LC ANG N T
Sbjct: 109 ----------LTNAMMLRSIFDNVN-----VTGADFTNAVLDITQVKKLCLKANGVNSKT 153
Query: 225 GVSTRKSLGC 234
GV TR+SLGC
Sbjct: 154 GVDTRESLGC 163
>gi|67922694|ref|ZP_00516198.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|416392485|ref|ZP_11685875.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
gi|67855476|gb|EAM50731.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|357263639|gb|EHJ12621.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
Length = 170
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 74/135 (54%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G++A + L ++ A FT+AD+ +S+FS + GA + + AD
Sbjct: 35 GASASYEDVQLIGEDFSGKSLTYAQFTNADLTDSNFSDADLRGAVFNGSALIGTDLHQAD 94
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L A+LTNAVL ++ R+ A I GADFS AV+DL Q LCK A+G
Sbjct: 95 LTNGLAYLTSFEGADLTNAVLTEAIMMRTTFKNANITGADFSLAVLDLQQVAELCKRADG 154
Query: 220 TNPITGVSTRKSLGC 234
N TG+STR+SLGC
Sbjct: 155 VNSKTGISTRESLGC 169
>gi|428203139|ref|YP_007081728.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427980571|gb|AFY78171.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 177
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 66/110 (60%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F A++R S+FS +K G A ANF GADL+ ++ L AN TNA+LV
Sbjct: 67 FDHANLRGSNFSNAKLQGVRFFAANLESANFEGADLTGADLESARLVRANFTNAILVGAF 126
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
T + GAII+GADF+D ++ ++ LC+ A GTNP+TG +TR +L C
Sbjct: 127 ATNTLFNGAIIDGADFTDVLLRPDTEKKLCEIARGTNPVTGRNTRDTLNC 176
>gi|119389418|pdb|2F3L|A Chain A, Crystal Structure Of A Lumenal Rfr-Domain Protein
(Contig83.1_1_243_746) From Cyanothece Sp. 51142 At 2.1
Angstrom Resolution
Length = 184
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 53/135 (39%), Positives = 74/135 (54%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
GS+A + L ++ A FT+AD+ +S+FS + GA + A+ GAD
Sbjct: 49 GSSASYEDVKLIGEDFSGKSLTYAQFTNADLTDSNFSEADLRGAVFNGSALIGADLHGAD 108
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L A+LTNAVL + R+ A I GADFS AV+D+ + LC A+G
Sbjct: 109 LTNGLAYLTSFKGADLTNAVLTEAIXXRTKFDDAKITGADFSLAVLDVYEVDKLCDRADG 168
Query: 220 TNPITGVSTRKSLGC 234
NP TGVSTR+SL C
Sbjct: 169 VNPKTGVSTRESLRC 183
>gi|427723591|ref|YP_007070868.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427355311|gb|AFY38034.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 165
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 67/128 (52%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
S D +NF+ A FT + R +D S + GA + N GAD+S+ +
Sbjct: 38 SEDFANENFAGQNFQGAEFTQVNFRNADMSNTDLRGAVFNSSQLQNTNLHGADMSNGIAY 97
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
A+L+ A+ +L RS A I+GADFS AV+D +Q++ LC A G NP+TG+
Sbjct: 98 LSAFTGADLSGAIFEEAILLRSTFDDANIDGADFSFAVLDGSQQKKLCAAATGVNPVTGI 157
Query: 227 STRKSLGC 234
T SLGC
Sbjct: 158 ETADSLGC 165
>gi|434400099|ref|YP_007134103.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428271196|gb|AFZ37137.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 167
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 72/118 (61%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
++ R A F A +R SDFS S +G L + + NFTGA+LS+ ++ L AN T
Sbjct: 49 HQDLRDAIFDHASLRGSDFSYSDLSGVRLFGSNLSRVNFTGANLSNADLESCRLTRANFT 108
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA+L +T + L AIIEGADF++ ++ ++ LC+ A+GTNP TG +T+ +L C
Sbjct: 109 NAILTGAFMTNTLLDEAIIEGADFTNVLLSPTTEKMLCENASGTNPTTGRNTKDTLFC 166
>gi|157413206|ref|YP_001484072.1| hypothetical protein P9215_08711 [Prochlorococcus marinus str. MIT
9215]
gi|254525828|ref|ZP_05137880.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
MIT 9202]
gi|157387781|gb|ABV50486.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT
9215]
gi|221537252|gb|EEE39705.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
MIT 9202]
Length = 170
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 64/116 (55%), Gaps = 15/116 (12%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NF +N A S SKF GA L A+AY +FT ADLSD N TNA
Sbjct: 70 NFSESNLEGAVFNNSKLQNSKFTGANLRDALAYATDFTDADLSD----------VNFTNA 119
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+L+ S+ GA I+GADF+DAV+ Q++ LC ANGTN TG ST SLGC
Sbjct: 120 LLME-----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170
>gi|186686067|ref|YP_001869263.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186468519|gb|ACC84320.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 191
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 65/110 (59%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
FT A++R+S+FS + NG A AN G+DL + +D L ANLTNA+L
Sbjct: 81 FTKANLRQSNFSRANLNGVSFFAANLESANLEGSDLRNATLDSARLVRANLTNALLEGAF 140
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ GAII+GADF+D ++ +++ LCK A GTNP TG TR +L C
Sbjct: 141 AANARFDGAIIDGADFTDTLLRPDEQKKLCKLAKGTNPTTGRDTRDTLFC 190
>gi|75909862|ref|YP_324158.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75703587|gb|ABA23263.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 194
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 74/133 (55%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A ++ L +A + ++FT A++R+S+FS S G A AN G +L+
Sbjct: 61 ALEYNKEILVEADFSGRDLTDSSFTKANLRQSNFSKSNLTGVSFFAANLESANLEGTNLT 120
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
+ +D L +ANLTNAVL + GAII+GADF+D ++ +++ LCK A GTN
Sbjct: 121 NATLDSARLIKANLTNAVLEGAFAASTKFDGAIIDGADFTDVLLRPDEQKKLCKVAKGTN 180
Query: 222 PITGVSTRKSLGC 234
P TG TR +L C
Sbjct: 181 PTTGRETRDTLFC 193
>gi|427706684|ref|YP_007049061.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427359189|gb|AFY41911.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 169
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 52/132 (39%), Positives = 74/132 (56%), Gaps = 10/132 (7%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F DL + K N R++NF+++++R G F A LE A N G DL++
Sbjct: 47 ADFSGRDLTDSSFTKANLRQSNFSNSNLR-----GVSFFAANLESA-----NLQGTDLTN 96
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
+D L +A+LTNAVL + GAII+GADF+D ++ +++ LCK A GTNP
Sbjct: 97 ATLDSARLMKADLTNAVLEGAFAANAKFDGAIIDGADFTDVLLRPDEQKKLCKVAKGTNP 156
Query: 223 ITGVSTRKSLGC 234
TG TR +L C
Sbjct: 157 TTGRDTRDTLFC 168
>gi|126696175|ref|YP_001091061.1| hypothetical protein P9301_08371 [Prochlorococcus marinus str. MIT
9301]
gi|91070292|gb|ABE11210.1| conserved hypothetical protein [uncultured Prochlorococcus marinus
clone HF10-88D1]
gi|126543218|gb|ABO17460.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9301]
Length = 170
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 64/116 (55%), Gaps = 15/116 (12%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NF +N A S SKF GA L A+AY +FT ADLSD N TNA
Sbjct: 70 NFSESNLEGAVFNNSKLQNSKFTGANLRDALAYATDFTDADLSD----------VNFTNA 119
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+L+ S+ GA I+GADF+DAV+ Q++ LC ANGTN TG ST SLGC
Sbjct: 120 LLME-----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170
>gi|33865584|ref|NP_897143.1| hypothetical protein SYNW1050 [Synechococcus sp. WH 8102]
gi|33632753|emb|CAE07565.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 162
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 68/118 (57%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ R A F +++RE++ SGS GA L A A+ +G DL + +D V+ NL+N
Sbjct: 45 KDLRGATFNLSNLREANLSGSDLRGASLYGAKLQDADLSGTDLREATLDAAVMTGTNLSN 104
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCG 235
AVL + I GADF+D + Q ++LC A+GTNP+TG STR SLGCG
Sbjct: 105 AVLEGAFAFNTRFVDVTISGADFTDVPMRGDQLKSLCAVADGTNPVTGRSTRDSLGCG 162
>gi|427731475|ref|YP_007077712.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427367394|gb|AFY50115.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 185
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 80/149 (53%), Gaps = 9/149 (6%)
Query: 95 GEFGIGSAAQFG----SADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYL 145
G GI + A F + D K + ++ +F ++FT A++R+S+FS S G
Sbjct: 36 GILGITTIAGFAPTALALDYNKEILIEADFSGRDLTDSSFTKANLRQSNFSNSNLQGVSF 95
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A AN G +LS+ +D L +A+LTNAVL + GAII+GADF+D ++
Sbjct: 96 FAANLESANLQGVNLSNATLDSARLIKADLTNAVLEGAFAANAKFDGAIIDGADFTDVLL 155
Query: 206 DLAQKQALCKYANGTNPITGVSTRKSLGC 234
+++ LCK A GTNP TG T +L C
Sbjct: 156 RPDEQKKLCKVAKGTNPTTGRDTHDTLYC 184
>gi|166362955|ref|YP_001655228.1| hypothetical protein MAE_02140 [Microcystis aeruginosa NIES-843]
gi|166085328|dbj|BAG00036.1| hypothetical protein MAE_02140 [Microcystis aeruginosa NIES-843]
Length = 186
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+ A + + +L +N + A FT+ ++++S+FS + GA A + NF GAD
Sbjct: 50 GANASYENQNLTGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGAD 109
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L ++L++A+ ++ R+ G I GADFS AV+D Q + LC+ A G
Sbjct: 110 LTNGLAYLSTFKNSDLSDAIFAEAIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEG 169
Query: 220 TNPITGVSTRKSLGC 234
N TG+ST +SLGC
Sbjct: 170 VNSKTGISTPESLGC 184
>gi|334118008|ref|ZP_08492098.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333459993|gb|EGK88603.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 171
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 72/118 (61%), Gaps = 10/118 (8%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
K N R +NFT+AD+R G F A +E+A ANFTGA L + RM+ +ANLT
Sbjct: 62 KANLRNSNFTNADLR-----GVSFFAANMEEANLEGANFTGATLD---LARMM--KANLT 111
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA+L + L GA+I+GADF+D ++ + LCK A GTNP+TG TR++L C
Sbjct: 112 NAILEGAFAYNTRLEGAVIDGADFTDTLLRDDMIEKLCKVAKGTNPVTGRDTRETLFC 169
>gi|119490210|ref|ZP_01622723.1| hypothetical protein L8106_15969 [Lyngbya sp. PCC 8106]
gi|119454096|gb|EAW35249.1| hypothetical protein L8106_15969 [Lyngbya sp. PCC 8106]
Length = 177
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 63/110 (57%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F A++R+S+FS S G L A + NF ADLS +D LN ANLTNA+L
Sbjct: 67 FDFANLRDSNFSHSNLRGVSLFGAKLQRTNFEAADLSYATLDTARLNRANLTNAILEGAF 126
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+D A+I GADF+D ++ ++ LC A GTNP+TG TR +L C
Sbjct: 127 AYNTDFSDAMIAGADFTDVLLRRDMQEKLCALAEGTNPVTGRDTRDTLYC 176
>gi|443666115|ref|ZP_21133744.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|159030126|emb|CAO91018.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443331286|gb|ELS45952.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 169
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 76/135 (56%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+ A + + +L +N + A FT+ ++++S+FS + GA A + NF GAD
Sbjct: 33 GANASYENQNLTGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGAD 92
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L ++L++A+ ++ R+ G I GADFS AV+D Q + LC+ A G
Sbjct: 93 LTNGLAYLSTFKNSDLSDAIFAEAIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEG 152
Query: 220 TNPITGVSTRKSLGC 234
N TGVST +SLGC
Sbjct: 153 VNSKTGVSTPESLGC 167
>gi|390438199|ref|ZP_10226689.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
gi|425441109|ref|ZP_18821396.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9717]
gi|425454770|ref|ZP_18834496.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9807]
gi|425466166|ref|ZP_18845469.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|425468563|ref|ZP_18847571.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9701]
gi|389718271|emb|CCH97753.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9717]
gi|389804467|emb|CCI16499.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9807]
gi|389831470|emb|CCI25816.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|389838386|emb|CCI30813.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
gi|389884775|emb|CCI34954.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9701]
Length = 169
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 76/135 (56%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+ A + + +L +N + A FT+ ++++S+FS + GA A + NF GAD
Sbjct: 33 GANASYENQNLTGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGAD 92
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L ++L++A+ ++ R+ G I GADFS AV+D Q + LC+ A G
Sbjct: 93 LTNGLAYLSTFKNSDLSDAIFAEAIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEG 152
Query: 220 TNPITGVSTRKSLGC 234
N TGVST +SLGC
Sbjct: 153 VNSKTGVSTPESLGC 167
>gi|168067322|ref|XP_001785569.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662809|gb|EDQ49618.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 545
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 67/130 (51%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F ADLR +N R F + D R+ + GS +G+ A N +
Sbjct: 416 FDHADLRGRDMSNQNLRGVVFAACDCRKINLEGSTMDGSTDTFAGFEGGNLKNSSWIRAF 475
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
DR+V ANL NA VL+ S GA I GADF+DA++D Q+ +C+ A G NP T
Sbjct: 476 ADRVVFRGANLENANFTDAVLSGSQFDGADITGADFTDALVDNYQRLQMCRRAKGVNPTT 535
Query: 225 GVSTRKSLGC 234
GV+TR+SL C
Sbjct: 536 GVATRESLFC 545
>gi|425445790|ref|ZP_18825810.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
gi|389734131|emb|CCI02174.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
Length = 169
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 76/135 (56%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+ A + + +L +N + A FT+ ++++S+FS + GA A + NF GAD
Sbjct: 33 GANASYENQNLTGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGAD 92
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L ++L++A+ ++ R+ G I GADFS AV+D Q + LC+ A G
Sbjct: 93 LTNGLAYLSTFKNSDLSDAIFAEAIMLRTIFEGVNINGADFSFAVLDAEQIKNLCERAEG 152
Query: 220 TNPITGVSTRKSLGC 234
N TGVST +SLGC
Sbjct: 153 VNSKTGVSTPESLGC 167
>gi|194476536|ref|YP_002048715.1| hypothetical protein PCC_0045 [Paulinella chromatophora]
gi|171191543|gb|ACB42505.1| hypothetical protein PCC_0045 [Paulinella chromatophora]
Length = 167
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 74/130 (56%), Gaps = 20/130 (15%)
Query: 110 LRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
L++ +K + ++ +F+ +D+R SD + N A L+ VA+ + F GADL T
Sbjct: 53 LQQQEFLKADLQKIDFSESDLRGTVFNNSDLRNANLNAADLQDVVAFASRFDGADLRQT- 111
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
NL N +L++ S +IEGADF+DA++DL Q++ LC +ANGTN T
Sbjct: 112 ---------NLRNGMLIQ-----SKFKDTLIEGADFTDAILDLKQQKILCSFANGTNLKT 157
Query: 225 GVSTRKSLGC 234
GV T++SL C
Sbjct: 158 GVDTKESLRC 167
>gi|354567943|ref|ZP_08987110.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353541617|gb|EHC11084.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 169
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 73/133 (54%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A + + L A K++ ++F A++R S+FS S G + +FTGADLS
Sbjct: 36 AINYNNRTLEAADFSKQDLTDSSFDHANLRNSNFSNSNLRGVRFFSSNLASVDFTGADLS 95
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
++ + +ANLTNA+L T + GAII+GADF+D I LC+ A GTN
Sbjct: 96 YADLESARMTKANLTNAILEGAFTTGTMFDGAIIDGADFTDTYIREDTLNKLCQVAKGTN 155
Query: 222 PITGVSTRKSLGC 234
P+TG +TR +L C
Sbjct: 156 PVTGRNTRDTLAC 168
>gi|119511413|ref|ZP_01630525.1| hypothetical protein N9414_20009 [Nodularia spumigena CCY9414]
gi|119463958|gb|EAW44883.1| hypothetical protein N9414_20009 [Nodularia spumigena CCY9414]
Length = 126
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 73/117 (62%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ + A F++A+M ++F+ + GA + +V +AN GADL++ ++D++ A+L++
Sbjct: 9 QSLQAAEFSNANMELANFADADLRGAVMSASVMTQANLHGADLTNAMVDQVKFAGADLSD 68
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AV +L RS I+ ADF+DA++D Q + LC A+G N TGV TR SLGC
Sbjct: 69 AVFKEALLLRSTFTDVNIDSADFTDAILDGVQIKELCSKASGVNSKTGVETRYSLGC 125
>gi|422301609|ref|ZP_16388976.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9806]
gi|389789327|emb|CCI14609.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9806]
Length = 169
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+ A + + +L +N + A FT+ ++++S+FS + GA A + NF GAD
Sbjct: 33 GANASYENQNLTGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGAD 92
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L ++L++A+ ++ R+ G I GADFS AV+D Q + LC+ A G
Sbjct: 93 LTNGLAYLSTFKNSDLSDAIFAEAIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEG 152
Query: 220 TNPITGVSTRKSLGC 234
N TG+ST +SLGC
Sbjct: 153 VNSKTGISTLESLGC 167
>gi|116074723|ref|ZP_01471984.1| hypothetical protein RS9916_29354 [Synechococcus sp. RS9916]
gi|116067945|gb|EAU73698.1| hypothetical protein RS9916_29354 [Synechococcus sp. RS9916]
Length = 173
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/189 (34%), Positives = 92/189 (48%), Gaps = 28/189 (14%)
Query: 56 KLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVH 115
+L N R S LA V C AL + +A T E A Q SAD+
Sbjct: 2 RLLNPRALCSGLLATLV---CCVISVALLPSSPAQAITAPELRGQKAVQDISADMHGRDL 58
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLE----------KAVAYKANFTGADLSDTLM 165
++ F +A+ D+ E+D G+ N + L+ VA+ + F GADL D
Sbjct: 59 KEKEFLKADLQGVDLSEADLRGAVINTSLLQGSDLRSADLGDVVAFASRFDGADLRD--- 115
Query: 166 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 225
A NA+L+++ T ++ IEGADF++AVIDL Q +A+C A G N TG
Sbjct: 116 -------ARFVNAMLMQSRFTEAN-----IEGADFTNAVIDLPQLKAMCARAEGVNSATG 163
Query: 226 VSTRKSLGC 234
+STR+SLGC
Sbjct: 164 ISTRESLGC 172
>gi|78779169|ref|YP_397281.1| hypothetical protein PMT9312_0785 [Prochlorococcus marinus str. MIT
9312]
gi|78712668|gb|ABB49845.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9312]
Length = 170
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 64/116 (55%), Gaps = 15/116 (12%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NF +N A S SKF GA L A+AY +FT ADLSD N TNA
Sbjct: 70 NFSDSNLEGAVFNNSKLQNSKFTGANLRDALAYATDFTDADLSD----------VNFTNA 119
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+L+ S+ GA I+GADF+DAV+ Q++ LC ANGTN TG ST SLGC
Sbjct: 120 LLME-----SNFEGAKIDGADFTDAVLSRTQQKQLCAIANGTNSSTGESTEYSLGC 170
>gi|218438105|ref|YP_002376434.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218170833|gb|ACK69566.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 168
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 72/135 (53%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+ A F L A + A FT+ D+ E+DFS + GA + + GAD
Sbjct: 33 GATATFEDKKLVGADFSGQTLTLAQFTNVDLSEADFSNADLRGAVFNGSALIEGKLRGAD 92
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L A+L++A+L ++ R+ A + GADFS AV+D Q LC+ A+G
Sbjct: 93 LTNALGYLSSFERADLSDAILAEVIMKRTSFKNADVTGADFSYAVLDGEQIANLCRTASG 152
Query: 220 TNPITGVSTRKSLGC 234
N TGVSTR+SLGC
Sbjct: 153 VNSKTGVSTRESLGC 167
>gi|428300991|ref|YP_007139297.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428237535|gb|AFZ03325.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 166
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/121 (40%), Positives = 74/121 (61%), Gaps = 20/121 (16%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEA 173
N +ANF++AD+R + F+GS + A L+ + +AY ++F GA+LSD A
Sbjct: 59 NLEKANFSAADLRGAVFNGSMLHDANLQGIDFSEGIAYLSDFKGANLSD----------A 108
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 233
TNA+++R+ + D + GADF++AV+D + Q LC A+G NP TGV TR+SLG
Sbjct: 109 VFTNAMMLRSAFSDVD-----VTGADFTNAVLDRTEVQKLCVNASGVNPKTGVETRQSLG 163
Query: 234 C 234
C
Sbjct: 164 C 164
>gi|17230824|ref|NP_487372.1| hypothetical protein all3332 [Nostoc sp. PCC 7120]
gi|17132427|dbj|BAB75031.1| all3332 [Nostoc sp. PCC 7120]
Length = 206
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 81/147 (55%), Gaps = 9/147 (6%)
Query: 97 FGIGSAAQFG----SADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEK 147
FG+ + A F + + K + V+ +F ++FT A++R+S+FS S G
Sbjct: 59 FGMITIANFTPPAFALEYNKEILVEADFSGRDLTDSSFTKANLRQSNFSKSNLTGVSFFA 118
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
A AN G++L++ +D L +ANL NAVL + GAII+GADF+D ++
Sbjct: 119 ANLESANLEGSNLTNATLDSARLIKANLKNAVLEGAFAASTKFDGAIIDGADFTDVLLRP 178
Query: 208 AQKQALCKYANGTNPITGVSTRKSLGC 234
+++ LCK A GTNP TG TR +L C
Sbjct: 179 DEQKKLCKVAKGTNPTTGRETRDTLFC 205
>gi|116070732|ref|ZP_01468001.1| hypothetical protein BL107_13840 [Synechococcus sp. BL107]
gi|116066137|gb|EAU71894.1| hypothetical protein BL107_13840 [Synechococcus sp. BL107]
Length = 165
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 74/135 (54%), Gaps = 5/135 (3%)
Query: 105 FGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
F + D+ K V + ++ A F +++RE+D SGS GA L A A+ + D
Sbjct: 30 FAAVDVAKQVLIGADYANKDLVGATFNLSNLREADLSGSDLRGASLYGAKLQDADLSDTD 89
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L + +D V+ NL+NAV+ + +I GADF+D + Q ++LC A+G
Sbjct: 90 LREATLDSAVMTGTNLSNAVMEGAFAFNTRFKDVVITGADFTDVPMRPDQLKSLCSVADG 149
Query: 220 TNPITGVSTRKSLGC 234
TNP+TG STR+SLGC
Sbjct: 150 TNPVTGRSTRESLGC 164
>gi|428781463|ref|YP_007173249.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
gi|428695742|gb|AFZ51892.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
Length = 165
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 70/135 (51%), Gaps = 20/135 (14%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTGAD 159
F L +A E RA+F +A++ + F+G+ + G +AY +FTG D
Sbjct: 45 FSGESLIEAEFYDEELERADFHNANLEAAVFNGANLTNANWQGVNFTNGIAYLTDFTGVD 104
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
TNA+L ++ RS A +EG DF++AV+D Q + LC+ A+G
Sbjct: 105 F---------------TNAILTEAMMLRSTFNDATVEGVDFTNAVVDRLQVKRLCERASG 149
Query: 220 TNPITGVSTRKSLGC 234
NP TGVSTR+SLGC
Sbjct: 150 VNPTTGVSTRESLGC 164
>gi|428770110|ref|YP_007161900.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428684389|gb|AFZ53856.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 193
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/131 (40%), Positives = 69/131 (52%), Gaps = 15/131 (11%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT----------GADLSDT----- 163
NF A D D +GS F + L A Y++N T GADL +T
Sbjct: 60 NFTYAQLEGEDFSHRDLTGSVFAASNLRNASFYQSNLTNSVMTEGILFGADLRETNFTGS 119
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
L+DR+ L+ A+L NA+ + TR+ IEGADF+ AVID Q +C A+G N I
Sbjct: 120 LIDRVTLDFADLRNAIFTDAIATRTRFYDTNIEGADFTGAVIDRYQVALMCDRASGVNSI 179
Query: 224 TGVSTRKSLGC 234
TGV+TR SLGC
Sbjct: 180 TGVATRDSLGC 190
>gi|119512324|ref|ZP_01631410.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
gi|119463037|gb|EAW43988.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
Length = 170
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 67/110 (60%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
FT A++R+SDF+ + G A AN ADLS +D L +ANLTNA+L
Sbjct: 60 FTKANLRQSDFNHANLRGVSFFAANLESANLESADLSFATLDSARLIKANLTNAILEGAF 119
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ + GAII+GADF+D ++ +++ LC+ A GTNP+TG +TR +L C
Sbjct: 120 ASNARFDGAIIDGADFTDILLRQDEEKKLCQLAKGTNPVTGRNTRDTLFC 169
>gi|425436672|ref|ZP_18817106.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9432]
gi|425449430|ref|ZP_18829270.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
7941]
gi|425458879|ref|ZP_18838365.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9808]
gi|440755734|ref|ZP_20934936.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|389678572|emb|CCH92580.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9432]
gi|389763888|emb|CCI09674.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
7941]
gi|389823689|emb|CCI27950.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9808]
gi|440175940|gb|ELP55309.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
Length = 169
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+ A + + +L +N + A FT+ ++++S+FS + GA A + NF GAD
Sbjct: 33 GANASYENQNLTGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGAD 92
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L++ L ++L++A+ ++ R+ G I GADFS AV+D Q + LC+ A G
Sbjct: 93 LTNGLAYLSTFKNSDLSDAIFSEAIMLRTIFEGVNINGADFSFAVLDAQQIKNLCERAEG 152
Query: 220 TNPITGVSTRKSLGC 234
N TG+ST +SLGC
Sbjct: 153 VNSKTGISTPESLGC 167
>gi|427715923|ref|YP_007063917.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348359|gb|AFY31083.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 169
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/132 (39%), Positives = 73/132 (55%), Gaps = 10/132 (7%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F DL + K N R++NF++A++ SG F A LE A N GA+L++
Sbjct: 47 ADFSGRDLTDSSFTKANLRQSNFSNANL-----SGVSFFAANLESA-----NLQGANLTN 96
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
+D + NLTNAVL + GAII+GADF+D ++ +++ LCK A GTNP
Sbjct: 97 ATLDSARFIKTNLTNAVLEGAFAANAKFDGAIIDGADFTDVLLRQDEQKKLCKVAKGTNP 156
Query: 223 ITGVSTRKSLGC 234
TG TR +L C
Sbjct: 157 TTGRDTRDTLFC 168
>gi|443326265|ref|ZP_21054925.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442794122|gb|ELS03549.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 172
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 67/118 (56%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+N A F ++ SDFS S G A +ANF A+L ++ L++ANLT
Sbjct: 54 HQNLTDATFDHTNLIGSDFSDSNLFGVRFFAANLREANFANANLKFADLEAARLSDANLT 113
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NAVL LT + L G IIEGADFS A++D ++ LC A GTNP TG +TR +L C
Sbjct: 114 NAVLAGAYLTNALLDGVIIEGADFSGALLDRNDEKMLCDIATGTNPTTGRNTRDTLFC 171
>gi|33862830|ref|NP_894390.1| hypothetical protein PMT0557 [Prochlorococcus marinus str. MIT
9313]
gi|33634746|emb|CAE20732.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 198
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/149 (38%), Positives = 77/149 (51%), Gaps = 25/149 (16%)
Query: 96 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN----------GAYL 145
EF G A + S D+ ++NF +A+ D+ E+D G+ FN GA L
Sbjct: 63 EFRGGQAIEEISKDMHGRDLKEQNFLKADLRGVDLSEADLRGAVFNSSQLQEADLQGADL 122
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
E VA+ + F GADL AN TNA+L++ S A+IEGADFS+AV+
Sbjct: 123 ENVVAFASRFDGADLRG----------ANFTNAMLMQ-----SQFKDALIEGADFSNAVL 167
Query: 206 DLAQKQALCKYANGTNPITGVSTRKSLGC 234
D Q+ LC A+GTN +G T SLGC
Sbjct: 168 DRRQQNELCARADGTNAASGSQTLDSLGC 196
>gi|443312459|ref|ZP_21042076.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442777437|gb|ELR87713.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 167
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 70/126 (55%), Gaps = 5/126 (3%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
D V +F +AN ++++ SD +G F A LE A N GA+L++ +D
Sbjct: 46 DFSGQVLTDASFTKANLRNSNLSHSDLTGVSFFAANLESA-----NLEGANLTNATLDAA 100
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
+ + NLTNAVL + GAII+GADF+D ++ ++ LCK A GTNP TG T
Sbjct: 101 RIIKTNLTNAVLTGAFAANAKFDGAIIDGADFTDVLLRQDEQDKLCKVAQGTNPTTGKQT 160
Query: 229 RKSLGC 234
R++L C
Sbjct: 161 RETLMC 166
>gi|159903526|ref|YP_001550870.1| hypothetical protein P9211_09851 [Prochlorococcus marinus str. MIT
9211]
gi|159888702|gb|ABX08916.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9211]
Length = 169
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 78/152 (51%), Gaps = 20/152 (13%)
Query: 88 KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNG 142
K E R + + + + DL VK + R NF +D+ S+ + ++FNG
Sbjct: 33 KRPPEIRNQDDLNISQDMHAQDLSGREFVKFDLRGINFKDSDLSGAVFNNSNLTNAQFNG 92
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A + ++AY NF DLSD ANLTNA+L+ + + I+GADF+D
Sbjct: 93 ADMHDSLAYATNFENTDLSD----------ANLTNALLMESTFVNTK-----IDGADFTD 137
Query: 203 AVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AV+ Q++ LC A+GTN TG+ T SLGC
Sbjct: 138 AVLSRIQQKQLCSIASGTNSNTGIDTEYSLGC 169
>gi|428225171|ref|YP_007109268.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427985072|gb|AFY66216.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 170
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 69/116 (59%), Gaps = 5/116 (4%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NF +AN S+++ ++ G F GA LE A N GA L+ +D L +ANLTNA
Sbjct: 59 NFTKANMRSSNLSRANLQGVSFFGANLESA-----NLEGAQLNYATLDSARLVKANLTNA 113
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+L T + GA IEGADF+DA++ + + LC+ A+G NP TG +TR+SL C
Sbjct: 114 ILEGTYAFNAKFAGATIEGADFTDALLRDDEIEHLCEVASGVNPTTGRATRESLMC 169
>gi|119486074|ref|ZP_01620136.1| hypothetical protein L8106_06120 [Lyngbya sp. PCC 8106]
gi|119456849|gb|EAW37977.1| hypothetical protein L8106_06120 [Lyngbya sp. PCC 8106]
Length = 161
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 68/112 (60%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A F ++++ ++F S+ G+ KA+ AN GADL+ ++D++ + A+L+N++
Sbjct: 49 AEFANSNLESANFDHSQLVGSVFSKAMMKNANMRGADLTYAMLDQVDFSNADLSNSIFTE 108
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ S I GADF+DA++D Q + LC A+G NP TGVSTR SLGC
Sbjct: 109 VLFFGSTFKDTKITGADFTDALLDGEQLRQLCITASGVNPKTGVSTRYSLGC 160
>gi|425455123|ref|ZP_18834848.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9807]
gi|389804043|emb|CCI17099.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9807]
Length = 161
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/128 (42%), Positives = 69/128 (53%), Gaps = 10/128 (7%)
Query: 117 KENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM--- 168
N +F D+R+S F GS F+ A LE + AN GAD SD M +
Sbjct: 33 NRNLTDNDFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLE 92
Query: 169 --VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
L AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPITG
Sbjct: 93 SARLTRANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIATGTNPITGR 152
Query: 227 STRKSLGC 234
+TR +L C
Sbjct: 153 NTRDTLFC 160
>gi|425470227|ref|ZP_18849097.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9701]
gi|389884202|emb|CCI35462.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9701]
Length = 161
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/128 (42%), Positives = 70/128 (54%), Gaps = 10/128 (7%)
Query: 117 KENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM--- 168
N +F D+R+S F GS F+ A LE + AN GAD SD M +
Sbjct: 33 NRNLTDNDFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLE 92
Query: 169 --VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
L +AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPITG
Sbjct: 93 SARLTKANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPITGR 152
Query: 227 STRKSLGC 234
+TR +L C
Sbjct: 153 NTRDTLFC 160
>gi|148242344|ref|YP_001227501.1| pentapeptide repeat-containing protein [Synechococcus sp. RCC307]
gi|147850654|emb|CAK28148.1| Secreted pentapeptide repeat protein [Synechococcus sp. RCC307]
Length = 164
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 79/149 (53%), Gaps = 18/149 (12%)
Query: 100 GSAAQFGSADLRKAVHVKE--------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 151
G+AA + +LR A +++ N ++ F D+ +DFS S G
Sbjct: 21 GAAAAITAPELRGAKSMQDLSSDMHGRNLQQKEFLKMDLEGTDFSDSDLRGTVFNTTQLQ 80
Query: 152 KANFTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+NF+GADL D + DR L++A L N +L+++ T A I+GADF++AV+D
Sbjct: 81 DSNFSGADLRDVVAFSSRFDRADLSQARLDNGMLLQSKFT-----DATIDGADFTNAVLD 135
Query: 207 LAQKQALCKYANGTNPITGVSTRKSLGCG 235
L Q + LC A G N +G+ST SLGCG
Sbjct: 136 LPQIKQLCARATGVNERSGLSTADSLGCG 164
>gi|425440692|ref|ZP_18820990.1| Pentapeptide repeat family protein (modular protein) [Microcystis
aeruginosa PCC 9717]
gi|389718807|emb|CCH97279.1| Pentapeptide repeat family protein (modular protein) [Microcystis
aeruginosa PCC 9717]
Length = 213
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/126 (42%), Positives = 69/126 (54%), Gaps = 10/126 (7%)
Query: 119 NFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM----- 168
N +F D+R+S F GS F+ A LE + AN GAD SD M +
Sbjct: 87 NLTDNDFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESA 146
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
L AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPITG +T
Sbjct: 147 RLTRANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIATGTNPITGRNT 206
Query: 229 RKSLGC 234
R +L C
Sbjct: 207 RDTLFC 212
>gi|443663881|ref|ZP_21133269.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|443331763|gb|ELS46407.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 150
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/131 (41%), Positives = 70/131 (53%), Gaps = 10/131 (7%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
FG DLR + N R +NF+ A++ G +F A LE A AN DL
Sbjct: 29 DFGGQDLRDSTFDHSNLRASNFSHANLE-----GVRFFSANLEGADFSDANMRNVDLESA 83
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
+ R AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPI
Sbjct: 84 RLTR-----ANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIAKGTNPI 138
Query: 224 TGVSTRKSLGC 234
TG +TR +L C
Sbjct: 139 TGRNTRDTLFC 149
>gi|428317848|ref|YP_007115730.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428241528|gb|AFZ07314.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 171
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 71/118 (60%), Gaps = 10/118 (8%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
K N R +NFT+AD+R G F A +E+A NF GA+L+ +D + +ANLT
Sbjct: 62 KANLRNSNFTNADLR-----GVSFFAANMEEA-----NFEGANLTGATLDLARMMKANLT 111
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA+L + L GA+I+GADF++ ++ + LCK A GTNP+TG TR++L C
Sbjct: 112 NAILEGAFAYNTRLEGAVIDGADFTETLLRDDMIEKLCKVAKGTNPVTGRDTRETLFC 169
>gi|172037018|ref|YP_001803519.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354555787|ref|ZP_08975086.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171698472|gb|ACB51453.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353552111|gb|EHC21508.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 167
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 73/130 (56%), Gaps = 10/130 (7%)
Query: 115 HVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYK-----ANFTGADLSDTL 164
+ K+N +F+S D+R+SDF G F+ A L+ + ANF GADL
Sbjct: 37 YAKQNLVERDFSSQDLRDSDFEHANLRGCNFSHANLQGVRFFASNLEGANFEGADLRYAD 96
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
++ L N TNA+L T + GA+I+GADF+D ++ L ++ LC+ A GTNPIT
Sbjct: 97 LESARLVRVNFTNAILEGAFATNTLFNGAVIDGADFTDVLLRLDTEKKLCEIAKGTNPIT 156
Query: 225 GVSTRKSLGC 234
G +T+ +L C
Sbjct: 157 GRNTKDTLFC 166
>gi|422302957|ref|ZP_16390315.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9806]
gi|389792132|emb|CCI12113.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9806]
Length = 161
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/128 (41%), Positives = 70/128 (54%), Gaps = 10/128 (7%)
Query: 117 KENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM--- 168
N +F D+R+S F GS F+ A LE + AN GAD SD M +
Sbjct: 33 NRNLTDNDFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLE 92
Query: 169 --VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
L +AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+TG
Sbjct: 93 SARLTKANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGR 152
Query: 227 STRKSLGC 234
+TR +L C
Sbjct: 153 NTRDTLFC 160
>gi|425434011|ref|ZP_18814483.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9432]
gi|425451971|ref|ZP_18831790.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
7941]
gi|440753099|ref|ZP_20932302.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|389678210|emb|CCH92885.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9432]
gi|389766463|emb|CCI07918.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
7941]
gi|440177592|gb|ELP56865.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
Length = 161
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/128 (42%), Positives = 69/128 (53%), Gaps = 10/128 (7%)
Query: 117 KENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM--- 168
N +F D+R+S F GS F+ A LE + AN GAD SD M +
Sbjct: 33 NRNLTDNDFGGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLE 92
Query: 169 --VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
L AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNPITG
Sbjct: 93 SARLTRANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCEIATGTNPITGR 152
Query: 227 STRKSLGC 234
+TR +L C
Sbjct: 153 NTRDTLFC 160
>gi|78184858|ref|YP_377293.1| hypothetical protein Syncc9902_1285 [Synechococcus sp. CC9902]
gi|78169152|gb|ABB26249.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 162
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 65/112 (58%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A F +++RE+D SGS GA L A A+ + DL + +D V+ NL+NAV+
Sbjct: 50 ATFNLSNLREADLSGSDLRGASLYGAKLQDADLSDTDLREATLDSAVMTGTNLSNAVMEG 109
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ +I GADF+D + Q ++LC A+GTNP+TG STR+SLGC
Sbjct: 110 AFAFNTRFKDVVITGADFTDVPMRPDQLKSLCSVADGTNPVTGRSTRESLGC 161
>gi|254414183|ref|ZP_05027950.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196178858|gb|EDX73855.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 178
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 70/139 (50%), Gaps = 20/139 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 155
S F +L + K N + N ++ D+R +S + + GA ++AYK NF
Sbjct: 54 STMDFSGQNLAELEISKMNLTQTNLSNTDLRSVVISDSTMTDANLQGADFSYSIAYKVNF 113
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
GADLSD AVL +L S L I GADFS+AV+D Q Q+LC
Sbjct: 114 KGADLSD---------------AVLEEAILLGSRLDDVNITGADFSNAVLDRVQVQSLCT 158
Query: 216 YANGTNPITGVSTRKSLGC 234
A+G N TGV TR+SLGC
Sbjct: 159 KASGVNSKTGVETRESLGC 177
>gi|291566844|dbj|BAI89116.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 174
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 67/117 (57%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ + + F A+++ S+FS + G L A N ADL +D L ANLTN
Sbjct: 57 QDLKDSEFDFANLQGSNFSHTDLRGVSLFGAKMQDVNLESADLRFATLDTARLVRANLTN 116
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+L +D GAII GADF+D ++ Q+Q LC+ A+GTNP+TG TR++L C
Sbjct: 117 ALLEEAYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 173
>gi|390440388|ref|ZP_10228721.1| Pentapeptide repeat family protein [Microcystis sp. T1-4]
gi|389836192|emb|CCI32847.1| Pentapeptide repeat family protein [Microcystis sp. T1-4]
Length = 161
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 68/121 (56%), Gaps = 10/121 (8%)
Query: 124 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 173
+F D+R+S F GS F+ A LE + AN GAD SD M + L A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLESARLTRA 99
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 233
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+TG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159
Query: 234 C 234
C
Sbjct: 160 C 160
>gi|409992571|ref|ZP_11275753.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|409936565|gb|EKN78047.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 149
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 67/117 (57%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ + + F A+++ S+FS + G L A N ADL +D L ANLTN
Sbjct: 32 QDLKDSEFDFANLQGSNFSHTDLRGVSLFGAKMQDVNLESADLRLATLDTARLVRANLTN 91
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+L +D GAII GADF+D ++ Q+Q LC+ A+GTNP+TG TR++L C
Sbjct: 92 ALLEEAYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 148
>gi|67922307|ref|ZP_00515820.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|67855883|gb|EAM51129.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
Length = 164
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 9/132 (6%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
F DLRK A F A++R+S+FS + G A ANF GADL
Sbjct: 41 VDFSGQDLRK---------EALFDHANLRDSNFSNANVQGVRFFSANLDSANFEGADLRY 91
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
++ L + N TNA+L T + GAII+GADF+D ++D ++ LC A GTNP
Sbjct: 92 ADLEVARLTKVNFTNAILEGAFATNILVQGAIIDGADFTDVLLDPKTEKYLCTIATGTNP 151
Query: 223 ITGVSTRKSLGC 234
ITG +T+ +L C
Sbjct: 152 ITGRNTKDTLYC 163
>gi|425463375|ref|ZP_18842714.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9809]
gi|389833543|emb|CCI21857.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9809]
Length = 161
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 69/121 (57%), Gaps = 10/121 (8%)
Query: 124 NFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 173
+F D+R+S F GS F+ A LE + AN GAD SD M + L +A
Sbjct: 40 DFAGQDLRDSTFDHSNLRGSNFSRANLEGVRFFSANLEGADFSDANMRNVDLESARLTKA 99
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 233
N TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+TG +TR +L
Sbjct: 100 NFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGRNTRDTLF 159
Query: 234 C 234
C
Sbjct: 160 C 160
>gi|416389980|ref|ZP_11685429.1| pentapeptide repeat protein [Crocosphaera watsonii WH 0003]
gi|357264135|gb|EHJ13061.1| pentapeptide repeat protein [Crocosphaera watsonii WH 0003]
Length = 164
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 9/132 (6%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
F DLRK A F A++R+S+FS + G A ANF GADL
Sbjct: 41 VDFSGQDLRK---------EALFDHANLRDSNFSNANVQGVRFFSANLDSANFEGADLRY 91
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
++ L + N TNA+L T + GAII+GADF+D ++D ++ LC A GTNP
Sbjct: 92 ADLEVARLTKVNFTNAILEGAFATNILVQGAIIDGADFTDVLLDPKTEKYLCTIATGTNP 151
Query: 223 ITGVSTRKSLGC 234
ITG +T+ +L C
Sbjct: 152 ITGRNTKDTLYC 163
>gi|428305184|ref|YP_007142009.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428246719|gb|AFZ12499.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 169
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 67/130 (51%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F L A K N R NF+ AD+R G GA LE N GA+LS+
Sbjct: 49 FSGRVLTDATFTKANLRNCNFSHADLR-----GVSLFGANLELV-----NLEGANLSNAT 98
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D +ANLTNAVL + GAII+GADF+D ++ ++ LCK A GTNP T
Sbjct: 99 LDTAKFTKANLTNAVLEGAFAFNAKFDGAIIDGADFTDVLVRQDVQKQLCKIATGTNPTT 158
Query: 225 GVSTRKSLGC 234
G TR +L C
Sbjct: 159 GRETRDTLLC 168
>gi|425447360|ref|ZP_18827349.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9443]
gi|389732098|emb|CCI03919.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9443]
Length = 161
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/128 (42%), Positives = 68/128 (53%), Gaps = 10/128 (7%)
Query: 117 KENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM--- 168
N +F D+R+S F GS F+ A LE + AN GAD SD M +
Sbjct: 33 NRNLTDNDFAGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLE 92
Query: 169 --VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
L AN TNAVL T + GAII+GADF+DA+I + LC+ A GTNPITG
Sbjct: 93 SARLTRANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEIYLCEIAKGTNPITGR 152
Query: 227 STRKSLGC 234
+TR +L C
Sbjct: 153 NTRDTLFC 160
>gi|428310976|ref|YP_007121953.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428252588|gb|AFZ18547.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 167
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/121 (41%), Positives = 65/121 (53%), Gaps = 20/121 (16%)
Query: 119 NFRRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
N + NF +AD+R FS S +GA +AY ++FTGADLSD
Sbjct: 61 NLEQTNFNNADLRNVVFSSSTLKQASLHGADFTSGIAYLSDFTGADLSD----------- 109
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 233
AVL ++ RS A I GADF+DAV+D Q + LC A G N TG++TR+SLG
Sbjct: 110 ----AVLTEAIMLRSRFDEADITGADFTDAVLDGVQIKKLCARATGVNSKTGMATRESLG 165
Query: 234 C 234
C
Sbjct: 166 C 166
>gi|434386960|ref|YP_007097571.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428017950|gb|AFY94044.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 168
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/133 (36%), Positives = 69/133 (51%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A F A L A ++ FT A +R F + G L A+ TGA+L+
Sbjct: 35 ADDFTKATLENADFSGKDLTSYEFTQASVRNGKFINANLTGVSLIGGNFDSADMTGANLT 94
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
+ L+D N TNA+LV + ++ GAII+GADF+D ++ ++ LCK A GTN
Sbjct: 95 NALLDTARFTRTNFTNAILVGAFTSVTNFDGAIIDGADFTDVLLRKDIQKKLCKVAKGTN 154
Query: 222 PITGVSTRKSLGC 234
P TG TR+SL C
Sbjct: 155 PTTGRDTRESLEC 167
>gi|427420100|ref|ZP_18910283.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425762813|gb|EKV03666.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 165
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 66/133 (49%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A + LR+ ++ R N+TS DM E+D S + G L KAN AD+S
Sbjct: 32 AKNYDRQSLRQQSFAGQDLRGNNYTSTDMAEADLSNTDLRGVRLFDTNLTKANLESADMS 91
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
+D ANL NA+ +D A IEGADF+D +D+ LC+ A G N
Sbjct: 92 GATLDGARFIRANLKNAIFEGAYAFSTDFRKANIEGADFTDVDLDVKTNDMLCEVATGVN 151
Query: 222 PITGVSTRKSLGC 234
P+TG +T+ +L C
Sbjct: 152 PVTGRATKDTLYC 164
>gi|423066922|ref|ZP_17055712.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|406711687|gb|EKD06887.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 137
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 67/117 (57%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ + + F A+++ S+FS + G L A N ADL +D L ANLTN
Sbjct: 20 QDLKDSEFDFANLQGSNFSHTDLRGVSLFGAKMQDVNLESADLRLATLDTARLVRANLTN 79
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+L +D GAII GADF+D ++ Q+Q LC+ A+GTNP+TG TR++L C
Sbjct: 80 ALLEEAYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 136
>gi|425458741|ref|ZP_18838229.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9808]
gi|389824728|emb|CCI26060.1| Pentapeptide repeat family protein [Microcystis aeruginosa PCC
9808]
Length = 161
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/128 (41%), Positives = 69/128 (53%), Gaps = 10/128 (7%)
Query: 117 KENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTGADLSDTLMDRM--- 168
N +F D+R+S F GS F+ A LE + AN GAD SD M +
Sbjct: 33 NRNLTDNDFGGQDLRDSTFDHSNLRGSNFSHANLEGVRFFSANLEGADFSDANMRNVDLE 92
Query: 169 --VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
L AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+TG
Sbjct: 93 SARLTRANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVTGR 152
Query: 227 STRKSLGC 234
+TR +L C
Sbjct: 153 NTRDTLFC 160
>gi|318041291|ref|ZP_07973247.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0101]
Length = 161
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 70/131 (53%), Gaps = 5/131 (3%)
Query: 109 DLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
D+ K V + +F R A F ++RE+DF GS GA L A AN +G DL+D
Sbjct: 30 DVAKQVLIGHDFAGMDLRGATFNLTNLREADFHGSDLRGASLFGAKLQDANLSGTDLTDA 89
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
+D VL+ +L NAVL + +IEGADF++ + LC A+GTNP+
Sbjct: 90 TLDSAVLDGTDLRNAVLENAFAFNTRFNNVLIEGADFTNVPFRGDVLKTLCASASGTNPV 149
Query: 224 TGVSTRKSLGC 234
TG +TR +L C
Sbjct: 150 TGRNTRDTLEC 160
>gi|434404813|ref|YP_007147698.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428259068|gb|AFZ25018.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 172
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 64/110 (58%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F A++R+S+ S + NG A AN GADL ++ +D L ANLTNA+L
Sbjct: 62 FAKANLRQSNLSHTNLNGVSFFAANLESANLEGADLRNSTLDSARLVRANLTNALLEGAF 121
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ GAII+GADF+D ++ +++ LCK A GTNP+T TR +L C
Sbjct: 122 AANARFDGAIIDGADFTDMLLRQDEQKKLCKLAKGTNPVTLRDTRDTLFC 171
>gi|116074641|ref|ZP_01471902.1| hypothetical protein RS9916_28944 [Synechococcus sp. RS9916]
gi|116067863|gb|EAU73616.1| hypothetical protein RS9916_28944 [Synechococcus sp. RS9916]
Length = 158
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 59/110 (53%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F ++RE+DFSGS GA L A AN T +L D +D VL+ NLTNAVL
Sbjct: 49 FNLTNLREADFSGSDLQGASLYGAKLQDANLTDTNLRDATLDSAVLDGTNLTNAVLEDAF 108
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ II GADF++ + LC A GTNP+TG TR +LGC
Sbjct: 109 AFNTRFSNVIITGADFTNVPFRGDALKTLCAAAEGTNPVTGRDTRDTLGC 158
>gi|218245449|ref|YP_002370820.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|257058486|ref|YP_003136374.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
gi|218165927|gb|ACK64664.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
gi|256588652|gb|ACU99538.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
Length = 168
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 71/118 (60%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+++ + F +AD+ E++FS S GA +ANF GA+L++ L +A+L+
Sbjct: 50 RQDLKEVKFANADLTEANFSDSDLRGAVFNGVELKQANFHGANLTNGLAYLSSFRDADLS 109
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+A+L ++ R+ A I GADF+ AV+D + LC+ A+G N TG+STR+SLGC
Sbjct: 110 DAILSEVIMLRTVFDNANITGADFTLAVLDGEEVAKLCQRADGVNSKTGMSTRESLGC 167
>gi|209527449|ref|ZP_03275954.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|376003366|ref|ZP_09781178.1| pentapeptide repeat-containing protein [Arthrospira sp. PCC 8005]
gi|209492122|gb|EDZ92472.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|375328288|emb|CCE16931.1| pentapeptide repeat-containing protein [Arthrospira sp. PCC 8005]
Length = 137
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 67/117 (57%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ + + F A+++ S+FS + G L A N ADL +D L ANLTN
Sbjct: 20 QDLKDSEFDFANLQGSNFSHTDLRGVSLFGAKMQDINLESADLRLATLDTARLVRANLTN 79
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+L +D GAII GADF+D ++ Q+Q LC+ A+GTNP+TG TR++L C
Sbjct: 80 ALLEEAYAYNADFRGAIITGADFTDVMLRRDQQQLLCEVADGTNPVTGRDTRETLYC 136
>gi|443475471|ref|ZP_21065420.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019714|gb|ELS33767.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 164
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 90/189 (47%), Gaps = 37/189 (19%)
Query: 57 LKNWRVFVSTALAAAV------VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADL 110
+K W++ S L+ V V + SS+ + LN E F +L
Sbjct: 1 MKYWQLITSIVLSIFVFLMPLPVQAASSSSVTRSILNAVGGE-----------DFSGKNL 49
Query: 111 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLM 165
+A + ANFT+AD+R + F+G +GA L +AY + F DLSD
Sbjct: 50 IRAEFTSVTLKNANFTNADLRGAIFNGVLLDGANLHGSDFSSGIAYISRFKNVDLSDA-- 107
Query: 166 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 225
VLN+ N+ RS + GADF++A++D+ Q + LC A+GTN TG
Sbjct: 108 ---VLNDTNML----------RSTFDNVEVTGADFTNALLDIQQLKKLCINASGTNSKTG 154
Query: 226 VSTRKSLGC 234
VSTR+SLGC
Sbjct: 155 VSTRESLGC 163
>gi|166364098|ref|YP_001656371.1| pentapeptide repeat-containing protein [Microcystis aeruginosa
NIES-843]
gi|166086471|dbj|BAG01179.1| pentapeptide repeat family protein [Microcystis aeruginosa
NIES-843]
Length = 161
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 74/130 (56%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F DLR + N R +NF+ A++ G +F A LE A NF+ A++ +
Sbjct: 41 FAGQDLRDSTFDHSNLRGSNFSRANL-----EGVRFFSANLEGA-----NFSDANMRNVD 90
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
++ L +AN TNAVL T + GAII+GADF+DA+I ++ LC+ A GTNP+T
Sbjct: 91 LESARLTKANFTNAVLEGAFATNILIKGAIIDGADFTDAIIRSDVEKYLCERATGTNPVT 150
Query: 225 GVSTRKSLGC 234
G +TR +L C
Sbjct: 151 GRNTRDTLFC 160
>gi|126696874|ref|YP_001091760.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9301]
gi|126543917|gb|ABO18159.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9301]
Length = 186
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 71/128 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
S D+ K + + D+ D + GAY+ A ++F GA+++D +
Sbjct: 46 SVDVLKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMTDLIAY 105
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
+ A+ T+A L L +S GAII+GADF+DA +DL+Q+++LC+ A+GTN TGV
Sbjct: 106 ATRFDNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLSQRKSLCERASGTNSQTGV 165
Query: 227 STRKSLGC 234
+T SL C
Sbjct: 166 NTIDSLEC 173
>gi|123969083|ref|YP_001009941.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. AS9601]
gi|123199193|gb|ABM70834.1| Pentapeptide repeats [Prochlorococcus marinus str. AS9601]
Length = 186
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 71/128 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
S D+ K + + D+ D + GAY+ A ++F GA+++D +
Sbjct: 46 SVDVLKDDLHGADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMTDLIAY 105
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
+ A+ T+A L L +S GAII+GADF+DA +DL+Q+++LC+ A+GTN TGV
Sbjct: 106 ATRFDNADFTDANLTNGELMKSVFDGAIIDGADFTDANLDLSQRKSLCERASGTNTKTGV 165
Query: 227 STRKSLGC 234
+T SL C
Sbjct: 166 NTIDSLEC 173
>gi|434397761|ref|YP_007131765.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428268858|gb|AFZ34799.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 166
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 67/130 (51%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F D R +N + +F D+ ++FS + GA + AN G D S
Sbjct: 36 FSEVDFRSKDFSGKNLQSIDFAKVDLESANFSNADLRGAVFNASNLANANLQGVDFSYGF 95
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+ A+LT+A+ T+L+ S GA I+ ADF+ AV++ Q + LC A+G NP T
Sbjct: 96 AYLTNFDGADLTDAIFQETILSFSTFEGAKIKNADFTFAVLEKWQVKQLCANASGVNPKT 155
Query: 225 GVSTRKSLGC 234
GV TR+SLGC
Sbjct: 156 GVDTRESLGC 165
>gi|443320013|ref|ZP_21049146.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
gi|442790267|gb|ELR99867.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
Length = 164
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 65/131 (49%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+F + DLR +N + FT ++ +F+ + G +AN G D S
Sbjct: 33 RFDNRDLRGESFANQNLQTVEFTKVKLQGVNFANADLIGVVFNSTALDQANLQGVDFSQG 92
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
+ + +L +A+LV +L RS I GADFS AV+D Q LC YA+G N
Sbjct: 93 IAYLTSFDGVDLRDALLVEALLLRSTFKDTKISGADFSSAVLDQDQLDKLCSYADGVNSK 152
Query: 224 TGVSTRKSLGC 234
TGV TR+SLGC
Sbjct: 153 TGVKTRESLGC 163
>gi|359460819|ref|ZP_09249382.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 164
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 70/127 (55%), Gaps = 10/127 (7%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN------ 171
E+ +F+ D+RE++FS ++ GA +A F G DL+ + + +
Sbjct: 37 EDIVTQDFSGQDLREAEFSNNQLAGANFSEADLTAVVFNGVDLTGASLKNVDMTGGMAYL 96
Query: 172 ----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 227
EA+L+ A+L +L +S L A + ADFS AVID Q + LC+ A+G NP+TGV
Sbjct: 97 SSFAEADLSGAILTEAMLLQSSLRNATVTDADFSFAVIDKDQVKILCETASGVNPVTGVD 156
Query: 228 TRKSLGC 234
TR SLGC
Sbjct: 157 TRDSLGC 163
>gi|428220990|ref|YP_007105160.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994330|gb|AFY73025.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 165
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 62/110 (56%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F D+ ++F + G L A A+FTGADL + +D +N ANLTNAVL
Sbjct: 54 FNKTDLHNANFRNANLAGVSLFGANMTAADFTGADLRYSTLDTARMNGANLTNAVLEGAF 113
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ + G +I+GADFSD + + LCK A GTNP+TG TR++L C
Sbjct: 114 VYGTSFVGTVIDGADFSDVDLRNTTRSLLCKVAKGTNPVTGRDTRETLEC 163
>gi|254431831|ref|ZP_05045534.1| pentapeptide repeat protein [Cyanobium sp. PCC 7001]
gi|197626284|gb|EDY38843.1| pentapeptide repeat protein [Cyanobium sp. PCC 7001]
Length = 174
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 64/117 (54%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ R F ++R++D SGS GA L A A+ + +L +T +D V N +LTN
Sbjct: 57 QDLRGGTFNLTNLRDADLSGSDLQGASLFGAKLQDADLSNTNLRETTLDSAVFNGTDLTN 116
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVL + II+GADF++ + +ALC A GTNP+TG TR +LGC
Sbjct: 117 AVLEDAFAFNTKFSDVIIDGADFTNVPLRGDALKALCAVARGTNPVTGRQTRDTLGC 173
>gi|33861334|ref|NP_892895.1| hypothetical protein PMM0777 [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
gi|33633911|emb|CAE19236.1| conserved hypothetical protein [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
Length = 170
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/140 (38%), Positives = 74/140 (52%), Gaps = 29/140 (20%)
Query: 109 DLRKAVHVKE----NFRRANFTSADMRESDFSGSKFN----------GAYLEKAVAYKAN 154
DL + +H ++ F + N D +S+ G+ FN GA L A+AY +
Sbjct: 46 DLEEDMHGQDLSGNEFVKFNLNGFDFSQSNLEGAVFNNSKLQNATMTGANLSDALAYATD 105
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
FT ADLSD N TNA+L+ S+ GA I+GADF++AV+ Q++ LC
Sbjct: 106 FTDADLSD----------VNFTNALLME-----SNFEGAKIDGADFTNAVLSRIQQKELC 150
Query: 215 KYANGTNPITGVSTRKSLGC 234
+ ANGTN TG ST SLGC
Sbjct: 151 EIANGTNSSTGESTEYSLGC 170
>gi|300868113|ref|ZP_07112748.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300333887|emb|CBN57928.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 169
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 64/112 (57%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A FT A++R S+FS + G A ANF GA+L +D + + NLTNA+L
Sbjct: 57 AQFTKANLRNSNFSNANLQGVSFFAANMEDANFEGANLRGATLDLARMIKVNLTNAILEG 116
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ AI++GADF+D +I + LCK A GTNP+TG +TR++L C
Sbjct: 117 AFAYNTKFERAIVDGADFTDILIRDDMVEKLCKVARGTNPVTGRNTRETLFC 168
>gi|123966041|ref|YP_001011122.1| hypothetical protein P9515_08061 [Prochlorococcus marinus str. MIT
9515]
gi|123200407|gb|ABM72015.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 170
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 56/148 (37%), Positives = 76/148 (51%), Gaps = 20/148 (13%)
Query: 92 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF-----NGAYLE 146
E R + + DL VK N +F+ +++ + F+ SK NGA L
Sbjct: 38 EIRNQQDLDLEQDMHGQDLSGNEFVKFNLNGFDFSQSNLEGAVFNNSKLQNATLNGANLT 97
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A+AY +FT ADLSD N TNA+L+ S+ GA I+GADF++AV+
Sbjct: 98 DALAYATDFTDADLSD----------VNFTNALLME-----SNFEGAKIDGADFTNAVLS 142
Query: 207 LAQKQALCKYANGTNPITGVSTRKSLGC 234
Q++ LC ANGTN TG ST SLGC
Sbjct: 143 RIQQKELCAIANGTNSSTGESTEYSLGC 170
>gi|428313239|ref|YP_007124216.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428254851|gb|AFZ20810.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 169
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 63/112 (56%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
++FT A++R S+FS S G A ANF GA+L + +D L A+L NAVL
Sbjct: 57 SSFTKANLRSSNFSHSNLEGVSFFSANLESANFEGANLRNATLDTARLTRASLKNAVLEG 116
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ GA IEGADF++ + ++ LC A+GTNP TG STR +L C
Sbjct: 117 AFAFNTKFDGATIEGADFTEVLFRQDVQKQLCHVASGTNPTTGRSTRDTLFC 168
>gi|158337467|ref|YP_001518642.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158307708|gb|ABW29325.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 164
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 70/127 (55%), Gaps = 10/127 (7%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN------ 171
E+ +F+ D+RE++FS ++ GA +A F G DL+ + + +
Sbjct: 37 EDIVTQDFSGQDLREAEFSNNQLAGANFSEADLTAVVFNGVDLTGASLKNVDMTGGMAYL 96
Query: 172 ----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 227
EA+L+ A+L +L +S L A + ADFS AVID Q + LC+ A+G NP+TGV
Sbjct: 97 SSFAEADLSGAILTEAMLLQSSLRDATVTDADFSFAVIDKDQVKILCETASGVNPVTGVD 156
Query: 228 TRKSLGC 234
TR SLGC
Sbjct: 157 TRDSLGC 163
>gi|113475775|ref|YP_721836.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110166823|gb|ABG51363.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 165
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 63/110 (57%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F A M +++F G+ L K +AN T AD + T DR+ ++++LTNA+ +
Sbjct: 55 FAGATMWKANFQGANLQNTILTKGDFLRANLTEADFTGTFADRVSFDKSDLTNAIFTDAM 114
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
L S A + G DFS A++D Q + +C+ A+G N TGV TR+SLGC
Sbjct: 115 LMSSTFRDATVIGTDFSGAMVDRYQIKLMCETASGKNKTTGVETRESLGC 164
>gi|81300649|ref|YP_400857.1| hypothetical protein Synpcc7942_1840 [Synechococcus elongatus PCC
7942]
gi|81169530|gb|ABB57870.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 168
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 70/130 (53%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A++ + + ++ +A F S ++ F G+ GA +ANF AD +D +
Sbjct: 37 FDDAEVTRQDYSGQSLIQAEFASVRLKGVSFRGADLRGAVFNGVDLREANFEDADFTDGI 96
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
L N NA L +L +S+L G+ + GADFS AV+ Q ALC+ A+GTNP T
Sbjct: 97 AYVSDLRNVNFRNANLTSAMLLQSELQGSDVTGADFSFAVLSKQQITALCETASGTNPKT 156
Query: 225 GVSTRKSLGC 234
G TR+SLGC
Sbjct: 157 GADTRESLGC 166
>gi|56752263|ref|YP_172964.1| hypothetical protein syc2254_d [Synechococcus elongatus PCC 6301]
gi|24251237|gb|AAN46157.1| unknown protein [Synechococcus elongatus PCC 7942]
gi|56687222|dbj|BAD80444.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 171
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 70/130 (53%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A++ + + ++ +A F S ++ F G+ GA +ANF AD +D +
Sbjct: 40 FDDAEVTRQDYSGQSLIQAEFASVRLKGVSFRGADLRGAVFNGVDLREANFEDADFTDGI 99
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
L N NA L +L +S+L G+ + GADFS AV+ Q ALC+ A+GTNP T
Sbjct: 100 AYVSDLRNVNFRNANLTSAMLLQSELQGSDVTGADFSFAVLSKQQITALCETASGTNPKT 159
Query: 225 GVSTRKSLGC 234
G TR+SLGC
Sbjct: 160 GADTRESLGC 169
>gi|113474577|ref|YP_720638.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110165625|gb|ABG50165.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 144
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 70/130 (53%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F DL K R++NF++A++ SG GA+LE A N GA+LS +
Sbjct: 24 FSGKDLTNDSFTKSILRKSNFSNANL-----SGVSLFGAHLEGA-----NLEGANLSYST 73
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D V N+ANLTNA+L + AII+GADF+DA + + LCK A G N IT
Sbjct: 74 LDDAVFNKANLTNAILEGAFAFHTQFRDAIIDGADFTDAFLRKDTTKDLCKIAQGKNSIT 133
Query: 225 GVSTRKSLGC 234
G TR +L C
Sbjct: 134 GKETRDTLFC 143
>gi|428771687|ref|YP_007163477.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428685966|gb|AFZ55433.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 159
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 65/115 (56%), Gaps = 5/115 (4%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
F + N SA++ +SD G F GA ++ N GA+L+++++D L ANL NAV
Sbjct: 49 FNKTNLRSANLSQSDLQGVSFFGANMDSI-----NLEGANLTNSILDSARLTRANLRNAV 103
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
L T + GA IEGADF+D ++ ++ LC+ A G NP TG TR +L C
Sbjct: 104 LEGAFATNTKFEGANIEGADFTDVILRPDVEEMLCEKAKGVNPTTGRKTRDTLYC 158
>gi|78212716|ref|YP_381495.1| hypothetical protein Syncc9605_1185 [Synechococcus sp. CC9605]
gi|78197175|gb|ABB34940.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 165
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 68/120 (56%)
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
+ ++ R A F +++RE++ SGS GA L A A+ +G DL + +D V+ N
Sbjct: 45 YSNKDLRGATFNLSNLREANLSGSDLRGASLYGAKLQDADLSGTDLREATLDAAVMTGTN 104
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
L +AVL + +I GADF+D + Q ++LC A+GTN +TG STR+SLGC
Sbjct: 105 LEDAVLEGAFAFNTRFSDVLITGADFTDVPMRGDQLKSLCAVADGTNSVTGRSTRESLGC 164
>gi|428215647|ref|YP_007088791.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004028|gb|AFY84871.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 183
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 71/133 (53%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A + +L A + +A+F A++R+S+ S + GA L A AN GA+LS
Sbjct: 50 AQNYNKENLLGADFSGRDLTQASFNHANLRKSNLSHANLQGASLFAAHLEDANLEGANLS 109
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
+T +D NL NA+L + + GA IEGADF+D + + LC+ A GTN
Sbjct: 110 NTTLDTARFIRTNLKNAILEGSFAFSAKFNGANIEGADFTDVFLRDDANEILCELATGTN 169
Query: 222 PITGVSTRKSLGC 234
P+TG +TR +L C
Sbjct: 170 PVTGRNTRDTLYC 182
>gi|428780675|ref|YP_007172461.1| low-complexity protein [Dactylococcopsis salina PCC 8305]
gi|428694954|gb|AFZ51104.1| putative low-complexity protein [Dactylococcopsis salina PCC 8305]
Length = 167
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 77/147 (52%), Gaps = 7/147 (4%)
Query: 90 EAETRGEFGIGS--AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 147
EA+T F S +A DL AN T AD+ +D GS F + ++
Sbjct: 25 EAQTSTRFQRQSLISADLSEEDLSGETLQLREISDANLTGADLSNADLRGSIFTASVMKN 84
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
A + ANFT T+++ + A+L+ A+L +L+R+ L I GADF++AV+D
Sbjct: 85 ANLHGANFTF-----TVLNGVDFTNADLSQAILEDAILSRAILKDVDITGADFTNAVLDN 139
Query: 208 AQKQALCKYANGTNPITGVSTRKSLGC 234
Q LC+ A G N TGV+TR+SLGC
Sbjct: 140 QQYNQLCEMATGVNEETGVATRESLGC 166
>gi|427734374|ref|YP_007053918.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427369415|gb|AFY53371.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 167
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 71/131 (54%), Gaps = 5/131 (3%)
Query: 109 DLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
D K + ++ +F ++FT A++R+S+FS S G AN A+L
Sbjct: 36 DYNKEILIEADFSGQDLTDSSFTKANLRDSNFSNSNLQGVRFFATNLESANLRNANLRYA 95
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
+D L +A+LTNAVL + + GAII+GADF+D ++ ++ LCK A GTNP
Sbjct: 96 TLDSARLVKADLTNAVLEGAFASNARFDGAIIDGADFTDVLLRADEQDKLCKLAKGTNPT 155
Query: 224 TGVSTRKSLGC 234
TG TR +L C
Sbjct: 156 TGRDTRDTLFC 166
>gi|449018152|dbj|BAM81554.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 321
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 75/129 (58%), Gaps = 1/129 (0%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
+L A K++ +F + +R+ DFSGS A A ANF A+LS ++
Sbjct: 193 NLEGANFAKQDLHGVSFQQSIVRDVDFSGSNLQDASFFDADCSGANFQNANLSRANLELA 252
Query: 169 VLNEANLTNAVLVRT-VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 227
L +A+L NA+L V+ ++ L G IEG+D++D ++ Q++ LCK A+G NP+T ++
Sbjct: 253 NLRKADLRNAILTNAYVVGQTKLEGIQIEGSDWTDVLLRPDQRRLLCKRASGENPVTHIA 312
Query: 228 TRKSLGCGN 236
T+ SLGC +
Sbjct: 313 TKDSLGCAD 321
>gi|218438527|ref|YP_002376856.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218171255|gb|ACK69988.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 172
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 64/117 (54%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ R A F A++R S+FS G A ANF GA+L ++ L N TN
Sbjct: 54 QDLRDAKFDHANLRSSNFSNVNAEGVRFFAANLESANFEGANLRYADLESARLTRVNFTN 113
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVL T + GAII+GADF+D ++ +Q LC A GTNP+TG +T+ +L C
Sbjct: 114 AVLEGAFATNTLFKGAIIDGADFTDVLLRPDTEQYLCTIAKGTNPVTGRNTKDTLYC 170
>gi|78779832|ref|YP_397944.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9312]
gi|78713331|gb|ABB50508.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9312]
Length = 186
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 75/153 (49%), Gaps = 20/153 (13%)
Query: 100 GSAAQFGSADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 154
G ADL+ VK ++ AN A M + + S F A ++ +AY
Sbjct: 49 GLKEDLHGADLQNNEFVKYDLSNQDLGEANLQGAYMSVTTAANSSFKSANMKDLIAYAVR 108
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
F ADLSD ANLTN L+++V GA I+GADF+DA +DL Q+++LC
Sbjct: 109 FDNADLSD----------ANLTNGELMKSVF-----DGATIDGADFTDATLDLPQRKSLC 153
Query: 215 KYANGTNPITGVSTRKSLGCGNSRRNAYGSPSS 247
+ A GTN TGV T SL C R +P +
Sbjct: 154 ERATGTNSKTGVDTVDSLECSGLRGYIPATPEA 186
>gi|434395414|ref|YP_007130361.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428267255|gb|AFZ33201.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 168
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 63/110 (57%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F A++R S+FS + G L A AN GA+L++ +D L+ ANL +AVL
Sbjct: 58 FNHANLRNSNFSHANLEGVSLFAANLESANLEGANLTNATLDSARLSNANLKDAVLEGAF 117
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ AII+GADF+D ++ ++ LCK A GTNP TG TR++L C
Sbjct: 118 AANAKFDKAIIDGADFTDVLLRRDEQDKLCKVAKGTNPTTGRETRETLMC 167
>gi|123966744|ref|YP_001011825.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9515]
gi|123201110|gb|ABM72718.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9515]
Length = 192
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 52/132 (39%), Positives = 71/132 (53%), Gaps = 20/132 (15%)
Query: 108 ADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
ADL+ +VK ++ AN A M + S F GA ++ +AY F AD SD
Sbjct: 63 ADLQNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFSD 122
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
ANLTN L+++V GAII+GADF+DA +DL +++LC+ A GTN
Sbjct: 123 ----------ANLTNGELMKSV-----FDGAIIDGADFTDANLDLKTRKSLCERATGTNS 167
Query: 223 ITGVSTRKSLGC 234
TGV T +SL C
Sbjct: 168 RTGVDTFESLEC 179
>gi|124025420|ref|YP_001014536.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL1A]
gi|123960488|gb|ABM75271.1| Pentapeptide repeats [Prochlorococcus marinus str. NATL1A]
Length = 156
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 68/134 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SA +G L + + + A F +D+++SDFSGS GA A AN + ++
Sbjct: 23 SALDYGKQTLIGSDFSNIDLKGATFYLSDLQDSDFSGSDLQGASFFDAKLENANLSNTNM 82
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
D MD +LN ANL+N+VL + IIEGADF+D +I + LC ANG
Sbjct: 83 RDVTMDAAILNGANLSNSVLEGAFAYNAKFENVIIEGADFTDVLIANDVRNKLCLIANGI 142
Query: 221 NPITGVSTRKSLGC 234
N +T T +L C
Sbjct: 143 NSVTNKKTSDTLDC 156
>gi|422295781|gb|EKU23080.1| pentapeptide repeat protein [Nannochloropsis gaditana CCMP526]
Length = 217
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 70/117 (59%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++F + +F+ A + ++F G+K GA K+ +A+FTGADL+ + + +A L +
Sbjct: 100 KDFSKKDFSGAFAQRANFKGAKLMGARFYKSALTEADFTGADLTSASFEGANMVDAILKD 159
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A++ T + L IEGADFSD ++D ++ LC+ A GTNP T V TR+SL C
Sbjct: 160 AIVNNAYFTETVLKVGSIEGADFSDTLLDRFVQKKLCEKATGTNPKTKVDTRESLLC 216
>gi|113954335|ref|YP_730803.1| pentapeptide repeat-containing protein [Synechococcus sp. CC9311]
gi|113881686|gb|ABI46644.1| pentapeptide repeat protein [Synechococcus sp. CC9311]
Length = 157
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 68/135 (50%), Gaps = 5/135 (3%)
Query: 105 FGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
F + D K V + +F + F ++RE+D SGS GA L A AN + ++
Sbjct: 22 FAAMDYAKQVLIGADFSNREMQGVTFNLTNLREADLSGSDLQGASLYGAKLQDANLSNSN 81
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L D +D V + NLTNAVL + +EGADF++ + + LC A G
Sbjct: 82 LRDATLDSAVFDGTNLTNAVLEDAFAFNTRFINVTVEGADFTNVPLRTDALKVLCANAEG 141
Query: 220 TNPITGVSTRKSLGC 234
NP+TG TR++LGC
Sbjct: 142 VNPVTGRDTRETLGC 156
>gi|157413912|ref|YP_001484778.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9215]
gi|157388487|gb|ABV51192.1| Pentapeptide repeat-containing proteins [Prochlorococcus marinus
str. MIT 9215]
Length = 186
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 20/132 (15%)
Query: 108 ADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
ADL +VK ++ AN A M + S F GA ++ +AY F AD +D
Sbjct: 57 ADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFTD 116
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
ANLTN L+++V GAII+GADF+DA +DL +++LC+ A GTN
Sbjct: 117 ----------ANLTNGELMKSV-----FDGAIIDGADFTDANLDLKTRKSLCERATGTNS 161
Query: 223 ITGVSTRKSLGC 234
TGV+T SL C
Sbjct: 162 QTGVNTADSLEC 173
>gi|434392213|ref|YP_007127160.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428264054|gb|AFZ30000.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 165
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 66/117 (56%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
+N + A F +AD+ ++FS + G A KAN GAD ++ + + ANL++
Sbjct: 49 QNLQTAEFANADLEAANFSNADLRGVVFNGAKLIKANLHGADFTNGIAYIVDFTGANLSD 108
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AV+ ++ RS I GADF++AV+D + LC A+G N TGV+TR SLGC
Sbjct: 109 AVMEEAMMLRSIFNDVDITGADFTNAVLDRTVVKKLCAQASGVNSKTGVATRDSLGC 165
>gi|72382023|ref|YP_291378.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL2A]
gi|124025522|ref|YP_001014638.1| hypothetical protein NATL1_08151 [Prochlorococcus marinus str.
NATL1A]
gi|72001873|gb|AAZ57675.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
NATL2A]
gi|123960590|gb|ABM75373.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL1A]
Length = 170
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 63/116 (54%), Gaps = 15/116 (12%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NF +N T A S +G+ +GA L A+AY ++F GADL D + +L E+N T+A
Sbjct: 70 NFSESNLTGAVFNNSKLNGADLHGAQLNDALAYASDFEGADLRDVDFNGALLMESNFTDA 129
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+IEGADF+DAVI Q++ LC A+GTN T T SLGC
Sbjct: 130 ---------------LIEGADFTDAVISRIQQKELCNMASGTNSKTDEDTSYSLGC 170
>gi|254526458|ref|ZP_05138510.1| pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
gi|221537882|gb|EEE40335.1| pentapeptide repeat protein [Prochlorococcus marinus str. MIT 9202]
Length = 179
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 20/132 (15%)
Query: 108 ADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
ADL +VK ++ AN A M + S F GA ++ +AY F AD +D
Sbjct: 50 ADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFTD 109
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
ANLTN L+++V GAII+GADF+DA +DL +++LC+ A GTN
Sbjct: 110 ----------ANLTNGELMKSV-----FDGAIIDGADFTDANLDLKTRKSLCERATGTNS 154
Query: 223 ITGVSTRKSLGC 234
TGV+T SL C
Sbjct: 155 QTGVNTADSLEC 166
>gi|428205702|ref|YP_007090055.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428007623|gb|AFY86186.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 169
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 62/110 (56%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F+ A++R S+FS S G L A ANF GA+L+ +D L ANL +A+L
Sbjct: 59 FSHANLRSSNFSHSNLEGVSLFAANLDSANFEGANLASATLDSARLTRANLKDAILEGAF 118
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ GA+I+GADF+D ++ + LC+ A G NP TG +TR +L C
Sbjct: 119 AANTKFDGAVIDGADFTDVLMRRDVQDKLCQVAKGVNPTTGRATRDTLFC 168
>gi|91070378|gb|ABE11292.1| pentapeptide repeats [uncultured Prochlorococcus marinus clone
HF10-88H9]
Length = 186
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 20/132 (15%)
Query: 108 ADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
ADL +VK ++ AN A M + S F GA ++ +AY F AD +D
Sbjct: 57 ADLHNTEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFTD 116
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
ANLTN L+++V GAII+GADF+DA +DL +++LC+ A GTN
Sbjct: 117 ----------ANLTNGELMKSV-----FDGAIIDGADFTDANLDLKTRKSLCERATGTNS 161
Query: 223 ITGVSTRKSLGC 234
TGV+T SL C
Sbjct: 162 QTGVNTADSLEC 173
>gi|22298403|ref|NP_681650.1| hypothetical protein tll0860 [Thermosynechococcus elongatus BP-1]
gi|22294582|dbj|BAC08412.1| tll0860 [Thermosynechococcus elongatus BP-1]
Length = 178
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 69/130 (53%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F DLR + F +AN +++ ++ G F GA LE A N GADL
Sbjct: 54 FSGRDLRGS-----EFTKANLFHSNLSHTNLQGVSFFGANLETA-----NLEGADLRYAT 103
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D L +ANLTNA+L ++ AII GADF+D + ++ LCK A+GTNP+T
Sbjct: 104 LDTARLTKANLTNAILEGAFAFNTNFDDAIITGADFTDVELREDAQRKLCKVASGTNPVT 163
Query: 225 GVSTRKSLGC 234
G T ++L C
Sbjct: 164 GRKTWETLHC 173
>gi|72381929|ref|YP_291284.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL2A]
gi|72001779|gb|AAZ57581.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
NATL2A]
Length = 156
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 68/134 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SA +G L + + + A F +D++ SDFSGS GA A AN + ++
Sbjct: 23 SALDYGKQTLIGSDFSNIDLKGATFYLSDLQNSDFSGSDLQGASFFDAKLENANLSNTNM 82
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
D MD +LN ANL+N++L + IIEGADF+D +I + LC ANG
Sbjct: 83 RDVTMDAAILNGANLSNSILEGAFAYNAKFENVIIEGADFTDVLIANDVRNKLCLIANGI 142
Query: 221 NPITGVSTRKSLGC 234
N +T T ++L C
Sbjct: 143 NSVTNKKTSETLDC 156
>gi|218248608|ref|YP_002373979.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|218169086|gb|ACK67823.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
Length = 152
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 70/130 (53%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F DLR A+ N R +NF+ A+++ G +F A LE A NF GADL
Sbjct: 28 FSGQDLRDALFDHANLRGSNFSHANLQ-----GVRFFSANLEGA-----NFEGADLRGAD 77
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
++ L N TNA+L T + G II+GADF+D ++ ++ LC A GTNP+T
Sbjct: 78 LESARLTRVNFTNALLEGAFATNVLIKGVIIDGADFTDVLLRPDVEKQLCAIAQGTNPVT 137
Query: 225 GVSTRKSLGC 234
G +T+ +L C
Sbjct: 138 GRNTKDTLFC 147
>gi|307151213|ref|YP_003886597.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306981441|gb|ADN13322.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 174
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 70/137 (51%), Gaps = 20/137 (14%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTG 157
A F L A + +ANF+ AD+R + F+GS K +GA L A+AY ++F G
Sbjct: 46 ADFSGQRLTLAQFTNVDLTQANFSDADLRGAVFNGSALKEVKLHGADLTNALAYLSSFEG 105
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 217
ADLSD A+ +L R+ A + G DFS AV+D + LCK A
Sbjct: 106 ADLSD---------------AIFAEAILKRTSFKNADVTGTDFSFAVLDGEEIANLCKSA 150
Query: 218 NGTNPITGVSTRKSLGC 234
+G N TGVSTR SL C
Sbjct: 151 SGVNSKTGVSTRDSLRC 167
>gi|56751209|ref|YP_171910.1| hypothetical protein syc1200_c [Synechococcus elongatus PCC 6301]
gi|81299124|ref|YP_399332.1| hypothetical protein Synpcc7942_0313 [Synechococcus elongatus PCC
7942]
gi|56686168|dbj|BAD79390.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81168005|gb|ABB56345.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 170
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 68/131 (51%), Gaps = 5/131 (3%)
Query: 109 DLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
D K + ++ NF ANFT A++R SDFS S G A + GADLS+T
Sbjct: 39 DFTKEILIESNFSNRDLSDANFTKANLRSSDFSNSVLVGVRFYGANLESVDLHGADLSNT 98
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
++D+ + +LT+A+L + GA I GADF+D ++ + LC A G N
Sbjct: 99 ILDQARMTNTDLTDAILEGAYAFNALFQGAKITGADFTDVLMRQDAQDLLCSVAEGVNSK 158
Query: 224 TGVSTRKSLGC 234
TG +TR +L C
Sbjct: 159 TGRATRDTLDC 169
>gi|257061674|ref|YP_003139562.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
gi|256591840|gb|ACV02727.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
Length = 167
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 70/130 (53%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F DLR A+ N R +NF+ A+++ G +F A LE A NF GADL
Sbjct: 47 FSGQDLRDALFDHANLRGSNFSHANLQ-----GVRFFSANLEGA-----NFEGADLRGAD 96
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
++ L N TNA+L T + G II+GADF+D ++ ++ LC A GTNP+T
Sbjct: 97 LESARLTRVNFTNALLEGAFATNVLIKGVIIDGADFTDVLLRPDVEKQLCAIAQGTNPVT 156
Query: 225 GVSTRKSLGC 234
G +T+ +L C
Sbjct: 157 GRNTKDTLFC 166
>gi|434406341|ref|YP_007149226.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428260596|gb|AFZ26546.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 165
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 68/121 (56%), Gaps = 20/121 (16%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEA 173
N ANF+ AD+R F+G+ G L + +AY NF GAD +D A
Sbjct: 59 NLENANFSDADLRGVVFNGTLLKGVNLHGVDFSQGIAYLVNFKGADFTD----------A 108
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 233
T+A+++R++ + + GADF++AV+D+ Q + LC A+G N TGV+TR+SLG
Sbjct: 109 VFTDAMMLRSLFDDVN-----VTGADFTNAVLDMQQVKKLCLKASGVNSQTGVNTRESLG 163
Query: 234 C 234
C
Sbjct: 164 C 164
>gi|428774426|ref|YP_007166214.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
7202]
gi|428688705|gb|AFZ48565.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
Length = 158
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 72/132 (54%), Gaps = 10/132 (7%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
+ F DL + K N R ++FT+A++ F G+ + A LE GA+L++
Sbjct: 36 SDFSGQDLSGSTFNKTNLRSSDFTNANLSNVSFFGANLDSANLE----------GANLTN 85
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
++D + ANL NAVL T + A IEGADF+D ++ ++ LC+ A+G NP
Sbjct: 86 AVLDSARVTRANLHNAVLEGAFATNTKFEKANIEGADFTDVLLRPDVEEMLCEVASGINP 145
Query: 223 ITGVSTRKSLGC 234
+TG +TR +L C
Sbjct: 146 VTGRNTRDTLYC 157
>gi|126659509|ref|ZP_01730642.1| hypothetical protein CY0110_07279 [Cyanothece sp. CCY0110]
gi|126619243|gb|EAZ89979.1| hypothetical protein CY0110_07279 [Cyanothece sp. CCY0110]
Length = 167
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 70/130 (53%), Gaps = 10/130 (7%)
Query: 115 HVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYK-----ANFTGADLSDTL 164
+ K+N +F+ D+R+SDF G F+ A L+ + ANF GADL
Sbjct: 37 YAKQNLVERDFSGQDLRDSDFEHANLRGCNFSHANLQGVRFFASNLEGANFEGADLRYAD 96
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
++ L N TNA+L T + GA+I+GADF+D ++ L ++ LC A GTNP+T
Sbjct: 97 LESARLVRVNFTNAILEGAFATNTLFNGAVIDGADFTDVLLRLDTEKKLCDIAKGTNPVT 156
Query: 225 GVSTRKSLGC 234
+T+ +L C
Sbjct: 157 RRNTKDTLFC 166
>gi|427728200|ref|YP_007074437.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427364119|gb|AFY46840.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 164
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 60/112 (53%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A FT+AD+ ++FS + G V N G D S+ + ANL++AVL
Sbjct: 52 AEFTNADLENANFSDADLRGGVFNGTVLEGVNLHGVDFSNGIAYLAKFKNANLSDAVLTD 111
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
++ RS I G DF++AV+D Q + LC A+G N TGV TR+SLGC
Sbjct: 112 AMMLRSTFDNVDITGTDFTNAVLDGPQVKKLCTKASGVNSKTGVDTRESLGC 163
>gi|443328810|ref|ZP_21057403.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791546|gb|ELS01040.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 170
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 69/135 (51%), Gaps = 20/135 (14%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGAD 159
F L++ K + ANF+++D+R SD S + +GA AY +F GAD
Sbjct: 49 FSGKTLQRLDFAKVDLSEANFSNSDLRGAVFNASDLSNANLHGADFTYGFAYLTDFQGAD 108
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
LSD A+ T+L+ S A+I+GADF+ A+++ Q LC+ A G
Sbjct: 109 LSD---------------AIFRETILSFSSFEDAMIDGADFTLAILEKWQVNQLCENATG 153
Query: 220 TNPITGVSTRKSLGC 234
N TGV TR+SLGC
Sbjct: 154 VNSQTGVDTRRSLGC 168
>gi|452821017|gb|EME28052.1| thylakoid lumenal protein [Galdieria sulphuraria]
Length = 217
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 68/122 (55%), Gaps = 1/122 (0%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+++ +F + +RE+DF G+K A A AN ADL++ ++ L A L
Sbjct: 96 EQDLSGVSFQQSLLRETDFHGAKLVSASFFGAELSYANLEDADLTEANLELANLRSAKLK 155
Query: 177 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCG 235
NAVL R + + L I+GADFS+ ++ QK+ LC ANGTN TGV T+ SLGC
Sbjct: 156 NAVLRRAYFSGNTRLENVDIDGADFSEVILRKDQKKYLCNIANGTNSHTGVETKTSLGCN 215
Query: 236 NS 237
+S
Sbjct: 216 SS 217
>gi|407961546|dbj|BAM54786.1| hypothetical protein BEST7613_5855 [Synechocystis sp. PCC 6803]
Length = 194
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 85/178 (47%), Gaps = 16/178 (8%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
L W+ V T + +VA+ + +LA + RG A F DLR ++
Sbjct: 32 LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 85
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
N R A+FT A+++ + F + +GA LE A A +F A L+ ANL
Sbjct: 86 HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDFESARLT----------HANLR 135
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA L + T + G IEGAD +D ++ + LC A GTNP+TG T+++L C
Sbjct: 136 NARLEGSFGTNTKFGEVDIEGADLTDIILRPDTEDYLCGLAKGTNPVTGRETKETLFC 193
>gi|16331083|ref|NP_441811.1| hypothetical protein sll0274 [Synechocystis sp. PCC 6803]
gi|383322826|ref|YP_005383679.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325995|ref|YP_005386848.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491879|ref|YP_005409555.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437147|ref|YP_005651871.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
gi|451815240|ref|YP_007451692.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
gi|1653576|dbj|BAA18489.1| sll0274 [Synechocystis sp. PCC 6803]
gi|339274179|dbj|BAK50666.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
gi|359272145|dbj|BAL29664.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359275315|dbj|BAL32833.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278485|dbj|BAL36002.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451781209|gb|AGF52178.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
Length = 196
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 85/178 (47%), Gaps = 16/178 (8%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
L W+ V T + +VA+ + +LA + RG A F DLR ++
Sbjct: 34 LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 87
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
N R A+FT A+++ + F + +GA LE A A +F A L+ ANL
Sbjct: 88 HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDFESARLT----------HANLR 137
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA L + T + G IEGAD +D ++ + LC A GTNP+TG T+++L C
Sbjct: 138 NARLEGSFGTNTKFGEVDIEGADLTDIILRPDTEDYLCGLAKGTNPVTGRETKETLFC 195
>gi|88770664|gb|ABD51935.1| chloroplast thylakoid 11 kDa protein [Guillardia theta]
Length = 242
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 72/139 (51%), Gaps = 2/139 (1%)
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
G +AA+ DLR+ ++ + A M++ DFS KF A + K A +A F G
Sbjct: 102 GQANAARDKLYDLRECPMAGKDATGFDLAGALMQKGDFSKVKFKDAVMSKVFADEATFDG 161
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 217
AD S+ +MDR +++ A+ VL+ S+ G+ + +DFSD + + +CK
Sbjct: 162 ADFSNAVMDRGTWRKSSFKGAIFANAVLSGSEFEGSDLTDSDFSDTYMGDFDNKKICKNP 221
Query: 218 --NGTNPITGVSTRKSLGC 234
GTNP+TGV TR S C
Sbjct: 222 TLQGTNPVTGVDTRASASC 240
>gi|352094203|ref|ZP_08955374.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
gi|351680543|gb|EHA63675.1| pentapeptide repeat protein [Synechococcus sp. WH 8016]
Length = 159
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 66/135 (48%), Gaps = 5/135 (3%)
Query: 105 FGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
F + D K V + +F + F ++RE+D SGS GA L A AN + +
Sbjct: 24 FAAMDYAKQVLIGADFSNREMQGVTFNLTNLREADLSGSDLQGASLYGAKLQDANLSNTN 83
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
L D +D V + NLTNAVL + +EGADF++ + + LC A G
Sbjct: 84 LRDATLDSAVFDGTNLTNAVLEDAFAFNTRFINVTVEGADFTNVPLRADALKVLCANAEG 143
Query: 220 TNPITGVSTRKSLGC 234
NP+TG T ++LGC
Sbjct: 144 VNPVTGRDTSETLGC 158
>gi|33861906|ref|NP_893467.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. pastoris str. CCMP1986]
gi|33640274|emb|CAE19809.1| Pentapeptide repeats [Prochlorococcus marinus subsp. pastoris str.
CCMP1986]
Length = 192
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 72/136 (52%), Gaps = 20/136 (14%)
Query: 108 ADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
ADL+ +VK ++ AN A M + S F GA ++ +AY F AD SD
Sbjct: 63 ADLQNNEYVKYDLSNQDLGEANLQGAYMSVTTAKNSSFKGANMKDLIAYATRFDNADFSD 122
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
ANLTN L+++V GA I+GADF++A +DL +++LC+ A+GTN
Sbjct: 123 ----------ANLTNGELMKSVFD-----GATIDGADFTNANLDLKTRKSLCERASGTNS 167
Query: 223 ITGVSTRKSLGCGNSR 238
TGV T +SL C R
Sbjct: 168 QTGVDTFESLECSGLR 183
>gi|317969761|ref|ZP_07971151.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
Length = 160
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 66/131 (50%), Gaps = 5/131 (3%)
Query: 109 DLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
D K V + +F A F ++RE+DF G+ GA L A AN GADLSD
Sbjct: 29 DYAKQVLIGHDFAGVDLHGATFNLTNLREADFHGADLRGASLYGAKLQDANLAGADLSDA 88
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
+D VL +L NAVL + +I+GADF++ + LC A+GTNP+
Sbjct: 89 TLDSAVLEGTDLRNAVLENAFAFNTRFKDVLIDGADFTNVPFRGDVLKTLCASASGTNPV 148
Query: 224 TGVSTRKSLGC 234
TG T+ +L C
Sbjct: 149 TGRVTKDTLEC 159
>gi|37523524|ref|NP_926901.1| hypothetical protein gll3955 [Gloeobacter violaceus PCC 7421]
gi|35214528|dbj|BAC91896.1| gll3955 [Gloeobacter violaceus PCC 7421]
Length = 159
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 79/175 (45%), Gaps = 17/175 (9%)
Query: 60 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 119
WR V LAA +V +SA AD+ + A L ++N
Sbjct: 2 WRSGVLAGLAAGLV--LPGLVSAQADIQN---------------NYNGAYLEGRSVAEQN 44
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
++A F A++R DFS S GA L A ANF A L D + L A L AV
Sbjct: 45 LKQAQFYKANLRGVDFSSSDLRGASLFAASLRGANFNKARLDDAELSNADLQGAKLDQAV 104
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
L +T + L ++GADF+ +I+ QK C A GTN +T TR++LGC
Sbjct: 105 LAGAYMTAARLKDVSVDGADFTGTIINNQQKTYQCGRATGTNGLTKRQTRRTLGC 159
>gi|254423673|ref|ZP_05037391.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
gi|196191162|gb|EDX86126.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
Length = 190
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 67/133 (50%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A F +LR+ ++ ++T AD+ E+D S + L +AN GA+L+
Sbjct: 57 ADNFDRMNLRQQDFSGQDLTDNDYTRADLTEADLSHTNLERVRLFTTRLNRANLEGANLT 116
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
+D L ANL +AVL D G IEGADF+D ++D LC+ A GTN
Sbjct: 117 GATLDGASLVGANLKDAVLEGAYAINIDFRGIDIEGADFTDVLLDPKDNDKLCEIATGTN 176
Query: 222 PITGVSTRKSLGC 234
P TG T+++L C
Sbjct: 177 PTTGRKTKETLYC 189
>gi|86605126|ref|YP_473889.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86553668|gb|ABC98626.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 176
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 70/133 (52%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A + DL+ A ++ F A++R+SD S K GA L A KAN GADL
Sbjct: 43 AEDYSKRDLQGANFAGQDLSGWKFLKANLRQSDLSHVKAAGANLFGANLSKANLRGADLR 102
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
+D L A+L A L +++ + + G I+GADF++A+I LC+ A G N
Sbjct: 103 GATLDMANLQGADLREAQLQDSMMWLARVEGIQIDGADFTNALIRQDALSILCERATGVN 162
Query: 222 PITGVSTRKSLGC 234
P+TG +TR +L C
Sbjct: 163 PVTGRATRDTLEC 175
>gi|428164857|gb|EKX33868.1| hypothetical protein GUITHDRAFT_155908 [Guillardia theta CCMP2712]
Length = 237
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 72/139 (51%), Gaps = 2/139 (1%)
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
G +AA+ DLR+ ++ + A M++ DFS KF A + K A +A F G
Sbjct: 97 GQANAARDKLYDLRECPMAGKDATGFDLAGALMQKGDFSKVKFKDAVMSKVFADEATFDG 156
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 217
AD S+ +MDR +++ A+ VL+ S+ G+ + +DFSD + + +CK
Sbjct: 157 ADFSNAVMDRGTWRKSSFKGAIFANAVLSGSEFEGSDLTDSDFSDTYMGDFDNKKICKNP 216
Query: 218 --NGTNPITGVSTRKSLGC 234
GTNP+TGV TR S C
Sbjct: 217 TLQGTNPVTGVDTRASASC 235
>gi|443477206|ref|ZP_21067069.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443017715|gb|ELS32099.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 167
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 68/131 (51%), Gaps = 20/131 (15%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
QF +DLR A V + + +F +A+M+E AN TGA+LS +
Sbjct: 56 QFNESDLRNASFVNADAQGVSFFAANMKE--------------------ANLTGANLSYS 95
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
+D L++ANLTNAV+ + + II+GADF+D + +Q LCK A G NP
Sbjct: 96 TLDNARLDKANLTNAVIEGSFAYGTSFNNVIIDGADFTDVDLRTPIRQKLCKSAKGQNPT 155
Query: 224 TGVSTRKSLGC 234
TG TR +L C
Sbjct: 156 TGRLTRDTLEC 166
>gi|298250074|ref|ZP_06973878.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297548078|gb|EFH81945.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 471
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 62/99 (62%), Gaps = 5/99 (5%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTL 164
L +A + + R+AN + A M +D SG+ GA LE AVA+KANFTGA+LSD L
Sbjct: 126 LHEANLCQADLRKANLSMARMHHTDLSGANLTGAILEGIDLKDAVAHKANFTGANLSDGL 185
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+D+ L+E++L+NA L ++L +DL AI+ G S A
Sbjct: 186 LDQANLSESDLSNANLHNSILDETDLSKAILRGTTLSKA 224
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 58/112 (51%), Gaps = 10/112 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTG 157
A A L AV + + +A+ + A +RE++ +G+ +GA L KA + Y+A G
Sbjct: 59 ASLQGARLENAVLYRTSLFKADLSEASIREANMTGANLSGATLHKADLQRVILYRATLAG 118
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAV 204
A+L DT + L +A+L A L + +DL GA I+EG D DAV
Sbjct: 119 ANLFDTTLHEANLCQADLRKANLSMARMHHTDLSGANLTGAILEGIDLKDAV 170
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 48/97 (49%), Gaps = 8/97 (8%)
Query: 124 NFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ AD+ +++FSG+ GA LE AV Y+ + ADLS+ + + ANL+ A
Sbjct: 40 DLMGADLSQTNFSGANLVRASLQGARLENAVLYRTSLFKADLSEASIREANMTGANLSGA 99
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
L + L R L A + GA+ D + A LC+
Sbjct: 100 TLHKADLQRVILYRATLAGANLFDTTLHEAN---LCQ 133
>gi|307154028|ref|YP_003889412.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306984256|gb|ADN16137.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 172
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 64/117 (54%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ R + F A++R S+FS + G + ANF GA+L ++ L N TN
Sbjct: 54 QDLRDSKFDHANLRSSNFSNANLEGVRFFASNLESANFEGANLRYADLESARLIRVNFTN 113
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVL T + GAII+GADF+D ++ ++ LC A GTNP+TG T+ +L C
Sbjct: 114 AVLEGAFATNTLFKGAIIDGADFTDVLLRPDVEKYLCTIAKGTNPVTGRDTKDTLYC 170
>gi|158337082|ref|YP_001518257.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158307323|gb|ABW28940.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 175
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 60/112 (53%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+FT AD+R SDFS S G A N GA+LS +D ANLTNA L
Sbjct: 63 ASFTKADLRGSDFSNSDLRGVSFFAANLEDVNLEGANLSVATLDSARFARANLTNANLEG 122
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
++ AII+GADF+D + + LC A GTNP+TG +TR +L C
Sbjct: 123 AFAFNTEFRRAIIDGADFTDVDLRDDTLEILCAAAQGTNPVTGRNTRDTLYC 174
>gi|359460626|ref|ZP_09249189.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 175
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 60/112 (53%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+FT AD+R SDFS S G A N GA+LS +D ANLTNA L
Sbjct: 63 ASFTKADLRGSDFSNSDLRGVSFFAANLEDVNLEGANLSVATLDSARFARANLTNANLEG 122
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
++ AII+GADF+D + + LC A GTNP+TG +TR +L C
Sbjct: 123 AFAFNAEFRKAIIDGADFTDVDLRDDTLEILCAAAQGTNPVTGRNTRDTLYC 174
>gi|384252144|gb|EIE25621.1| hypothetical protein COCSUDRAFT_83628, partial [Coccomyxa
subellipsoidea C-169]
Length = 122
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 66/117 (56%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++FR AD+R ++FS + GA L A A F GA L++ ++ + A+L+
Sbjct: 5 KDFRGQKLYKADLRGTNFSKANMEGASLFGAFCKDAKFVGAHLNNADLESVDFENADLSE 64
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+L +T + I G+D++D V+ +Q LCK A+GTNPITG TR++L C
Sbjct: 65 AILEGAQVTNAKFKNVNIAGSDWTDVVLRRDVQQQLCKIASGTNPITGQDTRETLIC 121
>gi|427736970|ref|YP_007056514.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427372011|gb|AFY55967.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 164
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 65/116 (56%), Gaps = 20/116 (17%)
Query: 124 NFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NF++ADMR + F+GS +G +AY +NF +DLSD + TNA
Sbjct: 63 NFSNADMRGAVFNGSLLENSNLHGVDFTDGIAYLSNFKDSDLSDAI----------FTNA 112
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+++RT+ D + GADFS A++D + + LC+ A+G N TGVSTR SL C
Sbjct: 113 MMLRTIFRNVD-----VTGADFSGAILDRVEVKKLCETASGVNSKTGVSTRASLEC 163
>gi|428180855|gb|EKX49721.1| hypothetical protein GUITHDRAFT_135885 [Guillardia theta CCMP2712]
Length = 244
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/227 (30%), Positives = 105/227 (46%), Gaps = 26/227 (11%)
Query: 20 SSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNWRVFVSTALAAAVVASCSSN 79
S KGP+ L + KP+ + + + E+D + VS AL +A++ S
Sbjct: 28 SLKGPHALSGM-KPVTRSHPAAVRMEADAD------AFDAKKFAVSLALGSALLFSSGMP 80
Query: 80 ISALADLNKYEAETRGEFGI--GSAAQFGSAD----LRKAVHVKENFRRAN-----FTSA 128
I A A + G F + G+A+ S R A+ NF N F +
Sbjct: 81 IPAFA-------QQGGSFKVLKGAASTQDSGSRRTITRGALLEGSNFDGQNLPGISFQQS 133
Query: 129 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT-VLTR 187
R+ F G+ GA AN GAD+S+ + + +ANL NA++ + +
Sbjct: 134 LCRDCSFVGTNLKGASFFDGDLTNANMEGADVSNVNFELTCMKDANLKNAIVNNAYIQST 193
Query: 188 SDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ L G IEGADF+D + Q++ LCK A+GTNP TGV T+ SL C
Sbjct: 194 TKLDGINIEGADFTDTELRKDQQRYLCKRASGTNPKTGVDTKDSLRC 240
>gi|75908971|ref|YP_323267.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75702696|gb|ABA22372.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 164
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 67/121 (55%), Gaps = 20/121 (16%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEA 173
+ ANF++AD+R F+G+ G L +AY A F ADLSD A
Sbjct: 58 DLENANFSNADLRGGVFNGTVLEGVNLHGVDFSNGIAYLARFKNADLSD----------A 107
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 233
LT+A+++R+V D + GADF++AV+D + + LC A+G N TGV TR+SLG
Sbjct: 108 VLTDAMMLRSVFDNVD-----VSGADFTNAVLDGTEVKKLCVKASGVNSKTGVDTRESLG 162
Query: 234 C 234
C
Sbjct: 163 C 163
>gi|428776639|ref|YP_007168426.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428690918|gb|AFZ44212.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 167
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 67/112 (59%), Gaps = 5/112 (4%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN ++AD+ ++D GS F + ++ A + ANFT T+++ + A+L+ +L
Sbjct: 60 ANLSAADLSDTDMRGSIFTASVMKDANLHGANFTF-----TVLNGVDFTNADLSQTILED 114
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+L+R+ I GADF++AV+D Q LC+ A+G N TG++TR SLGC
Sbjct: 115 AILSRATFENTDITGADFTNAVLDSRQIDQLCETASGVNEETGMATRDSLGC 166
>gi|124023314|ref|YP_001017621.1| hypothetical protein P9303_16121 [Prochlorococcus marinus str. MIT
9303]
gi|123963600|gb|ABM78356.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9303]
Length = 158
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 65/125 (52%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
L A ++ R F A++RE++ SGS G+ L A + AN + +L D+ +D +
Sbjct: 33 LVNADFSNQDLRGDTFNLANLREANLSGSDLEGSTLFGAKLHDANLSNTNLRDSTLDSAI 92
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 229
+ +LTNAVL + I GADF++ + LC+ A GTNPITG +T
Sbjct: 93 FDGTDLTNAVLEDAFAFNTRFKNVTITGADFTNVPLRGDALTTLCEVAEGTNPITGRNTA 152
Query: 230 KSLGC 234
SLGC
Sbjct: 153 DSLGC 157
>gi|72382760|ref|YP_292115.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL2A]
gi|72002610|gb|AAZ58412.1| secreted pentapeptide repeats protein [Prochlorococcus marinus str.
NATL2A]
Length = 182
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 63/117 (53%), Gaps = 7/117 (5%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ + + D+ D G+ F GAY + ++ TGA++++ + + ANLTN
Sbjct: 53 KDLQNTEYVKYDLSGKDLGGTNFTGAYFSVSTLKDSDLTGANMTNVIAYATRFDNANLTN 112
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
L L +S G I+GADF+DAV+D +Q++ LCK A G ST +SLGC
Sbjct: 113 VNLTGAELLKSVFDGVTIDGADFTDAVLDRSQQKNLCKVATG-------STAESLGC 162
>gi|124026482|ref|YP_001015597.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. NATL1A]
gi|123961550|gb|ABM76333.1| Pentapeptide repeats [Prochlorococcus marinus str. NATL1A]
Length = 182
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 63/117 (53%), Gaps = 7/117 (5%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ + + D+ D G+ F GAY + ++ TGA++++ + + ANLTN
Sbjct: 53 KDLQNTEYVKYDLSGKDLGGTNFTGAYFSVSTLKDSDLTGANMTNVIAYATRFDNANLTN 112
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
L L +S G I+GADF+DAV+D +Q++ LCK A G ST +SLGC
Sbjct: 113 VNLTGAELLKSVFDGVTIDGADFTDAVLDRSQQKNLCKVATG-------STAESLGC 162
>gi|86609869|ref|YP_478631.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86558411|gb|ABD03368.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 176
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 69/133 (51%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A + DL+ ++ F A++R+SD S K GA L A KAN GADL
Sbjct: 43 AEDYTKRDLQGVSFAGQDLSGWKFLKANLRQSDLSHVKAAGANLFGANLSKANLRGADLR 102
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
+D L A+L A L +++ + + G I+GADF++A+I LC+ A G N
Sbjct: 103 GATLDMANLQGADLREAQLQDSMMWLARVEGIQIDGADFTNALIRQDALSILCERATGVN 162
Query: 222 PITGVSTRKSLGC 234
P+TG +TR +L C
Sbjct: 163 PVTGRATRDTLEC 175
>gi|186683889|ref|YP_001867085.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186466341|gb|ACC82142.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 165
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 63/115 (54%), Gaps = 10/115 (8%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
ANF++AD+R G FNG LE N G D S+ + +A+L++AV
Sbjct: 60 LENANFSNADLR-----GGVFNGTLLEGV-----NLHGVDFSEGIAYLTRFKDADLSDAV 109
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
L ++ RS + GADF++A++D Q + LC A+G N TGV TR+SLGC
Sbjct: 110 LTDAMMLRSTFDDVNVTGADFTNAILDGTQVKKLCVKASGVNSKTGVDTRQSLGC 164
>gi|33862899|ref|NP_894459.1| hypothetical protein PMT0626 [Prochlorococcus marinus str. MIT
9313]
gi|33634815|emb|CAE20801.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 158
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 65/125 (52%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
L A ++ R F A++RE++ SGS G+ L A + AN + +L D+ +D +
Sbjct: 33 LVNADFSNQDLRGDTFNLANLREANLSGSDLEGSTLFGAKLHDANLSNTNLRDSTLDSAI 92
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 229
+ +LTNAVL + I GADF++ + LC+ A GTNPITG +T
Sbjct: 93 FDGTDLTNAVLEDAFAFNTRFKNVTITGADFTNVPLRGDALTTLCEVAEGTNPITGRNTA 152
Query: 230 KSLGC 234
+LGC
Sbjct: 153 DTLGC 157
>gi|428306980|ref|YP_007143805.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428248515|gb|AFZ14295.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 160
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 66/139 (47%), Gaps = 20/139 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANF 155
S F L + V+ N NF +AD+R F+GS G+ L A +AY A+F
Sbjct: 36 SGKDFSGQTLISSEFVEANLDNTNFNNADIRGVVFNGSTLKGSSLHSADFTNGLAYAADF 95
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
+ ADLSD AV ++L +S I G DFS V+D + LC
Sbjct: 96 SNADLSD---------------AVFSESILLKSRFDEVNINGTDFSGVVLDGTNVKKLCD 140
Query: 216 YANGTNPITGVSTRKSLGC 234
A+G N TGV+TR SLGC
Sbjct: 141 VADGVNSKTGVATRASLGC 159
>gi|282897571|ref|ZP_06305571.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
gi|281197494|gb|EFA72390.1| Pentapeptide repeat protein [Raphidiopsis brookii D9]
Length = 164
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 88/185 (47%), Gaps = 29/185 (15%)
Query: 57 LKNWRVFVSTALAAAVV-------ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 109
+K W++FV L A A+ SS+I+ A + G+ +G
Sbjct: 1 MKYWQIFVGLVLTAVFFVSNLPAQAASSSSITRSAGSEIEIQDYSGKSLVG--------- 51
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
++ ++K ANF++AD+R G FNG L AN G + SD +
Sbjct: 52 -KEFTNIK--LENANFSNADLR-----GVVFNGTLL-----IDANLHGVNFSDGISYLSN 98
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 229
+NL++A+ ++ RS + GADF++A++D + + LC A+G N TGV TR
Sbjct: 99 FKNSNLSDAIFTNAMMLRSTFNNVDVTGADFTNAILDGVEVKKLCANASGVNSQTGVDTR 158
Query: 230 KSLGC 234
KSLGC
Sbjct: 159 KSLGC 163
>gi|148241708|ref|YP_001226865.1| pentapeptide repeat-containing protein [Synechococcus sp. RCC307]
gi|147850018|emb|CAK27512.1| Secreted pentapeptide repeats protein [Synechococcus sp. RCC307]
Length = 156
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/117 (38%), Positives = 61/117 (52%), Gaps = 7/117 (5%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ ++F A +R +DFSG+K +GA + +NF GADLSD LMDR NL+
Sbjct: 46 QDLEGSSFAGAVVRNADFSGAKLHGAIFTQGAFAGSNFAGADLSDVLMDRADFTGTNLSG 105
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
L V S A IEGADF+ A++D + LC+ A G TR SL C
Sbjct: 106 TNLSGVVANGSSFAKAEIEGADFTGALLDRDDQITLCRKAKG-------ETRLSLDC 155
>gi|427719897|ref|YP_007067891.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427352333|gb|AFY35057.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 165
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 63/117 (53%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ +A FTS +++ ++FS + G + N GAD S+ + +L++
Sbjct: 48 QSLIQAEFTSVNLKNTNFSNADLRGGVFNSTLLEGVNLHGADFSEGIAYLARFKNTDLSD 107
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+L ++ RS I GADF++AV+D Q + LC A+G N TG TR+SLGC
Sbjct: 108 AILTDAMMLRSTFDDVDITGADFTNAVLDGVQIKKLCVNASGVNSKTGTDTRESLGC 164
>gi|332706397|ref|ZP_08426459.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332354834|gb|EGJ34312.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 126
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 66/118 (55%), Gaps = 1/118 (0%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
E+ ++ T ++ +++ + +K A ++ AN GADL+ + + N+A+LT+
Sbjct: 8 EDLQKVKITYCNLDQANLADAKLIQASIKHTTLNNANLHGADLTKSDTYNISFNDADLTD 67
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVID-LAQKQALCKYANGTNPITGVSTRKSLGC 234
+ +L R+ GA I GADF+ +I + ++ LC A+G NP TGV TR SLGC
Sbjct: 68 VIFTGALLQRASFDGADITGADFTSTLIQPVRERLKLCDVASGVNPTTGVVTRDSLGC 125
>gi|87124267|ref|ZP_01080116.1| hypothetical protein RS9917_11675 [Synechococcus sp. RS9917]
gi|86167839|gb|EAQ69097.1| hypothetical protein RS9917_11675 [Synechococcus sp. RS9917]
Length = 183
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 66/131 (50%), Gaps = 5/131 (3%)
Query: 109 DLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
D K V + +F + F ++RE+D SGS GA L A A+ + +L D
Sbjct: 53 DYAKQVLIGADFSGREMQGVTFNLTNLREADLSGSDLQGASLFGAKLQDADLSNTNLRDA 112
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
+D VL+ NL+NAVL + I GADF++ + + LC A GTNP+
Sbjct: 113 TLDSAVLDGTNLSNAVLEDAFAFNTRFINVTISGADFTNVPLRGDVLKTLCAVAEGTNPV 172
Query: 224 TGVSTRKSLGC 234
TG +TR +LGC
Sbjct: 173 TGRNTRDTLGC 183
>gi|440680470|ref|YP_007155265.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428677589|gb|AFZ56355.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 168
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 64/125 (51%), Gaps = 30/125 (24%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLE----------KAVAYKANFTGADLSDTLMDRMV 169
NF++AD+R G FNGA LE + +AY A F D SD
Sbjct: 63 LENTNFSNADLR-----GGVFNGALLEGVNLHGVDFRQGIAYLARFKNTDFSD------- 110
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 229
A LT+A+++RT D + GADF++A++D+ Q + LC A G N TGV TR
Sbjct: 111 ---AVLTDAMMLRTTFDDVD-----VTGADFTNAILDMTQVKKLCVNARGVNSQTGVDTR 162
Query: 230 KSLGC 234
+SLGC
Sbjct: 163 ESLGC 167
>gi|33863821|ref|NP_895381.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9313]
gi|33635404|emb|CAE21729.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9313]
Length = 209
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 67/121 (55%), Gaps = 6/121 (4%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
F + + D+ E+D GS F+ L+ A N G +L D L + A+L+ ++
Sbjct: 88 FVKYDLAGYDLSEADLRGSTFSVTSLKNA-----NLHGTNLEDVLAYATRFDNADLSESI 142
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC-GNSR 238
L L +S+ GA+I+GADF++A++D +++ALC A G N TGV T SL C G S
Sbjct: 143 LRNANLRKSEFAGALIDGADFTNALLDKQEQKALCARATGKNSKTGVDTYSSLDCSGISE 202
Query: 239 R 239
R
Sbjct: 203 R 203
>gi|148239424|ref|YP_001224811.1| pentapeptide repeat-containing protein [Synechococcus sp. WH 7803]
gi|147847963|emb|CAK23514.1| Secreted pentapeptide repeat protein [Synechococcus sp. WH 7803]
Length = 158
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 70/143 (48%), Gaps = 5/143 (3%)
Query: 97 FGIGSAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAY 151
FG+ + + D K V + +F + F ++RE+D SGS GA L A
Sbjct: 16 FGLLLPSAEAAMDYAKQVLIGADFSNRDMQGVTFNLTNLREADLSGSDLQGASLYGAKLQ 75
Query: 152 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
AN + +L D +D VLN +LT+AVL + I GADF++ + +
Sbjct: 76 DANLSRTNLRDATLDSAVLNGTDLTDAVLEDAFAFNTRFIDVTISGADFTNVPLRGDVLK 135
Query: 212 ALCKYANGTNPITGVSTRKSLGC 234
LC A GTNP+TG TR +LGC
Sbjct: 136 TLCAAAEGTNPVTGRDTRDTLGC 158
>gi|409993003|ref|ZP_11276163.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|409936150|gb|EKN77654.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 162
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 62/112 (55%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A F ++++ ++F ++ G+ +A+ ADL+ ++D++ ++A+L++++
Sbjct: 50 AEFANSNLEYANFDEAELRGSVFSRAIMLGVTMRKADLTYAMLDQVDFSQADLSDSIFTE 109
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ S I GADF+DA+ D Q + LC A G N TGV TR SLGC
Sbjct: 110 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSTTGVDTRYSLGC 161
>gi|291569983|dbj|BAI92255.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 170
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 62/112 (55%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A F ++++ ++F ++ G+ +A+ ADL+ ++D++ ++A+L++++
Sbjct: 58 AEFANSNLEYANFDEAELRGSVFSRAIMLGVTMRKADLTYAMLDQVDFSQADLSDSIFTE 117
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ S I GADF+DA+ D Q + LC A G N TGV TR SLGC
Sbjct: 118 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSTTGVDTRYSLGC 169
>gi|87302765|ref|ZP_01085576.1| hypothetical protein WH5701_13470 [Synechococcus sp. WH 5701]
gi|87282648|gb|EAQ74606.1| hypothetical protein WH5701_13470 [Synechococcus sp. WH 5701]
Length = 168
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 62/132 (46%), Gaps = 10/132 (7%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F ADLR N R AN + ADMR + G+K A+ G DL +
Sbjct: 46 ADFHDADLRGVTFNLTNLRDANLSGADMRNASLFGAKLQ----------DADMHGVDLRE 95
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
+D VL +L AVL + IEGADF++ + +LC A+GTNP
Sbjct: 96 ATLDSAVLEGTDLREAVLEDAFAFNTKFVDVAIEGADFTNVPLRGDVLTSLCAIASGTNP 155
Query: 223 ITGVSTRKSLGC 234
+TG TR +LGC
Sbjct: 156 VTGRVTRDTLGC 167
>gi|124022089|ref|YP_001016396.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9303]
gi|123962375|gb|ABM77131.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9303]
Length = 202
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 67/121 (55%), Gaps = 6/121 (4%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
F + + D+ E+D GS F+ L+ A N G +L D L + A+L+ ++
Sbjct: 81 FVKYDLAGYDLSEADLRGSTFSVTTLKNA-----NLHGTNLEDVLAYATRFDNADLSESI 135
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC-GNSR 238
L L +S+ GA+I+GADF++A++D +++ALC A G N TGV T SL C G S
Sbjct: 136 LRNANLRKSEFAGALIDGADFTNALLDRQEQKALCARATGKNSKTGVDTYTSLDCSGISE 195
Query: 239 R 239
R
Sbjct: 196 R 196
>gi|88808450|ref|ZP_01123960.1| hypothetical protein WH7805_02132 [Synechococcus sp. WH 7805]
gi|88787438|gb|EAR18595.1| hypothetical protein WH7805_02132 [Synechococcus sp. WH 7805]
Length = 159
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 58/110 (52%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F ++RE+D SGS GA L A AN + +L D +D VLN +LT+AVL
Sbjct: 50 FNLTNLREADLSGSDLQGASLYGAKLQDANLSRTNLRDATLDSAVLNGTDLTDAVLEDAF 109
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ I GADF++ + + LC A GTNP+TG TR +LGC
Sbjct: 110 AFNTRFIDVTISGADFTNVPLRGDVLKTLCAAAEGTNPVTGRDTRDTLGC 159
>gi|427714384|ref|YP_007063008.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
gi|427378513|gb|AFY62465.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
Length = 177
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 82/186 (44%), Gaps = 21/186 (11%)
Query: 49 QFPGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSA 108
FP + K + +FV+ LA +V + A Y E F F
Sbjct: 10 HFPAWF-KFQAGSIFVAFLLAMLLVLGFALPAQA----ENYTKEALVNF------DFSGK 58
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DLR + K N +N + D+R G F A LE A + TGADL +D
Sbjct: 59 DLRDSEFTKANLFHSNLSHTDLR-----GVSFFAANLETA-----DLTGADLRVATLDTA 108
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
+ANLT+A L + GAII+GADF+D + ++ LC A G NP+TG +T
Sbjct: 109 RFTKANLTDANLEGAFAFNTIFDGAIIDGADFTDVDLRPDARKMLCSVAKGVNPVTGRAT 168
Query: 229 RKSLGC 234
+L C
Sbjct: 169 HDTLEC 174
>gi|411119230|ref|ZP_11391610.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410711093|gb|EKQ68600.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 192
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 59/110 (53%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
FT A++RES+F G+ +G A AN GADL + +D L+ +NL NA L
Sbjct: 82 FTKANLRESNFRGADLHGVSFFGANLEGANLEGADLRNATLDTARLSRSNLKNANLEGAF 141
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ GA I+GADF+ + + ALC A GTNP T +TR +L C
Sbjct: 142 AFNAKFDGATIDGADFTGVDMRQDVQHALCDRAAGTNPTTKRNTRDTLNC 191
>gi|255570589|ref|XP_002526251.1| Thylakoid lumenal 17.4 kDa protein, chloroplast precursor, putative
[Ricinus communis]
gi|223534416|gb|EEF36120.1| Thylakoid lumenal 17.4 kDa protein, chloroplast precursor, putative
[Ricinus communis]
Length = 228
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 110/243 (45%), Gaps = 27/243 (11%)
Query: 3 LSSIS-PLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNWR 61
+++IS PLS++SL SS + + + L P+ + C S+ DG P+ +L++
Sbjct: 1 MATISFPLSVRSL----SSERSRFPVPQLHPPIKIICSGSA----DGSKSKPFKELQS-- 50
Query: 62 VFVSTALAAAVVASCSSNISALADL-------NKYEAETRGE-FGIGSAAQFGSADLRKA 113
LAA V S S I+A L N+ E G G + DLR
Sbjct: 51 -VACGLLAAWAVTSASPVIAASQRLPPLSTEPNRCEKAFVGNTIGQANGVYDKPIDLRFC 109
Query: 114 VHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 171
+ E N + + +A M ++ F G+ + + KA A A+F G D S+ ++DR+
Sbjct: 110 DYTNEKSNLKGKSLAAALMSDAKFDGADMSEVVMSKAYAVGASFKGVDFSNAVLDRVNFG 169
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 231
+ANL AV TVL+ S A + A F D +I Q LCK N + R+
Sbjct: 170 KANLQGAVFKNTVLSGSTFDEAQLADAVFEDTIIGYIDLQKLCK-----NTSINLEGREI 224
Query: 232 LGC 234
LGC
Sbjct: 225 LGC 227
>gi|217071608|gb|ACJ84164.1| unknown [Medicago truncatula]
Length = 240
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/235 (28%), Positives = 101/235 (42%), Gaps = 20/235 (8%)
Query: 13 SLNFCSSSSKGP-YQLHALSKPLWVACQISSKTESDGQFPGP-YAKLKNWRVFVSTALAA 70
SL+ + S+K P + AL P + C + + E DG P L + LAA
Sbjct: 12 SLSIRNFSTKRPCFTTSAL--PFTITCSVVGEAELDGTENKPRLLSLNKIKGVACGILAA 69
Query: 71 AVVASCSSNISALA--------DLNKYEAETRGE-FGIGSAAQFGSADLRKA--VHVKEN 119
V S S ++A D N+ E G G + + DLRK + K N
Sbjct: 70 YAVTSASFPVTAATQRLPPLSTDPNRCERAFVGNTIGQANGVYDKALDLRKCDFTNEKSN 129
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
+ ++A M ++ F G+ + KA A +F G D S+ ++DR+ +A+L AV
Sbjct: 130 LKGKTLSAALMSDAKFDGADMTEVVMSKAYAVGGSFKGVDFSNAVLDRVNFGKADLQGAV 189
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
TVL+ S A +EGA F D +I Q +C+ N G R LGC
Sbjct: 190 FRNTVLSGSTFDDAKLEGAVFEDTIIGYIDLQKICR-----NTTIGDEGRAELGC 239
>gi|209525582|ref|ZP_03274120.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423065234|ref|ZP_17054024.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493915|gb|EDZ94232.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406713366|gb|EKD08537.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 177
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 62/112 (55%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A F ++++ ++F S+ G+ +A+ ADL+ ++D++ ++A+L++++
Sbjct: 65 AEFANSNLEYANFDESELRGSVFSRAIMLGVTMRKADLTYAMVDQVDFSQADLSDSIFTE 124
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ S I GADF+DA+ D Q + LC A G N TGV TR SLGC
Sbjct: 125 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSRTGVDTRYSLGC 176
>gi|119511352|ref|ZP_01630465.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
gi|119463974|gb|EAW44898.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
Length = 164
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 62/116 (53%), Gaps = 20/116 (17%)
Query: 124 NFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NF++AD R F+GS+ G L +AY F GADL+D A TNA
Sbjct: 63 NFSNADFRGGVFNGSRLEGVNLHGVDFSDGIAYLTQFKGADLTD----------AVFTNA 112
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+++R+V D I GADF++A++D Q + LC A+G N TG TR+SL C
Sbjct: 113 MMLRSVFDDVD-----ITGADFTNAILDGTQIKKLCTQASGVNSQTGADTRESLEC 163
>gi|33240260|ref|NP_875202.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
gi|33237787|gb|AAP99854.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
Length = 158
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 69/139 (49%), Gaps = 5/139 (3%)
Query: 101 SAAQFGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
+ + F S D K V E+F + A F +++++D SGS GA L A +N
Sbjct: 19 TQSSFASIDYGKQTLVGEDFSKLDLKGATFYLTNLQDADLSGSDLEGASLFGAKLLNSNL 78
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
+ A+L + +D V NL NAVL + + I+G+DF++ ++ LC
Sbjct: 79 SNANLHNATLDSAVFEGTNLENAVLEDAFVFNARFSDVNIQGSDFTNVILRNQDLSYLCS 138
Query: 216 YANGTNPITGVSTRKSLGC 234
ANGTNP+T T+ +L C
Sbjct: 139 IANGTNPVTKRKTKDTLQC 157
>gi|427710138|ref|YP_007052515.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427362643|gb|AFY45365.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 164
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 62/116 (53%), Gaps = 10/116 (8%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ ANF++AD+R G FNG LE N G D S+ + A+L++A
Sbjct: 58 DLENANFSNADLR-----GGVFNGIVLEGV-----NMHGVDFSNGIAYLARFKNADLSDA 107
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
VL ++ RS I GADF++AV+D Q + LC A+G N T V TR+SLGC
Sbjct: 108 VLTDAMMLRSTFDNVEITGADFTNAVLDGTQVKKLCAKASGVNSKTSVDTRESLGC 163
>gi|224006618|ref|XP_002292269.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971911|gb|EED90244.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 255
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 69/122 (56%), Gaps = 4/122 (3%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+N + F + +R+SDFS S GA A +NF AD++ ++ N ANL
Sbjct: 132 NQNLKGVAFQQSIVRDSDFSNSNLYGASFFDATLDGSNFENADMTLCNVEMAQFNRANLK 191
Query: 177 NAVLVRTVLTRSDL--GGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSL 232
NA++ ++ + L G IEG+D+S+ + Q++ LC + A GTNP+TGV+TR+SL
Sbjct: 192 NAIVKDMYVSGATLFEGVKDIEGSDWSETQLRKDQQKYLCNHPTAKGTNPVTGVNTRESL 251
Query: 233 GC 234
C
Sbjct: 252 MC 253
>gi|332705869|ref|ZP_08425945.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332355661|gb|EGJ35125.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 150
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 65/129 (50%), Gaps = 7/129 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A + A +K F + + D+ DF + F+ L+ A NF GA+++ +
Sbjct: 26 AQAQSAATIKATFANTDLSGQDLSGQDFHNAVFSSVNLQSANLSNVNFKGANIT-----K 80
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI--TG 225
+ ANL A + + GA I GADF+ A++D Q + LCK A+ TNPI TG
Sbjct: 81 VNFTNANLQGADFSYAFINVCNFKGANITGADFTFAILDSKQYRELCKNASATNPITDTG 140
Query: 226 VSTRKSLGC 234
V TR SLGC
Sbjct: 141 VDTRYSLGC 149
>gi|414077638|ref|YP_006996956.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413971054|gb|AFW95143.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 165
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 65/116 (56%), Gaps = 20/116 (17%)
Query: 124 NFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NF +AD+R + F+G+ F+G + +AY + F +DLSD A T A
Sbjct: 64 NFNNADLRGAVFNGTLLDTVNFHGVDFSQGIAYLSRFKNSDLSD----------AVFTEA 113
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+++R+ + D + GADF++A++D+ Q + +C A+G N TGV TR SLGC
Sbjct: 114 MMLRSTFDQVD-----VTGADFTNAILDMIQIKKICINASGVNSKTGVDTRASLGC 164
>gi|376005445|ref|ZP_09782948.1| conserved exported hypothetical protein [Arthrospira sp. PCC 8005]
gi|375326159|emb|CCE18701.1| conserved exported hypothetical protein [Arthrospira sp. PCC 8005]
Length = 177
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 62/112 (55%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A F ++++ ++F ++ G+ +A+ ADL+ ++D++ ++A+L++++
Sbjct: 65 AEFANSNLEYANFDEAELRGSVFSRAIMLGVTMRKADLTYAMVDQVDFSQADLSDSIFTE 124
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ S I GADF+DA+ D Q + LC A G N TGV TR SLGC
Sbjct: 125 ALFLGSTFADTKITGADFTDAIFDREQLRQLCLRAEGVNSRTGVDTRYSLGC 176
>gi|427701765|ref|YP_007044987.1| low-complexity protein [Cyanobium gracile PCC 6307]
gi|427344933|gb|AFY27646.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
Length = 175
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 65/132 (49%), Gaps = 10/132 (7%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F ADLR F ++R++D SG+ GA L A A+ +G+DL D
Sbjct: 53 ADFHGADLRGVT----------FNLTNLRDADLSGADLRGASLFGAKLQDADLSGSDLRD 102
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
+D V +L NA L + G +I+GADF++ + +LC A+GTNP
Sbjct: 103 ATLDSAVFEGTDLRNARLDDAFAFNTKFRGVLIDGADFTNVPLRGDALTSLCAAASGTNP 162
Query: 223 ITGVSTRKSLGC 234
+TG TR +L C
Sbjct: 163 VTGRLTRDTLNC 174
>gi|148242416|ref|YP_001227573.1| pentapeptide repeat-containing protein [Synechococcus sp. RCC307]
gi|147850726|emb|CAK28220.1| Secreted pentapeptide repeat protein [Synechococcus sp. RCC307]
Length = 162
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 63/131 (48%), Gaps = 5/131 (3%)
Query: 109 DLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
D K V + +F + F ++RE+D SGS A L A AN +G+DL +
Sbjct: 31 DYAKQVLIGADFSSRDLKGVTFNLTNLREADLSGSDLRAASLFGAKLQDANLSGSDLREA 90
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
+D V N +L++A L + G I GADFSD + LC A GTN +
Sbjct: 91 TLDSAVFNGTDLSDARLEGAFAFNTRFSGVTITGADFSDVPLRGDALSTLCAVAEGTNSV 150
Query: 224 TGVSTRKSLGC 234
TG TR +LGC
Sbjct: 151 TGRDTRDTLGC 161
>gi|282900932|ref|ZP_06308865.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281194023|gb|EFA68987.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 164
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 53/183 (28%), Positives = 85/183 (46%), Gaps = 25/183 (13%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
+K W++FV L A S S +A + A ++ E + L
Sbjct: 1 MKYWQIFVGLVLTAVFFVSNLSAQAASSSSITRSAGSKIEI-----QDYSGKSLVGKEFT 55
Query: 117 KENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 171
ANF++AD+R F+G+ +G ++Y +NF ++LSD +
Sbjct: 56 NIKLENANFSNADLRGVVFNGTLLIDTNLHGVNFSDGISYLSNFKNSNLSDAI------- 108
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 231
TNA+++R+ D I GADF++A++D + + LC A+G N TGV TR+S
Sbjct: 109 ---FTNAMMLRSTFNNVD-----ITGADFTNAILDGVEVKKLCADASGVNSQTGVDTRES 160
Query: 232 LGC 234
LGC
Sbjct: 161 LGC 163
>gi|449018747|dbj|BAM82149.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 269
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 64/115 (55%), Gaps = 2/115 (1%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 181
+ +F+ + R+++FSGS +GA KA +ANF A L +++ VL +N NAVL
Sbjct: 153 QKDFSGSTCRKTNFSGSDLSGARFFKADLTEANFENAQLIGASLEQTVLRGSNFQNAVLR 212
Query: 182 RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 234
T T S L A IE D++DA+++ + LC A G N +T TR+SL C
Sbjct: 213 STYWTESVLTIANIENTDWTDALLEPTWQMKLCSRSDAKGMNTLTNTDTRESLMC 267
>gi|427723472|ref|YP_007070749.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427355192|gb|AFY37915.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 170
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 66/134 (49%), Gaps = 6/134 (4%)
Query: 107 SADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
+ D K ++E+F R ++ + +R SDFS G A NF GAD+
Sbjct: 36 AVDYNKRTFIQEDFSHQDLRDNSYDLSSLRGSDFSYCDLRGVRFFSANLEFVNFEGADMR 95
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLG-GAIIEGADFSDAVIDLAQKQALCKYANGT 220
++D + AN TNA L L + +I+GADF+DA+I + LC A GT
Sbjct: 96 GAVLDSARIGHANFTNANLEGAYLASVKITPSTVIDGADFTDALILKNENDKLCDLATGT 155
Query: 221 NPITGVSTRKSLGC 234
NP TGV T +SL C
Sbjct: 156 NPDTGVDTAESLYC 169
>gi|302837694|ref|XP_002950406.1| hypothetical protein VOLCADRAFT_120854 [Volvox carteri f.
nagariensis]
gi|300264411|gb|EFJ48607.1| hypothetical protein VOLCADRAFT_120854 [Volvox carteri f.
nagariensis]
Length = 182
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 66/117 (56%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ R+ T A++R+++F+ + G L +++ A F GA+L + ++ A+ TN
Sbjct: 65 KDLRKLKLTKANLRQTNFTDANLEGVSLFGSLSESAIFRGANLRNADLESGNYEFADFTN 124
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AVL + + I G+D++D V+ ++ LC A+G NP TGVSTR+SL C
Sbjct: 125 AVLEGAFVNNAQFVKVTITGSDWTDVVLRKDVQKELCAIADGVNPTTGVSTRESLLC 181
>gi|302756827|ref|XP_002961837.1| hypothetical protein SELMODRAFT_76876 [Selaginella moellendorffii]
gi|300170496|gb|EFJ37097.1| hypothetical protein SELMODRAFT_76876 [Selaginella moellendorffii]
Length = 180
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 71/136 (52%), Gaps = 6/136 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
+GS R ++F + T D + S + F GA L A + AN TGAD SD
Sbjct: 44 YGSEVTRGQDLSGKDFSGRDLTKQDFKTSILRQANFKGAKLFGASFFDANLTGADFSDAD 103
Query: 165 MDRMVLN-----EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
+ L+ +AN TNA L ++T + L GA I GADF+D + Q+ LC+ A+
Sbjct: 104 LRGADLSLADATKANFTNANLEGALVTGNTSLKGANITGADFTDVLWREDQRSYLCRIAD 163
Query: 219 GTNPITGVSTRKSLGC 234
G NP+T STR++L C
Sbjct: 164 GINPVTSNSTRETLLC 179
>gi|302798106|ref|XP_002980813.1| hypothetical protein SELMODRAFT_178497 [Selaginella moellendorffii]
gi|300151352|gb|EFJ17998.1| hypothetical protein SELMODRAFT_178497 [Selaginella moellendorffii]
Length = 180
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 71/136 (52%), Gaps = 6/136 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
+GS R ++F + T D + S + F GA L A + AN TGAD SD
Sbjct: 44 YGSEVTRGQDLSGKDFSGRDLTKQDFKTSILRQANFKGAKLFGASFFDANLTGADFSDAD 103
Query: 165 MDRMVLN-----EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
+ L+ +AN TNA L ++T + L GA I GADF+D + Q+ LC+ A+
Sbjct: 104 LRGADLSLADATKANFTNANLEGALVTGNTSLKGANITGADFTDVLWREDQRSYLCRIAD 163
Query: 219 GTNPITGVSTRKSLGC 234
G NP+T STR++L C
Sbjct: 164 GINPVTSNSTRETLLC 179
>gi|159467845|ref|XP_001692102.1| hypothetical protein CHLREDRAFT_115715 [Chlamydomonas reinhardtii]
gi|158278829|gb|EDP04592.1| predicted protein [Chlamydomonas reinhardtii]
Length = 124
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 65/131 (49%), Gaps = 20/131 (15%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDT 163
DLRK K N R+ N T A++ GS F GA L A N+ AD SD
Sbjct: 8 DLRKLKLTKANLRQTNLTGANLEGVSLFGSLSEGAVFKGANLRNADLESGNYEDADFSDA 67
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
+++ +N NA VR I+G+D++D V+ ++ALC A+G NP
Sbjct: 68 ILEGAFVN-----NAQFVRVN----------IKGSDWTDVVLRKDIQKALCAIADGVNPT 112
Query: 224 TGVSTRKSLGC 234
TGVSTR+SL C
Sbjct: 113 TGVSTRESLMC 123
>gi|298715141|emb|CBJ27829.1| Thylakoid lumenal 15 kDa protein, chloroplast precursor (p15)
[Ectocarpus siliculosus]
Length = 245
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 71/146 (48%), Gaps = 13/146 (8%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA G RK V N A++ D+ F S G + A A F ADLS
Sbjct: 99 AASTGDKGARKTVTRGVNIENADYHDKDLSSVSFQQSLVRGTNFKNAKLVAAGFFDADLS 158
Query: 162 DTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGG------AIIEGADFSDAVIDLAQK 210
+ + +N+ ANL+ A + ++T + + G AIIEGADF+D + Q
Sbjct: 159 NCNFESANMNQANLELANLSGANMKNALVTEAYVSGATKMEPAIIEGADFTDTFLRKDQV 218
Query: 211 QALC--KYANGTNPITGVSTRKSLGC 234
+ LC + A GTNP++GV TR SLGC
Sbjct: 219 RYLCGLETAKGTNPVSGVDTRDSLGC 244
>gi|225449424|ref|XP_002282933.1| PREDICTED: thylakoid lumenal 15 kDa protein 1, chloroplastic [Vitis
vinifera]
gi|296086195|emb|CBI31636.3| unnamed protein product [Vitis vinifera]
Length = 221
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/123 (38%), Positives = 69/123 (56%), Gaps = 6/123 (4%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 172
++F + D + S + F GA L A + A+ TGADLSD + D + N +
Sbjct: 98 KDFSGKSLIKQDFKTSILRQANFKGANLLGASFFDADLTGADLSDADLRGADFSLANVTK 157
Query: 173 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 231
ANL+NA L + T + G+II GADF+D + Q++ LCK A+G NP TG +TR++
Sbjct: 158 ANLSNANLEGALATGNTSFRGSIITGADFTDVPLREDQREYLCKVADGVNPTTGNATRET 217
Query: 232 LGC 234
L C
Sbjct: 218 LLC 220
>gi|170079322|ref|YP_001735960.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
gi|169886991|gb|ACB00705.1| Pentapeptide repeat containing protein [Synechococcus sp. PCC 7002]
Length = 166
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 65/134 (48%), Gaps = 6/134 (4%)
Query: 107 SADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
+ D K ++E+F R ++ + +R DFS S G A NF GADL
Sbjct: 32 AVDYNKRTFIQEDFSHQDLRDNSYDLSSLRGCDFSYSDLRGVRFFSANLEFVNFEGADLR 91
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLG-GAIIEGADFSDAVIDLAQKQALCKYANGT 220
++D + AN NA L L + +IEGADF+DA+I + LC+ A+GT
Sbjct: 92 GAVLDSARIGHANFKNANLEGAFLASVKITPSTVIEGADFTDALILARENDKLCELASGT 151
Query: 221 NPITGVSTRKSLGC 234
NP TG T +L C
Sbjct: 152 NPTTGRDTAATLYC 165
>gi|397595313|gb|EJK56448.1| hypothetical protein THAOC_23663 [Thalassiosira oceanica]
Length = 238
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 69/145 (47%), Gaps = 2/145 (1%)
Query: 96 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
+ G +AA+ DLR+ N + + + M ++D S + F A K +NF
Sbjct: 94 KMGQANAARDKLYDLRECKLSGVNGQEFDLSGVIMSKTDLSKANFREAQFSKGYLRDSNF 153
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
AD ++ ++DR ++L + VLT + GA +E ADF+DA I + LCK
Sbjct: 154 EEADFTNAIVDRATFKGSSLKGTIFSNAVLTATSFEGADVENADFTDAYIGDFDIRNLCK 213
Query: 216 YAN--GTNPITGVSTRKSLGCGNSR 238
G NP+TG TR S CG R
Sbjct: 214 NPTLKGENPLTGADTRLSANCGPGR 238
>gi|224098455|ref|XP_002311180.1| predicted protein [Populus trichocarpa]
gi|222851000|gb|EEE88547.1| predicted protein [Populus trichocarpa]
Length = 218
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 71/119 (59%), Gaps = 11/119 (9%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 176
+ +F ++ +R+++F G+K GA + A+ TGADLSD + D + N +ANL+
Sbjct: 104 KQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSDADLRSADFSLTNVTKANLS 158
Query: 177 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA L + T + G+ I GADF+D + Q++ LCK+A+G NP TG +TR +L C
Sbjct: 159 NANLEGALATGNTSFRGSNITGADFTDVPLREDQREYLCKFADGVNPTTGNATRDTLLC 217
>gi|449016903|dbj|BAM80305.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 341
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 68/128 (53%), Gaps = 11/128 (8%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA---------- 173
+ +S D+ + +G+ +GA L A ++ +GA+L D +L+EA
Sbjct: 211 DLSSVDLSTAALAGADLHGAALSHANLFQVQLSGANLRGAKFDASILDEAALDGADLSGA 270
Query: 174 NLTNAVLVRTVLTRSDLGGAI-IEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 232
+L A++ RT+L + L I I+GADFS A+ID ++ LC+ A G N TGV+T SL
Sbjct: 271 DLRQALVRRTLLLGARLDANISIDGADFSGALIDRTNQRLLCELAQGVNSRTGVATATSL 330
Query: 233 GCGNSRRN 240
C + N
Sbjct: 331 ACPEPKTN 338
>gi|255073547|ref|XP_002500448.1| predicted protein [Micromonas sp. RCC299]
gi|226515711|gb|ACO61706.1| predicted protein [Micromonas sp. RCC299]
Length = 215
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 66/128 (51%), Gaps = 5/128 (3%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DLR+ +V ++ + A M ++ F G+ + KA A A+FTGA+ ++ ++DR+
Sbjct: 93 DLRQCNYVDKDLSTKTLSGALMVDATFKGANMTEVVMSKAYAVNADFTGANFTNAVVDRV 152
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
+ ANL+NA V+T + G + GA F +A+I + LC+ NP T
Sbjct: 153 TFDGANLSNANFFNAVITGATFEGTNLAGAQFDEALIGKEDVKKLCE-----NPTLVEET 207
Query: 229 RKSLGCGN 236
R +GC N
Sbjct: 208 RFQVGCRN 215
>gi|428166498|gb|EKX35473.1| hypothetical protein GUITHDRAFT_97823, partial [Guillardia theta
CCMP2712]
Length = 230
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 63/119 (52%), Gaps = 2/119 (1%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++F + +F+ +E+ F G+K G KA A+FTGADLS ++ L+ L N
Sbjct: 112 KDFSKKDFSGCAAKEAKFVGTKLRGTRFFKADLTGADFTGADLSTASLEDAKLDGVVLKN 171
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 234
A+L + I GADF+DA++ LCK A GTNP+T TR+SLGC
Sbjct: 172 AILSNSYTNLGLDKVKDISGADFTDALVRPDILAKLCKRSDATGTNPVTKADTRESLGC 230
>gi|298492954|ref|YP_003723131.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298234872|gb|ADI66008.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 164
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 64/120 (53%), Gaps = 20/120 (16%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEAN 174
+ NF+++D+R F+G+ G L + +AY F AD SD +
Sbjct: 59 LQNTNFSNSDLRGGVFNGTLLEGVNLHGVDFSQGIAYLVKFNNADFSDAI---------- 108
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
LT+A+++R+V D + GADF++A++D + + LC A+G N T V TR+SLGC
Sbjct: 109 LTDAMMLRSVFDNVD-----VTGADFTNAILDGVEIKKLCLKASGVNSKTAVDTRESLGC 163
>gi|302143933|emb|CBI23038.3| unnamed protein product [Vitis vinifera]
Length = 232
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 68/241 (28%), Positives = 105/241 (43%), Gaps = 17/241 (7%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNW 60
MA SI PLS++ SS + + + L P ++C S + + +LKN
Sbjct: 1 MATLSI-PLSLQH----SSPKRHRFSVPELHSPFRISCSASWDSPELKASSSQFKELKNV 55
Query: 61 R---VFVSTALAAAVVASCSSNISALA-DLNKYEAETRGE-FGIGSAAQFGSADLRKAVH 115
+ V AA+ V + S + L+ + N+ E G G + DLR +
Sbjct: 56 AFGILAVCAVTAASPVIAASQRLPPLSTEPNRCERAFVGNTIGQANGVYDKPIDLRFCDY 115
Query: 116 VKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
E N + + +A M E+ F G+ + + KA A A+F G D ++ ++DR+ +A
Sbjct: 116 TNEKSNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGVDFTNAVLDRVNFGKA 175
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 233
NL AV TVL+ S A +E A F D +I Q +C TN R LG
Sbjct: 176 NLQGAVFKNTVLSGSTFDQAQLEDAVFEDTIIGYIDLQKIC-----TNTSINADGRAELG 230
Query: 234 C 234
C
Sbjct: 231 C 231
>gi|224112717|ref|XP_002316270.1| predicted protein [Populus trichocarpa]
gi|222865310|gb|EEF02441.1| predicted protein [Populus trichocarpa]
Length = 219
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 68/235 (28%), Positives = 100/235 (42%), Gaps = 35/235 (14%)
Query: 14 LNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNWRV---FVSTALAA 70
LN + P +L+KP + S + S + P P A + N ++ F T L A
Sbjct: 5 LNVSLCTKIPPKPPLSLTKPSLSIPRFLSLSHS--RCPNPQALILNKQLLEDFAKTGLLA 62
Query: 71 AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM 130
+ S ALA +GS R ++F T D
Sbjct: 63 LLSVSLFFTDPALA--------------FKGGGPYGSEVTRGQDLTGKDFSGRTLTKQDF 108
Query: 131 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+ S + F GA L A + A+ TGADLSD L A+L+ A + + L+ ++L
Sbjct: 109 KTSILRQANFKGAKLLGASFFDADLTGADLSDA-----DLRSADLSLANVAKVNLSNANL 163
Query: 191 GGAI-----------IEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
GA+ I GADF+D + Q++ LCK A+G NP TG +TR +L C
Sbjct: 164 EGALATGNTSFRGSNITGADFTDVPLREDQREYLCKVADGVNPTTGNATRDTLLC 218
>gi|359490718|ref|XP_002275994.2| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic [Vitis
vinifera]
Length = 244
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 68/241 (28%), Positives = 105/241 (43%), Gaps = 17/241 (7%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNW 60
MA SI PLS++ SS + + + L P ++C S + + +LKN
Sbjct: 13 MATLSI-PLSLQH----SSPKRHRFSVPELHSPFRISCSASWDSPELKASSSQFKELKNV 67
Query: 61 R---VFVSTALAAAVVASCSSNISALA-DLNKYEAETRGE-FGIGSAAQFGSADLRKAVH 115
+ V AA+ V + S + L+ + N+ E G G + DLR +
Sbjct: 68 AFGILAVCAVTAASPVIAASQRLPPLSTEPNRCERAFVGNTIGQANGVYDKPIDLRFCDY 127
Query: 116 VKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
E N + + +A M E+ F G+ + + KA A A+F G D ++ ++DR+ +A
Sbjct: 128 TNEKSNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGVDFTNAVLDRVNFGKA 187
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG 233
NL AV TVL+ S A +E A F D +I Q +C TN R LG
Sbjct: 188 NLQGAVFKNTVLSGSTFDQAQLEDAVFEDTIIGYIDLQKIC-----TNTSINADGRAELG 242
Query: 234 C 234
C
Sbjct: 243 C 243
>gi|219116308|ref|XP_002178949.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409716|gb|EEC49647.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 131
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 66/124 (53%), Gaps = 4/124 (3%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
K+N + F + +R DFS S GA A +NF ++L + ++ L +
Sbjct: 8 KQNLKGVAFQQSIVRNCDFSNSDLRGASFFDATLTDSNFENSNLENVNLEMAQLTRVSFK 67
Query: 177 NAVLVRTVLTRSDL--GGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSL 232
NAV+ ++ + + G +EG+D+S+ + QK+ LC + A GTNP+TGV TR+SL
Sbjct: 68 NAVVTDAYVSGATIFDGVKDVEGSDWSETYLRADQKKLLCNHPTAKGTNPVTGVDTRESL 127
Query: 233 GCGN 236
C N
Sbjct: 128 MCPN 131
>gi|255645177|gb|ACU23086.1| unknown [Glycine max]
Length = 222
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 71/119 (59%), Gaps = 11/119 (9%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 176
+ +F ++ +R+++F G+K GA + A+ TGADLSD + D + N +ANL+
Sbjct: 108 KQDFKTSILRQANFKGAKLIGASF-----FDADLTGADLSDADLRNADFSLANVTKANLS 162
Query: 177 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA L ++T + G+ + GADF+D + Q++ LCK A+G NP TG +TR +L C
Sbjct: 163 NANLEGALVTGNTSFRGSNVTGADFTDVPLREDQREYLCKVADGVNPTTGNATRDTLFC 221
>gi|449441422|ref|XP_004138481.1| PREDICTED: thylakoid lumenal 15 kDa protein 1, chloroplastic-like
[Cucumis sativus]
Length = 214
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 67/234 (28%), Positives = 115/234 (49%), Gaps = 29/234 (12%)
Query: 9 LSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPGPYAKLKNWRVFVSTAL 68
+++ + + C S+ K P + H L P + + S ++ Q +L+ R FV +
Sbjct: 1 MALLNASLCCSTPKIPSRSHLLLPP---SISLHSTSQVFNQ------QLQATRNFVLSLG 51
Query: 69 AAAVVASCSSNISALADLNKYEAETRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFT 126
+A S+++ LAD + G +G G D +K++F+
Sbjct: 52 QPTFLAFVSASL-FLAD-PALAFKGGGPYGAGVTRGQDLSGKDFSGKTLIKQDFK----- 104
Query: 127 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLTNAVLV 181
++ +R+++F G+ GA + A+ TGADLSD + D + N +ANL+NA L
Sbjct: 105 TSILRQANFKGANLLGASF-----FDADLTGADLSDADLRGADFSLANVTKANLSNANLE 159
Query: 182 RTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ T + G+ I GADF+D + Q++ LCK A+G NP TG +TR++L C
Sbjct: 160 GALATGNTSFRGSTINGADFTDVPLREDQREYLCKVADGVNPTTGNATRETLLC 213
>gi|298705858|emb|CBJ29003.1| thylakoid lumenal protein [Ectocarpus siliculosus]
Length = 199
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 59/105 (56%), Gaps = 2/105 (1%)
Query: 132 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 191
E++FS F + KA A +N+ AD ++ ++DR+ + +++ A+ VLT +
Sbjct: 92 EANFSKGDFKEVVMSKAYARSSNWEEADFTNAVVDRVSFDGSSMKGAIFQNAVLTSTSFT 151
Query: 192 GAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGC 234
GA +E ADF++A + ++ LCK GTNP+T TR S GC
Sbjct: 152 GADVENADFTEAYMGDFDQKNLCKNPTLKGTNPVTNADTRASAGC 196
>gi|147774410|emb|CAN74472.1| hypothetical protein VITISV_013914 [Vitis vinifera]
Length = 221
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 75/150 (50%), Gaps = 18/150 (12%)
Query: 89 YEAE-TRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 144
Y AE TRG+ G S D + ++ + NF+ AN A ++D +G+ + A
Sbjct: 85 YGAEVTRGQDLTGKDFSGKSLIKQDFKTSILRQANFKXANLLGASFFDADLTGADLSDAD 144
Query: 145 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
L A AN T A+LS+ ANL A+ R G+II GADF+D
Sbjct: 145 LRGADFSLANVTKANLSN----------ANLEGALATGNTSFR----GSIITGADFTDVP 190
Query: 205 IDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ Q++ LCK A+G NP TG +TR++L C
Sbjct: 191 LREDQREYLCKVADGVNPTTGNATRETLLC 220
>gi|351722845|ref|NP_001236746.1| uncharacterized protein LOC100500352 [Glycine max]
gi|255630103|gb|ACU15405.1| unknown [Glycine max]
Length = 224
Score = 67.8 bits (164), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 70/119 (58%), Gaps = 11/119 (9%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 176
+ +F ++ +R+++F G+K GA + A+ TGADLSD + D + N +ANL+
Sbjct: 110 KQDFKTSILRQANFKGAKLIGASF-----FDADLTGADLSDADLRNADFSLANVTKANLS 164
Query: 177 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA L + T + G+ I GADF+D + Q++ LCK A+G NP TG +TR +L C
Sbjct: 165 NANLEGALATGNTSFKGSNITGADFTDVPLREDQREYLCKVADGVNPTTGNATRDALFC 223
>gi|388521435|gb|AFK48779.1| unknown [Lotus japonicus]
Length = 225
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 71/119 (59%), Gaps = 11/119 (9%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 176
+ +F ++ +R+++F G+K GA + ++ TGADLSD + D + N +ANL+
Sbjct: 111 KQDFKTSILRQANFKGAKLLGASF-----FDSDLTGADLSDADLRSADFFLANVTKANLS 165
Query: 177 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA L + T + G+ I GADF+D + Q++ LCK A+G NP TG +TR++L C
Sbjct: 166 NANLEGALATGNTSFKGSNITGADFTDVPLRDDQREYLCKVADGVNPTTGNATRETLLC 224
>gi|168022043|ref|XP_001763550.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685343|gb|EDQ71739.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 165
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 68/114 (59%), Gaps = 1/114 (0%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 181
+ +F ++ +R+++F G+K GA + A+ T ADL + L++ANLTNA L
Sbjct: 51 KQDFKTSILRQANFKGAKLLGASFFDSDLTGADLTDADLRGADLSLARLSKANLTNANLE 110
Query: 182 RTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+T + L G+II GADF++ Q++ LC A+G NP+TG +TR++L C
Sbjct: 111 GASVTGNTYLKGSIITGADFTEVNWRDDQRKELCLIADGVNPVTGNATRETLLC 164
>gi|383763560|ref|YP_005442542.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381383828|dbj|BAM00645.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 189
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 48/123 (39%), Positives = 65/123 (52%), Gaps = 12/123 (9%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
A RG + + F A+L++A N RAN + AD+ +D SG+ GA L A
Sbjct: 30 AHLRGAHLVEADLSF--ANLQRANLAGANLERANLSGADLEGADLSGANLVGANLTGARL 87
Query: 151 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGG-----AIIEGADF 200
+AN TGA+L D L++R L E ANL NA V + L R+DLG A+ +GAD
Sbjct: 88 MRANLTGANLRDALVNRADLTEALLVDANLRNAHFVESTLVRADLGDANALKAVFKGADL 147
Query: 201 SDA 203
S A
Sbjct: 148 SGA 150
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+ A + E+D S + A L A +AN +GADL + L ANLT A L+R
Sbjct: 30 AHLRGAHLVEADLSFANLQRANLAGANLERANLSGADLEGADLSGANLVGANLTGARLMR 89
Query: 183 TVLTRSDLGGAIIEGADFSDAVI 205
LT ++L A++ AD ++A++
Sbjct: 90 ANLTGANLRDALVNRADLTEALL 112
>gi|443326649|ref|ZP_21055296.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442793770|gb|ELS03210.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 920
Score = 67.4 bits (163), Expect = 7e-09, Method: Composition-based stats.
Identities = 39/104 (37%), Positives = 56/104 (53%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L A V+ N RAN A++ ++ +G+ GA LEKA+ ANF GA+L++
Sbjct: 801 ANLDGANLEGANLVRANLVRANLVRANLDGANLNGAILEGANLEKAILEGANFRGANLNE 860
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ L+EAN A R L R D A +GADF A++D
Sbjct: 861 ANLRGAHLSEANFQEADFDRADLQRVDFDRADFQGADFDRAIMD 904
Score = 44.7 bits (104), Expect = 0.047, Method: Composition-based stats.
Identities = 30/103 (29%), Positives = 51/103 (49%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
+L +A + N RAN A++ ++ G+ A L +A +AN GA+L+ +++
Sbjct: 782 NLYRANLYRANLYRANLVRANLDGANLEGANLVRANLVRANLVRANLDGANLNGAILEGA 841
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
L +A L A L ++L GA + A+F +A D A Q
Sbjct: 842 NLEKAILEGANFRGANLNEANLRGAHLSEANFQEADFDRADLQ 884
Score = 41.6 bits (96), Expect = 0.42, Method: Composition-based stats.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 5/74 (6%)
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR-----SDLGGAI 194
N L +A Y+AN A+L +D L ANL A LVR L R ++L GAI
Sbjct: 778 LNWLNLYRANLYRANLYRANLVRANLDGANLEGANLVRANLVRANLVRANLDGANLNGAI 837
Query: 195 IEGADFSDAVIDLA 208
+EGA+ A+++ A
Sbjct: 838 LEGANLEKAILEGA 851
>gi|298711847|emb|CBJ32870.1| Pentapeptide repeat [Ectocarpus siliculosus]
Length = 238
Score = 67.0 bits (162), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 63/125 (50%), Gaps = 12/125 (9%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
K ++ +F+ + +E +FSGS G KA KA+FTGA+L L EA+L
Sbjct: 116 KGKYKSKDFSGSIAKEVNFSGSDLRGVRFFKADLKKADFTGANLGTA-----SLEEADLE 170
Query: 177 NAVLVRTVLTRSDLGGAI-----IEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTR 229
++ V T S G + I GADF+DA+I + LC A GTNP TG TR
Sbjct: 171 GTIMTNAVATGSYFGNNMNNVGDISGADFTDALIRKDVAKILCARPDAKGTNPTTGTDTR 230
Query: 230 KSLGC 234
SL C
Sbjct: 231 DSLLC 235
>gi|425437827|ref|ZP_18818239.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9432]
gi|389677087|emb|CCH93934.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9432]
Length = 976
Score = 67.0 bits (162), Expect = 8e-09, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 55/102 (53%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
+A A+L A + N RAN A++ E++ G+ GAYLE A +AN GA+L
Sbjct: 861 SANLERANLYMANLERANLERANLKRANLYEANLYGAYLAGAYLEGANLERANLYGANLE 920
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++R L ANL A L L R++L GA + GA+F DA
Sbjct: 921 GANLERANLERANLKGANLEGANLERANLEGAFLRGANFKDA 962
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 34/102 (33%), Positives = 46/102 (45%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A A+L+ A N RAN A + + + AYLE+A Y AN A+L
Sbjct: 811 GANLERANLKGANLYMANLERANLYRAYLYRAYLYRAYLERAYLERANLYSANLERANLY 870
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++R L ANL A L L + L GA +EGA+ A
Sbjct: 871 MANLERANLERANLKRANLYEANLYGAYLAGAYLEGANLERA 912
Score = 46.2 bits (108), Expect = 0.014, Method: Composition-based stats.
Identities = 35/111 (31%), Positives = 55/111 (49%), Gaps = 1/111 (0%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL+ + + + RAN A++ ++ G+ GA LE+A AN A+L + R
Sbjct: 778 DLQNCLLIHRDLYRANLERANLERANLYGAYLYGANLERANLKGANLYMANLERANLYRA 837
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 218
L A L A L R L R++L A +E A+ A ++ A ++A K AN
Sbjct: 838 YLYRAYLYRAYLERAYLERANLYSANLERANLYMANLERANLERANLKRAN 888
Score = 42.0 bits (97), Expect = 0.31, Method: Composition-based stats.
Identities = 29/93 (31%), Positives = 42/93 (45%), Gaps = 3/93 (3%)
Query: 83 LADLNKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 139
L N YEA G + G+ A A+L A N RAN A+++ ++ G+
Sbjct: 884 LKRANLYEANLYGAYLAGAYLEGANLERANLYGANLEGANLERANLERANLKGANLEGAN 943
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 172
A LE A ANF A++ T++D V E
Sbjct: 944 LERANLEGAFLRGANFKDANVKGTILDTEVKTE 976
Score = 40.8 bits (94), Expect = 0.69, Method: Composition-based stats.
Identities = 36/125 (28%), Positives = 54/125 (43%), Gaps = 4/125 (3%)
Query: 84 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
A+L + E +G A A+L +A N AN A++ + + A
Sbjct: 792 ANLERANLERANLYG----AYLYGANLERANLKGANLYMANLERANLYRAYLYRAYLYRA 847
Query: 144 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
YLE+A +AN A+L + L ANL A L R L ++L GA + GA A
Sbjct: 848 YLERAYLERANLYSANLERANLYMANLERANLERANLKRANLYEANLYGAYLAGAYLEGA 907
Query: 204 VIDLA 208
++ A
Sbjct: 908 NLERA 912
Score = 40.4 bits (93), Expect = 0.83, Method: Composition-based stats.
Identities = 33/106 (31%), Positives = 48/106 (45%), Gaps = 5/106 (4%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L +A + RA A + + + A LE+A Y AN A+L + R
Sbjct: 827 ANLERANLYRAYLYRAYLYRAYLERAYLERANLYSANLERANLYMANLERANLERANLKR 886
Query: 168 MVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADFSDAVIDLA 208
L EANL A L L R++L GA +EGA+ A ++ A
Sbjct: 887 ANLYEANLYGAYLAGAYLEGANLERANLYGANLEGANLERANLERA 932
>gi|359806262|ref|NP_001240959.1| uncharacterized protein LOC100806792 [Glycine max]
gi|255626639|gb|ACU13664.1| unknown [Glycine max]
Length = 222
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 70/119 (58%), Gaps = 11/119 (9%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 176
+ +F ++ +R+++F G+K GA + A+ TGADLSD + D + N +ANL+
Sbjct: 108 KQDFKTSILRQANFKGAKLIGASF-----FDADLTGADLSDADLRNADFSLANVTKANLS 162
Query: 177 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA L + T + G+ + GADF+D + Q++ LCK A+G NP TG +TR +L C
Sbjct: 163 NANLEGALATGNTSFRGSNVTGADFTDVPLREDQREYLCKVADGVNPTTGNATRDTLFC 221
>gi|388510406|gb|AFK43269.1| unknown [Lotus japonicus]
Length = 225
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 71/119 (59%), Gaps = 11/119 (9%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 176
+ +F ++ +R+++F G+K GA + ++ TGADLSD + D + N +ANL+
Sbjct: 111 KQDFKTSILRQANFKGAKLLGASF-----FDSDLTGADLSDADLRSADFSLANVTKANLS 165
Query: 177 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA L + T + G+ I GADF+D + Q++ LCK A+G NP TG +TR++L C
Sbjct: 166 NANLEGALATGNTSFKGSNITGADFTDVPLRDDQREYLCKVADGVNPTTGNATRETLLC 224
>gi|413968546|gb|AFW90610.1| chloroplast thylakoid lumenal 17.4 kDa protein [Solanum tuberosum]
Length = 228
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/227 (29%), Positives = 98/227 (43%), Gaps = 25/227 (11%)
Query: 3 LSSIS-PLSIKSLNFCSSSSKGPYQLHALSKPLWVAC----QISSKTESDGQFPGPYAKL 57
++SIS PL+ KS + S P QLH+ P+ + C S+ ES QF
Sbjct: 1 MASISIPLAYKSHSLRRSPIYRPSQLHS---PIQIKCSASKDCSNSEESSTQF------- 50
Query: 58 KNWRVFVSTALAAAVVASCSSNISA-------LADLNKYEAETRGE-FGIGSAAQFGSAD 109
K R LA ++S S I+A D N+ E G G + D
Sbjct: 51 KQLRNVACGFLAVWALSSVSPVIAAGQRLPPLSTDPNRCERAFVGSTIGQANGVYDKPLD 110
Query: 110 LRKAVHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
LR + E N + + +A M ++ F G+ + KA A A+F D S+ ++DR
Sbjct: 111 LRFCDYTNEKTNLKGKSLAAALMSDAKFDGADMTEVIMSKAYAVGASFKAMDFSNAVLDR 170
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
+ +ANL A TVL+ S A ++G DF D +I Q +C
Sbjct: 171 VNFEKANLQGASFKNTVLSGSTFNDAQLDGVDFEDTIIGYIDLQKIC 217
>gi|116785879|gb|ABK23895.1| unknown [Picea sitchensis]
gi|116792150|gb|ABK26251.1| unknown [Picea sitchensis]
Length = 239
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 63/128 (49%), Gaps = 7/128 (5%)
Query: 109 DLRKA--VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
DLR + K N R + +A M ++ F G+ + + KA A A+F G D S+ ++D
Sbjct: 116 DLRSCDFTNEKTNLRGKSLAAALMSDAKFDGADMSEVIMSKAYAVGASFKGVDFSNAVID 175
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
R+ +AN+ +AV TVL+ S A +EGA F D +I Q LC TN
Sbjct: 176 RVNFGKANMQDAVFRNTVLSGSTFVDANLEGAKFEDTIIGYIDLQKLC-----TNQTLSD 230
Query: 227 STRKSLGC 234
R LGC
Sbjct: 231 EGRDILGC 238
>gi|307108672|gb|EFN56912.1| hypothetical protein CHLNCDRAFT_51710 [Chlorella variabilis]
Length = 155
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 61/126 (48%), Gaps = 5/126 (3%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DLR N + A M E+D SG+ L KA A AN GADL++ ++DR+
Sbjct: 33 DLRFCKFAGANLSGKTLSGAYMNEADMSGANMREVVLTKAYAVGANLRGADLTNAVIDRV 92
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
+ +L A LV V+T + GA ++ A+F DA+I + LC NP +
Sbjct: 93 AFDGVDLEGAQLVNAVITGTTFTGANLKDANFEDALIGSEDAKRLC-----ANPTLVGES 147
Query: 229 RKSLGC 234
R +GC
Sbjct: 148 RDQVGC 153
>gi|159474024|ref|XP_001695129.1| thylakoid lumenal 17.4 kDa protein [Chlamydomonas reinhardtii]
gi|158276063|gb|EDP01837.1| thylakoid lumenal 17.4 kDa protein [Chlamydomonas reinhardtii]
Length = 185
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 65/131 (49%), Gaps = 5/131 (3%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
D+R + ++ A + ++D S + A L KA A KANF GAD+++ ++DR+
Sbjct: 60 DMRLCSYAGKDLHGRVLAGALLADADLSNTNLQEAVLTKAYAVKANFEGADMTNAVVDRV 119
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
ANL + TV+T + GA +EG+ + DA+I LC+ NP +
Sbjct: 120 DFTNANLKRVKFINTVVTGASFAGADLEGSVWEDALIGSQDVGKLCE-----NPTLTGES 174
Query: 229 RKSLGCGNSRR 239
R +GC R+
Sbjct: 175 RAQVGCRAVRK 185
>gi|357133836|ref|XP_003568528.1| PREDICTED: thylakoid lumenal 15 kDa protein 1, chloroplastic-like
[Brachypodium distachyon]
Length = 200
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 70/119 (58%), Gaps = 11/119 (9%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 176
+ +F ++ +R+++F G+K GA + A+ TGADLSDT + D + N + NLT
Sbjct: 86 KQDFKTSILRQTNFKGAKLLGASF-----FDADLTGADLSDTDLRNADFSLANVTKVNLT 140
Query: 177 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA L ++T + G+ I GADF+D + Q+ LCK A+G N TG +T+++L C
Sbjct: 141 NANLEGALVTGNTSFKGSTIYGADFTDVPLRDDQRDYLCKIADGVNTTTGNATKETLFC 199
>gi|219116042|ref|XP_002178816.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409583|gb|EEC49514.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 109
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 131 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+ ++F S G KA +A+F+GADL ++ ++EA L + V V + S +
Sbjct: 3 KSTNFGKSNLKGCRFYKAYLVRADFSGADLRGASLEDTSMDEALLKDTVAVGAYFSASIM 62
Query: 191 GGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 234
+E ADF+DA + LC+ A GTNP+TGV TR+SL C
Sbjct: 63 DTLTVENADFTDAQFPIKTLPLLCERSDATGTNPVTGVDTRESLMC 108
>gi|428219116|ref|YP_007103581.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427990898|gb|AFY71153.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 179
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 69/133 (51%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA++ L A ++ R A F A +R +F+ + +G L + AN +GA+L
Sbjct: 46 AAKYDRRVLEGADFSGKDLRDAQFNKAVLRSVNFANANLSGVSLFGSDLTNANLSGANLR 105
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
+ +D + +L+NA+L + + I GADF+D + ++ LC+ A GTN
Sbjct: 106 YSSLDTSRMVGTDLSNAILEGAFVYGAKFKNLKIAGADFTDVDLRETIREELCEVATGTN 165
Query: 222 PITGVSTRKSLGC 234
P TG TR++LGC
Sbjct: 166 PTTGRDTRETLGC 178
>gi|172036979|ref|YP_001803480.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354554778|ref|ZP_08974082.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171698433|gb|ACB51414.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353553587|gb|EHC22979.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 325
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 106/219 (48%), Gaps = 37/219 (16%)
Query: 3 LSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQF---PGPYAKLKN 59
L+ IS +++K + P+QL L++ + E+D QF ++ N
Sbjct: 95 LTQISGVTVKQFKLVKTH---PFQLEDLAEQI---------DENDPQFLLIERIMSQGGN 142
Query: 60 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 119
+ F L+ A++ C++N+ LADL EA G S A ADL A + N
Sbjct: 143 DQDFREANLSGAIL--CNANL-ILADL--REANLMGTDL--SGANLMGADLSGADLLGAN 195
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVA---------------YKANFTGADLSDTL 164
AN A++ E++ +G+ A L++A +AN GA L+D+L
Sbjct: 196 LTGANLMGANLTEANLTGADLGDAILQEADLCWADLSEVNLIGADLSQANLKGAILTDSL 255
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ LNEANL+ A+L R++L++++L G+I+ D ++A
Sbjct: 256 LSHTNLNEANLSEAILNRSILSKTNLSGSILSQTDLTNA 294
Score = 37.4 bits (85), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 53/115 (46%), Gaps = 28/115 (24%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+ F A+L A+ AN AD+RE++ G+ +GA L A+ +GAD
Sbjct: 141 GNDQDFREANLSGAI-----LCNANLILADLREANLMGTDLSGANL-----MGADLSGAD 190
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
L ANLT A L+ LT ++L GA D DA++ Q+ LC
Sbjct: 191 LLG----------ANLTGANLMGANLTEANLTGA-----DLGDAIL---QEADLC 227
>gi|116792169|gb|ABK26257.1| unknown [Picea sitchensis]
Length = 237
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 71/136 (52%), Gaps = 6/136 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
+G+ R A ++F N D + S +KF GA L A + A+ TGADLSD
Sbjct: 102 YGAEVTRGADLSGKDFSGKNLIQQDFKTSILRQAKFKGAKLIGASFFDADLTGADLSDAD 161
Query: 165 M---DRMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
+ D + N + NL+NA L ++T + G+ I GADF+D + Q++ LC A+
Sbjct: 162 LRGADFSLANVTKVNLSNANLEGALVTGNTSFKGSNISGADFTDVPLRDDQRRYLCNIAD 221
Query: 219 GTNPITGVSTRKSLGC 234
G N TG +TR +L C
Sbjct: 222 GVNLTTGNATRDTLLC 237
>gi|260434702|ref|ZP_05788672.1| secreted pentapeptide repeat protein [Synechococcus sp. WH 8109]
gi|260412576|gb|EEX05872.1| secreted pentapeptide repeat protein [Synechococcus sp. WH 8109]
Length = 160
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 63/122 (51%), Gaps = 1/122 (0%)
Query: 113 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 172
A + ++ R A F +++RE++ SGS GA L A A+ +G DL + +D V+
Sbjct: 39 ADYSNKDLRGATFNLSNLREANLSGSDLRGASLYGAKLQDADLSGTDLREATLDAAVMTG 98
Query: 173 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 232
NL +AVL + +I GADF+D + L + TN +TG STR+SL
Sbjct: 99 TNLEDAVLEGAFAFNTRFRDVLITGADFTDVPCAGTNSKPL-RRCRRTNSVTGRSTRESL 157
Query: 233 GC 234
GC
Sbjct: 158 GC 159
>gi|159903302|ref|YP_001550646.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9211]
gi|159888478|gb|ABX08692.1| Pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9211]
Length = 158
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 43/132 (32%), Positives = 62/132 (46%), Gaps = 10/132 (7%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F DLR A N + AN SGS GA L A K + + +L +
Sbjct: 36 ADFSDTDLRGATFYLTNLQNANL----------SGSNLEGASLFGAKLLKTDLSNTNLKN 85
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
+D +L+ A+LTNA L + I G+DF++ +I Q+ LC A+GTN
Sbjct: 86 ATLDSSILDGADLTNAYLEDAFAFNTQFKDVKISGSDFTNVLITNDQRNYLCSIASGTNS 145
Query: 223 ITGVSTRKSLGC 234
++ +TR SL C
Sbjct: 146 VSTRNTRDSLEC 157
>gi|356509222|ref|XP_003523350.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
[Glycine max]
Length = 240
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 62/128 (48%), Gaps = 7/128 (5%)
Query: 109 DLRKAVHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
DLR+ E N + + ++A M ++ F G+ + KA A A+F G D S+ ++D
Sbjct: 117 DLRQCDFTDEKTNLKGKSLSAALMSDAKFDGADMTEVVMSKAYAVGASFKGVDFSNAVLD 176
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
R+ +A+L AV TVL+ S A ++ A F D +I Q LC TN G
Sbjct: 177 RVNFEKADLEGAVFKNTVLSGSTFDDAKLDNAVFEDTIIGYIDLQKLC-----TNKTIGD 231
Query: 227 STRKSLGC 234
R LGC
Sbjct: 232 EWRVELGC 239
>gi|126656956|ref|ZP_01728134.1| hypothetical protein CY0110_02219 [Cyanothece sp. CCY0110]
gi|126621794|gb|EAZ92503.1| hypothetical protein CY0110_02219 [Cyanothece sp. CCY0110]
Length = 1084
Score = 64.3 bits (155), Expect = 6e-08, Method: Composition-based stats.
Identities = 55/161 (34%), Positives = 75/161 (46%), Gaps = 21/161 (13%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
A+ RG + G A G ADL A + A+ T AD+R +D +G+ GAYLE A
Sbjct: 931 ADLRGAYLEG--ADLGGADLTGA-----DLEGADLTGADLRGADLTGAYLEGAYLEGADL 983
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DL 207
A+ TGA L ++ L A+LT A L L +DLGGA + GAD + A + DL
Sbjct: 984 TGADLTGAYLEGAYLEGADLGGADLTGADLEGADLRGADLGGADLGGADLTGADLRGADL 1043
Query: 208 AQ-----------KQALCKYANGTNPITGVSTRKSLGCGNS 237
+ KQ NG + I K LG G++
Sbjct: 1044 TKTDLNEARYLTVKQVQEAKNNGKDAIYDEEMEKKLGLGDN 1084
Score = 53.9 bits (128), Expect = 8e-05, Method: Composition-based stats.
Identities = 38/106 (35%), Positives = 51/106 (48%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ ADL A + A+ T AD+ +D G+ GAYLE A A+ TGADL
Sbjct: 896 AKLTGADLTGAYLEGADLGGADLTGADLTGADLEGADLRGAYLEGADLGGADLTGADLEG 955
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+ L A+LT A L L +DL GA + GA A ++ A
Sbjct: 956 ADLTGADLRGADLTGAYLEGAYLEGADLTGADLTGAYLEGAYLEGA 1001
Score = 45.1 bits (105), Expect = 0.036, Method: Composition-based stats.
Identities = 29/75 (38%), Positives = 40/75 (53%)
Query: 129 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 188
++ E+ +G+ GAYLE A A+ TGADL+ ++ L A L A L LT +
Sbjct: 892 ELYEAKLTGADLTGAYLEGADLGGADLTGADLTGADLEGADLRGAYLEGADLGGADLTGA 951
Query: 189 DLGGAIIEGADFSDA 203
DL GA + GAD A
Sbjct: 952 DLEGADLTGADLRGA 966
>gi|115434488|ref|NP_001042002.1| Os01g0144100 [Oryza sativa Japonica Group]
gi|13486898|dbj|BAB40127.1| unknown protein [Oryza sativa Japonica Group]
gi|113531533|dbj|BAF03916.1| Os01g0144100 [Oryza sativa Japonica Group]
gi|215678959|dbj|BAG96389.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765141|dbj|BAG86838.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 198
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 69/119 (57%), Gaps = 11/119 (9%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 176
R +F ++ +R+++F G+K GA + A+ TGADLSD + D + N + NLT
Sbjct: 84 RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSDADLRGADFSLANVSKVNLT 138
Query: 177 NAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA L + T + G+ I GADF+D + Q++ LCK A+G N TG +T+++L C
Sbjct: 139 NANLEGALATGNTTFKGSNIYGADFTDVPLRDDQREYLCKIADGVNTTTGNATKETLFC 197
>gi|376001358|ref|ZP_09779228.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375330187|emb|CCE14981.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 351
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 83/179 (46%), Gaps = 18/179 (10%)
Query: 30 LSKPLWVACQISSKTESDGQFPGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKY 89
+SKP KT + + YA+ R F +L AA+ + N L+ N
Sbjct: 1 MSKP---------KTVTVNKLLTRYAQ--GERNFSDISLVAAIFNEVTLNRINLSGANLS 49
Query: 90 EAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
EA IG S +Q ADL AV + N A T + ++D SG+ +GA L
Sbjct: 50 EALMVHTRLIGANLSRSQLSYADLSMAVLIDANLTGATMTETVLHQADLSGASLSGAILS 109
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ N TGA L T + LN + LT+A+LV LTRS L GA + GA+ + +++
Sbjct: 110 QVNLTGVNLTGASLIGTCL----LNGSQLTDAILVGATLTRSVLSGAHMTGANLNRSIL 164
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 52/100 (52%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL ++V NF AN T A++ ++ +G+ NGA L A AN TGA+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTGANLTGANL 249
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ + L ANL+ A L LT ++L GA + AD
Sbjct: 250 NGLTLQSADLRLANLSKADLRGANLTGANLAGANLLEADL 289
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSD 162
A L ++V + AN + + E D SG+ GA +L + AN TGADLS+
Sbjct: 142 ATLTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGANLTGADLSE 201
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+++ ANLT A L LT ++L GA + GA+ + A
Sbjct: 202 SVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTGA 242
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 51/111 (45%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 152
A A+L A N AN T A++ ++ +G+ NG A L KA
Sbjct: 212 ANLTGANLTGANLTGANLNGANLTGANLTGANLTGANLNGLTLQSADLRLANLSKADLRG 271
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
AN TGA+L+ + L ANLT+A L L + L GA + GA+ + A
Sbjct: 272 ANLTGANLAGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQA 322
>gi|440804190|gb|ELR25067.1| pentapeptide repeatcontaining protein [Acanthamoeba castellanii
str. Neff]
Length = 293
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 57/108 (52%), Gaps = 5/108 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
AQ ADLR+A +AN AD+RE++ SG+ A L A+ +A+ +GA L +
Sbjct: 162 AQLEDADLRQANLANAKMTKANLMHADLREANLSGAVMLRADLRSAILRRADLSGAALPN 221
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDAVI 205
+ R L ANLT A L LT +D L GA + GAD S++ +
Sbjct: 222 VELQRASLRRANLTGANLTWATLTDADCTQANLSGANLSGADLSNSTL 269
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 54/117 (46%), Gaps = 30/117 (25%)
Query: 119 NFRRANFTS---------------ADMRE----------SDFSGSKFNGAYLEKAVAYKA 153
+F+ AN T A+MRE ++ SG+ + A L KA +A
Sbjct: 93 DFQWANLTEATLTDCNLTGANLKGANMREVQLASTNLTRANLSGANLHLARLGKAQLRRA 152
Query: 154 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDAVI 205
N +GA+L + ++ L +ANL NA + + L +D L GA++ AD A++
Sbjct: 153 NLSGANLEEAQLEDADLRQANLANAKMTKANLMHADLREANLSGAVMLRADLRSAIL 209
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 44/96 (45%), Gaps = 12/96 (12%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD----------LSDTLMDRMVLNEA 173
+ T A + D G F A L +A N TGA+ L+ T + R L+ A
Sbjct: 78 DLTGARLFRCDLRGVDFQWANLTEATLTDCNLTGANLKGANMREVQLASTNLTRANLSGA 137
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
NL A L + L R++L GA +E A DA DL Q
Sbjct: 138 NLHLARLGKAQLRRANLSGANLEEAQLEDA--DLRQ 171
>gi|303279747|ref|XP_003059166.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459002|gb|EEH56298.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 213
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 62/126 (49%), Gaps = 5/126 (3%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DLRK + ++ + A M ++ F G+ + KA A A+FTGA+ ++ ++DR+
Sbjct: 91 DLRKCEYDGKDLSTKTLSGALMVDASFKGTNLTEVVMSKAYALNADFTGANFTNAVVDRV 150
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
+ ANL NA V+T + G + GA F +A+I + LC NP T
Sbjct: 151 TFDGANLANADFHNAVITGTTYEGTDLTGATFEEALIGKEDVKRLCD-----NPTVKGPT 205
Query: 229 RKSLGC 234
R +GC
Sbjct: 206 RFEVGC 211
>gi|449456995|ref|XP_004146234.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
[Cucumis sativus]
gi|449522387|ref|XP_004168208.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
[Cucumis sativus]
Length = 237
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 56/118 (47%), Gaps = 5/118 (4%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
K + + +A M ++ F G+ + + KA A A+F G D S+ ++DR+ +ANL
Sbjct: 124 KNQLKGKSLAAALMSDAKFDGADLSEVVMSKAYAVGASFKGVDFSNAVLDRVNFGKANLQ 183
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+ TVL+ S A +E A F D +I Q LC NP R LGC
Sbjct: 184 GALFKNTVLSGSTFDDAQLEDAVFEDTIIGYIDLQKLC-----VNPTISPEGRAELGC 236
>gi|209528100|ref|ZP_03276576.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|209491459|gb|EDZ91838.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
Length = 351
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 83/179 (46%), Gaps = 18/179 (10%)
Query: 30 LSKPLWVACQISSKTESDGQFPGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKY 89
+SKP KT + + YA+ R F +L AA+ + N L+ N
Sbjct: 1 MSKP---------KTVTVNKLLTRYAQ--GERNFSDISLMAAIFNEVTLNRINLSGANLA 49
Query: 90 EAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
EA IG S +Q ADL AV + N A T + ++D SG+ +GA L
Sbjct: 50 EALMVHTRLIGANLSRSQLSYADLSMAVLIDANLTGATMTETVLHQADLSGASLSGAILS 109
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ N TGA L T + LN + LT+A+LV LTRS L GA + GA+ + +++
Sbjct: 110 QVNLTGVNLTGASLIGTCL----LNGSQLTDAILVGATLTRSVLSGAHMTGANLNRSIL 164
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 49/101 (48%), Gaps = 12/101 (11%)
Query: 101 SAAQFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
S A A L + VH+ + N AN T AD+ ES S F A L A AN TGA+
Sbjct: 170 SGANLTGATLIR-VHLNQGNLSGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGAN 228
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
LN ANLT A L R LTR++L G ++ AD
Sbjct: 229 ----------LNGANLTRANLTRANLTRANLNGLTLQSADL 259
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 53/100 (53%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL ++V NF AN T A++ ++ +G+ NGA L +A +AN T A+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTRANLTRANLTRANL 249
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ + L ANL+ A L LT ++L GA + AD
Sbjct: 250 NGLTLQSADLRLANLSKADLRGANLTGANLAGANLLEADL 289
Score = 40.8 bits (94), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 48/103 (46%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A A+L A + N RAN T A++ + A L KA AN TGA+L
Sbjct: 220 TGANLTGANLNGANLTRANLTRANLTRANLNGLTLQSADLRLANLSKADLRGANLTGANL 279
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + L ANLT+A L L + L GA + GA+ + A
Sbjct: 280 AGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQA 322
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 54/114 (47%), Gaps = 10/114 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANF 155
+ A A+L +A + N RAN SAD+R ++ S + GA L A N
Sbjct: 225 TGANLNGANLTRANLTRANLTRANLNGLTLQSADLRLANLSKADLRGANLTGA-----NL 279
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
GA+L + + L +ANL A L+ T L ++L GA + A+ A + +A
Sbjct: 280 AGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQANLIGASLSVAN 333
>gi|423066634|ref|ZP_17055424.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|406711942|gb|EKD07140.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 351
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 83/179 (46%), Gaps = 18/179 (10%)
Query: 30 LSKPLWVACQISSKTESDGQFPGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKY 89
+SKP KT + + YA+ R F +L AA+ + N L+ N
Sbjct: 1 MSKP---------KTVTVNKLLTRYAQ--GERNFSDISLMAAIFNEVTLNRINLSGANLA 49
Query: 90 EAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
EA IG S +Q ADL AV + N A T + ++D SG+ +GA L
Sbjct: 50 EALMVHTRLIGANLSRSQLSYADLSMAVLIDANLTGATMTETVLHQADLSGASLSGAILS 109
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ N TGA L T + LN + LT+A+LV LTRS L GA + GA+ + +++
Sbjct: 110 QVNLTGVNLTGASLIGTCL----LNGSQLTDAILVGATLTRSVLSGAHMTGANLNRSIL 164
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 53/100 (53%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL ++V NF AN T A++ ++ +G+ NGA L A +AN TGA+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTRANLTGANL 249
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ + L ANL+ A L LT ++L GA + AD
Sbjct: 250 NGLTLQSADLRLANLSKADLRGANLTGANLAGANLLEADL 289
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 48/101 (47%), Gaps = 12/101 (11%)
Query: 101 SAAQFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
S A A L + VH+ + N AN T AD+ ES S F A L A AN TGA+
Sbjct: 170 SGANLTGATLIR-VHLNQGNLSGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGAN 228
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
LN ANLT A L R LT ++L G ++ AD
Sbjct: 229 ----------LNGANLTGANLTRANLTGANLNGLTLQSADL 259
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSD 162
A L ++V + AN + + E D SG+ GA +L + AN TGADLS+
Sbjct: 142 ATLTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGANLTGADLSE 201
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+++ ANLT A L LT ++L GA + GA+ + A
Sbjct: 202 SVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTRA 242
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 70/151 (46%), Gaps = 19/151 (12%)
Query: 70 AAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSA 128
A+++ +C N S L D A TR S A A+L +++ + + AN T A
Sbjct: 121 ASLIGTCLLNGSQLTDAILVGATLTRSVL---SGAHMTGANLNRSILSEIDLSGANLTGA 177
Query: 129 -----DMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEANLTNA 178
+ + + SG+ GA L ++V +NF TGA+L+ + LN ANLT A
Sbjct: 178 TLIRVHLNQGNLSGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGA 237
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L TR++L GA + G A + LA
Sbjct: 238 NL-----TRANLTGANLNGLTLQSADLRLAN 263
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 51/111 (45%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 152
A A+L A N AN T A++ ++ +G+ NG A L KA
Sbjct: 212 ANLTGANLTGANLTGANLNGANLTGANLTRANLTGANLNGLTLQSADLRLANLSKADLRG 271
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
AN TGA+L+ + L ANLT+A L L + L GA + GA+ + A
Sbjct: 272 ANLTGANLAGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQA 322
>gi|428307622|ref|YP_007144447.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
gi|428249157|gb|AFZ14937.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
Length = 378
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 61/111 (54%), Gaps = 5/111 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A L+ A ++ N A+ + AD+R +D SG+ A L KA +AN T DL
Sbjct: 43 SNADLSRASLKDAKLIRVNLSNADLSWADLRGADLSGANLENANLSKASLDQANLTNTDL 102
Query: 161 SDTLMDR-----MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
S ++R +L++ANL NA L T L +DLG A +E AD S+A +D
Sbjct: 103 SSANLNRASLDYALLSKANLINADLSGTNLVGADLGRANLENADLSNATLD 153
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 52/98 (53%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +ADL +A R N ++AD+ +D G+ +GA LE A KA+ A+L++
Sbjct: 40 ADLSNADLSRASLKDAKLIRVNLSNADLSWADLRGADLSGANLENANLSKASLDQANLTN 99
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
T + LN A+L A+L + L +DL G + GAD
Sbjct: 100 TDLSSANLNRASLDYALLSKANLINADLSGTNLVGADL 137
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 62/129 (48%), Gaps = 8/129 (6%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A N +A+ A++ +D S + N A L+ A+ KAN ADL
Sbjct: 68 SWADLRGADLSGANLENANLSKASLDQANLTNTDLSSANLNRASLDYALLSKANLINADL 127
Query: 161 SDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA--- 212
S T + R L A+L+NA L ++L ++ G A ++ A +A I+ A +
Sbjct: 128 SGTNLVGADLGRANLENADLSNATLDNSILISANFGAANLKKASLCNANIERASLEGANL 187
Query: 213 LCKYANGTN 221
+ NGTN
Sbjct: 188 ISANLNGTN 196
>gi|168060251|ref|XP_001782111.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666451|gb|EDQ53105.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 158
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 60/118 (50%), Gaps = 5/118 (4%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
K N + ++A M E+ F G+ + KA A A+F G+ ++ ++DR+ +++++
Sbjct: 44 KTNLKGKTLSAALMSEAKFDGADLTEVIMSKAYAVGASFKGSVFTNAVVDRVAFDKSDMQ 103
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ TVL+ S GA +EGA F +A+I Q LCK NP +R L C
Sbjct: 104 GVQFINTVLSGSTFEGANLEGASFENALIGYVDIQKLCK-----NPTLPEESRIDLAC 156
>gi|307109822|gb|EFN58059.1| hypothetical protein CHLNCDRAFT_57123 [Chlorella variabilis]
Length = 608
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 58/101 (57%), Gaps = 1/101 (0%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ R+ +T AD+R ++ S + G L A+A ANF+GA+L + ++ + L A+L+N
Sbjct: 49 QDLRKNKYTKADLRGTNLSNANLEGVTLFGALATNANFSGANLRNADLELVELEGADLSN 108
Query: 178 AVLVRTVLTRSDLGGAI-IEGADFSDAVIDLAQKQALCKYA 217
AVL +LT + LG I GADF+D V LC+ A
Sbjct: 109 AVLEGAMLTNAQLGRVKSITGADFTDVVFRKDVMMGLCRIA 149
>gi|18406661|ref|NP_566030.1| thylakoid lumenal protein 1 [Arabidopsis thaliana]
gi|20141847|sp|O22160.2|TL15A_ARATH RecName: Full=Thylakoid lumenal 15 kDa protein 1, chloroplastic;
AltName: Full=p15; Flags: Precursor
gi|20196925|gb|AAM14836.1| pentapeptide repeat family protein [Arabidopsis thaliana]
gi|330255391|gb|AEC10485.1| thylakoid lumenal protein 1 [Arabidopsis thaliana]
Length = 224
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 68/123 (55%), Gaps = 11/123 (8%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 172
+ R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N +
Sbjct: 106 QTLIRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTK 160
Query: 173 ANLTNAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 231
NLTNA L TV + G+ I GADF+D + Q+ LCK A+G N TG +TR +
Sbjct: 161 VNLTNANLEGATVTGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDT 220
Query: 232 LGC 234
L C
Sbjct: 221 LLC 223
>gi|222423354|dbj|BAH19651.1| AT2G44920 [Arabidopsis thaliana]
Length = 224
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 67/119 (56%), Gaps = 11/119 (9%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 176
R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N + NLT
Sbjct: 110 RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGGDFSLANVTKVNLT 164
Query: 177 NAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA L TV + G+ I GADF+D + Q+ LCK A+G N TG +TR +L C
Sbjct: 165 NANLEGATVTGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDTLLC 223
>gi|388504750|gb|AFK40441.1| unknown [Lotus japonicus]
Length = 239
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 56/118 (47%), Gaps = 5/118 (4%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
K N + ++A M ++ F G+ + KA A +F G D S+ ++DR+ +A+L
Sbjct: 126 KSNLKGKTLSAALMSDAKFDGADMTEVVMSKAYAVGGSFKGVDFSNAVLDRVNFEKADLQ 185
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AV TVL+ S A +EGA F D +I Q LC+ N R LGC
Sbjct: 186 GAVFKNTVLSGSTFDDAKLEGAVFEDTIIGYIDLQKLCR-----NKTIADDWRVELGC 238
>gi|443477350|ref|ZP_21067204.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443017546|gb|ELS31963.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 670
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 73/146 (50%), Gaps = 12/146 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A SA+L+ A V N R+AN A++ +S+ + N A LE A A+ A+L
Sbjct: 521 SEADLNSANLKGANLVLTNLRKANLVKANLSDSNLGAANLNDAILEGADLSAADLRSAEL 580
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----------IIEGADFSDAV-IDLA 208
+ T + L+ ANLT A LV ++L GA IE ADF++AV +D
Sbjct: 581 NLTNLSNANLSSANLTAAKLVLIEFAGANLNGANFRNAIVENIGSIESADFTNAVNLDPI 640
Query: 209 QKQALCKYANGTNPITGVSTRKSLGC 234
++ C A+G +G ST+ +L C
Sbjct: 641 VRKYFCSLASGNVADSGNSTKSTLNC 666
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 45/90 (50%), Gaps = 5/90 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N AN + A++ + +K A L+KA K N + ADL+ + L NL A
Sbjct: 484 NLTEANLSQANLLRVNLFQAKLGSANLQKAELMKTNLSEADLNSANLKGANLVLTNLRKA 543
Query: 179 VLVRTVLTRSDLGG-----AIIEGADFSDA 203
LV+ L+ S+LG AI+EGAD S A
Sbjct: 544 NLVKANLSDSNLGAANLNDAILEGADLSAA 573
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 46/100 (46%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S + DLR+ + N D+R D S + A L + +AN + A+L
Sbjct: 436 SGSVLERVDLRQVILKNANLNGVKIVKVDLRGGDLSNASAIDANLSFSNLTEANLSQANL 495
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ + L ANL A L++T L+ +DL A ++GA+
Sbjct: 496 LRVNLFQAKLGSANLQKAELMKTNLSEADLNSANLKGANL 535
>gi|397570889|gb|EJK47511.1| hypothetical protein THAOC_33758, partial [Thalassiosira oceanica]
Length = 122
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 64/118 (54%), Gaps = 10/118 (8%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N + F + +R++DF G+ GA A ++F GAD++ ++ ++ E ++ A
Sbjct: 11 NLKGVAFQQSIVRDTDFRGTNLFGASFFDATLDGSDFEGADMTLCNVENAIVKEMYVSGA 70
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY--ANGTNPITGVSTRKSLGC 234
L V + IE +D+SD + Q++ LC++ A GTNP+TGV TR+SL C
Sbjct: 71 TLFEGVKS--------IENSDWSDTQLRKDQQKYLCEHPTAKGTNPVTGVDTRESLMC 120
>gi|363807626|ref|NP_001241901.1| uncharacterized protein LOC100785667 [Glycine max]
gi|255647148|gb|ACU24042.1| unknown [Glycine max]
Length = 239
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 63/128 (49%), Gaps = 7/128 (5%)
Query: 109 DLRKA--VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
DLR+ + K N + + ++A M ++ F G+ + KA A A+F G D S+ ++D
Sbjct: 116 DLRQCDFTNEKTNLKGKSPSAALMSDAKFDGADMTEVVMSKAYAAGASFKGVDFSNAVLD 175
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
R+ +A+L A+ TVL+ S A ++ A F D +I Q LC TN G
Sbjct: 176 RVNFEKADLEGAIFKNTVLSGSPFDDAKLDNAVFEDTIIGYIDFQKLC-----TNKTIGD 230
Query: 227 STRKSLGC 234
R LGC
Sbjct: 231 EWRVELGC 238
>gi|224120874|ref|XP_002318440.1| predicted protein [Populus trichocarpa]
gi|222859113|gb|EEE96660.1| predicted protein [Populus trichocarpa]
Length = 240
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 57/118 (48%), Gaps = 5/118 (4%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
K N + + +A M ++ F G+ + KA A A+F G D S+ ++DR+ +A+L
Sbjct: 127 KSNLKGKSLAAALMSDAKFDGADMTEVVMSKAYAVGASFRGVDFSNAVLDRVNFGKADLK 186
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AV TVL+ S A +E A F D +I Q +C+ N G R LGC
Sbjct: 187 GAVFKNTVLSGSTFDEAQLEDAIFEDTIIGYIDLQKICR-----NTSIGPDGRAELGC 239
>gi|409990095|ref|ZP_11273525.1| pentapeptide repeat-containing protein, partial [Arthrospira
platensis str. Paraca]
gi|409939047|gb|EKN80281.1| pentapeptide repeat-containing protein, partial [Arthrospira
platensis str. Paraca]
Length = 220
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 76/154 (49%), Gaps = 10/154 (6%)
Query: 61 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 117
R F +L AA+ + N L+ N EA IG S +Q ADL AV +
Sbjct: 21 RNFSDISLVAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQLSYADLSMAVLID 80
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
N A+ T + ++D SG+ +GA L + N TGA L T + LN + LT+
Sbjct: 81 ANLTGASMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTCL----LNGSQLTD 136
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAV---IDLA 208
A+LV +TRS L GA + GA+ + ++ IDL+
Sbjct: 137 AILVGATMTRSVLSGAHMTGANLNRSILSEIDLS 170
Score = 37.0 bits (84), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 43/82 (52%), Gaps = 5/82 (6%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE-----ANLTNAV 179
A M S SG+ GA L +++ + + +GA+L+ + R+ LN+ ANLT A
Sbjct: 139 LVGATMTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGANLTGAD 198
Query: 180 LVRTVLTRSDLGGAIIEGADFS 201
L +V+ S+ A + GA+ +
Sbjct: 199 LSESVIQNSNFCIANLTGANLT 220
>gi|297824527|ref|XP_002880146.1| thylakoid lumenal 15 kDa protein, chloroplast [Arabidopsis lyrata
subsp. lyrata]
gi|297325985|gb|EFH56405.1| thylakoid lumenal 15 kDa protein, chloroplast [Arabidopsis lyrata
subsp. lyrata]
Length = 226
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/123 (35%), Positives = 68/123 (55%), Gaps = 11/123 (8%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 172
+ R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N +
Sbjct: 108 QTLIRQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTK 162
Query: 173 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 231
NLTNA L T + G+ I GADF+D + Q++ LCK A+G N TG +TR +
Sbjct: 163 VNLTNANLEGATATGNTSFKGSNITGADFTDVPLRDDQREYLCKIADGVNATTGNATRDT 222
Query: 232 LGC 234
L C
Sbjct: 223 LLC 225
>gi|302831317|ref|XP_002947224.1| hypothetical protein VOLCADRAFT_120426 [Volvox carteri f.
nagariensis]
gi|300267631|gb|EFJ51814.1| hypothetical protein VOLCADRAFT_120426 [Volvox carteri f.
nagariensis]
Length = 244
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 64/130 (49%), Gaps = 5/130 (3%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DLR + ++ A + ++D S + A L KA A KANF AD+++ ++DR+
Sbjct: 101 DLRLCSYSGKDLHGRVLAGALLADADLSNTNLQEAVLTKAYAVKANFENADMTNAVVDRV 160
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
+ ANL TV+T + GA +EG+ + DA+I LC+ NP +
Sbjct: 161 DFSGANLRGVRFNNTVVTGAQFAGADLEGSVWEDALIGSQDVGKLCE-----NPTLTGES 215
Query: 229 RKSLGCGNSR 238
R +GC SR
Sbjct: 216 RMQVGCRVSR 225
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 30/59 (50%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
A L KA VK NF A+ T+A + DFSG+ G V A F GADL ++ +
Sbjct: 135 AVLTKAYAVKANFENADMTNAVVDRVDFSGANLRGVRFNNTVVTGAQFAGADLEGSVWE 193
>gi|219130181|ref|XP_002185250.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403429|gb|EEC43382.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 235
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 52/107 (48%), Gaps = 2/107 (1%)
Query: 130 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 189
M +D S + F AY K + GAD ++ ++DR ++L A+ VLT +
Sbjct: 128 MTNTDASNANFAEAYFSKGYLRDSMLDGADFTNAIVDRATFKGSSLRGAIFANAVLTGTG 187
Query: 190 LGGAIIEGADFSDAVIDLAQKQALCK--YANGTNPITGVSTRKSLGC 234
GA +E ADF+DA I + LCK G NP TG TR S C
Sbjct: 188 FEGADVENADFTDAYIGDFDIRLLCKNPTLKGENPKTGADTRMSANC 234
>gi|384246084|gb|EIE19575.1| hypothetical protein COCSUDRAFT_31020 [Coccomyxa subellipsoidea
C-169]
Length = 203
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 68/145 (46%), Gaps = 5/145 (3%)
Query: 90 EAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 149
A T G +A DLR ++ + A ++++ S L KA
Sbjct: 61 RAYTGNTIGQANAVSDKVLDLRMCDFTGKDLSGKTLSGALLKDAILPNSTMRETVLTKAY 120
Query: 150 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
A ANF+GAD+++ ++DR+ +ANL+N + V+T + GA ++GA F DA+I
Sbjct: 121 AVGANFSGADMTNAVIDRVDFRKANLSNVKFINAVITGTAFDGANLDGAIFEDALIGNED 180
Query: 210 KQALCKYANGTNPITGVSTRKSLGC 234
+ LC NP +R +GC
Sbjct: 181 VKRLC-----LNPTLTGESRMGVGC 200
>gi|291571459|dbj|BAI93731.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 351
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/148 (33%), Positives = 73/148 (49%), Gaps = 7/148 (4%)
Query: 61 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVK 117
R F +L AA+ + N L+ N EA IG S +Q ADL AV +
Sbjct: 21 RNFSDISLVAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQLSYADLSMAVLID 80
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
N A+ T + ++D SG+ +GA L + N TGA L T + LN + LT+
Sbjct: 81 ANLTGASMTETVLHQADLSGASLSGAILSQVNLTGVNLTGASLIGTCL----LNGSQLTD 136
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+LV +TRS L GA + GA+ + +++
Sbjct: 137 AILVGATMTRSVLSGAHMTGANLNRSIL 164
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 52/100 (52%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL ++V NF AN T A++ ++ +G+ NGA L A AN TGA+L
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLAGANLAGANLNGANLTGANLTGANLTGANL 249
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ + L ANL+ A L LT ++L GA + AD
Sbjct: 250 NGLTLQCADLRLANLSKADLRGANLTGANLAGANLLEADL 289
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSD 162
A + ++V + AN + + E D SG+ GA +L + AN TGADLS+
Sbjct: 142 ATMTRSVLSGAHMTGANLNRSILSEIDLSGANLTGATLIRVHLNQGNLSGANLTGADLSE 201
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+++ ANLT A L L ++L GA + GA+ + A
Sbjct: 202 SVIQNSNFCIANLTGANLAGANLAGANLNGANLTGANLTGA 242
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 51/111 (45%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG----------AYLEKAVAYK 152
A A+L A N AN T A++ ++ +G+ NG A L KA
Sbjct: 212 ANLTGANLAGANLAGANLNGANLTGANLTGANLTGANLNGLTLQCADLRLANLSKADLRG 271
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
AN TGA+L+ + L ANLT+A L L + L GA + GA+ + A
Sbjct: 272 ANLTGANLAGANLLEADLRLANLTDANLCGAGLLLTSLRGANLAGANLNQA 322
>gi|212721648|ref|NP_001132583.1| uncharacterized protein LOC100194054 [Zea mays]
gi|194694818|gb|ACF81493.1| unknown [Zea mays]
gi|413933909|gb|AFW68460.1| hypothetical protein ZEAMMB73_478838 [Zea mays]
Length = 225
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 61/128 (47%), Gaps = 7/128 (5%)
Query: 109 DLRKAVHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
DLR + E N + + +A M E+ F G+ + + KA A A+F G D ++ ++D
Sbjct: 102 DLRFCDYTNEKTNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGTDFTNAVID 161
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
R+ +A+LT A+ TVL+ S A ++ F D +I Q LC TN
Sbjct: 162 RVNFEKADLTGAIFKNTVLSGSTFDDAKMDDVVFEDTIIGYIDLQKLC-----TNTSISP 216
Query: 227 STRKSLGC 234
R LGC
Sbjct: 217 DARLELGC 224
>gi|409991580|ref|ZP_11274829.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|291567915|dbj|BAI90187.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409937560|gb|EKN78975.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 390
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 66/117 (56%), Gaps = 10/117 (8%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
+A ADL +A+ +K NF +A+ +SA++ +S+ + F AYL KA +A+ ADLS
Sbjct: 111 SAHLNWADLTEAIFIKTNFHKADLSSANLTKSNLQSANFVRAYLIKANLSEADLFQADLS 170
Query: 162 DTLMDRMVLNEANLTN----------AVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+ + L+ ANLT A L LT+++LG A + GA+ +DA ++LA
Sbjct: 171 SANLKDVNLSAANLTECKMTRANLMGANLTEADLTKANLGRANLRGANLTDAYLNLA 227
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 25/160 (15%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE---------------- 146
A F ADL A N + NF+ A++ ++ SGS NGA L+
Sbjct: 57 ADFSEADLSGAHLSLANLSKVNFSGANLTGANLSGSSLNGANLQGATLSAVNLESAHLNW 116
Query: 147 ----KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+A+ K NF ADLS + + L AN A L++ L+ +DL A + A+ D
Sbjct: 117 ADLTEAIFIKTNFHKADLSSANLTKSNLQSANFVRAYLIKANLSEADLFQADLSSANLKD 176
Query: 203 AVIDLAQKQALCKY--AN--GTNPITGVSTRKSLGCGNSR 238
+ A CK AN G N T+ +LG N R
Sbjct: 177 VNLSAANLTE-CKMTRANLMGANLTEADLTKANLGRANLR 215
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL +A N RAN + A++ ++ NGA+L K A+ G DLS
Sbjct: 227 ASLVEADLHQA-----NLTRANLSRANLSKTYLRDICLNGAHLTKVNLSGADLGGVDLSH 281
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L+ + L A L+ A LV +L ++L A + GA+ +A +
Sbjct: 282 KLLTGINLAGAYLSEATLVGALLMEANLSAANLSGANLQNACL 324
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 44/90 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A N AN T M ++ G+ A L KA +AN GA+L
Sbjct: 160 SEADLFQADLSSANLKDVNLSAANLTECKMTRANLMGANLTEADLTKANLGRANLRGANL 219
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+D ++ L EA+L A L R L+R++L
Sbjct: 220 TDAYLNLASLVEADLHQANLTRANLSRANL 249
Score = 40.4 bits (93), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 48/108 (44%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A G DL + N A A + E++ S + +GA L+ A A+
Sbjct: 270 SGADLGGVDLSHKLLTGINLAGAYLSEATLVGALLMEANLSAANLSGANLQNACLINADL 329
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
GA +DR+ L +ANLT A L + L ++L AI+ G + A
Sbjct: 330 RGA-----YLDRVDLTDANLTGANLTKADLREANLRAAILAGVELKGA 372
Score = 38.1 bits (87), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 57/123 (46%), Gaps = 11/123 (8%)
Query: 84 ADLNKYEAETRGEFGIGSA-AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
ADL + + GI A A A L A+ ++ N AN + A+++ + + G
Sbjct: 272 ADLGGVDLSHKLLTGINLAGAYLSEATLVGALLMEANLSAANLSGANLQNACLINADLRG 331
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
AYL++ AN TGA+L+ + L EANL A+L +L GA + GA +
Sbjct: 332 AYLDRVDLTDANLTGANLT-----KADLREANLRAAILAGV-----ELKGAQLAGATLPN 381
Query: 203 AVI 205
I
Sbjct: 382 GKI 384
Score = 37.4 bits (85), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 47/90 (52%), Gaps = 5/90 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L +A K N RAN A++ ++ + + A L +A +AN + A+LS
Sbjct: 192 ANLMGANLTEADLTKANLGRANLRGANLTDAYLNLASLVEADLHQANLTRANLSRANLSK 251
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGG 192
T + + LN A+LT + L+ +DLGG
Sbjct: 252 TYLRDICLNGAHLT-----KVNLSGADLGG 276
>gi|326523645|dbj|BAJ92993.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524189|dbj|BAJ97105.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 200
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/123 (35%), Positives = 65/123 (52%), Gaps = 6/123 (4%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--E 172
++F D + S + F GA L A + A+ TGADLSD + D + N +
Sbjct: 77 KDFSGQTLIKQDFKTSILRQTNFKGANLLGASFFDADLTGADLSDADLRNADFSLANVTK 136
Query: 173 ANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKS 231
NLTNA L ++T + G+ I GADF+D + Q+ LCK A+G N TG +T+++
Sbjct: 137 VNLTNANLEGALVTGNTSFKGSNIYGADFTDVPLRDDQRDYLCKIADGVNTTTGNATKET 196
Query: 232 LGC 234
L C
Sbjct: 197 LFC 199
>gi|209526071|ref|ZP_03274603.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423067542|ref|ZP_17056332.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493459|gb|EDZ93782.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406711116|gb|EKD06318.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 517
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 85/180 (47%), Gaps = 28/180 (15%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F +A+LR+A N A+F+ A+MR D G+ +GA L +A AN +GA+LS
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---------SDAV----IDLA- 208
++ + L A+L+ A L+R + +DL GA + GA +D + +DL+
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKLYQVSRFNLKADEITCEWVDLSA 308
Query: 209 ----------QKQALCKYANGTNPITGVSTRKSL--GCGNSRRNAYGSPSS--PLLSAPP 254
+++L K+ N T PI + SL + N Y + P++ PP
Sbjct: 309 NGDHSQVYHFDRESLRKFFNQTRPIVEILVNSSLDQDANMALANIYHKIAQEFPVMERPP 368
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 66/136 (48%), Gaps = 19/136 (13%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 156
+ A+ +A+L KA+ + AN D+ E+ S A L +A KANFT
Sbjct: 77 NVARLSNANLTKAILNQATINVANLARVDLTEAQLINSLLIRAELIRAKLTKANFTQANL 136
Query: 157 -GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAI-----IEGADFSDAVI 205
GADL +T + + N ANL+ A L T T++DL GA + ADFS+A +
Sbjct: 137 NGADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDLRGADLVKVNLPKADFSNAEL 196
Query: 206 DLAQKQALCKYANGTN 221
+QA YAN +N
Sbjct: 197 ----RQANLTYANLSN 208
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 67/156 (42%), Gaps = 27/156 (17%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
L A++ + N++ LA ++ EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138
Query: 128 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 167
AD+RE+ + FNGA L A K N AD S+ + +
Sbjct: 139 ADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDLRGADLVKVNLPKADFSNAELRQ 198
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANL+NA + DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANMRWIDLQGADLSGANLTEA 234
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN +++D+RE + S + N A L A KA A ++ + R+ L EA L N++L+R
Sbjct: 59 ANLSASDLREVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEAQLINSLLIR 118
Query: 183 T-----VLTRSDLGGAIIEGADFSD 202
LT+++ A + GAD +
Sbjct: 119 AELIRAKLTKANFTQANLNGADLRE 143
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 49/102 (48%), Gaps = 2/102 (1%)
Query: 104 QFGSADLRKAVHVKENFR--RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
Q +D+ K + + +R +F ++ E + S GA L A AN + +DL
Sbjct: 8 QNSESDVLKVYEIVKKYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + R LN A L+NA L + +L ++ + A + D ++A
Sbjct: 68 EVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEA 109
>gi|340707640|pdb|3N90|A Chain A, The 1.7 Angstrom Resolution Crystal Structure Of
At2g44920, A Pentapeptide Repeat Protein From
Arabidopsis Thaliana Thylakoid Lumen
Length = 152
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 68/119 (57%), Gaps = 11/119 (9%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 176
R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N + NLT
Sbjct: 30 RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTKVNLT 84
Query: 177 NAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
NA L T++ + G+ I GADF+D + Q+ LCK A+G N TG +TR +L C
Sbjct: 85 NANLEGATMMGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDTLLC 143
>gi|376002767|ref|ZP_09780589.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375328823|emb|CCE16342.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 517
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 85/180 (47%), Gaps = 28/180 (15%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F +A+LR+A N A+F+ A+MR D G+ +GA L +A AN +GA+LS
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---------SDAV----IDLA- 208
++ + L A+L+ A L+R + +DL GA + GA +D + +DL+
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKLYQVSRFNLKADEITCEWVDLSA 308
Query: 209 ----------QKQALCKYANGTNPITGVSTRKSL--GCGNSRRNAYGSPSS--PLLSAPP 254
+++L K+ N T PI + SL + N Y + P++ PP
Sbjct: 309 NGDHSQVYHFDRESLRKFFNQTRPIVEILVNSSLDQDANMALANIYHKIAQEFPVMERPP 368
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 66/136 (48%), Gaps = 19/136 (13%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 156
+ A+ +A+L KA+ + AN D+ E+ S A L +A KANFT
Sbjct: 77 NVARLSNANLTKAILNQATINVANLARVDLTEAQLINSLLIRAELIRAKLTKANFTQANL 136
Query: 157 -GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAI-----IEGADFSDAVI 205
GADL +T + + N ANL+ A L T T++DL GA + ADFS+A +
Sbjct: 137 NGADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDLRGADLVKVNLPKADFSNAEL 196
Query: 206 DLAQKQALCKYANGTN 221
+QA YAN +N
Sbjct: 197 ----RQANLTYANLSN 208
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 67/156 (42%), Gaps = 27/156 (17%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
L A++ + N++ LA ++ EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138
Query: 128 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 167
AD+RE+ + FNGA L A K N AD S+ + +
Sbjct: 139 ADLRETKLQQTNFNGANLSGANLRGASGALTKFTKTDLRGADLVKVNLPKADFSNAELRQ 198
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANL+NA + DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANMRWIDLQGADLSGANLTEA 234
Score = 40.4 bits (93), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN +++D+RE + S + N A L A KA A ++ + R+ L EA L N++L+R
Sbjct: 59 ANLSASDLREVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEAQLINSLLIR 118
Query: 183 T-----VLTRSDLGGAIIEGADFSD 202
LT+++ A + GAD +
Sbjct: 119 AELIRAKLTKANFTQANLNGADLRE 143
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 49/102 (48%), Gaps = 2/102 (1%)
Query: 104 QFGSADLRKAVHVKENFR--RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
Q +D+ K + + +R +F ++ E + S GA L A AN + +DL
Sbjct: 8 QNSESDVLKVYEIVKKYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + R LN A L+NA L + +L ++ + A + D ++A
Sbjct: 68 EVNLSRANLNVARLSNANLTKAILNQATINVANLARVDLTEA 109
>gi|118592119|ref|ZP_01549513.1| hypothetical protein SIAM614_25622 [Stappia aggregata IAM 12614]
gi|118435415|gb|EAV42062.1| hypothetical protein SIAM614_25622 [Labrenzia aggregata IAM 12614]
Length = 275
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 63/128 (49%), Gaps = 14/128 (10%)
Query: 84 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD--------- 134
+D + EAE R +F S + F A++R K N +ANF AD+R+ D
Sbjct: 85 SDFRRTEAE-RADF---SGSDFSGANMRSVDLEKANLNKANFQDADLRDGDLNTVEANEA 140
Query: 135 -FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
F G+ ++VA KA+F GA + D ++R+ LN AN +A + + L R A
Sbjct: 141 IFDGADMRNVLFTRSVANKASFKGAKMDDANLERVDLNGANFQDARMRQAKLDRVKAQNA 200
Query: 194 IIEGADFS 201
GADFS
Sbjct: 201 NFSGADFS 208
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 62/124 (50%), Gaps = 8/124 (6%)
Query: 91 AETRG---EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY--- 144
AE RG E G + DL++A+ NF+ ++F + +DFSGS F+GA
Sbjct: 50 AELRGLVLENGDFAGTNLREVDLKEAMLPNANFKNSDFRRTEAERADFSGSDFSGANMRS 109
Query: 145 --LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
LEKA KANF ADL D ++ + NEA A + + TRS A +GA D
Sbjct: 110 VDLEKANLNKANFQDADLRDGDLNTVEANEAIFDGADMRNVLFTRSVANKASFKGAKMDD 169
Query: 203 AVID 206
A ++
Sbjct: 170 ANLE 173
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 57/124 (45%), Gaps = 19/124 (15%)
Query: 85 DLNKYEAETRGEFGIGSAAQFGSADLR-----KAVHVKENFRRANFTSADMRESDFSGSK 139
DLN EA + A F AD+R ++V K +F+ A A++ D +G+
Sbjct: 131 DLNTVEA---------NEAIFDGADMRNVLFTRSVANKASFKGAKMDDANLERVDLNGAN 181
Query: 140 FNGAY-----LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
F A L++ A ANF+GAD S + L ANLT +L R+ L GA
Sbjct: 182 FQDARMRQAKLDRVKAQNANFSGADFSGVRLVSSDLTGANLTGVDFDGALLRRTRLAGAD 241
Query: 195 IEGA 198
+ GA
Sbjct: 242 LSGA 245
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 47/106 (44%), Gaps = 5/106 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
G A+LR V +F N D++E+ + F + + A +A+F+G+D S
Sbjct: 47 LGLAELRGLVLENGDFAGTNLREVDLKEAMLPNANFKNSDFRRTEAERADFSGSDFSGAN 106
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVI 205
M + L +ANL A L DL AI +GAD + +
Sbjct: 107 MRSVDLEKANLNKANFQDADLRDGDLNTVEANEAIFDGADMRNVLF 152
>gi|323452967|gb|EGB08840.1| hypothetical protein AURANDRAFT_25565 [Aureococcus anophagefferens]
Length = 176
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 68/135 (50%), Gaps = 5/135 (3%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G + A ++ + F +F+ D +++F+ SK GA KA +A+F+GAD
Sbjct: 46 GGGKDYAEATIKGQDFSGKTFNNKDFSGCDAVDTNFAKSKLRGARFFKADLARADFSGAD 105
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
LS ++ L LT A+ T +++ L + GADF+DAVI ++ LC G
Sbjct: 106 LSAASLEGANLEGTKLTGALAEGTAFSQTILDAGDLTGADFTDAVIQPYVQKGLC----G 161
Query: 220 TNPITGVSTRKSLGC 234
+TG +TR SL C
Sbjct: 162 RKDVTG-ATRDSLFC 175
>gi|220907627|ref|YP_002482938.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219864238|gb|ACL44577.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 267
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 56/105 (53%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A +A+ +KA + +N T AD+ ++D +G + A L +A + NFTG DL
Sbjct: 132 SQANMSAANFQKATLISAYLHNSNLTQADLSDADLTGINLSDANLSQATLIRTNFTGGDL 191
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S ++ L E NLT L L+R++L G ++ GA+ + ++
Sbjct: 192 SRVMLVGANLAETNLTAVNLSDANLSRAELNGVVLAGANLNRVIL 236
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 58/104 (55%), Gaps = 5/104 (4%)
Query: 104 QFGSADLRKA--VHVK---ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
F A+L KA VH NF A ++A++ +++ S + F A L A + +N T A
Sbjct: 100 NFSEANLIKANLVHAALYCANFFMAMMSAANLSQANMSAANFQKATLISAYLHNSNLTQA 159
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
DLSD + + L++ANL+ A L+RT T DL ++ GA+ ++
Sbjct: 160 DLSDADLTGINLSDANLSQATLIRTNFTGGDLSRVMLVGANLAE 203
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 53/118 (44%), Gaps = 15/118 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSA----------DMRESDFSGSKFNGAYLEKAVA 150
S A A+L +A N RAN + A ++ E +FS + A L A
Sbjct: 57 SGANLSGANLIRANLTGANLSRANLSGATLAEVNLSRTNLTEVNFSEANLIKANLVHAAL 116
Query: 151 YKANF-----TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
Y ANF + A+LS M +A L +A L + LT++DL A + G + SDA
Sbjct: 117 YCANFFMAMMSAANLSQANMSAANFQKATLISAYLHNSNLTQADLSDADLTGINLSDA 174
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 44/93 (47%), Gaps = 5/93 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTG 157
++ DLR N + D+RE++ SG+ +GA L +A +AN +G
Sbjct: 24 SELSQMDLRGMSLCGANLAGMDLRGKDLREANLSGANLSGANLIRANLTGANLSRANLSG 83
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
A L++ + R L E N + A L++ L + L
Sbjct: 84 ATLAEVNLSRTNLTEVNFSEANLIKANLVHAAL 116
>gi|302779862|ref|XP_002971706.1| hypothetical protein SELMODRAFT_95422 [Selaginella moellendorffii]
gi|300160838|gb|EFJ27455.1| hypothetical protein SELMODRAFT_95422 [Selaginella moellendorffii]
Length = 157
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 59/118 (50%), Gaps = 5/118 (4%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
KE + ++A M ++ F G+ + KA A +F G D ++ ++DR+V ++A++
Sbjct: 43 KEGLKGKTLSAALMADAKFDGADMTEVVMSKAYAVGGSFKGTDFTNAVLDRVVFDKADMK 102
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AV TVL+ S GA +E ADF +A+I + LC NP + L C
Sbjct: 103 GAVFRNTVLSGSTFQGANLENADFENALIGYNDARKLC-----LNPTLSEESTIELAC 155
>gi|300867252|ref|ZP_07111912.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300334729|emb|CBN57078.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 508
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 49/145 (33%), Positives = 71/145 (48%), Gaps = 10/145 (6%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A F DLR+A + N AN + A++R +D SG+ GA L +A AN GA+L
Sbjct: 179 NGADFSGTDLRQANLCQVNLSGANLSGANLRWADLSGANLRGADLNEAKLSGANLYGANL 238
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
S+ ANLTNA LV LT ++L GA GAD S + + A+ + ++
Sbjct: 239 SN----------ANLTNASLVHADLTLANLNGADWVGADLSGSTLSGAKLYDVPRFGIKA 288
Query: 221 NPITGVSTRKSLGCGNSRRNAYGSP 245
+T S NS+ +GSP
Sbjct: 289 EEVTCEWVDLSSNGDNSQVYRFGSP 313
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 76/152 (50%), Gaps = 13/152 (8%)
Query: 65 STALAAAVVASCSSNISAL--ADLNKYE----AETRGEF-------GIGSAAQFGSADLR 111
S+ L A++ + N++ L ADL++ + A RGE S A ADLR
Sbjct: 75 SSHLVRAILQGATLNVANLVRADLSEAQLMGAALIRGELIRAELSKANFSKANLTGADLR 134
Query: 112 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 171
+A + NF AN + A++R + + + F A L A KA+ GAD S T + + L
Sbjct: 135 EAKLTEVNFSEANLSGANLRGASGTAANFELANLHGADLSKADLNGADFSGTDLRQANLC 194
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ NL+ A L L +DL GA + GAD ++A
Sbjct: 195 QVNLSGANLSGANLRWADLSGANLRGADLNEA 226
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANF 155
+ A+ S+ L +A+ AN AD+ E+ G+ + A L KA KAN
Sbjct: 69 NVARLSSSHLVRAILQGATLNVANLVRADLSEAQLMGAALIRGELIRAELSKANFSKANL 128
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
TGADL + + + +EANL+ A L T ++ A + GAD S A
Sbjct: 129 TGADLREAKLTEVNFSEANLSGANLRGASGTAANFELANLHGADLSKA 176
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 45/80 (56%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
NFT ++ E++ S + A L +A + N +GA+L++ + LN A L+++ LVR
Sbjct: 22 NFTGINLNEANLSRINLSQANLSEASLFVTNLSGANLNEVNLSNANLNVARLSSSHLVRA 81
Query: 184 VLTRSDLGGAIIEGADFSDA 203
+L + L A + AD S+A
Sbjct: 82 ILQGATLNVANLVRADLSEA 101
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 1/80 (1%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N R N + A++ E+ + +GA L + AN A LS + + R +L A L A
Sbjct: 32 NLSRINLSQANLSEASLFVTNLSGANLNEVNLSNANLNVARLSSSHLVRAILQGATLNVA 91
Query: 179 VLVRTVLTRSDL-GGAIIEG 197
LVR L+ + L G A+I G
Sbjct: 92 NLVRADLSEAQLMGAALIRG 111
>gi|21674877|ref|NP_662942.1| pentapeptide repeat-containing protein [Chlorobium tepidum TLS]
gi|21648101|gb|AAM73284.1| pentapeptide repeat family protein [Chlorobium tepidum TLS]
Length = 439
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 60/118 (50%), Gaps = 15/118 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRA---------------NFTSADMRESDFSGSKFNGAYLEK 147
A+ G DLRKA K +F RA NF ADM+E++ G+ GA L++
Sbjct: 285 AELGGVDLRKASLSKSDFERANLDKANLAGANLAGVNFQRADMKEANLKGANLEGANLDR 344
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A A+ +GA+L ++ +L ANL A+L L ++L A ++GAD + A +
Sbjct: 345 AFLKGADLSGANLKGAILYGAMLYGANLDGAILTNVSLFDANLEKASLKGADLTGATL 402
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 51/105 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A L A K N +A+ + A + +++ G+ + YL+KA N A L
Sbjct: 56 SKANLEDAKLNGANLSKANLSKADLSGASLDKANLEGANLSMTYLKKANMKAVNAAHAWL 115
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+D ++ + +A+L A L R L + + GA +E A DAV+
Sbjct: 116 ADANLNGAFMKDASLKAANLARANLRWAKMSGADLEQASLKDAVL 160
>gi|374583660|ref|ZP_09656754.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
17734]
gi|374419742|gb|EHQ92177.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
17734]
Length = 367
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 60/108 (55%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL +A N RRAN + A++ E+D SG+ +GA L +A +A+ +GA+L
Sbjct: 153 SGANLSEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEADLSRADLSGANL 212
Query: 161 SDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + L+E ANL+ A L L+R+DL GA + AD S A
Sbjct: 213 SRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANLRRADLSGA 260
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 58/103 (56%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL +A + + AN + A++ E+D S + +GA L +A AN +GA+L
Sbjct: 98 SGANLSEADLSRADLSEADLSGANLSGANLSEADLSRADLSGANLSEADLSGANLSGANL 157
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S+ + R L+ ANL A L L+ +DL GA + GA+ S+A
Sbjct: 158 SEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEA 200
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 54/100 (54%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL +A N RA+ + A++ E+D SG+ +GA L +A +A+ +GA+L
Sbjct: 193 SGANLSEADLSRADLSGANLSRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANL 252
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ L A+L+ A L R L+ ++L A + GAD
Sbjct: 253 RRADLSGANLRRADLSGANLRRADLSEANLSEANLSGADL 292
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 71/144 (49%), Gaps = 7/144 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A + + RA+ + A++ E+D SG+ +GA L +A +A+ +GA+L
Sbjct: 113 SEADLSGANLSGANLSEADLSRADLSGANLSEADLSGANLSGANLSEADLSRADLSGANL 172
Query: 161 SDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
+ L+E ANL+ A L L+R+DL GA + AD S A +L++
Sbjct: 173 RRANLSGANLSEADLSGANLSGANLSEADLSRADLSGANLSRADLSGA--NLSEADLSGA 230
Query: 216 YANGTNPITGVSTRKSLGCGNSRR 239
+G N +R L N RR
Sbjct: 231 NLSGANLSEADLSRADLSGANLRR 254
Score = 41.2 bits (95), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 57/103 (55%), Gaps = 11/103 (10%)
Query: 111 RKAVHVKENFRRANFTSAD----------MRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+KA+ + N AN + A+ + E+D SG+ +GA L +A +A+ +GA+L
Sbjct: 84 KKAI-LDYNLSGANLSGANLSEADLSRADLSEADLSGANLSGANLSEADLSRADLSGANL 142
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S+ + L+ ANL+ A L R L+ ++L A + GA+ S+A
Sbjct: 143 SEADLSGANLSGANLSEADLSRADLSGANLRRANLSGANLSEA 185
>gi|302819846|ref|XP_002991592.1| hypothetical protein SELMODRAFT_133757 [Selaginella moellendorffii]
gi|300140625|gb|EFJ07346.1| hypothetical protein SELMODRAFT_133757 [Selaginella moellendorffii]
Length = 157
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 60/118 (50%), Gaps = 5/118 (4%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
K+ + ++A M ++ F G+ + KA A A+F G D ++ ++DR+V ++A++
Sbjct: 43 KDGLKGKTLSAALMADAKFDGADMTEVVMSKAYAVGASFKGTDFTNAVLDRVVFDKADMK 102
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AV TVL+ S GA +E ADF +A+I + LC NP + L C
Sbjct: 103 GAVFRNTVLSGSTFQGANLENADFENALIGYNDARKLC-----LNPTLSEESTIELAC 155
>gi|33240880|ref|NP_875822.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
gi|33238409|gb|AAQ00475.1| Secreted pentapeptide repeats protein [Prochlorococcus marinus
subsp. marinus str. CCMP1375]
Length = 184
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 65/126 (51%), Gaps = 8/126 (6%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
L ++H +N + + D+ D + +G+Y + KA+ GA++ + +
Sbjct: 46 LDTSLH-GQNLQNTEYVKYDLSGRDLGDADLSGSYFSVSNLQKADLRGANMQNVIAYATR 104
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 229
+ A+L+NA L +S GA+I+G +F++AV+DL Q ++LC+ A G T
Sbjct: 105 FDNADLSNANFSGAELLKSRFDGAVIDGTNFTNAVLDLPQVKSLCERATG-------QTA 157
Query: 230 KSLGCG 235
+SL CG
Sbjct: 158 ESLECG 163
>gi|428222198|ref|YP_007106368.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427995538|gb|AFY74233.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 225
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 59/106 (55%), Gaps = 10/106 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ ADL A N +A + A++ ++ SG+ + +L +AV AN ADL+
Sbjct: 26 AELNDADLSGA-----NLSKARMSGAELNRANMSGANLHSTHLNRAVMKNANLENADLTG 80
Query: 163 TLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAIIEGADFSDA 203
M + L+EANLTNA L V + LT ++L GAI+ ADFS++
Sbjct: 81 AKMMEVNLSEANLTNANLSNVSGVESNLTMANLAGAILSSADFSNS 126
Score = 44.3 bits (103), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 53/119 (44%), Gaps = 10/119 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 156
S A S L +AV N A+ T A M E + S + A L ++N T
Sbjct: 54 SGANLHSTHLNRAVMKNANLENADLTGAKMMEVNLSEANLTNANLSNVSGVESNLTMANL 113
Query: 157 ------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
AD S++ + ++ L A+L A+ T LT +DL G ++G + S A + +A
Sbjct: 114 AGAILSSADFSNSNLSKVNLVGADLQGAIFSNTNLTGADLSGINLKGVNLSGANLSMAN 172
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 58/122 (47%), Gaps = 10/122 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 155
S A +A+L V+ N AN A + +DFS S + GA L+ A+ N
Sbjct: 89 SEANLTNANLSNVSGVESNLTMANLAGAILSSADFSNSNLSKVNLVGADLQGAIFSNTNL 148
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
TGADLS + + L+ ANL+ A L +L GGA I A+F+ + A + +
Sbjct: 149 TGADLSGINLKGVNLSGANLSMANLSGAIL-----GGANITKANFAQTDLSNADLRDVNI 203
Query: 216 YA 217
YA
Sbjct: 204 YA 205
>gi|298243143|ref|ZP_06966950.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297556197|gb|EFH90061.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 338
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 54/90 (60%), Gaps = 10/90 (11%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN A++RE+DFSG+ +G+ + +GADLS ++ R +L A+L+ A+L
Sbjct: 95 ANLVGANLREADFSGNDLSGS----------DLSGADLSRAILRRAILRRADLSEAILRD 144
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQA 212
VL R+DL A + GAD +DA + A++ A
Sbjct: 145 AVLRRADLTDADLRGADLTDADLTGAKRDA 174
>gi|449016876|dbj|BAM80278.1| similar to thylakoid lumenal 17.4 kD protein, chloroplast precursor
[Cyanidioschyzon merolae strain 10D]
Length = 288
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 68/150 (45%), Gaps = 20/150 (13%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS----- 161
S+ + +A ++ F F D+R DFSG +G LE A A +A F LS
Sbjct: 141 SSTIGQANAARDKFLDGRF--CDLRGRDFSGYDLSGVLLEGATADEARFRSTQLSKAYAP 198
Query: 162 ----------DTLMDRMVLNEANLTNAVLVRTVLTRSDLG-GAIIEGADFSDAVIDLAQK 210
D ++DR+ A+L+ +V VL+ S G + DF+D I
Sbjct: 199 GFKCRRCDFEDAVVDRVNFENADLSGSVFRNAVLSDSMFSDGTNVRDVDFTDVYIGEYGL 258
Query: 211 QALCKYA--NGTNPITGVSTRKSLGCGNSR 238
+ LC+ +G NP+TG TR SLGC R
Sbjct: 259 RRLCRNPTLDGENPLTGAPTRASLGCRAER 288
>gi|413947393|gb|AFW80042.1| putative homeobox DNA-binding domain superfamily protein [Zea mays]
Length = 202
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 63/114 (55%), Gaps = 1/114 (0%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 181
+ +F ++ +R+++F G+ GA A A+ + ADL + L +ANL+NA L
Sbjct: 88 KQDFKTSILRQANFKGANLLGASFFDADLTSADLSDADLRGADLSLANLTKANLSNANLE 147
Query: 182 RTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ T + GA I GADF+D + Q++ LCK A+G N TG T+++L C
Sbjct: 148 GALATGNTSFKGADITGADFTDVPLRDDQREYLCKIADGVNSTTGNPTKETLFC 201
>gi|428308708|ref|YP_007119685.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428250320|gb|AFZ16279.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 294
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 35/84 (41%), Positives = 48/84 (57%), Gaps = 5/84 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NFRRA T+A + G+ GA L A + N GADLS ++R L +ANLT A
Sbjct: 186 NFRRAKLTAATL-----EGANLTGANLTDAQLNRVNLQGADLSGANLERACLEDANLTGA 240
Query: 179 VLVRTVLTRSDLGGAIIEGADFSD 202
+L RT L+ +++ G + G DFSD
Sbjct: 241 ILRRTQLSEANMSGTKLYGVDFSD 264
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
++ RAN + A M +S G+K +GA L A AN GA+L + ++R+ L +ANL
Sbjct: 78 IETELTRANLSGAFMVKSLLPGAKMSGADLMGANLRGANLWGANLCGSQLERVNLRDANL 137
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFS 201
L+ + L GA++ G+ +
Sbjct: 138 MGVNFKWANLSEARLMGAMLYGSSLN 163
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 50/111 (45%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA----------YK 152
+Q +LR A + NF+ AN + A + + GS N A + +A
Sbjct: 125 SQLERVNLRDANLMGVNFKWANLSEARLMGAMLYGSSLNFANMSRAWLKGVDLGGFNLEG 184
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
NF A L+ ++ L ANLT+A L R L +DL GA +E A DA
Sbjct: 185 VNFRRAKLTAATLEGANLTGANLTDAQLNRVNLQGADLSGANLERACLEDA 235
>gi|434384824|ref|YP_007095435.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428015814|gb|AFY91908.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 377
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 91/210 (43%), Gaps = 24/210 (11%)
Query: 33 PLWVACQISSKTESDGQFP---GPYAKLKNWRVF---------------VSTALAAAVVA 74
P+ +A QI S+ + D P K + W ++ + ALA +
Sbjct: 105 PVAIASQIQSERDVDKLLKILQHPEEKERIWAIYELQSAIEVNPELHWEIMQALATFIRT 164
Query: 75 SCSSNISALADLNKYEA-ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRES 133
+ + + + EA G I DL + ++ N +RAN A++ +
Sbjct: 165 NSPEDKQGEIESDIQEALNVIGNRNIDRDIPLSLIDLAQTNLIRANLKRANLQGANLEGA 224
Query: 134 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
D G+ GA L+KA +AN GA+L ++ + L ANL A+L+R L ++L GA
Sbjct: 225 DLEGANLQGANLKKANLKRANLQGANLMIANLEGINLVRANLEGAILIRANLEGANLEGA 284
Query: 194 IIEG-----ADFSDAVIDLAQKQALCKYAN 218
+EG A+F A + A QA +AN
Sbjct: 285 NLEGAILLLANFKGAYLSKANLQACHGHAN 314
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 41/85 (48%), Gaps = 1/85 (1%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N RAN A + ++ G+ GA LE A+ ANF GA LS + + AN A
Sbjct: 260 NLVRANLEGAILIRANLEGANLEGANLEGAILLLANFKGAYLSKANL-QACHGHANFAGA 318
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
L + +DL GA +EGA+ A
Sbjct: 319 YLSKANFEGADLEGANLEGANLQRA 343
>gi|218187501|gb|EEC69928.1| hypothetical protein OsI_00358 [Oryza sativa Indica Group]
Length = 191
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 14/117 (11%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 181
R +F ++ +R+++F G+K GA + A+ TGADLSD L A+ + A +
Sbjct: 84 RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSDA-----DLRGADFSLANVS 133
Query: 182 RTVLTRSDLGGAIIEG----ADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ LT ++L GA+ G DF+D + Q++ LCK A+G N TG +T+++L C
Sbjct: 134 KVNLTNANLEGALATGNTTFKDFTDVPLRDDQREYLCKIADGVNTTTGNATKETLFC 190
>gi|150016367|ref|YP_001308621.1| pentapeptide repeat-containing protein [Clostridium beijerinckii
NCIMB 8052]
gi|149902832|gb|ABR33665.1| pentapeptide repeat protein [Clostridium beijerinckii NCIMB 8052]
Length = 1084
Score = 60.5 bits (145), Expect = 8e-07, Method: Composition-based stats.
Identities = 42/144 (29%), Positives = 71/144 (49%), Gaps = 10/144 (6%)
Query: 84 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
ADL++ + G S F ADL A+ V+ +A+F+ A + E+ G+ FN +
Sbjct: 914 ADLSRASMDYTGL----SYCNFEKADLSYAILVESGVSKADFSEASLSEAHIEGTFFNKS 969
Query: 144 YLEKAV-----AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
EKA ++++F + + + V+ E+N NA + T L DL A + GA
Sbjct: 970 KFEKASLIMTQMWRSDFEDCNFNHANLSSAVMRESNFKNATFINTCLRNVDLEEADLTGA 1029
Query: 199 DFSDAVIDLAQ-KQALCKYANGTN 221
D S+A + A+ +A+ + N TN
Sbjct: 1030 DMSNANLSNAKINKAIFEGTNLTN 1053
Score = 55.1 bits (131), Expect = 3e-05, Method: Composition-based stats.
Identities = 42/127 (33%), Positives = 54/127 (42%), Gaps = 22/127 (17%)
Query: 101 SAAQFGSADLRKAVHV------KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 154
S A F A L +A H+ K F +A+ M SDF FN A L AV ++N
Sbjct: 947 SKADFSEASLSEA-HIEGTFFNKSKFEKASLIMTQMWRSDFEDCNFNHANLSSAVMRESN 1005
Query: 155 FTGADLSDTLMDRMVLNEANLT---------------NAVLVRTVLTRSDLGGAIIEGAD 199
F A +T + + L EA+LT A+ T LT DL IE D
Sbjct: 1006 FKNATFINTCLRNVDLEEADLTGADMSNANLSNAKINKAIFEGTNLTNVDLTNVDIENID 1065
Query: 200 FSDAVID 206
FS +ID
Sbjct: 1066 FSKTIID 1072
Score = 43.1 bits (100), Expect = 0.14, Method: Composition-based stats.
Identities = 33/113 (29%), Positives = 52/113 (46%), Gaps = 11/113 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRES--DFSG---SKFNGAYLEKAV-----AYK 152
A FG A+L + H+ NF AD+ + D++G F A L A+ K
Sbjct: 890 ANFGYANLNDS-HISGTLYNCNFKEADLSRASMDYTGLSYCNFEKADLSYAILVESGVSK 948
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+F+ A LS+ ++ N++ A L+ T + RSD A+ S AV+
Sbjct: 949 ADFSEASLSEAHIEGTFFNKSKFEKASLIMTQMWRSDFEDCNFNHANLSSAVM 1001
Score = 40.4 bits (93), Expect = 0.88, Method: Composition-based stats.
Identities = 33/104 (31%), Positives = 43/104 (41%), Gaps = 11/104 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A L K N ANF A++ +S SG+ +N NF ADLS
Sbjct: 870 ADFSYAKLDNLEIGKLNAENANFGYANLNDSHISGTLYN-----------CNFKEADLSR 918
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
MD L+ N A L +L S + A A S+A I+
Sbjct: 919 ASMDYTGLSYCNFEKADLSYAILVESGVSKADFSEASLSEAHIE 962
>gi|428219581|ref|YP_007104046.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427991363|gb|AFY71618.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 508
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 74/140 (52%), Gaps = 8/140 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A F A+L A K + ANF+ AD+R ++ SG+ NGA L +A +AN
Sbjct: 172 SVASFNGANLTGASLAKLDLSGLDLSDANFSGADLRGANLSGANLNGADLSRANLSRANL 231
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALC 214
+ A+LS T R LNEANL+ A L + L+R+DL A + AD A + +++ A
Sbjct: 232 SRANLSRTNFVRTELNEANLSEASLSGSNLSRADLSRANLIKADLHGANLSMSKLAGAYL 291
Query: 215 KYAN--GTNPITGVSTRKSL 232
AN GTN I+ TR L
Sbjct: 292 VRANLLGTNLISADLTRAVL 311
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 63/136 (46%), Gaps = 17/136 (12%)
Query: 101 SAAQFGSADLRKAVHVKEN----------FRRANFTSADMRESDFSG-----SKFNGAYL 145
S A DL KA V+ N F AN T A + + D SG + F+GA L
Sbjct: 147 SGANLSQVDLSKATLVEANLKDAKLSVASFNGANLTGASLAKLDLSGLDLSDANFSGADL 206
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A AN GADLS + R L+ ANL+ VRT L ++L A + G++ S A
Sbjct: 207 RGANLSGANLNGADLSRANLSRANLSRANLSRTNFVRTELNEANLSEASLSGSNLSRA-- 264
Query: 206 DLAQKQALCKYANGTN 221
DL++ + +G N
Sbjct: 265 DLSRANLIKADLHGAN 280
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 53/99 (53%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
SADL +AV ++ + RAN T A++ +D + + A +A AN G DL+ +
Sbjct: 303 SADLTRAVLIEADLFRANLTEANLSRADLNRANLTEASFIEANLISANLCGTDLTRANLT 362
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ +A + A+L++T L+ + L GA A+ S A++
Sbjct: 363 GVYAIDAEIVGAILIKTNLSEASLAGANFVRANLSRAIL 401
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 45/87 (51%), Gaps = 5/87 (5%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ + RAN AD+ ++ S SK GAYL +A N ADL+ R VL EA+L
Sbjct: 263 RADLSRANLIKADLHGANLSMSKLAGAYLVRANLLGTNLISADLT-----RAVLIEADLF 317
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDA 203
A L L+R+DL A + A F +A
Sbjct: 318 RANLTEANLSRADLNRANLTEASFIEA 344
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ DL +A + N +RAN T A + +D A L +A +AN GA+LS
Sbjct: 49 AELSRIDLSRADLSESNLKRANLTEAVLVGADLISINLGRATLTEANLNRANLIGANLSG 108
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+L EA+L L + LT++DL GA + GAD S A
Sbjct: 109 A-----ILVEADLARCDLRVSNLTKADLMGANLSGADLSVA 144
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 15/116 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY---------------LEK 147
A ADL +A + +F AN SA++ +D + + G Y L +
Sbjct: 324 ANLSRADLNRANLTEASFIEANLISANLCGTDLTRANLTGVYAIDAEIVGAILIKTNLSE 383
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A ANF A+LS ++ L+EANL A L ++ ++L GA +E AD S A
Sbjct: 384 ASLAGANFVRANLSRAILSGASLSEANLGRANLYGANMSEANLSGANLENADLSRA 439
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 57/114 (50%), Gaps = 5/114 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L +A + NF R A++ E+ SGS + A L +A KA+ GA+L
Sbjct: 222 SRANLSRANLSRANLSRTNFVRTELNEANLSEASLSGSNLSRADLSRANLIKADLHGANL 281
Query: 161 SDTLMDRMVLNEANL--TNAV---LVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
S + + L ANL TN + L R VL +DL A + A+ S A ++ A
Sbjct: 282 SMSKLAGAYLVRANLLGTNLISADLTRAVLIEADLFRANLTEANLSRADLNRAN 335
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 45/101 (44%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A DLR + K + AN + AD+ ++ SG+ + L KA +AN A LS
Sbjct: 114 ADLARCDLRVSNLTKADLMGANLSGADLSVANLSGANLSQVDLSKATLVEANLKDAKLSV 173
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
N ANLT A L + L+ DL A GAD A
Sbjct: 174 A-----SFNGANLTGASLAKLDLSGLDLSDANFSGADLRGA 209
>gi|427713339|ref|YP_007061963.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
gi|427377468|gb|AFY61420.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
Length = 327
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 60/122 (49%), Gaps = 16/122 (13%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ ADLR AV N A+ AD+R G+ GA L K KAN TGADL
Sbjct: 48 SGAKLQRADLRGAVLSAINLNHADLIGADLR-----GAMLMGADLRKVNLRKANLTGADL 102
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANG 219
+ ANLT A+L LT +D+ AI+ GAD + + LA+ +Q AN
Sbjct: 103 T----------RANLTGAILSEANLTAADMSQAILRGADLTLTDLTLAELEQVNLSQANL 152
Query: 220 TN 221
TN
Sbjct: 153 TN 154
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 66/152 (43%), Gaps = 45/152 (29%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLRK N R+AN T AD+ ++ +G+ + A L A +A GADL+
Sbjct: 80 AMLMGADLRKV-----NLRKANLTGADLTRANLTGAILSEANLTAADMSQAILRGADLTL 134
Query: 163 T-----LMDRMVLNEANLTNA----------VLVRTVLTRSDLGGA-------------- 193
T ++++ L++ANLTNA +L+ L +++L GA
Sbjct: 135 TDLTLAELEQVNLSQANLTNAYLRGADMADAILLEATLIQANLRGANLRNCNLQGANLQK 194
Query: 194 -----------IIEGADFSDAVIDLAQKQALC 214
+EGA+ +A + A + C
Sbjct: 195 TNLRGANLRQARLEGANLREATLTEANLRYAC 226
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 48/103 (46%), Gaps = 10/103 (9%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
AD+ A+ ++ +AN A++R + G+ L A +A GA+L +
Sbjct: 160 ADMADAILLEATLIQANLRGANLRNCNLQGANLQKTNLRGANLRQARLEGANLREA---- 215
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI 205
L EANL A L L +DL GA ++ GA ++A++
Sbjct: 216 -TLTEANLRYACLDEACLIGADLRGASLARAMLRGAQLNEAIL 257
>gi|427729960|ref|YP_007076197.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427365879|gb|AFY48600.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 937
Score = 60.1 bits (144), Expect = 9e-07, Method: Composition-based stats.
Identities = 33/104 (31%), Positives = 57/104 (54%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A A+L+ A + N + AN A++ ++ G+ GA L++A+ +A GA+L
Sbjct: 812 GANLYGANLQGANLQRANLQGANLQRANLYGANLEGANLYGANLQRAILQRAILEGANLQ 871
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ R L ANL A+L R L ++L GA +EGA+ +A++
Sbjct: 872 RAILQRANLEGANLQRAILQRANLEGANLEGANLEGANLQEAIL 915
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 5/110 (4%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I S+ F A+ ++A N + AN A+++ ++ + GA L++A Y AN GA
Sbjct: 789 ILSSKDFYMANFQRANLQGANLQGANLYGANLQGANLQRANLQGANLQRANLYGANLEGA 848
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+L + R +L A L A L R +L R++L EGA+ A++ A
Sbjct: 849 NLYGANLQRAILQRAILEGANLQRAILQRANL-----EGANLQRAILQRA 893
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 33/100 (33%), Positives = 52/100 (52%), Gaps = 5/100 (5%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL+ + ++F ANF A+++ ++ G+ GA L+ A +AN GA+L +
Sbjct: 784 DLQNCILSSKDFYMANFQRANLQGANLQGANLYGANLQGANLQRANLQGANLQRANLYGA 843
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
L ANL A L R +L R AI+EGA+ A++ A
Sbjct: 844 NLEGANLYGANLQRAILQR-----AILEGANLQRAILQRA 878
>gi|443476541|ref|ZP_21066442.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443018491|gb|ELS32731.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 400
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 89/188 (47%), Gaps = 29/188 (15%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----G 137
L + N EA F I A A L +A V N AN TSA M +D S G
Sbjct: 61 LVEANLAEANLTSAFLI--RADLQRACLNQAYLVAANLNSANLTSASMVNADLSLATLTG 118
Query: 138 SKFNGAYLEKA-----VAYKANFTGADLSDT-----LMDRMVLNEANLTNAVLVRTVLTR 187
+ NGA L +A ++N GADLSD+ LM + L+ ANL+ A L+ LT
Sbjct: 119 ACLNGANLSRAKLNGTFFIESNLLGADLSDSDFTGALMIKANLSGANLSQACLMNVDLTE 178
Query: 188 SDLGGAIIEGADFSDAVIDLAQKQAL-CKYANGTNPITGVSTRKS-------LGCGNSRR 239
++L GA ++G D + A+++ A A+ YAN ++GVS ++ LG +
Sbjct: 179 ANLTGAELQGVDLAGAILNAANLNAVDLVYAN----LSGVSLSRANLSWANLLGTNLEKT 234
Query: 240 NAYGSPSS 247
N GS S
Sbjct: 235 NLVGSDLS 242
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 46/92 (50%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+A+L A N AN + AD+ D GS L A+ +AN TGA+L +
Sbjct: 280 NLSNANLSGANLSGANLMGANLSGADLSNVDLRGSYLIRTNLHNAILNEANLTGANLDEA 339
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
+++ LN ANL A L R LT ++L GA +
Sbjct: 340 VLNGASLNRANLNRASLTRASLTGANLKGAFM 371
Score = 43.9 bits (102), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 51/104 (49%), Gaps = 5/104 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +L A +K N AN ++ ++ SG+ +GA L AN +GADLS+
Sbjct: 254 ADLSWTNLTGAFLMKSNLSGANLNGVNLSNANLSGANLSGANL-----MGANLSGADLSN 308
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ L NL NA+L LT ++L A++ GA + A ++
Sbjct: 309 VDLRGSYLIRTNLHNAILNEANLTGANLDEAVLNGASLNRANLN 352
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 50/99 (50%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L + + + N T A + +S+ SG+ NG L A AN +GA+L +
Sbjct: 244 ANLNETNLAEADLSWTNLTGAFLMKSNLSGANLNGVNLSNANLSGANLSGANLMGANLSG 303
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L+ +L + L+RT L + L A + GA+ +AV++
Sbjct: 304 ADLSNVDLRGSYLIRTNLHNAILNEANLTGANLDEAVLN 342
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 15/104 (14%)
Query: 105 FGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
F A+L K++ N + R N + A + ++ S + GA+L +A +AN A+
Sbjct: 6 FTKANLTKSILEGINLKGADLKRVNLSEAKLADAKLSKANLTGAFLHRADLNRANLVEAN 65
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L+ EANLT+A L+R L R+ L A + A+ + A
Sbjct: 66 LA----------EANLTSAFLIRADLQRACLNQAYLVAANLNSA 99
Score = 40.4 bits (93), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 10/100 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A + DLR + ++ N AN T A++ E+ +G+ N A L +A +A+
Sbjct: 302 SGADLSNVDLRGSYLIRTNLHNAILNEANLTGANLDEAVLNGASLNRANLNRASLTRASL 361
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
TGA+L M NL A ++ T L +++ GAI+
Sbjct: 362 TGANLKGAFMLW-----TNLRGAFMLWTNLDGANMTGAIL 396
Score = 40.4 bits (93), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 17/107 (15%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A +L K V + AN ++ E+D S + GA+L K+N +GA+L
Sbjct: 222 SWANLLGTNLEKTNLVGSDLSWANLNETNLAEADLSWTNLTGAFL-----MKSNLSGANL 276
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
N NL+NA L L+ ++L GA + GAD S+ +DL
Sbjct: 277 ----------NGVNLSNANLSGANLSGANLMGANLSGADLSN--VDL 311
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 20/99 (20%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL +A V+ N AN TSA + +D + N AYL A
Sbjct: 54 ADLNRANLVEANLAEANLTSAFLIRADLQRACLNQAYLVAAN------------------ 95
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
LN ANLT+A +V L+ + L GA + GA+ S A ++
Sbjct: 96 --LNSANLTSASMVNADLSLATLTGACLNGANLSRAKLN 132
Score = 37.0 bits (84), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 53/126 (42%), Gaps = 25/126 (19%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-VAY---------- 151
A A+L +A + + AN T A+++ D +G+ N A L + Y
Sbjct: 159 ANLSGANLSQACLMNVDLTEANLTGAELQGVDLAGAILNAANLNAVDLVYANLSGVSLSR 218
Query: 152 --------------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
K N G+DLS ++ L EA+L+ L L +S+L GA + G
Sbjct: 219 ANLSWANLLGTNLEKTNLVGSDLSWANLNETNLAEADLSWTNLTGAFLMKSNLSGANLNG 278
Query: 198 ADFSDA 203
+ S+A
Sbjct: 279 VNLSNA 284
>gi|159903945|ref|YP_001551289.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9211]
gi|159889121|gb|ABX09335.1| Pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9211]
Length = 184
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 63/125 (50%), Gaps = 8/125 (6%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
L ++H +N + + D+ D + +G+Y + A+ GA+L + +
Sbjct: 46 LDTSLH-GQNLQNTEYVKYDLSGRDLGDANLSGSYFSVSSLKNADLRGANLQNVIAYATR 104
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 229
+ A+L+ A L L +S GA+IEG DF++AV+DL Q ++LC+ A G T
Sbjct: 105 FDNADLSGANLSGAELLKSVFNGAVIEGTDFTNAVLDLPQVKSLCERATG-------KTA 157
Query: 230 KSLGC 234
+SL C
Sbjct: 158 ESLQC 162
>gi|91070460|gb|ABE11370.1| pentapeptide repeats [uncultured Prochlorococcus marinus clone
HOT0M-10G7]
Length = 157
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 66/134 (49%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+A +G L A + + A F D+++++ SG + A L A N + ++L
Sbjct: 21 AALDYGKQSLVGADFSGSDLKGATFYLTDLQDANLSGCELQNATLYGAKLKDTNLSNSNL 80
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+ +D VL+ +L+N L + + I+GADF++ + + C+ A+GT
Sbjct: 81 REVTLDSAVLDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIVREFCEIASGT 140
Query: 221 NPITGVSTRKSLGC 234
NPIT TR++L C
Sbjct: 141 NPITNRDTRETLEC 154
>gi|332712234|ref|ZP_08432162.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332349040|gb|EGJ28652.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 280
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 54/99 (54%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL A NF RA+ + A++ ++ +G+ F GA L A AN TGA+LS+T +
Sbjct: 171 ADLTNANLTGANFSRADLSQANLSNANLTGADFAGADLANADLSGANLTGANLSNTDLKG 230
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L ANL L R L RSDL A+ GA+F + ++
Sbjct: 231 SNLTGANLNGTDLARADLERSDLRDAMTNGANFENTNLN 269
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 2/98 (2%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
+ + AD+ ++ +G+ F+ A L +A AN TGAD + + L+ ANLT A L T
Sbjct: 167 DLSGADLTNANLTGANFSRADLSQANLSNANLTGADFAGADLANADLSGANLTGANLSNT 226
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
L S+L GA + G D + A DL + NG N
Sbjct: 227 DLKGSNLTGANLNGTDLARA--DLERSDLRDAMTNGAN 262
Score = 37.0 bits (84), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 11/99 (11%)
Query: 105 FGSADLRKAVHVKENFRR-ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
GS + K + + + AN + AD D SG+ A L ANF+ ADLS
Sbjct: 137 LGSGHINKCTPLIDKYLSGANLSGADCTNVDLSGADLTNANL-----TGANFSRADLS-- 189
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+ L+ ANLT A L +DL GA + GA+ S+
Sbjct: 190 ---QANLSNANLTGADFAGADLANADLSGANLTGANLSN 225
>gi|397645344|gb|EJK76787.1| hypothetical protein THAOC_01435 [Thalassiosira oceanica]
Length = 224
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 57/118 (48%), Gaps = 2/118 (1%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N++ +FT + + FS S G KA A+F+GAD + ++ ANL N
Sbjct: 106 NYKGKDFTQIIAKGTIFSKSNLQGCRFYKAYLVNADFSGADARGAAFEDTSMDGANLRNI 165
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN--GTNPITGVSTRKSLGC 234
V + +S L +EG DF+DA I + +C + GTNP TG TR SL C
Sbjct: 166 VASGSYFGQSLLDVESLEGGDFTDAQIPPKTLKLVCDREDVKGTNPTTGADTRDSLMC 223
>gi|242034055|ref|XP_002464422.1| hypothetical protein SORBIDRAFT_01g017890 [Sorghum bicolor]
gi|241918276|gb|EER91420.1| hypothetical protein SORBIDRAFT_01g017890 [Sorghum bicolor]
Length = 221
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 5/118 (4%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
K N + + +A M E+ F G+ + + KA A A+F G D ++ ++DR+ +A+LT
Sbjct: 108 KTNLKGKSLAAALMSEAKFDGADMSEVVMSKAYAVGASFKGTDFTNAVIDRVNFEKADLT 167
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+ VL+ S A ++ F D +I Q LC TN +R LGC
Sbjct: 168 GAIFKNAVLSGSTFDDAKMDDVVFEDTIIGYIDLQKLC-----TNTSISPDSRLELGC 220
>gi|428224166|ref|YP_007108263.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984067|gb|AFY65211.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 583
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 61/107 (57%), Gaps = 7/107 (6%)
Query: 103 AQFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A G A+LR+ VH++E N R A A + E++ SG A L + ++A GADLS
Sbjct: 155 ADLGGANLRE-VHLEEANLREAKLVEASLIEANLSGCYLRQANLSGSDLHRAILAGADLS 213
Query: 162 DTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADFSDA 203
+ ++ L+ ANLT A L++T L R+DL A++ ADFS+A
Sbjct: 214 EAVLHGADLSRANLTGAYLLKTSLRNARLLRADLQDALLLRADFSEA 260
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F DL A N + + + AD+ + S + N A L +A + AN A L+ L
Sbjct: 17 FAQVDLTGANLSGANLQDIDLSGADLTGVNLSWAYLNRANLTEASLHHANLRNASLNSAL 76
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+DR VL+ A+LT A L +L +D AI++ AD S A + AQ
Sbjct: 77 LDRAVLSGADLTKAELCLALLRGADCNWAILQEADLSGANLHGAQ 121
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 59/112 (52%), Gaps = 10/112 (8%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAY-----KA 153
A+LR+A V+ + AN + +R+++ SGS + GA L +AV + +A
Sbjct: 166 HLEEANLREAKLVEASLIEANLSGCYLRQANLSGSDLHRAILAGADLSEAVLHGADLSRA 225
Query: 154 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
N TGA L T + L A+L +A+L+R + ++L GA + AD S A +
Sbjct: 226 NLTGAYLLKTSLRNARLLRADLQDALLLRADFSEANLRGADLRRADLSGAYL 277
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ S L +A ++ N +RA+ +AD+ G+ +LE+A +A A L +
Sbjct: 130 AKLNSTLLNEAKLMEANLKRASLVNADL-----GGANLREVHLEEANLREAKLVEASLIE 184
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L +ANL+ + L R +L +DL A++ GAD S A
Sbjct: 185 ANLSGCYLRQANLSGSDLHRAILAGADLSEAVLHGADLSRA 225
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 48/101 (47%), Gaps = 5/101 (4%)
Query: 110 LRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
LR+A + RA AD+ E+ D S + GAYL K A ADL D L
Sbjct: 192 LRQANLSGSDLHRAILAGADLSEAVLHGADLSRANLTGAYLLKTSLRNARLLRADLQDAL 251
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ R +EANL A L R L+ + L +I+ AD +A +
Sbjct: 252 LLRADFSEANLRGADLRRADLSGAYLSHSILCEADLGEAYL 292
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 56/114 (49%), Gaps = 4/114 (3%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L A +K + R A AD++++ + F+ A L A +A+ +GA LS
Sbjct: 220 ADLSRANLTGAYLLKTSLRNARLLRADLQDALLLRADFSEANLRGADLRRADLSGAYLSH 279
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 216
+++ L EA L + +RT L + L G I+ D +DL+ Q C+Y
Sbjct: 280 SILCEADLGEAYLLQSHFIRTNLDNACLTGCCIDNWQLED--VDLSNVQ--CQY 329
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 49/101 (48%), Gaps = 10/101 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL KA R A+ A ++E+D SG+ +GA L++ +A L+
Sbjct: 80 AVLSGADLTKAELCLALLRGADCNWAILQEADLSGANLHGAQLDQVTLERAK-----LNS 134
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
TL++ L EANL A LV +DLGGA + +A
Sbjct: 135 TLLNEAKLMEANLKRASLV-----NADLGGANLREVHLEEA 170
>gi|193213578|ref|YP_001999531.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
8327]
gi|193087055|gb|ACF12331.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
Length = 439
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 72/128 (56%), Gaps = 5/128 (3%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S + G A+L + N ++++F SAD+ +++ +G+ G +A KAN GA+L
Sbjct: 279 SEEKLGDANLEEVDLSNANLKQSDFESADLDKANLAGANLAGGNFSRADMEKANLKGANL 338
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANG 219
++DR + +A+L+NA L L + L GA ++GAD ++A + D ++A K G
Sbjct: 339 EGAVLDRAFMKQADLSNANLRNANLFGAMLSGANLDGADLTNASLFDANLEKASLK---G 395
Query: 220 TNPITGVS 227
TN +TG +
Sbjct: 396 TN-LTGAN 402
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 68/149 (45%), Gaps = 7/149 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LRKA +RA+ AD+ E+ + A+L+ A +AN +G +L
Sbjct: 81 SGASLDQANLRKANLSMTYLKRADLKKADLSEAWMVSANLRDAFLKDARLSRANLSGTNL 140
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-SDAVIDLAQKQALCKYANG 219
+ L +ANL +A L T R++L G + A F +AV++ A K +N
Sbjct: 141 RWAKLWDADLGQANLKDANLFETSFERANLKGTLFTKARFLENAVMNDA------KVSNN 194
Query: 220 TNPITGVSTRKSLGCGNSRRNAYGSPSSP 248
T +G + ++ R PS+P
Sbjct: 195 TVIPSGEPASRGWAMRHNSRFVQEEPSAP 223
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 61/128 (47%), Gaps = 15/128 (11%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-T 163
F SADL KA N NF+ ADM +++ G+ GA L++A +A+ + A+L +
Sbjct: 303 FESADLDKANLAGANLAGGNFSRADMEKANLKGANLEGAVLDRAFMKQADLSNANLRNAN 362
Query: 164 LMDRMV--------------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L M+ L +ANL A L T LT ++L G + GA S + + +
Sbjct: 363 LFGAMLSGANLDGADLTNASLFDANLEKASLKGTNLTGANLIGINLTGAAISSSTLTPSG 422
Query: 210 KQALCKYA 217
K A +A
Sbjct: 423 KPATRSWA 430
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 53/106 (50%), Gaps = 12/106 (11%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANFTGADLSDT 163
DL KA N AN + A++ ++D SG+ + A L KA + Y +A+ ADLS+
Sbjct: 54 DLSKANLEDANLDGANLSEANLSKADLSGASLDQANLRKANLSMTYLKRADLKKADLSEA 113
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
M ANL +A L L+R++L G + A DA DL Q
Sbjct: 114 WMV-----SANLRDAFLKDARLSRANLSGTNLRWAKLWDA--DLGQ 152
>gi|242052129|ref|XP_002455210.1| hypothetical protein SORBIDRAFT_03g006310 [Sorghum bicolor]
gi|241927185|gb|EES00330.1| hypothetical protein SORBIDRAFT_03g006310 [Sorghum bicolor]
Length = 200
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 63/119 (52%), Gaps = 4/119 (3%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
+K++F+ + A+ + ++ G+ F A L A A+ GAD S + + L+ ANL
Sbjct: 85 IKQDFKTSILRQANFKGANLLGASFFDADLTSADLSDADLRGADFSLANLTKTNLSNANL 144
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
A+ V + GA I GADF+D + Q++ LCK A+G N TG T+++L C
Sbjct: 145 EGAL----VTGNTSFKGANITGADFTDVPLRDDQREYLCKIADGVNSTTGNPTKETLFC 199
>gi|78779034|ref|YP_397146.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9312]
gi|78712533|gb|ABB49710.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9312]
Length = 157
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 69/136 (50%), Gaps = 4/136 (2%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+A +G L A + + A F D+++++ SG + A L A N + ++L
Sbjct: 21 AALDYGKQSLVGADFSGSDLKGATFYLTDLQDANLSGCELQNATLYGAKLKDTNLSNSNL 80
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--DLAQKQALCKYAN 218
+ +D VL+ +L+N L + + I+GADF++ + D+ +K C+ A+
Sbjct: 81 REVTLDSAVLDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIVRK--FCESAS 138
Query: 219 GTNPITGVSTRKSLGC 234
GTNPIT TR++L C
Sbjct: 139 GTNPITNRDTRETLEC 154
>gi|119488860|ref|ZP_01621822.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119455021|gb|EAW36163.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 1011
Score = 59.3 bits (142), Expect = 2e-06, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 54/103 (52%), Gaps = 15/103 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A +ADLR A N RAN + A++R ++ SG+ +G YL A +AN A+
Sbjct: 850 SGADLRTADLRSA-----NLIRANLSDANLRSANLSGANLSGVYLNSADLRRANLNDAN- 903
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
LN+A+L+ A L L+ +DL GA + ADFS A
Sbjct: 904 ---------LNDADLSGANLRSADLSGADLSGADLSVADFSSA 937
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 5/90 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD-----RMVLNEA 173
N R ++ + AD+R +D + A L A AN +GA+LS ++ R LN+A
Sbjct: 843 NLRTSDLSGADLRTADLRSANLIRANLSDANLRSANLSGANLSGVYLNSADLRRANLNDA 902
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
NL +A L L +DL GA + GAD S A
Sbjct: 903 NLNDADLSGANLRSADLSGADLSGADLSVA 932
Score = 43.1 bits (100), Expect = 0.14, Method: Composition-based stats.
Identities = 26/77 (33%), Positives = 37/77 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S SADLR+A N A+ + A++R +D SG+ +GA L A AN A+L
Sbjct: 885 SGVYLNSADLRRANLNDANLNDADLSGANLRSADLSGADLSGADLSVADFSSANLGAANL 944
Query: 161 SDTLMDRMVLNEANLTN 177
+ L+ NL N
Sbjct: 945 GAANLSGANLSGVNLNN 961
>gi|158340319|ref|YP_001521675.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158310560|gb|ABW32174.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 284
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 15/141 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S F ++ L++++ + A+F+ AD+R +DFS +K + A L++ +AN GADL
Sbjct: 68 SGVNFKASKLQRSLAIWVQAYWADFSDADLRHADFSCAKLSAAQLKRTDFSQANLMGADL 127
Query: 161 SDTLMDRMVL----------NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
SD++ ANLTNA L + + L GA + +D S + +
Sbjct: 128 SDSVAQDSCFKGANLWGVWAQRANLTNACLSHVDMATAKLTGAQLLDSDLSWSCL----S 183
Query: 211 QALCKYANGTNP-ITGVSTRK 230
QA+CK AN T+ + G RK
Sbjct: 184 QAVCKGANLTSACLEGSDLRK 204
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 50/119 (42%), Gaps = 20/119 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA---- 158
A F A L A + +F +AN AD+ +S S F GA L A +AN T A
Sbjct: 100 ADFSCAKLSAAQLKRTDFSQANLMGADLSDSVAQDSCFKGANLWGVWAQRANLTNACLSH 159
Query: 159 ----------------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
DLS + + + V ANLT+A L + L + D A + AD S
Sbjct: 160 VDMATAKLTGAQLLDSDLSWSCLSQAVCKGANLTSACLEGSDLRKIDFRDACLSRADLS 218
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 60/139 (43%), Gaps = 31/139 (22%)
Query: 76 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE--- 132
CS ++ + KY A R QF +L A + N +F+ + + E
Sbjct: 14 CSKDLQKFWE--KYHASER---------QFAGTNLPGANFYQMNLSGFDFSHSRLSEVNL 62
Query: 133 --SDFSGSKFNGAYLEKAVA-----YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 185
+D SG F + L++++A Y A+F+ AD L A+ + A L L
Sbjct: 63 IWADISGVNFKASKLQRSLAIWVQAYWADFSDAD----------LRHADFSCAKLSAAQL 112
Query: 186 TRSDLGGAIIEGADFSDAV 204
R+D A + GAD SD+V
Sbjct: 113 KRTDFSQANLMGADLSDSV 131
>gi|83955651|ref|ZP_00964231.1| hypothetical protein NAS141_07590 [Sulfitobacter sp. NAS-14.1]
gi|83839945|gb|EAP79121.1| hypothetical protein NAS141_07590 [Sulfitobacter sp. NAS-14.1]
Length = 189
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 64/117 (54%), Gaps = 11/117 (9%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
N EA+ RG + A G ADLR A + R A+ + A++ +D SG+K GA L
Sbjct: 12 NLTEADLRGA-DLREADLSGRADLRGA-----DLREADLSGAELFYADLSGAKLIGAILS 65
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+A+ AN +GADL R+ L+ A+L+ +L+ LT +DL GA + AD S A
Sbjct: 66 RAILISANLSGADLR-----RVDLSGADLSGTILIGANLTGADLTGANLSSADLSGA 117
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 58/119 (48%), Gaps = 2/119 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ A L +A+ + N A+ D+ +D SG+ GA L A AN + ADL
Sbjct: 55 SGAKLIGAILSRAILISANLSGADLRRVDLSGADLSGTILIGANLTGADLTGANLSSADL 114
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
S + M+L ANL+ A L R L+ ++L GA + AD A +L + Y NG
Sbjct: 115 SGANLSGMILRGANLSGANLSRADLSGANLSGASVTEADLGGA--NLTEANLTRTYLNG 171
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 46/95 (48%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL + + N A+ T A++ +D SG+ +G L A AN + ADLS +
Sbjct: 87 ADLSGTILIGANLTGADLTGANLSSADLSGANLSGMILRGANLSGANLSRADLSGANLSG 146
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+ EA+L A L LTR+ L GA + SD
Sbjct: 147 ASVTEADLGGANLTEANLTRTYLNGATLCNTTMSD 181
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 64/138 (46%), Gaps = 20/138 (14%)
Query: 54 YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYE---AETRGEFGIG--------SA 102
YA L + + L+ A++ S+N+S ADL + + A+ G IG +
Sbjct: 51 YADLSGAK-LIGAILSRAIL--ISANLSG-ADLRRVDLSGADLSGTILIGANLTGADLTG 106
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A SADL A R AN + A++ +D SG+ +GA + +A AN T A+L+
Sbjct: 107 ANLSSADLSGANLSGMILRGANLSGANLSRADLSGANLSGASVTEADLGGANLTEANLT- 165
Query: 163 TLMDRMVLNEANLTNAVL 180
R LN A L N +
Sbjct: 166 ----RTYLNGATLCNTTM 179
>gi|209526959|ref|ZP_03275476.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|376005813|ref|ZP_09783205.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423064919|ref|ZP_17053709.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209492561|gb|EDZ92899.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|375325803|emb|CCE18958.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406714162|gb|EKD09330.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 331
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 68/138 (49%), Gaps = 9/138 (6%)
Query: 63 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 121
F T L AA + + ++ L D N +A+ RG A ADLR A N R
Sbjct: 87 FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139
Query: 122 -RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
R + S ++R +D G+ G L A +AN TGA+L++ ++ +LN+ NL+ L
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLTGANLTECVLRGAILNQTNLSETNL 199
Query: 181 VRTVLTRSDLGGAIIEGA 198
+LT +L GA + G+
Sbjct: 200 QGAILTEVNLSGANLIGS 217
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 63/126 (50%), Gaps = 6/126 (4%)
Query: 86 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSK 139
LN+Y + + G+ A+ +ADL A +F+ ANF A ++ ++ ++
Sbjct: 7 LNQYRSGEKLFRGVNLRNAELSNADLIGANLSGGDFQGANFVLAYLNGVNLTRANLEKAR 66
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
GA L +A A T AD T++ L +ANLT A LV L ++DL GA ++GAD
Sbjct: 67 LGGANLSRANLSGAQLTDADFHGTILQAADLRKANLTLATLVDANLIQADLRGANLQGAD 126
Query: 200 FSDAVI 205
A +
Sbjct: 127 LRGACL 132
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 51/119 (42%), Gaps = 17/119 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFT----------SADMRESDFSGSKFNGAYL----- 145
S AQ AD + + R+AN T AD+R ++ G+ GA L
Sbjct: 78 SGAQLTDADFHGTILQAADLRKANLTLATLVDANLIQADLRGANLQGADLRGACLRGANM 137
Query: 146 --EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
E+ + N GADL T + + L A+LT A L LT L GAI+ + S+
Sbjct: 138 RYERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLTGANLTECVLRGAILNQTNLSE 196
>gi|428226754|ref|YP_007110851.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427986655|gb|AFY67799.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 330
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 72/143 (50%), Gaps = 13/143 (9%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
L D N ++A+ G G A ADL AV + N A+ +A++ +D +G+ G
Sbjct: 77 LVDANLHDADLHGASLRG--ADLRGADLSLAVLLDANLMDADLRNANLSGADLTGACLRG 134
Query: 143 AYLEK-----------AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 191
A L + ++ YKA+ G +LS + R+ L EANLT A L T L+ +DL
Sbjct: 135 ANLRQEMRSQHTNLRGSILYKADLRGVNLSGADLTRVDLREANLTEASLRETDLSGADLS 194
Query: 192 GAIIEGADFSDAVIDLAQKQALC 214
GA + GA SDA ++ A + C
Sbjct: 195 GANLTGALLSDACLEGAILEGAC 217
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 51/94 (54%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
LR A + N +AN A+++ + ++ GA L++ + +A T DLS +
Sbjct: 218 LRNAKLERANLSQANLFRANLQNALLPQARLTGAGLQQTIFAQAKLTDVDLSRADLFEAD 277
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L EANLT A L RT LTR++L A++ A+ S A
Sbjct: 278 LREANLTGAYLARTNLTRANLSDALLVRAELSSA 311
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 22/110 (20%)
Query: 89 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 148
Y+A+ RG ADL + + R AN T A +RE+D SG+ +G
Sbjct: 154 YKADLRG-------VNLSGADL-----TRVDLREANLTEASLRETDLSGADLSG------ 195
Query: 149 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
AN TGA LSD ++ +L A L NA L R L++++L A ++ A
Sbjct: 196 ----ANLTGALLSDACLEGAILEGACLRNAKLERANLSQANLFRANLQNA 241
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 50/100 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A S+DL A + N R+AN A + ++ S + A L A + A+ GADL
Sbjct: 38 SQADLRSSDLFFAYLNRANLRQANLLGARLSGANLSQATLVDANLHDADLHGASLRGADL 97
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ VL +ANL +A L L+ +DL GA + GA+
Sbjct: 98 RGADLSLAVLLDANLMDADLRNANLSGADLTGACLRGANL 137
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 42/85 (49%), Gaps = 15/85 (17%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N + AD+R SD F AYL +A +AN GA LS ANL+ A LV
Sbjct: 36 NLSQADLRSSDLF---F--AYLNRANLRQANLLGARLSG----------ANLSQATLVDA 80
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLA 208
L +DL GA + GAD A + LA
Sbjct: 81 NLHDADLHGASLRGADLRGADLSLA 105
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 54/115 (46%), Gaps = 10/115 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSAD----------MRESDFSGSKFNGAYLEKAVA 150
S +ADL + N +A+ S+D +R+++ G++ +GA L +A
Sbjct: 18 SGRNLSNADLTNVDLIGINLSQADLRSSDLFFAYLNRANLRQANLLGARLSGANLSQATL 77
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
AN ADL + L A+L+ AVL+ L +DL A + GAD + A +
Sbjct: 78 VDANLHDADLHGASLRGADLRGADLSLAVLLDANLMDADLRNANLSGADLTGACL 132
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 57/108 (52%), Gaps = 5/108 (4%)
Query: 103 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A+ A+L +A + N + +A T A ++++ F+ +K L +A ++A+
Sbjct: 221 AKLERANLSQANLFRANLQNALLPQARLTGAGLQQTIFAQAKLTDVDLSRADLFEADLRE 280
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+L+ + R L ANL++A+LVR L+ ++L A ++ A D +
Sbjct: 281 ANLTGAYLARTNLTRANLSDALLVRAELSSANLMDANLQRAVLPDGKV 328
>gi|115482792|ref|NP_001064989.1| Os10g0502000 [Oryza sativa Japonica Group]
gi|22165076|gb|AAM93693.1| hypothetical protein [Oryza sativa Japonica Group]
gi|31432906|gb|AAP54482.1| Thylakoid lumenal 17.4 kDa protein, chloroplast precursor,
putative, expressed [Oryza sativa Japonica Group]
gi|113639598|dbj|BAF26903.1| Os10g0502000 [Oryza sativa Japonica Group]
gi|125532544|gb|EAY79109.1| hypothetical protein OsI_34214 [Oryza sativa Indica Group]
gi|125575308|gb|EAZ16592.1| hypothetical protein OsJ_32066 [Oryza sativa Japonica Group]
gi|215704684|dbj|BAG94312.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 236
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 61/128 (47%), Gaps = 7/128 (5%)
Query: 109 DLRKAVHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
DLR + E N + + +A M +S F G+ + + KA A A+F G D ++ ++D
Sbjct: 113 DLRFCDYTNEKTNLKGKSLAAALMSDSKFDGADMSEVVMSKAYAVGASFKGTDFTNAVID 172
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
R+ +A+L A+ TVL+ S A ++ F D +I Q LC TN
Sbjct: 173 RVNFEKADLQGAIFRNTVLSGSTFDDAKMQDVVFEDTIIGYIDLQKLC-----TNTSISA 227
Query: 227 STRKSLGC 234
+R LGC
Sbjct: 228 DSRLELGC 235
>gi|383763954|ref|YP_005442936.1| hypothetical protein CLDAP_29990 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381384222|dbj|BAM01039.1| hypothetical protein CLDAP_29990 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 244
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 64/123 (52%), Gaps = 12/123 (9%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK--- 139
L + N YEA+ S A ADLR A + R A AD+R+++ +G+
Sbjct: 87 LREANLYEADL-------SNAVLDQADLRYATLERAVLRSATLRGADLRDANLAGADLRV 139
Query: 140 --FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
F+GA +E+A+ A+ A+L++ ++ R L ANL NAVL L +DL GA + G
Sbjct: 140 ADFSGAQMERAILTGASLVDANLANAVLRRADLRNANLRNAVLRYADLRGADLSGADLMG 199
Query: 198 ADF 200
AD
Sbjct: 200 ADL 202
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ + R NFT A + +++ S + A L +A +AN ADLS+ ++D+ L A L
Sbjct: 54 RADLNRVNFTEASLNQANLSRATLLMAILSRAQLREANLYEADLSNAVLDQADLRYATLE 113
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
AVL L +DL A + GAD A AQ +
Sbjct: 114 RAVLRSATLRGADLRDANLAGADLRVADFSGAQME 148
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 50/121 (41%), Gaps = 20/121 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L A+ + R AN AD+ + + A LE+AV A GADL D
Sbjct: 70 ANLSRATLLMAILSRAQLREANLYEADLSNAVLDQADLRYATLERAVLRSATLRGADLRD 129
Query: 163 ---------------TLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
M+R +L +ANL NAVL R L ++L A++ AD
Sbjct: 130 ANLAGADLRVADFSGAQMERAILTGASLVDANLANAVLRRADLRNANLRNAVLRYADLRG 189
Query: 203 A 203
A
Sbjct: 190 A 190
>gi|119486763|ref|ZP_01620738.1| hypothetical protein L8106_10952 [Lyngbya sp. PCC 8106]
gi|119456056|gb|EAW37189.1| hypothetical protein L8106_10952 [Lyngbya sp. PCC 8106]
Length = 331
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 62/123 (50%), Gaps = 9/123 (7%)
Query: 80 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR--RANFTSADMRESDFSG 137
++ L D N +A+ RG F ADLR A N R R + ++R +D G
Sbjct: 104 LAILLDANLIQADLRG-------VNFQGADLRGACLRGANLRYERRIYDGVNLRGADLRG 156
Query: 138 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
+ G L A +AN GA+L++T++ +L +ANLT A L LT +DL GA + G
Sbjct: 157 ADLQGVNLTGADLTRANLRGANLAETVLRGAILKQANLTQANLQSAFLTEADLSGARLIG 216
Query: 198 ADF 200
A+
Sbjct: 217 ANL 219
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 60/106 (56%), Gaps = 10/106 (9%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTGADLSDTL 164
LR A+ + N +AN SA + E+D SG++ GA LE+A+ +A G +L D++
Sbjct: 184 LRGAILKQANLTQANLQSAFLTEADLSGARLIGANLRKVKLERAILIEAQLPGVELCDSI 243
Query: 165 MDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 205
+ + L+ ANL+ A L RT L TR+DL A + AD +DA +
Sbjct: 244 LPDVKLSSANLSGADLSRTNLVRADLTRTDLSNANLTQADLTDASV 289
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 12/106 (11%)
Query: 104 QFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
Q G D R +H++ N A+ A+ +D GS+F AYL +AN A LS
Sbjct: 11 QAGERDFRD-IHLRNANLNSADLIDANFNHADLQGSEFVFAYLNSVNFVRANLGSAKLSG 69
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+++ L+ ANL++A DL GA+++GADF A + LA
Sbjct: 70 AYLNKANLSGANLSDA----------DLHGAVLQGADFRKANLSLA 105
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 59/134 (44%), Gaps = 24/134 (17%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF----------- 135
N+Y+A R F LR A + ANF AD++ S+F
Sbjct: 8 NRYQAGER---------DFRDIHLRNANLNSADLIDANFNHADLQGSEFVFAYLNSVNFV 58
Query: 136 ----SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 191
+K +GAYL KA AN + ADL ++ +ANL+ A+L+ L ++DL
Sbjct: 59 RANLGSAKLSGAYLNKANLSGANLSDADLHGAVLQGADFRKANLSLAILLDANLIQADLR 118
Query: 192 GAIIEGADFSDAVI 205
G +GAD A +
Sbjct: 119 GVNFQGADLRGACL 132
>gi|254424332|ref|ZP_05038050.1| DnaJ domain protein [Synechococcus sp. PCC 7335]
gi|196191821|gb|EDX86785.1| DnaJ domain protein [Synechococcus sp. PCC 7335]
Length = 411
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 44/77 (57%), Gaps = 10/77 (12%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
+ + A+++E DFSG +GA N +GADLSDT M ++ LN ANL A L R
Sbjct: 298 DMSGANLKEKDFSGRNLSGA----------NLSGADLSDTFMHKVNLNRANLRKARLFRA 347
Query: 184 VLTRSDLGGAIIEGADF 200
L ++DL A + GAD
Sbjct: 348 NLLQADLSHADLSGADL 364
>gi|223995969|ref|XP_002287658.1| thylakoid lumenal 17.4 kDa protein, chloroplast precursor
[Thalassiosira pseudonana CCMP1335]
gi|220976774|gb|EED95101.1| thylakoid lumenal 17.4 kDa protein, chloroplast precursor
[Thalassiosira pseudonana CCMP1335]
Length = 245
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 5/110 (4%)
Query: 130 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 189
M ++D S +F A K +NF GAD ++ ++DR ++L AV VLT +
Sbjct: 128 MTKTDVSNGQFKEAQFSKGYLRDSNFDGADFTNAIVDRASFKGSSLKGAVFKNAVLTATS 187
Query: 190 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRR 239
GA +E ADF+DA I + LCK NP VS + N++R
Sbjct: 188 FEGADVENADFTDAYIGDFDIRTLCK-----NPTLKVSRFYRMTYRNAQR 232
>gi|158316060|ref|YP_001508568.1| pentapeptide repeat-containing protein [Frankia sp. EAN1pec]
gi|158111465|gb|ABW13662.1| pentapeptide repeat protein [Frankia sp. EAN1pec]
Length = 411
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 56/100 (56%), Gaps = 6/100 (6%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ + RRAN T A++ ++D +G++ A L A+ ++A TGA L + L A+LT
Sbjct: 282 RADLRRANLTDAELVDADLTGARLADATLAGALLFRATLTGAQLGRADLTGAQLGGADLT 341
Query: 177 NAVLVRTVLTRSDLGGA-----IIEGADFSDAVIDLAQKQ 211
NAVL +L + L GA ++GAD + A LAQKQ
Sbjct: 342 NAVLDEAILADAVLSGANLTNARLDGADLT-AATGLAQKQ 380
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 50/99 (50%), Gaps = 6/99 (6%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
+FT + + D + + A L A A+ TGA L+D + +L A LT A
Sbjct: 269 DFTGGSLDDVDLARADLRRANLTDAELVDADLTGARLADATLAGALLFRATLTGA----- 323
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLA-QKQALCKYANGTN 221
L R+DL GA + GAD ++AV+D A A+ AN TN
Sbjct: 324 QLGRADLTGAQLGGADLTNAVLDEAILADAVLSGANLTN 362
>gi|386828484|ref|ZP_10115591.1| putative low-complexity protein [Beggiatoa alba B18LD]
gi|386429368|gb|EIJ43196.1| putative low-complexity protein [Beggiatoa alba B18LD]
Length = 986
Score = 58.5 bits (140), Expect = 3e-06, Method: Composition-based stats.
Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 25/109 (22%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY--------------------KANFTG 157
+N R +F+ D+R +DFSG+ A + A+ Y ANF+
Sbjct: 645 QNLRGQDFSGQDLRYADFSGADLTDALFKNAILYHVNFSNATLKNADFTKTDLSNANFSD 704
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
ADL+D L +L AN ++A L T++DL A+F+DA+ D
Sbjct: 705 ADLTDALFKNAILQHANFSDATLKNADFTKTDL-----SNANFTDAICD 748
Score = 40.4 bits (93), Expect = 0.83, Method: Composition-based stats.
Identities = 22/76 (28%), Positives = 36/76 (47%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A L+ A K + ANF+ AD+ ++ F + A A A+FT DLS+
Sbjct: 682 FSNATLKNADFTKTDLSNANFSDADLTDALFKNAILQHANFSDATLKNADFTKTDLSNAN 741
Query: 165 MDRMVLNEANLTNAVL 180
+ +E +L A +
Sbjct: 742 FTDAICDEVSLLGATV 757
>gi|119512769|ref|ZP_01631839.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
gi|119462587|gb|EAW43554.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
Length = 268
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 50/85 (58%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N AN +AD+ E++ ++ NGAYL KA YKAN A LS + R +EANL+ A
Sbjct: 160 NLIEANLINADLSEANLYEAQLNGAYLYKANFYKANLHQAHLSGAYLFRANFSEANLSCA 219
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
L + LT ++L GA ++GA+ A
Sbjct: 220 NLTWSNLTGANLAGANLQGANLRGA 244
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 52/101 (51%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +A+L+ A+ + N + A++RE+D S +K A L A +AN ADLS+
Sbjct: 114 ADLSTANLQGAIIAEANLIGTDLRDANLRETDLSTAKLIRANLGFANLIEANLINADLSE 173
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ LN A L A + L ++ L GA + A+FS+A
Sbjct: 174 ANLYEAQLNGAYLYKANFYKANLHQAHLSGAYLFRANFSEA 214
Score = 43.5 bits (101), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SAA +LR A N + + + A + ++ S + +GA L +A +AN + A+L
Sbjct: 32 SAANLKGENLRGANLQGVNLNKVDLSHALLVRANLSNADLSGANLHQAKLIEANLSEANL 91
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDA 203
S + L +ANL+ A L+ L+ ++L GAII G D DA
Sbjct: 92 SVANLSGATLTQANLSYAHLIGADLSTANLQGAIIAEANLIGTDLRDA 139
Score = 40.4 bits (93), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 49/94 (52%), Gaps = 5/94 (5%)
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD--RMV--- 169
+K + AN ++R ++ G N L A+ +AN + ADLS + +++
Sbjct: 26 QMKLDISAANLKGENLRGANLQGVNLNKVDLSHALLVRANLSNADLSGANLHQAKLIEAN 85
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L+EANL+ A L LT+++L A + GAD S A
Sbjct: 86 LSEANLSVANLSGATLTQANLSYAHLIGADLSTA 119
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 55/119 (46%), Gaps = 15/119 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVA 150
S A A+L +A ++ N AN T A++ + G+ + A L+ A+
Sbjct: 67 SNADLSGANLHQAKLIEANLSEANLSVANLSGATLTQANLSYAHLIGADLSTANLQGAII 126
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+AN G DL D L E +L+ A L+R L ++L A + AD S+A + AQ
Sbjct: 127 AEANLIGTDLRDA-----NLRETDLSTAKLIRANLGFANLIEANLINADLSEANLYEAQ 180
>gi|261821705|ref|YP_003259811.1| hypothetical protein Pecwa_2443 [Pectobacterium wasabiae WPP163]
gi|261605718|gb|ACX88204.1| Protein of unknown function DUF2169 [Pectobacterium wasabiae
WPP163]
Length = 846
Score = 58.2 bits (139), Expect = 4e-06, Method: Composition-based stats.
Identities = 46/161 (28%), Positives = 77/161 (47%), Gaps = 13/161 (8%)
Query: 70 AAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSAD 129
A++ SCS + A+ ++ T + S + SAD +A + N R+A+ A
Sbjct: 687 GALLDSCSW-VETQANEARFTGATWLTSAVASGSSMNSADFTQATLRQSNLRQASLIGAV 745
Query: 130 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 189
F+ +K + L +A + NF A+L+ +L R EAN T+A L+ +L +S
Sbjct: 746 -----FALAKLENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQ 800
Query: 190 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 230
LGGA GA+ A DL+Q + + T + G T++
Sbjct: 801 LGGANFRGANLFRA--DLSQ-----AFTSNTTQLDGAWTKR 834
Score = 39.3 bits (90), Expect = 2.0, Method: Composition-based stats.
Identities = 26/96 (27%), Positives = 42/96 (43%), Gaps = 10/96 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F L++A+ F A FT RE+ F+ F+ A L + + + G D
Sbjct: 606 SRAHFKDTQLQEALFDHCTFAEATFTELLFRETWFTQCGFHRATLNACIFMELSLPGLDF 665
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
SD A LT +++ L R+ GA+++
Sbjct: 666 SD----------AKLTKTTFLKSTLERATFNGALLD 691
Score = 37.7 bits (86), Expect = 4.7, Method: Composition-based stats.
Identities = 29/96 (30%), Positives = 47/96 (48%), Gaps = 11/96 (11%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGAD-LSDTLMDRMVLNE 172
+F A T +S + FNGA L+ + A +A FTGA L+ + +N
Sbjct: 664 DFSDAKLTKTTFLKSTLERATFNGALLDSCSWVETQANEARFTGATWLTSAVASGSSMNS 723
Query: 173 ANLTNAVLVRTVLTRSDLGGAI-----IEGADFSDA 203
A+ T A L ++ L ++ L GA+ +E +D S+A
Sbjct: 724 ADFTQATLRQSNLRQASLIGAVFALAKLENSDLSEA 759
>gi|428216484|ref|YP_007100949.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988266|gb|AFY68521.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 673
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 60/110 (54%), Gaps = 7/110 (6%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGAD 159
F + DL + N +A + E+DFS ++ GA L A+A A+ F+GAD
Sbjct: 416 FANVDLSGEILSGAELNEINLQNALLSETDFSDARLGGANLTGAIATGADLRGVDFSGAD 475
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L++ + +++E NLT A L+R L ++DL A++ GA+ A DL+Q
Sbjct: 476 LTEANLTNAIMSEVNLTGARLLRANLKQADLNFAVLRGAELMRA--DLSQ 523
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 10/102 (9%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKA 148
I S A+ +L+ A+ + +F A T AD+R DFSG+ A L A
Sbjct: 425 ILSGAELNEINLQNALLSETDFSDARLGGANLTGAIATGADLRGVDFSGADLTEANLTNA 484
Query: 149 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+ + N TGA L + + LN A L A L+R L+++DL
Sbjct: 485 IMSEVNLTGARLLRANLKQADLNFAVLRGAELMRADLSQTDL 526
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 28/131 (21%)
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
A+++ES+ S ++ A LE AV A+ A+L ++ L E +L++A L T+
Sbjct: 549 ANLQESNLSAAELENAQLEAAVLLLADLRSANLKLANLNYADLREVDLSSADL-----TQ 603
Query: 188 SDLGG----------------AIIEGADFSDAVIDLAQ--KQALCKYANGT----NPITG 225
++L G A I+GADF+D V++LA K CK A G +P
Sbjct: 604 ANLIGANLSGANLRGTDVNQLASIDGADFTD-VVNLADTSKTYFCKIAAGQTFAESPEQR 662
Query: 226 VSTRKSLGCGN 236
+TR +L C N
Sbjct: 663 RATRATLDCPN 673
>gi|123965950|ref|YP_001011031.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9515]
gi|123200316|gb|ABM71924.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9515]
Length = 157
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 61/130 (46%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +DL+ A + + AN + D++ + G+K N + ++L +
Sbjct: 35 FSGSDLKGATFYLTDLQDANLSDCDLQNASLYGAKLK----------DTNLSNSNLREVT 84
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D VL+ +LTN L + + I+GADF++ + + CK A+GTNP T
Sbjct: 85 LDSAVLDGTDLTNTNLEDSFAYSTQFENVKIQGADFTNVYLPKDIVREFCKEASGTNPFT 144
Query: 225 GVSTRKSLGC 234
TR++L C
Sbjct: 145 NRETRETLEC 154
>gi|73669894|ref|YP_305909.1| hypothetical protein Mbar_A2409 [Methanosarcina barkeri str.
Fusaro]
gi|72397056|gb|AAZ71329.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 234
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 62/112 (55%), Gaps = 3/112 (2%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A+L++A +K N +A+ A++ +D G+ GA ++A +AN GADLS+
Sbjct: 27 ANFQDANLQEAYLIKANLTQADLQGANLYRADLRGADLRGANFQEANLQEANLQGADLSN 86
Query: 163 TLMDRMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
+ + + L ANL A L LTR++L GA ++GA+ + I LA Q
Sbjct: 87 SYLLEGIGTNLQGANLQGANLQGANLTRANLKGANLKGANLQLSNIHLANLQ 138
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 50/169 (29%), Positives = 71/169 (42%), Gaps = 27/169 (15%)
Query: 40 ISSKTESDGQFPGPYAKLKN--WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEF 97
I K PG + + N W F L A + + + L N Y A+ RG
Sbjct: 4 IEKKNYRGVNLPGAHLEKNNLIWANFQDANLQEAYLIKANLTQADLQGANLYRADLRG-- 61
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF---NGAYLEKAVAYKAN 154
ADLR A NF+ AN A+++ +D S S G L+ A AN
Sbjct: 62 ----------ADLRGA-----NFQEANLQEANLQGADLSNSYLLEGIGTNLQGANLQGAN 106
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
GA+L+ R L ANL A L + + ++L GA ++GA+F A
Sbjct: 107 LQGANLT-----RANLKGANLKGANLQLSNIHLANLQGANLQGANFQGA 150
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 5/82 (6%)
Query: 130 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 189
+ + ++ G GA+LEK ANF A+L + + + ANLT A L L R+D
Sbjct: 4 IEKKNYRGVNLPGAHLEKNNLIWANFQDANLQEAYLIK-----ANLTQADLQGANLYRAD 58
Query: 190 LGGAIIEGADFSDAVIDLAQKQ 211
L GA + GA+F +A + A Q
Sbjct: 59 LRGADLRGANFQEANLQEANLQ 80
>gi|85860772|ref|YP_462974.1| pentapeptide repeat-containing protein [Syntrophus aciditrophicus
SB]
gi|85723863|gb|ABC78806.1| pentapeptide repeat domain protein [Syntrophus aciditrophicus SB]
Length = 306
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A + DLR+A + AN T AD+++S+ S + N L +AN +GADL
Sbjct: 157 SEANLSNTDLREADLHGADLSDANLTGADLQKSNLSKANLNWTRL-----REANLSGADL 211
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S+ + R L +ANL+ A LV L R++L G + GAD +A +
Sbjct: 212 SEAYLKRADLRKANLSRANLVDANLNRANLRGTDLRGADLGNANL 256
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 52/103 (50%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L A K N +AN +RE++ SG+ + AYL++A KAN + A+L D
Sbjct: 174 ADLSDANLTGADLQKSNLSKANLNWTRLREANLSGADLSEAYLKRADLRKANLSRANLVD 233
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
ANL A L T L +DLG A + GAD +A +
Sbjct: 234 ----------ANLNRANLRGTDLRGADLGNANLAGADLREANL 266
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 56/104 (53%), Gaps = 5/104 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G DL A + +A+ + A+++E+D SG+ + A L A N GADLS+
Sbjct: 39 ADLGGMDLCNA-----DLGKADLSEANLQETDLSGANLHKADLNGANLKGVNLVGADLSE 93
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
++ L+EA+L A L RT L++ +L G + A+ S+ +D
Sbjct: 94 ACLNGADLSEADLGKADLRRTCLSKVNLRGTKLIEANLSNTDLD 137
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 5/111 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A + DL + +N RR A++ E++ S + A L A AN TGADL
Sbjct: 129 ANLSNTDLDEVELRGQNLRRTKLIGANLSEANLSNTDLREADLHGADLSDANLTGADLQK 188
Query: 163 TLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+ + + LN EANL+ A L L R+DL A + A+ DA ++ A
Sbjct: 189 SNLSKANLNWTRLREANLSGADLSEAYLKRADLRKANLSRANLVDANLNRA 239
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 45/100 (45%), Gaps = 10/100 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADLRKA N RAN A++ ++ G+ GA L AN GADL
Sbjct: 212 SEAYLKRADLRKA-----NLSRANLVDANLNRANLRGTDLRGADL-----GNANLAGADL 261
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ + + L A L A L T L+ +D G + AD
Sbjct: 262 REANLGKTCLRGARLQGAKLNETDLSDADFTGVDLSEADL 301
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 49/118 (41%), Gaps = 29/118 (24%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A G ADLR+ K N R A++ +D + G L + GA+L
Sbjct: 102 SEADLGKADLRRTCLSKVNLRGTKLIEANLSNTDLDEVELRGQNL-----RRTKLIGANL 156
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQKQALCK 215
S EANL+N +DL A + GAD SDA + DL QK L K
Sbjct: 157 S----------EANLSN----------TDLREADLHGADLSDANLTGADL-QKSNLSK 193
>gi|409994208|ref|ZP_11277326.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|409934956|gb|EKN76502.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 517
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 68/122 (55%), Gaps = 5/122 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F +A+LR+A N A+F+ A++R +D G+ +GA L +A AN +GA+LS
Sbjct: 189 ADFTNAELRQANLTYANLSNADFSGANLRWTDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY----A 217
++ + L A+L+ A L+R + +DL GA + GA + + +L + C++ A
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKLYQVSRFNLKADEITCEWVDLSA 308
Query: 218 NG 219
NG
Sbjct: 309 NG 310
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 73/135 (54%), Gaps = 10/135 (7%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKAN 154
G+ +F DLR A +K N +A+FT+A++R+++ S + F+GA L A+
Sbjct: 166 GALTKFTKTDLRGADLLKANLPKADFTNAELRQANLTYANLSNADFSGANLRWTDLQGAD 225
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADFSDAVIDLAQ 209
+GA+L++ + L+ ANL++AVLV+ L +DL A + GAD S A + A+
Sbjct: 226 LSGANLTEANLSGANLSGANLSSAVLVKASLVHADLSQANLIRANWSGADLSGATLTGAK 285
Query: 210 KQALCKYANGTNPIT 224
+ ++ + IT
Sbjct: 286 LYQVSRFNLKADEIT 300
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 67/156 (42%), Gaps = 27/156 (17%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
L A++ + N++ L + EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138
Query: 128 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 167
AD+RES + FNGA L A KAN AD ++ + +
Sbjct: 139 ADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPKADFTNAELRQ 198
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANL+NA L +DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANLRWTDLQGADLSGANLTEA 234
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 22/144 (15%)
Query: 93 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 152
TR + A+ +A+L KA+ + AN AD+ E+ + A L +A K
Sbjct: 72 TRANLNV---ARLSNANLTKAILNQATINVANLVRADLTEAQLINTLLIRAELVRAKLSK 128
Query: 153 ANFT-----GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAI-----IEG 197
ANFT GADL ++ + + N ANL+ A L T T++DL GA +
Sbjct: 129 ANFTQANLNGADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPK 188
Query: 198 ADFSDAVIDLAQKQALCKYANGTN 221
ADF++A + +QA YAN +N
Sbjct: 189 ADFTNAEL----RQANLTYANLSN 208
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 49/102 (48%), Gaps = 2/102 (1%)
Query: 104 QFGSADLRKAVHVKENFR--RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
Q +D+ K + + +R +F ++ E + S GA L A AN + +DL
Sbjct: 8 QNSESDVLKVYAIVKRYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ R LN A L+NA L + +L ++ + A + AD ++A
Sbjct: 68 GVNLTRANLNVARLSNANLTKAILNQATINVANLVRADLTEA 109
>gi|323454309|gb|EGB10179.1| hypothetical protein AURANDRAFT_23610 [Aureococcus anophagefferens]
Length = 107
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 6/101 (5%)
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII---- 195
FN A L A + A+ G D M ++ L A+L+NA L LT + + GA+I
Sbjct: 6 FNKAQLFSASFFDADLAGTTFVDADMKQVNLEMADLSNADLTNADLTEAYMAGAVIKDLK 65
Query: 196 --EGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
+ D++D + Q+ LC A GTNP TG+ TR +L C
Sbjct: 66 KIDNTDWTDVDMRKDQRTYLCSIAKGTNPKTGMDTRDTLMC 106
>gi|126696014|ref|YP_001090900.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9301]
gi|126543057|gb|ABO17299.1| Pentapeptide repeats [Prochlorococcus marinus str. MIT 9301]
Length = 157
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 64/134 (47%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+A +G L A + + A F D+++++ SG + A L A N + ++L
Sbjct: 21 AALDYGKQSLVGADFSGSDLKGATFYLTDLQDANLSGCELQNATLYGAKLKDTNLSNSNL 80
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+ +D VL+ +L+N L + + I+GADF++ + + C+ A GT
Sbjct: 81 REVTLDSAVLDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIIKKFCESATGT 140
Query: 221 NPITGVSTRKSLGC 234
NP T TR++L C
Sbjct: 141 NPFTNRETRETLEC 154
>gi|119487930|ref|ZP_01621427.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119455506|gb|EAW36644.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 276
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 56/103 (54%), Gaps = 5/103 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A SADL A + N R N T++ + E+ G+ AYL +A NFT ADL
Sbjct: 38 SGANLISADLSHANLCQTNLRGINLTNSTLSEARLRGADLCDAYLSEA-----NFTRADL 92
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S+ + L EANLT+A LV T L ++L A ++ A+ S+A
Sbjct: 93 SEAQLLNAYLKEANLTHAQLVNTNLNGANLSNAKLQNANLSNA 135
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 49/101 (48%), Gaps = 10/101 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ ADL A ANFT AD+ E+ + A L A N GA+L
Sbjct: 68 SEARLRGADLCDAY-----LSEANFTRADLSEAQLLNAYLKEANLTHAQLVNTNLNGANL 122
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
S+ L ANL+NA L+ TVLT +L GA + GA+ +
Sbjct: 123 SNA-----KLQNANLSNANLLNTVLTGVNLTGANLNGANLT 158
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 46/96 (47%), Gaps = 5/96 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F ADL +A + + AN T A + ++ NGA L A AN + A+L
Sbjct: 83 SEANFTRADLSEAQLLNAYLKEANLTHAQLVNTNL-----NGANLSNAKLQNANLSNANL 137
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
+T++ + L ANL A L L R +L G I+
Sbjct: 138 LNTVLTGVNLTGANLNGANLTGVELCRVNLNGTQID 173
>gi|113476913|ref|YP_722974.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
gi|110167961|gb|ABG52501.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
Length = 567
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 53/106 (50%), Gaps = 5/106 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 157
A A+L KAV V N RR N + A++ ++ + F+GAYL +A +AN G
Sbjct: 418 ASLEGANLTKAVLVSANLRRVNLSGANLNSTNLRAANFSGAYLREAKLSRANLEGANLKK 477
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A+LS M L A+L A L L R DL GA + G F DA
Sbjct: 478 ANLSGANMSHASLRGADLRRATLKDANLKRVDLVGANLAGVTFLDA 523
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 10/87 (11%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NFRRANF + K AYL A ++AN G +L + L +A L +
Sbjct: 354 NFRRANFAAL----------KLEDAYLRNADLFQANLRGVELRGARLQNANLKKAQLQGS 403
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVI 205
+L++ L +++L A +EGA+ + AV+
Sbjct: 404 ILIKAKLQKANLYRASLEGANLTKAVL 430
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 52/104 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +ADL +A R A +A+++++ GS A L+KA Y+A+ GA+L+
Sbjct: 368 AYLRNADLFQANLRGVELRGARLQNANLKKAQLQGSILIKAKLQKANLYRASLEGANLTK 427
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
++ L NL+ A L T L ++ GA + A S A ++
Sbjct: 428 AVLVSANLRRVNLSGANLNSTNLRAANFSGAYLREAKLSRANLE 471
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 50/108 (46%), Gaps = 15/108 (13%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A S +LR ANF+ A +RE+ S + GA L+KA AN + A L
Sbjct: 441 SGANLNSTNLRA----------ANFSGAYLREAKLSRANLEGANLKKANLSGANMSHASL 490
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDA 203
+ R L +ANL LV L +DL GA ++GA+ +A
Sbjct: 491 RGADLRRATLKDANLKRVDLVGANLAGVTFLDADLQGANLKGANLKNA 538
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 48/110 (43%), Gaps = 15/110 (13%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+A + A LR A + N R A ++ ++ ++ G+ L KA KAN A L
Sbjct: 361 AALKLEDAYLRNADLFQANLRGVELRGARLQNANLKKAQLQGSILIKAKLQKANLYRASL 420
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI 205
ANLT AVLV L R +L GA + A+FS A +
Sbjct: 421 EG----------ANLTKAVLVSANLRRVNLSGANLNSTNLRAANFSGAYL 460
>gi|334117749|ref|ZP_08491840.1| stress protein [Microcoleus vaginatus FGP-2]
gi|333460858|gb|EGK89466.1| stress protein [Microcoleus vaginatus FGP-2]
Length = 578
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 57/105 (54%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S+A +A L + + N + AN S +++ +D + +GA L KA+ Y A A+L
Sbjct: 312 SSANLANAKLIQVNLIGSNLQGANLNSTNLQSADLIEANLSGANLTKAILYYARLIHANL 371
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S + L++ANLT A L R LT++ LG A + GAD S + +
Sbjct: 372 SQANLSEAKLDKANLTTANLSRANLTQASLGSANLTGADLSQSKV 416
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A L A + N +AN + A + +++ + + + A L +A AN TGADL
Sbjct: 352 SGANLTKAILYYARLIHANLSQANLSEAKLDKANLTTANLSRANLTQASLGSANLTGADL 411
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
S + + ++ L+ ANL+ L LT +L G + G + S
Sbjct: 412 SQSKVTKVNLSGANLSGVNLTGVSLTGVNLQGVNLSGMNLS 452
>gi|157803630|ref|YP_001492179.1| hypothetical protein A1E_02245 [Rickettsia canadensis str. McKiel]
gi|157784893|gb|ABV73394.1| Uncharacterized low-complexity protein [Rickettsia canadensis str.
McKiel]
Length = 956
Score = 57.8 bits (138), Expect = 5e-06, Method: Composition-based stats.
Identities = 40/113 (35%), Positives = 61/113 (53%), Gaps = 6/113 (5%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+A++ KA+ K N AN T A + ++ +K + A LEKA A G +++D +
Sbjct: 559 NANMNKALLDKANLEYANLTGAILTDASAQFAKLSNATLEKAEA-----EGLNIADAIAK 613
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 218
M EAN NA++ R LT+++L AI+E AD A +D K+A K AN
Sbjct: 614 NMNAKEANFKNAIMKRADLTKANLEKAILENADMQAAEALDAIFKEANLKQAN 666
Score = 40.4 bits (93), Expect = 0.79, Method: Composition-based stats.
Identities = 40/148 (27%), Positives = 61/148 (41%), Gaps = 27/148 (18%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
L A++ S+ + L++ +AE G ++ A+ N + ANF +
Sbjct: 577 LTGAILTDASAQFAKLSNATLEKAEAEG------------LNIADAIAKNMNAKEANFKN 624
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
A M+ +D + A LEKA+ A+ A+ D + EANL A L L R
Sbjct: 625 AIMKRADLTK-----ANLEKAILENADMQAAEALDA-----IFKEANLKQANLKAANLAR 674
Query: 188 SDLGGAIIEGADFSDAVIDLAQKQALCK 215
+ GADF A +D A K K
Sbjct: 675 INKA-----GADFDQAKVDDATKMHYTK 697
Score = 39.3 bits (90), Expect = 2.1, Method: Composition-based stats.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 7/110 (6%)
Query: 121 RRANFTSADMRESDFSGSKFNG------AYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
++ N S R + FS ++F A L A+ + N A+++ L+D+ L AN
Sbjct: 517 KQCNMKSITARNAYFSDAEFENILSLEEADLRNAIMERVNLVNANMNKALLDKANLEYAN 576
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANGTNPI 223
LT A+L + L A +E A+ + D K K AN N I
Sbjct: 577 LTGAILTDASAQFAKLSNATLEKAEAEGLNIADAIAKNMNAKEANFKNAI 626
>gi|428314577|ref|YP_007151024.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428256301|gb|AFZ22256.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 281
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 62/111 (55%), Gaps = 5/111 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A ADL +A + N RA N + A + +++ S + + A+L +A AN
Sbjct: 123 SRANLSRADLSEANLSRANLSRADLSDANLSPASLSDANLSRANLSRAFLSRANLSDANL 182
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ A+LSD + R L+ ANL+ A L R L+ ++LGGA + GA+F ++ ID
Sbjct: 183 SRANLSDANLSRADLSRANLSRANLSRADLSGANLGGANLSGANFRNSEID 233
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 52/95 (54%), Gaps = 5/95 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEA 173
N R A +A++RE + S + + A L +A +AN + ADLSD + L++A
Sbjct: 101 NVRNAPLENANLREINLSEANLSRANLSRADLSEANLSRANLSRADLSDANLSPASLSDA 160
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
NL+ A L R L+R++L A + A+ SDA + A
Sbjct: 161 NLSRANLSRAFLSRANLSDANLSRANLSDANLSRA 195
>gi|411118568|ref|ZP_11390949.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410712292|gb|EKQ69798.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 321
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 56/105 (53%), Gaps = 5/105 (4%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA A+L +A+ N AN T A++ E+ S ++ GA L++A KAN T ADLS
Sbjct: 194 AANLSGANLGRALLEGVNLIGANLTQANLIEARLSLAEMRGAKLDQAELTKANLTEADLS 253
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L+ A L AV+V VL AI+ GADFSDA ID
Sbjct: 254 WASFRGTNLSAATLHKAVMVDVVLD-----AAILRGADFSDATID 293
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 41/80 (51%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
NF + D+ + + GA L KAV ++A+ TGA+L D + + L NLT A L
Sbjct: 16 NFDTVDLSGVNLRQADLRGASLRKAVLFEADLTGANLVDVELHGVALRHTNLTAACLAGV 75
Query: 184 VLTRSDLGGAIIEGADFSDA 203
L +DL A + AD S A
Sbjct: 76 KLVGADLSAAQLVRADLSGA 95
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 61/126 (48%), Gaps = 20/126 (15%)
Query: 101 SAAQFGSADL----------RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
SAAQ ADL R A N R N +A++ E+D + ++ + A L +A
Sbjct: 83 SAAQLVRADLSGANLWRSLLRNANLHAANLERTNLHAANLVEADLTTARLSHANLAEANL 142
Query: 151 YKANFTGADL----------SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A+ TGA L S + + + L +A+L AVLV L+R++L A + GA+
Sbjct: 143 SDADLTGATLRWVNGVEAMFSRSRLRGVDLEQADLKKAVLVEVDLSRANLEAANLSGANL 202
Query: 201 SDAVID 206
A+++
Sbjct: 203 GRALLE 208
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 46/93 (49%), Gaps = 5/93 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTG 157
A+ A++R A + +AN T AD+ + F G+ + A L KAV A G
Sbjct: 225 ARLSLAEMRGAKLDQAELTKANLTEADLSWASFRGTNLSAATLHKAVMVDVVLDAAILRG 284
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
AD SD +D LN+++LT +L VL S L
Sbjct: 285 ADFSDATIDPACLNQSSLTWVILPSGVLQISSL 317
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 51/103 (49%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A LR V+ F R+ D+ ++D + L +A AN +GA+L
Sbjct: 143 SDADLTGATLRWVNGVEAMFSRSRLRGVDLEQADLKKAVLVEVDLSRANLEAANLSGANL 202
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L++ + L ANLT A L+ L+ +++ GA ++ A+ + A
Sbjct: 203 GRALLEGVNLIGANLTQANLIEARLSLAEMRGAKLDQAELTKA 245
>gi|291570912|dbj|BAI93184.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 517
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 68/122 (55%), Gaps = 5/122 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F +A+LR+A N A+F+ A++R +D G+ +GA L +A AN +GA+LS
Sbjct: 189 ADFTNAELRQANLTYANLSNADFSGANLRWTDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY----A 217
++ + L A+L+ A L+R + +DL GA + GA + + +L + C++ A
Sbjct: 249 AVLVKASLVHADLSQANLIRANWSGADLSGATLTGAKLYQVSRFNLKADEITCEWVDLSA 308
Query: 218 NG 219
NG
Sbjct: 309 NG 310
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 73/135 (54%), Gaps = 10/135 (7%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKAN 154
G+ +F DLR A +K N +A+FT+A++R+++ S + F+GA L A+
Sbjct: 166 GALTKFTKTDLRGADLLKANLPKADFTNAELRQANLTYANLSNADFSGANLRWTDLQGAD 225
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADFSDAVIDLAQ 209
+GA+L++ + L+ ANL++AVLV+ L +DL A + GAD S A + A+
Sbjct: 226 LSGANLTEANLSGANLSGANLSSAVLVKASLVHADLSQANLIRANWSGADLSGATLTGAK 285
Query: 210 KQALCKYANGTNPIT 224
+ ++ + IT
Sbjct: 286 LYQVSRFNLKADEIT 300
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 67/156 (42%), Gaps = 27/156 (17%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
L A++ + N++ L + EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138
Query: 128 ADMRESDFSGSKFNGAYL--------------------EKAVAYKANFTGADLSDTLMDR 167
AD+RES + FNGA L A KAN AD ++ + +
Sbjct: 139 ADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPKADFTNAELRQ 198
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANL+NA L +DL GA + GA+ ++A
Sbjct: 199 ANLTYANLSNADFSGANLRWTDLQGADLSGANLTEA 234
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 22/144 (15%)
Query: 93 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 152
TR + A+ +A+L KA+ + AN AD+ E+ + A L +A K
Sbjct: 72 TRANLNV---ARLSNANLTKAILNQATINVANLVRADLTEAQLINTLLIRAELVRAKLSK 128
Query: 153 ANFT-----GADLSDTLMDRMVLNEANLTNAVL-----VRTVLTRSDLGGAI-----IEG 197
ANFT GADL ++ + + N ANL+ A L T T++DL GA +
Sbjct: 129 ANFTQANLNGADLRESKLQQTNFNGANLSGANLRGVSGALTKFTKTDLRGADLLKANLPK 188
Query: 198 ADFSDAVIDLAQKQALCKYANGTN 221
ADF++A + +QA YAN +N
Sbjct: 189 ADFTNAEL----RQANLTYANLSN 208
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 50/102 (49%), Gaps = 2/102 (1%)
Query: 104 QFGSADLRKAVHVKENFR--RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
Q +D+ K + + +R +F ++ E + S GA L A AN + +DL
Sbjct: 8 QNSESDVLKVYEIVKKYRDGERDFEDINLNEINLSRINLAGANLSGASLSVANLSASDLR 67
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + R LN A L+NA L + +L ++ + A + AD ++A
Sbjct: 68 EVNLTRANLNVARLSNANLTKAILNQATINVANLVRADLTEA 109
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ++DLR+ N RAN A + ++ + + N A + A +A+ T A L
Sbjct: 57 SVANLSASDLREV-----NLTRANLNVARLSNANLTKAILNQATINVANLVRADLTEAQL 111
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+TL+ R A LVR L++++ A + GAD ++
Sbjct: 112 INTLLIR----------AELVRAKLSKANFTQANLNGADLRES 144
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 49/111 (44%), Gaps = 13/111 (11%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ R N T A++ + S + A L +A AN ADL+ EA L N
Sbjct: 65 DLREVNLTRANLNVARLSNANLTKAILNQATINVANLVRADLT----------EAQLINT 114
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 229
+L+R L R+ L A A+ + A DL + + NG N ++G + R
Sbjct: 115 LLIRAELVRAKLSKANFTQANLNGA--DLRESKLQQTNFNGAN-LSGANLR 162
>gi|427707611|ref|YP_007049988.1| pentapeptide repeat-containing protein [Nostoc sp. PCC 7107]
gi|427360116|gb|AFY42838.1| pentapeptide repeat protein [Nostoc sp. PCC 7107]
Length = 521
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 54/101 (53%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +ADLR+A K N RRAN + A ++ S +G+ A L A ++ + +GA+L D
Sbjct: 120 ANLSNADLREATLRKANLRRANLSEASLKGSSLAGTNLEMANLNAADLHRTDLSGANLRD 179
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + L ANL+ A L L +DL GA + AD S A
Sbjct: 180 AELKQTNLTHANLSGADLSGANLRWADLSGANLSWADLSGA 220
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 57/119 (47%), Gaps = 1/119 (0%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L++ N A+ + A++R +D SG+ + A L A AN GA+L
Sbjct: 173 SGANLRDAELKQTNLTHANLSGADLSGANLRWADLSGANLSWADLSGAKLSGANLMGANL 232
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALCKYAN 218
S+ + ANLT A L++ +DL GA + GA A L + +C++ +
Sbjct: 233 SNANLTNTSFVHANLTEATLIKAEWIGADLTGATLTGAKLHSASRFGLKTEGMICEWVD 291
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 47/105 (44%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S F A+L N AN + A + + SG F GA + A AN ADL
Sbjct: 33 SGINFSEANLSVVNLSGANLSDANLSHAKLNVARLSGVNFVGAIMNYASLNVANLIRADL 92
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S R L A+L A L+R L+R+DL A + AD +A +
Sbjct: 93 S-----RAQLRGASLVRAELIRAELSRADLFEANLSNADLREATL 132
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 51/111 (45%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A LR A V+ RA + AD+ E++ S + A L KA +AN + A L
Sbjct: 90 ADLSRAQLRGASLVRAELIRAELSRADLFEANLSNADLREATLRKANLRRANLSEASLKG 149
Query: 163 TLMDRMVLNEANLTNAVLVRTVLT----------RSDLGGAIIEGADFSDA 203
+ + L ANL A L RT L+ +++L A + GAD S A
Sbjct: 150 SSLAGTNLEMANLNAADLHRTDLSGANLRDAELKQTNLTHANLSGADLSGA 200
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 41/80 (51%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
NF+ ++ E++ SG+K +G +A N +GA+LSD + LN A L+ V
Sbjct: 16 NFSGIELCEANLSGAKLSGINFSEANLSVVNLSGANLSDANLSHAKLNVARLSGVNFVGA 75
Query: 184 VLTRSDLGGAIIEGADFSDA 203
++ + L A + AD S A
Sbjct: 76 IMNYASLNVANLIRADLSRA 95
>gi|379022817|ref|YP_005299478.1| hypothetical protein RCA_02115 [Rickettsia canadensis str. CA410]
gi|376323755|gb|AFB20996.1| hypothetical protein RCA_02115 [Rickettsia canadensis str. CA410]
Length = 956
Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats.
Identities = 40/113 (35%), Positives = 61/113 (53%), Gaps = 6/113 (5%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+A++ KA+ K N AN T A + ++ +K + A LEKA A G +++D +
Sbjct: 559 NANMNKALLDKANLEYANLTGAILTDASAQFAKLSNATLEKAEA-----EGLNIADAIAK 613
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 218
M EAN NA++ R LT+++L AI+E AD A +D K+A K AN
Sbjct: 614 NMNAKEANFKNAIMKRADLTKANLEKAILENADMQAAEALDAIFKEANLKQAN 666
Score = 40.4 bits (93), Expect = 0.91, Method: Composition-based stats.
Identities = 40/148 (27%), Positives = 61/148 (41%), Gaps = 27/148 (18%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
L A++ S+ + L++ +AE G ++ A+ N + ANF +
Sbjct: 577 LTGAILTDASAQFAKLSNATLEKAEAEG------------LNIADAIAKNMNAKEANFKN 624
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
A M+ +D + A LEKA+ A+ A+ D + EANL A L L R
Sbjct: 625 AIMKRADLTK-----ANLEKAILENADMQAAEALDA-----IFKEANLKQANLKAANLAR 674
Query: 188 SDLGGAIIEGADFSDAVIDLAQKQALCK 215
+ GADF A +D A K K
Sbjct: 675 INKA-----GADFDQAKVDDATKMHYTK 697
Score = 38.9 bits (89), Expect = 2.2, Method: Composition-based stats.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 7/110 (6%)
Query: 121 RRANFTSADMRESDFSGSKFNG------AYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
++ N S R + FS ++F A L A+ + N A+++ L+D+ L AN
Sbjct: 517 KQCNMKSITARNAYFSDAEFENILSLEEADLRNAIMERVNLVNANMNKALLDKANLEYAN 576
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANGTNPI 223
LT A+L + L A +E A+ + D K K AN N I
Sbjct: 577 LTGAILTDASAQFAKLSNATLEKAEAEGLNIADAIAKNMNAKEANFKNAI 626
>gi|428222472|ref|YP_007106642.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427995812|gb|AFY74507.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 340
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 80/159 (50%), Gaps = 12/159 (7%)
Query: 58 KNWR--VFVSTA------LAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSA 108
NWR VF S L+AA ++S + +++ L +N A ++ S A G A
Sbjct: 18 NNWRSEVFRSKIDLSYADLSAATLSSINLSLANLRSINLSRANLSKANL---SGAILGKA 74
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
+L +A + N ANF AD+ + S S + A L AVA ANF A+LS T
Sbjct: 75 NLTEASLINANLSMANFIMADLSGAYLSESNLSRANLGNAVAIAANFIMANLSGTYFSES 134
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
+ ANL++A L +L +++L G+ + A+F+ A + +
Sbjct: 135 DFSRANLSSANLTEAILVKTNLTGSYLSKANFTSANLSM 173
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 56/103 (54%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A+LR+A + N + AN + A + ++ +G+ GA L A AN GA
Sbjct: 239 ANFYQANLREANLDRANAQNANLSEAYLSNANLTGTILEGANLSSAYISNANLVGA---- 294
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
VL A+LT A+L+ LT+++ GA ++GADF+ A++
Sbjct: 295 ------VLKGADLTGAILIGANLTKANFSGAKLDGADFTSAIM 331
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 58/108 (53%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A SA+L +A+ VK N +ANFTSA++ +D S + + A + A AN
Sbjct: 137 SRANLSSANLTEAILVKTNLTGSYLSKANFTSANLSMTDLSEADLSSANMHLADLSMANL 196
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ A+L ++ + L +ANLT A L LT +DL + + GA+F A
Sbjct: 197 SSANLIGAILTDVDLRQANLTGAYLNTANLTGADLATSTLVGANFYQA 244
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 54/110 (49%), Gaps = 10/110 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 155
S+A ADL A N A T D+R+++ +G+ N GA L + ANF
Sbjct: 182 SSANMHLADLSMANLSSANLIGAILTDVDLRQANLTGAYLNTANLTGADLATSTLVGANF 241
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+L + +DR AN NA L L+ ++L G I+EGA+ S A I
Sbjct: 242 YQANLREANLDR-----ANAQNANLSEAYLSNANLTGTILEGANLSSAYI 286
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 10/103 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S + A+L AV + NF AN + ESDFS + + A L +A+ K N TG+ L
Sbjct: 102 SESNLSRANLGNAVAIAANFIMANLSGTYFSESDFSRANLSSANLTEAILVKTNLTGSYL 161
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S +AN T+A L T L+ +DL A + AD S A
Sbjct: 162 S----------KANFTSANLSMTDLSEADLSSANMHLADLSMA 194
>gi|359459933|ref|ZP_09248496.1| hypothetical protein ACCM5_14478 [Acaryochloris sp. CCMEE 5410]
Length = 315
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 53/92 (57%), Gaps = 5/92 (5%)
Query: 124 NFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
++ AD++E DFSG + A L + A +K N GA+L++ + R L +ANLT A
Sbjct: 202 DWHGADLQERDFSGRNLSQANLANVNLKDAFMHKVNLAGANLTNANLTRANLLQANLTQA 261
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
L LT +DL GA + GADF+ A + + +K
Sbjct: 262 NLQGANLTAADLSGADLRGADFTGANMGIGKK 293
>gi|220910076|ref|YP_002485387.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219866687|gb|ACL47026.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 332
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 44/82 (53%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N AN D+RE+D SG+ GA L ++AN GADLS++ + + L ANL A
Sbjct: 177 NLEEANLREVDLREADLSGANLRGALLTDVNLFQANLAGADLSNSNLKGVDLQRANLQQA 236
Query: 179 VLVRTVLTRSDLGGAIIEGADF 200
L LT ++L G +++ A
Sbjct: 237 KLTGATLTEANLAGVMMQRAQM 258
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 48/99 (48%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+LR A+ N +AN AD+ S+ G A L++A A T A+L+
Sbjct: 191 ADLSGANLRGALLTDVNLFQANLAGADLSNSNLKGVDLQRANLQQAKLTGATLTEANLAG 250
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+M R + + L A L R L +DL GA + GA+ +
Sbjct: 251 VMMQRAQMFQVRLNRANLSRANLQGADLRGASLIGANLA 289
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 41/95 (43%), Gaps = 16/95 (16%)
Query: 124 NFTSADMRESDFSGSKFNGA----------------YLEKAVAYKANFTGADLSDTLMDR 167
N D+RE+D SG+ GA L A+ + GA+LS + R
Sbjct: 111 NLIETDLREADLSGANLTGACLRSANLRTERRGTPVNLRGAILAGVDLRGANLSGASLVR 170
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+ L ANL A L L +DL GA + GA +D
Sbjct: 171 VNLQGANLEEANLREVDLREADLSGANLRGALLTD 205
>gi|37520785|ref|NP_924162.1| hypothetical protein gll1216 [Gloeobacter violaceus PCC 7421]
gi|35211780|dbj|BAC89157.1| gll1216 [Gloeobacter violaceus PCC 7421]
Length = 287
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 63/130 (48%), Gaps = 8/130 (6%)
Query: 97 FGIGSAAQFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
F + A ADL ++V++K + R A AD+R + G+ +G+ LE A K
Sbjct: 137 FAVLPFADLSGADLSRSVNLKRADLRGARLVGADLRGAFLHGANLSGSRLEAADLMKVAL 196
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI-----IEGADFSDAVIDLAQK 210
GA+LS + R L A+L A L RT L +DL GA +EGAD A ++ A
Sbjct: 197 AGANLSGADLSRANLRAAHLEGADLRRTNLGEADLAGAFLRGARLEGADLRRARLEGADL 256
Query: 211 QALCKYANGT 220
+ C GT
Sbjct: 257 E--CAATEGT 264
>gi|308813604|ref|XP_003084108.1| COG1357: Uncharacterized low-complexity proteins (ISS)
[Ostreococcus tauri]
gi|116055991|emb|CAL58524.1| COG1357: Uncharacterized low-complexity proteins (ISS)
[Ostreococcus tauri]
Length = 177
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 66/139 (47%), Gaps = 27/139 (19%)
Query: 113 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 172
A K + +RANF A++ F G+ GA F GA+L + + + L++
Sbjct: 48 AFFTKGSLKRANFDGANLEGITFFGADLTGA----------TFRGANLQNANLGQANLSK 97
Query: 173 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ-----------------ALCK 215
A+LT+A+L +++ + IEG+D+S+ ++ + + LCK
Sbjct: 98 ADLTDAILSGAIVSSAQFDDVKIEGSDWSEVIVRKREAKDDTTDDLFCVAYQDILTGLCK 157
Query: 216 YANGTNPITGVSTRKSLGC 234
A G NP+TG+ T +L C
Sbjct: 158 VAKGENPVTGLPTELTLMC 176
>gi|110597243|ref|ZP_01385531.1| Pentapeptide repeat [Chlorobium ferrooxidans DSM 13031]
gi|110341079|gb|EAT59547.1| Pentapeptide repeat [Chlorobium ferrooxidans DSM 13031]
Length = 447
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 57/108 (52%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S + F SA L +A N + NF ADM+ + G+ GA L++A A+ + +L
Sbjct: 304 SGSSFKSASLDEANLAGANLSKVNFHKADMKGAHLQGANLQGANLDRAFLKDADLSNTNL 363
Query: 161 SD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S+ T++ L ANL NA L L ++LGGA ++GA+ +DA
Sbjct: 364 SNAVLFGTILTGANLQNANLENASLFEADLEEANLGGANLKGANITDA 411
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 71/145 (48%), Gaps = 17/145 (11%)
Query: 73 VASCSSNI----SALADLNKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANF 125
+AS ++NI + L D + EA G + S ++ A+L+ A N A
Sbjct: 46 LASPAANIDLYKAVLEDADLSEANLGGALLVRSDLSGSKLNRANLKGA-----NLMMAFI 100
Query: 126 TSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
ADM+ +D SG + G+++++A+ AN GA+L +++ + +ANL N VL
Sbjct: 101 KKADMKGTDLSGACLIKANMKGSFMKEAIFRGANLQGANLRWVMLEEADMEDANLANTVL 160
Query: 181 VRTVLTRSDLGGAIIEGADFSDAVI 205
L ++L GA ++ A F D +
Sbjct: 161 FEANLENANLKGANLKDAVFLDQAL 185
>gi|418020640|ref|ZP_12659878.1| putative low-complexity protein [Candidatus Regiella insecticola
R5.15]
gi|347604005|gb|EGY28733.1| putative low-complexity protein [Candidatus Regiella insecticola
R5.15]
Length = 148
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 12/112 (10%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRES------------DFSGSKFNGAYLEKAVAYKANF 155
A+L+ A + R + ADMRE+ + SG+ GA L + KA
Sbjct: 4 ANLQNATLNDADMREVDLVGADMREAKLIGKKTNLEGANLSGADLQGAELYHTILIKAVL 63
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
+ ADLS+ ++R+ L EANL +A+L T L + L A +EG + DAV+++
Sbjct: 64 SWADLSNAKLERVNLREANLYHAILEETSLYITKLENANLEGVNLKDAVLEV 115
>gi|22299142|ref|NP_682389.1| hypothetical protein tlr1599 [Thermosynechococcus elongatus BP-1]
gi|22295324|dbj|BAC09151.1| tlr1599 [Thermosynechococcus elongatus BP-1]
Length = 309
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SA A+L +A+ + N RRA A++RE F + A L+KA N GADL
Sbjct: 183 SATNLQQANLERAILIGANLRRARLEEANLREVAFKEANLRHACLDKA-----NLVGADL 237
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ + +L ANL++A+L+ L ++L GA + GA+ +A++
Sbjct: 238 RGVSLAQALLRGANLSSAILIGANLMGANLSGADLRGANLIEAIL 282
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 70/144 (48%), Gaps = 10/144 (6%)
Query: 81 SALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 137
+AL N A+ RG G S A ADLR + V + R + +R+++ +G
Sbjct: 45 AALQSTNLQRADLRGAILTGANLSQADLRGADLRGVILVSADLRWVS-----LRKANLTG 99
Query: 138 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
+ A L A +AN TGA LS+ ++ L +LT A L R LTR++L A + G
Sbjct: 100 ADLTRANLANADLSEANLTGAQLSEAIVRDANLTLTDLTLAELERANLTRANLTEAYLRG 159
Query: 198 ADFSDAVIDLAQKQALCKYANGTN 221
AD +DAV L + Q L G N
Sbjct: 160 ADLTDAV--LRESQLLQANLRGAN 181
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 15/108 (13%)
Query: 103 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A+ A+LR+ + N R +AN AD+R + + GA L A+ AN G
Sbjct: 205 ARLEEANLREVAFKEANLRHACLDKANLVGADLRGVSLAQALLRGANLSSAILIGANLMG 264
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+LS A+L A L+ +LT + L G + D S+A++
Sbjct: 265 ANLSG----------ADLRGANLIEAILTGASLNGVDLSAVDMSEAIL 302
Score = 44.7 bits (104), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 60/132 (45%), Gaps = 17/132 (12%)
Query: 83 LADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS 138
L DL E E TR + A ADL AV R + A++R ++ S +
Sbjct: 134 LTDLTLAELERANLTRANL---TEAYLRGADLTDAV-----LRESQLLQANLRGANLSAT 185
Query: 139 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----A 193
A LE+A+ AN A L + + + EANL +A L + L +DL G A
Sbjct: 186 NLQQANLERAILIGANLRRARLEEANLREVAFKEANLRHACLDKANLVGADLRGVSLAQA 245
Query: 194 IIEGADFSDAVI 205
++ GA+ S A++
Sbjct: 246 LLRGANLSSAIL 257
Score = 43.9 bits (102), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 58/125 (46%), Gaps = 9/125 (7%)
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 145
L +Y R GI A L + + + RA+ T A ++ ++ + GA L
Sbjct: 7 LKRYSVGDRDFAGI----HLRRAHLSRCILTGIDLSRADLTDAALQSTNLQRADLRGAIL 62
Query: 146 EKAVAYKANFTGADLSDTLMD----RMV-LNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A +A+ GADL ++ R V L +ANLT A L R L +DL A + GA
Sbjct: 63 TGANLSQADLRGADLRGVILVSADLRWVSLRKANLTGADLTRANLANADLSEANLTGAQL 122
Query: 201 SDAVI 205
S+A++
Sbjct: 123 SEAIV 127
>gi|428310629|ref|YP_007121606.1| serine/threonine protein kinase [Microcoleus sp. PCC 7113]
gi|428252241|gb|AFZ18200.1| serine/threonine protein kinase [Microcoleus sp. PCC 7113]
Length = 542
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 46/87 (52%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 181
R +F S D+ D +G +A K NF GADLS+ R LN +NL +A L
Sbjct: 415 RRDFASQDLSGLDLHKVDLSGGIFHQAKLAKTNFQGADLSNADFGRASLNRSNLRDANLG 474
Query: 182 RTVLTRSDLGGAIIEGADFSDAVIDLA 208
R L+ +DL GA + GAD S A ++ A
Sbjct: 475 RAYLSYADLEGADLRGADLSYAYLNHA 501
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 39/81 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A FG A L ++ N RA + AD+ +D G+ + AYL A AN GA+L
Sbjct: 454 SNADFGRASLNRSNLRDANLGRAYLSYADLEGADLRGADLSYAYLNHANLKGANLCGANL 513
Query: 161 SDTLMDRMVLNEANLTNAVLV 181
S+ + L +A A ++
Sbjct: 514 SNAKISEEQLTQAKTNWATVL 534
>gi|347755497|ref|YP_004863061.1| putative low-complexity protein [Candidatus Chloracidobacterium
thermophilum B]
gi|347588015|gb|AEP12545.1| putative low-complexity protein [Candidatus Chloracidobacterium
thermophilum B]
Length = 419
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 60/123 (48%), Gaps = 8/123 (6%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL-- 160
A SA LR A V+ N AN AD+ ++ G+ GA L +A AN GADL
Sbjct: 57 ANLASASLRDAFLVRANLEGANLRGADLESANLEGANLRGADLSRANLEGANLEGADLTG 116
Query: 161 ----SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCK 215
S L+D L A L NAV L + LGGA + DF +A+++ A ++AL
Sbjct: 117 ARLPSAQLID-AKLGVATLENAVFANADLRNAYLGGANLTAVDFQNAILEAANFEEALLT 175
Query: 216 YAN 218
AN
Sbjct: 176 GAN 178
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 50/99 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F +ADLR A N +F +A + ++F + GA L AV +A GADLS
Sbjct: 137 AVFANADLRNAYLGGANLTAVDFQNAILEAANFEEALLTGANLRDAVLRRAVLPGADLSG 196
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
++R VL A+L+ L+ + GA ++GA FS
Sbjct: 197 AKLERAVLEGADLSQVSLLEADCRHATFQGARLKGAKFS 235
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 5/106 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFTG 157
A ADL A + + RRAN SA +R+ ++ G+ GA LE A AN G
Sbjct: 37 ANLRRADLEGANLEEASLRRANLASASLRDAFLVRANLEGANLRGADLESANLEGANLRG 96
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
ADLS ++ L A+LT A L L + LG A +E A F++A
Sbjct: 97 ADLSRANLEGANLEGADLTGARLPSAQLIDAKLGVATLENAVFANA 142
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 44/88 (50%), Gaps = 5/88 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ G A L AV + R A A++ DF + A E+A+ TGA+L D
Sbjct: 127 AKLGVATLENAVFANADLRNAYLGGANLTAVDFQNAILEAANFEEAL-----LTGANLRD 181
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDL 190
++ R VL A+L+ A L R VL +DL
Sbjct: 182 AVLRRAVLPGADLSGAKLERAVLEGADL 209
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL KA N RRA+ A++ E+ + A L A +AN GA+L ++
Sbjct: 28 DLAKANLDNANLRRADLEGANLEEASLRRANLASASLRDAFLVRANLEGANLRGADLESA 87
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L ANL A L+R++L GA +EGAD + A + AQ
Sbjct: 88 NLEGANLRGA-----DLSRANLEGANLEGADLTGARLPSAQ 123
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 63/142 (44%), Gaps = 29/142 (20%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAV------ 149
+A A+LR A + N AN AD+ + +K A LE AV
Sbjct: 85 ESANLEGANLRGADLSRANLEGANLEGADLTGARLPSAQLIDAKLGVATLENAVFANADL 144
Query: 150 --AY--KANFTGADLSDTLM-----DRMVLNEANLTNAVLVRTVLTRSDLGG-----AII 195
AY AN T D + ++ + +L ANL +AVL R VL +DL G A++
Sbjct: 145 RNAYLGGANLTAVDFQNAILEAANFEEALLTGANLRDAVLRRAVLPGADLSGAKLERAVL 204
Query: 196 EGADFSDAVIDLAQKQALCKYA 217
EGAD S + +A C++A
Sbjct: 205 EGADLSQVSL----LEADCRHA 222
>gi|145355959|ref|XP_001422212.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582452|gb|ABP00529.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 125
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 61/135 (45%), Gaps = 20/135 (14%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+ F L++A NF AN T + +D S + F A L A +AN TGAD
Sbjct: 11 GTGEYFTKGSLKRA-----NFNDANLTGITLFGADLSNATFVNANLSNANLGQANLTGAD 65
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
TNA+L +++ + L + +D+SD ++ LCK A+G
Sbjct: 66 F---------------TNAILSGAIVSSAQLDEVKLTNSDWSDVIVRKDVLTGLCKVADG 110
Query: 220 TNPITGVSTRKSLGC 234
NP+TG T SL C
Sbjct: 111 ENPVTGNITALSLMC 125
>gi|440793397|gb|ELR14582.1| K+ channel tetramerisation subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 381
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 53/105 (50%), Gaps = 10/105 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+F DLR + RRANF D+ D +K NGA L + AN +GA
Sbjct: 229 KFNGCDLRGFDFHAMHLRRANFHRCDLTGVDLRHAKLNGACLVECCLRDANLSGA----- 283
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
VL+ +LT+A R LT +DL GA++ GAD S+A +D A
Sbjct: 284 -----VLSGVDLTDADCRRADLTNADLRGAVLSGADLSEAKLDRA 323
>gi|427417538|ref|ZP_18907721.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425760251|gb|EKV01104.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 397
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 25/103 (24%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+F A+++E DFSG + K+N GADLSDT + ++ LN+ANL A L R
Sbjct: 283 ADFKGANLKEKDFSGRNLS----------KSNLEGADLSDTFLHKVNLNQANLHKAKLFR 332
Query: 183 TVLTR---------------SDLGGAIIEGADFSDAVIDLAQK 210
L + +DL GA + GAD S A+I K
Sbjct: 333 ANLLQANLSHANLREANLIGADLSGADLSGADLSGAIIGYGDK 375
>gi|158336687|ref|YP_001517861.1| hypothetical protein AM1_3555 [Acaryochloris marina MBIC11017]
gi|158306928|gb|ABW28545.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 315
Score = 57.0 bits (136), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 53/92 (57%), Gaps = 5/92 (5%)
Query: 124 NFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
++ AD++E DFSG + A L + A +K N GA+L++ + R L +ANLT A
Sbjct: 202 DWHGADLQERDFSGRNLSQANLANVNLKDAFMHKVNLAGANLTNANLTRANLLQANLTQA 261
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
L LT +DL GA + GADF+ A + + +K
Sbjct: 262 NLQGANLTAADLSGADLRGADFTGANMGIGKK 293
>gi|357014784|ref|ZP_09079783.1| hypothetical protein PelgB_35370 [Paenibacillus elgii B69]
Length = 843
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 2/105 (1%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL--EKAVAYKANFTGADLSDTLMD 166
DL A + + +F AD+ +D SG GA + F A LS M+
Sbjct: 154 DLTWAYMASADLKSVSFEDADLSHADLSGCNLYGALFTGDDLKLSHTVFASATLSYARMN 213
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
+V++ A+ TNAV+ LT S+L G + GAD +DA+I+ AQ Q
Sbjct: 214 EIVIDSADFTNAVMTNVYLTNSNLQGNSLTGADMTDALINGAQFQ 258
Score = 37.4 bits (85), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 72/172 (41%), Gaps = 12/172 (6%)
Query: 51 PGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADL 110
PG A ++ + V T L A +AS + D + A+ G G A F DL
Sbjct: 139 PG-LADIQATKAVVQTDLTWAYMASADLKSVSFEDADLSHADLSGCNLYG--ALFTGDDL 195
Query: 111 RKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 165
+ + V F A + A M E +DF+ + YL + + TGAD++D L+
Sbjct: 196 KLSHTV---FASATLSYARMNEIVIDSADFTNAVMTNVYLTNSNLQGNSLTGADMTDALI 252
Query: 166 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKY 216
+ A+LT A L T + A + AD + A+I D A+ Y
Sbjct: 253 NGAQFQNADLTGAKLYGATATETRFDKANLTKADLTRAMITDFHIPGAMLAY 304
Score = 37.4 bits (85), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 46/105 (43%), Gaps = 6/105 (5%)
Query: 103 AQFGSADLRKA-----VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
AQF +ADL A + F +AN T AD+ + + GA L T
Sbjct: 255 AQFQNADLTGAKLYGATATETRFDKANLTKADLTRAMITDFHIPGAMLAYTKLDNQTLTT 314
Query: 158 ADL-SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
A++ +DT + L+N +L +DL GA+++G D +
Sbjct: 315 AEIDADTDFTGASMQNVFLSNCMLQGVTFAHADLTGAVLDGTDLT 359
>gi|409994014|ref|ZP_11277136.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|291569676|dbj|BAI91948.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409935088|gb|EKN76630.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 331
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 67/138 (48%), Gaps = 9/138 (6%)
Query: 63 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 121
F T L AA + + ++ L D N +A+ RG A ADLR A N R
Sbjct: 87 FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139
Query: 122 -RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
R + S ++R +D G+ G L A +AN GA+L++ ++ +LN+ NL+ L
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLMGANLTECVLRGAILNQTNLSETNL 199
Query: 181 VRTVLTRSDLGGAIIEGA 198
+LT +L GA + G+
Sbjct: 200 QGAILTEVNLSGANLIGS 217
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 63/126 (50%), Gaps = 6/126 (4%)
Query: 86 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSK 139
LNKY + + G+ A+ +ADL A +F+ ANF A ++ ++ +K
Sbjct: 7 LNKYRSGEKLFRGVNLRNAELSNADLIGANLSGGDFQGANFVLAYLNGVNLTRANLEKAK 66
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
GA L +A A T AD T++ L +ANLT A LV L ++DL GA ++GAD
Sbjct: 67 LGGANLSRANLSGAQLTDADFHGTILQAADLRKANLTLATLVDANLIQADLRGANLQGAD 126
Query: 200 FSDAVI 205
A +
Sbjct: 127 LRGACL 132
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 52/119 (43%), Gaps = 17/119 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFT----------SADMRESDFSGSKFNGAYL----- 145
S AQ AD + + R+AN T AD+R ++ G+ GA L
Sbjct: 78 SGAQLTDADFHGTILQAADLRKANLTLATLVDANLIQADLRGANLQGADLRGACLRGANM 137
Query: 146 --EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
E+ + N GADL T + + L A+LT A L+ LT L GAI+ + S+
Sbjct: 138 RYERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLMGANLTECVLRGAILNQTNLSE 196
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 48/104 (46%), Gaps = 10/104 (9%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
LR A+ + N N A + E + SG+ G+ + K +A T A + + +
Sbjct: 184 LRGAILNQTNLSETNLQGAILTEVNLSGANLIGSRMVKVKLERAILTNAQMPRVELCDSI 243
Query: 170 LNEANLTN----------AVLVRTVLTRSDLGGAIIEGADFSDA 203
L +ANL+N A LVR L R++L A + AD +DA
Sbjct: 244 LPDANLSNANLSHANLSRANLVRAELNRTNLSSANLTQADLTDA 287
Score = 37.7 bits (86), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 40/76 (52%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN ++A++ ++ S + A L + AN T ADL+D + R L ANL+ A L R
Sbjct: 247 ANLSNANLSHANLSRANLVRAELNRTNLSSANLTQADLTDASLGRTNLRNANLSYAYLTR 306
Query: 183 TVLTRSDLGGAIIEGA 198
T + ++ G + GA
Sbjct: 307 TEFSSANTIGVNLHGA 322
>gi|116754331|ref|YP_843449.1| pentapeptide repeat-containing protein [Methanosaeta thermophila
PT]
gi|116665782|gb|ABK14809.1| pentapeptide repeat protein [Methanosaeta thermophila PT]
Length = 389
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 57/106 (53%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F A L A + FR + F+ A++ ++ +G+ +G+ ++ +A TGADL
Sbjct: 177 SHANFVGAHLSWADMSRSRFRESQFSRAELYGANLTGTDLSGSDFTRSYMMRARMTGADL 236
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
SD +D L EA L + L + +DL GA + GAD S+ V+D
Sbjct: 237 SDASLDYADLTEAELRDTDLSGCKMRYADLSGANLAGADISEVVLD 282
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 72/156 (46%), Gaps = 30/156 (19%)
Query: 55 AKLKNWRVFVSTALAAAV-VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 113
AKL+N R+ ++ + A + +A C+ + + D++ +AE G +F DL A
Sbjct: 119 AKLRNARLSGASLVNANLTMADCTEAL--MDDVSLEDAEMTG-------TRFFRTDLTGA 169
Query: 114 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
V + ANF A + +D S S+F + +A Y AN TG DLS +
Sbjct: 170 VFSGASLSHANFVGAHLSWADMSRSRFRESQFSRAELYGANLTGTDLSGS---------- 219
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+ T + ++R +T GAD SDA +D A
Sbjct: 220 DFTRSYMMRARMT----------GADLSDASLDYAD 245
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 70/176 (39%), Gaps = 32/176 (18%)
Query: 48 GQFPGPYAKLKNWRVFVSTALAAAVVASCSSNISAL-------ADLNKYE---AETRGEF 97
G Y + + +STA A + S L A LN+ + A+ RG
Sbjct: 16 GIMSSGYKMIIILTILMSTAYAVDICDRSDLRFSDLRGRDLSGASLNQSDLTGADLRG-- 73
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG----------AYLEK 147
A A LR A V N A+ AD+ +D SG+ +G A L
Sbjct: 74 -----ANLNGAYLRSAWLVNANLEGASLAGADLSMADLSGANLSGTDLSRAKLRNARLSG 128
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A AN T AD ++ LMD + L +A +T RT DL GA+ GA S A
Sbjct: 129 ASLVNANLTMADCTEALMDDVSLEDAEMTGTRFFRT-----DLTGAVFSGASLSHA 179
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 43/84 (51%), Gaps = 5/84 (5%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+ T A++R++D SG K A L A N GAD+S+ ++D + NL+ A+L +
Sbjct: 244 ADLTEAELRDTDLSGCKMRYADLSGA-----NLAGADISEVVLDSVKTTGVNLSGAILYK 298
Query: 183 TVLTRSDLGGAIIEGADFSDAVID 206
T L DL + G A +D
Sbjct: 299 TSLFNLDLRDIDMHGVQIKKAKMD 322
>gi|254409695|ref|ZP_05023476.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183692|gb|EDX78675.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 350
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 64/136 (47%), Gaps = 35/136 (25%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL----------------- 145
A ADL+ A+ ++ +A+ T+A +RE+D SG+ GA L
Sbjct: 66 ADLSKADLKNALLIEATLSQADLTAAILREADLSGAILTGATLLDADLRHATLIGTSLID 125
Query: 146 ---EKAVAYKANFTG----------ADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTR 187
++A KAN TG ADL +++R +L++ ANL A +R L R
Sbjct: 126 AKMKRAKLAKANCTGASFSRANLKAADLQGVILNRAILSQADLRGANLRGACFIRAYLHR 185
Query: 188 SDLGGAIIEGADFSDA 203
+DL A + GAD SDA
Sbjct: 186 ADLRDANLTGADLSDA 201
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 62/119 (52%), Gaps = 6/119 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A +ADL+ + RA + AD+R ++ G+ F AYL +A AN TGADL
Sbjct: 144 SRANLKAADLQGVI-----LNRAILSQADLRGANLRGACFIRAYLHRADLRDANLTGADL 198
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA-LCKYAN 218
SD + L+ ANL+ A L L+ ++L GA + GA +A + LA L K AN
Sbjct: 199 SDADLKGADLSHANLSRANLSCANLSHANLTGANLTGAHLQNANLSLANLSGLLLKKAN 257
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 47/92 (51%), Gaps = 5/92 (5%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-----DLSDTLMDRMVLN 171
+ NF N AD+ E++ S + A+L++A KA GA DLS + +L
Sbjct: 20 ERNFPGVNLIRADLTEANLSRINLSAAHLQRANLAKAKLIGAQLKDADLSKADLKNALLI 79
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
EA L+ A L +L +DL GAI+ GA DA
Sbjct: 80 EATLSQADLTAAILREADLSGAILTGATLLDA 111
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 55/106 (51%), Gaps = 10/106 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A ADL+ A N RAN + A++ ++ +G+ GA+L+ A AN +G
Sbjct: 194 TGADLSDADLKGADLSHANLSRANLSCANLSHANLTGANLTGAHLQNANLSLANLSG--- 250
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
++L +ANL +A L + L R++L A + GA+ +A ++
Sbjct: 251 -------LLLKKANLQSAQLSKANLNRANLYKANLSGANLLEANLE 289
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 52/103 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR A F RA AD+R+++ +G+ + A L+ A AN + A+LS
Sbjct: 161 AILSQADLRGANLRGACFIRAYLHRADLRDANLTGADLSDADLKGADLSHANLSRANLSC 220
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ L ANLT A L L+ ++L G +++ A+ A +
Sbjct: 221 ANLSHANLTGANLTGAHLQNANLSLANLSGLLLKKANLQSAQL 263
>gi|33861206|ref|NP_892767.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
subsp. pastoris str. CCMP1986]
gi|33639938|emb|CAE19108.1| Pentapeptide repeats [Prochlorococcus marinus subsp. pastoris str.
CCMP1986]
Length = 157
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 61/130 (46%), Gaps = 10/130 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +DL+ A + + AN + D++ + G+K N + ++L +
Sbjct: 35 FSGSDLQGATFYLTDLQDANLSDCDLQNASLYGAKLK----------DTNLSNSNLREVT 84
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPIT 224
+D VL+ +LTN L + + I+GADF++ + + CK A+GTNP T
Sbjct: 85 LDSAVLDGTDLTNTNLEDSFAYSTQFENVKIQGADFTNVYLPKDVLREFCKDASGTNPFT 144
Query: 225 GVSTRKSLGC 234
TR++L C
Sbjct: 145 NRETRETLEC 154
>gi|300863681|ref|ZP_07108615.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300338313|emb|CBN53761.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 238
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 57/109 (52%), Gaps = 8/109 (7%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+F ADLR++ K NF +A+F D+ ES G+ A L +AV +A+ +GA L+D
Sbjct: 36 EFDRADLRQSRLGKTNFTQASFQETDLSESILWGTDLTEANLYRAVLREADLSGAKLTDA 95
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF---SDAVIDLAQ 209
L EANL A L L R+ L AI+ AD SD + DL Q
Sbjct: 96 -----NLEEANLMKACLSGANLVRAKLLRAILFEADLRSTSDQITDLGQ 139
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A F DL +++ + AN A +RE+D SG+K A LE+A KA +GA+L
Sbjct: 53 TQASFQETDLSESILWGTDLTEANLYRAVLREADLSGAKLTDANLEEANLMKACLSGANL 112
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+ R +L EA+L + T +DLG AI+ AD S
Sbjct: 113 VRAKLLRAILFEADLRS-----TSDQITDLGQAILTNADLS 148
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 8/95 (8%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK--------ANFTGADL 160
DL +A+ + +N + A + +++ G+K A+L + + + A+ GADL
Sbjct: 136 DLGQAILTNADLSYSNLSGALLYQANLDGAKLCRAHLNETIQQRFLATNLSEASLQGADL 195
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
S + +L +ANL A + RT+LT +DL GAI+
Sbjct: 196 SYADLSGAILRKANLRGADMTRTILTNTDLEGAIM 230
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 44/87 (50%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
K +F + D+ ++ G +F+ A L ++ K NFT A +T + +L +LT
Sbjct: 14 KRSFHQVKLQEIDLLNAELQGIEFDRADLRQSRLGKTNFTQASFQETDLSESILWGTDLT 73
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDA 203
A L R VL +DL GA + A+ +A
Sbjct: 74 EANLYRAVLREADLSGAKLTDANLEEA 100
>gi|75910595|ref|YP_324891.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704320|gb|ABA23996.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 521
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 59/116 (50%), Gaps = 6/116 (5%)
Query: 107 SADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
SA+LR A + NFR AN + AD+ R +D SG + A L A AN GADLS
Sbjct: 174 SANLRDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSNAKLSGANLVGADLS 233
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY 216
+ + L ANL A L+R +DL AI+ GA +S + L + +C++
Sbjct: 234 NANLTNASLVHANLIQAKLIRAEWVGADLTSAILTGAKLYSTSRFGLKTEGLICQW 289
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 49/103 (47%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A SADLR+A N R AN A ++ + G+ A L + + + T A+L
Sbjct: 118 SEANLNSADLREATLRHANLRHANLNGASLKGASLVGANLEMANLNGSDLSRCDLTSANL 177
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
D + ++ ANL+ A L L +DL G + AD S+A
Sbjct: 178 RDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSNA 220
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 55/107 (51%), Gaps = 9/107 (8%)
Query: 117 KENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLN 171
+ N AN + +++ E+DFS +K N GA L A+ ++ A+L + + R L
Sbjct: 39 QANLSIANLSGSNLSEADFSHAKLNVARLSGANLTNAIFNHSSLNVANLIRSDLSRAQLR 98
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
A+L A L+R L+R DL A + AD +A + + A ++AN
Sbjct: 99 GASLVRAELIRAELSRVDLSEANLNSADLREATL----RHANLRHAN 141
Score = 43.9 bits (102), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 5/108 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +DL + N R A + R ++ SG+ +GA L A N + ADLS+
Sbjct: 160 ANLNGSDLSRCDLTSANLRDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSN 219
Query: 163 TLMD--RMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ +V L+ ANLTNA LV L ++ L A GAD + A++
Sbjct: 220 AKLSGANLVGADLSNANLTNASLVHANLIQAKLIRAEWVGADLTSAIL 267
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 62/136 (45%), Gaps = 8/136 (5%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRAN 124
L A+ S N++ L + A+ RG + + A+ DL +A + R A
Sbjct: 72 LTNAIFNHSSLNVANLIRSDLSRAQLRGASLVRAELIRAELSRVDLSEANLNSADLREAT 131
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
A++R ++ +G+ GA L A AN G+DLS R L ANL +A L +
Sbjct: 132 LRHANLRHANLNGASLKGASLVGANLEMANLNGSDLS-----RCDLTSANLRDAELKQVN 186
Query: 185 LTRSDLGGAIIEGADF 200
++L GA + GA+
Sbjct: 187 FRHANLSGADLSGANL 202
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 59/121 (48%), Gaps = 26/121 (21%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTNA 178
NF+ D+ E++ SG K G +A AN +G++LS+ LN A NLTNA
Sbjct: 16 NFSGVDLSEANLSGVKLCGVNFSQANLSIANLSGSNLSEADFSHAKLNVARLSGANLTNA 75
Query: 179 V----------LVRTVLTRSDLGGAIIEGADFSDAV---IDLAQ--------KQALCKYA 217
+ L+R+ L+R+ L GA + A+ A +DL++ ++A ++A
Sbjct: 76 IFNHSSLNVANLIRSDLSRAQLRGASLVRAELIRAELSRVDLSEANLNSADLREATLRHA 135
Query: 218 N 218
N
Sbjct: 136 N 136
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 53/108 (49%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A+ A+L A+ F ++ A++ SD S ++ GA L +A +A + DL
Sbjct: 63 NVARLSGANLTNAI-----FNHSSLNVANLIRSDLSRAQLRGASLVRAELIRAELSRVDL 117
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
S+ LN A+L A L L ++L GA ++GA A +++A
Sbjct: 118 SEA-----NLNSADLREATLRHANLRHANLNGASLKGASLVGANLEMA 160
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 46/101 (45%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L+ A V N AN +D+ D + + A L++ AN +GADLS
Sbjct: 140 ANLNGASLKGASLVGANLEMANLNGSDLSRCDLTSANLRDAELKQVNFRHANLSGADLSG 199
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L A+L+ L L+ + L GA + GAD S+A
Sbjct: 200 A-----NLRWADLSGVNLSWADLSNAKLSGANLVGADLSNA 235
>gi|428220816|ref|YP_007104986.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994156|gb|AFY72851.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 418
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 59/108 (54%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANF 155
S A F +DL A+ ++ + RRAN + A++ E +D SG F+G+ L +A +ANF
Sbjct: 143 SMANFTGSDLSGAIMIRADLRRANISRANLNEADISRADLSGVDFSGSNLSQANFEEANF 202
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
G + S R L EAN +N L+ SDL GA + A+F++A
Sbjct: 203 LGTNFS-----RTNLIEANFSNTNFREVDLSGSDLIGADLSNANFAEA 245
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A + + RRA A + E+D S + G + A ANFTG+DL
Sbjct: 93 SGADLRGANLSTADLIGADLRRATLEGAILAEADLSRTNLVGTNMTDANLSMANFTGSDL 152
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S +M R L AN++ A L ++R+DL G G++ S A
Sbjct: 153 SGAIMIRADLRRANISRANLNEADISRADLSGVDFSGSNLSQA 195
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 55/108 (50%), Gaps = 10/108 (9%)
Query: 103 AQFGSAD-----LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A F AD LR+A + NF N + AD+R + SG+ GA L A+ G
Sbjct: 55 ANFSGADLSRAKLRRATFGETNFSNTNLSEADLRRVNLSGADLRGANLS-----TADLIG 109
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
ADL ++ +L EA+L+ LV T +T ++L A G+D S A++
Sbjct: 110 ADLRRATLEGAILAEADLSRTNLVGTNMTDANLSMANFTGSDLSGAIM 157
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 70/140 (50%), Gaps = 20/140 (14%)
Query: 87 NKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
N E + G IG S A F ADLR+A N ANF +A+++E+D SG+ GA
Sbjct: 221 NFREVDLSGSDLIGADLSNANFAEADLRRA-----NLVGANFNNANLKEADLSGAYLIGA 275
Query: 144 YLEKAVAYKANF----------TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
L A +A+F TGADL+ + L+ ANL++ L LT +DL A
Sbjct: 276 TLVNANIVRADFRRANLIGADLTGADLTGADLVGANLSGANLSDCNLTSVSLTSADLSMA 335
Query: 194 IIEGADFSDAVIDLAQKQAL 213
D ++A +L++ QAL
Sbjct: 336 NFANCDLTNA--NLSRVQAL 353
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 50/102 (49%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S F DL + + + ANF AD+R ++ G+ FN A L++A A GA L
Sbjct: 218 SNTNFREVDLSGSDLIGADLSNANFAEADLRRANLVGANFNNANLKEADLSGAYLIGATL 277
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+ + R ANL A L LT +DL GA + GA+ SD
Sbjct: 278 VNANIVRADFRRANLIGADLTGADLTGADLVGANLSGANLSD 319
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 58/132 (43%), Gaps = 9/132 (6%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
N EA+ G + IG A L A V+ +FRRAN AD+ +D +G+ GA L
Sbjct: 261 NLKEADLSGAYLIG-------ATLVNANIVRADFRRANLIGADLTGADLTGADLVGANLS 313
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A N T L+ + +LTNA L R ++ GA++ GA+ D ++
Sbjct: 314 GANLSDCNLTSVSLTSADLSMANFANCDLTNANLSRVQALSTNFSGAMLTGANLEDWSVN 373
Query: 207 LAQK--QALCKY 216
K C Y
Sbjct: 374 SKTKLDDVECDY 385
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 50/100 (50%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
F +L +A NFR + + +D+ +D S + F A L +A ANF A+L +
Sbjct: 206 NFSRTNLIEANFSNTNFREVDLSGSDLIGADLSNANFAEADLRRANLVGANFNNANLKEA 265
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L A L NA +VR R++L GA + GAD + A
Sbjct: 266 DLSGAYLIGATLVNANIVRADFRRANLIGADLTGADLTGA 305
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 46/90 (51%), Gaps = 11/90 (12%)
Query: 115 HVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
H+++ + +FT A++ E DF+G+ KANF+GADLS + R E
Sbjct: 26 HIQDLDLSDCDFTGANLSEVDFAGTDL----------QKANFSGADLSRAKLRRATFGET 75
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
N +N L L R +L GA + GA+ S A
Sbjct: 76 NFSNTNLSEADLRRVNLSGADLRGANLSTA 105
Score = 43.9 bits (102), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA---------------YL 145
S A A LR+A + NF N + AD+R + SG+ GA L
Sbjct: 58 SGADLSRAKLRRATFGETNFSNTNLSEADLRRVNLSGADLRGANLSTADLIGADLRRATL 117
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
E A+ +A+ + +L T M L+ AN T + L ++ R+DL A I A+ ++A I
Sbjct: 118 EGAILAEADLSRTNLVGTNMTDANLSMANFTGSDLSGAIMIRADLRRANISRANLNEADI 177
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 51/98 (52%), Gaps = 5/98 (5%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL N +F D+++++FSG+ + A L +A + NF+ +LS+ + R+
Sbjct: 31 DLSDCDFTGANLSEVDFAGTDLQKANFSGADLSRAKLRRATFGETNFSNTNLSEADLRRV 90
Query: 169 VLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
L+ ANL+ A L+ L R+ L GAI+ AD S
Sbjct: 91 NLSGADLRGANLSTADLIGADLRRATLEGAILAEADLS 128
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 52/105 (49%), Gaps = 10/105 (9%)
Query: 101 SAAQFGSADLRKA-----VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A ADLR+A + + + R N +M +++ S + F G+ L A+ +A+
Sbjct: 103 STADLIGADLRRATLEGAILAEADLSRTNLVGTNMTDANLSMANFTGSDLSGAIMIRADL 162
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A++S R LNEA+++ A L + S+L A E A+F
Sbjct: 163 RRANIS-----RANLNEADISRADLSGVDFSGSNLSQANFEEANF 202
>gi|254526129|ref|ZP_05138181.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9202]
gi|221537553|gb|EEE40006.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9202]
Length = 148
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 64/134 (47%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+A +G L A + + A F D+++++ S + A L A N + ++L
Sbjct: 12 AALDYGKQSLIGADFSGSDLKGATFYLTDLQDANLSDCELQNATLYGAKLKDTNLSNSNL 71
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+ +D +L+ +L+N L + + I+GADF++ + + C+ A GT
Sbjct: 72 REVTLDSAILDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVYLPKDIIREFCESATGT 131
Query: 221 NPITGVSTRKSLGC 234
NPIT TR++L C
Sbjct: 132 NPITNRDTRETLEC 145
>gi|436841883|ref|YP_007326261.1| Pentapeptide repeat protein [Desulfovibrio hydrothermalis AM13 = DSM
14728]
gi|432170789|emb|CCO24160.1| Pentapeptide repeat protein [Desulfovibrio hydrothermalis AM13 = DSM
14728]
Length = 1278
Score = 56.6 bits (135), Expect = 1e-05, Method: Composition-based stats.
Identities = 37/100 (37%), Positives = 50/100 (50%), Gaps = 6/100 (6%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
AD R A K F+ + AD R++D + FNGA K V K NF GA+L R
Sbjct: 1094 ADFRNAFIKKSIFKGSTLDGADFRKADVHETLFNGA---KGV--KVNFAGANLDKLRTGR 1148
Query: 168 MV-LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
EA+ T A L + +DL GA+ GAD +A++D
Sbjct: 1149 NAEFPEADFTGATLRSSAFRETDLTGALFRGADLENALVD 1188
Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats.
Identities = 45/135 (33%), Positives = 61/135 (45%), Gaps = 23/135 (17%)
Query: 78 SNISALADLNKYEAETRGEF---GIGSAAQFGSADLRKA-VH---------VKENFRRAN 124
S +S AD EA+ R F I + AD RKA VH VK NF AN
Sbjct: 1085 SMVSGKAD----EADFRNAFIKKSIFKGSTLDGADFRKADVHETLFNGAKGVKVNFAGAN 1140
Query: 125 FT------SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+A+ E+DF+G+ + + A F GADL + L+D +L +ANL A
Sbjct: 1141 LDKLRTGRNAEFPEADFTGATLRSSAFRETDLTGALFRGADLENALVDNCMLVDANLNGA 1200
Query: 179 VLVRTVLTRSDLGGA 193
T+S+L GA
Sbjct: 1201 SAKGARFTKSNLEGA 1215
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 40/138 (28%), Positives = 69/138 (50%), Gaps = 16/138 (11%)
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-----FNGAYLEK----- 147
G A F +A ++K++ A+F AD+ E+ F+G+K F GA L+K
Sbjct: 1089 GKADEADFRNAFIKKSIFKGSTLDGADFRKADVHETLFNGAKGVKVNFAGANLDKLRTGR 1148
Query: 148 -AVAYKANFTGADLS-----DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
A +A+FTGA L +T + + A+L NA++ +L ++L GA +GA F+
Sbjct: 1149 NAEFPEADFTGATLRSSAFRETDLTGALFRGADLENALVDNCMLVDANLNGASAKGARFT 1208
Query: 202 DAVIDLAQKQALCKYANG 219
+ ++ A +A + G
Sbjct: 1209 KSNLEGASMRAFNLFMGG 1226
Score = 47.4 bits (111), Expect = 0.007, Method: Composition-based stats.
Identities = 34/125 (27%), Positives = 62/125 (49%), Gaps = 14/125 (11%)
Query: 99 IGSAAQFGSADLR-----KAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKA 148
+G +A F A L+ +A+ F ++ T A R++ F GS F GA L + A
Sbjct: 1006 MGRSADFTKASLKGVNFERAMLGNAIFEESDLTGAQARQASFKGSSFKGATLADAVFDMA 1065
Query: 149 VAYKANFTGADLSDTLMDRMVL----NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
+ K +F+ A+LS ++ ++ +EA+ NA + +++ S L GA AD + +
Sbjct: 1066 ILEKTDFSKANLSGARINMSMVSGKADEADFRNAFIKKSIFKGSTLDGADFRKADVHETL 1125
Query: 205 IDLAQ 209
+ A+
Sbjct: 1126 FNGAK 1130
Score = 42.7 bits (99), Expect = 0.16, Method: Composition-based stats.
Identities = 24/86 (27%), Positives = 41/86 (47%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN T ++ +DF + + A L + + A+FT A L +R +L A + L
Sbjct: 980 ANLTGCQLKNTDFKETCLDNAKLIQTMGRSADFTKASLKGVNFERAMLGNAIFEESDLTG 1039
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLA 208
++ G+ +GA +DAV D+A
Sbjct: 1040 AQARQASFKGSSFKGATLADAVFDMA 1065
>gi|17228637|ref|NP_485185.1| hypothetical protein alr1142 [Nostoc sp. PCC 7120]
gi|17130488|dbj|BAB73099.1| alr1142 [Nostoc sp. PCC 7120]
Length = 521
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 59/116 (50%), Gaps = 6/116 (5%)
Query: 107 SADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
SA+LR A + NFR AN + AD+ R +D SG + A L A AN GADLS
Sbjct: 174 SANLRDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSNAKLSGANLVGADLS 233
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY 216
+ + L ANL A L+R +DL AI+ GA +S + L + +C++
Sbjct: 234 NANLTNASLVHANLIQAKLIRAEWVGADLTSAILTGAKLYSTSRFGLKTEGLICQW 289
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 49/103 (47%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A SADLR+A N R AN A ++ + G+ A L + + + T A+L
Sbjct: 118 SEANLNSADLREATLRHANLRHANLNGASLKGASLVGANLEMANLNGSDLSRCDLTSANL 177
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
D + ++ ANL+ A L L +DL G + AD S+A
Sbjct: 178 RDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSNA 220
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 54/107 (50%), Gaps = 9/107 (8%)
Query: 117 KENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLN 171
+ N AN + +++ E+DFS +K N GA L A+ ++ A+L + R L
Sbjct: 39 QANLSIANLSGSNLSEADFSHAKLNVARLSGANLTNAIFNHSSLNVANLIRADLSRAQLR 98
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
A+L A L+R L+R DL A + AD +A + + A ++AN
Sbjct: 99 GASLVRAELIRAELSRVDLSEANLNSADLREATL----RHANLRHAN 141
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 5/108 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +DL + N R A + R ++ SG+ +GA L A N + ADLS+
Sbjct: 160 ANLNGSDLSRCDLTSANLRDAELKQVNFRHANLSGADLSGANLRWADLSGVNLSWADLSN 219
Query: 163 TLMD--RMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ +V L+ ANLTNA LV L ++ L A GAD + A++
Sbjct: 220 AKLSGANLVGADLSNANLTNASLVHANLIQAKLIRAEWVGADLTSAIL 267
Score = 43.9 bits (102), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 62/136 (45%), Gaps = 8/136 (5%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRAN 124
L A+ S N++ L + A+ RG + + A+ DL +A + R A
Sbjct: 72 LTNAIFNHSSLNVANLIRADLSRAQLRGASLVRAELIRAELSRVDLSEANLNSADLREAT 131
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
A++R ++ +G+ GA L A AN G+DLS R L ANL +A L +
Sbjct: 132 LRHANLRHANLNGASLKGASLVGANLEMANLNGSDLS-----RCDLTSANLRDAELKQVN 186
Query: 185 LTRSDLGGAIIEGADF 200
++L GA + GA+
Sbjct: 187 FRHANLSGADLSGANL 202
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 58/121 (47%), Gaps = 26/121 (21%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTNA 178
NF+ D+ E++ SG K G +A AN +G++LS+ LN A NLTNA
Sbjct: 16 NFSGVDLSEANLSGVKLCGVNFSQANLSIANLSGSNLSEADFSHAKLNVARLSGANLTNA 75
Query: 179 V----------LVRTVLTRSDLGGAIIEGADFSDAV---IDLAQ--------KQALCKYA 217
+ L+R L+R+ L GA + A+ A +DL++ ++A ++A
Sbjct: 76 IFNHSSLNVANLIRADLSRAQLRGASLVRAELIRAELSRVDLSEANLNSADLREATLRHA 135
Query: 218 N 218
N
Sbjct: 136 N 136
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 46/101 (45%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L+ A V N AN +D+ D + + A L++ AN +GADLS
Sbjct: 140 ANLNGASLKGASLVGANLEMANLNGSDLSRCDLTSANLRDAELKQVNFRHANLSGADLSG 199
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L A+L+ L L+ + L GA + GAD S+A
Sbjct: 200 A-----NLRWADLSGVNLSWADLSNAKLSGANLVGADLSNA 235
>gi|434400818|ref|YP_007134822.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428271915|gb|AFZ37856.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 209
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 67/137 (48%), Gaps = 30/137 (21%)
Query: 119 NFRRANFTSADMRESDFSGS---------------KFNGAYLEKAVAYKANFTGADLSDT 163
NF +AN T AD RE D + + A LE+AV Y+A+ +LS +
Sbjct: 36 NFSQANLTGADFREIDLTQAILCEANLSQTILIEANLTKANLERAVLYRASLQLVNLSQS 95
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAI----------IEGADFSDAVIDLAQKQAL 213
++ L EANLT A+L +T L ++ L GA+ + GA+ S A++ QA
Sbjct: 96 ILTEADLREANLTEALLYKTSLGKAQLQGAVLNRAILQRTFLRGANLSQAIL----SQAN 151
Query: 214 CKYANGTNP-ITGVSTR 229
+ AN T+ +TG + R
Sbjct: 152 LQEANLTDADLTGANLR 168
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 64/131 (48%)
Query: 76 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 135
C +N+S + + E + A +L +++ + + R AN T A + ++
Sbjct: 58 CEANLSQTILIEANLTKANLERAVLYRASLQLVNLSQSILTEADLREANLTEALLYKTSL 117
Query: 136 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
++ GA L +A+ + GA+LS ++ + L EANLT+A L L ++L GA +
Sbjct: 118 GKAQLQGAVLNRAILQRTFLRGANLSQAILSQANLQEANLTDADLTGANLRGANLQGAFL 177
Query: 196 EGADFSDAVID 206
A+ +A ++
Sbjct: 178 VEANLFEASLE 188
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 26/96 (27%), Positives = 49/96 (51%), Gaps = 2/96 (2%)
Query: 114 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
++ +E R + R+ D S G L+ +AN TGAD + + + +L EA
Sbjct: 1 MNTEELLRLYAMGEREFRQVDLSYRVLRGVDLQAINFSQANLTGADFREIDLTQAILCEA 60
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
NL+ +L+ LT+++L A++ A +++L+Q
Sbjct: 61 NLSQTILIEANLTKANLERAVLYRASLQ--LVNLSQ 94
>gi|428220994|ref|YP_007105164.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994334|gb|AFY73029.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 283
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 55/108 (50%), Gaps = 5/108 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTG 157
A +A++ A N R A A++R S +G+ F GA L +AV N T
Sbjct: 169 ANLDTANISDADLTNANLRWATLRDANLRGSILTGANGNLANFTGANLSQAVLRGINLTN 228
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
ADLS+ ++ L+ ANL A LV LT +DL GA I AD S AV+
Sbjct: 229 ADLSNAKLNAADLSNANLVGASLVGANLTSADLTGANITNADLSGAVM 276
>gi|56751008|ref|YP_171709.1| hypothetical protein syc0999_c [Synechococcus elongatus PCC 6301]
gi|81299332|ref|YP_399540.1| hypothetical protein Synpcc7942_0521 [Synechococcus elongatus PCC
7942]
gi|56685967|dbj|BAD79189.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81168213|gb|ABB56553.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 195
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 5/103 (4%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL A+ V + RRA A +RE+D SG+ GA L ++ +A G++L +++
Sbjct: 49 ADLTGAILVGADLRRAWLRGAILREADCSGANLLGADLLRSDLCRAQLVGSNLRRAMLND 108
Query: 168 MVLNEAN-----LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+L EAN L A LVR +L R+D A + AD S A I
Sbjct: 109 SILAEANCRQACLQQADLVRAILYRTDFTAADLHEADLSHAFI 151
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 53/110 (48%), Gaps = 2/110 (1%)
Query: 110 LRKAVHVKENFRRANFTSA-DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
LR+ V +R N T D+R++D S LE+A A GADL +
Sbjct: 10 LRRGTAVWSRWRSQNPTVIPDLRQADLSFVDLVNVDLERADLTGAILVGADLRRAWLRGA 69
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYA 217
+L EA+ + A L+ L RSDL A + G++ A++ D +A C+ A
Sbjct: 70 ILREADCSGANLLGADLLRSDLCRAQLVGSNLRRAMLNDSILAEANCRQA 119
>gi|119487545|ref|ZP_01621155.1| hypothetical protein L8106_26852 [Lyngbya sp. PCC 8106]
gi|119455714|gb|EAW36850.1| hypothetical protein L8106_26852 [Lyngbya sp. PCC 8106]
Length = 277
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 64/116 (55%), Gaps = 4/116 (3%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ DL A ++ N AN T+A D +GS G+ + +AN T A+L++
Sbjct: 60 AKLMGVDLSDANLMEANLIGANLTNAKFDRCDLTGSNLRGSSSKLVSLTQANLTDANLTE 119
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
+ ANLTNA L+RT L +++L GA++EGA+ ++ ++ ++++ + AN
Sbjct: 120 ANLAEANFVGANLTNATLIRTNLMKANLTGAVLEGANLTNVIL----RESILEGAN 171
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 50/97 (51%), Gaps = 10/97 (10%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-----DLSDTLMDRMVLN 171
+ N AN T A++ E++F G+ A L + KAN TGA +L++ ++ +L
Sbjct: 109 QANLTDANLTEANLAEANFVGANLTNATLIRTNLMKANLTGAVLEGANLTNVILRESILE 168
Query: 172 EANLTNA-----VLVRTVLTRSDLGGAIIEGADFSDA 203
ANL +A +L+ T +D+ + GAD SDA
Sbjct: 169 GANLIHATLSGALLISANFTDADMSRVTMIGADLSDA 205
Score = 40.4 bits (93), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 58/137 (42%), Gaps = 30/137 (21%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSA----------DMRESDFSGSKFNGAYLEKAVAYK 152
A F A+L A ++ N +AN T A +RES G+ A L A+
Sbjct: 125 ANFVGANLTNATLIRTNLMKANLTGAVLEGANLTNVILRESILEGANLIHATLSGALLIS 184
Query: 153 ANFT----------GADLSDTLMDRM----------VLNEANLTNAVLVRTVLTRSDLGG 192
ANFT GADLSD + + L ANL+ A L RT L+ S+L G
Sbjct: 185 ANFTDADMSRVTMIGADLSDANLSGVNLRAANVSWTTLRGANLSRARLYRTKLSWSNLSG 244
Query: 193 AIIEGADFSDAVIDLAQ 209
A + A D +D A
Sbjct: 245 ANLIEAVLLDTRLDHAN 261
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 10/101 (9%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
+A F AD+ + + + AN + ++R ++ S + GA L +A Y+ + ++LS
Sbjct: 184 SANFTDADMSRVTMIGADLSDANLSGVNLRAANVSWTTLRGANLSRARLYRTKLSWSNLS 243
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
ANL AVL+ T L ++L I GA D
Sbjct: 244 G----------ANLIEAVLLDTRLDHANLRDVDIRGAILPD 274
>gi|358458677|ref|ZP_09168884.1| pentapeptide repeat protein [Frankia sp. CN3]
gi|357077988|gb|EHI87440.1| pentapeptide repeat protein [Frankia sp. CN3]
Length = 377
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 55/109 (50%), Gaps = 5/109 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A+ DL A V A+ T A + +D +G++ + A L+ A AN TG
Sbjct: 223 ARLAGRDLTFATFVAARLTGADLTGAVLAKTKLTATDLAGTRLSRANLDGADLANANLTG 282
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A L D ++ L+EA L A+L R L R+DL GA + GAD + A +D
Sbjct: 283 ARLDDAVLTGAHLSEARLVGAILTRADLHRADLVGADLTGADLTGARLD 331
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 38/82 (46%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
++T A + G++ G L A A TGADL+ ++ + L +L L R
Sbjct: 209 DWTIAHYPGAQLVGARLAGRDLTFATFVAARLTGADLTGAVLAKTKLTATDLAGTRLSRA 268
Query: 184 VLTRSDLGGAIIEGADFSDAVI 205
L +DL A + GA DAV+
Sbjct: 269 NLDGADLANANLTGARLDDAVL 290
>gi|284929723|ref|YP_003422245.1| hypothetical protein UCYN_11960 [cyanobacterium UCYN-A]
gi|284810167|gb|ADB95864.1| uncharacterized low-complexity protein [cyanobacterium UCYN-A]
Length = 243
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 62/125 (49%), Gaps = 14/125 (11%)
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 145
LNKY+ R F S LR+ + N + NF SAD+R+S S FNGA L
Sbjct: 7 LNKYDLGER---------NFQSICLREVDLTEVNLPKINFESADIRQSRLGKSNFNGAIL 57
Query: 146 EKA-----VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
++A + + N +LS ++ L+ A LTNA L L+++ L GA + A+
Sbjct: 58 KQADLSESIIWGTNLENTNLSKAILRDTDLSGAELTNADLTNAYLSKASLCGANLAKANL 117
Query: 201 SDAVI 205
S AV+
Sbjct: 118 SHAVL 122
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 51/105 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A ADL +++ N N + A +R++D SG++ A L A KA+ GA+L
Sbjct: 53 NGAILKQADLSESIIWGTNLENTNLSKAILRDTDLSGAELTNADLTNAYLSKASLCGANL 112
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ + VL E +L RT L R++L + A S A++
Sbjct: 113 AKANLSHAVLYEVDLRPLSNRRTNLGRANLSSTDLSYAKLSSALL 157
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 42/92 (45%), Gaps = 5/92 (5%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANFTGA 158
F SAD+R++ K NF A AD+ ES G+ L KA+ A T A
Sbjct: 36 NFESADIRQSRLGKSNFNGAILKQADLSESIIWGTNLENTNLSKAILRDTDLSGAELTNA 95
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
DL++ + + L ANL A L VL DL
Sbjct: 96 DLTNAYLSKASLCGANLAKANLSHAVLYEVDL 127
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 54/123 (43%), Gaps = 28/123 (22%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 158
S A DLR + + N RAN +S D+ A L A+ ++AN +GA
Sbjct: 118 SHAVLYEVDLRPLSNRRTNLGRANLSSTDLSY----------AKLSSALLFRANLSGAKL 167
Query: 159 ----------------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
DL++ + L+ A+L NA+LV L +DL G ++GA+
Sbjct: 168 CRAELNQDPYKFPFLTDLTEANLQGADLSYADLGNAILVNANLKNADLTGTNLKGANLQG 227
Query: 203 AVI 205
A++
Sbjct: 228 AIM 230
>gi|434395496|ref|YP_007130443.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428267337|gb|AFZ33283.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 249
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 55/105 (52%), Gaps = 10/105 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L+ A N AN + AD+ E+D SG+ +GA L + AN + A L
Sbjct: 128 SGANLAQANLKGA-----NLTEANLSKADLTEADLSGADLSGATLSGVILSDANLSDAIL 182
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S ++ VL ANL+ AVL LT +L EGA+ S+AV+
Sbjct: 183 SRAILTLAVLQGANLSGAVLSGVNLTEVNL-----EGANLSNAVL 222
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 58/102 (56%), Gaps = 10/102 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
G A+L + N N + A++ +++ G+ A L KA +A+ +GADLS
Sbjct: 111 NLGGANLSQG-----NLSGVNLSGANLAQANLKGANLTEANLSKADLTEADLSGADLSGA 165
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ ++L++ANL++A+L R +LT A+++GA+ S AV+
Sbjct: 166 TLSGVILSDANLSDAILSRAILTL-----AVLQGANLSGAVL 202
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 44/78 (56%)
Query: 139 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
F+G L A +A ADLS+ ++ +L +A L+ A L RT+LT++DL A++ GA
Sbjct: 16 NFSGENLRSADLTRATLNAADLSEAILSEAILTQAELSEANLSRTILTKADLTEAVLAGA 75
Query: 199 DFSDAVIDLAQKQALCKY 216
+ A++ A+ + Y
Sbjct: 76 KLTGAILTEAELSRVNLY 93
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 46/103 (44%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L A+ + R N A M + +G+ L A + N +G +LS
Sbjct: 70 AVLAGAKLTGAILTEAELSRVNLYDAFMLGVNLTGANVTEGNLGGANLSQGNLSGVNLSG 129
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ + L ANLT A L + LT +DL GA + GA S ++
Sbjct: 130 ANLAQANLKGANLTEANLSKADLTEADLSGADLSGATLSGVIL 172
>gi|428216301|ref|YP_007100766.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988083|gb|AFY68338.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 188
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 63/147 (42%), Gaps = 12/147 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A DL A + + + AN ++R + S FN A L A N TGA D
Sbjct: 40 ADLHGCDLSGAYIIASDLQGANLADTNLRGASLKNSNFNRANLSWANMSWTNLTGASFMD 99
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG-----------ADFSDAV-IDLAQK 210
MD L+ ANL +A L L ++L G + G ADFS +D +
Sbjct: 100 ARMDVTNLSSANLIDADLRGANLQGANLRGTNLRGTQIEPLRSIDNADFSRVKNLDQRVR 159
Query: 211 QALCKYANGTNPITGVSTRKSLGCGNS 237
LC A G +P T STR +L C NS
Sbjct: 160 VYLCSIATGAHPFTKNSTRATLECNNS 186
>gi|434396750|ref|YP_007130754.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428267847|gb|AFZ33788.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 331
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 57/111 (51%), Gaps = 10/111 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 156
S A ++L KA ++ NF RAN T A + ++D S GA L A+ K N T
Sbjct: 65 SGADLSQSNLEKAQLIETNFSRANLTEASLIQADLS-----GAILSSAIGTKTNLTAAIL 119
Query: 157 -GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
G L T + + L EANLT A L +LT S+L AI+ A S+A ++
Sbjct: 120 IGCSLVGTQLLKSKLKEANLTGASLTGAILTGSNLTRAILTRAILSNANLE 170
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 5/83 (6%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
+K NF+RA+ + D++ + + FN A L AN +GADLS + +++ L E N
Sbjct: 30 IKANFQRASLNNIDLKMAVLKKANFNQAQL-----INANLSGADLSQSNLEKAQLIETNF 84
Query: 176 TNAVLVRTVLTRSDLGGAIIEGA 198
+ A L L ++DL GAI+ A
Sbjct: 85 SRANLTEASLIQADLSGAILSSA 107
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSD 162
ADLR A N + AN +++++D + + + A LE AV AN G +L+
Sbjct: 202 ADLRGANLEGANLQGANLEGVNLQDADLTEANLSAANLEGAVLSNANLQQVILKGTNLTG 261
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
T + L +ANL+ A L + L +DL GA + GAD + A
Sbjct: 262 TNLLNANLGQANLSQANLCQAGLLFTDLTGANLMGADLTSA 302
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 55/106 (51%), Gaps = 4/106 (3%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
+R +H + N ++AN AD+R +D G+ GA L+ A N ADL++ +
Sbjct: 180 IRAYLH-RVNLKKANLEKADLRFADLRGANLEGANLQGANLEGVNLQDADLTEANLSAAN 238
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
L A L+NA L + +L ++L G + A+ A +L+Q LC+
Sbjct: 239 LEGAVLSNANLQQVILKGTNLTGTNLLNANLGQA--NLSQAN-LCQ 281
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 53/95 (55%), Gaps = 10/95 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A + DL+ AV ++ANF A + ++ SG+ + + LEKA + NF+ A+L++
Sbjct: 37 ASLNNIDLKMAV-----LKKANFNQAQLINANLSGADLSQSNLEKAQLIETNFSRANLTE 91
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
L +A+L+ A+L + T+++L AI+ G
Sbjct: 92 A-----SLIQADLSGAILSSAIGTKTNLTAAILIG 121
>gi|422295276|gb|EKU22575.1| hypothetical protein NGA_0469800 [Nannochloropsis gaditana CCMP526]
Length = 90
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 47/83 (56%), Gaps = 2/83 (2%)
Query: 154 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 213
NF GAD S+ ++DR+ + +NL ++ VL+ + GA + +DF+D + + L
Sbjct: 7 NFEGADFSNAVVDRVSFDGSNLKGSIFSNAVLSGTSFVGADLTDSDFTDTYMGEFNLREL 66
Query: 214 CKYAN--GTNPITGVSTRKSLGC 234
CK GTNP+T T++S GC
Sbjct: 67 CKNPTLKGTNPVTQAPTKESAGC 89
>gi|300864770|ref|ZP_07109621.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
gi|300337239|emb|CBN54769.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
Length = 334
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A DLR ++ N + N T AD+RE+D S + N A L+ A AN GA L
Sbjct: 230 ADLHDTDLRGGNLIQANLMKTNLTEADLREADLSHTNLNLANLKGADLSGANLQGAYLWA 289
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
T +D L A+L A L +++ +DL AI+ GA D I
Sbjct: 290 TNLDGACLKGADLRGASLRNAIISGADLRDAILTGATMPDGKI 332
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 51/110 (46%), Gaps = 5/110 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S AQ A+L V R +AN AD+ ++D G A L K +A+
Sbjct: 198 SGAQLSGANLSGTVLSGARMRFTKLEQANLKQADLHDTDLRGGNLIQANLMKTNLTEADL 257
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
ADLS T ++ L A+L+ A L L ++L GA ++GAD A +
Sbjct: 258 READLSHTNLNLANLKGADLSGANLQGAYLWATNLDGACLKGADLRGASL 307
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 53/110 (48%), Gaps = 2/110 (1%)
Query: 89 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 148
EA G F G+ F A L+ + + + +A+ A + + D +G++ +GA L
Sbjct: 58 LEANLNGAFLYGANLSF--AKLKGSHLLGADLTKADLRGAQLAKVDLTGAQLSGAILSWV 115
Query: 149 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
++AN G +L + + L ANL A L LT + L GA ++GA
Sbjct: 116 SLFQANLPGVNLCGANLSGINLRSANLAGANLNWANLTGARLSGANLKGA 165
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 45/101 (44%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +LR A N AN T A + ++ G+ NG L KA N G D S
Sbjct: 130 ANLSGINLRSANLAGANLNWANLTGARLSGANLKGALLNGVKLNKAFLNGLNLAGIDFSG 189
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++ + L+ A L+ A L TVL+ + + +E A+ A
Sbjct: 190 LELEDVKLSGAQLSGANLSGTVLSGARMRFTKLEQANLKQA 230
>gi|411117892|ref|ZP_11390273.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410711616|gb|EKQ69122.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 577
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 65/149 (43%), Gaps = 35/149 (23%)
Query: 103 AQFGSADLRKAVHVKENF----------RRANFTSADMRE---------------SDFSG 137
AQ A+LR+A V N R+AN T AD+ +D S
Sbjct: 165 AQLDEANLREATLVGTNLNEASLIGAYLRQANLTEADLHRVVLSSADLSEAILANADLSR 224
Query: 138 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
+ GAYL KA +KA+ ADL D + R L+EANL A L R L+ + L I+
Sbjct: 225 ANLAGAYLLKASFHKAHLLRADLQDVYLLRADLSEANLRGANLQRADLSGAYLNHTILSE 284
Query: 198 ADFSDAV----------IDLAQKQALCKY 216
AD S+A +D AQ A C Y
Sbjct: 285 ADLSEAYLLQSHLIHTNLDGAQLTACCIY 313
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 57/116 (49%), Gaps = 20/116 (17%)
Query: 103 AQFGSADLRKAVHVKENFRRANFT---------------SADMRESDFSGSKFNGAYLEK 147
A+ SA L+ A ++ N RRAN A++RE+ G+ N A L
Sbjct: 130 AKLNSAQLKGAELMEANLRRANLAGANLDQANLREAQLDEANLREATLVGTNLNEASLIG 189
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A +AN T ADL R+VL+ A+L+ A+L L+R++L GA + A F A
Sbjct: 190 AYLRQANLTEADLH-----RVVLSSADLSEAILANADLSRANLAGAYLLKASFHKA 240
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 45/83 (54%)
Query: 129 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 188
D SD SG+ +G L A +AN T A+LS +++ +L ANL A L L+ +
Sbjct: 16 DFSHSDLSGANLSGFNLRGANFTEANLTEANLSWAFLNQAILTGANLRRADLRNASLSGA 75
Query: 189 DLGGAIIEGADFSDAVIDLAQKQ 211
DL AI+ GA+ S + LAQ Q
Sbjct: 76 DLNHAILHGANLSKIDLRLAQLQ 98
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 53/105 (50%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +DL A N R ANFT A++ E++ S + N A L A +A+ A LS
Sbjct: 17 FSHSDLSGANLSGFNLRGANFTEANLTEANLSWAFLNQAILTGANLRRADLRNASLSGAD 76
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
++ +L+ ANL+ L L +++L A ++ AD A + A+
Sbjct: 77 LNHAILHGANLSKIDLRLAQLQQANLNWATLQDADMGGANLAFAK 121
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 50/121 (41%), Gaps = 15/121 (12%)
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADM---------------RESDFSGSKFNG 142
I A DLR A + N A ADM + + ++ G
Sbjct: 80 AILHGANLSKIDLRLAQLQQANLNWATLQDADMGGANLAFAKLDQVNLERAKLNSAQLKG 139
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A L +A +AN GA+L + L+EANL A LV T L + L GA + A+ ++
Sbjct: 140 AELMEANLRRANLAGANLDQANLREAQLDEANLREATLVGTNLNEASLIGAYLRQANLTE 199
Query: 203 A 203
A
Sbjct: 200 A 200
Score = 40.8 bits (94), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 57/128 (44%), Gaps = 20/128 (15%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANF 155
+ A A L +A+ N RRA+ +A + +D + + +GA L K A +AN
Sbjct: 43 TEANLSWAFLNQAILTGANLRRADLRNASLSGADLNHAILHGANLSKIDLRLAQLQQANL 102
Query: 156 TGADLSDTLM---------------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A L D M +R LN A L A L+ L R++L GA ++ A+
Sbjct: 103 NWATLQDADMGGANLAFAKLDQVNLERAKLNSAQLKGAELMEANLRRANLAGANLDQANL 162
Query: 201 SDAVIDLA 208
+A +D A
Sbjct: 163 REAQLDEA 170
>gi|427715911|ref|YP_007063905.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348347|gb|AFY31071.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 589
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 55/110 (50%), Gaps = 5/110 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A SA+L A + R N +SAD+ +D S + N A L A AN + ADL
Sbjct: 321 SHADLSSANLSGANLTNTDLNRTNLSSADLSSADLSSTNLNSADLSSANLKDANLSSADL 380
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVI 205
S T + L++ANL+ L L R+DL G AI+ G + SD ++
Sbjct: 381 SHTHLFGANLSDANLSGVNLSHADLCRADLSGADMSKAILNGTNLSDTIL 430
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 103 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A AD+ KA+ N N + A + +D S +K NGA L A A F G
Sbjct: 408 ADLSGADMSKAILNGTNLSDTILFSTNLSDAILIAADLSYAKLNGAKLNYARLNGAMFLG 467
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
ADLS + ++LN+A+L+ +L L+ +DL AI+ G D S A ++ A
Sbjct: 468 ADLSGVDLSGVILNDADLSGVLLSEADLSDADLSDAILFGTDLSYANLNRA 518
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 57/110 (51%), Gaps = 8/110 (7%)
Query: 95 GEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 151
GEF G A G A+L A NF AN + A + +++ +G F+GA L A
Sbjct: 252 GEFLRGGNFRGAYLGDANLTGA-----NFSGANLSGAYLGDANLTGVNFSGANLSGANLG 306
Query: 152 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
AN +GA+LS+ + L+ ANL+ A L T L R++L A + AD S
Sbjct: 307 DANLSGANLSNANLSHADLSSANLSGANLTNTDLNRTNLSSADLSSADLS 356
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 58/103 (56%), Gaps = 5/103 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A G A+L NF AN + A++ +++ SG+ + A L A AN +GA+L
Sbjct: 281 SGAYLGDANLTGV-----NFSGANLSGANLGDANLSGANLSNANLSHADLSSANLSGANL 335
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++T ++R L+ A+L++A L T L +DL A ++ A+ S A
Sbjct: 336 TNTDLNRTNLSSADLSSADLSSTNLNSADLSSANLKDANLSSA 378
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 13/103 (12%)
Query: 117 KENFRRAN---FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM-------- 165
K N+ R N F AD+ D SG N A L + +A+ + ADLSD ++
Sbjct: 454 KLNYARLNGAMFLGADLSGVDLSGVILNDADLSGVLLSEADLSDADLSDAILFGTDLSYA 513
Query: 166 --DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+R L+ +NL+ A+L L+ ++L AI+ GAD SDA ++
Sbjct: 514 NLNRANLSGSNLSGALLNGADLSHTNLSCAILGGADVSDANLE 556
Score = 44.7 bits (104), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 46/87 (52%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
V E R NF A + +++ +G+ F+GA L A AN TG + S + L +ANL
Sbjct: 251 VGEFLRGGNFRGAYLGDANLTGANFSGANLSGAYLGDANLTGVNFSGANLSGANLGDANL 310
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFSD 202
+ A L L+ +DL A + GA+ ++
Sbjct: 311 SGANLSNANLSHADLSSANLSGANLTN 337
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 51/103 (49%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ F A+L A N AN ++A++ +D S + +GA L + N + ADL
Sbjct: 291 TGVNFSGANLSGANLGDANLSGANLSNANLSHADLSSANLSGANLTNTDLNRTNLSSADL 350
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + LN A+L++A L L+ +DL + GA+ SDA
Sbjct: 351 SSADLSSTNLNSADLSSANLKDANLSSADLSHTHLFGANLSDA 393
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 51/106 (48%), Gaps = 5/106 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S+A S +L A N + AN +SAD+ + G+ + A L A+ ADL
Sbjct: 351 SSADLSSTNLNSADLSSANLKDANLSSADLSHTHLFGANLSDANLSGVNLSHADLCRADL 410
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
S M + +LN NL++ +L T +L AI+ AD S A ++
Sbjct: 411 SGADMSKAILNGTNLSDTILFST-----NLSDAILIAADLSYAKLN 451
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 63/134 (47%), Gaps = 13/134 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 157
S A +L + N A +AD+ + +G+K N A L A+ A+ +G
Sbjct: 416 SKAILNGTNLSDTILFSTNLSDAILIAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDL 475
Query: 158 -------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DL 207
ADLS L+ L++A+L++A+L T L+ ++L A + G++ S A++ DL
Sbjct: 476 SGVILNDADLSGVLLSEADLSDADLSDAILFGTDLSYANLNRANLSGSNLSGALLNGADL 535
Query: 208 AQKQALCKYANGTN 221
+ C G +
Sbjct: 536 SHTNLSCAILGGAD 549
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 5/101 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A G A+L A N AN + AD+ ++ SG+ L + A+ + ADL
Sbjct: 301 SGANLGDANLSGA-----NLSNANLSHADLSSANLSGANLTNTDLNRTNLSSADLSSADL 355
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
S T ++ L+ ANL +A L L+ + L GA + A+ S
Sbjct: 356 SSTNLNSADLSSANLKDANLSSADLSHTHLFGANLSDANLS 396
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 7/99 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F ++ L + +H + NF+S + G F GAYL A N TGA+ S
Sbjct: 227 FFTSQLLRVIHYSDAIEIGNFSS--IVGEFLRGGNFRGAYLGDA-----NLTGANFSGAN 279
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L +ANLT L+ ++LG A + GA+ S+A
Sbjct: 280 LSGAYLGDANLTGVNFSGANLSGANLGDANLSGANLSNA 318
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 5/92 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEA 173
N A+ AD+ +D S + NG L + + N + ADLS ++ LN A
Sbjct: 399 NLSHADLCRADLSGADMSKAILNGTNLSDTILFSTNLSDAILIAADLSYAKLNGAKLNYA 458
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L A+ + L+ DL G I+ AD S ++
Sbjct: 459 RLNGAMFLGADLSGVDLSGVILNDADLSGVLL 490
>gi|443324431|ref|ZP_21053184.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442795950|gb|ELS05284.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 239
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 60/105 (57%), Gaps = 10/105 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGA 158
QF +L++A +K N + T+AD+R++ S F A L A + + +FT A
Sbjct: 16 QFSRINLQEAELIKVNLSNVDLTAADLRQARLGRSNFGHACLRSADLSESILWGTDFTQA 75
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
DLS + V+ EA+L+ A+L + L +++L +I+EGA+FS A
Sbjct: 76 DLS-----QAVMREADLSGAILTQANLEKANLIKSILEGANFSGA 115
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 60/112 (53%), Gaps = 3/112 (2%)
Query: 94 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 153
R FG A SADL +++ +F +A+ + A MRE+D SG+ A LEKA K+
Sbjct: 49 RSNFG---HACLRSADLSESILWGTDFTQADLSQAVMREADLSGAILTQANLEKANLIKS 105
Query: 154 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
GA+ S + ++ E +L A RT L+++DL A + A+ S A++
Sbjct: 106 ILEGANFSGAKLRHALMIEVDLRPASDYRTNLSQADLSYADLSYANLSMALL 157
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 57/123 (46%), Gaps = 18/123 (14%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I A F A LR A+ ++ + R A+ ++ ++D S + + A L A+ Y+A GA
Sbjct: 106 ILEGANFSGAKLRHALMIEVDLRPASDYRTNLSQADLSYADLSYANLSMALLYQAKLDGA 165
Query: 159 ------------------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
DL++ + L+ ANLT A+L R LT +DL G I+ D
Sbjct: 166 RLSRANLSAGRGENALATDLTEASLRDADLSYANLTGAILHRADLTGADLTGTILTNTDL 225
Query: 201 SDA 203
+A
Sbjct: 226 REA 228
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 59/128 (46%), Gaps = 6/128 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY-----LEKAVAYKANF 155
S A ADL A+ + N +AN + + ++FSG+K A L A Y+ N
Sbjct: 78 SQAVMREADLSGAILTQANLEKANLIKSILEGANFSGAKLRHALMIEVDLRPASDYRTNL 137
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
+ ADLS + L+ A L A L L+R++L E A +D + + + + A
Sbjct: 138 SQADLSYADLSYANLSMALLYQAKLDGARLSRANLSAGRGENALATD-LTEASLRDADLS 196
Query: 216 YANGTNPI 223
YAN T I
Sbjct: 197 YANLTGAI 204
>gi|167771967|ref|ZP_02444020.1| hypothetical protein ANACOL_03340 [Anaerotruncus colihominis DSM
17241]
gi|167665765|gb|EDS09895.1| pentapeptide repeat protein [Anaerotruncus colihominis DSM 17241]
Length = 314
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 67/129 (51%), Gaps = 9/129 (6%)
Query: 86 LNKYEAETRGEF-GIG---SAAQFGSADLRKAVH-----VKENFRRANFTSADMRESDFS 136
L+K+ A RGE G+ + A ADL KA N +AN + A++ ++ S
Sbjct: 7 LDKHAAWLRGEPEGVKADLTGANLPGADLSKANLSGANLFGANLSKANLSGANLFGANLS 66
Query: 137 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
G+ GA L KA AN +GADLS T + L++ANL+ A L L+R+ L GA +
Sbjct: 67 GANLFGANLSKANLSGANLSGADLSRTHLPGADLSKANLSGANLSGADLSRTHLPGADLS 126
Query: 197 GADFSDAVI 205
A+ S A +
Sbjct: 127 KANLSKANL 135
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 65/133 (48%), Gaps = 11/133 (8%)
Query: 84 ADLNKYEAETRGEFGIG------SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 137
ADL+K FG S A A+L A N +AN + A++ +D S
Sbjct: 33 ADLSKANLSGANLFGANLSKANLSGANLFGANLSGANLFGANLSKANLSGANLSGADLSR 92
Query: 138 SKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGG 192
+ GA L KA AN +GADLS T + + L++ANL+ A L L++++L G
Sbjct: 93 THLPGADLSKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANLFGANLSKANLSG 152
Query: 193 AIIEGADFSDAVI 205
A + GA+ S A +
Sbjct: 153 ANLFGANLSGANL 165
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 54/106 (50%), Gaps = 5/106 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL + + +AN + A++ +D S + GA L KA KAN +GA+L
Sbjct: 81 SGANLSGADLSRTHLPGADLSKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANL 140
Query: 161 SDTLMDRMVLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFS 201
+ + L+ ANL + A L L++++L GA + GAD S
Sbjct: 141 FGANLSKANLSGANLFGANLSGANLFGANLSKANLSGANLSGADLS 186
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 53/105 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL + + +AN + A++ ++ G+ + A L A + AN +GA+L
Sbjct: 106 SGANLSGADLSRTHLPGADLSKANLSKANLSGANLFGANLSKANLSGANLFGANLSGANL 165
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ + L+ ANL+ A L RT L +DL A + A+ S A +
Sbjct: 166 FGANLSKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANL 210
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 48/90 (53%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL KA K N AN A++ +++ SG+ GA L A + AN + A+LS +
Sbjct: 123 ADLSKANLSKANLSGANLFGANLSKANLSGANLFGANLSGANLFGANLSKANLSGANLSG 182
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
L+ +L A L + L++++L GA + G
Sbjct: 183 ADLSRTHLPGADLSKANLSKANLSGANLSG 212
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 55/105 (52%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A + + A+ + A++ +++ SG+ GA L KA AN GA+L
Sbjct: 101 SKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANLFGANLSKANLSGANLFGANL 160
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S + L++ANL+ A L L+R+ L GA + A+ S A +
Sbjct: 161 SGANLFGANLSKANLSGANLSGADLSRTHLPGADLSKANLSKANL 205
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 50/108 (46%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A A+L A + R + AD+ +++ S + +GA L A KAN +GA+L
Sbjct: 97 GADLSKANLSGANLSGADLSRTHLPGADLSKANLSKANLSGANLFGANLSKANLSGANLF 156
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+ L ANL+ A L L+ +DL + GAD S A + A
Sbjct: 157 GANLSGANLFGANLSKANLSGANLSGADLSRTHLPGADLSKANLSKAN 204
>gi|312195986|ref|YP_004016047.1| pentapeptide repeat-containing protein [Frankia sp. EuI1c]
gi|311227322|gb|ADP80177.1| pentapeptide repeat protein [Frankia sp. EuI1c]
Length = 377
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 56/105 (53%), Gaps = 10/105 (9%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA+ ADL ++ +K + +D +G++ + A L+ A AN TGA L
Sbjct: 237 AARLTGADLTGSILIKTK----------LTATDLAGARLSQANLDGADLANANLTGARLD 286
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
D ++ + L+E L +AVL R L R+DL GA + GAD + A +D
Sbjct: 287 DAILTGVHLSEGRLVDAVLTRANLHRADLVGADLTGADLTGARLD 331
>gi|123968240|ref|YP_001009098.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. AS9601]
gi|123198350|gb|ABM69991.1| Pentapeptide repeat-containing protein [Prochlorococcus marinus
str. AS9601]
Length = 157
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 64/134 (47%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+A +G L A + + A F D+++++ S + A L A N + ++L
Sbjct: 21 AALDYGKQSLIGADFSGSDLKGATFYLTDLQDANLSDCELQNATLYGAKLKDTNLSNSNL 80
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+ +D +L+ +L+N L + + I+GADF++ + + C+ A GT
Sbjct: 81 REVTLDSAILDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVFLPKDIIRKFCESATGT 140
Query: 221 NPITGVSTRKSLGC 234
NPIT TR++L C
Sbjct: 141 NPITNRETRETLEC 154
>gi|78033474|emb|CAJ30090.1| hypothetical acidic protein, pentapeptide repeat [Magnetospirillum
gryphiswaldense MSR-1]
gi|144901135|emb|CAM77999.1| pentapeptide repeat containing protein [Magnetospirillum
gryphiswaldense MSR-1]
Length = 503
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 57/100 (57%), Gaps = 8/100 (8%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+LRKAV N R +N A + ++D SG+K GA L A +ANF+GA++ R
Sbjct: 28 ANLRKAVLSGANLRDSNLPRASLEDADLSGAKLQGANLAGATLLRANFSGANM------R 81
Query: 168 MV-LNEANLTNAVLVRTV-LTRSDLGGAIIEGADFSDAVI 205
M L ANL + + V LT ++L GA + GA+FS A +
Sbjct: 82 MANLAGANLAGRMDLSGVDLTGANLAGAKLMGANFSGATL 121
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 38/61 (62%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN A + ++FSG+ GA L A A ANF+GADL+D + +L+ AN++ AV+ R
Sbjct: 104 ANLAGAKLMGANFSGATLTGANLAGADARNANFSGADLTDAVTAGTLLDGANMSGAVIRR 163
Query: 183 T 183
+
Sbjct: 164 S 164
Score = 40.8 bits (94), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 50/87 (57%), Gaps = 5/87 (5%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 181
R + TSA++R ++ +G +G+ L A KA +GA+L D+ + R L +A+L+ A
Sbjct: 2 RPDLTSANLRGANLAGMDLSGSLLSLANLRKAVLSGANLRDSNLPRASLEDADLSGA--- 58
Query: 182 RTVLTRSDLGGAIIEGADFSDAVIDLA 208
L ++L GA + A+FS A + +A
Sbjct: 59 --KLQGANLAGATLLRANFSGANMRMA 83
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 48/114 (42%), Gaps = 16/114 (14%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRES-----------DFSGSKFNGAYLEKAVAY 151
A A L+ A RANF+ A+MR + D SG GA L A
Sbjct: 53 ADLSGAKLQGANLAGATLLRANFSGANMRMANLAGANLAGRMDLSGVDLTGANLAGAKLM 112
Query: 152 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
ANF+GA L+ + AN + A L V G +++GA+ S AVI
Sbjct: 113 GANFSGATLTGANLAGADARNANFSGADLTDAV-----TAGTLLDGANMSGAVI 161
>gi|220907082|ref|YP_002482393.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863693|gb|ACL44032.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 309
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 5/103 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A+L ++ N R A T AD+RE+ K N A L ++ +AN TGADL
Sbjct: 185 ADFQGANLSRSTLTGANLRGAYLTGADLREA-----KLNEANLRRSDLSQANLTGADLRG 239
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++R L ANL ++L+ L ++L A ++GA+ +AV+
Sbjct: 240 ANLNRATLRGANLRESILIGASLMGANLSQASLQGANLLEAVL 282
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 54/106 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A++ A+ + N A+ T A++ +++ G+ GAYL A TGA+L
Sbjct: 113 SEANLTGAEISAAILREANLTLADLTLAELSQTNLRGANLTGAYLRGAELLGTQLTGAEL 172
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
S L EA+ A L R+ LT ++L GA + GAD +A ++
Sbjct: 173 SQANFRGTNLTEADFQGANLSRSTLTGANLRGAYLTGADLREAKLN 218
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 15/103 (14%)
Query: 108 ADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
ADLR+A + N RR AN T AD+R ++ + + GA L +++ A+ GA+LS
Sbjct: 210 ADLREAKLNEANLRRSDLSQANLTGADLRGANLNRATLRGANLRESILIGASLMGANLS- 268
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+A+L A L+ VLT ++L G + G D S V+
Sbjct: 269 ---------QASLQGANLLEAVLTGANLTGVDLTGVDLSATVM 302
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 48/93 (51%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADLRK K N A+ T A++ +D S + GA + A+ +AN T ADL+ + +
Sbjct: 85 ADLRKVNLRKANLTGADLTGANLTGADLSEANLTGAEISAAILREANLTLADLTLAELSQ 144
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
L ANLT A L L + L GA + A+F
Sbjct: 145 TNLRGANLTGAYLRGAELLGTQLTGAELSQANF 177
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 59/115 (51%), Gaps = 1/115 (0%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A L +A + + + + A++R + SG+ GA L A AN + ADL
Sbjct: 17 FTGASLYQANLNRVHLSQVDLQGANLRGAGLSGANLQGADLRGATLAAANLSNADLRGAD 76
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 218
+ ++L EA+L L + LT +DL GA + GAD S+A + A+ A+ + AN
Sbjct: 77 LRGVLLMEADLRKVNLRKANLTGADLTGANLTGADLSEANLTGAEISAAILREAN 131
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 52/109 (47%), Gaps = 15/109 (13%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADLR A AN ++AD+R +D G A L K KAN TGADL
Sbjct: 48 SGANLQGADLRGAT-----LAAANLSNADLRGADLRGVLLMEADLRKVNLRKANLTGADL 102
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+ ANLT A L LT +++ AI+ A+ + A + LA+
Sbjct: 103 TG----------ANLTGADLSEANLTGAEISAAILREANLTLADLTLAE 141
>gi|436670209|ref|YP_007317948.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428262481|gb|AFZ28430.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 309
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 57/119 (47%), Gaps = 5/119 (4%)
Query: 92 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 151
+T E I S A DL K+ + RRA+ T AD+ E+D + A L +
Sbjct: 162 QTNWEGAILSQASLQRVDLEKSQLNETILRRADLTEADLVEADLRYADLTEAILCRVALE 221
Query: 152 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI 205
AN GADLS + R L A+L AVL T L +DL A + G+DFSD+ +
Sbjct: 222 LANLVGADLSRATLKRASLFRADLEGAVLQDTNLVETDLRYANFKDTQLMGSDFSDSRV 280
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 5/103 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L A V+ RAN ++E+D +G+ F+ A L + N+ GA LS
Sbjct: 118 ADLSEASLESACLVQAVLSRANLFKVSLKEADCTGANFDEANLR-----QTNWEGAILSQ 172
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ R+ L ++ L +L R LT +DL A + AD ++A++
Sbjct: 173 ASLQRVDLEKSQLNETILRRADLTEADLVEADLRYADLTEAIL 215
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 51/90 (56%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ NF+ + + D++ ++ S F+ A L A AN +G L T ++R L +ANLT
Sbjct: 17 ERNFQNLDLSRVDLKGTNLKSSDFSHANLNSADLSYANLSGTSLIWTDLNRANLRQANLT 76
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A L+R+ L +DL A + A+ S+A+++
Sbjct: 77 QACLLRSSLFWADLQEATLVNANLSNALLN 106
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 59/128 (46%), Gaps = 3/128 (2%)
Query: 79 NISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS 138
NI +D+ K+ A+ F DL+ +F AN SAD+ ++ SG+
Sbjct: 2 NIINASDIVKHYADQERNF---QNLDLSRVDLKGTNLKSSDFSHANLNSADLSYANLSGT 58
Query: 139 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
L +A +AN T A L + + L EA L NA L +L +L A ++GA
Sbjct: 59 SLIWTDLNRANLRQANLTQACLLRSSLFWADLQEATLVNANLSNALLNHVNLTSACLKGA 118
Query: 199 DFSDAVID 206
D S+A ++
Sbjct: 119 DLSEASLE 126
>gi|392410624|ref|YP_006447231.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
gi|390623760|gb|AFM24967.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
Length = 285
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 56/102 (54%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A L+KA +F RA+ + AD+ +D SG+ +GA L A + + + DL
Sbjct: 161 SGADLFGAKLKKAALSAVDFSRADLSGADLSGADLSGAILSGARLNGANLSRVDLSFTDL 220
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
S + L+ ANLT A L + L+ +DL GA ++GAD +D
Sbjct: 221 SGAHLSGANLSAANLTGAYLPGSDLSGADLSGANLQGADITD 262
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL +A+ N +A AD+ +D G+K A L +A+ +GADL
Sbjct: 131 SKANLSQADLSRAILSGANLSKALLPFADLSGADLFGAKLKKAALSAVDFSRADLSGADL 190
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + +L+ A L A L R L+ +DL GA + GA+ S A
Sbjct: 191 SGADLSGAILSGARLNGANLSRVDLSFTDLSGAHLSGANLSAA 233
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 53/105 (50%), Gaps = 10/105 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+AA+ +L +A N A+ + A++ ++D S + +GA L KA+ A+ +GADL
Sbjct: 106 AAAKLVEINLTQANLCGANLCGADLSKANLSQADLSRAILSGANLSKALLPFADLSGADL 165
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A L A L +R+DL GA + GAD S A++
Sbjct: 166 F----------GAKLKKAALSAVDFSRADLSGADLSGADLSGAIL 200
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 47/106 (44%), Gaps = 10/106 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A+ AN + D+ +D SG+ +GA L A A G+DL
Sbjct: 186 SGADLSGADLSGAILSGARLNGANLSRVDLSFTDLSGAHLSGANLSAANLTGAYLPGSDL 245
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
S A+L+ A L +T SDL GA + GA+ +D
Sbjct: 246 S----------GADLSGANLQGADITDSDLSGANLNGANLDGTKLD 281
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 20/118 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT------ 156
AQ DL A N A +SAD+ E++ SG+ + A+L A +AN +
Sbjct: 43 AQLSGEDLSFA-----NLSNAKLSSADLSEANLSGASLDRAHLTVAKLDRANLSNANASC 97
Query: 157 ----GADLSDTLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVI 205
GA L+ + + L +ANL A L + L+++DL AI+ GA+ S A++
Sbjct: 98 AGLLGARLAAAKLVEINLTQANLCGANLCGADLSKANLSQADLSRAILSGANLSKALL 155
>gi|86605499|ref|YP_474262.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86554041|gb|ABC98999.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 330
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 58/118 (49%), Gaps = 3/118 (2%)
Query: 91 AETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 147
A+ RG +G Q G A+L++A+ + N AN + AD+ +D S + A L +
Sbjct: 207 ADLRGASFLGGDLQGVQMGRANLKEAMLSQVNLAEANLSEADLAGADLSAACLRSAKLAR 266
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+AN GADL + L NL NA L +LTR+DL A + GA+ A +
Sbjct: 267 TDLSRANLAGADLRSASLVDAYLGRTNLENADLREAILTRADLSTANLAGANLRGATL 324
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 54/121 (44%), Gaps = 20/121 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G A L+KA V N AN + AD+ E+D + +G L+ A + AN A L D
Sbjct: 52 AYLGRAKLQKANLVGANLSGANLSQADLSEADLRDAHLHGTTLQGADLHGANLALALLID 111
Query: 163 TLMDRMVLNEANLTNA--------------------VLVRTVLTRSDLGGAIIEGADFSD 202
+ L ANLT+A VL L+R+DL GA + GAD +
Sbjct: 112 ANLLEADLRWANLTSANLGGACLRGANLRFESRRAAVLRSANLSRADLSGANLAGADLTR 171
Query: 203 A 203
A
Sbjct: 172 A 172
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 55/107 (51%), Gaps = 4/107 (3%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ RA+ D+ ++D G AYL +A KAN GA+LS + + L+EA+L +A
Sbjct: 28 DLSRADLIGIDLSQADLHGINLIFAYLGRAKLQKANLVGANLSGANLSQADLSEADLRDA 87
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITG 225
L T L +DL GA + A DA + +A ++AN T+ G
Sbjct: 88 HLHGTTLQGADLHGANLALALLIDANL----LEADLRWANLTSANLG 130
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 44/98 (44%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL +A N + A+ A ++ ++ + GA L A A+F G DL
Sbjct: 160 SGANLAGADLTRADLRGANLKEASLIGAHLQGANLQRACLRGALLSNADLRGASFLGGDL 219
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
M R L EA L+ L L+ +DL GA + A
Sbjct: 220 QGVQMGRANLKEAMLSQVNLAEANLSEADLAGADLSAA 257
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 64/135 (47%), Gaps = 7/135 (5%)
Query: 66 TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANF 125
T L A + + ++ L D N EA+ R + ++A G A LR A E+ R A
Sbjct: 92 TTLQGADLHGANLALALLIDANLLEADLR--WANLTSANLGGACLRGANLRFESRRAAVL 149
Query: 126 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 185
SA++ +D SG+ GA L +A+ GA+L + + L ANL A L +L
Sbjct: 150 RSANLSRADLSGANLAGADL-----TRADLRGANLKEASLIGAHLQGANLQRACLRGALL 204
Query: 186 TRSDLGGAIIEGADF 200
+ +DL GA G D
Sbjct: 205 SNADLRGASFLGGDL 219
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 55/108 (50%), Gaps = 10/108 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG-----AYLEKAVAYKANFTG 157
A A+L++A R A ++AD+R + F G G A L++A+ + N
Sbjct: 187 AHLQGANLQRAC-----LRGALLSNADLRGASFLGGDLQGVQMGRANLKEAMLSQVNLAE 241
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+LS+ + L+ A L +A L RT L+R++L GA + A DA +
Sbjct: 242 ANLSEADLAGADLSAACLRSAKLARTDLSRANLAGADLRSASLVDAYL 289
>gi|440681678|ref|YP_007156473.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428678797|gb|AFZ57563.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 402
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 55/103 (53%), Gaps = 8/103 (7%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL KA K NF ANFT A + E+ G+ F AYL +A+ TGA+L+
Sbjct: 281 AILAGADLTKA---KANFTGANFTGAILTEAILIGANFEKAYL-----IRADLTGANLTG 332
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
T + R L EA+LT A L R L ++ L AI+E A++
Sbjct: 333 TNLTRADLTEADLTGANLTRAYLIKAILEEAILEEVILRGAIL 375
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 60/113 (53%), Gaps = 12/113 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAV-------A 150
A A+L++A+ + NF A FT AD+ E++F+ + GA E+A+
Sbjct: 231 AILAEANLKRAILIGANFEGAIFTRADLAEANFTRAILTEAILIGANFEEAILAGADLTK 290
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
KANFTGA+ + ++ +L AN A L+R LT ++L G + AD ++A
Sbjct: 291 AKANFTGANFTGAILTEAILIGANFEKAYLIRADLTGANLTGTNLTRADLTEA 343
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 69/145 (47%), Gaps = 9/145 (6%)
Query: 63 FVSTALAAAVV--ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 120
F L A++ A+ I A ADL K +A G A F A L +A+ + NF
Sbjct: 263 FTRAILTEAILIGANFEEAILAGADLTKAKANFTG-------ANFTGAILTEAILIGANF 315
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
+A AD+ ++ +G+ A L +A AN T A L +++ +L E L A+L
Sbjct: 316 EKAYLIRADLTGANLTGTNLTRADLTEADLTGANLTRAYLIKAILEEAILEEVILRGAIL 375
Query: 181 VRTVLTRSDLGGAIIEGADFSDAVI 205
+LTR+ L GA ++GA D I
Sbjct: 376 RGAILTRAILRGANLKGATMPDGSI 400
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 5/80 (6%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N + A++ E++F + A L++A+ ANF GA + R L EAN T A+L
Sbjct: 217 NISKANLTEANFKRAILAEANLKRAILIGANFEGA-----IFTRADLAEANFTRAILTEA 271
Query: 184 VLTRSDLGGAIIEGADFSDA 203
+L ++ AI+ GAD + A
Sbjct: 272 ILIGANFEEAILAGADLTKA 291
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 7/99 (7%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
++ KA + NF+RA A+++ + G+ F GA +A +ANFT A L++
Sbjct: 217 NISKANLTEANFKRAILAEANLKRAILIGANFEGAIFTRADLAEANFTRAILTEA----- 271
Query: 169 VLNEANLTNAVLVRTVLT--RSDLGGAIIEGADFSDAVI 205
+L AN A+L LT +++ GA GA ++A++
Sbjct: 272 ILIGANFEEAILAGADLTKAKANFTGANFTGAILTEAIL 310
>gi|316934318|ref|YP_004109300.1| pentapeptide repeat-containing protein [Rhodopseudomonas palustris
DX-1]
gi|315602032|gb|ADU44567.1| pentapeptide repeat protein [Rhodopseudomonas palustris DX-1]
Length = 273
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 56/109 (51%), Gaps = 5/109 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A N RA+ + A++ +D SG+ +GA L +A + AN +GADL
Sbjct: 57 SGANLSGADLSGANLSGANLYRADLSGANLSGADLSGANLSGANLYRAKLFSANLSGADL 116
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
S L+ ANL A L L R+DL GA + GAD S A + A
Sbjct: 117 SGA-----NLSGANLYRADLSGANLYRADLSGANLSGADLSGANLHRAN 160
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 53/103 (51%), Gaps = 5/103 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A N RA A++ ++ SG+ +GA L A Y+A+ +GA+L
Sbjct: 27 SGANLSGADLSGANLSGANLYRAKLFGANLSGANLSGADLSGANLSGANLYRADLSGANL 86
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S L+ ANL+ A L R L ++L GA + GA+ S A
Sbjct: 87 SGA-----DLSGANLSGANLYRAKLFSANLSGADLSGANLSGA 124
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 46/80 (57%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N + AD+ ++ SG+ +GA L A Y+A GA+LS + L+ ANL+ A L R
Sbjct: 20 NLSGADLSGANLSGADLSGANLSGANLYRAKLFGANLSGANLSGADLSGANLSGANLYRA 79
Query: 184 VLTRSDLGGAIIEGADFSDA 203
L+ ++L GA + GA+ S A
Sbjct: 80 DLSGANLSGADLSGANLSGA 99
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 46/93 (49%), Gaps = 5/93 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANF 155
S A ADL A N RA SA++ +D SG+ +GA L +A Y+A+
Sbjct: 82 SGANLSGADLSGANLSGANLYRAKLFSANLSGADLSGANLSGANLYRADLSGANLYRADL 141
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 188
+GA+LS + L+ ANL+ A V L R+
Sbjct: 142 SGANLSGADLSGANLHRANLSGAKGVDLSLART 174
>gi|330509039|ref|YP_004385467.1| pentapeptide repeat-containing protein [Methanosaeta concilii GP6]
gi|328929847|gb|AEB69649.1| pentapeptide repeat protein [Methanosaeta concilii GP6]
Length = 386
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 57/110 (51%), Gaps = 5/110 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-VAY----KANF 155
S + AD +A ++ N AN ADM +D + + GA L+ A + Y KANF
Sbjct: 204 SGSDLSDADFTRAYLMRSNLTGANIDWADMAYADLTEAVLTGASLKSAKMPYSDLTKANF 263
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
TGADLS+ +D +L A L NA L R L DL G + GA ++V+
Sbjct: 264 TGADLSEAYLDGAILAGATLRNAKLDRVNLREVDLRGLEMGGASLKNSVL 313
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 51/101 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +DL+ N A SA + S +GS A L AV +A+ TGADL+
Sbjct: 51 AHLNQSDLQGCNLNGSNLDGAYLRSAWLMASHLNGSTLENADLTGAVLTEADLTGADLTG 110
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ R+ +++A L A +V+ LT +D+ + + AD +DA
Sbjct: 111 ANLIRVQMSKAKLNGARIVKADLTEADISDSDLSDADLTDA 151
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEANLTN 177
A+ +D++ + +GS +GAYL A ++ G ADL+ ++ L A+LT
Sbjct: 51 AHLNQSDLQGCNLNGSNLDGAYLRSAWLMASHLNGSTLENADLTGAVLTEADLTGADLTG 110
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A L+R ++++ L GA I AD ++A I
Sbjct: 111 ANLIRVQMSKAKLNGARIVKADLTEADI 138
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 53/103 (51%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADL AV + + A+ T A++ S +K NGA + KA +A+ + +DLSD +
Sbjct: 90 NADLTGAVLTEADLTGADLTGANLIRVQMSKAKLNGARIVKADLTEADISDSDLSDADLT 149
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L +L+ A L LT +++ GA I AD S A + Q
Sbjct: 150 DARLFRTDLSGAKLKGIYLTSANMIGAHISWADMSVAYLSQGQ 192
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 59/118 (50%), Gaps = 15/118 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-------------V 149
A+ ADL +A + A+ T A + +D SG+K G YL A V
Sbjct: 126 ARIVKADLTEADISDSDLSDADLTDARLFRTDLSGAKLKGIYLTSANMIGAHISWADMSV 185
Query: 150 AY--KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
AY + F+ A+L T + L++A+ T A L+R+ LT +++ A + AD ++AV+
Sbjct: 186 AYLSQGQFSRAELYSTNLSGSDLSDADFTRAYLMRSNLTGANIDWADMAYADLTEAVL 243
Score = 37.0 bits (84), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 45/103 (43%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L+ A + +ANFT AD+ E+ G+ GA L A + N DL
Sbjct: 241 AVLTGASLKSAKMPYSDLTKANFTGADLSEAYLDGAILAGATLRNAKLDRVNLREVDLRG 300
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ + A+L N+VL + +DL GAD DA +
Sbjct: 301 -----LEMGGASLKNSVLTGVFMAMTDLA-----GADLRDATL 333
>gi|157413067|ref|YP_001483933.1| pentapeptide repeat-containing protein [Prochlorococcus marinus
str. MIT 9215]
gi|157387642|gb|ABV50347.1| Pentapeptide repeat-containing proteins [Prochlorococcus marinus
str. MIT 9215]
Length = 157
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 64/134 (47%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+A +G L A + + A F D+++++ S + A L A N + ++L
Sbjct: 21 AALDYGKQSLIGADFSGSDLKGATFYLTDLQDANLSDCELQNATLYGAKLKDTNLSNSNL 80
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+ +D +L+ +L+N L + + I+GADF++ + + C+ A GT
Sbjct: 81 REVTLDSAILDGTDLSNTNLEDSFAYSTQFENVKIQGADFTNVYLPKDIIREFCESATGT 140
Query: 221 NPITGVSTRKSLGC 234
NPIT TR++L C
Sbjct: 141 NPITNRDTRETLEC 154
>gi|427731151|ref|YP_007077388.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427367070|gb|AFY49791.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 572
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 55/103 (53%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A F A+L A N AN + AD+ +D S + GA L A Y+ +F+ ADL
Sbjct: 274 TGANFQDANLAGANLGDANLSGANLSGADLSSADLSSANLTGANLTGATLYRTDFSRADL 333
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S ++ + A+L+ A L T L R++L AI+ GA+ SDA
Sbjct: 334 SSCHLNDAEMGHADLSGANLRDTQLCRTNLTNAILFGANLSDA 376
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 53/104 (50%), Gaps = 5/104 (4%)
Query: 103 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A ADL A+ N N T A + +D S +K NGA L A A F G
Sbjct: 391 ADLSGADLSHAILNGTNLSDTILFSTNLTDASLMAADLSYAKLNGAKLIDAKLNGAMFLG 450
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
ADLS + R+VLN+A+L+ ++L L+ +DL AI+ G D S
Sbjct: 451 ADLSGVDLSRVVLNDADLSGSILSEADLSSADLSDAILLGTDLS 494
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A N AN T A + +DFS + + +L A A+ +GA+L
Sbjct: 294 SGANLSGADLSSADLSSANLTGANLTGATLYRTDFSRADLSSCHLNDAEMGHADLSGANL 353
Query: 161 SDTLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
DT + R +L ANL++A L L+ +DL A + GAD S A+++
Sbjct: 354 RDTQLCRTNLTNAILFGANLSDANLKHINLSHADLCRADLSGADLSHAILN 404
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 65/136 (47%), Gaps = 10/136 (7%)
Query: 95 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAV 149
G+F G A F A L A NF+ AN + A + +++ +G+ F GA L A
Sbjct: 235 GQFLKG--ANFRGAYLGDANLTGANFQGANLSGAYLGDANLTGANFQDANLAGANLGDAN 292
Query: 150 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA- 208
AN +GADLS + L ANLT A L RT +R+DL + A+ A + A
Sbjct: 293 LSGANLSGADLSSADLSSANLTGANLTGATLYRTDFSRADLSSCHLNDAEMGHADLSGAN 352
Query: 209 -QKQALCKYANGTNPI 223
+ LC+ N TN I
Sbjct: 353 LRDTQLCR-TNLTNAI 367
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 48/90 (53%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
V + + ANF A + +++ +G+ F GA L A AN TGA+ D + L +ANL
Sbjct: 234 VGQFLKGANFRGAYLGDANLTGANFQGANLSGAYLGDANLTGANFQDANLAGANLGDANL 293
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ A L L+ +DL A + GA+ + A +
Sbjct: 294 SGANLSGADLSSADLSSANLTGANLTGATL 323
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 5/102 (4%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA A L A + A F AD+ D S N A L ++ +A+ + ADLS
Sbjct: 425 AADLSYAKLNGAKLIDAKLNGAMFLGADLSGVDLSRVVLNDADLSGSILSEADLSSADLS 484
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
D ++ L+ ANL +A L+ S+L GA++ GAD S+A
Sbjct: 485 DAILLGTDLSFANLNSA-----NLSGSNLSGAMLNGADLSEA 521
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 38/74 (51%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I S A SADL A+ + + AN SA++ S+ SG+ NGA L +A A
Sbjct: 472 ILSEADLSSADLSDAILLGTDLSFANLNSANLSGSNLSGAMLNGADLSEANLSDAILEDT 531
Query: 159 DLSDTLMDRMVLNE 172
DLS+ +++M E
Sbjct: 532 DLSEANLEQMTWGE 545
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 48/99 (48%), Gaps = 7/99 (7%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F ++ L + ++ + NF+ ++ G+ F GAYL A ANF GA+LS
Sbjct: 210 FFTSQLLRVIYYSDAIEIGNFS--NIVGQFLKGANFRGAYLGDANLTGANFQGANLSGA- 266
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L +ANLT A L ++LG A + GA+ S A
Sbjct: 267 ----YLGDANLTGANFQDANLAGANLGDANLSGANLSGA 301
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 51/109 (46%), Gaps = 10/109 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ G ADL A R N T+A + ++ S + L A +A+ +GADLS
Sbjct: 341 AEMGHADLSGANLRDTQLCRTNLTNAILFGANLSDANLKHINLSHADLCRADLSGADLS- 399
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTR-----SDLGGAIIEGADFSDAVID 206
+LN NL++ +L T LT +DL A + GA DA ++
Sbjct: 400 ----HAILNGTNLSDTILFSTNLTDASLMAADLSYAKLNGAKLIDAKLN 444
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 50/120 (41%), Gaps = 16/120 (13%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
+ +A FG A+L A N A+ AD+ +D S + NG L + + N T A
Sbjct: 363 LTNAILFG-ANLSDANLKHINLSHADLCRADLSGADLSHAILNGTNLSDTILFSTNLTDA 421
Query: 159 DLSDTLMDRMVLNEANLTNAV---------------LVRTVLTRSDLGGAIIEGADFSDA 203
L + LN A L +A L R VL +DL G+I+ AD S A
Sbjct: 422 SLMAADLSYAKLNGAKLIDAKLNGAMFLGADLSGVDLSRVVLNDADLSGSILSEADLSSA 481
>gi|163797086|ref|ZP_02191041.1| pentapeptide repeat protein [alpha proteobacterium BAL199]
gi|159177602|gb|EDP62155.1| pentapeptide repeat protein [alpha proteobacterium BAL199]
Length = 421
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 60/116 (51%), Gaps = 13/116 (11%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F ADLR +V + +A F++A + + DF+G+K GA L A A ADL+D
Sbjct: 51 ALFAGADLRGSVFAGGHLEQAQFSTARLEQVDFAGAKLMGANLRGANLKGAKLMAADLTD 110
Query: 163 --------TLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
M+R + L++A+L+NA VRT L+ +++ I G F AV+
Sbjct: 111 ADLRPAKIVDMNRTIEQSANLHKADLSNAQFVRTNLSGANMSAIIAVGTAFQSAVL 166
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 48/102 (47%), Gaps = 10/102 (9%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLS 161
SA+L KA F R N + A+M G+ F A L +A K++F G+DL
Sbjct: 128 SANLHKADLSNAQFVRTNLSGANMSAIIAVGTAFQSAVLRNVNLSRADLSKSSFKGSDLR 187
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L N +AVL T LT SDL ++GAD S A
Sbjct: 188 GS-----NLRGVNFADAVLTDTDLTGSDLRSCNLDGADLSGA 224
Score = 37.0 bits (84), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 50/121 (41%), Gaps = 21/121 (17%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 158
SAA F A L A + +FR AN + A + +D + S GA L A T A
Sbjct: 281 SAADFSGARLGGATLKQCSFRFANLSDATLERADLARSDLRGARLRSTRLDGATLTHARL 340
Query: 159 -------------------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
DLSD + + ++A+ + AVL L R+DL A + GAD
Sbjct: 341 TPMQILNPATLELLREWPTDLSDASLVGVKADKADFSGAVLCGATLDRADLTHASLAGAD 400
Query: 200 F 200
Sbjct: 401 L 401
>gi|86608820|ref|YP_477582.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86557362|gb|ABD02319.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 328
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 59/119 (49%), Gaps = 3/119 (2%)
Query: 90 EAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
E + RG +G+ AQ A+L++A+ + N AN + AD+ +D S S A L
Sbjct: 204 ETDLRGVSFLGADLQGAQMARANLKEAILRQVNLTEANLSEADLAGADLSASSLCSAKLA 263
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ +AN GADL + L NL NA L +LTR+DL A + GA+ A +
Sbjct: 264 RTDLSRANLAGADLRCANLVDAYLGRTNLENADLGEAILTRADLSTANLSGANLRGATL 322
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 51/96 (53%), Gaps = 10/96 (10%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
G A L+KA V N AN + AD+ E+D ++ +GA L+ A + AN T A
Sbjct: 52 LGRAKLQKANLVGANLGGANLSQADLSEADLRDAQLHGATLQGADLHGANLTLA------ 105
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+L +ANL +A L LT ++LGGA + GA+
Sbjct: 106 ----LLIDANLLDADLRWANLTSANLGGACLRGANL 137
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 5/101 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL +A N + A+ A+++ ++ ++ GA L + +F GADL
Sbjct: 158 SGANLSGADLTRADLSGANLKEASLIKANLQGANLQQARLQGAILSETDLRGVSFLGADL 217
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
M R ANL A+L + LT ++L A + GAD S
Sbjct: 218 QGAQMAR-----ANLKEAILRQVNLTEANLSEADLAGADLS 253
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 61/118 (51%), Gaps = 15/118 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLE----------K 147
A A+L++A +K N + AN A ++ E+D G F GA L+ +
Sbjct: 170 ADLSGANLKEASLIKANLQGANLQQARLQGAILSETDLRGVSFLGADLQGAQMARANLKE 229
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+ + N T A+LS+ + L+ ++L +A L RT L+R++L GA + A+ DA +
Sbjct: 230 AILRQVNLTEANLSEADLAGADLSASSLCSAKLARTDLSRANLAGADLRCANLVDAYL 287
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 84/167 (50%), Gaps = 9/167 (5%)
Query: 55 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 114
A L++ ++ +T L A + + ++ L D N +A+ R + ++A G A LR A
Sbjct: 80 ADLRDAQLHGAT-LQGADLHGANLTLALLIDANLLDADLR--WANLTSANLGGACLRGAN 136
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
++ R A +A++ +D SG+ +GA L +A+ +GA+L + + + L AN
Sbjct: 137 LRFDSRRGAVLRNANLSRADLSGANLSGADL-----TRADLSGANLKEASLIKANLQGAN 191
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGT 220
L A L +L+ +DL G GAD A + A K+A+ + N T
Sbjct: 192 LQQARLQGAILSETDLRGVSFLGADLQGAQMARANLKEAILRQVNLT 238
>gi|428215909|ref|YP_007089053.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004290|gb|AFY85133.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 447
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +DLR A + + + N T AD+RE+D + + GA L A +A+ TGA
Sbjct: 330 ANMKGSDLRGADLIGASLNKVNLTQADLREADLTRADLRGANLRLADLREADLTGAS--- 386
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
LN+ NL A L LTR+DL GA + GAD +A
Sbjct: 387 -------LNQVNLAEADLRGVDLTRADLRGANLSGADLREA 420
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 62/123 (50%), Gaps = 13/123 (10%)
Query: 83 LADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 139
LAD N ++ RG IG++ ADLR+A + + R AN AD+RE+D +G+
Sbjct: 327 LADANMKGSDLRGADLIGASLNKVNLTQADLREADLTRADLRGANLRLADLREADLTGAS 386
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
N + N ADL + R L ANL+ A L LT+++L A ++GA+
Sbjct: 387 LN----------QVNLAEADLRGVDLTRADLRGANLSGADLREADLTKANLHWANLDGAN 436
Query: 200 FSD 202
+D
Sbjct: 437 LTD 439
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 47/85 (55%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ R A +SA++ ++D +G+ + A L KA AN G+DL + LN+ NLT A
Sbjct: 296 DLRGAMLSSANLSQADMTGTDLSRANLRKAYLADANMKGSDLRGADLIGASLNKVNLTQA 355
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
L LTR+DL GA + AD +A
Sbjct: 356 DLREADLTRADLRGANLRLADLREA 380
Score = 43.5 bits (101), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 51/102 (50%), Gaps = 25/102 (24%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKA-----------VAYK---------ANFTGADLSDT 163
NF + D+ D G+ G+YL +A + Y AN +GADLSD
Sbjct: 29 NFMTPDLSNKDLIGASLRGSYLREAKLSGANLSEAILCYADLIGADLKGANLSGADLSDA 88
Query: 164 LMDRMVLNEANLTNA-----VLVRTVLTRSDLGGAIIEGADF 200
++ L+E+NLT A +LV T L+ +DL GA ++GA+
Sbjct: 89 NLNLANLSESNLTGANFKGSLLVGTDLSEADLRGANLKGANL 130
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 59/127 (46%), Gaps = 5/127 (3%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ A+L +A+ + A+ A++ +D S + N A L ++ ANF G+ L
Sbjct: 53 AKLSGANLSEAILCYADLIGADLKGANLSGADLSDANLNLANLSESNLTGANFKGSLLVG 112
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-----VIDLAQKQALCKYA 217
T + L ANL A L+ L ++L GA + G D S+A ++ A ++
Sbjct: 113 TDLSEADLRGANLKGANLIGAKLAEANLSGANLSGTDLSEADLRGTILQKAVYDLRTRFC 172
Query: 218 NGTNPIT 224
G +P T
Sbjct: 173 EGLDPQT 179
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 46/179 (25%), Positives = 74/179 (41%), Gaps = 39/179 (21%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRAN 124
L+ A ++ + N++ L++ N A +G +G S A A+L+ A + AN
Sbjct: 80 LSGADLSDANLNLANLSESNLTGANFKGSLLVGTDLSEADLRGANLKGANLIGAKLAEAN 139
Query: 125 FTSADMRESDFSGSKFNGAYLEKAV-----------------AY---------KANFTGA 158
+ A++ +D S + G L+KAV AY AN +G
Sbjct: 140 LSGANLSGTDLSEADLRGTILQKAVYDLRTRFCEGLDPQTSGAYLIGADVALPAANLSGV 199
Query: 159 DLSDTLMDRMVLNEANLTNAVLV----------RTVLTRSDLGGAIIEGADFSDAVIDL 207
DL+ + R L ANL A L+ R L+ ++L G +G + AV DL
Sbjct: 200 DLTGFNLKRADLRGANLRYAKLIGANLEGANLFRANLSGANLTGVNFKGTNLQKAVYDL 258
>gi|298245086|ref|ZP_06968892.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297552567|gb|EFH86432.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 394
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 62/121 (51%), Gaps = 12/121 (9%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
L +N Y+++ R A DLR+A + RAN A++RE+ +
Sbjct: 247 LYKINLYKSDLR-------EANLSKTDLREA-----DISRANLYKANLRETFLLKANLYE 294
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A L +A +AN + A+LS T + R L +ANL+ A L+ L+R DL GA + ADFS
Sbjct: 295 ADLHRANLSEANLSEANLSKTDLSRTNLTKANLSKADLISANLSRGDLSGADLSKADFSG 354
Query: 203 A 203
A
Sbjct: 355 A 355
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 57/117 (48%), Gaps = 17/117 (14%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
N Y+A+ RG AD KA N R AN A++RE+D S A+L
Sbjct: 206 NLYKADLRG------------ADFSKATLCGANLREANLCEANLREADIS-----RAFLY 248
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
K YK++ A+LS T + ++ ANL A L T L +++L A + A+ S+A
Sbjct: 249 KINLYKSDLREANLSKTDLREADISRANLYKANLRETFLLKANLYEADLHRANLSEA 305
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 49/94 (52%), Gaps = 4/94 (4%)
Query: 87 NKYEAET-RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 145
N YEA+ R S A A+L K + N +AN + AD+ ++ S +GA L
Sbjct: 291 NLYEADLHRANL---SEANLSEANLSKTDLSRTNLTKANLSKADLISANLSRGDLSGADL 347
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
KA AN +GA+LS ++ +LN+AN+ A+
Sbjct: 348 SKADFSGANLSGANLSGATLNEAILNKANIQQAL 381
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 53/105 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A S DL+ +F AN AD+R +DFS + GA L +A +AN AD+
Sbjct: 183 SQADMKSMDLKGVKAHNIDFSGANLYKADLRGADFSKATLCGANLREANLCEANLREADI 242
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S + ++ L +++L A L +T L +D+ A + A+ + +
Sbjct: 243 SRAFLYKINLYKSDLREANLSKTDLREADISRANLYKANLRETFL 287
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 12/113 (10%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
N Y+A R F + A ADL +A N AN + A++ ++D S + A L
Sbjct: 276 NLYKANLRETFLLK--ANLYEADLHRA-----NLSEANLSEANLSKTDLSRTNLTKANLS 328
Query: 147 KAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
KA AN + GADLS L+ ANL+ A L +L ++++ A+
Sbjct: 329 KADLISANLSRGDLSGADLSKADFSGANLSGANLSGATLNEAILNKANIQQAL 381
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 52/138 (37%), Gaps = 22/138 (15%)
Query: 90 EAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS------------ADMRESDFSG 137
+AE R + + G D + HV R A S ADM+ D G
Sbjct: 135 DAEVRKVARVRTLTVLGQLDAPRINHVFSFLREAQLVSSKPGESIVSLSQADMKSMDLKG 194
Query: 138 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL----------TNAVLVRTVLTR 187
K + A YKA+ GAD S + L EANL + A L + L +
Sbjct: 195 VKAHNIDFSGANLYKADLRGADFSKATLCGANLREANLCEANLREADISRAFLYKINLYK 254
Query: 188 SDLGGAIIEGADFSDAVI 205
SDL A + D +A I
Sbjct: 255 SDLREANLSKTDLREADI 272
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LR+ +K N A+ A++ E++ S + + L + KAN + ADL
Sbjct: 273 SRANLYKANLRETFLLKANLYEADLHRANLSEANLSEANLSKTDLSRTNLTKANLSKADL 332
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
+ R +L+ A L + + ++L GA + GA ++A+++ A Q
Sbjct: 333 ISANLSR-----GDLSGADLSKADFSGANLSGANLSGATLNEAILNKANIQ 378
>gi|443310213|ref|ZP_21039874.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
gi|442779757|gb|ELR89989.1| putative low-complexity protein [Synechocystis sp. PCC 7509]
Length = 253
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 54/101 (53%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A SA+L +A ++ N AN T A + +D S + A L A+ YKA A+L+D
Sbjct: 139 ANLKSANLSEAKLIRANLNEANLTEAHLNYADLSHANLGSASLVGAILYKAELRQANLND 198
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + L +ANL+ A L+ L ++L GA + GA+ + A
Sbjct: 199 AYLHKAYLFDANLSQARLINADLRWANLRGANLRGANLTGA 239
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 20/116 (17%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRES---------------DFSGSKFNGAYLEK 147
A F A+L ++ +K N AN + A+++++ D G+ + A LE
Sbjct: 69 ANFTLANLSHSLLMKANLSNANLSIANLQDANLKGAFLGAANLIGADLQGANLSNADLEN 128
Query: 148 -----AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
A AN A+LS+ + R LNEANLT A L L+ ++LG A + GA
Sbjct: 129 VNLIGANLQNANLKSANLSEAKLIRANLNEANLTEAHLNYADLSHANLGSASLVGA 184
>gi|448412419|ref|ZP_21576534.1| hypothetical protein C475_19468 [Halosimplex carlsbadense 2-9-1]
gi|445668180|gb|ELZ20811.1| hypothetical protein C475_19468 [Halosimplex carlsbadense 2-9-1]
Length = 561
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 54/120 (45%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 158
+A S D A +FRRA +A++R++D G+ F GA L A A+ TGA
Sbjct: 251 TAGTLESVDFGGATLTDASFRRAGLQNAELRDADLVGADFQGADLRNASLTNADLTGANF 310
Query: 159 -------------DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
DLS+ + L A+L +A L R L SDL A + AD SD +
Sbjct: 311 RDADLTDAHLRGADLSEADLKDATLCGADLKDATLTRASLWNSDLTEAYLRNADLSDGYL 370
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 70/151 (46%), Gaps = 14/151 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F ADLR A + A+ T A+ R++D + + GA L +A A GADL D
Sbjct: 288 ADFQGADLRNA-----SLTNADLTGANFRDADLTDAHLRGADLSEADLKDATLCGADLKD 342
Query: 163 TLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK-Y 216
+ R L EA L NA L L R DL A + AD + DL + +L + +
Sbjct: 343 ATLTRASLWNSDLTEAYLRNADLSDGYLRRVDLTDADLPAADLTG---DLNARCSLGRTF 399
Query: 217 ANGTNPITGVSTRKSLGCGNSRRNAYGSPSS 247
+ I+ + R+SL C ++ G P++
Sbjct: 400 SMPRCAISDHTGRRSLTCRSTSARPSGRPTT 430
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 49/90 (54%), Gaps = 1/90 (1%)
Query: 130 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 189
+RE+D SG+ G+ L+ A+ A+ DL+ M VL EA+LT+ L + ++
Sbjct: 145 LREADLSGANLAGSTLKGAILTDASLREVDLTGADMMGAVLVEADLTSGTLAQLSGDKAV 204
Query: 190 LGGAIIEGADFSDAVI-DLAQKQALCKYAN 218
+ GAI++ A+ A + DL +A+ K A
Sbjct: 205 MRGAILKDANLERAHLWDLTAPEAVFKRAT 234
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 59/148 (39%), Gaps = 17/148 (11%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
+ AV+ LA L+ +A RG I A A L + F+RA
Sbjct: 180 MMGAVLVEADLTSGTLAQLSGDKAVMRG--AILKDANLERAHLWDLTAPEAVFKRATLCE 237
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV------ 181
A MR++ G+ F LE +F GA L+D R L A L +A LV
Sbjct: 238 ATMRDAVLPGASFTAGTLESV-----DFGGATLTDASFRRAGLQNAELRDADLVGADFQG 292
Query: 182 ----RTVLTRSDLGGAIIEGADFSDAVI 205
LT +DL GA AD +DA +
Sbjct: 293 ADLRNASLTNADLTGANFRDADLTDAHL 320
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 47/114 (41%), Gaps = 10/114 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA----------VAYK 152
A ADL A + A T A +RE D +G+ GA L +A K
Sbjct: 143 AVLREADLSGANLAGSTLKGAILTDASLREVDLTGADMMGAVLVEADLTSGTLAQLSGDK 202
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A GA L D ++R L + AV R L + + A++ GA F+ ++
Sbjct: 203 AVMRGAILKDANLERAHLWDLTAPEAVFKRATLCEATMRDAVLPGASFTAGTLE 256
>gi|309792396|ref|ZP_07686863.1| pentapeptide repeat-containing protein [Oscillochloris trichoides
DG-6]
gi|308225551|gb|EFO79312.1| pentapeptide repeat-containing protein [Oscillochloris trichoides
DG6]
Length = 314
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 60/125 (48%), Gaps = 9/125 (7%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLRK N AN T A++R ++ S + F+GA L A N +G DL D
Sbjct: 89 ADLSDADLRKGDLAWANLEFANLTGANLRGANLSAADFSGANLYGANLSLCNLSGVDLRD 148
Query: 163 TLMDRMVLNEANLTNAVLVRTV--------LTRSDLGGAIIEGADFSDA-VIDLAQKQAL 213
T+M L EA L A LV L + LGGA ++G + S A ++ ++A
Sbjct: 149 TIMIGANLTEAQLREAQLVNLSGANLSGANLNKVSLGGASMQGVNLSGASLLSANLREAT 208
Query: 214 CKYAN 218
+ AN
Sbjct: 209 LREAN 213
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 5/107 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTG 157
A A LR+A + N AN + AD+ +D S + +G YL E A+ AN +
Sbjct: 202 ANLREATLREANLIGANLYEANLSEADLSAADLSMANLSGIYLSGANLEGAILTHANLSR 261
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
A+LS + LN NL A L LT +DL GA + D S +
Sbjct: 262 ANLSGCNLRGAQLNGCNLREASLADADLTGADLTGADLSECDLSGVI 308
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 47/103 (45%), Gaps = 2/103 (1%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A LR+A V N AN + A++ + G+ G L A AN A L +
Sbjct: 154 ANLTEAQLREAQLV--NLSGANLSGANLNKVSLGGASMQGVNLSGASLLSANLREATLRE 211
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ L EANL+ A L L+ ++L G + GA+ A++
Sbjct: 212 ANLIGANLYEANLSEADLSAADLSMANLSGIYLSGANLEGAIL 254
Score = 37.0 bits (84), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 49/106 (46%), Gaps = 10/106 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A L A N A+ SA++RE+ + GA L Y+AN + ADL
Sbjct: 175 SGANLNKVSLGGASMQGVNLSGASLLSANLREATLREANLIGANL-----YEANLSEADL 229
Query: 161 S--DTLMDRM---VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
S D M + L+ ANL A+L L+R++L G + GA +
Sbjct: 230 SAADLSMANLSGIYLSGANLEGAILTHANLSRANLSGCNLRGAQLN 275
>gi|428313200|ref|YP_007124177.1| pentapeptide repeat protein,protein kinase family protein
[Microcoleus sp. PCC 7113]
gi|428254812|gb|AFZ20771.1| pentapeptide repeat protein,protein kinase family protein
[Microcoleus sp. PCC 7113]
Length = 464
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ NF N + +++E++ SG F A L K NF GADLSD + LN+ANL
Sbjct: 321 ERNFAFRNISGLNLQEANLSGGLFYSAKLAKT-----NFQGADLSDAYFGQANLNQANLR 375
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
NA L T + +DL GA ++GAD A +
Sbjct: 376 NANLGGTSFSNADLSGADLQGADLRFAYL 404
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 51/109 (46%), Gaps = 25/109 (22%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A FG A+L +A N R AN +D SG+ GA L A KAN GA+L
Sbjct: 360 SDAYFGQANLNQA-----NLRNANLGGTSFSNADLSGADLQGADLRFAYLSKANLKGANL 414
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
EANL+NA ++ GA + GA+ S+A+I AQ
Sbjct: 415 C----------EANLSNA----------NIKGANLCGANLSNAIITEAQ 443
>gi|119485665|ref|ZP_01619940.1| hypothetical protein L8106_24820 [Lyngbya sp. PCC 8106]
gi|119456990|gb|EAW38117.1| hypothetical protein L8106_24820 [Lyngbya sp. PCC 8106]
Length = 433
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 55/103 (53%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ + DL A + N R A+ A+++++D G+K GA L A +AN A
Sbjct: 178 ARLANTDLSNANLWQANLREAHLVDANLQQADLRGAKLEGANLSNAKLVQANLESAIFVG 237
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ L++A+L A L +T +TR+DLG A ++ A DA +
Sbjct: 238 ANLENANLHQASLKGANLAKTQMTRADLGFANLQKASLGDAQL 280
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 59/110 (53%), Gaps = 5/110 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANF 155
S A F A+LR+A K N A+ + A + ++D G K GA L+ A ANF
Sbjct: 116 SGANFRDANLREAYLWKANLSNADLSDAYLEKADLRGVKLEGADLGYAMLKGANLGYANF 175
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A L++T + L +ANL A LV L ++DL GA +EGA+ S+A +
Sbjct: 176 VRARLANTDLSNANLWQANLREAHLVDANLQQADLRGAKLEGANLSNAKL 225
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 63/137 (45%), Gaps = 13/137 (9%)
Query: 83 LADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRRANFTSA----------D 129
L D N +A+ RG E S A+ A+L A+ V N AN A
Sbjct: 200 LVDANLQQADLRGAKLEGANLSNAKLVQANLESAIFVGANLENANLHQASLKGANLAKTQ 259
Query: 130 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 189
M +D + A L A +AN ADL++ + L +ANL NA+L + L +
Sbjct: 260 MTRADLGFANLQKASLGDAQLSQANLESADLTEAKLWVAKLEDANLNNAILEKAKLGFAQ 319
Query: 190 LGGAIIEGADFSDAVID 206
L GA +E A+ +DA+++
Sbjct: 320 LKGANLEDANLTDAILE 336
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 58/111 (52%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA--------- 153
A G A+L+KA +AN SAD+ E+ +K A L A+ KA
Sbjct: 263 ADLGFANLQKASLGDAQLSQANLESADLTEAKLWVAKLEDANLNNAILEKAKLGFAQLKG 322
Query: 154 -NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
N A+L+D +++ ++L +ANL +A L L +++L GA ++ A+ ++A
Sbjct: 323 ANLEDANLTDAILEGVILEDANLEDANLEGAKLEQANLIGAYLKDANLTEA 373
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 57/124 (45%), Gaps = 27/124 (21%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSA----------DMRESDFSGSKFN-----GA 143
I A+ G A L+ A N AN T A ++ +++ G+K GA
Sbjct: 309 ILEKAKLGFAQLKGA-----NLEDANLTDAILEGVILEDANLEDANLEGAKLEQANLIGA 363
Query: 144 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
YL+ A +AN GADL L +ANL NA L L ++L GA ++GA+ D
Sbjct: 364 YLKDANLTEANLQGADLRGA-----NLTKANLRNAYLQGANLRGANLKGASLKGANLRD- 417
Query: 204 VIDL 207
+DL
Sbjct: 418 -VDL 420
>gi|428310592|ref|YP_007121569.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428252204|gb|AFZ18163.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 522
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 2/134 (1%)
Query: 86 LNKYEAETRGEFGIGSA-AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 144
L KY A R G+ A +A+L A + N AN + A++ ++ S +K N A
Sbjct: 7 LKKYAAGDRDFSGLNLAEVNLSAANLSGANLSEVNLSVANLSGANLSGANLSRAKLNVAR 66
Query: 145 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
L A KAN A L+ T + R L ANLT A L+R L R++L GA ++ A+ S A
Sbjct: 67 LSGANISKANLIQASLNVTNLIRADLRRANLTQAALIRAELIRAELSGATLKEANLSGAD 126
Query: 205 I-DLAQKQALCKYA 217
+ + A +QA+ A
Sbjct: 127 LREAALRQAILSRA 140
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 61/119 (51%), Gaps = 1/119 (0%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A L ++ + RRAN T A + ++ ++ +GA L++A A+ A L
Sbjct: 73 SKANLIQASLNVTNLIRADLRRANLTQAALIRAELIRAELSGATLKEANLSGADLREAAL 132
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 218
++ R L+EANL A L ++L ++L A + AD SD+ I A +QA +AN
Sbjct: 133 RQAILSRATLSEANLRGAFLTASILEGTNLNKADLNRADLSDSNIREADLRQANLSFAN 191
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 47/93 (50%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADLR+A N A+ + A++R +D SG+ A L A AN GADLS +
Sbjct: 180 ADLRQANLSFANLSGADLSRANLRWADLSGADLRWANLSDAKLSGANLMGADLSHANLHN 239
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
L A+LT A L++ +DL GA + GA
Sbjct: 240 ASLVHADLTQASLIKVDWIGADLSGATMTGAKL 272
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 53/101 (52%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR+A + RA + A++R + + S G L KA +A+ + +++ +
Sbjct: 120 ANLSGADLREAALRQAILSRATLSEANLRGAFLTASILEGTNLNKADLNRADLSDSNIRE 179
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + L+ ANL+ A L R L +DL GA + A+ SDA
Sbjct: 180 ADLRQANLSFANLSGADLSRANLRWADLSGADLRWANLSDA 220
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 48/106 (45%), Gaps = 5/106 (4%)
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
I S A A+LR A N AD+ +D S S A L +A AN +G
Sbjct: 135 AILSRATLSEANLRGAFLTASILEGTNLNKADLNRADLSDSNIREADLRQANLSFANLSG 194
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
ADLS R L A+L+ A L L+ + L GA + GAD S A
Sbjct: 195 ADLS-----RANLRWADLSGADLRWANLSDAKLSGANLMGADLSHA 235
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 67/150 (44%), Gaps = 22/150 (14%)
Query: 90 EAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 149
EA RG F S + +L KA + RA+ + +++RE+D + + A L A
Sbjct: 144 EANLRGAFLTASILE--GTNLNKA-----DLNRADLSDSNIREADLRQANLSFANLSGAD 196
Query: 150 AYKANFTGADLSD------TLMDRMV---------LNEANLTNAVLVRTVLTRSDLGGAI 194
+AN ADLS L D + L+ ANL NA LV LT++ L
Sbjct: 197 LSRANLRWADLSGADLRWANLSDAKLSGANLMGADLSHANLHNASLVHADLTQASLIKVD 256
Query: 195 IEGADFSDAVIDLAQKQALCKYANGTNPIT 224
GAD S A + A+ A+ ++ T IT
Sbjct: 257 WIGADLSGATMTGAKLYAVSRFGLKTTGIT 286
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 58/111 (52%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA-----NFTG 157
A A+L +A ++ RA + A ++E++ SG+ A L +A+ +A N G
Sbjct: 90 ADLRRANLTQAALIRAELIRAELSGATLKEANLSGADLREAALRQAILSRATLSEANLRG 149
Query: 158 ADLSDTLMDRMVLNEANLTNAVL----VRTV-LTRSDLGGAIIEGADFSDA 203
A L+ ++++ LN+A+L A L +R L +++L A + GAD S A
Sbjct: 150 AFLTASILEGTNLNKADLNRADLSDSNIREADLRQANLSFANLSGADLSRA 200
>gi|90419937|ref|ZP_01227846.1| conserved hypothetical protein with pentapeptide repeats
[Aurantimonas manganoxydans SI85-9A1]
gi|90335978|gb|EAS49726.1| conserved hypothetical protein with pentapeptide repeats
[Aurantimonas manganoxydans SI85-9A1]
Length = 292
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 59/109 (54%), Gaps = 7/109 (6%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY-LEKAVAYKANFTGADLS 161
A F ADL A + +F RA+F A+M+ +DFS N + L + V A+ TGADLS
Sbjct: 168 ATFDGADL-SAARIAGDFSRASFVRANMKGADFSADMRNQSMGLMRGVLNSADLTGADLS 226
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTR-----SDLGGAIIEGADFSDAVI 205
+ R A+ T+A L LTR ++ G ++EGADF+DA +
Sbjct: 227 GANLSRAAAEFADFTDADLSGADLTRFEASGANFNGTMVEGADFADAEL 275
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL-- 180
A+ TSA + +D S ++ GA L++A ANFTGADLS + + + +A A L
Sbjct: 118 ADLTSAYLNGTDLSNARLAGAKLDQAWGLGANFTGADLSGASLFQSQMQDATFDGADLSA 177
Query: 181 --VRTVLTRSDLGGAIIEGADFS 201
+ +R+ A ++GADFS
Sbjct: 178 ARIAGDFSRASFVRANMKGADFS 200
>gi|428314172|ref|YP_007125149.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428255784|gb|AFZ21743.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 276
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 60/114 (52%), Gaps = 10/114 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SAA A LR+A N + AN + D++ +D G+ GA L++A N +GADL
Sbjct: 104 SAATLKGAKLREA-----NLQGANLRAVDLKNADLCGANLQGADLKRADLINTNLSGADL 158
Query: 161 S-----DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
S D + +++ L EANL A L L+ +DL GA + A+ + A + AQ
Sbjct: 159 SGANLTDVIFEKVNLREANLRGANLQGLDLSEADLTGADLSEANLNGARLQEAQ 212
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 15/119 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGS-----KFNGAYLEKAVA 150
S A A+L + K N R AN A D+ E+D +G+ NGA L++A
Sbjct: 154 SGADLSGANLTDVIFEKVNLREANLRGANLQGLDLSEADLTGADLSEANLNGARLQEAQL 213
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+AN +G D M + L+ ANL A L L+++ L G + GA+ +A++D A+
Sbjct: 214 SQANLSGLD-----MTHLNLSGANLRQANLSEAQLSQAQLYGTDLRGANLDEAILDQAK 267
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 46/90 (51%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+EN + + ++ +++ +K + A L+ A +AN GA+L + L ANL
Sbjct: 80 QENLVWMDLSGVNLSQANLQQAKLSAATLKGAKLREANLQGANLRAVDLKNADLCGANLQ 139
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A L R L ++L GA + GA+ +D + +
Sbjct: 140 GADLKRADLINTNLSGADLSGANLTDVIFE 169
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 37/127 (29%), Positives = 57/127 (44%), Gaps = 20/127 (15%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFTG 157
A + DL+ A N + A+ AD+ ++ SG+ +GA L EK +AN G
Sbjct: 121 ANLRAVDLKNADLCGANLQGADLKRADLINTNLSGADLSGANLTDVIFEKVNLREANLRG 180
Query: 158 ---------------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
ADLS+ ++ L EA L+ A L +T +L GA + A+ S+
Sbjct: 181 ANLQGLDLSEADLTGADLSEANLNGARLQEAQLSQANLSGLDMTHLNLSGANLRQANLSE 240
Query: 203 AVIDLAQ 209
A + AQ
Sbjct: 241 AQLSQAQ 247
>gi|448473532|ref|ZP_21601674.1| RDD domain-containing protein [Halorubrum aidingense JCM 13560]
gi|445819044|gb|EMA68893.1| RDD domain-containing protein [Halorubrum aidingense JCM 13560]
Length = 348
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 57/115 (49%), Gaps = 12/115 (10%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N R AN T AD+ S + A L KA Y AN +GADL+ L+D+ L A+L
Sbjct: 64 NLRGANITGADL-----SSANLTDALLTKANLYSANLSGADLTGALLDKANLRSADLRGV 118
Query: 179 VLVRTVLTRSDLGGAIIEGADFSD------AVIDLAQKQALCKYAN-GTNPITGV 226
LTR+DL A + GA+FSD AV D + A AN G +TGV
Sbjct: 119 GFTEAHLTRADLHSADLRGANFSDADLFGAAVTDADLRGADLTDANLGDTDLTGV 173
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 49/105 (46%), Gaps = 10/105 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADM----------RESDFSGSKFNGAYLEKAVA 150
+ A SA+L A+ K N AN + AD+ R +D G F A+L +A
Sbjct: 71 TGADLSSANLTDALLTKANLYSANLSGADLTGALLDKANLRSADLRGVGFTEAHLTRADL 130
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
+ A+ GA+ SD + + +A+L A L L +DL G I+
Sbjct: 131 HSADLRGANFSDADLFGAAVTDADLRGADLTDANLGDTDLTGVIL 175
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 38/80 (47%), Gaps = 10/80 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A SADLR F A+ T AD+ +D G+ F+ A L A A+ GADL+D
Sbjct: 108 ANLRSADLRGV-----GFTEAHLTRADLHSADLRGANFSDADLFGAAVTDADLRGADLTD 162
Query: 163 TLMDRMVLNEANLTNAVLVR 182
L + +LT +L R
Sbjct: 163 A-----NLGDTDLTGVILAR 177
>gi|378579963|ref|ZP_09828623.1| hypothetical protein CKS_2597 [Pantoea stewartii subsp. stewartii
DC283]
gi|377817422|gb|EHU00518.1| hypothetical protein CKS_2597 [Pantoea stewartii subsp. stewartii
DC283]
Length = 272
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 55/106 (51%), Gaps = 10/106 (9%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
GS A ADLR A R A+ + AD+ +D SG+ GAYL A A+ +GAD
Sbjct: 24 GSRADLRGADLRGAY-----LRGADLSGADLSGADLSGADLRGAYLRDADLRGADLSGAD 78
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
LSD + L +A+L A L+ +DL GA + GAD S A +
Sbjct: 79 LSDADLRGAYLRDADLRGA-----DLSDADLSGAYLRGADLSGADL 119
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 53/110 (48%), Gaps = 10/110 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANF 155
S A ADLR A + R A+ + AD+ R +D SG+ GAYL A +
Sbjct: 75 SGADLSDADLRGAYLRDADLRGADLSDADLSGAYLRGADLSGADLRGAYLRDA-----DL 129
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
GADLSD + L +A+L A L L + L A + GAD SDA +
Sbjct: 130 RGADLSDADLSGAYLRDADLRGADLRGADLRGAYLRDADLRGADLSDADL 179
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 55/115 (47%), Gaps = 2/115 (1%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
A+ RG + G A ADL A + R A AD+R +D SG+ + A L A
Sbjct: 32 ADLRGAYLRG--ADLSGADLSGADLSGADLRGAYLRDADLRGADLSGADLSDADLRGAYL 89
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+ GADLSD + L A+L+ A L L +DL GA + AD S A +
Sbjct: 90 RDADLRGADLSDADLSGAYLRGADLSGADLRGAYLRDADLRGADLSDADLSGAYL 144
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 41/90 (45%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A LR A + R A AD+R +D S + +GAYL A A+ GADL
Sbjct: 100 SDADLSGAYLRGADLSGADLRGAYLRDADLRGADLSDADLSGAYLRDADLRGADLRGADL 159
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+ L A+L++A L L DL
Sbjct: 160 RGAYLRDADLRGADLSDADLSGAYLRDGDL 189
>gi|158340059|ref|YP_001521229.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158310300|gb|ABW31915.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 483
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 57/112 (50%), Gaps = 10/112 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSAD----------MRESDFSGSKFNGAYLEKAVA 150
S + A+LR A NFR+AN + AD + ++D SG+ F+GAYL KA
Sbjct: 305 SYSNLRKANLRHAHLSGANFRKANLSLADISKAHLGHAHLNDADLSGAYFSGAYLYKANL 364
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A GADLS + ++L ANL +A L L+ +DL AI+ D +
Sbjct: 365 SSAFLIGADLSRANLSDVILRGANLLSANLSDASLSSADLNNAILLNTDLRE 416
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 50/100 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F A+L A N R+AN A + ++F + + A + KA A+ ADL
Sbjct: 290 SIANFIGANLGGANLSYSNLRKANLRHAHLSGANFRKANLSLADISKAHLGHAHLNDADL 349
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
S L +ANL++A L+ L+R++L I+ GA+
Sbjct: 350 SGAYFSGAYLYKANLSSAFLIGADLSRANLSDVILRGANL 389
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 56/124 (45%), Gaps = 3/124 (2%)
Query: 83 LADLNKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 139
L D N A RG I + A A+L A NF AN A++ S+ +
Sbjct: 254 LIDANLSGANLRGANLIDANLRGANLIDANLSDAYLSIANFIGANLGGANLSYSNLRKAN 313
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
A+L A KAN + AD+S + LN+A+L+ A L +++L A + GAD
Sbjct: 314 LRHAHLSGANFRKANLSLADISKAHLGHAHLNDADLSGAYFSGAYLYKANLSSAFLIGAD 373
Query: 200 FSDA 203
S A
Sbjct: 374 LSRA 377
>gi|425458953|ref|ZP_18838439.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9808]
gi|389823440|emb|CCI28334.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9808]
Length = 425
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 60/120 (50%), Gaps = 4/120 (3%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L A+ ++ N R A + AD+ E+D SG+ A L KA+ +A A LS+
Sbjct: 285 ANLIKAILSWAILIEANLRGAILSEADLSEADLSGANLRRANLIKAILRRAILIEAILSE 344
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
+ L ANL A+L+ +L +DL GA + A+ S+A I+ A+ A G P
Sbjct: 345 ADLSGANLRRANLIKAILIEAILIEADLRGADLRWANLSEADIE----NAIFIDATGITP 400
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 58/106 (54%), Gaps = 4/106 (3%)
Query: 102 AAQFGSADLRKAVHV----KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A A+L KA+ K+ ++ + + AD+ E+D SG+ +GA L +A AN +G
Sbjct: 210 AKVIQKAELIKAIREGTINKKTLQQVDLSGADLSEADLSGAILSGANLSEANLSGANLSG 269
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A+LS + L ANL A+L +L ++L GAI+ AD S+A
Sbjct: 270 ANLSWANLIDANLRRANLIKAILSWAILIEANLRGAILSEADLSEA 315
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 1/103 (0%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LR+A +K RRA A + E+D SG+ A L KA+ +A ADL
Sbjct: 313 SEADLSGANLRRANLIKAILRRAILIEAILSEADLSGANLRRANLIKAILIEAILIEADL 372
Query: 161 SDTLMDRMVLNEANLTNAVLVR-TVLTRSDLGGAIIEGADFSD 202
+ L+EA++ NA+ + T +T I GA F D
Sbjct: 373 RGADLRWANLSEADIENAIFIDATGITPEQKQDLIRRGAIFGD 415
Score = 40.8 bits (94), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 10/113 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA----------VA 150
S A ADL A+ N AN + A++ ++ S + A L +A +
Sbjct: 238 SGADLSEADLSGAILSGANLSEANLSGANLSGANLSWANLIDANLRRANLIKAILSWAIL 297
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+AN GA LS+ + L+ ANL A L++ +L R+ L AI+ AD S A
Sbjct: 298 IEANLRGAILSEADLSEADLSGANLRRANLIKAILRRAILIEAILSEADLSGA 350
>gi|163795566|ref|ZP_02189532.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
gi|159179165|gb|EDP63698.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
Length = 427
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 65/148 (43%), Gaps = 24/148 (16%)
Query: 86 LNKYEAETRGEF--GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
LN Y R + G + AQ DLR+A+ +FR A F A++ E+ +GS+ A
Sbjct: 23 LNNYPGGQRADMRGGRHNGAQLNGVDLRRAMMSAADFRGAQFVGANLSEATLAGSQLRVA 82
Query: 144 YLEKAVAYKANFTGADL------SDTLMD----------------RMVLNEANLTNAVLV 181
L A K +F GADL S + D L+ A+L + V
Sbjct: 83 DLSGAKLVKTDFRGADLEQAKLTSSDITDADFRATTIGAPAGSDIATKLDGADLDHVKAV 142
Query: 182 RTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
RT LTR+ L GA GA F A +D A
Sbjct: 143 RTNLTRASLMGATARGAHFDGASLDRAN 170
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 50/109 (45%), Gaps = 10/109 (9%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A + ADL V+ N RA+ A R G+ F+GA L++A AN A
Sbjct: 128 ATKLDGADLDHVKAVRTNLTRASLMGATAR-----GAHFDGASLDRANFKGANLEHATFV 182
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVI 205
+ + L E N +A L T LT +DL GA + GAD +D VI
Sbjct: 183 SSSLRGANLQEVNFADATLSNTDLTGADLRSCHLDGADMSGADLTDCVI 231
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 50/99 (50%), Gaps = 7/99 (7%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTL 164
+RK H N+ ADMR +G++ NG L +A+ A+ F GA+LS+
Sbjct: 16 IRKHGHFLNNYPGGQ--RADMRGGRHNGAQLNGVDLRRAMMSAADFRGAQFVGANLSEAT 73
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L A+L+ A LV+T +DL A + +D +DA
Sbjct: 74 LAGSQLRVADLSGAKLVKTDFRGADLEQAKLTSSDITDA 112
>gi|334119992|ref|ZP_08494075.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333457174|gb|EGK85799.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 566
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 58/106 (54%), Gaps = 5/106 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+LR+A V+ N R A ++ ESD +G++ A L A ++A GADL++
Sbjct: 155 ANLTGANLREAHLVEANLRSAILIGVNLIESDLNGAQMRSANLTGADLHRAVLAGADLTE 214
Query: 163 TLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDA 203
++D L+ ANL + L+ + +L R++L G + AD S+A
Sbjct: 215 AVLDNADLSRANLAGSYLLKASFKKALLLRANLQGVYLLRADLSEA 260
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 60/136 (44%), Gaps = 30/136 (22%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADM----------RESDFSGSKFNGAYLEKAVA 150
S A ADLR A A F AD+ + +F+G+K +GA L A
Sbjct: 83 SGANLAKADLRLACLAAAELNWAAFPEADLGGANLQGVKSDQINFAGAKLDGAKLMAAEL 142
Query: 151 YKAN-----FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG------------- 192
+AN GA+L+ + L EANL +A+L+ L SDL G
Sbjct: 143 MEANLNRASLVGANLTGANLREAHLVEANLRSAILIGVNLIESDLNGAQMRSANLTGADL 202
Query: 193 --AIIEGADFSDAVID 206
A++ GAD ++AV+D
Sbjct: 203 HRAVLAGADLTEAVLD 218
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 51/113 (45%), Gaps = 10/113 (8%)
Query: 103 AQFGSADLRKAVHVKENF----------RRANFTSADMRESDFSGSKFNGAYLEKAVAYK 152
A A+LR A+ + N R AN T AD+ + +G+ A L+ A +
Sbjct: 165 AHLVEANLRSAILIGVNLIESDLNGAQMRSANLTGADLHRAVLAGADLTEAVLDNADLSR 224
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
AN G+ L + +L ANL L+R L+ ++L A + AD S A +
Sbjct: 225 ANLAGSYLLKASFKKALLLRANLQGVYLLRADLSEANLRSADLRKADLSGAYL 277
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 22/117 (18%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG---- 157
+A ADL +AV + A +AD+ ++ +GS A +KA+ +AN G
Sbjct: 194 SANLTGADLHRAVLAGADLTEAVLDNADLSRANLAGSYLLKASFKKALLLRANLQGVYLL 253
Query: 158 -ADLSDT----------------LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
ADLS+ LMD M L EA+L A L+ L R++L A + G
Sbjct: 254 RADLSEANLRSADLRKADLSGAYLMDAM-LGEADLREACLIECRLIRTNLEAAQLTG 309
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 52/103 (50%), Gaps = 5/103 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A+ A L A ++ N RA+ A++ +G+ A+L +A A G +L
Sbjct: 128 AGAKLDGAKLMAAELMEANLNRASLVGANL-----TGANLREAHLVEANLRSAILIGVNL 182
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++ ++ + ANLT A L R VL +DL A+++ AD S A
Sbjct: 183 IESDLNGAQMRSANLTGADLHRAVLAGADLTEAVLDNADLSRA 225
>gi|209526319|ref|ZP_03274848.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|376001485|ref|ZP_09779353.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423062694|ref|ZP_17051484.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493248|gb|EDZ93574.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|375330094|emb|CCE15106.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406715650|gb|EKD10803.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 390
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 64/115 (55%), Gaps = 10/115 (8%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV-----AYKANFT 156
+A ADL +A+ +K N +A+ +SA++ +S+ + F AYL KA ++A+ +
Sbjct: 111 SAHLNWADLTEAIFIKTNLHKADLSSANLTKSNLQSANFVRAYLIKANLSEADLFQADLS 170
Query: 157 GADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A+L D + L+E ANL A L LT+++LG A + GA+ +DA ++
Sbjct: 171 SANLKDVNLSAANLSECKMTRANLMGANLTEADLTKANLGRANLRGANLTDAYLN 225
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 10/101 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F ADL A N + N + A++ ++ SGS NGA N GA LS
Sbjct: 57 ADFSEADLSGAHLSLANLSKVNLSGANLTGANLSGSSLNGA----------NLQGATLSG 106
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++ LN A+LT A+ ++T L ++DL A + ++ A
Sbjct: 107 VNLESAHLNWADLTEAIFIKTNLHKADLSSANLTKSNLQSA 147
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 68/146 (46%), Gaps = 8/146 (5%)
Query: 68 LAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSA-------AQFGSADLRKAVHVKEN 119
L+AA ++ C + L N EA+ T+ G + A SA L +A + N
Sbjct: 179 LSAANLSECKMTRANLMGANLTEADLTKANLGRANLRGANLTDAYLNSASLVEADLYQAN 238
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
RAN + A++ ++ NGA+L K A+ G DLS L+ + L A L+ A
Sbjct: 239 LTRANLSRANLSKTYLRDICLNGAHLTKVNLSGADLGGVDLSQKLLTGINLAGAYLSEAT 298
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVI 205
LV +L ++L A + GA+ A +
Sbjct: 299 LVGALLMEANLSAANLSGANLQSACL 324
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 44/90 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A N AN + M ++ G+ A L KA +AN GA+L
Sbjct: 160 SEADLFQADLSSANLKDVNLSAANLSECKMTRANLMGANLTEADLTKANLGRANLRGANL 219
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+D ++ L EA+L A L R L+R++L
Sbjct: 220 TDAYLNSASLVEADLYQANLTRANLSRANL 249
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 50/110 (45%), Gaps = 10/110 (9%)
Query: 101 SAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A G DL + + N A A + E++ S + +GA L+ A A+
Sbjct: 270 SGADLGGVDLSQKLLTGINLAGAYLSEATLVGALLMEANLSAANLSGANLQSACLIHADL 329
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
GA +DR+ L +ANLT A L + L ++L AI+ G + A +
Sbjct: 330 GGA-----YLDRVDLTDANLTGANLTKADLREANLRAAILAGVELKGAQL 374
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 17/108 (15%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L +A K N RAN A++ ++ + + A L +A +AN + A+LS
Sbjct: 192 ANLMGANLTEADLTKANLGRANLRGANLTDAYLNSASLVEADLYQANLTRANLSRANLSK 251
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
T + + LN A+LT + L+ +DLGG +DL+QK
Sbjct: 252 TYLRDICLNGAHLT-----KVNLSGADLGG------------VDLSQK 282
>gi|428203771|ref|YP_007082360.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427981203|gb|AFY78803.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 180
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 52/97 (53%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL KA V N N AD+ ++ SG+ GA L A + AN + A+L +
Sbjct: 60 ANLTDADLIKANLVGANLIEINLIGADLTSANLSGADLTGADLRCANLHNANLSQANLRE 119
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
+D L+ ANL+ A+LV T L+ +D GA ++G D
Sbjct: 120 VHLDGADLSGANLSGAILVNTDLSVADTVGAKLDGID 156
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 50/95 (52%), Gaps = 10/95 (10%)
Query: 114 VHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
+ V+E F R NF ++ ++ G KF G L +A+ +GADLS+T +
Sbjct: 1 MKVRELFIRYLKNQRNFEEVNLHIANLQGLKFQGINL-----TRADLSGADLSETDLSGA 55
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L +ANLT+A L++ L ++L + GAD + A
Sbjct: 56 CLKQANLTDADLIKANLVGANLIEINLIGADLTSA 90
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 47/102 (46%), Gaps = 15/102 (14%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT-- 176
N RA+ + AD+ E+D SG+ A L A KAN GA+L + + L ANL+
Sbjct: 36 NLTRADLSGADLSETDLSGACLKQANLTDADLIKANLVGANLIEINLIGADLTSANLSGA 95
Query: 177 -------------NAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
NA L + L L GA + GA+ S A++
Sbjct: 96 DLTGADLRCANLHNANLSQANLREVHLDGADLSGANLSGAIL 137
>gi|78189684|ref|YP_380022.1| pentapeptide repeat-containing protein [Chlorobium chlorochromatii
CaD3]
gi|78171883|gb|ABB28979.1| pentapeptide repeat family protein [Chlorobium chlorochromatii
CaD3]
Length = 389
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 49/88 (55%), Gaps = 5/88 (5%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM-----DRMVLNEANLTN 177
ANF ADM+ + G+ GA+ ++A +AN GA+L+ L+ D+ L ANLT
Sbjct: 270 ANFYKADMKGAQLQGANLQGAHCDRAFLLQANLQGANLTKALLFGATLDKADLRNANLTE 329
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A L +DL GAI+ A+ +DAV+
Sbjct: 330 ASLFGANCEGADLRGAILTRANVTDAVL 357
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 59/124 (47%), Gaps = 6/124 (4%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
LA N Y+A+ +G G+ Q D +A ++ N + AN T A + + +
Sbjct: 267 LAGANFYKADMKGAQLQGANLQGAHCD--RAFLLQANLQGANLTKALLFGATLDKADLRN 324
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG----AIIEGA 198
A L +A + AN GADL ++ R + +A LTNA++ T + S A+++ A
Sbjct: 325 ANLTEASLFGANCEGADLRGAILTRANVTDAVLTNALISSTTVLPSGKAATRQWALMQQA 384
Query: 199 DFSD 202
FS
Sbjct: 385 IFSQ 388
>gi|354564725|ref|ZP_08983901.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353549851|gb|EHC19290.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 564
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 10/106 (9%)
Query: 101 SAAQFGSADLR-----KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S ADLR + N R A +AD+ + +G+K NGA L A+ A+
Sbjct: 386 SGTNLNHADLRGSNLSDTILFSTNLRNAILIAADLSYAKLNGAKLNGANLRSAILLGADL 445
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
G DL+D ++LNEA+L+ VL L+ +D+ AI+ G D S
Sbjct: 446 GGVDLTD-----VILNEADLSGVVLNEADLSGADISDAILFGTDLS 486
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 57/114 (50%), Gaps = 7/114 (6%)
Query: 95 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 154
GEF GS F A L A NF AN TSA + +++ +G F+ A L A AN
Sbjct: 227 GEFLQGS--NFSGAYLGDANLTGVNFSAANLTSAYLGDANLTGVNFSAANLNAANLGDAN 284
Query: 155 FTGADLSD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+GA+LS T + L+ ANL A L R L+ +DL A + GAD S A
Sbjct: 285 LSGANLSGANLRCTDLSSANLSGANLAGADLYRADLSHADLSSANLSGADLSHA 338
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 55/103 (53%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ F +A+L A N AN + A++R +D S + +GA L A Y+A+ + ADL
Sbjct: 266 TGVNFSAANLNAANLGDANLSGANLSGANLRCTDLSSANLSGANLAGADLYRADLSHADL 325
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + L+ ANL++A L L+ S L AI+ A+ SDA
Sbjct: 326 SSANLSGADLSHANLSSANLRDAELSSSYLSHAILFAANLSDA 368
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 50/100 (50%), Gaps = 5/100 (5%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA A L A N R A AD+ D + N A L V +A+ +GAD+S
Sbjct: 417 AADLSYAKLNGAKLNGANLRSAILLGADLGGVDLTDVILNEADLSGVVLNEADLSGADIS 476
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
D ++ L+ ANL++A L+ S+L GAI+ GAD S
Sbjct: 477 DAILFGTDLSYANLSSA-----NLSGSNLSGAILSGADLS 511
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ S+ L A+ N AN SA++ +D + +G L A + G++LSD
Sbjct: 348 AELSSSYLSHAILFAANLSDANLNSANLSYADLCRADLSGTNLNHA-----DLRGSNLSD 402
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
T +L NL NA+L+ L+ + L GA + GA+ A++
Sbjct: 403 T-----ILFSTNLRNAILIAADLSYAKLNGAKLNGANLRSAIL 440
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 10/115 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A DL A N A+ AD+ +D S + +GA L A AN A+L
Sbjct: 291 SGANLRCTDLSSANLSGANLAGADLYRADLSHADLSSANLSGADLSHANLSSANLRDAEL 350
Query: 161 SDTLMDRMV----------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S + + + LN ANL+ A L R L+ ++L A + G++ SD ++
Sbjct: 351 SSSYLSHAILFAANLSDANLNSANLSYADLCRADLSGTNLNHADLRGSNLSDTIL 405
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 51/112 (45%), Gaps = 25/112 (22%)
Query: 119 NFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF-----TGADLSDTLMDRM 168
N AN + AD+ +D SG+ N G+ L + + N ADLS ++
Sbjct: 369 NLNSANLSYADLCRADLSGTNLNHADLRGSNLSDTILFSTNLRNAILIAADLSYAKLNGA 428
Query: 169 VLNEANLTNAVLV----------RTVLTRSDLGGAIIE-----GADFSDAVI 205
LN ANL +A+L+ +L +DL G ++ GAD SDA++
Sbjct: 429 KLNGANLRSAILLGADLGGVDLTDVILNEADLSGVVLNEADLSGADISDAIL 480
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 2/90 (2%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
L D+ EA+ G + + A AD+ A+ + AN +SA++ S+ SG+ +G
Sbjct: 450 LTDVILNEADLSGV--VLNEADLSGADISDAILFGTDLSYANLSSANLSGSNLSGAILSG 507
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNE 172
A L A GADLSD ++++ NE
Sbjct: 508 ADLSYTNLSYAILGGADLSDANLEKVTWNE 537
>gi|158340188|ref|YP_001521358.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158310429|gb|ABW32044.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 292
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 72/142 (50%), Gaps = 15/142 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F ++ L++++ + ++F+ AD+R +DFS +K + A L++ +AN GADL
Sbjct: 68 SGANFKASKLQRSLAIWVQAYWSDFSDADLRHADFSCAKLSAAQLKRTDFSQANLMGADL 127
Query: 161 SDTLMDRMVLNEA----------NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
SD+ A NLTN L + +T SDL A + +D S + +
Sbjct: 128 SDSEAQDACFKGANLWGVWAQRTNLTNVCLSQVDMTTSDLTEAQLSESDLSWSFL----S 183
Query: 211 QALCKYANGTNP-ITGVSTRKS 231
QA+C AN T+ + G +K+
Sbjct: 184 QAVCVGANLTSACLEGSDLKKT 205
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 44/89 (49%), Gaps = 10/89 (11%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA----------DLSDTLMDRMVLN 171
+ + T++D+ E+ S S + ++L +AV AN T A D D + R L+
Sbjct: 159 QVDMTTSDLTEAQLSESDLSWSFLSQAVCVGANLTSACLEGSDLKKTDFQDACLSRADLS 218
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A+ NA L ++DL GA + GADF
Sbjct: 219 AADCENACFFNANLYKADLRGAKLCGADF 247
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 10/113 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A L A + +F +AN AD+ +S+ + F GA L A + N T LS
Sbjct: 100 ADFSCAKLSAAQLKRTDFSQANLMGADLSDSEAQDACFKGANLWGVWAQRTNLTNVCLSQ 159
Query: 163 TLMDRMVLNEAN----------LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
M L EA L+ AV V LT + L G+ ++ DF DA +
Sbjct: 160 VDMTTSDLTEAQLSESDLSWSFLSQAVCVGANLTSACLEGSDLKKTDFQDACL 212
>gi|209526910|ref|ZP_03275429.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423063829|ref|ZP_17052619.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209492689|gb|EDZ93025.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406714678|gb|EKD09839.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 740
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 54/101 (53%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +LR A N A+ AD+R +D G+ F GA L +A Y+AN T + +
Sbjct: 580 ANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITEGNFNG 639
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ R+ N ++L +A L+R L++S L A ++GA+ S +
Sbjct: 640 ANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQS 680
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 61/133 (45%), Gaps = 17/133 (12%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
L +N A RG G A ADLR A NF+ AN A+ +++ + FNG
Sbjct: 582 LRGVNLRNANLRG--GNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITEGNFNG 639
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNE----------ANLTNAVLVRTVLTRSDLGG 192
A L + NF +DL D + R+ L++ ANL+ + L T TR+DL
Sbjct: 640 ANLR-----RVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGTDFTRADLSN 694
Query: 193 AIIEGADFSDAVI 205
A GAD S +I
Sbjct: 695 AKFNGADLSFTLI 707
Score = 43.5 bits (101), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 51/112 (45%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
+QF DLR+ N + +F ADMRE + G L KAN + A L+
Sbjct: 430 SQFQGQDLRQKNLKGVNLKTIDFKGADMREKNLKGMSLIKLDLRLVNLAKANLSHAILNG 489
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
+ + L AN+ A LV+T L R+DL + A + A + A ++ C
Sbjct: 490 SKLAVANLKGANMQEASLVKTDLRRADLEDVNLSYASLTTAQLQRANLRSAC 541
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 3/111 (2%)
Query: 87 NKYEAE-TRGEFGIGSA--AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
N Y+A T G F + F +DLR A ++ + ++ SA ++ ++ S S G
Sbjct: 626 NFYQANITEGNFNGANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGT 685
Query: 144 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
+A A F GADLS TL+ L+ A+LTNA L + L S+ G I
Sbjct: 686 DFTRADLSNAKFNGADLSFTLIRHANLSGADLTNAKLEKANLFGSNTVGCI 736
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 54/113 (47%), Gaps = 10/113 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 156
+ A A++++A VK + RRA+ ++ + + ++ A L A KAN
Sbjct: 493 AVANLKGANMQEASLVKTDLRRADLEDVNLSYASLTTAQLQRANLRSACLIKANLMAASL 552
Query: 157 ------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
GADLS+ ++ LN+ANL +A L L ++L G +EGA A
Sbjct: 553 EGCDLQGADLSNGNLESAKLNQANLAHANLRGVNLRNANLRGGNLEGAHLEGA 605
Score = 40.4 bits (93), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 46/103 (44%), Gaps = 10/103 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQ A+LR A +K N A+ D++ +D S A L +A AN G +L
Sbjct: 528 TTAQLQRANLRSACLIKANLMAASLEGCDLQGADLSNGNLESAKLNQANLAHANLRGVNL 587
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ ANL L L +DL GA ++GA+F A
Sbjct: 588 RN----------ANLRGGNLEGAHLEGADLRGADLQGANFKGA 620
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 66/141 (46%), Gaps = 20/141 (14%)
Query: 105 FGSADLRK-----AVHVKENFRRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKAN 154
F AD+R+ +K + R N A++ + +GSK GA +++A K +
Sbjct: 452 FKGADMREKNLKGMSLIKLDLRLVNLAKANLSHAILNGSKLAVANLKGANMQEASLVKTD 511
Query: 155 FTGADLSDT-----LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
ADL D + L ANL +A L++ L + L G ++GAD S+ ++ A+
Sbjct: 512 LRRADLEDVNLSYASLTTAQLQRANLRSACLIKANLMAASLEGCDLQGADLSNGNLESAK 571
Query: 210 -KQALCKYANGTNPITGVSTR 229
QA +AN + GV+ R
Sbjct: 572 LNQANLAHAN----LRGVNLR 588
>gi|86606854|ref|YP_475617.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86555396|gb|ABD00354.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 248
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 69/148 (46%), Gaps = 15/148 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----G 157
A F ++DLR + + +NFT+A + +S F G F+ + +A AN T
Sbjct: 89 ANFAASDLRGSSFSQALGDYSNFTAAKLDKSSFQGGHFSHSIFREASLVAANLTEGNFFA 148
Query: 158 AD----------LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
AD LS ++ L AN A+LV L + + GA GADF+DA +
Sbjct: 149 ADFRQANLFRCNLSQAILSSCQLQNANFDQALLVGANLQEAQIEGASFVGADFTDAKLSD 208
Query: 208 AQKQALCKYANGTNPITGVSTRKSLGCG 235
++ L + A+GTN +T T +L G
Sbjct: 209 EMRKFLLERASGTNELTQRDTLNTLLAG 236
>gi|376003692|ref|ZP_09781500.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375327990|emb|CCE17253.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 740
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 54/101 (53%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +LR A N A+ AD+R +D G+ F GA L +A Y+AN T + +
Sbjct: 580 ANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITEGNFNG 639
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ R+ N ++L +A L+R L++S L A ++GA+ S +
Sbjct: 640 ANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQS 680
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 61/133 (45%), Gaps = 17/133 (12%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
L +N A RG G A ADLR A NF+ AN A+ +++ + FNG
Sbjct: 582 LRGVNLRNANLRG--GNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANITEGNFNG 639
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNE----------ANLTNAVLVRTVLTRSDLGG 192
A L + NF +DL D + R+ L++ ANL+ + L T TR+DL
Sbjct: 640 ANLR-----RVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGTDFTRADLSN 694
Query: 193 AIIEGADFSDAVI 205
A GAD S +I
Sbjct: 695 AKFNGADLSFTLI 707
Score = 43.5 bits (101), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 51/112 (45%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
+QF DLR+ N + +F ADMRE + G L KAN + A L+
Sbjct: 430 SQFQGQDLRQKNLKGVNLKTIDFKGADMREKNLKGMSLIKLDLRLVNLAKANLSHAILNG 489
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
+ + L AN+ A LV+T L R+DL + A + A + A ++ C
Sbjct: 490 SKLAVANLKGANMQEASLVKTDLRRADLEDVNLSYASLTTAQLQRANLRSAC 541
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 3/111 (2%)
Query: 87 NKYEAE-TRGEFGIGSA--AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
N Y+A T G F + F +DLR A ++ + ++ SA ++ ++ S S G
Sbjct: 626 NFYQANITEGNFNGANLRRVNFNRSDLRDAELIRVDLSKSRLRSACLQGANLSQSNLKGT 685
Query: 144 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
+A A F GADLS TL+ L+ A+LTNA L + L S+ G I
Sbjct: 686 DFTRADLSNAKFNGADLSFTLIRHANLSGADLTNAKLEKANLFGSNTVGCI 736
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 54/113 (47%), Gaps = 10/113 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 156
+ A A++++A VK + RRA+ ++ + + ++ A L A KAN
Sbjct: 493 AVANLKGANMQEASLVKTDLRRADLEDVNLSYASLTTAQLQRANLRSACLIKANLMAASL 552
Query: 157 ------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
GADLS+ ++ LN+ANL +A L L ++L G +EGA A
Sbjct: 553 EGCDLQGADLSNGNLESAKLNQANLAHANLRGVNLRNANLRGGNLEGAHLEGA 605
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 46/103 (44%), Gaps = 10/103 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQ A+LR A +K N A+ D++ +D S A L +A AN G +L
Sbjct: 528 TTAQLQRANLRSACLIKANLMAASLEGCDLQGADLSNGNLESAKLNQANLAHANLRGVNL 587
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ ANL L L +DL GA ++GA+F A
Sbjct: 588 RN----------ANLRGGNLEGAHLEGADLRGADLQGANFKGA 620
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 66/141 (46%), Gaps = 20/141 (14%)
Query: 105 FGSADLRK-----AVHVKENFRRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKAN 154
F AD+R+ +K + R N A++ + +GSK GA +++A K +
Sbjct: 452 FKGADMREKNLKGMSLIKLDLRLVNLAKANLSHAILNGSKLAVANLKGANMQEASLVKTD 511
Query: 155 FTGADLSDT-----LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
ADL D + L ANL +A L++ L + L G ++GAD S+ ++ A+
Sbjct: 512 LRRADLEDVNLSYASLTTAQLQRANLRSACLIKANLMAASLEGCDLQGADLSNGNLESAK 571
Query: 210 -KQALCKYANGTNPITGVSTR 229
QA +AN + GV+ R
Sbjct: 572 LNQANLAHAN----LRGVNLR 588
>gi|239909009|ref|YP_002955751.1| hypothetical protein DMR_43740 [Desulfovibrio magneticus RS-1]
gi|239798876|dbj|BAH77865.1| hypothetical protein [Desulfovibrio magneticus RS-1]
Length = 972
Score = 55.5 bits (132), Expect = 3e-05, Method: Composition-based stats.
Identities = 54/194 (27%), Positives = 83/194 (42%), Gaps = 29/194 (14%)
Query: 37 ACQISSKTESDGQFPGP------YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYE 90
AC +S T S+ +P P A+L+ R + L + + + S+N L ++ Y
Sbjct: 724 ACSKNSTTISNIAWPLPTSFGEWLARLQPQRNSPESTLGYSCLNNISANNQQLPMVDLYS 783
Query: 91 AETRGE-------------FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 137
+ F S F A L + NF SA +RES+F+
Sbjct: 784 SNLAKSSLKNCQLFNANFMFSNLSEVNFNGAKLDDVEFANAILNKTNFESASLRESNFTN 843
Query: 138 S-----KFNGAYLEKAVAYKA-----NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
+ F A +EK+ +KA NF ADL++T L ANL+N+ L LTR
Sbjct: 844 AICNNANFKKARMEKSNLHKATLINTNFEKADLTNTNFSEASLEGANLSNSKLKEANLTR 903
Query: 188 SDLGGAIIEGADFS 201
++L A + GA+ S
Sbjct: 904 ANLCDANLVGANLS 917
Score = 43.9 bits (102), Expect = 0.081, Method: Composition-based stats.
Identities = 40/119 (33%), Positives = 59/119 (49%), Gaps = 6/119 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTG 157
A+ ++L KA + NF +A+ T+ + E+ G SK A L +A AN G
Sbjct: 854 ARMEKSNLHKATLINTNFEKADLTNTNFSEASLEGANLSNSKLKEANLTRANLCDANLVG 913
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALCK 215
A+LS + + + N+ANL NA L+ S GA ++ A F D V IDL Q C+
Sbjct: 914 ANLSGSDLSKANFNKANLANANLLNCKFNFSKFLGANLDNAKFDDDVDIDLLTNQKRCQ 972
>gi|254409513|ref|ZP_05023294.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183510|gb|EDX78493.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 209
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 61/118 (51%), Gaps = 15/118 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVAYK 152
A +A+L +A ++ N +RAN T A +RE+ + +GA L +A+ +
Sbjct: 80 ANLTAAELVRATLIECNLKRANLTEAHLREASLMFANLAQACLYQADLHGAMLHQAILHW 139
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRT-----VLTRSDLGGAIIEGADFSDAVI 205
A+ ADL ++ + A+L+ A L+R +L +DL GAI+ GA+F A++
Sbjct: 140 ASLKNADLIGAILQGADMRGADLSQACLIRADVSKAILMVADLRGAIVMGANFKAAIL 197
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 49/97 (50%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
A A++R SD SG+ +GA L+ + +AN + A+LS + + LN+ANLT A
Sbjct: 27 LTEAILNGANLRRSDLSGANLSGASLKGSNLSEANLSQANLSVANLSKAELNDANLTAAE 86
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 216
LVR L +L A + A +A + A C Y
Sbjct: 87 LVRATLIECNLKRANLTEAHLREASLMFANLAQACLY 123
Score = 37.0 bits (84), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 55/120 (45%), Gaps = 4/120 (3%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+LR++ N A+ +++ E++ S + + A L KA AN T A+L
Sbjct: 30 AILNGANLRRSDLSGANLSGASLKGSNLSEANLSQANLSVANLSKAELNDANLTAAELVR 89
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
+ L ANLT A L L ++L A + AD A++ QA+ +A+ N
Sbjct: 90 ATLIECNLKRANLTEAHLREASLMFANLAQACLYQADLHGAML----HQAILHWASLKNA 145
>gi|334121546|ref|ZP_08495612.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333454932|gb|EGK83604.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 388
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 65/118 (55%), Gaps = 8/118 (6%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +A+L KAV + N +A A++ ++F + + A L++A +A TGA+LS
Sbjct: 133 ADMSAANLTKAVLTEANLSKAYLIKANLNGANFQDAYLSLASLKEADLTEAQLTGAELSK 192
Query: 163 -----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
+ R L++ANL A L RT LT++ L GA + G+D S+A +D A LCK
Sbjct: 193 ANLAGANLTRANLSKANLLKANLRRTNLTQAYLNGACLIGSDLSEACLDRAN---LCK 247
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 54/108 (50%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A + + AN A++ ++ +G+ N A L A+ AN GAD+
Sbjct: 76 SKADLSGANLTGANLMAASLSGANLIGANLTGANLAGAHLNWANLTGAILPNANLIGADM 135
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
S + + VL EANL+ A L++ A + GA+F DA + LA
Sbjct: 136 SAANLTKAVLTEANLSKAYLIK----------ANLNGANFQDAYLSLA 173
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 42/81 (51%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN ADM ++ + + A L KA KAN GA+ D + L EA+LT A L
Sbjct: 128 ANLIGADMSAANLTKAVLTEANLSKAYLIKANLNGANFQDAYLSLASLKEADLTEAQLTG 187
Query: 183 TVLTRSDLGGAIIEGADFSDA 203
L++++L GA + A+ S A
Sbjct: 188 AELSKANLAGANLTRANLSKA 208
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 57/126 (45%), Gaps = 20/126 (15%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA--------- 153
A A+L KA +K N RR N T A + + GS + A L++A KA
Sbjct: 198 ANLTRANLSKANLLKANLRRTNLTQAYLNGACLIGSDLSEACLDRANLCKADLSKTYLRN 257
Query: 154 -----------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
NF+GADLS + R +L N+ A+L L+ + L A + GA+ S
Sbjct: 258 ITLNGSHLSGINFSGADLSGVDLSRKLLTGINMAEALLNEANLSGAYLMEANLSGANLSK 317
Query: 203 AVIDLA 208
A + LA
Sbjct: 318 ANLSLA 323
Score = 43.5 bits (101), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANF 155
S F ADL ++ N A + E++ SG+ +GA L KA A
Sbjct: 266 SGINFSGADLSGVDLSRKLLTGINMAEALLNEANLSGAYLMEANLSGANLSKANLSLAYL 325
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
ADLS++ + + L++ANL+ A L + LT ++L GAI+ AD + A
Sbjct: 326 INADLSNSCLHEINLSKANLSKASLQKADLTGANLRGAILTEADLTGA 373
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANF 155
+ A+ A+L A + N +AN A++R ++ + + NGA L +A +AN
Sbjct: 186 TGAELSKANLAGANLTRANLSKANLLKANLRRTNLTQAYLNGACLIGSDLSEACLDRANL 245
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
ADLS T + + LN ++L+ L+ DL ++ G + ++A+++
Sbjct: 246 CKADLSKTYLRNITLNGSHLSGINFSGADLSGVDLSRKLLTGINMAEALLN 296
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 59/143 (41%), Gaps = 37/143 (25%)
Query: 103 AQFGSADLRKAVHVKENFRRANF--------------------TSADMRESDFSGSKFNG 142
A A+L KA +K N ANF T A++ +++ +G+
Sbjct: 143 AVLTEANLSKAYLIKANLNGANFQDAYLSLASLKEADLTEAQLTGAELSKANLAGANLTR 202
Query: 143 AYLEKAVAYKAN---------------FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
A L KA KAN G+DLS+ +DR L +A+L+ L L
Sbjct: 203 ANLSKANLLKANLRRTNLTQAYLNGACLIGSDLSEACLDRANLCKADLSKTYLRNITLNG 262
Query: 188 SDLGGAIIEGADFSDAVIDLAQK 210
S L G GAD S +DL++K
Sbjct: 263 SHLSGINFSGADLSG--VDLSRK 283
>gi|304414054|ref|ZP_07395422.1| pentapeptide repeat-containing protein [Candidatus Regiella
insecticola LSR1]
gi|304283268|gb|EFL91664.1| pentapeptide repeat-containing protein [Candidatus Regiella
insecticola LSR1]
Length = 283
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 57/128 (44%), Gaps = 22/128 (17%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDF----------------------SGS 138
S A +ADLR A N + A ADMRE D SG+
Sbjct: 122 SNATLSNADLRGAYMSWANLQNATLNDADMREVDLVGADMREAKLIGKKTNLEGANLSGA 181
Query: 139 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
GA L + KA + ADLS ++R+ L EANL +A+L T L + L A +E
Sbjct: 182 DLRGAELCHTILIKAALSWADLSYAKLERVNLREANLYHAILEETSLYLTKLENANLESV 241
Query: 199 DFSDAVID 206
+ DAV++
Sbjct: 242 NLKDAVLE 249
>gi|428224583|ref|YP_007108680.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984484|gb|AFY65628.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 156
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 71/138 (51%), Gaps = 8/138 (5%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A L +A + N ++AN +SAD+ +D S + +GA L +A A+ T ADL
Sbjct: 17 FQQAALHQADLEEVNLQQANLSSADLSSADLSHANLSGANLSRANLSNADLTNADLRSAD 76
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV---IDLAQKQALCKYANGTN 221
+ + L ANL+ A L R L ++DL AI+ ADF+ A +DL+ +GTN
Sbjct: 77 LSEVNLIGANLSGAKLGRANLFQADLRSAILTDADFTGANLEDVDLSGAD-----LSGTN 131
Query: 222 PITGVSTRKSLGCGNSRR 239
T ++ + G SRR
Sbjct: 132 LRTAELSKAASSHGVSRR 149
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANF-----TSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S+A SADL A N RAN T+AD+R +D S GA L A +AN
Sbjct: 38 SSADLSSADLSHANLSGANLSRANLSNADLTNADLRSADLSEVNLIGANLSGAKLGRANL 97
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
ADL +L +A+ T A L L+ +DL G + A+ S A
Sbjct: 98 FQADLRSA-----ILTDADFTGANLEDVDLSGADLSGTNLRTAELSKA 140
>gi|78187857|ref|YP_375900.1| pentapeptide repeat-containing protein [Chlorobium luteolum DSM
273]
gi|78167759|gb|ABB24857.1| pentapeptide repeat family protein [Chlorobium luteolum DSM 273]
Length = 447
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 59/118 (50%), Gaps = 20/118 (16%)
Query: 103 AQFGSADLRKAVHVKE----------NFRRANFTSADMRESDFSGSKFNGAYLEKA---- 148
A+ ADLR+ V ++ N R AN A +R++D G+ GA+L KA
Sbjct: 63 AELAGADLRRTVLIRADLSGANLNGANLREANLAMAFIRKADMKGADMTGAWLVKANLKS 122
Query: 149 -----VAYK-ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+++ AN GA+L + + + L ANL+NAVL L +DL GA + GA F
Sbjct: 123 SFMNGASFRGANLLGANLRWSSLRKADLTGANLSNAVLFEANLAGADLSGANLSGATF 180
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 1/106 (0%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A+ F A L A N R A AD++ + G+ GA L++A A+ +GA+L
Sbjct: 306 ASSFNGATLDNADMRGANLRNAYMKKADLKSAKLGGACLEGANLDRAFLKDADLSGANLR 365
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VID 206
T++ L+ ANL A L L +DL GA ++GAD A V+D
Sbjct: 366 GTMLYGATLSGANLEGADLAGASLFDADLRGANLDGADLEGANVMD 411
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 52/98 (53%), Gaps = 5/98 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADLR+A +F A +ADMR G+ AY++KA A GA L +DR
Sbjct: 297 ADLRQADLGASSFNGATLDNADMR-----GANLRNAYMKKADLKSAKLGGACLEGANLDR 351
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L +A+L+ A L T+L + L GA +EGAD + A +
Sbjct: 352 AFLKDADLSGANLRGTMLYGATLSGANLEGADLAGASL 389
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 20/89 (22%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
KE ++ AD+R++D S FNGA L+ A + ANL
Sbjct: 286 KEKLESSSLEGADLRQADLGASSFNGATLDNAD--------------------MRGANLR 325
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
NA + + L + LGGA +EGA+ A +
Sbjct: 326 NAYMKKADLKSAKLGGACLEGANLDRAFL 354
>gi|428319029|ref|YP_007116911.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428242709|gb|AFZ08495.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 520
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 62/121 (51%), Gaps = 1/121 (0%)
Query: 86 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 144
L KY A R GI + A +L A N AN + A++ +++ +G+K N A
Sbjct: 7 LKKYAAGERNFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSKTNLTGAKLNIAR 66
Query: 145 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
L A A+ T ADL+ + R+ L +A L A L+R L R++L GA + GA+ S A
Sbjct: 67 LSGAHLGGADLTDADLNVAYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANLSGAT 126
Query: 205 I 205
+
Sbjct: 127 L 127
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL+KA+ + RA A++ ++ SG+ +GA L +A AN A+L +
Sbjct: 91 DLKKAILIGAKLIRAELIRAELSGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGA 150
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L EANL A L L+R+DL GA + G + A
Sbjct: 151 CLTEANLEQANLQGADLSRADLSGADLRGTELRQA 185
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A + R AN A++R + SG+ A LE+A A+ + ADL
Sbjct: 113 SGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRADL 172
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG-----ADFSDA 203
S + L +ANLT AVL L+ +L AI+ G AD S+A
Sbjct: 173 SGADLRGTELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEA 220
Score = 43.9 bits (102), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 52/113 (46%), Gaps = 10/113 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L A + N +AN AD+ +D SG+ G L +A +A +GADLS
Sbjct: 140 ANLRGAHLSGACLTEANLEQANLQGADLSRADLSGADLRGTELRQANLTQAVLSGADLSG 199
Query: 163 TLMDRMV----------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ + L+EA L+ A L R L ++L A + AD S+A +
Sbjct: 200 VNLRWAILSGCNLRWADLSEAKLSGADLSRADLCHANLLNASLVHADLSNAYL 252
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 54/124 (43%), Gaps = 5/124 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADLR R+AN T A + +D SG A L A+ + A L
Sbjct: 168 SRADLSGADLRGT-----ELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEAKL 222
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
S + R L ANL NA LV L+ + L A GAD + A + A+ A+ + T
Sbjct: 223 SGADLSRADLCHANLLNASLVHADLSNAYLIRADWIGADLTGATLTGAKLHAVSRLGIKT 282
Query: 221 NPIT 224
+T
Sbjct: 283 EGMT 286
Score = 40.4 bits (93), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A G ADL A + A D++++ G+K A L +A AN +GA+L
Sbjct: 68 SGAHLGGADLTDA-----DLNVAYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANL 122
Query: 161 SDTLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + L +ANL A L LT ++L A ++GAD S A
Sbjct: 123 SGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRA 170
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 15/85 (17%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
NF ++ E++ SG +GA L+ A AN +GA+LS T NLT A L
Sbjct: 16 NFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSKT----------NLTGAKL--- 62
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLA 208
+ L GA + GAD +DA +++A
Sbjct: 63 --NIARLSGAHLGGADLTDADLNVA 85
>gi|158341584|ref|YP_001522748.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158311825|gb|ABW33434.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 521
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 52/99 (52%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G ADL A N RANF A ++E+D + + +GA+L A AN +GA LS
Sbjct: 88 AYLGGADLYSANLRGANLIRANFNDAHLKEADLTNANLSGAHLRGANLLNANLSGALLSR 147
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
++ L+ ANL NA L L +DL A ++ AD S
Sbjct: 148 ANLENADLSYANLENADLSYANLENADLSHANLKNADLS 186
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 59/136 (43%), Gaps = 21/136 (15%)
Query: 89 YEAETRGEFGIGSAAQFGSADLRKAVHV-----KEN------FRRANFTSADMRESDFSG 137
Y ETRG+ Q L++ + V +EN FRR + + + E DFS
Sbjct: 3 YSGETRGKDYAAMTNQVLVELLKRGIGVWNSWREENLYENLDFRRVQLSGSYLSEVDFSH 62
Query: 138 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV----------LTR 187
+ F AYL A AN G +L+ + L ANL A L+R LT
Sbjct: 63 ANFEIAYLSSAKLSCANLEGINLNRAYLGGADLYSANLRGANLIRANFNDAHLKEADLTN 122
Query: 188 SDLGGAIIEGADFSDA 203
++L GA + GA+ +A
Sbjct: 123 ANLSGAHLRGANLLNA 138
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 54/109 (49%), Gaps = 6/109 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NF A +SA + ++ G N AYL A Y AN GA+L + L EA+LTNA
Sbjct: 64 NFEIAYLSSAKLSCANLEGINLNRAYLGGADLYSANLRGANLIRANFNDAHLKEADLTNA 123
Query: 179 VL----VRTV-LTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGTN 221
L +R L ++L GA++ A+ +A + A + A YAN N
Sbjct: 124 NLSGAHLRGANLLNANLSGALLSRANLENADLSYANLENADLSYANLEN 172
>gi|428216913|ref|YP_007101378.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988695|gb|AFY68950.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 227
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 65/120 (54%), Gaps = 4/120 (3%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S+ + A+L V N A+ ++ ++ +D S S + A L + ANF+ A L
Sbjct: 21 SSVKLPGAELDGEVLHHANLADADLSAGNLNHADLSNSDLSRANLYRCSLKHANFSAAKL 80
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
S+ + + LN+ANL++A+L L +DL GAI+ GAD S A DL + LC +AN T
Sbjct: 81 SNANLKDVQLNDANLSDAILSCANLAEADLSGAILVGADLSGA--DLTNAE-LC-HANLT 136
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 81/171 (47%), Gaps = 11/171 (6%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ++DL +A + + + ANF++A + ++ + N A L A+ AN ADLS
Sbjct: 53 ADLSNSDLSRANLYRCSLKHANFSAAKLSNANLKDVQLNDANLSDAILSCANLAEADLSG 112
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDAVIDLAQKQALCKYA 217
++ L+ A+LTNA L LT ++L G ++ GA+F++A ++ AQ A
Sbjct: 113 AILVGADLSGADLTNAELCHANLTGANLEGVLLHNANLTGANFTNANMENAQLDG----A 168
Query: 218 NGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKL--LDRDGFCDS 266
+ TN +T ++ NS A ++ L Q L+ CD+
Sbjct: 169 DLTNANLSGTTLHNVNLANSNLQAVNLTNADLRGVNLQHTHNLETANLCDA 219
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 48/102 (47%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I S A ADL A+ V + A+ T+A++ ++ +G+ G L A ANFT A
Sbjct: 99 ILSCANLAEADLSGAILVGADLSGADLTNAELCHANLTGANLEGVLLHNANLTGANFTNA 158
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
++ + +D L ANL+ L L S+L + AD
Sbjct: 159 NMENAQLDGADLTNANLSGTTLHNVNLANSNLQAVNLTNADL 200
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SAA+ +A+L+ N A + A++ E+D SG+ GA L A A A+L
Sbjct: 76 SAAKLSNANLKDVQLNDANLSDAILSCANLAEADLSGAILVGADLSGADLTNAELCHANL 135
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ ++ ++L+ ANLT A T +++ A ++GAD ++A
Sbjct: 136 TGANLEGVLLHNANLTGA-----NFTNANMENAQLDGADLTNA 173
>gi|381207604|ref|ZP_09914675.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 255
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 15/111 (13%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL +A + N + A+ T D+ ++ G+ +GA L A AN GADL+D +
Sbjct: 96 ADLHEANAPEANLKNADLTEVDLLHANLGGTDLSGAKLSGAKLRGANLVGADLTDADLSE 155
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAI---------------IEGADFSDA 203
L+EANL+ A L L +DLG A+ ++GAD +DA
Sbjct: 156 ANLSEANLSEADLSGADLREADLGKAVLSQAKLVGANLHRIRLQGADLTDA 206
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 60/123 (48%), Gaps = 15/123 (12%)
Query: 94 RGE-FGIGSAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEK 147
+GE FG+ ADL KAV + R +AN AD++E++ G+ + L
Sbjct: 20 KGELFGV----DLSEADLPKAVLYSSDLREAKLSKANLAKADLQEANLVGAGLHRVDLNG 75
Query: 148 AVAYKANFTGADLSDTLM---DRMVLN--EANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A ++AN ADLS L+ D N EANL NA L L ++LGG + GA S
Sbjct: 76 ANLHQANLAQADLSGALLFFADLHEANAPEANLKNADLTEVDLLHANLGGTDLSGAKLSG 135
Query: 203 AVI 205
A +
Sbjct: 136 AKL 138
Score = 44.3 bits (103), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 57/119 (47%), Gaps = 12/119 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G DL A R AN AD+ ++D S + + A L +A+ +GADL +
Sbjct: 121 ANLGGTDLSGAKLSGAKLRGANLVGADLTDADLSEANLSEANLS-----EADLSGADLRE 175
Query: 163 TLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADFSDAVID--LAQKQALC 214
+ + VL++A L A L R LT +DL A + G D +A+ + L +K LC
Sbjct: 176 ADLGKAVLSQAKLVGANLHRIRLQGADLTDADLTDANLYGIDLREAITENTLFEKAKLC 234
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 50/105 (47%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ A+L A + AN + A++ E+D SG+ A L KAV +A GA+L
Sbjct: 134 SGAKLRGANLVGADLTDADLSEANLSEANLSEADLSGADLREADLGKAVLSQAKLVGANL 193
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
R+ L A+LT+A L L DL AI E F A +
Sbjct: 194 H-----RIRLQGADLTDADLTDANLYGIDLREAITENTLFEKAKL 233
Score = 40.8 bits (94), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A ADL++A V R AN A++ ++D SG+ A L +A A +AN
Sbjct: 49 SKANLAKADLQEANLVGAGLHRVDLNGANLHQANLAQADLSGALLFFADLHEANAPEANL 108
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
ADL++ + L ANL L L+ + L GA + GAD +DA
Sbjct: 109 KNADLTE-----VDLLHANLGGTDLSGAKLSGAKLRGANLVGADLTDA 151
>gi|434394300|ref|YP_007129247.1| heat shock protein DnaJ domain protein [Gloeocapsa sp. PCC 7428]
gi|428266141|gb|AFZ32087.1| heat shock protein DnaJ domain protein [Gloeocapsa sp. PCC 7428]
Length = 213
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 49/92 (53%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+F+RAN D+ DF + F GA L A +K N +GA+L + R L +ANL++A
Sbjct: 100 DFKRANLKEKDLSGRDFRNANFTGANLSDAFMHKVNLSGANLFQANLFRANLLQANLSHA 159
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
L L +DL G+ + GAD A I + +
Sbjct: 160 NLREANLVGADLSGSDLSGADLRGARIGVGDR 191
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 10/96 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A+L++ +FR ANFT A++ ++ +GA L +A ++AN A+LS
Sbjct: 99 ADFKRANLKEKDLSGRDFRNANFTGANLSDAFMHKVNLSGANLFQANLFRANLLQANLS- 157
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
ANL A LV L+ SDL GA + GA
Sbjct: 158 ---------HANLREANLVGADLSGSDLSGADLRGA 184
>gi|17227929|ref|NP_484477.1| hypothetical protein alr0433 [Nostoc sp. PCC 7120]
gi|17129778|dbj|BAB72391.1| alr0433 [Nostoc sp. PCC 7120]
Length = 143
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 46/87 (52%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N ++A+ AD+R ++ +G+ A LE A AN GA+LS L+ NLTN
Sbjct: 47 NLQQAHLIGADLRNANLAGANLKLANLEGADLTGANLKGANLSQVFASDASLSATNLTNV 106
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVI 205
L+ L +DL GA++ AD A++
Sbjct: 107 KLINAELYNADLEGAVLANADLRGAIL 133
Score = 43.5 bits (101), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 49/99 (49%), Gaps = 10/99 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR A N AN A++ +D +G+ GA L + A A+ + +L++
Sbjct: 51 AHLIGADLRNA-----NLAGANLKLANLEGADLTGANLKGANLSQVFASDASLSATNLTN 105
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+ L A L NA L VL +DL GAI+ GA +S
Sbjct: 106 -----VKLINAELYNADLEGAVLANADLRGAILFGALYS 139
>gi|390438023|ref|ZP_10226524.1| Pentapeptide repeat protein [Microcystis sp. T1-4]
gi|389838556|emb|CCI30648.1| Pentapeptide repeat protein [Microcystis sp. T1-4]
Length = 275
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 69/139 (49%), Gaps = 18/139 (12%)
Query: 113 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 172
A+ K N A +A++R +D SG+ GAYL A AN A LS + R L
Sbjct: 135 AIGPKANLTGAYLNNANLRFADLSGANLRGAYLSGADLTGANLAAAALSGANLQRASLTG 194
Query: 173 ANLTNAVLVRTVLTRSDLGGAI-----------IEGADFS--DAVIDLAQKQALCKYAN- 218
A L +A LV L +DL GA +EGADFS + + DL ++ LC ++
Sbjct: 195 AFLRDARLVGVELQFADLRGADLTGAILEQIQNLEGADFSQVEGLSDL-ERSYLCGRSSR 253
Query: 219 --GT-NPITGVSTRKSLGC 234
GT NP T +T +SLGC
Sbjct: 254 ELGTWNPYTRSNTGQSLGC 272
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 52/100 (52%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
D+RKA ++ AN D+ + D + F GA L A AN TGA+L + R
Sbjct: 17 DVRKARDKGQSLSAANLEGIDLSQMDLKNADFTGAILLGADLAGANLTGANLEAADLRRA 76
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
L ++L A L T+L R+ L GA ++GAD + A I L+
Sbjct: 77 NLRGSDLRGANLRDTLLYRAILCGANLQGADLTGAKISLS 116
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 10/90 (11%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ + + A+FT A + +D +G+ GA LE A +AN G+DL ANL
Sbjct: 40 QMDLKNADFTGAILLGADLAGANLTGANLEAADLRRANLRGSDLRG----------ANLR 89
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ +L R +L ++L GA + GA S +V D
Sbjct: 90 DTLLYRAILCGANLQGADLTGAKISLSVYD 119
>gi|354569053|ref|ZP_08988212.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353539057|gb|EHC08553.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 519
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 63/117 (53%), Gaps = 1/117 (0%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A +A++R+A NF AN + A++R +D +G+ + A L +A AN GADL
Sbjct: 173 SGANCRNAEMRQANLSHSNFSGANLSGANLRWADLNGANLSWADLSEAKLSGANLIGADL 232
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKY 216
S+ + L A+LT A L++ +DL GA + GA +S + L + +C++
Sbjct: 233 SNANLTNASLVHADLTQAKLIKAEWVGADLSGATLTGAKLYSTSRFGLKTEGMICEW 289
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 70/144 (48%), Gaps = 20/144 (13%)
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKA----VHVKE-NFRRANFTSADMRES-----DF 135
L KYEA R F S DL +A V + E NF AN + ++ S DF
Sbjct: 7 LAKYEAGER---------DFRSVDLSEANLSGVKLNEANFSHANLSIVNLSGSHLCGTDF 57
Query: 136 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
S ++ N A L A ++AN A L+ + R L+ A L +A L+R L R+DL A +
Sbjct: 58 SHAQINVARLSGAYLHQANLNHASLNVANLIRADLSRAQLQSASLIRAELIRADLSRADL 117
Query: 196 EGADFSDAVIDLAQ-KQALCKYAN 218
A+ + A + A + A+ +YAN
Sbjct: 118 FAANLNCADLREASLRHAILRYAN 141
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 55/108 (50%), Gaps = 5/108 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A + DL + N R A A++ S+FSG+ +GA L A AN + ADLS+
Sbjct: 160 ANLNNTDLSRTDCSGANCRNAEMRQANLSHSNFSGANLSGANLRWADLNGANLSWADLSE 219
Query: 163 TLMD--RMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ ++ L+ ANLTNA LV LT++ L A GAD S A +
Sbjct: 220 AKLSGANLIGADLSNANLTNASLVHADLTQAKLIKAEWVGADLSGATL 267
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 60/114 (52%), Gaps = 3/114 (2%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+LR ++ + N AN + D+ +D SG+ A + +A +NF+GA+LS
Sbjct: 140 ANLNEANLRDSLLTEANLEGANLNNTDLSRTDCSGANCRNAEMRQANLSHSNFSGANLSG 199
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQKQAL 213
+ LN ANL+ A L L+ ++L GA + A+ ++A + DL Q + +
Sbjct: 200 ANLRWADLNGANLSWADLSEAKLSGANLIGADLSNANLTNASLVHADLTQAKLI 253
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 55/110 (50%), Gaps = 10/110 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGS-----KFNGAYLEKAVA 150
S AQ SA L +A ++ + RA N AD+RE+ + N A L ++
Sbjct: 93 SRAQLQSASLIRAELIRADLSRADLFAANLNCADLREASLRHAILRYANLNEANLRDSLL 152
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+AN GA+L++T + R + AN NA + + L+ S+ GA + GA+
Sbjct: 153 TEANLEGANLNNTDLSRTDCSGANCRNAEMRQANLSHSNFSGANLSGANL 202
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 50/102 (49%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA ADLR+A R AN A++R+S + + GA L + + +GA+
Sbjct: 119 AANLNCADLREASLRHAILRYANLNEANLRDSLLTEANLEGANLNNTDLSRTDCSGANCR 178
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ M + L+ +N + A L L +DL GA + AD S+A
Sbjct: 179 NAEMRQANLSHSNFSGANLSGANLRWADLNGANLSWADLSEA 220
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L+ A ++ RA+ + AD+ ++ + + A L A+ AN A+L D
Sbjct: 90 ADLSRAQLQSASLIRAELIRADLSRADLFAANLNCADLREASLRHAILRYANLNEANLRD 149
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+L L EANL A L T L+R+D GA A+ A
Sbjct: 150 SL-----LTEANLEGANLNNTDLSRTDCSGANCRNAEMRQA 185
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 48/101 (47%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A N RA+ + A ++ + ++ A L +A + AN ADL
Sbjct: 68 SGAYLHQANLNHASLNVANLIRADLSRAQLQSASLIRAELIRADLSRADLFAANLNCADL 127
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+ + +L ANL A L ++LT ++L GA + D S
Sbjct: 128 REASLRHAILRYANLNEANLRDSLLTEANLEGANLNNTDLS 168
>gi|428225059|ref|YP_007109156.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984960|gb|AFY66104.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 315
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 67/152 (44%), Gaps = 20/152 (13%)
Query: 101 SAAQFGSADLR----KAVHVKENFRRA------NFTSADMRESDFSGSKFNGAYL----- 145
S A+ A LR V +K+ F N D+R + SG+ +GA L
Sbjct: 148 SGARLSGATLRGSFLNGVKLKDAFLNGVDLNGINLDGVDLRSTKLSGATLHGANLAATNF 207
Query: 146 -----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
AN +GA+LS + R LN ANLT A L LT ++L GA IEGA+F
Sbjct: 208 SDAKMHGGSFTGANLSGANLSRAFLKRANLNWANLTRADLTDADLTEANLLGARIEGAEF 267
Query: 201 SDAVIDLAQKQALCKYANGTNPITGVSTRKSL 232
+ + ++ L A G P + TR +L
Sbjct: 268 TGVTLSDPTRRYLRLIATGVTPWSQQPTRSTL 299
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 49/108 (45%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 155
S A A+LR + N AN + ++R +D +G+ N GA L A +
Sbjct: 103 SGANLNGANLRGSHLQHANLCGANLNAINLRGADLTGANLNWANLSGARLSGATLRGSFL 162
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
G L D ++ + LN NL L T L+ + L GA + +FSDA
Sbjct: 163 NGVKLKDAFLNGVDLNGINLDGVDLRSTKLSGATLHGANLAATNFSDA 210
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 23/111 (20%)
Query: 124 NFTSADMRESDFSG----------SKFNGAYLEKAVAYKANFTGA----------DLSDT 163
NF D+R +D SG + GA L +A +AN +GA LS+
Sbjct: 16 NFAGVDLRGADLSGVTLIAVDLSDANLMGANLSRAFLTQANLSGAFLNWADLRYVKLSEG 75
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
+ + L +ANL+ A +V++ R+ L GA + GA+ + + Q LC
Sbjct: 76 CLTHVDLTKANLSGAFMVKSDFNRAKLSGANLNGANLRGSHL---QHANLC 123
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 46/99 (46%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L A VK +F RA + A++ ++ GS A L A N GADL+ ++
Sbjct: 85 ANLSGAFMVKSDFNRAKLSGANLNGANLRGSHLQHANLCGANLNAINLRGADLTGANLNW 144
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L+ A L+ A L + L L A + G D + +D
Sbjct: 145 ANLSGARLSGATLRGSFLNGVKLKDAFLNGVDLNGINLD 183
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 53/118 (44%), Gaps = 14/118 (11%)
Query: 101 SAAQFGSADLRKA-------VHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKA 148
S A ADLR HV + +AN + A M +SDF SG+ NGA L +
Sbjct: 58 SGAFLNWADLRYVKLSEGCLTHV--DLTKANLSGAFMVKSDFNRAKLSGANLNGANLRGS 115
Query: 149 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
AN GA+L+ + L ANL A L L+ + L G+ + G DA ++
Sbjct: 116 HLQHANLCGANLNAINLRGADLTGANLNWANLSGARLSGATLRGSFLNGVKLKDAFLN 173
>gi|300866933|ref|ZP_07111605.1| exported hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300335037|emb|CBN56767.1| exported hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 253
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 55/119 (46%), Gaps = 10/119 (8%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
N+ + E E G Q DLR A N R AN AD+R ++ G+ GA L
Sbjct: 26 NRRDVEKLKETG-----QCSRCDLRDA-----NLRNANLQGADLRNANLRGANLRGAALR 75
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A A+ GADL D + R L ANL++A L L R+++ G +G D A +
Sbjct: 76 NADLSNADLRGADLRDADLSRSNLRNANLSDANLRNADLERAEVRGVNFQGTDLRGANV 134
>gi|412993172|emb|CCO16705.1| predicted protein [Bathycoccus prasinos]
Length = 163
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 65/140 (46%), Gaps = 5/140 (3%)
Query: 97 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 156
G +A + DLRK + + + + A M +S F S F+ + K A KA+F
Sbjct: 29 IGQANAVSDKTLDLRKCQYDNVSVKGITLSGALMVDSVFDNSDFSETVMSKVYATKASFK 88
Query: 157 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 216
+ ++ ++DR + +++T A VLT GA + GA+F +A+I + LC
Sbjct: 89 NVNFTNAVIDRATFDGSDMTGANFQNAVLTGVSYEGANLTGANFEEALIGDQDVKLLC-- 146
Query: 217 ANGTNPITGVSTRKSLGCGN 236
NP +R +GC N
Sbjct: 147 ---LNPTVVDESRMQIGCKN 163
>gi|186682860|ref|YP_001866056.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186465312|gb|ACC81113.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 589
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 52/104 (50%), Gaps = 5/104 (4%)
Query: 103 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A ADL A+ N N + A + +D S +K NGA L A A F G
Sbjct: 408 ADLSGADLSHAILNGTNLSDTILFSTNLSDAILMAADLSYAKLNGAKLNNARLNGAMFLG 467
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
ADLS + R+ LNEA+L+ +L L+ +DL AI+ G DFS
Sbjct: 468 ADLSGVDLSRVSLNEADLSGVILSEADLSGADLTDAILFGTDFS 511
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 56/115 (48%), Gaps = 10/115 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANF 155
S A ADL N RAN + AD+ +D S + N A L +A NF
Sbjct: 316 SGANLSGADLSSTNLSGANLSRANLSRADLNRADLSSTNLNRADLSNTNLSRADLSSTNF 375
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVI 205
+ ADLS+ ++ L+EANL+N L L R+DL G AI+ G + SD ++
Sbjct: 376 SRADLSNAILFGANLSEANLSNVSLNHADLCRADLSGADLSHAILNGTNLSDTIL 430
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ F A+L A N AN + A++ +D S + +GA L +A +A+ ADL
Sbjct: 291 TGVNFIGANLSGANFGDANLSGANLSGANLSGADLSSTNLSGANLSRANLSRADLNRADL 350
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S T ++R L+ NL+ A L T +R+DL AI+ GA+ S+A
Sbjct: 351 SSTNLNRADLSNTNLSRADLSSTNFSRADLSNAILFGANLSEA 393
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A G A+L + N ANF A++ ++ SG+ +GA L AN + A+L
Sbjct: 281 SLAYLGDANLTGVNFIGANLSGANFGDANLSGANLSGANLSGADLSSTNLSGANLSRANL 340
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S ++R L+ NL A L T L+R+DL AD S+A++
Sbjct: 341 SRADLNRADLSSTNLNRADLSNTNLSRADLSSTNFSRADLSNAIL 385
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 57/115 (49%), Gaps = 8/115 (6%)
Query: 55 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 114
AKL N R+ + L A + S +S LN EA+ G I S A ADL A+
Sbjct: 453 AKLNNARLNGAMFLGADLSGVDLSRVS----LN--EADLSGV--ILSEADLSGADLTDAI 504
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
+F AN SA++ S+ SG+ NGA L + A +GADLSD M++M
Sbjct: 505 LFGTDFSYANLNSANLSGSNLSGAILNGANLSHSNLSYAILSGADLSDANMEKMT 559
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 54/115 (46%), Gaps = 10/115 (8%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA A L A A F AD+ D S N A L + +A+ +GADL+
Sbjct: 442 AADLSYAKLNGAKLNNARLNGAMFLGADLSGVDLSRVSLNEADLSGVILSEADLSGADLT 501
Query: 162 DTLM----------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
D ++ + L+ +NL+ A+L L+ S+L AI+ GAD SDA ++
Sbjct: 502 DAILFGTDFSYANLNSANLSGSNLSGAILNGANLSHSNLSYAILSGADLSDANME 556
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 48/90 (53%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NFR A A++ +DFSG+ + AYL A NF GA+LS L+ ANL+ A
Sbjct: 259 NFRSAYLGDANLTGADFSGADLSLAYLGDANLTGVNFIGANLSGANFGDANLSGANLSGA 318
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
L L+ ++L GA + A+ S A ++ A
Sbjct: 319 NLSGADLSSTNLSGANLSRANLSRADLNRA 348
Score = 41.6 bits (96), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 46/101 (45%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S+ F ADL A+ N AN ++ + +D + +GA L A+ N + L
Sbjct: 371 SSTNFSRADLSNAILFGANLSEANLSNVSLNHADLCRADLSGADLSHAILNGTNLSDTIL 430
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
T + +L A+L+ A L L + L GA+ GAD S
Sbjct: 431 FSTNLSDAILMAADLSYAKLNGAKLNNARLNGAMFLGADLS 471
>gi|113477234|ref|YP_723295.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110168282|gb|ABG52822.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 227
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 63/122 (51%), Gaps = 10/122 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVAYK 152
A+F ADL +A ++ + A+F+ +M ++D S G+ G +A+ K
Sbjct: 90 AKFNKADLTRAKLIRADLSCADFSQVNMVDADLSRAILYEIDLHGANLYGVNFRRAILNK 149
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 212
A+ GA+L M + L EANLT A L +L+ +DL GA + GA+ SD + A QA
Sbjct: 150 ADLIGANLIRANMTGVDLIEANLTRANLTEAILSGADLNGASLLGANISDVNLVGAALQA 209
Query: 213 LC 214
+
Sbjct: 210 VI 211
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 66/154 (42%), Gaps = 25/154 (16%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFN 141
N +EA G A A+L + K N AN D+ E++ S G+KFN
Sbjct: 41 NFFEANLTG-------ANLSQANLSRVNLAKANLTGANLIGTDLSEANLSDTLLVGAKFN 93
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
A L +A +A+ + AD S + N+ +A L R +L DL GA + G +F
Sbjct: 94 KADLTRAKLIRADLSCADFS----------QVNMVDADLSRAILYEIDLHGANLYGVNFR 143
Query: 202 DAVI---DLAQKQALCKYANGTNPITGVSTRKSL 232
A++ DL + G + I TR +L
Sbjct: 144 RAILNKADLIGANLIRANMTGVDLIEANLTRANL 177
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 58/119 (48%), Gaps = 6/119 (5%)
Query: 92 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 151
E ++ IG F L++A +K N ANF A++ ++ S + + L KA
Sbjct: 10 ELLRQYAIGEK-NFSGLYLQEAHLLKANLEGANFFEANLTGANLSQANLSRVNLAKANLT 68
Query: 152 KANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
AN G D LSDTL+ N+A+LT A L+R L+ +D + AD S A++
Sbjct: 69 GANLIGTDLSEANLSDTLLVGAKFNKADLTRAKLIRADLSCADFSQVNMVDADLSRAIL 127
>gi|86605838|ref|YP_474601.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86554380|gb|ABC99338.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 158
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 54/112 (48%), Gaps = 15/112 (13%)
Query: 109 DLRKAVHVKENFRRANFT----------SADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
DL +A N R AN + AD+R +D S + GA L A ++AN GA
Sbjct: 30 DLVRATLQGANLRGANLSFGKLSGINLQEADLRGADLSSANLMGANLRGANLWEANLIGA 89
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVI 205
DLS + L+ A L A L R L SDL GGA++ GAD S A++
Sbjct: 90 DLSFADLREANLHGAYLWEAKLTRAQLQGSDLSGAKIGGAVLTGADLSGAIL 141
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 54/118 (45%), Gaps = 7/118 (5%)
Query: 63 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 122
V L A + + + L+ +N EA+ RG A SA+L A N
Sbjct: 31 LVRATLQGANLRGANLSFGKLSGINLQEADLRG-------ADLSSANLMGANLRGANLWE 83
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
AN AD+ +D + +GAYL +A +A G+DLS + VL A+L+ A+L
Sbjct: 84 ANLIGADLSFADLREANLHGAYLWEAKLTRAQLQGSDLSGAKIGGAVLTGADLSGAIL 141
>gi|427735760|ref|YP_007055304.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427370801|gb|AFY54757.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 263
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 46/88 (52%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++FRRAN D+ + S +K NGA L A +K GADLSD + R L A++
Sbjct: 149 QDFRRANLKGRDLSGRNLSYAKLNGANLSDAFMHKVVLRGADLSDANLFRANLLLADMKE 208
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A L L +DL GA + GAD A I
Sbjct: 209 ANLQGADLIGADLSGADLRGADLRGARI 236
>gi|58613539|gb|AAW79356.1| chloroplast thylakoid 11kDa protein [Heterocapsa triquetra]
Length = 91
Score = 55.1 bits (131), Expect = 4e-05, Method: Composition-based stats.
Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 1/83 (1%)
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQ 211
A+FTGA L+ ++ L A+LTNA++ + + L +GADF+D + Q+
Sbjct: 7 ADFTGAVLTQANLELAQLTGADLTNAIVTEAYINGTTKLEVKSADGADFTDTPLRKDQQM 66
Query: 212 ALCKYANGTNPITGVSTRKSLGC 234
LC A GTNP+T V TR+S+ C
Sbjct: 67 YLCGIAKGTNPVTKVDTRESMAC 89
>gi|359464087|ref|ZP_09252650.1| hypothetical protein ACCM5_35600 [Acaryochloris sp. CCMEE 5410]
Length = 237
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 61/123 (49%), Gaps = 20/123 (16%)
Query: 103 AQFGSADLRKAVHVKENF-----RRANF----------TSADMR-----ESDFSGSKFNG 142
A F ADLR++ + NF RRAN TSADMR E+D SG+K
Sbjct: 35 ADFSDADLRQSRFGRTNFSYTCFRRANLSETIFWGADLTSADMRQANLREADLSGAKLIQ 94
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
L +A KA GA+LS MD +L E +L RT L R++L GA + A+ S
Sbjct: 95 TQLTEANLLKACLCGANLSAVQMDGAILIEVDLRPTSDQRTDLGRANLAGADLSYANLSQ 154
Query: 203 AVI 205
A++
Sbjct: 155 ALL 157
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 56/102 (54%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +LR+A + A+F+ AD+R+S F + F+ +A + F GADL+
Sbjct: 17 FHRIELREAELINSELCGADFSDADLRQSRFGRTNFSYTCFRRANLSETIFWGADLTSAD 76
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
M + L EA+L+ A L++T LT ++L A + GA+ S +D
Sbjct: 77 MRQANLREADLSGAKLIQTQLTEANLLKACLCGANLSAVQMD 118
>gi|152980852|ref|YP_001353914.1| pentapeptide repeat-containing protein [Janthinobacterium sp.
Marseille]
gi|151280929|gb|ABR89339.1| Uncharacterized conserved protein, pentapeptide repeat family
[Janthinobacterium sp. Marseille]
Length = 243
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 58/116 (50%), Gaps = 5/116 (4%)
Query: 88 KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 147
+++ E AA A+LR A N R AN AD+R++D SG+ A L
Sbjct: 16 EHDIEDNTMLATVKAALAAGANLRDADLSGANLRGANLRDADLRDADLSGANLRDADLSG 75
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A A+ +GA+LSD L+ ANL+ A L L ++LGGA + GAD S A
Sbjct: 76 ANLRDADLSGANLSDA-----DLSGANLSGADLSGANLGGANLGGANLSGADLSGA 126
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 52/100 (52%), Gaps = 5/100 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR A N R A+ + A++R++D SG+ + A L A N +GADLS
Sbjct: 51 ANLRDADLRDADLSGANLRDADLSGANLRDADLSGANLSDADLSGA-----NLSGADLSG 105
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+ L ANL+ A L L+ ++L GA + GA+ D
Sbjct: 106 ANLGGANLGGANLSGADLSGANLSGANLRGANLSGANLRD 145
Score = 40.4 bits (93), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 46/98 (46%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A + AN + AD+ ++ SG+ +GA L A AN +GADL
Sbjct: 64 SGANLRDADLSGANLRDADLSGANLSDADLSGANLSGADLSGANLGGANLGGANLSGADL 123
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
S + L ANL+ A L + D+ A+ E A
Sbjct: 124 SGANLSGANLRGANLSGANLRDYPVKIKDIHKAVYEAA 161
>gi|186685487|ref|YP_001868683.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186467939|gb|ACC83740.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 146
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 53/106 (50%), Gaps = 9/106 (8%)
Query: 107 SADLRKAVHVKE---------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
SA +R+ + +E N + A+ D+R ++ G+ GA LE A AN
Sbjct: 28 SAPVRRLLETRECLGCNLAGANLKGAHLIGVDLRNANLKGANLEGANLEGADLTGANLKS 87
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A+L++ + +LN ANLTN L + L +D+ GA++ D S A
Sbjct: 88 ANLTEAFVSDTILNNANLTNVNLSNSRLYNTDVDGAVLANIDLSGA 133
>gi|425454434|ref|ZP_18834174.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9807]
gi|389804880|emb|CCI15729.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9807]
Length = 962
Score = 54.7 bits (130), Expect = 4e-05, Method: Composition-based stats.
Identities = 34/103 (33%), Positives = 52/103 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A LR A+ N + AN A+++E++ + F GA L +A +AN GA+L +
Sbjct: 798 ANLEGAILRGAILEGANLKEANLKEANLKEANLEEAFFEGAILAEANLERANLYGANLGE 857
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ L ANL A L R L + L GA +E A+ A +
Sbjct: 858 ANLEEAFLAGANLEEAFLERANLKGAFLMGAFLERANLKGAFL 900
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 51/176 (28%), Positives = 76/176 (43%), Gaps = 23/176 (13%)
Query: 49 QFPGP------YAKLKNWRV-----FVSTALAAAVVASCSSNISALADLNKYEAETRGEF 97
Q+P P A+L+ R+ FV L+ + +C L N A G
Sbjct: 745 QWPSPEAFGNWLARLQGQRIDFEPMFVLDCLSFLDLKNCLLICRDLYKANLERANLEG-- 802
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
I A A+L++A N + AN A++ E+ F G+ A LE+A Y AN
Sbjct: 803 AILRGAILEGANLKEA-----NLKEANLKEANLEEAFFEGAILAEANLERANLYGANLGE 857
Query: 158 ADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
A+L + ++ L ANL A L+ L R++L GA + GA A I+ A
Sbjct: 858 ANLEEAFLAGANLEEAFLERANLKGAFLMGAFLERANLKGAFLMGAFLQWADIERA 913
Score = 50.8 bits (120), Expect = 6e-04, Method: Composition-based stats.
Identities = 34/106 (32%), Positives = 51/106 (48%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A A+ + N RAN A++ E++ + GA LE+A +AN GA L
Sbjct: 828 ANLEEAFFEGAILAEANLERANLYGANLGEANLEEAFLAGANLEEAFLERANLKGAFLMG 887
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
++R L A L A L + R++L GA +E A F A ++ A
Sbjct: 888 AFLERANLKGAFLMGAFLQWADIERANLDGANLETASFYGANLERA 933
Score = 37.7 bits (86), Expect = 5.4, Method: Composition-based stats.
Identities = 26/94 (27%), Positives = 43/94 (45%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L +A + N + A A + ++ G+ GA+L+ A +AN GA+L
Sbjct: 863 AFLAGANLEEAFLERANLKGAFLMGAFLERANLKGAFLMGAFLQWADIERANLDGANLET 922
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
L ANL A LV +++ G I++
Sbjct: 923 ASFYGANLERANLERANLVGANFKDANVKGTILD 956
>gi|303289212|ref|XP_003063894.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454962|gb|EEH52267.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 124
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 65/138 (47%), Gaps = 32/138 (23%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL K + K + +RANF++ S+ SG G L A NFTGADLS+
Sbjct: 6 DLTKEFYTKGSMKRANFSN-----SNLSGVTLFGGDLSYA-----NFTGADLSN------ 49
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGA-----------DFSDAVIDLAQKQALC-KY 216
AN+ L T+ T ++L GAI+ GA D++D ++ +C K
Sbjct: 50 ----ANIGQCNLTGTIFTNANLSGAIVSGANMDELGDITGSDWTDVIVRKDVNDKICAKG 105
Query: 217 ANGTNPITGVSTRKSLGC 234
+G NP+TG T +L C
Sbjct: 106 VSGENPVTGNPTAMTLFC 123
>gi|428221053|ref|YP_007105223.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994393|gb|AFY73088.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 270
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 58/105 (55%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ + +ADL + R+AN T+AD+ ++ + +GA L A AN + A+L
Sbjct: 121 TGSNLSNADLVYVNLENADLRQANLTNADLIYANLKNANLSGANLSGANLSGANLSDANL 180
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
D L+ + L+ ANL +A T+L R++L GA + GA F +A++
Sbjct: 181 EDALLHKAKLSNANLKSANFSGTILVRANLIGADLTGAIFKEAIL 225
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 2/113 (1%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
A G + IG A DL + V N R N D++ +D + A + +
Sbjct: 63 ANLMGAYLIG--ANLSHVDLSGSNLVGANLRSINLNDTDLKGADLRETILRNARMARVNL 120
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+N + ADL ++ L +ANLTNA L+ L ++L GA + GA+ S A
Sbjct: 121 TGSNLSNADLVYVNLENADLRQANLTNADLIYANLKNANLSGANLSGANLSGA 173
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L A + N + AN + A++ ++ SG+ + A LE A+ +KA + A+L
Sbjct: 138 ADLRQANLTNADLIYANLKNANLSGANLSGANLSGANLSDANLEDALLHKAKLSNANLK- 196
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
AN + +LVR L +DL GAI + A A +
Sbjct: 197 ---------SANFSGTILVRANLIGADLTGAIFKEAILVHATM 230
>gi|434392029|ref|YP_007126976.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428263870|gb|AFZ29816.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 532
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 58/115 (50%), Gaps = 10/115 (8%)
Query: 102 AAQFGSADLRKAVHVKENF----------RRANFTSADMRESDFSGSKFNGAYLEKAVAY 151
A Q +A+L + + NF A+ + AD+R++D SG+ GA L A
Sbjct: 310 ATQLNNANLSDSQLIGANFSNVVAEDIFLENADLSGADLRDADLSGANLKGANLSGANLT 369
Query: 152 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
GADLS+ + +LN A L NA++ +T LT +D A + GAD +A+ D
Sbjct: 370 GVELDGADLSEANLAGAILNGAVLDNALVQKTDLTGADFTNATLTGADLKEAIGD 424
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 56/117 (47%), Gaps = 16/117 (13%)
Query: 103 AQFGSADLRKAV-HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT----- 156
A ADL++A+ NF AN A + F GS F A L KANFT
Sbjct: 411 ATLTGADLKEAIGDSLTNFTGANLNGASLEVGSFIGSNFTDAALRDTNLIKANFTDALFI 470
Query: 157 --------GADL-SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
GADL S T +D + +N + NA+LV LT+++ GA + GA+ S A+
Sbjct: 471 DGSDANSVGADLTSSTFIDGIAIN-GDFRNALLVNANLTKANFTGANLAGANLSGAI 526
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 73/170 (42%), Gaps = 28/170 (16%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS--------------- 127
LA LN +A+ G A+F A L + N NF+S
Sbjct: 263 LAGLNLADADLTG-------ARFNGAILNNFIGGDLNLSGVNFSSFVASNGQVFATQLNN 315
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
A++ +S G+ F+ E A+ +GADL D + L ANL+ A L L
Sbjct: 316 ANLSDSQLIGANFSNVVAEDIFLENADLSGADLRDADLSGANLKGANLSGANLTGVELDG 375
Query: 188 SDLGGAIIEGADFSDAVID--LAQKQAL--CKYANGTNPITGVSTRKSLG 233
+DL A + GA + AV+D L QK L + N T +TG ++++G
Sbjct: 376 ADLSEANLAGAILNGAVLDNALVQKTDLTGADFTNAT--LTGADLKEAIG 423
>gi|224014282|ref|XP_002296804.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968659|gb|EED87005.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 2544
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 5/113 (4%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
++ D+ DFS + + G + NF GAD+ + ++ ANL + V V +
Sbjct: 2434 DYAGIDISGQDFSNASYKGKDFTQV---NTNFEGADVRGVSFEDTSMDNANLKDIVAVGS 2490
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN--GTNPITGVSTRKSLGC 234
+S + +E DF+DA I + +C + GTNP TG TR SL C
Sbjct: 2491 YFGQSLVDVKTLENGDFTDATIPPKTLKLVCDREDVKGTNPTTGADTRDSLMC 2543
>gi|254409899|ref|ZP_05023679.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196182935|gb|EDX77919.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 478
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 69/151 (45%), Gaps = 19/151 (12%)
Query: 87 NKYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANF---------------TSA 128
N EA RG F G+ A +ADL ++ NFR A F + A
Sbjct: 141 NLSEANLRGAFVTGANLEGANLNAADLSRSDLSNSNFRHAEFKQANLSCANLAGADLSGA 200
Query: 129 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 188
++R +D SG+ + A L +A AN TGADL+ + L A+LT A L+ +
Sbjct: 201 NLRWTDLSGANLSWANLSEAKLSGANLTGADLTHANLLNTSLVHADLTQARLIHADWIGA 260
Query: 189 DLGGAIIEGADFSD-AVIDLAQKQALCKYAN 218
DL GA + GA + + L + +C++ +
Sbjct: 261 DLTGATLTGAKLHGVSRVGLKTQGIVCEWVD 291
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 64/129 (49%), Gaps = 3/129 (2%)
Query: 83 LADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 139
L++ N A G + IG S A+ A L A K N +AN A++ +D G++
Sbjct: 37 LSEANLSVANLSGAYLIGTNLSRARLNVARLSGANLTKANLTKANLNVANLIRADLGGAQ 96
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
A + +A +A +GA L++ + L EA L +A L R L+ ++L GA + GA+
Sbjct: 97 LTQAAMIRAELIRAKLSGATLTEANLSGADLREAALRDAKLQRANLSEANLRGAFVTGAN 156
Query: 200 FSDAVIDLA 208
A ++ A
Sbjct: 157 LEGANLNAA 165
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 66/139 (47%), Gaps = 8/139 (5%)
Query: 79 NISALADLNKYEAETRGEFGIGSA-AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 137
N++A+ L KY A + G A A +L A + N AN + A + ++ S
Sbjct: 2 NVTAI--LKKYAAGVKNFSGANLAEANLSGINLSGADLSEANLSVANLSGAYLIGTNLSR 59
Query: 138 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-- 195
++ N A L A KAN T A+L+ + R L A LT A ++R L R+ L GA +
Sbjct: 60 ARLNVARLSGANLTKANLTKANLNVANLIRADLGGAQLTQAAMIRAELIRAKLSGATLTE 119
Query: 196 ---EGADFSDAVIDLAQKQ 211
GAD +A + A+ Q
Sbjct: 120 ANLSGADLREAALRDAKLQ 138
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G A L +A ++ RA + A + E++ SG+ A L A +AN + A+L
Sbjct: 90 ADLGGAQLTQAAMIRAELIRAKLSGATLTEANLSGADLREAALRDAKLQRANLSEANLRG 149
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+ L ANL A L R+ L+ S+ A + A+ S
Sbjct: 150 AFVTGANLEGANLNAADLSRSDLSNSNFRHAEFKQANLS 188
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 50/101 (49%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR+A +RAN + A++R + +G+ GA L A +++ + ++
Sbjct: 120 ANLSGADLREAALRDAKLQRANLSEANLRGAFVTGANLEGANLNAADLSRSDLSNSNFRH 179
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L+ ANL A L L +DL GA + A+ S+A
Sbjct: 180 AEFKQANLSCANLAGADLSGANLRWTDLSGANLSWANLSEA 220
>gi|119490886|ref|ZP_01623169.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119453704|gb|EAW34863.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 517
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 57/106 (53%)
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
I +AA ADLR+A + + R+AN SA++R++ S G L A +A+ G
Sbjct: 115 AILTAANLSEADLREATLRQVDLRQANLKSANLRDAVLIASNLEGTNLHGADLTRADLRG 174
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A+L + + + L++ANL+ A L L +DL GA + GA+ A
Sbjct: 175 ANLVNAELRQANLSQANLSGANLKGANLRWADLNGADLRGANLEQA 220
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 52/110 (47%), Gaps = 10/110 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTS----------ADMRESDFSGSKFNGAYLEKAVAYK 152
A ADLR A V R+AN + A++R +D +G+ GA LE+A
Sbjct: 165 ADLTRADLRGANLVNAELRQANLSQANLSGANLKGANLRWADLNGADLRGANLEQARLSG 224
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A+ GADLS + L A+LT A L T ++L GA + GA D
Sbjct: 225 ASLYGADLSHASLLYTHLIHADLTQANLTGADWTGAELTGAALTGAKLYD 274
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 49/101 (48%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A SA+LR AV + N N AD+ +D G+ A L +A +AN +GA+L
Sbjct: 140 ANLKSANLRDAVLIASNLEGTNLHGADLTRADLRGANLVNAELRQANLSQANLSGANLKG 199
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ LN A+L A L ++ L GA + GAD S A
Sbjct: 200 ANLRWADLNGADLRGA-----NLEQARLSGASLYGADLSHA 235
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 49/90 (54%), Gaps = 5/90 (5%)
Query: 119 NFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
NF +A N + A++ ++ S +K N A L A AN TGA L+ + R L+ A
Sbjct: 36 NFSQAVLSITNLSGANLSGTNLSQAKLNVAKLSGANLSGANLTGAILNVANLIRADLSHA 95
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L NA +R+ L R+DL AI+ A+ S+A
Sbjct: 96 TLINASAIRSELIRADLSHAILTAANLSEA 125
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 51/103 (49%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A+ N RA+ + A + + S+ A L A+ AN + ADL
Sbjct: 68 SGANLSGANLTGAILNVANLIRADLSHATLINASAIRSELIRADLSHAILTAANLSEADL 127
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + ++ L +ANL +A L VL S+L G + GAD + A
Sbjct: 128 REATLRQVDLRQANLKSANLRDAVLIASNLEGTNLHGADLTRA 170
>gi|428311554|ref|YP_007122531.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428253166|gb|AFZ19125.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 411
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 49/133 (36%), Positives = 68/133 (51%), Gaps = 9/133 (6%)
Query: 69 AAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSA 128
A +VAS S+ + L D N ++A G A+ G A+LR A N A +
Sbjct: 65 ADLIVASLSA--ADLRDANLHDANLIG-------AKLGVANLRDADLSGANLSGAELSCT 115
Query: 129 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 188
D+ S+ +G+ +GA L KA +AN GA+LS T M L+ ANL A L L +
Sbjct: 116 DLTCSNLNGAYISGANLIKAKLSRANLQGANLSVTNMIGADLSGANLQGANLGGANLIEA 175
Query: 189 DLGGAIIEGADFS 201
DLGGA ++GA S
Sbjct: 176 DLGGANLQGAKLS 188
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 49/102 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A A+L KA + N + AN + +M +D SG+ GA L A +A+ GA+L
Sbjct: 123 NGAYISGANLIKAKLSRANLQGANLSVTNMIGADLSGANLQGANLGGANLIEADLGGANL 182
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+ R L NL N+ L L+ S+L G + AD +
Sbjct: 183 QGAKLSRSNLAYVNLANSDLSNADLSDSNLAGTNLTNADLDN 224
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 57/137 (41%), Gaps = 11/137 (8%)
Query: 86 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 144
L +Y A R G A DLR N + + AD+R++D G+ GA
Sbjct: 7 LERYAAGERCLRGADLHGADLRGVDLRGIDLSDANLSDTDLSDADLRDADLIGANLRGAD 66
Query: 145 L----------EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
L A + AN GA L + L+ ANL+ A L T LT S+L GA
Sbjct: 67 LIVASLSAADLRDANLHDANLIGAKLGVANLRDADLSGANLSGAELSCTDLTCSNLNGAY 126
Query: 195 IEGADFSDAVIDLAQKQ 211
I GA+ A + A Q
Sbjct: 127 ISGANLIKAKLSRANLQ 143
>gi|428216610|ref|YP_007101075.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988392|gb|AFY68647.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 373
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 56/111 (50%), Gaps = 14/111 (12%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A L KA V+ N + AN + A++ E++ SG+ A L A+ AN +GA+LS
Sbjct: 189 AQLNKAYFVRANLQNANLSDANLTEANLSGADLREADLSGAILCGANLSGANLS------ 242
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
EANL A VL SDL GA + GAD +A + QA K AN
Sbjct: 243 ----EANLRTANFKAAVLIGSDLSGADLSGADLYNANL----SQADLKIAN 285
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 59/120 (49%), Gaps = 9/120 (7%)
Query: 76 CSSNISA--LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRES 133
C +N+S L++ N A + IGS ADL A + AN + AD++ +
Sbjct: 232 CGANLSGANLSEANLRTANFKAAVLIGS--DLSGADLSGA-----DLYNANLSQADLKIA 284
Query: 134 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
+ S + + L A AY AN + ADLS ++ L ANL+ A LV+T ++LG A
Sbjct: 285 NLSLADLGASNLFGANAYGANLSLADLSMANLNDAFLYGANLSWANLVQTNFAGANLGAA 344
>gi|383312720|ref|YP_005365521.1| hypothetical protein MCE_05120 [Candidatus Rickettsia amblyommii
str. GAT-30V]
gi|378931380|gb|AFC69889.1| hypothetical protein MCE_05120 [Candidatus Rickettsia amblyommii
str. GAT-30V]
Length = 958
Score = 54.7 bits (130), Expect = 4e-05, Method: Composition-based stats.
Identities = 41/121 (33%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ SADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKSADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVI-DLAQKQALCKYA 217
+ + EAN NA++ R LT++D A++E AD ++A+ ++ KQA K A
Sbjct: 610 IAQNINAKEANFKNAIMQRADLTKADFTKAVLENADMQAVAAAEAIFKEVNLKQANLKAA 669
Query: 218 N 218
N
Sbjct: 670 N 670
Score = 37.7 bits (86), Expect = 5.0, Method: Composition-based stats.
Identities = 33/108 (30%), Positives = 50/108 (46%), Gaps = 5/108 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ +A L KA N A + + +E++ F A +++A KA+FT A L +
Sbjct: 589 AKLSNATLEKAEAEGLNISDAIAQNINAKEAN-----FKNAIMQRADLTKADFTKAVLEN 643
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
M + EA L + L ++L G EGADF A I+ A K
Sbjct: 644 ADMQAVAAAEAIFKEVNLKQANLKAANLAGINQEGADFDKAKINDATK 691
>gi|357146891|ref|XP_003574148.1| PREDICTED: thylakoid lumenal 17.4 kDa protein, chloroplastic-like
[Brachypodium distachyon]
Length = 227
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 58/128 (45%), Gaps = 7/128 (5%)
Query: 109 DLRKAVHVKE--NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
DLR + E N + ++A M ++ F G+ + KA A A+F G D ++ ++D
Sbjct: 104 DLRFCDYTNEKNNLKGKTLSAALMSDAKFDGADLTEVVMSKAYAVGASFKGTDFTNAVID 163
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGV 226
R +A+L A+ TVL+ S A ++ F D +I Q LC+ N
Sbjct: 164 RANFGKADLEGAIFKNTVLSGSTFDDANMKDVVFEDTIIGYIDLQKLCR-----NMSINE 218
Query: 227 STRKSLGC 234
R LGC
Sbjct: 219 DARLDLGC 226
>gi|428223553|ref|YP_007107650.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427983454|gb|AFY64598.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 521
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 60/128 (46%), Gaps = 11/128 (8%)
Query: 92 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 151
E +G G FG DLR+A + N + + AD+ ++ SG+ +GA L A
Sbjct: 5 ELLKRYGAGER-NFGGMDLREANLSRANLSHIDLSGADLSVANLSGANLSGADLRGARLN 63
Query: 152 KANFTGADLSDTLMDRMVLNEANLT----------NAVLVRTVLTRSDLGGAIIEGADFS 201
A +GA+LS + +LN ANL A L+R L R+DL A ++ AD
Sbjct: 64 VAKLSGANLSGANLSSCILNVANLVRADLTGANLNQAALIRAELMRADLKQATLDSADLG 123
Query: 202 DAVIDLAQ 209
A + AQ
Sbjct: 124 GAQLQEAQ 131
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 10/124 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTG 157
A F A+LR A + N RANF +A++R ++ S G+ +GA L A A G
Sbjct: 155 AVFDQANLRGADLNRANATRANFRNAELRLANLSEILLIGADLHGANLRWANLTGARLRG 214
Query: 158 ADLSDTLMDRMV-----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 212
ADL++ + L NLT+A L+ L+R++L G GA+ + A + A+
Sbjct: 215 ADLTEAKLSGAAIVGADLRNVNLTHASLIHADLSRANLIGTDWIGAELTGATLTGAKLHG 274
Query: 213 LCKY 216
+ +Y
Sbjct: 275 VSRY 278
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 74/164 (45%), Gaps = 30/164 (18%)
Query: 68 LAAAVVASCSSNISAL-------ADLNKYEAETRGEF-------GIGSAAQFGSADLRKA 113
L+ A ++SC N++ L A+LN+ A R E +A G A L++A
Sbjct: 72 LSGANLSSCILNVANLVRADLTGANLNQ-AALIRAELMRADLKQATLDSADLGGAQLQEA 130
Query: 114 VHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
+ NF RAN F A + ++ F + GA L +A A +ANF A+L + +
Sbjct: 131 QLHQANFSRANLSEVNFHRATLADAVFDQANLRGADLNRANATRANFRNAELRLANLSEI 190
Query: 169 V----------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+ L ANLT A L LT + L GA I GAD +
Sbjct: 191 LLIGADLHGANLRWANLTGARLRGADLTEAKLSGAAIVGADLRN 234
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 56/112 (50%), Gaps = 5/112 (4%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I + A ADL A + RA AD++++ + GA L++A ++ANF+ A
Sbjct: 81 ILNVANLVRADLTGANLNQAALIRAELMRADLKQATLDSADLGGAQLQEAQLHQANFSRA 140
Query: 159 DLSDTLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+LS+ R V ++ANL A L R TR++ A + A+ S+ ++
Sbjct: 141 NLSEVNFHRATLADAVFDQANLRGADLNRANATRANFRNAELRLANLSEILL 192
>gi|427728370|ref|YP_007074607.1| putative low-complexity protein [Nostoc sp. PCC 7524]
gi|427364289|gb|AFY47010.1| putative low-complexity protein [Nostoc sp. PCC 7524]
Length = 238
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 61/116 (52%), Gaps = 15/116 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A F ADLR+ K N +A SAD+ ES G + A L +A+ +A+ TGA L
Sbjct: 34 TGADFSYADLRQTRLGKTNLSQACLQSADLSESILWGIDLSAADLYRAILREADLTGAKL 93
Query: 161 SDTLMD--RMV--------LNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFS 201
T ++ ++ LN ANL+N++L + L R+DLG A++ GAD S
Sbjct: 94 VKTRLESANLIKASLCGANLNGANLSNSLLFQADLRPSSNQRTDLGYAVLSGADLS 149
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 58/106 (54%), Gaps = 5/106 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +L++A N A+F+ AD+R++ + + A L+ A ++ G DLS
Sbjct: 18 FQRVNLQEAELTNVNLTGADFSYADLRQTRLGKTNLSQACLQSADLSESILWGIDLSAAD 77
Query: 165 MDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADFSDAVI 205
+ R +L EA+LT A LV+T L ++ L GA + GA+ S++++
Sbjct: 78 LYRAILREADLTGAKLVKTRLESANLIKASLCGANLNGANLSNSLL 123
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 61/134 (45%), Gaps = 24/134 (17%)
Query: 97 FGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN- 154
+GI SAA A LR+A + SA++ ++ G+ NGA L ++ ++A+
Sbjct: 69 WGIDLSAADLYRAILREADLTGAKLVKTRLESANLIKASLCGANLNGANLSNSLLFQADL 128
Query: 155 --------------FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS--------DLGG 192
+GADLS + L+ ANL A L R L ++ DL G
Sbjct: 129 RPSSNQRTDLGYAVLSGADLSYADLRATSLHHANLDRAKLCRANLGKTIQWGNLAADLTG 188
Query: 193 AIIEGADFSDAVID 206
A ++GAD S A +D
Sbjct: 189 ASLQGADLSYANLD 202
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 53/106 (50%), Gaps = 8/106 (7%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF--------TGAD 159
ADLR + + + + A + AD+ +D + + A L++A +AN AD
Sbjct: 126 ADLRPSSNQRTDLGYAVLSGADLSYADLRATSLHHANLDRAKLCRANLGKTIQWGNLAAD 185
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L+ + L+ ANL +A+L + L +DL GAI+ A+ A++
Sbjct: 186 LTGASLQGADLSYANLDSAILRKANLRGADLTGAILTDAELEGAIM 231
>gi|428305945|ref|YP_007142770.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428247480|gb|AFZ13260.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 273
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 78/167 (46%), Gaps = 22/167 (13%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANF-----TSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A ADL + V N A+ SAD+ +D S + N AYL A AN
Sbjct: 123 SGASLLGADLSRINLVAANLSNAHLEGATMISADLSHADLSQTNINDAYLHLANLSNANL 182
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
TGA+LS + L+ A+L+NA L L ++L A + GAD S+AV +A +
Sbjct: 183 TGANLSGS-----ELHIADLSNANLSEAQLNSAELNNANLLGADLSNAVF----AEANLR 233
Query: 216 YANGT-NPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRD 261
N T N I+ + ++G G G+ +S +L P L DRD
Sbjct: 234 GTNLTSNQISSANLEGAIGLG------EGASASTVLD-QPTILEDRD 273
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 51/108 (47%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S Q ADL V N +N A++R++D G A L +A A+ GADL
Sbjct: 78 SRVQLSGADL-----VDANLNSSNLIQANLRDTDMLGVDLREANLSEADLSGASLLGADL 132
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
S R+ L ANL+NA L + +DL A + + +DA + LA
Sbjct: 133 S-----RINLVAANLSNAHLEGATMISADLSHADLSQTNINDAYLHLA 175
>gi|381206177|ref|ZP_09913248.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 210
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 71/145 (48%), Gaps = 6/145 (4%)
Query: 90 EAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
EA+ G +G+ + A L++A N AN + A++ E++ G+ G L
Sbjct: 42 EADLGGSLLMGATLISTNLTGAKLQEANLTNANLSEANLSEANLSEANLFGANLTGTNLT 101
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+A +A+ + ADLS+ + L+EAN + A L RT L ++L A + GAD A D
Sbjct: 102 EANLSEADLSWADLSEANLSEANLSEANFSKANLSRTNLRETNLQKADLRGADLRSA--D 159
Query: 207 LAQKQALCKYANGTNPITGVSTRKS 231
L + + Y N N + G RK+
Sbjct: 160 LREAVLVAAYLNEAN-LDGADMRKA 183
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 48/95 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A ADL A + N AN + A+ +++ S + L+KA A+ ADL
Sbjct: 101 TEANLSEADLSWADLSEANLSEANLSEANFSKANLSRTNLRETNLQKADLRGADLRSADL 160
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
+ ++ LNEANL A + + L R+ +GGAI+
Sbjct: 161 REAVLVAAYLNEANLDGADMRKANLYRASMGGAIL 195
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 42/80 (52%), Gaps = 5/80 (6%)
Query: 129 DMRESDFSGSKFNGAYL-----EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
D+ E+D GS GA L A +AN T A+LS+ + L+EANL A L T
Sbjct: 39 DLSEADLGGSLLMGATLISTNLTGAKLQEANLTNANLSEANLSEANLSEANLFGANLTGT 98
Query: 184 VLTRSDLGGAIIEGADFSDA 203
LT ++L A + AD S+A
Sbjct: 99 NLTEANLSEADLSWADLSEA 118
>gi|226365701|ref|YP_002783484.1| hypothetical protein ROP_62920 [Rhodococcus opacus B4]
gi|226244191|dbj|BAH54539.1| hypothetical protein [Rhodococcus opacus B4]
Length = 201
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 15/131 (11%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYL 145
+E R E I + F ADL ++ HV FR +FT ++ R F GS+F+ L
Sbjct: 38 SELRTESVIFTECDFTGADLAESHHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97
Query: 146 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 195
V + +FT GADL EANL L R VL +DL GGA +
Sbjct: 98 RPMVFDECDFTLVSLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKL 157
Query: 196 EGADFSDAVID 206
+GAD A +D
Sbjct: 158 DGADLRGAHVD 168
>gi|334120837|ref|ZP_08494914.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333455836|gb|EGK84476.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 197
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 59/111 (53%), Gaps = 6/111 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF----- 155
S A A+L++AV ++ N R A+ + AD+R +DF + GA A+ A+F
Sbjct: 42 SGANLAGANLQRAV-LRANLRGADLSGADLRGADFRNADLRGASFANALVRDASFGGAFL 100
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
TGA + + + + L A+L A L R +L +DL A + GAD S A ++
Sbjct: 101 TGASIGNLDLSGVDLRGADLRGAALARAILHSADLSNANLSGADLSGADLE 151
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 54/114 (47%), Gaps = 20/114 (17%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSA----------DMRESDFSGSKFNGAYLEKAVAYK 152
A F +ADLR A R A+F A D+ D G+ GA L +A+ +
Sbjct: 73 ADFRNADLRGASFANALVRDASFGGAFLTGASIGNLDLSGVDLRGADLRGAALARAILHS 132
Query: 153 ANFT----------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
A+ + GADL + +++ VL ANLT A L+ ++ ++ GA+++
Sbjct: 133 ADLSNANLSGADLSGADLEEAILNGAVLRGANLTGANLLCAMIEQTLWDGALLD 186
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 43/80 (53%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADLR A + A+ ++A++ +D SG+ A L AV AN TGA+L ++++
Sbjct: 118 ADLRGAALARAILHSADLSNANLSGADLSGADLEEAILNGAVLRGANLTGANLLCAMIEQ 177
Query: 168 MVLNEANLTNAVLVRTVLTR 187
+ + A L A L T L+R
Sbjct: 178 TLWDGALLDRACLQGTPLSR 197
>gi|443475539|ref|ZP_21065485.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019605|gb|ELS33670.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 222
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 64/118 (54%), Gaps = 15/118 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA----------VAYK 152
A A+L + + + +F RA+ T A++ ++D S + A L KA +A
Sbjct: 88 ANLSRANLSEGILMGVDFSRADLTEANLSKADLYNSLLSSANLTKANLKSSTLDSSIATD 147
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNA-----VLVRTVLTRSDLGGAIIEGADFSDAVI 205
ANF+ A +++T + +VL+ ANL+NA + + LT SDL GA GAD S++V+
Sbjct: 148 ANFSNAIVTETTLKSIVLSRANLSNADFSNSKMRNSRLTNSDLRGAKFGGADLSNSVM 205
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 52/111 (46%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYK----- 152
A A+L A+ R AN + A++ E DFS + A L KA Y
Sbjct: 68 ANLSGANLSGALLNDSKLRGANLSRANLSEGILMGVDFSRADLTEANLSKADLYNSLLSS 127
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
AN T A+L + +D + +AN +NA++ T L L A + ADFS++
Sbjct: 128 ANLTKANLKSSTLDSSIATDANFSNAIVTETTLKSIVLSRANLSNADFSNS 178
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 57/120 (47%), Gaps = 16/120 (13%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
+ + AD+ D GS NGA L A N +GA L+D+ + L+ ANL+ +L+
Sbjct: 49 DLSGADLSYKDLYGSALNGANLSGA-----NLSGALLNDSKLRGANLSRANLSEGILMGV 103
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLAQ-----------KQALCKYANGTNPITGVSTRKSL 232
+R+DL A + AD ++++ A ++ AN +N I +T KS+
Sbjct: 104 DFSRADLTEANLSKADLYNSLLSSANLTKANLKSSTLDSSIATDANFSNAIVTETTLKSI 163
>gi|334117106|ref|ZP_08491198.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333461926|gb|EGK90531.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 520
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 61/121 (50%), Gaps = 1/121 (0%)
Query: 86 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 144
L KY A R GI + A +L A N AN + A++ +++ G+K N A
Sbjct: 7 LKKYAAGERNFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSQTNLIGAKLNIAR 66
Query: 145 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
L A A+ T ADL+ + R+ L +A L A L+R L R++L GA + GA+ S A
Sbjct: 67 LSGAHLGGADLTDADLNVAYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANLSGAT 126
Query: 205 I 205
+
Sbjct: 127 L 127
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL+KA+ + RA A++ ++ SG+ +GA L +A AN A+L +
Sbjct: 91 DLKKAILIGAKLIRAELIRAELSGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGA 150
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L EANL A L L+R+DL GA + G + A
Sbjct: 151 CLTEANLEQANLQGADLSRADLSGADLRGTELRQA 185
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A + R AN A++R + SG+ A LE+A A+ + ADL
Sbjct: 113 SGANLSGANLSGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRADL 172
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG-----ADFSDA 203
S + L +ANLT AVL L+ +L AI+ G AD S+A
Sbjct: 173 SGADLRGTELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEA 220
Score = 43.5 bits (101), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 45/101 (44%), Gaps = 10/101 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L A + N +AN AD+ +D SG+ G L +A +A +GADLS
Sbjct: 140 ANLRGAHLSGACLTEANLEQANLQGADLSRADLSGADLRGTELRQANLTQAVLSGADLSG 199
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
NL A+L L +DL A + GAD S A
Sbjct: 200 V----------NLRWAILSGCNLRWADLSEAKLSGADLSRA 230
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 54/124 (43%), Gaps = 5/124 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADLR R+AN T A + +D SG A L A+ + A L
Sbjct: 168 SRADLSGADLRGT-----ELRQANLTQAVLSGADLSGVNLRWAILSGCNLRWADLSEAKL 222
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
S + R L ANL NA LV LT + L A GAD + A + A+ A+ + T
Sbjct: 223 SGADLSRADLCHANLLNASLVHADLTNAYLIRADWIGADLTGATLTGAKLHAVSRLGIKT 282
Query: 221 NPIT 224
+T
Sbjct: 283 EGMT 286
Score = 40.4 bits (93), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A G ADL A + A D++++ G+K A L +A AN +GA+L
Sbjct: 68 SGAHLGGADLTDA-----DLNVAYLVRVDLKKAILIGAKLIRAELIRAELSGANLSGANL 122
Query: 161 SDTLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + L +ANL A L LT ++L A ++GAD S A
Sbjct: 123 SGATLTEATLRGANLAQANLRGAHLSGACLTEANLEQANLQGADLSRA 170
Score = 37.4 bits (85), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 15/85 (17%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
NF ++ E++ SG +GA L+ A AN +GA+LS T + LN A L+
Sbjct: 16 NFAGINLTEANLSGVNLSGANLKGANLSVANLSGANLSQTNLIGAKLNIARLS------- 68
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLA 208
GA + GAD +DA +++A
Sbjct: 69 --------GAHLGGADLTDADLNVA 85
>gi|282896932|ref|ZP_06304938.1| hglK (Pentapeptide repeat protein) [Raphidiopsis brookii D9]
gi|281198341|gb|EFA73231.1| hglK (Pentapeptide repeat protein) [Raphidiopsis brookii D9]
Length = 689
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 58/105 (55%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S AQ ADL A + + + + +++ ++++ G+ + +YL A ANF+ A+L
Sbjct: 536 SGAQLQEADLYAAQLARVSAIGSQLSHSNLTKTNWQGADLSESYLNHANLNSANFSAANL 595
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S +L AN+TNA L ++R+DL GA +EG DF A++
Sbjct: 596 SGA-----ILRSANMTNANLRNADISRADLRGANLEGTDFQGAIL 635
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 52/117 (44%), Gaps = 19/117 (16%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAY---------LEKAVAYKANFTGADLSDTLMDRM-- 168
+ AN A + S F +G + L KA +N + A+LS LM R+
Sbjct: 431 LKSANLNQASFKSSRFRSVGEDGRWDTYDDIIADLSKAQLKGSNLSSANLSRVLMSRVDL 490
Query: 169 ---VLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 217
VLN ANL N+ L+ R L SDL AI++ A + A I AQ Q YA
Sbjct: 491 SFSVLNRANLANSKLIGANLSRAQLVGSDLQQAILQDAILTGADISGAQLQEADLYA 547
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 65/131 (49%), Gaps = 14/131 (10%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSG 137
+ADL+K A+ +G + SA+L + + + + RAN ++ + ++ S
Sbjct: 462 IADLSK--AQLKG-------SNLSSANLSRVLMSRVDLSFSVLNRANLANSKLIGANLSR 512
Query: 138 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
++ G+ L++A+ A TGAD+S + L A L + + L+ S+L +G
Sbjct: 513 AQLVGSDLQQAILQDAILTGADISGAQLQEADLYAAQLARVSAIGSQLSHSNLTKTNWQG 572
Query: 198 ADFSDAVIDLA 208
AD S++ ++ A
Sbjct: 573 ADLSESYLNHA 583
>gi|307592031|ref|YP_003899622.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306985676|gb|ADN17556.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 161
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 48/90 (53%), Gaps = 5/90 (5%)
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
+ AD+ E D +G K GA L KA Y AN +GA LS + L+ ANL+ + L
Sbjct: 46 HNCDLVEADLHEKDLAGVKLYGADLSKAKLYGANLSGASLSGANLSGASLSGANLSGSYL 105
Query: 181 VRT-----VLTRSDLGGAIIEGADFSDAVI 205
+ L +++L GA + GAD SDAV+
Sbjct: 106 QKANLKGAYLQKANLEGAALYGADLSDAVL 135
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 5/99 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL KA N A+ + A++ + SG+ +G+YL+KA N GA L ++
Sbjct: 68 ADLSKAKLYGANLSGASLSGANLSGASLSGANLSGSYLQKA-----NLKGAYLQKANLEG 122
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L A+L++AVL L + L GA +EGA A+ D
Sbjct: 123 AALYGADLSDAVLYGANLKGAKLKGANLEGAKTKGAIFD 161
>gi|119488469|ref|ZP_01621642.1| hypothetical protein L8106_23865 [Lyngbya sp. PCC 8106]
gi|119455280|gb|EAW36420.1| hypothetical protein L8106_23865 [Lyngbya sp. PCC 8106]
Length = 463
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 60/109 (55%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F +ADL + + R A+ +SA + D + + A L +A +AN GA+L +
Sbjct: 119 ANFHNADLDAVNLISADLRGADLSSASLSWYDKVVANLSRADLTEANLSEANLCGANLLE 178
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
T + R LN+ANL +A L+RT+L SDL A + A DA ++ A+ Q
Sbjct: 179 TNLTRANLNKANLQDANLIRTILLESDLSLAELSNARLQDANLEGAKLQ 227
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 53/111 (47%), Gaps = 15/111 (13%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L KA N R +D+ ++ S ++ A LE A +AN TG +LS + R
Sbjct: 184 ANLNKANLQDANLIRTILLESDLSLAELSNARLQDANLEGAKLQQANLTGINLSRLNLAR 243
Query: 168 MVLNEANLTNAVLVRTV---------------LTRSDLGGAIIEGADFSDA 203
+ LN ANL NA L+ T L R++L A + GAD +DA
Sbjct: 244 VNLNRANLKNANLLETSFEGANLRIVNLNQANLIRANLSRASLIGADLTDA 294
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 55/105 (52%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ +A L+ A ++AN T ++ + + N A L+ A + +F GA+L
Sbjct: 207 SLAELSNARLQDANLEGAKLQQANLTGINLSRLNLARVNLNRANLKNANLLETSFEGANL 266
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+++ L ANL+ A L+ LT ++L GA +E A+F AV+
Sbjct: 267 RIVNLNQANLIRANLSRASLIGADLTDANLYGANLENAEFLGAVM 311
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 4/105 (3%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSA----DMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
A A+L + ++ N AN TSA D+R++ S + + A L +A + + TGA
Sbjct: 40 ADLTDANLNETKLMRANLSHANLTSANLRGDLRQATLSYATLSEADLGRAKLHGVDLTGA 99
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+L+ + LN ANL A L +L A + GAD S A
Sbjct: 100 NLTGANLTGASLNHANLKQANFHNADLDAVNLISADLRGADLSSA 144
>gi|440681606|ref|YP_007156401.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428678725|gb|AFZ57491.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 943
Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats.
Identities = 40/108 (37%), Positives = 56/108 (51%), Gaps = 13/108 (12%)
Query: 96 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
+FG A ADL +A NF R N + A M ++FS + FN A L +A +AN
Sbjct: 808 DFG---GANLSHADLSRANLNCANFSRTNCSGAYMISANFSEALFNHANLHEANFIRANL 864
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
TGADLS ++ L+ A+L+ A +L GA +E A+FS A
Sbjct: 865 TGADLSSADLNYADLSLADLSGA----------NLSGANLEDANFSGA 902
Score = 47.0 bits (110), Expect = 0.008, Method: Composition-based stats.
Identities = 26/72 (36%), Positives = 41/72 (56%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F A+L +A ++ N A+ +SAD+ +D S + +GA L A ANF+GA L
Sbjct: 845 SEALFNHANLHEANFIRANLTGADLSSADLNYADLSLADLSGANLSGANLEDANFSGAKL 904
Query: 161 SDTLMDRMVLNE 172
S+ L+ + +E
Sbjct: 905 SNGLLGDICWDE 916
Score = 45.1 bits (105), Expect = 0.031, Method: Composition-based stats.
Identities = 39/131 (29%), Positives = 63/131 (48%), Gaps = 6/131 (4%)
Query: 79 NISALADLNKYEAETRGE-FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE----- 132
+ + L D+ Y GE F + +LR A + +F AN + AD+
Sbjct: 767 DTTQLRDIINYSNCLSGENFNQIVGSFLSGTNLRGADLSEVDFGGANLSHADLSRANLNC 826
Query: 133 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 192
++FS + +GAY+ A +A F A+L + R L A+L++A L L+ +DL G
Sbjct: 827 ANFSRTNCSGAYMISANFSEALFNHANLHEANFIRANLTGADLSSADLNYADLSLADLSG 886
Query: 193 AIIEGADFSDA 203
A + GA+ DA
Sbjct: 887 ANLSGANLEDA 897
>gi|428215879|ref|YP_007089023.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004260|gb|AFY85103.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 284
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 62/128 (48%), Gaps = 26/128 (20%)
Query: 101 SAAQFGSADLRKAVHVKE------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 154
S A+L++A H+ E N RAN + D+ E+D SG AN
Sbjct: 133 SGINLSGANLQEA-HIAEVSFHNANLSRANLSGLDLSETDLSG---------------AN 176
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
+ ADLSDT + +L ANLT A+L L + + G++++GAD S A + + A
Sbjct: 177 LSYADLSDTQLTEAILYGANLTGAILTSAQLDGAKMNGSLVDGADLSQANL----QDAEV 232
Query: 215 KYANGTNP 222
K+ + TN
Sbjct: 233 KWVDLTNA 240
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 54/105 (51%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S +QF SA L+ A V+ N + +AD+R +D S + G+ L +A + N TGA+L
Sbjct: 43 SHSQFCSAILQGATLVEANLEQTKLRAADLRRADLSHANLMGSDLSRADMIETNLTGANL 102
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ ++ +E +A L R L +L G + GA+ +A I
Sbjct: 103 EQANLTEVIFSEVIFADANLSRANLQGLNLSGINLSGANLQEAHI 147
>gi|428219623|ref|YP_007104088.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427991405|gb|AFY71660.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 172
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 54/101 (53%), Gaps = 5/101 (4%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DLR A N R A +A +R SD +G+ A L A AN TGADL+ M+
Sbjct: 69 DLRGA-----NLRGAFLKNARLRGSDLTGADLRDATLTGAYFTGANLTGADLAGAEMEWA 123
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L +ANL +A L L+RSDL GA ++GAD A + A+
Sbjct: 124 NLRDANLQDANLQDANLSRSDLDGANLDGADLRGANLSRAK 164
>gi|194337742|ref|YP_002019536.1| pentapeptide repeat-containing protein [Pelodictyon
phaeoclathratiforme BU-1]
gi|194310219|gb|ACF44919.1| pentapeptide repeat protein [Pelodictyon phaeoclathratiforme BU-1]
Length = 408
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 5/98 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L A+ K N R+++FT S +G+ G++++ AV +AN GA+L
Sbjct: 96 ADLKGANLTMALIKKANLRKSDFTG-----SSLTGANLQGSFMKGAVLREANLEGANLRW 150
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+++ LN ANLT A L L +DL GA ++ A F
Sbjct: 151 AMLENGDLNRANLTGATLFEANLAGADLKGANLKNAHF 188
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 48/87 (55%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+F +A+ A+M+ + G+ GA L++A A+ + ++LS+ L+ L ANL+ A
Sbjct: 282 DFHKADLHKAEMKSAKLQGADLQGANLDRAFLKGADLSNSNLSNALLYGAKLGNANLSGA 341
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVI 205
L L +DL GA +EGA+ A I
Sbjct: 342 NLEGASLFEADLEGANLEGANLKGANI 368
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 5/90 (5%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLT 176
R + + ++ +G+ F+ A L KA A GADL +DR L A NL+
Sbjct: 265 RTRVEQSSFQNTNMAGADFHKADLHKAEMKSAKLQGADLQGANLDRAFLKGADLSNSNLS 324
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
NA+L L ++L GA +EGA +A ++
Sbjct: 325 NALLYGAKLGNANLSGANLEGASLFEADLE 354
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 44/96 (45%), Gaps = 20/96 (20%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL+ A N A A++R+SDF+GS + TGA+L + M
Sbjct: 96 ADLKGA-----NLTMALIKKANLRKSDFTGS---------------SLTGANLQGSFMKG 135
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
VL EANL A L +L DL A + GA +A
Sbjct: 136 AVLREANLEGANLRWAMLENGDLNRANLTGATLFEA 171
>gi|428202965|ref|YP_007081554.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427980397|gb|AFY77997.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 179
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 49/95 (51%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL+ A N AN +AD+ E++ G+ GA L+ A K N GA+L +
Sbjct: 65 DLQNANLQGANLEGANLQNADLEEANLQGANLAGANLQGADLEKGNLAGANLQTANLINA 124
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L EANL NA L L R+DL A + GA+ ++A
Sbjct: 125 DLEEANLQNANLQGASLQRADLEKANLTGANTNEA 159
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 5/79 (6%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A ADL K N + AN +AD+ E++ + GA L++A KAN TGA+
Sbjct: 97 AGANLQGADLEKGNLAGANLQTANLINADLEEANLQNANLQGASLQRADLEKANLTGANT 156
Query: 161 SDTLMDRMVLNEANLTNAV 179
++ L ANL NA+
Sbjct: 157 NEA-----NLQGANLENAI 170
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 68/143 (47%), Gaps = 9/143 (6%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A S +LR+ + KE N + D++ ++ G+ GA L+ A +AN GA+L
Sbjct: 38 STAPEASTELRRLLDTKE-CAGCNLSGVDLQNANLQGANLEGANLQNADLEEANLQGANL 96
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+ + L + NL A L L +DL A ++ A+ A + ++A + AN
Sbjct: 97 AGANLQGADLEKGNLAGANLQTANLINADLEEANLQNANLQGASL----QRADLEKAN-- 150
Query: 221 NPITGVSTRKSLGCGNSRRNAYG 243
+TG +T ++ G + NA G
Sbjct: 151 --LTGANTNEANLQGANLENAIG 171
>gi|428318916|ref|YP_007116798.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428242596|gb|AFZ08382.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 568
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 76/158 (48%), Gaps = 13/158 (8%)
Query: 59 NWRVFVSTALAAAVVASCSSNISALA----DLNKYEAETRGEFGIGSAAQFGS----ADL 110
NW F L A + S+ LA D K A E + A+ G+ A+L
Sbjct: 103 NWAAFPEADLGGANLQRVKSDQINLAGAKLDGAKLMAAELMEANLNRASLVGANLTGANL 162
Query: 111 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 170
R+A V+ N R A ++ E+D +G++ A L A ++A GADL++ ++D L
Sbjct: 163 REAHLVEANLRSAILLGVNLIEADLNGAQMRSANLAGADLHRAVLAGADLTEAVLDNADL 222
Query: 171 NEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDA 203
+ ANL + L+ + +L R++L G + AD S+A
Sbjct: 223 SRANLAGSYLLKASFQKALLLRANLQGVYLLRADLSEA 260
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 59/136 (43%), Gaps = 30/136 (22%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADM----------RESDFSGSKFNGAYLEKAVA 150
S A ADLR A A F AD+ + + +G+K +GA L A
Sbjct: 83 SGANLAKADLRLACLEAAELNWAAFPEADLGGANLQRVKSDQINLAGAKLDGAKLMAAEL 142
Query: 151 YKANF-----TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA------------ 193
+AN GA+L+ + L EANL +A+L+ L +DL GA
Sbjct: 143 MEANLNRASLVGANLTGANLREAHLVEANLRSAILLGVNLIEADLNGAQMRSANLAGADL 202
Query: 194 ---IIEGADFSDAVID 206
++ GAD ++AV+D
Sbjct: 203 HRAVLAGADLTEAVLD 218
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 56/127 (44%), Gaps = 15/127 (11%)
Query: 103 AQFGSADLRKAVHVKENF----------RRANFTSADMRESDFSGSKFNGAYLEKAVAYK 152
A A+LR A+ + N R AN AD+ + +G+ A L+ A +
Sbjct: 165 AHLVEANLRSAILLGVNLIEADLNGAQMRSANLAGADLHRAVLAGADLTEAVLDNADLSR 224
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS-----DAVIDL 207
AN G+ L + +L ANL L+R L+ ++L A + AD S DA++
Sbjct: 225 ANLAGSYLLKASFQKALLLRANLQGVYLLRADLSEANLRSADLRKADLSGAYLMDAMLGE 284
Query: 208 AQKQALC 214
A +A C
Sbjct: 285 ADLRAAC 291
>gi|189499620|ref|YP_001959090.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
BS1]
gi|189495061|gb|ACE03609.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
Length = 300
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 67/126 (53%), Gaps = 12/126 (9%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSG 137
L+D N EA+ G + A+LR A + R + TSA++ E+DF+
Sbjct: 93 LSDANLVEADLSGSMLV-------EANLRGANLSRGKVRDVDLTSANLSDGFFIETDFTR 145
Query: 138 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
S+ + +++A +A TG +LS + ++++ L+ A+L NAVLV +T SDL A G
Sbjct: 146 SQMVRSKMQRAFLGRATLTGTNLSWSNLEKVNLDNADLQNAVLVDVDITSSDLVAANFSG 205
Query: 198 ADFSDA 203
AD DA
Sbjct: 206 ADLRDA 211
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 46/104 (44%), Gaps = 29/104 (27%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA F ADLR A + N R A+ T AD+R GA L ++ N TG
Sbjct: 200 AANFSGADLRDADLSEVNLRNADLTGADLR----------GARL----SFSQNMTG---- 241
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ L NA+L L ++L GA IE ADFS A I
Sbjct: 242 -----------STLNNAILHSANLIGTNLNGADIEQADFSGAKI 274
>gi|254425612|ref|ZP_05039329.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
gi|196188035|gb|EDX83000.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
Length = 215
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 54/107 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L A K N AN + AD+ ESD S + GA L A A+ +GADL
Sbjct: 15 ANLSEANLDGATLDKANLMGANLSEADLSESDLSSADLPGATLHNATLQNADLSGADLRS 74
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+ R L+EANL +A L L +DL GA + GA+ A + +A
Sbjct: 75 ADLFRADLSEANLRSADLSSADLRGADLPGAKLIGANLIGANLSIAN 121
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 46/81 (56%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+ + AD+R ++ S + +GA L+KA AN + ADLS++ + L A L NA L
Sbjct: 5 ADLSGADLRGANLSEANLDGATLDKANLMGANLSEADLSESDLSSADLPGATLHNATLQN 64
Query: 183 TVLTRSDLGGAIIEGADFSDA 203
L+ +DL A + AD S+A
Sbjct: 65 ADLSGADLRSADLFRADLSEA 85
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 40/76 (52%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
++AD+ +D G+ + A L+ A KAN GA+LS+ + L+ A+L A L
Sbjct: 2 LSNADLSGADLRGANLSEANLDGATLDKANLMGANLSEADLSESDLSSADLPGATLHNAT 61
Query: 185 LTRSDLGGAIIEGADF 200
L +DL GA + AD
Sbjct: 62 LQNADLSGADLRSADL 77
Score = 40.0 bits (92), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 32/57 (56%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
S A SADL +A + N R A+ +SAD+R +D G+K GA L A AN TG
Sbjct: 68 SGADLRSADLFRADLSEANLRSADLSSADLRGADLPGAKLIGANLIGANLSIANVTG 124
>gi|332707026|ref|ZP_08427086.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332354291|gb|EGJ33771.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 239
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 51/105 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S F AD +A N AN A + + F +K A L+ A N GADL
Sbjct: 78 SGVDFSRADFSQANLSDSNLENANLKDAKVIGARFENAKLTSADLDGADFKDTNLKGADL 137
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
SD + + L A+L+ A+L RT L +DL GA +E AD S A I
Sbjct: 138 SDANLLNIRLANADLSTAILNRTELREADLTGANMEHADLSHASI 182
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 88/197 (44%), Gaps = 16/197 (8%)
Query: 27 LHALSKPLWVACQISSKTESDGQFPGPYAKLKNWRVFVSTALAAAVVASCS---SNISAL 83
LH LS V+ + + S P AK K + + T C ++S L
Sbjct: 21 LH-LSSQQQVSISLKTNDNSSKTIQHPKAKTKVLELMLKTK----TCVECDLMGIDLSGL 75
Query: 84 ADLNKYEAETRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 141
DL+ + +R +F S + +A+L+ A + F A TSAD+ +DF +
Sbjct: 76 -DLSGVDF-SRADFSQANLSDSNLENANLKDAKVIGARFENAKLTSADLDGADFKDTNLK 133
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
GA L A ADLS +++R L EA+LT A + L+ + + GAI+ A+ +
Sbjct: 134 GADLSDANLLNIRLANADLSTAILNRTELREADLTGANMEHADLSHASIYGAILREANLT 193
Query: 202 DAVIDLAQKQALCKYAN 218
A + +A +YAN
Sbjct: 194 GANL----YKANLRYAN 206
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 41/82 (50%), Gaps = 5/82 (6%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA-----YKANFTGA 158
+ +ADL A+ + R A+ T A+M +D S + GA L +A YKAN A
Sbjct: 146 RLANADLSTAILNRTELREADLTGANMEHADLSHASIYGAILREANLTGANLYKANLRYA 205
Query: 159 DLSDTLMDRMVLNEANLTNAVL 180
+L D ++ L A+L AV+
Sbjct: 206 NLQDAVLKGTNLKGADLQFAVM 227
>gi|6226483|sp|Q52118.1|YMO3_ERWST RecName: Full=Uncharacterized protein in mobD 3'region
gi|886362|gb|AAA69501.1| unknown [Plasmid pSW200]
Length = 295
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 61/105 (58%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANF 155
S A +ADL++A N A+ T+A++ ++D +GA L A +AY +A+
Sbjct: 170 SNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADL 229
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ A+LS+ + R L++ANL++A L L R+DL AI++GA+
Sbjct: 230 SNANLSNADLKRADLSDANLSDANLTNVDLKRADLSNAILKGANL 274
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANF 155
S A A+L A N AN T A + E+D S + +GA L A + N
Sbjct: 90 SDADLSDANLSDANLSGANLAHANLTMAYLSEADLSNANLSGADLTNANLNQTDLPNVNL 149
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+GA+L+ + L+EA+L+NA L L R+DL A + GAD ++A ++
Sbjct: 150 SGANLAHANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNANLN 200
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 52/111 (46%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A DL N AN T A + E+D S + + A L++A AN +GADL++
Sbjct: 137 ANLNQTDLPNVNLSGANLAHANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTN 196
Query: 163 TLMDRM----------VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+++ L ANLT A L L+ ++L A ++ AD SDA
Sbjct: 197 ANLNQTDLPNVNLSGANLAHANLTMAYLSEADLSNANLSNADLKRADLSDA 247
Score = 40.4 bits (93), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 50/106 (47%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL A + AN D+ + SG+ A L A +A+ + A+LS+
Sbjct: 117 AYLSEADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADLSNANLSN 176
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+ R L+ ANL+ A L L ++DL + GA+ + A + +A
Sbjct: 177 ADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMA 222
>gi|393766611|ref|ZP_10355166.1| pentapeptide repeat-containing protein [Methylobacterium sp. GXF4]
gi|392727929|gb|EIZ85239.1| pentapeptide repeat-containing protein [Methylobacterium sp. GXF4]
Length = 448
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 51/89 (57%), Gaps = 5/89 (5%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTN 177
A F A MR +D SG+ +GA +A + A+F+GAD DT+ +D L +ANLT+
Sbjct: 133 ARFGQAAMRFADLSGALLDGASFAEADLWGADFSGADADDTVFRDARLDEAKLADANLTH 192
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A LT++ L G+ + GA F+ A +D
Sbjct: 193 ADFEGASLTKASLAGSRLRGAKFTGAKLD 221
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 54/182 (29%), Positives = 74/182 (40%), Gaps = 15/182 (8%)
Query: 66 TALAAAVVASCSSNISAL----ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 121
TALAA A + L ADL++ E A A+LR+A R
Sbjct: 41 TALAAGGTAPADAESGGLPLAEADLSRARIEE---------ADLSGANLRRASLTGAVGR 91
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 181
F A + E+D S + +GA VA + F A L D + + A+L+ A+L
Sbjct: 92 STRFVGAILEETDLSEADMSGADFTGIVAGQVKFASAMLEDARFGQAAMRFADLSGALLD 151
Query: 182 RTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYANGTNP-ITGVSTRKSLGCGNSRR 239
+DL GA GAD D V D +A AN T+ G S K+ G+ R
Sbjct: 152 GASFAEADLWGADFSGADADDTVFRDARLDEAKLADANLTHADFEGASLTKASLAGSRLR 211
Query: 240 NA 241
A
Sbjct: 212 GA 213
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 46/97 (47%), Gaps = 5/97 (5%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A L +A N A+F A + ++ +GS+ GA A A+ +GADLSDT
Sbjct: 175 FRDARLDEAKLADANLTHADFEGASLTKASLAGSRLRGAKFTGAKLDGADLSGADLSDTD 234
Query: 165 MDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIE 196
+ R+ L A A L T ++ LGGA+ E
Sbjct: 235 LVRLNLATCRLRHARFAGAWLNGTRMSVEQLGGAVGE 271
>gi|113477694|ref|YP_723755.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110168742|gb|ABG53282.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 204
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 62/131 (47%), Gaps = 13/131 (9%)
Query: 88 KYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSA----------DMRESD 134
K A RG G+ A F +ADLR A+ + R A+F A D+ D
Sbjct: 63 KLRANLRGADFTGADLRGADFRNADLRGAILIDAQLREASFAGAFLNGAIFNNLDLSGID 122
Query: 135 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
F G+ G L KA ++A + A+LS + L EANL+ AVL T L S+L A
Sbjct: 123 FRGADLRGVNLSKANLFRAELSNANLSGADLSSADLEEANLSGAVLRGTNLQSSNLLCAS 182
Query: 195 IEGADFSDAVI 205
+E AD + ++
Sbjct: 183 VEQADLTGTLL 193
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 58/110 (52%), Gaps = 11/110 (10%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
F A+L+KA ++ N R A+FT AD+R +DF + GA L A +A+F GA L+
Sbjct: 53 NFAGANLQKA-KLRANLRGADFTGADLRGADFRNADLRGAILIDAQLREASFAGAFLNGA 111
Query: 164 LMDRMVLN----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + + L+ NL+ A L R L+ ++L GA + AD +A
Sbjct: 112 IFNNLDLSGIDFRGADLRGVNLSKANLFRAELSNANLSGADLSSADLEEA 161
>gi|428314781|ref|YP_007150965.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428256164|gb|AFZ22121.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 237
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 53/96 (55%), Gaps = 5/96 (5%)
Query: 105 FGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
F ADL A + N +AN + A ++R++D +G+K A L A A+ TGA+
Sbjct: 130 FQGADLSNAQLLNTNLAKANLSMATLNRTELRDADLTGAKLESANLSNATLVGAHMTGAN 189
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
L+ + VL A+LT AVL++T L +DL AI+
Sbjct: 190 LTGANFNNAVLRYADLTKAVLIKTNLKGADLSLAIM 225
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 57/110 (51%), Gaps = 10/110 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANF 155
S ADLR + RAN + ++++ SG++ N A L + A + +F
Sbjct: 76 SGLDLSGADLRNT-----DLSRANLKNTKLKDAKMSGARLNQANLTYADLDGADFQECDF 130
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
GADLS+ + L +ANL+ A L RT L +DL GA +E A+ S+A +
Sbjct: 131 QGADLSNAQLLNTNLAKANLSMATLNRTELRDADLTGAKLESANLSNATL 180
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 58/123 (47%), Gaps = 4/123 (3%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A + L+ A +AN T AD+ +DF F GA L A N A+L
Sbjct: 91 SRANLKNTKLKDAKMSGARLNQANLTYADLDGADFQECDFQGADLSNAQLLNTNLAKANL 150
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
S ++R L +A+LT A L L+ + L GA + GA+ + A + A+ +YA+ T
Sbjct: 151 SMATLNRTELRDADLTGAKLESANLSNATLVGAHMTGANLTGANFN----NAVLRYADLT 206
Query: 221 NPI 223
+
Sbjct: 207 KAV 209
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 38/85 (44%), Gaps = 12/85 (14%)
Query: 129 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 188
D+RE + SG +GA L +AN L D M LN+ANLT A
Sbjct: 69 DLREINLSGLDLSGADLRNTDLSRANLKNTKLKDAKMSGARLNQANLTYA---------- 118
Query: 189 DLGGAIIEGADFSDAVIDLAQKQAL 213
DL GA + DF A DL+ Q L
Sbjct: 119 DLDGADFQECDFQGA--DLSNAQLL 141
>gi|254416875|ref|ZP_05030623.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196176239|gb|EDX71255.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 332
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 7/125 (5%)
Query: 89 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 148
Y A+ RG I ADLR A +K N R AN ++RE+D G+ +GA L A
Sbjct: 144 YTAKLRG--AILQNVDLQGADLRGADLLKVNLRGANLRETNLREADLRGANLSGANLSSA 201
Query: 149 VAYKANFTGADLSDTLM-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ N GA+L ++ +R +L+EA+LT L V+ L A + G + S A
Sbjct: 202 FLTEVNLMGANLRGAILKNVKLERAILSEADLTGVNLQGAVMPDVRLSKAQVSGGNLSFA 261
Query: 204 VIDLA 208
++ A
Sbjct: 262 RLNRA 266
Score = 44.3 bits (103), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 49/98 (50%), Gaps = 5/98 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL A + RANF R+++ G++ +GA L +A N + ADL +
Sbjct: 40 ADLHGATLIFAYLSRANF-----RKANLVGTRLSGANLNQAWLSGVNLSNADLHGASLQS 94
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L ANLT A L+ L +DL GA + GAD + A +
Sbjct: 95 ADLRSANLTLASLLDANLMDADLRGANLSGADLTGACL 132
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 52/118 (44%), Gaps = 20/118 (16%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGS--------------------KFNGAYLEK 147
A+LR A+ RA + AD+ + G+ + N A L +
Sbjct: 211 ANLRGAILKNVKLERAILSEADLTGVNLQGAVMPDVRLSKAQVSGGNLSFARLNRADLSR 270
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+AN + +DL + + R L ANL+NA L R L+ ++L GA ++GA D I
Sbjct: 271 TNLREANLSDSDLIEAYLARTNLMGANLSNANLTRAELSTTNLMGANLQGATMPDGRI 328
>gi|428302093|ref|YP_007140399.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428238637|gb|AFZ04427.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 146
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 71/143 (49%), Gaps = 17/143 (11%)
Query: 66 TALAAAVVASCSSNISALADLN---KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 122
TAL A + S ISA AD+ ++ ETR + + +LR A N +
Sbjct: 7 TALTIASTITLSLPISAQADMKSDVQHLLETRECY---------ACNLRGA-----NLKG 52
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+ AD+R ++ G+ GA LE A A+ A+LS ++ LN ANLTNA L
Sbjct: 53 AHLIGADLRNANLKGANLAGANLEGADLTGADLEEANLSYAFVNSTSLNYANLTNANLSN 112
Query: 183 TVLTRSDLGGAIIEGADFSDAVI 205
L ++L GA++ GAD + A I
Sbjct: 113 AHLYSAELDGAVMVGADLAGADI 135
>gi|307152500|ref|YP_003887884.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306982728|gb|ADN14609.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 305
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 65/137 (47%), Gaps = 25/137 (18%)
Query: 82 ALADL-NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK- 139
A+ DL NKY+A R S + DLR + NF+ A+F+ A++RE DFSG+
Sbjct: 6 AVIDLKNKYDAGERN----FSKIELRRVDLRGFNLSQANFKGADFSYANLREVDFSGADL 61
Query: 140 ----FN---------------GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
FN G+YL KA K N A+LS + L+++NLTNA L
Sbjct: 62 SEAFFNEADLTGANLQEANLQGSYLMKAYLMKTNLQSANLSKAYLTGAYLSKSNLTNANL 121
Query: 181 VRTVLTRSDLGGAIIEG 197
L S L GA + G
Sbjct: 122 TGAYLNGSKLNGADLTG 138
>gi|452962545|gb|EME67671.1| hypothetical protein H261_22313 [Magnetospirillum sp. SO-1]
Length = 542
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 8/100 (8%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+LRKAV N R N A + ++D SG+K GA L A +ANF+GA++ R
Sbjct: 54 ANLRKAVLSGANLRDCNLPRACLEDADLSGAKLQGANLAGATLLRANFSGANM------R 107
Query: 168 MV-LNEANLTNAVLVRTV-LTRSDLGGAIIEGADFSDAVI 205
M L ANL + + V LT ++L GA + GA+FS A +
Sbjct: 108 MANLAGANLAGKMDLSGVDLTGANLAGAKLMGANFSGATL 147
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 38/61 (62%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN A + ++FSG+ GA L A A ANFTGADL+D + +L+ AN++ AV+ R
Sbjct: 130 ANLAGAKLMGANFSGATLAGANLAGADARNANFTGADLTDAVTAGALLDGANMSGAVIRR 189
Query: 183 T 183
T
Sbjct: 190 T 190
>gi|254417634|ref|ZP_05031369.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196175575|gb|EDX70604.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 470
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 54/107 (50%), Gaps = 10/107 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A SADLR A N A AD+R +D G+K N A L+ A AN +GA+LS
Sbjct: 214 ANLVSADLRNA-----NLTDAQLEVADIRSADLRGAKLNNANLDTVNADSANLSGANLS- 267
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+ + A+ A+LVRT L + L G+ + AD + A + AQ
Sbjct: 268 ----QAYITNADFNGAILVRTTLREAVLNGSNFQIADLTQANLQGAQ 310
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 69/158 (43%), Gaps = 22/158 (13%)
Query: 59 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 118
N + V T L AV+ + I+ L N A+ +G IG F A+L KA
Sbjct: 277 NGAILVRTTLREAVLNGSNFQIADLTQANLQGAQLKG---IG----FNRANLTKANLEGA 329
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL----------SDTLMDRM 168
+ A AD+ + +G+ + AYL A AN +G DL S+ +
Sbjct: 330 DLTNAKLAIADLTNAQLTGAILHSAYLHSATLANANLSGVDLQGAQLREANLSNVTLVGA 389
Query: 169 VLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFS 201
L +ANL T A L T LTR DL GA + GAD S
Sbjct: 390 TLEDANLIRSTLTGANLTYTNLTRCDLRGANLTGADLS 427
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 72/159 (45%), Gaps = 15/159 (9%)
Query: 48 GQFPGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGS 107
G FP A + NW V T L +N+ ADL Y A G + S A
Sbjct: 149 GYFP---AFIANWYAAVVTDLR-------DTNLQG-ADL--YRANLDG--ALLSRANLQD 193
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A L A V+ R A T+A++ +D + A LE A A+ GA L++ +D
Sbjct: 194 AQLDYANLVRTYLREATLTNANLVSADLRNANLTDAQLEVADIRSADLRGAKLNNANLDT 253
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ + ANL+ A L + +T +D GAI+ +AV++
Sbjct: 254 VNADSANLSGANLSQAYITNADFNGAILVRTTLREAVLN 292
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 51/110 (46%), Gaps = 5/110 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A +AD A+ V+ R A NF AD+ +++ G++ G +A KAN
Sbjct: 267 SQAYITNADFNGAILVRTTLREAVLNGSNFQIADLTQANLQGAQLKGIGFNRANLTKANL 326
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
GADL++ + L A LT A+L L + L A + G D A +
Sbjct: 327 EGADLTNAKLAIADLTNAQLTGAILHSAYLHSATLANANLSGVDLQGAQL 376
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A SA L A N + A +RE++ S GA LE A ++ TGA+L
Sbjct: 347 TGAILHSAYLHSATLANANLSGVDLQGAQLREANLSNVTLVGATLEDANLIRSTLTGANL 406
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADFSDA 203
+ T + R L ANLT A L LT+++ A++ +GA+ SDA
Sbjct: 407 TYTNLTRCDLRGANLTGADLSYANLTQANFSQAVLMDASFQGANLSDA 454
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 56/122 (45%), Gaps = 23/122 (18%)
Query: 103 AQFGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYL-----EKAVAYK 152
A SADLR A N AN + A++ ++ + + FNGA L +AV
Sbjct: 234 ADIRSADLRGAKLNNANLDTVNADSANLSGANLSQAYITNADFNGAILVRTTLREAVLNG 293
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD---AVIDLAQ 209
+NF ADL+ +ANL A L R++L A +EGAD ++ A+ DL
Sbjct: 294 SNFQIADLT----------QANLQGAQLKGIGFNRANLTKANLEGADLTNAKLAIADLTN 343
Query: 210 KQ 211
Q
Sbjct: 344 AQ 345
>gi|428320418|ref|YP_007118300.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428244098|gb|AFZ09884.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 479
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 55/105 (52%), Gaps = 10/105 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL ++ N RA+ T A +RE++ G +F GA L++A KAN GA+L
Sbjct: 60 SGANLSGADLAESFLNLANLTRADLTGAVLREANLVGVEFTGANLKQASLIKANLVGANL 119
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+EANLT A L L S L GAI++ A +++ I
Sbjct: 120 ----------HEANLTRANLSGADLRGSQLSGAILDKAVYNNRTI 154
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 48/100 (48%), Gaps = 10/100 (10%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
SADLR + + AN AD+RE+DF+G A AN +GADL
Sbjct: 335 NLSSADLRGVDLTRADLSGANLRDADLRETDFTG----------ATLLFANLSGADLRGV 384
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + L+ A L A L + L R +L GA + AD SDA
Sbjct: 385 DLTKADLSGAKLNEADLRKADLMRVNLEGADLTEADLSDA 424
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 51/103 (49%), Gaps = 15/103 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 155
S A ADLR+ AN + AD+R ++D SG+K N A L KA + N
Sbjct: 352 SGANLRDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGAKLNEADLRKADLMRVNL 411
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
GADL+ EA+L++A L R L ++L G ++GA
Sbjct: 412 EGADLT----------EADLSDAHLFRVNLRGANLKGTNLKGA 444
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 51/97 (52%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
LR A ++ + AN + AD+ ES + + A L AV +AN G + + + +
Sbjct: 49 LRYADLIEADLSGANLSGADLAESFLNLANLTRADLTGAVLREANLVGVEFTGANLKQAS 108
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L +ANL A L LTR++L GA + G+ S A++D
Sbjct: 109 LIKANLVGANLHEANLTRANLSGADLRGSQLSGAILD 145
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
R AN D+RE++ SG+ A L +A AN +GADL+++ LN ANLT A
Sbjct: 29 LRGANLRGTDLRETNLSGAMLRYADLIEADLSGANLSGADLAESF-----LNLANLTRAD 83
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVI 205
L VL ++L G GA+ A +
Sbjct: 84 LTGAVLREANLVGVEFTGANLKQASL 109
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 5/95 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL + N +N TS +++ D S + AYL+ A + GADLS
Sbjct: 274 ADLNGSDLSGANLSASNLTSVNLKNVDLSRASLKKAYLKGANLEGTDLRGADLSGA---- 329
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+L++ NL++A L LTR+DL GA + AD +
Sbjct: 330 -ILHQVNLSSADLRGVDLTRADLSGANLRDADLRE 363
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 53/106 (50%), Gaps = 5/106 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 157
A DLR A + N +SAD+R D + + +GA L A + +FTG
Sbjct: 314 ANLEGTDLRGADLSGAILHQVNLSSADLRGVDLTRADLSGANLRDADLRETDFTGATLLF 373
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A+LS + + L +A+L+ A L L ++DL +EGAD ++A
Sbjct: 374 ANLSGADLRGVDLTKADLSGAKLNEADLRKADLMRVNLEGADLTEA 419
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 63/156 (40%), Gaps = 31/156 (19%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV----------- 149
+ A A L KA V N AN T A++ +D GS+ +GA L+KAV
Sbjct: 100 TGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAVYNNRTIFPEDI 159
Query: 150 ---AYKA------------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
A A N DL++ + L NL A+L L R++L GA
Sbjct: 160 DPGAMGAFLLAPNASLPGLNLAMVDLTEADLKGADLRRTNLYKAILFGAKLDRANLAGAN 219
Query: 195 IEGADFSDA-----VIDLAQKQALCKYANGTNPITG 225
+ AD +A +++ A ++ G +P G
Sbjct: 220 LSAADLREASLSGTILEKAVYSNKTLFSEGIDPALG 255
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 49/106 (46%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
++ + DL +A K + AN D+R +D SG+ + L A + T ADL
Sbjct: 292 TSVNLKNVDLSRASLKKAYLKGANLEGTDLRGADLSGAILHQVNLSSADLRGVDLTRADL 351
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
S + L E + T A L+ L+ +DL G + AD S A ++
Sbjct: 352 SGANLRDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGAKLN 397
>gi|383482351|ref|YP_005391265.1| hypothetical protein MCI_01270 [Rickettsia montanensis str. OSU
85-930]
gi|378934705|gb|AFC73206.1| hypothetical protein MCI_01270 [Rickettsia montanensis str. OSU
85-930]
Length = 959
Score = 53.9 bits (128), Expect = 6e-05, Method: Composition-based stats.
Identities = 39/118 (33%), Positives = 62/118 (52%), Gaps = 10/118 (8%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ---KQALCKYAN 218
+ + EAN NA++ R LT++D A++E AD ++ A+ K+A+ K AN
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKADFTKALLENADMQ--AVEAAEAIFKEAILKQAN 665
Score = 38.1 bits (87), Expect = 3.8, Method: Composition-based stats.
Identities = 28/96 (29%), Positives = 44/96 (45%), Gaps = 10/96 (10%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------AN 154
F ADL+K+ + RA D+ E++ + SKFN + A A K +N
Sbjct: 404 FLFADLKKSKIENSDMSRAYMPKVDLSEAEVTNSKFNAVMMVNADAEKLIIKNSEWKNSN 463
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
TG L+ M R+ + L NA+L + + +DL
Sbjct: 464 LTGISLAYADMQRVQMQGVVLNNALLDQANIVSTDL 499
Score = 37.7 bits (86), Expect = 4.9, Method: Composition-based stats.
Identities = 25/109 (22%), Positives = 51/109 (46%), Gaps = 5/109 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A+L+ AV R F AD+++S S + AY+ K +A T + + +
Sbjct: 384 FEGANLQNAVFQNVTARNVGFLFADLKKSKIENSDMSRAYMPKVDLSEAEVTNSKFNAVM 443
Query: 165 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
M +++++ + N+ L L +D+ ++G ++A++D A
Sbjct: 444 MVNADAEKLIIKNSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 492
>gi|443651776|ref|ZP_21130709.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|159027471|emb|CAO89436.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443334417|gb|ELS48929.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 931
Score = 53.9 bits (128), Expect = 6e-05, Method: Composition-based stats.
Identities = 36/117 (30%), Positives = 55/117 (47%), Gaps = 2/117 (1%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
G A+L +A + + AN A++ ++ G+ GA L A +AN GA+L
Sbjct: 789 LGGANLERANLAEADIGGANLEGANLEGANLKGANLEGANLAMAFLKRANLEGANLRGAN 848
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
++ L ANL A L R L ++L GA + GA+ A +D A + Y G N
Sbjct: 849 LEEAYLEGANLAMAFLKRANLEGANLRGANLYGANLKGANLDWANLEG--AYLEGAN 903
Score = 45.1 bits (105), Expect = 0.030, Method: Composition-based stats.
Identities = 42/129 (32%), Positives = 56/129 (43%), Gaps = 5/129 (3%)
Query: 90 EAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 149
E E IG A G A+L A N AN A ++ ++ G+ GA LE+A
Sbjct: 795 ERANLAEADIGGANLEG-ANLEGANLKGANLEGANLAMAFLKRANLEGANLRGANLEEAY 853
Query: 150 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
AN A L ++ L ANL A L L ++L GA +EGA+ +D A
Sbjct: 854 LEGANLAMAFLKRANLEGANLRGANLYGANLKGANLDWANLEGAYLEGANLRGVFLDGAN 913
Query: 210 KQALCKYAN 218
KYAN
Sbjct: 914 ----FKYAN 918
>gi|46202237|ref|ZP_00053526.2| COG1357: Uncharacterized low-complexity proteins [Magnetospirillum
magnetotacticum MS-1]
Length = 542
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 8/100 (8%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+LRKAV N R N A + ++D SG+K GA L A +ANF+GA++ R
Sbjct: 54 ANLRKAVLSGANLRDCNLPRACLEDADLSGAKLQGANLAGATLLRANFSGANM------R 107
Query: 168 MV-LNEANLTNAVLVRTV-LTRSDLGGAIIEGADFSDAVI 205
M L ANL + + V LT ++L GA + GA+FS A +
Sbjct: 108 MANLAGANLAGKMDLSGVDLTGANLAGAKLMGANFSGATL 147
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 38/61 (62%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN A + ++FSG+ GA L A A ANFTGADL+D + +L+ AN++ AV+ R
Sbjct: 130 ANLAGAKLMGANFSGATLAGANLAGADARNANFTGADLTDAVTAGTLLDGANMSGAVIRR 189
Query: 183 T 183
T
Sbjct: 190 T 190
>gi|325106774|ref|YP_004267842.1| pentapeptide repeat-containing protein [Planctomyces brasiliensis
DSM 5305]
gi|324967042|gb|ADY57820.1| pentapeptide repeat protein [Planctomyces brasiliensis DSM 5305]
Length = 194
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 72/159 (45%), Gaps = 16/159 (10%)
Query: 111 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 170
+K + E RAN + AD+ E+D G+ +GA L +A +A+ GADLS + L
Sbjct: 14 QKWLKGDEGGERANLSEADLSEADLRGADLSGANLSEADLSEADLRGADLSGANLSWANL 73
Query: 171 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ------KQALCKYANGTNPIT 224
+ ANL+ A L L+ +DL A + GAD S A + A +A+ + G I
Sbjct: 74 SWANLSEADLSGANLSEADLSEADLRGADLSGANLRGANLSGANLSEAVARLDFGAWSIC 133
Query: 225 GVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGF 263
S+GC R + + L P D DGF
Sbjct: 134 VRKDVTSIGCRTYRNDRW-------LEWTPD---DVDGF 162
>gi|443476809|ref|ZP_21066696.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443018179|gb|ELS32476.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 330
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 68/131 (51%), Gaps = 12/131 (9%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
L+ +N A G IG+ F +L +A + N + AN AD++ ++ + G
Sbjct: 183 LSRVNLQGANLSGAIAIGTI--FTEVNLSQANLTEVNLKGANLMKADLKNANLRLANLFG 240
Query: 143 AYLEKA---VAYKAN-------FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 192
A L KA +A +N TG+DLS +L+DR L++A+L +A LVR L +DL
Sbjct: 241 ANLSKANLSMATLSNAGLIQAILTGSDLSRSLLDRANLSQASLVDAYLVRANLDGADLSN 300
Query: 193 AIIEGADFSDA 203
AI+ A+ S A
Sbjct: 301 AILTRAELSGA 311
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 66/136 (48%), Gaps = 11/136 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFTG 157
A ++DL K + RR NF A + + F SG+K GA L +A+ AN T
Sbjct: 25 ANLFNSDLIGINLTKADLRRTNFVFAYLNKVTFNHANLSGAKLGGATLNQAIMMSANLTE 84
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 217
ADL ++ R+ L ANL+ A L+ L+ +DL + GA+ A++ AL +
Sbjct: 85 ADLHGAMLQRVNLFGANLSLANLMDANLSEADLRSVNLRGANLRCAIL----SAALMREE 140
Query: 218 NGTNP--ITGVSTRKS 231
G P + G + RK+
Sbjct: 141 RGYPPTNMVGANLRKA 156
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 47/94 (50%), Gaps = 5/94 (5%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
V N R+A+ A++ SD +G +GA L +A + N GA+LS + + E NL
Sbjct: 149 VGANLRKADLRGANLSGSDLTGVDLSGANLSEATLSRVNLQGANLSGAIAIGTIFTEVNL 208
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+ A LT +L GA + AD +A + LA
Sbjct: 209 SQA-----NLTEVNLKGANLMKADLKNANLRLAN 237
Score = 44.3 bits (103), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 49/99 (49%), Gaps = 5/99 (5%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F S +L A + N T AD+R ++F F AYL K AN +GA L
Sbjct: 17 FASLNLANANLFNSDLIGINLTKADLRRTNFV---F--AYLNKVTFNHANLSGAKLGGAT 71
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+++ ++ ANLT A L +L R +L GA + A+ DA
Sbjct: 72 LNQAIMMSANLTEADLHGAMLQRVNLFGANLSLANLMDA 110
>gi|344171276|emb|CCA83758.1| hypothetical protein, Pentapeptide repeat domains [blood disease
bacterium R229]
Length = 325
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 51/103 (49%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A A + AD+ +D SG+ +GAYL A AN +GADL
Sbjct: 52 SGADLSGADLSGAYLSGAYLSGAYLSDADLSGADLSGADLSGAYLSGAYLSGANLSGADL 111
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + L+ A+L+ A L L+ + L GA + GAD S A
Sbjct: 112 SGANLSGADLSGADLSGADLSGAYLSGAYLSGAYLSGADLSGA 154
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 50/105 (47%), Gaps = 10/105 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A A + AD+ +D SG+ +GAYL A AN +GADL
Sbjct: 122 SGADLSGADLSGAYLSGAYLSGAYLSGADLSGADLSGADLSGAYLSGAYLSSANLSGADL 181
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S ANL+ A L L+ +DL GA + GA+ S A +
Sbjct: 182 SG----------ANLSGANLSGAYLSSADLSGANLSGANLSGAYL 216
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A + A+ + AD+ + SG+ +GAYL A A+ +GADL
Sbjct: 102 SGANLSGADLSGANLSGADLSGADLSGADLSGAYLSGAYLSGAYLSGADLSGADLSGADL 161
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + L+ ANL+ A L L+ ++L GA + AD S A
Sbjct: 162 SGAYLSGAYLSSANLSGADLSGANLSGANLSGAYLSSADLSGA 204
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 52/105 (49%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A L A + AN + AD+ +D SG+ +GAYL A A +GADL
Sbjct: 92 SGAYLSGAYLSGANLSGADLSGANLSGADLSGADLSGADLSGAYLSGAYLSGAYLSGADL 151
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S + L+ A L+ A L L+ +DL GA + GA+ S A +
Sbjct: 152 SGADLSGADLSGAYLSGAYLSSANLSGADLSGANLSGANLSGAYL 196
>gi|83310097|ref|YP_420361.1| hypothetical protein amb0998 [Magnetospirillum magneticum AMB-1]
gi|82944938|dbj|BAE49802.1| Uncharacterized low-complexity protein [Magnetospirillum magneticum
AMB-1]
Length = 542
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 8/100 (8%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+LRKAV N R N A + ++D SG+K GA L A +ANF+GA++ R
Sbjct: 54 ANLRKAVLSGANLRDCNLPRACLEDADLSGAKLQGANLAGATLLRANFSGANM------R 107
Query: 168 MV-LNEANLTNAVLVRTV-LTRSDLGGAIIEGADFSDAVI 205
M L ANL + + V LT ++L GA + GA+FS A +
Sbjct: 108 MANLAGANLAGKMDLSGVDLTGANLAGAKLMGANFSGATL 147
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 38/61 (62%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN A + ++FSG+ GA L A A ANFTGADL+D + +L+ AN++ AV+ R
Sbjct: 130 ANLAGAKLMGANFSGATLAGANLAGADARNANFTGADLTDAVTAGALLDGANMSGAVIRR 189
Query: 183 T 183
T
Sbjct: 190 T 190
>gi|411117186|ref|ZP_11389673.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410713289|gb|EKQ70790.1| putative low-complexity protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 544
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 59/119 (49%), Gaps = 1/119 (0%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A +LR+A + N R AN T A++R +D SG+ + A L A AN TG +L
Sbjct: 173 SGADLSYTELRQANLSRANLRGANLTGANLRWADLSGADLSWADLSGARLSGANLTGVNL 232
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALCKYAN 218
S + +L A+LT A L+ SDL GA + GA + + + LC++ +
Sbjct: 233 SYANLLGTILVHADLTRASLIGADWAGSDLSGATLTGAKLHGVLRFGVKTEGILCEWVD 291
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 59/119 (49%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
S+DL A R+AN + A++R ++ +G+ A L A A+ +GA LS
Sbjct: 167 LSSSDLSGADLSYTELRQANLSRANLRGANLTGANLRWADLSGADLSWADLSGARLSGAN 226
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
+ + L+ ANL +LV LTR+ L GA G+D S A + A+ + ++ T I
Sbjct: 227 LTGVNLSYANLLGTILVHADLTRASLIGADWAGSDLSGATLTGAKLHGVLRFGVKTEGI 285
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 3/135 (2%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A DL+ A + N AN + A ++ + F+ + + A L ++ +GADL
Sbjct: 118 SFANLSGVDLKDAKLRQANLSHANISRASLKWATFTSANLSQANLHGTDLSSSDLSGADL 177
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL-CKYAN- 218
S T + + L+ ANL A L L +DL GA + AD S A + A + YAN
Sbjct: 178 SYTELRQANLSRANLRGANLTGANLRWADLSGADLSWADLSGARLSGANLTGVNLSYANL 237
Query: 219 -GTNPITGVSTRKSL 232
GT + TR SL
Sbjct: 238 LGTILVHADLTRASL 252
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 58/123 (47%), Gaps = 9/123 (7%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 155
S A +L KA N AN +++ E++ S +K N GA L KA +AN
Sbjct: 23 SEANLSGVNLSKANLNGANLSVANLCGSNLSEANLSKAKLNVAKLSGANLSKANLEEANL 82
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
A+L+ + L +A+L A + R L+ ++L A + G D DA + +QA
Sbjct: 83 NVANLTLADLSHAELRQASLVRAEMARAELSEANLSFANLSGVDLKDAKL----RQANLS 138
Query: 216 YAN 218
+AN
Sbjct: 139 HAN 141
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 62/131 (47%), Gaps = 15/131 (11%)
Query: 76 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM----- 130
C SN+S A+L+K + + A+ A+L KA + N AN T AD+
Sbjct: 48 CGSNLSE-ANLSKAKL---------NVAKLSGANLSKANLEEANLNVANLTLADLSHAEL 97
Query: 131 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
R++ ++ A L +A AN +G DL D + + L+ AN++ A L T ++L
Sbjct: 98 RQASLVRAEMARAELSEANLSFANLSGVDLKDAKLRQANLSHANISRASLKWATFTSANL 157
Query: 191 GGAIIEGADFS 201
A + G D S
Sbjct: 158 SQANLHGTDLS 168
>gi|158341150|ref|YP_001522487.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158311391|gb|ABW33002.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 150
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 49/84 (58%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
F +AN T+A + + F G+ F A L+ A AN +GA+L + + +L ANLT A
Sbjct: 20 FAKANLTNAILHGATFIGTSFQQANLQAAGLISANLSGANLKEANLTNALLTTANLTGAD 79
Query: 180 LVRTVLTRSDLGGAIIEGADFSDA 203
L ++L R+ L AI++GA+ DA
Sbjct: 80 LRSSILCRAVLTDAILQGANLRDA 103
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 55/105 (52%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +A L A + +F++AN +A + ++ SG+ A L A+ AN TGADL
Sbjct: 23 ANLTNAILHGATFIGTSFQQANLQAAGLISANLSGANLKEANLTNALLTTANLTGADLRS 82
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
+++ R VL +A L A L L +D A + GAD S A ++L
Sbjct: 83 SILCRAVLTDAILQGANLRDADLRETDFKNADLTGADLSGAKVNL 127
>gi|20090742|ref|NP_616817.1| hypothetical protein MA1892 [Methanosarcina acetivorans C2A]
gi|19915798|gb|AAM05297.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans
C2A]
Length = 560
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 54/106 (50%), Gaps = 10/106 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTG 157
A A+LR K N R + + AD+RE+D SG +GA L A +AN G
Sbjct: 389 ANLSGANLRGTNLSKANLREVDLSGADLREADLSGVDLSGANLSGADLSGVDLSRANLNG 448
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
ADL+ + R LNEANL+ +T L +DL A + GA S+A
Sbjct: 449 ADLNGIDLRRANLNEANLS-----KTNLNEADLSKAKLSGAYLSEA 489
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 49/91 (53%), Gaps = 5/91 (5%)
Query: 112 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 171
+A+ V N A+ + +D+R++ + N A L KA KAN + ADL M R L+
Sbjct: 272 QALLVINNLIGADLSESDLRDAFLHEAHLNEADLSKANLSKANLSEADLKGAYMRRANLS 331
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
EANL+ A L + DL GA + GAD ++
Sbjct: 332 EANLSKAKL-----SGVDLSGANLSGADLNE 357
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 51/103 (49%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 157
A DLR+A N AN + ++ E+D S +K +GAYL +A A G
Sbjct: 449 ADLNGIDLRRA-----NLNEANLSKTNLNEADLSKAKLSGAYLSEAKLKGAKLKGAYMRK 503
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A+LS+ ++ L EANL+ A L L+ DL GA + G +
Sbjct: 504 ANLSEADLNGADLREANLSEANLNGVDLSVIDLRGANLNGVNI 546
Score = 37.0 bits (84), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 49/115 (42%), Gaps = 13/115 (11%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL + RAN AD+ D + N A L K +A+ + A L
Sbjct: 427 SGANLSGADLSGV-----DLSRANLNGADLNGIDLRRANLNEANLSKTNLNEADLSKAKL 481
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA--------VIDL 207
S + L A L A + + L+ +DL GA + A+ S+A VIDL
Sbjct: 482 SGAYLSEAKLKGAKLKGAYMRKANLSEADLNGADLREANLSEANLNGVDLSVIDL 536
>gi|300867251|ref|ZP_07111911.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300334728|emb|CBN57077.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 520
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 61/117 (52%), Gaps = 9/117 (7%)
Query: 84 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
ADLN+ A+ RG +A+LR+A + N A+ A++R +D +G+ GA
Sbjct: 165 ADLNR--ADLRG-------VNLSNAELRQANLSQANLSGADLRGANLRWADLNGADLTGA 215
Query: 144 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
L++A AN GA+LS + +L A+LT A L+R +DL GA + GA
Sbjct: 216 DLDEARLSGANLYGANLSSANLLNAILVHADLTQANLIRADWVGADLTGAALTGAKL 272
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 55/110 (50%), Gaps = 3/110 (2%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A LR + V N RAN AD+ +D G + A L +A +AN +GADL
Sbjct: 140 ADLSGAHLRGSSLVSANLERANLHRADLNRADLRGVNLSNAELRQANLSQANLSGADLRG 199
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 209
+ LN A+LT A L L+ ++L GA + A+ +A++ DL Q
Sbjct: 200 ANLRWADLNGADLTGADLDEARLSGANLYGANLSSANLLNAILVHADLTQ 249
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 47/80 (58%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N + A++ + +F +K N A L A +AN +GA L+ + R LN A+L+ A L+R
Sbjct: 46 NMSGANLSDVNFRKAKLNVARLSGANLSRANLSGAILNVANLIRADLNSADLSEATLIRA 105
Query: 184 VLTRSDLGGAIIEGADFSDA 203
L R+D+ A + GA+ S+A
Sbjct: 106 ELIRADMSNASLSGANLSEA 125
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 61/127 (48%), Gaps = 5/127 (3%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL +A N A A++ +++ SG+ GA L A A+ TGADL +
Sbjct: 160 ANLHRADLNRADLRGVNLSNAELRQANLSQANLSGADLRGANLRWADLNGADLTGADLDE 219
Query: 163 TLMD-----RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 217
+ L+ ANL NA+LV LT+++L A GAD + A + A+ + ++
Sbjct: 220 ARLSGANLYGANLSSANLLNAILVHADLTQANLIRADWVGADLTGAALTGAKLYGVSRFG 279
Query: 218 NGTNPIT 224
+ IT
Sbjct: 280 LKADDIT 286
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 46/101 (45%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A SADL +A + RA ADM + SG+ + A L + +AN ADLS
Sbjct: 90 ADLNSADLSEATLI-----RAELIRADMSNASLSGANLSEADLREGTLRQANLEQADLSG 144
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L ANL A L R L R+DL G + A+ A
Sbjct: 145 AHLRGSSLVSANLERANLHRADLNRADLRGVNLSNAELRQA 185
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 50/108 (46%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S F A L A N RAN + A + ++ + N A L +A +A AD+
Sbjct: 53 SDVNFRKAKLNVARLSGANLSRANLSGAILNVANLIRADLNSADLSEATLIRAELIRADM 112
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
S+ + L+EA+L L + L ++DL GA + G+ A ++ A
Sbjct: 113 SNASLSGANLSEADLREGTLRQANLEQADLSGAHLRGSSLVSANLERA 160
>gi|300023195|ref|YP_003755806.1| pentapeptide repeat protein [Hyphomicrobium denitrificans ATCC
51888]
gi|299525016|gb|ADJ23485.1| pentapeptide repeat protein [Hyphomicrobium denitrificans ATCC
51888]
Length = 282
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 60/113 (53%), Gaps = 3/113 (2%)
Query: 97 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 156
FG+ + + F ADL A+ N + F R +D SG+ +GA L +A + F+
Sbjct: 149 FGVFAGSNFAGADLTDAISAPLN--KTGFIEYIWR-TDLSGANLSGAQLTRANMTQTRFS 205
Query: 157 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
A L D + +L EA+L+ AVL L+ +DL GA + GAD + A +D A+
Sbjct: 206 FAVLRDASLHDTILREADLSGAVLTGADLSGADLTGADLSGADVTGANLDGAK 258
Score = 37.0 bits (84), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 10/71 (14%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT-----LMDRMVL-----N 171
R + +++ D +G F GA L K+ + A+ +GADLS T ++DR+ L +
Sbjct: 51 RPVLSGKNLQGLDLAGLDFKGADLSKSDLFGADLSGADLSSTNLSGAMLDRVTLIAARLD 110
Query: 172 EANLTNAVLVR 182
ANL NA L+R
Sbjct: 111 GANLDNASLMR 121
>gi|193083812|gb|ACF09494.1| pentapeptide repeat protein [uncultured marine crenarchaeote
SAT1000-23-F7]
Length = 741
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 61/110 (55%), Gaps = 13/110 (11%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NFR +NFTS ++ ++F+ +GA L + TGADL + L+ A+L+N
Sbjct: 495 NFRESNFTSTNIANANFTSVNLSGADLSMKDLTENILTGADLRNA-----NLSGADLSNN 549
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA----VID---LAQKQALCKYANGTN 221
LV T+LT +DL AI+ GAD S A +ID + QK L K AN TN
Sbjct: 550 QLVNTILTGADLTDAILSGADLSTANIFGIIDGINILQKTKL-KGANFTN 598
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 14/114 (12%)
Query: 99 IGSAAQFGSAD----LRKAVHVKENFRRANFTS-----ADMRESDFSGSKFNGAYLEKAV 149
+ +A FG D L+K NF AN T+ D+ E+ G+ G LEKA
Sbjct: 571 LSTANIFGIIDGINILQKTKLKGANFTNANLTNINLIGVDISETILKGADLTGVKLEKAK 630
Query: 150 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+N DLS + ++ L ++NL+ RT+L+ +DL A + GA+ SDA
Sbjct: 631 VNNSNLEDLDLSFKNLSKIRLVDSNLS-----RTILSGADLSNAELMGANLSDA 679
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 51/119 (42%), Gaps = 15/119 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A DL + + + R AN + AD+ + + GA L A+ +GADL
Sbjct: 517 SGADLSMKDLTENILTGADLRNANLSGADLSNNQLVNTILTGADLTDAI-----LSGADL 571
Query: 161 SD----------TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
S ++ + L AN TNA L L D+ I++GAD + ++ A+
Sbjct: 572 STANIFGIIDGINILQKTKLKGANFTNANLTNINLIGVDISETILKGADLTGVKLEKAK 630
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 36/69 (52%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL A + N A+ T A + + G+K GA L A ANF ADL+ ++
Sbjct: 664 ADLSNAELMGANLSDADLTGAKLIGAKLIGAKLIGANLTNANLTGANFHMADLTGANLEG 723
Query: 168 MVLNEANLT 176
+++NE NL+
Sbjct: 724 VIINETNLS 732
>gi|428771470|ref|YP_007163260.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428685749|gb|AFZ55216.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 195
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 60/118 (50%), Gaps = 21/118 (17%)
Query: 103 AQFGSADLRKAVHVK---ENFRRA------------NFTSADMRESDFSGSKFNGAYLEK 147
A ADLR A+ + EN A + +AD+R +D G GA L+K
Sbjct: 77 ADLRGADLRGAILLSSQVENISLAGSFLAGAILTNLDLCNADLRGADLRGVNLVGACLQK 136
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A AN +GADLS + L EANL+ A+L T LT+++L AI+EG F D VI
Sbjct: 137 ADLSNANLSGADLS-----QADLEEANLSGAILHGTNLTQANLLCAIVEGVSF-DYVI 188
Score = 40.4 bits (93), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 58/119 (48%), Gaps = 8/119 (6%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L KA+ ++ N R A+ T A ++ +D G+ GA L + + G+ L+ ++
Sbjct: 53 ANLEKAI-LRCNLRGADLTGASLQGADLRGADLRGAILLSSQVENISLAGSFLAGAILTN 111
Query: 168 MVLNEANLTNA-----VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
+ L A+L A LV L ++DL A + GAD S A DL + +GTN
Sbjct: 112 LDLCNADLRGADLRGVNLVGACLQKADLSNANLSGADLSQA--DLEEANLSGAILHGTN 168
Score = 37.0 bits (84), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 54/123 (43%), Gaps = 13/123 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A ADLR A + R A S+ + +GS GA L A+ GADL
Sbjct: 70 TGASLQGADLRGA-----DLRGAILLSSQVENISLAGSFLAGAILTNLDLCNADLRGADL 124
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQKQALCKYA 217
+ L +A+L+NA L L+++DL E A+ S A++ +L Q LC
Sbjct: 125 RGVNLVGACLQKADLSNANLSGADLSQADL-----EEANLSGAILHGTNLTQANLLCAIV 179
Query: 218 NGT 220
G
Sbjct: 180 EGV 182
>gi|424851694|ref|ZP_18276091.1| pentapeptide repeat-containing protein [Rhodococcus opacus PD630]
gi|356666359|gb|EHI46430.1| pentapeptide repeat-containing protein [Rhodococcus opacus PD630]
Length = 194
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 59/131 (45%), Gaps = 15/131 (11%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYL 145
+E R E I + F ADL ++ HV FR +FT ++ R F GS+F+ L
Sbjct: 31 SELRTESVIFTDCDFTGADLAESRHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 90
Query: 146 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 195
V + +FT GADL EANL L R VL +DL GGA
Sbjct: 91 RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 150
Query: 196 EGADFSDAVID 206
+GAD A ID
Sbjct: 151 DGADLRGARID 161
>gi|428218432|ref|YP_007102897.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427990214|gb|AFY70469.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 403
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +A L +A + +AN T A + ++D S GAYL A +AN GA
Sbjct: 9 ANLTNASLTRADLKGVDLVKANLTGASLSDADLSQVNLTGAYLNGADLNRANLAGA---- 64
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+L+EANL A L+R L R+ L AI+ GA+F +A +
Sbjct: 65 ------ILDEANLAAAFLIRANLQRASLNEAILAGANFHEASL 101
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 58/119 (48%), Gaps = 15/119 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-------------- 148
A ADL+ VK N A+ + AD+ + + +G+ NGA L +A
Sbjct: 14 ASLTRADLKGVDLVKANLTGASLSDADLSQVNLTGAYLNGADLNRANLAGAILDEANLAA 73
Query: 149 -VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+AN A L++ ++ +EA+LT A L L+ +DL GA + GA+ SDA ++
Sbjct: 74 AFLIRANLQRASLNEAILAGANFHEASLTGANLRSADLSLADLAGADLAGANLSDACMN 132
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 10/108 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVAYK 152
A A L +A+ NF A+ T A++R +D S G+ + A + A +
Sbjct: 79 ANLQRASLNEAILAGANFHEASLTGANLRSADLSLADLAGADLAGANLSDACMNSAFFIE 138
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
AN GADLS T + L +ANL+ A L LT +DL A + GA+
Sbjct: 139 ANLLGADLSLTSLRGASLAKANLSGANLRSADLTGADLSHATMTGAEL 186
Score = 44.7 bits (104), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 49/95 (51%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A S +++ A+ V+ N AN +A++ ++ + NGA L +A +AN +GA L
Sbjct: 302 SGADLSSTEMKGAILVRTNLNGANLANANLTGANLEQANLNGANLGEANLNRANLSGASL 361
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
+ + L ANL L+ L ++L GAI+
Sbjct: 362 TGANLKGAFLLWANLKGTFLLWANLDEANLTGAIL 396
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 48/94 (51%), Gaps = 10/94 (10%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
LR A K N AN SAD+ +D S + GA L ++ + TGA+L D+
Sbjct: 151 LRGASLAKANLSGANLRSADLTGADLSHATMTGAEL-----HQVDLTGANL-----DQTN 200
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
LN A+L NA L L+R++LG A + G +A
Sbjct: 201 LNAADLVNASLDGAFLSRANLGWANLIGTTMKEA 234
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 57/120 (47%), Gaps = 12/120 (10%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL----- 145
A G F +G A A+L A N AN + AD+ ++ G+ +GA L
Sbjct: 259 ANLTGAFLMG--ANLNGANLNGA-----NLTNANLSGADLSNTNLMGTSLSGADLSSTEM 311
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ A+ + N GA+L++ + L +ANL A L L R++L GA + GA+ A +
Sbjct: 312 KGAILVRTNLNGANLANANLTGANLEQANLNGANLGEANLNRANLSGASLTGANLKGAFL 371
Score = 43.9 bits (102), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 57/109 (52%), Gaps = 5/109 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL A + N A+ + A++ + G+ NGA L A AN +GADLS+
Sbjct: 234 ANLVGADLSWANLNEVNLAGADLSWANLTGAFLMGANLNGANLNGANLTNANLSGADLSN 293
Query: 163 TLMDRMVLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDAVID 206
T + L+ A+L++ A+LVRT L ++L A + GA+ A ++
Sbjct: 294 TNLMGTSLSGADLSSTEMKGAILVRTNLNGANLANANLTGANLEQANLN 342
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 15/92 (16%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
++AN T+A + +D G KAN TGA LSD L++ NLT A
Sbjct: 6 LKKANLTNASLTRADLKGVDL----------VKANLTGASLSDA-----DLSQVNLTGAY 50
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
L L R++L GAI++ A+ + A + A Q
Sbjct: 51 LNGADLNRANLAGAILDEANLAAAFLIRANLQ 82
Score = 41.2 bits (95), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 10/101 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A G A+L + N A+ + A++ E + +G+ + A L A AN GA+
Sbjct: 217 SRANLGWANLIGTTMKEANLVGADLSWANLNEVNLAGADLSWANLTGAFLMGANLNGAN- 275
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LN ANLTNA L L+ ++L G + GAD S
Sbjct: 276 ---------LNGANLTNANLSGADLSNTNLMGTSLSGADLS 307
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 47/101 (46%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L A F AN AD+ + G+ A L A A+ TGADLS
Sbjct: 119 ADLAGANLSDACMNSAFFIEANLLGADLSLTSLRGASLAKANLSGANLRSADLTGADLSH 178
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
M L++ +LT A L +T L +DL A ++GA S A
Sbjct: 179 ATMTGAELHQVDLTGANLDQTNLNAADLVNASLDGAFLSRA 219
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 48/95 (50%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A L KA N R A+ T AD+ + +G++ + L A + N ADL + +D
Sbjct: 154 ASLAKANLSGANLRSADLTGADLSHATMTGAELHQVDLTGANLDQTNLNAADLVNASLDG 213
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
L+ ANL A L+ T + ++L GA + A+ ++
Sbjct: 214 AFLSRANLGWANLIGTTMKEANLVGADLSWANLNE 248
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 5/107 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L A + N AN A++ ++ SG+ + L + +GADLS
Sbjct: 254 ADLSWANLTGAFLMGANLNGANLNGANLTNANLSGADLSNTNL-----MGTSLSGADLSS 308
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
T M +L NL A L LT ++L A + GA+ +A ++ A
Sbjct: 309 TEMKGAILVRTNLNGANLANANLTGANLEQANLNGANLGEANLNRAN 355
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 53/116 (45%), Gaps = 15/116 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKA--------- 148
A ADL A + + T A++ ++ D + +GA+L +A
Sbjct: 169 ADLTGADLSHATMTGAELHQVDLTGANLDQTNLNAADLVNASLDGAFLSRANLGWANLIG 228
Query: 149 -VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+AN GADLS ++ + L A+L+ A L L ++L GA + GA+ ++A
Sbjct: 229 TTMKEANLVGADLSWANLNEVNLAGADLSWANLTGAFLMGANLNGANLNGANLTNA 284
>gi|67459256|ref|YP_246880.1| hypothetical protein RF_0864 [Rickettsia felis URRWXCal2]
gi|67004789|gb|AAY61715.1| Uncharacterized low-complexity protein [Rickettsia felis URRWXCal2]
Length = 959
Score = 53.9 bits (128), Expect = 7e-05, Method: Composition-based stats.
Identities = 41/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAKLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 217
+ + EAN NA++ R LT++D A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAQEANFKNAIMQRADLTKADFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 218 N 218
N
Sbjct: 670 N 670
Score = 43.9 bits (102), Expect = 0.085, Method: Composition-based stats.
Identities = 28/111 (25%), Positives = 54/111 (48%), Gaps = 5/111 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A+L+ AV N R A F AD++ S S + AY+ K +A T + +
Sbjct: 382 ANFEGANLQNAVFQNVNARNAGFLFADLKNSKIENSDMSRAYMPKVDLSEAEVTNSKFNA 441
Query: 163 TLM-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+M +++++ ++ N+ L L +D+ ++G ++A++D A
Sbjct: 442 VMMVNADAEKLIIKDSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 492
Score = 39.7 bits (91), Expect = 1.6, Method: Composition-based stats.
Identities = 30/106 (28%), Positives = 47/106 (44%), Gaps = 10/106 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK---------- 152
A F ADL+ + + RA D+ E++ + SKFN + A A K
Sbjct: 402 AGFLFADLKNSKIENSDMSRAYMPKVDLSEAEVTNSKFNAVMMVNADAEKLIIKDSEWKN 461
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
+N TG L+ M R+ + L NA+L + + +DL A + A
Sbjct: 462 SNLTGISLAYADMQRVQMQGVVLNNALLDQANIVSTDLENAFMNNA 507
>gi|163797895|ref|ZP_02191839.1| pentapeptide repeat family protein [alpha proteobacterium BAL199]
gi|159176857|gb|EDP61425.1| pentapeptide repeat family protein [alpha proteobacterium BAL199]
Length = 396
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 47/95 (49%), Gaps = 10/95 (10%)
Query: 106 GSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 165
G+AD + A N +FT AD+RE DF+G+ GA A +A GADLS
Sbjct: 15 GAADGQPASFANANLFGFDFTGADLREVDFAGASLQGARFVGADLTRAVLVGADLSGVSF 74
Query: 166 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
VL EA+LT A LV GA+ EGAD
Sbjct: 75 RNAVLLEADLTGARLV----------GAVFEGADL 99
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 76/163 (46%), Gaps = 17/163 (10%)
Query: 55 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQFGSADLR 111
A L+ R FV L AV+ + + + EA+ G +G+ A S LR
Sbjct: 47 ASLQGAR-FVGADLTRAVLVGADLSGVSFRNAVLLEADLTGARLVGAVFEGADLRSVSLR 105
Query: 112 KAVHV------KENFRR--ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
A V +E+ RR F A M ++ +G+KF E V + + TGA+L
Sbjct: 106 GASGVSAEPVTEESPRREAVTFAGARMHRANLTGAKF-----ENVVLAQTDLTGANLERA 160
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ R ++ A L NA+L+ L+ +DL +++ GAD S A +D
Sbjct: 161 SLRRASMSGAVLRNAILIDADLSHADLTDSLVTGADLSGAQLD 203
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 44/89 (49%), Gaps = 1/89 (1%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
+ T AD+R + SG+ GA L +AV A ADLS + L NL+ A L
Sbjct: 270 DLTDADLRSLNLSGADLRGAVLRRAVLTDALLVLADLSGADLTLASLARCNLSGANLAGA 329
Query: 184 VLTRSDLGGAIIEGAD-FSDAVIDLAQKQ 211
L+R+DL AI+ A S A D ++Q
Sbjct: 330 NLSRADLTDAILTAAPILSQAGADTGRRQ 358
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 55/110 (50%), Gaps = 1/110 (0%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
RAN T A + + GA LE+A +A+ +GA L + ++ L+ A+LT+++
Sbjct: 132 MHRANLTGAKFENVVLAQTDLTGANLERASLRRASMSGAVLRNAILIDADLSHADLTDSL 191
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVI-DLAQKQALCKYANGTNPITGVST 228
+ L+ + L GA +E A+F A + D+ + A T P V+T
Sbjct: 192 VTGADLSGAQLDGATVERANFVGARLRDVDLSRVDTSKARLTPPTDSVTT 241
>gi|443317576|ref|ZP_21046968.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442782825|gb|ELR92773.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 303
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 55/103 (53%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G DL +A+ V+ N R++ + ++ +++ + + G L +A +ANFT A+L
Sbjct: 99 ADLGETDLSQAILVEANLNRSDLSGVNLHQANLTKASLIGVELNRANLREANFTEANLRR 158
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ R L +ANL TR++L A + ADFSDA++
Sbjct: 159 VELQRAQLGKANL----------TRANLADARMLHADFSDAIL 191
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 66/141 (46%), Gaps = 22/141 (15%)
Query: 66 TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANF 125
T L+ A++ + N S L+ +N ++A IG +L +A N R ANF
Sbjct: 104 TDLSQAILVEANLNRSDLSGVNLHQANLTKASLIG-------VELNRA-----NLREANF 151
Query: 126 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 185
T A++R + ++ A L +A A AD SD +L E NL+ A L R L
Sbjct: 152 TEANLRRVELQRAQLGKANLTRANLADARMLHADFSDA-----ILQETNLSGARLNRANL 206
Query: 186 TRSDLGGAIIE-----GADFS 201
TR+DL A ++ GAD S
Sbjct: 207 TRTDLTAANLKETNLLGADLS 227
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 59/119 (49%), Gaps = 1/119 (0%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G+ DL+ A + RAN + E+D SG+ L A KAN +GA+L+
Sbjct: 34 ANLGNFDLKGANLSGADLTRANCIGVILSEADLSGATLVRTDLSGADINKANLSGANLTK 93
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANGT 220
+ L E +L+ A+LV L RSDL G + A+ + A +I + +A + AN T
Sbjct: 94 ANLLGADLGETDLSQAILVEANLNRSDLSGVNLHQANLTKASLIGVELNRANLREANFT 152
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 59/123 (47%), Gaps = 20/123 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTG 157
AQ G A+L +A A+F+ A ++E++ SG++ N A L + A + N G
Sbjct: 164 AQLGKANLTRANLADARMLHADFSDAILQETNLSGARLNRANLTRTDLTAANLKETNLLG 223
Query: 158 ADLSDTLMDRMVLNEANLTNA---------------VLVRTVLTRSDLGGAIIEGADFSD 202
ADLS +L EANL+ A L T LT+++L GA + A+ +
Sbjct: 224 ADLSYANFTEALLAEANLSGADLSYANLAGLDLTGLNLAGTNLTQANLAGANLTEANLEE 283
Query: 203 AVI 205
AV+
Sbjct: 284 AVL 286
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 37/73 (50%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL A + AN + AD+ ++ +G G L +AN GA+L++ ++
Sbjct: 224 ADLSYANFTEALLAEANLSGADLSYANLAGLDLTGLNLAGTNLTQANLAGANLTEANLEE 283
Query: 168 MVLNEANLTNAVL 180
VL EANLT A +
Sbjct: 284 AVLTEANLTQATM 296
>gi|227496450|ref|ZP_03926734.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434]
gi|226834032|gb|EEH66415.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434]
Length = 222
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 53/104 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR+++ + R AN +DMR +D G+ G +L A+ GADL D
Sbjct: 98 ADMAGADLRRSILPRAELRNANLVDSDMRGADLRGADLRGTWLPYTDMRGADLAGADLRD 157
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
++ L+ A+L ++ L LT ++L A + GAD A ID
Sbjct: 158 ADLEGADLHGASLQSSDLRGADLTDAELTDADLRGADLRGADID 201
Score = 40.8 bits (94), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 43/93 (46%), Gaps = 6/93 (6%)
Query: 102 AAQFGSA-DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
A + G A DLR N R + T AD+R G+ +GA L + A+ T ADL
Sbjct: 16 AHRLGQAPDLRDTDLSNLNLRELDLTDADLR-----GANLDGADLSWSTLSTADLTDADL 70
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
+ R VL A LT A L + +D+ GA
Sbjct: 71 RGATLRRTVLTRAVLTRAALTQVYARDADMAGA 103
>gi|428304969|ref|YP_007141794.1| heat shock protein DnaJ domain-containing protein [Crinalium
epipsammum PCC 9333]
gi|428246504|gb|AFZ12284.1| heat shock protein DnaJ domain protein [Crinalium epipsammum PCC
9333]
Length = 242
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 48/92 (52%), Gaps = 5/92 (5%)
Query: 124 NFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N + AD++E DFSG + + A L A +K N GA+L + R L +ANL+NA
Sbjct: 128 NMSGADLKEKDFSGRNLSDANLSHANLSDAFLHKVNLQGANLYKANLFRANLLQANLSNA 187
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
L L +DL GA + GAD + A I K
Sbjct: 188 CLREANLIGADLSGADLRGADLTGAKIGFNDK 219
>gi|427724799|ref|YP_007072076.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427356519|gb|AFY39242.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 276
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 64/138 (46%), Gaps = 16/138 (11%)
Query: 113 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTGADLSDTLMDR 167
AV K N A +A++R +D G+ GAYL AN F+GA+L + +
Sbjct: 135 AVGPKANLSGAYLNTANLRGADLQGANLRGAYLSGTDFTGANLTGVAFSGANLKRSFLTG 194
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIE------GADFSDAV-IDLAQKQALC----KY 216
L EA L N L L +DL GA++E GADFSD + +++ LC K
Sbjct: 195 ACLREARLINVELEMADLRGADLTGAMLEQIESLAGADFSDVRGLSDSERSYLCSRSPKE 254
Query: 217 ANGTNPITGVSTRKSLGC 234
N T +TR SL C
Sbjct: 255 LGTWNSFTRKNTRASLNC 272
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 11/98 (11%)
Query: 115 HVKENFRRAN-FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
VKE R N +A++ + D +G + A L+ A+ NFTGA L+ + L A
Sbjct: 17 EVKEILERGNSLENANLEDLDLAGYDLSDANLQGAILIGVNFTGATLAGAQLQNADLRRA 76
Query: 174 NLTN----------AVLVRTVLTRSDLGGAIIEGADFS 201
NLTN A L RT+L DL GA+++GA+ +
Sbjct: 77 NLTNASLKGATLSEAYLQRTILNDCDLAGAVLDGANLT 114
>gi|86608719|ref|YP_477481.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86557261|gb|ABD02218.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 207
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 37/127 (29%), Positives = 64/127 (50%), Gaps = 4/127 (3%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L++A+ + N + A+ + A + +D G+ +G+ A ++A+ A+L
Sbjct: 68 ADLSGANLKEAILRQANLQAADLSQAILNLADLRGANLSGSAQAGAFLWEADLAQANLQQ 127
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
T + L ANL+ A L R +LTR+DL GA + AD A + + A + A+ T
Sbjct: 128 TDLTGANLQVANLSGADLRRAILTRADLTGAKLHNADLRGADL----RGAFLEGADLTGA 183
Query: 223 ITGVSTR 229
+ TR
Sbjct: 184 LYNAQTR 190
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 56/113 (49%), Gaps = 1/113 (0%)
Query: 97 FGIGSAAQFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
GI +AA F +L + + F + + ++ ++ G+ +GA L++A+ +AN
Sbjct: 26 LGIPTAAAFAQLELDAQLGRSQIVFPSKDCPACNLTGAELPGADLSGANLKEAILRQANL 85
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
ADLS +++ L ANL+ + L +DL A ++ D + A + +A
Sbjct: 86 QAADLSQAILNLADLRGANLSGSAQAGAFLWEADLAQANLQQTDLTGANLQVA 138
>gi|217423045|ref|ZP_03454547.1| pentapeptide repeat protein [Burkholderia pseudomallei 576]
gi|217393953|gb|EEC33973.1| pentapeptide repeat protein [Burkholderia pseudomallei 576]
Length = 825
Score = 53.9 bits (128), Expect = 8e-05, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQGADLTGADLSGMDLRGARLAGAMLENADLSDADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.47, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 37.4 bits (85), Expect = 6.3, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +ADL A + R AD+ + ++ A L A + +F+G+DL
Sbjct: 530 AGAMLENADLSDADLTGADLSRTVLVRADLTRAKLVDARLTAANLSLAHCERTDFSGSDL 589
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIE----------GADFSDAVI 205
SD + +++ L + +VL T D G A + G FSDA I
Sbjct: 590 SDGIFEQVHLRDCRFNGSVLASTRFDACRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|434407711|ref|YP_007150596.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428261966|gb|AFZ27916.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 268
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 52/101 (51%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G++ L + N N +AD+ E+ ++ GAYL K YKAN T A LS
Sbjct: 144 ADLGTSKLHRTNLCFANLIAVNLIAADLSEATLHEAEVMGAYLYKTDLYKANLTEAHLSG 203
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ R L EA+L+NA L T L ++L GA + GA+ A
Sbjct: 204 AYLLRANLTEADLSNADLSWTNLRGANLTGANLRGANLRGA 244
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 43/98 (43%), Gaps = 15/98 (15%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G ADL A N A A++ ++ S + GA L +A AN G+DLS
Sbjct: 64 ANLGGADLTGA-----NLYNAKLIEANLSAANLSAANLRGATLTQADMNCANLIGSDLS- 117
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
EANL AV+ L +DL GA + AD
Sbjct: 118 ---------EANLKGAVITDANLIGADLRGANLRDADL 146
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 56/112 (50%), Gaps = 10/112 (8%)
Query: 76 CSSNISAL----ADLNK---YEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANF 125
C +N+ A+ ADL++ +EAE G + + A A L A ++ N A+
Sbjct: 157 CFANLIAVNLIAADLSEATLHEAEVMGAYLYKTDLYKANLTEAHLSGAYLLRANLTEADL 216
Query: 126 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++AD+ ++ G+ GA L A AN TGA+LS + ++ ++++ N
Sbjct: 217 SNADLSWTNLRGANLTGANLRGANLRGANLTGANLSSVNLHETIMPDSSMHN 268
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 50/98 (51%), Gaps = 5/98 (5%)
Query: 111 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 170
+K + + R AN ++R ++ G + L A+ +AN GADL+ + L
Sbjct: 22 KKNPQIAPDLRGANLQGDNLRGANLQGVNLSKVDLSNALLVRANLGGADLTGANLYNAKL 81
Query: 171 NEANLTNAVL----VR-TVLTRSDLGGAIIEGADFSDA 203
EANL+ A L +R LT++D+ A + G+D S+A
Sbjct: 82 IEANLSAANLSAANLRGATLTQADMNCANLIGSDLSEA 119
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 51/109 (46%), Gaps = 15/109 (13%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SAA +A+LR A T ADM ++ GS + A L+ AV AN GADL
Sbjct: 87 SAANLSAANLRGAT----------LTQADMNCANLIGSDLSEANLKGAVITDANLIGADL 136
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L +A+L + L RT L ++L + AD S+A + A+
Sbjct: 137 RGA-----NLRDADLGTSKLHRTNLCFANLIAVNLIAADLSEATLHEAE 180
>gi|167905147|ref|ZP_02492352.1| pentapeptide repeat protein [Burkholderia pseudomallei NCTC 13177]
gi|237508538|ref|ZP_04521253.1| pentapeptide repeat family protein [Burkholderia pseudomallei
MSHR346]
gi|235000743|gb|EEP50167.1| pentapeptide repeat family protein [Burkholderia pseudomallei
MSHR346]
Length = 825
Score = 53.9 bits (128), Expect = 8e-05, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQGADLTGADLSGMDLRGARLAGAMLENADLSDADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.47, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 37.4 bits (85), Expect = 6.4, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +ADL A + R AD+ + ++ A L A + +F+G+DL
Sbjct: 530 AGAMLENADLSDADLTGADLSRTVLVRADLTRAKLVDARLTAANLSLAHCERTDFSGSDL 589
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIE----------GADFSDAVI 205
SD + +++ L + +VL T D G A + G FSDA I
Sbjct: 590 SDGIFEQVHLRDCRFNGSVLASTRFDACRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|428312148|ref|YP_007123125.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428253760|gb|AFZ19719.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 223
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 50/88 (56%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N N + D+R +DF G+ A L +A AN GA LS +++R VLN A L++A
Sbjct: 21 NLEGINLSDTDLRGADFRGADLFDANLARADLSDANLGGAILSRAVLNRAVLNRAVLSSA 80
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVID 206
+L L R+ L GA++ GA + A+++
Sbjct: 81 LLSNAFLNRAVLCGAVLRGAILNGAILN 108
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 58/125 (46%), Gaps = 6/125 (4%)
Query: 80 ISALADLNKYEAETRG-EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS 138
++A+ L +Y A R +F DLR A +FR A+ A++ +D S +
Sbjct: 1 MNAIELLERYAAGERSFDFPNLEGINLSDTDLRGA-----DFRGADLFDANLARADLSDA 55
Query: 139 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
GA L +AV +A A LS L+ LN A L AVL +L + L GA + GA
Sbjct: 56 NLGGAILSRAVLNRAVLNRAVLSSALLSNAFLNRAVLCGAVLRGAILNGAILNGANLSGA 115
Query: 199 DFSDA 203
D A
Sbjct: 116 DLYHA 120
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 58/131 (44%), Gaps = 10/131 (7%)
Query: 59 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA----AQFGSADLRKAV 114
N V L A++ N + L+ + Y A G +G A A SA LR+A
Sbjct: 88 NRAVLCGAVLRGAILNGAILNGANLSGADLYHANLSGAL-LGYADLYHAYLNSALLREAD 146
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
R AN A++R ++ SG+ GA L AN GA+LSD L AN
Sbjct: 147 LYHAYLREANLFGANLRSANLSGADLTGANLMATNLRSANLFGANLSDA-----NLGGAN 201
Query: 175 LTNAVLVRTVL 185
+ A++ +T++
Sbjct: 202 MRCALICQTIM 212
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 5/89 (5%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEAN 174
RA A +R + +G+ NGA L A Y AN +G ADL ++ +L EA+
Sbjct: 87 LNRAVLCGAVLRGAILNGAILNGANLSGADLYHANLSGALLGYADLYHAYLNSALLREAD 146
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L +A L L ++L A + GAD + A
Sbjct: 147 LYHAYLREANLFGANLRSANLSGADLTGA 175
>gi|16331795|ref|NP_442523.1| hypothetical protein slr0516 [Synechocystis sp. PCC 6803]
gi|383323538|ref|YP_005384392.1| hypothetical protein SYNGTI_2630 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383326707|ref|YP_005387561.1| hypothetical protein SYNPCCP_2629 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383492591|ref|YP_005410268.1| hypothetical protein SYNPCCN_2629 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437859|ref|YP_005652584.1| hypothetical protein SYNGTS_2631 [Synechocystis sp. PCC 6803]
gi|451815947|ref|YP_007452399.1| hypothetical protein MYO_126560 [Synechocystis sp. PCC 6803]
gi|6226382|sp|Q55837.1|Y516_SYNY3 RecName: Full=Uncharacterized protein slr0516
gi|1001755|dbj|BAA10593.1| slr0516 [Synechocystis sp. PCC 6803]
gi|339274892|dbj|BAK51379.1| hypothetical protein SYNGTS_2631 [Synechocystis sp. PCC 6803]
gi|359272858|dbj|BAL30377.1| hypothetical protein SYNGTI_2630 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359276028|dbj|BAL33546.1| hypothetical protein SYNPCCN_2629 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359279198|dbj|BAL36715.1| hypothetical protein SYNPCCP_2629 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407960570|dbj|BAM53810.1| hypothetical protein BEST7613_4879 [Bacillus subtilis BEST7613]
gi|451781916|gb|AGF52885.1| hypothetical protein MYO_126560 [Synechocystis sp. PCC 6803]
Length = 166
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 5/88 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN---- 174
+ R N +A + SD SG+ +G L +A+ +AN TGA+LS+T + L EAN
Sbjct: 49 DLREFNLENARLNRSDLSGANLSGVNLRRALLDRANLTGANLSETDLTEAALTEANLAGA 108
Query: 175 -LTNAVLVRTVLTRSDLGGAIIEGADFS 201
L+ A L R+ L DL GA ++GA+ +
Sbjct: 109 DLSGANLERSFLRDVDLTGANLKGANLA 136
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 45/83 (54%), Gaps = 10/83 (12%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N AD+RE FN LE A +++ +GA+LS + R +L+ ANLT A L T
Sbjct: 44 NLAGADLRE-------FN---LENARLNRSDLSGANLSGVNLRRALLDRANLTGANLSET 93
Query: 184 VLTRSDLGGAIIEGADFSDAVID 206
LT + L A + GAD S A ++
Sbjct: 94 DLTEAALTEANLAGADLSGANLE 116
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 16/121 (13%)
Query: 84 ADLNKYEAET-RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
ADL ++ E R S A +LR+A+ RAN T A++ E+D +
Sbjct: 48 ADLREFNLENARLNRSDLSGANLSGVNLRRAL-----LDRANLTGANLSETDLT------ 96
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+A +AN GADLS ++R L + +LT A L L ++L A + D +
Sbjct: 97 ----EAALTEANLAGADLSGANLERSFLRDVDLTGANLKGANLAWANLTAANLTDVDLEE 152
Query: 203 A 203
A
Sbjct: 153 A 153
>gi|428307284|ref|YP_007144109.1| serine/threonine protein kinase with pentapeptide repeats
[Crinalium epipsammum PCC 9333]
gi|428248819|gb|AFZ14599.1| serine/threonine protein kinase with pentapeptide repeats
[Crinalium epipsammum PCC 9333]
Length = 564
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 54/92 (58%), Gaps = 5/92 (5%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ +F N ++ ++++++ SG F+ A L + NF GA+LS+T M + L+ A L
Sbjct: 437 RRDFGEQNLSNLNLQKANLSGGNFHQANLTQT-----NFQGANLSNTDMGQTSLSGAMLR 491
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+A LVR L+ +DL GA + GAD S A + A
Sbjct: 492 DANLVRAYLSYADLEGADLRGADLSFAYFNYA 523
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 49/117 (41%), Gaps = 10/117 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A + +A + NF+ AN ++ DM ++ SG+ A L +A A+ GADL
Sbjct: 453 ANLSGGNFHQANLTQTNFQGANLSNTDMGQTSLSGAMLRDANLVRAYLSYADLEGADLRG 512
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
+ N ANL A +L GA + GA +D I A+ NG
Sbjct: 513 ADLSFAYFNYANLRGA----------NLCGANLTGAKINDEQIAQAKTNWATVLPNG 559
>gi|410672126|ref|YP_006924497.1| pentapeptide repeat protein [Methanolobus psychrophilus R15]
gi|409171254|gb|AFV25129.1| pentapeptide repeat protein [Methanolobus psychrophilus R15]
Length = 418
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 67/131 (51%), Gaps = 4/131 (3%)
Query: 73 VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE 132
++S + I A+L + + E G A A+L++A + N RRAN AD+
Sbjct: 197 LSSSMAEIKPQANLQRIDMEKTDLLG----ANLMEANLKEANLREANLRRANLEGADLMG 252
Query: 133 SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 192
++ G+ A L A A+ GA+L D + ++ LN+ANL A L+ T L R++L
Sbjct: 253 ANLMGADMREANLMLANLEGASLMGANLMDANLKKINLNKANLVGANLIGTNLLRAELTE 312
Query: 193 AIIEGADFSDA 203
A++ A+ DA
Sbjct: 313 ALLMNAEIIDA 323
>gi|398354158|ref|YP_006399622.1| hypothetical protein USDA257_c43260 [Sinorhizobium fredii USDA 257]
gi|390129484|gb|AFL52865.1| hypothetical protein USDA257_c43260 [Sinorhizobium fredii USDA 257]
Length = 249
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 62/124 (50%), Gaps = 11/124 (8%)
Query: 101 SAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A+ +A+L KA V+ + +ANF+ + DFSG GA + +A+F
Sbjct: 85 SGAELTAANLEKATLVRASLAGAKADKANFSRVEAYRGDFSGISAEGALFVSSELQRADF 144
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGA-DFSDAVIDLAQ 209
TGA L+ ++ L AN AVL T L+R++L GA+ EG DF A + L +
Sbjct: 145 TGARLTGADFEKAELGRANFGKAVLTGTRFSVANLSRANLSGALFEGPLDFDRAFLFLTR 204
Query: 210 KQAL 213
+ L
Sbjct: 205 IEGL 208
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 49/111 (44%), Gaps = 5/111 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G A + + R+ + + + ++ D +D SG++ A LEKA +A+ GA
Sbjct: 49 GPGADWRECNKRQLMLGGSDLKGSHLVDTDFASTDLSGAELTAANLEKATLVRASLAGAK 108
Query: 160 LSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
R+ + + A+ V + L R+D GA + GADF A +
Sbjct: 109 ADKANFSRVEAYRGDFSGISAEGALFVSSELQRADFTGARLTGADFEKAEL 159
>gi|167921391|ref|ZP_02508482.1| pentapeptide repeat protein [Burkholderia pseudomallei BCC215]
Length = 825
Score = 53.9 bits (128), Expect = 8e-05, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQVADLTGADLSGMDLRGARLAGAMLENADLSDADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.47, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 38.9 bits (89), Expect = 2.7, Method: Composition-based stats.
Identities = 38/153 (24%), Positives = 61/153 (39%), Gaps = 17/153 (11%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
+A+A A S ++ L + + RG G A +ADL A + R
Sbjct: 499 VASAAAAGQSLQVADLTGADLSGMDLRGARLAG--AMLENADLSDADLTGADLSRTVLVR 556
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
AD+ + ++ A L A + +F+G+DLSD + +++ L + +VL T
Sbjct: 557 ADLTRAKLVDARLTAANLSLAHCERTDFSGSDLSDGIFEQVHLRDCRFNGSVLASTRFDA 616
Query: 188 S-----DLGGAIIE----------GADFSDAVI 205
D G A + G FSDA I
Sbjct: 617 CRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|126455703|ref|YP_001074295.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
1106a]
gi|167896768|ref|ZP_02484170.1| pentapeptide repeat protein [Burkholderia pseudomallei 7894]
gi|242312992|ref|ZP_04812009.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106b]
gi|254195379|ref|ZP_04901807.1| pentapeptide repeat protein [Burkholderia pseudomallei S13]
gi|126229471|gb|ABN92884.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106a]
gi|169652126|gb|EDS84819.1| pentapeptide repeat protein [Burkholderia pseudomallei S13]
gi|242136231|gb|EES22634.1| pentapeptide repeat protein [Burkholderia pseudomallei 1106b]
Length = 825
Score = 53.9 bits (128), Expect = 8e-05, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQVADLTGADLSGMDLRGARLAGAMLENADLSDADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.47, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 38.9 bits (89), Expect = 2.7, Method: Composition-based stats.
Identities = 38/153 (24%), Positives = 61/153 (39%), Gaps = 17/153 (11%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
+A+A A S ++ L + + RG G A +ADL A + R
Sbjct: 499 VASAAAAGQSLQVADLTGADLSGMDLRGARLAG--AMLENADLSDADLTGADLSRTVLVR 556
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
AD+ + ++ A L A + +F+G+DLSD + +++ L + +VL T
Sbjct: 557 ADLTRAKLVDARLTAANLSLAHCERTDFSGSDLSDGIFEQVHLRDCRFNGSVLASTRFDA 616
Query: 188 S-----DLGGAIIE----------GADFSDAVI 205
D G A + G FSDA I
Sbjct: 617 CRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|53721218|ref|YP_110203.1| hypothetical protein BPSS0182 [Burkholderia pseudomallei K96243]
gi|167818308|ref|ZP_02449988.1| hypothetical protein Bpse9_24431 [Burkholderia pseudomallei 91]
gi|418395056|ref|ZP_12969100.1| type VI secretion system [Burkholderia pseudomallei 354a]
gi|418554994|ref|ZP_13119746.1| type VI secretion system [Burkholderia pseudomallei 354e]
gi|52211632|emb|CAH37627.1| conserved hypothetical protein [Burkholderia pseudomallei K96243]
gi|385369399|gb|EIF74730.1| type VI secretion system [Burkholderia pseudomallei 354e]
gi|385374364|gb|EIF79254.1| type VI secretion system [Burkholderia pseudomallei 354a]
Length = 825
Score = 53.9 bits (128), Expect = 8e-05, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQVADLTGADLSGMDLRGARLAGAMLENADLSDADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 42.0 bits (97), Expect = 0.29, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 34/60 (56%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L+D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILID 802
Score = 38.5 bits (88), Expect = 2.8, Method: Composition-based stats.
Identities = 38/153 (24%), Positives = 61/153 (39%), Gaps = 17/153 (11%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
+A+A A S ++ L + + RG G A +ADL A + R
Sbjct: 499 VASAAAAGQSLQVADLTGADLSGMDLRGARLAG--AMLENADLSDADLTGADLSRTVLVR 556
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
AD+ + ++ A L A + +F+G+DLSD + +++ L + +VL T
Sbjct: 557 ADLTRAKLVDARLTAANLSLAHCERTDFSGSDLSDGIFEQVHLRDCRFNGSVLASTRFDA 616
Query: 188 S-----DLGGAIIE----------GADFSDAVI 205
D G A + G FSDA I
Sbjct: 617 CRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
Score = 37.0 bits (84), Expect = 9.6, Method: Composition-based stats.
Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 10/102 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A + R+ V+ NF A +FT A +R +D G+K G+ +A+
Sbjct: 707 SFATLTEVNFRETQLVEANFGGARIGNCDFTDACLRAADLRGAKAEGSPF-----VRADL 761
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
T ADL DT + L A L A L R L R++L +I+
Sbjct: 762 TRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILIDA 803
>gi|451980423|ref|ZP_21928815.1| conserved hypothetical protein, contains pentapeptide repeats
[Nitrospina gracilis 3/211]
gi|451762323|emb|CCQ90046.1| conserved hypothetical protein, contains pentapeptide repeats
[Nitrospina gracilis 3/211]
Length = 289
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 64/136 (47%), Gaps = 30/136 (22%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GA------------ 143
S A+F A L++A N R+ F A M E++ +G +FN GA
Sbjct: 100 SGAKFHQALLKRAQFEGANLVRSEFLEAQMNEANLAGVRFNKSDLRGAMMIGINLAGAQI 159
Query: 144 ---YLEKAVAYKANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRTV-----LTRSDL 190
+L K K + TG D L+ + + VL E N NA+L RT LT ++L
Sbjct: 160 PQSHLSKTNISKGDLTGTDVSGCNLTGSDLREAVLRETNFQNAILDRTFLKGADLTGANL 219
Query: 191 GGAIIEGADFSDAVID 206
GA + GADF++ V+D
Sbjct: 220 TGARLRGADFAETVLD 235
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 10/90 (11%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
++ +F R N + + R +D SG+KF+ A L+ +A F GA+L + +NEANL
Sbjct: 80 IRADFTRTNLSGVNFRNTDLSGAKFHQALLK-----RAQFEGANLVRSEFLEAQMNEANL 134
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
VR +SDL GA++ G + + A I
Sbjct: 135 AG---VR--FNKSDLRGAMMIGINLAGAQI 159
>gi|254189534|ref|ZP_04896044.1| pentapeptide repeat protein [Burkholderia pseudomallei Pasteur
52237]
gi|157937212|gb|EDO92882.1| pentapeptide repeat protein [Burkholderia pseudomallei Pasteur
52237]
Length = 825
Score = 53.9 bits (128), Expect = 8e-05, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQVADLTGADLSGMDLRGARLAGAMLENADLSDADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.47, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 38.5 bits (88), Expect = 2.7, Method: Composition-based stats.
Identities = 38/153 (24%), Positives = 61/153 (39%), Gaps = 17/153 (11%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
+A+A A S ++ L + + RG G A +ADL A + R
Sbjct: 499 VASAAAAGQSLQVADLTGADLSGMDLRGARLAG--AMLENADLSDADLTGADLSRTVLVR 556
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
AD+ + ++ A L A + +F+G+DLSD + +++ L + +VL T
Sbjct: 557 ADLTRAKLVDARLTAANLSLAHCERTDFSGSDLSDGIFEQVHLRDCRFNGSVLASTRFDA 616
Query: 188 S-----DLGGAIIE----------GADFSDAVI 205
D G A + G FSDA I
Sbjct: 617 CRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|418939072|ref|ZP_13492497.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
gi|375054219|gb|EHS50602.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
Length = 202
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 10/102 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A ADLR A NF AN SAD++ +D + + GA L A +AN TGA
Sbjct: 63 TGANLTGADLRWADCDGANFTGANLKSADLQHTDLTNANLTGANLTGANLTEANLTGA-- 120
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+L EA L A L++ + +++L G + GAD +D
Sbjct: 121 --------ILKEARLDKASLIQAIKQKANLQGVDLSGADLTD 154
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 50/104 (48%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L V + + A + +DF+G+ GA L A ANFTGA+L +
Sbjct: 35 ANLSNGVFAGADLEQVRLAGASLEGADFTGANLTGADLRWADCDGANFTGANLKSADLQH 94
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
L ANLT A L LT ++L GAI++ A A + A KQ
Sbjct: 95 TDLTNANLTGANLTGANLTEANLTGAILKEARLDKASLIQAIKQ 138
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 54/110 (49%), Gaps = 15/110 (13%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANF 155
+ A SADL+ N AN T A++ E++ +G+ A L+KA + KAN
Sbjct: 83 TGANLKSADLQHTDLTNANLTGANLTGANLTEANLTGAILKEARLDKASLIQAIKQKANL 142
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
G DLS A+LT+ L R T +L GAI++GA + A++
Sbjct: 143 QGVDLSG----------ADLTDMNLSRVDFTGVNLKGAILKGAILTGAIL 182
>gi|432333149|ref|ZP_19584958.1| hypothetical protein Rwratislav_00760 [Rhodococcus wratislaviensis
IFP 2016]
gi|430779982|gb|ELB95096.1| hypothetical protein Rwratislav_00760 [Rhodococcus wratislaviensis
IFP 2016]
Length = 220
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 51/108 (47%), Gaps = 5/108 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +ADLR R A+ TS +M E D SG+ A L A AN ADL+D
Sbjct: 34 ANLRNADLRLGFLRDATLRNADLTSCNMYEVDLSGANLYLAQLSGAHMTGANLNNADLTD 93
Query: 163 TLMDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
T + + +L E L A L R L +DL GA + G D SDA +
Sbjct: 94 TKLIKTQLSGAMLIEVELDGADLSRAFLQNADLTGAHLRGTDLSDATL 141
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 65/150 (43%), Gaps = 16/150 (10%)
Query: 87 NKYEAETRGE---FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
N YE + G S A A+L A + + A + E + G+ + A
Sbjct: 60 NMYEVDLSGANLYLAQLSGAHMTGANLNNADLTDTKLIKTQLSGAMLIEVELDGADLSRA 119
Query: 144 YLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLTNAVLVRTVLTRSDLGGAIIEGA 198
+L+ A A+ G DLSD + + M N EA L +A L LT +DL GA + GA
Sbjct: 120 FLQNADLTGAHLRGTDLSDATLVGAELMATNLAEAELVDADLTDADLTFADLTGADLRGA 179
Query: 199 -----DFSDAVI---DLAQKQALCKYANGT 220
DF+DA + DL Q +Y + T
Sbjct: 180 NLTRTDFTDADLTGADLGTTQDKARYDDTT 209
>gi|428770507|ref|YP_007162297.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428684786|gb|AFZ54253.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 355
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 1/97 (1%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N R A+ T D+ E++ +K NG L A AN T A+L++ + L ANLTNA
Sbjct: 253 NLRGADLTDVDLSEANLQNTKLNGVDLSGAYLEGANLTNANLTNASLALSNLIGANLTNA 312
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALC 214
L T L + LG I++GA F++ + ++ +KQ L
Sbjct: 313 NLTNTNLQNTSLGQTIVKGAIFANNLGLNEEKKQELI 349
>gi|86608529|ref|YP_477291.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86557071|gb|ABD02028.1| pentapeptide repeat protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 248
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 70/148 (47%), Gaps = 15/148 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTG 157
A F ++DLR + + +NFT+A + +S F G +F+ + +A AN F
Sbjct: 89 ANFTASDLRGSSFSQALGDYSNFTAAKLDKSSFQGGRFSHSIFREASLVAANLAEGNFFA 148
Query: 158 ADLSDTLMDRMVLNEA----------NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
AD + R L++A NL A+LV L + + + GADF+DA +
Sbjct: 149 ADFRQANLSRCNLSQAALVSCQLQFANLEQAILVGANLRDAQIEDTLFSGADFTDAKLSD 208
Query: 208 AQKQALCKYANGTNPITGVSTRKSLGCG 235
++ L + A+GTN +T T +L G
Sbjct: 209 ETRKLLIERASGTNELTQRDTLNTLLAG 236
>gi|17230606|ref|NP_487154.1| hypothetical protein all3114 [Nostoc sp. PCC 7120]
gi|17132208|dbj|BAB74813.1| all3114 [Nostoc sp. PCC 7120]
Length = 576
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 45/78 (57%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N + A + +D S +K NGA L A A F GADLS + +VLN+A+L+ +L
Sbjct: 421 NLSDAILEAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGILSEA 480
Query: 184 VLTRSDLGGAIIEGADFS 201
LT +DL AI+ G DFS
Sbjct: 481 DLTGADLSDAILLGTDFS 498
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 67/132 (50%), Gaps = 22/132 (16%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A G A+L A NF+ AN T AD +++ S +GA L A AN TGA+L
Sbjct: 268 SGAYLGDANLTGA-----NFQDANLTGADFGDANLSSVNLSGANLSSADLSSANLTGANL 322
Query: 161 SDTLMDRMVLNEANLTNAVL--------------VRTV-LTRSDLGGAIIEGADFSDAVI 205
S + R L+ A+L++++L +R L R++L AI+ GA+ SDA +
Sbjct: 323 SGANLQRADLSRADLSSSILNDGEFSHANLSGVNLRDAELRRANLSNAILFGANLSDANL 382
Query: 206 DLAQ--KQALCK 215
+ A + LC+
Sbjct: 383 NHADLSRADLCR 394
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 51/90 (56%), Gaps = 5/90 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEA 173
NF+ A +A++ +FSG+ +GAYL A ANF TGAD D + + L+ A
Sbjct: 246 NFQGAYLGNANLTGVNFSGANLSGAYLGDANLTGANFQDANLTGADFGDANLSSVNLSGA 305
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
NL++A L LT ++L GA ++ AD S A
Sbjct: 306 NLSSADLSSANLTGANLSGANLQRADLSRA 335
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 50/95 (52%), Gaps = 8/95 (8%)
Query: 117 KENFRRAN---FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
K N+ R N F AD+ D +G N A L + +A+ TGADLSD ++ + A
Sbjct: 441 KLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGILSEADLTGADLSDAILLGTDFSFA 500
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
NL +A L+ S+L GAI+ GAD S A + A
Sbjct: 501 NLNSA-----NLSGSNLSGAILNGADLSSANLSYA 530
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 48/106 (45%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A SADL A N AN AD+ +D S S N A N A+L
Sbjct: 303 SGANLSSADLSSANLTGANLSGANLQRADLSRADLSSSILNDGEFSHANLSGVNLRDAEL 362
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ +L ANL++A L L+R+DL A + GAD + A ++
Sbjct: 363 RRANLSNAILFGANLSDANLNHADLSRADLCRADLSGADLTHATLN 408
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 55/110 (50%), Gaps = 5/110 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANF 155
S+A SA+L A N +RA+ + AD+ S +FS + +G L A +AN
Sbjct: 308 SSADLSSANLTGANLSGANLQRADLSRADLSSSILNDGEFSHANLSGVNLRDAELRRANL 367
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ A L + LN A+L+ A L R L+ +DL A + G + SD ++
Sbjct: 368 SNAILFGANLSDANLNHADLSRADLCRADLSGADLTHATLNGTNLSDTIL 417
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 58/115 (50%), Gaps = 18/115 (15%)
Query: 95 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 154
GEF S A +LR A RRAN ++A + ++ S + N A L +A +A+
Sbjct: 345 GEF---SHANLSGVNLRDA-----ELRRANLSNAILFGANLSDANLNHADLSRADLCRAD 396
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+GADL+ LN NL++ +L T +L AI+E AD S A ++ A+
Sbjct: 397 LSGADLT-----HATLNGTNLSDTILFST-----NLSDAILEAADLSYAKLNGAK 441
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 44/88 (50%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
V E R NF A + ++ +G F+GA L A AN TGA+ D + +ANL
Sbjct: 238 VGEFLRGGNFQGAYLGNANLTGVNFSGANLSGAYLGDANLTGANFQDANLTGADFGDANL 297
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++ L L+ +DL A + GA+ S A
Sbjct: 298 SSVNLSGANLSSADLSSANLTGANLSGA 325
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 60/128 (46%), Gaps = 7/128 (5%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
L+ +N +AE R + +A FG A+L A + RA+ AD+ +D + + NG
Sbjct: 352 LSGVNLRDAELR-RANLSNAILFG-ANLSDANLNHADLSRADLCRADLSGADLTHATLNG 409
Query: 143 AYLEKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
L + + N + ADLS ++ LN A L A+ + L+ DL G ++
Sbjct: 410 TNLSDTILFSTNLSDAILEAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDLTGVVLND 469
Query: 198 ADFSDAVI 205
AD S ++
Sbjct: 470 ADLSGGIL 477
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 37/75 (49%)
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
GI S A ADL A+ + +F AN SA++ S+ SG+ NGA L A A
Sbjct: 475 GILSEADLTGADLSDAILLGTDFSFANLNSANLSGSNLSGAILNGADLSSANLSYAILDD 534
Query: 158 ADLSDTLMDRMVLNE 172
D+S+ ++ M E
Sbjct: 535 TDISEANLEEMTWGE 549
>gi|374723788|gb|EHR75868.1| Pentapeptide repeats containing protein [uncultured marine group II
euryarchaeote]
Length = 148
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 49/106 (46%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
LRK H NFRR AD+ E DFS F A + + K+ F GAD + +
Sbjct: 35 LRKGRHAGSNFRRGILDGADLTEGDFSNCDFRKASMYEVDLMKSAFDGADFRGADLRKAR 94
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
LN +N N L AI +G+D+ +A +D ++AL K
Sbjct: 95 LNLSNFRNCKFAGADLRGIRGKYAIWQGSDWWNATMDEGLEKALAK 140
>gi|254413321|ref|ZP_05027092.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196179941|gb|EDX74934.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 636
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 51/97 (52%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA +A+LR+ + K N R A A + E++ + A L +A Y+A T ADLS
Sbjct: 210 AANLTTANLREVLLEKANLRDAILVGATLTEANLRQACLRRANLTQAELYRAILTDADLS 269
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
+ DR+ L+ ANL A L+R L ++L +++
Sbjct: 270 EVTGDRVNLSRANLMGAYLLRASLVNANLRRTVLQNV 306
Score = 43.5 bits (101), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 46/97 (47%), Gaps = 10/97 (10%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR----------MV 169
+N T A + ++ ++ A L +A AN T A+L + L+++
Sbjct: 178 LNHSNLTGATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAILVGAT 237
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L EANL A L R LT+++L AI+ AD S+ D
Sbjct: 238 LTEANLRQACLRRANLTQAELYRAILTDADLSEVTGD 274
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTG 157
A ++L +A+ + N + + SA + +++ S + +GA L+ A +N TG
Sbjct: 126 AILKHSNLNQAILTRVNLSKVDGQSASLCQANLSWVEAPYCNLSGANLQAAQLNHSNLTG 185
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A L T + L ANL A L+ LT ++L ++E A+ DA++
Sbjct: 186 ATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAIL 233
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 66/173 (38%), Gaps = 38/173 (21%)
Query: 63 FVSTALAAAVVASCSSNISALADLNKYE-----AETRGEFGIG---SAAQFGSADLRKAV 114
+ST L AA + S + L N E A R +G + A A LR+A
Sbjct: 193 LISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAILVGATLTEANLRQACLRRAN 252
Query: 115 HVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKAN--------------- 154
+ RA T AD+ E + S + GAYL +A AN
Sbjct: 253 LTQAELYRAILTDADLSEVTGDRVNLSRANLMGAYLLRASLVNANLRRTVLQNVYCLQTN 312
Query: 155 ----------FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
ADLS ++ +L EANLT+A L+ + L R L A + G
Sbjct: 313 LTAANLQGADLRQADLSGAYLNETILTEANLTDAYLIGSYLIRPKLEQAQLTG 365
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 46/90 (51%), Gaps = 5/90 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
N AN +A + S+ +G+ + L A Y+A+ A+L+ + ++L +A
Sbjct: 167 NLSGANLQAAQLNHSNLTGATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKA 226
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
NL +A+LV LT ++L A + A+ + A
Sbjct: 227 NLRDAILVGATLTEANLRQACLRRANLTQA 256
Score = 40.0 bits (92), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 44/88 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +A+LR+ V + N T+A+++ +D + +GAYL + + +AN T A L
Sbjct: 291 ASLVNANLRRTVLQNVYCLQTNLTAANLQGADLRQADLSGAYLNETILTEANLTDAYLIG 350
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+ + R L +A LT + L DL
Sbjct: 351 SYLIRPKLEQAQLTGCCIHNWHLEEVDL 378
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 42/96 (43%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A L K + AN A + ++ + + LEKA A GA L++ + +
Sbjct: 186 ATLDKTQLISTQLMAANLYQASLIAANLTTANLREVLLEKANLRDAILVGATLTEANLRQ 245
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANLT A L R +LT +DL + + S A
Sbjct: 246 ACLRRANLTQAELYRAILTDADLSEVTGDRVNLSRA 281
Score = 37.0 bits (84), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 36/79 (45%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
N AD+R + SG+ L A AN +GA L + +L +NL A+L R
Sbjct: 81 VNLKGADLRTINLSGANLTNTNLSWASLDYANLSGACLHQADLHNAILKHSNLNQAILTR 140
Query: 183 TVLTRSDLGGAIIEGADFS 201
L++ D A + A+ S
Sbjct: 141 VNLSKVDGQSASLCQANLS 159
>gi|119489371|ref|ZP_01622151.1| hypothetical protein L8106_02407 [Lyngbya sp. PCC 8106]
gi|119454644|gb|EAW35790.1| hypothetical protein L8106_02407 [Lyngbya sp. PCC 8106]
Length = 166
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 51/92 (55%), Gaps = 5/92 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N + AN + A + +FS + +GA L A +ANFT A+LS+ + L A LTNA
Sbjct: 68 NLKGANLSGALLDNVNFSQADLSGANLSSAALTQANFTEANLSEANLTGAFLRSAILTNA 127
Query: 179 VLVRTVLTRSDLG-----GAIIEGADFSDAVI 205
L L ++DL GA I+GADF +A++
Sbjct: 128 KLTNASLNKADLNTAKLEGAEIKGADFKEAIM 159
>gi|443321008|ref|ZP_21050077.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
gi|442789287|gb|ELR98951.1| putative low-complexity protein [Gloeocapsa sp. PCC 73106]
Length = 333
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 83/201 (41%), Gaps = 40/201 (19%)
Query: 41 SSKTESDGQFPGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG 100
++K ES Q Y + R F+ L A + + N+ L A+ IG
Sbjct: 3 TTKLESVSQLLSLYQQ--GERNFIEVKLTLANLNQANLNLINLKRAMLKSAQIIEAKLIG 60
Query: 101 ---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-------------------- 137
S A +A+L ++ + N +AN A++ ++D SG
Sbjct: 61 ANLSEADLEAANLTRSTLIDINLSKANLNHANLTDADLSGANLSNSNLTGADLSNASLIS 120
Query: 138 ----------SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV-----LVR 182
SK A L AV KAN ADLS ++R +L EANL A+ L+R
Sbjct: 121 SSMIGSCLSKSKLKLANLTSAVLAKANLQYADLSFAGLNRAILTEANLRGAILKQATLIR 180
Query: 183 TVLTRSDLGGAIIEGADFSDA 203
+ L R DL GA ++G + S A
Sbjct: 181 SYLNRVDLSGANLQGCNLSLA 201
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I + A A L++A ++ R + + A+++ + S + GA L A AN GA
Sbjct: 162 ILTEANLRGAILKQATLIRSYLNRVDLSGANLQGCNLSLADLRGANLTGANLQGANLEGA 221
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+LSD + L+ ANLT A LV T L R++L GA + A+
Sbjct: 222 NLSD-----VNLSGANLTKANLVGTQLVRANLTGAKLSYANL 258
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 44/93 (47%), Gaps = 10/93 (10%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADLR A N + AN A++ + + SG+ A L +AN TGA LS
Sbjct: 201 ADLRGANLTGANLQGANLEGANLSDVNLSGANLTKANLVGTQLVRANLTGAKLS------ 254
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
ANL + L++ L++++L A + GA
Sbjct: 255 ----YANLKGSNLLKANLSQANLAAANLSGAGL 283
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 43/85 (50%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N + N + AD+R ++ +G+ GA LE A N +GA+L+ + L ANLT A
Sbjct: 192 NLQGCNLSLADLRGANLTGANLQGANLEGANLSDVNLSGANLTKANLVGTQLVRANLTGA 251
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
L L S+L A + A+ + A
Sbjct: 252 KLSYANLKGSNLLKANLSQANLAAA 276
>gi|428314592|ref|YP_007151039.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428256316|gb|AFZ22271.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 237
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 61/119 (51%), Gaps = 20/119 (16%)
Query: 105 FGSADLRKAVHVKENFRRANF---------------TSADMRESDFSGSKFNGAYLEKAV 149
F +A+LR AV V++N + NF + D+ +D S + NGA L +A
Sbjct: 105 FANANLRCAVLVEQNLCQCNFSYVKLNFANLSGINLSGVDLTSADLSDACLNGANLSQAS 164
Query: 150 AY-----KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
Y +AN + A+L T + + LN+ANLT A L L+ +DL GAI++ A S A
Sbjct: 165 LYRTLLTRANLSQANLRGTNLFKASLNDANLTQADLTGANLSFADLRGAILDEATLSGA 223
Score = 40.4 bits (93), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 43/85 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L +A + RAN + A++R ++ + N A L +A AN + ADL
Sbjct: 151 SDACLNGANLSQASLYRTLLTRANLSQANLRGTNLFKASLNDANLTQADLTGANLSFADL 210
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVL 185
++D L+ ANLT A L + L
Sbjct: 211 RGAILDEATLSGANLTGAKLTQGQL 235
>gi|427416432|ref|ZP_18906615.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425759145|gb|EKU99997.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 237
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 54/106 (50%), Gaps = 15/106 (14%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F + +L +++ + R AN A +RESD S + A LEKA KA+ GA+LSD
Sbjct: 57 FENCNLSESILWGSDLRNANLKQAQLRESDLSSALLTQANLEKANLIKASLCGANLSD-- 114
Query: 165 MDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 205
ANL NA L+ L R+DLG + + GAD S A +
Sbjct: 115 --------ANLANACLLDADLRSNSDQRTDLGQSNLSGADLSYAFL 152
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 47/83 (56%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
ANF+ +D+R+S + F E ++ G+DL + + + L E++L++A+L +
Sbjct: 35 ANFSQSDLRQSRLGRTHFCRVNFENCNLSESILWGSDLRNANLKQAQLRESDLSSALLTQ 94
Query: 183 TVLTRSDLGGAIIEGADFSDAVI 205
L +++L A + GA+ SDA +
Sbjct: 95 ANLEKANLIKASLCGANLSDANL 117
>gi|126442493|ref|YP_001061349.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
668]
gi|126221984|gb|ABN85489.1| pentapeptide repeat protein [Burkholderia pseudomallei 668]
Length = 825
Score = 53.5 bits (127), Expect = 9e-05, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQGADLTGADLSGMDLRGARLAGAMLENADLSDADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.53, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 37.4 bits (85), Expect = 7.4, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +ADL A + R AD+ + ++ A L A + +F+G+DL
Sbjct: 530 AGAMLENADLSDADLTGADLSRTVLVRADLTRAKLVDARLTAANLSLAHCERTDFSGSDL 589
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIE----------GADFSDAVI 205
SD + +++ L + +VL T D G A + G FSDA I
Sbjct: 590 SDGIFEQVHLRDCRFNGSVLASTRFDACRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|94266194|ref|ZP_01289904.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
gi|93453242|gb|EAT03697.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
Length = 818
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N D+RE DF G++ +G ++A A+F+GADL + L A L A L R
Sbjct: 142 NLAGMDLREVDFRGARLHGVSFQEANLRGADFSGADLMHADLSEADLRGAKLVGANLSRV 201
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYA 217
L R+DLG A + AD + A + A+ +QA+ + A
Sbjct: 202 NLARADLGEADLSEADLTRANLGGARLRQAILRRA 236
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 67/152 (44%), Gaps = 28/152 (18%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----------EKAVA 150
AA AD A K NF AN T+A +R++D +G + A L +A
Sbjct: 375 AANLSRADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATL 434
Query: 151 YKANFT----------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+AN T GADLS+ +L A+L AVLVRT LT + L A + +
Sbjct: 435 IRANLTNASLREADLTGADLSNA-----ILTGADLREAVLVRTRLTHAHLNRADLAWSTL 489
Query: 201 SDAVIDLAQKQALCKYANGTNPITGVSTRKSL 232
SDA DL+ NG N G S +SL
Sbjct: 490 SDA--DLSNADLKEASLNGVNLGAGASVLQSL 519
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 55/131 (41%), Gaps = 26/131 (19%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 157
SAA A L+K++ + S R +DF + + A A KANF G
Sbjct: 339 SAADLAGAKLQKSIMAGATLHGSRLVSVTARNADFRAANLSRADATGADFSKANFAGANL 398
Query: 158 -----------------ADLSDTLMD------RMVLNEANLTNAVLVRTVLTRSDLGGAI 194
A+L+D +D R L ANLTNA L LT +DL AI
Sbjct: 399 TAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLTNASLREADLTGADLSNAI 458
Query: 195 IEGADFSDAVI 205
+ GAD +AV+
Sbjct: 459 LTGADLREAVL 469
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 54/113 (47%), Gaps = 25/113 (22%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS- 161
A G A LR+A+ RRA F D R+ D + F GA ++ NF+GADLS
Sbjct: 221 ANLGGARLRQAI-----LRRALFGETDARKVDARQADFRGATFQRG-----NFSGADLSR 270
Query: 162 ----DTLMDRMVLNEANLTNAVLVRTVLTR----------SDLGGAIIEGADF 200
DT + +L E +L A L + L+R ++LGGA + GAD
Sbjct: 271 ARFADTDLSGAILQEVDLAGAELEGSDLSRLALPGVRLVKANLGGANLYGADL 323
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 7/123 (5%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
LA ++ E + RG G F A+LR A +F A+ AD+ E+D G+K G
Sbjct: 143 LAGMDLREVDFRGARLHG--VSFQEANLRGA-----DFSGADLMHADLSEADLRGAKLVG 195
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A L + +A+ ADLS+ + R L A L A+L R + +D ADF
Sbjct: 196 ANLSRVNLARADLGEADLSEADLTRANLGGARLRQAILRRALFGETDARKVDARQADFRG 255
Query: 203 AVI 205
A
Sbjct: 256 ATF 258
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 7/111 (6%)
Query: 103 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A FG D RK + +FR R NF+ AD+ + F+ + +GA L++ A G
Sbjct: 236 ALFGETDARKVDARQADFRGATFQRGNFSGADLSRARFADTDLSGAILQEVDLAGAELEG 295
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+DLS + + L +ANL A L L +DL A + AD S A DLA
Sbjct: 296 SDLSRLALPGVRLVKANLGGANLYGADLRAADLTDASLVEADLSAA--DLA 344
Score = 44.3 bits (103), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F D+R +D G+ A L A A+ ADLS + R L ANLT+A+L T+
Sbjct: 529 FVRYDLRNADLRGANLRDADLADADLSNADLANADLSRANLSRSDLRWANLTDAILQGTI 588
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQ 209
L+ + L A A+F++A DL Q
Sbjct: 589 LSNASLNDANFNRANFAEA--DLTQ 611
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 2/121 (1%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +ADL A V+ + A+ A +++S +G+ +G+ L A A+F A+LS
Sbjct: 321 ADLRAADLTDASLVEADLSAADLAGAKLQKSIMAGATLHGSRLVSVTARNADFRAANLSR 380
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYANGT 220
++AN A L VL ++DL G + A+ +DA +D A +A AN T
Sbjct: 381 ADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLT 440
Query: 221 N 221
N
Sbjct: 441 N 441
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 49/119 (41%), Gaps = 20/119 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAV--- 149
A ADLR A V N R N AD+ E+D S G++ A L +A+
Sbjct: 181 ADLSEADLRGAKLVGANLSRVNLARADLGEADLSEADLTRANLGGARLRQAILRRALFGE 240
Query: 150 -------AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
A +A+F GA L+ A + L +L DL GA +EG+D S
Sbjct: 241 TDARKVDARQADFRGATFQRGNFSGADLSRARFADTDLSGAILQEVDLAGAELEGSDLS 299
>gi|94266259|ref|ZP_01289965.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
gi|93453141|gb|EAT03609.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
Length = 818
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N D+RE DF G++ +G ++A A+F+GADL + L A L A L R
Sbjct: 142 NLAGMDLREVDFRGARLHGVSFQEANLRGADFSGADLMHADLSEADLRGAKLVGANLSRV 201
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYA 217
L R+DLG A + AD + A + A+ +QA+ + A
Sbjct: 202 NLARADLGEADLSEADLTRANLGGARLRQAILRRA 236
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 67/152 (44%), Gaps = 28/152 (18%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----------EKAVA 150
AA AD A K NF AN T+A +R++D +G + A L +A
Sbjct: 375 AANLSRADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATL 434
Query: 151 YKANFT----------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+AN T GADLS+ +L A+L AVLVRT LT + L A + +
Sbjct: 435 IRANLTNASLREADLTGADLSNA-----ILTGADLREAVLVRTRLTHAHLNRADLAWSTL 489
Query: 201 SDAVIDLAQKQALCKYANGTNPITGVSTRKSL 232
SDA DL+ NG N G S +SL
Sbjct: 490 SDA--DLSNADLKEASLNGVNLGAGASVLQSL 519
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 55/131 (41%), Gaps = 26/131 (19%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 157
SAA A L+K++ + S R +DF + + A A KANF G
Sbjct: 339 SAADLAGAKLQKSIMAGATLHGSRLVSVTARNADFRAANLSRADATGADFSKANFAGANL 398
Query: 158 -----------------ADLSDTLMD------RMVLNEANLTNAVLVRTVLTRSDLGGAI 194
A+L+D +D R L ANLTNA L LT +DL AI
Sbjct: 399 TAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLTNASLREADLTGADLSNAI 458
Query: 195 IEGADFSDAVI 205
+ GAD +AV+
Sbjct: 459 LTGADLREAVL 469
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 54/113 (47%), Gaps = 25/113 (22%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS- 161
A G A LR+A+ RRA F D R+ D + F GA ++ NF+GADLS
Sbjct: 221 ANLGGARLRQAI-----LRRALFGETDARKVDARQADFRGATFQRG-----NFSGADLSR 270
Query: 162 ----DTLMDRMVLNEANLTNAVLVRTVLTR----------SDLGGAIIEGADF 200
DT + +L E +L A L + L+R ++LGGA + GAD
Sbjct: 271 ARFADTDLSGAILQEVDLAGAELEGSDLSRLALPGVRLVKANLGGANLYGADL 323
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 7/123 (5%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
LA ++ E + RG G F A+LR A +F A+ AD+ E+D G+K G
Sbjct: 143 LAGMDLREVDFRGARLHG--VSFQEANLRGA-----DFSGADLMHADLSEADLRGAKLVG 195
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A L + +A+ ADLS+ + R L A L A+L R + +D ADF
Sbjct: 196 ANLSRVNLARADLGEADLSEADLTRANLGGARLRQAILRRALFGETDARKVDARQADFRG 255
Query: 203 AVI 205
A
Sbjct: 256 ATF 258
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 7/111 (6%)
Query: 103 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A FG D RK + +FR R NF+ AD+ + F+ + +GA L++ A G
Sbjct: 236 ALFGETDARKVDARQADFRGATFQRGNFSGADLSRARFADTDLSGAILQEVDLAGAELEG 295
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+DLS + + L +ANL A L L +DL A + AD S A DLA
Sbjct: 296 SDLSRLALPGVRLVKANLGGANLYGADLRAADLTDASLVEADLSAA--DLA 344
Score = 44.3 bits (103), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F D+R +D G+ A L A A+ ADLS + R L ANLT+A+L T+
Sbjct: 529 FVRYDLRNADLRGANLRDADLADADLSNADLANADLSRANLSRSDLRWANLTDAILQGTI 588
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQ 209
L+ + L A A+F++A DL Q
Sbjct: 589 LSNASLNDANFNRANFAEA--DLTQ 611
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 2/121 (1%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +ADL A V+ + A+ A +++S +G+ +G+ L A A+F A+LS
Sbjct: 321 ADLRAADLTDASLVEADLSAADLAGAKLQKSIMAGATLHGSRLVSVTARNADFRAANLSR 380
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCKYANGT 220
++AN A L VL ++DL G + A+ +DA +D A +A AN T
Sbjct: 381 ADATGADFSKANFAGANLTAAVLRQTDLTGVEMLEANLTDAQLDQADLSSRATLIRANLT 440
Query: 221 N 221
N
Sbjct: 441 N 441
>gi|334118424|ref|ZP_08492513.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333459431|gb|EGK88044.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 479
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 51/97 (52%), Gaps = 10/97 (10%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
SADLR + + AN + AD+RE+DF+G A AN +GADL +
Sbjct: 338 SADLRGVDLTRADLSGANLSDADLRETDFTG----------ATLLFANLSGADLRGVDLT 387
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L+ ANLT A L + L R +L GA + AD SDA
Sbjct: 388 KADLSGANLTEADLRKADLMRVNLEGADLTEADLSDA 424
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 52/97 (53%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
LR A ++ + AN + AD+ ES + + A L AV +AN GA+ + + +
Sbjct: 49 LRYADLIEADLSGANLSGADLAESFLNLANLTRADLTGAVLREANLVGAEFTGANLKQAS 108
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L +ANL A L LTR++L GA + G+ S A++D
Sbjct: 109 LIKANLVGANLHEANLTRANLSGADLRGSQLSGAILD 145
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
R AN D+RE++ SG+ A L +A AN +GADL+++ LN ANLT A
Sbjct: 29 LRGANLRGTDLRETNLSGAMLRYADLIEADLSGANLSGADLAESF-----LNLANLTRAD 83
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVI 205
L VL ++L GA GA+ A +
Sbjct: 84 LTGAVLREANLVGAEFTGANLKQASL 109
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 55/110 (50%), Gaps = 20/110 (18%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKANF 155
S A ADL ++ AN T AD +RE++ G++F GA L++A KAN
Sbjct: 60 SGANLSGADLAESF-----LNLANLTRADLTGAVLREANLVGAEFTGANLKQASLIKANL 114
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
GA+ L+EANLT A L L S L GAI++ A +++ I
Sbjct: 115 VGAN----------LHEANLTRANLSGADLRGSQLSGAILDKAVYNNRTI 154
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 50/95 (52%), Gaps = 5/95 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL + N +N +S +++ DFS + AYL+ A + + GADLS
Sbjct: 274 ADLNGSDLSGANLSGSNLSSVNLKNVDFSRASLKKAYLKGANLEQTDLRGADLSGA---- 329
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+L++ NL++A L LTR+DL GA + AD +
Sbjct: 330 -ILHQVNLSSADLRGVDLTRADLSGANLSDADLRE 363
Score = 43.9 bits (102), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 155
S A ADLR+ AN + AD+R ++D SG+ A L KA + N
Sbjct: 352 SGANLSDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGANLTEADLRKADLMRVNL 411
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
GADL+ EA+L++A L R L ++L G ++GA
Sbjct: 412 EGADLT----------EADLSDAHLFRVNLRGANLKGTNLKGA 444
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 68/159 (42%), Gaps = 36/159 (22%)
Query: 103 AQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAV-------- 149
A+F A+L++A +K N AN T A++ +D GS+ +GA L+KAV
Sbjct: 97 AEFTGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAVYNNRTIFP 156
Query: 150 ------AYKA------------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 191
A A N DL++ + L NL A+L L R++L
Sbjct: 157 EDIDPGAMGAFLLAPNASLPGLNLAMVDLTEADLKGADLRRTNLYKAILFGAKLDRANLA 216
Query: 192 GAIIEGADFSDAVID--LAQKQALCK---YANGTNPITG 225
GA + AD +A + + +K K ++ G +P G
Sbjct: 217 GANLSAADLREASLSGTILEKAVYSKKTLFSEGIDPALG 255
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 43/99 (43%), Gaps = 5/99 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFTG 157
A ADLR K + AN T AD+R++D G+ A L A ++ N G
Sbjct: 374 ANLSGADLRGVDLTKADLSGANLTEADLRKADLMRVNLEGADLTEADLSDAHLFRVNLRG 433
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
A+L T + L LT+A L T L DL + E
Sbjct: 434 ANLKGTNLKGASLKGVFLTDAYLSETDLADIDLSPSFFE 472
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 46/103 (44%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S+ + D +A K + AN D+R +D SG+ + L A + T ADL
Sbjct: 292 SSVNLKNVDFSRASLKKAYLKGANLEQTDLRGADLSGAILHQVNLSSADLRGVDLTRADL 351
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + L E + T A L+ L+ +DL G + AD S A
Sbjct: 352 SGANLSDADLRETDFTGATLLFANLSGADLRGVDLTKADLSGA 394
>gi|334117701|ref|ZP_08491792.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333460810|gb|EGK89418.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 214
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 58/108 (53%), Gaps = 5/108 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTG 157
A G ADL +A+ V+ N RA A++ ++D SG + GA + +A +A+ G
Sbjct: 70 ADLGGADLTEALLVEANLNRAELMGANLSKADLSGASLIQATLIGANVSRATLSRADLHG 129
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+L + R VL E +L A L + L+ +DL GA + AD ++A++
Sbjct: 130 VNLYGVNLRRAVLTECDLIGANLSKVDLSGADLMGASLIRADLTEAIL 177
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 57/117 (48%), Gaps = 2/117 (1%)
Query: 89 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 148
YEAE G + A G A+L KA + NF +AN ++ +D G+ A L +A
Sbjct: 28 YEAELIGA-NLYEADLIG-ANLSKAKLNRVNFGKANLCKINLMRADLGGADLTEALLVEA 85
Query: 149 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+A GA+LS + L +A L A + R L+R+DL G + G + AV+
Sbjct: 86 NLNRAELMGANLSKADLSGASLIQATLIGANVSRATLSRADLHGVNLYGVNLRRAVL 142
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 55/127 (43%), Gaps = 12/127 (9%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE----------SDFS 136
N YEA+ G S A+ + KA K N RA+ AD+ E ++
Sbjct: 36 NLYEADLIGANL--SKAKLNRVNFGKANLCKINLMRADLGGADLTEALLVEANLNRAELM 93
Query: 137 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
G+ + A L A +A GA++S + R L+ NL L R VLT DL GA +
Sbjct: 94 GANLSKADLSGASLIQATLIGANVSRATLSRADLHGVNLYGVNLRRAVLTECDLIGANLS 153
Query: 197 GADFSDA 203
D S A
Sbjct: 154 KVDLSGA 160
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 37/74 (50%), Gaps = 5/74 (6%)
Query: 132 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 191
E +FSG + YL +A AN ADL + + LN N A L + L R+DLG
Sbjct: 14 ERNFSGVYLHEVYLYEAELIGANLYEADLIGANLSKAKLNRVNFGKANLCKINLMRADLG 73
Query: 192 GAIIEGADFSDAVI 205
GAD ++A++
Sbjct: 74 -----GADLTEALL 82
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 55/115 (47%), Gaps = 13/115 (11%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN AD+ ++ S +K N KA K N ADL + +L EANL A L+
Sbjct: 35 ANLYEADLIGANLSKAKLNRVNFGKANLCKINLMRADLGGADLTEALLVEANLNRAELMG 94
Query: 183 TVLTRSDLGGA-IIE----GADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSL 232
L+++DL GA +I+ GA+ S A + A + Y GV+ R+++
Sbjct: 95 ANLSKADLSGASLIQATLIGANVSRATLSRADLHGVNLY--------GVNLRRAV 141
>gi|170751525|ref|YP_001757785.1| pentapeptide repeat-containing protein [Methylobacterium
radiotolerans JCM 2831]
gi|170658047|gb|ACB27102.1| pentapeptide repeat protein [Methylobacterium radiotolerans JCM
2831]
Length = 456
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLNEANLTN 177
A F A MR +D SG+ +G A + A+F+GAD DT+ +D L +ANLT+
Sbjct: 141 ARFGEAAMRFADLSGALLDGTDFAGADLWGADFSGADADDTVFRGARLDEAKLADANLTH 200
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A LT++ L G+ + GA F+ A +D
Sbjct: 201 ADFAEASLTKASLAGSRLRGAHFTGAKLD 229
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 47/105 (44%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A L AV F A +AD+ E+D SG+ F G VA + F GA L
Sbjct: 84 SGANLRGASLTGAVGRSTRFTGAILEAADLSEADLSGADFTG-----IVAGQVKFAGAML 138
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
D + A+L+ A+L T +DL GA GAD D V
Sbjct: 139 EDARFGEAAMRFADLSGALLDGTDFAGADLWGADFSGADADDTVF 183
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 47/97 (48%), Gaps = 5/97 (5%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A L +A N A+F A + ++ +GS+ GA+ A A+ +GADLSDT
Sbjct: 183 FRGARLDEAKLADANLTHADFAEASLTKASLAGSRLRGAHFTGAKLDGADLSGADLSDTD 242
Query: 165 MDRM-----VLNEANLTNAVLVRTVLTRSDLGGAIIE 196
+ R+ L A A L T ++ LGGA+ E
Sbjct: 243 LVRLNLATCRLRHARFAGAWLNGTRMSVEQLGGAVGE 279
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 5/86 (5%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+ + A M E+D SG+ GA L AV FTGA +++ L+EA+L+ A
Sbjct: 71 ADLSRARMEEADLSGANLRGASLTGAVGRSTRFTGA-----ILEAADLSEADLSGADFTG 125
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLA 208
V + GA++E A F +A + A
Sbjct: 126 IVAGQVKFAGAMLEDARFGEAAMRFA 151
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 51/106 (48%), Gaps = 15/106 (14%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-----FNGAYLEKAVAYKANFTG 157
A+FG A +R A +F AD+ +DFSG+ F GA L++A AN T
Sbjct: 141 ARFGEAAMRFADLSGALLDGTDFAGADLWGADFSGADADDTVFRGARLDEAKLADANLTH 200
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
AD + EA+LT A L + L + GA ++GAD S A
Sbjct: 201 ADFA----------EASLTKASLAGSRLRGAHFTGAKLDGADLSGA 236
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 47/98 (47%), Gaps = 10/98 (10%)
Query: 116 VKENFRRANFTSADMRESDFSG----------SKFNGAYLEKAVAYKANFTGADLSDTLM 165
V + RA AD+ ++ G ++F GA LE A +A+ +GAD + +
Sbjct: 69 VGADLSRARMEEADLSGANLRGASLTGAVGRSTRFTGAILEAADLSEADLSGADFTGIVA 128
Query: 166 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++ A L +A + +DL GA+++G DF+ A
Sbjct: 129 GQVKFAGAMLEDARFGEAAMRFADLSGALLDGTDFAGA 166
>gi|390441101|ref|ZP_10229280.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
gi|389835591|emb|CCI33406.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
Length = 436
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 63/127 (49%), Gaps = 9/127 (7%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LR A + + A+ + AD+ E+D SG+ A L +A KAN A+L
Sbjct: 289 SGADLSGANLRGANLSEADLSEADLSEADLSEADLSGANLIDANLRRANLIKANLRRANL 348
Query: 161 SDTLMDRMVLN-----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
+ ++ L+ ANL A+L+ +L +DL GA + A+ S+A I+ A+
Sbjct: 349 IEAILSEADLSGANLRRANLIKAILIEAILIEADLRGADLRWANLSEADIE----NAIFI 404
Query: 216 YANGTNP 222
A G P
Sbjct: 405 DATGITP 411
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 72/150 (48%), Gaps = 10/150 (6%)
Query: 93 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 152
T+ EF A A+L KA+ R +++ D SG+ GA L A+
Sbjct: 203 TKAEFT-TDAKVIEKAELIKAI------REGTIDKTTLQQVDLSGAILRGAILIGAILRG 255
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 209
AN +GA+LSD ++ +L+ A L+ A L L+ +DL GA + GA+ S+A + DL++
Sbjct: 256 ANLSGANLSDAILRGAILSRAFLSGAFLSEADLSGADLSGANLRGANLSEADLSEADLSE 315
Query: 210 KQALCKYANGTNPITGVSTRKSLGCGNSRR 239
+G N I R +L N RR
Sbjct: 316 ADLSEADLSGANLIDANLRRANLIKANLRR 345
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 52/103 (50%), Gaps = 1/103 (0%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LR+A +K N RRAN A + E+D SG+ A L KA+ +A ADL
Sbjct: 324 SGANLIDANLRRANLIKANLRRANLIEAILSEADLSGANLRRANLIKAILIEAILIEADL 383
Query: 161 SDTLMDRMVLNEANLTNAVLVR-TVLTRSDLGGAIIEGADFSD 202
+ L+EA++ NA+ + T +T I GA F D
Sbjct: 384 RGADLRWANLSEADIENAIFIDATGITPEQKQDLIRRGAIFGD 426
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 54/103 (52%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A LR A+ + A + AD+ +D SG+ GA L +A +A+ + ADL
Sbjct: 259 SGANLSDAILRGAILSRAFLSGAFLSEADLSGADLSGANLRGANLSEADLSEADLSEADL 318
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S+ + L +ANL A L++ L R++L AI+ AD S A
Sbjct: 319 SEADLSGANLIDANLRRANLIKANLRRANLIEAILSEADLSGA 361
>gi|163797791|ref|ZP_02191737.1| hypothetical protein BAL199_22152 [alpha proteobacterium BAL199]
gi|159176913|gb|EDP61479.1| hypothetical protein BAL199_22152 [alpha proteobacterium BAL199]
Length = 427
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 71/157 (45%), Gaps = 37/157 (23%)
Query: 84 ADLNK---YEAETRGEFGIGS---AAQFGSADLRKAVHVKENFR------RANFTSADMR 131
ADLN A+ RG F GS A ADLR + N R+N +DM
Sbjct: 78 ADLNHALLIRADLRGAFMRGSNLAGANLKEADLRGGALISGNLAAPATIIRSNIGQSDMD 137
Query: 132 ESDFSGSKFNG----------AYLEKAVAYKANFTG----------ADLSDTLMD--RMV 169
E+D G+ +G A LEK + AN +G ADLS + R++
Sbjct: 138 EADMGGANLSGTDLSHSSMIGATLEKTLLCGANLSGVNLEGANLQGADLSGANLSSARII 197
Query: 170 ---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L+ ANL+ A++ RT +S+L GAI+E D S A
Sbjct: 198 GANLSGANLSGALIHRTQFQKSELHGAILENVDLSTA 234
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 58/115 (50%), Gaps = 15/115 (13%)
Query: 108 ADLRKAVHVKENFRR----------ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
ADLR+A+ V RR +N + AD+R+S+ +G GA L A A+ T
Sbjct: 294 ADLREAILVSAVMRRTSLVMSDLSGSNLSGADLRDSELAGINLAGANLTNARIAGADLTS 353
Query: 158 ADL---SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+L R+ + +NL+ AVLV LT + L GA ++GAD + A + AQ
Sbjct: 354 VELKGPDGQATGRLWV--SNLSGAVLVNADLTGARLTGANLKGADLTGAKLARAQ 406
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 49/94 (52%), Gaps = 3/94 (3%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN + ++ ++ G+ +GA L A AN +GA+LS L+ R ++ L A+L
Sbjct: 169 ANLSGVNLEGANLQGADLSGANLSSARIIGANLSGANLSGALIHRTQFQKSELHGAILEN 228
Query: 183 TVLTRSDLGGAII---EGADFSDAVIDLAQKQAL 213
L+ +DL GA + +G S ++ D+ + A+
Sbjct: 229 VDLSTADLSGANLTSGDGRGLSRSLRDILHEHAV 262
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 15/115 (13%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSG---------------SKFNGAYLEKA 148
QF ++L A+ + A+ + A++ D G + G +A
Sbjct: 215 QFQKSELHGAILENVDLSTADLSGANLTSGDGRGLSRSLRDILHEHAVWIREQGRGGSRA 274
Query: 149 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
K TG D+SD + L EA L +AV+ RT L SDL G+ + GAD D+
Sbjct: 275 QLAKTELTGIDVSDVNLSGADLREAILVSAVMRRTSLVMSDLSGSNLSGADLRDS 329
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 50/93 (53%), Gaps = 10/93 (10%)
Query: 111 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 170
R++ + N N ++ + + +K +GAY+ +A+ +GADLS + M L
Sbjct: 21 RRSGGARANLSGCNLADFNLAQVNLQSAKLSGAYMARAI-----LSGADLSYSDMFCANL 75
Query: 171 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ A+L +A+L+ R+DL GA + G++ + A
Sbjct: 76 DGADLNHALLI-----RADLRGAFMRGSNLAGA 103
>gi|448449600|ref|ZP_21591825.1| pentapeptide repeat-containing protein [Halorubrum litoreum JCM
13561]
gi|445813229|gb|EMA63210.1| pentapeptide repeat-containing protein [Halorubrum litoreum JCM
13561]
Length = 822
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 50/98 (51%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL AV + A+ + E+D SG+ GA L +A+ T ADLS+
Sbjct: 178 ASLLGADLPGAVLTDTDLSGADLIKTGLIEADLSGADLTGANLRHGRLKEADLTNADLSN 237
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ R+ L +A+L AVL +T +DL GA++ AD
Sbjct: 238 ADLYRVDLTDADLEGAVLTDADITDADLEGAVLTDADL 275
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 67/148 (45%), Gaps = 21/148 (14%)
Query: 77 SSNISALADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKE-----NFRRANFTSADM 130
S +I ADL+K + G + A G A+L A V+ N R A+ T AD+
Sbjct: 16 SEDIEPSADLSKVDLSDADLSGADLTNAYLGGANLSNATLVEADLTGANLRDADLTDADL 75
Query: 131 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT------- 183
+D + + G L A +A+ T A L + +L EA+LT+A L RT
Sbjct: 76 YRTDLTDAYLEGVNLSGATPVEADLTDASLKRANLSSTILMEADLTDADLYRTDFTDAYL 135
Query: 184 --------VLTRSDLGGAIIEGADFSDA 203
L+ SDL A +EGA+ +DA
Sbjct: 136 EGANLTNAYLSGSDLTNAYLEGANLTDA 163
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 69/147 (46%), Gaps = 10/147 (6%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
L + N EA+ G G+ A LR+A N + T+A +RE+D +G+ G
Sbjct: 450 LTNANLREADLTGAHLKGT--DLTDASLREADLTDVNLEEIDLTNASLREADLTGAHLEG 507
Query: 143 -----AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
A+LE AN ADL+ +++ L ANLT+A L L+ +DL + G
Sbjct: 508 VDLTGAHLEGIDLTSANLNQADLTSANLNQADLRGANLTDASLREANLSGADLTDTELSG 567
Query: 198 ADFSDAVI---DLAQKQALCKYANGTN 221
AD S + DL + ++L +G N
Sbjct: 568 ADLSRTDLEKSDLHKSKSLPTNLSGAN 594
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 49/105 (46%)
Query: 96 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
E + + A ADL AV + + T A+++ +D +G+ A L A A
Sbjct: 251 EGAVLTDADITDADLEGAVLTDADLEGTDLTGANLKVADLTGANLKVADLTGADLEDAVL 310
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
T ADL T + L A+LT A L LT DLGGA++ AD
Sbjct: 311 TDADLERTDLIEASLLSADLTGASLKEADLTEVDLGGAVLTDADL 355
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 48/105 (45%), Gaps = 5/105 (4%)
Query: 103 AQFGSADLRKAVHVKEN-----FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A DL V N FR ++ T A +R SD S + GA+LE A+
Sbjct: 378 ADLTEVDLEGTVLTDANLRFSEFRGSDITDASLRGSDLSNTDLTGAHLEGIDLTDASLRE 437
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
ADL+D ++ + L ANL A L L +DL A + AD +D
Sbjct: 438 ADLTDVNLEEIDLTNANLREADLTGAHLKGTDLTDASLREADLTD 482
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 58/120 (48%), Gaps = 11/120 (9%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A LR+A N + T+A++RE+D +G+ G L A +A+ T +L + +
Sbjct: 433 ASLREADLTDVNLEEIDLTNANLREADLTGAHLKGTDLTDASLREADLTDVNLEEIDLTN 492
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ------KQALCKYANGTN 221
L EA+LT A L DL GA +EG D + A ++ A QA + AN T+
Sbjct: 493 ASLREADLTGAHLEGV-----DLTGAHLEGIDLTSANLNQADLTSANLNQADLRGANLTD 547
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 51/96 (53%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L +AV + A+ A + ++D SG+ L +A A+ TGA+L +
Sbjct: 168 AELPRAVLTDASLLGADLPGAVLTDTDLSGADLIKTGLIEADLSGADLTGANLRHGRLKE 227
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L A+L+NA L R LT +DL GA++ AD +DA
Sbjct: 228 ADLTNADLSNADLYRVDLTDADLEGAVLTDADITDA 263
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 51/100 (51%), Gaps = 2/100 (2%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL AV + A+ A + ++D G+ GA L+ A AN ADL+ ++
Sbjct: 248 ADLEGAVLTDADITDADLEGAVLTDADLEGTDLTGANLKVADLTGANLKVADLTGADLED 307
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
VL +A+L L+ L +DL GA ++ AD ++ +DL
Sbjct: 308 AVLTDADLERTDLIEASLLSADLTGASLKEADLTE--VDL 345
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL A + N AD+ ++D + F AYLE A A +G+DL++ ++
Sbjct: 98 ADLTDASLKRANLSSTILMEADLTDADLYRTDFTDAYLEGANLTNAYLSGSDLTNAYLEG 157
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
L +A+ A L R VLT + L GA + GA +D
Sbjct: 158 ANLTDASPIGAELPRAVLTDASLLGADLPGAVLTD 192
Score = 43.9 bits (102), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 63/131 (48%), Gaps = 3/131 (2%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LR + + A+ ++AD+ D + + GA L A A+ GA L
Sbjct: 211 SGADLTGANLRHGRLKEADLTNADLSNADLYRVDLTDADLEGAVLTDADITDADLEGAVL 270
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+D ++ L ANL A L L +DL GA +E A +DA + ++ L + + +
Sbjct: 271 TDADLEGTDLTGANLKVADLTGANLKVADLTGADLEDAVLTDADL---ERTDLIEASLLS 327
Query: 221 NPITGVSTRKS 231
+TG S +++
Sbjct: 328 ADLTGASLKEA 338
Score = 43.9 bits (102), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 47/93 (50%), Gaps = 5/93 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL K ++ + A+ T A++R + A L A Y+ + T ADL +
Sbjct: 198 ADLIKTGLIEADLSGADLTGANLRHGRLKEADLTNADLSNADLYRVDLTDADL-----EG 252
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
VL +A++T+A L VLT +DL G + GA+
Sbjct: 253 AVLTDADITDADLEGAVLTDADLEGTDLTGANL 285
Score = 43.9 bits (102), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 54/113 (47%), Gaps = 10/113 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A SADL A + + + A + ++D G+ AYL A+ ADL++
Sbjct: 323 ASLLSADLTGASLKEADLTEVDLGGAVLTDADLEGTALTEAYLPSPDLTGASLKEADLTE 382
Query: 163 TLMDRMVLNEANL----------TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ VL +ANL T+A L + L+ +DL GA +EG D +DA +
Sbjct: 383 VDLEGTVLTDANLRFSEFRGSDITDASLRGSDLSNTDLTGAHLEGIDLTDASL 435
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 58/124 (46%), Gaps = 19/124 (15%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL A + +F A A++ + SGS AYLE A A+ GA+L R
Sbjct: 118 ADLTDADLYRTDFTDAYLEGANLTNAYLSGSDLTNAYLEGANLTDASPIGAELP-----R 172
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGA------IIE----GADFSDAVIDLAQKQALCKYA 217
VL +A+L A L VLT +DL GA +IE GAD + A + + K A
Sbjct: 173 AVLTDASLLGADLPGAVLTDTDLSGADLIKTGLIEADLSGADLTGANL----RHGRLKEA 228
Query: 218 NGTN 221
+ TN
Sbjct: 229 DLTN 232
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 59/131 (45%), Gaps = 7/131 (5%)
Query: 81 SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 140
+ L D N +E RG I A+ GS DL + + T A +RE+D +
Sbjct: 388 TVLTDANLRFSEFRGS-DITDASLRGS-DLSNTDLTGAHLEGIDLTDASLREADLTDVNL 445
Query: 141 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAII 195
L A +A+ TGA L T + L EA+LT+ L LT +DL GA +
Sbjct: 446 EEIDLTNANLREADLTGAHLKGTDLTDASLREADLTDVNLEEIDLTNASLREADLTGAHL 505
Query: 196 EGADFSDAVID 206
EG D + A ++
Sbjct: 506 EGVDLTGAHLE 516
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 5/103 (4%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-----DLSD 162
A L++A + + T A++R S+F GS A L + + TGA DL+D
Sbjct: 373 ASLKEADLTEVDLEGTVLTDANLRFSEFRGSDITDASLRGSDLSNTDLTGAHLEGIDLTD 432
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ L + NL L L +DL GA ++G D +DA +
Sbjct: 433 ASLREADLTDVNLEEIDLTNANLREADLTGAHLKGTDLTDASL 475
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 48/92 (52%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
++A ADLR A + R AN + AD+ +++ SG+ + LEK+ +K+ +L
Sbjct: 531 TSANLNQADLRGANLTDASLREANLSGADLTDTELSGADLSRTDLEKSDLHKSKSLPTNL 590
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 192
S + + L+E NL++ L R L +L G
Sbjct: 591 SGANLRGLNLSEQNLSSVNLSRADLRDVNLIG 622
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 46/93 (49%), Gaps = 20/93 (21%)
Query: 123 ANFTSADMRESDFSGSKFNGAY-----LEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
A+ + D+ ++D SG+ AY L A +A+ TGA+L D A+LT+
Sbjct: 23 ADLSKVDLSDADLSGADLTNAYLGGANLSNATLVEADLTGANLRD----------ADLTD 72
Query: 178 AVLVRTVLTRSDLGGAIIEG-----ADFSDAVI 205
A L RT LT + L G + G AD +DA +
Sbjct: 73 ADLYRTDLTDAYLEGVNLSGATPVEADLTDASL 105
>gi|428296910|ref|YP_007135216.1| RDD domain-containing protein [Calothrix sp. PCC 6303]
gi|428233454|gb|AFY99243.1| RDD domain containing protein [Calothrix sp. PCC 6303]
Length = 718
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 60/116 (51%), Gaps = 10/116 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 155
S+AQ ADLR AV + A+ A + E++ G++ N GA L A K ++
Sbjct: 540 SSAQMVGADLRNAVLENASLTGADLGEAKLNEAELYGARLNRAIAIGAQLSYANLTKTDW 599
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
ADLS + +DR+ NLTNA L LT + L A +EGA+ +A + LA Q
Sbjct: 600 QAADLSGSYLDRV-----NLTNANLSTARLTGAILRSANLEGANLRNADLTLADFQ 650
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 51/110 (46%), Gaps = 11/110 (10%)
Query: 95 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 154
GE + A +G A L +A+ + AN T D + +D SGS YL++ N
Sbjct: 565 GEAKLNEAELYG-ARLNRAIAIGAQLSYANLTKTDWQAADLSGS-----YLDRV-----N 613
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
T A+LS + +L ANL A L LT +D GA + DF A+
Sbjct: 614 LTNANLSTARLTGAILRSANLEGANLRNADLTLADFQGANVANVDFQGAI 663
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 50/98 (51%), Gaps = 14/98 (14%)
Query: 123 ANFTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMDRM 168
NF A++ ++ F S+F G A L +A +ANF+ A+LS L+++
Sbjct: 458 VNFKGANLDQASFKNSRFRGPGDDGLWDTFDDAIADLSQAQLKQANFSEANLSRVLLNKS 517
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L+ + L A L + L ++L A + GAD +AV++
Sbjct: 518 DLSRSTLNKANLAGSRLIGANLSSAQMVGADLRNAVLE 555
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 51/119 (42%), Gaps = 20/119 (16%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFN--------------------GAYLEK 147
ADL +A + NF AN + + +SD S S N GA L
Sbjct: 492 ADLSQAQLKQANFSEANLSRVLLNKSDLSRSTLNKANLAGSRLIGANLSSAQMVGADLRN 551
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
AV A+ TGADL + ++ L A L A+ + L+ ++L + AD S + +D
Sbjct: 552 AVLENASLTGADLGEAKLNEAELYGARLNRAIAIGAQLSYANLTKTDWQAADLSGSYLD 610
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 52/120 (43%), Gaps = 15/120 (12%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG------ 157
Q DLR+ V + + N S + D G F GA L++A + F G
Sbjct: 425 QMKKVDLRR-VRLGQTIDGQNTFSLSLDRVDLWGVNFKGANLDQASFKNSRFRGPGDDGL 483
Query: 158 --------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
ADLS + + +EANL+ +L ++ L+RS L A + G+ A + AQ
Sbjct: 484 WDTFDDAIADLSQAQLKQANFSEANLSRVLLNKSDLSRSTLNKANLAGSRLIGANLSSAQ 543
>gi|318042736|ref|ZP_07974692.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0101]
Length = 164
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 52/94 (55%), Gaps = 5/94 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL + + R A+ AD+R S+ G+ +GA L A+ + + ADLSD
Sbjct: 54 ADLSGLLLNGIDLRDADLRGADLRGSNLEGADLSGADLRGAMLQDSWLSNADLSD----- 108
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+ L +ANL +AVL++ + L GA++ GADF+
Sbjct: 109 VDLRQANLRDAVLIQALTPGLQLEGAVLIGADFT 142
>gi|111023196|ref|YP_706168.1| hypothetical protein RHA1_ro06233 [Rhodococcus jostii RHA1]
gi|110822726|gb|ABG98010.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length = 201
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 59/131 (45%), Gaps = 15/131 (11%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYL 145
+E R E I + F ADL ++ HV FR +FT ++ R F GS+F+ L
Sbjct: 38 SELRTESVIFTECDFTGADLAESHHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97
Query: 146 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 195
V + +FT GADL EANL L R VL +DL GGA
Sbjct: 98 RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 157
Query: 196 EGADFSDAVID 206
+GAD A +D
Sbjct: 158 DGADLRGAHVD 168
>gi|119356056|ref|YP_910700.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
DSM 266]
gi|119353405|gb|ABL64276.1| pentapeptide repeat protein [Chlorobium phaeobacteroides DSM 266]
Length = 446
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 52/98 (53%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A +ADLR + + ++A+ AD+RE+ A++EK++ KAN A+L
Sbjct: 82 SGANLNNADLRGSNLQQAFIKKADLKGADLREAYLVKVNLKEAFMEKSMLQKANLQSANL 141
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
T R L +NL +AVL T +DL GA ++GA
Sbjct: 142 RWTRFHRADLAGSNLQDAVLFETSFVDADLRGANLKGA 179
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 5/103 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA-----YLEKAVAYKANFTG 157
A F A+L +A+ + +A+F ADM++ G+ +GA ++E A AN +G
Sbjct: 307 ADFEDANLDEAMMEGADLSKADFQKADMKKVKLQGANLSGANLDRSFMEGADLRNANLSG 366
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A+L ++ L+ ANL+ A L T L ++L GA ++GA+
Sbjct: 367 ANLFGAMLKDANLSGANLSGASLFETDLEGANLSGANLKGANL 409
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 49/96 (51%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ +L+KA +F AN A M +D S + F A ++K AN +GA+L
Sbjct: 292 ARLKGVNLQKASMPGADFEDANLDEAMMEGADLSKADFQKADMKKVKLQGANLSGANLDR 351
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
+ M+ L ANL+ A L +L ++L GA + GA
Sbjct: 352 SFMEGADLRNANLSGANLFGAMLKDANLSGANLSGA 387
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 53/105 (50%), Gaps = 15/105 (14%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGAD---- 159
DL KA + AN +++ + ++ SG+ N G+ L++A KA+ GAD
Sbjct: 55 DLDKAKLEDADLEGANLSNSSLVRAELSGANLNNADLRGSNLQQAFIKKADLKGADLREA 114
Query: 160 ------LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
L + M++ +L +ANL +A L T R+DL G+ ++ A
Sbjct: 115 YLVKVNLKEAFMEKSMLQKANLQSANLRWTRFHRADLAGSNLQDA 159
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/89 (26%), Positives = 46/89 (51%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+E A +++++ G+ F A L++A+ A+ + AD M ++ L ANL+
Sbjct: 286 EEKLENARLKGVNLQKASMPGADFEDANLDEAMMEGADLSKADFQKADMKKVKLQGANLS 345
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A L R+ + +DL A + GA+ A++
Sbjct: 346 GANLDRSFMEGADLRNANLSGANLFGAML 374
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 44/95 (46%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F AD++K N AN + M +D + +GA L A+ AN +GA+L
Sbjct: 325 SKADFQKADMKKVKLQGANLSGANLDRSFMEGADLRNANLSGANLFGAMLKDANLSGANL 384
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
S + L ANL+ A L L +L AII
Sbjct: 385 SGASLFETDLEGANLSGANLKGANLVEPNLKNAII 419
>gi|254182800|ref|ZP_04889393.1| pentapeptide repeat protein [Burkholderia pseudomallei 1655]
gi|184213334|gb|EDU10377.1| pentapeptide repeat protein [Burkholderia pseudomallei 1655]
Length = 825
Score = 53.5 bits (127), Expect = 9e-05, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQGADLTGADLSGMDLRGARLAGAMLENADLSGADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.47, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 37.0 bits (84), Expect = 8.0, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +ADL A + R AD+ + ++ A L A + +F+G+DL
Sbjct: 530 AGAMLENADLSGADLTGADLSRTVLVRADLTRAKLVDARLTAANLSLAHCERTDFSGSDL 589
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIE----------GADFSDAVI 205
SD + +++ L + +VL T D G A + G FSDA I
Sbjct: 590 SDGIFEQVHLRDCRFNGSVLASTRFDACRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|219849225|ref|YP_002463658.1| pentapeptide repeat-containing protein [Chloroflexus aggregans DSM
9485]
gi|219543484|gb|ACL25222.1| pentapeptide repeat protein [Chloroflexus aggregans DSM 9485]
Length = 311
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 56/114 (49%), Gaps = 13/114 (11%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLRKA N A A++R ++ S + F+GA L A N +GADL D
Sbjct: 89 ADLSDADLRKADLSWANLEFATLIGANLRGANLSAADFSGANLYGANLSLCNLSGADLRD 148
Query: 163 TLMDRMVLNE-------------ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
T+M L+E ANL+ A+L+R L ++L GA + GA+ A
Sbjct: 149 TVMIGANLSEAQLREAQLVNLSGANLSGAILLRVSLNGANLNGANLAGANLMHA 202
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 43/82 (52%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N AN A++RE+ GA L + +A+ AD SD + + L+ A+L NA
Sbjct: 193 NLAGANLMHANLREATLDEVNCIGANLSETNLSEASLCNADFSDANLSGIYLSGAHLRNA 252
Query: 179 VLVRTVLTRSDLGGAIIEGADF 200
+ R L+R++L GA + GA+
Sbjct: 253 IFTRANLSRANLSGANLRGANL 274
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 29/55 (52%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A LR A+ + N RAN + A++R ++ G A L A A+ T ADL+D
Sbjct: 247 AHLRNAIFTRANLSRANLSGANLRGANLRGVNLREASLADADLTDADLTDADLTD 301
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 59/128 (46%), Gaps = 14/128 (10%)
Query: 83 LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 141
L D N Y A+ + E G A A+LR+A RA+ + AD+R++D S +
Sbjct: 51 LTDANLYRADLSICELG---EANLSWANLREAKLNWAQLVRADLSDADLRKADLSWANLE 107
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
A L A AN + AD S ANL A L L+ +DL ++ GA+ S
Sbjct: 108 FATLIGANLRGANLSAADFSG----------ANLYGANLSLCNLSGADLRDTVMIGANLS 157
Query: 202 DAVIDLAQ 209
+A + AQ
Sbjct: 158 EAQLREAQ 165
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 43/87 (49%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ N A+ +AD +++ SG +GA+L A+ +AN + A+LS + L NL
Sbjct: 221 ETNLSEASLCNADFSDANLSGIYLSGAHLRNAIFTRANLSRANLSGANLRGANLRGVNLR 280
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDA 203
A L LT +DL A + D S A
Sbjct: 281 EASLADADLTDADLTDADLTDCDLSGA 307
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 46/105 (43%), Gaps = 5/105 (4%)
Query: 103 AQFGSADLRKAVH-----VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A A+LR+A + N N + A + +DFS + +G YL A A FT
Sbjct: 197 ANLMHANLREATLDEVNCIGANLSETNLSEASLCNADFSDANLSGIYLSGAHLRNAIFTR 256
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A+LS + L ANL L L +DL A + AD +D
Sbjct: 257 ANLSRANLSGANLRGANLRGVNLREASLADADLTDADLTDADLTD 301
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 44/86 (51%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN + A++ ++ SG+ + A L +A AN ADLS + L+ ANL A L
Sbjct: 24 ANLSGANLSAANLSGANLSEAKLSRARLTDANLYRADLSICELGEANLSWANLREAKLNW 83
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLA 208
L R+DL A + AD S A ++ A
Sbjct: 84 AQLVRADLSDADLRKADLSWANLEFA 109
Score = 37.7 bits (86), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 50/114 (43%), Gaps = 4/114 (3%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
A+ R IG A A LR+A V N AN + A + +G+ NGA L A
Sbjct: 144 ADLRDTVMIG--ANLSEAQLREAQLV--NLSGANLSGAILLRVSLNGANLNGANLAGANL 199
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
AN A L + L+E NL+ A L + ++L G + GA +A+
Sbjct: 200 MHANLREATLDEVNCIGANLSETNLSEASLCNADFSDANLSGIYLSGAHLRNAI 253
>gi|53715998|ref|YP_106439.1| pentapeptide repeat-containing protein [Burkholderia mallei ATCC
23344]
gi|121597894|ref|YP_990510.1| pentapeptide repeat-containing protein [Burkholderia mallei SAVP1]
gi|124382797|ref|YP_001025000.1| pentapeptide repeat-containing protein [Burkholderia mallei NCTC
10229]
gi|126447556|ref|YP_001079344.1| pentapeptide repeat-containing protein [Burkholderia mallei NCTC
10247]
gi|166999172|ref|ZP_02265018.1| pentapeptide repeat family protein [Burkholderia mallei PRL-20]
gi|238561876|ref|ZP_00441284.2| pentapeptide repeat family protein [Burkholderia mallei GB8 horse
4]
gi|254176522|ref|ZP_04883180.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 10399]
gi|254203434|ref|ZP_04909795.1| pentapeptide repeat family protein [Burkholderia mallei FMH]
gi|254205313|ref|ZP_04911666.1| pentapeptide repeat family protein [Burkholderia mallei JHU]
gi|254356120|ref|ZP_04972397.1| pentapeptide repeat family protein [Burkholderia mallei 2002721280]
gi|52421968|gb|AAU45538.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 23344]
gi|121225692|gb|ABM49223.1| pentapeptide repeat family protein [Burkholderia mallei SAVP1]
gi|126240410|gb|ABO03522.1| pentapeptide repeat family protein [Burkholderia mallei NCTC 10247]
gi|147745673|gb|EDK52752.1| pentapeptide repeat family protein [Burkholderia mallei FMH]
gi|147754899|gb|EDK61963.1| pentapeptide repeat family protein [Burkholderia mallei JHU]
gi|148025103|gb|EDK83272.1| pentapeptide repeat family protein [Burkholderia mallei 2002721280]
gi|160697564|gb|EDP87534.1| pentapeptide repeat family protein [Burkholderia mallei ATCC 10399]
gi|238523698|gb|EEP87135.1| pentapeptide repeat family protein [Burkholderia mallei GB8 horse
4]
gi|243064727|gb|EES46913.1| pentapeptide repeat family protein [Burkholderia mallei PRL-20]
gi|261826983|gb|ABM99323.2| pentapeptide repeat family protein [Burkholderia mallei NCTC 10229]
Length = 825
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQGADLTGADLSGMDLRGARLAGAMLENADLSGADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.47, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 37.4 bits (85), Expect = 7.9, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +ADL A + R AD+ + ++ A L A + +F+G+DL
Sbjct: 530 AGAMLENADLSGADLTGADLSRTVLVRADLTRAKLVDARLTAANLSLAHCERTDFSGSDL 589
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIE----------GADFSDAVI 205
SD + +++ L + +VL T D G A + G FSDA I
Sbjct: 590 SDGIFEQVHLRDCRFNGSVLASTRFDACRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|226194659|ref|ZP_03790253.1| pentapeptide repeat protein [Burkholderia pseudomallei Pakistan 9]
gi|386863935|ref|YP_006276883.1| type VI secretion system [Burkholderia pseudomallei 1026b]
gi|418534996|ref|ZP_13100802.1| type VI secretion system [Burkholderia pseudomallei 1026a]
gi|225933225|gb|EEH29218.1| pentapeptide repeat protein [Burkholderia pseudomallei Pakistan 9]
gi|385357281|gb|EIF63347.1| type VI secretion system [Burkholderia pseudomallei 1026a]
gi|385661063|gb|AFI68485.1| type VI secretion system [Burkholderia pseudomallei 1026b]
Length = 825
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQGADLTGADLSGMDLRGARLAGAMLENADLSGADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.47, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 37.4 bits (85), Expect = 7.9, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +ADL A + R AD+ + ++ A L A + +F+G+DL
Sbjct: 530 AGAMLENADLSGADLTGADLSRTVLVRADLTRAKLVDARLTAANLSLAHCERTDFSGSDL 589
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIE----------GADFSDAVI 205
SD + +++ L + +VL T D G A + G FSDA I
Sbjct: 590 SDGIFEQVHLRDCRFNGSVLASTRFDACRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|167913453|ref|ZP_02500544.1| pentapeptide repeat family protein [Burkholderia pseudomallei 112]
gi|403521532|ref|YP_006657101.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
BPC006]
gi|403076599|gb|AFR18178.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
BPC006]
Length = 825
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQGADLTGADLSGMDLRGARLAGAMLENADLSGADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.47, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 37.4 bits (85), Expect = 7.9, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +ADL A + R AD+ + ++ A L A + +F+G+DL
Sbjct: 530 AGAMLENADLSGADLTGADLSRTVLVRADLTRAKLVDARLTAANLSLAHCERTDFSGSDL 589
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIE----------GADFSDAVI 205
SD + +++ L + +VL T D G A + G FSDA I
Sbjct: 590 SDGIFEQVHLRDCRFNGSVLASTRFDACRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|254299592|ref|ZP_04967041.1| pentapeptide repeat protein [Burkholderia pseudomallei 406e]
gi|418542641|ref|ZP_13108060.1| type VI secretion system [Burkholderia pseudomallei 1258a]
gi|418549165|ref|ZP_13114243.1| type VI secretion system [Burkholderia pseudomallei 1258b]
gi|157809489|gb|EDO86659.1| pentapeptide repeat protein [Burkholderia pseudomallei 406e]
gi|385355180|gb|EIF61399.1| type VI secretion system [Burkholderia pseudomallei 1258a]
gi|385356028|gb|EIF62174.1| type VI secretion system [Burkholderia pseudomallei 1258b]
Length = 825
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQGADLTGADLSGMDLRGARLAGAMLENADLSGADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.48, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 37.0 bits (84), Expect = 8.1, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +ADL A + R AD+ + ++ A L A + +F+G+DL
Sbjct: 530 AGAMLENADLSGADLTGADLSRTVLVRADLTRAKLVDARLTAANLSLAHCERTDFSGSDL 589
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIE----------GADFSDAVI 205
SD + +++ L + +VL T D G A + G FSDA I
Sbjct: 590 SDGIFEQVHLRDCRFNGSVLASTRFDACRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|126655992|ref|ZP_01727376.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
gi|126622272|gb|EAZ92978.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
Length = 319
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 56/118 (47%), Gaps = 15/118 (12%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
Q ADLR +FR +F+ A++RE DF+G+ AYL +A N TGA+L T
Sbjct: 25 QLRRADLRGLNLSNTDFRGVDFSYANLREVDFTGADLRDAYLNEADLTGVNLTGANLEGT 84
Query: 164 LMDRMVLNEAN-----LTNAVLVRTVLTRSD----------LGGAIIEGADFSDAVID 206
+ ++ L +AN + A L LT+SD L G + GA DA D
Sbjct: 85 SLIKIYLIKANCYQTDFSGAYLTGAYLTKSDFKEAKFNGAYLNGTKLSGAKLGDAYYD 142
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 107 SADLRKAVHV-KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 165
S DL++ + NF+ AD+R + S + F G A + +FTGADL D +
Sbjct: 7 SIDLKERYEKGQRNFQEFQLRRADLRGLNLSNTDFRGVDFSYANLREVDFTGADLRDAYL 66
Query: 166 DRMVLNEANLTNAVLVRTVLTR----------SDLGGAIIEGADFSDAVIDLAQKQALCK 215
+ L NLT A L T L + +D GA + GA + + D + +
Sbjct: 67 NEADLTGVNLTGANLEGTSLIKIYLIKANCYQTDFSGAYLTGAYLTKS--DFKEAKFNGA 124
Query: 216 YANGT 220
Y NGT
Sbjct: 125 YLNGT 129
>gi|378582929|ref|ZP_09831540.1| hypothetical protein CKS_5479 [Pantoea stewartii subsp. stewartii
DC283]
gi|377814439|gb|EHT97579.1| hypothetical protein CKS_5479 [Pantoea stewartii subsp. stewartii
DC283]
Length = 375
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 61/105 (58%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA---VAY--KANF 155
S A +ADL++A N A+ T+A++ ++D +GA L A +AY +A+
Sbjct: 250 SNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADL 309
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ A+LS+ + R L++ANL++A L L R+DL AI++GA+
Sbjct: 310 SNANLSNADLKRADLSDANLSDANLTNVDLKRADLSNAILKGANL 354
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-----ANF 155
S A A+L A N AN T A + E+D S + +GA L A + N
Sbjct: 170 SDADLSDANLSDANLSGANLAHANLTMAYLSEADLSNANLSGADLTNANLNQTDLPNVNL 229
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+GA+L+ + L+EA+L+NA L L R+DL A + GAD ++A
Sbjct: 230 SGANLAHANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNA 277
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 49/95 (51%), Gaps = 10/95 (10%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM---------- 168
N AN T A + E+D S + + A L++A AN +GADL++ +++
Sbjct: 233 NLAHANLTMAYLSEADLSNANLSNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGA 292
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANLT A L L+ ++L A ++ AD SDA
Sbjct: 293 NLAHANLTMAYLSEADLSNANLSNADLKRADLSDA 327
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 51/108 (47%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A ADL A + AN D+ + SG+ A L A +A+ + A+L
Sbjct: 195 TMAYLSEADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMAYLSEADLSNANL 254
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
S+ + R L+ ANL+ A L L ++DL + GA+ + A + +A
Sbjct: 255 SNADLKRADLSNANLSGADLTNANLNQTDLPNVNLSGANLAHANLTMA 302
>gi|443329141|ref|ZP_21057730.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791290|gb|ELS00788.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 174
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 53/90 (58%), Gaps = 5/90 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEA 173
NF RAN + A +R S+ SG+ F A L+KA + NF+GA+L + + + L+EA
Sbjct: 36 NFIRANLSQAILRNSNLSGAFFVLADLQKADLSGAILIVVNFSGANLQEANLTQSKLSEA 95
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
LT L LT ++L GAI+ GA+ S+A
Sbjct: 96 VLTGTQLQGANLTEANLQGAILAGANLSEA 125
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 54/113 (47%), Gaps = 10/113 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANFTG 157
A ADL A+ + NF AN A++ +S S G++ GA L +A A G
Sbjct: 60 ADLQKADLSGAILIVVNFSGANLQEANLTQSKLSEAVLTGTQLQGANLTEANLQGAILAG 119
Query: 158 ADLSDTLMDRMVLNEAN-----LTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+LS+ + L AN L NA L +T ++L GA +EGA + +I
Sbjct: 120 ANLSEANLRGGDLRGANLYGVDLRNADLTDAKITHANLRGANLEGAIMPEQLI 172
>gi|218247298|ref|YP_002372669.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|218167776|gb|ACK66513.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
Length = 371
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 67/136 (49%), Gaps = 10/136 (7%)
Query: 72 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 131
+ A+ + N++ L L + T G AA+ + +L A + NFR AN T A++
Sbjct: 218 LYAANTHNLAELIKLAHFNPLTDLAGGNFLAAELSAVELSGANLTQTNFRGANLTDAELS 277
Query: 132 ES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 186
E+ FSG+ +GAYL A KA+F A L+ + L EANL A L+ T
Sbjct: 278 EAILNYCKFSGADLSGAYLGNAQLVKADFHRASLAVANLIGANLTEANLREANLIDT--- 334
Query: 187 RSDLGGAIIEGADFSD 202
+L GA ++ A F +
Sbjct: 335 --NLSGATVKNAKFGE 348
>gi|440681919|ref|YP_007156714.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428679038|gb|AFZ57804.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 269
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 66/133 (49%), Gaps = 7/133 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ A+L +A NF A+ T AD+ + + G+ + A L AV AN G D
Sbjct: 77 SQAKLIEANLSQANLSIANFSGADLTQADLSQVNLIGANLSDANLRNAVITDANLIGTDF 136
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
S+ +LN+A+L A L+R+ L+ ++L GA + AD S+A +L + + Y
Sbjct: 137 SNA-----ILNDADLAAAKLIRSNLSFANLIGANLIAADLSEA--NLYDAEVMTAYLYKA 189
Query: 221 NPITGVSTRKSLG 233
N TR LG
Sbjct: 190 NLSKANLTRVHLG 202
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 59/126 (46%), Gaps = 23/126 (18%)
Query: 91 AETRGEFGIGSAAQ---FGSADLRKAVHVKENFRRANFTSADMRES----------DFSG 137
A +GE G+ Q F DL A+ V+ N AN T+A++ ++ + S
Sbjct: 34 ANLQGENLRGANLQGVNFTKVDLSHALLVRTNLMFANLTNANLSQAKLIEANLSQANLSI 93
Query: 138 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
+ F+GA L +A + N GA+LSD ANL NAV+ L +D AI+
Sbjct: 94 ANFSGADLTQADLSQVNLIGANLSD----------ANLRNAVITDANLIGTDFSNAILND 143
Query: 198 ADFSDA 203
AD + A
Sbjct: 144 ADLAAA 149
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 55/108 (50%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A ADL A ++ N AN +AD+ E++ ++ AYL KA KAN
Sbjct: 137 SNAILNDADLAAAKLIRSNLSFANLIGANLIAADLSEANLYDAEVMTAYLYKANLSKANL 196
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
T L + + + L+EANLTNA L + L ++L GA ++ A+ A
Sbjct: 197 TRVHLGSSYLFKANLSEANLTNADLSWSNLRYANLAGANLQRANLRGA 244
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 132 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 191
E D S + G L A NFT DLS L+ R L ANLTNA L + L ++L
Sbjct: 28 EIDLSTANLQGENLRGANLQGVNFTKVDLSHALLVRTNLMFANLTNANLSQAKLIEANLS 87
Query: 192 GAIIEGADFSDAVIDLAQ 209
A + A+FS A DL Q
Sbjct: 88 QANLSIANFSGA--DLTQ 103
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 51/102 (50%), Gaps = 2/102 (1%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+LR AV N +F++A + ++D + +K + L A AN ADLS+
Sbjct: 114 ANLSDANLRNAVITDANLIGTDFSNAILNDADLAAAKLIRSNLSFANLIGANLIAADLSE 173
Query: 163 -TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L D V+ A L A L + LTR LG + + A+ S+A
Sbjct: 174 ANLYDAEVM-TAYLYKANLSKANLTRVHLGSSYLFKANLSEA 214
Score = 37.0 bits (84), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 48/105 (45%), Gaps = 15/105 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A++ A K N +AN T + S YL KA +AN T ADL
Sbjct: 172 SEANLYDAEVMTAYLYKANLSKANLTRVHLGSS----------YLFKANLSEANLTNADL 221
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S + L ANL A L R L ++L GA ++GA+ D ++
Sbjct: 222 SWS-----NLRYANLAGANLQRANLRGANLQGANLKGANLQDTIM 261
>gi|427723149|ref|YP_007070426.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427354869|gb|AFY37592.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 508
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ LR+A N RR + + A + ++D S + GAYL A Y AN GA+L
Sbjct: 67 SGAKLSKVHLRQAYLYGTNLRRTHLSEAFLFKADLSKTNLYGAYLYGAYLYGANLYGANL 126
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
S + L+EA+L+ A L L+ +DL G + AD S
Sbjct: 127 S-----KADLSEADLSEADLSEADLSEADLSGVSLSEADLS 162
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
AQF A L N A+ + AD+ +D SG+K + +L +A Y N LS+
Sbjct: 34 AQFSGAHLSGVNLSGVNLSGADLSGADLSGADLSGAKLSKVHLRQAYLYGTNLRRTHLSE 93
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + L++ NL A L L ++L GA + AD S+A
Sbjct: 94 AFLFKADLSKTNLYGAYLYGAYLYGANLYGANLSKADLSEA 134
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 50/98 (51%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L A K + A+ + AD+ E+D S + +G L +A N +G +LS +
Sbjct: 119 ANLYGANLSKADLSEADLSEADLSEADLSEADLSGVSLSEADLSGVNLSGVNLSGVNLSG 178
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ L+ NL+ A L T+ S L GA ++ AD + A I
Sbjct: 179 VNLSGVNLSGAKLCHTLCKLSTLVGASLKSADLTGACI 216
>gi|428219102|ref|YP_007103567.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427990884|gb|AFY71139.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 698
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 54/101 (53%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A A+L A NF +AN A++R + SG +GA L A AN +GA+L
Sbjct: 67 TGANLTGANLTGANLTGANFSKANLRGANLRGVNLSGVNLSGANLSGANLSGANLSGANL 126
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
S + R+ L+ AN +NA L L+ DL GA + GA+FS
Sbjct: 127 SGVNLSRVNLSGANFSNANLNNFDLSGFDLTGANLTGANFS 167
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 47/85 (55%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N ANF++A++ D SG +G L A AN +GA+LS+ + + L + NL+ A
Sbjct: 210 NLSGANFSNANLNNFDLSGFDLSGVNLSGANLSGANLSGANLSEANLSEVDLYQINLSGA 269
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
L R LT ++L GA GA+ S A
Sbjct: 270 NLSRIDLTGANLSGANFSGANLSGA 294
Score = 44.3 bits (103), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L + N ANF++A++ D SG GA L A NF+G +L
Sbjct: 117 SGANLSGANLSGVNLSRVNLSGANFSNANLNNFDLSGFDLTGANLTGA-----NFSGVNL 171
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + R L+ AN +NA L L+ DL G + GA+ S A
Sbjct: 172 SGVNLSRANLSGANFSNANLNNFDLSGFDLSGVNLSGANLSGA 214
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 58/122 (47%), Gaps = 13/122 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A DL + N R + T A++ ++FSG+ +GA A + +G DL
Sbjct: 252 SEANLSEVDLYQINLSGANLSRIDLTGANLSGANFSGANLSGANFSNANLNNFDLSGFDL 311
Query: 161 SDTLMDRMVLNEANLTNAVL---------VRTV-LTRSDLGGAIIEGADFSDA---VIDL 207
S + L+ ANL+ A L +R + L+ +DLGG + GA+ S+A +DL
Sbjct: 312 SGVNLSGANLSGANLSGANLNNFDLSGFDLRGINLSGADLGGTNLSGANLSEANLSEVDL 371
Query: 208 AQ 209
Q
Sbjct: 372 YQ 373
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 47/91 (51%), Gaps = 2/91 (2%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N AN + A++ D SG G L A N +GA+LS+ + + L + NL+ A
Sbjct: 320 NLSGANLSGANLNNFDLSGFDLRGINLSGADLGGTNLSGANLSEANLSEVDLYQINLSGA 379
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L R LT ++L GA + A+ ++ +DL Q
Sbjct: 380 NLSRIDLTGANLTGANLSEANLNE--VDLYQ 408
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 41/81 (50%)
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
+RA AD+R +D +G+ GA L A AN TGA+ S + L NL+ L
Sbjct: 47 KRAYLRGADLRGADLTGANLTGANLTGANLTGANLTGANFSKANLRGANLRGVNLSGVNL 106
Query: 181 VRTVLTRSDLGGAIIEGADFS 201
L+ ++L GA + GA+ S
Sbjct: 107 SGANLSGANLSGANLSGANLS 127
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 2/95 (2%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N AN + A++ E D +GA L + AN TGA+LS+ ++ + L + NL+ A
Sbjct: 355 NLSGANLSEANLSEVDLYQINLSGANLSRIDLTGANLTGANLSEANLNEVDLYQINLSGA 414
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 213
L + DLGG ++ + + A +L + +AL
Sbjct: 415 NLSKVNFQGFDLGGFDLKNVNLTGA--NLREVKAL 447
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 55/128 (42%), Gaps = 25/128 (19%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF----- 155
+ A F +L + N ANF++A++ D SG +G L A ANF
Sbjct: 162 TGANFSGVNLSGVNLSRANLSGANFSNANLNNFDLSGFDLSGVNLSGANLSGANFSNANL 221
Query: 156 ---------------TGADLSDTLMDRMVLNEANLTNAVLVRT-----VLTRSDLGGAII 195
+GA+LS + L+EANL+ L + L+R DL GA +
Sbjct: 222 NNFDLSGFDLSGVNLSGANLSGANLSGANLSEANLSEVDLYQINLSGANLSRIDLTGANL 281
Query: 196 EGADFSDA 203
GA+FS A
Sbjct: 282 SGANFSGA 289
Score = 37.4 bits (85), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 41/82 (50%), Gaps = 5/82 (6%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
R N + AD+ ++ SG+ + A L + Y+ N +GA+LS R+ L ANLT A
Sbjct: 341 LRGINLSGADLGGTNLSGANLSEANLSEVDLYQINLSGANLS-----RIDLTGANLTGAN 395
Query: 180 LVRTVLTRSDLGGAIIEGADFS 201
L L DL + GA+ S
Sbjct: 396 LSEANLNEVDLYQINLSGANLS 417
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 56/109 (51%), Gaps = 5/109 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A +++A + R A+ T A++ ++ +G+ GA L A KAN GA+L
Sbjct: 39 ANLSEAYVKRAYLRGADLRGADLTGANLTGANLTGANLTGANLTGANFSKANLRGANLRG 98
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVID 206
+ + L+ ANL+ A L L+ ++L G + GA+FS+A ++
Sbjct: 99 VNLSGVNLSGANLSGANLSGANLSGANLSGVNLSRVNLSGANFSNANLN 147
>gi|334121293|ref|ZP_08495365.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333455228|gb|EGK83883.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 299
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 61/120 (50%), Gaps = 9/120 (7%)
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 145
LN YE R +F + A A+L A+ + N RAN + A++ + + + GA L
Sbjct: 7 LNNYEKGHR-DF---TGADLSGANLSGAILIGVNLSRANLSGANLSRAFLTKATLQGAVL 62
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ N + A + +T + L +ANL+ A LV+ L R+ L GA + GA+ AV+
Sbjct: 63 QRT-----NLSFAKMGETQLSGADLTKANLSGAFLVKAKLPRAKLSGATLTGANLRGAVL 117
Score = 37.0 bits (84), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 38/85 (44%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NF+ AN A + + G++ G L +A N GADLS + L ANL +
Sbjct: 141 NFKWANLYGAKLNSAKLFGAQLTGVSLRRAQLTGVNLCGADLSGVNVSEAKLMGANLEGS 200
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
L + + L G + GA+ + A
Sbjct: 201 NLTGANFSAAQLRGVKLAGANLTGA 225
>gi|134280632|ref|ZP_01767342.1| pentapeptide repeat protein [Burkholderia pseudomallei 305]
gi|134247654|gb|EBA47738.1| pentapeptide repeat protein [Burkholderia pseudomallei 305]
Length = 825
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T AD+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQGADLTGADLSGMDLRGARLAGAMLENADLSGADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.49, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 37.0 bits (84), Expect = 8.6, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +ADL A + R AD+ + ++ A L A + +F+G+DL
Sbjct: 530 AGAMLENADLSGADLTGADLSRTVLVRADLTRAKLVDARLTAANLSLAHCERTDFSGSDL 589
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIE----------GADFSDAVI 205
SD + +++ L + +VL T D G A + G FSDA I
Sbjct: 590 SDGIFEQVHLRDCRFNGSVLASTRFDACRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|389694674|ref|ZP_10182768.1| putative low-complexity protein [Microvirga sp. WSM3557]
gi|388588060|gb|EIM28353.1| putative low-complexity protein [Microvirga sp. WSM3557]
Length = 251
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 66/137 (48%), Gaps = 8/137 (5%)
Query: 94 RGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA--- 148
R FG + A FGSAD+ ++ + ANF+ +++ SDFSG+ +GA + KA
Sbjct: 104 RANFGQANLTGADFGSADMNRSNFAQVKAAGANFSKSELNRSDFSGADLSGANISKAELA 163
Query: 149 -VAYK-ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVI 205
V ++ A G D S + + R L+ NL + L + +GGA + GA + I
Sbjct: 164 RVLFQSAKIAGVDFSYSNLSRSRLDGLNLQGVNFTGSYLYLTQIGGADLSGATGLTQEQI 223
Query: 206 DLAQKQALCKYANGTNP 222
D+A A K NP
Sbjct: 224 DIACGSAQTKLPPSINP 240
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 85/199 (42%), Gaps = 36/199 (18%)
Query: 33 PLWVACQISSKTESDGQFPG-PYAKLKNWRVFVS------TALAAAVVASCSSNISALAD 85
P W CQ DG PG ++ R+ ++ T +++ S +A
Sbjct: 22 PAWAKCQ-------DGPGPGVDWSGCSKARLMLTNEDLTGTNFQRSLLTLSDFASSKMAG 74
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVK-----ENFRRANFT-----SADMRESDF 135
N E E S +F ADL KA K NF +AN T SADM S+F
Sbjct: 75 ANLSETEV-------SRTRFEGADLSKANFTKALGWRANFGQANLTGADFGSADMNRSNF 127
Query: 136 SGSKFNGAYLEKAVAYKANFTGADL-----SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+ K GA K+ +++F+GADL S + R++ A + + L+RS L
Sbjct: 128 AQVKAAGANFSKSELNRSDFSGADLSGANISKAELARVLFQSAKIAGVDFSYSNLSRSRL 187
Query: 191 GGAIIEGADFSDAVIDLAQ 209
G ++G +F+ + + L Q
Sbjct: 188 DGLNLQGVNFTGSYLYLTQ 206
>gi|406937704|gb|EKD71085.1| hypothetical protein ACD_46C00278G0012 [uncultured bacterium]
Length = 585
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 75/148 (50%), Gaps = 14/148 (9%)
Query: 68 LAAAVVASCSSNISALADLN-KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT 126
+ AV+ + + + + D N +Y T+ F S + +++L KA+ NF AN +
Sbjct: 324 MNEAVLICTNMSDTTITDTNLQYANLTKTNF---SKSNLSNSNLSKAIFQGTNFSEANLS 380
Query: 127 SADMRESDFSGSKFNGAYLEKA----VAYKA-NFTGADLSDTLMDRMVLNEANLTNAVLV 181
A M+ESD S F+ L A +K+ NF+GADL ++ L+ A+L+NA L+
Sbjct: 381 HAIMKESDCSNIDFSNLCLYHANLANTKFKSTNFSGADLQKAILTDCDLSNADLSNANLI 440
Query: 182 RTVLTR-----SDLGGAIIEGADFSDAV 204
LTR +DL +E A +DA+
Sbjct: 441 HANLTRAYLGETDLSTTNLEHATLTDAM 468
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 10/100 (10%)
Query: 112 KAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+A V+ NF +NF DM+ + + F+ A L NF ADLS+T
Sbjct: 206 EACFVEANFTNSNFVKTRFFLCDMQRINAMNTDFSSAIL-----MGTNFANADLSNTNFT 260
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L++A L ++L T+L +L GA ++G + + +D
Sbjct: 261 NANLSQAKLDRSILTNTILKNVNLSGASLQGVSYPNKKLD 300
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 5/80 (6%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N+ A++ S+F+ F+ + +AV N + ++DT L ANLT ++
Sbjct: 303 NWNKAELSHSNFNNDDFSFCDMNEAVLICTNMSDTTITDT-----NLQYANLTKTNFSKS 357
Query: 184 VLTRSDLGGAIIEGADFSDA 203
L+ S+L AI +G +FS+A
Sbjct: 358 NLSNSNLSKAIFQGTNFSEA 377
>gi|359457996|ref|ZP_09246559.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 464
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 68/129 (52%), Gaps = 20/129 (15%)
Query: 103 AQFGSADLR----KAVHVKE-NFR----------RANFTSADMRESDFSGSKFNGAYLEK 147
A+ G ADLR K ++KE N R RA+ AD+RE++ S ++ + LEK
Sbjct: 36 AKLGGADLRNANLKGANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLTLSNLEK 95
Query: 148 -----AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A+ ++AN + A L+ + ++ L +ANL+ A L L R++LG A + A+ +
Sbjct: 96 SQLGAAILFRANLSQAQLTLSDLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTT 155
Query: 203 AVIDLAQKQ 211
A + A+ Q
Sbjct: 156 ANLSQARLQ 164
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 58/119 (48%), Gaps = 7/119 (5%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
N EA RG A+ ADL +A + + R AN +SA + S+ S+ A L
Sbjct: 52 NLKEANLRG-------AKLDGADLLRADLKQADLREANLSSAQLTLSNLEKSQLGAAILF 104
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+A +A T +DL + + L++ANLT A L R L ++ L A + A+ S A +
Sbjct: 105 RANLSQAQLTLSDLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTTANLSQARL 163
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 53/105 (50%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S+AQ ++L K+ RAN + A + SD ++ A L +A +AN A+L
Sbjct: 84 SSAQLTLSNLEKSQLGAAILFRANLSQAQLTLSDLENAQLRDANLSQANLTEANLARANL 143
Query: 161 SDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADF 200
+++ L ANL+ NA LV T L ++L GA ++GA+
Sbjct: 144 GKAQLNQANLTTANLSQARLQNASLVGTQLINANLEGASLKGANL 188
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 46/87 (52%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL+ A + + + +F + +++ + S +GA L +A ++A+ TGA L +
Sbjct: 369 DLKTADLAQADLSQVDFFRVQLPQANLTQSILDGANLTEANLFRADLTGASLKAATLKNA 428
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAII 195
L EANL NA + T L + L GAI+
Sbjct: 429 NLAEANLENANIEGTNLDDAYLCGAIM 455
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 46/89 (51%), Gaps = 5/89 (5%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
+A AD+R ++ GA L++A A GADL + + L EANL++A
Sbjct: 33 LDKAKLGGADLRNANLK-----GANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQ 87
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
L + L +S LG AI+ A+ S A + L+
Sbjct: 88 LTLSNLEKSQLGAAILFRANLSQAQLTLS 116
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 49/106 (46%), Gaps = 5/106 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S AQ +DL A R AN + A++ E++ + + A L +A AN + A L
Sbjct: 109 SQAQLTLSDLENA-----QLRDANLSQANLTEANLARANLGKAQLNQANLTTANLSQARL 163
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ + L ANL A L L +DL GA + AD +A +D
Sbjct: 164 QNASLVGTQLINANLEGASLKGANLIGADLTGANLVNADLREAKLD 209
>gi|448661888|ref|ZP_21683780.1| hypothetical protein C435_21969 [Haloarcula californiae ATCC 33799]
gi|445758247|gb|EMA09568.1| hypothetical protein C435_21969 [Haloarcula californiae ATCC 33799]
Length = 480
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 16/128 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A LR+A N + AN T A +R++D + + GA L A +A+ T A L +
Sbjct: 168 ANLTDTSLRQADLTDANLKGANLTDASLRQADLTDANLKGADLPGASLLRADLTDAFLRE 227
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRS---------------DLGGAIIEGADFSDA-VID 206
+ LN ANLT +L + LT + DL GA + GADFS+A +I+
Sbjct: 228 VNLTDAALNRANLTGTILHKADLTDTDLQVADFTNADLRYADLTGATLPGADFSEANLIN 287
Query: 207 LAQKQALC 214
++ L
Sbjct: 288 TTLREVLL 295
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 57/124 (45%), Gaps = 6/124 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----G 157
A ADL A + + AN T +R++D + + GA L A +A+ T G
Sbjct: 148 ADLTDADLWAAALPDADLKGANLTDTSLRQADLTDANLKGANLTDASLRQADLTDANLKG 207
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKY 216
ADL + R L +A L L L R++L G I+ AD +D + +A A +Y
Sbjct: 208 ADLPGASLLRADLTDAFLREVNLTDAALNRANLTGTILHKADLTDTDLQVADFTNADLRY 267
Query: 217 ANGT 220
A+ T
Sbjct: 268 ADLT 271
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 71/156 (45%), Gaps = 11/156 (7%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR----- 122
L AV+A + + L + N EAE I A A L A N R
Sbjct: 55 LKGAVLADVNFAGADLVNANIKEAELTD--AILRQADLTDAALWDANLTGSNLLRTDLPG 112
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
ANF AD+ +++ GS F A L +A A ADL+D + L +A+L A L
Sbjct: 113 ANFLRADLHDANLKGSDFTDAALRQADLTDATLRQADLTDADLWAAALPDADLKGANLTD 172
Query: 183 TVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
T L ++DL A ++GA+ +DA + +QA AN
Sbjct: 173 TSLRQADLTDANLKGANLTDASL----RQADLTDAN 204
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 54/112 (48%), Gaps = 5/112 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTG 157
+ F A LR+A R+A+ T AD+ ++D G+ L +A AN G
Sbjct: 128 SDFTDAALRQADLTDATLRQADLTDADLWAAALPDADLKGANLTDTSLRQADLTDANLKG 187
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
A+L+D + + L +ANL A L L R+DL A + + +DA ++ A
Sbjct: 188 ANLTDASLRQADLTDANLKGADLPGASLLRADLTDAFLREVNLTDAALNRAN 239
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 25/119 (21%)
Query: 112 KAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTGADLSDT--- 163
K ++ + AN + A ++E+D +G + GA L+ AV NF GADL +
Sbjct: 17 KDIYPGADLTDANLSGAFLKEADLTGANLTRTDLTGANLKGAVLADVNFAGADLVNANIK 76
Query: 164 ---LMDRMV---------LNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 205
L D ++ L +ANLT + L+RT L R+DL A ++G+DF+DA +
Sbjct: 77 EAELTDAILRQADLTDAALWDANLTGSNLLRTDLPGANFLRADLHDANLKGSDFTDAAL 135
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTG 157
A F ADL A N + ++FT A +R++D + + A L A + A+ G
Sbjct: 113 ANFLRADLHDA-----NLKGSDFTDAALRQADLTDATLRQADLTDADLWAAALPDADLKG 167
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A+L+DT + + L +ANL A L L ++DL A ++GAD
Sbjct: 168 ANLTDTSLRQADLTDANLKGANLTDASLRQADLTDANLKGADL 210
Score = 43.5 bits (101), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD- 166
A+L+ AV NF A+ +A+++E++ + + A L A + AN TG++L T +
Sbjct: 53 ANLKGAVLADVNFAGADLVNANIKEAELTDAILRQADLTDAALWDANLTGSNLLRTDLPG 112
Query: 167 ----RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
R L++ANL + L ++DL A + AD +DA
Sbjct: 113 ANFLRADLHDANLKGSDFTDAALRQADLTDATLRQADLTDA 153
Score = 37.7 bits (86), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%), Gaps = 6/82 (7%)
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII-----EGADF 200
+ +V+ K + GADL+D + L EA+LT A L RT LT ++L GA++ GAD
Sbjct: 11 DDSVSDKDIYPGADLTDANLSGAFLKEADLTGANLTRTDLTGANLKGAVLADVNFAGADL 70
Query: 201 SDAVIDLAQ-KQALCKYANGTN 221
+A I A+ A+ + A+ T+
Sbjct: 71 VNANIKEAELTDAILRQADLTD 92
>gi|427735932|ref|YP_007055476.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427370973|gb|AFY54929.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 713
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 68/153 (44%), Gaps = 32/153 (20%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY--------- 151
S+A ADLR AV A+ T AD+ E+ + + GA L + VA
Sbjct: 534 SSASLAKADLRNAV-----LENASLTGADLGEARLNDADLYGARLGRVVAIGTQLSNANL 588
Query: 152 -KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL---------------TRSDLGGAII 195
K + GADLS +DR L+ ANL+ A L +L + +DL GA +
Sbjct: 589 IKTEWQGADLSSAYLDRANLSNANLSAARLTGAILRSTNLQNVNLRNADLSLADLRGANL 648
Query: 196 EGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
GADF ++ Q+ K+ + P TG+ +
Sbjct: 649 AGADFQGTILSARQQNPADKFVD--TPTTGIQS 679
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 54/111 (48%), Gaps = 5/111 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTG 157
A A+L + + V+ N R+N A++ + G+ + A L KA V A+ TG
Sbjct: 496 ANLSGANLSRVLMVRTNLSRSNLNKANLSAARLVGANLSSASLAKADLRNAVLENASLTG 555
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
ADL + ++ L A L V + T L+ ++L +GAD S A +D A
Sbjct: 556 ADLGEARLNDADLYGARLGRVVAIGTQLSNANLIKTEWQGADLSSAYLDRA 606
Score = 43.9 bits (102), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 50/97 (51%), Gaps = 14/97 (14%)
Query: 124 NFTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMDRMV 169
+F A++ ++ F+GS+F G A L +A +AN +GA+LS LM R
Sbjct: 453 DFKYANLDKASFTGSRFRGPGKDGRWDTYDDWIANLSQAQLKQANLSGANLSRVLMVRTN 512
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L+ +NL A L L ++L A + AD +AV++
Sbjct: 513 LSRSNLNKANLSAARLVGANLSSASLAKADLRNAVLE 549
>gi|86606624|ref|YP_475387.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86555166|gb|ABD00124.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 371
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 69/152 (45%), Gaps = 14/152 (9%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGI----GSAAQFGSADLRKA----VHVKE- 118
L A V+ S S L++ + E R + + G F DL KA + +++
Sbjct: 204 LRGAKVSGTSLRGSRLSEETRLEERLRHIWQLQNWGGQGQDFSGQDLSKADLRGLGLRQI 263
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLMDRMVLNEA 173
R AN D+R S+ G+ GA L++A AN GADL + + L A
Sbjct: 264 RLRGANLKRVDLRGSNLEGADLRGANLQRADLRGANLQNADLEGADLGGAELRQAQLQGA 323
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
NL A L R LT+++L GA IEG S + I
Sbjct: 324 NLRRADLSRANLTQANLEGAQIEGLKHSGSQI 355
Score = 44.3 bits (103), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 50/100 (50%), Gaps = 4/100 (4%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A F A+LRKA NF A+ AD+R+++ G+K +GA L+ A A+ GA +S
Sbjct: 151 GANFYEANLRKANLGLCNFNGAHLHQADLRQANLQGAKLSGAVLQGADLRGADLRGAKVS 210
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
T + L+E L R + + GG +G DFS
Sbjct: 211 GTSLRGSRLSEETRLEERL-RHIWQLQNWGG---QGQDFS 246
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ NF A+ D++E+ G+ F A L KA NF GA L L +ANL
Sbjct: 131 ERNFAYADLEGVDLQEARLGGANFYEANLRKANLGLCNFNGAHLHQA-----DLRQANLQ 185
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFS 201
A L VL +DL GA + GA S
Sbjct: 186 GAKLSGAVLQGADLRGADLRGAKVS 210
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 105 FGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
F ADL + V ++E ANF A++R+++ FNGA+L +A +AN GA LS
Sbjct: 134 FAYADL-EGVDLQEARLGGANFYEANLRKANLGLCNFNGAHLHQADLRQANLQGAKLSGA 192
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDL 190
++ L A+L A + T L S L
Sbjct: 193 VLQGADLRGADLRGAKVSGTSLRGSRL 219
>gi|307944130|ref|ZP_07659471.1| pentapeptide repeat protein [Roseibium sp. TrichSKD4]
gi|307772476|gb|EFO31696.1| pentapeptide repeat protein [Roseibium sp. TrichSKD4]
Length = 534
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 53/113 (46%)
Query: 97 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 156
I A+ ADLR A + + R A AD+R + +K GA L++A +A+
Sbjct: 63 LAILQEAKLQEADLRGAKLQQADLRGAKLQQADLRLAKLQQAKLWGADLQEADLQEADLR 122
Query: 157 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
GADL + L A L A L L +DL GA + GAD A ++ A+
Sbjct: 123 GADLRGAKLQEADLRGAKLQEADLRGAKLQEADLRGAKLRGADLRGAKLEWAK 175
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ ADLR A+ + + A+ A ++++D G+K A L A +A GADL +
Sbjct: 54 AKLQQADLRLAILQEAKLQEADLRGAKLQQADLRGAKLQQADLRLAKLQQAKLWGADLQE 113
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L EA+L A L L +DL GA ++ AD A +
Sbjct: 114 A-----DLQEADLRGADLRGAKLQEADLRGAKLQEADLRGAKL 151
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 64/153 (41%), Gaps = 13/153 (8%)
Query: 59 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRG---EFGIGSAAQFGSADLRKAVH 115
W L A + ++ L + EA+ RG + A+ ADLR A
Sbjct: 42 EWADLWGANLQQAKLQQADLRLAILQEAKLQEADLRGAKLQQADLRGAKLQQADLRLAKL 101
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
+ A+ AD++E+D G+ GA L+ +A+ GA L + + L EA+L
Sbjct: 102 QQAKLWGADLQEADLQEADLRGADLRGAKLQ-----EADLRGAKLQEADLRGAKLQEADL 156
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
A L +DL GA +E A A ++ A
Sbjct: 157 RGA-----KLRGADLRGAKLEWAKLEWAKLEWA 184
>gi|158338487|ref|YP_001519664.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158308728|gb|ABW30345.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 464
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 68/129 (52%), Gaps = 20/129 (15%)
Query: 103 AQFGSADLR----KAVHVKE-NFR----------RANFTSADMRESDFSGSKFNGAYLEK 147
A+ G ADLR K ++KE N R RA+ AD+RE++ S ++ + LEK
Sbjct: 36 AKLGGADLRNANLKGANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLTLSNLEK 95
Query: 148 -----AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A+ ++AN + A L+ + ++ L +ANL+ A L L R++LG A + A+ +
Sbjct: 96 SQLGAAILFRANLSQAQLTLSNLENAQLRDANLSQANLTEANLARANLGKAQLNQANLTT 155
Query: 203 AVIDLAQKQ 211
A + A+ Q
Sbjct: 156 ANLSQARLQ 164
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 47/87 (54%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL+ A + + + +F A + +++ + S +GA L +A ++A+ TGA L +
Sbjct: 369 DLKTADLAQADLNQVDFFRAQLPQANLAQSILDGANLTEANLFRADLTGASLKAATLKNA 428
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAII 195
L EANL NA + T L + L GAI+
Sbjct: 429 NLAEANLENANIEGTNLDDAYLCGAIM 455
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLV 181
+A AD+R ++ GA L++A A GADL + + L EANL++A L
Sbjct: 35 KAKLGGADLRNANLK-----GANLKEANLRGAKLDGADLLRADLKQADLREANLSSAQLT 89
Query: 182 RTVLTRSDLGGAIIEGADFSDAVIDLA 208
+ L +S LG AI+ A+ S A + L+
Sbjct: 90 LSNLEKSQLGAAILFRANLSQAQLTLS 116
>gi|427724651|ref|YP_007071928.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427356371|gb|AFY39094.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 281
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 66/116 (56%), Gaps = 9/116 (7%)
Query: 99 IGSAAQFGSADLRKA----VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 154
I + A A+LR+A ++ N +AN S+++ E++ + +K + + A +A
Sbjct: 46 IFTGATLDQANLREADLSYASLQGNLSQANLISSNLTEANLTAAKMAYSGMRAANLTRAK 105
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVI 205
T ADLS +++ ++ EANL+ A LV R LT+++L GA ++GA+ + A++
Sbjct: 106 LTSADLSYCILNEAIMREANLSKATLVDAFIGRANLTQANLEGANLQGANLTSAIL 161
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 48/106 (45%), Gaps = 10/106 (9%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFN----------GAYLEKAVAYKANFTG 157
A+L KA V RAN T A++ ++ G+ GA L A + N TG
Sbjct: 124 ANLSKATLVDAFIGRANLTQANLEGANLQGANLTSAILIGANLRGANLANATLHGINATG 183
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ D + + LN ANLTN L T L + L + GAD ++A
Sbjct: 184 STADDADLSKSKLNSANLTNVKLRGTNLREAQLAWTTMRGADLTEA 229
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 109 DLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGA-----YLEKAVAYKANFTGA 158
+L +A + N AN T+A M R ++ + +K A L +A+ +AN + A
Sbjct: 70 NLSQANLISSNLTEANLTAAKMAYSGMRAANLTRAKLTSADLSYCILNEAIMREANLSKA 129
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L D + R L +ANL A L LT + L GA + GA+ ++A +
Sbjct: 130 TLVDAFIGRANLTQANLEGANLQGANLTSAILIGANLRGANLANATL 176
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 43/84 (51%), Gaps = 5/84 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S ++ SA+L N R A MR +D + +K L +A Y +NFTGA+L
Sbjct: 192 SKSKLNSANLTNVKLRGTNLREAQLAWTTMRGADLTEAK-----LFRAKLYWSNFTGANL 246
Query: 161 SDTLMDRMVLNEANLTNAVLVRTV 184
+ T++ +++ N NA+L T+
Sbjct: 247 TRTMLMDATMDQVNFRNAILDGTI 270
>gi|332705327|ref|ZP_08425405.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
gi|332355687|gb|EGJ35149.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
Length = 221
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 51/107 (47%), Gaps = 15/107 (14%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR + + R AN T AD+R +D G+ GA L +A +AN ADLS
Sbjct: 111 AILTRADLRLTILQDTDLRGANLTRADLRYADLRGANLTGACLHQADLTRANLCDADLS- 169
Query: 163 TLMDRMVLNEANLTNAV-----LVRTVLTRSDLGGAIIEGADFSDAV 204
+ANL+ A+ L R L+ DLG A + GA D +
Sbjct: 170 ---------QANLSGAILSQVDLRRVTLSNVDLGQAELSGATVPDQL 207
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 48/100 (48%)
Query: 106 GSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM 165
G D R + N N T R + + + + A L++ +AN TGA L T +
Sbjct: 24 GERDFRGVDLQQINLSEVNLTGVIFRRVNLADANLSLAVLQEVNLNQANLTGAKLWRTNL 83
Query: 166 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ L EANL+ A ++R LTR +L AI+ AD ++
Sbjct: 84 KKTSLVEANLSQAFMIRANLTRVNLRQAILTRADLRLTIL 123
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 52/121 (42%), Gaps = 6/121 (4%)
Query: 86 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 144
L +Y A R G+ +L + + N AN + A ++E + + + GA
Sbjct: 18 LERYSAGERDFRGVDLQQINLSEVNLTGVIFRRVNLADANLSLAVLQEVNLNQANLTGAK 77
Query: 145 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGAD 199
L + K + A+LS M R L NL A+L R T+L +DL GA + AD
Sbjct: 78 LWRTNLKKTSLVEANLSQAFMIRANLTRVNLRQAILTRADLRLTILQDTDLRGANLTRAD 137
Query: 200 F 200
Sbjct: 138 L 138
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 5/99 (5%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
A+L AV + N +AN T A + ++ + A L +A +AN T +L +
Sbjct: 53 LADANLSLAVLQEVNLNQANLTGAKLWRTNLKKTSLVEANLSQAFMIRANLTRVNLRQAI 112
Query: 165 MDR-----MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
+ R +L + +L A L R L +DL GA + GA
Sbjct: 113 LTRADLRLTILQDTDLRGANLTRADLRYADLRGANLTGA 151
>gi|242277903|ref|YP_002990032.1| pentapeptide repeat-containing protein [Desulfovibrio salexigens DSM
2638]
gi|242120797|gb|ACS78493.1| pentapeptide repeat protein [Desulfovibrio salexigens DSM 2638]
Length = 1277
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 41/149 (27%), Positives = 69/149 (46%), Gaps = 4/149 (2%)
Query: 62 VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 121
+F AV+ + +++ L + EAE +G I G AD KA + N +
Sbjct: 1045 IFKGAQFPKAVLRDTNFDMAILEKTDFSEAELKGA-RINMCMISGKAD--KADFSQSNIK 1101
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV-LNEANLTNAVL 180
++ F ++ + +DFS + N + A+K NFT A+L R +++ +A L
Sbjct: 1102 KSIFKASSLTGADFSEASVNESLFNDVDAHKVNFTDANLDKLRTGRNSNFKDSDFRHATL 1161
Query: 181 VRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L SD G+ GADF + +ID +Q
Sbjct: 1162 HGAALRESDFTGSDFRGADFENGLIDNSQ 1190
Score = 47.0 bits (110), Expect = 0.010, Method: Composition-based stats.
Identities = 34/118 (28%), Positives = 54/118 (45%), Gaps = 9/118 (7%)
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
IG +A F A LR+A + F +A F +D+ E++ + + F GA KAV NF
Sbjct: 1004 AIGMSADFSKASLRRADLSRGLFNKALFVESDLSEANGAQAIFKGAQFPKAVLRDTNFDM 1063
Query: 158 ADLSDTLMDRMVLNEANLT---------NAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A L T L A + A ++ + +S + + GADFS+A ++
Sbjct: 1064 AILEKTDFSEAELKGARINMCMISGKADKADFSQSNIKKSIFKASSLTGADFSEASVN 1121
Score = 44.7 bits (104), Expect = 0.039, Method: Composition-based stats.
Identities = 32/148 (21%), Positives = 65/148 (43%), Gaps = 4/148 (2%)
Query: 62 VFVSTALAAAVVASCSSNISALADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVK 117
+F +++L A + S N S D++ ++ + G + F +D R A
Sbjct: 1104 IFKASSLTGADFSEASVNESLFNDVDAHKVNFTDANLDKLRTGRNSNFKDSDFRHATLHG 1163
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
R ++FT +D R +DF + + L +A + GA + + ++ + AN+
Sbjct: 1164 AALRESDFTGSDFRGADFENGLIDNSQLVRANLNGVSAKGARFTKSNLEGASMRAANVHM 1223
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ + L +DL G+ + DF +V+
Sbjct: 1224 GGMRKARLVDTDLRGSNLFAVDFYKSVL 1251
Score = 39.3 bits (90), Expect = 1.6, Method: Composition-based stats.
Identities = 34/108 (31%), Positives = 46/108 (42%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S ADL K K NF+ A F A +DFS + A L + + KA F
Sbjct: 972 SGLDLSGADLSKCQLQKTNFKGAILDNVKFVQAIGMSADFSKASLRRADLSRGLFNKALF 1031
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+DLS+ + + A AVL T + AI+E DFS+A
Sbjct: 1032 VESDLSEANGAQAIFKGAQFPKAVLRDT-----NFDMAILEKTDFSEA 1074
Score = 37.4 bits (85), Expect = 7.4, Method: Composition-based stats.
Identities = 39/141 (27%), Positives = 58/141 (41%), Gaps = 22/141 (15%)
Query: 86 LNKYEAETRGEFGIGSAAQFG-SADLRKAVHVKENFRR-----ANFTSADMRESDFSGSK 139
L K EA+ + A+ G SAD +A+ +E +R + A + D SG
Sbjct: 917 LKKLEAKELPDAAKAKLAEHGLSADSLRAL-TREEVQRYHEQGKSLVGAVLSGVDLSGLD 975
Query: 140 FNGAYLEKAVAYKANFTGA---------------DLSDTLMDRMVLNEANLTNAVLVRTV 184
+GA L K K NF GA D S + R L+ A+ V +
Sbjct: 976 LSGADLSKCQLQKTNFKGAILDNVKFVQAIGMSADFSKASLRRADLSRGLFNKALFVESD 1035
Query: 185 LTRSDLGGAIIEGADFSDAVI 205
L+ ++ AI +GA F AV+
Sbjct: 1036 LSEANGAQAIFKGAQFPKAVL 1056
>gi|434394477|ref|YP_007129424.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428266318|gb|AFZ32264.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 132
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 49/88 (55%), Gaps = 4/88 (4%)
Query: 107 SADLRKAVHVKE----NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
S++L++ ++ K+ N R AN +A++ E++ SG+ GA L+ A KAN GA+L
Sbjct: 40 SSELQRLLNTKQCPGCNLRGANLRNANLEEANLSGANLQGANLQNADLEKANLQGANLQQ 99
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+ L EANL NA L L +DL
Sbjct: 100 ANLSDADLQEANLQNANLQNANLRSADL 127
>gi|428298482|ref|YP_007136788.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428235026|gb|AFZ00816.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 567
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ SADL RA+F A++R +DFSG+ N A A ANF+ ADL
Sbjct: 83 SDAKLNSADLS----------RADFYQANLRNTDFSGANLNSANFRNADLRNANFSNADL 132
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
++ + L N +NA + T L R +L G + GAD S
Sbjct: 133 ANADFSGLDLYGVNFSNAKMRGTRLDRVNLSGVNLSGADLS 173
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 53/110 (48%), Gaps = 10/110 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F A+LR N ANF +AD+R ++FS + A Y NF+ A +
Sbjct: 93 SRADFYQANLRNTDFSGANLNSANFRNADLRNANFSNADLANADFSGLDLYGVNFSNAKM 152
Query: 161 SDTLMDRMVLNEANLTNAVL----VRTV------LTRSDLGGAIIEGADF 200
T +DR+ L+ NL+ A L +R V LTR +L A + G DF
Sbjct: 153 RGTRLDRVNLSGVNLSGADLSGIDLRNVNLRGINLTRINLSHANLIGFDF 202
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 46/96 (47%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL + + + AN +D+R +D S +K N A L +A Y+AN D S ++
Sbjct: 55 ADLSRKNLKRADLYNANLQRSDLRNTDLSDAKLNSADLSRADFYQANLRNTDFSGANLNS 114
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A+L NA L +D G + G +FS+A
Sbjct: 115 ANFRNADLRNANFSNADLANADFSGLDLYGVNFSNA 150
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 11/131 (8%)
Query: 84 ADLNK---YEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 137
ADL++ Y+A R G ++A F +ADLR A + A+F+ D+ +FS
Sbjct: 90 ADLSRADFYQANLRNTDFSGANLNSANFRNADLRNANFSNADLANADFSGLDLYGVNFSN 149
Query: 138 SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
+K G L++ N +GADLS + L NL L R L+ ++L G G
Sbjct: 150 AKMRGTRLDRVNLSGVNLSGADLSG-----IDLRNVNLRGINLTRINLSHANLIGFDFRG 204
Query: 198 ADFSDAVIDLA 208
D +A + A
Sbjct: 205 TDLRNANLSYA 215
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 38/83 (45%), Gaps = 4/83 (4%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ R AN + AD+R SD S +K A L A NF ADL +D L A+L A
Sbjct: 206 DLRNANLSYADLRNSDLSNAKLESADLRNANLSGVNFRNADLIGVNLDGASLQNADLRGA 265
Query: 179 VLVRTVLTRSDLGGAIIEGADFS 201
L T L G + E D++
Sbjct: 266 NLNFTSLP----SGIVAEAEDYT 284
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 37/70 (52%)
Query: 134 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
D SG+ + L++A Y AN +DL +T + LN A+L+ A + L +D GA
Sbjct: 51 DLSGADLSRKNLKRADLYNANLQRSDLRNTDLSDAKLNSADLSRADFYQANLRNTDFSGA 110
Query: 194 IIEGADFSDA 203
+ A+F +A
Sbjct: 111 NLNSANFRNA 120
>gi|407781954|ref|ZP_11129170.1| hypothetical protein P24_07031 [Oceanibaculum indicum P24]
gi|407206993|gb|EKE76937.1| hypothetical protein P24_07031 [Oceanibaculum indicum P24]
Length = 392
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 64/134 (47%), Gaps = 20/134 (14%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL--------------EKA 148
A ADL A + + R+A A++ + F GS NGA L E A
Sbjct: 72 ADLTGADLTAATLDEASLRKAKLVDANLSGASFRGSDLNGADLRGAHGTVSMSSPGFEGA 131
Query: 149 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+ N +GADLS +D +A+LT A+LV TVL + L GA + D +DA + A
Sbjct: 132 MLRLTNLSGADLSGANLD-----QADLTGAMLVGTVLRNASLAGANMRNTDLTDADLGAA 186
Query: 209 Q-KQALCKYANGTN 221
++AL AN +N
Sbjct: 187 NLREALLNGANLSN 200
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 47/101 (46%), Gaps = 10/101 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A+ V R A+ A+MR +D + + A L +A+ AN + A L
Sbjct: 144 SGANLDQADLTGAMLVGTVLRNASLAGANMRNTDLTDADLGAANLREALLNGANLSNAHL 203
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
N ANL A LV LT L GA EGA+F+
Sbjct: 204 ----------NGANLQRARLVGVTLTEGVLDGADTEGANFA 234
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 59/133 (44%), Gaps = 26/133 (19%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G A+ ADL A + R N + A +R ++ G+ NGA L + GAD
Sbjct: 262 GERAELDGADLTDA-----DLRGFNLSGASLRAANLRGALLNGALL-----VLTDLAGAD 311
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA----------------IIEGADFSDA 203
LS + R L+ ANL A L L+ + LG A ++EGAD ++A
Sbjct: 312 LSQASLVRANLSGANLRGAKLHSADLSGAKLGPAPLIGADGRPTGRSRATVLEGADLTEA 371
Query: 204 VIDLAQKQALCKY 216
V+D QK L +
Sbjct: 372 VLDDEQKSVLPDF 384
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 37/75 (49%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
+ + D+R+ SG+ G L A A+ TGADL+ +D L +A L +A L
Sbjct: 43 DLSGRDLRKCQLSGAGLQGIRLTGANLEGADLTGADLTAATLDEASLRKAKLVDANLSGA 102
Query: 184 VLTRSDLGGAIIEGA 198
SDL GA + GA
Sbjct: 103 SFRGSDLNGADLRGA 117
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 74/173 (42%), Gaps = 27/173 (15%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
L AA + S + L D N A RG + ADLR A H + F
Sbjct: 79 LTAATLDEASLRKAKLVDANLSGASFRG-------SDLNGADLRGA-HGTVSMSSPGFEG 130
Query: 128 ADMRESDFSGSKFNGAYLEKA----------VAYKANFTGADLSDTLMDRMVLNEANLTN 177
A +R ++ SG+ +GA L++A V A+ GA++ +T + L ANL
Sbjct: 131 AMLRLTNLSGADLSGANLDQADLTGAMLVGTVLRNASLAGANMRNTDLTDADLGAANLRE 190
Query: 178 AVLVRTVLTRSDLGGAIIE-----GADFSDAVIDLAQKQALCKYANGTNPITG 225
A+L L+ + L GA ++ G ++ V+D A + AN P+ G
Sbjct: 191 ALLNGANLSNAHLNGANLQRARLVGVTLTEGVLDGADTEG----ANFAPPLDG 239
>gi|220907270|ref|YP_002482581.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863881|gb|ACL44220.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 369
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 46/92 (50%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ R AN D+ + S + + A L A +K NF GA+L + R L +ANLTNA
Sbjct: 256 DLRGANLAEKDLAGRNLSNANLSSANLSDAFLHKTNFHGANLFRANLFRANLLQANLTNA 315
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
L T L +DL GA + GAD A I K
Sbjct: 316 NLRETNLIGADLSGADLRGADLRGAKIGFDNK 347
>gi|428770347|ref|YP_007162137.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428684626|gb|AFZ54093.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 278
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 58/129 (44%), Gaps = 10/129 (7%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
Q DLR A N + + AD+R++D SG+ + YL +A AN TGA+L+
Sbjct: 25 QLRRIDLRNAQLKGVNLGGCDLSYADLRDADLSGADLSKCYLNEANLSGANLTGANLTGA 84
Query: 164 LM----------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 213
+ + ++ EA T + L R ++DL GA + GA + + A
Sbjct: 85 YLIKAYLTKVNFQKAIVKEAYFTGSFLTRANFYKADLSGAFLNGAHLNGGIFKDASYDNT 144
Query: 214 CKYANGTNP 222
++ G NP
Sbjct: 145 TRFDKGFNP 153
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 15/99 (15%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL- 175
+ NF + D+R + G G L A A+ +GADLS + LNEANL
Sbjct: 18 ERNFPKLQLRRIDLRNAQLKGVNLGGCDLSYADLRDADLSGADLS-----KCYLNEANLS 72
Query: 176 ---------TNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
T A L++ LT+ + AI++ A F+ + +
Sbjct: 73 GANLTGANLTGAYLIKAYLTKVNFQKAIVKEAYFTGSFL 111
>gi|397736621|ref|ZP_10503302.1| pentapeptide repeats family protein [Rhodococcus sp. JVH1]
gi|396927531|gb|EJI94759.1| pentapeptide repeats family protein [Rhodococcus sp. JVH1]
Length = 201
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 59/131 (45%), Gaps = 15/131 (11%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYL 145
+E R E I + F ADL ++ HV FR +FT ++ R F GS+F+ L
Sbjct: 38 SELRTESVIFTECDFTGADLAESNHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97
Query: 146 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 195
V + +FT GADL EANL L R VL +DL GGA
Sbjct: 98 RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 157
Query: 196 EGADFSDAVID 206
+GAD A +D
Sbjct: 158 DGADLRGAHVD 168
>gi|298489886|ref|YP_003720063.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298231804|gb|ADI62940.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 256
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 41/76 (53%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR A N RAN T AD+R ++ +G+ G L +A +AN TGADL +
Sbjct: 51 ADLSGADLRGANLEGANLSRANLTGADLRSANLAGASLFGVNLSRAKLNEANLTGADLRN 110
Query: 163 TLMDRMVLNEANLTNA 178
T + + L ANL A
Sbjct: 111 TYLMNIELTNANLNGA 126
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 43/82 (52%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+ + AD+R ++ G+ + A L A AN GA L + R LNEANLT A L
Sbjct: 51 ADLSGADLRGANLEGANLSRANLTGADLRSANLAGASLFGVNLSRAKLNEANLTGADLRN 110
Query: 183 TVLTRSDLGGAIIEGADFSDAV 204
T L +L A + GA+F AV
Sbjct: 111 TYLMNIELTNANLNGANFQGAV 132
>gi|419963472|ref|ZP_14479445.1| hypothetical protein WSS_A15164 [Rhodococcus opacus M213]
gi|432333027|ref|ZP_19584842.1| hypothetical protein Rwratislav_00170 [Rhodococcus wratislaviensis
IFP 2016]
gi|414571123|gb|EKT81843.1| hypothetical protein WSS_A15164 [Rhodococcus opacus M213]
gi|430780078|gb|ELB95186.1| hypothetical protein Rwratislav_00170 [Rhodococcus wratislaviensis
IFP 2016]
Length = 201
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 59/131 (45%), Gaps = 15/131 (11%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYL 145
+E R E I + F ADL ++ HV FR +FT ++ R F GS+F+ L
Sbjct: 38 SELRTESVIFTDCDFTGADLAESRHVGTAFRSCSFTRTTLWHSEFRNCSFLGSEFDNCRL 97
Query: 146 EKAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAII 195
V + +FT GADL EANL L R VL +DL GGA
Sbjct: 98 RPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKF 157
Query: 196 EGADFSDAVID 206
+GAD A +D
Sbjct: 158 DGADLRGAHVD 168
>gi|392382587|ref|YP_005031784.1| conserved protein of unknown function; pentapeptide repeats
[Azospirillum brasilense Sp245]
gi|356877552|emb|CCC98392.1| conserved protein of unknown function; pentapeptide repeats
[Azospirillum brasilense Sp245]
Length = 493
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 60/124 (48%), Gaps = 26/124 (20%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDF-SGSKFNG---------------AY 144
+A+ ADLR A N RA T A++R +DF +GS NG A
Sbjct: 84 TASTLIGADLRGA-----NLHRAILTDANLRGADFRAGSLMNGTDDKPRSDGVTRLTEAK 138
Query: 145 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
+E+++ ANFTG DLS LN+A+LT A + VL +D GA ++G F
Sbjct: 139 MERSILAGANFTGCDLSGA-----DLNDADLTGADMTAAVLVGADFWGATLDGVTFDGTT 193
Query: 205 IDLA 208
ID A
Sbjct: 194 IDEA 197
>gi|376002766|ref|ZP_09780588.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
gi|375328822|emb|CCE16341.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
Length = 529
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 61/112 (54%), Gaps = 14/112 (12%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN----- 171
+ N +ANFT A + ++FSG+ G L +A + +GA L ++ VLN
Sbjct: 39 RVNLSQANFTEAVLSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLI 98
Query: 172 -----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
+ANL +A L+R L R++L AI+ GA+ ++A DL ++A ++A+
Sbjct: 99 RADLSQANLVDASLIRAELMRAELSEAIVNGANLTEA--DL--REATLRHAD 146
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 47/91 (51%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L +A + N R+N T AD+ +D G A L +A A+ GA+LS +
Sbjct: 155 ANLSEACLILSNLERSNLTRADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRW 214
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
L+ ANL+ A L T L+ + L GA + GA
Sbjct: 215 ANLSGANLSGANLEATQLSGASLRGANLSGA 245
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 63/129 (48%), Gaps = 12/129 (9%)
Query: 93 TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
TR + + S A A+L +AV N RA+ + A++ ++ ++ A L +A+
Sbjct: 68 TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLVDASLIRAELMRAELSEAIV 127
Query: 151 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSDLGGAIIEGADF 200
AN T ADL + + L + ANL+ A L+ R+ LTR+DL A + G +
Sbjct: 128 NGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNL 187
Query: 201 SDAVIDLAQ 209
+A + A+
Sbjct: 188 RNAELRQAE 196
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 48/98 (48%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ A+L +A+ N A+ A +R +D + +GA L +A +N ++L+
Sbjct: 115 AELMRAELSEAIVNGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTR 174
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ R L NL NA L + L +DL GA + GA+
Sbjct: 175 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANL 212
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL +A N R A A++ +D G+ +GA L A AN +GA+L T +
Sbjct: 175 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 234
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANL+ A L+ +DL A + D++DA
Sbjct: 235 ASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDA 270
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 134 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
DFS A L + +ANFT A LS T + ANLT L R L S L GA
Sbjct: 26 DFSAILLCEANLSRVNLSQANFTEAVLSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 80
Query: 194 IIEGADFSDAVIDLA 208
I++GA+ ++AV+++A
Sbjct: 81 ILQGANLNEAVLNVA 95
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 57/114 (50%), Gaps = 9/114 (7%)
Query: 84 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
ADL + A+ RG +A+LR+A + R AN + A++R ++ SG+ +GA
Sbjct: 175 ADLTR--ADLRG-------VNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGA 225
Query: 144 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
LE A+ GA+LS + A+LT A L+ T ++L G+ + G
Sbjct: 226 NLEATQLSGASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDANLRGSALTG 279
>gi|418939008|ref|ZP_13492446.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
gi|375054283|gb|EHS50653.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
Length = 229
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 60/131 (45%), Gaps = 19/131 (14%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDF---------------SGSKFNGAYLEK 147
A F A+L AV + AN AD+R++D SG+K + A L +
Sbjct: 100 ANFTGANLESAVLQHTDLTNANLDRADLRDADLHGTILHRANLTGAILSGAKLDKASLIQ 159
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
A+A KAN G DLS + M L+ + T L + T ++L GAI GA A +
Sbjct: 160 AIAQKANLQGVDLSGADLTDMNLSRVDFTAVNLKGAIFTGTNLTGAIFSGAKLDKASL-- 217
Query: 208 AQKQALCKYAN 218
QA+ + AN
Sbjct: 218 --IQAIAQKAN 226
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 58/120 (48%), Gaps = 20/120 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYK----- 152
A F A+L+ A + ANFT AD++ +D G+ F GA LE AV
Sbjct: 60 ANFTEANLKGANLRGADCDGANFTRADLKSADLRWADCDGANFTGANLESAVLQHTDLTN 119
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAV----------LVRTVLTRSDLGGAIIEGADFSD 202
AN ADL D + +L+ ANLT A+ L++ + +++L G + GAD +D
Sbjct: 120 ANLDRADLRDADLHGTILHRANLTGAILSGAKLDKASLIQAIAQKANLQGVDLSGADLTD 179
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 5/106 (4%)
Query: 105 FGSADLRK----AVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
F ADL + +KE NF AN A++R +D G+ F A L+ A A+ GA+
Sbjct: 42 FAGADLEQVRLAGASLKEANFTEANLKGANLRGADCDGANFTRADLKSADLRWADCDGAN 101
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ ++ VL +LTNA L R L +DL G I+ A+ + A++
Sbjct: 102 FTGANLESAVLQHTDLTNANLDRADLRDADLHGTILHRANLTGAIL 147
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 45/96 (46%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+LR + + A ++E++F+ + GA L A ANFT ADL +
Sbjct: 35 ANLRNGDFAGADLEQVRLAGASLKEANFTEANLKGANLRGADCDGANFTRADLKSADLRW 94
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ AN T A L VL +DL A ++ AD DA
Sbjct: 95 ADCDGANFTGANLESAVLQHTDLTNANLDRADLRDA 130
>gi|282898711|ref|ZP_06306699.1| hglK (Pentapeptide repeat protein) [Cylindrospermopsis raciborskii
CS-505]
gi|281196579|gb|EFA71488.1| hglK (Pentapeptide repeat protein) [Cylindrospermopsis raciborskii
CS-505]
Length = 682
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S AQ ADL A + + + + A++ ++++ G+ + +YL A ANF+ A+L
Sbjct: 529 SGAQLQEADLYAAQLARVSAIGSQLSHANLTKTNWQGADLSESYLNHANLNSANFSAANL 588
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S +L AN+TN L ++R+DL GA +EG DF A++
Sbjct: 589 SGA-----ILRYANMTNTNLRSADISRADLRGANLEGTDFQGAIL 628
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 59/125 (47%), Gaps = 16/125 (12%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
DL ++V RAN S+ + +++ S ++ G+ L++A A TGAD+S +
Sbjct: 481 VDLSRSV-----LNRANLASSKLIDANLSSAQLVGSDLQQATLQDAVLTGADISGAQLQE 535
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ-----------ALCKY 216
L A L + + L+ ++L +GAD S++ ++ A A+ +Y
Sbjct: 536 ADLYAAQLARVSAIGSQLSHANLTKTNWQGADLSESYLNHANLNSANFSAANLSGAILRY 595
Query: 217 ANGTN 221
AN TN
Sbjct: 596 ANMTN 600
>gi|119491336|ref|ZP_01623390.1| hypothetical protein L8106_22104 [Lyngbya sp. PCC 8106]
gi|119453500|gb|EAW34662.1| hypothetical protein L8106_22104 [Lyngbya sp. PCC 8106]
Length = 122
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 45/76 (59%), Gaps = 7/76 (9%)
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
H + NF AN T AD+R+SD S ++ GA LE A N TGA+LS T + + L +A+
Sbjct: 47 HAQLNF--ANLTHADLRDSDLSHAQLIGATLEGA-----NLTGANLSHTNLSQANLKQAD 99
Query: 175 LTNAVLVRTVLTRSDL 190
LT A L T+ + S L
Sbjct: 100 LTEATLQDTIYSHSTL 115
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 49/102 (48%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ A+L A V N + T A+++++ G+ F+ A L A A+ +DLS
Sbjct: 8 AKLTDANLESAKLVVANLSQTVITRANLQQAKCVGANFSHAQLNFANLTHADLRDSDLSH 67
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
+ L ANLT A L T L++++L A + A D +
Sbjct: 68 AQLIGATLEGANLTGANLSHTNLSQANLKQADLTEATLQDTI 109
>gi|113477518|ref|YP_723579.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110168566|gb|ABG53106.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 710
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 61/127 (48%), Gaps = 14/127 (11%)
Query: 83 LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFS 136
L + N ++A T F + A GSADL KA + N + F +D+RES++
Sbjct: 534 LIETNLHQANLTEATF---TGADLGSADLSKANLYRANLSKVKAEGTTFQLSDLRESNWQ 590
Query: 137 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
G+ +GA +AN ADLS L+ A L NA L T ++ +DL GA +
Sbjct: 591 GANLSGANFS-----RANLKKADLSLALLTNANFRNAQLQNANLRNTDISLADLRGANLS 645
Query: 197 GADFSDA 203
G DF A
Sbjct: 646 GTDFKGA 652
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 53/120 (44%), Gaps = 15/120 (12%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSA----------DMRESDFSGSKFNGAYLEKA 148
I A A L KA+ +ANF+SA ++ E+ F+G+ A L KA
Sbjct: 503 IMKRADLFRATLSKAIMPGSTITQANFSSAKLIETNLHQANLTEATFTGADLGSADLSKA 562
Query: 149 VAYKANFTGADLSDTLMDRMVLNE-----ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
Y+AN + T L E ANL+ A R L ++DL A++ A+F +A
Sbjct: 563 NLYRANLSKVKAEGTTFQLSDLRESNWQGANLSGANFSRANLKKADLSLALLTNANFRNA 622
>gi|189499236|ref|YP_001958706.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
BS1]
gi|189494677|gb|ACE03225.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
Length = 442
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 64/128 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ AD ++ +++RA+ R ++ G+ FN A+++KA A+ TGA L +
Sbjct: 307 AKLDHADFSESDLSSTSWKRASLVETVFRNANLQGADFNRAFMKKADLSGADLTGAQLRE 366
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
T + L ++NL+ L T LT +DL GA + GA+ ++D A A +G
Sbjct: 367 TRLQEADLKKSNLSKTNLYDTDLTCADLRGADLTGANLLYTILDNALISAETITPSGEKA 426
Query: 223 ITGVSTRK 230
TG + K
Sbjct: 427 TTGWAVLK 434
Score = 44.3 bits (103), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ AD R A + +R D++++D SG+ GA L+ + A +A F ADL+
Sbjct: 83 AKLNGADFRNAKLFSASLKRT-----DLKQTDLSGANLRGADLKNSYAKEAKFINADLTG 137
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
T L A+LT AVL + ++L A + G + + A +
Sbjct: 138 TDFRYANLEGADLTGAVLENALFFDANLSSADLRGVNLTGAKM 180
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 10/90 (11%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE----- 172
E+ A ADM++ D + S NGA L+ A +F+ +DLS T R L E
Sbjct: 282 EDLDDAGLKGADMKKLDMTSSTMNGAKLDHA-----DFSESDLSSTSWKRASLVETVFRN 336
Query: 173 ANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
ANL A R + ++DL GA + GA +
Sbjct: 337 ANLQGADFNRAFMKKADLSGADLTGAQLRE 366
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 47/103 (45%), Gaps = 5/103 (4%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
+L KA A+ +A M + +G+K NGA A + A+ DL T +
Sbjct: 54 NLDKATLEDATLVNADLHNASMVNTRLNGAKLNGADFRNAKLFSASLKRTDLKQTDLSGA 113
Query: 169 VLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L A+L N A + LT +D A +EGAD + AV++
Sbjct: 114 NLRGADLKNSYAKEAKFINADLTGTDFRYANLEGADLTGAVLE 156
>gi|428218533|ref|YP_007102998.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427990315|gb|AFY70570.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 348
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 57/116 (49%), Gaps = 15/116 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SA A+L A ++ N AN A+ E++ S + N AYL KA + AN T A+L
Sbjct: 47 SAVNLRGANLSMANLIRANLSGANLIEANFDEANLSMAYLNCAYLNKAYLHGANLTWANL 106
Query: 161 SDTLMDRMVLNEANLTNAVLVRT---------------VLTRSDLGGAIIEGADFS 201
S + + +EANL+ AVL T L+ +DLGGA + GA+ S
Sbjct: 107 SQSCLIDTDASEANLSGAVLSGTDAYGSNFSGANLSEAYLSVADLGGANLHGANLS 162
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 56/114 (49%), Gaps = 20/114 (17%)
Query: 107 SADLRKAVHVKENFRRANFTS-----ADMRESDFSGSKFNGA---------------YLE 146
+AD+R A ++ + RA+ T AD+ ++ G++ +GA +LE
Sbjct: 208 AADIRGASLIETDLSRADLTKVSLICADLSDAHLIGTELHGANLSQANLKHADLRLSHLE 267
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A Y A+ ADLS ++ LNEA L A+L T L +DL GA + GA+
Sbjct: 268 AANLYGASLYSADLSQANLNAAYLNEAFLFGAILKWTNLADADLSGAHLGGANL 321
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 59/119 (49%), Gaps = 5/119 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTG 157
A A+L + NF RAN ++A+M +++ + SKF A L++A Y A+ G
Sbjct: 154 ANLHGANLSSVYAIATNFERANLSNANMSKANCAKSKFGSAILDRANLSMSYLYAADIRG 213
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 216
A L +T + R L + +L A L L ++L GA + A+ A + L+ +A Y
Sbjct: 214 ASLIETDLSRADLTKVSLICADLSDAHLIGTELHGANLSQANLKHADLRLSHLEAANLY 272
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 57/110 (51%), Gaps = 10/110 (9%)
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
G+ + Q+ SA+ R ++ + A+ TS D+ ++D S GA L A +AN +G
Sbjct: 13 GVSTWNQWRSANSR----IQVDLTGADLTSVDLLDADLSAVNLRGANLSMANLIRANLSG 68
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VID 206
A+L + D EANL+ A L L ++ L GA + A+ S + +ID
Sbjct: 69 ANLIEANFD-----EANLSMAYLNCAYLNKAYLHGANLTWANLSQSCLID 113
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 48/105 (45%), Gaps = 20/105 (19%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR----------M 168
NF AN + A + +D G+ +GA L A NF A+LS+ M +
Sbjct: 135 NFSGANLSEAYLSVADLGGANLHGANLSSVYAIATNFERANLSNANMSKANCAKSKFGSA 194
Query: 169 VLNEANLT----------NAVLVRTVLTRSDLGGAIIEGADFSDA 203
+L+ ANL+ A L+ T L+R+DL + AD SDA
Sbjct: 195 ILDRANLSMSYLYAADIRGASLIETDLSRADLTKVSLICADLSDA 239
>gi|421082377|ref|ZP_15543263.1| Pentapeptide repeat protein [Pectobacterium wasabiae CFBP 3304]
gi|401702907|gb|EJS93144.1| Pentapeptide repeat protein [Pectobacterium wasabiae CFBP 3304]
Length = 846
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 44/161 (27%), Positives = 75/161 (46%), Gaps = 13/161 (8%)
Query: 70 AAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSAD 129
A++ SCS + A+ ++ T + S + AD +A + N R+A+ A
Sbjct: 687 GALLDSCSW-VETQANEARFVGATWLTSAVASGSSMNGADFTQATLRQSNLRQASLIGAV 745
Query: 130 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 189
F+ +K + L +A + NF A+L+ +L R EAN T+A L+ +L +S
Sbjct: 746 -----FARAKLENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLMGALLQKSQ 800
Query: 190 LGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 230
L GA GA+ A DL+Q + + T + G T++
Sbjct: 801 LSGANFRGANLFRA--DLSQ-----AFTSNTTQLDGAWTKR 834
Score = 40.4 bits (93), Expect = 0.90, Method: Composition-based stats.
Identities = 27/84 (32%), Positives = 36/84 (42%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+F+ D+R +DFS + A L ANF A LS T + L N NA L
Sbjct: 538 ADFSGMDLRGADFSKALLECADLSNCKLDGANFHSAMLSRTELHNTSLCGCNFENASLAL 597
Query: 183 TVLTRSDLGGAIIEGADFSDAVID 206
SD GA + +A+ D
Sbjct: 598 AQCCHSDFSGAHFKNTQLQEALFD 621
Score = 39.3 bits (90), Expect = 1.9, Method: Composition-based stats.
Identities = 26/96 (27%), Positives = 42/96 (43%), Gaps = 10/96 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F + L++A+ F A FT RE+ F+ +F A L + + + G D
Sbjct: 606 SGAHFKNTQLQEALFDDCTFAEATFTELLFRETWFTQCRFYRAMLNACIFMELSLPGLDF 665
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
SD A LT +++ L R GA+++
Sbjct: 666 SD----------AKLTKTTFLKSTLERGIFNGALLD 691
>gi|425473009|ref|ZP_18851753.1| Genome sequencing data, contig C314 [Microcystis aeruginosa PCC
9701]
gi|389880711|emb|CCI38594.1| Genome sequencing data, contig C314 [Microcystis aeruginosa PCC
9701]
Length = 453
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 59/117 (50%), Gaps = 5/117 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTG 157
A A+L +A + N RAN A++ + +G+ GA L +A +AN G
Sbjct: 286 ANLNGANLNRANLNRANLNRANLNGAELYRAYLNGANLKGANLNEANLIGANLNEANLIG 345
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
A+L+ ++ R LN ANL A L +L ++L GAI+ GA A +D Q ++ C
Sbjct: 346 ANLNGAILYRANLNGANLNGAYLNGAILYGANLYGAILYGAILWGAEVDPKQIKSAC 402
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 55/102 (53%), Gaps = 5/102 (4%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA-----NFTGADLSDT 163
+LR+A + N RAN A++ ++ + + N A L A Y+A N GA+L++
Sbjct: 272 NLREANLILANLNRANLNGANLNRANLNRANLNRANLNGAELYRAYLNGANLKGANLNEA 331
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ LNEANL A L +L R++L GA + GA + A++
Sbjct: 332 NLIGANLNEANLIGANLNGAILYRANLNGANLNGAYLNGAIL 373
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 5/87 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N AN A++ E++ G+ NGA L Y+AN GA+L+ ++ +L ANL A
Sbjct: 327 NLNEANLIGANLNEANLIGANLNGAIL-----YRANLNGANLNGAYLNGAILYGANLYGA 381
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVI 205
+L +L +++ I+ A F + I
Sbjct: 382 ILYGAILWGAEVDPKQIKSACFWERAI 408
>gi|209526072|ref|ZP_03274604.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423067543|ref|ZP_17056333.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493460|gb|EDZ93783.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406711117|gb|EKD06319.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 519
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 61/112 (54%), Gaps = 14/112 (12%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN----- 171
+ N +ANFT A + ++FSG+ G L +A + +GA L ++ VLN
Sbjct: 29 RVNLSQANFTEAVLSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLI 88
Query: 172 -----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
+ANL +A L+R L R++L AI+ GA+ ++A DL ++A ++A+
Sbjct: 89 RADLSQANLVDASLIRAELMRAELSEAIVNGANLTEA--DL--REATLRHAD 136
Score = 43.9 bits (102), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 47/91 (51%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L +A + N R+N T AD+ +D G A L +A A+ GA+LS +
Sbjct: 145 ANLSEACLILSNLERSNLTRADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRW 204
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
L+ ANL+ A L T L+ + L GA + GA
Sbjct: 205 ANLSGANLSGANLEATQLSGASLRGANLSGA 235
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 63/129 (48%), Gaps = 12/129 (9%)
Query: 93 TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
TR + + S A A+L +AV N RA+ + A++ ++ ++ A L +A+
Sbjct: 58 TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLVDASLIRAELMRAELSEAIV 117
Query: 151 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSDLGGAIIEGADF 200
AN T ADL + + L + ANL+ A L+ R+ LTR+DL A + G +
Sbjct: 118 NGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNL 177
Query: 201 SDAVIDLAQ 209
+A + A+
Sbjct: 178 RNAELRQAE 186
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 48/98 (48%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ A+L +A+ N A+ A +R +D + +GA L +A +N ++L+
Sbjct: 105 AELMRAELSEAIVNGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTR 164
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ R L NL NA L + L +DL GA + GA+
Sbjct: 165 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANL 202
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL +A N R A A++ +D G+ +GA L A AN +GA+L T +
Sbjct: 165 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 224
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANL+ A L+ +DL A + D++DA
Sbjct: 225 ASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDA 260
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 134 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
DFS A L + +ANFT A LS T + ANLT L R L S L GA
Sbjct: 16 DFSAILLCEANLSRVNLSQANFTEAVLSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 70
Query: 194 IIEGADFSDAVIDLA 208
I++GA+ ++AV+++A
Sbjct: 71 ILQGANLNEAVLNVA 85
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 57/114 (50%), Gaps = 9/114 (7%)
Query: 84 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
ADL + A+ RG +A+LR+A + R AN + A++R ++ SG+ +GA
Sbjct: 165 ADLTR--ADLRG-------VNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGA 215
Query: 144 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
LE A+ GA+LS + A+LT A L+ T ++L G+ + G
Sbjct: 216 NLEATQLSGASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDANLRGSALTG 269
>gi|428181173|gb|EKX50038.1| hypothetical protein GUITHDRAFT_135709 [Guillardia theta CCMP2712]
Length = 1263
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 5/113 (4%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I S ++ ++L K+ K + N T DM SD A L +++ Y+AN + A
Sbjct: 484 ILSGSKLEKSNLHKSKLSKVDLSNCNLTLTDMSSSDLQK-----ADLSRSLFYRANLSSA 538
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
+L + M+ L+ NL++A L R L S L GA +EGADFS + A Q
Sbjct: 539 NLKSSNMNGADLSHCNLSSACLERASLYGSKLEGANLEGADFSHCDLSFAMLQ 591
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 44/77 (57%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ R F ++D R DFSGSK +G L KA +N + DL+ + M + L ANL A
Sbjct: 1033 DLRSCKFANSDFRGQDFSGSKLSGVQLSKANLTGSNLSSCDLTGSDMSKCHLERANLLGA 1092
Query: 179 VLVRTVLTRSDLGGAII 195
VL + L+++ L GA++
Sbjct: 1093 VLKGSDLSQARLKGAVL 1109
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 53/110 (48%), Gaps = 22/110 (20%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++ R + + AD+ DF+G+ F+G+ L +A ++ G DLS N
Sbjct: 931 KDLRNSKLSEADLSHQDFAGADFSGSKLSRANLRQSKLDGCDLS---------------N 975
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA-------LCKYANGT 220
L R++L + L GA+I G DFS+A ++ A A CK+A T
Sbjct: 976 CDLSRSILEGASLQGAVIRGTDFSNAKLEGAALPAWVEVDFECCKFAGAT 1025
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 50/107 (46%), Gaps = 5/107 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-----KANF 155
S++ ADL +++ + N AN S++M +D S + A LE+A Y AN
Sbjct: 516 SSSDLQKADLSRSLFYRANLSSANLKSSNMNGADLSHCNLSSACLERASLYGSKLEGANL 575
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
GAD S + +L NL A LT +D G+ +EGA D
Sbjct: 576 EGADFSHCDLSFAMLQNCNLRGANFTGAKLTGTDFSGSDLEGAIMPD 622
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 51/108 (47%), Gaps = 10/108 (9%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD------ 162
+L KA+ + + A ++ +D S + GA LE A +N + +LS
Sbjct: 254 NLSKAMLQQARLQGAQLQGCNLSYNDLSDANLEGAKLEGADLSYSNLSQCNLSQASCSRI 313
Query: 163 ----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
++M R LN+ + +A L LT S L + EGADF D+V+D
Sbjct: 314 MLQFSVMTRARLNDGDFGSANLSECDLTHSQLSSSCFEGADFRDSVLD 361
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 51/123 (41%), Gaps = 20/123 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY----------- 151
A F DL A+ N R ANFT A + +DFSGS GA + Y
Sbjct: 578 ADFSHCDLSFAMLQNCNLRGANFTGAKLTGTDFSGSDLEGAIMPDMEGYDLQGVCLSGTS 637
Query: 152 ---------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+AN ADL + + L +A+L+ A L L +DL G + G + S
Sbjct: 638 GFFKDKSARRANLCDADLRGQELSGVNLQQADLSFADLTGANLQGADLTGTKLNGTNLSQ 697
Query: 203 AVI 205
+ +
Sbjct: 698 SRL 700
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 47/103 (45%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A L +A N +N +S D+ ++ SG+ GA L A + + + L
Sbjct: 191 SRADLSEAKLCRADLTHANLTESNLSSCDLSDTILSGANLGGADLSGAKLFNCDLSRTSL 250
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
D + + +L +A L A L L+ +DL A +EGA A
Sbjct: 251 MDVNLSKAMLQQARLQGAQLQGCNLSYNDLSDANLEGAKLEGA 293
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 47/106 (44%), Gaps = 5/106 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANF 155
S + + D AV NF RAN T A +MR + F + F A +
Sbjct: 411 SESNLTACDFSGAVMNDSNFERANLTKARFVGCEMRNASFQHATFASATFSDVKMEGVDL 470
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
TG DLS + +++L+ + L + L ++ L++ DL + D S
Sbjct: 471 TGCDLSSCDLSKLILSGSKLEKSNLHKSKLSKVDLSNCNLTLTDMS 516
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 50/91 (54%), Gaps = 10/91 (10%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A LR + V + + NF+ D+ +++ S S + A L +A +A+ T A+L+
Sbjct: 158 ATLRGSSFVSSSCAQTNFSRCDLSDANLSMSTLSRADLSEAKLCRADLTHANLT------ 211
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
E+NL++ L T+L+ ++LGGA + GA
Sbjct: 212 ----ESNLSSCDLSDTILSGANLGGADLSGA 238
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 47/113 (41%), Gaps = 16/113 (14%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
D+ + + R N +++ E +FS FNGA L Y N T DL + R
Sbjct: 730 DMNNSSWTGADLRGVNMAGSNLNECNFSEVSFNGADLTGCSIYNTNLTNCDLKGVNLSRA 789
Query: 169 VLNEANLTNAVL----------------VRTVLTRSDLGGAIIEGADFSDAVI 205
L ++L+++ + V T T + GA + ADFS AV+
Sbjct: 790 NLQYSDLSHSAMDGATLPEWSSGSFEGVVLTGATGINFVGADLRKADFSQAVL 842
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 39/82 (47%), Gaps = 15/82 (18%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
NF AD+R++DFS + G L A +AN AD + E NLT L
Sbjct: 826 NFVGADLRKADFSQAVLKGHDLSAADLSQANLRNADFT----------ECNLTGCNL--- 872
Query: 184 VLTRSDLGGAIIEGADFSDAVI 205
T+S+L G +GA S A+I
Sbjct: 873 --TQSNLSGCNFDGAILSGAII 892
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 52/109 (47%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A + L +A + RA+ T A++ ES+ S + L A A+ +GA L
Sbjct: 181 SDANLSMSTLSRADLSEAKLCRADLTHANLTESNLSSCDLSDTILSGANLGGADLSGAKL 240
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+ + R L + NL+ A+L + L + L G + D SDA ++ A+
Sbjct: 241 FNCDLSRTSLMDVNLSKAMLQQARLQGAQLQGCNLSYNDLSDANLEGAK 289
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 41/90 (45%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S AQF +A+L R+NF+ +++ DFSG+ N + E+A KA F G ++
Sbjct: 386 SDAQFVNANLSNVKLNAARVLRSNFSESNLTACDFSGAVMNDSNFERANLTKARFVGCEM 445
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+ A ++ + LT DL
Sbjct: 446 RNASFQHATFASATFSDVKMEGVDLTGCDL 475
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 38/79 (48%), Gaps = 2/79 (2%)
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
RRAN AD+R + SG A L A AN GADL+ T ++ L+++ L A
Sbjct: 646 RRANLCDADLRGQELSGVNLQQADLSFADLTGANLQGADLTGTKLNGTNLSQSRLAGACF 705
Query: 181 VRTVLTRSDLGGAIIEGAD 199
+ D+ G + GA+
Sbjct: 706 --SCWAERDVSGIKLAGAE 722
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 41/85 (48%), Gaps = 5/85 (5%)
Query: 105 FGSADLRKA-----VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
F ADLRKA V + A+ + A++R +DF+ G L ++ NF GA
Sbjct: 827 FVGADLRKADFSQAVLKGHDLSAADLSQANLRNADFTECNLTGCNLTQSNLSGCNFDGAI 886
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTV 184
LS ++ ++ L+ L A+L +
Sbjct: 887 LSGAIIKQVDLSTTRLNGAILPELI 911
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 22/80 (27%), Positives = 43/80 (53%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
V + R+A+F+ A ++ D S + + A L A + N TG +L+ + + + A L
Sbjct: 828 VGADLRKADFSQAVLKGHDLSAADLSQANLRNADFTECNLTGCNLTQSNLSGCNFDGAIL 887
Query: 176 TNAVLVRTVLTRSDLGGAII 195
+ A++ + L+ + L GAI+
Sbjct: 888 SGAIIKQVDLSTTRLNGAIL 907
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 42/92 (45%), Gaps = 10/92 (10%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAV----------AYKANFTGADLSDTLMDRMVLNEA 173
N D++ F G++ + A L A + NF+ DLSD + L+ A
Sbjct: 134 NMQGLDLKNLCFDGARLDRATLRMATLRGSSFVSSSCAQTNFSRCDLSDANLSMSTLSRA 193
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+L+ A L R LT ++L + + D SD ++
Sbjct: 194 DLSEAKLCRADLTHANLTESNLSSCDLSDTIL 225
>gi|77404498|ref|YP_345074.1| hypothetical protein pREC1_0013 [Rhodococcus erythropolis PR4]
gi|77019879|dbj|BAE46254.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length = 589
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 5/106 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL A K N R A + A + E+D +G+ GA L AN +GADL+D
Sbjct: 403 ADLEDADLESAKLSKANLRLAILSGATLPEADLTGAVLIGANLTNTTFSGANLSGADLTD 462
Query: 163 -----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++ L EANLT AVL+ L ++L A + A+ SDA
Sbjct: 463 ADLSVADLEEADLTEANLTGAVLIGANLAHANLTDADLSKANLSDA 508
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 66/141 (46%), Gaps = 9/141 (6%)
Query: 87 NKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
N EA G + G+A A A L KA K A +AD++E+ G+ A
Sbjct: 349 NLAEANLTGAYMFGAALTEAVLTDATLTKAHLAKTTLAGALLINADLQEATLEGADLEDA 408
Query: 144 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
LE A KAN A LS L EA+LT AVL+ LT + GA + GAD +DA
Sbjct: 409 DLESAKLSKANLRLAILSGA-----TLPEADLTGAVLIGANLTNTTFSGANLSGADLTDA 463
Query: 204 VIDLAQ-KQALCKYANGTNPI 223
+ +A ++A AN T +
Sbjct: 464 DLSVADLEEADLTEANLTGAV 484
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 49/98 (50%), Gaps = 15/98 (15%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L AV + N AN T AD+ +++ S + Y AN T A+LSD
Sbjct: 473 ADLTEANLTGAVLIGANLAHANLTDADLSKANLSDADL----------YSANLTDANLSD 522
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
L+ A LT A L+ T+LTR DL GA++ G D
Sbjct: 523 A-----DLSGATLTRAGLMGTILTRVDLTGAVLTGLDL 555
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 10/109 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADLR A N RAN A + E++ + + GAY+ GA L
Sbjct: 316 SGATLFEADLRSATLTGANLERANLAHAKLFEANLAEANLTGAYM----------FGAAL 365
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
++ ++ L +A+L L +L +DL A +EGAD DA ++ A+
Sbjct: 366 TEAVLTDATLTKAHLAKTTLAGALLINADLQEATLEGADLEDADLESAK 414
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 51/98 (52%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A LR A + R AN AD++ ++ SG+ A L+ A+ +A+ TGA+L+D +
Sbjct: 223 ARLRGASLGFADLRAANLQGADLQTAELSGATLRLANLKGAILREADLTGANLTDATLTE 282
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L EA L A+LV L DL +E A+ S A +
Sbjct: 283 ADLAEAKLQGAILVNVNLQNFDLSRLDLEKANLSGATL 320
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 49/98 (50%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL + K N A AD+R + +G+ A L A ++AN A+L+ M
Sbjct: 304 DLSRLDLEKANLSGATLFEADLRSATLTGANLERANLAHAKLFEANLAEANLTGAYMFGA 363
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L EA LT+A L + L ++ L GA++ AD +A ++
Sbjct: 364 ALTEAVLTDATLTKAHLAKTTLAGALLINADLQEATLE 401
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 63/139 (45%), Gaps = 8/139 (5%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA--- 158
AA ADL+ A R AN A +RE+D +G+ A L +A +A GA
Sbjct: 237 AANLQGADLQTAELSGATLRLANLKGAILREADLTGANLTDATLTEADLAEAKLQGAILV 296
Query: 159 --DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQKQAL 213
+L + + R+ L +ANL+ A L L + L GA +E A+ + A + +LA+
Sbjct: 297 NVNLQNFDLSRLDLEKANLSGATLFEADLRSATLTGANLERANLAHAKLFEANLAEANLT 356
Query: 214 CKYANGTNPITGVSTRKSL 232
Y G V T +L
Sbjct: 357 GAYMFGAALTEAVLTDATL 375
Score = 43.9 bits (102), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 64/144 (44%), Gaps = 8/144 (5%)
Query: 62 VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGS---AAQFGSADLRKAVHVKE 118
F L+ A + +++ L + + EA G IG+ A ADL KA
Sbjct: 449 TFSGANLSGADLTDADLSVADLEEADLTEANLTGAVLIGANLAHANLTDADLSKANLSDA 508
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ AN T A++ ++D SG+ A L + + + TGA L+ + L NLT+
Sbjct: 509 DLYSANLTDANLSDADLSGATLTRAGLMGTILTRVDLTGAVLTG-----LDLVGVNLTDV 563
Query: 179 VLVRTVLTRSDLGGAIIEGADFSD 202
L + DL GAI+ G D S+
Sbjct: 564 NLDNVNMDDVDLSGAILPGTDTSE 587
Score = 40.4 bits (93), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 53/120 (44%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY---------------L 145
S A A+L+ A+ + + AN T A + E+D + +K GA L
Sbjct: 251 SGATLRLANLKGAILREADLTGANLTDATLTEADLAEAKLQGAILVNVNLQNFDLSRLDL 310
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
EKA A ADL + L ANL +A L L ++L GA + GA ++AV+
Sbjct: 311 EKANLSGATLFEADLRSATLTGANLERANLAHAKLFEANLAEANLTGAYMFGAALTEAVL 370
>gi|427738633|ref|YP_007058177.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427373674|gb|AFY57630.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 436
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 7/135 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A DL A N ANFT+ ++ ++F+ + GA LE A AN T ADLS
Sbjct: 239 ADLSGIDLCDANFSDANLEGANFTNVNLEGANFTNANLEGANLENAKLNNANLTNADLSY 298
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA-----DFSDAVIDLAQKQALCKYA 217
T + + L ANL N+ L +R++L AI+ GA +FSDA +L + Y
Sbjct: 299 TNLRKADLRCANLINSDLSNADASRANLSDAIVNGANLIQSNFSDA--NLRGCNLIKTYL 356
Query: 218 NGTNPITGVSTRKSL 232
+G N I R +L
Sbjct: 357 SGANLIRADLKRANL 371
>gi|434405486|ref|YP_007148371.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428259741|gb|AFZ25691.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 808
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 57/103 (55%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A+ N +AN + A++R ++ G+ +GAY A A+ +GA L
Sbjct: 103 SGANLSGADLSGAILFGANLSQANLSQANLRGANLRGADLSGAYPSGADLRGADLSGAYL 162
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S+ + + L++ANL+ A L + L+ + L GA + GAD S A
Sbjct: 163 SEAKLSQAKLSQANLSQANLSQADLSGAYLTGAYLSGADLSGA 205
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 56/96 (58%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL A + N AN + A++ E+ G+K + A L +A AN +GA+LS+ ++
Sbjct: 30 ADLLGADLLGANLSGANLSQANLSEAILFGAKLSQANLSQANLSGANLSGANLSEAILFG 89
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L++ANL+ A L L+ +DL GAI+ GA+ S A
Sbjct: 90 AKLSQANLSQANLSGANLSGADLSGAILFGANLSQA 125
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 1/105 (0%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
+ A FG A L +A + N AN + A++ E+ G+K + A L +A AN +GA
Sbjct: 52 LSEAILFG-AKLSQANLSQANLSGANLSGANLSEAILFGAKLSQANLSQANLSGANLSGA 110
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
DLS ++ L++ANL+ A L L +DL GA GAD A
Sbjct: 111 DLSGAILFGANLSQANLSQANLRGANLRGADLSGAYPSGADLRGA 155
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 54/105 (51%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A + +A + A++ +++ S + +GAYL A A+ +GADL
Sbjct: 148 SGADLRGADLSGAYLSEAKLSQAKLSQANLSQANLSQADLSGAYLTGAYLSGADLSGADL 207
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S + R L+ A+L+ A L L+ +DL A + GA S A +
Sbjct: 208 SGARLSRADLSRADLSAADLRGAYLSAADLSAAYLSGAYLSAAYL 252
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 55/123 (44%), Gaps = 20/123 (16%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 157
S A A L A + N +AN + A++ +D SG+ GA L +A +AN G
Sbjct: 78 SGANLSEAILFGAKLSQANLSQANLSGANLSGADLSGAILFGANLSQANLSQANLRGANL 137
Query: 158 -----------------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
ADLS + L++A L+ A L + L+++DL GA + GA
Sbjct: 138 RGADLSGAYPSGADLRGADLSGAYLSEAKLSQAKLSQANLSQANLSQADLSGAYLTGAYL 197
Query: 201 SDA 203
S A
Sbjct: 198 SGA 200
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 59/124 (47%), Gaps = 3/124 (2%)
Query: 83 LADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 139
L+ N +A+ G + G S A ADL A + + RA+ ++AD+R + S +
Sbjct: 177 LSQANLSQADLSGAYLTGAYLSGADLSGADLSGARLSRADLSRADLSAADLRGAYLSAAD 236
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
+ AYL A A +GA L+ + L+ +L+ L L+ DL GA + GA+
Sbjct: 237 LSAAYLSGAYLSAAYLSGAYLNAAYLSGAYLSGFDLSGVNLSGVNLSGFDLSGANLSGAN 296
Query: 200 FSDA 203
S A
Sbjct: 297 LSGA 300
>gi|425447182|ref|ZP_18827173.1| Genome sequencing data, contig C314 (fragment) [Microcystis
aeruginosa PCC 9443]
gi|389732326|emb|CCI03724.1| Genome sequencing data, contig C314 (fragment) [Microcystis
aeruginosa PCC 9443]
Length = 285
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 46/78 (58%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+ T A++ E+ +G+ NGA LE+A A+ GA+L + ++ L EANL A L+R
Sbjct: 137 ADLTEANLTEAKLNGADLNGANLEEAKLNGADLNGANLEEAKLNGAFLEEANLKRANLIR 196
Query: 183 TVLTRSDLGGAIIEGADF 200
L S L GA ++GA+
Sbjct: 197 ANLIGSGLWGANLKGANL 214
>gi|239947676|ref|ZP_04699429.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes
scapularis]
gi|239921952|gb|EER21976.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes
scapularis]
Length = 953
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 49/178 (27%), Positives = 79/178 (44%), Gaps = 22/178 (12%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
L+N + + AL A C N+ + N Y ++T E + ADLR+A+
Sbjct: 494 LENAFMNKTHALEAKFKEQC--NMQGITARNAYFSDTEFE----NILSLKEADLREAIMQ 547
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY----------KANFTGADLSDTLMD 166
+ + A+ T A + ++ + A L A A KA G ++SD +
Sbjct: 548 RVKLKNADLTKAKLDKAKLEYADLTNATLTNATAQFAKLSNATLEKAEAEGLNISDAIAK 607
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYAN 218
+ EAN NA++ R LT++D A++E AD ++A+ A KQA K AN
Sbjct: 608 NINAKEANFKNAIMQRADLTKADFTKAVLENADMQAMEAAEAIFKEANLKQANLKVAN 665
Score = 39.7 bits (91), Expect = 1.4, Method: Composition-based stats.
Identities = 38/149 (25%), Positives = 64/149 (42%), Gaps = 13/149 (8%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
LKN +F S L +++C+ + + N A + + F ADL+K+
Sbjct: 354 LKN-TLFASANLENIKISNCNLDFTNFEGANLQNAVFQNVTARNTGFLF--ADLKKSKIE 410
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTGADLSDTLMD 166
+ RA D+ E++ + SKFN + A A K +N TG L+ M
Sbjct: 411 NSDMSRAYMPKVDLSEAEVTNSKFNAVMMVNADAEKLIIKDSEWKNSNLTGISLAYADMQ 470
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAII 195
R+ + L NA+L + + +DL A +
Sbjct: 471 RVQMQGVVLNNALLDQANIVSTDLENAFM 499
Score = 39.3 bits (90), Expect = 1.7, Method: Composition-based stats.
Identities = 39/137 (28%), Positives = 56/137 (40%), Gaps = 30/137 (21%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA----------VAYKA 153
+ +ADL KA K A+ T+A + + +K + A LEKA +A
Sbjct: 550 KLKNADLTKAKLDKAKLEYADLTNATLTNATAQFAKLSNATLEKAEAEGLNISDAIAKNI 609
Query: 154 NFTGADLSDTLMDRMVLNEANLTNAV--------------------LVRTVLTRSDLGGA 193
N A+ + +M R L +A+ T AV L + L ++L G
Sbjct: 610 NAKEANFKNAIMQRADLTKADFTKAVLENADMQAMEAAEAIFKEANLKQANLKVANLAGI 669
Query: 194 IIEGADFSDAVIDLAQK 210
EGADF A ID A K
Sbjct: 670 NKEGADFDKAKIDDATK 686
>gi|416374431|ref|ZP_11683193.1| hypothetical protein CWATWH0003_0051 [Crocosphaera watsonii WH
0003]
gi|357266721|gb|EHJ15312.1| hypothetical protein CWATWH0003_0051 [Crocosphaera watsonii WH
0003]
Length = 279
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 60/111 (54%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A SADLR A + N A+ TSA++ ++ +G+ NGA L + AN +G DLS
Sbjct: 54 ATLASADLRGANLKQVNLSYADLTSANLSGANLTGAILNGAKLNRVDLSYANLSGVDLSG 113
Query: 163 TLMDR-----MVLNEANLTNAVLVRTVLTRS-----DLGGAIIEGADFSDA 203
+ R + L EA+LTNA L + +++S D A ++GA+FS A
Sbjct: 114 ANLSRSDLSYVDLREADLTNANLYKADISQSKLHNTDFQEAFLQGANFSRA 164
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 6/98 (6%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DLR+A N +A+ + + + +DF + GA +A AN GA L + + +
Sbjct: 125 DLREADLTNANLYKADISQSKLHNTDFQEAFLQGANFSRANLKGANLGGASLREVNLSLV 184
Query: 169 VLNEANLTNAVLVRTV------LTRSDLGGAIIEGADF 200
L+E NL V + L +++L GAI+ A+
Sbjct: 185 NLSEFNLQRVTRVGEIDLSSANLQKANLQGAILRHANL 222
Score = 37.0 bits (84), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 45/103 (43%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A ADL A + A+ T A+++ +D S A L A AN +L
Sbjct: 12 TGADLNRADLIYARLLSAKLIDADLTGANLQNADLSWVDLENATLASADLRGANLKQVNL 71
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + L+ ANLT A+L L R DL A + G D S A
Sbjct: 72 SYADLTSANLSGANLTGAILNGAKLNRVDLSYANLSGVDLSGA 114
>gi|332711043|ref|ZP_08430978.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332350169|gb|EGJ29774.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 343
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/164 (30%), Positives = 76/164 (46%), Gaps = 14/164 (8%)
Query: 63 FVSTALAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFR 121
+ LA A++ S N + L N A+ T+ + A +A L KA+ ++ N
Sbjct: 170 LIDIDLANAILHQASLNDAELTGANLTGADLTKANL---ARANLNTAKLSKALLIRANLS 226
Query: 122 RANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ N + +AD+R +D SG+ F GA L A AN TG+D LN ANL
Sbjct: 227 KTNLSITELRNADLRNADLSGANFMGADLTGADLTSANLTGSDFR-----YAKLNGANLK 281
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+A L LT ++L G + GAD + A ++ K+ N T
Sbjct: 282 HADLSGADLTDANLNGMDLTGADLTSANLEGISWNRQTKWKNAT 325
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 51/96 (53%), Gaps = 4/96 (4%)
Query: 112 KAVHVKENF----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
KA+ V N+ R + +AD+ + D + + + A L A AN TGADL+ + R
Sbjct: 148 KALEVLNNYGVSMRGLDAPNADLIDIDLANAILHQASLNDAELTGANLTGADLTKANLAR 207
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
LN A L+ A+L+R L++++L + AD +A
Sbjct: 208 ANLNTAKLSKALLIRANLSKTNLSITELRNADLRNA 243
Score = 37.4 bits (85), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 63/144 (43%), Gaps = 19/144 (13%)
Query: 60 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 119
W++ V A A A VA+ + + AL LN Y RG D A + +
Sbjct: 129 WQI-VDNA-AGAGVATSHARVKALEVLNNYGVSMRG------------LDAPNADLIDID 174
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
A A + +++ +G+ GA L KA N A+L+ + + +L ANL+
Sbjct: 175 LANAILHQASLNDAELTGANLTGADLTKA-----NLARANLNTAKLSKALLIRANLSKTN 229
Query: 180 LVRTVLTRSDLGGAIIEGADFSDA 203
L T L +DL A + GA+F A
Sbjct: 230 LSITELRNADLRNADLSGANFMGA 253
>gi|308205942|gb|ADO19342.1| pentapeptide repeat protein [Nostoc flagelliforme str. Sunitezuoqi]
Length = 146
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 52/106 (49%), Gaps = 9/106 (8%)
Query: 107 SADLRKAVHVKE---------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
SA +R+ + +E N + A+ D+R ++ G+ GA LE A AN
Sbjct: 28 SAPVRRLLETRECFGCNLTGANLKGAHLIGVDLRNANLKGANLEGANLEGADLTGANLKY 87
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A+L+ + +LN ANLTN L + L SD+ GA++ D S A
Sbjct: 88 ANLTKAFVSDTILNNANLTNVNLSNSRLYNSDVDGAVMANIDLSGA 133
>gi|383501588|ref|YP_005414947.1| hypothetical protein MC5_03910 [Rickettsia australis str. Cutlack]
gi|378932599|gb|AFC71104.1| hypothetical protein MC5_03910 [Rickettsia australis str. Cutlack]
Length = 960
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 38/116 (32%), Positives = 57/116 (49%), Gaps = 6/116 (5%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAKLDKANLEYADLTNATLTNATAQFVKLSNATLEKAEA-----EGLNISDV 609
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 218
+ + EAN N ++ R LT++D A++E AD +D K+A K AN
Sbjct: 610 IAKNINAKEANFKNVIMQRADLTKADFTKAVLENADMQAVEALDAIFKEATLKQAN 665
Score = 40.4 bits (93), Expect = 0.84, Method: Composition-based stats.
Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 10/104 (9%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------AN 154
F ADL+K+ + RA D+ E++ + SKFN + A A K +N
Sbjct: 404 FLFADLKKSKIENSDMSRAYMPKVDLSEAEVTNSKFNAVMMVNADAEKLIMQDSEWKNSN 463
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
TG L+ M R+ + L NA+L + + +DL A + A
Sbjct: 464 LTGISLAYADMQRVQMQGVVLNNALLDQANIISTDLENAFMNNA 507
>gi|162450958|ref|YP_001613325.1| WD repeat-containing protein [Sorangium cellulosum So ce56]
gi|161161540|emb|CAN92845.1| Hypothetical WD-repeat protein [Sorangium cellulosum So ce56]
Length = 2305
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 40/130 (30%), Positives = 60/130 (46%), Gaps = 18/130 (13%)
Query: 89 YEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDF---------- 135
+ ET G G+ Q DLR A N R AN + AD+ +D
Sbjct: 1111 WAEETAGWISEGADLHGVQLAGEDLRGAPLAGANLRDANLSGADLSGADLTDAALSGAML 1170
Query: 136 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
SG+K +G L +A+A++A+FT A+ +++ A L+ AVL ++ T GA
Sbjct: 1171 SGAKLHGTILRRAIAHRADFTQAEAKGAIVEL-----AKLSGAVLRQSTWTGCRWNGAQA 1225
Query: 196 EGADFSDAVI 205
EG D S +I
Sbjct: 1226 EGTDLSACLI 1235
Score = 40.4 bits (93), Expect = 0.72, Method: Composition-based stats.
Identities = 40/142 (28%), Positives = 60/142 (42%), Gaps = 16/142 (11%)
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS-----DTLMDRMVLNEANLTNAVLVR 182
AD+ +G GA L A AN +GADLS D + +L+ A L +L R
Sbjct: 1123 ADLHGVQLAGEDLRGAPLAGANLRDANLSGADLSGADLTDAALSGAMLSGAKLHGTILRR 1182
Query: 183 TVLTRSDL-----GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNS 237
+ R+D GAI+E A S AV+ + C++ T +S C +
Sbjct: 1183 AIAHRADFTQAEAKGAIVELAKLSGAVLRQSTWTG-CRWNGAQAEGTDLS-----ACLIA 1236
Query: 238 RRNAYGSPSSPLLSAPPQKLLD 259
R A+ + L + PP +D
Sbjct: 1237 GRGAHPERARRLAATPPLAHVD 1258
>gi|218440553|ref|YP_002378882.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218173281|gb|ACK72014.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 320
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 58/111 (52%), Gaps = 15/111 (13%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +A+L++A+ +F+ AN + A++ G K NGA L +A KAN +G DL+
Sbjct: 100 ANLSNANLKQAILTNVDFKSANLSGANL-----VGVKLNGANLSRADLSKANLSGIDLTG 154
Query: 163 TLMDRMVLNEANLT----------NAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ R+ L+ ANL A L R+ L DL GAI++G++ A
Sbjct: 155 ANLSRVDLSRANLNGADLSGANLYKADLSRSNLRNGDLQGAILQGSNLHKA 205
Score = 37.7 bits (86), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 55/124 (44%), Gaps = 16/124 (12%)
Query: 92 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA--- 148
E ++G G+ F DL+ ++ AN + + ++ SG+ N A L +A
Sbjct: 5 EILWQYGQGNR-DFSRLDLQNINIIQAELMEANLSRTALDWANLSGTNLNRANLNRADLM 63
Query: 149 -------VAYKANFTGADLSD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
+A+ GADLSD ++ L ANL+NA L + +LT D A +
Sbjct: 64 HAKLISAQLIEADLIGADLSDADLSWVNLEGAKLTYANLSNANLKQAILTNVDFKSANLS 123
Query: 197 GADF 200
GA+
Sbjct: 124 GANL 127
>gi|113474166|ref|YP_720227.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110165214|gb|ABG49754.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 1033
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 3/131 (2%)
Query: 91 AETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 147
A+ G + IG+ A ADLR A N A + A++ ++ SG+ +GA L
Sbjct: 865 ADLSGAYLIGANLIGADLSRADLRYADLSGANLSDAKLSGANLSDAKLSGAGLSGADLRY 924
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
A A+ + A LSD + L+ A L+ A L L+ +DL A + GAD SDA +
Sbjct: 925 ADLSGADLSRAKLSDAGLSGANLSVAGLSGADLRYADLSGADLRYADLSGADLSDANLSN 984
Query: 208 AQKQALCKYAN 218
+ + K++N
Sbjct: 985 VRWNSQTKWSN 995
Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats.
Identities = 40/112 (35%), Positives = 59/112 (52%), Gaps = 2/112 (1%)
Query: 92 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 151
ET G+F G+ ++ ADL A + N R A+ + A + +D SG+ GA L A
Sbjct: 826 ETVGQFLSGADLRY--ADLSGAYLIVANLRYADLSGAYLISADLSGAYLIGANLIGADLS 883
Query: 152 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+A+ ADLS + L+ ANL++A L L+ +DL A + GAD S A
Sbjct: 884 RADLRYADLSGANLSDAKLSGANLSDAKLSGAGLSGADLRYADLSGADLSRA 935
>gi|409993957|ref|ZP_11277081.1| hypothetical protein APPUASWS_22623 [Arthrospira platensis str.
Paraca]
gi|409935173|gb|EKN76713.1| hypothetical protein APPUASWS_22623 [Arthrospira platensis str.
Paraca]
Length = 336
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 79/163 (48%), Gaps = 21/163 (12%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL + V +F+ AN +A ++E++ GS F L+ A +KAN T + +
Sbjct: 140 ADLEQVTLVDTDFKEANLKTAKLQEANLKGSTFELTQLQGANLWKANLQECFFLLTQLQK 199
Query: 168 MVLNEANLTNAV-----LVRTVLTRSDLGGAII----EGADFSDAVIDLAQKQ------A 212
+ LN ANL NA L+ L +++L GA I +GA+F +A + A Q A
Sbjct: 200 VNLNAANLQNAELQGVNLLEANLQQANLQGAYILGNLQGANFQEANLKGANLQGAYLQDA 259
Query: 213 LCKYAN--GTN----PITGVSTRKSLGCGNSRRNAYGSPSSPL 249
K AN G N +TGV+ ++ G + +NA G + +
Sbjct: 260 NFKRANLRGVNLKDANLTGVNFEEAHLQGANLQNAQGLTTQQI 302
>gi|75911046|ref|YP_325342.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704771|gb|ABA24447.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 576
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 45/78 (57%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N + A + +D S +K NGA L A A F GADLS + +VLN+A+L+ +L
Sbjct: 421 NLSDAILEAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGILSEA 480
Query: 184 VLTRSDLGGAIIEGADFS 201
LT +DL A++ G DFS
Sbjct: 481 DLTGADLSDAVLLGTDFS 498
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 64/118 (54%), Gaps = 14/118 (11%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A FG A+L N AN +SAD+ ++ +G+ +GA LE +A+ + ADL
Sbjct: 288 TGADFGDANLSSV-----NLSGANLSSADLSSANLTGANLSGANLE-----RADLSRADL 337
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA---VIDLAQKQALCK 215
S +++ L+ ANL+ L R++L AI+ GA+ SDA +DL++ LC+
Sbjct: 338 SSCILNDGELSHANLSGVNFRDAELCRANLSNAILFGANLSDANLNHVDLSRAD-LCR 394
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 51/90 (56%), Gaps = 5/90 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF-----TGADLSDTLMDRMVLNEA 173
NF+ A +A++ +FSG+ +GAYL A ANF TGAD D + + L+ A
Sbjct: 246 NFQGAYLGNANLTGVNFSGANLSGAYLGDANLTGANFQGANLTGADFGDANLSSVNLSGA 305
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
NL++A L LT ++L GA +E AD S A
Sbjct: 306 NLSSADLSSANLTGANLSGANLERADLSRA 335
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 49/106 (46%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A SADL A N AN AD+ +D S N L A NF A+L
Sbjct: 303 SGANLSSADLSSANLTGANLSGANLERADLSRADLSSCILNDGELSHANLSGVNFRDAEL 362
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ +L ANL++A L L+R+DL A + GAD + A ++
Sbjct: 363 CRANLSNAILFGANLSDANLNHVDLSRADLCRADLSGADLTHATLN 408
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 57/111 (51%), Gaps = 6/111 (5%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRAN---FTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
I AA A L A K N+ R N F AD+ D +G N A L + +A+
Sbjct: 426 ILEAADLSYAKLNGA---KLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGILSEADL 482
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
TGADLSD ++ + ANL +A L + L+ + L GA + A+FS A++D
Sbjct: 483 TGADLSDAVLLGTDFSFANLNSANLSGSNLSGAILNGADLSSANFSYAILD 533
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 56/117 (47%), Gaps = 13/117 (11%)
Query: 95 GEF---GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 151
GEF G A G+A+L NF AN + A + +++ +G+ F GA L A
Sbjct: 239 GEFLRDGNFQGAYLGNANLTGV-----NFSGANLSGAYLGDANLTGANFQGANLTGADFG 293
Query: 152 KANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
AN + GA+LS + L ANL+ A L R L+R+DL I+ + S A
Sbjct: 294 DANLSSVNLSGANLSSADLSSANLTGANLSGANLERADLSRADLSSCILNDGELSHA 350
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 42/80 (52%), Gaps = 10/80 (12%)
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
GI S A ADL AV + +F AN SA++ S+ SG+ NGA L ANF+
Sbjct: 475 GILSEADLTGADLSDAVLLGTDFSFANLNSANLSGSNLSGAILNGADLS-----SANFSY 529
Query: 158 ADLSDTLMDRMVLNEANLTN 177
A L DT L+EANL +
Sbjct: 530 AILDDT-----DLSEANLED 544
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 52/112 (46%), Gaps = 6/112 (5%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-- 156
+ +A FG A+L A + RA+ AD+ +D + + NG L + + N +
Sbjct: 367 LSNAILFG-ANLSDANLNHVDLSRADLCRADLSGADLTHATLNGTNLSDTILFSTNLSDA 425
Query: 157 ---GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
ADLS ++ LN A L A+ + L+ DL G ++ AD S ++
Sbjct: 426 ILEAADLSYAKLNGAKLNYARLNGAMFLGADLSGVDLTGVVLNDADLSGGIL 477
Score = 37.0 bits (84), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 50/96 (52%), Gaps = 15/96 (15%)
Query: 119 NFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
NFR RAN ++A + ++ S + N L +A +A+ +GADL+ LN
Sbjct: 356 NFRDAELCRANLSNAILFGANLSDANLNHVDLSRADLCRADLSGADLT-----HATLNGT 410
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
NL++ +L T +L AI+E AD S A ++ A+
Sbjct: 411 NLSDTILFST-----NLSDAILEAADLSYAKLNGAK 441
>gi|37522461|ref|NP_925838.1| hypothetical protein gll2892 [Gloeobacter violaceus PCC 7421]
gi|35213462|dbj|BAC90833.1| gll2892 [Gloeobacter violaceus PCC 7421]
Length = 457
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR A N AN AD+ +D +G+ N A+L A +AN GA+L+
Sbjct: 79 ANLSEADLRGANLNWANLNWANLNWADLSGADLNGANLNWAHLNWADLREANLGGAELNR 138
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L ANL+ L R ++ +DL GA + GA+ S+A
Sbjct: 139 ANLREANLGGANLSGVSLSRAFMSGADLRGADLGGANLSEA 179
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 50/101 (49%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G A+L +A N AN A++ +D SG+ NGA L A A+ A+L
Sbjct: 74 ANLGGANLSEADLRGANLNWANLNWANLNWADLSGADLNGANLNWAHLNWADLREANLGG 133
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++R L EANL A L L+R+ + GA + GAD A
Sbjct: 134 AELNRANLREANLGGANLSGVSLSRAFMSGADLRGADLGGA 174
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 51/104 (49%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL A N A+ AD+RE++ G++ N A L +A AN +G LS
Sbjct: 99 ANLNWADLSGADLNGANLNWAHLNWADLREANLGGAELNRANLREANLGGANLSGVSLSR 158
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
M L A+L A L L ++LGGA ++GAD A ++
Sbjct: 159 AFMSGADLRGADLGGANLSEADLGGANLGGANLKGADLGGANLE 202
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 51/103 (49%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G ADL A N AN + AD+R ++ + + N A L A A+ GA+L+
Sbjct: 59 ADLGGADLGGADLEGANLGGANLSEADLRGANLNWANLNWANLNWADLSGADLNGANLNW 118
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ L EANL A L R L ++LGGA + G S A +
Sbjct: 119 AHLNWADLREANLGGAELNRANLREANLGGANLSGVSLSRAFM 161
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 50/100 (50%), Gaps = 5/100 (5%)
Query: 103 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A+ A+LR+A N RA + AD+R +D G+ + A L A AN G
Sbjct: 134 AELNRANLREANLGGANLSGVSLSRAFMSGADLRGADLGGANLSEADLGGANLGGANLKG 193
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
ADL ++R L A+L A L RT LT L GA++EG
Sbjct: 194 ADLGGANLERTSLRGADLRGADLRRTRLTGCSLEGAVLEG 233
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 47/98 (47%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR+A RAN A++ ++ SG + A++ A A+ GA+LS+
Sbjct: 119 AHLNWADLREANLGGAELNRANLREANLGGANLSGVSLSRAFMSGADLRGADLGGANLSE 178
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ L ANL A L L R+ L GA + GAD
Sbjct: 179 ADLGGANLGGANLKGADLGGANLERTSLRGADLRGADL 216
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 66/143 (46%), Gaps = 7/143 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SAA G A+L + AN AD+ +D G+ GA LE A AN + ADL
Sbjct: 32 SAADLGGANLGGV-----DLGGANLGGADLDGADLGGADLGGADLEGANLGGANLSEADL 86
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN- 218
++ LN ANL A L L ++L A + AD +A + A+ +A + AN
Sbjct: 87 RGANLNWANLNWANLNWADLSGADLNGANLNWAHLNWADLREANLGGAELNRANLREANL 146
Query: 219 GTNPITGVSTRKSLGCGNSRRNA 241
G ++GVS ++ G R A
Sbjct: 147 GGANLSGVSLSRAFMSGADLRGA 169
>gi|427734924|ref|YP_007054468.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427369965|gb|AFY53921.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 213
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 11/106 (10%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G Q A+L + + RAN A++ ++F+GSKF GA+LE AN GA+
Sbjct: 9 GELKQLAGANLEDENLSQTDLSRANLAGANLVGTNFAGSKFEGAHLE-----GANLMGAN 63
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L +T + ANL A L++ LT +D+ G+ + GA+ AVI
Sbjct: 64 LKETDL------RANLMGANLMQADLTGADVRGSNLRGANLMGAVI 103
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 60/133 (45%), Gaps = 19/133 (14%)
Query: 112 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 171
K ++ N AN AD+ +D GS GA L AV + +F GA LS T + + L
Sbjct: 65 KETDLRANLMGANLMQADLTGADVRGSNLRGANLMGAVISEVSFAGAFLSGTNLINVDLQ 124
Query: 172 ----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--------DLAQKQAL 213
ANLT A L L+R+DL GA++ A+ +A + +LA L
Sbjct: 125 GVDLRGADLRGANLTGANLKGADLSRADLQGALLSEANLEEADLRKANLSGANLAGANLL 184
Query: 214 CKYANGTNPITGV 226
C G N + GV
Sbjct: 185 CAELEGAN-VNGV 196
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 44/87 (50%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N + D+R +D G+ GA L+ A +A+ GA LS+ ++ L +ANL+ A
Sbjct: 117 NLINVDLQGVDLRGADLRGANLTGANLKGADLSRADLQGALLSEANLEEADLRKANLSGA 176
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVI 205
L L ++L GA + G DF A +
Sbjct: 177 NLAGANLLCAELEGANVNGVDFDRACL 203
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 49/99 (49%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L AV + +F A + ++ D G GA L A AN GADLS +
Sbjct: 96 ANLMGAVISEVSFAGAFLSGTNLINVDLQGVDLRGADLRGANLTGANLKGADLSRADLQG 155
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+L+EANL A L + L+ ++L GA + A+ A ++
Sbjct: 156 ALLSEANLEEADLRKANLSGANLAGANLLCAELEGANVN 194
>gi|291571143|dbj|BAI93415.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 331
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 79/163 (48%), Gaps = 21/163 (12%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL + V +F+ AN +A ++E++ GS F L+ A +KAN T + +
Sbjct: 135 ADLEQVTLVDTDFKEANLKTAKLQEANLKGSTFELTQLQGANLWKANLQECFFLLTQLQK 194
Query: 168 MVLNEANLTNAV-----LVRTVLTRSDLGGAII----EGADFSDAVIDLAQKQ------A 212
+ LN ANL NA L+ L +++L GA I +GA+F +A + A Q A
Sbjct: 195 VNLNAANLQNAELQGVNLLEANLQQANLQGAYILGNLQGANFQEANLKGANLQGAYLQDA 254
Query: 213 LCKYAN--GTN----PITGVSTRKSLGCGNSRRNAYGSPSSPL 249
K AN G N +TGV+ ++ G + +NA G + +
Sbjct: 255 NFKRANLRGVNLKDANLTGVNFEEAHLQGANLQNAQGLTTQQI 297
>gi|158337660|ref|YP_001518836.1| pentapeptide repeat-containing serine/threonine kinase
[Acaryochloris marina MBIC11017]
gi|158307901|gb|ABW29518.1| serine/threonine kinase with pentapeptide repeats [Acaryochloris
marina MBIC11017]
Length = 532
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 56/116 (48%), Gaps = 20/116 (17%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+F + DLR A+ + NF RANFT A++R ++ L +A A+ ADL
Sbjct: 429 KFQNTDLRDAILINANFGRANFTGANLRNAN----------LMQAYMSHADLANADLRG- 477
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
ANL++A L L ++L GA + GA S++ + AQ L Y NG
Sbjct: 478 ---------ANLSDAYLSHANLRGANLCGADLSGAKLSESQLSFAQTNWLTVYPNG 524
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 10/87 (11%)
Query: 132 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDT-LMDRMVLNE---------ANLTNAVLV 181
+ DFSG L K ANF +T L D +++N ANL NA L+
Sbjct: 402 QRDFSGQDLRNLNLRKFQLPSANFHEGKFQNTDLRDAILINANFGRANFTGANLRNANLM 461
Query: 182 RTVLTRSDLGGAIIEGADFSDAVIDLA 208
+ ++ +DL A + GA+ SDA + A
Sbjct: 462 QAYMSHADLANADLRGANLSDAYLSHA 488
>gi|119486130|ref|ZP_01620190.1| hypothetical protein L8106_17342 [Lyngbya sp. PCC 8106]
gi|119456621|gb|EAW37750.1| hypothetical protein L8106_17342 [Lyngbya sp. PCC 8106]
Length = 207
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 64/137 (46%), Gaps = 23/137 (16%)
Query: 88 KYEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANF---------------TSAD 129
K A RG G+ A +ADLR A+ + + R A+F T D
Sbjct: 62 KLRANLRGADLTGTNLIGADLRNADLRGAILLDADVREASFAGAFLTGASCGALDLTGVD 121
Query: 130 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 189
+R +D G + A L++A N +GADLS + L EANL+ AVL T L R++
Sbjct: 122 LRGADLRGVSLSQAILQQADLRNTNLSGADLS-----QADLEEANLSGAVLRGTNLERAN 176
Query: 190 LGGAIIEGADFSDAVID 206
L AI+E + ++D
Sbjct: 177 LLCAIVEQTQWFGTILD 193
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 62/131 (47%), Gaps = 19/131 (14%)
Query: 108 ADLRKAVHVKENFRRANFTS-----ADMRESDFSG----------SKFNGAYLEKAVAYK 152
A+L++A ++ N R A+ T AD+R +D G + F GA+L A
Sbjct: 56 ANLQRA-KLRANLRGADLTGTNLIGADLRNADLRGAILLDADVREASFAGAFLTGASCGA 114
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 209
+ TG DL + + L++A L A L T L+ +DL A +E A+ S AV+ +L +
Sbjct: 115 LDLTGVDLRGADLRGVSLSQAILQQADLRNTNLSGADLSQADLEEANLSGAVLRGTNLER 174
Query: 210 KQALCKYANGT 220
LC T
Sbjct: 175 ANLLCAIVEQT 185
>gi|428214178|ref|YP_007087322.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428002559|gb|AFY83402.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 346
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 73/151 (48%), Gaps = 2/151 (1%)
Query: 59 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 118
NW L+ A +A+ + + L+ N A+ + IG+ S DLR+A
Sbjct: 95 NWADLSGANLSGANLANADVSGANLSGANLSGAKLNQTYLIGT--NLKSVDLREANLSLA 152
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ +A+ T A++R++D +G+K + L A AN TGA+L + + LN ANLT A
Sbjct: 153 SLNKADLTKANLRQADLTGAKLKQSNLNLADLTHANLTGANLKQANLSQAHLNWANLTKA 212
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L L ++L A + D ++ + AQ
Sbjct: 213 DLREANLCGANLSKANLSQTDLTEVCLKDAQ 243
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 45/86 (52%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
F R + AD+ E++ SG GA L KA AN + A+LS + L ANLT A
Sbjct: 29 FNRLSLAKADLSEANLSGVYLGGASLTKANLSGANLSRANLSGASLSGANLTGANLTGAN 88
Query: 180 LVRTVLTRSDLGGAIIEGADFSDAVI 205
L L +DL GA + GA+ ++A +
Sbjct: 89 LAGAHLNWADLSGANLSGANLANADV 114
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 55/118 (46%), Gaps = 15/118 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRES----------DFSGSKFNGAYLEKAVAYK 152
A ADLR+A N +AN + D+ E +FSG+ G L +
Sbjct: 207 ANLTKADLREANLCGANLSKANLSQTDLTEVCLKDAQLSGINFSGANLTGVDLSNKLLTG 266
Query: 153 ANFTGAD-----LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
AN +GA+ LS + + L EANL+ A L+ + L +DL A + GA+ S A +
Sbjct: 267 ANLSGAELSLANLSGAYLIQTNLREANLSEANLMGSHLMDADLTKANLSGANLSQANV 324
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 55/118 (46%), Gaps = 10/118 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A L A N AN A + +D SG+ +GA L A AN +GA+L
Sbjct: 65 SRANLSGASLSGANLTGANLTGANLAGAHLNWADLSGANLSGANLANADVSGANLSGANL 124
Query: 161 SDTLMDRMVL----------NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
S +++ L EANL+ A L + LT+++L A + GA + ++LA
Sbjct: 125 SGAKLNQTYLIGTNLKSVDLREANLSLASLNKADLTKANLRQADLTGAKLKQSNLNLA 182
Score = 40.8 bits (94), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 52/108 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S G A L KA N RAN + A + ++ +G+ GA L A A+ +GA+L
Sbjct: 45 SGVYLGGASLTKANLSGANLSRANLSGASLSGANLTGANLTGANLAGAHLNWADLSGANL 104
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
S + ++ ANL+ A L L ++ L G ++ D +A + LA
Sbjct: 105 SGANLANADVSGANLSGANLSGAKLNQTYLIGTNLKSVDLREANLSLA 152
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 60/121 (49%), Gaps = 20/121 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK----------AVAYK 152
A A+L++A + + AN T AD+RE++ G+ + A L + A
Sbjct: 187 ANLTGANLKQANLSQAHLNWANLTKADLREANLCGANLSKANLSQTDLTEVCLKDAQLSG 246
Query: 153 ANFTGADLSDT-LMDRMV---------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
NF+GA+L+ L ++++ L+ ANL+ A L++T L ++L A + G+ D
Sbjct: 247 INFSGANLTGVDLSNKLLTGANLSGAELSLANLSGAYLIQTNLREANLSEANLMGSHLMD 306
Query: 203 A 203
A
Sbjct: 307 A 307
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 63/139 (45%), Gaps = 15/139 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL A N AN +AD+ ++ SG+ +GA L + N DL +
Sbjct: 92 AHLNWADLSGA-----NLSGANLANADVSGANLSGANLSGAKLNQTYLIGTNLKSVDLRE 146
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ---KQALCKY 216
+ LN+A+LT A L + LT + L + + AD + A + +L Q QA +
Sbjct: 147 ANLSLASLNKADLTKANLRQADLTGAKLKQSNLNLADLTHANLTGANLKQANLSQAHLNW 206
Query: 217 ANGTNPITGVSTRKSLGCG 235
AN +T R++ CG
Sbjct: 207 AN----LTKADLREANLCG 221
>gi|359458687|ref|ZP_09247250.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 203
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 66/146 (45%), Gaps = 22/146 (15%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDF----------SGSKFNGAYLEKAVAYK 152
A F SADLRKA + + R A AD+R ++ SG+ +GA L A+ Y
Sbjct: 53 ANFASADLRKAKLFRADLRAACLYRADLRGANLKGANLFGANLSGANLSGANLSNAMLYC 112
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF--SDAV-IDLAQ 209
AN GA+L T++D L N ++ L +L + L G EG +D + I+L Q
Sbjct: 113 ANLGGANLRGTILDSANLMRVNFSHGDLRNAMLRNAKLQGTHFEGTRMLQTDLIEINLNQ 172
Query: 210 KQALCKY---------ANGTNPITGV 226
Q Y A G ITG+
Sbjct: 173 AQIDGVYLMDPDANNTAMGNTAITGI 198
>gi|428300657|ref|YP_007138963.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428237201|gb|AFZ02991.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 516
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 56/102 (54%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA +ADLR+A + N R+AN + +++ S +G+ A L K + + +GA+L
Sbjct: 119 AANLKNADLREATLRQANLRQANLSEVNLKGSLLTGANLEQANLSKTDLSRTDLSGANLR 178
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
DT + + L+ ANL+ A L L ++L GA + AD + A
Sbjct: 179 DTELKQSNLSRANLSGANLAGANLRWANLTGANLRWADLTGA 220
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 10/82 (12%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N R AN T A++R +D +G+K +GA + TGA+LS+ + L ANL A
Sbjct: 201 NLRWANLTGANLRWADLTGAKLSGA----------DLTGANLSNANLSNCTLVHANLHQA 250
Query: 179 VLVRTVLTRSDLGGAIIEGADF 200
L++T +DL GA + GA
Sbjct: 251 RLIKTEWVGADLSGASLTGAKL 272
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +L+ ++ N +AN + D+ +D SG+ L+++ +AN +GA+L+
Sbjct: 140 ANLSEVNLKGSLLTGANLEQANLSKTDLSRTDLSGANLRDTELKQSNLSRANLSGANLAG 199
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANLT A L LT + L GA + GA+ S+A
Sbjct: 200 A-----NLRWANLTGANLRWADLTGAKLSGADLTGANLSNA 235
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 48/93 (51%), Gaps = 10/93 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ A L ++ ++ N RA +A+++ +D L +A +AN A+L
Sbjct: 93 SHAELSKASLVRSELIRANLSRATLIAANLKNAD----------LREATLRQANLRQANL 142
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
S+ + +L ANL A L +T L+R+DL GA
Sbjct: 143 SEVNLKGSLLTGANLEQANLSKTDLSRTDLSGA 175
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 50/101 (49%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L KA V+ RAN + A + ++ + A L +A +AN + +L
Sbjct: 90 ADLSHAELSKASLVRSELIRANLSRATLIAANLKNADLREATLRQANLRQANLSEVNLKG 149
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+L+ L +ANL+ L RT L+ ++L ++ ++ S A
Sbjct: 150 SLLTGANLEQANLSKTDLSRTDLSGANLRDTELKQSNLSRA 190
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 49/101 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S +A+L+ A N RA+ + A++ ++ S+ A L +A AN ADL
Sbjct: 68 SGVHLTNANLKGASLNVTNLVRADLSHAELSKASLVRSELIRANLSRATLIAANLKNADL 127
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+ + + L +ANL+ L ++LT ++L A + D S
Sbjct: 128 REATLRQANLRQANLSEVNLKGSLLTGANLEQANLSKTDLS 168
>gi|428222289|ref|YP_007106459.1| serine/threonine protein kinase [Synechococcus sp. PCC 7502]
gi|427995629|gb|AFY74324.1| serine/threonine protein kinase [Synechococcus sp. PCC 7502]
Length = 563
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 54/97 (55%), Gaps = 5/97 (5%)
Query: 112 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD-----TLMD 166
+ V V+ + NF + D+ ++ +G+ +G + ++ + +F +DL+ +M
Sbjct: 396 RKVIVEYGHGKRNFANLDLSKASLAGTNLSGIVMSRSKLVETDFCQSDLTHASFTGAIMT 455
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++ LN ANL A + R +LT++DLGGA + AD +A
Sbjct: 456 QVKLNGANLAQAKMQRAILTKADLGGACLNQADLREA 492
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANF 155
S A +L V + +F +D+ + F+G+ K NGA L +A +A
Sbjct: 415 SKASLAGTNLSGIVMSRSKLVETDFCQSDLTHASFTGAIMTQVKLNGANLAQAKMQRAIL 474
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
T ADL +++ L EANL +A + + L+ +DL GA ++GA S A
Sbjct: 475 TKADLGGACLNQADLREANLQSAYMSKADLSGADLTGANLKGAYLSQA 522
Score = 44.3 bits (103), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 54/123 (43%), Gaps = 11/123 (8%)
Query: 96 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
E+G G F + DL KA N + + + E+DF S A A+ +
Sbjct: 401 EYGHGKR-NFANLDLSKASLAGTNLSGIVMSRSKLVETDFCQSDLTHASFTGAIMTQVKL 459
Query: 156 TGADLSDTLMDRMVL----------NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
GA+L+ M R +L N+A+L A L ++++DL GA + GA+ A +
Sbjct: 460 NGANLAQAKMQRAILTKADLGGACLNQADLREANLQSAYMSKADLSGADLTGANLKGAYL 519
Query: 206 DLA 208
A
Sbjct: 520 SQA 522
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 37/70 (52%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
A+ + + I + A G A L +A + N + A + AD+ +D +G+ GAYL +A
Sbjct: 465 AQAKMQRAILTKADLGGACLNQADLREANLQSAYMSKADLSGADLTGANLKGAYLSQANL 524
Query: 151 YKANFTGADL 160
N +GADL
Sbjct: 525 RGTNLSGADL 534
>gi|307152112|ref|YP_003887496.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306982340|gb|ADN14221.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 180
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 45/87 (51%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL KA V N N AD+RE++ SG+ A L A AN TGA+L +
Sbjct: 60 ANLTDADLLKAHLVGANLVEINLIGADLREANLSGADLTKADLRCANLTGANLTGANLRE 119
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSD 189
+D L ANLT+A ++ T L +D
Sbjct: 120 VNLDGANLMGANLTDAQIINTDLNMAD 146
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 66/128 (51%), Gaps = 6/128 (4%)
Query: 79 NISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS 138
NI L L +Y+A+ R +F + A+L A + N RA+ + AD+ E+D SG+
Sbjct: 2 NIQEL--LKRYKAKER-DF---QGSNLHQANLEGANLQRINLTRADLSGADLSEADLSGA 55
Query: 139 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
A L A KA+ GA+L + + L EANL+ A L + L ++L GA + GA
Sbjct: 56 CLMQANLTDADLLKAHLVGANLVEINLIGADLREANLSGADLTKADLRCANLTGANLTGA 115
Query: 199 DFSDAVID 206
+ + +D
Sbjct: 116 NLREVNLD 123
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 62/120 (51%), Gaps = 4/120 (3%)
Query: 114 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
++++E +R D + S+ + GA L++ +A+ +GADLS+ + L +A
Sbjct: 1 MNIQELLKRYKAKERDFQGSNLHQANLEGANLQRINLTRADLSGADLSEADLSGACLMQA 60
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQKQALCKYANGTNPITGVSTRK 230
NLT+A L++ L ++L + GAD +A + DL + C G N +TG + R+
Sbjct: 61 NLTDADLLKAHLVGANLVEINLIGADLREANLSGADLTKADLRCANLTGAN-LTGANLRE 119
>gi|218439290|ref|YP_002377619.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218172018|gb|ACK70751.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 231
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 60/107 (56%), Gaps = 5/107 (4%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGA 158
QF + ++A +K N A+F+ AD R S +F+ + F GA L +A+ + +FTGA
Sbjct: 16 QFKTCKFQEAELIKVNLSGADFSKADFRSSRLGKTNFAYACFFGADLSEAILWGTDFTGA 75
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+L ++ + L+ A L+ A L L ++ LGGA + A+ +A++
Sbjct: 76 NLEKAILREVELSGAILSQANLTGVNLMKATLGGANLSLANLREAIL 122
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 70/142 (49%), Gaps = 25/142 (17%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F AD R + K NF A F AD+ E+ G+ F GA LEKA+ + +GA
Sbjct: 33 SGADFSKADFRSSRLGKTNFAYACFFGADLSEAILWGTDFTGANLEKAILREVELSGA-- 90
Query: 161 SDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADF---SDAVIDLAQ--- 209
+L++ANLT L++ L+ ++L AI+ ADF S+ + +L Q
Sbjct: 91 --------ILSQANLTGVNLMKATLGGANLSLANLREAILYEADFRPTSEHITNLQQADL 142
Query: 210 KQALCKYANGTNPITGVSTRKS 231
+A YA + GV+ R++
Sbjct: 143 SEADLSYA----KLNGVNLRQA 160
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 54/109 (49%), Gaps = 13/109 (11%)
Query: 103 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A A+LR+A+ + +FR N AD+ E+D S +K NG L +A A
Sbjct: 110 ANLSLANLREAILYEADFRPTSEHITNLQQADLSEADLSYAKLNGVNLRQAKLMGAKLCR 169
Query: 158 ADLSDTLMDRMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
ADLS + + L EANL NA L+ +DL GAI+ AD + A
Sbjct: 170 ADLSKGIWQNSLPTDLCEANLRNA-----DLSYADLSGAILSYADLTGA 213
>gi|443328868|ref|ZP_21057461.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791604|gb|ELS01098.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 266
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 5/110 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR+A ++ N + + AD+R ++ G GA L KA + + A+LS+
Sbjct: 153 ADLNDADLREAQLIRANLSEVDLSGADLRAANLKGVNLRGADLNKA-----DLSRANLSE 207
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 212
+ LNEANL+ A L L ++L + GA F + ++D K++
Sbjct: 208 AYLYLANLNEANLSRADLSEANLHEANLSRVDLRGAIFCETIMDDGHKES 257
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 44/84 (52%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ N N + D+ ++ SG+ +GA L +A + N + +L+ ++ LN+A+L
Sbjct: 102 QSNLSGVNLSGVDLSGANLSGADLSGADLSEADLSRVNLSRVNLNGANLNDADLNDADLR 161
Query: 177 NAVLVRTVLTRSDLGGAIIEGADF 200
A L+R L+ DL GA + A+
Sbjct: 162 EAQLIRANLSEVDLSGADLRAANL 185
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 48/91 (52%), Gaps = 1/91 (1%)
Query: 111 RKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
R + +KE + ++N + ++ D SG+ +GA L A +A+ + +LS ++
Sbjct: 90 RSLLSLKEFDLSQSNLSGVNLSGVDLSGANLSGADLSGADLSEADLSRVNLSRVNLNGAN 149
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
LN+A+L +A L L R++L + GAD
Sbjct: 150 LNDADLNDADLREAQLIRANLSEVDLSGADL 180
>gi|427715910|ref|YP_007063904.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348346|gb|AFY31070.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 1031
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 56/108 (51%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S F A+L A N RAN + + ++ SG+ +GA L A +AN G L
Sbjct: 843 SGGNFSRANLSGANLSVANLSRANLSGTNFSRANLSGANLSGADLSTANLSRANLNGVYL 902
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
S ++R L+ AN + A L R L+ +DL GA + GAD SDA ++ A
Sbjct: 903 SRANLNRANLSGANFSRADLSRANLSGADLSGADLSGADLSDANLNRA 950
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 54/110 (49%)
Query: 94 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 153
RG F G ADL A + AN + A++ ++D SG F+ A L A A
Sbjct: 801 RGNFNSVVGQFLGGADLSGANLSDADLSLANLSHANLSDADLSGGNFSRANLSGANLSVA 860
Query: 154 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
N + A+LS T R L+ ANL+ A L L+R++L G + A+ + A
Sbjct: 861 NLSRANLSGTNFSRANLSGANLSGADLSTANLSRANLNGVYLSRANLNRA 910
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 55/128 (42%), Gaps = 30/128 (23%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 157
S A ADL A N AN + AD+ +FS + +GA L A +AN +G
Sbjct: 818 SGANLSDADLSLA-----NLSHANLSDADLSGGNFSRANLSGANLSVANLSRANLSGTNF 872
Query: 158 ----------------------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
A+L+ + R LN ANL+ A R L+R++L GA +
Sbjct: 873 SRANLSGANLSGADLSTANLSRANLNGVYLSRANLNRANLSGANFSRADLSRANLSGADL 932
Query: 196 EGADFSDA 203
GAD S A
Sbjct: 933 SGADLSGA 940
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 52/103 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A + N + A++ ++ SG+ F+ A L +A A+ +GADL
Sbjct: 878 SGANLSGADLSTANLSRANLNGVYLSRANLNRANLSGANFSRADLSRANLSGADLSGADL 937
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + LN ANL+ A L R L+ ++L A + G + S A
Sbjct: 938 SGADLSDANLNRANLSRANLKRANLSDANLSSANLSGDNLSRA 980
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 53/102 (51%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A L +A + N ANF+ AD+ ++ SG+ +GA L A AN A+L
Sbjct: 893 SRANLNGVYLSRANLNRANLSGANFSRADLSRANLSGADLSGADLSGADLSDANLNRANL 952
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
S + R L++ANL++A L L+R++L A + A+ D
Sbjct: 953 SRANLKRANLSDANLSSANLSGDNLSRANLSRANLSDANLGD 994
>gi|159030580|emb|CAO88243.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length = 354
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 66/133 (49%)
Query: 72 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 131
+ A+ S + LA L + + T AA+ L A + N R AN T AD+
Sbjct: 204 IYAAVSDDFLELAQLAELDPLTDFTGANLLAAELSGISLGMANLYQANLRGANLTDADLS 263
Query: 132 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 191
E + S + F GA L A+ A+ + AD + + L +NLT A LV +T+++L
Sbjct: 264 EINGSHASFKGADLSGALLANADLSYADFYRSSLALANLIGSNLTGANLVEVNITQANLS 323
Query: 192 GAIIEGADFSDAV 204
GA ++GA F+D V
Sbjct: 324 GAKVQGAKFADNV 336
>gi|428777412|ref|YP_007169199.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428691691|gb|AFZ44985.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 333
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 50/92 (54%), Gaps = 5/92 (5%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-----MDRMVLN 171
+ N RRA+ S ++ E DF+ + A L +A KAN GADLS+ + + L
Sbjct: 154 RTNLRRADLESLNLDELDFTQANLTEANLVRATLTKANLQGADLSEANLFNADLSKANLK 213
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
ANL A L+R L R+DL GA + GA ++A
Sbjct: 214 GANLRGANLIRANLERADLSGADLRGAYLNEA 245
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 48/95 (50%), Gaps = 5/95 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA-----NF 155
S A +ADL KA N R AN A++ +D SG+ GAYL +A ++A N
Sbjct: 198 SEANLFNADLSKANLKGANLRGANLIRANLERADLSGADLRGAYLNEAKMFEASLDNVNL 257
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+ A+L T M R AN +NA L + ++DL
Sbjct: 258 SQANLHRTRMIRASFKHANFSNANLTEANMRQADL 292
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 68/160 (42%), Gaps = 23/160 (14%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG----- 157
+ F A L +A V N A+ +++ E++ +G+ +GA L A FTG
Sbjct: 85 SDFHGAILHRANLVDTNLTLASLLDSNLMEANLAGADLSGADLSGVCLLGAVFTGSEQRG 144
Query: 158 ---------------ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
ADL +D + +ANLT A LVR LT+++L GA + A+ +
Sbjct: 145 SRKSTTKLKRTNLRRADLESLNLDELDFTQANLTEANLVRATLTKANLQGADLSEANLFN 204
Query: 203 AVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAY 242
A DL++ G N I R L G R AY
Sbjct: 205 A--DLSKANLKGANLRGANLIRANLERADL-SGADLRGAY 241
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 56/108 (51%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
+ A A+L +A K N + AN +AD+ +++ G+ GA L +A +A+
Sbjct: 173 TQANLTEANLVRATLTKANLQGADLSEANLFNADLSKANLKGANLRGANLIRANLERADL 232
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+GADL ++ + EA+L N L + L R+ + A + A+FS+A
Sbjct: 233 SGADLRGAYLNEAKMFEASLDNVNLSQANLHRTRMIRASFKHANFSNA 280
Score = 37.0 bits (84), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 46/101 (45%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A DL ++ N A A++ + +K NG L +A AN +D
Sbjct: 28 SGADLIGIDLSRSNLEGSNLAFAFLNEANLNRCNLVRAKLNGINLSQASLRFANLHDSDF 87
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
++ R L + NLT A L+ + L ++L GA + GAD S
Sbjct: 88 HGAILHRANLVDTNLTLASLLDSNLMEANLAGADLSGADLS 128
>gi|304393841|ref|ZP_07375766.1| pentapeptide repeat-containing protein [Ahrensia sp. R2A130]
gi|303294040|gb|EFL88415.1| pentapeptide repeat-containing protein [Ahrensia sp. R2A130]
Length = 247
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 59/115 (51%), Gaps = 5/115 (4%)
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
G+G + GS + V +F +FT A+M SDFSGS + K+ +ANFTG
Sbjct: 109 GVGLSKVEGS----RTVLQNSDFTDTDFTKAEMFRSDFSGSILKNVNMNKSEFSRANFTG 164
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 212
ADLS ++ ++ ANL +A L T + S + A + G D S A L Q+Q
Sbjct: 165 ADLSGAMITFANISRANLADAKLDGTDFSSSWMYLAKVAGVDMS-ATKGLTQEQV 218
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 51/118 (43%), Gaps = 20/118 (16%)
Query: 111 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GADLSDTLM 165
R + NF AN D+ SD KF+GA + K++ +AN + G LS
Sbjct: 58 RNVILSGYNFSLANLNQTDLFGSDLRDVKFDGADMTKSILTRANLSNSSLKGVGLSKVEG 117
Query: 166 DRMVLNEANLTNAVLVRTVLTRSDLGGAIIE---------------GADFSDAVIDLA 208
R VL ++ T+ + + RSD G+I++ GAD S A+I A
Sbjct: 118 SRTVLQNSDFTDTDFTKAEMFRSDFSGSILKNVNMNKSEFSRANFTGADLSGAMITFA 175
>gi|427414830|ref|ZP_18905017.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425755483|gb|EKU96348.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 1182
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 56/103 (54%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L KA + N + N + A++ ++DFSG+ +GA L KA+ + +L
Sbjct: 642 SEANLSEANLSKANLRETNLHKTNLSKANLSKTDFSGANLSGANLSGTNLRKADLSKLNL 701
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + LN ANL+ A L RT L++++LG A + A+ A
Sbjct: 702 KEINLTGANLNGANLSEADLSRTNLSKANLGKANLGAANLEGA 744
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 57/106 (53%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S F A+L A N R+A+ + +++E + +G+ NGA L +A + N + A+L
Sbjct: 672 SKTDFSGANLSGANLSGTNLRKADLSKLNLKEINLTGANLNGANLSEADLSRTNLSKANL 731
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ L ANLT + L +T L +++L G + GA+ ++A +D
Sbjct: 732 GKANLGAANLEGANLTGSNLNKTDLHQANLNGTDLTGANLNEANLD 777
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 48/103 (46%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S F +LR A K N AN SA++ +++ S + A L KA N GADL
Sbjct: 532 SKMDFTGVNLRGANLRKTNLCEANLNSAELNQANLSEANLRKANLSKAKLLGTNLQGADL 591
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + L+E NL A++ L + +L + G + SDA
Sbjct: 592 RGVTLTEINLSEVNLHGAIISEAALNKINLAKTNLCGINLSDA 634
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 49/90 (54%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A ADL + K N +AN +A++ ++ +GS N L +A + TGA+L
Sbjct: 712 NGANLSEADLSRTNLSKANLGKANLGAANLEGANLTGSNLNKTDLHQANLNGTDLTGANL 771
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
++ +D + L++A LT A L++ L ++ L
Sbjct: 772 NEANLDEVNLHQAKLTKAKLIKVDLRKTKL 801
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 44/87 (50%), Gaps = 5/87 (5%)
Query: 119 NFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
N RAN + ADM +D G+ A L + +AN +GA+LSD + L+ A
Sbjct: 409 NLSRANLSGADMHLANLNRTDLRGAVLCEAKLTRVTLEEANLSGANLSDAAVFEANLSRA 468
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADF 200
NL+ A L +T L S+L GA + D
Sbjct: 469 NLSGAKLYKTYLVESNLIGANLSETDL 495
Score = 44.3 bits (103), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 10/106 (9%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G ADLR A + A+ + A++ E++ +K A LE + +AN +GAD
Sbjct: 365 GICPDLSGADLRSA-----DLTEADLSRANLSEANLCRAKLCAANLEGSNLSRANLSGAD 419
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
M LN +L AVL LTR L A + GA+ SDA +
Sbjct: 420 -----MHLANLNRTDLRGAVLCEAKLTRVTLEEANLSGANLSDAAV 460
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 7/109 (6%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S ++ DL K + N N + AD+ + DF+G GA L K +AN A+L
Sbjct: 502 SESKLTRDDLTKMNLRETNLHGINLSGADLSKMDFTGVNLRGANLRKTNLCEANLNSAEL 561
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
++ L+EANL A L + L ++L GA + G ++ I+L++
Sbjct: 562 -----NQANLSEANLRKANLSKAKLLGTNLQGADLRGVTLTE--INLSE 603
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 58/126 (46%), Gaps = 25/126 (19%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTS----------ADMR----------ESDFSGSKFNG 142
A SA+L +A + N R+AN + AD+R E + G+ +
Sbjct: 554 ANLNSAELNQANLSEANLRKANLSKAKLLGTNLQGADLRGVTLTEINLSEVNLHGAIISE 613
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEG 197
A L K K N G +LSD + +M L+EANL+ A L + T L +++L A +
Sbjct: 614 AALNKINLAKTNLCGINLSDADLSKMNLSEANLSEANLSKANLRETNLHKTNLSKANLSK 673
Query: 198 ADFSDA 203
DFS A
Sbjct: 674 TDFSGA 679
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 53/124 (42%), Gaps = 25/124 (20%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAYK 152
A A+LRKA K N AD+R E + G+ + A L K K
Sbjct: 564 ANLSEANLRKANLSKAKLLGTNLQGADLRGVTLTEINLSEVNLHGAIISEAALNKINLAK 623
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL---------------TRSDLGGAIIEG 197
N G +LSD + +M L+EANL+ A L + L +++D GA + G
Sbjct: 624 TNLCGINLSDADLSKMNLSEANLSEANLSKANLRETNLHKTNLSKANLSKTDFSGANLSG 683
Query: 198 ADFS 201
A+ S
Sbjct: 684 ANLS 687
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 18/124 (14%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
LA+LN+ + RG A A L + + N AN + A + E++ S + +G
Sbjct: 422 LANLNR--TDLRG-------AVLCEAKLTRVTLEEANLSGANLSDAAVFEANLSRANLSG 472
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR-----SDLGGAIIEG 197
A L K ++N GA+LS+T + LN A+L+ + L R LT+ ++L G + G
Sbjct: 473 AKLYKTYLVESNLIGANLSETDL----LNGASLSESKLTRDDLTKMNLRETNLHGINLSG 528
Query: 198 ADFS 201
AD S
Sbjct: 529 ADLS 532
Score = 38.1 bits (87), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 5/99 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A L KA +K + R+ D+ E D S + L K + G +LS D
Sbjct: 784 AKLTKAKLIKVDLRKTKLNKTDLCEIDLRESNLSKINLSKTNLSRTQLAGTNLS--FAD- 840
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L E+NL+ A L L+++ L GA ++GAD S+A ++
Sbjct: 841 --LRESNLSKADLYGADLSQAMLCGANLKGADLSEAKLN 877
>gi|319791261|ref|YP_004152901.1| hypothetical protein Varpa_0569 [Variovorax paradoxus EPS]
gi|315593724|gb|ADU34790.1| Protein of unknown function DUF2169 [Variovorax paradoxus EPS]
Length = 865
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 34/86 (39%), Positives = 44/86 (51%), Gaps = 5/86 (5%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
++F + T AD D G F GA+LE A AN +GA+LS VL ANL
Sbjct: 544 KHFSGMDLTGADFSGLDLRGVNFTGAWLESANFENANLSGANLSHA-----VLAHANLRG 598
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDA 203
A+ V T L ++LGGA + A DA
Sbjct: 599 AIAVETSLVGANLGGARLASAVLEDA 624
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 35/115 (30%), Positives = 54/115 (46%), Gaps = 3/115 (2%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A F DLR ANF +A++ ++ S + A L A+A + + GA+L
Sbjct: 552 TGADFSGLDLRGVNFTGAWLESANFENANLSGANLSHAVLAHANLRGAIAVETSLVGANL 611
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV---IDLAQKQA 212
+ VL +A+ NA T + L GA +EGA + D V +DL + QA
Sbjct: 612 GGARLASAVLEDADCRNARFDGCDWTGARLRGARLEGASWLDVVWGGVDLQRAQA 666
Score = 40.0 bits (92), Expect = 1.0, Method: Composition-based stats.
Identities = 40/145 (27%), Positives = 53/145 (36%), Gaps = 41/145 (28%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDF----------------------------- 135
F DLR V + ANF D+R D
Sbjct: 671 FYKQDLRGTVFTEAVLDDANFIECDLRGCDLRAAHMARATFVQCRLDGVHASGVQAEGVV 730
Query: 136 --SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT---------- 183
G GA L A ANF G DLS + +L+ ANL L R+
Sbjct: 731 FVEGCSLVGADLGHAAMGSANFGGMDLSQVSLVGSMLDGANLIGTRLARSDWRLASAKGV 790
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLA 208
+L ++DL A + GA+FS+AV+ A
Sbjct: 791 LLCKADLAHARMAGANFSNAVLQHA 815
>gi|220909896|ref|YP_002485207.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219866507|gb|ACL46846.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 184
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+F N + + G++ NGA L A+ G DL+ +++ LN+ANL
Sbjct: 14 HRDFSHVNLVQVCLTNAKLVGARLNGAEL-----VGADLQGVDLTAAHLNQARLNQANLA 68
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDA 203
A +++ LTR+DL GA + GAD +DA
Sbjct: 69 GAEMIQACLTRADLSGAYLAGADLTDA 95
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 43/79 (54%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+ T AD+ +D SG+ GA L KA KA+ +GADL + L E +L++A L
Sbjct: 90 ADLTDADLSGADLSGANLGGADLRKADLSKADLSGADLRGADLSGANLRETDLSDADLDG 149
Query: 183 TVLTRSDLGGAIIEGADFS 201
L +DL GA +E F+
Sbjct: 150 AYLGHADLTGADVERTRFN 168
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 43/90 (47%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A + AN AD+R++D S + +GA L A AN DL
Sbjct: 83 SGAYLAGADLTDADLSGADLSGANLGGADLRKADLSKADLSGADLRGADLSGANLRETDL 142
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
SD +D L A+LT A + RT +S L
Sbjct: 143 SDADLDGAYLGHADLTGADVERTRFNQSQL 172
Score = 44.3 bits (103), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 55/108 (50%), Gaps = 15/108 (13%)
Query: 101 SAAQFGSADLR----KAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
+ A+ ADL+ A H+ + +AN A+M ++ + + +GAYL A A+
Sbjct: 38 NGAELVGADLQGVDLTAAHLNQARLNQANLAGAEMIQACLTRADLSGAYLAGADLTDADL 97
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+GADLS ANL A L + L+++DL GA + GAD S A
Sbjct: 98 SGADLS----------GANLGGADLRKADLSKADLSGADLRGADLSGA 135
Score = 37.4 bits (85), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 39/81 (48%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A A++ +D G A+L +A +AN GA++ + R L+ A L A L
Sbjct: 35 ARLNGAELVGADLQGVDLTAAHLNQARLNQANLAGAEMIQACLTRADLSGAYLAGADLTD 94
Query: 183 TVLTRSDLGGAIIEGADFSDA 203
L+ +DL GA + GAD A
Sbjct: 95 ADLSGADLSGANLGGADLRKA 115
>gi|254411218|ref|ZP_05024995.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196181719|gb|EDX76706.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 293
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 63/125 (50%), Gaps = 23/125 (18%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA----------YLEKAVAYKANFTG 157
ADL +A + N +AN + A++ + S NGA L +A+ +AN
Sbjct: 84 ADLVEANLISSNLTQANLSEANLINASLRASTLNGANLSRANLSEAILSEAIMREANLNQ 143
Query: 158 ADLSDTLMDRMVLN----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA---V 204
A L D + R L+ +ANLTNA+L+ T L +++L A++ GA+F+ A
Sbjct: 144 AKLIDASLSRTNLSYATLISANLEKANLTNAILLETNLKQANLNKALLHGANFTQADLTE 203
Query: 205 IDLAQ 209
+DL+Q
Sbjct: 204 VDLSQ 208
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 60/124 (48%), Gaps = 20/124 (16%)
Query: 102 AAQFGSADLRKAVHVKENFRRAN----------FTSADMRESDFSGSKFNGAYLEKAVAY 151
+A A+L A+ ++ N ++AN FT AD+ E D S ++ NG L +A+
Sbjct: 163 SANLEKANLTNAILLETNLKQANLNKALLHGANFTQADLTEVDLSQARLNGVNLTRAILV 222
Query: 152 KA----------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
A GA+LS + R L +NLT A+L+ TVL +++ + GA +
Sbjct: 223 GAKLRGVSICWTTLRGANLSKANLYRAKLCWSNLTEAILLETVLLDANMDQVNLRGATLT 282
Query: 202 DAVI 205
A++
Sbjct: 283 GAIL 286
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +L A + N +AN T+A + E++ + N KA+ + ANFT ADL++
Sbjct: 149 ASLSRTNLSYATLISANLEKANLTNAILLETNLKQANLN-----KALLHGANFTQADLTE 203
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + LN NLT A+LV L + + GA+ S A
Sbjct: 204 VDLSQARLNGVNLTRAILVGAKLRGVSICWTTLRGANLSKA 244
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 65/137 (47%), Gaps = 21/137 (15%)
Query: 99 IGSAAQFGSAD--LRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLE 146
+G+ AD LR+ + +FRR N T A++R SD S S GA L+
Sbjct: 8 LGNQNTIMDADELLRRYAVGERDFRRVNLRNASLIGADLTHANLRGSDLSQSNLTGASLK 67
Query: 147 KAVAYKANFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+AN T GADL + + L +ANL+ A L+ L S L GA + A+ S
Sbjct: 68 LVNFREANLTQITLRGADLVEANLISSNLTQANLSEANLINASLRASTLNGANLSRANLS 127
Query: 202 DAVIDLAQKQALCKYAN 218
+A++ +A+ + AN
Sbjct: 128 EAIL----SEAIMREAN 140
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 75/167 (44%), Gaps = 9/167 (5%)
Query: 46 SDGQFPGPYAKLKNWR---VFVSTALAAAVVAS--CSSNISA--LADLNKYEAETRGEFG 98
S G KL N+R + T A +V + SSN++ L++ N A R
Sbjct: 57 SQSNLTGASLKLVNFREANLTQITLRGADLVEANLISSNLTQANLSEANLINASLRASTL 116
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
G A A+L +A+ + R AN A + ++ S + + A L A KAN T A
Sbjct: 117 NG--ANLSRANLSEAILSEAIMREANLNQAKLIDASLSRTNLSYATLISANLEKANLTNA 174
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L +T + + LN+A L A + LT DL A + G + + A++
Sbjct: 175 ILLETNLKQANLNKALLHGANFTQADLTEVDLSQARLNGVNLTRAIL 221
>gi|119487879|ref|ZP_01621376.1| hypothetical protein L8106_28486 [Lyngbya sp. PCC 8106]
gi|119455455|gb|EAW36593.1| hypothetical protein L8106_28486 [Lyngbya sp. PCC 8106]
Length = 514
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 70/141 (49%), Gaps = 5/141 (3%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQ---FGSADLRKAVHVKE--NFRR 122
L A+++ + + S LAD N +A+ G G+ + A L + H++E N R
Sbjct: 265 LKQAILSEVNLSESNLADANLEQADLMGAELRGATLKGTNLSQAYLVRTNHLREVKNLRE 324
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN A++ ++ GA L++A +A GA+L D + R L EA L +A L R
Sbjct: 325 ANLKGANLTRANLREVNLQGANLQQANLQQAILQGANLKDANLIRANLREAKLQDAKLQR 384
Query: 183 TVLTRSDLGGAIIEGADFSDA 203
L R++L A + A+ S+A
Sbjct: 385 VNLERANLQAANLTDANLSNA 405
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 10/98 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L++A+ N + AN A++RE+ +K LE+A AN T A+LS+
Sbjct: 345 ANLQQANLQQAILQGANLKDANLIRANLREAKLQDAKLQRVNLERANLQAANLTDANLSN 404
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
ANLT+A L T L ++ A++ DF
Sbjct: 405 ----------ANLTDASLCDTCLNQTQFYQAVLIRVDF 432
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 61/126 (48%), Gaps = 21/126 (16%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYK--- 152
S A ADL+ A + NF+ AN A++ + +DF G+ ++L++A + +
Sbjct: 185 SGANLQGADLQGANLHETNFQGANLAGANLGGANLKCTDFQGTNLQESHLKQAYSVRKAK 244
Query: 153 -------------ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD 199
N GA+L ++ + L+E+NL +A L + L ++L GA ++G +
Sbjct: 245 FAQANLSGVDFQGVNLRGANLKQAILSEVNLSESNLADANLEQADLMGAELRGATLKGTN 304
Query: 200 FSDAVI 205
S A +
Sbjct: 305 LSQAYL 310
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 15/113 (13%)
Query: 105 FGSADLRKAVHVKENF--RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
F +L+++ H+K+ + R+A F A++ DF G GA L++A+ + N + ++L+D
Sbjct: 224 FQGTNLQES-HLKQAYSVRKAKFAQANLSGVDFQGVNLRGANLKQAILSEVNLSESNLAD 282
Query: 163 TLMDR----------MVLNEANLTNAVLVRTVLTRS--DLGGAIIEGADFSDA 203
+++ L NL+ A LVRT R +L A ++GA+ + A
Sbjct: 283 ANLEQADLMGAELRGATLKGTNLSQAYLVRTNHLREVKNLREANLKGANLTRA 335
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 69/162 (42%), Gaps = 20/162 (12%)
Query: 60 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN 119
W+V STA + V + + + AL DLN G I S A +L A V+ N
Sbjct: 113 WQVVDSTATSG--VFASRARLKALQDLNNEGVSLDG-LDI-SQAYLKEINLSGANLVEAN 168
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD----------TLMDRMV 169
AN A + ++ SG+ GA L+ A ++ NF GA+L+ T
Sbjct: 169 LEGANLQGASLSHANLSGANLQGADLQGANLHETNFQGANLAGANLGGANLKCTDFQGTN 228
Query: 170 LNEANLTNAVLVRTV------LTRSDLGGAIIEGADFSDAVI 205
L E++L A VR L+ D G + GA+ A++
Sbjct: 229 LQESHLKQAYSVRKAKFAQANLSGVDFQGVNLRGANLKQAIL 270
Score = 37.0 bits (84), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 55/113 (48%), Gaps = 27/113 (23%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK---AVAY------------- 151
A+L++A+ + N +N A++ ++D G++ GA L+ + AY
Sbjct: 263 ANLKQAILSEVNLSESNLADANLEQADLMGAELRGATLKGTNLSQAYLVRTNHLREVKNL 322
Query: 152 -KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+AN GA+L+ R L E NL A L +++L AI++GA+ DA
Sbjct: 323 REANLKGANLT-----RANLREVNLQGA-----NLQQANLQQAILQGANLKDA 365
>gi|114569789|ref|YP_756469.1| pentapeptide repeat-containing protein [Maricaulis maris MCS10]
gi|114340251|gb|ABI65531.1| pentapeptide repeat protein [Maricaulis maris MCS10]
Length = 493
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 77/166 (46%), Gaps = 30/166 (18%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A F S +L A V+ N +RA A++ +D SG GA L A AN GADL
Sbjct: 339 TGANFTSVELSNARIVESNMQRAILAGANLSYADLSGIDLAGADLTGADLSGANLIGADL 398
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS---------------DLGGAIIEGADFSDAVI 205
+ + R ANLT A+L T LTR+ L GA ++ AD +DA +
Sbjct: 399 TGANLTR-----ANLTGAILFGTDLTRAILANARLNSAQLVGAQLSGARLDSADLTDANL 453
Query: 206 DLAQKQALCKYANGTNPITGVST--RKSLGCGNSRRNAYGSP-SSP 248
AQ A + P++G T R + G+ R ++ G+ SSP
Sbjct: 454 FGAQNAA-------SIPVSGTMTFCRTRMADGSDRSSSCGAAVSSP 492
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 44/94 (46%), Gaps = 5/94 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESD-----FSGSKFNGAYLEKAVAYKANF 155
S A A+L ++ N RA+ ++ D+ E+D F G+ F GA L AN
Sbjct: 65 SGANMSGANLSRSRFPDANLDRADLSNTDLTEADLSTGRFVGANFRGALLRNTSLTGANL 124
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 189
TGADL+ +N+A L N L TV+ D
Sbjct: 125 TGADLTGARELGYEINQARLCNTRLSATVVLNRD 158
Score = 37.0 bits (84), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 49/113 (43%), Gaps = 15/113 (13%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN------------- 154
A L +A+ +FR N T + +G+ F L A ++N
Sbjct: 311 ASLSQAIFPGNDFRTINLTGVQIYGMVLTGANFTSVELSNARIVESNMQRAILAGANLSY 370
Query: 155 --FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+G DL+ + L+ ANL A L LTR++L GAI+ G D + A++
Sbjct: 371 ADLSGIDLAGADLTGADLSGANLIGADLTGANLTRANLTGAILFGTDLTRAIL 423
>gi|443314210|ref|ZP_21043788.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442786182|gb|ELR95944.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 516
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 56/124 (45%), Gaps = 1/124 (0%)
Query: 96 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
E + S A A+LR A + + AN A++R +D SG+ A L A A
Sbjct: 174 EDTVLSGAVLQRAELRHATLMGADLSGANLRGANLRWADLSGANLQEADLTDAKLSGATL 233
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALC 214
GADLS + +L +L+ L R SDL GA + GA + AV DL + C
Sbjct: 234 VGADLSGATLVNTILVHTDLSRTRLQRVYCVDSDLSGATLNGAFLAGAVCYDLVTAETTC 293
Query: 215 KYAN 218
+ +
Sbjct: 294 DWVD 297
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 5/114 (4%)
Query: 92 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 151
E R + S A +DLRK+ NF AN A + + + +GA L++A
Sbjct: 135 EARLRWARLSGANLSQSDLRKS-----NFLGANLEGAQLYAAQMEDTVLSGAVLQRAELR 189
Query: 152 KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A GADLS + L A+L+ A L LT + L GA + GAD S A +
Sbjct: 190 HATLMGADLSGANLRGANLRWADLSGANLQEADLTDAKLSGATLVGADLSGATL 243
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 57/113 (50%), Gaps = 10/113 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L +A +AN + AD+RE+ ++ +GA L ++ K+NF GA+L
Sbjct: 104 SEASLIRAELLRADLSNATLNQANLSEADLREARLRWARLSGANLSQSDLRKSNFLGANL 163
Query: 161 ----------SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
DT++ VL A L +A L+ L+ ++L GA + AD S A
Sbjct: 164 EGAQLYAAQMEDTVLSGAVLQRAELRHATLMGADLSGANLRGANLRWADLSGA 216
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 54/101 (53%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
AQ LR+A N AN + +D++++ + S+F+GA L +A +A A+L
Sbjct: 36 AQIPHIVLRQANLNIVNLSTANLSFSDLQQASLNVSRFSGANLSQACLRQAQLNVANLI- 94
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
R VL A+L+ A L+R L R+DL A + A+ S+A
Sbjct: 95 ----RAVLVGADLSEASLIRAELLRADLSNATLNQANLSEA 131
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 49/105 (46%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A LR+A N RA AD+ E+ ++ A L A +AN + ADL
Sbjct: 74 SGANLSQACLRQAQLNVANLIRAVLVGADLSEASLIRAELLRADLSNATLNQANLSEADL 133
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ + L+ ANL+ + L ++ ++L GA + A D V+
Sbjct: 134 REARLRWARLSGANLSQSDLRKSNFLGANLEGAQLYAAQMEDTVL 178
>gi|425455658|ref|ZP_18835373.1| Genome sequencing data, contig C328 [Microcystis aeruginosa PCC
9807]
gi|389803408|emb|CCI17656.1| Genome sequencing data, contig C328 [Microcystis aeruginosa PCC
9807]
Length = 354
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 55/103 (53%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA+ L A + N R AN T AD+ E + S + F GA L A+ A+ + AD
Sbjct: 234 AAELSGISLGMANLYQANLRGANLTDADLSEINGSHASFKGADLSGALLANADLSYADFY 293
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
+ + L +NLT A LV +T+++L GA ++GA F+D V
Sbjct: 294 RSSLALANLIGSNLTGANLVEVNITQANLSGAKVQGAKFADNV 336
>gi|153871558|ref|ZP_02000700.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
gi|152071976|gb|EDN69300.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
Length = 179
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 64/123 (52%), Gaps = 14/123 (11%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL + + R A+ + AD+ E+D SG+ +G AN +GADL
Sbjct: 59 SGADLSGADLSNSDIRAGDLRVADLSEADLSEADLSGADLSG----------ANLSGADL 108
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+ R +LN+ANL+ A L L+ +DL GA + GA+ S +DL++ + AN T
Sbjct: 109 RWADLYRTILNDANLSYANLCSADLSEADLSGANLSGANLS--RVDLSEAN--LEGANLT 164
Query: 221 NPI 223
+ I
Sbjct: 165 DAI 167
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL + + N AN SAD+ E+D SG+ +GA L + +AN GA+L
Sbjct: 104 SGADLRWADLYRTILNDANLSYANLCSADLSEADLSGANLSGANLSRVDLSEANLEGANL 163
Query: 161 SDTLMDRMVLNE 172
+D ++ + NE
Sbjct: 164 TDAILTGAIFNE 175
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 45/85 (52%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ AN AD++ ++ G+ N AYL +A+ + N +GADLS + + +L A
Sbjct: 22 DLSEANLNGADLKNANLRGADLNHAYLFRAILTQINLSGADLSGADLSNSDIRAGDLRVA 81
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
L L+ +DL GA + GA+ S A
Sbjct: 82 DLSEADLSEADLSGADLSGANLSGA 106
>gi|359459150|ref|ZP_09247713.1| pentapeptide repeat-containing serine/threonine kinase
[Acaryochloris sp. CCMEE 5410]
Length = 514
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 56/116 (48%), Gaps = 20/116 (17%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+F + DLR A+ + NF RANFT A++R ++ L +A A+ ADL
Sbjct: 411 KFQNTDLRDAILINANFGRANFTGANLRNAN----------LMQAYMSHADLANADLRG- 459
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
ANL++A L L ++L GA + GA S++ + AQ L Y NG
Sbjct: 460 ---------ANLSDAYLSHANLRGANLCGADLSGAKLSESQLSFAQTNWLTVYPNG 506
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 10/87 (11%)
Query: 132 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDT-LMDRMVLNE---------ANLTNAVLV 181
+ DFSG L K ANF +T L D +++N ANL NA L+
Sbjct: 384 QRDFSGQDLRNLNLRKFQLPSANFHEGKFQNTDLRDAILINANFGRANFTGANLRNANLM 443
Query: 182 RTVLTRSDLGGAIIEGADFSDAVIDLA 208
+ ++ +DL A + GA+ SDA + A
Sbjct: 444 QAYMSHADLANADLRGANLSDAYLSHA 470
>gi|186684326|ref|YP_001867522.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186466778|gb|ACC82579.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 413
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANF 155
S A A+L KA+ V N + NFT A++ E+D S GS F A L KA +AN
Sbjct: 216 SNADLTEANLSKAIFVGANLQWVNFTQANLSEADLSITNLCGSVFYEANLSKATLPEANL 275
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
G L + + + +ANL A+L L + L A +EGA DA
Sbjct: 276 QGVILRKANLSKAIFYDANLEGAILCDANLVGAILCDANLEGAILCDA 323
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 61/123 (49%), Gaps = 11/123 (8%)
Query: 88 KYEAETRGEFGIGSAAQ-----FGSADLRK-AVHVKENFRRANFTSADMRESDFSGSKFN 141
KYE E + + + Q G D K V+ K + R + ++AD+ E++ S + F
Sbjct: 172 KYEDELQVSSKLPTDIQTAITVIGRRDSHKDPVNQKLDLRNTDLSNADLTEANLSKAIFV 231
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
GA L+ +AN + ADLS T + V EANL+ A L ++L G I+ A+ S
Sbjct: 232 GANLQWVNFTQANLSEADLSITNLCGSVFYEANLSKA-----TLPEANLQGVILRKANLS 286
Query: 202 DAV 204
A+
Sbjct: 287 KAI 289
>gi|114799805|ref|YP_760951.1| pentapeptide repeat-containing protein [Hyphomonas neptunium ATCC
15444]
gi|114739979|gb|ABI78104.1| pentapeptide repeat domain protein [Hyphomonas neptunium ATCC
15444]
Length = 245
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
ADLR A F A F +A M++ +DFS ++ GA LEKA NF GA L
Sbjct: 88 ADLRGADLTSARFADATFNNARMQDVLASGADFSRARLQGANLEKARLIGVNFEGASL-- 145
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L R L A+L+ A T+L R++L G I +GA+ S+A
Sbjct: 146 -LFAR--LETADLSGANCTGTILDRANLRGTIFDGANLSEA 183
>gi|359151325|ref|ZP_09184042.1| pentapeptide repeat-containing protein [Streptomyces sp. S4]
Length = 240
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 44/86 (51%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
N RAN SAD+ + +G+ GA L + AN TGADL + L NLT
Sbjct: 68 HNLSRANLISADLARVNLTGANLTGADLARVNLTGANLTGADLIYANLAGADLTRVNLTR 127
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDA 203
A + T LT +DL GA + G D ++A
Sbjct: 128 ARMKLTNLTGADLTGADLAGGDLTNA 153
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 53/107 (49%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L A + N AN T AD+ ++ +G+ L +A N TGADL+ +
Sbjct: 88 ANLTGADLARVNLTGANLTGADLIYANLAGADLTRVNLTRARMKLTNLTGADLTGADLAG 147
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
L A+LTNA L LT DL GAI+ GA+ A + A++ L
Sbjct: 148 GDLTNADLTNADLTGAHLTNVDLTGAILTGANLGGANLAAARQLRLV 194
>gi|334117107|ref|ZP_08491199.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333461927|gb|EGK90532.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 520
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 67/138 (48%), Gaps = 2/138 (1%)
Query: 66 TALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANF 125
T L+ A + N++ L+ N Y+A G I + A ADLR+A V+ R+
Sbjct: 50 TNLSNANMRKAKLNVARLSGANLYKANLSG--AILNVANLIRADLREAQLVEATMIRSEL 107
Query: 126 TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL 185
A++ ++ +G+ + A L +A +AN ADLS + L ANL A L R L
Sbjct: 108 IRANLSSANLTGANLSEADLREATLREANLEQADLSGAHLRGASLTAANLERANLHRADL 167
Query: 186 TRSDLGGAIIEGADFSDA 203
+R+DL G + A+ A
Sbjct: 168 SRADLRGVNLCNAELRQA 185
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 63/111 (56%), Gaps = 9/111 (8%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+ +A+ N N ++A+MR++ + ++ +GA L YKAN +GA L+ + R
Sbjct: 35 ANFSEAILSLTNMSGTNLSNANMRKAKLNVARLSGANL-----YKANLSGAILNVANLIR 89
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
L EA L A ++R+ L R++L A + GA+ S+A DL ++A + AN
Sbjct: 90 ADLREAQLVEATMIRSELIRANLSSANLTGANLSEA--DL--REATLREAN 136
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 51/92 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+A+LR+A + N A+ A++R +D SG+ GA L++A AN GA+LS+ +
Sbjct: 179 NAELRQANLSQANLSGADLRGANLRWADLSGANLTGADLDEARLSGANLYGANLSNVNLL 238
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
L A+LT A L+ +DL GA + GA
Sbjct: 239 NATLVHADLTQANLIHADWVGADLTGAALTGA 270
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 54/103 (52%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A ADLR+A + N +A+ + A +R + + + A L +A +A+ G +L
Sbjct: 118 TGANLSEADLREATLREANLEQADLSGAHLRGASLTAANLERANLHRADLSRADLRGVNL 177
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + + L++ANL+ A L L +DL GA + GAD +A
Sbjct: 178 CNAELRQANLSQANLSGADLRGANLRWADLSGANLTGADLDEA 220
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 67/134 (50%), Gaps = 10/134 (7%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANF 155
+AA A+L +A + + R N +A++R+++ S G+ GA L A AN
Sbjct: 153 TAANLERANLHRADLSRADLRGVNLCNAELRQANLSQANLSGADLRGANLRWADLSGANL 212
Query: 156 TGADLSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
TGADL + + L ANL+ NA LV LT+++L A GAD + A + A+
Sbjct: 213 TGADLDEARLSGANLYGANLSNVNLLNATLVHADLTQANLIHADWVGADLTGAALTGAKI 272
Query: 211 QALCKYANGTNPIT 224
A+ ++ + IT
Sbjct: 273 YAVSRFDVKADDIT 286
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 51/110 (46%), Gaps = 3/110 (2%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A LR A N RAN AD+ +D G A L +A +AN +GADL
Sbjct: 140 ADLSGAHLRGASLTAANLERANLHRADLSRADLRGVNLCNAELRQANLSQANLSGADLRG 199
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 209
+ L+ ANLT A L L+ ++L GA + + +A + DL Q
Sbjct: 200 ANLRWADLSGANLTGADLDEARLSGANLYGANLSNVNLLNATLVHADLTQ 249
>gi|332708407|ref|ZP_08428384.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332352810|gb|EGJ32373.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 309
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 101 SAAQFGSADLRKAV-----HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S AQ ADLR+A + N + AN + + E++FSG+ + A LE A +
Sbjct: 115 SLAQLQKADLREATGKGITFINANLKMANLGAVNFPEANFSGASLDIASLEAANLMDTKW 174
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
GADL + R L A+LT+A L+ L +DL I+ GA ++ ++
Sbjct: 175 VGADLERANLSRASLVRADLTSANLIVANLRAADLTEVILRGAQLLESSLE 225
>gi|428320632|ref|YP_007118514.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428244312|gb|AFZ10098.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 280
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 51/91 (56%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
LR+ + NFR N AD++ + S + F A + A AN TGA+L + + +
Sbjct: 7 LRQYAAGERNFREINLAGADLKGVNLSEANFTRANFQDANLKGANLTGANLREVKLAGVD 66
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
L EANL+ A L+ T L+R++L GA + GA+
Sbjct: 67 LTEANLSEANLIGTDLSRANLSGANLMGANL 97
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 60/109 (55%), Gaps = 2/109 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LR ++ + N +AN T A++ E++F+ + A L A + N A+L
Sbjct: 88 SGANLMGANLRGSMAREVNMTKANLTEANLTEANFTEANLFAANLTDASMIRINLMKANL 147
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
S + + + L A L+ ++L R LT++ L GA++ GA+ + A DL Q
Sbjct: 148 SWSTLKAVNLTNAILSESLLERANLTQAILSGAMVSGANLTGA--DLRQ 194
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 66/131 (50%), Gaps = 14/131 (10%)
Query: 103 AQFGSADLRKA----VHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A A+LR+ V + E N AN D+ ++ SG+ GA L ++A + N T
Sbjct: 50 ANLTGANLREVKLAGVDLTEANLSEANLIGTDLSRANLSGANLMGANLRGSMAREVNMTK 109
Query: 158 ADLSDTLMDRMVLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 212
A+L++ + EANL T+A ++R L +++L + ++ + ++A++ ++
Sbjct: 110 ANLTEANLTEANFTEANLFAANLTDASMIRINLMKANLSWSTLKAVNLTNAIL----SES 165
Query: 213 LCKYANGTNPI 223
L + AN T I
Sbjct: 166 LLERANLTQAI 176
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 5/90 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA-----NF 155
S A ADLR+ V N AN ++A++R ++ S S A L A Y+A NF
Sbjct: 183 SGANLTGADLRQVTMVGANLTEANLSNANLRVANVSWSTLARANLSGANLYRAKLCWSNF 242
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVL 185
+GA L + ++ LN N +A L R ++
Sbjct: 243 SGAVLVEAVLIDANLNRTNFRDADLRRAIM 272
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 56/106 (52%), Gaps = 7/106 (6%)
Query: 104 QFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTG 157
ADL K V++ E NF RANF A+++ ++ +G+ K G L +A +AN G
Sbjct: 21 NLAGADL-KGVNLSEANFTRANFQDANLKGANLTGANLREVKLAGVDLTEANLSEANLIG 79
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
DLS + L ANL ++ +T+++L A + A+F++A
Sbjct: 80 TDLSRANLSGANLMGANLRGSMAREVNMTKANLTEANLTEANFTEA 125
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 57/123 (46%), Gaps = 20/123 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVAYK 152
A +A+L A ++ N +AN T+A + ES + A L A+
Sbjct: 125 ANLFAANLTDASMIRINLMKANLSWSTLKAVNLTNAILSESLLERANLTQAILSGAMVSG 184
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVL-VRTV----LTRSDLGGAIIEGA-----DFSD 202
AN TGADL M L EANL+NA L V V L R++L GA + A +FS
Sbjct: 185 ANLTGADLRQVTMVGANLTEANLSNANLRVANVSWSTLARANLSGANLYRAKLCWSNFSG 244
Query: 203 AVI 205
AV+
Sbjct: 245 AVL 247
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 41/88 (46%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A+L+ A N R D+ E++ S + G L +A AN GA+L
Sbjct: 40 ANFQDANLKGANLTGANLREVKLAGVDLTEANLSEANLIGTDLSRANLSGANLMGANLRG 99
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDL 190
++ + + +ANLT A L T ++L
Sbjct: 100 SMAREVNMTKANLTEANLTEANFTEANL 127
>gi|158313419|ref|YP_001505927.1| pentapeptide repeat-containing protein [Frankia sp. EAN1pec]
gi|158108824|gb|ABW11021.1| pentapeptide repeat protein [Frankia sp. EAN1pec]
Length = 299
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 55/117 (47%), Gaps = 6/117 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR + R A AD+R++D S + GA L A+ A TGADL
Sbjct: 103 AYLSGADLRG-----TDLRDACLRGADLRDADLSQAALGGADLAGALLAGAFLTGADLHG 157
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYAN 218
T + L+ A+L A L R L +D G I+ GAD A D +QA + A+
Sbjct: 158 TDLHGAFLHNADLRKAFLARADLRGADADGIIMRGADLRAADATDAVLRQADLRAAD 214
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A G ADL A+ A T AD+ +D G+ + A L KA +A+ GAD
Sbjct: 131 SQAALGGADLAGALLAG-----AFLTGADLHGTDLHGAFLHNADLRKAFLARADLRGADA 185
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDA 203
+M L A+ T+AVL + L +D L GAI+ G D A
Sbjct: 186 DGIIMRGADLRAADATDAVLRQADLRAADLRGIRLAGAILRGVDLRGA 233
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 37/71 (52%)
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
AD+ +D +G G L A + A +GADL T + L A+L +A L + L
Sbjct: 78 ADLTGADLAGVCLTGRILRGAQLHGAYLSGADLRGTDLRDACLRGADLRDADLSQAALGG 137
Query: 188 SDLGGAIIEGA 198
+DL GA++ GA
Sbjct: 138 ADLAGALLAGA 148
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 36/78 (46%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+ T AD+ +G GA L A A+ G DL D + L +A+L+ A L
Sbjct: 78 ADLTGADLAGVCLTGRILRGAQLHGAYLSGADLRGTDLRDACLRGADLRDADLSQAALGG 137
Query: 183 TVLTRSDLGGAIIEGADF 200
L + L GA + GAD
Sbjct: 138 ADLAGALLAGAFLTGADL 155
>gi|218441428|ref|YP_002379757.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218174156|gb|ACK72889.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 362
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S Q G A+L H+ N R A T AD+ E+D + K +GA L A AN + +DL
Sbjct: 245 SGVQLGGANL---YHI--NLRGAVLTDADLGEADLNHGKLSGADLSGAYLGNANLSYSDL 299
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+ L A+L A L L++++L GAI+EG F+D
Sbjct: 300 HKASLALTNLIGADLRGANLTEVNLSQANLSGAIVEGTRFAD 341
>gi|428223745|ref|YP_007107842.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427983646|gb|AFY64790.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 183
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 53/109 (48%), Gaps = 10/109 (9%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAYKAN 154
F DLR+A N + ++D+R +++ G+K GA + A Y+AN
Sbjct: 20 FDEIDLREANLFNANLEAVSLQNSDLRSTYLPYTNLNKANLQGAKLQGAEMSDAQLYQAN 79
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
GADL + + R L A+L A L L +DL GA ++GA+ DA
Sbjct: 80 LAGADLRGSNLSRATLRYASLQQANLQGANLQGADLYGANLQGANLQDA 128
Score = 37.7 bits (86), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 46/107 (42%), Gaps = 13/107 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVAYK 152
A A L+ A +AN AD+R S+ S + GA L+ A Y
Sbjct: 58 ANLQGAKLQGAEMSDAQLYQANLAGADLRGSNLSRATLRYASLQQANLQGANLQGADLYG 117
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS---DLGGAIIE 196
AN GA+L D + R L++A L +L L R+ D GA ++
Sbjct: 118 ANLQGANLQDADLQRADLDQATLKATILANANLFRAQNIDWTGAAVD 164
>gi|428314300|ref|YP_007125277.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428255912|gb|AFZ21871.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 355
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 56/112 (50%), Gaps = 7/112 (6%)
Query: 103 AQFGSADLR-----KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A +ADLR KA ++ RA+ T A + E+D SG+ +GA L A A G
Sbjct: 61 ANLSNADLRVANFTKAQLIETTLSRADLTQAILSEADLSGAILSGALLSGADLKGATLIG 120
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L L+ L + NLT A L R +L ++DL AI+ A +A DL++
Sbjct: 121 VSLIGALIKGAKLTKVNLTGATLSRAILVQADLKKAILNRAILGEA--DLSE 170
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 60/114 (52%), Gaps = 10/114 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL A N RAN T +++++ G++ + A L KA KAN +GA+L +
Sbjct: 226 ANLSHADLSGADLQGANLTRANLTGVLLKKANLRGAELSKANLHKANLSKANLSGANLLE 285
Query: 163 TLMDRMVLNEANL----------TNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ L++ANL TNA L T L ++L GA +EGA+ S+A ++
Sbjct: 286 ANLLDANLSQANLLRSGLLLTYLTNANLSSTNLNEANLIGANLEGANLSEASLE 339
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A L +A + N R+AN AD+ E+D G+ +GA L AN +GADL
Sbjct: 169 SEANLSGASLVRAYLNRVNLRQANLEEADLSEADLKGANLSGANLS-----GANLSGADL 223
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + L+ A+L A L R LT L A + GA+ S A
Sbjct: 224 REANLSHADLSGADLQGANLTRANLTGVLLKKANLRGAELSKA 266
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 41/82 (50%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
NF+ A + D SGS N L A ANFT L + L AN T A L+ T
Sbjct: 22 NFSGAKLSGVDLSGSNLNRINLSSAHLNGANFTKTKLIRANLSNADLRVANFTKAQLIET 81
Query: 184 VLTRSDLGGAIIEGADFSDAVI 205
L+R+DL AI+ AD S A++
Sbjct: 82 TLSRADLTQAILSEADLSGAIL 103
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 50/92 (54%), Gaps = 5/92 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM-----VLNEA 173
N R N +SA + ++F+ +K A L A ANFT A L +T + R +L+EA
Sbjct: 37 NLNRINLSSAHLNGANFTKTKLIRANLSNADLRVANFTKAQLIETTLSRADLTQAILSEA 96
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+L+ A+L +L+ +DL GA + G A+I
Sbjct: 97 DLSGAILSGALLSGADLKGATLIGVSLIGALI 128
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 10/101 (9%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL+KA+ RA AD+ E++ SG+ AYL + +AN ADLS+ +
Sbjct: 151 ADLKKAI-----LNRAILGEADLSEANLSGASLVRAYLNRVNLRQANLEEADLSEADLKG 205
Query: 168 MVLNEANLTNAVL----VRTV-LTRSDLGGAIIEGADFSDA 203
L+ ANL+ A L +R L+ +DL GA ++GA+ + A
Sbjct: 206 ANLSGANLSGANLSGADLREANLSHADLSGADLQGANLTRA 246
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 48/94 (51%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
++ A K N A + A + ++D + N A L +A +AN +GA L ++R+
Sbjct: 128 IKGAKLTKVNLTGATLSRAILVQADLKKAILNRAILGEADLSEANLSGASLVRAYLNRVN 187
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L +ANL A L L ++L GA + GA+ S A
Sbjct: 188 LRQANLEEADLSEADLKGANLSGANLSGANLSGA 221
>gi|410472731|ref|YP_006896012.1| hypothetical protein BN117_2075 [Bordetella parapertussis Bpp5]
gi|408442841|emb|CCJ49408.1| Hypothetical protein BN117_2075 [Bordetella parapertussis Bpp5]
Length = 329
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 2/107 (1%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L +A + N RAN A++ ++ + + GA L +A +AN GA+L+D
Sbjct: 66 ADLAGANLARANLARANLARANLAGANLADAYLADADLAGANLARANLARANLAGANLAD 125
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+ R L +A L +A L L R++L A + GAD + A DLA+
Sbjct: 126 AYLARAYLADAYLADAYLADADLARANLACANLAGADLAGA--DLAR 170
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 70/168 (41%), Gaps = 19/168 (11%)
Query: 96 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
E + S A ADL A N AN A + ++D +G+ A L +A +AN
Sbjct: 34 EQAVKSGANLARADLAGA-----NLAGANLADAYLADADLAGANLARANLARANLARANL 88
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI--------DL 207
GA+L+D + L ANL A L R L ++L A + A +DA + DL
Sbjct: 89 AGANLADAYLADADLAGANLARANLARANLAGANLADAYLARAYLADAYLADAYLADADL 148
Query: 208 AQKQALCKYANGTNPITGVSTRKSLGCGN------SRRNAYGSPSSPL 249
A+ C G + R +L N +R N G+ + P+
Sbjct: 149 ARANLACANLAGADLAGADLARANLAGANLAGAYLARANLAGARNLPV 196
>gi|428313439|ref|YP_007124416.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428255051|gb|AFZ21010.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 167
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 10/101 (9%)
Query: 111 RKAVHVKENFRRANFTSADMRE----------SDFSGSKFNGAYLEKAVAYKANFTGADL 160
R+ + + NF RAN +D+R+ ++ S +GA L + Y+AN + ADL
Sbjct: 8 RRYLAGERNFHRANLNGSDLRKIPLMRADLLKANLHNSNLSGANLTRVNLYQANLSKADL 67
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
T+ + +L+ A LT A L R L ++DL A ++GA +
Sbjct: 68 RQTIFNEAILHGAELTGANLHRASLIKADLCEANLKGASLT 108
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYK----- 152
A +++L A + N +AN + AD+R++ F+ G++ GA L +A K
Sbjct: 40 ANLHNSNLSGANLTRVNLYQANLSKADLRQTIFNEAILHGAELTGANLHRASLIKADLCE 99
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
AN GA L+ T + L+ ANL NA L L ++DL A +EGAD S A
Sbjct: 100 ANLKGASLTHTNLGAAKLSGANLNNANLTWANLRKADLKNANLEGADLSGA 150
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 56/112 (50%), Gaps = 6/112 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL KA N AN T ++ +++ S + +A+ + A TGA+L + +
Sbjct: 35 ADLLKANLHNSNLSGANLTRVNLYQANLSKADLRQTIFNEAILHGAELTGANLHRASLIK 94
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 218
L EANL A LT ++LG A + GA+ ++A + A ++A K AN
Sbjct: 95 ADLCEANLKGA-----SLTHTNLGAAKLSGANLNNANLTWANLRKADLKNAN 141
>gi|385871982|gb|AFI90502.1| Pentapeptide repeat protein [Pectobacterium sp. SCC3193]
Length = 273
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 77/160 (48%), Gaps = 13/160 (8%)
Query: 71 AVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM 130
A++ SCS + A+ ++ T + S + SAD +A + N R+A+ A
Sbjct: 115 ALLDSCSW-VETQANEARFTGATWLTSAVASGSSMNSADFTQATLRQSNLRQASLIGAV- 172
Query: 131 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
F+ +K + L +A + NF A+L+ +L R EAN T+A L+ +L +S L
Sbjct: 173 ----FALAKLENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQL 228
Query: 191 GGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 230
GGA GA+ A DL+Q + + T + G T++
Sbjct: 229 GGANFRGANLFRA--DLSQ-----AFTSNTTQLDGAWTKR 261
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 26/96 (27%), Positives = 42/96 (43%), Gaps = 10/96 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F L++A+ F A FT RE+ F+ F+ A L + + + G D
Sbjct: 33 SRAHFKDTQLQEALFDHCTFAEATFTELLFRETWFTQCGFHRATLNACIFMELSLPGLDF 92
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
SD A LT +++ L R+ GA+++
Sbjct: 93 SD----------AKLTKTTFLKSTLERATFNGALLD 118
>gi|443324425|ref|ZP_21053179.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442795970|gb|ELS05303.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 305
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 68/134 (50%), Gaps = 5/134 (3%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTG 157
A +A+L+ AV + AN ++AD+ ++ D S + GA L A ANF+
Sbjct: 71 ADLATANLQAAVLIGICLIEANLSNADLSDAYLMDGDLSNANLIGADLRDANCDHANFSN 130
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 217
A+L TLM ++ L ANLT A L RT L+ ++L A + AD S+A + A+ + Y
Sbjct: 131 ANLIGTLMRKVRLRHANLTGAKLQRTNLSEAELIEAHLSEADLSNANLYEAELLNIFGYK 190
Query: 218 NGTNPITGVSTRKS 231
+ ++T S
Sbjct: 191 TNFCRVQAIATHMS 204
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 57/121 (47%), Gaps = 18/121 (14%)
Query: 83 LADLNKYEAETRGEFGIGS------------------AAQFGSADLRKAVHVKENFRRAN 124
L++ N YEAE FG + A F A+L K N RAN
Sbjct: 173 LSNANLYEAELLNIFGYKTNFCRVQAIATHMSRAYLFQANFSEAELIKIDLRWANCDRAN 232
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
F +A+++++D G+ N A L++A +AN GA+L+ + L +AN+ +A+ +
Sbjct: 233 FRNANLQQADLRGTNLNQADLKQANLTRANLRGANLNHADLRGANLTDANIQDAIFKSAI 292
Query: 185 L 185
L
Sbjct: 293 L 293
>gi|428316245|ref|YP_007114127.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428239925|gb|AFZ05711.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 410
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 52/102 (50%), Gaps = 5/102 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SA F SA+L + NFR A AD +++ +KF GA L A A+ +GADL
Sbjct: 292 SAVDFSSANLDRV-----NFRGATLNDADFSDANLQNAKFGGADLSGAFLGNADLSGADL 346
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+ L+ ANL+ A L+ LT ++ GA +E A F +
Sbjct: 347 HKASLALANLSGANLSGANLLEVNLTNTNFSGANVESARFGN 388
>gi|416402943|ref|ZP_11687479.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
gi|357261803|gb|EHJ11028.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
Length = 330
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 54/121 (44%), Gaps = 20/121 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A DL +A + N + AN AD+R++D + + GA L A TGA+L++
Sbjct: 161 ANMKGVDLSRANLMGANLKEANLRDADLRKADLTNANLKGALLTDTNLTGAKLTGANLTN 220
Query: 163 TLMDR--------------------MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
T M R VLN ANL A L T + +DL A + GA+ +D
Sbjct: 221 TNMVRAQLSQAELSDIMAKGAILTHAVLNRANLNQADLTLTRMNHADLSRANLSGANLTD 280
Query: 203 A 203
A
Sbjct: 281 A 281
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 49/103 (47%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
AQ A+L + A A++ ++D + ++ N A L +A AN T ADL +
Sbjct: 226 AQLSQAELSDIMAKGAILTHAVLNRANLNQADLTLTRMNHADLSRANLSGANLTDADLVE 285
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
R L ANLTNA L R L ++L G I+ GA D +
Sbjct: 286 AFFARANLMGANLTNANLTRAELMSANLAGVILRGATMPDGKV 328
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 5/90 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL--- 175
N N A + +++ S GA L AV +AN ADL+ T M+ L+ ANL
Sbjct: 217 NLTNTNMVRAQLSQAELSDIMAKGAILTHAVLNRANLNQADLTLTRMNHADLSRANLSGA 276
Query: 176 --TNAVLVRTVLTRSDLGGAIIEGADFSDA 203
T+A LV R++L GA + A+ + A
Sbjct: 277 NLTDADLVEAFFARANLMGANLTNANLTRA 306
>gi|291570913|dbj|BAI93185.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 484
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 61/112 (54%), Gaps = 14/112 (12%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN----- 171
+ N +ANFT A + ++FSG+ G L +A + +GA L ++ VLN
Sbjct: 29 RVNLSQANFTEAVLSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLI 88
Query: 172 -----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
+ANL +A L+R L R++L A++ GA+ ++A DL ++A ++A+
Sbjct: 89 RADLSQANLVDASLIRAELMRAELSEAVVNGANLTEA--DL--REATLRHAD 136
Score = 43.9 bits (102), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 47/93 (50%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L +A + N R+N T AD+ +D G A L +A A+ GA+LS +
Sbjct: 145 ANLSEACLILSNLERSNLTRADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRW 204
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
L+ ANL+ A L T L+ + L GA + GA
Sbjct: 205 ANLSGANLSGANLEATQLSGASLRGANLSGASL 237
Score = 43.9 bits (102), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 63/129 (48%), Gaps = 12/129 (9%)
Query: 93 TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
TR + + S A A+L +AV N RA+ + A++ ++ ++ A L +AV
Sbjct: 58 TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLVDASLIRAELMRAELSEAVV 117
Query: 151 YKANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSDLGGAIIEGADF 200
AN T ADL + + L + ANL+ A L+ R+ LTR+DL A + G +
Sbjct: 118 NGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTRADLTRADLRGVNL 177
Query: 201 SDAVIDLAQ 209
+A + A+
Sbjct: 178 RNAELRQAE 186
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 48/98 (48%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ A+L +AV N A+ A +R +D + +GA L +A +N ++L+
Sbjct: 105 AELMRAELSEAVVNGANLTEADLREATLRHADLQQTNLSGANLSEACLILSNLERSNLTR 164
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ R L NL NA L + L +DL GA + GA+
Sbjct: 165 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANL 202
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL +A N R A A++ +D G+ +GA L A AN +GA+L T +
Sbjct: 165 ADLTRADLRGVNLRNAELRQAELNGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 224
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANL+ A L+ +DL A + D++DA
Sbjct: 225 ASLRGANLSGASLLNCSAIHADLTQANLIDCDWTDA 260
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 134 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
DFS A L + +ANFT A LS T + ANLT L R L S L GA
Sbjct: 16 DFSAILLCEANLSRVNLSQANFTEAVLSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 70
Query: 194 IIEGADFSDAVIDLA 208
I++GA+ ++AV+++A
Sbjct: 71 ILQGANLNEAVLNVA 85
>gi|119488080|ref|ZP_01621524.1| hypothetical protein L8106_11802 [Lyngbya sp. PCC 8106]
gi|119455369|gb|EAW36508.1| hypothetical protein L8106_11802 [Lyngbya sp. PCC 8106]
Length = 351
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 48/85 (56%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N A +A++ + S + GA L K AN +GADLS+ + + +L EA L A
Sbjct: 27 NLMAAQLNAANLNRVNLSYANLTGANLSKTRLICANLSGADLSNANLSQAILIEATLNGA 86
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
L +T+L +++L GA++ G+ S+A
Sbjct: 87 SLTQTLLVQANLSGALLSGSILSEA 111
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 64/139 (46%), Gaps = 36/139 (25%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 158
S A +A+L +A+ ++ A+ T + +++ SG+ +G+ L +A AN TGA
Sbjct: 64 SGADLSNANLSQAILIEATLNGASLTQTLLVQANLSGALLSGSILSEADLSGANLTGASL 123
Query: 159 -----------------------------DLSDTLMDRMVLNE-----ANLTNAVLVRTV 184
DLS + R +L+E ANL++A L+R
Sbjct: 124 IGTSLLNGSKLIEATLIGATLSRATLSAIDLSGVNLTRAILSESELGGANLSSACLIRAY 183
Query: 185 LTRSDLGGAIIEGADFSDA 203
L RS+L GA + GAD S+A
Sbjct: 184 LNRSNLSGANLMGADLSEA 202
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 53/99 (53%), Gaps = 5/99 (5%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AAQ +A+L + N AN + + ++ SG+ + A L +A+ +A GA L+
Sbjct: 30 AAQLNAANLNRVNLSYANLTGANLSKTRLICANLSGADLSNANLSQAILIEATLNGASLT 89
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
TL+ +ANL+ A+L ++L+ +DL GA + GA
Sbjct: 90 QTLLV-----QANLSGALLSGSILSEADLSGANLTGASL 123
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 49/103 (47%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL +A N AN T A+++ +D G+ NGA L A N A+L
Sbjct: 190 SGANLMGADLSEASLCNANLCVANLTRANLQGADLEGANLNGAQLSGANLKSTNLKNANL 249
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ ++ L A+L+ A L LT ++L GA + AD A
Sbjct: 250 NGLILHEADLRLADLSQANLRGANLTGANLAGASLLEADLRGA 292
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 56/106 (52%), Gaps = 1/106 (0%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L K + N A+ ++A++ ++ + NGA L + + +AN +GA L
Sbjct: 44 SYANLTGANLSKTRLICANLSGADLSNANLSQAILIEATLNGASLTQTLLVQANLSGALL 103
Query: 161 SDTLMDRMVLNEANLTNAVLVRT-VLTRSDLGGAIIEGADFSDAVI 205
S +++ L+ ANLT A L+ T +L S L A + GA S A +
Sbjct: 104 SGSILSEADLSGANLTGASLIGTSLLNGSKLIEATLIGATLSRATL 149
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 7/111 (6%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I S ++ G A+L A ++ R+N + A++ +D S + A L A +AN GA
Sbjct: 163 ILSESELGGANLSSACLIRAYLNRSNLSGANLMGADLSEASLCNANLCVANLTRANLQGA 222
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
DL + LN A L+ A L T L ++L G I+ AD A DL+Q
Sbjct: 223 DL-----EGANLNGAQLSGANLKSTNLKNANLNGLILHEADLRLA--DLSQ 266
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 46/96 (47%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A L +A + N T A + ES+ G+ + A L +A ++N +GA+L +
Sbjct: 142 ATLSRATLSAIDLSGVNLTRAILSESELGGANLSSACLIRAYLNRSNLSGANLMGADLSE 201
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANL A L R L +DL GA + GA S A
Sbjct: 202 ASLCNANLCVANLTRANLQGADLEGANLNGAQLSGA 237
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 43/90 (47%), Gaps = 5/90 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEA 173
N RAN AD+ ++ +G++ +GA L+ AN G ADL + + L A
Sbjct: 213 NLTRANLQGADLEGANLNGAQLSGANLKSTNLKNANLNGLILHEADLRLADLSQANLRGA 272
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
NLT A L L +DL GA + A+ A
Sbjct: 273 NLTGANLAGASLLEADLRGANLSHANLKGA 302
>gi|428768931|ref|YP_007160721.1| pentapeptide repeat-containing protein [Cyanobacterium aponinum PCC
10605]
gi|428683210|gb|AFZ52677.1| pentapeptide repeat protein [Cyanobacterium aponinum PCC 10605]
Length = 320
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 54/103 (52%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
+ F A+L+ K NF ANFT A++ +D SG GA +A AN G DL +
Sbjct: 115 SDFSYANLQNCKLTKANFMGANFTRANLSGADLSGVNLTGADFTRADLSGANLQGCDLEE 174
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ L+++ L NA L ++L +L A + GA+FS AV+
Sbjct: 175 ANLRCADLSKSILRNADLSESILQGVNLENANLRGANFSGAVL 217
>gi|428309179|ref|YP_007120156.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428250791|gb|AFZ16750.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 303
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 53/106 (50%), Gaps = 5/106 (4%)
Query: 105 FGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
F + L +AV + +F A +F AD+RE+DF+ F+ A L +A AN A
Sbjct: 136 FWRSHLMRAVLRRVDFHEAILQETSFRQADLREADFTRVYFSEASLSEANLRGANLDQAL 195
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ T R L +A+L A L R V ++DL GA +GA AV
Sbjct: 196 VKRTSFWRTNLQQASLKGAYLKRIVFNQTDLSGASFQGAQLQGAVF 241
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 53/105 (50%), Gaps = 15/105 (14%)
Query: 117 KENFRRANFTSADMRESDFSGSKF----------NGAYLEKAVAYK-----ANFTGADLS 161
+ N RAN + A++ ++ SG++ N A LE A+ ++ AN GA L
Sbjct: 53 RTNLSRANLSRANLSHANLSGARLECVSLSRANLNQADLEGAILFQSNLSQANLIGASLP 112
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+T + L +ANLT A L T+ RS L A++ DF +A++
Sbjct: 113 ETDLQVATLFQANLTGACLRGTIFWRSHLMRAVLRRVDFHEAILQ 157
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 55/116 (47%), Gaps = 20/116 (17%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVA 150
S A A+L +A+ + +F R N A ++ ++D SG+ F GA L+ AV
Sbjct: 182 SEANLRGANLDQALVKRTSFWRTNLQQASLKGAYLKRIVFNQTDLSGASFQGAQLQGAVF 241
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
AN TGA+ ++R V ANLT ++L GA ++ A F + I+
Sbjct: 242 RGANLTGANFEGANLERAVFRGANLTG----------TNLKGASLQWAVFKEVNIE 287
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 45/103 (43%), Gaps = 20/103 (19%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A+ + N +AN A + E+D L+ A ++AN TGA L
Sbjct: 82 SRANLNQADLEGAILFQSNLSQANLIGASLPETD----------LQVATLFQANLTGACL 131
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
T+ R + L+R VL R D AI++ F A
Sbjct: 132 RGTIFWR----------SHLMRAVLRRVDFHEAILQETSFRQA 164
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 51/121 (42%), Gaps = 20/121 (16%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMR---------------ESDFSGSKFNGAYLEKAV 149
F ADLR+A + F A+ + A++R ++ + GAYL++ V
Sbjct: 161 FRQADLREADFTRVYFSEASLSEANLRGANLDQALVKRTSFWRTNLQQASLKGAYLKRIV 220
Query: 150 AYKANFTGADLSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSDAV 204
+ + +GA + V ANLT A L R V ++L G ++GA AV
Sbjct: 221 FNQTDLSGASFQGAQLQGAVFRGANLTGANFEGANLERAVFRGANLTGTNLKGASLQWAV 280
Query: 205 I 205
Sbjct: 281 F 281
>gi|434388230|ref|YP_007098841.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428019220|gb|AFY95314.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 193
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 45/96 (46%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G A ADLR A N + N AD+R +D +G GA L +A AN T AD
Sbjct: 97 GDRASLHKADLRLASLQGANLSQVNLVGADLRYADLTGVNLTGANLSRANLTGANLTKAD 156
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
L + + +L NL A L+ L+ DL AI+
Sbjct: 157 LRGVTLAQAILENTNLCEASLIDVDLSCVDLRHAIL 192
>gi|75906828|ref|YP_321124.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75700553|gb|ABA20229.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 727
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 52/111 (46%), Gaps = 11/111 (9%)
Query: 95 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 154
G+ + A +G A L + + + AN T D + SD SG+ +AN
Sbjct: 574 GDAKLQEANLYG-ARLSRVIAIGAQLSFANLTKTDWQSSDLSGADLE----------RAN 622
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ ADLS T M +L A L NA L L+ DL GA + GADF D ++
Sbjct: 623 LSNADLSATRMTGAILRSAQLENANLRNADLSLVDLRGANVAGADFKDTIL 673
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 29/108 (26%)
Query: 125 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLM----- 165
F SA++ ++ F GS+F A L +A +ANFT A+LS LM
Sbjct: 469 FKSANLNQASFKGSRFRSVGDDGRLDTYDDAIADLSQAQMKQANFTDANLSRVLMTRSDL 528
Query: 166 DRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDA 203
R LN ANL+NA L+ L +DL G ++E GAD DA
Sbjct: 529 SRATLNRANLSNARLIGANLSSAQLVGADLRGTVLENASLTGADLGDA 576
Score = 43.9 bits (102), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 65/137 (47%), Gaps = 24/137 (17%)
Query: 82 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFS 136
A+ADL++ + + A F A+L + + + + RAN ++A + ++ S
Sbjct: 499 AIADLSQAQMKQ---------ANFTDANLSRVLMTRSDLSRATLNRANLSNARLIGANLS 549
Query: 137 GSKFNGAYLEKAVAYKANFTGADLSDTLMD----------RMVLNEANLTNAVLVRTVLT 186
++ GA L V A+ TGADL D + R++ A L+ A L +T
Sbjct: 550 SAQLVGADLRGTVLENASLTGADLGDAKLQEANLYGARLSRVIAIGAQLSFANLTKTDWQ 609
Query: 187 RSDLGGAIIEGADFSDA 203
SDL GA +E A+ S+A
Sbjct: 610 SSDLSGADLERANLSNA 626
>gi|425467207|ref|ZP_18846491.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9809]
gi|389830088|emb|CCI28159.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9809]
Length = 442
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/121 (38%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANF 155
S A A L KA N RRAN + A++ + SG+ GA L K A+ + AN
Sbjct: 310 SKANLSWAKLSKAKLSGANLRRANLSKANLSWAFMSGANLIGAILSKANLRGAILWGANL 369
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV-IDLAQKQALC 214
+GA+LS L+ ANL+ A L L+++DL GA +E A F DA I QKQ L
Sbjct: 370 SGANLSGA-----NLSGANLSKADLSGANLSKADLSGAKVENAIFIDATGITPEQKQDLI 424
Query: 215 K 215
+
Sbjct: 425 R 425
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 65/145 (44%), Gaps = 37/145 (25%)
Query: 89 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT--------------------SA 128
Y+ + G I S A ADL A N RRAN + A
Sbjct: 235 YKVDLSG--AILSGAILSGADLSGA-----NLRRANLSWAFLSWADLIEADLSWAFLRRA 287
Query: 129 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN----------A 178
D+ ++D SG+K +GA L KA KAN + A LS + L ANL+ A
Sbjct: 288 DLIDADLSGAKLSGANLNKANLSKANLSWAKLSKAKLSGANLRRANLSKANLSWAFMSGA 347
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
L+ +L++++L GAI+ GA+ S A
Sbjct: 348 NLIGAILSKANLRGAILWGANLSGA 372
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 54/101 (53%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL A N +AN + A++ + S +K +GA L +A KAN + A +S
Sbjct: 287 ADLIDADLSGAKLSGANLNKANLSKANLSWAKLSKAKLSGANLRRANLSKANLSWAFMSG 346
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ +L++ANL A+L L+ ++L GA + GA+ S A
Sbjct: 347 ANLIGAILSKANLRGAILWGANLSGANLSGANLSGANLSKA 387
>gi|427419722|ref|ZP_18909905.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425762435|gb|EKV03288.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 308
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANF 155
S A ADL A+ + F AN A + +SDFS + F GA L +A ANF
Sbjct: 115 SFATLTQADLSNAIGHRTRFSWANLVKAQLIDSDFSEAVFEGANLTRSNWHRATVRGANF 174
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
ADL + LN NLT A L+ +L ++ L GA++ A
Sbjct: 175 QQADLEAARLRAANLNGVNLTKANLLNAILEQTQLDGAVLMAA 217
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 55/116 (47%), Gaps = 5/116 (4%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKA 153
IG +F A+L KA + +F A F A++ S++ G+ F A LE A A
Sbjct: 128 IGHRTRFSWANLVKAQLIDSDFSEAVFEGANLTRSNWHRATVRGANFQQADLEAARLRAA 187
Query: 154 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
N G +L+ + +L + L AVL+ + L GA + D++DA + AQ
Sbjct: 188 NLNGVNLTKANLLNAILEQTQLDGAVLMAAQADWATLNGASLIETDWTDASMMGAQ 243
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F A+L ++ + R ANF AD+ + + NG L KAN A L
Sbjct: 150 SEAVFEGANLTRSNWHRATVRGANFQQADLEAARLRAANLNGVNL-----TKANLLNAIL 204
Query: 161 SDTLMDRMVL-----NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
T +D VL + A L A L+ T T + + GA +EGA+ + A
Sbjct: 205 EQTQLDGAVLMAAQADWATLNGASLIETDWTDASMMGAQLEGANLAGA 252
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 58/126 (46%), Gaps = 15/126 (11%)
Query: 97 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 156
+ S A+F A L A + R F RE +KF+GA+L +A A T
Sbjct: 66 LAVLSDARFDKARLDAAELTRARLERGIF-----RELQAPKAKFHGAHLTEADLSFATLT 120
Query: 157 GADLSDTLMDRMVLNEANLTNAVLVRT----------VLTRSDLGGAIIEGADFSDAVID 206
ADLS+ + R + ANL A L+ + LTRS+ A + GA+F A ++
Sbjct: 121 QADLSNAIGHRTRFSWANLVKAQLIDSDFSEAVFEGANLTRSNWHRATVRGANFQQADLE 180
Query: 207 LAQKQA 212
A+ +A
Sbjct: 181 AARLRA 186
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 52/124 (41%), Gaps = 12/124 (9%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
N + A RG A F ADL A N N T A++ + ++ +GA L
Sbjct: 163 NWHRATVRG-------ANFQQADLEAARLRAANLNGVNLTKANLLNAILEQTQLDGAVLM 215
Query: 147 KAVAYKANFTGA-----DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
A A A GA D +D M L ANL A L L +++L A + DF+
Sbjct: 216 AAQADWATLNGASLIETDWTDASMMGAQLEGANLAGANLAGVNLQQANLENANLTAVDFT 275
Query: 202 DAVI 205
DA +
Sbjct: 276 DAQV 279
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 49/98 (50%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
D KA V A F A + ++ + ++ + A KA F GA L++ +
Sbjct: 58 DFSKATLVLAVLSDARFDKARLDAAELTRARLERGIFRELQAPKAKFHGAHLTEADLSFA 117
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L +A+L+NA+ RT + ++L A + +DFS+AV +
Sbjct: 118 TLTQADLSNAIGHRTRFSWANLVKAQLIDSDFSEAVFE 155
>gi|376007502|ref|ZP_09784697.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
gi|375324138|emb|CCE20450.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
Length = 179
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 57/111 (51%), Gaps = 1/111 (0%)
Query: 94 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 153
RGE+ G AD+ R+AN T+A+M + DF+G+ F + L +
Sbjct: 40 RGEYSSCQGCNLGGADMSNQSRRNAQLRQANLTNANMSDGDFTGAFFTCSNLSNSNLSGG 99
Query: 154 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVL-TRSDLGGAIIEGADFSDA 203
NF A+ D + + L+ A+L+ A L ++ +R++L GA ++GA DA
Sbjct: 100 NFNFANFVDANLSGVDLSNADLSRADLSGAIIDSRTNLDGANLDGARLWDA 150
>gi|300863629|ref|ZP_07108569.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
gi|300338371|emb|CBN53713.1| Pentapeptide repeat protein [Oscillatoria sp. PCC 6506]
Length = 386
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 53/103 (51%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A A L A N +AN DM ++FSG+ N A L KA K+N + A L
Sbjct: 105 TGANLTGAHLNWANLSTANLSKANLKGTDMSAANFSGAILNDANLGKAYLIKSNLSQAQL 164
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+D + + L +A+LT+A L L R++L GA + AD + A
Sbjct: 165 NDADLTQANLKDADLTDANLSGAELARANLAGANLTRADLTKA 207
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 51/90 (56%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S AQ ADL +A + AN + A++ ++ +G+ A L KA KAN ADL
Sbjct: 160 SQAQLNDADLTQANLKDADLTDANLSGAELARANLAGANLTRADLTKANLLKANLRRADL 219
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+++ ++ L EA+L+ A+L R L+++DL
Sbjct: 220 TESYLNWASLGEADLSEAILTRANLSKADL 249
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 73/147 (49%), Gaps = 11/147 (7%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
L +VV S S + L N E + R + IG A ADL KA + RAN +
Sbjct: 25 LVLSVVDSHSGDTPTLVLANINEQQNR-PYLIG--ANLSEADLSKA-----HLSRANLSK 76
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
AD+ ++ G+ GA L A AN TGA+L+ ++ L+ ANL+ A L T ++
Sbjct: 77 ADLSGANLCGANLVGASLSGANLTGANLTGANLTGAHLNWANLSTANLSKANLKGTDMSA 136
Query: 188 SDLGGAIIEGADFSDAVI---DLAQKQ 211
++ GAI+ A+ A + +L+Q Q
Sbjct: 137 ANFSGAILNDANLGKAYLIKSNLSQAQ 163
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 53/103 (51%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A V + AN T A++ ++ +G+ N A L A KAN G D+
Sbjct: 75 SKADLSGANLCGANLVGASLSGANLTGANLTGANLTGAHLNWANLSTANLSKANLKGTDM 134
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S +LN+ANL A L+++ L+++ L A + A+ DA
Sbjct: 135 SAANFSGAILNDANLGKAYLIKSNLSQAQLNDADLTQANLKDA 177
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L KA +K N +A AD+ +++ + A L A +AN GA+L
Sbjct: 140 SGAILNDANLGKAYLIKSNLSQAQLNDADLTQANLKDADLTDANLSGAELARANLAGANL 199
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ R L +ANL A L R LT S L A + AD S+A++
Sbjct: 200 T-----RADLTKANLLKANLRRADLTESYLNWASLGEADLSEAIL 239
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 52/103 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L +A K N +AN AD+ ES + + A L +A+ +AN + ADLS
Sbjct: 192 ANLAGANLTRADLTKANLLKANLRRADLTESYLNWASLGEADLSEAILTRANLSKADLSK 251
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
T + ++VL+ +L+ L L DL ++ G + + A +
Sbjct: 252 TYLRKIVLHGCHLSGINLSGADLGGLDLSKKLLTGINLASAYL 294
Score = 44.3 bits (103), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A G DL K + N AN + A + E++ S + GA L A KANF
Sbjct: 270 SGADLGGLDLSKKLLTGINLASAYLSEANLSGAYLIEANLSDANLCGADLSDACLMKANF 329
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
GA M + L+ ANLT A L + L ++L GAI+ AD A
Sbjct: 330 IGAR-----MGNINLSNANLTGAKLCKADLMGANLRGAILTEADMRGA 372
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 2/108 (1%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L A + N AN T AD+ +++ + A L ++ A+ ADLS+
Sbjct: 177 ADLTDANLSGAELARANLAGANLTRADLTKANLLKANLRRADLTESYLNWASLGEADLSE 236
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
++ R L++A+L+ L + VL L G + GAD +DL++K
Sbjct: 237 AILTRANLSKADLSKTYLRKIVLHGCHLSGINLSGADLGG--LDLSKK 282
>gi|452966664|gb|EME71673.1| putative low-complexity protein [Magnetospirillum sp. SO-1]
Length = 241
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 48/101 (47%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L+ AV + A F ADM +D S + GA L A F GA L D
Sbjct: 70 ANLSGASLKGAVFAGADLFHAIFDEADMTGADLSDTYLFGANLIATRLVGAEFKGAFLKD 129
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
LM+R L++A + ++R V + L GA + GAD + A
Sbjct: 130 VLMERADLSQAKMAGVYMLRGVFEEAKLAGADLSGADMTGA 170
Score = 37.4 bits (85), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 43/104 (41%), Gaps = 10/104 (9%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
V F+ A M +D S +K G Y+ + V +A GADLS M A+
Sbjct: 118 VGAEFKGAFLKDVLMERADLSQAKMAGVYMLRGVFEEAKLAGADLSGADMTGAAAEGADF 177
Query: 176 TNAVLVRTVLT----------RSDLGGAIIEGADFSDAVIDLAQ 209
T A L T L+ R+DL GA AD V D A+
Sbjct: 178 TGANLKGTRLSGASMRFARFVRADLDGADFAKADLLHTVFDGAR 221
>gi|424513094|emb|CCO66678.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
Length = 140
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 66/141 (46%), Gaps = 31/141 (21%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
+ DL + K + +RANF R ++ SG GA LE+A +FTGA+L +
Sbjct: 19 YHDQDLTQTYFTKGSLKRANF-----RGANLSGISLFGANLEEA-----DFTGANLEN-- 66
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA-----------DFSDAVIDLAQKQAL 213
ANL L++T T ++L AI+ GA D+S +I +
Sbjct: 67 --------ANLGQCNLLKTNFTGANLTNAIVSGASNLETVKANDSDWSQVIIRKDVLMGI 118
Query: 214 CKYANGTNPITGVSTRKSLGC 234
C A+G +P++G T+ +L C
Sbjct: 119 CANADGVSPVSGDPTKMTLEC 139
>gi|302522367|ref|ZP_07274709.1| OxyO [Streptomyces sp. SPB78]
gi|302431262|gb|EFL03078.1| OxyO [Streptomyces sp. SPB78]
Length = 233
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 64/139 (46%), Gaps = 16/139 (11%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE---------------KAVAYK 152
+DL A+ N AN T A+++ S S + N A+L KA ++
Sbjct: 78 SDLSHAMLYGANLAYANLTDANLKYSSLSSTHLNEAWLSHSVLSHASLSLADLSKANLHE 137
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 212
A+ T AD+S + L A +TNA RT L+ ++L GA + GAD S V +L QKQ
Sbjct: 138 ADLTKADVSGANLSEADLAGAKMTNANFFRTNLSGAELTGADLSGADLS-TVKNLTQKQV 196
Query: 213 LCKYANGTNPITGVSTRKS 231
N T + TR S
Sbjct: 197 SSARTNRTTRLPSGLTRAS 215
>gi|443327376|ref|ZP_21056002.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442792998|gb|ELS02459.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 187
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 55/102 (53%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A+L KAV + NF+ +D+ E+D + F+ + KA +K+ A+L+ +
Sbjct: 47 FTGANLGKAVFYRTVVELGNFSQSDLGEADLREANFSQSLFYKASLFKSQLQKANLNQVI 106
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
R +ANL +AVL L +++L A + GAD S+A ++
Sbjct: 107 AIRAFFRDANLNHAVLTSANLQQANLTNADLRGADLSNANLE 148
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 39/75 (52%), Gaps = 5/75 (6%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I A F A+L AV N ++AN T+AD+R +D S A LE A AN GA
Sbjct: 106 IAIRAFFRDANLNHAVLTSANLQQANLTNADLRGADLS-----NANLESAFLVGANLLGA 160
Query: 159 DLSDTLMDRMVLNEA 173
L D ++R +L +A
Sbjct: 161 SLVDANLERAILTDA 175
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 10/105 (9%)
Query: 96 EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
E G S + G ADLR+ ANF+ + ++ S+ A L + +A +A F
Sbjct: 63 ELGNFSQSDLGEADLRE----------ANFSQSLFYKASLFKSQLQKANLNQVIAIRAFF 112
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A+L+ ++ L +ANLTNA L L+ ++L A + GA+
Sbjct: 113 RDANLNHAVLTSANLQQANLTNADLRGADLSNANLESAFLVGANL 157
>gi|15892731|ref|NP_360445.1| hypothetical protein RC0808 [Rickettsia conorii str. Malish 7]
gi|15619907|gb|AAL03346.1| unknown [Rickettsia conorii str. Malish 7]
Length = 957
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 553 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 607
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 217
+ + EAN NA++ R LT+++ A++E AD ++A+ A+ KQA K A
Sbjct: 608 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEAKLKQANLKAA 667
Query: 218 N 218
N
Sbjct: 668 N 668
Score = 42.4 bits (98), Expect = 0.21, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 582 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 639
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 640 ENADMQAVEAAEAIFKEAKLKQANLKAANLAGINKEGADFDKAKINDATK 689
Score = 40.4 bits (93), Expect = 0.72, Method: Composition-based stats.
Identities = 26/109 (23%), Positives = 53/109 (48%), Gaps = 5/109 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A+L+ AV R A F AD+++S S + AY+ K + T + + +
Sbjct: 382 FEGANLQNAVFQNVTARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVM 441
Query: 165 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
M +++++ ++ TN+ L L +D+ ++G ++A++D A
Sbjct: 442 MVNADAEKLIIKDSEWTNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 490
Score = 37.0 bits (84), Expect = 8.6, Method: Composition-based stats.
Identities = 41/144 (28%), Positives = 60/144 (41%), Gaps = 18/144 (12%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
LKN +F S L +++C+ + + N A + A F ADL+K+
Sbjct: 357 LKN-TLFASANLENVKISNCNLDFTNFEGANLQNAVFQNV--TARNAGFLFADLKKSKIE 413
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTG-----ADLS 161
+ RA D+ E + + SKFN + A A K +N TG AD+
Sbjct: 414 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKLIIKDSEWTNSNLTGISLAYADMQ 473
Query: 162 DTLMDRMVLNEANLTNAVLVRTVL 185
M +VLN A L A +V T L
Sbjct: 474 RVQMQGVVLNNALLDQANIVSTNL 497
>gi|359459044|ref|ZP_09247607.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 256
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 53/98 (54%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADLR A + + AD+R ++ +G+ A L K +AN +GA LS +
Sbjct: 43 ADLRGADLEGIDLNHIDLCWADLRGTNLAGANLQAANLMKTDFCQANLSGAILSGASLQD 102
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
V+ +A+L A+L++T + ++ L GAI+ GA+ A I
Sbjct: 103 AVMTQADLNGAILIKTSMIQTRLRGAILRGANLKQARI 140
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 55/136 (40%), Gaps = 13/136 (9%)
Query: 63 FVSTALAAAVV--ASCSSNISALADLN------KYEAETRGEFGIGSAAQFGSADLRKAV 114
F L+ A++ AS + ADLN +TR I A A + +
Sbjct: 85 FCQANLSGAILSGASLQDAVMTQADLNGAILIKTSMIQTRLRGAILRGANLKQARILGSF 144
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
N ++ AD++ S F + NGA LE A +NF GADL + L AN
Sbjct: 145 LEDVNLKKGVLEKADLKGSKFKNANLNGATLEGADLRASNFCGADLEEA-----SLRGAN 199
Query: 175 LTNAVLVRTVLTRSDL 190
+ + L L R+D
Sbjct: 200 VRSVKLCDANLNRTDF 215
>gi|220906448|ref|YP_002481759.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219863059|gb|ACL43398.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 309
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 55/115 (47%), Gaps = 9/115 (7%)
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 145
L +YEA R GI +LR A + N R N A + +++F G+ GA L
Sbjct: 134 LQRYEAGERNFQGI---------NLRGAQLNQLNLRAINLEQAQLEDANFQGTVLEGANL 184
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+A +AN GA L + +D L A+L A L T L R++L A + G +F
Sbjct: 185 RQANLSRANLKGARLDGSSLDNANLTSADLEGASLQSTSLDRANLTAANLMGVNF 239
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 5/121 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L+ A + AN TSAD+ + + + A L A NF ADL
Sbjct: 187 ANLSRANLKGARLDGSSLDNANLTSADLEGASLQSTSLDRANLTAANLMGVNFWLADLQS 246
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
+ L ANL + R ++L G + GAD DA+ D Q C + G NP
Sbjct: 247 VNFTQANLTGANLGGTDVSRANFKAANLTGVNLSGADRRDAIYD----QFTC-FPEGFNP 301
Query: 223 I 223
+
Sbjct: 302 L 302
>gi|172037842|ref|YP_001804343.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354556328|ref|ZP_08975624.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171699296|gb|ACB52277.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353551765|gb|EHC21165.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 319
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 61/133 (45%), Gaps = 10/133 (7%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
Q ADLR +FR + + A++RE DF+G+ AYL +A NFT A+L
Sbjct: 25 QLRRADLRGLNLSHTDFRGVDLSYANLREVDFTGADLRDAYLNEADLTAVNFTDANLEGA 84
Query: 164 LMDRMVLNEAN----------LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 213
+ ++ L +AN LT A L +T + GA + GA S A ++ A
Sbjct: 85 SLIKIYLIKANCYQTNFSGAYLTGAYLTKTNFKEAKFHGAYLNGAKLSGAKLEDAYYDHQ 144
Query: 214 CKYANGTNPITGV 226
++ +P T +
Sbjct: 145 TRFDTSFDPKTAL 157
Score = 40.0 bits (92), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 17/105 (16%)
Query: 111 RKAVHVKE-------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
++A+ +KE NF+ AD+R + S + F G L A + +FTGADL D
Sbjct: 5 QEAIDLKERYEKGQRNFQEFQLRRADLRGLNLSHTDFRGVDLSYANLREVDFTGADLRDA 64
Query: 164 LMDRMVL-----NEANLTNAVLVRTVLTR-----SDLGGAIIEGA 198
++ L +ANL A L++ L + ++ GA + GA
Sbjct: 65 YLNEADLTAVNFTDANLEGASLIKIYLIKANCYQTNFSGAYLTGA 109
>gi|428211266|ref|YP_007084410.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|427999647|gb|AFY80490.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 279
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 71/142 (50%), Gaps = 15/142 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A A+L + N R AN +A++ +++ S + + A + +A+ +AN A L
Sbjct: 53 TGANLREANLMGVTLHQANLREANLINANLSKANLSEADLSLANISRAIVERANLERAKL 112
Query: 161 -----SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
S+T + L EA + A L R L+ +DL GA +EGA+ + A++ QA+ +
Sbjct: 113 VQALASETRLGWANLKEATMNQANLSRANLSEADLTGANLEGANLTIAIL----IQAIME 168
Query: 216 YANGTNP------ITGVSTRKS 231
N TN +TGV+ R S
Sbjct: 169 KVNLTNATLNGANLTGVNLRDS 190
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 53/103 (51%), Gaps = 10/103 (9%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
+ S + G A+L++A + N RAN + AD+ ++ G+ A L +A+ K N T A
Sbjct: 116 LASETRLGWANLKEATMNQANLSRANLSEADLTGANLEGANLTIAILIQAIMEKVNLTNA 175
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LN ANLT L + L+R+++ G+ + GAD +
Sbjct: 176 ----------TLNGANLTGVNLRDSDLSRANMSGSNLAGADLT 208
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 49/98 (50%), Gaps = 15/98 (15%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
+DL +A N A+ T + +R ++ S + G LE A Y+AN ++LS
Sbjct: 190 SDLSRANMSGSNLAGADLTKSQLRGTNVSWTTMRGTNLEGASLYRANLGWSNLSG----- 244
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
ANLTNA+L+ T L R++L DF+ A++
Sbjct: 245 -----ANLTNAILMDTNLYRTNL-----RDVDFTGAIM 272
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 49/95 (51%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ +FR N A++ D S F A L +A TGA+L + + + L++ANL
Sbjct: 14 ERDFRNLNLIGANLAGLDLSEVTFRDADLRQANLTCTKLTGANLREANLMGVTLHQANLR 73
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
A L+ L++++L A + A+ S A+++ A +
Sbjct: 74 EANLINANLSKANLSEADLSLANISRAIVERANLE 108
>gi|332710048|ref|ZP_08430003.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332351191|gb|EGJ30776.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 739
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 58/112 (51%), Gaps = 5/112 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S+AQ +AD R+A+ A+ T A++ E+ FS S +GA L K A +++F+ ADL
Sbjct: 561 SSAQLINADFRRAI-----LENASLTGANLGEAKFSLSSLHGARLGKVSAVRSDFSSADL 615
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 212
S + L+ ANL+NA L + L GA + A +A + A A
Sbjct: 616 SQSSWQGANLSRANLSNANLKNVDFNSTQLVGANLRNAKLYNAKLRYANLSA 667
>gi|312194409|ref|YP_004014470.1| pentapeptide repeat-containing protein [Frankia sp. EuI1c]
gi|311225745|gb|ADP78600.1| pentapeptide repeat protein [Frankia sp. EuI1c]
Length = 2027
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 30/83 (36%), Positives = 45/83 (54%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+ T D+ ++D +G+ A L+ A AN TGA L+ R+ L ANLT+A L R
Sbjct: 1243 ADLTGLDLSDADLAGANLTDADLDDANLTGANLTGARLTGVRARRLRLTGANLTDADLRR 1302
Query: 183 TVLTRSDLGGAIIEGADFSDAVI 205
LT DL G ++ G+ + A +
Sbjct: 1303 ARLTDPDLTGTVLTGSKWERAAL 1325
Score = 45.8 bits (107), Expect = 0.020, Method: Composition-based stats.
Identities = 29/92 (31%), Positives = 44/92 (47%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A ADL + AN T AD+ +++ +G+ GA L A + TGA+L+
Sbjct: 1237 GAHLEGADLTGLDLSDADLAGANLTDADLDDANLTGANLTGARLTGVRARRLRLTGANLT 1296
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
D + R L + +LT VL + R+ L GA
Sbjct: 1297 DADLRRARLTDPDLTGTVLTGSKWERAALLGA 1328
>gi|428226949|ref|YP_007111046.1| hypothetical protein GEI7407_3527 [Geitlerinema sp. PCC 7407]
gi|427986850|gb|AFY67994.1| Tetratricopeptide TPR_1 repeat-containing protein [Geitlerinema sp.
PCC 7407]
Length = 575
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 69/148 (46%), Gaps = 1/148 (0%)
Query: 60 WRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKE 118
WR + AL A + + ++ + K ETR S +G A+L
Sbjct: 15 WRSLAALALVVAPMVGTDAALAEKPEHRKQLLETRRCISCDLSNGDYGRANLSGFDLSNS 74
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N A+F SAD++ +DFS + A LE+A +A+F A L+ + L+ ANL+N+
Sbjct: 75 NLENADFESADLQRTDFSSANLRRADLERADLERADFQSAILNGADLSNSDLSYANLSNS 134
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVID 206
L L+ SDL GA GA+ A D
Sbjct: 135 DLSYADLSGSDLDGANFWGANLFQANFD 162
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 56/118 (47%), Gaps = 20/118 (16%)
Query: 105 FGSADLRKAVHVKENFRRANFTSA--------------------DMRESDFSGSKFNGAY 144
F SA+LR+A + + RA+F SA D+ +D SGS +GA
Sbjct: 91 FSSANLRRADLERADLERADFQSAILNGADLSNSDLSYANLSNSDLSYADLSGSDLDGAN 150
Query: 145 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A ++ANF +LS+ ++ L ANL+NA L L + L G + GA+ S+
Sbjct: 151 FWGANLFQANFDRTNLSNISLNGARLEGANLSNAYLSEYTLRSARLSGVNLRGANLSN 208
Score = 37.7 bits (86), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 43/97 (44%), Gaps = 5/97 (5%)
Query: 112 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 171
K H K+ S D+ D+ + +G L + A+F ADL R +
Sbjct: 38 KPEHRKQLLETRRCISCDLSNGDYGRANLSGFDLSNSNLENADFESADLQ-----RTDFS 92
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
ANL A L R L R+D AI+ GAD S++ + A
Sbjct: 93 SANLRRADLERADLERADFQSAILNGADLSNSDLSYA 129
>gi|17228308|ref|NP_484856.1| heterocyst-specific glycolipids-directing protein [Nostoc sp. PCC
7120]
gi|535436|gb|AAB59979.1| HglK [Nostoc sp. PCC 7120]
gi|17130158|dbj|BAB72770.1| heterocyst-specific glycolipids-directing protein [Nostoc sp. PCC
7120]
gi|1585247|prf||2124368C hglK gene
Length = 727
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 52/111 (46%), Gaps = 11/111 (9%)
Query: 95 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 154
G+ + A +G A L + + + AN T D + SD SG+ +AN
Sbjct: 574 GDAKLQEANLYG-ARLSRVIAIGAQLSFANLTKTDWQSSDLSGADLE----------RAN 622
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ ADLS T M +L A L NA L L+ DL GA + GADF D ++
Sbjct: 623 LSNADLSATRMTGAILRSAQLENANLRNADLSLVDLRGANVAGADFKDTIL 673
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 29/108 (26%)
Query: 125 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLM----- 165
F SA++ ++ F GS+F A L +A +ANFT A+LS LM
Sbjct: 469 FKSANLNQASFKGSRFRSVGDDGRWDTYDDAIADLSQAQMKQANFTDANLSRVLMTRSDL 528
Query: 166 DRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDA 203
R LN ANL+NA L+ L +DL G ++E GAD DA
Sbjct: 529 SRATLNRANLSNARLIGANLSSAQLVGADLRGTVLENASLTGADLGDA 576
Score = 43.9 bits (102), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 65/137 (47%), Gaps = 24/137 (17%)
Query: 82 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFS 136
A+ADL++ + + A F A+L + + + + RAN ++A + ++ S
Sbjct: 499 AIADLSQAQMKQ---------ANFTDANLSRVLMTRSDLSRATLNRANLSNARLIGANLS 549
Query: 137 GSKFNGAYLEKAVAYKANFTGADLSDTLMD----------RMVLNEANLTNAVLVRTVLT 186
++ GA L V A+ TGADL D + R++ A L+ A L +T
Sbjct: 550 SAQLVGADLRGTVLENASLTGADLGDAKLQEANLYGARLSRVIAIGAQLSFANLTKTDWQ 609
Query: 187 RSDLGGAIIEGADFSDA 203
SDL GA +E A+ S+A
Sbjct: 610 SSDLSGADLERANLSNA 626
>gi|332710578|ref|ZP_08430523.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332350633|gb|EGJ30228.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 185
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 51/106 (48%), Gaps = 15/106 (14%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKAN 154
G+ A F +A+L A N RA+F+ A + +D SGS F GA L +KAN
Sbjct: 88 GTGATFRNANLDSAYATGANMSRADFSGASVVWANFISADLSGSSFRGADLSNTTFFKAN 147
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
GADLS ANLTNA + LT ++L A + GA
Sbjct: 148 LNGADLSG----------ANLTNANFINADLTNANLDNANLTGAQL 183
>gi|30696344|ref|NP_851183.1| thylakoid lumenal protein [Arabidopsis thaliana]
gi|38503418|sp|P81760.2|TL17_ARATH RecName: Full=Thylakoid lumenal 17.4 kDa protein, chloroplastic;
AltName: Full=P17.4; Flags: Precursor
gi|13899115|gb|AAK48979.1|AF370552_1 thylakoid lumenal 17.4 kD protein, chloroplast precursor (P17.4)
[Arabidopsis thaliana]
gi|9759188|dbj|BAB09725.1| thylakoid lumenal 17.4 kD protein, chloroplast precursor (P17.4)
[Arabidopsis thaliana]
gi|28059599|gb|AAO30073.1| thylakoid lumenal 17.4 kD protein, chloroplast precursor (P17.4)
[Arabidopsis thaliana]
gi|332008985|gb|AED96368.1| thylakoid lumenal protein [Arabidopsis thaliana]
Length = 236
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 5/118 (4%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ N + ++A M + F G+ + KA A +A+F G + ++ ++DR+ ++NL
Sbjct: 123 QTNLKGKTLSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLK 182
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AV TVL+ S A +E F D +I Q +C+ N R LGC
Sbjct: 183 GAVFRNTVLSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 235
>gi|30696347|ref|NP_200161.2| thylakoid lumenal protein [Arabidopsis thaliana]
gi|332008984|gb|AED96367.1| thylakoid lumenal protein [Arabidopsis thaliana]
Length = 235
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 5/118 (4%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ N + ++A M + F G+ + KA A +A+F G + ++ ++DR+ ++NL
Sbjct: 122 QTNLKGKTLSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLK 181
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AV TVL+ S A +E F D +I Q +C+ N R LGC
Sbjct: 182 GAVFRNTVLSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 234
>gi|381205231|ref|ZP_09912302.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 236
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
AQ ADL A + + AN A+++ ++ +G+ A L A YKAN GADL
Sbjct: 99 AQLVGADLEGADLDRADLFEANLEIANLQWANLAGASLENANLGLANLYKANLQGADLRG 158
Query: 163 TLMDRMVLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDAV 204
+ +L EANL+N A L+ L+R++L GA ++GA +A+
Sbjct: 159 ANLTGAMLGEANLSNANLEGARLMVVNLSRANLKGANLKGAKIHEAI 205
Score = 37.0 bits (84), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 41/86 (47%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G A+L KA + R AN T A + E++ S + GA L +AN GA+L
Sbjct: 139 ANLGLANLYKANLQGADLRGANLTGAMLGEANLSNANLEGARLMVVNLSRANLKGANLKG 198
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRS 188
+ + + A+LT+ + + R+
Sbjct: 199 AKIHEAIFSGADLTDVEMTDAQICRT 224
>gi|76819210|ref|YP_336861.1| pentapeptide repeat-containing protein [Burkholderia pseudomallei
1710b]
gi|76583683|gb|ABA53157.1| pentapeptide repeat family protein [Burkholderia pseudomallei
1710b]
Length = 862
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 38/102 (37%), Positives = 52/102 (50%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T D+ D G++ GA LE A A+ TGAD
Sbjct: 526 GAAARARRECVASAAAAGQSLQGADLTGVDLSGMDLRGARLAGAMLENADLSDADLTGAD 585
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 586 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 622
Score = 41.2 bits (95), Expect = 0.48, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 780 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 839
Score = 37.4 bits (85), Expect = 6.4, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +ADL A + R AD+ + ++ A L A + +F+G+DL
Sbjct: 567 AGAMLENADLSDADLTGADLSRTVLVRADLTRAKLVDARLTAANLSLAHCERTDFSGSDL 626
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIE----------GADFSDAVI 205
SD + +++ L + +VL T D G A + G FSDA I
Sbjct: 627 SDGIFEQVHLRDCRFNGSVLASTRFDACRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 686
>gi|414077930|ref|YP_006997248.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
gi|413971346|gb|AFW95435.1| pentapeptide repeat-containing protein [Anabaena sp. 90]
Length = 189
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 45/94 (47%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N + N D +D G+ A L A KAN GA L+ + ++L A+LT A
Sbjct: 26 NLQGVNLGGVDFGRADLRGANLTAASLSGANLSKANLQGAILARAHLSEVILCGADLTQA 85
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 212
L L SDL GA++ GA+ DA + +A A
Sbjct: 86 TLTTAHLNESDLSGALLSGANLCDANLHMASISA 119
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 46/95 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +DL A+ N AN A + ++ G+ +GA + +KA+ GADL
Sbjct: 88 TTAHLNESDLSGALLSGANLCDANLHMASISAANLQGANLSGAKMGGVRMWKADLQGADL 147
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
S + L E NLT A L T ++ + L GAI+
Sbjct: 148 SGADLSEANLCEVNLTGANLDDTDMSETFLTGAIM 182
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 56/113 (49%), Gaps = 20/113 (17%)
Query: 101 SAAQFGSADLRKAV----HVKE------NFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
S A A+L+ A+ H+ E + +A T+A + ESD SG+ +GA L A
Sbjct: 53 SGANLSKANLQGAILARAHLSEVILCGADLTQATLTTAHLNESDLSGALLSGANLCDANL 112
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ A+ + A+L ANL+ A + + ++DL GA + GAD S+A
Sbjct: 113 HMASISAANLQG----------ANLSGAKMGGVRMWKADLQGADLSGADLSEA 155
>gi|229818699|ref|YP_002880225.1| pentapeptide repeat-containing protein [Beutenbergia cavernae DSM
12333]
gi|229564612|gb|ACQ78463.1| pentapeptide repeat-containing protein [Beutenbergia cavernae DSM
12333]
Length = 205
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 59/131 (45%), Gaps = 12/131 (9%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVH-----VKENFRRANFTSADMRESDFSG 137
D++ EA TRG + S F + + H V FRR NF A G
Sbjct: 37 FVDVDLTEASTRGT--VFSECVFSNVAFNVSHHASTAFVNCTFRRCNFFDATFTGCKLVG 94
Query: 138 SKFNGA-----YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 192
+ F+G +++ A F GADL + L E++LT+A R+V DL G
Sbjct: 95 AMFDGCSFGIMKVDRGDWSFAGFPGADLEGVEFTGVRLRESDLTHARCARSVFAGCDLSG 154
Query: 193 AIIEGADFSDA 203
+ + GADF+DA
Sbjct: 155 SWLHGADFTDA 165
>gi|254264016|ref|ZP_04954881.1| pentapeptide repeat protein [Burkholderia pseudomallei 1710a]
gi|254215018|gb|EET04403.1| pentapeptide repeat protein [Burkholderia pseudomallei 1710a]
Length = 825
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 38/102 (37%), Positives = 52/102 (50%), Gaps = 5/102 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G+AA+ + A ++ + A+ T D+ D G++ GA LE A A+ TGAD
Sbjct: 489 GAAARARRECVASAAAAGQSLQGADLTGVDLSGMDLRGARLAGAMLENADLSDADLTGAD 548
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
LS R VL A+LT A LV LT ++L A E DFS
Sbjct: 549 LS-----RTVLVRADLTRAKLVDARLTAANLSLAHCERTDFS 585
Score = 41.2 bits (95), Expect = 0.47, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 33/60 (55%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+ADLR A F RA+ T AD+R++D + GA L+ A +AN A+LS L D
Sbjct: 743 AADLRGAKAEGSPFVRADLTRADLRDTDLIAAYLRGAKLDGADLRRANLFRANLSQILTD 802
Score = 37.4 bits (85), Expect = 6.4, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +ADL A + R AD+ + ++ A L A + +F+G+DL
Sbjct: 530 AGAMLENADLSDADLTGADLSRTVLVRADLTRAKLVDARLTAANLSLAHCERTDFSGSDL 589
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIE----------GADFSDAVI 205
SD + +++ L + +VL T D G A + G FSDA I
Sbjct: 590 SDGIFEQVHLRDCRFNGSVLASTRFDACRFDAVDFGRATLRELIFIEQSFSGVSFSDATI 649
>gi|332705303|ref|ZP_08425383.1| hypothetical protein LYNGBM3L_05720 [Moorea producens 3L]
gi|332355929|gb|EGJ35389.1| hypothetical protein LYNGBM3L_05720 [Moorea producens 3L]
Length = 240
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 54/105 (51%), Gaps = 5/105 (4%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A F A+L + NF A T A++ E+ GS F+ A L A KAN GA+LS
Sbjct: 133 AVNFTKANLSRV-----NFTEAVMTGANLNEAQLIGSNFDKANLTGADLVKANLKGANLS 187
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ L EANL+ L + LT +DL A ++GA+ +A +D
Sbjct: 188 QANLSYTNLREANLSETNLRKANLTGADLTHANLQGANLIEAELD 232
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 51/95 (53%), Gaps = 5/95 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A A+L +A + NF +AN T AD+ +++ G+ + A L +Y N A+L
Sbjct: 147 TEAVMTGANLNEAQLIGSNFDKANLTGADLVKANLKGANLSQANL----SY-TNLREANL 201
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
S+T + + L A+LT+A L L ++L G I+
Sbjct: 202 SETNLRKANLTGADLTHANLQGANLIEAELDGVIL 236
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 39/82 (47%), Gaps = 10/82 (12%)
Query: 129 DMRESDFSGSKFNGAYLEKAVAYKANFTGA-----DLSD-----TLMDRMVLNEANLTNA 178
D+ E D SG NG L +A AN +G+ DL++ + D+ +L A L A
Sbjct: 30 DLMEVDLSGQNLNGFNLFQAELMGANLSGSLLIYTDLTEACVVGAIFDKAILRHAYLNRA 89
Query: 179 VLVRTVLTRSDLGGAIIEGADF 200
L RT R+DL +E A+
Sbjct: 90 KLTRTSFQRADLTMTSLEDANL 111
Score = 37.4 bits (85), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 52/106 (49%), Gaps = 10/106 (9%)
Query: 108 ADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A L +A + +F+RA+ T A++ +FS + GA L + NFT A+LS
Sbjct: 84 AYLNRAKLTRTSFQRADLTMTSLEDANLIRVNFSLADLEGANLFRTNLIAVNFTKANLSR 143
Query: 163 TLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDA 203
V+ ANL A L+ + LT +DL A ++GA+ S A
Sbjct: 144 VNFTEAVMTGANLNEAQLIGSNFDKANLTGADLVKANLKGANLSQA 189
>gi|119485597|ref|ZP_01619872.1| hypothetical protein L8106_24480 [Lyngbya sp. PCC 8106]
gi|119456922|gb|EAW38049.1| hypothetical protein L8106_24480 [Lyngbya sp. PCC 8106]
Length = 253
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 61/121 (50%), Gaps = 17/121 (14%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
LAD N YEA R A ADLR+A + + RA+ AD+++++ F+
Sbjct: 93 LADANLYEANLR-------YANLQGADLRQADLSRASLTRADLRKADLQDANLFKVNFSE 145
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
AYL +A NF ADL + L +AN T+A L SDL A ++GADFS+
Sbjct: 146 AYLSEA-----NFENADLRQVTFFKANLADANFTDANLFG-----SDLRLANLKGADFSN 195
Query: 203 A 203
A
Sbjct: 196 A 196
>gi|186681457|ref|YP_001864653.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186463909|gb|ACC79710.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 539
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 55/103 (53%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A SADLR+A + N R AN + + +R + +G+ A L + + + +GA+L
Sbjct: 133 SEADLTSADLREATLRQANLRHANLSESVLRGASMTGANLEMANLNASDLSRCDLSGANL 192
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
DT + + L+ ANL+ A L L +DL GA + AD S A
Sbjct: 193 RDTELRQANLSHANLSGADLSGANLRWADLSGANLRWADLSGA 235
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 58/119 (48%), Gaps = 1/119 (0%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A +LR+A N A+ + A++R +D SG+ A L A A GADL
Sbjct: 188 SGANLRDTELRQANLSHANLSGADLSGANLRWADLSGANLRWADLSGAKLSGATLIGADL 247
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGAD-FSDAVIDLAQKQALCKYAN 218
++ + + A+LT A L+R +DL GA + GA ++ + L + +C++ +
Sbjct: 248 TNANLTNTIFIHADLTQAKLIRAEWIGADLTGATLTGAKLYATSRFGLKTEGMICEWVD 306
Score = 41.6 bits (96), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 15/92 (16%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTNA 178
NF+ D+ E++ SG K +G L A N +GA+LS+ + LN A NL+NA
Sbjct: 31 NFSGIDLAEANLSGVKLSGVNLSDANLSIVNLSGANLSEANLSNAKLNVARLSGVNLSNA 90
Query: 179 V----------LVRTVLTRSDLGGAIIEGADF 200
+ L+R L+R+ L GA++ A+
Sbjct: 91 ILNNASLNVANLIRADLSRAQLKGALLIRAEL 122
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N N + A++ E++ S +K N A L A A L+ + R L+ A L A
Sbjct: 56 NLSIVNLSGANLSEANLSNAKLNVARLSGVNLSNAILNNASLNVANLIRADLSRAQLKGA 115
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ---KQALCKYAN 218
+L+R L R+DL A + AD + A DL + +QA ++AN
Sbjct: 116 LLIRAELIRADLSRADLSEADLTSA--DLREATLRQANLRHAN 156
Score = 37.0 bits (84), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 49/101 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A+ +L A+ + AN AD+ + G+ A L +A +A+ + ADL
Sbjct: 78 NVARLSGVNLSNAILNNASLNVANLIRADLSRAQLKGALLIRAELIRADLSRADLSEADL 137
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+ + L +ANL +A L +VL + + GA +E A+ +
Sbjct: 138 TSADLREATLRQANLRHANLSESVLRGASMTGANLEMANLN 178
>gi|434407898|ref|YP_007150783.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
gi|428262153|gb|AFZ28103.1| putative low-complexity protein [Cylindrospermum stagnale PCC 7417]
Length = 182
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 5/84 (5%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+F A + +D SG+K GA E + +AN GADLS+ L + LT A LVR
Sbjct: 90 ADFRGAQLNHADLSGAKLCGANFEGCLMVRANLAGADLSNA-----SLAGSALTGANLVR 144
Query: 183 TVLTRSDLGGAIIEGADFSDAVID 206
+++DL A++ GA+ DAV D
Sbjct: 145 ANFSQADLTNAVLFGAETEDAVFD 168
>gi|334188366|ref|NP_001190531.1| thylakoid lumenal protein [Arabidopsis thaliana]
gi|332008986|gb|AED96369.1| thylakoid lumenal protein [Arabidopsis thaliana]
Length = 250
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 5/118 (4%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ N + ++A M + F G+ + KA A +A+F G + ++ ++DR+ ++NL
Sbjct: 137 QTNLKGKTLSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLK 196
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AV TVL+ S A +E F D +I Q +C+ N R LGC
Sbjct: 197 GAVFRNTVLSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 249
>gi|300868761|ref|ZP_07113372.1| hypothetical protein OSCI_3800094 [Oscillatoria sp. PCC 6506]
gi|300333322|emb|CBN58564.1| hypothetical protein OSCI_3800094 [Oscillatoria sp. PCC 6506]
Length = 195
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 51/102 (50%), Gaps = 5/102 (4%)
Query: 109 DLRKAVHVKENFRRANFTS-----ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
DLR A NF A+F AD+R ++ F GA L AN GAD
Sbjct: 61 DLRGAPLAGINFAGADFKEVRLYFADLRGANLELCDFRGADLSDTNLSDANLAGADFEGC 120
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
M + L +ANL+NA L+R VLT S+L A GAD + A++
Sbjct: 121 FMMSINLTKANLSNAQLMRVVLTGSNLVEANFSGADLTGALL 162
Score = 43.9 bits (102), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 48/98 (48%), Gaps = 5/98 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADLR A N +F AD+ +++ S + GA E N T A+LS+ + R
Sbjct: 85 ADLRGA-----NLELCDFRGADLSDTNLSDANLAGADFEGCFMMSINLTKANLSNAQLMR 139
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+VL +NL A LT + L GA +EG F A++
Sbjct: 140 VVLTGSNLVEANFSGADLTGALLLGAKLEGKVFDGAIL 177
>gi|159045175|ref|YP_001533969.1| hypothetical protein Dshi_2635 [Dinoroseobacter shibae DFL 12]
gi|157912935|gb|ABV94368.1| hypothetical protein Dshi_2635 [Dinoroseobacter shibae DFL 12]
Length = 245
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/137 (33%), Positives = 59/137 (43%), Gaps = 25/137 (18%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANF 155
+ A+ ADLR A F AN AD+R++ FSG++ G ++ A F
Sbjct: 85 AGAELAGADLRDAYLTYAVFDGANLEGADLRDAFMPFAQFSGARMRGILFDRTNARDTVF 144
Query: 156 TGADLSDTLM-----DRMVLNEA---------------NLTNAVLVRTVLTRSDLGGAII 195
GADL M R L EA N NA LV VL +DL GA +
Sbjct: 145 AGADLRAASMVGVALPRATLTEADLGGADLSGAFLEGANFGNARLVGAVLREADLTGARL 204
Query: 196 EGADFSDAVIDLAQKQA 212
GAD S+A + A QA
Sbjct: 205 TGADLSEADLTGAVTQA 221
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 42/91 (46%), Gaps = 10/91 (10%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGS----------KFNGAYLEKAVAYKAN 154
F ADLR A V RA T AD+ +D SG+ + GA L +A A
Sbjct: 144 FAGADLRAASMVGVALPRATLTEADLGGADLSGAFLEGANFGNARLVGAVLREADLTGAR 203
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVL 185
TGADLS+ + V A + AV RTV+
Sbjct: 204 LTGADLSEADLTGAVTQAAGFSGAVFCRTVM 234
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 33/68 (48%), Gaps = 5/68 (7%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTG 157
A G ADL A NF A A +RE+D +G++ GA L + AV A F+G
Sbjct: 167 ADLGGADLSGAFLEGANFGNARLVGAVLREADLTGARLTGADLSEADLTGAVTQAAGFSG 226
Query: 158 ADLSDTLM 165
A T+M
Sbjct: 227 AVFCRTVM 234
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 44/105 (41%), Gaps = 30/105 (28%)
Query: 129 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---------------DRM----- 168
D+ ++ +G+ AYL AV AN GADL D M DR
Sbjct: 83 DLAGAELAGADLRDAYLTYAVFDGANLEGADLRDAFMPFAQFSGARMRGILFDRTNARDT 142
Query: 169 -----VLNEANLTNAVLVRTVLTRSDLG-----GAIIEGADFSDA 203
L A++ L R LT +DLG GA +EGA+F +A
Sbjct: 143 VFAGADLRAASMVGVALPRATLTEADLGGADLSGAFLEGANFGNA 187
>gi|443475317|ref|ZP_21065270.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019839|gb|ELS33873.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 377
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 59/111 (53%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT- 156
A A+L A+ VK + + RAN T AD+RE+D SG++ A L KA KAN +
Sbjct: 140 ADLTQANLSAAILVKASLKQVILNRANLTEADLREADLSGAQLYLAVLSKANLAKANLSL 199
Query: 157 ----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A+L + ++ + EANL NA L ++ L ++L A + A+ S A
Sbjct: 200 ANLDSANLLEAKLEGSLFCEANLENANLSQSFLMEANLTKANLRKANLSKA 250
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 57/121 (47%), Gaps = 15/121 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A S DL +A R+N A++ E+D SG+ L A+ AN + DLS
Sbjct: 65 ANLSSTDLVRANLRSARLDRSNLVRANLYEADLSGASLVNINLSNAICASANLSHVDLSQ 124
Query: 163 TLM----------DRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDAVIDL 207
+ + DR L +ANL+ A+LV+ +L R++L A + AD S A + L
Sbjct: 125 SNLSSTNLSLANLDRADLTQANLSAAILVKASLKQVILNRANLTEADLREADLSGAQLYL 184
Query: 208 A 208
A
Sbjct: 185 A 185
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 71/154 (46%), Gaps = 17/154 (11%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
L+AA++ S L N EA+ R E + S AQ A L KA K N AN S
Sbjct: 147 LSAAILVKASLKQVILNRANLTEADLR-EADL-SGAQLYLAVLSKANLAKANLSLANLDS 204
Query: 128 ADMRESDFSGSKFNGAYLE---------------KAVAYKANFTGADLSDTLMDRMVLNE 172
A++ E+ GS F A LE KA KAN + A+L+ ++ + L
Sbjct: 205 ANLLEAKLEGSLFCEANLENANLSQSFLMEANLTKANLRKANLSKANLTSAILSQANLLG 264
Query: 173 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
ANL A L + L SD GA ++G + S A ++
Sbjct: 265 ANLAGASLAKANLAESDCFGANLQGTNLSQANVE 298
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A ADL A V N A N + D+ +S+ S + + A L++A +AN +
Sbjct: 90 ANLYEADLSGASLVNINLSNAICASANLSHVDLSQSNLSSTNLSLANLDRADLTQANLSA 149
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
A L + +++LN ANLT A L L+ + L A++ A+ + A + LA
Sbjct: 150 AILVKASLKQVILNRANLTEADLREADLSGAQLYLAVLSKANLAKANLSLA 200
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 52/96 (54%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L KA K N +AN TSA + +++ G+ GA L KA +++ GA+L T + +
Sbjct: 235 ANLTKANLRKANLSKANLTSAILSQANLLGANLAGASLAKANLAESDCFGANLQGTNLSQ 294
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ +L + L + L ++L GA + GA+ DA
Sbjct: 295 ANVEAVDLRESDLAKANLVGANLAGANLFGAELLDA 330
>gi|428312955|ref|YP_007123932.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428254567|gb|AFZ20526.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 471
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 84/192 (43%), Gaps = 22/192 (11%)
Query: 30 LSKPLWVACQISSKTESDGQFPGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKY 89
L + +W+ +I + E G P + WR + V +N+S L ++ Y
Sbjct: 153 LERMMWLLNRIYADEE--GSQNTPINAEEFWRRYNERERDFTGVNLAGANLSNLP-MHSY 209
Query: 90 EAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 148
G+ S A A+LR N+ N A ++ +D S ++F A L A
Sbjct: 210 -------HGVNLSKANLNGANLRNV-----NWSSLNLMGASLKGADLSNNQFENANLRGA 257
Query: 149 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
AN +GA+LS + ++ L +ANL A L L +DL A + AD S+A ++ A
Sbjct: 258 NLDDANLSGANLSQSNLETATLKKANLNRANLRNAKLNSADLSHANLSDADVSEANLEGA 317
Query: 209 Q------KQALC 214
KQALC
Sbjct: 318 NLQEANLKQALC 329
>gi|424801888|ref|ZP_18227430.1| FIG01055523: hypothetical protein [Cronobacter sakazakii 696]
gi|423237609|emb|CCK09300.1| FIG01055523: hypothetical protein [Cronobacter sakazakii 696]
Length = 846
Score = 52.0 bits (123), Expect = 3e-04, Method: Composition-based stats.
Identities = 49/182 (26%), Positives = 83/182 (45%), Gaps = 19/182 (10%)
Query: 54 YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 113
+A+L N FV + L AA + + + + + N EA SA A ++
Sbjct: 667 HARL-NKTTFVKSTLEAADFSDATLDSCSFVETNADEAR------FISATWITCAAASES 719
Query: 114 VHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRM 168
+ +F A +++R++ G++F A LE +A A+F A L +L R
Sbjct: 720 TLNRADFTHATLRQSNLRQTALCGARFELAKLENTDLSEADCRGASFQRASLVGSLFIRT 779
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVST 228
E + T+A L+ +L +S LGGA GA+ A DL+Q + NG ++G T
Sbjct: 780 DFREVDFTDANLMGALLQKSQLGGADFNGANLFRA--DLSQ-----SFTNGETRMSGAFT 832
Query: 229 RK 230
++
Sbjct: 833 KR 834
Score = 40.4 bits (93), Expect = 0.89, Method: Composition-based stats.
Identities = 30/105 (28%), Positives = 43/105 (40%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-- 158
S A ADL NFR A++ + F GA L A ++F+GA
Sbjct: 551 SKALLECADLSHCQLDGANFRGTMLARAELHHTSLRDCNFEGASLSLAQCCHSDFSGARF 610
Query: 159 ---DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
L +TL+D V ++A L + T TR A ++G F
Sbjct: 611 KDTQLQETLLDDCVFDDATLEGLLFRETWFTRCRFHRATLDGCVF 655
>gi|73668253|ref|YP_304268.1| hypothetical protein Mbar_A0710 [Methanosarcina barkeri str.
Fusaro]
gi|72395415|gb|AAZ69688.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 381
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 62/117 (52%), Gaps = 1/117 (0%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F ADL K N + +F D+ +++ + GA LE+A +AN GA+L +
Sbjct: 152 ADFQGADLEKVNLQGTNLKETSFKRTDLEKTNLQEADLQGADLEEANLQRANLQGANLKE 211
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 218
+ R L +AN+ A L + +++L GA ++ A+F ++ A+ K+A+ + AN
Sbjct: 212 ANLQRTDLRKANIQGADLGKANFEQANLKGANLKKANFEKTNLEEAKLKEAILQGAN 268
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 54/104 (51%), Gaps = 5/104 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L++A + + R+AN AD+ +++F + GA L+KA NF +L +
Sbjct: 202 ANLQGANLKEANLQRTDLRKANIQGADLGKANFEQANLKGANLKKA-----NFEKTNLEE 256
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ +L ANL A L++ L +++L A GA+ A ++
Sbjct: 257 AKLKEAILQGANLIKAKLIKAKLQKANLKSANFNGANLIKAKLE 300
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 54/112 (48%), Gaps = 5/112 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTG 157
A F A+L+ A K NF + N A ++E+ G+ K A L+KA ANF G
Sbjct: 232 ANFEQANLKGANLKKANFEKTNLEEAKLKEAILQGANLIKAKLIKAKLQKANLKSANFNG 291
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
A+L ++ L ANL A L R + A ++GA F +A ++ AQ
Sbjct: 292 ANLIKAKLEGANLQRANLKEANFNGADLQRVNFRKANLQGAKFKEANLEGAQ 343
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 54/121 (44%), Gaps = 20/121 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG---------------SKFNGAYLEK 147
A A ++ A+ + + + ANF AD++ +DF G + F LEK
Sbjct: 122 ANLEKAKVQGAIFCEADLQEANFQGADLQGADFQGADLEKVNLQGTNLKETSFKRTDLEK 181
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
+A+ GADL + + R ANL A L L R+DL A I+GAD A +
Sbjct: 182 TNLQEADLQGADLEEANLQR-----ANLQGANLKEANLQRTDLRKANIQGADLGKANFEQ 236
Query: 208 A 208
A
Sbjct: 237 A 237
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 55/116 (47%), Gaps = 15/116 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA--------- 153
A F A+L +A + N + ANF A++ +++ ++ G L++A ++A
Sbjct: 22 ADFMGANLEEANFIGSNLKGANFKGANLEKANLQATELQGVNLQEANLHRAKLQVATLYG 81
Query: 154 ------NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
N A+L + R L E NL A L RT L ++L A ++GA F +A
Sbjct: 82 ADLQRANLQEANLQGANLQRADLQEVNLQEANLQRTDLVEANLEKAKVQGAIFCEA 137
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 48/99 (48%), Gaps = 5/99 (5%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLSDT 163
DL+ A +K A+F A++ E++F GS F GA LEKA G +L +
Sbjct: 8 DLQGANFIKTKLEGADFMGANLEEANFIGSNLKGANFKGANLEKANLQATELQGVNLQEA 67
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+ R L A L A L R L ++L GA ++ AD +
Sbjct: 68 NLHRAKLQVATLYGADLQRANLQEANLQGANLQRADLQE 106
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 60/122 (49%), Gaps = 6/122 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL-- 160
A A+L++A + N + AN D+ E++ +K GA +A +ANF GADL
Sbjct: 92 ANLQGANLQRADLQEVNLQEANLQRTDLVEANLEKAKVQGAIFCEADLQEANFQGADLQG 151
Query: 161 ---SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ-ALCKY 216
++++ L NL RT L +++L A ++GAD +A + A Q A K
Sbjct: 152 ADFQGADLEKVNLQGTNLKETSFKRTDLEKTNLQEADLQGADLEEANLQRANLQGANLKE 211
Query: 217 AN 218
AN
Sbjct: 212 AN 213
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 59/122 (48%), Gaps = 3/122 (2%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F DL K + + + A+ A+++ ++ G+ A L++ KAN GADL
Sbjct: 174 FKRTDLEKTNLQEADLQGADLEEANLQRANLQGANLKEANLQRTDLRKANIQGADLGKAN 233
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYA--NGTN 221
++ L ANL A +T L + L AI++GA+ A +I ++A K A NG N
Sbjct: 234 FEQANLKGANLKKANFEKTNLEEAKLKEAILQGANLIKAKLIKAKLQKANLKSANFNGAN 293
Query: 222 PI 223
I
Sbjct: 294 LI 295
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 50/108 (46%), Gaps = 3/108 (2%)
Query: 99 IGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
IGS A F A+L KA + N A++ + + GA L++A +AN
Sbjct: 35 IGSNLKGANFKGANLEKANLQATELQGVNLQEANLHRAKLQVATLYGADLQRANLQEANL 94
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
GA+L + + L EANL LV L ++ + GAI AD +A
Sbjct: 95 QGANLQRADLQEVNLQEANLQRTDLVEANLEKAKVQGAIFCEADLQEA 142
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL KA NF +AN A++++++F + A L++A+ AN A L
Sbjct: 222 ANIQGADLGKA-----NFEQANLKGANLKKANFEKTNLEEAKLKEAILQGANLIKAKLIK 276
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + L AN A L++ L ++L A ++ A+F+ A
Sbjct: 277 AKLQKANLKSANFNGANLIKAKLEGANLQRANLKEANFNGA 317
>gi|448677922|ref|ZP_21689112.1| pentapeptide repeat-containing protein [Haloarcula argentinensis
DSM 12282]
gi|445773597|gb|EMA24630.1| pentapeptide repeat-containing protein [Haloarcula argentinensis
DSM 12282]
Length = 428
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+A F + LR A + +AN +SAD+RE+D SG+ A L A KA+ +GADL
Sbjct: 49 NAISFENTGLRGADLSDADLGKANLSSADLREADLSGADLGSADLSGANLQKADLSGADL 108
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSD 202
S + L A+L++A L RT L+ +DL A + DFSD
Sbjct: 109 SYANLSGADLENADLSSADLRRTNLSGVKFVETDLADADLRNIDFSD 155
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 52/106 (49%), Gaps = 5/106 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A SADLR+ F + AD+R DFS ++ G L A + + +GADL
Sbjct: 121 ADLSSADLRRTNLSGVKFVETDLADADLRNIDFSDTELVGTDLSGADFFATDLSGADLRV 180
Query: 163 TLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDA 203
M + L EA+L+ A L T L+ +DL GA + G D SDA
Sbjct: 181 ADMSNVNLREADLSGADLGGTDLSDANLREADLSGADLGGVDLSDA 226
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 51/98 (52%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A SADLR+A + A+ + A+++++D SG+ + A L A A+ + ADL
Sbjct: 71 ANLSSADLREADLSGADLGSADLSGANLQKADLSGADLSYANLSGADLENADLSSADLRR 130
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
T + + E +L +A L + ++L G + GADF
Sbjct: 131 TNLSGVKFVETDLADADLRNIDFSDTELVGTDLSGADF 168
>gi|297796179|ref|XP_002865974.1| thylakoid lumenal 17.4 kDa protein, chloroplast [Arabidopsis lyrata
subsp. lyrata]
gi|297311809|gb|EFH42233.1| thylakoid lumenal 17.4 kDa protein, chloroplast [Arabidopsis lyrata
subsp. lyrata]
Length = 236
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 5/118 (4%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ N + ++A M + F G+ + KA A +A+F G + ++ ++DR+ ++NL
Sbjct: 123 QTNLKGKTLSAALMVGAKFDGADMTEVVMSKAYAVEASFKGVNFTNAVIDRVNFGKSNLK 182
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 234
AV TVL+ S A +E F D +I Q +C+ N R LGC
Sbjct: 183 GAVFRNTVLSGSTFEEANLEDVVFEDTIIGYIDLQKICR-----NESINEEGRLVLGC 235
>gi|168705224|ref|ZP_02737501.1| pentapeptide repeat [Gemmata obscuriglobus UQM 2246]
Length = 831
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 45/92 (48%), Gaps = 5/92 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ R A F A + E+ FSGS+ GA A KANF A +D + +L ANL A
Sbjct: 537 DLRGAKFDGAMLSEASFSGSQIQGASFADVPARKANFASARAADAVFRGAILANANLRAA 596
Query: 179 VLVRTVLTRSDLGGA-----IIEGADFSDAVI 205
+RT DL GA + GADF+ A +
Sbjct: 597 TFLRTNFQNVDLTGADFAFSDLRGADFTGATL 628
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 5/99 (5%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
F++ + + A++ +S F G F GA L A K +FT A+L+ L N TNA
Sbjct: 231 FKKTDLSGAELEQSHFGGCDFTGADLSHAKLQKTDFTAANLAGATCVDADLRGTNFTNAD 290
Query: 180 LVR-----TVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 213
L + L +DL GA + GADF+ A + A+ L
Sbjct: 291 LRKANFRGANLAGADLTGANVAGADFTGANLTGAKVDGL 329
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 38/62 (61%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +A+L A V + R NFT+AD+R+++F G+ GA L A A+FTGA+L+
Sbjct: 266 FTAANLAGATCVDADLRGTNFTNADLRKANFRGANLAGADLTGANVAGADFTGANLTGAK 325
Query: 165 MD 166
+D
Sbjct: 326 VD 327
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 51/114 (44%), Gaps = 5/114 (4%)
Query: 95 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAV 149
G F + F DL A + +F +FT AD+ +++DF+ + GA A
Sbjct: 221 GSFTRATDCTFKKTDLSGAELEQSHFGGCDFTGADLSHAKLQKTDFTAANLAGATCVDAD 280
Query: 150 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
NFT ADL L A+LT A + T ++L GA ++G D S A
Sbjct: 281 LRGTNFTNADLRKANFRGANLAGADLTGANVAGADFTGANLTGAKVDGLDASKA 334
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 42/94 (44%), Gaps = 4/94 (4%)
Query: 85 DLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 144
DL+ E E + FG F ADL A K +F AN A ++D G+ F A
Sbjct: 235 DLSGAELE-QSHFG---GCDFTGADLSHAKLQKTDFTAANLAGATCVDADLRGTNFTNAD 290
Query: 145 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
L KA AN GADL+ + ANLT A
Sbjct: 291 LRKANFRGANLAGADLTGANVAGADFTGANLTGA 324
Score = 37.4 bits (85), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 32/65 (49%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
A R + A +A+LR A ++ NF+ + T AD SD G+ F GA L+ A
Sbjct: 574 ASARAADAVFRGAILANANLRAATFLRTNFQNVDLTGADFAFSDLRGADFTGATLKNASF 633
Query: 151 YKANF 155
+A F
Sbjct: 634 SQAKF 638
>gi|94263119|ref|ZP_01286937.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
gi|93456490|gb|EAT06604.1| Pentapeptide repeat [delta proteobacterium MLMS-1]
Length = 355
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 52/111 (46%), Gaps = 10/111 (9%)
Query: 105 FGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVAYKAN 154
F D R A + F++ +F T D+R+ + G+ F GA L K + AN
Sbjct: 41 FKGVDFRGAKITRTGFKKCSFAGARFDETDLTMVDLRQLELPGASFKGARLHKTLLGGAN 100
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
G D S + +L EA+L+ A + RS L A E ADFS+AV+
Sbjct: 101 LAGCDFSQARIFWSLLQEADLSRASFRQAEFERSILQDANCEEADFSEAVL 151
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 46/104 (44%), Gaps = 10/104 (9%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVAYKANFTG 157
ADL +A + F R+ A+ E+DFS S+ G L +A +K +G
Sbjct: 119 ADLSRASFRQAEFERSILQDANCEEADFSEAVLFKTILLNSRLKGINLRQAKMHKVLLSG 178
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
DL+ M E N NA L +R+D+ G + GAD S
Sbjct: 179 CDLAGQDFSDMRFREVNFANAKLGGADFSRADISGCVFTGADLS 222
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 49/105 (46%), Gaps = 5/105 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKANFTG 157
A F A L K + N +F+ A ++E+D S + F A E+++ AN
Sbjct: 84 ASFKGARLHKTLLGGANLAGCDFSQARIFWSLLQEADLSRASFRQAEFERSILQDANCEE 143
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
AD S+ ++ + +L + L L + + + L G + G DFSD
Sbjct: 144 ADFSEAVLFKTILLNSRLKGINLRQAKMHKVLLSGCDLAGQDFSD 188
Score = 37.4 bits (85), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 40/83 (48%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+F RA+ + +D S S+ +G +++ N GADL + + L E+NL A
Sbjct: 205 DFSRADISGCVFTGADLSASRLSGVIARQSMFAGTNLQGADLEGAGLVQAYLGESNLEGA 264
Query: 179 VLVRTVLTRSDLGGAIIEGADFS 201
LV L + L A GADF+
Sbjct: 265 SLVGANLESASLEKARAMGADFT 287
>gi|443475216|ref|ZP_21065173.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443020003|gb|ELS34017.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 352
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 52/102 (50%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L A V A A + ++DF+G NGA + A + GADL+D + R
Sbjct: 228 ANLSSASLVGAVLNNAKLERAILIDADFNGVTLNGAIMADIKASRVQMQGADLTDAKLSR 287
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L+ ANL A++VR L + L + AD +DA+++ A+
Sbjct: 288 ADLSRANLKGAIMVRANLIEAYLARTNLADADLTDAILNRAE 329
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 15/113 (13%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A K + R AN A + ++ S +K N AYL+ +AN + A L
Sbjct: 176 SGADLRGANLSGADLYKADLRGANLQEATLSGANLSEAKLNNAYLQGVFLTEANLSSASL 235
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII----------EGADFSDA 203
VLN A L A+L+ L GAI+ +GAD +DA
Sbjct: 236 VGA-----VLNNAKLERAILIDADFNGVTLNGAIMADIKASRVQMQGADLTDA 283
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 68/137 (49%), Gaps = 9/137 (6%)
Query: 80 ISALADLNKYEAETR----GEFGIGSAAQFGSADLRKAVHVKE----NFRRANFTSADMR 131
++ L D N +A+ R G +G A G A+LR+ V + + R + A +
Sbjct: 113 LANLMDANLIDADMRTINLGGANLGGACMRG-ANLRQERAVGDRDEIDVSRKKRSIASLI 171
Query: 132 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 191
++ SG+ GA L A YKA+ GA+L + + L+EA L NA L LT ++L
Sbjct: 172 GANLSGADLRGANLSGADLYKADLRGANLQEATLSGANLSEAKLNNAYLQGVFLTEANLS 231
Query: 192 GAIIEGADFSDAVIDLA 208
A + GA ++A ++ A
Sbjct: 232 SASLVGAVLNNAKLERA 248
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F L A+ R AD+ ++ S + + A L+ A+ +AN A L+
Sbjct: 253 ADFNGVTLNGAIMADIKASRVQMQGADLTDAKLSRADLSRANLKGAIMVRANLIEAYLA- 311
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
R L +A+LT+A+L R L+ ++L GAI++GA D +
Sbjct: 312 ----RTNLADADLTDAILNRAELSSANLVGAILKGATLPDGKV 350
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 45/93 (48%), Gaps = 10/93 (10%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+LR A+ V N AN A + + S + +GA L+ + AN + A+L D
Sbjct: 64 ANLRGALMVGANLCGANLNQASLSNVNLSNADLHGASLQGTTLFGANLSLANLMD----- 118
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
ANL +A + L ++LGGA + GA+
Sbjct: 119 -----ANLIDADMRTINLGGANLGGACMRGANL 146
>gi|73670411|ref|YP_306426.1| hypothetical protein Mbar_A2951 [Methanosarcina barkeri str.
Fusaro]
gi|72397573|gb|AAZ71846.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 286
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 53/107 (49%), Gaps = 5/107 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR+ + R AN AD+RE++ G+ GA L + N GADL +
Sbjct: 72 ANLEGADLRETNLGGADLREANLGGADLREANLEGADLEGADL-----RETNLGGADLRE 126
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+ L EANL A L T L ++L GA +EGA+ A ++ A
Sbjct: 127 ANLGGADLREANLEGADLRETNLLEANLEGASLEGANLKVANLERAN 173
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 51/105 (48%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
G ADLR+A + R AN AD+RE++ + GA LE A AN A+L
Sbjct: 118 NLGGADLREANLGGADLREANLEGADLRETNLLEANLEGASLEGANLKVANLERANLKGV 177
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+ L+ A L A LV + L ++ GA +E D + A ++ A
Sbjct: 178 NLIEAELSWAELKGANLVESYLVGTNFTGANLEWVDLTKANLEEA 222
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 7/119 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G ADLR+A N A+ AD+RE++ G+ A L A +AN GADL +
Sbjct: 92 ANLGGADLREA-----NLEGADLEGADLRETNLGGADLREANLGGADLREANLEGADLRE 146
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
T + L A+L A L L R++L G + A+ S A +L + Y GTN
Sbjct: 147 TNLLEANLEGASLEGANLKVANLERANLKGVNLIEAELSWA--ELKGANLVESYLVGTN 203
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 15/108 (13%)
Query: 105 FGSADLRKAVHVKENFRRANFTS-----ADMRESDFSGSKFN-----GAYLEKAVAYKAN 154
F + +A ++N +++NF A+++E F G GA LEKA AN
Sbjct: 14 FEETKVTRANLNEDNLKKSNFIGTCLIGANLKELSFEGVNLREANLLGANLEKANLLGAN 73
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
GADL +T + L EANL A L ++L GA +EGAD +
Sbjct: 74 LEGADLRETNLGGADLREANLGGADL-----REANLEGADLEGADLRE 116
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 5/85 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N RAN ++ E++ S ++ GA L ++ NFTGA+L + + L +ANL A
Sbjct: 168 NLERANLKGVNLIEAELSWAELKGANLVESYLVGTNFTGANL-----EWVDLTKANLEEA 222
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
+ L +++ GA I+GA+ +A
Sbjct: 223 IFTWADLEGANISGANIKGANLKEA 247
>gi|341583996|ref|YP_004764487.1| hypothetical protein Rh054_04430 [Rickettsia heilongjiangensis 054]
gi|340808221|gb|AEK74809.1| hypothetical protein Rh054_04430 [Rickettsia heilongjiangensis 054]
Length = 959
Score = 52.0 bits (123), Expect = 3e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 217
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAKEANFKNAIMKRADLTKANFTKAVLENADMQAAEAAEAIFKEANLKQANLKAA 669
Query: 218 N 218
N
Sbjct: 670 N 670
Score = 39.7 bits (91), Expect = 1.6, Method: Composition-based stats.
Identities = 37/143 (25%), Positives = 59/143 (41%), Gaps = 27/143 (18%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
L A + + ++ + L++ +AE G ++ A+ N + ANF +
Sbjct: 576 LTNATLTNATAQFAKLSNATLEKAEAEG------------LNISDAIAKNINAKEANFKN 623
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
A M+ +D + + F A LE A D+ + EANL A L
Sbjct: 624 AIMKRADLTKANFTKAVLENA----------DMQAAEAAEAIFKEANLKQA-----NLKA 668
Query: 188 SDLGGAIIEGADFSDAVIDLAQK 210
++L G EGADF A I+ A K
Sbjct: 669 ANLAGINKEGADFDKAKINDATK 691
Score = 38.1 bits (87), Expect = 3.8, Method: Composition-based stats.
Identities = 38/144 (26%), Positives = 61/144 (42%), Gaps = 13/144 (9%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
LKN +F S L +++C+ + + N A + A F ADL+K+
Sbjct: 359 LKN-TLFASANLENIKISNCNLDFTNFEGANLQNAVFQNV--TARNAGFLFADLKKSKIE 415
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTGADLSDTLMD 166
+ RA D+ E + + SKFN + A A K +N TG L+ M
Sbjct: 416 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKLIIKDSEWKNSNLTGISLAYADMQ 475
Query: 167 RMVLNEANLTNAVLVRTVLTRSDL 190
R+ + L NA+L + + +DL
Sbjct: 476 RVQMQGVVLNNALLDQANIVSTDL 499
>gi|409912856|ref|YP_006891321.1| pentapeptide repeat-containing protein [Geobacter sulfurreducens
KN400]
gi|298506440|gb|ADI85163.1| pentapeptide repeat domain protein [Geobacter sulfurreducens KN400]
Length = 259
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 51/98 (52%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+F A+L A K N + NF+ A++ ++FSG+K A L AV NF+ ADLS
Sbjct: 117 AKFVGANLSGADMRKVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSA 176
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
T + + L AN A T+L + L GA GAD
Sbjct: 177 TDLGSLDLEGANFRGATFNGTLLRDAKLKGADFTGADL 214
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 73/162 (45%), Gaps = 6/162 (3%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQ A L +A+ + R A+ + A + + F G+ +GA + K K NF+ A+L
Sbjct: 85 TGAQMDGASLDEAIFDTADMRSAHCSGAYIHHAKFVGANLSGADMRKVNVEKGNFSQANL 144
Query: 161 SDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADFSDAVID-LAQKQALC 214
++ L ANL AVL T L+ +DLG +EGA+F A + + A
Sbjct: 145 TNANFSGAKLKYANLGGAVLRGTNFSFADLSATDLGSLDLEGANFRGATFNGTLLRDAKL 204
Query: 215 KYANGTNPITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQK 256
K A+ T S S+ ++ N G P+ A Q+
Sbjct: 205 KGADFTGADLRQSRFHSVSIYDTATNRLGESFDPVRCADLQE 246
>gi|428313290|ref|YP_007124267.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428254902|gb|AFZ20861.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 283
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 43/85 (50%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
++ + AN ++R + G+ A L+ + AN TGA+LS + LNEANL
Sbjct: 27 LEPDLSEANLIGVNLRGAHLQGTNLRKALLDHTLLIAANLTGANLSQANLSHASLNEANL 86
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADF 200
A L+ T L +DL A + GA+
Sbjct: 87 VEACLIDTTLISADLSHAELTGANL 111
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 58/113 (51%), Gaps = 10/113 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S AQ +L +A V+ N N ++A++ E++ G+ YL KA KAN + A L
Sbjct: 147 SGAQLLRTNLSEAKLVQANLSHTNLSNANLHEAELIGT-----YLYKAELQKANLSEAHL 201
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAIIEGADFSDAVIDLA 208
S + R L EA+L A L L+RS DL GA + GA+ S A ++ A
Sbjct: 202 SGAYLSRANLREADLERADLRWANLSRSNLCEADLKGANLRGANLSKANLERA 254
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 63/131 (48%), Gaps = 18/131 (13%)
Query: 63 FVSTALAAAVVASCSSNISALADLNKYEAETRGEF-------------GIGSAAQFGSAD 109
+ T L+ A + + + + L++ N +EAE G + S A A+
Sbjct: 151 LLRTNLSEAKLVQANLSHTNLSNANLHEAELIGTYLYKAELQKANLSEAHLSGAYLSRAN 210
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
LR+A + + R AN + +++ E+D G+ GA L KA +A+ GA+L T
Sbjct: 211 LREADLERADLRWANLSRSNLCEADLKGANLRGANLSKANLERADLRGANLRGT-----N 265
Query: 170 LNEANLTNAVL 180
LN+ANL A++
Sbjct: 266 LNKANLQGAMM 276
Score = 40.8 bits (94), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 55/114 (48%), Gaps = 10/114 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A L +A V+ SAD+ ++ +G+ GA L Y AN G DL
Sbjct: 72 SQANLSHASLNEANLVEACLIDTTLISADLSHAELTGANLIGADL-----YGANLKGVDL 126
Query: 161 SD-----TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
SD T + R+ L A+L+ A L+RT L+ + L A + + S+A + A+
Sbjct: 127 SDANLIGTNLRRVNLQGADLSGAQLLRTNLSEAKLVQANLSHTNLSNANLHEAE 180
Score = 37.4 bits (85), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 45/101 (44%), Gaps = 15/101 (14%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +LRKA+ AN T A++ +++ S + N A L +A ADLS
Sbjct: 44 AHLQGTNLRKALLDHTLLIAANLTGANLSQANLSHASLNEANLVEACLIDTTLISADLS- 102
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A LT A L+ +DL GA ++G D SDA
Sbjct: 103 ---------HAELTGANLI-----GADLYGANLKGVDLSDA 129
>gi|304404631|ref|ZP_07386292.1| pentapeptide repeat protein [Paenibacillus curdlanolyticus YK9]
gi|304346438|gb|EFM12271.1| pentapeptide repeat protein [Paenibacillus curdlanolyticus YK9]
Length = 288
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 48/87 (55%)
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
+ F + + SDFSG+ G+ + + +ANF GA+L+D + L +A+ ++L
Sbjct: 99 HKGQFKGSALHGSDFSGADLTGSSFKGSDVREANFDGANLTDCSFTALDLTKASFNKSIL 158
Query: 181 VRTVLTRSDLGGAIIEGADFSDAVIDL 207
VRT ++S L GA +G +D V+ L
Sbjct: 159 VRTNFSKSGLDGAAFKGVKLTDVVLTL 185
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 50/110 (45%), Gaps = 9/110 (8%)
Query: 94 RGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
+G+F GSA + F ADL + + R ANF A++ + F+ A K++
Sbjct: 100 KGQFK-GSALHGSDFSGADLTGSSFKGSDVREANFDGANLTDCSFTALDLTKASFNKSIL 158
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ NF+ + L D LT+ VL T L ++ G + +G DF
Sbjct: 159 VRTNFSKSGL-----DGAAFKGVKLTDVVLTLTDLRKTSFEGCLFDGVDF 203
>gi|282898833|ref|ZP_06306820.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281196360|gb|EFA71270.1| Pentapeptide repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 189
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 45/91 (49%), Gaps = 2/91 (2%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N R N D +D S S G L A +AN GA L + + ++L A+LT A
Sbjct: 26 NLRGVNLGGIDFARADLSWSDLTGISLSGANLSQANLRGAKLENAHLSEVILCGADLTQA 85
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+L+ L SDL GA++ A+ DA DL Q
Sbjct: 86 ILINAHLNESDLSGALLVDANLCDA--DLHQ 114
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 10/98 (10%)
Query: 103 AQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A +DL A+ V N +A+ T+A+++ + +G+K G + KA A+ TG
Sbjct: 90 AHLNESDLSGALLVDANLCDADLHQASITAANLQSAKLNGAKMGGVRMWKADLQGADLTG 149
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
ADLS+ M + L+ ANL+ + T LT GAI+
Sbjct: 150 ADLSEANMCGVNLSMANLSATDMSETFLT-----GAIM 182
Score = 40.4 bits (93), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 63/135 (46%), Gaps = 21/135 (15%)
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVK-ENFRRANFTSADMRESDFSG------- 137
LN+Y RGE F LR AV+++ N +F AD+ SD +G
Sbjct: 7 LNRY---ARGE------RNFNGICLR-AVNLRGVNLGGIDFARADLSWSDLTGISLSGAN 56
Query: 138 ---SKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
+ GA LE A + GADL+ ++ LNE++L+ A+LV L +DL A
Sbjct: 57 LSQANLRGAKLENAHLSEVILCGADLTQAILINAHLNESDLSGALLVDANLCDADLHQAS 116
Query: 195 IEGADFSDAVIDLAQ 209
I A+ A ++ A+
Sbjct: 117 ITAANLQSAKLNGAK 131
>gi|416394625|ref|ZP_11686208.1| pentapeptide repeat protein [Crocosphaera watsonii WH 0003]
gi|357263221|gb|EHJ12255.1| pentapeptide repeat protein [Crocosphaera watsonii WH 0003]
Length = 164
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 49/89 (55%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
E+ R A+ +D+R+++ G+ A LE A AN GA+L+ ++ LN++NL
Sbjct: 62 NEDLRYAHLIGSDLRDANLEGAILIEANLEGADLTGANLEGANLTGAMLSNASLNDSNLD 121
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
N L ++ +D+ GA +E D ++A I
Sbjct: 122 NVNLASAIIYDADVTGASMENLDITNAQI 150
>gi|344339023|ref|ZP_08769953.1| pentapeptide repeat protein [Thiocapsa marina 5811]
gi|343800943|gb|EGV18887.1| pentapeptide repeat protein [Thiocapsa marina 5811]
Length = 284
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 51/103 (49%), Gaps = 11/103 (10%)
Query: 112 KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----------VAYKANFTGADL 160
KA+ ++ N R AN AD+R+ + + GA + +A V ANF GADL
Sbjct: 149 KALFIRANLREANLCGADLRDCHLNDANLAGASMHEADLTSALPGGFTVINLANFEGADL 208
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + + E N NA L LT + LGGAI+ AD ++A
Sbjct: 209 RGSKLRSVSAQETNFRNANLTDVDLTNAVLGGAILRRADVTNA 251
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 50/106 (47%), Gaps = 6/106 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADLR A + + R AN ADMR++DF GS KA+ +AN A+L
Sbjct: 103 SKANLERADLRHADVRRADLRGANLAHADMRDTDFQGSDLCHVVAPKALFIRANLREANL 162
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG------AIIEGADF 200
+ LN+ANL A + LT + GG A EGAD
Sbjct: 163 CGADLRDCHLNDANLAGASMHEADLTSALPGGFTVINLANFEGADL 208
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 50/118 (42%), Gaps = 16/118 (13%)
Query: 103 AQFGSADLRKAVHVKE-----------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 151
A G ADL +A E N +AN AD+R +D + GA L A
Sbjct: 74 ADLGGADLTQAHLGAERPSRAATLNGANLSKANLERADLRHADVRRADLRGANLAHADMR 133
Query: 152 KANFTGADLSDT-----LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
+F G+DL L R L EANL A L L ++L GA + AD + A+
Sbjct: 134 DTDFQGSDLCHVVAPKALFIRANLREANLCGADLRDCHLNDANLAGASMHEADLTSAL 191
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 53/108 (49%), Gaps = 14/108 (12%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETR----GEFGIGSAAQFGSADLR----KAVHVKE- 118
L A + C N + LA + +EA+ G F + + A F ADLR ++V +E
Sbjct: 162 LCGADLRDCHLNDANLAGASMHEADLTSALPGGFTVINLANFEGADLRGSKLRSVSAQET 221
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
NFR AN T D+ + + GA L +A A+F+G +L+ M+
Sbjct: 222 NFRNANLTDVDL-----TNAVLGGAILRRADVTNADFSGVELASVTME 264
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 63/157 (40%), Gaps = 19/157 (12%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A ADLR + R A+ +SA++ SD G+ +GA L A+ + T ADL
Sbjct: 18 ADTLAGADLRDMHLNGADLRGADLSSANLESSDLVGALLSGARLIDAILVATDLTDADLG 77
Query: 162 DTLMDR-----------MVLNEANLTNAVLVRTVL-----TRSDLGGAIIEGADFSDAVI 205
+ + LN ANL+ A L R L R+DL GA + AD D
Sbjct: 78 GADLTQAHLGAERPSRAATLNGANLSKANLERADLRHADVRRADLRGANLAHADMRDTDF 137
Query: 206 DLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAY 242
Q LC + R++ CG R+ +
Sbjct: 138 ---QGSDLCHVVAPKALFIRANLREANLCGADLRDCH 171
Score = 37.7 bits (86), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 53/129 (41%), Gaps = 16/129 (12%)
Query: 111 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 170
R+ V E AD+R+ +G+ GA L A ++ GA +L
Sbjct: 7 RETGQVLERIDADTLAGADLRDMHLNGADLRGADLSSANLESSDLVGA----------LL 56
Query: 171 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRK 230
+ A L +A+LV T LT +DLGGA + A A++ + NG N R
Sbjct: 57 SGARLIDAILVATDLTDADLGGADLTQAHLG------AERPSRAATLNGANLSKANLERA 110
Query: 231 SLGCGNSRR 239
L + RR
Sbjct: 111 DLRHADVRR 119
>gi|298246992|ref|ZP_06970797.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297549651|gb|EFH83517.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 381
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 56/119 (47%), Gaps = 8/119 (6%)
Query: 95 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAV 149
G +GS + GSA + ++ + A A MR S D S + GA L KA
Sbjct: 236 GHDALGSQGERGSA---RHPDLQAHLSHAQLAGAKMRGSYLSGVDLSQANLRGADLSKAY 292
Query: 150 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
Y AN GADLS + L EAN+ A L L+++ L GA + AD S A + LA
Sbjct: 293 FYGANLQGADLSGANLTETTLTEANIEGANLTEANLSKATLIGANLRQADLSGARLTLA 351
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 48/104 (46%), Gaps = 21/104 (20%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
GS A G ADL+K V + N D+R +F +AN GAD
Sbjct: 146 GSKALVG-ADLQKIV-----LPQINLAQMDLRRVNFR---------------EANLQGAD 184
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
LS + R L+ ANL++A L L +DL G + GAD SD+
Sbjct: 185 LSGVNLYRADLSGANLSHATLKGADLRGADLRGTDLTGADLSDS 228
>gi|381206178|ref|ZP_09913249.1| hypothetical protein SclubJA_11179 [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 205
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 68/136 (50%), Gaps = 15/136 (11%)
Query: 85 DLNKYEAETRGEFGIGSAAQFGSADLRKA-VHVKE----NFRRANFTSADMRESDFSGSK 139
DL+K +A + S A G A+L A +H N + AN AD+RE+D +
Sbjct: 55 DLDKLQATNKCIRCDLSGADLGGANLSDANLHFANLQGTNLKGANLNWADLREADLRKAD 114
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN-----AVLVRTVLTRSD----- 189
N A L+ A +A+ GA+L + + + EANL A L R L+R++
Sbjct: 115 LNWARLKNADLRRADLYGANLGEAFLQYSDMREANLREVDLEAADLYRAELSRANLEDAR 174
Query: 190 LGGAIIEGADFSDAVI 205
LGGAI++ A S+A++
Sbjct: 175 LGGAILKFASMSEAIL 190
>gi|217968703|ref|YP_002353937.1| pentapeptide repeat-containing protein [Thauera sp. MZ1T]
gi|217506030|gb|ACK53041.1| pentapeptide repeat protein [Thauera sp. MZ1T]
Length = 215
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 1/109 (0%)
Query: 102 AAQFGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
A G D+R+ N AN ++ S F+ S +GA L A +ANF+ A++
Sbjct: 29 AGNIGGCDIRRGTLCTNLNLNGANLEGVNLANSQFTRSDLSGANLRGATLNEANFSQAEM 88
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+ +D+ + ANL A L +DL AI++ AD DA + AQ
Sbjct: 89 AGATLDKASMLRANLRGARLTGASFKEADLRNAILQNADLHDADLTAAQ 137
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A +L + + + AN A + E++FS ++ GA L+KA +AN GA L
Sbjct: 49 NGANLEGVNLANSQFTRSDLSGANLRGATLNEANFSQAEMAGATLDKASMLRANLRGARL 108
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ EA+L NA+L L +DL A + GAD ++A
Sbjct: 109 TGA-----SFKEADLRNAILQNADLHDADLTAAQLGGADLTNA 146
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 55/115 (47%), Gaps = 11/115 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A F ADLR A+ +AD+ ++D + ++ GA L A A ++ F GA+L
Sbjct: 109 TGASFKEADLRNAI----------LQNADLHDADLTAAQLGGADLTNARAERSRFDGAEL 158
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
+ + + L A+ A L+ TR+ L A + GA+ A+ AQ CK
Sbjct: 159 TRSNLRAAKLAGASFVGANLLGANFTRAQLSNADLTGANLDQAIFLNAQTDG-CK 212
Score = 43.9 bits (102), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 49/105 (46%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ A L KA ++ N R A T A +E+D + A L A A GADL
Sbjct: 84 SQAEMAGATLDKASMLRANLRGARLTGASFKEADLRNAILQNADLHDADLTAAQLGGADL 143
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ +R + A LT + L L + GA + GA+F+ A +
Sbjct: 144 TNARAERSRFDGAELTRSNLRAAKLAGASFVGANLLGANFTRAQL 188
>gi|443668754|ref|ZP_21134246.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|443330716|gb|ELS45411.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 403
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 59/118 (50%), Gaps = 12/118 (10%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
A+ RG F S A ADLR+A AN + AD+ E++ SG+ GA L A+
Sbjct: 243 ADLRGAFL--SEANLKGADLRRAF-----LSEANLSGADLSEANLSGADLRGAILSGAIL 295
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDA 203
+ AN GA LS + +L+ ANL A L L+ ++L GAI+ AD +A
Sbjct: 296 WGANLKGAGLSLAFLRGAILSGANLGQADLWEANLSGANLSEANLSGAILWEADLIEA 353
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 56/121 (46%), Gaps = 17/121 (14%)
Query: 93 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 152
T+ EF A A+L KA+ R +++ D SG+ GA L A +
Sbjct: 200 TKAEFTT-DAKVIKKAELIKAI------REGTIDKTTLQQVDLSGAILRGADLRGAFLSE 252
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVIDL 207
AN GADL R L+EANL+ A L L+ +DL GAI+ GA+ A + L
Sbjct: 253 ANLKGADLR-----RAFLSEANLSGADLSEANLSGADLRGAILSGAILWGANLKGAGLSL 307
Query: 208 A 208
A
Sbjct: 308 A 308
>gi|452964739|gb|EME69773.1| serine/threonine protein kinase [Magnetospirillum sp. SO-1]
Length = 137
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 46/89 (51%), Gaps = 15/89 (16%)
Query: 133 SDFSGSKFNGAYLEKAVAYKANFTGA----------DLSDTLMDRMVLNEAN-----LTN 177
SDFSGS N A L +AV ANF GA DL++ R VLN AN L
Sbjct: 8 SDFSGSVLNAADLRQAVLIGANFEGAVLNHARLTDADLTEARFLRSVLNNANMHGACLKG 67
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A+L V+ +DL A +EGAD A+I+
Sbjct: 68 AILAGAVMNNADLSCATLEGADLRGAIIN 96
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 59/111 (53%), Gaps = 1/111 (0%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S + +ADLR+AV + NF A A + ++D + ++F + L A + A GA L
Sbjct: 11 SGSVLNAADLRQAVLIGANFEGAVLNHARLTDADLTEARFLRSVLNNANMHGACLKGAIL 70
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
+ +M+ L+ A L A L ++ +DL GA + GAD + A ++L + Q
Sbjct: 71 AGAVMNNADLSCATLEGADLRGAIINNADLSGADLRGADLTGA-LNLTRDQ 120
>gi|428217541|ref|YP_007102006.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427989323|gb|AFY69578.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 353
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 59/126 (46%), Gaps = 4/126 (3%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A FGSA+L A + N +AN AD+ ++D G+K G L +A +AN D +
Sbjct: 54 ANFGSANLLGANLSEANLTKANLREADLYKADLGGAKLIGTSLIRAYLREANLRDCDCNS 113
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
T + L E L NA L L+ S+L A + A DA++ A+ YAN
Sbjct: 114 TALIGADLTEVCLENADLTGANLSESNLSSANLNFAILKDAIL----SNAIASYANMNET 169
Query: 223 ITGVST 228
I ++
Sbjct: 170 IMDMAV 175
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 14/145 (9%)
Query: 74 ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRES 133
A S+ I++ A++N ET + + AQ D A V+ + R AN AD+ +
Sbjct: 154 AILSNAIASYANMN----ETIMDMAVLDRAQLNFVDFNGAAMVQASLRHANLCGADLSGA 209
Query: 134 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE----------ANLTNAVLVRT 183
+ S + +GA L +A+ AN + A+LS ++ L+ ANLT+A+L
Sbjct: 210 NLSYANLSGANLCEAILSNANLSHANLSGAILRDASLSNANLSGADLSGANLTDAILSDA 269
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLA 208
L+R++L AI+ GA A ++ A
Sbjct: 270 DLSRANLSEAILAGAQLISAKLEAA 294
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 104 QFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
+ A+LR+A + N R ANF SA++ ++ S + A L +A YKA+ GA
Sbjct: 30 ELSGANLRRATLREVNLSGVDLRWANFGSANLLGANLSEANLTKANLREADLYKADLGGA 89
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L T + R L EANL + T L +DL +E AD + A
Sbjct: 90 KLIGTSLIRAYLREANLRDCDCNSTALIGADLTEVCLENADLTGA 134
Score = 44.3 bits (103), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 48/87 (55%), Gaps = 5/87 (5%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL---- 175
A+ T A++ ES+ S + N A L+ A+ A + A++++T+MD VL+ A L
Sbjct: 126 LENADLTGANLSESNLSSANLNFAILKDAILSNAIASYANMNETIMDMAVLDRAQLNFVD 185
Query: 176 -TNAVLVRTVLTRSDLGGAIIEGADFS 201
A +V+ L ++L GA + GA+ S
Sbjct: 186 FNGAAMVQASLRHANLCGADLSGANLS 212
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 55/108 (50%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANF 155
S A A+L +A+ N AN + A +R++ + SG+ +GA L A+ A+
Sbjct: 212 SYANLSGANLCEAILSNANLSHANLSGAILRDASLSNANLSGADLSGANLTDAILSDADL 271
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ A+LS+ ++ L A L A LV T L +++L A ++G DA
Sbjct: 272 SRANLSEAILAGAQLISAKLEAAFLVGTDLIKANLRLASLKGVSLKDA 319
>gi|428224795|ref|YP_007108892.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984696|gb|AFY65840.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 284
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 53/103 (51%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL +A N RRANFT+A MR + S GA + + Y+A ++LS
Sbjct: 185 ANLSDADLTRANLGSTNLRRANFTNAKMRGASLIWSSLRGAKMIRVNLYRAKLNWSNLS- 243
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
EA+L+ A+L+ T L R++L A ++ A+ S A +
Sbjct: 244 ---------EADLSEAILIDTNLRRANLRDANLQNANLSGATM 277
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 5/117 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK-----AVAYKANFTG 157
A + R+A ++ N +N T + RE++ SGS GA L++ A AN G
Sbjct: 30 ADLRDVNFREAHLIEVNLSGSNLTGVNFREANLSGSNLGGAMLQECNLIGANLLGANLMG 89
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
+DLS + + L+E NL +A L ++ ++L GA + G + + A + A+ C
Sbjct: 90 SDLSGSSLRSANLSEVNLRSANLSDAIVGEANLSGANLYGTNLTGAHLSRARLVETC 146
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 50/94 (53%), Gaps = 10/94 (10%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS-----DTLMDRMVLNEAN 174
R AN + ++R ++ S + A L A Y N TGA LS +T ++ +L+ AN
Sbjct: 97 LRSANLSEVNLRSANLSDAIVGEANLSGANLYGTNLTGAHLSRARLVETCLEHAILDNAN 156
Query: 175 LTNAVLVRTVLT-----RSDLGGAIIEGADFSDA 203
L+ +VL LT ++ L GA +EGA+ SDA
Sbjct: 157 LSGSVLNGANLTGARLSQAVLSGASLEGANLSDA 190
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 67/146 (45%), Gaps = 11/146 (7%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANF 155
S + G A L++ + N AN +D+ R ++ S A L A+ +AN
Sbjct: 63 SGSNLGGAMLQECNLIGANLLGANLMGSDLSGSSLRSANLSEVNLRSANLSDAIVGEANL 122
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
+GA+L T + L+ A L L +L ++L G+++ GA+ + A + QA+
Sbjct: 123 SGANLYGTNLTGAHLSRARLVETCLEHAILDNANLSGSVLNGANLTGARL----SQAVLS 178
Query: 216 YAN--GTNPITGVSTRKSLGCGNSRR 239
A+ G N TR +LG N RR
Sbjct: 179 GASLEGANLSDADLTRANLGSTNLRR 204
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 68/153 (44%), Gaps = 25/153 (16%)
Query: 95 GEFGIGSAAQFGS----ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
GE + A +G+ A L +A V+ A +A++ S +G+ GA L +AV
Sbjct: 118 GEANLSGANLYGTNLTGAHLSRARLVETCLEHAILDNANLSGSVLNGANLTGARLSQAVL 177
Query: 151 YKANFTGADLSDTLMDR-----MVLNEANLTN---------------AVLVRTVLTRSDL 190
A+ GA+LSD + R L AN TN A ++R L R+ L
Sbjct: 178 SGASLEGANLSDADLTRANLGSTNLRRANFTNAKMRGASLIWSSLRGAKMIRVNLYRAKL 237
Query: 191 GGAIIEGADFSDAV-IDLAQKQALCKYANGTNP 222
+ + AD S+A+ ID ++A + AN N
Sbjct: 238 NWSNLSEADLSEAILIDTNLRRANLRDANLQNA 270
>gi|334118359|ref|ZP_08492448.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333459366|gb|EGK87979.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 280
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 51/91 (56%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
LR+ + NFR N AD++ + S + F + + A AN TGA+L + + +
Sbjct: 7 LRQYAAGERNFREINLVGADLKGVNLSEANFTRSNFQDANLKGANLTGANLREVKLAGVD 66
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
L EANL+ A L+ T L+R++L GA + GA+
Sbjct: 67 LTEANLSEANLIGTDLSRANLSGANLMGANL 97
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 59/112 (52%), Gaps = 8/112 (7%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LR ++ + N +AN T A++ E++F+ + A L A + N A+L
Sbjct: 88 SGANLMGANLRGSMAREVNMTKANLTEANLTEANFTEANLFAANLTDASMIRINLMKANL 147
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI---DLAQ 209
S + L NLTNA+L ++L R++L AI+ GA S A + DL Q
Sbjct: 148 SWS-----TLKAVNLTNAILSESLLERANLNQAILSGAMLSGANLTGADLRQ 194
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 65/131 (49%), Gaps = 14/131 (10%)
Query: 103 AQFGSADLRKA----VHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A A+LR+ V + E N AN D+ ++ SG+ GA L ++A + N T
Sbjct: 50 ANLTGANLREVKLAGVDLTEANLSEANLIGTDLSRANLSGANLMGANLRGSMAREVNMTK 109
Query: 158 ADLSDTLMDRMVLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 212
A+L++ + EANL T+A ++R L +++L + ++ + ++A++ ++
Sbjct: 110 ANLTEANLTEANFTEANLFAANLTDASMIRINLMKANLSWSTLKAVNLTNAIL----SES 165
Query: 213 LCKYANGTNPI 223
L + AN I
Sbjct: 166 LLERANLNQAI 176
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 5/95 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADLR+ V N AN ++A++R ++ S S A L A Y+A ++L
Sbjct: 183 SGANLTGADLRQVTMVGANLSEANLSNANLRVANVSWSTLAKANLSGANLYRAKLCWSNL 242
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
S ++ VL +ANL RT +DL AI+
Sbjct: 243 SGAVLLEAVLIDANLN-----RTNFRDADLRRAIM 272
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 51/108 (47%), Gaps = 10/108 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVAYK 152
A +A+L A ++ N +AN T+A + ES + N A L A+
Sbjct: 125 ANLFAANLTDASMIRINLMKANLSWSTLKAVNLTNAILSESLLERANLNQAILSGAMLSG 184
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
AN TGADL M L+EANL+NA L ++ S L A + GA+
Sbjct: 185 ANLTGADLRQVTMVGANLSEANLSNANLRVANVSWSTLAKANLSGANL 232
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 56/102 (54%), Gaps = 7/102 (6%)
Query: 108 ADLRKAVHVKE-NFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTGADLS 161
ADL K V++ E NF R+NF A+++ ++ +G+ K G L +A +AN G DLS
Sbjct: 25 ADL-KGVNLSEANFTRSNFQDANLKGANLTGANLREVKLAGVDLTEANLSEANLIGTDLS 83
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L ANL ++ +T+++L A + A+F++A
Sbjct: 84 RANLSGANLMGANLRGSMAREVNMTKANLTEANLTEANFTEA 125
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 5/124 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
+ F A+L+ A N R D+ E++ S + G L +A AN GA+L
Sbjct: 40 SNFQDANLKGANLTGANLREVKLAGVDLTEANLSEANLIGTDLSRANLSGANLMGANLRG 99
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ---ALCKYANG 219
++ + + +ANLT A L T ++L A + D S I+L + + K N
Sbjct: 100 SMAREVNMTKANLTEANLTEANFTEANLFAANL--TDASMIRINLMKANLSWSTLKAVNL 157
Query: 220 TNPI 223
TN I
Sbjct: 158 TNAI 161
>gi|307155293|ref|YP_003890677.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306985521|gb|ADN17402.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 145
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 50/97 (51%), Gaps = 5/97 (5%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DLR A N A+ AD+R ++ SG+ A LE A AN GADL+ ++
Sbjct: 42 DLRGA-----NLSAAHLIGADLRNANLSGANLVEANLEGADLTGANLQGADLTGAMVTNA 96
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
LN +NL + +L +D+ GA++EG + +A I
Sbjct: 97 SLNNSNLKDVNFTNAMLYDADVTGALMEGLNLKNAQI 133
>gi|218442709|ref|YP_002381029.1| hypothetical protein PCC7424_5734 [Cyanothece sp. PCC 7424]
gi|218175067|gb|ACK73799.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 266
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 66/143 (46%), Gaps = 26/143 (18%)
Query: 113 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 172
AV K N A +A+++ +D G+ GAYL A + TGA+L D + L
Sbjct: 125 AVGPKANLNGAFLNTANLKNADLKGANLRGAYLSGA-----DLTGANLEDAALSGANLQG 179
Query: 173 ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID-----------LAQ------KQALCK 215
A LT A L + L ++L GA + AD +DA ++ LAQ K LC
Sbjct: 180 ALLTGAYLRKARLIGAELQGADLRAADLTDANLEQLQNLAGADFTLAQGLTEDTKAMLCS 239
Query: 216 YAN---GT-NPITGVSTRKSLGC 234
GT NP T +T +SLGC
Sbjct: 240 RPAQELGTWNPFTRSNTAQSLGC 262
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 55/113 (48%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
+L KA+ +N +AN ++ + D S + + A L A + N GA+L + +
Sbjct: 7 ELTKALSEGKNLAKANLQGINLAQMDLSNADLSAANLIGANLSETNLKGANLEGADLRGV 66
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
L++ANL A L + L RS+L G ++ A A I LA+ + + G N
Sbjct: 67 NLSKANLEGANLQNSYLFRSNLEGCCLKEAQLQGAKIQLARYDSYTVWPEGYN 119
>gi|262196377|ref|YP_003267586.1| pentapeptide repeat-containing protein [Haliangium ochraceum DSM
14365]
gi|262079724|gb|ACY15693.1| pentapeptide repeat protein [Haliangium ochraceum DSM 14365]
Length = 903
Score = 52.0 bits (123), Expect = 3e-04, Method: Composition-based stats.
Identities = 41/127 (32%), Positives = 59/127 (46%), Gaps = 12/127 (9%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADLR A F +AN A++ +++F ++F GA L A AN A L + +
Sbjct: 768 ADLRHA-----GFEQANLVQANLIQANFGYARFLGADLRGAQLLGANLQDAKLQNANLQG 822
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVS 227
L ANL A L L +DL GA + A+ S A AQ K+ +G +P
Sbjct: 823 ANLQGANLQGAKLQNANLQGADLQGADLRAANLSAANFLGAQYSTETKWPDGVDP----- 877
Query: 228 TRKSLGC 234
++LGC
Sbjct: 878 --EALGC 882
Score = 42.0 bits (97), Expect = 0.31, Method: Composition-based stats.
Identities = 27/85 (31%), Positives = 42/85 (49%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ RA AD+ +D +G+ + A+LE+A +ANF A L + + L A A
Sbjct: 719 DLARAYLAGADLAGADLAGADLSLAHLERASLERANFRSAKLLYSNLRYADLRHAGFEQA 778
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
LV+ L +++ G A GAD A
Sbjct: 779 NLVQANLIQANFGYARFLGADLRGA 803
>gi|427709341|ref|YP_007051718.1| endoribonuclease L-PSP [Nostoc sp. PCC 7107]
gi|427361846|gb|AFY44568.1| endoribonuclease L-PSP [Nostoc sp. PCC 7107]
Length = 433
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 60/138 (43%), Gaps = 35/138 (25%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFS----------GSKFNGAYL 145
S A S+DL A V N +AN + AD+ ESDF+ G+K
Sbjct: 133 SGANLSSSDLSVASLVGANLNKANLSKADLGDAYLMESDFTLANLTEATLIGAKLQNVKF 192
Query: 146 EKAVAYKANFTGADLSDT--------------------LMDRMVLNEANLTNAVLVRTVL 185
+A Y+ N +G +L+D ++R+ L ANLTNA L L
Sbjct: 193 HRANLYQVNLSGMNLTDVDFTAASLQSTNLIKSRLQGANLERVNLRGANLTNANLDGANL 252
Query: 186 TRSDLGGAIIEGADFSDA 203
R+DL GA I GA F DA
Sbjct: 253 RRADLTGADIYGASFIDA 270
>gi|86606920|ref|YP_475683.1| pentapeptide repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86555462|gb|ABD00420.1| pentapeptide repeat family protein [Synechococcus sp. JA-3-3Ab]
Length = 154
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/128 (35%), Positives = 59/128 (46%), Gaps = 16/128 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 157
S AQ A+LR V R A+ + AD+RE D SG+ +GA L A + N G
Sbjct: 32 SGAQLSGANLRGIV-----LRDADLSGADLREGDLSGADLSGADLRGAKLRRVNLIGAKL 86
Query: 158 --ADLSDTLMDRMVLNEANLTNAVLVRTVL-TRSDLGGAIIEGADFSDAVIDLAQKQALC 214
ADL + R L A+L+ A L R L +DL GAII F A+ D
Sbjct: 87 VKADLRGANLYRAKLLRADLSEADLSRADLRIGADLRGAIITNTRFRGALYD-----EYT 141
Query: 215 KYANGTNP 222
K+ G NP
Sbjct: 142 KFPEGFNP 149
>gi|297569025|ref|YP_003690369.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
gi|296924940|gb|ADH85750.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
Length = 830
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 67/133 (50%), Gaps = 12/133 (9%)
Query: 82 ALADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 137
ALADL + +R F S A+ ADLR+ + + +FR A+ AD RE+
Sbjct: 225 ALADLGGADLRRADLSRANF---SQARLRQADLRQVLFSESDFRHADARRADFREATLRQ 281
Query: 138 SKFNGAYLEKAVAYKANFTG-----ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG 192
+ F+GA L +A+ + TG A+L+ +++ L+ L +V+ L S+L G
Sbjct: 282 ANFSGADLSRAIFSGTDLTGGVFQQANLAGAVLEGADLSRLALAGVKMVKANLAGSNLYG 341
Query: 193 AIIEGADFSDAVI 205
A + G D +DA +
Sbjct: 342 ADLRGVDLTDASL 354
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 42/80 (52%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N D+R +F+ S+ +G L+ A A+F+ ADL + L EA L +A L R
Sbjct: 163 NLAGLDLRGVNFADSRLHGVNLQGANLRGADFSRADLMHADLSEADLREAKLVDANLARA 222
Query: 184 VLTRSDLGGAIIEGADFSDA 203
L +DLGGA + AD S A
Sbjct: 223 SLALADLGGADLRRADLSRA 242
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 54/114 (47%), Gaps = 10/114 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRA----------NFTSADMRESDFSGSKFNGAYLEKAVAYK 152
A ADLR+A V N RA + AD+ ++FS ++ A L + + +
Sbjct: 202 ADLSEADLREAKLVDANLARASLALADLGGADLRRADLSRANFSQARLRQADLRQVLFSE 261
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
++F AD L +AN + A L R + + +DL G + + A+ + AV++
Sbjct: 262 SDFRHADARRADFREATLRQANFSGADLSRAIFSGTDLTGGVFQQANLAGAVLE 315
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 58/126 (46%), Gaps = 26/126 (20%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMR---------------ESDFSGSKFNGAYLEK 147
A F +A+L + + +F +A+FT A++ E++ + + +GA L
Sbjct: 392 ADFRAANLTRVAAQQADFSQADFTGANLTAAVFSEAIMAGAKLLEANLTNANLDGADLTS 451
Query: 148 AVAY-----------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
V+ KA+ GADLS+ ++ VL EANL L L R+DL A I
Sbjct: 452 RVSMIRGNLTNASLQKADLHGADLSNAIVTGAVLREANLRRVRLSHASLNRADLSWATIV 511
Query: 197 GADFSD 202
AD S+
Sbjct: 512 DADLSN 517
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 77/180 (42%), Gaps = 38/180 (21%)
Query: 62 VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 121
VF LA AV+ + ALA + +A G G A DL A ++ +
Sbjct: 303 VFQQANLAGAVLEGADLSRLALAGVKMVKANLAGSNLYG--ADLRGVDLTDASLLEADLS 360
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAY--------------------KANFTGADL- 160
A+ A + ++ F+G +GA L AVA +A+FTGA+L
Sbjct: 361 AADLAGARLDKAVFAGGTLHGARLLSAVARNADFRAANLTRVAAQQADFSQADFTGANLT 420
Query: 161 ----SDTLMDRMVLNEANLTNAVL-----------VRTVLTRSDLGGAIIEGADFSDAVI 205
S+ +M L EANLTNA L +R LT + L A + GAD S+A++
Sbjct: 421 AAVFSEAIMAGAKLLEANLTNANLDGADLTSRVSMIRGNLTNASLQKADLHGADLSNAIV 480
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 39/82 (47%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DLR A V N R A+ AD+ +D + A L ++ N T ADLS ++
Sbjct: 554 DLRNANLVNANLRDADLADADLSNADLRQANLARANLSRSDLRWVNLTDADLSGAILSGA 613
Query: 169 VLNEANLTNAVLVRTVLTRSDL 190
LN+A+ AV LTR+ L
Sbjct: 614 SLNDADFNRAVFAEANLTRASL 635
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 43/87 (49%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+F RA+ AD+ E+D +K A L +A A+ GADL + R ++A L A
Sbjct: 193 DFSRADLMHADLSEADLREAKLVDANLARASLALADLGGADLRRADLSRANFSQARLRQA 252
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVI 205
L + + + SD A ADF +A +
Sbjct: 253 DLRQVLFSESDFRHADARRADFREATL 279
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 15/78 (19%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +ADLR+A N RAN + +D+R + + + +GA L +GA L+D
Sbjct: 573 ADLSNADLRQA-----NLARANLSRSDLRWVNLTDADLSGAIL----------SGASLND 617
Query: 163 TLMDRMVLNEANLTNAVL 180
+R V EANLT A L
Sbjct: 618 ADFNRAVFAEANLTRASL 635
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 63/130 (48%), Gaps = 17/130 (13%)
Query: 110 LRKAV-----HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
LR AV ++ + R AN +A++R++D + + + A L +A +AN + +DL
Sbjct: 540 LRSAVSLGGRMIRYDLRNANLVNANLRDADLADADLSNADLRQANLARANLSRSDL---- 595
Query: 165 MDRMV-LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA----VIDLAQKQALCKYANG 219
R V L +A+L+ A+L L +D A+ A+ + A V +L K + A G
Sbjct: 596 --RWVNLTDADLSGAILSGASLNDADFNRAVFAEANLTRASLFNVKNL-DKARMLDQAQG 652
Query: 220 TNPITGVSTR 229
P G +R
Sbjct: 653 YEPKAGDDSR 662
Score = 37.0 bits (84), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 59/148 (39%), Gaps = 36/148 (24%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA---------- 150
S + F AD R+A + R+ANF+ AD+ + FSG+ G ++A
Sbjct: 260 SESDFRHADARRADFREATLRQANFSGADLSRAIFSGTDLTGGVFQQANLAGAVLEGADL 319
Query: 151 --------------------YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
Y A+ G DL+D + L+ A+L A L + V L
Sbjct: 320 SRLALAGVKMVKANLAGSNLYGADLRGVDLTDASLLEADLSAADLAGARLDKAVFAGGTL 379
Query: 191 GG-----AIIEGADFSDA-VIDLAQKQA 212
G A+ ADF A + +A +QA
Sbjct: 380 HGARLLSAVARNADFRAANLTRVAAQQA 407
Score = 37.0 bits (84), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 52/128 (40%), Gaps = 30/128 (23%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY---------------- 144
S A ADL A V A+ ++ D+RE++ +G
Sbjct: 496 SHASLNRADLSWATIVD-----ADLSNTDLREANLTGVNLGAGASVLQSLRSAVSLGGRM 550
Query: 145 ----LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT-----VLTRSDLGGAII 195
L A AN ADL+D + L +ANL A L R+ LT +DL GAI+
Sbjct: 551 IRYDLRNANLVNANLRDADLADADLSNADLRQANLARANLSRSDLRWVNLTDADLSGAIL 610
Query: 196 EGADFSDA 203
GA +DA
Sbjct: 611 SGASLNDA 618
>gi|159029340|emb|CAO90206.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length = 405
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 59/118 (50%), Gaps = 12/118 (10%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
A+ RG F S A ADLR+A AN + AD+ E++ SG+ GA L A+
Sbjct: 245 ADLRGAFL--SEANLKGADLRRAF-----LSEANLSGADLSEANLSGADLRGAILSGAIL 297
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVR-----TVLTRSDLGGAIIEGADFSDA 203
+ AN GA LS + +L+ ANL A L L+ ++L GAI+ AD +A
Sbjct: 298 WGANLKGAGLSLAFLRGAILSGANLGQADLWEANLSGANLSEANLSGAILWEADLIEA 355
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 56/121 (46%), Gaps = 17/121 (14%)
Query: 93 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 152
T+ EF A A+L KA+ R +++ D SG+ GA L A +
Sbjct: 202 TKAEFTT-DAKVIKKAELIKAI------REGTIDKTTLQQVDLSGAILRGADLRGAFLSE 254
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDAVIDL 207
AN GADL R L+EANL+ A L L+ +DL GAI+ GA+ A + L
Sbjct: 255 ANLKGADLR-----RAFLSEANLSGADLSEANLSGADLRGAILSGAILWGANLKGAGLSL 309
Query: 208 A 208
A
Sbjct: 310 A 310
>gi|434392917|ref|YP_007127864.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428264758|gb|AFZ30704.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 313
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 58/108 (53%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A+L KA+ + N RAN T AD+ +++ SG + + A L +AV A+
Sbjct: 93 SGVNLWRANLNKAILCEANLSRANLDEANLTGADLSKANLSGIQLSKANLTEAVIVDAHL 152
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A+L++T + R L L A L+ + LT +DL A +EGA+ S+A
Sbjct: 153 NRANLTETKLMRSHLCGTQLERAELIASDLTAADLSRANLEGANLSEA 200
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 52/99 (52%), Gaps = 10/99 (10%)
Query: 110 LRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
L++A+ + R+ AD+ +++ + ++FNG++L AN TGADLS
Sbjct: 37 LKRAILEATDLSRSILVGADLNGVILKQATMTATRFNGSHLVGVDLTAANLTGADLSGVN 96
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ R ANL A+L L+R++L A + GAD S A
Sbjct: 97 LWR-----ANLNKAILCEANLSRANLDEANLTGADLSKA 130
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 57/116 (49%), Gaps = 15/116 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA--VAYKA---NFTG 157
A+ ++DL A + N AN + A++ +++ SG+ G L +A +A KA N G
Sbjct: 175 AELIASDLTAADLSRANLEGANLSEANLSQANLSGANLTGVNLHRANLIAAKAILANLRG 234
Query: 158 ADLSDTLMDRMVLNEA----------NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A+L + L EA NL+ A L R +LT +L AI+ GA+ DA
Sbjct: 235 ANLEQAELITTNLTEADLSWANLSKTNLSGADLHRAILTDVNLNSAILRGANLIDA 290
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 68/154 (44%), Gaps = 36/154 (23%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFT--------------------SADMRESDFSGSKF 140
S Q A+L +AV V + RAN T ++D+ +D S +
Sbjct: 133 SGIQLSKANLTEAVIVDAHLNRANLTETKLMRSHLCGTQLERAELIASDLTAADLSRANL 192
Query: 141 NGAYLEKAVAYKANFTGADLSDTLMDR--MV--------LNEANLTNAVLVRTVLTRSDL 190
GA L +A +AN +GA+L+ + R ++ L ANL A L+ T LT +DL
Sbjct: 193 EGANLSEANLSQANLSGANLTGVNLHRANLIAAKAILANLRGANLEQAELITTNLTEADL 252
Query: 191 GGA-----IIEGADFSDAVI-DLAQKQALCKYAN 218
A + GAD A++ D+ A+ + AN
Sbjct: 253 SWANLSKTNLSGADLHRAILTDVNLNSAILRGAN 286
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 42/77 (54%), Gaps = 10/77 (12%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVA----------YKANFTGADLSDTLMDRMVLNEA 173
+ T+A++ +D SG A L KA+ +AN TGADLS + + L++A
Sbjct: 81 DLTAANLTGADLSGVNLWRANLNKAILCEANLSRANLDEANLTGADLSKANLSGIQLSKA 140
Query: 174 NLTNAVLVRTVLTRSDL 190
NLT AV+V L R++L
Sbjct: 141 NLTEAVIVDAHLNRANL 157
>gi|428306403|ref|YP_007143228.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428247938|gb|AFZ13718.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 276
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 62/127 (48%), Gaps = 8/127 (6%)
Query: 87 NKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
N YEA G G++ +F A L+ + + NF+ A + S+ + GA
Sbjct: 146 NLYEARLSGALLSGASLNGVKFSRAFLKDVDLNGADLQGINFSEARLGGSNLESANLVGA 205
Query: 144 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGA 198
L A Y+ N T ADLS + R L +ANLT A L + LT + L GA ++GA
Sbjct: 206 DLSDAHLYQVNLTAADLSGANLIRASLEQANLTWINLSKANLCQANLTNAILKGANLDGA 265
Query: 199 DFSDAVI 205
D +DA++
Sbjct: 266 DLTDAIL 272
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A SA+LR A + N A+ A++RE++ SG N A L +A A +GA L
Sbjct: 103 SGANLNSANLRGANLREANLSSASLQRANLREANLSGVNLNWANLYEARLSGALLSGASL 162
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDA 203
+ R L + +L A L + + LGG+ +E GAD SDA
Sbjct: 163 NGVKFSRAFLKDVDLNGADLQGINFSEARLGGSNLESANLVGADLSDA 210
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 10/113 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVA 150
S A+ A+L + V +K + RAN A++ ++D + + GA +EKA
Sbjct: 38 SGAKLMGANLSRTVMIKSDLSRANLNWANLSFAKMSAVKLGDADLTKANLQGAVMEKAKL 97
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+A +GA+L+ + L EANL++A L R L ++L G + A+ +A
Sbjct: 98 PRAKLSGANLNSANLRGANLREANLSSASLQRANLREANLSGVNLNWANLYEA 150
>gi|209523090|ref|ZP_03271646.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|376002100|ref|ZP_09779947.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423066405|ref|ZP_17055195.1| hypothetical protein SPLC1_S430130 [Arthrospira platensis C1]
gi|209496241|gb|EDZ96540.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|375329486|emb|CCE15700.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406712077|gb|EKD07268.1| hypothetical protein SPLC1_S430130 [Arthrospira platensis C1]
Length = 336
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 9/105 (8%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL + V +F+ AN +A ++E++ GS F L+ A +KAN T + +
Sbjct: 140 ADLEQVTLVDTDFKEANLKTAKLQEANLKGSTFELTQLQGANLWKANLQECFFLLTQLQK 199
Query: 168 MVLNEANLTNAV-----LVRTVLTRSDLGGAII----EGADFSDA 203
+ LN ANL NA L+ L +++L GA I +GA+F +A
Sbjct: 200 VNLNAANLENAELQGVNLLEANLQQANLQGAYILGNLQGANFQEA 244
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 45/89 (50%), Gaps = 9/89 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSA---------DMRESDFSGSKFNGAYLEKAVAY 151
+AA +A+L+ ++ N ++AN A + +E++ G+ GAYL+ A
Sbjct: 203 NAANLENAELQGVNLLEANLQQANLQGAYILGNLQGANFQEANLKGANLQGAYLQDANFK 262
Query: 152 KANFTGADLSDTLMDRMVLNEANLTNAVL 180
+AN G +L D + + EANL +A L
Sbjct: 263 RANLRGVNLKDANLTGVNFEEANLQSANL 291
>gi|157825867|ref|YP_001493587.1| hypothetical protein A1C_04030 [Rickettsia akari str. Hartford]
gi|157799825|gb|ABV75079.1| Uncharacterized low-complexity protein [Rickettsia akari str.
Hartford]
Length = 954
Score = 52.0 bits (123), Expect = 3e-04, Method: Composition-based stats.
Identities = 41/121 (33%), Positives = 59/121 (48%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKANLDKANLEYADLTNATLTNATAQFVKLSNATLEKAEA-----EGLNISDV 609
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS-----DAVIDLAQ-KQALCKYA 217
+ + EAN N ++ R LT++D A++E AD DA+ A KQA K A
Sbjct: 610 IATNINAKEANFKNVIMQRADLTKADFTKAMLENADMQAVEALDAIFKEANLKQANLKAA 669
Query: 218 N 218
N
Sbjct: 670 N 670
Score = 39.3 bits (90), Expect = 1.8, Method: Composition-based stats.
Identities = 36/147 (24%), Positives = 62/147 (42%), Gaps = 12/147 (8%)
Query: 62 VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR 121
+F S L +++C+ + + N + A + A F ADL+ + +
Sbjct: 363 LFASANLENINISNCNLDFTNFEGANLHNAVFQDV--TARNAVFLFADLKNSKIENSDMS 420
Query: 122 RANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTGADLSDTLMDRMVLN 171
RA D+ E++ + SKFN + A A K +N TG L+ M R+ +
Sbjct: 421 RAYMPKVDLSEAEVTNSKFNAIMMVNADAEKLIMQDSEWQNSNLTGISLAYADMQRVQMQ 480
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGA 198
L NA+L + + +DL A + A
Sbjct: 481 GVILNNALLDQANIVSTDLENAFMNNA 507
>gi|113476307|ref|YP_722368.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110167355|gb|ABG51895.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 225
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 46/88 (52%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
+ NF N +A+M ++ +G F GA L A AN TGA+L + R +N ANL
Sbjct: 50 TRANFHDINLKNANMSGANLTGVNFQGADLNGANLSGANLTGANLEKANLYRADINRANL 109
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFSDA 203
TN L T L +DL A + A+ +DA
Sbjct: 110 TNTNLTSTRLLEADLTLANLNHANLTDA 137
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 60/117 (51%), Gaps = 1/117 (0%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F +L+ A N NF AD+ ++ SG+ GA LEKA Y+A+ A+L++
Sbjct: 52 ANFHDINLKNANMSGANLTGVNFQGADLNGANLSGANLTGANLEKANLYRADINRANLTN 111
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL-CKYAN 218
T + L EA+LT A L LT ++L A + GA + A ++ A +YAN
Sbjct: 112 TNLTSTRLLEADLTLANLNHANLTDANLLEARLWGASLAGANLNNASLHGTNLEYAN 168
Score = 43.5 bits (101), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 45/97 (46%), Gaps = 9/97 (9%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD---------RMV 169
N AN T A++ E+ G+ GA L A + N A+L+D+ + R
Sbjct: 128 NLNHANLTDANLLEARLWGASLAGANLNNASLHGTNLEYANLADSNLSGADFHSFSFRSY 187
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
E NL+NA L ++T +DL G I+ G D I+
Sbjct: 188 KQETNLSNANLEGAIITNTDLSGVIMRGTIMPDGSIN 224
>gi|154251684|ref|YP_001412508.1| pentapeptide repeat-containing protein [Parvibaculum
lavamentivorans DS-1]
gi|154155634|gb|ABS62851.1| pentapeptide repeat protein [Parvibaculum lavamentivorans DS-1]
Length = 363
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 49/93 (52%), Gaps = 10/93 (10%)
Query: 121 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
+RA+FT D+ DFS + GA+ +A+ ANF ++ +L A+ +NA+L
Sbjct: 272 QRADFTRMDLSRKDFSRAVLAGAHFREAILADANF----------EKAILAAADFSNAIL 321
Query: 181 VRTVLTRSDLGGAIIEGADFSDAVIDLAQKQAL 213
R L +DL GA + GAD +A D +K L
Sbjct: 322 FRANLAGADLRGADLRGADLKNARQDDTKKGEL 354
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 37/77 (48%)
Query: 130 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSD 189
M+E D SG F +FTG DL D L AN +A L RT +R+D
Sbjct: 62 MKECDLSGLDFRNLNFSHGHFIGCDFTGCDLEDAHFSGANLFSANFDHANLTRTNFSRAD 121
Query: 190 LGGAIIEGADFSDAVID 206
L GA E A+ +DA +D
Sbjct: 122 LRGANFEDAEMADAQLD 138
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 16/104 (15%)
Query: 103 AQFGSADLRKAVHVK---------EN--FRRANFTSADMRE-----SDFSGSKFNGAYLE 146
AQ ADLR+ ++ EN FR A +M E +DF G+ +GA L+
Sbjct: 135 AQLDGADLRRGAVIRRGASAPVGRENSSFRGARMYGTNMAECKLLDADFEGASISGASLQ 194
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
A ANF GA+L + L +A+ AV+ + R D+
Sbjct: 195 GADLRGANFAGAELKGVELSGANLADADFRRAVMDEATIARGDM 238
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+FR NF+ DF+G A+ A + ANF A+L+ T R L AN +A
Sbjct: 71 DFRNLNFSHGHFIGCDFTGCDLEDAHFSGANLFSANFDHANLTRTNFSRADLRGANFEDA 130
Query: 179 VLVRTVLTRSDL-GGAII 195
+ L +DL GA+I
Sbjct: 131 EMADAQLDGADLRRGAVI 148
>gi|428225932|ref|YP_007110029.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427985833|gb|AFY66977.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 180
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 52/105 (49%), Gaps = 10/105 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S + SA LR + N R AN AD+++S+ G+ A L A +A+ GADL
Sbjct: 66 SKSNLYSAKLRGSDLGLANLREANLGDADLKQSNLRGADLRNANLLGASLIEADLRGADL 125
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
D ANLTNA L L +++L GA++ G F AV+
Sbjct: 126 RD----------ANLTNANLDGADLRQTNLQGAVLTGVSFRGAVL 160
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 61/132 (46%), Gaps = 24/132 (18%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N ++N SA +R SD + A L A ++N GADL + ANL A
Sbjct: 64 NLSKSNLYSAKLRGSDLGLANLREANLGDADLKQSNLRGADLRN----------ANLLGA 113
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANGTNPITGVSTRKSLGCGNS 237
L+ +DL GA + A+ ++A +D A +Q + A +TGVS R ++ CG +
Sbjct: 114 SLI-----EADLRGADLRDANLTNANLDGADLRQTNLQGA----VLTGVSFRGAVLCGAT 164
Query: 238 RRNA----YGSP 245
N YG P
Sbjct: 165 MPNGLAARYGCP 176
>gi|189347104|ref|YP_001943633.1| pentapeptide repeat-containing protein [Chlorobium limicola DSM
245]
gi|189341251|gb|ACD90654.1| pentapeptide repeat protein [Chlorobium limicola DSM 245]
Length = 408
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ SA LR A+ V+ + +A +AD+ ++ + F GA+++ AV KA+ TGAD S
Sbjct: 92 ARLDSAVLRSALLVRASLDKARLHNADLEDAVLEAASFKGAFMQTAVLKKADCTGADFSG 151
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L E N A L +LT +DL + AD S +V+
Sbjct: 152 A-----DLRETNFREARLAGALLTGADLRATYLWRADMSRSVL 189
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 44/78 (56%)
Query: 131 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
R +D SG++ G L A A+ +GADL+ + + + L+ A L +AVL +L R+ L
Sbjct: 50 RVADLSGAQLKGMNLRGADLSYADLSGADLASSDLSKARLDHARLDSAVLRSALLVRASL 109
Query: 191 GGAIIEGADFSDAVIDLA 208
A + AD DAV++ A
Sbjct: 110 DKARLHNADLEDAVLEAA 127
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 57/120 (47%), Gaps = 1/120 (0%)
Query: 105 FGSADLRKAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
F D+RK E N R+A F ++ +D + ++ GA KA + A+ ADLS
Sbjct: 285 FAWNDMRKRNRAMEVNLRQAKFDQKNLSYADLAHARLQGASFRKADLFDADLRNADLSGC 344
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPI 223
M L +A+L A L L R++LG A + G S + + K+A K+A + +
Sbjct: 345 DMREANLEKADLGGADLSGVNLWRANLGRARLNGVKVSASTVLDTGKKADQKWAERHDAV 404
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 52/126 (41%), Gaps = 20/126 (15%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
N Y A G S AQ +LR A + A+ + AD+ SD S ++ + A L+
Sbjct: 41 NSYRAGLGGRVADLSGAQLKGMNLRGA-----DLSYADLSGADLASSDLSKARLDHARLD 95
Query: 147 KAVAY----------KANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
AV KA ADL D VL A+ A + VL ++D GA
Sbjct: 96 SAVLRSALLVRASLDKARLHNADLEDA-----VLEAASFKGAFMQTAVLKKADCTGADFS 150
Query: 197 GADFSD 202
GAD +
Sbjct: 151 GADLRE 156
>gi|427418755|ref|ZP_18908938.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425761468|gb|EKV02321.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 312
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 62/135 (45%), Gaps = 16/135 (11%)
Query: 95 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 154
G + AQ DLR+A N A+FT A + S+ + + GA E A + N
Sbjct: 170 GSYAQLYMAQIQGCDLRQA-----NLNHADFTQAVLTRSNLNQATLIGANGEAATLEQVN 224
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI------DLA 208
T A+LS + L ANL AV+V LT S+L A ++ A +AVI D+
Sbjct: 225 LTRANLS-----HVNLTSANLQQAVMVHATLTESNLSEANLQNATLDNAVIRQCYLRDIN 279
Query: 209 QKQALCKYANGTNPI 223
+QA + + PI
Sbjct: 280 WQQASVQGTHFCQPI 294
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 15/119 (12%)
Query: 103 AQFGSADLRKAVHVKENF-----RRANFTS-----ADMRESDFSGSKFNGAYLEKAVAYK 152
AQ +L A+ N RRAN A + ++ S + GA L +A YK
Sbjct: 68 AQMSDVNLSGAILTSANLTATSLRRANLLGAVLMFATLEQATLSHANLAGANLTEAELYK 127
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
ANFT ADLS ++ R A+L NA R + + + GA + G D S A + +AQ Q
Sbjct: 128 ANFTEADLSHAMLRR-----ASLVNANFHRACMKQVNANGAELYGIDGSYAQLYMAQIQ 181
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 50/104 (48%), Gaps = 10/104 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F ADL A+ + + ANF A M++ + +G++ G A Y A G DL
Sbjct: 128 ANFTEADLSHAMLRRASLVNANFHRACMKQVNANGAELYGIDGSYAQLYMAQIQGCDLR- 186
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ANL +A + VLTRS+L A + GA+ A ++
Sbjct: 187 ---------QANLNHADFTQAVLTRSNLNQATLIGANGEAATLE 221
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 5/86 (5%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
EN R +FT AD+ SG+ L A N +GA L+ + L ANL
Sbjct: 43 ENLERVDFTRADL-----SGANLERTQLIDAQMSDVNLSGAILTSANLTATSLRRANLLG 97
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDA 203
AVL+ L ++ L A + GA+ ++A
Sbjct: 98 AVLMFATLEQATLSHANLAGANLTEA 123
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 10/104 (9%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F ADL A N R A M + + SG+ A L +AN GA L
Sbjct: 50 FTRADLSGA-----NLERTQLIDAQMSDVNLSGAILTSANLTATSLRRANLLGAVLMFAT 104
Query: 165 MDRMVLNEANL-----TNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+++ L+ ANL T A L + T +DL A++ A +A
Sbjct: 105 LEQATLSHANLAGANLTEAELYKANFTEADLSHAMLRRASLVNA 148
>gi|428214427|ref|YP_007087571.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428002808|gb|AFY83651.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 155
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
+ S D+ ++D SG + A L A +AN TGA+LS + L EANLT+A L T
Sbjct: 61 DLQSVDLEKADLSGVDLSNANLTNADLEEANLTGANLSTADLTNADLEEANLTDANLQNT 120
Query: 184 VLTRSDLGGAI-----IEGADFSDAVID 206
T +DL AI + GADF+ A +D
Sbjct: 121 NFTSADLEDAILTNANVTGADFTGADLD 148
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 55/109 (50%), Gaps = 11/109 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S S DL KA + AN T+AD+ E++ +G+ + A L A +AN T A+L
Sbjct: 58 SGCDLQSVDLEKADLSGVDLSNANLTNADLEEANLTGANLSTADLTNADLEEANLTDANL 117
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+T N T+A L +LT +++ GA GAD D+VI L +
Sbjct: 118 QNT----------NFTSADLEDAILTNANVTGADFTGADL-DSVIGLTR 155
>gi|428200510|ref|YP_007079099.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427977942|gb|AFY75542.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 174
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 53/99 (53%), Gaps = 5/99 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLR+A + AN + AD++E++ SG+ + A L AV KAN +GA L
Sbjct: 60 ASLDRADLREACLIV-----ANLSGADLKEANLSGANLSEAVLTGAVLQKANLSGAKLRG 114
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
++ + L E+NL A L L +DL GA + AD S
Sbjct: 115 AILAGVNLAESNLRGANLQGANLYGADLRGADLRNADLS 153
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 66/136 (48%), Gaps = 9/136 (6%)
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KF 140
L +Y A R +F S DL +A N RAN + A +R ++ SG+
Sbjct: 7 LTRYAAGER-DF---SRIDLHGVDLAQAKLSGANLIRANLSGALLRGANLSGAFLVVASL 62
Query: 141 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ A L +A AN +GADL + + L+EA LT AVL + L+ + L GAI+ G +
Sbjct: 63 DRADLREACLIVANLSGADLKEANLSGANLSEAVLTGAVLQKANLSGAKLRGAILAGVNL 122
Query: 201 SDAVIDLAQKQALCKY 216
+++ + A Q Y
Sbjct: 123 AESNLRGANLQGANLY 138
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 44/88 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL++A N A T A +++++ SG+K GA L ++N GA+L
Sbjct: 75 ANLSGADLKEANLSGANLSEAVLTGAVLQKANLSGAKLRGAILAGVNLAESNLRGANLQG 134
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+ L A+L NA L RT L ++L
Sbjct: 135 ANLYGADLRGADLRNADLSRTNLRGANL 162
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 5/83 (6%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTG 157
A A+L +AV ++AN + A +R + +G S GA L+ A Y A+ G
Sbjct: 85 ANLSGANLSEAVLTGAVLQKANLSGAKLRGAILAGVNLAESNLRGANLQGANLYGADLRG 144
Query: 158 ADLSDTLMDRMVLNEANLTNAVL 180
ADL + + R L ANL ++
Sbjct: 145 ADLRNADLSRTNLRGANLERTIM 167
>gi|378826441|ref|YP_005189173.1| BTB/POZ domain-containing protein KCTD9 [Sinorhizobium fredii
HH103]
gi|365179493|emb|CCE96348.1| BTB/POZ domain-containing protein KCTD9 [Sinorhizobium fredii
HH103]
Length = 250
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 60/122 (49%), Gaps = 11/122 (9%)
Query: 103 AQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A +A+L KA V+ + +ANF+ + DFSG GA + +A+FTG
Sbjct: 88 ADLTAANLEKATLVRASLAGAKADKANFSRVEGYRGDFSGISAEGALFVSSELQRADFTG 147
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGA-DFSDAVIDLAQKQ 211
A L+ ++ L AN AV+ T L+R+DL GA+ EG DF A + L + +
Sbjct: 148 ARLTGADFEKAELGRANFGKAVVTGTRFSVANLSRADLSGAVFEGPIDFDRAFLFLTRIE 207
Query: 212 AL 213
L
Sbjct: 208 GL 209
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 47/111 (42%), Gaps = 5/111 (4%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
G A + + R+ + + + N D +D +G+ A LEKA +A+ GA
Sbjct: 50 GPGADWQDCNKRQLMLGGSDLKGGNLVDTDFASTDLNGADLTAANLEKATLVRASLAGAK 109
Query: 160 LSDTLMDRMVLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDAVI 205
R+ + + A+ V + L R+D GA + GADF A +
Sbjct: 110 ADKANFSRVEGYRGDFSGISAEGALFVSSELQRADFTGARLTGADFEKAEL 160
>gi|354565480|ref|ZP_08984655.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353549439|gb|EHC18881.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 182
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 66/135 (48%), Gaps = 16/135 (11%)
Query: 86 LNKYEAETRGEFGIG-----------SAAQFGSADLRKAVHVKENFRRANFTSADMRESD 134
L++YE R G+ S A F ADL A + N NF+ A++ ++D
Sbjct: 7 LSRYETGERDFVGVNLHKVNLREVDLSGANFCGADLSGADLSQANLSGCNFSRANLTDAD 66
Query: 135 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
+ + NGA L + N GADL + ++ L+ A+L A LVR LT+++L A
Sbjct: 67 LTRADLNGANLS-----EINLIGADLINANLEGTNLSRADLRGANLVRANLTKANLSEAE 121
Query: 195 IEGADFSDAVIDLAQ 209
+ GAD S A ++ A
Sbjct: 122 LSGADLSGANLNQAN 136
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL +A N N AD+ ++ G+ + A L A +AN T A+L
Sbjct: 58 SRANLTDADLTRADLNGANLSEINLIGADLINANLEGTNLSRADLRGANLVRANLTKANL 117
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
S+ + L+ ANL A L+ T L ++L G I GA ++
Sbjct: 118 SEAELSGADLSGANLNQANLIETNLNEAELNGVNITGATVTE 159
>gi|427716392|ref|YP_007064386.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427348828|gb|AFY31552.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 521
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 71/149 (47%), Gaps = 10/149 (6%)
Query: 84 ADLNKYEAETRG--------EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 135
ADL+ EA RG E +A G DL A R+AN + A++ +D
Sbjct: 140 ADLS--EATLRGASLTGANLEMANLNATDMGRTDLSGANLRDTELRQANLSHANLSGADL 197
Query: 136 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
SG+ A L KA A+ +GA LS + L+ ANLTNA L+ LT++ L A
Sbjct: 198 SGANLRWADLSKANLRWADLSGAKLSGATLIGADLSNANLTNASLIHANLTQAKLIKAEW 257
Query: 196 EGADFSDAVIDLAQKQALCKYANGTNPIT 224
GAD + A++ A+ A ++ T +T
Sbjct: 258 IGADLTGAILTGAKLYATSRFGLKTEGMT 286
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 71/149 (47%), Gaps = 13/149 (8%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSA-------------AQFGSADLRKAV 114
LA+A++ + S N++ L + A+ RG I + A ADLR+A
Sbjct: 72 LASAILNNTSLNVANLIRADLSRAQLRGASLIRAELIRAELSRADLFEANLSGADLREAT 131
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
+ N RRA+ + A +R + +G+ A L + + +GA+L DT + + L+ AN
Sbjct: 132 LRQANLRRADLSEATLRGASLTGANLEMANLNATDMGRTDLSGANLRDTELRQANLSHAN 191
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L+ A L L +DL A + AD S A
Sbjct: 192 LSGADLSGANLRWADLSKANLRWADLSGA 220
Score = 40.4 bits (93), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 41/87 (47%), Gaps = 5/87 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N AN + A + + SG+ A L AN ADLS R L A+L A
Sbjct: 51 NLSEANLSDAKLNVARLSGANLASAILNNTSLNVANLIRADLS-----RAQLRGASLIRA 105
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVI 205
L+R L+R+DL A + GAD +A +
Sbjct: 106 ELIRAELSRADLFEANLSGADLREATL 132
Score = 37.0 bits (84), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 57/128 (44%), Gaps = 14/128 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A SA L N RA+ + A +R + ++ A L +A ++AN +GADL
Sbjct: 68 SGANLASAILNNTSLNVANLIRADLSRAQLRGASLIRAELIRAELSRADLFEANLSGADL 127
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLT----------RSDLGGAIIEGADFSDAVIDLAQK 210
+ + + L A+L+ A L LT +D+G + GA+ D + +
Sbjct: 128 REATLRQANLRRADLSEATLRGASLTGANLEMANLNATDMGRTDLSGANLRDTEL----R 183
Query: 211 QALCKYAN 218
QA +AN
Sbjct: 184 QANLSHAN 191
>gi|254486622|ref|ZP_05099827.1| hypothetical protein RGAI101_1279 [Roseobacter sp. GAI101]
gi|214043491|gb|EEB84129.1| hypothetical protein RGAI101_1279 [Roseobacter sp. GAI101]
Length = 200
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 54/112 (48%), Gaps = 18/112 (16%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
G ADL A + A T A++ S+ SG+ GAYLE A A TGADL+
Sbjct: 94 NLGGADLSGA-----DLTGAVLTQANLEMSNLSGATLTGAYLELANLAGARVTGADLT-- 146
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCK 215
+ANLT+A L VL + L GA++ GAD A + + LCK
Sbjct: 147 --------KANLTSANLRGAVLLEAKLVGAVLLGADLDGASL---EGAILCK 187
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 58/113 (51%), Gaps = 16/113 (14%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F ADLR+ + K ++ + + +D++ D +G+ GA L A + A+ + ADLS
Sbjct: 4 AAFDEADLRQLLDTKV-CQKCDLSGSDLKGVDLAGANLAGANLSGAKLWAADLSKADLSG 62
Query: 163 TLMDRMVLN----------EANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ L +ANL+ A L T ++LGGA + GAD + AV+
Sbjct: 63 VNLEAATLTAANLAGANLADANLSGAYL-----TTTNLGGADLSGADLTGAVL 110
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 52/123 (42%), Gaps = 20/123 (16%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG--------------------SKF 140
S + DL A N A +AD+ ++D SG +
Sbjct: 26 SGSDLKGVDLAGANLAGANLSGAKLWAADLSKADLSGVNLEAATLTAANLAGANLADANL 85
Query: 141 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+GAYL A+ +GADL+ ++ + L +NL+ A L L ++L GA + GAD
Sbjct: 86 SGAYLTTTNLGGADLSGADLTGAVLTQANLEMSNLSGATLTGAYLELANLAGARVTGADL 145
Query: 201 SDA 203
+ A
Sbjct: 146 TKA 148
>gi|307150734|ref|YP_003886118.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306980962|gb|ADN12843.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 231
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 52/105 (49%), Gaps = 15/105 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F +D R + K NF A F AD E+ G+ F+ A LEKA+ + + +GA
Sbjct: 33 SRADFSYSDFRSSRLGKTNFSAACFLGADFSEAILWGTDFSKANLEKAILREVDLSGA-- 90
Query: 161 SDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADF 200
+L EANLT L++ L+ + L GAI+ ADF
Sbjct: 91 --------ILTEANLTQVNLIKATLGGANLSLAQLPGAIVYEADF 127
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 57/117 (48%), Gaps = 2/117 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SAA F AD +A+ +F +AN A +RE D SG+ A L + KA GA+L
Sbjct: 53 SAACFLGADFSEAILWGTDFSKANLEKAILREVDLSGAILTEANLTQVNLIKATLGGANL 112
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ--KQALCK 215
S + ++ EA+ RT LT+++L A + A + A + AQ LC+
Sbjct: 113 SLAQLPGAIVYEADFRPTSEQRTNLTQANLSAANLSYAKLNGANLYQAQLMNAQLCR 169
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 56/121 (46%), Gaps = 18/121 (14%)
Query: 103 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A A L A+ + +FR R N T A++ ++ S +K NGA L +A A
Sbjct: 110 ANLSLAQLPGAIVYEADFRPTSEQRTNLTQANLSAANLSYAKLNGANLYQAQLMNAQLCR 169
Query: 158 ADLSDTLMDRMV---LNEANLTNA----------VLVRTVLTRSDLGGAIIEGADFSDAV 204
ADLS + + L+EANL NA +L LT +DL G I+ D + A+
Sbjct: 170 ADLSKGIWQNCLPTDLSEANLQNADLSYADLSGAILCYADLTGADLTGTILTNVDLTGAI 229
Query: 205 I 205
+
Sbjct: 230 L 230
>gi|167042950|gb|ABZ07664.1| putative Pentapeptide repeats (8 copies) [uncultured marine
crenarchaeote HF4000_ANIW137N18]
Length = 842
Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats.
Identities = 31/105 (29%), Positives = 53/105 (50%), Gaps = 5/105 (4%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDT 163
DL + + + + NF D+ D +G++ +G L + + AN A L +
Sbjct: 711 DLTQTILRRADLSHTNFAGVDLIGVDLTGARLVGVDLSGKDLTQTILINANLENATLKNA 770
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+ L+ ANLTNA L T+L ++L AI+ G+D ++AV+ A
Sbjct: 771 KLLNANLDSANLTNADLRNTLLVDTNLSNAILTGSDLTNAVLTRA 815
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 30/97 (30%), Positives = 45/97 (46%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
++D R + +FT A++ +D SG G L A+ NFTG DLS
Sbjct: 472 LSNSDFRWSNFDTAKISNVDFTRANLSYTDLSGRDMTGTILTGAIISYTNFTGVDLSGKD 531
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+ +L +L+N L T+LT +DL + G D S
Sbjct: 532 LTGTILTGVDLSNVDLSGTILTGADLSHTNLTGVDLS 568
Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats.
Identities = 28/78 (35%), Positives = 38/78 (48%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
NFT D+ D +G+ G L TGADLS T + + L+ +LT +L
Sbjct: 521 NFTGVDLSGKDLTGTILTGVDLSNVDLSGTILTGADLSHTNLTGVDLSGKDLTGTILTGV 580
Query: 184 VLTRSDLGGAIIEGADFS 201
L+ DL G I+ GAD S
Sbjct: 581 DLSNVDLSGTILTGADLS 598
Score = 45.1 bits (105), Expect = 0.038, Method: Composition-based stats.
Identities = 26/84 (30%), Positives = 43/84 (51%), Gaps = 10/84 (11%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +A L+ A + N AN T+AD+R + + + A L TG+DL++
Sbjct: 760 ANLENATLKNAKLLNANLDSANLTNADLRNTLLVDTNLSNAIL----------TGSDLTN 809
Query: 163 TLMDRMVLNEANLTNAVLVRTVLT 186
++ R +L A+L NA+L +LT
Sbjct: 810 AVLTRAILTGADLENAILTNAILT 833
Score = 43.9 bits (102), Expect = 0.069, Method: Composition-based stats.
Identities = 25/77 (32%), Positives = 36/77 (46%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
T D+ D SG+ GA L + +G DL+ T++ L+ NLT L T+
Sbjct: 577 LTGVDLSNVDLSGTILTGADLSHTNLTGVDLSGKDLTQTILTGADLSHTNLTGVYLSGTI 636
Query: 185 LTRSDLGGAIIEGADFS 201
LT +DL + G D S
Sbjct: 637 LTGADLSHTNLTGVDLS 653
Score = 43.5 bits (101), Expect = 0.097, Method: Composition-based stats.
Identities = 28/85 (32%), Positives = 42/85 (49%), Gaps = 7/85 (8%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
T AD+ ++ +G +G L + +G DL++T +L A+L N L T+
Sbjct: 637 LTGADLSHTNLTGVDLSGKDLTGTRLAGVDLSGKDLTET-----ILTGADLLNVDLTGTI 691
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQ 209
LTR+DL + G D SD DL Q
Sbjct: 692 LTRADLSHTNLTGVDLSDN--DLTQ 714
Score = 42.4 bits (98), Expect = 0.21, Method: Composition-based stats.
Identities = 25/99 (25%), Positives = 46/99 (46%), Gaps = 3/99 (3%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
T D+ D SG+ GA L + +G DL+ T++ + L+ +L+ +L
Sbjct: 537 LTGVDLSNVDLSGTILTGADLSHTNLTGVDLSGKDLTGTILTGVDLSNVDLSGTILTGAD 596
Query: 185 LTRSDLGGAIIEGADFSDAVI---DLAQKQALCKYANGT 220
L+ ++L G + G D + ++ DL+ Y +GT
Sbjct: 597 LSHTNLTGVDLSGKDLTQTILTGADLSHTNLTGVYLSGT 635
Score = 42.0 bits (97), Expect = 0.25, Method: Composition-based stats.
Identities = 31/108 (28%), Positives = 44/108 (40%), Gaps = 5/108 (4%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I + + DL + + N T D+ D +G+ G L TGA
Sbjct: 536 ILTGVDLSNVDLSGTILTGADLSHTNLTGVDLSGKDLTGTILTGVDLSNVDLSGTILTGA 595
Query: 159 DLSDTLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFS 201
DLS T + + L+ +LT +L T LT L G I+ GAD S
Sbjct: 596 DLSHTNLTGVDLSGKDLTQTILTGADLSHTNLTGVYLSGTILTGADLS 643
Score = 38.9 bits (89), Expect = 2.7, Method: Composition-based stats.
Identities = 27/107 (25%), Positives = 43/107 (40%), Gaps = 5/107 (4%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I + + DL + + N T D+ D + + GA L N TG
Sbjct: 576 ILTGVDLSNVDLSGTILTGADLSHTNLTGVDLSGKDLTQTILTGADLSHT-----NLTGV 630
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
LS T++ L+ NLT L LT + L G + G D ++ ++
Sbjct: 631 YLSGTILTGADLSHTNLTGVDLSGKDLTGTRLAGVDLSGKDLTETIL 677
Score = 38.5 bits (88), Expect = 2.9, Method: Composition-based stats.
Identities = 28/85 (32%), Positives = 39/85 (45%), Gaps = 7/85 (8%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
T AD+ D +G+ L +A N TG DLSD + + +L A+L++
Sbjct: 677 LTGADLLNVDLTGT-----ILTRADLSHTNLTGVDLSDNDLTQTILRRADLSHTNFAGVD 731
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQ 209
L DL GA + G D S DL Q
Sbjct: 732 LIGVDLTGARLVGVDLSGK--DLTQ 754
Score = 37.7 bits (86), Expect = 6.1, Method: Composition-based stats.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 7/86 (8%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
T AD+ ++ +G +G L + + TGADLS T + + L+ LT A L T
Sbjct: 592 LTGADLSHTNLTGVDLSGKDLTQTI-----LTGADLSHTNLTGVYLSGTILTGADLSHTN 646
Query: 185 LTRSDLGGAIIEGADFSDAVIDLAQK 210
LT DL G + G A +DL+ K
Sbjct: 647 LTGVDLSGKDLTGTRL--AGVDLSGK 670
>gi|197106790|ref|YP_002132167.1| pentapeptide repeat-containing protein [Phenylobacterium zucineum
HLK1]
gi|196480210|gb|ACG79738.1| pentapeptide repeat family protein [Phenylobacterium zucineum HLK1]
Length = 412
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 85/208 (40%), Gaps = 47/208 (22%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A L A+ + N RA AD+RE+D G+ A L A AN +GA+L+
Sbjct: 65 ADLAGVKLEGAMLARANLSRAILFGADLREADLRGANMKRADLRGACLKGANLSGAELAG 124
Query: 163 --------TLMDRM------------------VLNEANLTNAVLVRTVLTRSD-----LG 191
L D++ VL+ ANL A + TV SD L
Sbjct: 125 CDLREGRIALQDKLDGFRILRHEHRPGELNYAVLSGANLAGAQMAGTVAMASDFTDANLT 184
Query: 192 GAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGSPS--SPL 249
GA++ GA + AV+D A L G +TGVS ++++ G + A S +
Sbjct: 185 GAVLAGARLTRAVLDGAD---LSGADLGGADLTGVSLKRAVLAGANLDQARLEDVDLSEV 241
Query: 250 LSAPP-----------QKLLDRDGFCDS 266
L APP + L D + +CDS
Sbjct: 242 LRAPPPIVYVDDRSLEEVLADHEAYCDS 269
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 5/87 (5%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
+ D+R++D +G K GA L +A +A GADL + L AN+ A L
Sbjct: 56 SLEGVDLRDADLAGVKLEGAMLARANLSRAILFGADLREA-----DLRGANMKRADLRGA 110
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLAQK 210
L ++L GA + G D + I L K
Sbjct: 111 CLKGANLSGAELAGCDLREGRIALQDK 137
>gi|428320809|ref|YP_007118691.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428244489|gb|AFZ10275.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 290
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 62/120 (51%), Gaps = 9/120 (7%)
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 145
L +YE R +F + A A+L A+ V N RAN + A++ + + ++ NGA L
Sbjct: 7 LKEYENGNR-DF---ADANLSGANLSGAILVGVNLSRANLSGANLSRAHLTKAELNGANL 62
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
Y+AN + A + + L +ANL+ A LV+ L R+ L GA + G++ A++
Sbjct: 63 -----YRANLSFAKMGQARLADAELTKANLSGAFLVKAKLPRAKLSGAQLIGSNLRSAIL 117
>gi|359460749|ref|ZP_09249312.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 299
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 57/111 (51%), Gaps = 10/111 (9%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA-----DLSDT 163
DLR A +N + A+ A+++ ++ G + A LE A AN + A L+ T
Sbjct: 63 DLRGANLQDQNLKGASLQGANLQGANLQGVNLDDANLESANLKSANLSKATLRRASLTTT 122
Query: 164 LMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDAVIDLAQ 209
L L +ANLT A LV+T L R++L A +E ADFS AV++ Q
Sbjct: 123 LKQATNLQDANLTQATLVKTKLKGADLRRANLFEATLEDADFSVAVVETTQ 173
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 52/103 (50%), Gaps = 5/103 (4%)
Query: 108 ADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
+DLR+A K N + A +A++ E++ G+K A L+ A ADL
Sbjct: 181 SDLREANFNKSNLKNATLNQVYLANANLSEANLKGAKLKQAQLKYTNLNGAKLNNADLRK 240
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ + L+EA+L++A L + + +L GA + AD S A +
Sbjct: 241 ASLESVNLSEADLSSAHLGKIAMKDVNLRGANLSNADLSGAKL 283
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 58/122 (47%), Gaps = 14/122 (11%)
Query: 99 IGSAAQFGSADLRKAVHVKE-----NFRRANFTSADMRESDFSGSKFNGAYLEKAVAY-- 151
+ A A+L +A VK + RRAN A + ++DFS + + V +
Sbjct: 123 LKQATNLQDANLTQATLVKTKLKGADLRRANLFEATLEDADFSVAVVETTQGIRRVYFSD 182
Query: 152 --KANFTGADLSDTLMDRMVL-----NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
+ANF ++L + ++++ L +EANL A L + L ++L GA + AD A
Sbjct: 183 LREANFNKSNLKNATLNQVYLANANLSEANLKGAKLKQAQLKYTNLNGAKLNNADLRKAS 242
Query: 205 ID 206
++
Sbjct: 243 LE 244
>gi|332706458|ref|ZP_08426519.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332354342|gb|EGJ33821.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 345
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 46/88 (52%), Gaps = 5/88 (5%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANLTN 177
A+F AD++E DFS A L +A ++ N GA+L + R L +ANL+N
Sbjct: 231 ADFRGADLKERDFSNRNLQSANLSQANLKDAFLHRVNLAGANLEGANLFRANLFQANLSN 290
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A L L +D+ GA + GAD S A +
Sbjct: 291 ANLREANLIGADMSGADLSGADLSGAKV 318
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 49/98 (50%), Gaps = 10/98 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F ADL++ N + AN + A+++++ GA LE A ++AN A+L
Sbjct: 229 SGADFRGADLKERDFSNRNLQSANLSQANLKDAFLHRVNLAGANLEGANLFRANLFQANL 288
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
S+ ANL A L+ ++ +DL GA + GA
Sbjct: 289 SN----------ANLREANLIGADMSGADLSGADLSGA 316
>gi|158338763|ref|YP_001519940.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158309004|gb|ABW30621.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 299
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 58/111 (52%), Gaps = 10/111 (9%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-VAY----KANFTGADLSDT 163
DLR A +N + A+ A+++ ++ G + A LE A + Y KA A L+ T
Sbjct: 63 DLRGANLQDQNLKGASLQGANLQGANLQGVNLDDANLESANLKYANLSKATLRRASLTTT 122
Query: 164 LMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDAVIDLAQ 209
L L +ANLT A LV+T L R++L A +E ADFS AV++ Q
Sbjct: 123 LKQATNLQDANLTQATLVKTKLKGADLRRANLFEATLEDADFSVAVVETTQ 173
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 53/103 (51%), Gaps = 5/103 (4%)
Query: 108 ADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
+DLR+A K N + A ++A++ E++ G+K A L+ A ADL
Sbjct: 181 SDLREANFNKSNLKNATLNQVYLSNANLSEANLKGAKLKQAQLKYTNLNGAKLNNADLRK 240
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ + L+EA+L++A L + + +L GA + AD S A +
Sbjct: 241 ASLESVNLSEADLSSAHLGKIAMKDVNLRGANLSNADLSGAKL 283
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 53/122 (43%), Gaps = 14/122 (11%)
Query: 99 IGSAAQFGSADLRKAVHVKE-----NFRRANFTSADMRESDFSGSKFNGAY--------- 144
+ A A+L +A VK + RRAN A + ++DFS +
Sbjct: 123 LKQATNLQDANLTQATLVKTKLKGADLRRANLFEATLEDADFSVAVVETTQGIRRVYFSD 182
Query: 145 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
L +A K+N A L+ + L+EANL A L + L ++L GA + AD A
Sbjct: 183 LREANFNKSNLKNATLNQVYLSNANLSEANLKGAKLKQAQLKYTNLNGAKLNNADLRKAS 242
Query: 205 ID 206
++
Sbjct: 243 LE 244
Score = 37.4 bits (85), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 54/127 (42%), Gaps = 18/127 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A L + N + AN T A + ++ G+ A L +A A+F+ A +
Sbjct: 110 SKATLRRASLTTTLKQATNLQDANLTQATLVKTKLKGADLRRANLFEATLEDADFSVAVV 169
Query: 161 SDTLMDRMV---------LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
T R V N++NL NA L + L+ ++L A ++GA KQ
Sbjct: 170 ETTQGIRRVYFSDLREANFNKSNLKNATLNQVYLSNANLSEANLKGAKL---------KQ 220
Query: 212 ALCKYAN 218
A KY N
Sbjct: 221 AQLKYTN 227
>gi|379019292|ref|YP_005295526.1| hypothetical protein RPK_04435 [Rickettsia rickettsii str. Hlp#2]
gi|376331872|gb|AFB29106.1| hypothetical protein RPK_04435 [Rickettsia rickettsii str. Hlp#2]
Length = 959
Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 217
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 218 N 218
N
Sbjct: 670 N 670
Score = 42.0 bits (97), Expect = 0.27, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 584 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 641
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 642 ENADMQAVEAAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATK 691
Score = 37.7 bits (86), Expect = 4.7, Method: Composition-based stats.
Identities = 41/144 (28%), Positives = 61/144 (42%), Gaps = 18/144 (12%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
LKN +F S L + +++C+ + + N A + A F ADL+K+
Sbjct: 359 LKN-TLFASANLESVKISNCNLDFTNFEGANLQNAVFQNV--TARNAGFLFADLKKSKIE 415
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTG-----ADLS 161
+ RA D+ E + + SKFN + A A K +N TG AD+
Sbjct: 416 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKFIIKDSEWKNSNLTGISLAYADMQ 475
Query: 162 DTLMDRMVLNEANLTNAVLVRTVL 185
M +VLN A L A +V T L
Sbjct: 476 RVQMQGVVLNNALLDQANIVSTNL 499
Score = 37.7 bits (86), Expect = 5.8, Method: Composition-based stats.
Identities = 25/109 (22%), Positives = 51/109 (46%), Gaps = 5/109 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A+L+ AV R A F AD+++S S + AY+ K + T + + +
Sbjct: 384 FEGANLQNAVFQNVTARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVM 443
Query: 165 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
M ++ ++ ++ N+ L L +D+ ++G ++A++D A
Sbjct: 444 MVNADAEKFIIKDSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 492
>gi|379712563|ref|YP_005300902.1| hypothetical protein RSA_04485 [Rickettsia philipii str. 364D]
gi|376329208|gb|AFB26445.1| hypothetical protein RSA_04485 [Rickettsia philipii str. 364D]
Length = 959
Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 217
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 218 N 218
N
Sbjct: 670 N 670
Score = 42.0 bits (97), Expect = 0.27, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 584 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 641
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 642 ENADMQAVEAAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATK 691
Score = 38.9 bits (89), Expect = 2.7, Method: Composition-based stats.
Identities = 41/144 (28%), Positives = 61/144 (42%), Gaps = 18/144 (12%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
LKN +F S L + +++C+ + + N A + A F ADL+K+
Sbjct: 359 LKN-TLFASANLESVKISNCNLDFTNFEGANLQNAVVQNV--TARNAGFLFADLKKSKIE 415
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTG-----ADLS 161
+ RA D+ E + + SKFN + A A K +N TG AD+
Sbjct: 416 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKFIIKDSEWKNSNLTGISLAYADMQ 475
Query: 162 DTLMDRMVLNEANLTNAVLVRTVL 185
M +VLN A L A +V T L
Sbjct: 476 RVQMQGVVLNNALLDQANIVSTNL 499
>gi|378721493|ref|YP_005286380.1| hypothetical protein RPL_04520 [Rickettsia rickettsii str.
Colombia]
gi|376326517|gb|AFB23756.1| hypothetical protein RPL_04520 [Rickettsia rickettsii str.
Colombia]
Length = 959
Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 217
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 218 N 218
N
Sbjct: 670 N 670
Score = 42.0 bits (97), Expect = 0.27, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 584 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 641
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 642 ENADMQAVEAAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATK 691
Score = 37.7 bits (86), Expect = 4.7, Method: Composition-based stats.
Identities = 41/144 (28%), Positives = 61/144 (42%), Gaps = 18/144 (12%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
LKN +F S L + +++C+ + + N A + A F ADL+K+
Sbjct: 359 LKN-TLFASANLESVKISNCNLDFTNFEGANLQNAVFQNV--TARNAGFLFADLKKSKIE 415
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTG-----ADLS 161
+ RA D+ E + + SKFN + A A K +N TG AD+
Sbjct: 416 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKFIIKDSEWKNSNLTGISLAYADMQ 475
Query: 162 DTLMDRMVLNEANLTNAVLVRTVL 185
M +VLN A L A +V T L
Sbjct: 476 RVQMQGVVLNNALLDQANIVSTNL 499
Score = 37.7 bits (86), Expect = 5.7, Method: Composition-based stats.
Identities = 25/109 (22%), Positives = 51/109 (46%), Gaps = 5/109 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A+L+ AV R A F AD+++S S + AY+ K + T + + +
Sbjct: 384 FEGANLQNAVFQNVTARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVM 443
Query: 165 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
M ++ ++ ++ N+ L L +D+ ++G ++A++D A
Sbjct: 444 MVNADAEKFIIKDSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 492
>gi|354567192|ref|ZP_08986362.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353543493|gb|EHC12951.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 206
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 48/91 (52%), Gaps = 5/91 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
+ R NF AD+R ++ SG+ GA L +A + NF ADLS + L +ANL A
Sbjct: 39 DLSRINFKGADLRSANLSGAILTGANLREANLQQVNFCDADLS-----QADLTQANLCGA 93
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L R L+ S L GA + AD +A + AQ
Sbjct: 94 CLWRVQLSDSQLWGASLCNADLREADLSAAQ 124
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 47/81 (58%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
A+ +AD+RE+D S ++ A L +A +AN T A L +++ N+ANLTNA L
Sbjct: 108 ASLCNADLREADLSAAQLIEASLVEANLVRANLTKAKLCGSVLIEANFNQANLTNADLKW 167
Query: 183 TVLTRSDLGGAIIEGADFSDA 203
T L ++ A +E A+F +A
Sbjct: 168 TNLMAANFSEANLENANFKNA 188
Score = 45.1 bits (105), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 55/103 (53%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LR+A + NF A+ + AD+ +++ G+ L + + A+ ADL
Sbjct: 56 SGAILTGANLREANLQQVNFCDADLSQADLTQANLCGACLWRVQLSDSQLWGASLCNADL 115
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + L EA+L A LVR LT++ L G+++ A+F+ A
Sbjct: 116 READLSAAQLIEASLVEANLVRANLTKAKLCGSVLIEANFNQA 158
>gi|157828677|ref|YP_001494919.1| hypothetical protein A1G_04555 [Rickettsia rickettsii str. 'Sheila
Smith']
gi|165933396|ref|YP_001650185.1| hypothetical protein RrIowa_0959 [Rickettsia rickettsii str. Iowa]
gi|378722841|ref|YP_005287727.1| hypothetical protein RPO_04530 [Rickettsia rickettsii str. Arizona]
gi|378724195|ref|YP_005289079.1| hypothetical protein RPM_04500 [Rickettsia rickettsii str. Hauke]
gi|379016252|ref|YP_005292487.1| hypothetical protein RPN_02430 [Rickettsia rickettsii str. Brazil]
gi|379017982|ref|YP_005294217.1| hypothetical protein RPJ_04485 [Rickettsia rickettsii str. Hino]
gi|157801158|gb|ABV76411.1| hypothetical protein A1G_04555 [Rickettsia rickettsii str. 'Sheila
Smith']
gi|165908483|gb|ABY72779.1| hypothetical protein RrIowa_0959 [Rickettsia rickettsii str. Iowa]
gi|376324776|gb|AFB22016.1| hypothetical protein RPN_02430 [Rickettsia rickettsii str. Brazil]
gi|376327865|gb|AFB25103.1| hypothetical protein RPO_04530 [Rickettsia rickettsii str. Arizona]
gi|376330548|gb|AFB27784.1| hypothetical protein RPJ_04485 [Rickettsia rickettsii str. Hino]
gi|376333210|gb|AFB30443.1| hypothetical protein RPM_04500 [Rickettsia rickettsii str. Hauke]
Length = 959
Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 217
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 218 N 218
N
Sbjct: 670 N 670
Score = 42.0 bits (97), Expect = 0.27, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 584 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 641
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 642 ENADMQAVEAAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATK 691
Score = 37.7 bits (86), Expect = 4.7, Method: Composition-based stats.
Identities = 41/144 (28%), Positives = 61/144 (42%), Gaps = 18/144 (12%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
LKN +F S L + +++C+ + + N A + A F ADL+K+
Sbjct: 359 LKN-TLFASANLESVKISNCNLDFTNFEGANLQNAVFQNV--TARNAGFLFADLKKSKIE 415
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTG-----ADLS 161
+ RA D+ E + + SKFN + A A K +N TG AD+
Sbjct: 416 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKFIIKDSEWKNSNLTGISLAYADMQ 475
Query: 162 DTLMDRMVLNEANLTNAVLVRTVL 185
M +VLN A L A +V T L
Sbjct: 476 RVQMQGVVLNNALLDQANIVSTNL 499
Score = 37.7 bits (86), Expect = 5.8, Method: Composition-based stats.
Identities = 25/109 (22%), Positives = 51/109 (46%), Gaps = 5/109 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A+L+ AV R A F AD+++S S + AY+ K + T + + +
Sbjct: 384 FEGANLQNAVFQNVTARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVM 443
Query: 165 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
M ++ ++ ++ N+ L L +D+ ++G ++A++D A
Sbjct: 444 MVNADAEKFIIKDSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 492
>gi|34581546|ref|ZP_00143026.1| hypothetical protein [Rickettsia sibirica 246]
gi|28262931|gb|EAA26435.1| unknown [Rickettsia sibirica 246]
Length = 957
Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 553 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 607
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 217
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 608 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 667
Query: 218 N 218
N
Sbjct: 668 N 668
Score = 42.0 bits (97), Expect = 0.27, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 582 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 639
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 640 ENADMQAVEAAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATK 689
Score = 40.4 bits (93), Expect = 0.79, Method: Composition-based stats.
Identities = 26/109 (23%), Positives = 53/109 (48%), Gaps = 5/109 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A+L+ AV R A F AD+++S S + AY+ K + T + + +
Sbjct: 382 FEGANLQNAVFQNVTARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVM 441
Query: 165 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
M +++++ ++ TN+ L L +D+ ++G ++A++D A
Sbjct: 442 MVNADAEKLIIKDSEWTNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 490
Score = 37.0 bits (84), Expect = 9.9, Method: Composition-based stats.
Identities = 41/144 (28%), Positives = 60/144 (41%), Gaps = 18/144 (12%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
LKN +F S L +++C+ + + N A + A F ADL+K+
Sbjct: 357 LKN-TLFASANLENVKISNCNLDFTNFEGANLQNAVFQNV--TARNAGFLFADLKKSKIE 413
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTG-----ADLS 161
+ RA D+ E + + SKFN + A A K +N TG AD+
Sbjct: 414 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKLIIKDSEWTNSNLTGISLAYADMQ 473
Query: 162 DTLMDRMVLNEANLTNAVLVRTVL 185
M +VLN A L A +V T L
Sbjct: 474 RVQMQGVVLNNALLDQANIVSTNL 497
>gi|163760882|ref|ZP_02167961.1| hypothetical protein HPDFL43_07047 [Hoeflea phototrophica DFL-43]
gi|162281926|gb|EDQ32218.1| hypothetical protein HPDFL43_07047 [Hoeflea phototrophica DFL-43]
Length = 239
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 5/103 (4%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
S DLR++ N AN A + + +GSK GA ++ AY+A+F+ D +
Sbjct: 71 STDLRES-----NLIEANLEKATLFRASLAGSKATGARFDRIEAYRADFSNLDATGASFG 125
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+ A L N++L T T++DLG A +GAD S + LA
Sbjct: 126 SAEMQRAKLNNSMLANTDFTKADLGRAQFDGADISGSRFSLAN 168
>gi|162450992|ref|YP_001613359.1| hypothetical protein sce2720 [Sorangium cellulosum So ce56]
gi|161161574|emb|CAN92879.1| hypothetical protein sce2720 [Sorangium cellulosum So ce56]
Length = 579
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 51/102 (50%), Gaps = 5/102 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 155
+ A+ A+LR+A+ R AD+ ++D G+ GA LE+A+ AN
Sbjct: 286 TGAELTGANLRRALLQGAILRGQRLAGADLEMTLLVDADLEGADLQGARLERAILDGANL 345
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
GADL+ L+ + +L A L +L + + R DL G ++G
Sbjct: 346 RGADLTRALLLQTLLRGAALDGVILDKAIFDRVDLTGTDLQG 387
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 49/104 (47%), Gaps = 5/104 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A DL A N RRA A +R G + GA LE + A+ GADL
Sbjct: 278 AMLAGCDLTGAELTGANLRRALLQGAILR-----GQRLAGADLEMTLLVDADLEGADLQG 332
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
++R +L+ ANL A L R +L ++ L GA ++G A+ D
Sbjct: 333 ARLERAILDGANLRGADLTRALLLQTLLRGAALDGVILDKAIFD 376
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 52/99 (52%), Gaps = 5/99 (5%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
LR A + + R A FT +D+R + G+ +GA L +A A+ GADL+ TL+
Sbjct: 66 LRGASLDRCDLRGATFTGSDLRGARLRGANLSGAKLLRANLAGADLAGADLTATLLLGAD 125
Query: 170 LNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDA 203
L A LT A L R L ++L GA+++GA + A
Sbjct: 126 LTGARLTGAKLDRIRLDFAKLPGAELAGAVLQGASLNKA 164
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 45/97 (46%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL + + E F T D+R G++ GA L+ +A+ G+DL DT R
Sbjct: 5 DLARRLRAGEPFAGKTITRFDLRGKQLGGARLRGAKLKDIHLDEADLAGSDLQDTQWFRC 64
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L A+L L T SDL GA + GA+ S A +
Sbjct: 65 PLRGASLDRCDLRGATFTGSDLRGARLRGANLSGAKL 101
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 5/89 (5%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA-----NLTN 177
AN + +R ++ + GA LE+A+ + TGA+L+ + R +L A L
Sbjct: 253 ANLAGSSLRGTNLRNANLRGANLEQAMLAGCDLTGAELTGANLRRALLQGAILRGQRLAG 312
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A L T+L +DL GA ++GA A++D
Sbjct: 313 ADLEMTLLVDADLEGADLQGARLERAILD 341
Score = 40.4 bits (93), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 18/111 (16%)
Query: 104 QFGSADLR----KAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGAD 159
Q G A LR K +H+ E A+ +D++++ + GA L++ A FTG+D
Sbjct: 30 QLGGARLRGAKLKDIHLDE----ADLAGSDLQDTQWFRCPLRGASLDRCDLRGATFTGSD 85
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVI 205
L L ANL+ A L+R L +DL GA ++ GAD + A +
Sbjct: 86 LRGA-----RLRGANLSGAKLLRANLAGADLAGADLTATLLLGADLTGARL 131
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ DLR+A +F +NFT AD+R +D S A L +A +A+ +GA +
Sbjct: 403 AKLAGMDLREA-----DFTGSNFTRADLRGADLRSSVLTRATLMEADLARADLSGATAKE 457
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L A +A L R TR+DL A + GAD D V+
Sbjct: 458 AFFGDAALAGARARDARLRRATFTRADLDHADLSGADLGDVVM 500
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 48/108 (44%), Gaps = 5/108 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L +A+ N R A+ T A + ++ G+ +G L+KA+ + + TG DL
Sbjct: 328 ADLQGARLERAILDGANLRGADLTRALLLQTLLRGAALDGVILDKAIFDRVDLTGTDLQG 387
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIEGADFSDAVI 205
+ M + + A L L +D G A + GAD +V+
Sbjct: 388 VRLAGMTMTQCCFIEAKLAGMDLREADFTGSNFTRADLRGADLRSSVL 435
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 54/109 (49%), Gaps = 5/109 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTG 157
A+ A+L A ++ N A+ AD+ + D +G++ GA L++ A G
Sbjct: 89 ARLRGANLSGAKLLRANLAGADLAGADLTATLLLGADLTGARLTGAKLDRIRLDFAKLPG 148
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A+L+ ++ LN+A+LT A+L +T S A + GAD A ++
Sbjct: 149 AELAGAVLQGASLNKADLTRALLRDARITGSTFYDARLGGADLGGATLE 197
>gi|158341491|ref|YP_001522656.1| peptidase C14, caspase catalytic subunit p20 [Acaryochloris marina
MBIC11017]
gi|158311732|gb|ABW33342.1| peptidase C14, caspase catalytic subunit p20 [Acaryochloris marina
MBIC11017]
Length = 1037
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 58/108 (53%), Gaps = 4/108 (3%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
SADLR A+ ++ N N ++ ++ +D S + + A L A +AN +GADL +T +
Sbjct: 884 SADLRNAILIRANLFSTNLSNVNLYSADLSSTDMSSANLSNADLIRANLSGADLHNTDLF 943
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
L+ ANL+NA L +L S+L E + + A ++ A+ +C
Sbjct: 944 YANLSNANLSNANLSNAILLSSNLR----ETKNLTQAQLEGAEHPLIC 987
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 52/109 (47%), Gaps = 10/109 (9%)
Query: 103 AQFGSADLRKAVHVKEN----------FRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 152
A+ ADLR A+ ++ N F A+ AD+R +D + + FN A L
Sbjct: 805 AKLRHADLRSAILIRANLFAADLNFTDFSDADLRYADLRRTDLNFTDFNHANLNFTKLGN 864
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
AN G +LSD + L A+L NA+L+R L ++L + AD S
Sbjct: 865 ANLNGTNLSDANLIGTNLYSADLRNAILIRANLFSTNLSNVNLYSADLS 913
>gi|374319450|ref|YP_005065949.1| hypothetical protein Rsl_930 [Rickettsia slovaca 13-B]
gi|383751452|ref|YP_005426553.1| hypothetical protein MC3_04505 [Rickettsia slovaca str. D-CWPP]
gi|360041999|gb|AEV92381.1| hypothetical protein Rsl_930 [Rickettsia slovaca 13-B]
gi|379774466|gb|AFD19822.1| hypothetical protein MC3_04505 [Rickettsia slovaca str. D-CWPP]
Length = 959
Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A LEKA A G ++SD
Sbjct: 555 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEA-----EGLNISDA 609
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 217
+ + EAN NA++ R LT+++ A++E AD ++A+ A KQA K A
Sbjct: 610 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIFKEANLKQANLKAA 669
Query: 218 N 218
N
Sbjct: 670 N 670
Score = 42.0 bits (97), Expect = 0.28, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 2/110 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 584 ATAQF--AKLSNATLEKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 641
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
+ M + EA A L + L ++L G EGADF A I+ A K
Sbjct: 642 ENADMQAVEAAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATK 691
Score = 38.5 bits (88), Expect = 3.4, Method: Composition-based stats.
Identities = 25/109 (22%), Positives = 52/109 (47%), Gaps = 5/109 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F A+L+ AV R A F AD+++S S + AY+ K + T + + +
Sbjct: 384 FEGANLQNAVFQNVTARNAGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVM 443
Query: 165 M-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
M +++++ ++ N+ L L +D+ ++G ++A++D A
Sbjct: 444 MVNADAEKLIIKDSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQA 492
>gi|86610069|ref|YP_478831.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86558611|gb|ABD03568.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 160
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 45/87 (51%), Gaps = 5/87 (5%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N AD+R +D S + GA L A ++AN GADLS + L+ A L A L R
Sbjct: 67 NLQEADLRGADLSSANLMGANLRGANLWEANLIGADLSFADLREANLHGAYLWEAKLTRA 126
Query: 184 VLTRSDL-----GGAIIEGADFSDAVI 205
L SDL GGA++ GAD A++
Sbjct: 127 QLQGSDLSGAKIGGAVLTGADLRGAIL 153
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 47/98 (47%), Gaps = 7/98 (7%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
L+ +N EA+ RG A SA+L A N AN AD+ +D + +G
Sbjct: 63 LSGINLQEADLRG-------ADLSSANLMGANLRGANLWEANLIGADLSFADLREANLHG 115
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
AYL +A +A G+DLS + VL A+L A+L
Sbjct: 116 AYLWEAKLTRAQLQGSDLSGAKIGGAVLTGADLRGAIL 153
>gi|354567300|ref|ZP_08986470.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353543601|gb|EHC13059.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 1022
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 34/101 (33%), Positives = 47/101 (46%), Gaps = 5/101 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ FG DL A + N AN A++ ++ +G+ A L A AN TGA+L
Sbjct: 863 AGVNFGQIDLSNANFMGANLVGANLQDANLAGANLTGANLTDANLSGANLASANLTGANL 922
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+ L NLTN L + VL +D AI+ GA FS
Sbjct: 923 TGA-----NLQSTNLTNTCLFQAVLQETDKEIAILNGAIFS 958
Score = 41.6 bits (96), Expect = 0.42, Method: Composition-based stats.
Identities = 26/77 (33%), Positives = 38/77 (49%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
N + AD+ ++ +G F L A AN GA+L D + L ANLT+A L
Sbjct: 851 NLSGADLSQAMLAGVNFGQIDLSNANFMGANLVGANLQDANLAGANLTGANLTDANLSGA 910
Query: 184 VLTRSDLGGAIIEGADF 200
L ++L GA + GA+
Sbjct: 911 NLASANLTGANLTGANL 927
>gi|162453209|ref|YP_001615576.1| hypothetical protein sce4933 [Sorangium cellulosum So ce56]
gi|161163791|emb|CAN95096.1| hypothetical protein sce4933 [Sorangium cellulosum So ce56]
Length = 890
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 53/105 (50%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
A+Q + A E + T AD R D G +F A+LE A A+ +GA L
Sbjct: 552 EASQMARVVVDAAREAGEPLDERDLTGADFRGVDLRGMRFARAFLEGADLRGADLSGAVL 611
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ + L+ ANLT A L L +++L GA+ + AD ++AV+
Sbjct: 612 EGAVLAKADLSGANLTGARLRGANLGKANLEGAVFDDADLTEAVL 656
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 52/108 (48%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A F DLR F RA AD+R +D SG A LE AV KA+ +GA+L
Sbjct: 577 TGADFRGVDLRGM-----RFARAFLEGADLRGADLSG-----AVLEGAVLAKADLSGANL 626
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+ + L +ANL AV LT + L GA + GA A ++ A
Sbjct: 627 TGARLRGANLGKANLEGAVFDDADLTEAVLMGARLAGASLKRAKLERA 674
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 50/102 (49%), Gaps = 15/102 (14%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F + DLR A F RA T D+ E+D + + F+ A ++ A+ + N
Sbjct: 777 FRTTDLRGA-----RFDRAQMTMTDLSEADATDATFDRAVMKNALLIRTN---------- 821
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+DR L NLT A+L ++ L +D GA + ADFS A D
Sbjct: 822 LDRASLRGCNLTEAILSKSRLAGADFTGAQLCRADFSRARGD 863
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 56/117 (47%), Gaps = 16/117 (13%)
Query: 100 GSAAQFGSADLRKAVHV-KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
G+ A+F A + V V K A+F A + ++ F + GA ++A + + A
Sbjct: 741 GAKARFAGARFSEGVAVHKSGLPEADFRDAVLDKTCFRTTDLRGARFDRAQMTMTDLSEA 800
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGG-----AIIE-----GADFSDAVI 205
D +D DR V+ NA+L+RT L R+ L G AI+ GADF+ A +
Sbjct: 801 DATDATFDRAVMK-----NALLIRTNLDRASLRGCNLTEAILSKSRLAGADFTGAQL 852
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL-MDRMVLNEANLTN 177
N RA + +DFSG++ + L KA F GA S+ + + + L EA+ +
Sbjct: 710 NLERAMLLECSLDGTDFSGARLHKTSLMSCTGAKARFAGARFSEGVAVHKSGLPEADFRD 769
Query: 178 AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQA 212
AVL +T +DL GA + A + + DL++ A
Sbjct: 770 AVLDKTCFRTTDLRGARFDRAQMT--MTDLSEADA 802
>gi|302039057|ref|YP_003799379.1| hypothetical protein NIDE3778 [Candidatus Nitrospira defluvii]
gi|300607121|emb|CBK43454.1| conserved exported protein of unknown function [Candidatus
Nitrospira defluvii]
Length = 476
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 53/104 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A+L+ A+ + RA+F AD++ +D S + Y A K N T ADL+
Sbjct: 158 AAFYGANLQGALFREALLERAHFEDADLQGADLSNATLLDGYFYGANLSKTNLTDADLAG 217
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
T + R L +ANL A L +L ++L GA + AD A +D
Sbjct: 218 TDLRRTNLRQANLRRANLQGALLDSANLDGASLIEADLESAYLD 261
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 58/109 (53%), Gaps = 5/109 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADLRKA+ VK + R A ++ G+ F A LE+A A+ GADLS+
Sbjct: 133 ANLEGADLRKALLVKAHLNRIAADEAAFYGANLQGALFREALLERAHFEDADLQGADLSN 192
Query: 163 -TLMDRMV----LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
TL+D L++ NLT+A L T L R++L A + A+ A++D
Sbjct: 193 ATLLDGYFYGANLSKTNLTDADLAGTDLRRTNLRQANLRRANLQGALLD 241
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 15/121 (12%)
Query: 100 GSAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKAN 154
G A DLR+ V N R N A ++R + + GA +AV AN
Sbjct: 75 GRRANLCRTDLRQLRLVGANLERINLEGAILKGSNLRTASLVQAHLKGADFSQAVLDDAN 134
Query: 155 FTGADLSDTLMDRMVLNE----------ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
GADL L+ + LN ANL A+ +L R+ A ++GAD S+A
Sbjct: 135 LEGADLRKALLVKAHLNRIAADEAAFYGANLQGALFREALLERAHFEDADLQGADLSNAT 194
Query: 205 I 205
+
Sbjct: 195 L 195
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 46/103 (44%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SAA A+L AV + RA+ AD+ E + A L +A AN ADL
Sbjct: 326 SAANLHGANLHHAVLIGTQLARADLRKADLTEIYGPNAHLQQARLSEANLELANLVAADL 385
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + V+ + NL L L+ SDL GA++ AD A
Sbjct: 386 SQADISHAVVVQTNLQETNLRGANLSASDLTGALLNNADLGQA 428
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 47/98 (47%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL A + F AN + ++ ++D +G+ L +A +AN GA L
Sbjct: 183 ADLQGADLSNATLLDGYFYGANLSKTNLTDADLAGTDLRRTNLRQANLRRANLQGALLDS 242
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+D L EA+L +A L L +DL A + GADF
Sbjct: 243 ANLDGASLIEADLESAYLDDASLANADLHEASLRGADF 280
Score = 44.3 bits (103), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 63/138 (45%), Gaps = 32/138 (23%)
Query: 81 SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 140
++LA+ + +EA RG AD R N +R N +A+M + S+
Sbjct: 263 ASLANADLHEASLRG------------ADFRFTHLGGANLQRVNLENANMEGATLVKSRL 310
Query: 141 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG--------- 191
+ A L V YKAN + A+ L+ ANL +AVL+ T L R+DL
Sbjct: 311 DSATLTMTVLYKANLSAAN----------LHGANLHHAVLIGTQLARADLRKADLTEIYG 360
Query: 192 -GAIIEGADFSDAVIDLA 208
A ++ A S+A ++LA
Sbjct: 361 PNAHLQQARLSEANLELA 378
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 15/104 (14%)
Query: 121 RRANFTSADMRE----------SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 170
RRAN D+R+ + G+ G+ L A +A+ GAD S ++D L
Sbjct: 76 RRANLCRTDLRQLRLVGANLERINLEGAILKGSNLRTASLVQAHLKGADFSQAVLDDANL 135
Query: 171 NEANLTNAVLVRTVLTR-----SDLGGAIIEGADFSDAVIDLAQ 209
A+L A+LV+ L R + GA ++GA F +A+++ A
Sbjct: 136 EGADLRKALLVKAHLNRIAADEAAFYGANLQGALFREALLERAH 179
Score = 40.0 bits (92), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 46/103 (44%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL A + A+ A +R +DF + GA L++ AN GA L
Sbjct: 248 ASLIEADLESAYLDDASLANADLHEASLRGADFRFTHLGGANLQRVNLENANMEGATLVK 307
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ +D +A L TVL +++L A + GA+ AV+
Sbjct: 308 SRLD----------SATLTMTVLYKANLSAANLHGANLHHAVL 340
>gi|153873268|ref|ZP_02001907.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
gi|152070268|gb|EDN68095.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
Length = 159
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 49/84 (58%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
F RAN + D+ +D SG+ +GA L +A ANFT A+LS+ + +ANLT+A
Sbjct: 46 FFRANLSHVDLTNTDLSGANLSGANLNEANLTNANFTKANLSEANLCESYFAKANLTDAN 105
Query: 180 LVRTVLTRSDLGGAIIEGADFSDA 203
L LT++ L + + GA+ S+A
Sbjct: 106 LSEANLTKAYLIESFLSGANLSEA 129
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 10/106 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L +A NF +AN + A++ ES Y KA AN + A+L
Sbjct: 62 SGANLSGANLNEANLTNANFTKANLSEANLCES----------YFAKANLTDANLSEANL 111
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ + L+ ANL+ A L R+ L SDL A + GA+ A D
Sbjct: 112 TKAYLIESFLSGANLSEANLFRSNLFESDLFRANLTGANLYKAKFD 157
>gi|434393337|ref|YP_007128284.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428265178|gb|AFZ31124.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 213
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 51/102 (50%), Gaps = 5/102 (4%)
Query: 99 IGSAAQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKA 153
I + F DL +A K N R NFT A + ++D SGS + L +A A
Sbjct: 105 IATQVGFLETDLERANLKKVNLRDRDLSYTNFTKAKLEKADLSGSNLSHTNLSRAKLRNA 164
Query: 154 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
N +GA+LS+ + R L ANL A L L+R+ L GAI+
Sbjct: 165 NLSGANLSNADLSRADLRNANLIGANLDGANLSRAKLEGAIM 206
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 51/91 (56%)
Query: 116 VKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 175
++ + RAN ++R+ D S + F A LEKA +N + +LS + L+ ANL
Sbjct: 112 LETDLERANLKKVNLRDRDLSYTNFTKAKLEKADLSGSNLSHTNLSRAKLRNANLSGANL 171
Query: 176 TNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+NA L R L ++L GA ++GA+ S A ++
Sbjct: 172 SNADLSRADLRNANLIGANLDGANLSRAKLE 202
>gi|428301952|ref|YP_007140258.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428238496|gb|AFZ04286.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 267
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 54/113 (47%), Gaps = 15/113 (13%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L+ A+ RAN A++ +D SG+ A LEKA N GA L
Sbjct: 57 SKANLQGANLQGAILNYALLGRANLEGANLSNADLSGTFLGEANLEKA-----NLQGAKL 111
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA----------IIEGADFSDA 203
S + + L ANL+NA L T LTR++L GA I+ AD DA
Sbjct: 112 SQAFLYKANLEGANLSNAYLSGTALTRANLRGANLRKSVIFVSILSEADLQDA 164
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 61/122 (50%), Gaps = 7/122 (5%)
Query: 87 NKYEAETRGEFGIGSA----AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
N A+ G F +G A A A L +A K N AN ++A + + + + G
Sbjct: 85 NLSNADLSGTF-LGEANLEKANLQGAKLSQAFLYKANLEGANLSNAYLSGTALTRANLRG 143
Query: 143 AYLEKAVAYKANFTGADLSD-TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
A L K+V + + + ADL D LM+ +L +NL A L R LT++ L AI++ A+ +
Sbjct: 144 ANLRKSVIFVSILSEADLQDANLMEAKLL-SSNLERANLARANLTKAQLHNAILQDANLT 202
Query: 202 DA 203
A
Sbjct: 203 QA 204
Score = 43.9 bits (102), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 115 HVKE----------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
HVK+ + A+ A + +++ G+ GA L A+ +AN GA+LS+
Sbjct: 31 HVKQLLNTNSCPSCDLSNADLYGAKLSKANLQGANLQGAILNYALLGRANLEGANLSNAD 90
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ L EANL A L L+++ L A +EGA+ S+A +
Sbjct: 91 LSGTFLGEANLEKANLQGAKLSQAFLYKANLEGANLSNAYL 131
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 55/106 (51%), Gaps = 10/106 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+LRK+V + + AD+++++ +K + LE+A +AN T A L +
Sbjct: 139 ANLRGANLRKSV-----IFVSILSEADLQDANLMEAKLLSSNLERANLARANLTKAQLHN 193
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+L +ANLT A LV+ L ++ L A + AD + A++ A
Sbjct: 194 A-----ILQDANLTQAKLVKAELNQASLARANLLNADLTGAILQQA 234
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 49/107 (45%), Gaps = 10/107 (9%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I S A A+L +A + N RAN A++ + A L A+ AN T A
Sbjct: 155 ILSEADLQDANLMEAKLLSSNLERANLARANLTK----------AQLHNAILQDANLTQA 204
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
L +++ L ANL NA L +L ++ L A I GA F +A +
Sbjct: 205 KLVKAELNQASLARANLLNADLTGAILQQATLYLANINGAIFKEAFL 251
>gi|427714529|ref|YP_007063153.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
gi|427378658|gb|AFY62610.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
Length = 333
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 62/125 (49%), Gaps = 18/125 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A G A+ R+A + R AN T AD+ ES + +GA LEKA+ A+ T ADL
Sbjct: 51 SFALLGRANFRRANLAGADLRGANLTQADLTESLLQEANLHGASLEKAILVGADITLADL 110
Query: 161 SDTLMDRMVLNEANLTNAVLV----------------RTVLTRSDLGGAIIEGADFSDAV 204
+D + L +ANL++ V RTVL +DL A + A+ ++A
Sbjct: 111 TDCNLIEADLRQANLSSTRFVGACFRGANLRKDNYQERTVLRGTDLEKADFQSANLAEA- 169
Query: 205 IDLAQ 209
DLA+
Sbjct: 170 -DLAR 173
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 55/107 (51%), Gaps = 10/107 (9%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYLEKAVAYKANFTGA----- 158
+L+KA N R A F A ++E+DF + F+ A+L++ + KANFT A
Sbjct: 210 NLKKAGLAWANLREARFDRAQLQEADFFQADCYQANFSHAHLDQIIGEKANFTQAIFTKA 269
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
DL + L EA L A L RT LT +DL GA + A+ S ++
Sbjct: 270 DLRRANLRGSTLKEARLIEAYLARTDLTGADLTGANLIRAEISSTLL 316
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 44/83 (53%), Gaps = 5/83 (6%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A L + + K NF +A FT AD+R ++ GS A L +A + + TGADL+
Sbjct: 244 ANFSHAHLDQIIGEKANFTQAIFTKADLRRANLRGSTLKEARLIEAYLARTDLTGADLTG 303
Query: 163 TLMDR-----MVLNEANLTNAVL 180
+ R +L +ANLT+ +
Sbjct: 304 ANLIRAEISSTLLLDANLTDVTM 326
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 66/145 (45%), Gaps = 6/145 (4%)
Query: 67 ALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT 126
+L A++ ++ L D N EA+ R S+ +F A R A K+N++
Sbjct: 94 SLEKAILVGADITLADLTDCNLIEADLRQ--ANLSSTRFVGACFRGANLRKDNYQERTV- 150
Query: 127 SADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 186
+R +D + F A L +A + N TGA+L + + L ANL A T LT
Sbjct: 151 ---LRGTDLEKADFQSANLAEADLARVNLTGANLKEANLRGADLMGANLERAYCQMTRLT 207
Query: 187 RSDLGGAIIEGADFSDAVIDLAQKQ 211
++L A + A+ +A D AQ Q
Sbjct: 208 DTNLKKAGLAWANLREARFDRAQLQ 232
>gi|428307960|ref|YP_007144785.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428249495|gb|AFZ15275.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 201
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 54/100 (54%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A G ADL +A + N AN A + +D +G+ F A L A ++AN +GA+LS
Sbjct: 95 ANLGGADLIEADLFEANLTGANLIGAKLIGADLTGANFREANLMGADLFEANLSGANLSG 154
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
+ L ANL+ A L+ L+R L GA I+GA+ ++
Sbjct: 155 ANLSGANLTLANLSGANLMGVDLSRVTLMGASIDGANLNN 194
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGA 158
F ADL KA+ + N AN AD+ E++FS + GA L +A ++AN TGA
Sbjct: 56 NFSKADLSKAILMGANLMGANLCEADIMEANFSKANLCEANLGGADLIEADLFEANLTGA 115
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+L + L AN A L+ L ++L GA + GA+ S A + LA
Sbjct: 116 NLIGAKLIGADLTGANFREANLMGADLFEANLSGANLSGANLSGANLTLAN 166
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 53/103 (51%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A F A+L A K N NF+ AD+ ++ G+ GA L +A +ANF+ A+L
Sbjct: 33 SGANFSKANLSGAHFSKANLIGVNFSKADLSKAILMGANLMGANLCEADIMEANFSKANL 92
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + L EA+L A L L + L GA + GA+F +A
Sbjct: 93 CEANLGGADLIEADLFEANLTGANLIGAKLIGADLTGANFREA 135
Score = 43.9 bits (102), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 61/123 (49%), Gaps = 19/123 (15%)
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKF 140
L +YEA R +F + DL +A + N ANF+ A++ + FS G F
Sbjct: 7 LARYEAGER---------EFHNCDLIEANLIGANLSGANFSKANLSGAHFSKANLIGVNF 57
Query: 141 NGAYLEKAVAYKANFTGADLSDT-LMD----RMVLNEANLTNAVLVRTVLTRSDLGGAII 195
+ A L KA+ AN GA+L + +M+ + L EANL A L+ L ++L GA +
Sbjct: 58 SKADLSKAILMGANLMGANLCEADIMEANFSKANLCEANLGGADLIEADLFEANLTGANL 117
Query: 196 EGA 198
GA
Sbjct: 118 IGA 120
>gi|332704952|ref|ZP_08425038.1| hypothetical protein LYNGBM3L_00660 [Moorea producens 3L]
gi|332356304|gb|EGJ35758.1| hypothetical protein LYNGBM3L_00660 [Moorea producens 3L]
Length = 544
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 10/111 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE----------KAVAYK 152
A ADL N A+ + AD+ +D SG+ FN A L +A +
Sbjct: 236 ANLSDADLSDTKLSGANLCDADLSGADLSGADLSGADFNDANLSGADLSSANLIRANLIR 295
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
AN +GA+LSD + L ANL+NA L R++L GA + GAD S+A
Sbjct: 296 ANLSGANLSDVKVIGGNLGNANLSNANFSSAKLIRANLSGADLSGADLSNA 346
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 55/116 (47%), Gaps = 15/116 (12%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-------- 156
G+A+L A RAN + AD+ +D S + F+GA L A AN +
Sbjct: 313 LGNANLSNANFSSAKLIRANLSGADLSGADLSNANFSGASLYSANLSNANLSSANLRGTE 372
Query: 157 -------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
GADL T + L+ ANL+NA L+ + L ++L GA + GA+ A +
Sbjct: 373 LSGANLSGADLRGTKLSGANLSGANLSNAKLIDSNLRGTELSGANLSGANLRGASL 428
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 53/108 (49%), Gaps = 10/108 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRE----------SDFSGSKFNGAYLEKAVAYK 152
A ADL A ++ N RAN + A++ + ++ S + F+ A L +A
Sbjct: 276 ANLSGADLSSANLIRANLIRANLSGANLSDVKVIGGNLGNANLSNANFSSAKLIRANLSG 335
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A+ +GADLS+ L ANL+NA L L ++L GA + GAD
Sbjct: 336 ADLSGADLSNANFSGASLYSANLSNANLSSANLRGTELSGANLSGADL 383
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 54/105 (51%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A +A+ A N AN +SA++R ++ SG+ +GA L AN +GA+L
Sbjct: 339 SGADLSNANFSGASLYSANLSNANLSSANLRGTELSGANLSGADLRGTKLSGANLSGANL 398
Query: 161 SDT-LMDRMV----LNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
S+ L+D + L+ ANL+ A L L ++L GA + GA
Sbjct: 399 SNAKLIDSNLRGTELSGANLSGANLRGASLYSANLSGANLRGASL 443
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 51/108 (47%), Gaps = 10/108 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANF 155
S A ADLR N AN ++A ++R ++ SG+ +GA L A Y AN
Sbjct: 374 SGANLSGADLRGTKLSGANLSGANLSNAKLIDSNLRGTELSGANLSGANLRGASLYSANL 433
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+GA+L L ANL+ A L L+ ++L + G DFS A
Sbjct: 434 SGANLRGA-----SLYSANLSGANLSGANLSLANLCPMRVSGTDFSAA 476
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 54/125 (43%), Gaps = 6/125 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LR A N AN A + ++ SG+ +GA L A +G D
Sbjct: 414 SGANLSGANLRGASLYSANLSGANLRGASLYSANLSGANLSGANLSLANLCPMRVSGTDF 473
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYANG 219
S L+ ANL A L R L +DL A + GAD S A ++ A K A Y G
Sbjct: 474 S-----AANLSGANLGGAYLYRADLKDTDLSSANLTGADLSSANLNGADVKNARFGYIVG 528
Query: 220 TNPIT 224
+ T
Sbjct: 529 IDEST 533
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 52/111 (46%), Gaps = 10/111 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKAN- 154
S A SA+L A N R AN + AD+R + SG+ +GA L A +N
Sbjct: 349 SGASLYSANLSNANLSSANLRGTELSGANLSGADLRGTKLSGANLSGANLSNAKLIDSNL 408
Query: 155 ----FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+GA+LS + L ANL+ A L L ++L GA + GA+ S
Sbjct: 409 RGTELSGANLSGANLRGASLYSANLSGANLRGASLYSANLSGANLSGANLS 459
>gi|218442155|ref|YP_002380484.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218174883|gb|ACK73616.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 180
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 63/126 (50%), Gaps = 14/126 (11%)
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 145
L++Y+A+ R F L +A V N +R NFT D+ SD +G+ +G+ L
Sbjct: 7 LHRYQAQER---------NFEELSLHQANLVGANLQRINFTRTDLSGSDLNGADLSGSCL 57
Query: 146 EKAVA-----YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
++A KAN GA+L + + L EANL A L + L ++L GA + GA+
Sbjct: 58 KQANLTDADLEKANLVGANLVEVNLIGADLKEANLAGADLTKADLRCANLEGANLTGANL 117
Query: 201 SDAVID 206
+ ++
Sbjct: 118 TQVNLE 123
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 44/87 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL KA V N N AD++E++ +G+ A L A AN TGA+L+
Sbjct: 60 ANLTDADLEKANLVGANLVEVNLIGADLKEANLAGADLTKADLRCANLEGANLTGANLTQ 119
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSD 189
++ L ANL+ A ++ T L +D
Sbjct: 120 VNLEGANLKGANLSEAQIIGTDLNVAD 146
>gi|428774386|ref|YP_007166174.1| serine/threonine protein kinase with pentapeptide repeats
[Cyanobacterium stanieri PCC 7202]
gi|428688665|gb|AFZ48525.1| serine/threonine protein kinase with pentapeptide repeats
[Cyanobacterium stanieri PCC 7202]
Length = 506
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 41/71 (57%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F AD +A V+ N +A+ A+++ +DF + GA LE A YKAN GA+L+D
Sbjct: 420 ANFYHADFSRARLVRANLTKAHLFKAELQYADFRNANLTGANLEGANLYKANLCGANLTD 479
Query: 163 TLMDRMVLNEA 173
+D + L EA
Sbjct: 480 ANIDDIQLQEA 490
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 51/119 (42%), Gaps = 10/119 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A + A +K NF AN A+ +DFS ++ A L KA +KA AD
Sbjct: 393 SNASLPRVNFHHAKFIKTNFEDANLVEANFYHADFSRARLVRANLTKAHLFKAELQYADF 452
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
+ ANLT A L L +++L GA + A+ D + A+ + +G
Sbjct: 453 RN----------ANLTGANLEGANLYKANLCGANLTDANIDDIQLQEAETNWATIFPDG 501
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 47/104 (45%), Gaps = 10/104 (9%)
Query: 118 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTN 177
E F N ++A + +F +KF E A +ANF AD S + R L +A+L
Sbjct: 385 ECFNNFNLSNASLPRVNFHHAKFIKTNFEDANLVEANFYHADFSRARLVRANLTKAHLFK 444
Query: 178 AVLVRTVLTRSDLGGAIIE----------GADFSDAVIDLAQKQ 211
A L ++L GA +E GA+ +DA ID Q Q
Sbjct: 445 AELQYADFRNANLTGANLEGANLYKANLCGANLTDANIDDIQLQ 488
>gi|379713712|ref|YP_005302050.1| hypothetical protein RMB_03905 [Rickettsia massiliae str. AZT80]
gi|376334358|gb|AFB31590.1| hypothetical protein RMB_03905 [Rickettsia massiliae str. AZT80]
Length = 957
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 39/121 (32%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A L+KA A G ++SD
Sbjct: 553 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLKKAEA-----EGLNISDA 607
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 217
+ + EAN NA++ R LT+++ A++E AD ++A++ A KQA K A
Sbjct: 608 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIVKEANLKQANLKAA 667
Query: 218 N 218
N
Sbjct: 668 N 668
Score = 42.7 bits (99), Expect = 0.19, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 582 ATAQF--AKLSNATLKKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 639
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
+ M + EA + A L + L ++L G EGADF A I+ A K
Sbjct: 640 ENADMQAVEAAEAIVKEANLKQANLKAANLAGINKEGADFDKAEINNATK 689
Score = 37.7 bits (86), Expect = 5.1, Method: Composition-based stats.
Identities = 38/144 (26%), Positives = 61/144 (42%), Gaps = 13/144 (9%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
LKN +F S L +++C+ + + N A + A F ADL+K+
Sbjct: 357 LKN-TLFASANLENIKISNCNLDFTNFEGANLQNAVFQNV--TARNAGFLFADLQKSKIE 413
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTGADLSDTLMD 166
+ RA D+ E + + SKFN + A A K +N TG L+ M
Sbjct: 414 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKLIIKDSEWKNSNLTGISLAYADMQ 473
Query: 167 RMVLNEANLTNAVLVRTVLTRSDL 190
R+ + L NA+L + + +DL
Sbjct: 474 RVQMQGVVLNNALLDQANIVSTDL 497
>gi|428202846|ref|YP_007081435.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427980278|gb|AFY77878.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 253
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 51/106 (48%), Gaps = 5/106 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTG 157
A ADL A ++ N AN A++ E+D S S GAYL +A YKA
Sbjct: 110 ANLRRADLSAAKLIRSNLSEANLVDANLNEADLSQSNLYEAEAIGAYLYRATLYKAKLVE 169
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A LS + L EA+L L LT+++LGGA + A+ S A
Sbjct: 170 AHLSKVYLVGADLREAHLYRTDLRYAHLTKANLGGAHLLEANLSGA 215
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 64/138 (46%), Gaps = 7/138 (5%)
Query: 63 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 122
+ + L+ A + + N + L+ N YEAE G + A L KA V+ + +
Sbjct: 122 LIRSNLSEANLVDANLNEADLSQSNLYEAEAIGAY-------LYRATLYKAKLVEAHLSK 174
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AD+RE+ + A+L KA A+ A+LS + + L ANL A L
Sbjct: 175 VYLVGADLREAHLYRTDLRYAHLTKANLGGAHLLEANLSGANLRKANLRGANLQGADLRC 234
Query: 183 TVLTRSDLGGAIIEGADF 200
L ++DL GA ++GA F
Sbjct: 235 ANLHQADLRGANLQGALF 252
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 40/68 (58%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN +A++ E++ SG+ + A L V KAN ADLS + R L+EANL +A L
Sbjct: 80 ANLIAANLSEANLSGADLSHANLIGTVLKKANLRRADLSAAKLIRSNLSEANLVDANLNE 139
Query: 183 TVLTRSDL 190
L++S+L
Sbjct: 140 ADLSQSNL 147
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 50/93 (53%), Gaps = 2/93 (2%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ N + A ++A++ + SG + A L A +AN +GADLS + VL +ANL
Sbjct: 54 QNNLQNAELSNANLVGVNLSGVDLSDANLIAANLSEANLSGADLSHANLIGTVLKKANLR 113
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
A L L RS+L A + A+ ++A DL+Q
Sbjct: 114 RADLSAAKLIRSNLSEANLVDANLNEA--DLSQ 144
>gi|433593191|ref|YP_007282677.1| putative low-complexity protein [Natrinema pellirubrum DSM 15624]
gi|448335744|ref|ZP_21524879.1| hypothetical protein C488_20057 [Natrinema pellirubrum DSM 15624]
gi|433308229|gb|AGB34039.1| putative low-complexity protein [Natrinema pellirubrum DSM 15624]
gi|445615954|gb|ELY69591.1| hypothetical protein C488_20057 [Natrinema pellirubrum DSM 15624]
Length = 644
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F + DLR + N +FT A +RE+ F+ S +GA L +A+ T ADLS+ L
Sbjct: 24 FSNTDLRGTTFGEANLADTDFTEAILREAQFAASDLSGASL-----TQADLTDADLSNAL 78
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ L ANL NA L + L + L A +EGA F +A +
Sbjct: 79 APMVNLTGANLRNADLANSDLRQVTLTNAHLEGASFREARL 119
Score = 43.9 bits (102), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 56/121 (46%), Gaps = 15/121 (12%)
Query: 103 AQFGSADLRKAVHVKENFR-----RANFTSADMR----------ESDFSGSKFNGAYLEK 147
A+ SA LR A V + R + +FT D+R E++F G++ A L +
Sbjct: 177 ARLQSATLRGATLVHSDLRSTFCRQTDFTECDLRNVTAERMYAPEAEFDGARLTEANLRQ 236
Query: 148 AVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
A A+F G D S + L+ + ++A L ++DL GA + GAD S A +
Sbjct: 237 AEVTSASFDGVDASGIDVTEADLSATDWSDADLSGATFDQADLSGATLSGADLSGATFNQ 296
Query: 208 A 208
A
Sbjct: 297 A 297
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 51/194 (26%), Positives = 84/194 (43%), Gaps = 31/194 (15%)
Query: 38 CQISSKTESDGQFPGPY---AKLKNWRVFVSTALAAAVVAS------CSSNISALADLNK 88
C++ + + + G A L+N R+ +T A +V S C DL
Sbjct: 152 CELDNTSFREADLSGAILQSAALENARLQSATLRGATLVHSDLRSTFCRQTDFTECDLRN 211
Query: 89 YEAET----RGEFGIGSAAQFGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSK 139
AE EF A+ A+LR+A +F + T AD+ +D+S +
Sbjct: 212 VTAERMYAPEAEF---DGARLTEANLRQAEVTSASFDGVDASGIDVTEADLSATDWSDAD 268
Query: 140 FNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE--- 196
+GA ++A A +GADLS ++ L +A+L+ A L L+ + L GA++
Sbjct: 269 LSGATFDQADLSGATLSGADLSGATFNQATLKDADLSGADLTDVELSDTALTGALLRETR 328
Query: 197 -------GADFSDA 203
GADF++A
Sbjct: 329 LAPETACGADFTEA 342
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 48/105 (45%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
SA + ADL A F +A+ + A + +D SG+ FN A L+ A A+ T +L
Sbjct: 260 SATDWSDADLSGAT-----FDQADLSGATLSGADLSGATFNQATLKDADLSGADLTDVEL 314
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
SDT + +L E L T +DL GA I F D+
Sbjct: 315 SDTALTGALLRETRLAPETACGADFTEADLTGADISSGQFDDSTF 359
Score = 40.8 bits (94), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 48/105 (45%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A ADL A+ N AN +AD+ SD A+LE A +A GADL
Sbjct: 65 TQADLTDADLSNALAPMVNLTGANLRNADLANSDLRQVTLTNAHLEGASFREARLWGADL 124
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+D + + L A+L + L L++ +L AD S A++
Sbjct: 125 ADADLTVVALAGADLQESTLRGARLSQCELDNTSFREADLSGAIL 169
>gi|193214429|ref|YP_001995628.1| pentapeptide repeat-containing protein [Chloroherpeton thalassium
ATCC 35110]
gi|193087906|gb|ACF13181.1| pentapeptide repeat protein [Chloroherpeton thalassium ATCC 35110]
Length = 694
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 56/107 (52%), Gaps = 10/107 (9%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL-----EKAVAYKANFT 156
+A ADLR A N + A+ +SA+++ +D S + GA L + AV + AN
Sbjct: 482 SANLQGADLRAA-----NLQGADLSSANLQGADLSSANLQGAVLWLANLQGAVLWLANLQ 536
Query: 157 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
GADLSD + VL+ ANL A L L +DL A ++GAD A
Sbjct: 537 GADLSDAKLQGAVLSFANLQGADLRSAKLQGADLRSANLQGADLRSA 583
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 47/99 (47%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F ADLR A + AN AD+ ++ G+ A L+ A AN GADLS
Sbjct: 455 FQGADLRAANLQGADLISANLQGADLISANLQGADLRAANLQGADLSSANLQGADLSSAN 514
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ VL ANL AVL L +DL A ++GA S A
Sbjct: 515 LQGAVLWLANLQGAVLWLANLQGADLSDAKLQGAVLSFA 553
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 45/98 (45%), Gaps = 1/98 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADLR A + R AN AD+R ++ G+ A L+ A A GADL
Sbjct: 551 SFANLQGADLRSAKLQGADLRSANLQGADLRSANLQGAYLRSANLQGAYLRSAKLQGADL 610
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVL-TRSDLGGAIIEG 197
S+ + L+ A L A L + ++D GA +G
Sbjct: 611 SEANLQGADLDSAKLQGAYLRNIEIDEKTDFNGATADG 648
>gi|284008185|emb|CBA74448.1| conserved pentapeptide repeat protein [Arsenophonus nasoniae]
Length = 1253
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 5/109 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A L A+ + N AN S D+ + + +K N A L +A + N ADL +
Sbjct: 931 ATLHKASLNGAILHRVNLNNANLISVDLYRAILNDAKLNNANLLRANLKETNLVNADLIN 990
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGA-----IIEGADFSDAVID 206
+ + L+ NL +A L LT++DL A I G +FS A++D
Sbjct: 991 ADLTQATLSHTNLMHANLAHADLTQTDLSHANLQQVSIHGTNFSGAILD 1039
Score = 40.0 bits (92), Expect = 0.99, Method: Composition-based stats.
Identities = 24/87 (27%), Positives = 42/87 (48%), Gaps = 10/87 (11%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N N T+A + ++ +G A+ ++ N A+L + R +LN+A L NA
Sbjct: 922 NLSNLNLTNATLHKASLNG----------AILHRVNLNNANLISVDLYRAILNDAKLNNA 971
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVI 205
L+R L ++L A + AD + A +
Sbjct: 972 NLLRANLKETNLVNADLINADLTQATL 998
Score = 38.9 bits (89), Expect = 2.4, Method: Composition-based stats.
Identities = 24/83 (28%), Positives = 37/83 (44%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N +N ++ ++ + + NGA L + AN DL +++ LN ANL A
Sbjct: 917 NLSGSNLSNLNLTNATLHKASLNGAILHRVNLNNANLISVDLYRAILNDAKLNNANLLRA 976
Query: 179 VLVRTVLTRSDLGGAIIEGADFS 201
L T L +DL A + A S
Sbjct: 977 NLKETNLVNADLINADLTQATLS 999
>gi|409993775|ref|ZP_11276905.1| hypothetical protein APPUASWS_21733 [Arthrospira platensis str.
Paraca]
gi|291572160|dbj|BAI94432.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409935380|gb|EKN76914.1| hypothetical protein APPUASWS_21733 [Arthrospira platensis str.
Paraca]
Length = 741
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 52/101 (51%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +LR A N A+ AD+R +D G+ GA L +A Y+AN T + +
Sbjct: 581 ANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANLKGANLYRANFYQANITEGNFNG 640
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ R+ N ++L +A L+R L++S L A + GA+ S +
Sbjct: 641 AKLRRVNFNRSDLRDAELIRVDLSKSRLRSACLRGANLSQS 681
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 48/103 (46%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL+ A N RANF A++ E +F+G+K ++ A DLS
Sbjct: 606 ADLRGADLQGANLKGANLYRANFYQANITEGNFNGAKLRRVNFNRSDLRDAELIRVDLSK 665
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ + L ANL+ + L LTR+DL GAD S +I
Sbjct: 666 SRLRSACLRGANLSQSNLKGADLTRADLSNVKFTGADLSCTLI 708
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 3/111 (2%)
Query: 87 NKYEAE-TRGEFGIGSA--AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
N Y+A T G F F +DLR A ++ + ++ SA +R ++ S S GA
Sbjct: 627 NFYQANITEGNFNGAKLRRVNFNRSDLRDAELIRVDLSKSRLRSACLRGANLSQSNLKGA 686
Query: 144 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
L +A FTGADLS TL+ L+ A+L NA L + L S+ G I
Sbjct: 687 DLTRADLSNVKFTGADLSCTLIRHANLSGADLRNAKLEKANLFGSNTVGCI 737
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 59/121 (48%), Gaps = 7/121 (5%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
L +N A RG G A ADLR A N + AN A+ +++ + FNG
Sbjct: 583 LRGVNLRNANLRG--GNLEGAHLEGADLRGADLQGANLKGANLYRANFYQANITEGNFNG 640
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
A L + NF +DL D + R+ L+++ L +A L L++S+L GA + AD S+
Sbjct: 641 AKLR-----RVNFNRSDLRDAELIRVDLSKSRLRSACLRGANLSQSNLKGADLTRADLSN 695
Query: 203 A 203
Sbjct: 696 V 696
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 54/113 (47%), Gaps = 10/113 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---- 156
+ A A+L++A VK + RRA+ ++ + + +K A L A +AN
Sbjct: 494 AVANLKGANLQEASLVKADLRRADLEEVNLSYASLTTAKLQRANLRSACLIEANLMAASL 553
Query: 157 ------GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
GADLS+ ++ LN+ANL +A L L ++L G +EGA A
Sbjct: 554 EGCDLKGADLSNANLESAKLNQANLAHANLRGVNLRNANLRGGNLEGAHLEGA 606
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 51/112 (45%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
+QF DLR+ N ++ + T ADMRE + G L KAN + A L+
Sbjct: 431 SQFQGLDLRQTNLKGVNLKKMDLTGADMREKNLEGMSLIQLDLRLVNLAKANLSHAILNG 490
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALC 214
+ + L ANL A LV+ L R+DL + A + A + A ++ C
Sbjct: 491 SKLAVANLKGANLQEASLVKADLRRADLEEVNLSYASLTTAKLQRANLRSAC 542
>gi|157964675|ref|YP_001499499.1| hypothetical protein RMA_0846 [Rickettsia massiliae MTU5]
gi|157844451|gb|ABV84952.1| hypothetical protein RMA_0846 [Rickettsia massiliae MTU5]
Length = 964
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 39/121 (32%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A L+KA A G ++SD
Sbjct: 560 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLKKAEA-----EGLNISDA 614
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 217
+ + EAN NA++ R LT+++ A++E AD ++A++ A KQA K A
Sbjct: 615 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIVKEANLKQANLKAA 674
Query: 218 N 218
N
Sbjct: 675 N 675
Score = 42.4 bits (98), Expect = 0.20, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 589 ATAQF--AKLSNATLKKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 646
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
+ M + EA + A L + L ++L G EGADF A I+ A K
Sbjct: 647 ENADMQAVEAAEAIVKEANLKQANLKAANLAGINKEGADFDKAEINNATK 696
Score = 37.7 bits (86), Expect = 5.2, Method: Composition-based stats.
Identities = 38/144 (26%), Positives = 61/144 (42%), Gaps = 13/144 (9%)
Query: 57 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 116
LKN +F S L +++C+ + + N A + A F ADL+K+
Sbjct: 364 LKN-TLFASANLENIKISNCNLDFTNFEGANLQNAVFQNV--TARNAGFLFADLQKSKIE 420
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK----------ANFTGADLSDTLMD 166
+ RA D+ E + + SKFN + A A K +N TG L+ M
Sbjct: 421 NSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKLIIKDSEWKNSNLTGISLAYADMQ 480
Query: 167 RMVLNEANLTNAVLVRTVLTRSDL 190
R+ + L NA+L + + +DL
Sbjct: 481 RVQMQGVVLNNALLDQANIVSTDL 504
>gi|428773363|ref|YP_007165151.1| pentapeptide repeat-containing protein [Cyanobacterium stanieri PCC
7202]
gi|428687642|gb|AFZ47502.1| pentapeptide repeat protein [Cyanobacterium stanieri PCC 7202]
Length = 319
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A ANFT AD+ E++ SG A L +A +N G
Sbjct: 113 SNANLTGANLTGATLTGATLTGANFTRADLTEANLSGLNLMEADLTRANLSASNLQGCSF 172
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ R L EA+L N++L L R++L A + GA+FS AV+
Sbjct: 173 NEANFSRADLREADLKNSILEGVFLHRANLSRANLRGANFSGAVL 217
>gi|409994207|ref|ZP_11277325.1| hypothetical protein APPUASWS_23863 [Arthrospira platensis str.
Paraca]
gi|409934955|gb|EKN76501.1| hypothetical protein APPUASWS_23863 [Arthrospira platensis str.
Paraca]
Length = 519
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 52/97 (53%), Gaps = 10/97 (10%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN----- 171
+ N +ANFT A + ++FSG+ G L +A + +GA L ++ VLN
Sbjct: 29 RVNLSQANFTEAILSVTNFSGANLTGVNLTRAKLNVSKLSGAILQGANLNEAVLNVANLI 88
Query: 172 -----EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ANL +A L+R L R++L A++ GA+ ++A
Sbjct: 89 RADLSQANLIDASLIRAELMRAELSEAVVNGANLTEA 125
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 62/129 (48%), Gaps = 12/129 (9%)
Query: 93 TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
TR + + S A A+L +AV N RA+ + A++ ++ ++ A L +AV
Sbjct: 58 TRAKLNVSKLSGAILQGANLNEAVLNVANLIRADLSQANLIDASLIRAELMRAELSEAVV 117
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNA----------VLVRTVLTRSDLGGAIIEGADF 200
AN T ADL + + L +ANL+ A L R+ LTRSDL A + G +
Sbjct: 118 NGANLTEADLREATLRHTELQQANLSGANLSEACLILSNLERSNLTRSDLTRADLRGVNL 177
Query: 201 SDAVIDLAQ 209
+A + A+
Sbjct: 178 RNAELRQAE 186
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 48/96 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L +A + N R+N T +D+ +D G A L +A A+ GA+LS
Sbjct: 140 ANLSGANLSEACLILSNLERSNLTRSDLTRADLRGVNLRNAELRQAELSGADLRGANLSG 199
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
+ L+ ANL+ A L T L+ + L GA + GA
Sbjct: 200 ANLRWANLSGANLSGANLEATQLSGASLRGANLSGA 235
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 49/91 (53%)
Query: 107 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMD 166
+A+LR+A + R AN + A++R ++ SG+ +GA LE A+ GA+LS +
Sbjct: 179 NAELRQAELSGADLRGANLSGANLRWANLSGANLSGANLEATQLSGASLRGANLSGASLL 238
Query: 167 RMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
A+LT A L+ T +DL G+ + G
Sbjct: 239 NCTAIHADLTQANLIECDWTDADLRGSALTG 269
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 50/98 (51%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ A+L +AV N A+ A +R ++ + +GA L +A +N ++L+
Sbjct: 105 AELMRAELSEAVVNGANLTEADLREATLRHTELQQANLSGANLSEACLILSNLERSNLTR 164
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ + R L NL NA L + L+ +DL GA + GA+
Sbjct: 165 SDLTRADLRGVNLRNAELRQAELSGADLRGANLSGANL 202
Score = 40.8 bits (94), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 47/96 (48%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
+DL +A N R A A++ +D G+ +GA L A AN +GA+L T +
Sbjct: 165 SDLTRADLRGVNLRNAELRQAELSGADLRGANLSGANLRWANLSGANLSGANLEATQLSG 224
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANL+ A L+ +DL A + D++DA
Sbjct: 225 ASLRGANLSGASLLNCTAIHADLTQANLIECDWTDA 260
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 134 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
DFS A L + +ANFT A LS T + ANLT L R L S L GA
Sbjct: 16 DFSAILLCEANLSRVNLSQANFTEAILSVT-----NFSGANLTGVNLTRAKLNVSKLSGA 70
Query: 194 IIEGADFSDAVIDLA 208
I++GA+ ++AV+++A
Sbjct: 71 ILQGANLNEAVLNVA 85
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 58/139 (41%), Gaps = 18/139 (12%)
Query: 83 LADLNKYEAE-TRGEF--GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE------- 132
L D + AE R E + + A ADLR+A ++AN + A++ E
Sbjct: 97 LIDASLIRAELMRAELSEAVVNGANLTEADLREATLRHTELQQANLSGANLSEACLILSN 156
Query: 133 --------SDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
SD + + G L A +A +GADL + L ANL+ A L
Sbjct: 157 LERSNLTRSDLTRADLRGVNLRNAELRQAELSGADLRGANLSGANLRWANLSGANLSGAN 216
Query: 185 LTRSDLGGAIIEGADFSDA 203
L + L GA + GA+ S A
Sbjct: 217 LEATQLSGASLRGANLSGA 235
>gi|418728079|ref|ZP_13286659.1| NifU-like N-terminal domain protein [Leptospira interrogans str. UI
12758]
gi|410777124|gb|EKR57092.1| NifU-like N-terminal domain protein [Leptospira interrogans str. UI
12758]
Length = 263
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 53/102 (51%), Gaps = 4/102 (3%)
Query: 114 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
+ VK + R + +S + + F G F+GA L A ++F GA+ S + LN A
Sbjct: 141 LKVKGSLRDEDLSSIILEKLKFDGVDFSGANLGHAFLQNSSFVGANFSGAKLRGSFLNNA 200
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID----LAQKQ 211
+L N+ L + L GA +EGADF+DA+ D L QKQ
Sbjct: 201 DLRNSNFRGADLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242
>gi|254414225|ref|ZP_05027992.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196178900|gb|EDX73897.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 963
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 34/101 (33%), Positives = 50/101 (49%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A+ A + N +RAN A++ E++ + F GA LE+A +AN GA+L +
Sbjct: 776 ANFEGANFEGANLEEANLKRANLFEANLFEANLFEANFEGANLERANLKRANLEGANLEE 835
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L EANL A L R+ L A +E A+ A
Sbjct: 836 ANLKGANLEEANLEEANFEGANLKRATLFEANLEWANLKRA 876
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 40/117 (34%), Positives = 56/117 (47%), Gaps = 11/117 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A+L +A N +RAN A++ E++ G+ A LE+A NF GA+L
Sbjct: 811 ANFEGANLERA-----NLKRANLEGANLEEANLKGANLEEANLEEA-----NFEGANLKR 860
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 218
+ L ANL A L L ++ GA +EGA A + A K+A K AN
Sbjct: 861 ATLFEANLEWANLKRANLFEANLFDANFEGANLEGAHLKGANLKRANLKRANLKRAN 917
Score = 50.4 bits (119), Expect = 7e-04, Method: Composition-based stats.
Identities = 47/168 (27%), Positives = 71/168 (42%), Gaps = 25/168 (14%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
L + N +EA G A A+L++A N AN A++ E++ + F G
Sbjct: 803 LFEANLFEANFEG-------ANLERANLKRANLEGANLEEANLKGANLEEANLEEANFEG 855
Query: 143 AYLEKAVAYKANFTGADLSDT------LMDRMV---------LNEANLTNAVLVRTVLTR 187
A L++A ++AN A+L L D L ANL A L R L R
Sbjct: 856 ANLKRATLFEANLEWANLKRANLFEANLFDANFEGANLEGAHLKGANLKRANLKRANLKR 915
Query: 188 SDLGGAIIEGADFSDAVIDLA---QKQALCKYANGTNPITGVSTRKSL 232
++L A EGA+F A ++ A + G PI+ T ++L
Sbjct: 916 ANLFEANFEGANFEGATLEWANLFEANLKGTILEGKVPISSPETEQTL 963
Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats.
Identities = 35/102 (34%), Positives = 50/102 (49%), Gaps = 5/102 (4%)
Query: 109 DLRKAVHVKE-----NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
DLR V V + NF ANF A++ E++ + A L +A ++ANF GA+L
Sbjct: 762 DLRGCVLVFKDFYWANFEGANFEGANLEEANLKRANLFEANLFEANLFEANFEGANLERA 821
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ R L ANL A L L ++L A EGA+ A +
Sbjct: 822 NLKRANLEGANLEEANLKGANLEEANLEEANFEGANLKRATL 863
>gi|440683010|ref|YP_007157805.1| serine/threonine protein kinase with pentapeptide repeats [Anabaena
cylindrica PCC 7122]
gi|428680129|gb|AFZ58895.1| serine/threonine protein kinase with pentapeptide repeats [Anabaena
cylindrica PCC 7122]
Length = 535
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 53/99 (53%), Gaps = 5/99 (5%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
L+ V + +F N + ++ +D SG+ F+ A L++ N GA+L +T R
Sbjct: 401 LQAYVKGRRDFASYNISMLSLQGADLSGTNFHHAQLKQT-----NLQGANLQNTDFGRAS 455
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
L +ANL +A L + L+ +DL GA + GAD S A + A
Sbjct: 456 LMQANLRDANLTKAYLSNADLEGADLRGADLSYAYMSQA 494
Score = 40.4 bits (93), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 49/101 (48%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
NF A +++ ++ + F A L +A AN T A LS+ ++ L A+L+ A
Sbjct: 430 NFHHAQLKQTNLQGANLQNTDFGRASLMQANLRDANLTKAYLSNADLEGADLRGADLSYA 489
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANG 219
+ + L ++L GA + GA +D I LA+ L NG
Sbjct: 490 YMSQANLRGANLCGANLTGAKVTDEQIALAKTNWLTVRPNG 530
>gi|126656707|ref|ZP_01727921.1| hypothetical protein CY0110_23751 [Cyanothece sp. CCY0110]
gi|126621927|gb|EAZ92635.1| hypothetical protein CY0110_23751 [Cyanothece sp. CCY0110]
Length = 257
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 66/124 (53%), Gaps = 6/124 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQ A+L KA + N ++N A ++E++ + N + ++ A YKA TG+ L
Sbjct: 69 TEAQLKQANLTKANLFEANLSQSNLEEAILQEANLINTNLNKSIIKNANLYKALLTGSKL 128
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ------KQALC 214
S+ ++ L +A+L+ + T+L R++L A ++ +D A + AQ +Q+
Sbjct: 129 SNANLEGANLEQADLSTYEELPTLLNRTNLNKANLKQSDLEGAWLVKAQLIEANLQQSNL 188
Query: 215 KYAN 218
KYAN
Sbjct: 189 KYAN 192
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 59/124 (47%), Gaps = 12/124 (9%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
+L KA N N A+++ ++ ++ A L++A KAN A+LS + ++
Sbjct: 36 VNLEKANLQHANLHETNLNKANLKNANLQQTRLTEAQLKQANLTKANLFEANLSQSNLEE 95
Query: 168 MVLNEANLTN----------AVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 217
+L EANL N A L + +LT S L A +EGA+ A DL+ + L
Sbjct: 96 AILQEANLINTNLNKSIIKNANLYKALLTGSKLSNANLEGANLEQA--DLSTYEELPTLL 153
Query: 218 NGTN 221
N TN
Sbjct: 154 NRTN 157
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 54/99 (54%), Gaps = 10/99 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +A+L++ + ++AN T A++ E++ S S LE+A+ +AN +L
Sbjct: 56 ANLKNANLQQTRLTEAQLKQANLTKANLFEANLSQSN-----LEEAILQEANLINTNL-- 108
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
++ ++ ANL A+L + L+ ++L GA +E AD S
Sbjct: 109 ---NKSIIKNANLYKALLTGSKLSNANLEGANLEQADLS 144
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 40/81 (49%), Gaps = 5/81 (6%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAY-----KANFTGADLSDTLMDRMVLNEANLTNAV 179
+ D+++ D G A L+ A + KAN A+L T + L +ANLT A
Sbjct: 23 LENVDLQQLDLEGVNLEKANLQHANLHETNLNKANLKNANLQQTRLTEAQLKQANLTKAN 82
Query: 180 LVRTVLTRSDLGGAIIEGADF 200
L L++S+L AI++ A+
Sbjct: 83 LFEANLSQSNLEEAILQEANL 103
>gi|428309842|ref|YP_007120819.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428251454|gb|AFZ17413.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 289
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 52/95 (54%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
+LR+A + + R N +AD+ E+D +K +GA L A +AN D S + R+
Sbjct: 30 NLREANLREAHLRYVNLCTADLSEADLFNAKLSGADLTGANLTRANLFLVDFSTADLTRV 89
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANLT A L T LT ++L GA + GA+ +A
Sbjct: 90 DLTGANLTRANLFFTNLTGANLTGANLTGANLKEA 124
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 50/101 (49%), Gaps = 10/101 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ ADL A + N +F++AD+ D +G A L +A + N TGA+L+
Sbjct: 59 AKLSGADLTGANLTRANLFLVDFSTADLTRVDLTG-----ANLTRANLFFTNLTGANLTG 113
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L EAN +NA L R +DL GA + AD S A
Sbjct: 114 ANLTGANLKEANFSNAGLCR-----ADLSGANLNRADLSKA 149
Score = 43.5 bits (101), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 61/123 (49%), Gaps = 6/123 (4%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L A N + ANF++A + +D SG+ N A L KA N +GADLS + +
Sbjct: 109 ANLTGANLTGANLKEANFSNAGLCRADLSGANLNRADLSKADLRNINLSGADLSGANLGK 168
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN----PI 223
L+ ANL A L L ++L G + A+F A +L+ +A GTN +
Sbjct: 169 ANLSGANLCAANLSGANLCAANLSGTNLCAANFKRA--NLSGASLSNTHALGTNFEQARL 226
Query: 224 TGV 226
TGV
Sbjct: 227 TGV 229
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 50/96 (52%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
F +ADL + N RAN ++ ++ +G+ GA L++A A ADLS
Sbjct: 81 FSTADLTRVDLTGANLTRANLFFTNLTGANLTGANLTGANLKEANFSNAGLCRADLSGAN 140
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
++R L++A+L N L L+ ++LG A + GA+
Sbjct: 141 LNRADLSKADLRNINLSGADLSGANLGKANLSGANL 176
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 48/103 (46%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F +A L +A N RA+ + AD+R + SG+ +GA L KA AN A+LS
Sbjct: 124 ANFSNAGLCRADLSGANLNRADLSKADLRNINLSGADLSGANLGKANLSGANLCAANLSG 183
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ L+ NL A R L+ + L G +F A +
Sbjct: 184 ANLCAANLSGTNLCAANFKRANLSGASLSNTHALGTNFEQARL 226
>gi|119490887|ref|ZP_01623170.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119453705|gb|EAW34864.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 517
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 58/116 (50%), Gaps = 10/116 (8%)
Query: 100 GSAAQFGSADLRKAVHVKENF----------RRANFTSADMRESDFSGSKFNGAYLEKAV 149
G++ ADLR+A VK N R+ N T AD+R+++ SG+ A L A
Sbjct: 157 GASTNLQRADLRRANLVKANLPKADFSHAEMRQTNLTYADLRQANLSGANLRWADLRGAN 216
Query: 150 AYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+ +GA+LS + L+ A L A LV LT+++L A GAD S A +
Sbjct: 217 LLGADLSGANLSGANLSGANLSRATLAKASLVHVDLTQANLIKADWMGADISGATL 272
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 51/96 (53%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A++R+ + R+AN + A++R +D G+ GA L A AN +GA+LS
Sbjct: 180 ADFSHAEMRQTNLTYADLRQANLSGANLRWADLRGANLLGADLSGANLSGANLSGANLSR 239
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
+ + L +LT A L++ +D+ GA + GA
Sbjct: 240 ATLAKASLVHVDLTQANLIKADWMGADISGATLTGA 275
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 51/103 (49%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ A ADLR+A + NF +AN + A++R + + A L +A KAN AD
Sbjct: 123 TKANLNGADLREARVGQANFSQANLSGANLRGVSGASTNLQRADLRRANLVKANLPKADF 182
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S M + L A+L A L L +DL GA + GAD S A
Sbjct: 183 SHAEMRQTNLTYADLRQANLSGANLRWADLRGANLLGADLSGA 225
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 56/104 (53%), Gaps = 5/104 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA-----VAYKANFTGAD 159
F +L +A + N +AN + A + ++ SG+ +G L +A +AN TGA+
Sbjct: 22 FTGINLNEANLSRINLSQANLSDASLCVTNLSGANLSGINLSRANLNVSRLSQANLTGAN 81
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
LS ++ L A+L++A+LV T+ RS+L A + A+ + A
Sbjct: 82 LSRATLNVANLVRADLSDAILVETLAIRSELIRARLNNANLTKA 125
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 43/82 (52%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
NFT ++ E++ S + A L A N +GA+LS + R LN + L+ A L
Sbjct: 21 NFTGINLNEANLSRINLSQANLSDASLCVTNLSGANLSGINLSRANLNVSRLSQANLTGA 80
Query: 184 VLTRSDLGGAIIEGADFSDAVI 205
L+R+ L A + AD SDA++
Sbjct: 81 NLSRATLNVANLVRADLSDAIL 102
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 44/85 (51%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N RAN + + +++ +G+ + A L A +A+ + A L +TL R L A L NA
Sbjct: 61 NLSRANLNVSRLSQANLTGANLSRATLNVANLVRADLSDAILVETLAIRSELIRARLNNA 120
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
L + L +DL A + A+FS A
Sbjct: 121 NLTKANLNGADLREARVGQANFSQA 145
Score = 37.0 bits (84), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 5/87 (5%)
Query: 119 NFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
N +AN AD+RE+ +FS + +GA L N ADL + + L +A
Sbjct: 121 NLTKANLNGADLREARVGQANFSQANLSGANLRGVSGASTNLQRADLRRANLVKANLPKA 180
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADF 200
+ ++A + +T LT +DL A + GA+
Sbjct: 181 DFSHAEMRQTNLTYADLRQANLSGANL 207
>gi|392412448|ref|YP_006449055.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
gi|390625584|gb|AFM26791.1| putative low-complexity protein [Desulfomonile tiedjei DSM 6799]
Length = 241
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 53/103 (51%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ DL F R NF+ A++ +++F+ S G+ L AV A TG+DL
Sbjct: 37 SEAELSQVDLSSLNLSGMKFMRCNFSRANLTKTNFADSDLTGSNLTTAVLVAATLTGSDL 96
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++T + L A+L N+ L+ L + L A ++GAD S A
Sbjct: 97 TETNLTGADLTAADLVNSTLINADLYWARLTLATLDGADLSQA 139
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 43/94 (45%), Gaps = 10/94 (10%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMV 169
LR+ ++F A + D+ + SG KF +A K NF +DL+ +
Sbjct: 26 LREKARPGDDFSEAELSQVDLSSLNLSGMKFMRCNFSRANLTKTNFADSDLTGS------ 79
Query: 170 LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
NLT AVLV LT SDL + GAD + A
Sbjct: 80 ----NLTTAVLVAATLTGSDLTETNLTGADLTAA 109
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 48/101 (47%), Gaps = 10/101 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL +A K + A+ T AD+ +D G+ G L KAV AN + A
Sbjct: 129 ATLDGADLSQANLSKSDLTLASLTGADLFWADLGGATLVGTNLSKAVLTVANLSKA---- 184
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L A+L+ A+L L+ +DL A + GAD S+A
Sbjct: 185 ------ALMMADLSGAILAGADLSGADLSEANLTGADLSEA 219
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 12/114 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANF 155
+ A +ADL + + + A T AD+ +++ S S A L A + A+
Sbjct: 102 TGADLTAADLVNSTLINADLYWARLTLATLDGADLSQANLSKSDLTLASLTGADLFWADL 161
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
GA L T + + VL ANL+ A L+ +DL GAI+ GAD S A DL++
Sbjct: 162 GGATLVGTNLSKAVLTVANLSKAALMM-----ADLSGAILAGADLSGA--DLSE 208
>gi|428209167|ref|YP_007093520.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428011088|gb|AFY89651.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 163
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 4/107 (3%)
Query: 107 SADLRKAVHVKE----NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
S++++K + K+ N AN +AD+ E++ G+ A L+ A +AN GA+L
Sbjct: 43 SSEVQKLLKTKQCPGCNLSGANLQNADLDEANLQGANLQNANLQNADLEEANLQGANLQG 102
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
+ R L +ANL +A L + L R+D+ GA + A+ + A + A+
Sbjct: 103 ANLIRADLEKANLQSANLQQASLQRADIEGANLTKANITGANLQQAE 149
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 53/105 (50%), Gaps = 10/105 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A +ADL +A N + AN +AD+ E++ G+ GA L +A KAN A+L
Sbjct: 61 SGANLQNADLDEANLQGANLQNANLQNADLEEANLQGANLQGANLIRADLEKANLQSANL 120
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ R + AN LT++++ GA ++ A+F + V+
Sbjct: 121 QQASLQRADIEGAN----------LTKANITGANLQQAEFENTVM 155
>gi|334117108|ref|ZP_08491200.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333461928|gb|EGK90533.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 509
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 55/117 (47%), Gaps = 9/117 (7%)
Query: 84 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
ADL K E S +AD+R+A + N AN + A+++ +D +G+ NGA
Sbjct: 171 ADLTKAEL---------SGVNLSNADMRQASLQQVNLSSANLSGANLKWADLTGANLNGA 221
Query: 144 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
L A AN GADL +T + A+LT L+ +DL GA + GA
Sbjct: 222 DLSFAKLSGANLNGADLRNTNLGSASFVHADLTETNLINADWVGADLRGATLTGAKL 278
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADLR+ + N AN + A++R + S + F A L A KA +G +L
Sbjct: 124 SGANLTEADLREVKLTEANLCGANLSGANLRGASASSANFQEANLHGADLTKAELSGVNL 183
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
S+ M + L + NL++A L L +DL GA + GAD S
Sbjct: 184 SNADMRQASLQQVNLSSANLSGANLKWADLTGANLNGADLS 224
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 54/115 (46%), Gaps = 10/115 (8%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKANF 155
S+A F A+L A K N ++ADMR++ + S + +GA L+ A AN
Sbjct: 159 SSANFQEANLHGADLTKAELSGVNLSNADMRQASLQQVNLSSANLSGANLKWADLTGANL 218
Query: 156 TGADLSDTLMDRMVLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDAVI 205
GADLS + LN A+L N A V LT ++L A GAD A +
Sbjct: 219 NGADLSFAKLSGANLNGADLRNTNLGSASFVHADLTETNLINADWVGADLRGATL 273
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 49/100 (49%), Gaps = 5/100 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+LR A NF+ AN AD+ +++ SG + A + +A + N + A+LS
Sbjct: 146 ANLSGANLRGASASSANFQEANLHGADLTKAELSGVNLSNADMRQASLQQVNLSSANLSG 205
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSD 202
L A+LT A L L+ + L GA + GAD +
Sbjct: 206 A-----NLKWADLTGANLNGADLSFAKLSGANLNGADLRN 240
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
+FT ++ E++ S + + L +A + N +GA+LS+ L+EANL A L T
Sbjct: 22 DFTGINLNEANLSRINLSQSILRRASLFVTNLSGANLSEA-----NLSEANLNVARLSST 76
Query: 184 VLTRSDLGGAIIEGADFSDAVIDLAQ 209
L+R+ L GA I A+ A + AQ
Sbjct: 77 NLSRAILNGATINVANLVRADLSAAQ 102
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 62/136 (45%), Gaps = 2/136 (1%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
L+ A ++ + N++ L+ N A G + A ADL A ++ + R+
Sbjct: 58 LSEANLSEANLNVARLSSTNLSRAILNG--ATINVANLVRADLSAAQLIRASLIRSELVR 115
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
++ +++FSG+ A L + +AN GA+LS + + AN A L LT+
Sbjct: 116 CELSKTNFSGANLTEADLREVKLTEANLCGANLSGANLRGASASSANFQEANLHGADLTK 175
Query: 188 SDLGGAIIEGADFSDA 203
++L G + AD A
Sbjct: 176 AELSGVNLSNADMRQA 191
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN------- 171
N R N + + +R + + +GA L +A +AN A LS T + R +LN
Sbjct: 32 NLSRINLSQSILRRASLFVTNLSGANLSEANLSEANLNVARLSSTNLSRAILNGATINVA 91
Query: 172 ---EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A+L+ A L+R L RS+L + +FS A
Sbjct: 92 NLVRADLSAAQLIRASLIRSELVRCELSKTNFSGA 126
>gi|86607938|ref|YP_476700.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86556480|gb|ABD01437.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 154
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 61/128 (47%), Gaps = 16/128 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG--- 157
S AQ A+L+ + R A+ + AD+RE+D SG+ +GA L A + N G
Sbjct: 32 SGAQLSGANLKGII-----LRDADLSGADLREADLSGADLSGADLRGAKLRRVNLIGAKL 86
Query: 158 --ADLSDTLMDRMVLNEANLTNAVLVRTVL-TRSDLGGAIIEGADFSDAVIDLAQKQALC 214
ADL + R L A+L+ A L R L +DL GAII F A+ D
Sbjct: 87 VKADLRGANLYRAKLLRADLSEAELNRADLRIGADLRGAIITNTHFRGALYD-----EYT 141
Query: 215 KYANGTNP 222
K+ +G NP
Sbjct: 142 KFPDGFNP 149
>gi|158335878|ref|YP_001517052.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158306119|gb|ABW27736.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 170
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 49/100 (49%), Gaps = 5/100 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL + ++ N R AN A++R + S A LE A NFT A L + ++
Sbjct: 45 ADLSGLILIRANLRNANLQGANLRNTSLLLSNLENANLENA-----NFTAAYLYGSNLEN 99
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
L + T AVL L +D+ A + GAD +DA +DL
Sbjct: 100 TQLTSTDFTQAVLRSAKLQGADVCTATLAGADLTDADVDL 139
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 38/78 (48%)
Query: 134 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
D SG+ +G L +A AN GA+L +T + L ANL NA L S+L
Sbjct: 41 DLSGADLSGLILIRANLRNANLQGANLRNTSLLLSNLENANLENANFTAAYLYGSNLENT 100
Query: 194 IIEGADFSDAVIDLAQKQ 211
+ DF+ AV+ A+ Q
Sbjct: 101 QLTSTDFTQAVLRSAKLQ 118
>gi|427711398|ref|YP_007060022.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
gi|427375527|gb|AFY59479.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
Length = 449
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 58/114 (50%), Gaps = 8/114 (7%)
Query: 84 ADLNKY---EAETRGEFGIGS----AAQFGSADLRKAVHVKENFRRANFTSADMRESDFS 136
ADL++ EA+ RG G A A+L V+ + R + AD+ ++ S
Sbjct: 115 ADLSRVDLAEADLRG-LGFNQVNLRGANLQGANLHNTEMVQADLGRVDLIEADLSNANLS 173
Query: 137 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
G+ +GA L +A AN +GADLS + + L+EANLT A L L ++DL
Sbjct: 174 GANLSGANLSRANLANANLSGADLSRVDLTEVKLSEANLTKANLSGAELGKADL 227
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 56/110 (50%), Gaps = 7/110 (6%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANFTGAD 159
F A L A + + N AD+ + SG+K N GA L A+ K + + A+
Sbjct: 17 FNRASLSNAELINVDLSGINLARADLEWVNLSGTKLNNANLSGAELINAILIKTDLSQAN 76
Query: 160 LSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L+ + R L+ ANL+ L R+ L+ + L GA ++GAD S +DLA+
Sbjct: 77 LTGVNLSRTDLSWANLSYTNLSRSELSEATLRGANLQGADLSR--VDLAE 124
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ +LR A V N A AD+ ++D + +G L + AN +GA+L
Sbjct: 268 SRAKLVGTNLRGANLVGANLTGATLDGADLSQADMRSANLSGLLLNGVILRGANLSGANL 327
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + LN+ANL+ A L+ L+R+ + G + A S+A
Sbjct: 328 RE-----IELNQANLSRADLIEANLSRAKMAGVNLSRATLSEA 365
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 59/119 (49%), Gaps = 6/119 (5%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A DL + + N +AN + A++ ++D S + L A N +GA+L
Sbjct: 193 SGADLSRVDLTEVKLSEANLTKANLSGAELGKADLSALELCDVNLSGA-----NLSGANL 247
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ-KQALCKYAN 218
++T + R L+ ANL L R L ++L GA + GA+ + A +D A QA + AN
Sbjct: 248 ANTNLSRADLSGANLRGVNLSRAKLVGTNLRGANLVGANLTGATLDGADLSQADMRSAN 306
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 54/114 (47%), Gaps = 12/114 (10%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN-----GAYLEKAVAYKANF 155
S ++ A LR A N + A+ + D+ E+D G FN GA L+ A +
Sbjct: 98 SRSELSEATLRGA-----NLQGADLSRVDLAEADLRGLGFNQVNLRGANLQGANLHNTEM 152
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
ADL + L+ ANL+ A L L+R++L A + GAD S +DL +
Sbjct: 153 VQADLGRVDLIEADLSNANLSGANLSGANLSRANLANANLSGADLSR--VDLTE 204
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+LR+ +AN + AD+ E++ S +K G L +A +AN + A LS
Sbjct: 320 ANLSGANLREI-----ELNQANLSRADLIEANLSRAKMAGVNLSRATLSEANMSRATLSG 374
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ R+ L+ + L L+ +DLG A + GA+ S A
Sbjct: 375 ATLSRVTLSGDTIGKVDLSGVNLSGADLGDAQLLGANLSRA 415
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 54/116 (46%), Gaps = 15/116 (12%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANFTG 157
A A+L A+ +K + +AN T ++ +D S + + + L +A AN G
Sbjct: 55 ANLSGAELINAILIKTDLSQANLTGVNLSRTDLSWANLSYTNLSRSELSEATLRGANLQG 114
Query: 158 ADLSDTLM----------DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
ADLS + +++ L ANL A L T + ++DLG + AD S+A
Sbjct: 115 ADLSRVDLAEADLRGLGFNQVNLRGANLQGANLHNTEMVQADLGRVDLIEADLSNA 170
Score = 37.0 bits (84), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 51/109 (46%), Gaps = 7/109 (6%)
Query: 101 SAAQFGSADLRKAVHVKE------NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 154
S A+ G ADL A+ + + N AN + ++ +D SG+ G L +A N
Sbjct: 218 SGAELGKADL-SALELCDVNLSGANLSGANLANTNLSRADLSGANLRGVNLSRAKLVGTN 276
Query: 155 FTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
GA+L + L+ A+L+ A + L+ L G I+ GA+ S A
Sbjct: 277 LRGANLVGANLTGATLDGADLSQADMRSANLSGLLLNGVILRGANLSGA 325
>gi|381395251|ref|ZP_09920956.1| hypothetical protein GPUN_1974 [Glaciecola punicea DSM 14233 = ACAM
611]
gi|379329152|dbj|GAB56089.1| hypothetical protein GPUN_1974 [Glaciecola punicea DSM 14233 = ACAM
611]
Length = 258
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 61/116 (52%), Gaps = 9/116 (7%)
Query: 99 IGSAAQFGSADLR----KAVHVKE-NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 153
IGS F AD+R K V + F R+ T+ADMR DF G F+ A LE A A
Sbjct: 139 IGST--FIDADMRDSSLKNVRARSAMFTRSVLTNADMRWGDFEGVDFSNANLEGADLTMA 196
Query: 154 NFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
N GA+L+ + +L NL A+L T++ + + GA ++ DF+ +DL+Q
Sbjct: 197 NLRGANLTAANLKNAMLLYTNLEGAILNGTIMDGAQIVGANMKRVDFTK--VDLSQ 250
Score = 44.3 bits (103), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 52/121 (42%), Gaps = 15/121 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTS---------------ADMRESDFSGSKFNGAYL 145
S + ++ V VK F+R+N T+ AD+ ES+ + FN A L
Sbjct: 69 SGSNLTGSNFSSTVLVKAKFKRSNLTNTNFQNANLGAAQLLGADLSESNLRNANFNKAVL 128
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ A G+ D M L +A+ R+VLT +D+ EG DFS+A +
Sbjct: 129 QYTGFIDATLIGSTFIDADMRDSSLKNVRARSAMFTRSVLTNADMRWGDFEGVDFSNANL 188
Query: 206 D 206
+
Sbjct: 189 E 189
>gi|358461868|ref|ZP_09172018.1| pentapeptide repeat protein [Frankia sp. CN3]
gi|357072553|gb|EHI82089.1| pentapeptide repeat protein [Frankia sp. CN3]
Length = 376
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 61/114 (53%), Gaps = 20/114 (17%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----GAD 159
S DLR + R+A+F A + ++ +G++ +GA L A Y+A+ + GA+
Sbjct: 219 LASGDLRDV-----DLRQADFRDARLFYANLTGARLHGANLTNADLYQADLSFARLHGAN 273
Query: 160 LSDTLMDRM-----VLNEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDA 203
L+ ++R LNEANLTN AVL VL ++L GA + GA+ +DA
Sbjct: 274 LTSARLERADLSTAELNEANLTNGQLHEAVLYSAVLHGANLTGARLHGANLTDA 327
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 55/114 (48%), Gaps = 16/114 (14%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +ADL +A AN TSA + +D S ++ N +AN T L +
Sbjct: 252 ANLTNADLYQADLSFARLHGANLTSARLERADLSTAELN----------EANLTNGQLHE 301
Query: 163 TLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDAVIDLAQKQ 211
++ VL+ ANLT A L LT R++L GA + G D S V++L Q+Q
Sbjct: 302 AVLYSAVLHGANLTGARLHGANLTDAQPYRANLTGAQLHGVDLSR-VVNLTQEQ 354
>gi|428216569|ref|YP_007101034.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427988351|gb|AFY68606.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 330
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 54/98 (55%), Gaps = 9/98 (9%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N RRA+ AD+ E++ +G+ GA L +A + TGA+L++ +M L EANLT A
Sbjct: 151 NLRRADLRGADLSEANLAGADLRGADLSEANLANTDLTGANLAEAIMRGTGLTEANLTGA 210
Query: 179 VL-------VRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L VRT R++L A ++G + AV+ +A
Sbjct: 211 NLANAYMQNVRT--ERANLSEADLQGTNLDLAVMSMAN 246
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 77/163 (47%), Gaps = 12/163 (7%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRG----EFGIGSAAQFGSADLRKAVHVKENFRRA 123
L A + S NI++L + N A+ RG E + A G ADL +A + A
Sbjct: 132 LKGATLRRASKNITSLRNANLRRADLRGADLSEANLAGADLRG-ADLSEANLANTDLTGA 190
Query: 124 NFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N A MR E++ +G+ AY++ +AN + ADL T +D V++ ANL+ +
Sbjct: 191 NLAEAIMRGTGLTEANLTGANLANAYMQNVRTERANLSEADLQGTNLDLAVMSMANLSKS 250
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTN 221
L L R++L G + + S A +L + Q + Y TN
Sbjct: 251 NLSEASLYRANLNGTDLSRTNLSGA--NLREAQLVESYMARTN 291
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 60/122 (49%), Gaps = 7/122 (5%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFN 141
N EA RG G+ + A A+L A RAN + AD++ ++ S + +
Sbjct: 191 NLAEAIMRGT-GL-TEANLTGANLANAYMQNVRTERANLSEADLQGTNLDLAVMSMANLS 248
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+ L +A Y+AN G DLS T + L EA L + + RT LT +DL A++ A+ S
Sbjct: 249 KSNLSEASLYRANLNGTDLSRTNLSGANLREAQLVESYMARTNLTNADLADALLARAELS 308
Query: 202 DA 203
A
Sbjct: 309 SA 310
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 49/97 (50%), Gaps = 8/97 (8%)
Query: 87 NKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 143
N EA+ +G + + S A ++L +A + RAN D+ ++ SG+ A
Sbjct: 226 NLSEADLQGTNLDLAVMSMANLSKSNLSEA-----SLYRANLNGTDLSRTNLSGANLREA 280
Query: 144 YLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
L ++ + N T ADL+D L+ R L+ ANL NA L
Sbjct: 281 QLVESYMARTNLTNADLADALLARAELSSANLLNANL 317
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 49/103 (47%), Gaps = 19/103 (18%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA--NFTGADL 160
A +ADL AV + AN +AD+R ++ SG+ GA L+ A +A N T
Sbjct: 95 ATLVNADLTFAVLID-----ANLMNADLRSANLSGANLAGACLKGATLRRASKNITS--- 146
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L ANL A L L+ ++L GA + GAD S+A
Sbjct: 147 ---------LRNANLRRADLRGADLSEANLAGADLRGADLSEA 180
>gi|300866980|ref|ZP_07111651.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300335015|emb|CBN56817.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 300
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 65/120 (54%), Gaps = 9/120 (7%)
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 145
L +YE R +F + A SA+L A+ + N AN + A++ + + S+ NGA L
Sbjct: 7 LKRYENGDR-DF---AGADLSSANLSGAILIGVNLSGANLSGANLSRAFLTKSELNGASL 62
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++AN + A + + + L +AN++ A LV++ L R+ L GA I GA+ +A++
Sbjct: 63 -----HRANLSFAKMGEIRLADADLTKANISGAFLVKSKLPRAKLSGANITGANLRNAIL 117
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 15/113 (13%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG-----SKFNGAYLEKAVAYKANF-- 155
A+ A L A + R+A+ D+ D +G +K NG LE + ANF
Sbjct: 150 AKLSGAKLFGAQLTGISLRKAHLNGIDLGGVDLNGVNLSEAKLNGVNLEGSNLVGANFYA 209
Query: 156 --------TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
TGADL+ + R L +ANL + L + L+++DL A + GA+F
Sbjct: 210 AQLRSVKLTGADLTKANLVRACLVQANLNWSRLSQANLSQADLSEATLMGANF 262
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 67/141 (47%), Gaps = 18/141 (12%)
Query: 101 SAAQFGSADLRKAVHVKE-----NFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVA 150
S A A+L +A K + RAN + A M E +D + + +GA+L K+
Sbjct: 38 SGANLSGANLSRAFLTKSELNGASLHRANLSFAKMGEIRLADADLTKANISGAFLVKSKL 97
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
+A +GA+++ + +L ANL +A L T L ++L GA + A+F A + A+
Sbjct: 98 PRAKLSGANITGANLRNAILWNANLCSAELQLTNLRGANLTGANLNWANFYGAKLSGAKL 157
Query: 211 QALCKYANGTNPITGVSTRKS 231
+TG+S RK+
Sbjct: 158 FGA--------QLTGISLRKA 170
Score = 37.0 bits (84), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 48/106 (45%), Gaps = 15/106 (14%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LR A+ AN SA+++ ++ G+ GA L A NF GA L
Sbjct: 103 SGANITGANLRNAI-----LWNANLCSAELQLTNLRGANLTGANLNWA-----NFYGAKL 152
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
S L A LT L + L DLGG + G + S+A ++
Sbjct: 153 SGA-----KLFGAQLTGISLRKAHLNGIDLGGVDLNGVNLSEAKLN 193
>gi|383481718|ref|YP_005390633.1| hypothetical protein MCC_05165 [Rickettsia rhipicephali str.
3-7-female6-CWPP]
gi|378934057|gb|AFC72560.1| hypothetical protein MCC_05165 [Rickettsia rhipicephali str.
3-7-female6-CWPP]
Length = 957
Score = 51.2 bits (121), Expect = 4e-04, Method: Composition-based stats.
Identities = 39/121 (32%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 104 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 163
+ +ADL KA K N A+ T+A + + +K + A L+KA A G ++SD
Sbjct: 553 KLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLKKAEA-----EGLNISDA 607
Query: 164 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQ-KQALCKYA 217
+ + EAN NA++ R LT+++ A++E AD ++A++ A KQA K A
Sbjct: 608 IAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAAEAIVKEANLKQANLKAA 667
Query: 218 N 218
N
Sbjct: 668 N 668
Score = 42.4 bits (98), Expect = 0.21, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
+ AQF A L A K N + A + + + F A +++A KANFT A L
Sbjct: 582 ATAQF--AKLSNATLKKAEAEGLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVL 639
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
+ M + EA + A L + L ++L G EGADF A I+ A K
Sbjct: 640 ENADMQAVEAAEAIVKEANLKQANLKAANLAGINKEGADFDKAEINNATK 689
Score = 38.1 bits (87), Expect = 4.6, Method: Composition-based stats.
Identities = 29/98 (29%), Positives = 44/98 (44%), Gaps = 10/98 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK---------- 152
A F ADL+K+ + RA D+ E + + SKFN + A A K
Sbjct: 400 AGFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKLIIKDSEWKN 459
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+N TG L+ M R+ + L NA+L + + +DL
Sbjct: 460 SNLTGISLAYADMQRVQMQGVVLNNALLDQANIVSTDL 497
>gi|428208320|ref|YP_007092673.1| pentapeptide repeat-containing protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428010241|gb|AFY88804.1| pentapeptide repeat protein [Chroococcidiopsis thermalis PCC 7203]
Length = 160
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 5/103 (4%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRM 168
DL A N RAN +A+++ ++ S S GA L A AN + A+L +TL
Sbjct: 42 DLSNAPLNNLNLSRANLRNANLQGANLSRSILAGADLSDANLETANISSANLFETL---- 97
Query: 169 VLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
L ANL +AVLV + LT + L A +EGA+ A++DL +
Sbjct: 98 -LIGANLKSAVLVNSNLTGAGLMAANLEGANLRGAIMDLVNSR 139
>gi|443475349|ref|ZP_21065301.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019796|gb|ELS33834.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 243
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 62/120 (51%), Gaps = 15/120 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVA 150
S ++ ADL +A + N ANF+ +D+ E+D S G+ GA L KA
Sbjct: 53 SNSKLNGADLNRAKLYRSNLVSANFSGSDLGETDLSEANLSDARLYGANLYGAILNKAKL 112
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVI 205
+A TGA++ ++R L+EA L +A L ++L R+DL A + GA+ + A++
Sbjct: 113 PRAKLTGANMGKAKLNRADLSEAILRDARLFGASLNESMLQRADLSRASLNGANLNKAML 172
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 67/136 (49%), Gaps = 21/136 (15%)
Query: 86 LNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 144
+N Y++ R G+ S DL A + AN + + + ++ S SK NGA
Sbjct: 7 INSYQSGRRNFAGVNLSKTDMNGIDLSNA-----DLSGANLSESSLYGANLSNSKLNGAD 61
Query: 145 LEKAVAYK-----ANFTGADLSDTLMDRMVLNE-----ANLTNAVLV-----RTVLTRSD 189
L +A Y+ ANF+G+DL +T + L++ ANL A+L R LT ++
Sbjct: 62 LNRAKLYRSNLVSANFSGSDLGETDLSEANLSDARLYGANLYGAILNKAKLPRAKLTGAN 121
Query: 190 LGGAIIEGADFSDAVI 205
+G A + AD S+A++
Sbjct: 122 MGKAKLNRADLSEAIL 137
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 60/119 (50%), Gaps = 2/119 (1%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 142
L+D Y A G I + A+ A L A K RA+ + A +R++ G+ N
Sbjct: 92 LSDARLYGANLYG--AILNKAKLPRAKLTGANMGKAKLNRADLSEAILRDARLFGASLNE 149
Query: 143 AYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+ L++A +A+ GA+L+ ++ + L A+L A L L+ +DL A ++G+D +
Sbjct: 150 SMLQRADLSRASLNGANLNKAMLCEVDLTFASLYGASLCDADLSEADLTSANLQGSDLT 208
Score = 37.7 bits (86), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 55/109 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A+ ADL +A+ A+ + ++ +D S + NGA L KA+ + + T A L
Sbjct: 125 AKLNRADLSEAILRDARLFGASLNESMLQRADLSRASLNGANLNKAMLCEVDLTFASLYG 184
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
+ L+EA+LT+A L + LTR + A + + F+D V + Q +
Sbjct: 185 ASLCDADLSEADLTSANLQGSDLTRVNFYKANLSKSKFADTVTEGMQTR 233
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 5/84 (5%)
Query: 101 SAAQFGSADLRKAV--HVKENFRR---ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A A+L KA+ V F A+ AD+ E+D + + G+ L + YKAN
Sbjct: 158 SRASLNGANLNKAMLCEVDLTFASLYGASLCDADLSEADLTSANLQGSDLTRVNFYKANL 217
Query: 156 TGADLSDTLMDRMVLNEANLTNAV 179
+ + +DT+ + M EANLT +
Sbjct: 218 SKSKFADTVTEGMQTREANLTGII 241
>gi|428311473|ref|YP_007122450.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428253085|gb|AFZ19044.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 580
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 53/95 (55%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
A F A+LR+A + N + A ++ E++ + GA L +A ++A TGAD+S
Sbjct: 154 ATNFTGANLREANLEQANLQEATLVGVNLTEANLNNVYLRGANLRQADLHRAILTGADMS 213
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
+ + L+ ANLT A L+R L ++DL A+++
Sbjct: 214 EANCEGADLSRANLTGAYLLRASLRKADLLRAVLQ 248
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 45/79 (56%)
Query: 123 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 182
AN T A++R+ + +G+ +GA L ANFTGA+L + +L+ AN T ++L R
Sbjct: 25 ANLTGANLRKINLTGANLSGANLSWCCFSHANFTGANLHQANLHSAILDNANFTQSILSR 84
Query: 183 TVLTRSDLGGAIIEGADFS 201
L++ DL A + AD +
Sbjct: 85 AKLSKVDLRLANLREADLN 103
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 51/96 (53%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L A+ + AN ++ ++F+G+ A LE+A +A G +L++ ++
Sbjct: 130 ANLNHALLMGAQLMEANLCRTNLIATNFTGANLREANLEQANLQEATLVGVNLTEANLNN 189
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ L ANL A L R +LT +D+ A EGAD S A
Sbjct: 190 VYLRGANLRQADLHRAILTGADMSEANCEGADLSRA 225
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 52/108 (48%)
Query: 98 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
I A F + L +A K + R AN AD+ +D S S +GA L+ + N
Sbjct: 70 AILDNANFTQSILSRAKLSKVDLRLANLREADLNWADLSASNLSGADLQNTQLDQINLEH 129
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+L+ L+ L EANL L+ T T ++L A +E A+ +A +
Sbjct: 130 ANLNHALLMGAQLMEANLCRTNLIATNFTGANLREANLEQANLQEATL 177
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 57/115 (49%), Gaps = 4/115 (3%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A+L A ++ + R+A+ A ++E + + A L A KA+ +GA L D
Sbjct: 220 ADLSRANLTGAYLLRASLRKADLLRAVLQEVYLLRTDLSEANLRGADLRKADLSGAYLKD 279
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYA 217
TL+ L+ A L + L+RT L R++L G I + +DL+ + C+Y
Sbjct: 280 TLLSEANLSGAYLLESYLIRTKLDRAELTGCCIHQWHLEE--VDLSYVE--CRYV 330
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 41/84 (48%), Gaps = 5/84 (5%)
Query: 120 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAV 179
R AN AD+ + +G+ + A E A +AN TGA L R L +A+L AV
Sbjct: 192 LRGANLRQADLHRAILTGADMSEANCEGADLSRANLTGAYLL-----RASLRKADLLRAV 246
Query: 180 LVRTVLTRSDLGGAIIEGADFSDA 203
L L R+DL A + GAD A
Sbjct: 247 LQEVYLLRTDLSEANLRGADLRKA 270
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 49/106 (46%), Gaps = 20/106 (18%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFTG 157
A ADL +A+ T ADM E +D S + GAYL +A KA+
Sbjct: 195 ANLRQADLHRAI----------LTGADMSEANCEGADLSRANLTGAYLLRASLRKADLLR 244
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
A L + + R L+EANL A L ++DL GA ++ S+A
Sbjct: 245 AVLQEVYLLRTDLSEANLRGA-----DLRKADLSGAYLKDTLLSEA 285
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 49/106 (46%), Gaps = 5/106 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LRK N AN + ++F+G+ + A L A+ ANFT + L
Sbjct: 23 SGANLTGANLRKINLTGANLSGANLSWCCFSHANFTGANLHQANLHSAILDNANFTQSIL 82
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
S + ++ L ANL A L +DL + + GAD + +D
Sbjct: 83 SRAKLSKVDLRLANLREA-----DLNWADLSASNLSGADLQNTQLD 123
>gi|428215789|ref|YP_007088933.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004170|gb|AFY85013.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 222
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 48/81 (59%), Gaps = 5/81 (6%)
Query: 130 MRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT-----NAVLVRTV 184
+ ++ S + +GA L+ A AN +GA+LS+ M + L+EANLT NA L +
Sbjct: 65 LNGANLSNANLSGALLKDAKLQTANLSGANLSNAEMSGITLSEANLTGANLSNAELENAL 124
Query: 185 LTRSDLGGAIIEGADFSDAVI 205
+++ DL GA + GAD DA+I
Sbjct: 125 MSKVDLTGADLTGADLIDAII 145
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 58/106 (54%), Gaps = 5/106 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANF 155
S A A L+ A N AN ++A+M E++ +G+ + A LE A+ K +
Sbjct: 71 SNANLSGALLKDAKLQTANLSGANLSNAEMSGITLSEANLTGANLSNAELENALMSKVDL 130
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
TGADL+ + ++++ANL+NA + + L ++ L + + GADFS
Sbjct: 131 TGADLTGADLIDAIISDANLSNASVTQAQLKKAILSRSNLSGADFS 176
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 43/81 (53%), Gaps = 5/81 (6%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADM-----RESDFSGSKFNGAYLEKAVAYKANF 155
+ A +A+L A+ K + A+ T AD+ +++ S + A L+KA+ ++N
Sbjct: 111 TGANLSNAELENALMSKVDLTGADLTGADLIDAIISDANLSNASVTQAQLKKAILSRSNL 170
Query: 156 TGADLSDTLMDRMVLNEANLT 176
+GAD S + M L +ANLT
Sbjct: 171 SGADFSSSSMRDTKLADANLT 191
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 41/81 (50%), Gaps = 5/81 (6%)
Query: 125 FTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTV 184
+ D+ D GS NGA L A N +GA L D + L+ ANL+NA +
Sbjct: 50 LSGVDLSGKDLYGSALNGANLSNA-----NLSGALLKDAKLQTANLSGANLSNAEMSGIT 104
Query: 185 LTRSDLGGAIIEGADFSDAVI 205
L+ ++L GA + A+ +A++
Sbjct: 105 LSEANLTGANLSNAELENALM 125
>gi|428211194|ref|YP_007084338.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|427999575|gb|AFY80418.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 190
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 51/94 (54%), Gaps = 5/94 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L A N AN + AD+ +D + + GA L A + +FTGA+L R
Sbjct: 70 ANLSGANLTGANLTGANLSGADLSGADLTDADLGGADLSYATLHYTDFTGANLF-----R 124
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+L +A L +A LVR L ++L GAI+EGA FS
Sbjct: 125 AMLVDAKLNHAKLVRVRLRSANLNGAIVEGAIFS 158
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 43/87 (49%)
Query: 117 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
+ +FR + AD+ ++ SG F L A ++A G D + + L ANL+
Sbjct: 14 ERDFRDTDLFRADLSNAELSGVSFFRTSLFGANLFRAKLIGCDFFRSTLIGANLYCANLS 73
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDA 203
A L LT ++L GA + GAD +DA
Sbjct: 74 GANLTGANLTGANLSGADLSGADLTDA 100
>gi|75911106|ref|YP_325402.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704831|gb|ABA24507.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 268
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 70/161 (43%), Gaps = 39/161 (24%)
Query: 73 VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE 132
VA+ S +I ADL + F IG A F A+LR A+ + N +F+SAD+R+
Sbjct: 93 VANLSQSILTQADL------SHAHF-IG--ADFSGANLRGAIVAEANLIGTDFSSADLRD 143
Query: 133 SDFSGSKF------------------------------NGAYLEKAVAYKANFTGADLSD 162
+D +G+K GAYL KA YKAN A L
Sbjct: 144 ADLAGAKLIRSNLCFANLIAANLIAADFSEANLYQAEVMGAYLYKANFYKANLHKAHLGG 203
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ R L A+L A L LT ++L GA + GA+ A
Sbjct: 204 AYLFRANLTAADLRGADLAWANLTSANLAGANLSGANLRGA 244
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 62/126 (49%), Gaps = 17/126 (13%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANF----------TSADMRESDFSGSKFNGAYLEKAVA 150
S A SA+L +A + N ANF T AD+ + F G+ F+GA L A+
Sbjct: 67 SGADLSSANLYQAKISEANLSAANFSVANLSQSILTQADLSHAHFIGADFSGANLRGAIV 126
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
+AN G D S L +A+L A L+R+ L ++L A + ADFS+A +L Q
Sbjct: 127 AEANLIGTDFSSA-----DLRDADLAGAKLIRSNLCFANLIAANLIAADFSEA--NLYQA 179
Query: 211 QALCKY 216
+ + Y
Sbjct: 180 EVMGAY 185
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 52/94 (55%), Gaps = 5/94 (5%)
Query: 115 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
++ + AN ++R ++ G+ + L A+ +AN +GADLS + + ++EAN
Sbjct: 26 QIEPDLSTANLQENNLRGANLEGTNLSRVDLSHALLVRANLSGADLSSANLYQAKISEAN 85
Query: 175 LTNAV-----LVRTVLTRSDLGGAIIEGADFSDA 203
L+ A L +++LT++DL A GADFS A
Sbjct: 86 LSAANFSVANLSQSILTQADLSHAHFIGADFSGA 119
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A +LR A N R + + A + ++ SG+ + A L +A +AN + A+
Sbjct: 32 STANLQENNLRGANLEGTNLSRVDLSHALLVRANLSGADLSSANLYQAKISEANLSAANF 91
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE-----GADFSDA 203
S + + +L +A+L++A + + ++L GAI+ G DFS A
Sbjct: 92 SVANLSQSILTQADLSHAHFIGADFSGANLRGAIVAEANLIGTDFSSA 139
Score = 37.0 bits (84), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 49/104 (47%), Gaps = 10/104 (9%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
AA F A+L +A + +ANF A++ ++ G AYL ++AN T ADL
Sbjct: 168 AADFSEANLYQAEVMGAYLYKANFYKANLHKAHLGG-----AYL-----FRANLTAADLR 217
Query: 162 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ L ANL A L L ++L GA + G + + ++
Sbjct: 218 GADLAWANLTSANLAGANLSGANLRGANLKGANLNGVNLQETIM 261
>gi|257061367|ref|YP_003139255.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
gi|256591533|gb|ACV02420.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
Length = 371
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 67/136 (49%), Gaps = 10/136 (7%)
Query: 72 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 131
+ A+ + N++ L L + T G AA+ + +L A + NFR AN T A++
Sbjct: 218 LYAANTHNLAELIKLAHFNPLTDLAGGNFLAAELSAVELSGANLTQTNFRGANLTDAELS 277
Query: 132 ES-----DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT 186
E+ FSG+ +GAYL A KA+F A L+ + L EANL A L+
Sbjct: 278 EAILNYCKFSGADLSGAYLGNAQLVKADFHRASLAVANLIGANLTEANLREANLI----- 332
Query: 187 RSDLGGAIIEGADFSD 202
++L GA ++ A F +
Sbjct: 333 DANLSGATVKDAKFGE 348
>gi|359462469|ref|ZP_09251032.1| periplasmic binding protein/LacI transcriptional regulator
[Acaryochloris sp. CCMEE 5410]
Length = 702
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 60/126 (47%), Gaps = 20/126 (15%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A K N + +N ++ E++ G+ GA LE A AN GA+
Sbjct: 218 SHANLEQANLAHANLEKANLKGSNLKGINLSEANLQGANLQGANLEGANLEGANLQGANF 277
Query: 161 SDTLMDRMVLNEANLT--------------------NAVLVRTVLTRSDLGGAIIEGADF 200
+D ++ + +LN AN T +A+L RT L +++L +I++G+D
Sbjct: 278 TDAVLHKSLLNNANFTKANLTRAKMHQVQGIWTKFNHAILHRTDLYQANLNRSILKGSDL 337
Query: 201 SDAVID 206
A ++
Sbjct: 338 YKANLE 343
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 56/122 (45%), Gaps = 7/122 (5%)
Query: 81 SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 140
S L +N EA +G A A+L A N + ANFT A + +S + + F
Sbjct: 240 SNLKGINLSEANLQG-------ANLQGANLEGANLEGANLQGANFTDAVLHKSLLNNANF 292
Query: 141 NGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A L +A ++ + ++ R L +ANL ++L + L +++L + ++ DF
Sbjct: 293 TKANLTRAKMHQVQGIWTKFNHAILHRTDLYQANLNRSILKGSDLYKANLENSSLQSVDF 352
Query: 201 SD 202
D
Sbjct: 353 LD 354
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 41/72 (56%)
Query: 134 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
+ G + A LE+A AN A+L + + + L+EANL A L L ++L GA
Sbjct: 211 EHQGIDLSHANLEQANLAHANLEKANLKGSNLKGINLSEANLQGANLQGANLEGANLEGA 270
Query: 194 IIEGADFSDAVI 205
++GA+F+DAV+
Sbjct: 271 NLQGANFTDAVL 282
>gi|126659170|ref|ZP_01730309.1| pentapeptide repeat family protein [Cyanothece sp. CCY0110]
gi|126619577|gb|EAZ90307.1| pentapeptide repeat family protein [Cyanothece sp. CCY0110]
Length = 301
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 60/139 (43%), Gaps = 28/139 (20%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTG 157
A F A L+ K N + ANF++A ++ DFS + KFNG+ L K N TG
Sbjct: 151 ANFEEAKLKNINFSKANLKNANFSNAKLQNIDFSEANLYEVKFNGSDLYKIDFRDKNLTG 210
Query: 158 ADLS--------------------DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEG 197
D S D + + L ANLTNA L L + L GAI++G
Sbjct: 211 GDFSGADFWNVNLDNANLTDTNFSDANLKVINLKNANLTNADLSVANLAHAKLEGAILDG 270
Query: 198 ADFSDAVIDLAQKQALCKY 216
A+ A I + LC Y
Sbjct: 271 ANLEGAAI---RGTVLCDY 286
>gi|186684179|ref|YP_001867375.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186466631|gb|ACC82432.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 223
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 58/129 (44%), Gaps = 17/129 (13%)
Query: 76 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 135
C+ + L D N A G A +ADL +A + N + ANF AD+ + +
Sbjct: 104 CNLTGAMLKDANLQAANLEG-------ANLQNADLERANLQQTNLQGANFQGADLGKVNL 156
Query: 136 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
G+ GA L A KAN GA+L ANL A L +T LT +++ G +
Sbjct: 157 LGANLLGANLFDADLEKANLLGANLQ----------MANLQGADLEKTNLTNANIQGVNL 206
Query: 196 EGADFSDAV 204
G D DA+
Sbjct: 207 MGVDLEDAI 215
>gi|24213719|ref|NP_711200.1| hypothetical protein LA_1019 [Leptospira interrogans serovar Lai
str. 56601]
gi|386073300|ref|YP_005987617.1| hypothetical protein LIF_A0826 [Leptospira interrogans serovar Lai
str. IPAV]
gi|418709761|ref|ZP_13270547.1| NifU-like N-terminal domain protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|24194537|gb|AAN48218.1| hypothetical protein LA_1019 [Leptospira interrogans serovar Lai
str. 56601]
gi|353457089|gb|AER01634.1| hypothetical protein LIF_A0826 [Leptospira interrogans serovar Lai
str. IPAV]
gi|410769996|gb|EKR45223.1| NifU-like N-terminal domain protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|456968595|gb|EMG09774.1| NifU-like N-terminal domain protein [Leptospira interrogans serovar
Grippotyphosa str. LT2186]
Length = 263
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 53/102 (51%), Gaps = 4/102 (3%)
Query: 114 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
+ VK + R + +S + + F G F+GA L A ++F GA+ S + LN A
Sbjct: 141 LKVKGSLRDEDLSSIILEKLKFDGVDFSGANLGHAFLQNSSFVGANFSGAKLRGSFLNNA 200
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID----LAQKQ 211
+L N+ L + L GA +EGADF+DA+ D L QKQ
Sbjct: 201 DLRNSNFRGADLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242
>gi|409989952|ref|ZP_11273410.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|291567017|dbj|BAI89289.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409939186|gb|EKN80392.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 274
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
AQ A+L A NF RAN T A MR S N A L A +ANFT A+
Sbjct: 70 AQLADANLISANLTDANFSRANLTGASMRGSISKNVTLNMANLTDANLAEANFTEANFIG 129
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+L N+ L+RT L +++L GA ++GA+ ++ ++
Sbjct: 130 ----------AHLVNSTLIRTNLLKANLSGANLDGANLTNVIM 162
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 61/127 (48%), Gaps = 14/127 (11%)
Query: 83 LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 141
LAD N A T F S A A +R ++ AN T A++ E++F+ + F
Sbjct: 72 LADANLISANLTDANF---SRANLTGASMRGSISKNVTLNMANLTDANLAEANFTEANFI 128
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIE 196
GA+L + + N A+LS +D ANLTN ++ + L + + L GA++
Sbjct: 129 GAHLVNSTLIRTNLLKANLSGANLD-----GANLTNVIMRDSTLEGANLSNATLSGAMLM 183
Query: 197 GADFSDA 203
GA+F A
Sbjct: 184 GANFHQA 190
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 15/98 (15%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F ADL + V + AN + A++R ++ S + GA + +A Y+ ++LS
Sbjct: 185 ANFHQADLSRVTMVGADLTDANLSEANLRAANISWTSLRGANMSRARLYRTKLNWSNLSG 244
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAII 195
NL AV++ TVL R+ DL GAI+
Sbjct: 245 V----------NLIEAVMLDTVLYRANLRDADLRGAIL 272
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 56/127 (44%), Gaps = 12/127 (9%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
A RG I A+L A + NF ANF A + S + L KA
Sbjct: 95 ASMRGS--ISKNVTLNMANLTDANLAEANFTEANFIGAHLVNSTLIRTN-----LLKANL 147
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDAVI 205
AN GA+L++ +M L ANL+NA L +L ++DL + GAD +DA +
Sbjct: 148 SGANLDGANLTNVIMRDSTLEGANLSNATLSGAMLMGANFHQADLSRVTMVGADLTDANL 207
Query: 206 DLAQKQA 212
A +A
Sbjct: 208 SEANLRA 214
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 63/133 (47%), Gaps = 10/133 (7%)
Query: 75 SCSSNISALADLNKYEAE-TRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADM 130
+ + N++ L D N EA T F IG+ + +L KA N AN T+ M
Sbjct: 104 NVTLNMANLTDANLAEANFTEANF-IGAHLVNSTLIRTNLLKANLSGANLDGANLTNVIM 162
Query: 131 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL 190
R+S G+ + A L A+ ANF ADLS R+ + A+LT+A L L +++
Sbjct: 163 RDSTLEGANLSNATLSGAMLMGANFHQADLS-----RVTMVGADLTDANLSEANLRAANI 217
Query: 191 GGAIIEGADFSDA 203
+ GA+ S A
Sbjct: 218 SWTSLRGANMSRA 230
>gi|326506328|dbj|BAJ86482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 181
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 58/121 (47%), Gaps = 6/121 (4%)
Query: 105 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTL 164
+G R ++F D + S + F GA L A + A+ TGADLSD
Sbjct: 61 YGQQVTRGQDLTGKDFSGQTLIKQDFKTSILRQTNFKGANLLGASFFDADLTGADLSDAD 120
Query: 165 M---DRMVLN--EANLTNAVLVRTVLT-RSDLGGAIIEGADFSDAVIDLAQKQALCKYAN 218
+ D + N + NLTNA L ++T + G+ I GADF+D + Q+ LCK A+
Sbjct: 121 LRNADFSLANVTKVNLTNANLEGALVTGNTSFKGSNIYGADFTDVPLRDDQRDYLCKIAD 180
Query: 219 G 219
G
Sbjct: 181 G 181
>gi|443478641|ref|ZP_21068371.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443016050|gb|ELS30796.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 341
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 60/105 (57%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A LR+A + + AN AD+ + FSG+ +GA L+ +++ + A+L
Sbjct: 99 SGAYLSRAHLREACLQRCDLSLANLQGADLTNAYFSGANLSGADLD-----ESDLSNANL 153
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++T + +L+ ANLTNA L R+ LT ++L A + ++ SD+ I
Sbjct: 154 NETNLSNAILSNANLTNADLRRSDLTNANLEYANLSNSNLSDSKI 198
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 63/128 (49%), Gaps = 25/128 (19%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-------- 152
S A A+L + VK N R A+ A++++++ SG+ + A+L +A +
Sbjct: 64 SNANLLGANLSSSDLVKANLREADLYKANLKDAEVSGAYLSRAHLREACLQRCDLSLANL 123
Query: 153 ------------ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAII 195
AN +GADL ++ + LNE NL+NA+L LT RSDL A +
Sbjct: 124 QGADLTNAYFSGANLSGADLDESDLSNANLNETNLSNAILSNANLTNADLRRSDLTNANL 183
Query: 196 EGADFSDA 203
E A+ S++
Sbjct: 184 EYANLSNS 191
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 5/106 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A F AN + AD+ ESD S + N L A+ AN T ADL
Sbjct: 119 SLANLQGADLTNAY-----FSGANLSGADLDESDLSNANLNETNLSNAILSNANLTNADL 173
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+ + L ANL+N+ L + + ++L A ++ D S+ I+
Sbjct: 174 RRSDLTNANLEYANLSNSNLSDSKICTANLSHANLQECDLSNTTIN 219
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 66/147 (44%), Gaps = 30/147 (20%)
Query: 92 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS---------------ADMRESDFS 136
ET I S A +ADLR++ N AN ++ A+++E D S
Sbjct: 155 ETNLSNAILSNANLTNADLRRSDLTNANLEYANLSNSNLSDSKICTANLSHANLQECDLS 214
Query: 137 GSKFNGAYLEKA-----VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT-------- 183
+ N + L A + AN + A+LS T + VL+ A L+NA L R+
Sbjct: 215 NTTINQSNLSHANLADSILSNANLSNANLSYTNLKNAVLSNAILSNADLSRSNLEDTILS 274
Query: 184 --VLTRSDLGGAIIEGADFSDAVIDLA 208
+L+ ++L GAI+ GA A +D A
Sbjct: 275 DAILSNANLSGAILTGAQLVSAKLDAA 301
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 53/117 (45%), Gaps = 10/117 (8%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
N++ E S A+LR+A N R N ++A++ ++ S S A L
Sbjct: 25 NRWRLENPETIPNLSGTNLRRANLREADLSGVNLRWTNLSNANLLGANLSSSDLVKANLR 84
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+A YKAN A++S + R L EA L R DL A ++GAD ++A
Sbjct: 85 EADLYKANLKDAEVSGAYLSRAHLREA----------CLQRCDLSLANLQGADLTNA 131
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 52/113 (46%), Gaps = 5/113 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANF-----TSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S A DL + N AN ++A++ ++ S + A L A+ A+
Sbjct: 204 SHANLQECDLSNTTINQSNLSHANLADSILSNANLSNANLSYTNLKNAVLSNAILSNADL 263
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLA 208
+ ++L DT++ +L+ ANL+ A+L L + L A + G + A + LA
Sbjct: 264 SRSNLEDTILSDAILSNANLSGAILTGAQLVSAKLDAAFLIGTNLVKANLRLA 316
>gi|21673746|ref|NP_661811.1| pentapeptide repeat-containing protein [Chlorobium tepidum TLS]
gi|21646871|gb|AAM72153.1| pentapeptide repeat family protein [Chlorobium tepidum TLS]
Length = 382
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 51/92 (55%), Gaps = 5/92 (5%)
Query: 117 KENFRRANFTSADMRESDFSGSKF-----NGAYLEKAVAYKANFTGADLSDTLMDRMVLN 171
K +F + + ADMR+SDF S+F +GA L+ +V + FTGAD++ + +
Sbjct: 24 KIDFSQTSLAGADMRQSDFGRSEFRDADLSGAKLDGSVLAGSRFTGADMNQASLAGALCA 83
Query: 172 EANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
++ + A + TVL R+D G A + G D S A
Sbjct: 84 GSDFSGAKMASTVLRRADCGEAKLRGTDLSGA 115
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 51/103 (49%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S AD+R++ + FR A+ + A + S +GS+F GA + +A A G+D
Sbjct: 28 SQTSLAGADMRQSDFGRSEFRDADLSGAKLDGSVLAGSRFTGADMNQASLAGALCAGSDF 87
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S M VL A+ A L T L+ +DL A +E AD S A
Sbjct: 88 SGAKMASTVLRRADCGEAKLRGTDLSGADLREANLEHADLSRA 130
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 4/83 (4%)
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
AD+ +D + + LEKA AN GADL + R L +A+L A L R
Sbjct: 283 ADLHGADLEKASLKRSDLEKADLKSANLRGADLRSANLQRADLRQADLRGANLWLANTGR 342
Query: 188 SDLGGAIIEGADFSDAVIDLAQK 210
++ GAI+ S+ V+D +K
Sbjct: 343 AEFEGAIVS----SETVLDTGKK 361
>gi|119493870|ref|ZP_01624435.1| hypothetical protein L8106_09096 [Lyngbya sp. PCC 8106]
gi|119452382|gb|EAW33573.1| hypothetical protein L8106_09096 [Lyngbya sp. PCC 8106]
Length = 459
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 10/113 (8%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-----G 157
A DLRKA + N ++D+R++DF+ + + + L A A+FT G
Sbjct: 312 ANLEGIDLRKANLTGASLLEVNLQNSDLRQADFTRANLDDSNLSNADLRSADFTQASLQG 371
Query: 158 ADLSDTLM-----DRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
D +DT + R L +ANL N L + LT+ +L GA + GAD S AV+
Sbjct: 372 VDFTDTDLRGIDFTRANLTQANLENVNLSQAELTKVNLEGANLCGADLSHAVL 424
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 75/168 (44%), Gaps = 29/168 (17%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRAN 124
L+ A++ S N++ L N A R IG + A A+L+K V N +AN
Sbjct: 62 LSKALLCEASINLANLTRANLSGANLREATLIGVELTGANLTQANLKKVNLVGANLDQAN 121
Query: 125 FTSADMRESDFSGSKFNGAYLEKAV-----------------AY---------KANFTGA 158
T A++ ++D G++ A L+ AV AY +AN +G
Sbjct: 122 LTGANLSDADLRGAQLFTAILKGAVYSNRTLFPSEIDPILAGAYLLAPDVFLQEANLSGV 181
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
DLS + L NL A L L+R++L GA + GAD +A+++
Sbjct: 182 DLSGADLKGANLRGVNLCKANLFGVNLSRANLAGANLSGADLREALLN 229
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 51/102 (50%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F A+L + + R A+FT A ++ DF+ + G +A +AN +LS
Sbjct: 342 ADFTRANLDDSNLSNADLRSADFTQASLQGVDFTDTDLRGIDFTRANLTQANLENVNLSQ 401
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
+ ++ L ANL A L VL + +L GA ++G +F AV
Sbjct: 402 AELTKVNLEGANLCGADLSHAVLFQVNLKGANLKGVNFKQAV 443
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 49/93 (52%), Gaps = 1/93 (1%)
Query: 111 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 170
R + KE + AN A +R ++F G GA L +A+ A+ ADL++ + + +L
Sbjct: 9 RLGLQQKE-LQGANLIGAQLRGANFRGLNLRGANLSEALLVYADLIEADLTEVNLSKALL 67
Query: 171 NEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
EA++ A L R L+ ++L A + G + + A
Sbjct: 68 CEASINLANLTRANLSGANLREATLIGVELTGA 100
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 60/123 (48%), Gaps = 3/123 (2%)
Query: 90 EAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 146
+ E +G IG+ A F +LR A + A+ AD+ E + S + A +
Sbjct: 14 QKELQGANLIGAQLRGANFRGLNLRGANLSEALLVYADLIEADLTEVNLSKALLCEASIN 73
Query: 147 KAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A +AN +GA+L + + + L ANLT A L + L ++L A + GA+ SDA +
Sbjct: 74 LANLTRANLSGANLREATLIGVELTGANLTQANLKKVNLVGANLDQANLTGANLSDADLR 133
Query: 207 LAQ 209
AQ
Sbjct: 134 GAQ 136
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 10/88 (11%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR-----MVLNEANLTNA 178
N + D+ ES+ SG F GA L++ + T A+L +D+ + L +ANLT A
Sbjct: 268 NLSDVDLSESNLSGVNFCGANLKRVNLKNTDLTHANLKRASLDQANLEGIDLRKANLTGA 327
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDAVID 206
L+ L SDL ADF+ A +D
Sbjct: 328 SLLEVNLQNSDL-----RQADFTRANLD 350
>gi|422302289|ref|ZP_16389652.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9806]
gi|389788514|emb|CCI15758.1| Genome sequencing data, contig C319 [Microcystis aeruginosa PCC
9806]
Length = 405
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 60/115 (52%), Gaps = 3/115 (2%)
Query: 89 YEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 145
EA RG I + A A LR A N R AN ++A++ +++ G+ + A L
Sbjct: 243 IEANLRGAILIEANLRGANLRGAKLRGANLRWANLRWANLSAANLSDANLRGANLSAANL 302
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF 200
A AN GA+LS + L +ANL++A+L+ L++++L GA + GAD
Sbjct: 303 SDANLRGANLRGANLSAANLSGADLRKANLSDAILIEANLSKANLSGANLRGADL 357
>gi|384101177|ref|ZP_10002229.1| hypothetical protein W59_07424 [Rhodococcus imtechensis RKJ300]
gi|383841319|gb|EID80601.1| hypothetical protein W59_07424 [Rhodococcus imtechensis RKJ300]
Length = 201
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 55/123 (44%), Gaps = 15/123 (12%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFSGSKFNGAYLEKAVAYKA 153
I + F ADL ++ HV FR +FT ++ R F GS+F+ L V +
Sbjct: 46 IFTDCDFTGADLAESRHVGTAFRSCSFTRPTLWHSEFRNCSFLGSEFDNCRLRPMVFDEC 105
Query: 154 NFT-----GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDL-----GGAIIEGADFSDA 203
+FT GADL EANL L R VL +DL GGA +GAD A
Sbjct: 106 DFTLASLGGADLRGLDFTDCRFREANLVRTDLRRAVLRSADLFGARTGGAKFDGADLRGA 165
Query: 204 VID 206
+D
Sbjct: 166 HVD 168
>gi|209522801|ref|ZP_03271359.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|209496850|gb|EDZ97147.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
Length = 274
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
AQ A+L A NF RAN T A MR S N A L A +ANFT A+
Sbjct: 70 AQLADANLISANLTDANFSRANLTGASMRGSISKNVTLNMANLTDANLAEANFTEANFIG 129
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+L N+ L+RT L +++L GA ++GA+ ++ ++
Sbjct: 130 ----------AHLVNSTLIRTNLLKANLSGANLDGANLTNVIM 162
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 61/127 (48%), Gaps = 14/127 (11%)
Query: 83 LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 141
LAD N A T F S A A +R ++ AN T A++ E++F+ + F
Sbjct: 72 LADANLISANLTDANF---SRANLTGASMRGSISKNVTLNMANLTDANLAEANFTEANFI 128
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIE 196
GA+L + + N A+LS +D ANLTN ++ + L + + L GA++
Sbjct: 129 GAHLVNSTLIRTNLLKANLSGANLD-----GANLTNVIMRDSTLEGANLSNATLSGAMLM 183
Query: 197 GADFSDA 203
GA+F A
Sbjct: 184 GANFHRA 190
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 15/98 (15%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F ADL + V + AN + A++R ++ S + GA + +A Y+ ++LS
Sbjct: 185 ANFHRADLSRVTMVGADLTDANLSEANLRAANVSWTSLRGANMSRARLYRTKLNWSNLSG 244
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAII 195
NL AV++ TVL R+ DL GAI+
Sbjct: 245 V----------NLIEAVMLDTVLYRANLRDADLRGAIL 272
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 56/127 (44%), Gaps = 12/127 (9%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
A RG I A+L A + NF ANF A + S + L KA
Sbjct: 95 ASMRGS--ISKNVTLNMANLTDANLAEANFTEANFIGAHLVNSTLIRTN-----LLKANL 147
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDAVI 205
AN GA+L++ +M L ANL+NA L +L R+DL + GAD +DA +
Sbjct: 148 SGANLDGANLTNVIMRDSTLEGANLSNATLSGAMLMGANFHRADLSRVTMVGADLTDANL 207
Query: 206 DLAQKQA 212
A +A
Sbjct: 208 SEANLRA 214
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +A L A+ + NF RA+ + M +D + + + A L A + GA++S
Sbjct: 170 ANLSNATLSGAMLMGANFHRADLSRVTMVGADLTDANLSEANLRAANVSWTSLRGANMSR 229
Query: 163 TLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSD 202
+ R LN +NL+ AV++ TVL R++L A + GA D
Sbjct: 230 ARLYRTKLNWSNLSGVNLIEAVMLDTVLYRANLRDADLRGAILPD 274
Score = 37.0 bits (84), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 52/110 (47%), Gaps = 5/110 (4%)
Query: 75 SCSSNISALADLNKYEAE-TRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADM 130
+ + N++ L D N EA T F IG+ + +L KA N AN T+ M
Sbjct: 104 NVTLNMANLTDANLAEANFTEANF-IGAHLVNSTLIRTNLLKANLSGANLDGANLTNVIM 162
Query: 131 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
R+S G+ + A L A+ ANF ADLS M L +ANL+ A L
Sbjct: 163 RDSTLEGANLSNATLSGAMLMGANFHRADLSRVTMVGADLTDANLSEANL 212
>gi|158337957|ref|YP_001519133.1| periplasmic binding protein/LacI transcriptional regulator
[Acaryochloris marina MBIC11017]
gi|158308198|gb|ABW29815.1| periplasmic binding protein/LacI transcriptional regulator,
putative [Acaryochloris marina MBIC11017]
Length = 702
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 60/126 (47%), Gaps = 20/126 (15%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+L A K N + +N ++ E++ G+ GA LE A AN GA+
Sbjct: 218 SHANLEQANLAHANLEKANLKGSNLKGINLSEANLQGANLQGANLEGANLEGANLQGANF 277
Query: 161 SDTLMDRMVLNEANLT--------------------NAVLVRTVLTRSDLGGAIIEGADF 200
+D ++ + +LN AN T +A+L RT L +++L +I++G+D
Sbjct: 278 TDAVLHKSLLNNANFTKANLTRAKMHQVQGIWTKFNHAILHRTDLYQANLNRSILKGSDL 337
Query: 201 SDAVID 206
A ++
Sbjct: 338 YKANLE 343
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 61/135 (45%), Gaps = 7/135 (5%)
Query: 68 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 127
LA A + + S L +N EA +G A A+L A N + ANFT
Sbjct: 227 LAHANLEKANLKGSNLKGINLSEANLQG-------ANLQGANLEGANLEGANLQGANFTD 279
Query: 128 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 187
A + +S + + F A L +A ++ + ++ R L +ANL ++L + L +
Sbjct: 280 AVLHKSLLNNANFTKANLTRAKMHQVQGIWTKFNHAILHRTDLYQANLNRSILKGSDLYK 339
Query: 188 SDLGGAIIEGADFSD 202
++L + ++ DF D
Sbjct: 340 ANLENSSLQSVDFLD 354
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 41/72 (56%)
Query: 134 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
+ G + A LE+A AN A+L + + + L+EANL A L L ++L GA
Sbjct: 211 EHQGIDLSHANLEQANLAHANLEKANLKGSNLKGINLSEANLQGANLQGANLEGANLEGA 270
Query: 194 IIEGADFSDAVI 205
++GA+F+DAV+
Sbjct: 271 NLQGANFTDAVL 282
>gi|440233072|ref|YP_007346865.1| uncharacterized low-complexity protein [Serratia marcescens FGI94]
gi|440054777|gb|AGB84680.1| uncharacterized low-complexity protein [Serratia marcescens FGI94]
Length = 846
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 43/155 (27%), Positives = 70/155 (45%), Gaps = 14/155 (9%)
Query: 63 FVSTALAAAVVASCS----SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 118
F+ T L AA + S S + + A+ +++ T S + AD A
Sbjct: 675 FMKTTLEAASFSGASLESCSWVESHAEQARFDGATLVTCAAASESVLNGADFSNAT---- 730
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
++ N +R + F+ +K + L +A A+FT A+L +L R +AN ++A
Sbjct: 731 -LKQCNLRQTPLRGARFTLAKLENSDLSEACCQGADFTRANLVGSLFVRSDFRQANFSDA 789
Query: 179 VLVRTVLTRSDLGGAIIEG-----ADFSDAVIDLA 208
L+ +L +S LGGA G AD S A+ D A
Sbjct: 790 NLMGAILQKSLLGGARFNGANLFRADLSQAITDDA 824
Score = 42.7 bits (99), Expect = 0.17, Method: Composition-based stats.
Identities = 26/89 (29%), Positives = 46/89 (51%), Gaps = 5/89 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A+ ++DL +A +F RAN F +D R+++FS + GA L+K++ A F G
Sbjct: 749 AKLENSDLSEACCQGADFTRANLVGSLFVRSDFRQANFSDANLMGAILQKSLLGGARFNG 808
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLT 186
A+L + + + ++A N + V T
Sbjct: 809 ANLFRADLSQAITDDATSLNGAWTKRVKT 837
>gi|428320925|ref|YP_007118807.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428244605|gb|AFZ10391.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 214
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 50/103 (48%), Gaps = 10/103 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A G ADL +A+ V+ N RA A++ ++D S KA +A GA+L
Sbjct: 68 SKADLGGADLTEALLVEANLNRAELMGANLSKADLS----------KASLIQATLIGANL 117
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
S + + R L+ NL L R VLT DL GA + D S A
Sbjct: 118 SRSTLSRADLHGVNLYGVNLRRAVLTECDLIGANLSKVDLSGA 160
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 59/117 (50%), Gaps = 2/117 (1%)
Query: 89 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 148
YEAE G + A G A L KA + NF +AN ++ ++D G+ A L +A
Sbjct: 28 YEAELIGA-NLYEADLIG-AHLSKAKLNRVNFGKANLCKINLSKADLGGADLTEALLVEA 85
Query: 149 VAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+A GA+LS + + L +A L A L R+ L+R+DL G + G + AV+
Sbjct: 86 NLNRAELMGANLSKADLSKASLIQATLIGANLSRSTLSRADLHGVNLYGVNLRRAVL 142
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 48/99 (48%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL KA ++ AN + + + +D G G L +AV + + GA+LS
Sbjct: 95 ANLSKADLSKASLIQATLIGANLSRSTLSRADLHGVNLYGVNLRRAVLTECDLIGANLSK 154
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFS 201
+ L A+L A L +L+ SDL GA + GA+ +
Sbjct: 155 VDLSGADLMGASLIRADLTEAILSASDLSGANLLGANLT 193
Score = 37.7 bits (86), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 38/74 (51%), Gaps = 5/74 (6%)
Query: 132 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 191
E +FSG + YL +A AN ADL + + LN N A L + L+++DLG
Sbjct: 14 ERNFSGVYLHEVYLYEAELIGANLYEADLIGAHLSKAKLNRVNFGKANLCKINLSKADLG 73
Query: 192 GAIIEGADFSDAVI 205
GAD ++A++
Sbjct: 74 -----GADLTEALL 82
>gi|254413874|ref|ZP_05027643.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196179471|gb|EDX74466.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 359
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 62/117 (52%), Gaps = 12/117 (10%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN----------GAYLEKAVAYK 152
A A+LR AV + +AN A++ +++ G+ N GA+L +A Y
Sbjct: 92 ADLRQANLRGAVLSNADLTQANLEGANLTDANLEGTTLNYANLKMVDLRGAHLYQAYLYA 151
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
AN + A L + + L EANL A ++R L +++L GA ++GA+ S++ D++Q
Sbjct: 152 ANVSEAKLRGANLGKTDLREANLKQASIIRAYLGQANLQGADLDGANLSES--DMSQ 206
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 90/200 (45%), Gaps = 34/200 (17%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKE-----------NFRRANFTSADMRESDF 135
N EA+ RG A G DLR+A ++K+ N + A+ A++ ESD
Sbjct: 153 NVSEAKLRG-------ANLGKTDLREA-NLKQASIIRAYLGQANLQGADLDGANLSESDM 204
Query: 136 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAII 195
S +K N A L A+F+ +DLS + R + A+L A L L R++L GA +
Sbjct: 205 SQAKLNRAKLRNTQLRNADFSLSDLSQATLIRANASHAHLIRANLRGADLIRTNLTGADL 264
Query: 196 EGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGN-SRRNAYGS---------- 244
+GAD S A + LA L N TN I + LG N SR N +
Sbjct: 265 QGADLSLADLSLANLY-LANLGN-TNLIRANLSIAELGGANLSRANLNQADLRGANVENA 322
Query: 245 --PSSPLLSAPPQKLLDRDG 262
S+P LS +++L R G
Sbjct: 323 EFASNPGLSEEMKRVLKRRG 342
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 63/135 (46%), Gaps = 25/135 (18%)
Query: 72 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 131
VVA+ + N++ LA + G+ A F ++LR N +AN AD+
Sbjct: 16 VVAAETDNLAQLAAM----------VGLNLARDFAESNLRDTNLKGANLVKANLRGADLH 65
Query: 132 ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLG 191
++ ++ GA L A +AN GADL +ANL A VL+ +DL
Sbjct: 66 GANLMKARLCGADLRGADLIQANLCGADLR----------QANLRGA-----VLSNADLT 110
Query: 192 GAIIEGADFSDAVID 206
A +EGA+ +DA ++
Sbjct: 111 QANLEGANLTDANLE 125
>gi|386720786|ref|YP_006187111.1| Pentapeptide repeat-containing protein [Paenibacillus mucilaginosus
K02]
gi|384087910|gb|AFH59346.1| Pentapeptide repeat-containing protein [Paenibacillus mucilaginosus
K02]
Length = 201
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 61/122 (50%), Gaps = 10/122 (8%)
Query: 104 QFGSADL-----RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
+FG A+L R+ + +F +A + + D+S + L K F A
Sbjct: 73 KFGGANLFVSKFRECKMIGSDFAKAQLDGITIEQGDWSYTNLRQTNLGKQDLRNVKFMEA 132
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-----SDAVIDLAQKQAL 213
DLS+ +++ L EA+LT A+L R L+ SDL GA ++G DF A +DLAQ AL
Sbjct: 133 DLSECNLEKANLREADLTRALLGRARLSGSDLRGAKMDGVDFRAMDVKGARMDLAQAVAL 192
Query: 214 CK 215
+
Sbjct: 193 AR 194
>gi|443314200|ref|ZP_21043780.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442786200|gb|ELR95960.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 185
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 73/146 (50%), Gaps = 28/146 (19%)
Query: 99 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGA 158
I S A ADL+ A N + A T AD+RE++ S + AYL+ +AN TGA
Sbjct: 43 ILSGADLKGADLKGA-----NLKVATLTGADLREANLSKANLMLAYLD-----EANLTGA 92
Query: 159 DLSDTLMD-----RMVLNEANLTNAVLVRTVLTRSD-----LGGAIIEGADFSDAVIDLA 208
+LS++ M+ + L+ ANL+NA + + L +D LGGAI+ A +
Sbjct: 93 NLSNSQMNGAQMPHVNLHGANLSNAEMTQVNLLEADLSDANLGGAIMLSVKLGTANL--- 149
Query: 209 QKQALCKYANGTNPITGVSTRKSLGC 234
K A K AN + GV+ ++L C
Sbjct: 150 -KGANLKGAN----LRGVNRSQALFC 170
>gi|428776740|ref|YP_007168527.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428691019|gb|AFZ44313.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 157
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 58/121 (47%), Gaps = 25/121 (20%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVA 150
S A +ADL +A + N RAN T+ D+ ++D G+ GA L +A+
Sbjct: 38 SEADLSNADLSQATLCRSNLSRANLTNTDLNQADLRSANLSQVNLIGASLVGAKLGRAIL 97
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQK 210
A+ GADLSD A+LT A LT ++L GA++ GA+ D ++ A
Sbjct: 98 TGADLRGADLSD----------ADLTGA-----NLTDAELSGAVLTGANIEDVELEKAAT 142
Query: 211 Q 211
+
Sbjct: 143 E 143
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 44/85 (51%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 178
N + A D+ E+D S + + A L ++ +AN T DL+ + L++ NL A
Sbjct: 26 NLKSAYLEEIDLSEADLSNADLSQATLCRSNLSRANLTNTDLNQADLRSANLSQVNLIGA 85
Query: 179 VLVRTVLTRSDLGGAIIEGADFSDA 203
LV L R+ L GA + GAD SDA
Sbjct: 86 SLVGAKLGRAILTGADLRGADLSDA 110
>gi|418019711|ref|ZP_12659144.1| putative low-complexity protein [Candidatus Regiella insecticola
R5.15]
gi|347604938|gb|EGY29471.1| putative low-complexity protein [Candidatus Regiella insecticola
R5.15]
Length = 381
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 54/104 (51%), Gaps = 1/104 (0%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S DL K + N +AN T A++RE D +G+ GA LE+A +A ADL
Sbjct: 76 SHTYLAGLDLSKMDLSRVNLEKANLTGANLREMDLTGANLTGANLERARLVRAILEWADL 135
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 204
++ + +L +A+L A+L L R+ + GA + D +D+V
Sbjct: 136 TNANLFEAILLDASLNGAILKNANLERTFVEGAHMSTVD-TDSV 178
>gi|376007406|ref|ZP_09784602.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423063332|ref|ZP_17052122.1| pentapeptide repeat-containing protein [Arthrospira platensis C1]
gi|375324195|emb|CCE20355.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406715454|gb|EKD10610.1| pentapeptide repeat-containing protein [Arthrospira platensis C1]
Length = 274
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 10/103 (9%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
AQ A+L A NF RAN T A MR S N A L A +ANFT A+
Sbjct: 70 AQLADANLISANLTDANFSRANLTGASMRGSISKNVTLNMANLTDANLAEANFTEANFIG 129
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
A+L N+ L+RT L +++L GA ++GA+ ++ ++
Sbjct: 130 ----------AHLVNSTLIRTNLLKANLSGANLDGANLTNVIM 162
Score = 44.3 bits (103), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 61/127 (48%), Gaps = 14/127 (11%)
Query: 83 LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 141
LAD N A T F S A A +R ++ AN T A++ E++F+ + F
Sbjct: 72 LADANLISANLTDANF---SRANLTGASMRGSISKNVTLNMANLTDANLAEANFTEANFI 128
Query: 142 GAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVL-----TRSDLGGAIIE 196
GA+L + + N A+LS +D ANLTN ++ + L + + L GA++
Sbjct: 129 GAHLVNSTLIRTNLLKANLSGANLD-----GANLTNVIMRDSTLEGANLSNATLSGAMLM 183
Query: 197 GADFSDA 203
GA+F A
Sbjct: 184 GANFHRA 190
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 15/98 (15%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F ADL + V + AN + A++R ++ S + GA + +A Y+ ++LS
Sbjct: 185 ANFHRADLSRVTMVGADLTDANLSEANLRAANVSWTSLRGANMSRARLYRTKLNWSNLSG 244
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRS-----DLGGAII 195
NL AV++ TVL R+ DL GAI+
Sbjct: 245 V----------NLIEAVMLDTVLYRANLRDADLRGAIL 272
Score = 40.4 bits (93), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 56/127 (44%), Gaps = 12/127 (9%)
Query: 91 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 150
A RG I A+L A + NF ANF A + S + L KA
Sbjct: 95 ASMRGS--ISKNVTLNMANLTDANLAEANFTEANFIGAHLVNSTLIRTN-----LLKANL 147
Query: 151 YKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLT-----RSDLGGAIIEGADFSDAVI 205
AN GA+L++ +M L ANL+NA L +L R+DL + GAD +DA +
Sbjct: 148 SGANLDGANLTNVIMRDSTLEGANLSNATLSGAMLMGANFHRADLSRVTMVGADLTDANL 207
Query: 206 DLAQKQA 212
A +A
Sbjct: 208 SEANLRA 214
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A +A L A+ + NF RA+ + M +D + + + A L A + GA++S
Sbjct: 170 ANLSNATLSGAMLMGANFHRADLSRVTMVGADLTDANLSEANLRAANVSWTSLRGANMSR 229
Query: 163 TLMDRMVLNEANLT-----NAVLVRTVLTRSDLGGAIIEGADFSD 202
+ R LN +NL+ AV++ TVL R++L A + GA D
Sbjct: 230 ARLYRTKLNWSNLSGVNLIEAVMLDTVLYRANLRDADLRGAILPD 274
Score = 37.0 bits (84), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 52/110 (47%), Gaps = 5/110 (4%)
Query: 75 SCSSNISALADLNKYEAE-TRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADM 130
+ + N++ L D N EA T F IG+ + +L KA N AN T+ M
Sbjct: 104 NVTLNMANLTDANLAEANFTEANF-IGAHLVNSTLIRTNLLKANLSGANLDGANLTNVIM 162
Query: 131 RESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVL 180
R+S G+ + A L A+ ANF ADLS M L +ANL+ A L
Sbjct: 163 RDSTLEGANLSNATLSGAMLMGANFHRADLSRVTMVGADLTDANLSEANL 212
>gi|427736744|ref|YP_007056288.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427371785|gb|AFY55741.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 443
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 15/125 (12%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSA----------DMRESDFSGSKFNGAYLEKAVA 150
++ +F ADLR+A V N +F++A D+ +D SG+ +GAY A
Sbjct: 319 TSTKFIGADLREANFVGANLDNVDFSNANLSGTNLSGADLSGADLSGAYLSGAYFYDADL 378
Query: 151 YKANFTGADLS-----DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
AN GADLS D + L A+L+ A L+ ++L GA + GAD +D I
Sbjct: 379 SDANLQGADLSGAYFYDADLSGANLQGADLSGAYFYDADLSGANLQGANLNGADLTDTYI 438
Query: 206 DLAQK 210
D A+
Sbjct: 439 DRAKN 443
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 45/95 (47%), Gaps = 5/95 (5%)
Query: 120 FRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEAN 174
R AN AD+R +SDF+ + GA L A AN GADLS+ + LN
Sbjct: 168 LRGANLARADLRGTKLNQSDFTNANLAGADLRDADLTNANLAGADLSNADLTNANLNSVQ 227
Query: 175 LTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
L A L+ L +DL A + GA DA I+ A
Sbjct: 228 LVKAQLINARLVDTDLRKANLNGAYLIDANINRAN 262
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 50/106 (47%), Gaps = 5/106 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL-- 160
A ADLR + +F AN AD+R++D + + GA L A AN L
Sbjct: 171 ANLARADLRGTKLNQSDFTNANLAGADLRDADLTNANLAGADLSNADLTNANLNSVQLVK 230
Query: 161 SDTLMDRMV---LNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + R+V L +ANL A L+ + R++L G + AD + A
Sbjct: 231 AQLINARLVDTDLRKANLNGAYLIDANINRANLSGTNLSNADLTSA 276
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 56/122 (45%), Gaps = 12/122 (9%)
Query: 101 SAAQFGSADLRKAVHVKENF-RRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN----- 154
S +ADL A ++E F NF A++ DFSG NG L A AN
Sbjct: 264 SGTNLSNADLTSA-KLRETFPSNTNFCGANLSGIDFSGFILNGINLRWAKLIGANLTSTK 322
Query: 155 FTGADLSDTL-----MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQ 209
F GADL + +D + + ANL+ L L+ +DL GA + GA F DA + A
Sbjct: 323 FIGADLREANFVGANLDNVDFSNANLSGTNLSGADLSGADLSGAYLSGAYFYDADLSDAN 382
Query: 210 KQ 211
Q
Sbjct: 383 LQ 384
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 49/109 (44%), Gaps = 10/109 (9%)
Query: 103 AQFGSADLRKA-----VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTG 157
A+ DLRKA + N RAN + ++ +D + +K L + NF G
Sbjct: 236 ARLVDTDLRKANLNGAYLIDANINRANLSGTNLSNADLTSAK-----LRETFPSNTNFCG 290
Query: 158 ADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
A+LS +LN NL A L+ LT + GA + A+F A +D
Sbjct: 291 ANLSGIDFSGFILNGINLRWAKLIGANLTSTKFIGADLREANFVGANLD 339
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 52/119 (43%), Gaps = 20/119 (16%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY----------K 152
A ADLR A + AN AD+ +D + + N L KA K
Sbjct: 191 ANLAGADLRDA-----DLTNANLAGADLSNADLTNANLNSVQLVKAQLINARLVDTDLRK 245
Query: 153 ANFTGADLSDTLMDRMVLNEANLTNAVLV-----RTVLTRSDLGGAIIEGADFSDAVID 206
AN GA L D ++R L+ NL+NA L T + ++ GA + G DFS +++
Sbjct: 246 ANLNGAYLIDANINRANLSGTNLSNADLTSAKLRETFPSNTNFCGANLSGIDFSGFILN 304
>gi|241663874|ref|YP_002982234.1| pentapeptide repeat-containing protein [Ralstonia pickettii 12D]
gi|240865901|gb|ACS63562.1| pentapeptide repeat protein [Ralstonia pickettii 12D]
Length = 277
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A ADL A A AD+ +D SG+ +GAYL +GADL
Sbjct: 82 SGAYLSGADLSGAYLSDAYLSGAYLRGADLSGADLSGADLSGAYL----------SGADL 131
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA-VIDLAQKQALCKYANG 219
S + L A L++A L L+ +DL GA + GAD SDA VI+ ++ YA
Sbjct: 132 SGAYLSDAYLRGAYLSDAYLSDADLSGADLSGAYLSGADLSDAPVIENIHQKV---YAAA 188
Query: 220 TNP 222
+ P
Sbjct: 189 SQP 191
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 110 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV---AY--KANFTGADLSDTL 164
+ +AV A+ + AD+ +D SG+ +GAYL A AY A +GADLS
Sbjct: 26 VEQAVKGGAYLSGADLSGADLSGADLSGAYLSGAYLSDAYLRGAYLSGAYLSGADLSGAY 85
Query: 165 MDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+ L+ A L++A L L +DL GA + GAD S A +
Sbjct: 86 LSGADLSGAYLSDAYLSGAYLRGADLSGADLSGADLSGAYL 126
>gi|222106865|ref|YP_002547656.1| hypothetical protein Avi_5902 [Agrobacterium vitis S4]
gi|221738044|gb|ACM38940.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 241
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 68/157 (43%), Gaps = 14/157 (8%)
Query: 54 YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 113
+ +KN S + VV+ +NI+ AD G S + R+
Sbjct: 6 FQDVKNTARLASCLFSLVVVSMLGTNIAQAADCRS---------GPASKVDWSECRKRQL 56
Query: 114 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA 173
+ + R AN AD+ +D S S A LEKA +A+F GA+ DR+ A
Sbjct: 57 MLGGSDLRGANLYDADLSFTDLSNSSLQAADLEKATLIRASFAGANADGAKFDRIESYRA 116
Query: 174 NLT-----NAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
++ +A V + R +LGGA + GADF+ A +
Sbjct: 117 EMSAMSGVDASFVSAEMQRVNLGGANLAGADFTKAEL 153
>gi|334117594|ref|ZP_08491685.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333460703|gb|EGK89311.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 290
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 63/120 (52%), Gaps = 9/120 (7%)
Query: 86 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 145
L +YE R +F + A A+L A+ V NF RAN + A++ + + ++ N A L
Sbjct: 7 LKEYENGNR-DF---AGANLSGANLSGAILVGVNFSRANLSGANLSRAHLTKAELNDANL 62
Query: 146 EKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
Y+AN + A + + L +ANL+ A LV+ L R+ L GA + G++ + A++
Sbjct: 63 -----YRANLSFAKMGQARLADADLTKANLSGAFLVKAKLPRAKLSGAQLIGSNLAMAIL 117
>gi|307150160|ref|YP_003885544.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306980388|gb|ADN12269.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 215
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 55/119 (46%), Gaps = 3/119 (2%)
Query: 88 KYEAETRGEFGIGSAAQ---FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 144
+ E + RG +G+ Q ADLR + V AN A++ D G+ N A
Sbjct: 25 QLEIDLRGIYGLNLDLQGINLEKADLRGSYLVGAFLEGANLVGANLSGVDLKGANLNNAN 84
Query: 145 LEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L A +N +L L+ R LN L A L + L ++DL GAI+EGA ++A
Sbjct: 85 LTDAHLVGSNLREVNLKGALLTRAFLNGVYLNAANLDESDLRQADLRGAILEGASMTNA 143
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 39/155 (25%), Positives = 66/155 (42%), Gaps = 8/155 (5%)
Query: 83 LADLNKYEAETRGEFGIGS--------AAQFGSADLRKAVHVKENFRRANFTSADMRESD 134
L +N +A+ RG + +G+ A DL+ A N A+ +++RE +
Sbjct: 40 LQGINLEKADLRGSYLVGAFLEGANLVGANLSGVDLKGANLNNANLTDAHLVGSNLREVN 99
Query: 135 FSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
G+ A+L AN +DL + +L A++TNA L L R GA
Sbjct: 100 LKGALLTRAFLNGVYLNAANLDESDLRQADLRGAILEGASMTNANLREADLRRCQFEGAN 159
Query: 195 IEGADFSDAVIDLAQKQALCKYANGTNPITGVSTR 229
+EG+ DA++ + L + N N V ++
Sbjct: 160 LEGSLLIDAILQDQGQDHLIIWENFYNNSNTVESK 194
>gi|425445509|ref|ZP_18825537.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
gi|389734499|emb|CCI01861.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
Length = 187
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 7/119 (5%)
Query: 76 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 135
C N +L DLN A G A ADL + N + N AD+ ++
Sbjct: 69 CDFNGISLKDLNLSSANLEG-------ANLSQADLERTNLQGANLKGTNLQGADLGKTLL 121
Query: 136 SGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAI 194
+G+ +GA L A KAN GA+L++ + + L +ANLTNA L L +D GAI
Sbjct: 122 AGADLSGANLLGADLEKANLQGANLTNANLQKADLEKANLTNARLDGANLQDADGEGAI 180
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 48/90 (53%), Gaps = 5/90 (5%)
Query: 119 NFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTGADLSDTLMDRMVLNEA 173
N AN + AD+ ++ G+ G L+ K + A+ +GA+L +++ L A
Sbjct: 85 NLEGANLSQADLERTNLQGANLKGTNLQGADLGKTLLAGADLSGANLLGADLEKANLQGA 144
Query: 174 NLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
NLTNA L + L +++L A ++GA+ DA
Sbjct: 145 NLTNANLQKADLEKANLTNARLDGANLQDA 174
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 137 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL--TN---AVLVRTVLTRSDLG 191
G FNG L+ AN GA+LS ++R L ANL TN A L +T+L +DL
Sbjct: 68 GCDFNGISLKDLNLSSANLEGANLSQADLERTNLQGANLKGTNLQGADLGKTLLAGADLS 127
Query: 192 GAIIEGADFSDA 203
GA + GAD A
Sbjct: 128 GANLLGADLEKA 139
>gi|298492301|ref|YP_003722478.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298234219|gb|ADI65355.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 264
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 55/106 (51%), Gaps = 5/106 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A ADL A ++ N AN A++ +D SG+ A L + Y+AN A+L++
Sbjct: 139 ANLKDADLAAAKLIRSNLSFANLVGANLITTDLSGANLYEAELMQTYLYQANLYKANLTN 198
Query: 163 TLMD-----RMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + R L+EANLTNA L LT ++L GA + GA+ A
Sbjct: 199 SHLGSSYLFRANLSEANLTNADLTCANLTGANLRGANLRGANLRGA 244
Score = 43.9 bits (102), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 66/139 (47%), Gaps = 23/139 (16%)
Query: 109 DLRKAVHVKENFRRANFTSADMRESDFS----------GSKFNGAYLEKAVAYKANFTGA 158
DL A ENFR AN ++ + DFS G+ + A L +A +AN + A
Sbjct: 30 DLSTANLQGENFRGANLQGVNLTKVDFSHALLVRTNLSGANLSIANLHQAKLIEANLSEA 89
Query: 159 DLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA-----DFSDAVI---DLAQK 210
+LS + L +ANL+ L+ L+ ++L GA I GA DF +A + DLA
Sbjct: 90 NLSIANLRNATLTQANLSQVNLIGADLSEANLIGAAITGANLIGTDFRNANLKDADLAAA 149
Query: 211 QAL---CKYAN--GTNPIT 224
+ + +AN G N IT
Sbjct: 150 KLIRSNLSFANLVGANLIT 168
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 55/127 (43%), Gaps = 9/127 (7%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A A+LR A + N + N AD+ E++ G+ GA L AN ADL
Sbjct: 87 SEANLSIANLRNATLTQANLSQVNLIGADLSEANLIGAAITGANLIGTDFRNANLKDADL 146
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGT 220
+ + R L+ ANL A L+ T DL GA + A+ + QA AN T
Sbjct: 147 AAAKLIRSNLSFANLVGANLITT-----DLSGANLYEAELMQTYL----YQANLYKANLT 197
Query: 221 NPITGVS 227
N G S
Sbjct: 198 NSHLGSS 204
>gi|17230748|ref|NP_487296.1| hypothetical protein all3256 [Nostoc sp. PCC 7120]
gi|17132351|dbj|BAB74955.1| all3256 [Nostoc sp. PCC 7120]
Length = 268
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 57/198 (28%), Positives = 86/198 (43%), Gaps = 52/198 (26%)
Query: 73 VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE 132
VA+ S ++ ADL + F IG A F A+LR A+ + N +F+SAD+R+
Sbjct: 93 VANLSQSVLTHADL------SHAHF-IG--ADFSGANLRGAIVTEANLIGTDFSSADLRD 143
Query: 133 SDFSGSKF------------------------------NGAYLEKAVAYKANFTGADLSD 162
+D +G+K GAYL KA YKAN A L
Sbjct: 144 ADLAGAKLIRSNLCFANLIAANFIAVDFSEANLYQAEVMGAYLYKANFYKANLHQAHLGG 203
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 222
+ R L A+L A L LT ++L GA + GA+ A + NG N
Sbjct: 204 AYLFRANLTAADLRGADLAWANLTSANLAGANLSGANLRGANL------------NGAN- 250
Query: 223 ITGVSTRKSLGCGNSRRN 240
+ GV+ ++++ +SR +
Sbjct: 251 LNGVNLQETIMPDSSRHD 268
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 54/98 (55%), Gaps = 5/98 (5%)
Query: 111 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVL 170
+K ++ + +N ++R ++ +G+ + L +A+ +AN +GADLS + L
Sbjct: 22 QKNPQIEPDLSTSNLQENNLRGANLAGANLSRVDLSRALLIRANLSGADLSSANLHHAKL 81
Query: 171 NEANLTN-----AVLVRTVLTRSDLGGAIIEGADFSDA 203
+EANL+ A L ++VLT +DL A GADFS A
Sbjct: 82 SEANLSAANFSVANLSQSVLTHADLSHAHFIGADFSGA 119
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 64/160 (40%), Gaps = 39/160 (24%)
Query: 87 NKYEAETRGEFGIGSAAQFGSADLRKAVHVK--------------------ENFRRANF- 125
N E RG G A DL +A+ ++ N ANF
Sbjct: 35 NLQENNLRGANLAG--ANLSRVDLSRALLIRANLSGADLSSANLHHAKLSEANLSAANFS 92
Query: 126 ---------TSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLT 176
T AD+ + F G+ F+GA L A+ +AN G D S L +A+L
Sbjct: 93 VANLSQSVLTHADLSHAHFIGADFSGANLRGAIVTEANLIGTDFSSA-----DLRDADLA 147
Query: 177 NAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKY 216
A L+R+ L ++L A DFS+A +L Q + + Y
Sbjct: 148 GAKLIRSNLCFANLIAANFIAVDFSEA--NLYQAEVMGAY 185
>gi|418744036|ref|ZP_13300395.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
CBC379]
gi|418751631|ref|ZP_13307915.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
MOR084]
gi|409968104|gb|EKO35917.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
MOR084]
gi|410795431|gb|EKR93328.1| NifU-like N-terminal domain protein [Leptospira santarosai str.
CBC379]
Length = 263
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 48/92 (52%), Gaps = 4/92 (4%)
Query: 124 NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRT 183
+ +S + + +F G F+GA L A ++F GA+ S + LN ANL N
Sbjct: 151 DLSSIILEKQNFDGVDFSGANLGHAFLQNSSFVGANFSSAKLRGSFLNNANLRNTNFRGA 210
Query: 184 VLTRSDLGGAIIEGADFSDAVID----LAQKQ 211
L + L GA +EGADF+DA+ D L QKQ
Sbjct: 211 DLRWAKLAGANVEGADFTDAIYDIGTRLDQKQ 242
>gi|149179551|ref|ZP_01858089.1| pentapeptide repeat domain protein [Planctomyces maris DSM 8797]
gi|148841608|gb|EDL56033.1| pentapeptide repeat domain protein [Planctomyces maris DSM 8797]
Length = 343
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)
Query: 83 LADLNKYEAETRGEFGIGSAAQFGS----ADLRKAVHVKENFRRANFTSADMRESDFSGS 138
L+DL+ EA+ R + A GS A L +A + N RAN AD+ + +G+
Sbjct: 41 LSDLDLSEADLRNA-DLRDANLEGSDLSGAYLGQARLCQTNLCRANLQKADLTGGNLTGA 99
Query: 139 KFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGA 198
N A LE A ++NF+ ADL++T + L EAN NA L + L+ +DL GA ++ +
Sbjct: 100 ILNEANLEAAYLNQSNFSHADLNETKLAHTKLMEANFFNADLRKADLSGADLRGANLKWS 159
Query: 199 DFSDAVIDLAQ 209
+ S A + A+
Sbjct: 160 NLSGARLSAAE 170
Score = 43.9 bits (102), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV-----AYKANFTGADLSD 162
ADLR+ + A+ +AD+R+++ GS +GAYL +A +AN ADL+
Sbjct: 34 ADLRRDNLSDLDLSEADLRNADLRDANLEGSDLSGAYLGQARLCQTNLCRANLQKADLTG 93
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+ +LNEANL A L ++ + +DL
Sbjct: 94 GNLTGAILNEANLEAAYLNQSNFSHADL 121
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 5/88 (5%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
+ F ADL + ANF +AD+R++D SG+ GA L+ +N +GA LS
Sbjct: 114 SNFSHADLNETKLAHTKLMEANFFNADLRKADLSGADLRGANLK-----WSNLSGARLSA 168
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDL 190
+ + L E +L++A L + T + L
Sbjct: 169 AELSKANLIETDLSDADLTEAIFTDAKL 196
>gi|427734465|ref|YP_007054009.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427369506|gb|AFY53462.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 269
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 50/105 (47%), Gaps = 10/105 (9%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A G ADLR+A N AN T A + ++ SGS +GA L A AN T +
Sbjct: 68 SEADLGEADLREANLKGANLTGANLTGATLMNANLSGSNLSGACLSGAKLSGANLTEVNF 127
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
+D ANL +A LV+ L R+DL A A+ AVI
Sbjct: 128 TD----------ANLKSASLVKAQLIRTDLTNADFTRANLHQAVI 162
>gi|427720942|ref|YP_007068936.1| RDD domain-containing protein [Calothrix sp. PCC 7507]
gi|427353378|gb|AFY36102.1| RDD domain containing protein [Calothrix sp. PCC 7507]
Length = 716
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 5/105 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 160
S A+ ADL A + + A++ ++D+ G+ +G YL+ A N + A+L
Sbjct: 563 SDAKLNEADLFAAHLGRVTAIGTQLSYANLTKTDWQGADLSGVYLDHA-----NLSNANL 617
Query: 161 SDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 205
S T + V+ ANL NA L L+ +DL GA + GADF A++
Sbjct: 618 SATRLTGAVMRSANLENANLQNADLSHADLQGANLAGADFRGAIL 662
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 54/113 (47%), Gaps = 29/113 (25%)
Query: 125 FTSADMRESDFSGSKFNG--------------AYLEKAVAYKANFTGADLSDTLMD---- 166
F SA++ + F GS+F A L + +AN T A+LS +M+
Sbjct: 458 FKSANLSQGSFKGSRFRSPGEDGRWDTYDDVIADLSQVEMKQANLTDANLSRVVMNRSDL 517
Query: 167 -RMVLNEANLTNAVLV-----RTVLTRSDLGGAIIE-----GADFSDAVIDLA 208
R LN ANL+N L+ T L +DL GA++E GAD SDA ++ A
Sbjct: 518 SRATLNRANLSNTRLIAANLSSTQLVGADLTGAVLENASLTGADLSDAKLNEA 570
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 61/132 (46%), Gaps = 14/132 (10%)
Query: 82 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFS 136
+ADL++ E + A A+L + V + + RA A++ + + S
Sbjct: 488 VIADLSQVEMKQ---------ANLTDANLSRVVMNRSDLSRATLNRANLSNTRLIAANLS 538
Query: 137 GSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIE 196
++ GA L AV A+ TGADLSD ++ L A+L + T L+ ++L +
Sbjct: 539 STQLVGADLTGAVLENASLTGADLSDAKLNEADLFAAHLGRVTAIGTQLSYANLTKTDWQ 598
Query: 197 GADFSDAVIDLA 208
GAD S +D A
Sbjct: 599 GADLSGVYLDHA 610
>gi|402773132|ref|YP_006592669.1| pentapeptide repeat protein [Methylocystis sp. SC2]
gi|401775152|emb|CCJ08018.1| Pentapeptide repeat protein [Methylocystis sp. SC2]
Length = 261
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 60/139 (43%), Gaps = 35/139 (25%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A F S L A K + NFT AD++ +DFSG++ N A L A+ A F ADLS+
Sbjct: 115 ADFFSTKLAGAKLAKADLSATNFTRADLQNADFSGARMNAATLYAALLDGATFADADLSN 174
Query: 163 T---------------LMD---------------RMVLNEAN-----LTNAVLVRTVLTR 187
L+D R L +AN LT A L VLT
Sbjct: 175 ARIIGGGKGVNFRNAKLIDADLGADPANQGMAPVRAELPDANFDGADLTRANLTHAVLTG 234
Query: 188 SDLGGAIIEGADFSDAVID 206
++ AI+ GA F AV+D
Sbjct: 235 ANFTAAIVSGARFDYAVLD 253
>gi|359459720|ref|ZP_09248283.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 170
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 49/100 (49%), Gaps = 5/100 (5%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL + ++ N R AN A++R + S A LE A NFT A L + ++
Sbjct: 45 ADLSGLILIRANLRNANLKGANLRNTSLLLSNLENANLENA-----NFTAAYLYGSNLEN 99
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDL 207
L + T AVL L +D+ A + GAD +DA +DL
Sbjct: 100 TQLTSTDFTQAVLRSAKLQGADVCTATLAGADLTDADVDL 139
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 38/78 (48%)
Query: 134 DFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGA 193
D SG+ +G L +A AN GA+L +T + L ANL NA L S+L
Sbjct: 41 DLSGADLSGLILIRANLRNANLKGANLRNTSLLLSNLENANLENANFTAAYLYGSNLENT 100
Query: 194 IIEGADFSDAVIDLAQKQ 211
+ DF+ AV+ A+ Q
Sbjct: 101 QLTSTDFTQAVLRSAKLQ 118
>gi|386001277|ref|YP_005919576.1| Pentapeptide repeat protein [Methanosaeta harundinacea 6Ac]
gi|357209333|gb|AET63953.1| Pentapeptide repeat protein [Methanosaeta harundinacea 6Ac]
Length = 385
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 103 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 162
A A++R + + F RA F D+ SD S S F+ AYL +AN + A+L+
Sbjct: 178 AHMNWAEMRGSYLNRGQFSRAEFYGTDLSGSDLSDSDFSRAYL-----MRANLSDANLNW 232
Query: 163 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
L L EA L+ + L T L+ +DL GA + GAD +DA
Sbjct: 233 ALFAYADLTEAKLSRSTLRGTKLSYADLTGADLSGADLTDA 273
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 5/111 (4%)
Query: 101 SAAQFGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 155
S + +D +A ++ N AN F AD+ E+ S S G L A A+
Sbjct: 206 SGSDLSDSDFSRAYLMRANLSDANLNWALFAYADLTEAKLSRSTLRGTKLSYADLTGADL 265
Query: 156 TGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
+GADL+D + + L ++NL+N + R L DL G + GA DA ID
Sbjct: 266 SGADLTDADLTAIRLIKSNLSNTKMGRAYLQGLDLRGVDLSGAYLRDATID 316
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 53/104 (50%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
A+L+KA + A+ + AD+ E D S +K GA + A A T ADL+ T +
Sbjct: 83 ANLKKANLAGADLSGADLSEADLSEVDLSEAKLWGAKISGASLVDATLTKADLTRTDITD 142
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQ 211
L A +TNA L R LT + + G + G +F A ++ A+ +
Sbjct: 143 ADLTGAEMTNARLFRADLTGATMTGVYLIGGNFVGAHMNWAEMR 186
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 50/99 (50%)
Query: 108 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDR 167
ADL A + A+ T+ + +S+ S +K AYL+ + +GA L D +DR
Sbjct: 258 ADLTGADLSGADLTDADLTAIRLIKSNLSNTKMGRAYLQGLDLRGVDLSGAYLRDATIDR 317
Query: 168 MVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVID 206
L +ANLT A L L+ ++ GA + GA+ A +D
Sbjct: 318 TYLTDANLTGADLRGATLSSVEMTGADLAGANLIRAKVD 356
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 58/112 (51%), Gaps = 10/112 (8%)
Query: 102 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 161
+A DL A + + A+ + AD+R++ + GA L+KA A+ +GADLS
Sbjct: 42 SADLSGRDLVGAHLNQSDLSGADLSGADLRDAYLRSTWLLGANLKKANLAGADLSGADLS 101
Query: 162 DTLMDRMVLNE----------ANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 203
+ + + L+E A+L +A L + LTR+D+ A + GA+ ++A
Sbjct: 102 EADLSEVDLSEAKLWGAKISGASLVDATLTKADLTRTDITDADLTGAEMTNA 153
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 51/107 (47%), Gaps = 13/107 (12%)
Query: 129 DMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRS 188
D+R +D SG GA+L ++ A+ +GADL D + L ANL A
Sbjct: 39 DLRSADLSGRDLVGAHLNQSDLSGADLSGADLRDAYLRSTWLLGANLKKA---------- 88
Query: 189 DLGGAIIEGADFSDA---VIDLAQKQALCKYANGTNPITGVSTRKSL 232
+L GA + GAD S+A +DL++ + +G + + T+ L
Sbjct: 89 NLAGADLSGADLSEADLSEVDLSEAKLWGAKISGASLVDATLTKADL 135
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.129 0.370
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,195,212,456
Number of Sequences: 23463169
Number of extensions: 164326294
Number of successful extensions: 476375
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4073
Number of HSP's successfully gapped in prelim test: 746
Number of HSP's that attempted gapping in prelim test: 397348
Number of HSP's gapped (non-prelim): 43501
length of query: 274
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 134
effective length of database: 9,074,351,707
effective search space: 1215963128738
effective search space used: 1215963128738
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)