BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 029172
(198 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255583634|ref|XP_002532572.1| conserved hypothetical protein [Ricinus communis]
gi|223527699|gb|EEF29806.1| conserved hypothetical protein [Ricinus communis]
Length = 280
Score = 207 bits (528), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 112/173 (64%), Positives = 130/173 (75%), Gaps = 2/173 (1%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MA +SISPLSIKS+N SSS+ PY L + SKP + CQ++ TE + + DCS +
Sbjct: 1 MAFTSISPLSIKSVNISPSSSRSPYHLPSQSKPFHILCQLA--TEREDRILDCSTTRYKV 58
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
++K KNWR VSTALAAA + + A ADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 59 HHSKPKNWRTLVSTALAAAAAVNLGFGLPAAADLNKFEAELRGEFGIGSAAQFGSADLRK 118
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
AVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFT ++ L+
Sbjct: 119 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLM 171
>gi|224071571|ref|XP_002303521.1| predicted protein [Populus trichocarpa]
gi|222840953|gb|EEE78500.1| predicted protein [Populus trichocarpa]
Length = 275
Score = 202 bits (514), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 107/173 (61%), Positives = 127/173 (73%), Gaps = 7/173 (4%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MA +SIS +SIKS N + P+++ +LSKP +A Q+ TE QF DCS N
Sbjct: 1 MAFTSISSMSIKSPNIST-----PHRILSLSKPFRIAYQLD--TERGNQFADCSKNGYEV 53
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
AK KNW VST L AA ++ S N+ A+ADLN++EAETRGEFGIGSAAQFGSADLRK
Sbjct: 54 ETAKAKNWARVVSTTLVAAAISFSSCNLPAVADLNRFEAETRGEFGIGSAAQFGSADLRK 113
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
AVH+ ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANFT ++ L+
Sbjct: 114 AVHLNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLM 166
>gi|449459702|ref|XP_004147585.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Cucumis sativus]
gi|449520611|ref|XP_004167327.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Cucumis sativus]
Length = 279
Score = 187 bits (476), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 102/173 (58%), Positives = 123/173 (71%), Gaps = 4/173 (2%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSSIS LS+K L SS S+ P L K + + QI+ + + Q DCS + G
Sbjct: 1 MALSSISSLSVKCLPLNSSKSRHPCSLQT-RKQISMVSQINPQKD---QTQDCSERKHIG 56
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ K W+ VSTALAAA V SS + ++A+LNKYEA+TRGEFGIGSAAQ+GSADLRK
Sbjct: 57 KITEPKRWQKLVSTALAAAAVIGFSSGMPSVAELNKYEADTRGEFGIGSAAQYGSADLRK 116
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
AVH+ ENFRRANFTSADMRESDFSG FNGAYLEKAVAYK NF+ ++ L+
Sbjct: 117 AVHINENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLM 169
>gi|297741150|emb|CBI31881.3| unnamed protein product [Vitis vinifera]
Length = 261
Score = 186 bits (473), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 109/173 (63%), Positives = 122/173 (70%), Gaps = 21/173 (12%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSS+SPL I SK P L + SKP V C+I + G + C N
Sbjct: 1 MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNY--CRAN---- 42
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
A+ K W+ VSTALAAAVV + S + A+ADLNKYEAETRGEFGIGSAAQFGSADLRK
Sbjct: 43 --AESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 99
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
AVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFT ++ L+
Sbjct: 100 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLM 152
>gi|297741151|emb|CBI31882.3| unnamed protein product [Vitis vinifera]
Length = 201
Score = 186 bits (472), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 105/164 (64%), Positives = 115/164 (70%), Gaps = 19/164 (11%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSS+SPL I SK P L +LSKP V C+I + E NN
Sbjct: 1 MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRE---------NNWRGE 42
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
A+ K W+ VSTALAAAVV + S + A+ADLNKYE ETRGEFGIGSAAQFGSADLRK
Sbjct: 43 ANAESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRK 101
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
AVHV ENFRRANFTSADMRESDFSGS FNG YLEKAVAYKA+ T
Sbjct: 102 AVHVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLT 145
>gi|359474379|ref|XP_002265958.2| PREDICTED: uncharacterized protein LOC100250522 isoform 2 [Vitis
vinifera]
Length = 596
Score = 185 bits (470), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 108/164 (65%), Positives = 118/164 (71%), Gaps = 21/164 (12%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSS+SPL I SK P L + SKP V C+I + G + C N
Sbjct: 336 MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNY--CRAN---- 377
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
A+ K W+ VSTALAAAVV + S + A+ADLNKYEAETRGEFGIGSAAQFGSADLRK
Sbjct: 378 --AESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 434
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
AVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFT
Sbjct: 435 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFT 478
Score = 185 bits (469), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 105/164 (64%), Positives = 115/164 (70%), Gaps = 19/164 (11%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MALSS+SPL I SK P L +LSKP V C+I + E NN
Sbjct: 1 MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRE---------NNWRGE 42
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
A+ K W+ VSTALAAAVV + S + A+ADLNKYE ETRGEFGIGSAAQFGSADLRK
Sbjct: 43 ANAESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRK 101
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
AVHV ENFRRANFTSADMRESDFSGS FNG YLEKAVAYKA+ T
Sbjct: 102 AVHVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLT 145
>gi|388505216|gb|AFK40674.1| unknown [Lotus japonicus]
Length = 273
Score = 174 bits (442), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 106/179 (59%), Positives = 124/179 (69%), Gaps = 24/179 (13%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQL------HALSKPLWVACQISSKTESDGQFPDCS 54
MAL+S+SPLSI ++N SS+ +L H S P+ V CQ++S + P S
Sbjct: 2 MALNSLSPLSI-NINSLHVSSRPTSELSNSLHFHPKSSPI-VLCQMNSNRD----HPQES 55
Query: 55 NNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFG 114
K W VS LAAAV+A SS++SALADLNK+EAE RGEFGIGSAAQFG
Sbjct: 56 -----------KKWGKLVSATLAAAVIA-FSSDMSALADLNKFEAEIRGEFGIGSAAQFG 103
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
SADLRKAVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+ ++ L+
Sbjct: 104 SADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 162
>gi|255638223|gb|ACU19425.1| unknown [Glycine max]
Length = 199
Score = 173 bits (438), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 105/173 (60%), Positives = 126/173 (72%), Gaps = 17/173 (9%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S+SPLSI SL+ SSS+ H+ S P+ V CQI+S + + Q +
Sbjct: 2 MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPV-VVCQINSNRD---------HRQEST 51
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ K+ VS LAAAV+A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 52 KWGKV------VSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 104
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
AVHV ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+ ++ L+
Sbjct: 105 AVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 157
>gi|356540500|ref|XP_003538726.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Glycine max]
Length = 260
Score = 165 bits (418), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 100/173 (57%), Positives = 119/173 (68%), Gaps = 22/173 (12%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S+SPLSI SL+ SSS+ H+ S P+ V ++++
Sbjct: 1 MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPVVVKSVANAES---------------- 44
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
W VS LAAAV+A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 45 -----TKWGKVVSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 98
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
AVHV ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+ ++ L+
Sbjct: 99 AVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 151
>gi|357481967|ref|XP_003611269.1| Thylakoid lumenal protein [Medicago truncatula]
gi|355512604|gb|AES94227.1| Thylakoid lumenal protein [Medicago truncatula]
Length = 147
Score = 163 bits (412), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 95/164 (57%), Positives = 110/164 (67%), Gaps = 20/164 (12%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S +PLSI S + + S + + Q+ K + P SN
Sbjct: 1 MALNSFTPLSINSHH---------VSCYPSSSKVSKSSQVICKMSLNNDHPQESN----- 46
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
KNW VS LAAAV+ SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 47 -----KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKK 100
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
VHV ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFT
Sbjct: 101 TVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFT 144
>gi|357481963|ref|XP_003611267.1| Thylakoid lumenal protein [Medicago truncatula]
gi|355512602|gb|AES94225.1| Thylakoid lumenal protein [Medicago truncatula]
Length = 262
Score = 162 bits (411), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 96/173 (55%), Positives = 114/173 (65%), Gaps = 20/173 (11%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S +PLSI S + + S + + Q+ K + P SN
Sbjct: 1 MALNSFTPLSINSHHV---------SCYPSSSKVSKSSQVICKMSLNNDHPQESN----- 46
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
KNW VS LAAAV+ SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 47 -----KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKK 100
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
VHV ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFT ++ L+
Sbjct: 101 TVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTGADLSDTLM 153
>gi|357481965|ref|XP_003611268.1| Thylakoid lumenal protein [Medicago truncatula]
gi|355512603|gb|AES94226.1| Thylakoid lumenal protein [Medicago truncatula]
Length = 232
Score = 161 bits (408), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 81/108 (75%), Positives = 91/108 (84%), Gaps = 1/108 (0%)
Query: 66 KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
KNW VS LAAAV+ SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K VHV
Sbjct: 17 KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKKTVHVN 75
Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFT ++ L+
Sbjct: 76 ENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTGADLSDTLM 123
>gi|356495617|ref|XP_003516671.1| PREDICTED: LOW QUALITY PROTEIN: thylakoid lumenal protein
At1g12250, chloroplastic-like [Glycine max]
Length = 222
Score = 158 bits (399), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 97/163 (59%), Positives = 112/163 (68%), Gaps = 23/163 (14%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL+S SPLS+ SL+ S SS + + S P V CQ +S +
Sbjct: 1 MALNSFSPLSVNSLHVSSISSSKISRSLSKSFP--VVCQTNSNRDH-------------- 44
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ V VS LAAA++A SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 45 -----RQGNV-VSATLAAAIIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 97
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
AVHV ENFR +NFT+ADMRESDFSGS FNGAYLEKAVAYKANF
Sbjct: 98 AVHVNENFRXSNFTAADMRESDFSGSTFNGAYLEKAVAYKANF 140
>gi|116785652|gb|ABK23807.1| unknown [Picea sitchensis]
Length = 291
Score = 152 bits (384), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 81/134 (60%), Positives = 96/134 (71%), Gaps = 6/134 (4%)
Query: 40 ISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEA 99
I+ K +D D Q A + KNW+ ++ ALA V+ + ++A ADLNKYEA
Sbjct: 52 ITGKISTDQHKKDA---QPASATPESKNWQRCLAAALATIVIGT---GMNAEADLNKYEA 105
Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
ETRGEFGIGSAAQFGSA+LRK VH ENFRRANFTSAD+RESDFSGS FNGAYLEKAVAY
Sbjct: 106 ETRGEFGIGSAAQFGSAELRKTVHANENFRRANFTSADIRESDFSGSTFNGAYLEKAVAY 165
Query: 160 KANFTVDEICLPLL 173
K NFT ++ L+
Sbjct: 166 KTNFTGADLSDTLM 179
>gi|18391370|ref|NP_563902.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
gi|75151954|sp|Q8H1Q1.1|TL225_ARATH RecName: Full=Thylakoid lumenal protein At1g12250, chloroplastic;
Flags: Precursor
gi|23297125|gb|AAN13098.1| unknown protein [Arabidopsis thaliana]
gi|332190736|gb|AEE28857.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
Length = 280
Score = 148 bits (374), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 98/176 (55%), Positives = 125/176 (71%), Gaps = 8/176 (4%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--- 57
MA SS+SPL +KSL+ SSS + + L Q+SS+ S+ + D SN +
Sbjct: 1 MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTREGC 58
Query: 58 CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
C+ A+ W+ +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSAD
Sbjct: 59 CSS--AESNTWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSAD 115
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
L K VH ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+ ++ L+
Sbjct: 116 LSKTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 171
>gi|14334898|gb|AAK59627.1| unknown protein [Arabidopsis thaliana]
Length = 280
Score = 148 bits (374), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 98/176 (55%), Positives = 125/176 (71%), Gaps = 8/176 (4%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--- 57
MA SS+SPL +KSL+ SSS + + L Q+SS+ S+ + D SN +
Sbjct: 1 MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTREGC 58
Query: 58 CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
C+ A+ W+ +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSAD
Sbjct: 59 CSS--AESNKWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSAD 115
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
L K VH ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+ ++ L+
Sbjct: 116 LSKTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 171
>gi|145323868|ref|NP_001077523.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
gi|332190737|gb|AEE28858.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
Length = 206
Score = 145 bits (366), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 72/98 (73%), Positives = 85/98 (86%), Gaps = 1/98 (1%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH ENFRRANFTS
Sbjct: 1 MAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTS 59
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
ADMRESDFSGS FNGAYLEKAVAYKANF+ ++ L+
Sbjct: 60 ADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 97
>gi|297844088|ref|XP_002889925.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
lyrata]
gi|297335767|gb|EFH66184.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
lyrata]
Length = 280
Score = 145 bits (365), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 100/179 (55%), Positives = 127/179 (70%), Gaps = 14/179 (7%)
Query: 1 MALSSISPLSIKSLNFCSSSSKG---PYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ 57
MA SS+SPL +KSL+ SSS PY H PL Q+SS++ S + D SN +
Sbjct: 1 MAFSSLSPLPMKSLDISRSSSSVSRSPY--HYQRYPLR-RLQLSSRSNS--EIKDSSNAR 55
Query: 58 ---CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFG 114
C+ ++ W+ +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+G
Sbjct: 56 EGCCS--RSESNTWKRILSAAMAAAVIASSSS-VPAMAELNRFEADTRGEFGIGSAAQYG 112
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
SADL K +H ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+ ++ L+
Sbjct: 113 SADLSKTIHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 171
>gi|212721536|ref|NP_001132582.1| uncharacterized protein LOC100194053 [Zea mays]
gi|194694816|gb|ACF81492.1| unknown [Zea mays]
gi|195647732|gb|ACG43334.1| hypothetical protein [Zea mays]
gi|413937988|gb|AFW72539.1| hypothetical protein ZEAMMB73_749291 [Zea mays]
Length = 268
Score = 144 bits (363), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 70/86 (81%), Positives = 76/86 (88%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
+ A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFTSADMRESDFSGS
Sbjct: 74 MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 133
Query: 148 FNGAYLEKAVAYKANFTVDEICLPLL 173
FNGAYLEKAVAYKANFT ++ L+
Sbjct: 134 FNGAYLEKAVAYKANFTGADLSDTLM 159
>gi|242066558|ref|XP_002454568.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
gi|241934399|gb|EES07544.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
Length = 270
Score = 144 bits (362), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 70/86 (81%), Positives = 76/86 (88%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
+ A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFTSADMRESDFSGS
Sbjct: 76 MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 135
Query: 148 FNGAYLEKAVAYKANFTVDEICLPLL 173
FNGAYLEKAVAYKANFT ++ L+
Sbjct: 136 FNGAYLEKAVAYKANFTGADLSDTLM 161
>gi|357136761|ref|XP_003569972.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
[Brachypodium distachyon]
Length = 268
Score = 140 bits (354), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 67/86 (77%), Positives = 75/86 (87%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
+ A ADLNK+EAE RGEFGIGSAAQFG+ADL+K VHV ENFRRANFTSADMRESDFSGS
Sbjct: 74 MPAYADLNKFEAEQRGEFGIGSAAQFGNADLKKTVHVNENFRRANFTSADMRESDFSGST 133
Query: 148 FNGAYLEKAVAYKANFTVDEICLPLL 173
FNGAY+EKAVAYKANFT ++ L+
Sbjct: 134 FNGAYMEKAVAYKANFTGADLSDTLM 159
>gi|125540470|gb|EAY86865.1| hypothetical protein OsI_08249 [Oryza sativa Indica Group]
Length = 276
Score = 139 bits (349), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 92/173 (53%), Positives = 114/173 (65%), Gaps = 6/173 (3%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
MAL + SPL+ + C+ + + L + V+CQ + DG S + A
Sbjct: 1 MALPTTSPLAAAAARPCAFPTPWRCRSPPLRRLPHVSCQANRGGSRDGN--SLSTSAAAA 58
Query: 61 PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
+ WR VS ALAAA+V++ A ADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 59 AASPPPRWRAAVSAALAAAIVSA----APAYADLNKFEAEQRGEFGIGSAAQFGSADLKK 114
Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
AVHV ENFRRANFT+ADMRES+FSGS FNGAYLEKAVAY+ANFT ++ L+
Sbjct: 115 AVHVNENFRRANFTAADMRESNFSGSTFNGAYLEKAVAYRANFTGADLSDTLM 167
>gi|115447561|ref|NP_001047560.1| Os02g0643500 [Oryza sativa Japonica Group]
gi|49388647|dbj|BAD25782.1| thylakoid lumenal protein-like [Oryza sativa Japonica Group]
gi|113537091|dbj|BAF09474.1| Os02g0643500 [Oryza sativa Japonica Group]
gi|125583041|gb|EAZ23972.1| hypothetical protein OsJ_07699 [Oryza sativa Japonica Group]
gi|215687060|dbj|BAG90906.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 277
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 66/82 (80%), Positives = 74/82 (90%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFT+ADMRES+FSGS FNGA
Sbjct: 87 ADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTAADMRESNFSGSTFNGA 146
Query: 152 YLEKAVAYKANFTVDEICLPLL 173
YLEKAVAY+ANFT ++ L+
Sbjct: 147 YLEKAVAYRANFTGADLSDTLM 168
>gi|10086510|gb|AAG12570.1|AC022522_3 Hypothetical protein [Arabidopsis thaliana]
Length = 293
Score = 137 bits (345), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 87/163 (53%), Positives = 110/163 (67%), Gaps = 8/163 (4%)
Query: 11 IKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRV 70
+KSL+ SSS + + L Q+SS+ S+ + D SN A+ W+
Sbjct: 1 MKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTS-----AESNTWKR 53
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
+S A AA V + SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH ENFRR
Sbjct: 54 ILSAA-MAAAVIASSSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRR 112
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+ ++ L+
Sbjct: 113 ANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 155
>gi|326490876|dbj|BAJ90105.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 267
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 95/175 (54%), Positives = 115/175 (65%), Gaps = 19/175 (10%)
Query: 1 MALSSISPLSIKSLNFCSSSSKGPYQL-HALSKPLW-VACQISSKTESDGQFPDCSNNQC 58
MAL+S SPL+ + K P L S+ L ++CQ ++ G + SN
Sbjct: 1 MALASTSPLAA-----TVARPKAPASLTRCRSRRLQRISCQATTDRSGGG---NASNTSP 52
Query: 59 AGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADL 118
A P WRV VS ALAAAVV + + A ADLNKYEA+ RGEFGIGSAAQFG+ADL
Sbjct: 53 APPR-----WRVAVSAALAAAVVVA----MPAHADLNKYEADQRGEFGIGSAAQFGNADL 103
Query: 119 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
+ VHV ENFRRANFTSADMRESDFSGS FNGAY+EKAVA++ANFT ++ L+
Sbjct: 104 KNTVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFRANFTGADLSDTLM 158
>gi|302822738|ref|XP_002993025.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
gi|300139117|gb|EFJ05864.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
Length = 196
Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 61/91 (67%), Positives = 75/91 (82%)
Query: 88 ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+ H ENFRRANFTSADMRE+DFSGS
Sbjct: 1 MNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFRRANFTSADMREADFSGST 60
Query: 148 FNGAYLEKAVAYKANFTVDEICLPLLVSLPM 178
FNG YLEKAVAY+ NF+ ++ L+ + +
Sbjct: 61 FNGGYLEKAVAYRTNFSGADLSDTLMDRMVL 91
>gi|302780733|ref|XP_002972141.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
gi|300160440|gb|EFJ27058.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
Length = 219
Score = 129 bits (324), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 66/104 (63%), Positives = 82/104 (78%), Gaps = 4/104 (3%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RRANFT 134
LAA V+A+ ++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+ H ENF RRANFT
Sbjct: 14 LAATVLAT---GMNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFSRRANFT 70
Query: 135 SADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLLVSLPM 178
SADMRE+DFSGS FNG YLEKAVAY+ NF+ ++ L+ + +
Sbjct: 71 SADMREADFSGSTFNGGYLEKAVAYRTNFSGADLSDTLMDRMVL 114
>gi|168028137|ref|XP_001766585.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682230|gb|EDQ68650.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 225
Score = 122 bits (305), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 57/88 (64%), Positives = 69/88 (78%)
Query: 86 SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
++ +LADLN EA TRGEFGIGSA QFGSADL+K H ENFRR NFTSADM+E++FS
Sbjct: 24 TSTDSLADLNSLEANTRGEFGIGSAVQFGSADLKKTQHANENFRRGNFTSADMKEANFSN 83
Query: 146 SKFNGAYLEKAVAYKANFTVDEICLPLL 173
S FNGAYLEKAVAY+ NF+ ++ L+
Sbjct: 84 STFNGAYLEKAVAYRTNFSGADLSDTLM 111
>gi|159478056|ref|XP_001697120.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
gi|158274594|gb|EDP00375.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
Length = 239
Score = 89.0 bits (219), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 43/74 (58%), Positives = 51/74 (68%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
ALADLN YEA T GEFGIGSA Q+G AD++ ++ RR+NFTSAD R + F GS
Sbjct: 51 ALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCRNATFKGSNLQ 110
Query: 150 GAYLEKAVAYKANF 163
GAY KAV Y+ NF
Sbjct: 111 GAYFIKAVTYRTNF 124
>gi|302829835|ref|XP_002946484.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
nagariensis]
gi|300268230|gb|EFJ52411.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
nagariensis]
Length = 214
Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/74 (56%), Positives = 51/74 (68%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
A ADLN YEAE GEFGIGSA Q+G AD++ ++ RR+NFTSAD R ++F GS
Sbjct: 26 AFADLNVYEAEAGGEFGIGSAQQYGEADVQGRDFSGQDLRRSNFTSADCRNANFKGSNLQ 85
Query: 150 GAYLEKAVAYKANF 163
GAY KAV Y+ NF
Sbjct: 86 GAYFIKAVTYRTNF 99
>gi|384248119|gb|EIE21604.1| thylakoid lumenal protein [Coccomyxa subellipsoidea C-169]
Length = 217
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/74 (54%), Positives = 49/74 (66%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
A+ADLNKYEA GEFG G+A Q+G ADL+ E+ RR+NFT+AD R +F S
Sbjct: 29 AIADLNKYEAAAGGEFGNGTAQQYGEADLKGRDFHGEDLRRSNFTAADCRNCNFKDSNLQ 88
Query: 150 GAYLEKAVAYKANF 163
GAY K+V KANF
Sbjct: 89 GAYFIKSVVPKANF 102
>gi|424513452|emb|CCO66074.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
Length = 231
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/88 (50%), Positives = 56/88 (63%), Gaps = 5/88 (5%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF----RRANFTSADMRESDFSG 145
A+A+LN EA GEF GSA QFG DLR A +V E + R +NFT A+MR+S G
Sbjct: 39 AVAELNSREANQGGEFNRGSAQQFGGYDLR-AENVSEKYGTDLRLSNFTGAEMRDSKLVG 97
Query: 146 SKFNGAYLEKAVAYKANFTVDEICLPLL 173
+K NGAYL KAVA A+FT ++ L+
Sbjct: 98 AKLNGAYLMKAVAANADFTDADLSDALM 125
>gi|307105880|gb|EFN54127.1| hypothetical protein CHLNCDRAFT_31689 [Chlorella variabilis]
Length = 259
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 52/84 (61%)
Query: 90 ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
A A+LNKYE GEF +G+A Q+G AD++ ++ +R+NFT+AD R+++F SK
Sbjct: 71 ASAELNKYEFGVTGEFNVGTARQYGEADVKGQDFSNQDLQRSNFTAADCRDANFQNSKLQ 130
Query: 150 GAYLEKAVAYKANFTVDEICLPLL 173
AY K+V +AN ++ L+
Sbjct: 131 AAYFMKSVLARANLENADLSDALM 154
>gi|308811122|ref|XP_003082869.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
gi|116054747|emb|CAL56824.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
Length = 247
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/111 (41%), Positives = 59/111 (53%), Gaps = 6/111 (5%)
Query: 66 KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
K V S ALA A S + A A+LN+ EA GEF GSA QFG DL K K
Sbjct: 34 KKGHVITSIALATAFALSGAP---AHAELNRAEANRGGEFNRGSAKQFGGYDLVKVDIAK 90
Query: 126 E---NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
E + R +NFT ADMR + G+ GAY+ K VA + +FT ++ L+
Sbjct: 91 EYGKDLRLSNFTGADMRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALM 141
>gi|303288862|ref|XP_003063719.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454787|gb|EEH52092.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 277
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/78 (47%), Positives = 48/78 (61%), Gaps = 3/78 (3%)
Query: 89 SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE---NFRRANFTSADMRESDFSG 145
+A A+LN EA GEF GSA QFG DLR V + + R +NFT A+MR + G
Sbjct: 84 AAHAELNAREANRGGEFNRGSAQQFGGYDLRNEDVVGKYGADLRLSNFTGAEMRGAKLRG 143
Query: 146 SKFNGAYLEKAVAYKANF 163
+ GAYL KAVA++A+F
Sbjct: 144 ANLTGAYLMKAVAFEADF 161
>gi|427725361|ref|YP_007072638.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
gi|427357081|gb|AFY39804.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
Length = 919
Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 40/64 (62%), Gaps = 4/64 (6%)
Query: 109 SAAQFGSADLRK----AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A SA+LR+ A+ ++ NF AN T AD+ E++ +GS F+ A L+ AV ANFT
Sbjct: 748 SDANLSSANLRRSHLRAICLEANFTGANLTQADLCEANVTGSNFSDANLQGAVLKDANFT 807
Query: 165 VDEI 168
+ ++
Sbjct: 808 MTDL 811
>gi|332707710|ref|ZP_08427737.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332353413|gb|EGJ32926.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 285
Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 41/80 (51%), Gaps = 1/80 (1%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
S A A+L +A+ + N R A+ AD+ +D +G+ GAY+ +A KAN T+ +
Sbjct: 58 SQATLTGANLSQAILREANLRGADLRGADLTGADLTGADLEGAYVNRADLRKANLTMANL 117
Query: 169 C-LPLLVSLPMATPVFPAGF 187
L V+L +P GF
Sbjct: 118 NETNLQVALYDRETTWPEGF 137
>gi|145219796|ref|YP_001130505.1| pentapeptide repeat-containing protein [Chlorobium phaeovibrioides
DSM 265]
gi|145205960|gb|ABP37003.1| pentapeptide repeat protein [Chlorobium phaeovibrioides DSM 265]
Length = 412
Score = 43.5 bits (101), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 31/54 (57%), Gaps = 5/54 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAV 157
S A F ADLR+A K FR AN +A RE+ DFSG+ GAYL +A+
Sbjct: 135 SGADFSGADLRRAECSKAGFRGANLQNAHFREASLRSVDFSGADLRGAYLWRAI 188
>gi|78033474|emb|CAJ30090.1| hypothetical acidic protein, pentapeptide repeat [Magnetospirillum
gryphiswaldense MSR-1]
gi|144901135|emb|CAM77999.1| pentapeptide repeat containing protein [Magnetospirillum
gryphiswaldense MSR-1]
Length = 503
Score = 43.5 bits (101), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A+LRKAV N R +N A + ++D SG+K GA L A +ANF+
Sbjct: 28 ANLRKAVLSGANLRDSNLPRASLEDADLSGAKLQGANLAGATLLRANFS 76
>gi|167907368|ref|ZP_02494573.1| pentapeptide repeat protein [Burkholderia pseudomallei NCTC 13177]
Length = 269
Score = 43.5 bits (101), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 42/81 (51%), Gaps = 10/81 (12%)
Query: 84 CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 143
C +N+S ADL+ +A+ RG A ADLR A N AN + AD+ ++D
Sbjct: 67 CGANLSG-ADLS--DADLRG-------ADLSDADLRGADLSVANLSGANLSGADLSDADL 116
Query: 144 SGSKFNGAYLEKAVAYKANFT 164
SG+ +GAYL A AN +
Sbjct: 117 SGANLSGAYLSYANLSGANLS 137
>gi|407005745|gb|EKE21794.1| pentapeptide repeat protein [uncultured bacterium]
Length = 189
Score = 43.1 bits (100), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 29/123 (23%), Positives = 51/123 (41%), Gaps = 9/123 (7%)
Query: 44 TESD---GQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAE 100
TE+D +F DC N+C +K+ N + + C + + ++NK+
Sbjct: 36 TETDFVGTKFIDCVFNECNFSNSKILN------CSFCNVIFKECKMSGVSFNEINKFLLV 89
Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
+ + F D++K+ ++ +F AD+ ESDFS S G + K
Sbjct: 90 WEFDNCVIKLCNFSKLDIKKSKFIQCVIHETDFVDADLSESDFSNSDLRGCKFQNTNLSK 149
Query: 161 ANF 163
NF
Sbjct: 150 VNF 152
>gi|428215909|ref|YP_007089053.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428004290|gb|AFY85133.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 447
Score = 43.1 bits (100), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 3/77 (3%)
Query: 91 LADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
LAD N ++ RG IG++ ADLR+A + + R AN AD+RE+D +G+
Sbjct: 327 LADANMKGSDLRGADLIGASLNKVNLTQADLREADLTRADLRGANLRLADLREADLTGAS 386
Query: 148 FNGAYLEKAVAYKANFT 164
N L +A + T
Sbjct: 387 LNQVNLAEADLRGVDLT 403
>gi|338740277|ref|YP_004677239.1| hypothetical protein HYPMC_3462 [Hyphomicrobium sp. MC1]
gi|337760840|emb|CCB66673.1| protein of unknown function [Hyphomicrobium sp. MC1]
Length = 1588
Score = 42.7 bits (99), Expect = 0.070, Method: Composition-based stats.
Identities = 29/89 (32%), Positives = 42/89 (47%), Gaps = 7/89 (7%)
Query: 83 SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
+CSS A A + + F + S F ADL+ A +E R A F++AD+R+ D
Sbjct: 876 NCSSGDCANAKMKGWN------FSVISQTDFSGADLKGAEFPRET-RGAKFSNADLRDVD 928
Query: 143 FSGSKFNGAYLEKAVAYKANFTVDEICLP 171
SG +F A +ANF E+ P
Sbjct: 929 ISGKQFQSCSFIGANLREANFGSSEVAGP 957
Score = 41.6 bits (96), Expect = 0.18, Method: Composition-based stats.
Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 27/109 (24%)
Query: 52 DCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLN--KYEAETRGEFGIGS 109
+CS+ CA AK+K W V + ++ S ADL ++ ETRG
Sbjct: 876 NCSSGDCAN--AKMKGWNFSVIS----------QTDFSG-ADLKGAEFPRETRG------ 916
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYL 153
A+F +ADLR + F+ +F A++RE++F +G F+G++L
Sbjct: 917 -AKFSNADLRDVDISGKQFQSCSFIGANLREANFGSSEVAGPNFSGSFL 964
>gi|168705224|ref|ZP_02737501.1| pentapeptide repeat [Gemmata obscuriglobus UQM 2246]
Length = 831
Score = 42.7 bits (99), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 60/146 (41%), Gaps = 23/146 (15%)
Query: 52 DCSNNQCAGPYAKLKNWRV----FVSTALAAAVVASCSSNISALADLNKYEAE---TRGE 104
D SN + AG A+L N + F L+ A + ++ AD+ +A R
Sbjct: 522 DLSNEKLAG--ARLNNLDLRGAKFDGAMLSEASFSGSQIQGASFADVPARKANFASARAA 579
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
+ A +A+LR A ++ NF+ + T AD SD G+ F GA L+ A +A F
Sbjct: 580 DAVFRGAILANANLRAATFLRTNFQNVDLTGADFAFSDLRGADFTGATLKNASFSQAKFD 639
Query: 165 VDEICLPLLVSLPMATPVFPAGFCAP 190
D FP G AP
Sbjct: 640 AD--------------TKFPKGLTAP 651
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 31/52 (59%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
F +A+L A V + R NFT+AD+R+++F G+ GA L A A+FT
Sbjct: 266 FTAANLAGATCVDADLRGTNFTNADLRKANFRGANLAGADLTGANVAGADFT 317
>gi|431802241|ref|YP_007229144.1| pentapeptide repeat-containing protein [Pseudomonas putida HB3267]
gi|430793006|gb|AGA73201.1| pentapeptide repeat-containing protein [Pseudomonas putida HB3267]
Length = 219
Score = 42.7 bits (99), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 32/58 (55%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I A+Q ADLR A ++ R+ N AD+R++D ++ + A LEKA AN T
Sbjct: 36 IAEASQCPGADLRGANLANQDLRKMNLAGADLRDADLRHARLDLANLEKANLQGANLT 93
>gi|78211810|ref|YP_380589.1| hypothetical protein Syncc9605_0258 [Synechococcus sp. CC9605]
gi|78196269|gb|ABB34034.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 147
Score = 42.7 bits (99), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 30/54 (55%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
AD R+A + +FR ++ AD+RE++ G+ GA LE A AN T E+
Sbjct: 48 QADFRQAHLIGADFRGSDLRGADLREANLEGADLTGALLEGADLRGANLTNAEL 101
>gi|339487133|ref|YP_004701661.1| pentapeptide repeat-containing protein [Pseudomonas putida S16]
gi|338837976|gb|AEJ12781.1| pentapeptide repeat-containing protein [Pseudomonas putida S16]
Length = 219
Score = 42.7 bits (99), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 32/58 (55%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I A+Q ADLR A ++ R+ N AD+R++D ++ + A LEKA AN T
Sbjct: 36 IAEASQCPGADLRGANLANQDLRKMNLAGADLRDADLRHARLDLANLEKANLQGANLT 93
>gi|421082377|ref|ZP_15543263.1| Pentapeptide repeat protein [Pectobacterium wasabiae CFBP 3304]
gi|401702907|gb|EJS93144.1| Pentapeptide repeat protein [Pectobacterium wasabiae CFBP 3304]
Length = 846
Score = 42.7 bits (99), Expect = 0.087, Method: Composition-based stats.
Identities = 32/116 (27%), Positives = 54/116 (46%), Gaps = 8/116 (6%)
Query: 56 NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQ 112
+ C+ + R +T L +AV + S N + ++ R IG+ A+
Sbjct: 691 DSCSWVETQANEARFVGATWLTSAVASGSSMNGADFTQATLRQSNLRQASLIGAVFARAK 750
Query: 113 FGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
++DL +A + NF+RAN F D RE++F+ + GA L+K+ ANF
Sbjct: 751 LENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLMGALLQKSQLSGANF 806
>gi|157372424|ref|YP_001480413.1| pentapeptide repeat-containing protein [Serratia proteamaculans
568]
gi|157324188|gb|ABV43285.1| pentapeptide repeat protein [Serratia proteamaculans 568]
Length = 844
Score = 42.7 bits (99), Expect = 0.087, Method: Composition-based stats.
Identities = 33/141 (23%), Positives = 66/141 (46%), Gaps = 9/141 (6%)
Query: 30 LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNIS 89
L K ++ C++ + + C+ + PYA+ K + A+ + ++ + +
Sbjct: 668 LRKTVFQQCELQAAVFNGAWLESCNWVESKLPYAQFKAASLLTCAAVMESDLSGADFSEA 727
Query: 90 ALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDF 143
L + N +A T+ F + A+ ++DL +A + NF RAN + D R+ +F
Sbjct: 728 TLKESNLRQALLTQANFTL---AKVENSDLSEADCQRANFTRANLVGSLLIRTDFRQVNF 784
Query: 144 SGSKFNGAYLEKAVAYKANFT 164
+G+ GA ++K A+FT
Sbjct: 785 TGANLMGALMQKTQLGGADFT 805
>gi|22299142|ref|NP_682389.1| hypothetical protein tlr1599 [Thermosynechococcus elongatus BP-1]
gi|22295324|dbj|BAC09151.1| tlr1599 [Thermosynechococcus elongatus BP-1]
Length = 309
Score = 42.7 bits (99), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 43/96 (44%), Gaps = 13/96 (13%)
Query: 89 SALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFR-----RANFTSADMRE 140
+AL N A+ RG G S A ADLR + V + R +AN T AD+
Sbjct: 45 AALQSTNLQRADLRGAILTGANLSQADLRGADLRGVILVSADLRWVSLRKANLTGADLTR 104
Query: 141 -----SDFSGSKFNGAYLEKAVAYKANFTVDEICLP 171
+D S + GA L +A+ AN T+ ++ L
Sbjct: 105 ANLANADLSEANLTGAQLSEAIVRDANLTLTDLTLA 140
>gi|374583660|ref|ZP_09656754.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
17734]
gi|374419742|gb|EHQ92177.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
17734]
Length = 367
Score = 42.7 bits (99), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 36/67 (53%), Gaps = 3/67 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---V 165
S A ADL +A N RRA+ + A++R +D SG+ A L +A +AN + +
Sbjct: 233 SGANLSEADLSRADLSGANLRRADLSGANLRRADLSGANLRRADLSEANLSEANLSGADL 292
Query: 166 DEICLPL 172
D CLPL
Sbjct: 293 DFSCLPL 299
Score = 41.6 bits (96), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 32/56 (57%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A ADL +A N RRAN + A++ E+D SG+ +GA L +A +A+ +
Sbjct: 153 SGANLSEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEADLSRADLS 208
Score = 36.6 bits (83), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 31/56 (55%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A ADL +A N RA+ + A++ E+D SG+ +GA L +A +A+ +
Sbjct: 193 SGANLSEADLSRADLSGANLSRADLSGANLSEADLSGANLSGANLSEADLSRADLS 248
>gi|421528695|ref|ZP_15975254.1| pentapeptide repeat-containing protein [Pseudomonas putida S11]
gi|402213838|gb|EJT85176.1| pentapeptide repeat-containing protein [Pseudomonas putida S11]
Length = 200
Score = 42.4 bits (98), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 32/58 (55%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I A+Q ADLR A ++ R+ N AD+R++D ++ + A LEKA AN T
Sbjct: 36 IAEASQCPGADLRGANLANQDLRKMNLAGADLRDADLRHARLDLANLEKANLQGANLT 93
>gi|390441101|ref|ZP_10229280.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
gi|389835591|emb|CCI33406.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
Length = 436
Score = 42.4 bits (98), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 30/53 (56%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
S A A+LR+A +K N RRAN A + E+D SG+ A L KA+ +A
Sbjct: 324 SGANLIDANLRRANLIKANLRRANLIEAILSEADLSGANLRRANLIKAILIEA 376
>gi|443475317|ref|ZP_21065270.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019839|gb|ELS33873.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 377
Score = 42.4 bits (98), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 34/60 (56%), Gaps = 5/60 (8%)
Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTV 165
A A+L A+ VK + + RAN T AD+RE+D SG++ A L KA KAN ++
Sbjct: 140 ADLTQANLSAAILVKASLKQVILNRANLTEADLREADLSGAQLYLAVLSKANLAKANLSL 199
>gi|209528100|ref|ZP_03276576.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|209491459|gb|EDZ91838.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
Length = 351
Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 31/56 (55%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A ADL ++V NF AN T A++ ++ +G+ NGA L +A +AN T
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTRANLTRANLT 245
>gi|406706438|ref|YP_006756791.1| pentapeptide repeat-containing protein [alpha proteobacterium
HIMB5]
gi|406652214|gb|AFS47614.1| pentapeptide repeat protein [alpha proteobacterium HIMB5]
Length = 174
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 35/64 (54%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
FG + F A+L ++V + NF + NFT A++ ++DF GS A + A +ANFT
Sbjct: 80 FGTFPESTFYRANLYESVMIGANFEKTNFTGANLTKADFMGSTLIEANFQNANLMEANFT 139
Query: 165 VDEI 168
I
Sbjct: 140 SANI 143
>gi|325272495|ref|ZP_08138874.1| pentapeptide repeat-containing protein [Pseudomonas sp. TJI-51]
gi|324102372|gb|EGB99839.1| pentapeptide repeat-containing protein [Pseudomonas sp. TJI-51]
Length = 219
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 32/58 (55%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I A+Q ADLR A ++ R+ N AD+R++D ++ + A LEKA AN T
Sbjct: 36 IAEASQCPGADLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93
>gi|119485597|ref|ZP_01619872.1| hypothetical protein L8106_24480 [Lyngbya sp. PCC 8106]
gi|119456922|gb|EAW38049.1| hypothetical protein L8106_24480 [Lyngbya sp. PCC 8106]
Length = 253
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 11/88 (12%)
Query: 92 ADLNKYEAETRGEFGIG------SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRE 140
ADL K + + F + S A F +ADLR+ K N ANFT A D+R
Sbjct: 126 ADLRKADLQDANLFKVNFSEAYLSEANFENADLRQVTFFKANLADANFTDANLFGSDLRL 185
Query: 141 SDFSGSKFNGAYLEKAVAYKANFTVDEI 168
++ G+ F+ A L+ A+ N E+
Sbjct: 186 ANLKGADFSNANLQAAILVNTNIAQAEL 213
Score = 35.8 bits (81), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 35/82 (42%), Gaps = 8/82 (9%)
Query: 91 LADLNKYEAETRGEFGIG--------SAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
LAD N YEA R G S A ADLRKA N + NF+ A + E++
Sbjct: 93 LADANLYEANLRYANLQGADLRQADLSRASLTRADLRKADLQDANLFKVNFSEAYLSEAN 152
Query: 143 FSGSKFNGAYLEKAVAYKANFT 164
F + KA ANFT
Sbjct: 153 FENADLRQVTFFKANLADANFT 174
>gi|354567474|ref|ZP_08986643.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353542746|gb|EHC12207.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 164
Score = 42.0 bits (97), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 10/100 (10%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
+K+WRVF LA V+ + L+ + +R Q +AD +
Sbjct: 1 MKSWRVFAVLILAMVVL------LFPLSAEAAKSSSSR----FAGYKQMSNADFSGQTLI 50
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
+E F + A+ +D G+ FN AYLEKA + A+FT
Sbjct: 51 REEFTKVKLDKANFSNADLRGAVFNNAYLEKANLHGADFT 90
>gi|390438037|ref|ZP_10226537.1| hypothetical protein MICAI_1320003 [Microcystis sp. T1-4]
gi|389838536|emb|CCI30661.1| hypothetical protein MICAI_1320003 [Microcystis sp. T1-4]
Length = 260
Score = 42.0 bits (97), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 40/86 (46%), Gaps = 10/86 (11%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF------- 163
A ADL +A N R AN SA + E++ S GA L+ A Y+AN
Sbjct: 165 ANLAGADLFRANLRGANLRGANLHSAGLVEANLQSSDLAGAKLQMATLYRANLQDAKYTD 224
Query: 164 --TVDEICLPLLVSLPMATPVFPAGF 187
T ++C L +S P +T FP GF
Sbjct: 225 ASTSPKLCESLSLSYPCSTG-FPEGF 249
>gi|162450958|ref|YP_001613325.1| WD repeat-containing protein [Sorangium cellulosum So ce56]
gi|161161540|emb|CAN92845.1| Hypothetical WD-repeat protein [Sorangium cellulosum So ce56]
Length = 2305
Score = 42.0 bits (97), Expect = 0.15, Method: Composition-based stats.
Identities = 27/84 (32%), Positives = 39/84 (46%), Gaps = 13/84 (15%)
Query: 97 YEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDF---------- 143
+ ET G G+ Q DLR A N R AN + AD+ +D
Sbjct: 1111 WAEETAGWISEGADLHGVQLAGEDLRGAPLAGANLRDANLSGADLSGADLTDAALSGAML 1170
Query: 144 SGSKFNGAYLEKAVAYKANFTVDE 167
SG+K +G L +A+A++A+FT E
Sbjct: 1171 SGAKLHGTILRRAIAHRADFTQAE 1194
>gi|260436217|ref|ZP_05790187.1| pentapeptide repeat protein [Synechococcus sp. WH 8109]
gi|260414091|gb|EEX07387.1| pentapeptide repeat protein [Synechococcus sp. WH 8109]
Length = 147
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 29/54 (53%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
AD R+A + +FR + AD+RE++ G+ GA LE A AN T E+
Sbjct: 48 QADFRQAHLIGADFRGTDLRGADLREANLEGADLTGALLEGADLRGANLTNAEL 101
>gi|261821705|ref|YP_003259811.1| hypothetical protein Pecwa_2443 [Pectobacterium wasabiae WPP163]
gi|261605718|gb|ACX88204.1| Protein of unknown function DUF2169 [Pectobacterium wasabiae
WPP163]
Length = 846
Score = 41.6 bits (96), Expect = 0.16, Method: Composition-based stats.
Identities = 32/116 (27%), Positives = 54/116 (46%), Gaps = 8/116 (6%)
Query: 56 NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQ 112
+ C+ + R +T L +AV + S N + ++ R IG+ A+
Sbjct: 691 DSCSWVETQANEARFTGATWLTSAVASGSSMNSADFTQATLRQSNLRQASLIGAVFALAK 750
Query: 113 FGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
++DL +A + NF+RAN F D RE++F+ + GA L+K+ ANF
Sbjct: 751 LENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQLGGANF 806
>gi|257061367|ref|YP_003139255.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
gi|256591533|gb|ACV02420.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
Length = 371
Score = 41.6 bits (96), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 5/89 (5%)
Query: 80 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 139
+ A+ + N++ L L + T G AA+ + +L A + NFR AN T A++
Sbjct: 218 LYAANTHNLAELIKLAHFNPLTDLAGGNFLAAELSAVELSGANLTQTNFRGANLTDAELS 277
Query: 140 ES-----DFSGSKFNGAYLEKAVAYKANF 163
E+ FSG+ +GAYL A KA+F
Sbjct: 278 EAILNYCKFSGADLSGAYLGNAQLVKADF 306
>gi|113476913|ref|YP_722974.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
gi|110167961|gb|ABG52501.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
Length = 567
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 30/53 (56%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A A+L KAV V N RR N + A++ ++ + F+GAYL +A +AN
Sbjct: 418 ASLEGANLTKAVLVSANLRRVNLSGANLNSTNLRAANFSGAYLREAKLSRANL 470
>gi|218247298|ref|YP_002372669.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
gi|218167776|gb|ACK66513.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
Length = 371
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 5/89 (5%)
Query: 80 VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 139
+ A+ + N++ L L + T G AA+ + +L A + NFR AN T A++
Sbjct: 218 LYAANTHNLAELIKLAHFNPLTDLAGGNFLAAELSAVELSGANLTQTNFRGANLTDAELS 277
Query: 140 ES-----DFSGSKFNGAYLEKAVAYKANF 163
E+ FSG+ +GAYL A KA+F
Sbjct: 278 EAILNYCKFSGADLSGAYLGNAQLVKADF 306
>gi|334137987|ref|ZP_08511411.1| pentapeptide repeat protein [Paenibacillus sp. HGF7]
gi|333604520|gb|EGL15910.1| pentapeptide repeat protein [Paenibacillus sp. HGF7]
Length = 242
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 55/119 (46%), Gaps = 4/119 (3%)
Query: 49 QFPDCSNNQCAGPYAKLKNWRVFVSTALAAAV--VASCSSNISALADLNKYEAETRGEFG 106
DC ++ A++K+ + +ST + V C+ N+S + L K G
Sbjct: 89 DIADCVLSEATLRNAQMKDAEIKISTCIETCFDEVELCNGNLSG-STLIKATFRQANLHG 147
Query: 107 I-GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I S A F +DLR A V +F ++F SA++ E D S + F G L A+ NFT
Sbjct: 148 ISASKAYFDESDLRGANLVNGDFEESDFISANLSEVDASYANFTGGNLTGAILCNGNFT 206
>gi|119486371|ref|ZP_01620430.1| hypothetical protein L8106_16994 [Lyngbya sp. PCC 8106]
gi|119456584|gb|EAW37714.1| hypothetical protein L8106_16994 [Lyngbya sp. PCC 8106]
Length = 772
Score = 41.6 bits (96), Expect = 0.20, Method: Composition-based stats.
Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 3/103 (2%)
Query: 62 YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 121
+A LKN + + +AA++ ++ S+ + L+ N A +G G A A+LR A
Sbjct: 532 HANLKNANLSTANLMAASLNSANLSD-ANLSHANLECANLKGANLTG--ANLSYANLRGA 588
Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
N R AN + AD+R + S + + AYL A Y+AN +
Sbjct: 589 NLSGVNLRDANLSYADLRRVNLSQANLDSAYLRGANLYRANIS 631
>gi|427415571|ref|ZP_18905754.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
gi|425758284|gb|EKU99136.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
Length = 184
Score = 41.2 bits (95), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 33/53 (62%), Gaps = 5/53 (9%)
Query: 109 SAAQFGSADLRKA-VHVKE----NFRRANFTSADMRESDFSGSKFNGAYLEKA 156
S A G ADLRKA +H + + R A+ T A+++E+D S + +GAYL +A
Sbjct: 103 SGANLGGADLRKADLHKADLSDSDLRCADLTGANLQETDLSDANLDGAYLGEA 155
>gi|434398536|ref|YP_007132540.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428269633|gb|AFZ35574.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 284
Score = 41.2 bits (95), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 42/92 (45%), Gaps = 15/92 (16%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDF----------SGSKFNGAYLEKAVA 158
S +A+L +A N AN T A++ +D +G+ GAYL ++
Sbjct: 49 SGVDLTNANLSQATLTNANLSGANLTGANLTGTDLRGINLTGANLTGANLEGAYLNRSDL 108
Query: 159 YKANFT---VDEICLPLLVSLPMATPVFPAGF 187
+ANFT +D + L VSL +FP GF
Sbjct: 109 RQANFTDAKLDNVKLQ--VSLYDQATIFPEGF 138
>gi|300864976|ref|ZP_07109808.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300337032|emb|CBN54958.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 279
Score = 41.2 bits (95), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 33/66 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
+ A ADLR+A+ + N AN T A++R ++ S S GA L A Y+A +
Sbjct: 183 NGANLSGADLRQAIAIGSNLSDANLTQANLRVANVSWSTLRGANLTGANLYRAKLNWSNL 242
Query: 169 CLPLLV 174
+LV
Sbjct: 243 SGAILV 248
>gi|345872411|ref|ZP_08824346.1| pentapeptide repeat protein [Thiorhodococcus drewsii AZ1]
gi|343918959|gb|EGV29716.1| pentapeptide repeat protein [Thiorhodococcus drewsii AZ1]
Length = 284
Score = 41.2 bits (95), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 32/61 (52%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
S AQ ADLR A N + AN + AD+R +DF GS + KA+ +AN ++
Sbjct: 103 SDAQLTGADLRCAEVRYANLKHANLSHADLRGTDFHGSDLSHMVAIKALLIRANLRETDL 162
Query: 169 C 169
C
Sbjct: 163 C 163
>gi|37521689|ref|NP_925066.1| hypothetical protein glr2120 [Gloeobacter violaceus PCC 7421]
gi|35212687|dbj|BAC90061.1| glr2120 [Gloeobacter violaceus PCC 7421]
Length = 278
Score = 41.2 bits (95), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 28/53 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A ADLR+A V N RRAN AD RESD + A L +A +KAN
Sbjct: 179 ANLEGADLREASFVSANLRRANLRRADCRESDLFDANLCEADLREAKLHKANL 231
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 5/45 (11%)
Query: 115 SADLRKAVHVKENFRRANFTS-----ADMRESDFSGSKFNGAYLE 154
ADLR+A K N R+A S AD+RE+D SG+ GA+LE
Sbjct: 218 EADLREAKLHKANLRQALLVSADLRGADLREADLSGANLQGAHLE 262
>gi|434394476|ref|YP_007129423.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
gi|428266317|gb|AFZ32263.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
Length = 183
Score = 41.2 bits (95), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A SADL +A N ++AN AD+ E+D G+ +GA L+ A +AN
Sbjct: 89 ANLQSADLDQANLRDANLQQANLRDADLEEADLQGANLSGANLQSADLEEANL 141
Score = 39.3 bits (90), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 29/55 (52%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A SADL +A NF+ AN +AD+ ++ G+ F+GA L+ A N
Sbjct: 127 SGANLQSADLEEANLQNANFQNANLQNADLEDARVQGANFDGANLQGADLEGTNL 181
>gi|298250074|ref|ZP_06973878.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297548078|gb|EFH81945.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 471
Score = 41.2 bits (95), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 5/43 (11%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFT 164
+ R+AN + A M +D SG+ GA LE AVA+KANFT
Sbjct: 135 DLRKANLSMARMHHTDLSGANLTGAILEGIDLKDAVAHKANFT 177
>gi|119488860|ref|ZP_01621822.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
gi|119455021|gb|EAW36163.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
Length = 1011
Score = 40.8 bits (94), Expect = 0.26, Method: Composition-based stats.
Identities = 21/55 (38%), Positives = 30/55 (54%), Gaps = 5/55 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A +ADLR A N RAN + A++R ++ SG+ +G YL A +AN
Sbjct: 850 SGADLRTADLRSA-----NLIRANLSDANLRSANLSGANLSGVYLNSADLRRANL 899
>gi|434399306|ref|YP_007133310.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428270403|gb|AFZ36344.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 298
Score = 40.8 bits (94), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 35/71 (49%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLLVS 175
A+LR A + N R AN + AD++ ++ G+ F GA L KA ANF + + +
Sbjct: 224 ANLRDANLIGANLRGANLSQADLKGANLEGANFKGANLTKADLRGANFKGANLQDAIFKN 283
Query: 176 LPMATPVFPAG 186
+ + P G
Sbjct: 284 TKLQGTIMPDG 294
>gi|21674877|ref|NP_662942.1| pentapeptide repeat-containing protein [Chlorobium tepidum TLS]
gi|21648101|gb|AAM73284.1| pentapeptide repeat family protein [Chlorobium tepidum TLS]
Length = 439
Score = 40.8 bits (94), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 33/69 (47%), Gaps = 15/69 (21%)
Query: 111 AQFGSADLRKAVHVKENFRRA---------------NFTSADMRESDFSGSKFNGAYLEK 155
A+ G DLRKA K +F RA NF ADM+E++ G+ GA L++
Sbjct: 285 AELGGVDLRKASLSKSDFERANLDKANLAGANLAGVNFQRADMKEANLKGANLEGANLDR 344
Query: 156 AVAYKANFT 164
A A+ +
Sbjct: 345 AFLKGADLS 353
>gi|254416875|ref|ZP_05030623.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196176239|gb|EDX71255.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 332
Score = 40.8 bits (94), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 32/67 (47%), Gaps = 2/67 (2%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
Y A+ RG I ADLR A +K N R AN ++RE+D G+ +GA L A
Sbjct: 144 YTAKLRG--AILQNVDLQGADLRGADLLKVNLRGANLRETNLREADLRGANLSGANLSSA 201
Query: 157 VAYKANF 163
+ N
Sbjct: 202 FLTEVNL 208
>gi|354564859|ref|ZP_08984035.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353549985|gb|EHC19424.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 166
Score = 40.8 bits (94), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 30/56 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A +ADL KA N AN T+AD+ E++ +G+ GA ++A AN T
Sbjct: 89 SNANLTNADLEKANLSNANLSGANLTNADLEEANLTGANLRGANFQRADLEDANLT 144
>gi|150014700|gb|ABR57221.1| PedD [Pseudomonas putida]
Length = 219
Score = 40.8 bits (94), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 32/58 (55%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I A+Q A+LR A ++ R+ N AD+R++D ++ + A LEKA AN T
Sbjct: 36 IAEASQCPGANLRGAKLANQDLRKMNLAGADLRDADLRHARLDLANLEKARLQGANLT 93
>gi|423066634|ref|ZP_17055424.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|406711942|gb|EKD07140.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 351
Score = 40.8 bits (94), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 30/56 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A ADL ++V NF AN T A++ ++ +G+ NGA L A +AN T
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTRANLT 245
>gi|425440351|ref|ZP_18820656.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389719234|emb|CCH96913.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 333
Score = 40.8 bits (94), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 28/55 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR A N AN T A++ SDF G+ GA L K A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253
>gi|220909896|ref|YP_002485207.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219866507|gb|ACL46846.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 184
Score = 40.8 bits (94), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 30/53 (56%), Gaps = 5/53 (9%)
Query: 109 SAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKA 156
S A G ADLRKA K + R A+ + A++RE+D S + +GAYL A
Sbjct: 103 SGANLGGADLRKADLSKADLSGADLRGADLSGANLRETDLSDADLDGAYLGHA 155
>gi|330809494|ref|YP_004353956.1| hypothetical protein PSEBR_a2659 [Pseudomonas brassicacearum subsp.
brassicacearum NFM421]
gi|423697147|ref|ZP_17671637.1| pentapeptide repeat protein PedD [Pseudomonas fluorescens Q8r1-96]
gi|327377602|gb|AEA68952.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp.
brassicacearum NFM421]
gi|388004031|gb|EIK65358.1| pentapeptide repeat protein PedD [Pseudomonas fluorescens Q8r1-96]
Length = 214
Score = 40.8 bits (94), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 33/58 (56%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I ++Q A+LR A ++ R+ N + AD+R++D ++ + A LEKA AN T
Sbjct: 31 IAESSQCPGANLRGAKLANQDLRKMNLSGADLRDADLRHARLDLANLEKAQLQGANLT 88
>gi|409994014|ref|ZP_11277136.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|291569676|dbj|BAI91948.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409935088|gb|EKN76630.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 331
Score = 40.8 bits (94), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 41/101 (40%), Gaps = 14/101 (13%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
F T L AA + + ++ L D N +A+ RG A ADLR A N R
Sbjct: 87 FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139
Query: 130 ------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
N AD+R +D G GA L +A AN T
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLMGANLT 180
>gi|166364712|ref|YP_001656985.1| hypothetical protein MAE_19710 [Microcystis aeruginosa NIES-843]
gi|425466893|ref|ZP_18846187.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|166087085|dbj|BAG01793.1| hypothetical protein MAE_19710 [Microcystis aeruginosa NIES-843]
gi|389830484|emb|CCI27530.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 333
Score = 40.8 bits (94), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 28/55 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR A N AN T A++ SDF G+ GA L K A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253
>gi|90019736|ref|YP_525563.1| hypothetical protein Sde_0087 [Saccharophagus degradans 2-40]
gi|89949336|gb|ABD79351.1| pentapeptide repeat [Saccharophagus degradans 2-40]
Length = 600
Score = 40.8 bits (94), Expect = 0.33, Method: Composition-based stats.
Identities = 19/45 (42%), Positives = 25/45 (55%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
S A ADLR A NF+ A D+R++D SG +F+GA L
Sbjct: 197 SGANLRRADLRDAKFCSTNFKNAELNGVDLRKADLSGLEFDGADL 241
>gi|403357343|gb|EJY78297.1| hypothetical protein OXYTRI_24550 [Oxytricha trifallax]
Length = 290
Score = 40.8 bits (94), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 28/53 (52%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
A F D KAV NF +A ++ADMRE DF S FN A L A +AN
Sbjct: 199 GANFMHVDFVKAVGKDCNFLKAKLSNADMREGDFENSNFNEASLHGANLERAN 251
>gi|425435715|ref|ZP_18816162.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|425462172|ref|ZP_18841646.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
gi|440755045|ref|ZP_20934247.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
gi|389679721|emb|CCH91528.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389824858|emb|CCI25881.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
gi|440175251|gb|ELP54620.1| pentapeptide repeats family protein [Microcystis aeruginosa
TAIHU98]
Length = 333
Score = 40.8 bits (94), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 28/55 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR A N AN T A++ SDF G+ GA L K A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253
>gi|46202237|ref|ZP_00053526.2| COG1357: Uncharacterized low-complexity proteins [Magnetospirillum
magnetotacticum MS-1]
Length = 542
Score = 40.8 bits (94), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 28/49 (57%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A+LRKAV N R N A + ++D SG+K GA L A +ANF+
Sbjct: 54 ANLRKAVLSGANLRDCNLPRACLEDADLSGAKLQGANLAGATLLRANFS 102
>gi|425444319|ref|ZP_18824373.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|425455654|ref|ZP_18835369.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
gi|389730303|emb|CCI05384.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|389803421|emb|CCI17652.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
Length = 333
Score = 40.4 bits (93), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 28/55 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR A N AN T A++ SDF G+ GA L K A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253
>gi|390441606|ref|ZP_10229649.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|389835072|emb|CCI33775.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 333
Score = 40.4 bits (93), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 28/55 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR A N AN T A++ SDF G+ GA L K A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253
>gi|254414225|ref|ZP_05027992.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196178900|gb|EDX73897.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 963
Score = 40.4 bits (93), Expect = 0.35, Method: Composition-based stats.
Identities = 20/54 (37%), Positives = 30/54 (55%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A A L+ A + N +RAN A++ E++F G+ F GA LE A ++AN
Sbjct: 890 GANLEGAHLKGANLKRANLKRANLKRANLFEANFEGANFEGATLEWANLFEANL 943
>gi|428220816|ref|YP_007104986.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427994156|gb|AFY72851.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 418
Score = 40.4 bits (93), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 35/60 (58%), Gaps = 5/60 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANF 163
S A F +DL A+ ++ + RRAN + A++ E +D SG F+G+ L +A +ANF
Sbjct: 143 SMANFTGSDLSGAIMIRADLRRANISRANLNEADISRADLSGVDFSGSNLSQANFEEANF 202
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 38/72 (52%), Gaps = 8/72 (11%)
Query: 95 NKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
N E + G IG S A F ADLR+A N ANF +A+++E+D SG+ GA
Sbjct: 221 NFREVDLSGSDLIGADLSNANFAEADLRRA-----NLVGANFNNANLKEADLSGAYLIGA 275
Query: 152 YLEKAVAYKANF 163
L A +A+F
Sbjct: 276 TLVNANIVRADF 287
>gi|443315235|ref|ZP_21044737.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442785176|gb|ELR95014.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 402
Score = 40.4 bits (93), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 12/67 (17%)
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFN 149
N + A+ RG A S DLR+A+ + N R+ANF A+MR E+D G+
Sbjct: 318 NLHRADLRG-------ANLESTDLREAILRQANLRQANFRYANMRMAHLAEADLRGADLR 370
Query: 150 GAYLEKA 156
GA L A
Sbjct: 371 GADLTHA 377
>gi|452962545|gb|EME67671.1| hypothetical protein H261_22313 [Magnetospirillum sp. SO-1]
Length = 542
Score = 40.4 bits (93), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 28/49 (57%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A+LRKAV N R N A + ++D SG+K GA L A +ANF+
Sbjct: 54 ANLRKAVLSGANLRDCNLPRACLEDADLSGAKLQGANLAGATLLRANFS 102
>gi|443662162|ref|ZP_21132897.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|159030702|emb|CAO88375.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443332138|gb|ELS46762.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 333
Score = 40.4 bits (93), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 28/55 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR A N AN T A++ SDF G+ GA L K A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253
>gi|422302321|ref|ZP_16389684.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389788496|emb|CCI15816.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 333
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 28/55 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR A N AN T A++ SDF G+ GA L K A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253
>gi|378950893|ref|YP_005208381.1| protein PedD [Pseudomonas fluorescens F113]
gi|359760907|gb|AEV62986.1| PedD [Pseudomonas fluorescens F113]
Length = 214
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 33/58 (56%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I ++Q A+LR A ++ R+ N + AD+R++D ++ + A LEKA AN T
Sbjct: 31 IAESSQCPGANLRGAKLANQDLRKMNLSGADLRDADLRHARLDLANLEKAQLQGANLT 88
>gi|254413444|ref|ZP_05027214.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196179551|gb|EDX74545.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 768
Score = 40.4 bits (93), Expect = 0.37, Method: Composition-based stats.
Identities = 23/80 (28%), Positives = 41/80 (51%)
Query: 89 SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 148
SA+++L Y + + + AD+R + + R AN +SAD+ E++ S +K
Sbjct: 414 SAVSELEAYRIFCQLKGANLRGSDLKGADIRNSDLSAADLREANLSSADLSEANLSLAKL 473
Query: 149 NGAYLEKAVAYKANFTVDEI 168
GA L A+ A+ TV ++
Sbjct: 474 GGANLSSAILLGADLTVTDL 493
>gi|83310097|ref|YP_420361.1| hypothetical protein amb0998 [Magnetospirillum magneticum AMB-1]
gi|82944938|dbj|BAE49802.1| Uncharacterized low-complexity protein [Magnetospirillum magneticum
AMB-1]
Length = 542
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 28/49 (57%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A+LRKAV N R N A + ++D SG+K GA L A +ANF+
Sbjct: 54 ANLRKAVLSGANLRDCNLPRACLEDADLSGAKLQGANLAGATLLRANFS 102
>gi|332711030|ref|ZP_08430965.1| hypothetical cyclic nucleotide-binding domain protein [Moorea
producens 3L]
gi|332350156|gb|EGJ29761.1| hypothetical cyclic nucleotide-binding domain protein [Moorea
producens 3L]
Length = 328
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 7/73 (9%)
Query: 86 SNISALADLNKYEAETRGEFG--IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 143
S ++ +A+LN YE R + +ADLR NFR AN T AD R ++
Sbjct: 34 SQLAEIAELNLYEDLARVDLSGVNLENVNLNNADLRGT-----NFRNANLTGADFRNANL 88
Query: 144 SGSKFNGAYLEKA 156
+G+ FN A L+ A
Sbjct: 89 TGADFNDAILDNA 101
>gi|376002767|ref|ZP_09780589.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375328823|emb|CCE16342.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 517
Score = 40.4 bits (93), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 7/81 (8%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L A++ + N++ LA ++ EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138
Query: 136 ADMRESDFSGSKFNGAYLEKA 156
AD+RE+ + FNGA L A
Sbjct: 139 ADLRETKLQQTNFNGANLSGA 159
Score = 35.8 bits (81), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 34/64 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
A F +A+LR+A N A+F+ A+MR D G+ +GA L +A AN + +
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 171 PLLV 174
+LV
Sbjct: 249 AVLV 252
>gi|359460720|ref|ZP_09249283.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 294
Score = 40.4 bits (93), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 37/65 (56%), Gaps = 12/65 (18%)
Query: 111 AQFGSADLRKAVHVKE------NFRRANF-----TSADMRESDFSGSKFNGAYLEKAVAY 159
A+F ADLR+ V++++ NF RAN T AD+RE+DF+ + A L +A
Sbjct: 173 ARFQDADLRR-VNLQQAFVKSANFARANLVGADLTKADLRETDFTRANLTQAVLTQAKLR 231
Query: 160 KANFT 164
ANF+
Sbjct: 232 DANFS 236
>gi|425453004|ref|ZP_18832819.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|389764929|emb|CCI09042.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
Length = 333
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 28/55 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR A N AN T A++ SDF G+ GA L K A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253
>gi|354564871|ref|ZP_08984047.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
gi|353549997|gb|EHC19436.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
Length = 105
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 29/56 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A ADL KA N AN T+AD+ E++ +G+ GA ++A AN T
Sbjct: 28 SNANLTGADLEKANLSNANLSGANLTNADLEEANLTGANLKGANFQRADLEDANLT 83
>gi|209526071|ref|ZP_03274603.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423067542|ref|ZP_17056332.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209493459|gb|EDZ93782.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406711116|gb|EKD06318.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 517
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 7/81 (8%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L A++ + N++ LA ++ EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138
Query: 136 ADMRESDFSGSKFNGAYLEKA 156
AD+RE+ + FNGA L A
Sbjct: 139 ADLRETKLQQTNFNGANLSGA 159
Score = 35.8 bits (81), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 34/64 (53%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
A F +A+LR+A N A+F+ A+MR D G+ +GA L +A AN + +
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSGANLSS 248
Query: 171 PLLV 174
+LV
Sbjct: 249 AVLV 252
>gi|427702634|ref|YP_007045856.1| low-complexity protein [Cyanobium gracile PCC 6307]
gi|427345802|gb|AFY28515.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
Length = 182
Score = 40.4 bits (93), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 5/63 (7%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKA 161
G A+F ADL A+ + F A+F AD M + D SG+ GA L A+A +
Sbjct: 76 TGRQARFRDADLHGAILTQAAFPEADFHGADLSDALMDKVDMSGTDLTGAVLRGAIASGS 135
Query: 162 NFT 164
NFT
Sbjct: 136 NFT 138
>gi|209526959|ref|ZP_03275476.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|376005813|ref|ZP_09783205.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423064919|ref|ZP_17053709.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209492561|gb|EDZ92899.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|375325803|emb|CCE18958.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406714162|gb|EKD09330.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 331
Score = 40.4 bits (93), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 41/101 (40%), Gaps = 14/101 (13%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
F T L AA + + ++ L D N +A+ RG A ADLR A N R
Sbjct: 87 FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139
Query: 130 ------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
N AD+R +D G GA L +A AN T
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLTGANLT 180
>gi|407781463|ref|ZP_11128681.1| pentapeptide repeat-containing protein [Oceanibaculum indicum P24]
gi|407207680|gb|EKE77611.1| pentapeptide repeat-containing protein [Oceanibaculum indicum P24]
Length = 443
Score = 40.0 bits (92), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 38/75 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
S A +ADLR A NFR A T ++ + +G+ F+GA L A + ANF ++
Sbjct: 171 SEANLSNADLRNADLRMSNFRNAIMTGVNLIGVNAAGADFHGAVLTNARIHDANFDGVDL 230
Query: 169 CLPLLVSLPMATPVF 183
+L + +PVF
Sbjct: 231 TGAILDLTHLTSPVF 245
>gi|344339023|ref|ZP_08769953.1| pentapeptide repeat protein [Thiocapsa marina 5811]
gi|343800943|gb|EGV18887.1| pentapeptide repeat protein [Thiocapsa marina 5811]
Length = 284
Score = 40.0 bits (92), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 30/61 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
S A ADLR A + + R AN ADMR++DF GS KA+ +AN +
Sbjct: 103 SKANLERADLRHADVRRADLRGANLAHADMRDTDFQGSDLCHVVAPKALFIRANLREANL 162
Query: 169 C 169
C
Sbjct: 163 C 163
Score = 36.2 bits (82), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 56/126 (44%), Gaps = 19/126 (15%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETR----GEFGIGSAAQFGSADLR----KAVHVKE- 126
L A + C N + LA + +EA+ G F + + A F ADLR ++V +E
Sbjct: 162 LCGADLRDCHLNDANLAGASMHEADLTSALPGGFTVINLANFEGADLRGSKLRSVSAQET 221
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLLVSLPMATPVFPAG 186
NFR AN T D+ + + GA L +A A+F+ E L S+ M F
Sbjct: 222 NFRNANLTDVDL-----TNAVLGGAILRRADVTNADFSGVE-----LASVTMEFANFSKA 271
Query: 187 FCAPFP 192
A +P
Sbjct: 272 RNAVYP 277
>gi|220907627|ref|YP_002482938.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219864238|gb|ACL44577.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 267
Score = 40.0 bits (92), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 37/77 (48%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
S A +A+ +KA + +N T AD+ ++D +G + A L +A + NFT ++
Sbjct: 132 SQANMSAANFQKATLISAYLHNSNLTQADLSDADLTGINLSDANLSQATLIRTNFTGGDL 191
Query: 169 CLPLLVSLPMATPVFPA 185
+LV +A A
Sbjct: 192 SRVMLVGANLAETNLTA 208
>gi|428202965|ref|YP_007081554.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427980397|gb|AFY77997.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 179
Score = 40.0 bits (92), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 28/56 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
+ A ADL K N + AN +AD+ E++ + GA L++A KAN T
Sbjct: 97 AGANLQGADLEKGNLAGANLQTANLINADLEEANLQNANLQGASLQRADLEKANLT 152
>gi|313204014|ref|YP_004042671.1| pentapeptide repeat-containing protein [Paludibacter
propionicigenes WB4]
gi|312443330|gb|ADQ79686.1| pentapeptide repeat protein [Paludibacter propionicigenes WB4]
Length = 186
Score = 40.0 bits (92), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 45/129 (34%), Gaps = 6/129 (4%)
Query: 35 WVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADL 94
++ C S D F DC+ C A LKN TAL+ C +
Sbjct: 27 FLNCNFYSSNLVDVSFRDCTFESCDFSLASLKN------TALSDIQFIGCKLVGVQFDEC 80
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
N + R E + A F L+K + N +FT ADM N A
Sbjct: 81 NPFLFSVRFENCVLKLAVFQKVKLKKTRFINCNLEETDFTEADMSSGVLDNCNLNRAIFH 140
Query: 155 KAVAYKANF 163
K KA+F
Sbjct: 141 KTNLEKADF 149
>gi|163795566|ref|ZP_02189532.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
gi|159179165|gb|EDP63698.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
Length = 427
Score = 40.0 bits (92), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 42/94 (44%), Gaps = 2/94 (2%)
Query: 94 LNKYEAETRGEF--GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
LN Y R + G + AQ DLR+A+ +FR A F A++ E+ +GS+ A
Sbjct: 23 LNNYPGGQRADMRGGRHNGAQLNGVDLRRAMMSAADFRGAQFVGANLSEATLAGSQLRVA 82
Query: 152 YLEKAVAYKANFTVDEICLPLLVSLPMATPVFPA 185
L A K +F ++ L S + F A
Sbjct: 83 DLSGAKLVKTDFRGADLEQAKLTSSDITDADFRA 116
>gi|348176753|ref|ZP_08883647.1| pentapeptide repeat-containing protein [Saccharopolyspora spinosa
NRRL 18395]
Length = 198
Score = 40.0 bits (92), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 47/101 (46%), Gaps = 7/101 (6%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
F T L + + CS S+ AD + A T E + + G ADLR FR
Sbjct: 71 FERTVLGKSTLDGCSLLGSSFADC-RLRAWTLRETDL-TLVGMGKADLRGLDLRGIRFRE 128
Query: 131 ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTVD 166
AN T D+R E+DF+G++ GA LE+A ++ D
Sbjct: 129 ANLTECDLRRCDLREADFTGARLLGARLEEADLRESRIDAD 169
>gi|271962831|ref|YP_003337027.1| hypothetical protein [Streptosporangium roseum DSM 43021]
gi|270506006|gb|ACZ84284.1| Uncharacterized low-complexity protein-like protein
[Streptosporangium roseum DSM 43021]
Length = 412
Score = 40.0 bits (92), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 41/87 (47%), Gaps = 7/87 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-VDEIC 169
A F A LR+A + N R A F AD+ E++ + ++ +GA A+ A+F D+
Sbjct: 177 ADFTRAKLREAKLGQANLRNATFKDADLSEAELTQAELDGAVFTGALVEGASFVQADDAD 236
Query: 170 L------PLLVSLPMATPVFPAGFCAP 190
L P +SLP + P G P
Sbjct: 237 LAGAKGTPKGLSLPTTDLLIPDGIFTP 263
>gi|397695427|ref|YP_006533310.1| pentapeptide repeat-containing protein [Pseudomonas putida DOT-T1E]
gi|421520705|ref|ZP_15967367.1| pentapeptide repeat-containing protein [Pseudomonas putida LS46]
gi|298682200|gb|ADI95267.1| PedD [Pseudomonas putida DOT-T1E]
gi|397332157|gb|AFO48516.1| pentapeptide repeat-containing protein [Pseudomonas putida DOT-T1E]
gi|402755315|gb|EJX15787.1| pentapeptide repeat-containing protein [Pseudomonas putida LS46]
Length = 219
Score = 40.0 bits (92), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 32/58 (55%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I A+Q A+LR A ++ R+ N AD+R++D ++ + A LEKA AN T
Sbjct: 36 IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93
>gi|425470595|ref|ZP_18849461.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389883733|emb|CCI35905.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 333
Score = 40.0 bits (92), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 28/55 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADLR A N AN T A++ SDF G+ GA L K A KANF
Sbjct: 199 SYADLRGADLRGADLRCANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253
>gi|148548300|ref|YP_001268402.1| pentapeptide repeat-containing protein [Pseudomonas putida F1]
gi|395448857|ref|YP_006389110.1| pentapeptide repeat-containing protein [Pseudomonas putida ND6]
gi|148512358|gb|ABQ79218.1| pentapeptide repeat protein [Pseudomonas putida F1]
gi|388562854|gb|AFK71995.1| pentapeptide repeat-containing protein [Pseudomonas putida ND6]
Length = 219
Score = 39.7 bits (91), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 32/58 (55%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I A+Q A+LR A ++ R+ N AD+R++D ++ + A LEKA AN T
Sbjct: 36 IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93
>gi|332705327|ref|ZP_08425405.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
gi|332355687|gb|EGJ35149.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
Length = 221
Score = 39.7 bits (91), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 27/53 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A ADLR + + R AN T AD+R +D G+ GA L +A +AN
Sbjct: 111 AILTRADLRLTILQDTDLRGANLTRADLRYADLRGANLTGACLHQADLTRANL 163
>gi|434396750|ref|YP_007130754.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428267847|gb|AFZ33788.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 331
Score = 39.7 bits (91), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 5/56 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A ++L KA ++ NF RAN T A + ++D S GA L A+ K N T
Sbjct: 65 SGADLSQSNLEKAQLIETNFSRANLTEASLIQADLS-----GAILSSAIGTKTNLT 115
>gi|126661305|ref|ZP_01732374.1| hypothetical protein CY0110_08576 [Cyanothece sp. CCY0110]
gi|126617401|gb|EAZ88201.1| hypothetical protein CY0110_08576 [Cyanothece sp. CCY0110]
Length = 368
Score = 39.7 bits (91), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 34/70 (48%), Gaps = 5/70 (7%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFT 164
+ +L A + NFR AN T AD+ E + FSG+ +GAYL A KA+F
Sbjct: 246 GTELSGVELNGANLTQSNFRGANLTDADLSEAILSYTRFSGADLSGAYLGNANLQKADFY 305
Query: 165 VDEICLPLLV 174
+ L L+
Sbjct: 306 RSSLALANLI 315
>gi|86608820|ref|YP_477582.1| pentapeptide repeat-containing protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86557362|gb|ABD02319.1| pentapeptide repeat family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 328
Score = 39.7 bits (91), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 30/53 (56%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTV 165
G A L+KA V N AN + AD+ E+D ++ +GA L+ A + AN T+
Sbjct: 52 LGRAKLQKANLVGANLGGANLSQADLSEADLRDAQLHGATLQGADLHGANLTL 104
>gi|116073351|ref|ZP_01470613.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
gi|116068656|gb|EAU74408.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
Length = 167
Score = 39.7 bits (91), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 32/62 (51%), Gaps = 5/62 (8%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKA 161
+G A F ADL A+ + F A+F+ AD+ +S DFSG+ A L +A +
Sbjct: 62 VGRGADFSGADLHGAIFTQGAFAEADFSDADLSDSLMDRADFSGTNLTNALLNGVIASGS 121
Query: 162 NF 163
+F
Sbjct: 122 SF 123
>gi|347735787|ref|ZP_08868588.1| pentapeptide repeat family protein [Azospirillum amazonense Y2]
gi|346920906|gb|EGY01818.1| pentapeptide repeat family protein [Azospirillum amazonense Y2]
Length = 451
Score = 39.7 bits (91), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 7/70 (10%)
Query: 117 DLRKAVHVKENFRRANFTS-----ADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLP 171
DLR A+ VK + ++ T AD+ E++ SG+K +GA L +A+ AN + +
Sbjct: 178 DLRGAIFVKADLSGSDLTGCNLEGADLSEANLSGTKLDGAVLTRALLRSANLSKASLLGA 237
Query: 172 LL--VSLPMA 179
LL V L MA
Sbjct: 238 LLDDVDLSMA 247
>gi|320156222|ref|YP_004188601.1| hypothetical protein VVMO6_01376 [Vibrio vulnificus MO6-24/O]
gi|319931534|gb|ADV86398.1| hypothetical protein VVMO6_01376 [Vibrio vulnificus MO6-24/O]
Length = 689
Score = 39.7 bits (91), Expect = 0.67, Method: Composition-based stats.
Identities = 20/60 (33%), Positives = 28/60 (46%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
S A SAD + ++ V NF +A+ T AD DF+ + GA L +A T I
Sbjct: 607 SKASLDSADFKSSIFVNANFEKADLTQADFGGCDFTNANLQGAELSGCDLTQARLTSSNI 666
>gi|434388230|ref|YP_007098841.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
gi|428019220|gb|AFY95314.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
Length = 193
Score = 39.7 bits (91), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 26/57 (45%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
G A ADLR A N + N AD+R +D +G GA L +A AN T
Sbjct: 97 GDRASLHKADLRLASLQGANLSQVNLVGADLRYADLTGVNLTGANLSRANLTGANLT 153
>gi|26989392|ref|NP_744817.1| pentapeptide repeat-containing protein [Pseudomonas putida KT2440]
gi|24984254|gb|AAN68281.1|AE016462_7 pentapeptide repeat family protein [Pseudomonas putida KT2440]
Length = 219
Score = 39.7 bits (91), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 32/58 (55%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I A+Q A+LR A ++ R+ N AD+R++D ++ + A LEKA AN T
Sbjct: 36 IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93
>gi|386012542|ref|YP_005930819.1| Pentapeptide repeat-containing protein [Pseudomonas putida BIRD-1]
gi|313499248|gb|ADR60614.1| Pentapeptide repeat-containing protein [Pseudomonas putida BIRD-1]
Length = 219
Score = 39.7 bits (91), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 32/58 (55%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I A+Q A+LR A ++ R+ N AD+R++D ++ + A LEKA AN T
Sbjct: 36 IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93
>gi|156081718|ref|XP_001608352.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148800923|gb|EDL42328.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1301
Score = 39.7 bits (91), Expect = 0.69, Method: Composition-based stats.
Identities = 19/66 (28%), Positives = 34/66 (51%)
Query: 97 YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
Y G +G+ A AD+ +A ++ +F RA+F AD +D + + FN A + +A
Sbjct: 33 YREALPGRAALGTEADLSRADVSRADAIRADFNRADFNRADFNRADVNRADFNRADVSRA 92
Query: 157 VAYKAN 162
+A+
Sbjct: 93 NFNRAD 98
>gi|434398137|ref|YP_007132141.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428269234|gb|AFZ35175.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 223
Score = 39.7 bits (91), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 4/72 (5%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
ADL + + + G A ADL K N RAN AD+ +++ +G+ GA
Sbjct: 129 ADLERADLQKTNLIG----ANLQGADLGKTNIAGANLERANLFDADLEKANLAGTNLAGA 184
Query: 152 YLEKAVAYKANF 163
L+KA K N
Sbjct: 185 NLQKADLEKTNL 196
>gi|298489886|ref|YP_003720063.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
gi|298231804|gb|ADI62940.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
Length = 256
Score = 39.7 bits (91), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 28/54 (51%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A ADLR A N RAN T AD+R ++ +G+ G L +A +AN T
Sbjct: 51 ADLSGADLRGANLEGANLSRANLTGADLRSANLAGASLFGVNLSRAKLNEANLT 104
>gi|113477694|ref|YP_723755.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110168742|gb|ABG53282.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 204
Score = 39.7 bits (91), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 1/52 (1%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
F A+L+KA ++ N R A+FT AD+R +DF + GA L A +A+F
Sbjct: 53 NFAGANLQKA-KLRANLRGADFTGADLRGADFRNADLRGAILIDAQLREASF 103
>gi|443328868|ref|ZP_21057461.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442791604|gb|ELS01098.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 266
Score = 39.7 bits (91), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 28/54 (51%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A ADLR+A ++ N + + AD+R ++ G GA L KA +AN +
Sbjct: 153 ADLNDADLREAQLIRANLSEVDLSGADLRAANLKGVNLRGADLNKADLSRANLS 206
>gi|242277903|ref|YP_002990032.1| pentapeptide repeat-containing protein [Desulfovibrio salexigens DSM
2638]
gi|242120797|gb|ACS78493.1| pentapeptide repeat protein [Desulfovibrio salexigens DSM 2638]
Length = 1277
Score = 39.7 bits (91), Expect = 0.73, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 31/58 (53%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
IG +A F A LR+A + F +A F +D+ E++ + + F GA KAV NF
Sbjct: 1004 AIGMSADFSKASLRRADLSRGLFNKALFVESDLSEANGAQAIFKGAQFPKAVLRDTNF 1061
>gi|425452313|ref|ZP_18832131.1| Genome sequencing data, contig C306 [Microcystis aeruginosa PCC
7941]
gi|389765978|emb|CCI08285.1| Genome sequencing data, contig C306 [Microcystis aeruginosa PCC
7941]
Length = 188
Score = 39.7 bits (91), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 16/42 (38%), Positives = 27/42 (64%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 152
+Q +LR A + N RRANFT AD+ +++F+G++ + Y
Sbjct: 128 SQLMDVNLRGASLINANIRRANFTGADVTDTNFTGAQCSDGY 169
>gi|418939008|ref|ZP_13492446.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
gi|375054283|gb|EHS50653.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
Length = 229
Score = 39.7 bits (91), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 30/56 (53%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
+ A A+LR A NF RA+ SAD+R +D G+ F GA LE AV + T
Sbjct: 63 TEANLKGANLRGADCDGANFTRADLKSADLRWADCDGANFTGANLESAVLQHTDLT 118
>gi|144900552|emb|CAM77416.1| low-complexity proteins [Magnetospirillum gryphiswaldense MSR-1]
Length = 433
Score = 39.7 bits (91), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 43/84 (51%), Gaps = 3/84 (3%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRRAN 132
L+ A++A+ S + L+D E+ G + + AAQ G A+L A + R AN
Sbjct: 300 LSGAILANASFREADLSDAFMAESRLDGADFRYAVLGAAQLGGANLGVAQLRHADMRLAN 359
Query: 133 FTSADMRESDFSGSKFNGAYLEKA 156
A +R +D SG++ +GA L A
Sbjct: 360 LEGAQLRGADLSGARLSGAKLSGA 383
>gi|443326309|ref|ZP_21054967.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
gi|442794049|gb|ELS03478.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
Length = 366
Score = 39.7 bits (91), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 42/93 (45%), Gaps = 5/93 (5%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-----R 130
L + + + IS LA+L K + +T G AA DL A K N R
Sbjct: 211 LIEQIYIAKTEQISELAELAKLDLKTDLAGGNLLAANLAGIDLNGANLQKTNLRGVILND 270
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A+ + ++R ++ G+ +GAYLE A AN
Sbjct: 271 ADLSETNLRHANLGGADLSGAYLENADLTHANL 303
>gi|428777412|ref|YP_007169199.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
gi|428691691|gb|AFZ44985.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
Length = 333
Score = 39.7 bits (91), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 34/63 (53%), Gaps = 2/63 (3%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
S A +ADL KA N R AN A++ +D SG+ GAYL +A ++A ++D +
Sbjct: 198 SEANLFNADLSKANLKGANLRGANLIRANLERADLSGADLRGAYLNEAKMFEA--SLDNV 255
Query: 169 CLP 171
L
Sbjct: 256 NLS 258
>gi|376001358|ref|ZP_09779228.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375330187|emb|CCE14981.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 351
Score = 39.3 bits (90), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 29/56 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A ADL ++V NF AN T A++ ++ +G+ NGA L A AN T
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTGANLT 245
>gi|297569025|ref|YP_003690369.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
gi|296924940|gb|ADH85750.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
Length = 830
Score = 39.3 bits (90), Expect = 0.78, Method: Composition-based stats.
Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 7/79 (8%)
Query: 90 ALADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
ALADL + +R F S A+ ADLR+ + + +FR A+ AD RE+
Sbjct: 225 ALADLGGADLRRADLSRANF---SQARLRQADLRQVLFSESDFRHADARRADFREATLRQ 281
Query: 146 SKFNGAYLEKAVAYKANFT 164
+ F+GA L +A+ + T
Sbjct: 282 ANFSGADLSRAIFSGTDLT 300
>gi|16331545|ref|NP_442273.1| hypothetical protein slr0719 [Synechocystis sp. PCC 6803]
gi|383323287|ref|YP_005384141.1| hypothetical protein SYNGTI_2379 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383326456|ref|YP_005387310.1| hypothetical protein SYNPCCP_2378 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383492340|ref|YP_005410017.1| hypothetical protein SYNPCCN_2378 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437608|ref|YP_005652333.1| hypothetical protein SYNGTS_2380 [Synechocystis sp. PCC 6803]
gi|451815697|ref|YP_007452149.1| hypothetical protein MYO_124040 [Synechocystis sp. PCC 6803]
gi|1001199|dbj|BAA10343.1| slr0719 [Synechocystis sp. PCC 6803]
gi|339274641|dbj|BAK51128.1| hypothetical protein SYNGTS_2380 [Synechocystis sp. PCC 6803]
gi|359272607|dbj|BAL30126.1| hypothetical protein SYNGTI_2379 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359275777|dbj|BAL33295.1| hypothetical protein SYNPCCN_2378 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278947|dbj|BAL36464.1| hypothetical protein SYNPCCP_2378 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407961067|dbj|BAM54307.1| hypothetical protein BEST7613_5376 [Bacillus subtilis BEST7613]
gi|451781666|gb|AGF52635.1| hypothetical protein MYO_124040 [Synechocystis sp. PCC 6803]
Length = 388
Score = 39.3 bits (90), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 33/70 (47%), Gaps = 5/70 (7%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANFT 164
A S DL + NFR A T + D+R S+F G+ +GAYLE A + +F
Sbjct: 259 GANLNSIDLSNGQLMDSNFRGAILTDSDLSNTDLRRSNFRGADLSGAYLEGANLSQVDFR 318
Query: 165 VDEICLPLLV 174
+ L L+
Sbjct: 319 KSSLALATLI 328
>gi|162455067|ref|YP_001617434.1| hypothetical protein sce6785 [Sorangium cellulosum So ce56]
gi|161165649|emb|CAN96954.1| hypothetical protein sce6785 [Sorangium cellulosum So ce56]
Length = 973
Score = 39.3 bits (90), Expect = 0.82, Method: Composition-based stats.
Identities = 25/94 (26%), Positives = 47/94 (50%), Gaps = 3/94 (3%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEF---GIGSAAQFGSADLRKAVHVKEN 127
F + A +A + ++LA + +A+ RG + A+ A+L +A+ + N
Sbjct: 854 FAGADFSGATLAGANLMGTSLAGTDLSDADLRGALLNEADLTEARLDRANLAEAMLTRAN 913
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
RA+ +AD+R+S + ++ GA EKA + A
Sbjct: 914 LTRASLYAADLRQSILNSARVEGASFEKASLFSA 947
>gi|116753519|ref|YP_842637.1| pentapeptide repeat-containing protein [Methanosaeta thermophila
PT]
gi|116664970|gb|ABK13997.1| pentapeptide repeat protein [Methanosaeta thermophila PT]
Length = 862
Score = 39.3 bits (90), Expect = 0.83, Method: Composition-based stats.
Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 2/65 (3%)
Query: 101 TRGE-FGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
TR E FG S ADL KA ++ N A+ T A + ++DFSG+ GA + + V
Sbjct: 669 TRAELFGADLSGTDLSGADLVKAYALRANLSGADLTDAKLDDADFSGAILRGAKMPELVI 728
Query: 159 YKANF 163
NF
Sbjct: 729 RSVNF 733
Score = 38.5 bits (88), Expect = 1.5, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 26/52 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
A F A LR A + R NF AD+ ++D SG +F Y+ AV AN
Sbjct: 711 ADFSGAILRGAKMPELVIRSVNFGQADLSDADMSGCRFEALYVSNAVMRSAN 762
>gi|320353524|ref|YP_004194863.1| pentapeptide repeat-containing protein [Desulfobulbus propionicus
DSM 2032]
gi|320122026|gb|ADW17572.1| pentapeptide repeat protein [Desulfobulbus propionicus DSM 2032]
Length = 342
Score = 39.3 bits (90), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 44/100 (44%), Gaps = 3/100 (3%)
Query: 92 ADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
ADL + + E G + AQ ADL A K N R AN AD+ +D G+ G
Sbjct: 67 ADLRQSKLENANLEGANLTGAQLSLADLSGANLKKANLRNANLHGADLAYADLEGANLTG 126
Query: 151 AYLEKAVAYKANFTVDEICLPLLVSLPMATPVFPAGFCAP 190
A LE A+ +KA I LL + P PA +P
Sbjct: 127 ASLEGAI-FKATKMKGRIVNRLLHA-DQVRPETPAAPVSP 164
>gi|167034127|ref|YP_001669358.1| pentapeptide repeat-containing protein [Pseudomonas putida GB-1]
gi|166860615|gb|ABY99022.1| pentapeptide repeat protein [Pseudomonas putida GB-1]
Length = 219
Score = 39.3 bits (90), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 32/58 (55%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
I A+Q A+LR A ++ R+ N AD+R++D ++ + A LE+A AN T
Sbjct: 36 IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLERARLQGANLT 93
>gi|118592119|ref|ZP_01549513.1| hypothetical protein SIAM614_25622 [Stappia aggregata IAM 12614]
gi|118435415|gb|EAV42062.1| hypothetical protein SIAM614_25622 [Labrenzia aggregata IAM 12614]
Length = 275
Score = 39.3 bits (90), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 38/73 (52%), Gaps = 8/73 (10%)
Query: 99 AETRG---EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY--- 152
AE RG E G + DL++A+ NF+ ++F + +DFSGS F+GA
Sbjct: 50 AELRGLVLENGDFAGTNLREVDLKEAMLPNANFKNSDFRRTEAERADFSGSDFSGANMRS 109
Query: 153 --LEKAVAYKANF 163
LEKA KANF
Sbjct: 110 VDLEKANLNKANF 122
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 39/82 (47%), Gaps = 14/82 (17%)
Query: 92 ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD--------- 142
+D + EAE R +F S + F A++R K N +ANF AD+R+ D
Sbjct: 85 SDFRRTEAE-RADF---SGSDFSGANMRSVDLEKANLNKANFQDADLRDGDLNTVEANEA 140
Query: 143 -FSGSKFNGAYLEKAVAYKANF 163
F G+ ++VA KA+F
Sbjct: 141 IFDGADMRNVLFTRSVANKASF 162
>gi|334118424|ref|ZP_08492513.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333459431|gb|EGK88044.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 479
Score = 39.3 bits (90), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADL ++ N RA+ T A +RE++ G++F GA L++A KAN
Sbjct: 60 SGANLSGADLAESFLNLANLTRADLTGAVLREANLVGAEFTGANLKQASLIKANL 114
Score = 37.4 bits (85), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 5/53 (9%)
Query: 110 AAQFGSADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
A+F A+L++A +K N AN T A++ +D GS+ +GA L+KAV
Sbjct: 96 GAEFTGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAV 148
>gi|302556667|ref|ZP_07309009.1| pentapeptide repeats-containing protein [Streptomyces griseoflavus
Tu4000]
gi|302474285|gb|EFL37378.1| pentapeptide repeats-containing protein [Streptomyces griseoflavus
Tu4000]
Length = 355
Score = 39.3 bits (90), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 40/93 (43%), Gaps = 16/93 (17%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF------- 163
A +A LR+A V + R A D+R++DF+G+ A L KA A+ A F
Sbjct: 226 ADLTTAVLRRARCVLADLRAAKLVETDLRDADFTGTDLREANLRKAGAHGAVFQRADLRM 285
Query: 164 ---------TVDEICLPLLVSLPMATPVFPAGF 187
T D + L +L +PAGF
Sbjct: 286 ADLRGTDLSTADLVAARLTGALASERTRWPAGF 318
>gi|158337660|ref|YP_001518836.1| pentapeptide repeat-containing serine/threonine kinase
[Acaryochloris marina MBIC11017]
gi|158307901|gb|ABW29518.1| serine/threonine kinase with pentapeptide repeats [Acaryochloris
marina MBIC11017]
Length = 532
Score = 39.3 bits (90), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 10/69 (14%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAYK 160
+F + DLR A+ + NF RANFT A++R +D + + GA L A
Sbjct: 428 GKFQNTDLRDAILINANFGRANFTGANLRNANLMQAYMSHADLANADLRGANLSDAYLSH 487
Query: 161 ANFTVDEIC 169
AN +C
Sbjct: 488 ANLRGANLC 496
>gi|193213002|ref|YP_001998955.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
8327]
gi|193086479|gb|ACF11755.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
Length = 193
Score = 39.3 bits (90), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 31/128 (24%), Positives = 49/128 (38%), Gaps = 11/128 (8%)
Query: 35 WVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADL 94
+V C ++ S F +CS QC AKL + T C +D
Sbjct: 34 FVQCNLAQADLSGFMFRECSFEQCDMGLAKL------IDTGFQEVKFIDCKLLGVQFSDC 87
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
K E + I + F + DL+ V F + AD E++ +GS+F+ L
Sbjct: 88 RKLLLEINFKRCILKLSVFTNLDLKNTV-----FDDCDMQEADFTEANLTGSRFDNCDLR 142
Query: 155 KAVAYKAN 162
A+ + N
Sbjct: 143 LAIFFHTN 150
>gi|354556796|ref|ZP_08976083.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|353551246|gb|EHC20655.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 253
Score = 39.3 bits (90), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 47/98 (47%), Gaps = 8/98 (8%)
Query: 70 VFVSTALAAAVVASCSSNISALADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKE 126
+ ++ + V + N + L D N +A+ G + S A SA+LR A
Sbjct: 57 ILLNLRFTSKVTKKANLNYADLKDHNLSKADLSGADLNYANLSGANLTSANLRYA----- 111
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
N R A+ + AD+ E++F+ + +GA L A +AN T
Sbjct: 112 NLRGADLSGADLSETNFTYANLSGASLRYANLSRANLT 149
>gi|172055186|ref|YP_001806513.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|171701467|gb|ACB54447.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
Length = 280
Score = 39.3 bits (90), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 47/98 (47%), Gaps = 8/98 (8%)
Query: 70 VFVSTALAAAVVASCSSNISALADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKE 126
+ ++ + V + N + L D N +A+ G + S A SA+LR A
Sbjct: 84 ILLNLRFTSKVTKKANLNYADLKDHNLSKADLSGADLNYANLSGANLTSANLRYA----- 138
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
N R A+ + AD+ E++F+ + +GA L A +AN T
Sbjct: 139 NLRGADLSGADLSETNFTYANLSGASLRYANLSRANLT 176
>gi|427714529|ref|YP_007063153.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
gi|427378658|gb|AFY62610.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
Length = 333
Score = 39.3 bits (90), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 32/59 (54%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEIC 169
A G A+ R+A + R AN T AD+ ES + +GA LEKA+ A+ T+ ++
Sbjct: 53 ALLGRANFRRANLAGADLRGANLTQADLTESLLQEANLHGASLEKAILVGADITLADLT 111
>gi|443314265|ref|ZP_21043839.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
gi|442786137|gb|ELR95903.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
Length = 887
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 32/56 (57%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S ++F A L A ++ + R N ++ E++F ++F+GA L +VA KANF
Sbjct: 233 SESEFRGAKLAHAKFIRADLSRTNLIRTNLAEANFERARFHGANLNNSVAKKANFN 288
>gi|359459150|ref|ZP_09247713.1| pentapeptide repeat-containing serine/threonine kinase
[Acaryochloris sp. CCMEE 5410]
Length = 514
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 10/69 (14%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAYK 160
+F + DLR A+ + NF RANFT A++R +D + + GA L A
Sbjct: 410 GKFQNTDLRDAILINANFGRANFTGANLRNANLMQAYMSHADLANADLRGANLSDAYLSH 469
Query: 161 ANFTVDEIC 169
AN +C
Sbjct: 470 ANLRGANLC 478
>gi|427718922|ref|YP_007066916.1| peptidase C14 caspase catalytic subunit p20 [Calothrix sp. PCC
7507]
gi|427351358|gb|AFY34082.1| peptidase C14 caspase catalytic subunit p20 [Calothrix sp. PCC
7507]
Length = 1102
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 28/56 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S+A G ADLR A R AN + AD+R +D G+ GA L A AN +
Sbjct: 839 SSANLGGADLRGADLSSAYLRGANLSYADLRGADLRGADLRGADLRGANLSSANLS 894
Score = 38.1 bits (87), Expect = 1.8, Method: Composition-based stats.
Identities = 25/80 (31%), Positives = 44/80 (55%), Gaps = 5/80 (6%)
Query: 85 SSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS 144
S+N+S ADL+ + + + G +A GSA+L +A + N RAN +SAD+ +++ S
Sbjct: 971 SANLSG-ADLSDADLSS-ADLG---SADLGSANLSRANLSRANLSRANLSSADLSDANLS 1025
Query: 145 GSKFNGAYLEKAVAYKANFT 164
+ + L A +AN +
Sbjct: 1026 SANLSSTDLSSADLRRANLS 1045
>gi|118593941|ref|ZP_01551297.1| PipB-like protein [Stappia aggregata IAM 12614]
gi|118433481|gb|EAV40152.1| PipB-like protein [Stappia aggregata IAM 12614]
Length = 162
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 28/55 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
AA A + ++ V F AN TS +M SDF+G+ F A + +A + NF
Sbjct: 7 DAANLTGASFKNSIGVNATFIEANLTSVEMNNSDFTGADFTKADMRHVIASETNF 61
Score = 35.8 bits (81), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 30/62 (48%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVD 166
IG A F A+L +F A+FT ADMR S + F A + AVA ANF
Sbjct: 20 IGVNATFIEANLTSVEMNNSDFTGADFTKADMRHVIASETNFQEATFKDAVAINANFVAA 79
Query: 167 EI 168
+I
Sbjct: 80 DI 81
>gi|392384479|ref|YP_005033675.1| putative Pentapeptide repeat family protein [Azospirillum
brasilense Sp245]
gi|356881194|emb|CCD02176.1| putative Pentapeptide repeat family protein [Azospirillum
brasilense Sp245]
Length = 428
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 29/50 (58%)
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLLVSLPMAT 180
AN + AD+R +DFS +K GA L AV A F ++ L ++PMAT
Sbjct: 180 ANLSGADLRGADFSMAKLKGAILNNAVVAGATFQGADLRDAELRNVPMAT 229
>gi|389694674|ref|ZP_10182768.1| putative low-complexity protein [Microvirga sp. WSM3557]
gi|388588060|gb|EIM28353.1| putative low-complexity protein [Microvirga sp. WSM3557]
Length = 251
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 61/146 (41%), Gaps = 29/146 (19%)
Query: 33 PLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRV--------------FVSTALAA 78
P W CQ DG P + C+ L N + F S+ +A
Sbjct: 22 PAWAKCQ-------DGPGPGVDWSGCSKARLMLTNEDLTGTNFQRSLLTLSDFASSKMAG 74
Query: 79 AVVASCSSNISAL--ADLNKYEAET----RGEFGIG--SAAQFGSADLRKAVHVKENFRR 130
A ++ + + ADL+K R FG + A FGSAD+ ++ +
Sbjct: 75 ANLSETEVSRTRFEGADLSKANFTKALGWRANFGQANLTGADFGSADMNRSNFAQVKAAG 134
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKA 156
ANF+ +++ SDFSG+ +GA + KA
Sbjct: 135 ANFSKSELNRSDFSGADLSGANISKA 160
>gi|167645176|ref|YP_001682839.1| pentapeptide repeat-containing protein [Caulobacter sp. K31]
gi|167347606|gb|ABZ70341.1| pentapeptide repeat protein [Caulobacter sp. K31]
Length = 419
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 29/51 (56%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
I + A F A L+ A V+ N ++ANF A++ +D SG+ GA L AV
Sbjct: 166 IATKADFSDAILKDAKLVRANLKQANFNGANLAGADLSGANLTGADLRNAV 216
>gi|288957355|ref|YP_003447696.1| hypothetical protein AZL_005140 [Azospirillum sp. B510]
gi|288909663|dbj|BAI71152.1| hypothetical protein AZL_005140 [Azospirillum sp. B510]
Length = 450
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
ADLRKA V N A+ T AD+ E+D +G+ GA L A
Sbjct: 395 ADLRKANLVGANLAGADLTGADLSEADLTGADLTGAMLTGA 435
>gi|218439290|ref|YP_002377619.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218172018|gb|ACK70751.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 231
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 27/49 (55%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
S A F AD R + K NF A F AD+ E+ G+ F GA LEKA+
Sbjct: 33 SGADFSKADFRSSRLGKTNFAYACFFGADLSEAILWGTDFTGANLEKAI 81
>gi|166363932|ref|YP_001656205.1| pentapeptide repeat-containing protein [Microcystis aeruginosa
NIES-843]
gi|166086305|dbj|BAG01013.1| pentapeptide repeat family protein [Microcystis aeruginosa
NIES-843]
Length = 164
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
+ SA+L++AV + +FR + D+ +++F G+ N A L ++ Y+ANF +
Sbjct: 52 NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANFADCRL 111
Query: 169 CL 170
CL
Sbjct: 112 CL 113
>gi|418019711|ref|ZP_12659144.1| putative low-complexity protein [Candidatus Regiella insecticola
R5.15]
gi|347604938|gb|EGY29471.1| putative low-complexity protein [Candidatus Regiella insecticola
R5.15]
Length = 381
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 26/45 (57%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
DL K + N +AN T A++RE D +G+ GA LE+A +A
Sbjct: 84 DLSKMDLSRVNLEKANLTGANLREMDLTGANLTGANLERARLVRA 128
>gi|209964001|ref|YP_002296916.1| pentapeptide repeat-containing protein [Rhodospirillum centenum SW]
gi|209957467|gb|ACI98103.1| pentapeptide repeat family protein [Rhodospirillum centenum SW]
Length = 433
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 31/61 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
A A L KA V+ N R AN + AD+R +D +G+ A L A+ +A T + L
Sbjct: 367 ANLSGAKLVKASLVRANLRNANLSGADLRGADLTGANLIDANLRGALLDEAVLTGAALPL 426
Query: 171 P 171
P
Sbjct: 427 P 427
>gi|399075150|ref|ZP_10751398.1| putative low-complexity protein [Caulobacter sp. AP07]
gi|398039446|gb|EJL32581.1| putative low-complexity protein [Caulobacter sp. AP07]
Length = 380
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 29/51 (56%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
I + A F A L+ A V+ N ++ANF A++ +D SG+ GA L AV
Sbjct: 127 IATKADFSDAILKDAKLVRANLKQANFNGANLAGADLSGANLTGADLRNAV 177
>gi|337286774|ref|YP_004626247.1| Ion transport 2 domain-containing protein [Thermodesulfatator
indicus DSM 15286]
gi|335359602|gb|AEH45283.1| Ion transport 2 domain protein [Thermodesulfatator indicus DSM
15286]
Length = 304
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 32/65 (49%), Gaps = 20/65 (30%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFT--------------------SADMRESDFSGSKF 148
AA FG A+L+KA FR A+FT AD+RE+DFSG+KF
Sbjct: 68 EAAGFGMANLKKARLFNAKFRHASFTKATLKGADAKCADFSLARLREADLREADFSGAKF 127
Query: 149 NGAYL 153
A+L
Sbjct: 128 KEAHL 132
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
F DL A + N +RA FT A+++ +DF+G+ GA LE A A F
Sbjct: 21 DFSGEDLAGAKFFRANLKRALFTGANLKGADFTGADLEGANLEGVDAEAAGF 72
>gi|332711043|ref|ZP_08430978.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332350169|gb|EGJ29774.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 343
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 9/100 (9%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFR 129
+ LA A++ S N + L N A+ T+ + A +A L KA+ ++ N
Sbjct: 170 LIDIDLANAILHQASLNDAELTGANLTGADLTKANL---ARANLNTAKLSKALLIRANLS 226
Query: 130 RANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANFT 164
+ N + +AD+R +D SG+ F GA L A AN T
Sbjct: 227 KTNLSITELRNADLRNADLSGANFMGADLTGADLTSANLT 266
>gi|291571459|dbj|BAI93731.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 351
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 29/56 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A ADL ++V NF AN T A++ ++ +G+ NGA L A AN T
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLAGANLAGANLNGANLTGANLTGANLT 245
>gi|256397701|ref|YP_003119265.1| pentapeptide repeat-containing protein [Catenulispora acidiphila
DSM 44928]
gi|256363927|gb|ACU77424.1| pentapeptide repeat protein [Catenulispora acidiphila DSM 44928]
Length = 354
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 44/94 (46%), Gaps = 9/94 (9%)
Query: 70 VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN-- 127
V VS A +A + LADL R + + A F ADLR+AV K
Sbjct: 218 VSVSLQHAEMRLAKLTEARCVLADLRG----ARMAEAVLNGADFTRADLREAVLRKTQAQ 273
Query: 128 ---FRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
F A+ +AD+R +D S ++ +GA E AVA
Sbjct: 274 NTVFHHADLRNADLRGADLSSAELDGARFEGAVA 307
>gi|407684714|ref|YP_006799888.1| pentapeptide repeat-containing protein [Alteromonas macleodii str.
'English Channel 673']
gi|407246325|gb|AFT75511.1| pentapeptide repeat-containing protein [Alteromonas macleodii str.
'English Channel 673']
Length = 451
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 5/58 (8%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
GIG + F SADLRKA N RRA+ A M +S+ + ++ + ++A A F
Sbjct: 267 GIGQLSLFDSADLRKA-----NLRRADIRQAQMNQSNLNDAELDYTIFDRAQLQSAQF 319
>gi|425467653|ref|ZP_18846932.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
aeruginosa PCC 9809]
gi|389829528|emb|CCI29082.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
aeruginosa PCC 9809]
Length = 220
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
+ SA+L++AV + +FR + D+ +++F G+ N A L ++ Y+ANF +
Sbjct: 108 NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANFADCRL 167
Query: 169 CL 170
CL
Sbjct: 168 CL 169
>gi|126655992|ref|ZP_01727376.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
gi|126622272|gb|EAZ92978.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
Length = 319
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
Q ADLR +FR +F+ A++RE DF+G+ AYL +A N T
Sbjct: 25 QLRRADLRGLNLSNTDFRGVDFSYANLREVDFTGADLRDAYLNEADLTGVNLT 77
>gi|87302980|ref|ZP_01085784.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
gi|87282476|gb|EAQ74435.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
Length = 203
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 33/59 (55%), Gaps = 5/59 (8%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKANFT 164
A F ADL ++ + F R++F+ AD M +DFSG+ +GA L +A ++F+
Sbjct: 101 ADFSGADLHGSILTQAAFLRSDFSGADLSDALMDRADFSGTDLSGALLRGVIAAGSSFS 159
>gi|254417642|ref|ZP_05031376.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196175560|gb|EDX70590.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 436
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 29/54 (53%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+A ADLR+A + R AN + AD+RE++ SG+ A L A +A F
Sbjct: 348 SADLSDADLREANLSGADLREANLSGADLREANLSGADLREANLSGANVKQAKF 401
>gi|378719423|ref|YP_005284312.1| pentapeptide repeat-containing protein [Gordonia polyisoprenivorans
VH2]
gi|375754126|gb|AFA74946.1| pentapeptide repeat family protein [Gordonia polyisoprenivorans
VH2]
Length = 481
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 27/54 (50%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A F AD R A + R AN T A++ + F+G+ GA L A +ANF
Sbjct: 394 GASFVGADGRLASFTGADLRGANLTGANLSQGSFTGANLTGANLSGANLTEANF 447
>gi|254425612|ref|ZP_05039329.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
gi|196188035|gb|EDX83000.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
Length = 215
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 31/56 (55%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A SADL +A + N R A+ +SAD+R +D G+K GA L A AN T
Sbjct: 68 SGADLRSADLFRADLSEANLRSADLSSADLRGADLPGAKLIGANLIGANLSIANVT 123
>gi|425471163|ref|ZP_18850023.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
aeruginosa PCC 9701]
gi|389882952|emb|CCI36586.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
aeruginosa PCC 9701]
Length = 220
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
+ SA+L++AV + +FR + D+ +++F G+ N A L ++ Y+ANF +
Sbjct: 108 NGVNLNSANLQQAVLIDADFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANFADCRL 167
Query: 169 CL 170
CL
Sbjct: 168 CL 169
>gi|384246084|gb|EIE19575.1| hypothetical protein COCSUDRAFT_31020 [Coccomyxa subellipsoidea
C-169]
Length = 203
Score = 38.5 bits (88), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 3/64 (4%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV---AYKANFTVD 166
A F AD+ AV + +FR+AN ++ + +G+ F+GA L+ A+ A N V
Sbjct: 123 GANFSGADMTNAVIDRVDFRKANLSNVKFINAVITGTAFDGANLDGAIFEDALIGNEDVK 182
Query: 167 EICL 170
+CL
Sbjct: 183 RLCL 186
>gi|307152500|ref|YP_003887884.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306982728|gb|ADN14609.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 305
Score = 38.5 bits (88), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 40/75 (53%), Gaps = 5/75 (6%)
Query: 90 ALADL-NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 148
A+ DL NKY+A R S + DLR + NF+ A+F+ A++RE DFSG+
Sbjct: 6 AVIDLKNKYDAGERN----FSKIELRRVDLRGFNLSQANFKGADFSYANLREVDFSGADL 61
Query: 149 NGAYLEKAVAYKANF 163
+ A+ +A AN
Sbjct: 62 SEAFFNEADLTGANL 76
>gi|451981277|ref|ZP_21929641.1| putative Pentapeptide repeat protein [Nitrospina gracilis 3/211]
gi|451761500|emb|CCQ90895.1| putative Pentapeptide repeat protein [Nitrospina gracilis 3/211]
Length = 484
Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 44/89 (49%), Gaps = 2/89 (2%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L AV+ S SALA + +A+ +G + A A LR A VK + + A+
Sbjct: 377 LKEAVLGKASLKNSALAGADLRKAKLKG--AVLEGADLAGARLRHASLVKAHLKGADLHR 434
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFT 164
++ E+DFS + GA L A ++AN T
Sbjct: 435 TELDEADFSNADLQGANLTGAKLWEANLT 463
>gi|317970566|ref|ZP_07971956.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
Length = 175
Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 5/63 (7%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKA 161
+G AA F ADL A+ + F ANF AD+ + +D SG+ A L +A +
Sbjct: 70 VGKAANFSGADLHGAILTQGAFPDANFNGADLSDVLLDRTDMSGTDLRNAVLVGVIASGS 129
Query: 162 NFT 164
FT
Sbjct: 130 TFT 132
>gi|159029340|emb|CAO90206.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length = 405
Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 7/65 (10%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A+ RG F S A ADLR+A AN + AD+ E++ SG+ GA L A+
Sbjct: 245 ADLRGAFL--SEANLKGADLRRAF-----LSEANLSGADLSEANLSGADLRGAILSGAIL 297
Query: 159 YKANF 163
+ AN
Sbjct: 298 WGANL 302
>gi|443668754|ref|ZP_21134246.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|443330716|gb|ELS45411.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 403
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 7/65 (10%)
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
A+ RG F S A ADLR+A AN + AD+ E++ SG+ GA L A+
Sbjct: 243 ADLRGAFL--SEANLKGADLRRAF-----LSEANLSGADLSEANLSGADLRGAILSGAIL 295
Query: 159 YKANF 163
+ AN
Sbjct: 296 WGANL 300
>gi|209967175|ref|YP_002300090.1| pentapeptide repeat-containing protein [Rhodospirillum centenum SW]
gi|209960641|gb|ACJ01278.1| pentapeptide repeat family protein [Rhodospirillum centenum SW]
Length = 429
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 27/50 (54%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
I + A F + AV ++ +F AN D+R++D G+ F GA LE A
Sbjct: 152 IAAKADFSEVRMNGAVVLRADFTDANLARVDLRDADLRGANFRGANLEGA 201
>gi|254411535|ref|ZP_05025312.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196182036|gb|EDX77023.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 125
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 28/54 (51%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A ADLR+A N AN AD+RE++ +G+ GA++ A +AN
Sbjct: 22 GAHLIGADLREANLQGANLSHANLEGADLREANLAGANLTGAFVTNADMKEANL 75
>gi|428320418|ref|YP_007118300.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428244098|gb|AFZ09884.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 479
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 30/55 (54%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADL ++ N RA+ T A +RE++ G +F GA L++A KAN
Sbjct: 60 SGANLSGADLAESFLNLANLTRADLTGAVLREANLVGVEFTGANLKQASLIKANL 114
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 27/49 (55%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
+ A A L KA V N AN T A++ +D GS+ +GA L+KAV
Sbjct: 100 TGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAV 148
>gi|428217541|ref|YP_007102006.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
gi|427989323|gb|AFY69578.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
Length = 353
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A FGSA+L A + N +AN AD+ ++D G+K G L +A +AN
Sbjct: 54 ANFGSANLLGANLSEANLTKANLREADLYKADLGGAKLIGTSLIRAYLREANL 106
>gi|193213578|ref|YP_001999531.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
8327]
gi|193087055|gb|ACF12331.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
Length = 439
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 30/54 (55%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
+ F SADL KA N NF+ ADM +++ G+ GA L++A +A+ +
Sbjct: 301 SDFESADLDKANLAGANLAGGNFSRADMEKANLKGANLEGAVLDRAFMKQADLS 354
>gi|254430459|ref|ZP_05044162.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
gi|197624912|gb|EDY37471.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
Length = 180
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 5/62 (8%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKAN 162
G A F A+L A+ + F A+F AD M + DFSG+ F GA L +A +N
Sbjct: 76 GRHADFSGANLHGAILTQAAFPEASFAGADLSGVLMDKVDFSGADFTGADLSDVIASGSN 135
Query: 163 FT 164
F+
Sbjct: 136 FS 137
>gi|392382619|ref|YP_005031816.1| conserved protein of unknown function; Pentapeptide repeat
[Azospirillum brasilense Sp245]
gi|356877584|emb|CCC98426.1| conserved protein of unknown function; Pentapeptide repeat
[Azospirillum brasilense Sp245]
Length = 439
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 26/47 (55%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
AA ADLR+A+ +AN T AD+ +D G+ GA L++A
Sbjct: 383 AANLMGADLRQAMLTDSRMVQANLTDADLESADLDGADLAGAKLQRA 429
>gi|417147800|ref|ZP_11988300.1| pentapeptide repeat protein [Escherichia coli 1.2264]
gi|432414449|ref|ZP_19657095.1| hypothetical protein WG9_04959 [Escherichia coli KTE39]
gi|432449036|ref|ZP_19691321.1| hypothetical protein A13S_05120 [Escherichia coli KTE191]
gi|432639030|ref|ZP_19874892.1| hypothetical protein A1UY_04405 [Escherichia coli KTE81]
gi|433026911|ref|ZP_20214794.1| hypothetical protein WI9_05012 [Escherichia coli KTE106]
gi|433186914|ref|ZP_20371055.1| hypothetical protein WGO_05288 [Escherichia coli KTE85]
gi|215272912|emb|CAT00693.1| protein mcbG [Escherichia coli]
gi|386162365|gb|EIH24165.1| pentapeptide repeat protein [Escherichia coli 1.2264]
gi|430931206|gb|ELC51659.1| hypothetical protein WG9_04959 [Escherichia coli KTE39]
gi|430969334|gb|ELC86475.1| hypothetical protein A13S_05120 [Escherichia coli KTE191]
gi|431167788|gb|ELE68043.1| hypothetical protein A1UY_04405 [Escherichia coli KTE81]
gi|431524910|gb|ELI01731.1| hypothetical protein WI9_05012 [Escherichia coli KTE106]
gi|431695578|gb|ELJ60883.1| hypothetical protein WGO_05288 [Escherichia coli KTE85]
Length = 187
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/57 (31%), Positives = 30/57 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDE 167
F S L+K++ + FR F D+R+SDF+GS+FN + +F++ E
Sbjct: 97 VDFISLRLQKSIFLSSRFRDCLFEETDLRKSDFTGSEFNNTEFRHSDLSHCDFSMTE 153
>gi|300867252|ref|ZP_07111912.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300334729|emb|CBN57078.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 508
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 12/98 (12%)
Query: 73 STALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRAN 132
S+ L A++ + N++ L + EA+ G A +L +A K NF +AN
Sbjct: 75 SSHLVRAILQGATLNVANLVRADLSEAQLMG-------AALIRGELIRAELSKANFSKAN 127
Query: 133 FTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTV 165
T AD+RE+ +FS + +GA L A ANF +
Sbjct: 128 LTGADLREAKLTEVNFSEANLSGANLRGASGTAANFEL 165
Score = 36.2 bits (82), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 29/55 (52%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+ A F DLR+A + N AN + A++R +D SG+ GA L +A AN
Sbjct: 179 NGADFSGTDLRQANLCQVNLSGANLSGANLRWADLSGANLRGADLNEAKLSGANL 233
>gi|288957041|ref|YP_003447382.1| hypothetical protein AZL_002000 [Azospirillum sp. B510]
gi|288909349|dbj|BAI70838.1| hypothetical protein AZL_002000 [Azospirillum sp. B510]
Length = 424
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
AA F + L A + + R ANF+ AD+R +D +GS GA LE A
Sbjct: 166 AADFTNTRLAGARLDRTDLRDANFSGADLRGADLNGSDLRGAILEGA 212
>gi|158335471|ref|YP_001516643.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158305712|gb|ABW27329.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 502
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 44/97 (45%), Gaps = 13/97 (13%)
Query: 91 LADLNKYEAE-TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
L D+N A +R + S A +DL A + N R NF+ AD+ ++D S ++
Sbjct: 33 LKDINLINANLSRANLSLANLSGAFLAGSDLSDAFLSEANLSRVNFSRADLTKADLSFAR 92
Query: 148 FNGAYLEKAVAYKANFTVDEICLPLLVSLPMATPVFP 184
GA L +A Y+A +L+ M +FP
Sbjct: 93 LQGATLIEATLYQA----------ILIEACMVQVIFP 119
>gi|149922858|ref|ZP_01911281.1| serine/threonine kinase [Plesiocystis pacifica SIR-1]
gi|149816325|gb|EDM75829.1| serine/threonine kinase [Plesiocystis pacifica SIR-1]
Length = 655
Score = 38.1 bits (87), Expect = 1.9, Method: Composition-based stats.
Identities = 18/55 (32%), Positives = 32/55 (58%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A+ G L KA ++ + RA+ AD+R + F+ + +GA L +A+ + A+F
Sbjct: 556 SGARLGGLRLDKAEFIQASMARAHLRGADLRRARFNHADLSGADLREAIVWNADF 610
>gi|427734924|ref|YP_007054468.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427369965|gb|AFY53921.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 213
Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 29/56 (51%)
Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
G Q A+L + + RAN A++ ++F+GSKF GA+LE A AN
Sbjct: 9 GELKQLAGANLEDENLSQTDLSRANLAGANLVGTNFAGSKFEGAHLEGANLMGANL 64
>gi|23014351|ref|ZP_00054172.1| COG1357: Uncharacterized low-complexity proteins [Magnetospirillum
magnetotacticum MS-1]
Length = 164
Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 34/63 (53%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
R + G+ S A F ADL A V+ + RRA F A +R +D +G+K GA L A A
Sbjct: 87 RLDDGLFSDADFTKADLGGASLVRADLRRARFFHASLRGADLTGAKTLGAELLNADLSGA 146
Query: 162 NFT 164
+T
Sbjct: 147 RWT 149
>gi|16331083|ref|NP_441811.1| hypothetical protein sll0274 [Synechocystis sp. PCC 6803]
gi|383322826|ref|YP_005383679.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325995|ref|YP_005386848.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491879|ref|YP_005409555.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437147|ref|YP_005651871.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
gi|451815240|ref|YP_007451692.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
gi|1653576|dbj|BAA18489.1| sll0274 [Synechocystis sp. PCC 6803]
gi|339274179|dbj|BAK50666.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
gi|359272145|dbj|BAL29664.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359275315|dbj|BAL32833.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278485|dbj|BAL36002.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451781209|gb|AGF52178.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
Length = 196
Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 6/99 (6%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
L W+ V T + +VA+ + +LA + RG A F DLR ++
Sbjct: 34 LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 87
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
N R A+FT A+++ + F + +GA LE A A +F
Sbjct: 88 HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDF 126
>gi|218440259|ref|YP_002378588.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218172987|gb|ACK71720.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 340
Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 37/81 (45%), Gaps = 10/81 (12%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA----------YLEKAVAYKANFTV 165
A+LR+A+ N R N SAD+ E+DF G+ +GA L A +AN
Sbjct: 246 ANLRQAILTYANLRGCNLLSADLAEADFEGANLSGAGLLLTYMRATNLRHANLDQANLIG 305
Query: 166 DEICLPLLVSLPMATPVFPAG 186
+ L++ +A + P G
Sbjct: 306 ASLVQTNLMAASLAQTILPNG 326
>gi|376003692|ref|ZP_09781500.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375327990|emb|CCE17253.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 740
Score = 38.1 bits (87), Expect = 2.1, Method: Composition-based stats.
Identities = 20/54 (37%), Positives = 27/54 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A +LR A N A+ AD+R +D G+ F GA L +A Y+AN T
Sbjct: 580 ANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANIT 633
>gi|209526910|ref|ZP_03275429.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423063829|ref|ZP_17052619.1| pentapeptide repeat protein [Arthrospira platensis C1]
gi|209492689|gb|EDZ93025.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406714678|gb|EKD09839.1| pentapeptide repeat protein [Arthrospira platensis C1]
Length = 740
Score = 38.1 bits (87), Expect = 2.1, Method: Composition-based stats.
Identities = 20/54 (37%), Positives = 27/54 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A +LR A N A+ AD+R +D G+ F GA L +A Y+AN T
Sbjct: 580 ANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANIT 633
>gi|407961546|dbj|BAM54786.1| hypothetical protein BEST7613_5855 [Synechocystis sp. PCC 6803]
Length = 194
Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 6/99 (6%)
Query: 65 LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
L W+ V T + +VA+ + +LA + RG A F DLR ++
Sbjct: 32 LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 85
Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
N R A+FT A+++ + F + +GA LE A A +F
Sbjct: 86 HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDF 124
>gi|254411218|ref|ZP_05024995.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196181719|gb|EDX76706.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 293
Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 10/70 (14%)
Query: 110 AAQFGSADLRKAVHVKENFRRAN----------FTSADMRESDFSGSKFNGAYLEKAVAY 159
+A A+L A+ ++ N ++AN FT AD+ E D S ++ NG L +A+
Sbjct: 163 SANLEKANLTNAILLETNLKQANLNKALLHGANFTQADLTEVDLSQARLNGVNLTRAILV 222
Query: 160 KANFTVDEIC 169
A IC
Sbjct: 223 GAKLRGVSIC 232
>gi|448684742|ref|ZP_21692829.1| pentapeptide repeat-containing protein [Haloarcula japonica DSM
6131]
gi|445782673|gb|EMA33514.1| pentapeptide repeat-containing protein [Haloarcula japonica DSM
6131]
Length = 710
Score = 37.7 bits (86), Expect = 2.2, Method: Composition-based stats.
Identities = 19/47 (40%), Positives = 26/47 (55%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
AQFG+ D A + +FR A F +A + FSG FNG ++AV
Sbjct: 230 AQFGTGDFYHATFDEADFRWAEFGTARFYGATFSGGYFNGTSYDEAV 276
>gi|258612055|ref|ZP_05243959.2| phage protein [Listeria monocytogenes FSL R2-503]
gi|258608006|gb|EEW20614.1| phage protein [Listeria monocytogenes FSL R2-503]
Length = 187
Score = 37.7 bits (86), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 31/61 (50%)
Query: 96 KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 155
K+ + GE A ADLR A N RRA+ + AD+ +D +G+ NGA L +
Sbjct: 15 KWLRDGYGERANLRGANLRGADLRGADLSYANLRRADLSRADLNGADLNGADLNGADLSR 74
Query: 156 A 156
A
Sbjct: 75 A 75
>gi|409994208|ref|ZP_11277326.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|409934956|gb|EKN76502.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 517
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 7/81 (8%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L A++ + N++ L + EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138
Query: 136 ADMRESDFSGSKFNGAYLEKA 156
AD+RES + FNGA L A
Sbjct: 139 ADLRESKLQQTNFNGANLSGA 159
>gi|186683437|ref|YP_001866633.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186465889|gb|ACC81690.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 176
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 4/119 (3%)
Query: 41 SSKTESDGQFPDCSNNQCAGPY--AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYE 98
SS E+D D +N +G + + + A+ A ++ ++ L + N E
Sbjct: 55 SSLIEADLNGADLTNANLSGSNLSGAILDGAILDGAAMEGANLSQADLTVAKLIETNLSE 114
Query: 99 AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
A+ + I AA ADL A + +AN T AD+ +++ SG+ +GA +E +
Sbjct: 115 ADLQEASLI--AANLDGADLSGADLTVADLSQANLTQADLNQTNLSGANLDGANIEGTI 171
>gi|428213860|ref|YP_007087004.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428002241|gb|AFY83084.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 331
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 47/122 (38%), Gaps = 14/122 (11%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR----- 130
L A + + N++ L D N +A+ RG LR A NFR
Sbjct: 92 LQGADLRKANLNLANLLDANLSDADLRG-------TTLSGVCLRGACLRGANFREERRIY 144
Query: 131 --ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLLVSLPMATPVFPAGFC 188
AN AD+R +D G +GA L KA AN T + L MA + GF
Sbjct: 145 SAANLRGADLRGADLRGVNLSGADLTKADLSGANLTETNLRGANLERAKMALAIVNGGFL 204
Query: 189 AP 190
+
Sbjct: 205 SD 206
>gi|381207646|ref|ZP_09914717.1| hypothetical protein SclubJA_18738 [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 219
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 27/55 (49%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+A ADL +A K + R AN AD+RES+ G GA L+ A AN
Sbjct: 126 QSADLSEADLYRADLEKSDLRDANLYKADLRESNLQGVNLQGANLQGADLEGANL 180
>gi|291570912|dbj|BAI93184.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
Length = 517
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 7/81 (8%)
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L A++ + N++ L + EA+ I A+L +A K NF +AN
Sbjct: 86 LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138
Query: 136 ADMRESDFSGSKFNGAYLEKA 156
AD+RES + FNGA L A
Sbjct: 139 ADLRESKLQQTNFNGANLSGA 159
>gi|430900982|ref|ZP_19484783.1| LPXTG-domain-containing protein cell wall anchor domain
[Enterococcus faecium E1575]
gi|430554860|gb|ELA94429.1| LPXTG-domain-containing protein cell wall anchor domain
[Enterococcus faecium E1575]
Length = 1074
Score = 37.7 bits (86), Expect = 2.4, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 63 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQF 113
+KLKNWRV V + ++AS S+I AD+N E + E+G+G+ +F
Sbjct: 4 SKLKNWRVAVVVVMIIQLLASFVSSIIVHADINHPE-QVSIEYGVGTGYRF 53
>gi|428774386|ref|YP_007166174.1| serine/threonine protein kinase with pentapeptide repeats
[Cyanobacterium stanieri PCC 7202]
gi|428688665|gb|AFZ48525.1| serine/threonine protein kinase with pentapeptide repeats
[Cyanobacterium stanieri PCC 7202]
Length = 506
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A F AD +A V+ N +A+ A+++ +DF + GA LE A YKAN
Sbjct: 420 ANFYHADFSRARLVRANLTKAHLFKAELQYADFRNANLTGANLEGANLYKANL 472
>gi|284929723|ref|YP_003422245.1| hypothetical protein UCYN_11960 [cyanobacterium UCYN-A]
gi|284810167|gb|ADB95864.1| uncharacterized low-complexity protein [cyanobacterium UCYN-A]
Length = 243
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 9/63 (14%)
Query: 94 LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
LNKY+ R F S LR+ + N + NF SAD+R+S S FNGA L
Sbjct: 7 LNKYDLGER---------NFQSICLREVDLTEVNLPKINFESADIRQSRLGKSNFNGAIL 57
Query: 154 EKA 156
++A
Sbjct: 58 KQA 60
>gi|385679319|ref|ZP_10053247.1| pentapeptide repeat-containing protein [Amycolatopsis sp. ATCC
39116]
Length = 194
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 53/119 (44%), Gaps = 18/119 (15%)
Query: 53 CSNNQCAGPYAKLKNWR---------VFVSTALAAAVVASCSSNISALADLNKYEAETRG 103
C+ ++C A L R VF T LA + ++CS S+ D R
Sbjct: 40 CTFDECDFSGADLGESRHQASAFRSCVFDRTVLADSTWSACSLLGSSFVDGGLRGMSVRD 99
Query: 104 -EFGIGSAAQFGSADLRKAVHVKENFRRANF-----TSADMRESDFSGSKFNGAYLEKA 156
+F S A F A+LR+ FR A+F T AD+R+SDF G++ GA L A
Sbjct: 100 SDF---SLANFSRANLRRRSLSGLRFREASFVDANLTEADLRDSDFRGARLGGADLTGA 155
>gi|288960397|ref|YP_003450737.1| pentapeptide repeat protein [Azospirillum sp. B510]
gi|288912705|dbj|BAI74193.1| pentapeptide repeat protein [Azospirillum sp. B510]
Length = 431
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 29/57 (50%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
F A +R V+ R ANFT ++M +D SG+ GA L AV A T+ ++
Sbjct: 167 DFSDAVMRGCKLVRATMRGANFTGSNMEGADLSGADLRGACLRGAVLTGATMTMTDL 223
>gi|172037842|ref|YP_001804343.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354556328|ref|ZP_08975624.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171699296|gb|ACB52277.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353551765|gb|EHC21165.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 319
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
Q ADLR +FR + + A++RE DF+G+ AYL +A NFT
Sbjct: 25 QLRRADLRGLNLSHTDFRGVDLSYANLREVDFTGADLRDAYLNEADLTAVNFT 77
>gi|389874428|ref|YP_006373784.1| pentapeptide repeat-containing protein [Tistrella mobilis
KA081020-065]
gi|388531608|gb|AFK56802.1| pentapeptide repeat-containing protein [Tistrella mobilis
KA081020-065]
Length = 178
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 31/63 (49%), Gaps = 5/63 (7%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYK 160
G AA F ADL+ A + RA+FT AD+R +D G+ F GA L A Y
Sbjct: 95 GKAEAAIFAEADLQSADFTRSKAARADFTGADLRRARFYRADLRGADFTGANLTGADLYD 154
Query: 161 ANF 163
A+
Sbjct: 155 ADL 157
>gi|428299369|ref|YP_007137675.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428235913|gb|AFZ01703.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 255
Score = 37.7 bits (86), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 40/86 (46%), Gaps = 10/86 (11%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTV 165
A ADL +A N R AN +A + E++ S GA L+KA AN +T
Sbjct: 160 ANLAEADLFRANLRSANLRGANLQNAGLVEANLQSSNLAGAKLQKATLNGANLKDAKYTS 219
Query: 166 D----EICLPLLVSLPMATPVFPAGF 187
+ E+C L VS P T VF GF
Sbjct: 220 ENASPELCKSLSVSYPCPT-VFLEGF 244
>gi|385871982|gb|AFI90502.1| Pentapeptide repeat protein [Pectobacterium sp. SCC3193]
Length = 273
Score = 37.7 bits (86), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 8/103 (7%)
Query: 69 RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVK 125
R +T L +AV + S N + ++ R IG+ A+ ++DL +A +
Sbjct: 131 RFTGATWLTSAVASGSSMNSADFTQATLRQSNLRQASLIGAVFALAKLENSDLSEADCQQ 190
Query: 126 ENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
NF+RAN F D RE++F+ + GA L+K+ ANF
Sbjct: 191 TNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQLGGANF 233
>gi|334118008|ref|ZP_08492098.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
gi|333459993|gb|EGK88603.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
Length = 171
Score = 37.7 bits (86), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 29/52 (55%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
F A+LR + + R +F +A+M E++ G+ F GA L+ A KAN T
Sbjct: 60 FNKANLRNSNFTNADLRGVSFFAANMEEANLEGANFTGATLDLARMMKANLT 111
>gi|336250332|ref|YP_004594042.1| hypothetical protein EAE_19280 [Enterobacter aerogenes KCTC 2190]
gi|334736388|gb|AEG98763.1| hypothetical protein EAE_19280 [Enterobacter aerogenes KCTC 2190]
Length = 846
Score = 37.7 bits (86), Expect = 2.9, Method: Composition-based stats.
Identities = 18/53 (33%), Positives = 27/53 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+ AD R A ++ N + F D RE+DF+ + GA L+K+ ANF
Sbjct: 754 SDLSEADCRDASFIRANLVGSLFVRTDFREADFTDANLMGALLQKSQLAGANF 806
>gi|444351422|ref|YP_007387566.1| pentapeptide repeat [Enterobacter aerogenes EA1509E]
gi|443902252|emb|CCG30026.1| pentapeptide repeat [Enterobacter aerogenes EA1509E]
Length = 846
Score = 37.4 bits (85), Expect = 2.9, Method: Composition-based stats.
Identities = 18/53 (33%), Positives = 27/53 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+ AD R A ++ N + F D RE+DF+ + GA L+K+ ANF
Sbjct: 754 SDLSEADCRDASFIRANLVGSLFVRTDFREADFTDANLMGALLQKSQLAGANF 806
>gi|300865105|ref|ZP_07109930.1| serine/threonine protein kinase [Oscillatoria sp. PCC 6506]
gi|300336876|emb|CBN55080.1| serine/threonine protein kinase [Oscillatoria sp. PCC 6506]
Length = 540
Score = 37.4 bits (85), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 37/79 (46%), Gaps = 9/79 (11%)
Query: 91 LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFS 144
LA N YEA TR A A+L A V+ N R AN T +A+++ +D
Sbjct: 430 LAGANFYEARLTRANL---QGADLSEANLGHARLVEANLRDANLTQAYCSTANLQSADLR 486
Query: 145 GSKFNGAYLEKAVAYKANF 163
G+ GAYL KA AN
Sbjct: 487 GANLAGAYLSKANLRGANL 505
>gi|383763560|ref|YP_005442542.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381383828|dbj|BAM00645.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 189
Score = 37.4 bits (85), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A A+L++A N RAN + AD+ +D SG+ GA L A +AN T
Sbjct: 40 ADLSFANLQRANLAGANLERANLSGADLEGADLSGANLVGANLTGARLMRANLT 93
>gi|409991580|ref|ZP_11274829.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|291567915|dbj|BAI90187.1| pentapeptide repeat-containing protein [Arthrospira platensis
NIES-39]
gi|409937560|gb|EKN78975.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 390
Score = 37.4 bits (85), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 33/54 (61%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+A ADL +A+ +K NF +A+ +SA++ +S+ + F AYL KA +A+
Sbjct: 111 SAHLNWADLTEAIFIKTNFHKADLSSANLTKSNLQSANFVRAYLIKANLSEADL 164
>gi|428317848|ref|YP_007115730.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428241528|gb|AFZ07314.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 171
Score = 37.4 bits (85), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 29/52 (55%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
F A+LR + + R +F +A+M E++F G+ GA L+ A KAN T
Sbjct: 60 FNKANLRNSNFTNADLRGVSFFAANMEEANFEGANLTGATLDLARMMKANLT 111
>gi|428201834|ref|YP_007080423.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
gi|427979266|gb|AFY76866.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
Length = 143
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
SAA ADLR+A N AN T A++ +D +G+ GA L KA A F
Sbjct: 49 SAAHLIGADLREA-----NLSGANLTEANLEGADLTGANLQGANLTKAFVTNATF 98
>gi|425439840|ref|ZP_18820154.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
aeruginosa PCC 9717]
gi|389719844|emb|CCH96379.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
aeruginosa PCC 9717]
Length = 225
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 34/61 (55%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
+ SA+L++AV + +FR + D+ +++F G+ N A L ++ Y+ANF +
Sbjct: 113 NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANFADCRL 172
Query: 169 C 169
C
Sbjct: 173 C 173
>gi|268325885|emb|CBH39473.1| conserved hypothetical protein [uncultured archaeon]
Length = 358
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
+ A L KA K + R A AD+R +D S +K NGA L A Y A+ + ++
Sbjct: 258 TGASLNGGKLYKAKLRKADLRGAKMYKADLRWADLSSTKLNGADLTDADLYGADLSGAKL 317
Query: 169 CLPLLVSLPMATPVFPA 185
C L + F
Sbjct: 318 CEADLRKTDLRGTTFDG 334
>gi|448736468|ref|ZP_21718581.1| Ion transport 2 domain-containing protein [Halococcus thailandensis
JCM 13552]
gi|445806103|gb|EMA56272.1| Ion transport 2 domain-containing protein [Halococcus thailandensis
JCM 13552]
Length = 345
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 27/55 (49%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A ADLR+A + + RRA F AD+ + F + A L +A Y+ FT
Sbjct: 90 GADLSGADLRRATFDRVDARRARFDGADVEGATFENADLRDASLNRAKLYRTGFT 144
>gi|297170923|gb|ADI21940.1| uncharacterized low-complexity proteins [uncultured nuHF2 cluster
bacterium HF0130_29D04]
Length = 695
Score = 37.4 bits (85), Expect = 3.4, Method: Composition-based stats.
Identities = 20/54 (37%), Positives = 29/54 (53%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A F ADLR+A V + R ANF A+++ + + GA LE+A Y A+
Sbjct: 139 GANFRGADLREAKLVGADLREANFRGANLQTAYLIKADLKGANLEEASLYGADL 192
>gi|440681678|ref|YP_007156473.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
gi|428678797|gb|AFZ57563.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
Length = 402
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 52/117 (44%), Gaps = 26/117 (22%)
Query: 71 FVSTALAAAVV--ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 128
F L A++ A+ I A ADL K +A G A F A L +A+ + NF
Sbjct: 263 FTRAILTEAILIGANFEEAILAGADLTKAKANFTG-------ANFTGAILTEAILIGANF 315
Query: 129 RRA---------------NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
+A N T AD+ E+D +G+ AYL KA+ +A ++E+ L
Sbjct: 316 EKAYLIRADLTGANLTGTNLTRADLTEADLTGANLTRAYLIKAILEEA--ILEEVIL 370
Score = 35.8 bits (81), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 2/55 (3%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR--ESDFSGSKFNGAYLEKAVAYKANF 163
A F A L +A+ + NF A AD+ +++F+G+ F GA L +A+ ANF
Sbjct: 261 ANFTRAILTEAILIGANFEEAILAGADLTKAKANFTGANFTGAILTEAILIGANF 315
>gi|254421888|ref|ZP_05035606.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
gi|196189377|gb|EDX84341.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
Length = 194
Score = 37.4 bits (85), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 59/149 (39%), Gaps = 12/149 (8%)
Query: 21 SKGPYQLHALSKPLWVACQIS-SKTESDGQFPDCSNNQCAGPYAKLK--NWRV--FVSTA 75
S G L + P W Q +D + P C N+ A+L N +V
Sbjct: 18 SIGLIGLLGFAAPSWAYLQEDVDMLMNDNECPVCILNEADLVGAQLNHANLKVASLTGAN 77
Query: 76 LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
L A ++ + +S L N A G AQ A L+ AV + AN T
Sbjct: 78 LTGADLSETNLMLSELIGTNLTNASLAG-------AQMNGAQLKDAVLKGADLSGANLTQ 130
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A++ +++F G+K + AV ANFT
Sbjct: 131 ANLEDANFVGAKLINTEMTAAVVGVANFT 159
>gi|390440421|ref|ZP_10228750.1| membrane hypothetical protein [Microcystis sp. T1-4]
gi|389836163|emb|CCI32876.1| membrane hypothetical protein [Microcystis sp. T1-4]
Length = 904
Score = 37.4 bits (85), Expect = 3.5, Method: Composition-based stats.
Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 7/67 (10%)
Query: 91 LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
L D+N E++ G A +A+L+KA + R +FT+AD+ ++D +G+ G
Sbjct: 534 LKDINFTESDLSG-------ALLRNANLKKANLTRTILNRVDFTNADLSDADLTGASVKG 586
Query: 151 AYLEKAV 157
A + A+
Sbjct: 587 AKFDNAI 593
>gi|428317459|ref|YP_007115341.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
gi|428241139|gb|AFZ06925.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
Length = 197
Score = 37.4 bits (85), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 38/75 (50%), Gaps = 8/75 (10%)
Query: 89 SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 148
S LAD N +A G A A+L++AV ++ N R A+ + AD+R +DF +
Sbjct: 29 SDLADANLSQANLSG-------ANLVGANLQRAV-LRANLRGADLSGADLRGADFRNADL 80
Query: 149 NGAYLEKAVAYKANF 163
GA A+ A+F
Sbjct: 81 RGASFANALVRDASF 95
>gi|307154970|ref|YP_003890354.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306985198|gb|ADN17079.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
Length = 231
Score = 37.4 bits (85), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 28/47 (59%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
DLR+A + N AN AD+ +++ SG+ + A+LEKA+ AN
Sbjct: 55 DLREANLTQANLNWANLHKADLTQANLSGANLSQAFLEKAILIAANL 101
>gi|359457318|ref|ZP_09245881.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
5410]
Length = 510
Score = 37.4 bits (85), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 44/97 (45%), Gaps = 13/97 (13%)
Query: 91 LADLNKYEAE-TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
L D+N A +R + S A +DL A + N R NF+ AD+ ++D S ++
Sbjct: 41 LQDINLINANLSRANLSLANLSGAFLAGSDLSNAFLSEANLSRVNFSRADLTKADLSFAR 100
Query: 148 FNGAYLEKAVAYKANFTVDEICLPLLVSLPMATPVFP 184
GA L +A Y+A +L+ M +FP
Sbjct: 101 LQGATLIEANLYQA----------ILIEACMVQVIFP 127
>gi|402773132|ref|YP_006592669.1| pentapeptide repeat protein [Methylocystis sp. SC2]
gi|401775152|emb|CCJ08018.1| Pentapeptide repeat protein [Methylocystis sp. SC2]
Length = 261
Score = 37.4 bits (85), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 28/53 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A F S L A K + NFT AD++ +DFSG++ N A L A+ A F
Sbjct: 115 ADFFSTKLAGAKLAKADLSATNFTRADLQNADFSGARMNAATLYAALLDGATF 167
>gi|381204293|ref|ZP_09911364.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 156
Score = 37.4 bits (85), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 9/102 (8%)
Query: 71 FVSTALAAAVVASCSSNISALADL-NKYEAETRG----EFGIGSAAQFGS----ADLRKA 121
V+T L A A ++ L D N + + RG EF + + S ADLRKA
Sbjct: 20 IVATLLTADASAYKQEDLDKLQDTYNCVKCDLRGAILREFNLTGTNLYKSDLRKADLRKA 79
Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
N N T A +RE++ +G+ +GA+L +A AN
Sbjct: 80 DLRDTNLGDTNLTGAVLREANLTGANMSGAHLWEANLTGANL 121
>gi|428318454|ref|YP_007116336.1| serine/threonine protein kinase with pentapeptide repeats
[Oscillatoria nigro-viridis PCC 7112]
gi|428242134|gb|AFZ07920.1| serine/threonine protein kinase with pentapeptide repeats
[Oscillatoria nigro-viridis PCC 7112]
Length = 543
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 29/60 (48%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEIC 169
AA A+L A ++ N R AN T A ++F G+ F GA L A KAN +C
Sbjct: 450 AADLSGANLGHARLIQANLRDANLTEAYCSTANFEGADFRGADLTGAYLTKANLRGANLC 509
>gi|428224166|ref|YP_007108263.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984067|gb|AFY65211.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 583
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 25/46 (54%)
Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
RAN T A + ++ + N A L++AV A+ T E+CL LL
Sbjct: 52 LNRANLTEASLHHANLRNASLNSALLDRAVLSGADLTKAELCLALL 97
>gi|217977179|ref|YP_002361326.1| pentapeptide repeat-containing protein [Methylocella silvestris
BL2]
gi|217502555|gb|ACK49964.1| pentapeptide repeat protein [Methylocella silvestris BL2]
Length = 260
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
NF +A M ++FSG K +GA L++A A KANF
Sbjct: 79 NFRAARMNNTNFSGGKLDGAVLDQAWALKANF 110
>gi|158340181|ref|YP_001521351.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158310422|gb|ABW32037.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 535
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 51/106 (48%), Gaps = 12/106 (11%)
Query: 71 FVSTALAAAVVASCSSNIS-----ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
V+ L+ A++ S N S L + N A+ + I ++ ADLR A +K
Sbjct: 387 LVNADLSKAILKSAELNKSYLTFAKLQEANLTNAQLTEAYLISTS--LREADLRSANLLK 444
Query: 126 ENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTVD 166
+ R A+ ++D+R E+ SG++ GA L+ A KAN + D
Sbjct: 445 ADLRWADLINSDLRGANLRETKLSGARLYGANLKDADLSKANLSAD 490
>gi|16126499|ref|NP_421063.1| pentapeptide repeat-containing protein [Caulobacter crescentus
CB15]
gi|221235279|ref|YP_002517716.1| hypothetical protein CCNA_02343 [Caulobacter crescentus NA1000]
gi|13423771|gb|AAK24231.1| pentapeptide repeat family protein [Caulobacter crescentus CB15]
gi|220964452|gb|ACL95808.1| hypothetical protein with pentapeptide repeats [Caulobacter
crescentus NA1000]
Length = 419
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 29/51 (56%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
+ + A F A L+ A V+ N ++ANF A++ +D SG+ GA L AV
Sbjct: 166 VATKADFSDAILKDAKLVRANLKQANFNGANLAGADLSGANLAGADLRNAV 216
>gi|301029833|ref|ZP_07192875.1| pentapeptide repeat protein [Escherichia coli MS 196-1]
gi|126812|sp|P05530.1|MCBG_ECOLX RecName: Full=Protein McbG
gi|41983|emb|CAA30724.1| unnamed protein product [Escherichia coli]
gi|299877321|gb|EFI85532.1| pentapeptide repeat protein [Escherichia coli MS 196-1]
Length = 187
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 18/57 (31%), Positives = 30/57 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDE 167
F S L+K++ + FR F D+R+SDF+GS+FN + +F++ E
Sbjct: 97 VDFISLRLQKSIFLSCRFRDCLFEETDLRKSDFTGSEFNNTEFRHSDLSHCDFSMTE 153
>gi|119512769|ref|ZP_01631839.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
gi|119462587|gb|EAW43554.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
Length = 268
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 23/37 (62%)
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
N AN +AD+ E++ ++ NGAYL KA YKAN
Sbjct: 160 NLIEANLINADLSEANLYEAQLNGAYLYKANFYKANL 196
>gi|409990095|ref|ZP_11273525.1| pentapeptide repeat-containing protein, partial [Arthrospira
platensis str. Paraca]
gi|409939047|gb|EKN80281.1| pentapeptide repeat-containing protein, partial [Arthrospira
platensis str. Paraca]
Length = 220
Score = 37.0 bits (84), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 48/112 (42%), Gaps = 5/112 (4%)
Query: 56 NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQ 112
N+ YA+ R F +L AA+ + N L+ N EA IG S +Q
Sbjct: 10 NKLLTRYAQ--GERNFSDISLVAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQ 67
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
ADL AV + N A+ T + ++D SG+ +GA L + N T
Sbjct: 68 LSYADLSMAVLIDANLTGASMTETVLHQADLSGASLSGAILSQVNLTGVNLT 119
>gi|409989360|ref|ZP_11272974.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
gi|409939778|gb|EKN80828.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
Paraca]
Length = 333
Score = 37.0 bits (84), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 39/87 (44%), Gaps = 7/87 (8%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
F+ L A + C + L++ N A RG A A+LR A N
Sbjct: 242 FIKANLMKADLEECDLRNADLSNTNLNLANLRG-------ADLTGANLRGAYLWGANLDG 294
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAV 157
AN +AD+R++ F + NGA L+ A+
Sbjct: 295 ANLENADLRDASFRHATLNGAMLQDAI 321
>gi|443478905|ref|ZP_21068593.1| serine/threonine protein kinase with pentapeptide repeats
[Pseudanabaena biceps PCC 7429]
gi|443015732|gb|ELS30565.1| serine/threonine protein kinase with pentapeptide repeats
[Pseudanabaena biceps PCC 7429]
Length = 545
Score = 37.0 bits (84), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 29/54 (53%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEIC 169
ADL A + + R AN SA M ++D SG+ +GA L+ A +AN +C
Sbjct: 459 ADLGSASMILADMREANLQSAYMSKADLSGANLSGANLKGAYLSQANLNGTNLC 512
>gi|425458309|ref|ZP_18837797.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
aeruginosa PCC 9808]
gi|389827863|emb|CCI20729.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
aeruginosa PCC 9808]
Length = 220
Score = 37.0 bits (84), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 34/61 (55%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
+ SA+L++AV + +FR + D+ +++F G+ N A L ++ Y+ANF +
Sbjct: 108 NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFRGADLNYANLSGSLLYRANFADCRL 167
Query: 169 C 169
C
Sbjct: 168 C 168
>gi|39997499|ref|NP_953450.1| pentapeptide repeat-containing protein [Geobacter sulfurreducens
PCA]
gi|39984390|gb|AAR35777.1| pentapeptide repeat domain protein [Geobacter sulfurreducens PCA]
Length = 254
Score = 37.0 bits (84), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 37/73 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
A+F A+L A K N + NF+ A++ ++FSG+K A L AV NF+ ++
Sbjct: 117 AKFVGANLSGADMRKVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSA 176
Query: 171 PLLVSLPMATPVF 183
L SL + F
Sbjct: 177 TDLGSLDLEGANF 189
>gi|409912856|ref|YP_006891321.1| pentapeptide repeat-containing protein [Geobacter sulfurreducens
KN400]
gi|298506440|gb|ADI85163.1| pentapeptide repeat domain protein [Geobacter sulfurreducens KN400]
Length = 259
Score = 37.0 bits (84), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 37/73 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
A+F A+L A K N + NF+ A++ ++FSG+K A L AV NF+ ++
Sbjct: 117 AKFVGANLSGADMRKVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSA 176
Query: 171 PLLVSLPMATPVF 183
L SL + F
Sbjct: 177 TDLGSLDLEGANF 189
>gi|67924929|ref|ZP_00518320.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
gi|67853235|gb|EAM48603.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
Length = 366
Score = 37.0 bits (84), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 5/70 (7%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFT 164
A + +L A NFR AN T D+ E S FSG+ +GAYL A +A+F
Sbjct: 246 ATELSGIELSGANLTHSNFRGANLTDVDLSEAILSYSRFSGADLSGAYLGNANLQQADFY 305
Query: 165 VDEICLPLLV 174
+ L L+
Sbjct: 306 RSSLALANLI 315
>gi|113476307|ref|YP_722368.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110167355|gb|ABG51895.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 225
Score = 37.0 bits (84), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 27/53 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
A F +L+ A N NF AD+ ++ SG+ GA LEKA Y+A+
Sbjct: 52 ANFHDINLKNANMSGANLTGVNFQGADLNGANLSGANLTGANLEKANLYRADI 104
>gi|428314067|ref|YP_007125044.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428255679|gb|AFZ21638.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 745
Score = 37.0 bits (84), Expect = 4.3, Method: Composition-based stats.
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 5/56 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S+A+ SADLR+ V A+ T AD+ E+ F+ + +GA L K A +++FT
Sbjct: 564 SSAKLISADLRQGV-----LENASLTGADLGEAKFARANLHGARLGKVKAVRSDFT 614
>gi|428307960|ref|YP_007144785.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428249495|gb|AFZ15275.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 201
Score = 37.0 bits (84), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 42/84 (50%), Gaps = 11/84 (13%)
Query: 84 CSSNISALADL---NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE 140
C +N+ ADL + +EA G IG A+ ADL A NFR AN AD+ E
Sbjct: 93 CEANLGG-ADLIEADLFEANLTGANLIG--AKLIGADLTGA-----NFREANLMGADLFE 144
Query: 141 SDFSGSKFNGAYLEKAVAYKANFT 164
++ SG+ +GA L A AN +
Sbjct: 145 ANLSGANLSGANLSGANLTLANLS 168
>gi|416406325|ref|ZP_11688097.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
gi|357261078|gb|EHJ10386.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
Length = 366
Score = 37.0 bits (84), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 5/70 (7%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFT 164
A + +L A NFR AN T D+ E S FSG+ +GAYL A +A+F
Sbjct: 246 ATELSGIELSGANLTHSNFRGANLTDVDLSEAILSYSRFSGADLSGAYLGNANLQQADFY 305
Query: 165 VDEICLPLLV 174
+ L L+
Sbjct: 306 RSSLALANLI 315
>gi|218440380|ref|YP_002378709.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
gi|218173108|gb|ACK71841.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
Length = 206
Score = 37.0 bits (84), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 28/54 (51%), Gaps = 1/54 (1%)
Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
+ADLR A+ + AN AD+R F G+ GA L A+ K N +DEI
Sbjct: 127 NADLRGAIFEGTSLVNANLCFADLRRCQFDGANLEGATLTNAI-LKDNQKIDEI 179
>gi|381204405|ref|ZP_09911476.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 135
Score = 37.0 bits (84), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 31/61 (50%), Gaps = 3/61 (4%)
Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
G+F A DLR+A ++AN ++ADM E++ S + GA LE A +AN
Sbjct: 18 GDF---EGANLSGMDLRRANLSGAALKKANLSNADMTEANLSVADLTGAKLENAKLRQAN 74
Query: 163 F 163
Sbjct: 75 L 75
>gi|334119964|ref|ZP_08494048.1| serine/threonine protein kinase with pentapeptide repeats
[Microcoleus vaginatus FGP-2]
gi|333457605|gb|EGK86228.1| serine/threonine protein kinase with pentapeptide repeats
[Microcoleus vaginatus FGP-2]
Length = 543
Score = 37.0 bits (84), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 29/60 (48%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEIC 169
AA A+L A ++ N R AN T A ++F G+ F GA L A KAN +C
Sbjct: 450 AADLSGANLGHARLIQANLRDANLTEAYCSTANFEGADFRGADLTGAYLTKANLRGANLC 509
>gi|113477518|ref|YP_723579.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
IMS101]
gi|110168566|gb|ABG53106.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
Length = 710
Score = 37.0 bits (84), Expect = 4.4, Method: Composition-based stats.
Identities = 28/105 (26%), Positives = 52/105 (49%), Gaps = 10/105 (9%)
Query: 68 WRVFVSTA-LAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVK 125
+R +S A + + + + + + L + N ++A T F + A GSADL KA +
Sbjct: 510 FRATLSKAIMPGSTITQANFSSAKLIETNLHQANLTEATF---TGADLGSADLSKANLYR 566
Query: 126 ENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTV 165
N + F +D+RES++ G+ +GA +A KA+ ++
Sbjct: 567 ANLSKVKAEGTTFQLSDLRESNWQGANLSGANFSRANLKKADLSL 611
>gi|16330983|ref|NP_441711.1| eukariotic protein kinase [Synechocystis sp. PCC 6803]
gi|383322725|ref|YP_005383578.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr. GT-I]
gi|383325894|ref|YP_005386747.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr.
PCC-P]
gi|383491778|ref|YP_005409454.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr.
PCC-N]
gi|384437045|ref|YP_005651769.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803]
gi|451815141|ref|YP_007451593.1| eukariotic protein kinase [Synechocystis sp. PCC 6803]
gi|15214308|sp|P74297.1|SPKB_SYNY3 RecName: Full=Serine/threonine-protein kinase B
gi|1653478|dbj|BAA18391.1| eukariotic protein kinase [Synechocystis sp. PCC 6803]
gi|11022717|dbj|BAB17034.1| Ser/Thr protein kinase SpkB [Synechocystis sp. PCC 6803]
gi|339274077|dbj|BAK50564.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803]
gi|359272044|dbj|BAL29563.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr. GT-I]
gi|359275214|dbj|BAL32732.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr.
PCC-N]
gi|359278384|dbj|BAL35901.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr.
PCC-P]
gi|407961651|dbj|BAM54891.1| eukariotic protein kinase [Bacillus subtilis BEST7613]
gi|451781110|gb|AGF52079.1| eukariotic protein kinase [Synechocystis sp. PCC 6803]
Length = 574
Score = 37.0 bits (84), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 40/94 (42%), Gaps = 7/94 (7%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
V LA A V + + L + N +AE + A FG A L+ + N
Sbjct: 456 LVGIVLAKAFVPGINCYQANLTNANFEQAEL-------TRADFGKARLKNVIFKGANLSD 508
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A F AD+R +D G+ NG + A ANF+
Sbjct: 509 AYFGYADLRGADLRGANLNGVNFKYANLQGANFS 542
>gi|376007502|ref|ZP_09784697.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
gi|375324138|emb|CCE20450.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
Length = 179
Score = 37.0 bits (84), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 29/62 (46%)
Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
RGE+ G AD+ R+AN T+A+M + DF+G+ F + L +
Sbjct: 40 RGEYSSCQGCNLGGADMSNQSRRNAQLRQANLTNANMSDGDFTGAFFTCSNLSNSNLSGG 99
Query: 162 NF 163
NF
Sbjct: 100 NF 101
>gi|332710048|ref|ZP_08430003.1| uncharacterized low-complexity protein [Moorea producens 3L]
gi|332351191|gb|EGJ30776.1| uncharacterized low-complexity protein [Moorea producens 3L]
Length = 739
Score = 37.0 bits (84), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 29/53 (54%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+ F SADL ++ N RAN ++A+++ DF+ ++ GA L A Y A
Sbjct: 608 SDFSSADLSQSSWQGANLSRANLSNANLKNVDFNSTQLVGANLRNAKLYNAKL 660
>gi|194336259|ref|YP_002018053.1| pentapeptide repeat-containing protein [Pelodictyon
phaeoclathratiforme BU-1]
gi|194308736|gb|ACF43436.1| pentapeptide repeat protein [Pelodictyon phaeoclathratiforme BU-1]
Length = 180
Score = 37.0 bits (84), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 29/129 (22%), Positives = 55/129 (42%), Gaps = 11/129 (8%)
Query: 35 WVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADL 94
++ C +S S + DC C AKLKN ++L +C +D
Sbjct: 21 FIHCNFNSADLSGVRMIDCRFEGCDLSLAKLKN------SSLQKVKFVNCKLLGVLFSDC 74
Query: 95 NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
K+ + + I + F L+ + + + A+F+ AD+ SG+KF+G+ L
Sbjct: 75 RKFMLDLDFDRCILKLSLFAGLKLKNTRFINCDLQEADFSEADL-----SGAKFDGSDLL 129
Query: 155 KAVAYKANF 163
+ + + +N
Sbjct: 130 QTIFFHSNL 138
>gi|428226949|ref|YP_007111046.1| hypothetical protein GEI7407_3527 [Geitlerinema sp. PCC 7407]
gi|427986850|gb|AFY67994.1| Tetratricopeptide TPR_1 repeat-containing protein [Geitlerinema sp.
PCC 7407]
Length = 575
Score = 37.0 bits (84), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 50/123 (40%), Gaps = 21/123 (17%)
Query: 68 WRVFVSTALAAAVVASCSSNISALADLNKYEAETR---------GEFGIGS--------- 109
WR + AL A + + ++ + K ETR G++G +
Sbjct: 15 WRSLAALALVVAPMVGTDAALAEKPEHRKQLLETRRCISCDLSNGDYGRANLSGFDLSNS 74
Query: 110 ---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVD 166
A F SADL++ N RRA+ AD+ +DF + NGA L + AN +
Sbjct: 75 NLENADFESADLQRTDFSSANLRRADLERADLERADFQSAILNGADLSNSDLSYANLSNS 134
Query: 167 EIC 169
++
Sbjct: 135 DLS 137
>gi|425452623|ref|ZP_18832440.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
aeruginosa PCC 7941]
gi|389765493|emb|CCI08619.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
aeruginosa PCC 7941]
Length = 220
Score = 37.0 bits (84), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 34/61 (55%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
+ SA+L++AV + +FR + D+ +++F G+ N A L ++ Y+ANF +
Sbjct: 108 NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANFANCRL 167
Query: 169 C 169
C
Sbjct: 168 C 168
>gi|172039549|ref|YP_001806050.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
51142]
gi|354552189|ref|ZP_08971497.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
gi|171701003|gb|ACB53984.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
gi|353555511|gb|EHC24899.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
Length = 367
Score = 37.0 bits (84), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 33/70 (47%), Gaps = 5/70 (7%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFT 164
+ +L A NFR AN T AD+ E + FSG+ +GAYL A +A+F
Sbjct: 245 GTELSGIELSGANLTHSNFRGANLTDADLSEAILSYTRFSGADLSGAYLGNANLQQADFY 304
Query: 165 VDEICLPLLV 174
+ L L+
Sbjct: 305 RSSLALANLI 314
>gi|427736321|ref|YP_007055865.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427371362|gb|AFY55318.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 642
Score = 37.0 bits (84), Expect = 4.9, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 33/68 (48%), Gaps = 9/68 (13%)
Query: 110 AAQFGSADLRKAVHVKENFRRA---------NFTSADMRESDFSGSKFNGAYLEKAVAYK 160
A F A+L+K V + N A NF AD+ ++DFS +K A L+KA K
Sbjct: 421 GANFCKANLKKTVFIAANLTEAIESEEVIVTNFEEADLEKADFSCAKLIRANLQKANLVK 480
Query: 161 ANFTVDEI 168
AN ++
Sbjct: 481 ANLKAADL 488
>gi|300868113|ref|ZP_07112748.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300333887|emb|CBN57928.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 169
Score = 37.0 bits (84), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 30/54 (55%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
AQF A+LR + N + +F +A+M +++F G+ GA L+ A K N T
Sbjct: 57 AQFTKANLRNSNFSNANLQGVSFFAANMEDANFEGANLRGATLDLARMIKVNLT 110
>gi|220910319|ref|YP_002485630.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
gi|219866930|gb|ACL47269.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
Length = 165
Score = 37.0 bits (84), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 3/66 (4%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF---TV 165
+ AQ ADLRKAV N AN A +R ++F+G+ +GA L A A ++ +
Sbjct: 93 TGAQLPKADLRKAVLSGANLAGANLRDAKLRGANFAGADLHGADLFGAEALRSELLEGIL 152
Query: 166 DEICLP 171
++ +P
Sbjct: 153 NQTIMP 158
>gi|33866170|ref|NP_897729.1| hypothetical protein SYNW1636 [Synechococcus sp. WH 8102]
gi|33639145|emb|CAE08151.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 171
Score = 36.6 bits (83), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 5/68 (7%)
Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKA 161
+G A F ADL A+ + F A+F+ AD M +DF+G+ A L +A +
Sbjct: 66 VGRGANFSGADLHGAIFTQGAFAEADFSGADLSDALMDRADFAGTNLRDAVLTGIIASGS 125
Query: 162 NFTVDEIC 169
+F+ +I
Sbjct: 126 SFSDAQIA 133
>gi|145356542|ref|XP_001422487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582730|gb|ABP00804.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 114
Score = 36.6 bits (83), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 29/56 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S+ F ADLR A N R A E DF+G+ + A +++AV KANFT
Sbjct: 1 SSQNFTGADLRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALMDRAVLVKANFT 56
>gi|406987204|gb|EKE07615.1| hypothetical protein ACD_18C00027G0001, partial [uncultured
bacterium]
Length = 406
Score = 36.6 bits (83), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 28/63 (44%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
+ F ++DLR N NFT++D+R +DF G+ F G E A F
Sbjct: 337 TLTNFTNSDLRNVNFRDANLTWTNFTNSDLRNADFRGASFTGTIFENTNLEGAKFDKKNK 396
Query: 169 CLP 171
LP
Sbjct: 397 NLP 399
>gi|186684326|ref|YP_001867522.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
73102]
gi|186466778|gb|ACC82579.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
Length = 413
Score = 36.6 bits (83), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 31/60 (51%), Gaps = 5/60 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANF 163
S A A+L KA+ V N + NFT A++ E+D S GS F A L KA +AN
Sbjct: 216 SNADLTEANLSKAIFVGANLQWVNFTQANLSEADLSITNLCGSVFYEANLSKATLPEANL 275
>gi|189499620|ref|YP_001959090.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
BS1]
gi|189495061|gb|ACE03609.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
Length = 300
Score = 36.6 bits (83), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 30/55 (54%), Gaps = 1/55 (1%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-FNGAYLEKAVAYKANF 163
AA F ADLR A + N R A+ T AD+R + S S+ G+ L A+ + AN
Sbjct: 200 AANFSGADLRDADLSEVNLRNADLTGADLRGARLSFSQNMTGSTLNNAILHSANL 254
>gi|427734465|ref|YP_007054009.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427369506|gb|AFY53462.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 269
Score = 36.6 bits (83), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 28/56 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A G ADLR+A N AN T A + ++ SGS +GA L A AN T
Sbjct: 68 SEADLGEADLREANLKGANLTGANLTGATLMNANLSGSNLSGACLSGAKLSGANLT 123
>gi|329850490|ref|ZP_08265335.1| pentapeptide repeat 8 copies family protein [Asticcacaulis
biprosthecum C19]
gi|328840805|gb|EGF90376.1| pentapeptide repeat 8 copies family protein [Asticcacaulis
biprosthecum C19]
Length = 163
Score = 36.6 bits (83), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 30/63 (47%), Gaps = 7/63 (11%)
Query: 104 EFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
+ GIG S F ADLR F RA+F ADM ++F GAYLE A A
Sbjct: 64 DLGIGIFSGTNFSGADLRDVNGSAALFGRASFAGADMTNANFV-----GAYLEHANFRGA 118
Query: 162 NFT 164
N T
Sbjct: 119 NLT 121
>gi|428224795|ref|YP_007108892.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
gi|427984696|gb|AFY65840.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
Length = 284
Score = 36.6 bits (83), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 26/52 (50%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
A ADL +A N RRANFT+A MR + S GA + + Y+A
Sbjct: 184 GANLSDADLTRANLGSTNLRRANFTNAKMRGASLIWSSLRGAKMIRVNLYRA 235
>gi|443475902|ref|ZP_21065833.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
gi|443019187|gb|ELS33316.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
Length = 133
Score = 36.6 bits (83), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 8/111 (7%)
Query: 79 AVVASCSSNISALADLNKYEA--ETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTS 135
A + S S+ SALAD N+ +TR G S + A+LR A N R AN S
Sbjct: 16 AAITSISAIESALADPNQIRQVLQTRECAGCNLSREKLSFANLRGA-----NLRNANLFS 70
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLLVSLPMATPVFPAG 186
AD++ +D + GA L+KA A+ T ++ + + + P G
Sbjct: 71 ADLKLADLREANLIGAILDKADLRGADLTGADLTGAYMSETNLCGAIMPDG 121
>gi|428309499|ref|YP_007120476.1| low-complexity protein [Microcoleus sp. PCC 7113]
gi|428251111|gb|AFZ17070.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
Length = 166
Score = 36.6 bits (83), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 52/125 (41%), Gaps = 22/125 (17%)
Query: 72 VSTALAAAVVASCSSNISALADLNKY--------EAETRGEFGIGSAAQFGSADLRKAVH 123
++T L A +V C + ALA KY AE +G+ F LR A
Sbjct: 6 LATFLLALIVWCCP--LPALAQATKYYPPPLSYSNAELKGK-------DFSGQTLRSAEF 56
Query: 124 VKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTVDEICLPLLVSLPM 178
N R NFT AD+R + FS S +GA L A+ + +FT ++ +L M
Sbjct: 57 SNANLERTNFTDADLRGTIFSASVMTHANLHGADLSNAMIDQVSFTNADLSDAVLTESIM 116
Query: 179 ATPVF 183
F
Sbjct: 117 LRSTF 121
>gi|428222472|ref|YP_007106642.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
gi|427995812|gb|AFY74507.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
Length = 340
Score = 36.6 bits (83), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 52/109 (47%), Gaps = 12/109 (11%)
Query: 66 KNWR--VFVSTA------LAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSA 116
NWR VF S L+AA ++S + +++ L +N A ++ S A G A
Sbjct: 18 NNWRSEVFRSKIDLSYADLSAATLSSINLSLANLRSINLSRANLSKANL---SGAILGKA 74
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTV 165
+L +A + N ANF AD+ + S S + A L AVA ANF +
Sbjct: 75 NLTEASLINANLSMANFIMADLSGAYLSESNLSRANLGNAVAIAANFIM 123
>gi|158341584|ref|YP_001522748.1| pentapeptide repeat-containing protein [Acaryochloris marina
MBIC11017]
gi|158311825|gb|ABW33434.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
Length = 521
Score = 36.6 bits (83), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
A G ADL A N RANF A ++E+D + + +GA+L A AN +
Sbjct: 88 AYLGGADLYSANLRGANLIRANFNDAHLKEADLTNANLSGAHLRGANLLNANLS 141
>gi|427719675|ref|YP_007067669.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
gi|427352111|gb|AFY34835.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
Length = 291
Score = 36.6 bits (83), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 49/102 (48%), Gaps = 8/102 (7%)
Query: 63 AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
AKL + + +AA ++A+ L++ N YEAE G + A A+L KA
Sbjct: 172 AKLMRANLSFANLIAANLIATD------LSEANLYEAEVMGAYLY--QADLYKANLSKAH 223
Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
RAN T AD+R ++ S S +GA L A +AN T
Sbjct: 224 LSSAYLFRANLTKADLRGANLSWSNLSGANLAGADLCRANLT 265
>gi|83309857|ref|YP_420121.1| hypothetical protein amb0758 [Magnetospirillum magneticum AMB-1]
gi|82944698|dbj|BAE49562.1| Uncharacterized low-complexity protein [Magnetospirillum magneticum
AMB-1]
Length = 164
Score = 36.6 bits (83), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 45/103 (43%), Gaps = 23/103 (22%)
Query: 63 AKLKNWRV-FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 121
AK+ +RV F+S AL A R + G+ S A F ADL A
Sbjct: 69 AKVDGYRVRFISAALVGA----------------------RLDDGVFSEADFTKADLGGA 106
Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
+ + RRA F A +R +D +G++ GA L A A +T
Sbjct: 107 SLARADLRRARFYHASLRGADLTGARTLGAELLNADLSGARWT 149
>gi|374293141|ref|YP_005040176.1| hypothetical protein AZOLI_2775 [Azospirillum lipoferum 4B]
gi|357425080|emb|CBS87961.1| Conserved protein of unknown function; pentapeptide repeat domains
[Azospirillum lipoferum 4B]
Length = 425
Score = 36.6 bits (83), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 27/47 (57%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
AA F + L A + + R ANF+ AD+R +D +GS GA L+ A
Sbjct: 166 AADFTNTRLAGARLDRTDLRDANFSGADLRGADLNGSDLRGAILDGA 212
>gi|218711080|ref|YP_002418700.1| Microcin immunity mcbG [Escherichia coli ED1a]
gi|218349863|emb|CAQ87265.1| Microcin immunity mcbG [Escherichia coli ED1a]
Length = 187
Score = 36.6 bits (83), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 32/142 (22%), Positives = 54/142 (38%), Gaps = 10/142 (7%)
Query: 30 LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNIS 89
LS + C + +F DC +C +KN + L + C
Sbjct: 18 LSGVNYYNCIFERIQLDNFKFRDCEFEKCRFVNCSIKNLK------LNFFKLIDCEFKDC 71
Query: 90 ALADLNKYEAETRGEFGIGSA----AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
L +N + F + S F L+K++ + +FR F D+R+SDF+G
Sbjct: 72 LLQGVNAADIMFPCTFSLVSCDLRFVDFIGLRLQKSIFLSSHFRDCLFEETDLRKSDFTG 131
Query: 146 SKFNGAYLEKAVAYKANFTVDE 167
S FN + +F++ E
Sbjct: 132 SAFNNTEFRHSDLSHCDFSMTE 153
>gi|319793574|ref|YP_004155214.1| pentapeptide repeat-containing protein [Variovorax paradoxus EPS]
gi|315596037|gb|ADU37103.1| pentapeptide repeat protein [Variovorax paradoxus EPS]
Length = 372
Score = 36.6 bits (83), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 20/72 (27%), Positives = 35/72 (48%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLP 171
F DL + ++ +F AN + A+++ S F+ +GA LE+AV + NF + C
Sbjct: 25 DFRGLDLSGGMFIESDFTGANMSGANLKGSIFATCGLSGATLERAVLDRCNFHRVDACAA 84
Query: 172 LLVSLPMATPVF 183
++ M F
Sbjct: 85 MMAGATMHGTSF 96
>gi|298241513|ref|ZP_06965320.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
gi|297554567|gb|EFH88431.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
Length = 413
Score = 36.6 bits (83), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 31/59 (52%), Gaps = 2/59 (3%)
Query: 98 EAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
E E +G GS Q ADLRKA +F AN AD+ +++ G+ F GA LE A
Sbjct: 256 EVEAKGANFTGS--QLAGADLRKANLQGASFLGANLRGADLSQANLEGAVFVGAQLEGA 312
>gi|153871558|ref|ZP_02000700.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
gi|152071976|gb|EDN69300.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
Length = 179
Score = 36.6 bits (83), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 28/55 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A ADL + + N AN SAD+ E+D SG+ +GA L + +AN
Sbjct: 104 SGADLRWADLYRTILNDANLSYANLCSADLSEADLSGANLSGANLSRVDLSEANL 158
>gi|418068212|ref|ZP_12705516.1| pentapeptide repeat protein, partial [Geobacter metallireducens
RCH3]
gi|373557348|gb|EHP83782.1| pentapeptide repeat protein, partial [Geobacter metallireducens
RCH3]
Length = 153
Score = 36.6 bits (83), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 37/75 (49%), Gaps = 5/75 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
+ A F AD++K N + NFT A++ + F+G+K A + AV A+F+ ++
Sbjct: 12 NGANFTGADMKKV-----NIEKGNFTDANLTNASFTGAKLRYATFKGAVLKGADFSFADL 66
Query: 169 CLPLLVSLPMATPVF 183
L SL + F
Sbjct: 67 SYTDLSSLDLGGANF 81
>gi|434397472|ref|YP_007131476.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
gi|428268569|gb|AFZ34510.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
Length = 455
Score = 36.6 bits (83), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 25/41 (60%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
ADL +A K NF+RAN T AD E++ + F GA L +A
Sbjct: 189 ADLSEANLNKANFQRANLTEADFVEANLVQTNFKGANLSRA 229
>gi|108762763|ref|YP_635370.1| pentapeptide repeat-containing protein [Myxococcus xanthus DK 1622]
gi|108466643|gb|ABF91828.1| pentapeptide repeat domain protein [Myxococcus xanthus DK 1622]
Length = 203
Score = 36.6 bits (83), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
S A A LR A+ + N RA+F AD++ +D G+ GAYL A AN
Sbjct: 87 SKANLDYALLRGAILTQVNALRASFGEADLQGADLQGADLQGAYLVSANLASANL 141
>gi|428214178|ref|YP_007087322.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
gi|428002559|gb|AFY83402.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
Length = 346
Score = 36.6 bits (83), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 47/98 (47%), Gaps = 2/98 (2%)
Query: 67 NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
NW L+ A +A+ + + L+ N A+ + IG+ S DLR+A
Sbjct: 95 NWADLSGANLSGANLANADVSGANLSGANLSGAKLNQTYLIGT--NLKSVDLREANLSLA 152
Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
+ +A+ T A++R++D +G+K + L A AN T
Sbjct: 153 SLNKADLTKANLRQADLTGAKLKQSNLNLADLTHANLT 190
>gi|254456441|ref|ZP_05069870.1| Pentapeptide repeat protein [Candidatus Pelagibacter sp. HTCC7211]
gi|207083443|gb|EDZ60869.1| Pentapeptide repeat protein [Candidatus Pelagibacter sp. HTCC7211]
Length = 169
Score = 36.6 bits (83), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 19/64 (29%), Positives = 32/64 (50%)
Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
FG + F A+L + V + NF + NF+ +++ DF G+ A + + +ANFT
Sbjct: 74 FGTFPESTFVRANLYETVSIGANFEKTNFSGSNLTRVDFMGATLIEANFQNSNLMEANFT 133
Query: 165 VDEI 168
I
Sbjct: 134 SSNI 137
>gi|87311950|ref|ZP_01094060.1| hypothetical protein DSM3645_13340 [Blastopirellula marina DSM
3645]
gi|87285312|gb|EAQ77236.1| hypothetical protein DSM3645_13340 [Blastopirellula marina DSM
3645]
Length = 586
Score = 36.6 bits (83), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 28/55 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTV 165
AQ DLR+ + NF+ NF AD+ SDF+G++ A L A A + +
Sbjct: 128 AQLPGCDLREVSGKQANFQDVNFARADLSRSDFTGAQLAEADLSGVTAVAAQWKL 182
>gi|417305110|ref|ZP_12092092.1| oxidoreductase molybdopterin binding protein [Rhodopirellula
baltica WH47]
gi|327538543|gb|EGF25205.1| oxidoreductase molybdopterin binding protein [Rhodopirellula
baltica WH47]
Length = 349
Score = 36.6 bits (83), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
A F S +LR+A NFR AN +++ ++ SD G+ F GA LE A
Sbjct: 244 ADFRSCNLRQA-----NFRDANLSNSKLQRSDLQGANFTGADLEGA 284
>gi|428307622|ref|YP_007144447.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
gi|428249157|gb|AFZ14937.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
Length = 378
Score = 36.2 bits (82), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 29/56 (51%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
S A A L+ A ++ N A+ + AD+R +D SG+ A L KA +AN T
Sbjct: 43 SNADLSRASLKDAKLIRVNLSNADLSWADLRGADLSGANLENANLSKASLDQANLT 98
>gi|428303610|ref|YP_007113059.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
gi|428238815|gb|AFZ04603.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
Length = 490
Score = 36.2 bits (82), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 29/54 (53%)
Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEIC 169
ADL +A K + R A AD+RE++ G+ GA L A + A+ T ++C
Sbjct: 182 ADLERASFKKADLRNAILEGADLREANLEGADLRGADLRGANLWGADLTGVDLC 235
>gi|440713213|ref|ZP_20893815.1| oxidoreductase molybdopterin binding protein [Rhodopirellula
baltica SWK14]
gi|436442020|gb|ELP35204.1| oxidoreductase molybdopterin binding protein [Rhodopirellula
baltica SWK14]
Length = 365
Score = 36.2 bits (82), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
A F S +LR+A NFR AN +++ ++ SD G+ F GA LE A
Sbjct: 260 ADFRSCNLRQA-----NFRDANLSNSKLQRSDLQGANFTGADLEGA 300
>gi|428305676|ref|YP_007142501.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
9333]
gi|428247211|gb|AFZ12991.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
Length = 330
Score = 36.2 bits (82), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 24/46 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
A+ ADLR A N AN AD+R++D SG+ GA L A
Sbjct: 90 ARLQGADLRGADITLANLLDANLMEADLRDADLSGANLTGACLRGA 135
>gi|428181173|gb|EKX50038.1| hypothetical protein GUITHDRAFT_135709 [Guillardia theta CCMP2712]
Length = 1263
Score = 36.2 bits (82), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 24/50 (48%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
A F DL A+ N R ANFT A + +DFSGS GA + Y
Sbjct: 577 GADFSHCDLSFAMLQNCNLRGANFTGAKLTGTDFSGSDLEGAIMPDMEGY 626
>gi|427736744|ref|YP_007056288.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427371785|gb|AFY55741.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 443
Score = 36.2 bits (82), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 33/61 (54%), Gaps = 5/61 (8%)
Query: 109 SAAQFGSADLRKAVHVKEN-----FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
++ +F ADLR+A V N F AN + ++ +D SG+ +GAYL A Y A+
Sbjct: 319 TSTKFIGADLREANFVGANLDNVDFSNANLSGTNLSGADLSGADLSGAYLSGAYFYDADL 378
Query: 164 T 164
+
Sbjct: 379 S 379
>gi|17230748|ref|NP_487296.1| hypothetical protein all3256 [Nostoc sp. PCC 7120]
gi|17132351|dbj|BAB74955.1| all3256 [Nostoc sp. PCC 7120]
Length = 268
Score = 36.2 bits (82), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 35/83 (42%), Gaps = 30/83 (36%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF---------------------- 148
A F A+LR A+ + N +F+SAD+R++D +G+K
Sbjct: 114 ADFSGANLRGAIVTEANLIGTDFSSADLRDADLAGAKLIRSNLCFANLIAANFIAVDFSE 173
Query: 149 --------NGAYLEKAVAYKANF 163
GAYL KA YKAN
Sbjct: 174 ANLYQAEVMGAYLYKANFYKANL 196
>gi|166362955|ref|YP_001655228.1| hypothetical protein MAE_02140 [Microcystis aeruginosa NIES-843]
gi|166085328|dbj|BAG00036.1| hypothetical protein MAE_02140 [Microcystis aeruginosa NIES-843]
Length = 186
Score = 36.2 bits (82), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 21/82 (25%), Positives = 38/82 (46%), Gaps = 5/82 (6%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANF 163
+ F +L+ A N + +NF+SAD+R + F+G+ F+GA L +AY + F
Sbjct: 61 TGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTF 120
Query: 164 TVDEICLPLLVSLPMATPVFPA 185
++ + M +F
Sbjct: 121 KNSDLSDAIFAEAIMLRTIFEG 142
>gi|422301609|ref|ZP_16388976.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9806]
gi|389789327|emb|CCI14609.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9806]
Length = 169
Score = 36.2 bits (82), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 5/78 (6%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTVDE 167
F +L+ A N + +NF+SAD+R + F+G+ F+GA L +AY + F +
Sbjct: 48 FSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSD 107
Query: 168 ICLPLLVSLPMATPVFPA 185
+ + M +F
Sbjct: 108 LSDAIFAEAIMLRTIFEG 125
>gi|254501374|ref|ZP_05113525.1| Pentapeptide repeat protein [Labrenzia alexandrii DFL-11]
gi|222437445|gb|EEE44124.1| Pentapeptide repeat protein [Labrenzia alexandrii DFL-11]
Length = 296
Score = 36.2 bits (82), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 26/47 (55%)
Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
DL++AV + NF R++F + +DFS S F GA + K+N
Sbjct: 92 DLKEAVMPRSNFERSDFRRTEAERADFSASDFAGASMRAVDLEKSNL 138
>gi|209525619|ref|ZP_03274157.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|423065193|ref|ZP_17053983.1| hypothetical protein SPLC1_S230580 [Arthrospira platensis C1]
gi|209493952|gb|EDZ94269.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
gi|406713325|gb|EKD08496.1| hypothetical protein SPLC1_S230580 [Arthrospira platensis C1]
Length = 333
Score = 36.2 bits (82), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 7/87 (8%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
F+ L A + C + L++ N A RG A A+LR A N
Sbjct: 242 FIKANLMKADLQECDLRNADLSNTNLNLANLRG-------ADLTGANLRGAYLWGANLDG 294
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAV 157
AN +AD+R++ F + NGA L A+
Sbjct: 295 ANLENADLRDASFRDATLNGAILNGAI 321
>gi|443666115|ref|ZP_21133744.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
gi|159030126|emb|CAO91018.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443331286|gb|ELS45952.1| pentapeptide repeats family protein [Microcystis aeruginosa
DIANCHI905]
Length = 169
Score = 36.2 bits (82), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 5/78 (6%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTVDE 167
F +L+ A N + +NF+SAD+R + F+G+ F+GA L +AY + F +
Sbjct: 48 FSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSD 107
Query: 168 ICLPLLVSLPMATPVFPA 185
+ + M +F
Sbjct: 108 LSDAIFAEAIMLRTIFEG 125
>gi|68171987|ref|ZP_00545289.1| Pentapeptide repeat [Ehrlichia chaffeensis str. Sapulpa]
gi|67998589|gb|EAM85340.1| Pentapeptide repeat [Ehrlichia chaffeensis str. Sapulpa]
Length = 435
Score = 36.2 bits (82), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 29/62 (46%), Gaps = 1/62 (1%)
Query: 102 RGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
+ EFG S A F DLR +V N ANFT A++ S F S GA A K
Sbjct: 83 KKEFGNNLSGADFSDLDLRGSVFDNVNLLHANFTRANLSNSTFIDSNMQGASFINANLSK 142
Query: 161 AN 162
+N
Sbjct: 143 SN 144
>gi|390438199|ref|ZP_10226689.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
gi|425441109|ref|ZP_18821396.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9717]
gi|425454770|ref|ZP_18834496.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9807]
gi|425466166|ref|ZP_18845469.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|425468563|ref|ZP_18847571.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9701]
gi|389718271|emb|CCH97753.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9717]
gi|389804467|emb|CCI16499.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9807]
gi|389831470|emb|CCI25816.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|389838386|emb|CCI30813.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
gi|389884775|emb|CCI34954.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9701]
Length = 169
Score = 36.2 bits (82), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 5/78 (6%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTVDE 167
F +L+ A N + +NF+SAD+R + F+G+ F+GA L +AY + F +
Sbjct: 48 FSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSD 107
Query: 168 ICLPLLVSLPMATPVFPA 185
+ + M +F
Sbjct: 108 LSDAIFAEAIMLRTIFEG 125
>gi|427739890|ref|YP_007059434.1| putative low-complexity protein [Rivularia sp. PCC 7116]
gi|427374931|gb|AFY58887.1| putative low-complexity protein [Rivularia sp. PCC 7116]
Length = 447
Score = 35.8 bits (81), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 36/79 (45%), Gaps = 6/79 (7%)
Query: 92 ADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSG 145
A L+K G G A A+LR+A ++A NF A +RE+D SG
Sbjct: 327 AKLDKARMHETGLIGANLQQANLNGANLRQANLNAARLQQAEVFFANFAEASLREADLSG 386
Query: 146 SKFNGAYLEKAVAYKANFT 164
+ G +KAV Y+ N +
Sbjct: 387 ANLMGTDFQKAVLYETNLS 405
>gi|170076886|ref|YP_001733524.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
gi|169884555|gb|ACA98268.1| pentapeptide repeats protein [Synechococcus sp. PCC 7002]
Length = 324
Score = 35.8 bits (81), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 30/65 (46%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
A+ S D R A + +FR AN A+ +D G+KF GA L + A F D
Sbjct: 260 AELVSTDFRHAQLQRADFRGANLWGANFARADLRGAKFQGAKLNQTNFQGAVFEFDPRTQ 319
Query: 171 PLLVS 175
LL S
Sbjct: 320 TLLAS 324
>gi|75911046|ref|YP_325342.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
29413]
gi|75704771|gb|ABA24447.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
Length = 576
Score = 35.8 bits (81), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 28/51 (54%)
Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
GI S A ADL AV + +F AN SA++ S+ SG+ NGA L A
Sbjct: 475 GILSEADLTGADLSDAVLLGTDFSFANLNSANLSGSNLSGAILNGADLSSA 525
>gi|284008627|emb|CBA75237.1| conserved hypothetical protein [Arsenophonus nasoniae]
Length = 823
Score = 35.8 bits (81), Expect = 8.8, Method: Composition-based stats.
Identities = 19/58 (32%), Positives = 29/58 (50%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
A ADL KA K F +AN T D ES+ KF+ ++++ + N VD++
Sbjct: 748 ADLTDADLTKANCQKAKFSKANLTRTDFTESNLQDVKFSKHHIKREFLQEINVAVDKL 805
>gi|376004329|ref|ZP_09782046.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375327291|emb|CCE17799.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 340
Score = 35.8 bits (81), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 7/87 (8%)
Query: 71 FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
F+ L A + C + L++ N A RG A A+LR A N
Sbjct: 249 FIKANLMKADLQECDLRNADLSNTNLNLANLRG-------ADLTGANLRGAYLWGANLDG 301
Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAV 157
AN +AD+R++ F + NGA L A+
Sbjct: 302 ANLENADLRDASFRDATLNGAILNGAI 328
>gi|427702733|ref|YP_007045955.1| low-complexity protein [Cyanobium gracile PCC 6307]
gi|427345901|gb|AFY28614.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
Length = 247
Score = 35.8 bits (81), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 5/49 (10%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
+AA F ADLR A NF A+ T AD+R + G++F+GA L + +
Sbjct: 187 TAADFRGADLRGA-----NFSGADLTQADLRGALLDGARFHGAVLSRTL 230
>gi|409437276|ref|ZP_11264395.1| putative pentapeptide repeat protein [Rhizobium mesoamericanum
STM3625]
gi|408751000|emb|CCM75551.1| putative pentapeptide repeat protein [Rhizobium mesoamericanum
STM3625]
Length = 234
Score = 35.8 bits (81), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 20/61 (32%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Query: 109 SAAQFGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
+ AQ G+A+ K + F A+F A+++ ++F+G+K GA EKA +ANF
Sbjct: 92 TGAQAGNANFSKIEAYRSGFESVFAEGASFAGAELQRANFNGAKLTGANFEKAELGRANF 151
Query: 164 T 164
+
Sbjct: 152 S 152
>gi|425445790|ref|ZP_18825810.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
gi|389734131|emb|CCI02174.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
Length = 169
Score = 35.8 bits (81), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 5/78 (6%)
Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTVDE 167
F +L+ A N + +NF+SAD+R + F+G+ F+GA L +AY + F +
Sbjct: 48 FSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSD 107
Query: 168 ICLPLLVSLPMATPVFPA 185
+ + M +F
Sbjct: 108 LSDAIFAEAIMLRTIFEG 125
>gi|334145352|ref|YP_004538562.1| pentapeptide repeat-containing protein [Novosphingobium sp. PP1Y]
gi|333937236|emb|CCA90595.1| pentapeptide repeat-containing protein [Novosphingobium sp. PP1Y]
Length = 228
Score = 35.8 bits (81), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 53/131 (40%), Gaps = 26/131 (19%)
Query: 30 LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWR----VFVSTALAAAVVASCS 85
L +VAC ++ T +C A+L R F T L A++A S
Sbjct: 85 LGDARFVACDFNNATFKRANLQSARFERCKLTGAELSELRGIDIAFEETLLVNAILAGHS 144
Query: 86 SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
A+L + + F ADLRK +FR+A+FT +RE+ G
Sbjct: 145 FR---RANLKRTD--------------FSQADLRKC-----DFRQAHFTECSLREASMEG 182
Query: 146 SKFNGAYLEKA 156
++F GA L A
Sbjct: 183 ARFEGADLRGA 193
>gi|225631183|ref|ZP_03787884.1| pentapeptide repeat domain protein [Wolbachia endosymbiont of
Muscidifurax uniraptor]
gi|225591121|gb|EEH12302.1| pentapeptide repeat domain protein [Wolbachia endosymbiont of
Muscidifurax uniraptor]
Length = 601
Score = 35.8 bits (81), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 129 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLP 171
+ AN A+++ESDF+GS + AYL ++ +NF DE L
Sbjct: 231 KNANIYGAELKESDFTGSNLSAAYLNSSIIINSNF--DETNLS 271
>gi|227498348|ref|ZP_03928498.1| pentapeptide repeat protein [Acidaminococcus sp. D21]
gi|352685375|ref|YP_004897360.1| pentapeptide repeat-containing protein [Acidaminococcus intestini
RyC-MR95]
gi|226903810|gb|EEH89728.1| pentapeptide repeat protein [Acidaminococcus sp. D21]
gi|350280030|gb|AEQ23220.1| pentapeptide repeat protein [Acidaminococcus intestini RyC-MR95]
Length = 250
Score = 35.8 bits (81), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 40/73 (54%), Gaps = 8/73 (10%)
Query: 95 NKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
N Y A+ R ++ G+ A F SA+L ++ + NF ANFTSA++ + ++F A
Sbjct: 138 NLYTADLRESNFDYASGAMANFYSANLARSWFFRSNFMSANFTSANLYD-----ARFRRA 192
Query: 152 YLEKAVAYKANFT 164
L +A+ AN T
Sbjct: 193 NLSEALLRSANLT 205
>gi|17232102|ref|NP_488650.1| hypothetical protein alr4610 [Nostoc sp. PCC 7120]
gi|17133747|dbj|BAB76309.1| alr4610 [Nostoc sp. PCC 7120]
Length = 164
Score = 35.8 bits (81), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 62/132 (46%), Gaps = 24/132 (18%)
Query: 65 LKNWRVFVSTALAAAV-------VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
+K+WRV VS LA + A+ SS+I+ A + G+ IGS +F + D
Sbjct: 1 MKDWRVVVSFVLAMVLFLFPGSAQAASSSSITRSAGDELKAKDFSGQSLIGS--EFTNVD 58
Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTVDEICLPL 172
L EN ANF++AD+R F+G+ G L + +AY ANF ++ +
Sbjct: 59 L-------EN---ANFSNADLRGGVFNGTVLEGVNLHGVDFSEGIAYLANFKNADLSDAI 108
Query: 173 LVSLPMATPVFP 184
L + M +F
Sbjct: 109 LTNAMMLRSIFD 120
>gi|88658408|ref|YP_507868.1| pentapeptide repeat-containing protein [Ehrlichia chaffeensis str.
Arkansas]
gi|88599865|gb|ABD45334.1| pentapeptide repeat protein [Ehrlichia chaffeensis str. Arkansas]
Length = 607
Score = 35.8 bits (81), Expect = 9.5, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 29/62 (46%), Gaps = 1/62 (1%)
Query: 102 RGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
+ EFG S A F DLR +V N ANFT A++ S F S GA A K
Sbjct: 64 KKEFGNNLSGADFSDLDLRGSVFDNVNLLHANFTRANLSNSTFIDSNMQGASFINANLSK 123
Query: 161 AN 162
+N
Sbjct: 124 SN 125
>gi|390442100|ref|ZP_10230118.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
sp. T1-4]
gi|389834544|emb|CCI34244.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
sp. T1-4]
Length = 220
Score = 35.8 bits (81), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 17/61 (27%), Positives = 34/61 (55%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
+ SA+L++A+ + +FR + D+ +++F G+ N A L ++ Y+ANF +
Sbjct: 108 NGVNLNSANLQQALLIDADFRSTSDQRTDLGKTNFRGADLNYANLSGSLLYRANFADCRL 167
Query: 169 C 169
C
Sbjct: 168 C 168
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.130 0.394
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,018,727,694
Number of Sequences: 23463169
Number of extensions: 109804791
Number of successful extensions: 264004
Number of sequences better than 100.0: 437
Number of HSP's better than 100.0 without gapping: 254
Number of HSP's successfully gapped in prelim test: 183
Number of HSP's that attempted gapping in prelim test: 260489
Number of HSP's gapped (non-prelim): 3224
length of query: 198
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 63
effective length of database: 9,191,667,552
effective search space: 579075055776
effective search space used: 579075055776
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)