BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 038044
(203 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224101247|ref|XP_002312200.1| predicted protein [Populus trichocarpa]
gi|222852020|gb|EEE89567.1| predicted protein [Populus trichocarpa]
Length = 213
Score = 288 bits (736), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 151/205 (73%), Positives = 165/205 (80%), Gaps = 2/205 (0%)
Query: 1 MGALPLHSSLLILERDVFESSYIRLAVSANQKWKNEPSTSGQTSKIGSKSACFLGKSFLM 60
+GA L S L ++ +V E + I +A A Q +EPS SGQ SKIG K F F +
Sbjct: 9 VGARMLQSGLSVMVNEVSERTCITIATCAMQNRNSEPSFSGQASKIGIKFVPFQKTRFQL 68
Query: 61 RRHGWKIAFALDTGGISGSGGQESLNGDGPD-LGGTRLGRIVSAGGRQLLEKLNIARKNL 119
RR GW+IAFALDTGG+ G+GGQE++NGD P LGGTRLGRIVSAGGRQLLEKLN ARKN
Sbjct: 69 RRQGWRIAFALDTGGLPGNGGQENVNGDSPPGLGGTRLGRIVSAGGRQLLEKLNSARKNF 128
Query: 120 PMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-STGRLQS 178
PMKIFLLLLGFYTANALATILGQTGDWDVLVAG+VVAAIEGIGMLMYKKP S ST RLQ
Sbjct: 129 PMKIFLLLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIGMLMYKKPSSLSTRRLQY 188
Query: 179 FVVMMNYWKAGVCLGLFVDAFKLGS 203
FV MMNYWKAGVCLGLFVDAFKLGS
Sbjct: 189 FVAMMNYWKAGVCLGLFVDAFKLGS 213
>gi|359495755|ref|XP_002267846.2| PREDICTED: ycf20-like protein [Vitis vinifera]
gi|298205056|emb|CBI38352.3| unnamed protein product [Vitis vinifera]
Length = 225
Score = 279 bits (714), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 155/205 (75%), Positives = 167/205 (81%), Gaps = 2/205 (0%)
Query: 1 MGALPLHSSLLILERDVFESSYIRLAVSAN-QKWKNEPSTSGQTSKIGSKSACFLGKSFL 59
MG + L S+L + E + + SYI Q W NEPS SGQ SK+G KSA F KSF
Sbjct: 21 MGGIMLQSTLSVPESESSDCSYITFTTCCTIQNWNNEPSFSGQISKLGMKSAPFPRKSFQ 80
Query: 60 MRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNL 119
RRHGWK AFALDTGGI +GGQE+LNG+GPDLG TRLGRIVSAGGRQLL KLN ARKN
Sbjct: 81 TRRHGWKTAFALDTGGIPDNGGQENLNGNGPDLGQTRLGRIVSAGGRQLLAKLNSARKNF 140
Query: 120 PMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-STGRLQS 178
PMK+FLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGML+YKK S STGRLQS
Sbjct: 141 PMKVFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLVYKKSSSQSTGRLQS 200
Query: 179 FVVMMNYWKAGVCLGLFVDAFKLGS 203
VVM+NYWKAGVCLGLFVDAFKLGS
Sbjct: 201 LVVMINYWKAGVCLGLFVDAFKLGS 225
>gi|255547482|ref|XP_002514798.1| conserved hypothetical protein [Ricinus communis]
gi|223545849|gb|EEF47352.1| conserved hypothetical protein [Ricinus communis]
Length = 201
Score = 270 bits (691), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 151/199 (75%), Positives = 164/199 (82%), Gaps = 1/199 (0%)
Query: 6 LHSSLLILERDVFESSYIRLAVSANQKWKNEPSTSGQTSKIGSKSACFLGKSFLMRRHGW 65
L SS+ ++E +V E S I L + A Q W EP+ S Q KIG KSA FL K F + W
Sbjct: 3 LQSSVSVMEGEVSERSCITLTICAMQNWSVEPTFSRQIPKIGLKSASFLRKRFKLSGQDW 62
Query: 66 KIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFL 125
+I FALDTGGISG+GG+E+L+GD LG TRLGRIVSAGGRQLL+KLN ARKN PMKIFL
Sbjct: 63 RIPFALDTGGISGNGGEENLDGDSSGLGSTRLGRIVSAGGRQLLQKLNSARKNFPMKIFL 122
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-STGRLQSFVVMMN 184
LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP S STGRLQSFVVMMN
Sbjct: 123 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPTSPSTGRLQSFVVMMN 182
Query: 185 YWKAGVCLGLFVDAFKLGS 203
YWKAGVCLGLFVDAFKLGS
Sbjct: 183 YWKAGVCLGLFVDAFKLGS 201
>gi|351725487|ref|NP_001237606.1| uncharacterized protein LOC100499897 [Glycine max]
gi|255627495|gb|ACU14092.1| unknown [Glycine max]
Length = 208
Score = 244 bits (624), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/204 (66%), Positives = 161/204 (78%), Gaps = 1/204 (0%)
Query: 1 MGALPLHSSLLILERDVFESSYIRLAVSANQKWKNEPSTSGQTSKIGSKSACFLGKSFLM 60
MG + L+ L I E + E S + LA S EP+ +G T K KS+ F FL
Sbjct: 5 MGTMTLNWRLSITENESSEKSCVNLATSRIHSCCREPNFNGLTLKACMKSSPFAQSCFLG 64
Query: 61 RRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNLP 120
RR GWKIAFAL+TGG+SG+G Q+S N +LGGTRLGRI++AGGRQLLEKLN ARKN+P
Sbjct: 65 RRAGWKIAFALNTGGVSGNGDQQSFNEASSNLGGTRLGRILTAGGRQLLEKLNSARKNIP 124
Query: 121 MKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKK-PPSSTGRLQSF 179
+K+FLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGML+Y+K P + TGRLQSF
Sbjct: 125 LKVFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLIYRKLPTARTGRLQSF 184
Query: 180 VVMMNYWKAGVCLGLFVDAFKLGS 203
+V+++YWKAG+CLGLFVDAFKLGS
Sbjct: 185 LVLVDYWKAGICLGLFVDAFKLGS 208
>gi|388511147|gb|AFK43635.1| unknown [Medicago truncatula]
Length = 201
Score = 240 bits (613), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/204 (66%), Positives = 157/204 (76%), Gaps = 8/204 (3%)
Query: 1 MGALPLHSSLLILERDVFESSYIRLAVSANQKWKNEPSTSGQTSKIGSKSACFLGKSFLM 60
MG++ L S L I E + + S+I +P S T +G K FL FL
Sbjct: 5 MGSMNLPSKLSITENEGSKRSFIHSCCC-------QPRFSALTPNVGMKPLPFLQSRFLR 57
Query: 61 RRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNLP 120
RR GW +AFAL+T G+SG+G Q+SLN G LGGTRLGRI+SAGGRQLL+KLN ARKN P
Sbjct: 58 RRPGWTVAFALNTDGLSGNGEQQSLNDSGSSLGGTRLGRILSAGGRQLLDKLNSARKNFP 117
Query: 121 MKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSS-TGRLQSF 179
K+FLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGML+YKKPP++ TGRLQSF
Sbjct: 118 TKVFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLIYKKPPTTRTGRLQSF 177
Query: 180 VVMMNYWKAGVCLGLFVDAFKLGS 203
+VM+NYWKAG+CLGLFVDAFKLGS
Sbjct: 178 LVMVNYWKAGICLGLFVDAFKLGS 201
>gi|359807488|ref|NP_001241142.1| uncharacterized protein LOC100814758 [Glycine max]
gi|255639749|gb|ACU20168.1| unknown [Glycine max]
Length = 207
Score = 237 bits (604), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 137/204 (67%), Positives = 160/204 (78%), Gaps = 2/204 (0%)
Query: 1 MGALPLHSSLLILERDVFESSYIRLAVSANQKWKNEPSTSGQTSKIGSKSACFLGKSFLM 60
MG+ L L I + E+S I LA S E + + T G+KS+ F FL
Sbjct: 5 MGSTILKWRLSITDYKSSENSCIALATSRIHCCCCETNFNRLTLN-GTKSSPFSQSCFLG 63
Query: 61 RRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNLP 120
RR GWKIAFAL+TGG+SG+G Q+S N +LGGTRLGRI+SAGGRQLLEKLN ARKN+P
Sbjct: 64 RRAGWKIAFALNTGGVSGNGDQQSFNEASSNLGGTRLGRILSAGGRQLLEKLNAARKNIP 123
Query: 121 MKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSS-TGRLQSF 179
+K+FLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGML+Y+KPP+ TGRLQSF
Sbjct: 124 LKVFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLIYRKPPTEKTGRLQSF 183
Query: 180 VVMMNYWKAGVCLGLFVDAFKLGS 203
+V++NYWKAG+CLGLFVDAFKLGS
Sbjct: 184 LVLVNYWKAGICLGLFVDAFKLGS 207
>gi|21593789|gb|AAM65756.1| antigen receptor, putative [Arabidopsis thaliana]
Length = 197
Score = 224 bits (572), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 121/166 (72%), Positives = 133/166 (80%), Gaps = 4/166 (2%)
Query: 42 QTSKIGSKSACFLGKSFLMRRHGWKIAFALDTGG-ISGSGGQ-ESLNGDGPDLGGTRLGR 99
+ S I S + KSF +RR KIAFALDTG I G G+ + +NGD LG TRLGR
Sbjct: 32 KVSSITLTSFPYPMKSFQIRRPNRKIAFALDTGSSIPGDSGEGQEMNGDRTGLGSTRLGR 91
Query: 100 IVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIE 159
I AGG+QLL K+N ARKN PMKIFLLLLGFYTANALATILGQTGDWDVLVAG+VVAAIE
Sbjct: 92 IAIAGGKQLLAKINSARKNFPMKIFLLLLGFYTANALATILGQTGDWDVLVAGIVVAAIE 151
Query: 160 GIGMLMYKKPPSS--TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
GIGMLMYKKP SS +G+LQSFVV MN+WKAGVCLGLFVDAFKLGS
Sbjct: 152 GIGMLMYKKPSSSMFSGKLQSFVVFMNFWKAGVCLGLFVDAFKLGS 197
>gi|357476287|ref|XP_003608429.1| Ycf20-like protein [Medicago truncatula]
gi|355509484|gb|AES90626.1| Ycf20-like protein [Medicago truncatula]
Length = 209
Score = 224 bits (572), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 120/148 (81%), Positives = 135/148 (91%), Gaps = 1/148 (0%)
Query: 57 SFLMRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIAR 116
+FL RR GW +AFAL+T G+SG+G Q+SLN G LGGTRLGRI+SAGGRQLL+KLN AR
Sbjct: 62 TFLRRRPGWTVAFALNTDGLSGNGEQQSLNDSGSSLGGTRLGRILSAGGRQLLDKLNSAR 121
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSS-TGR 175
KN P K+FLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGML+YKKPP++ TGR
Sbjct: 122 KNFPTKVFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLIYKKPPTTRTGR 181
Query: 176 LQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
LQSF+VM+NYWKAG+CLGLFVDAFKLGS
Sbjct: 182 LQSFLVMVNYWKAGICLGLFVDAFKLGS 209
>gi|297841047|ref|XP_002888405.1| hypothetical protein ARALYDRAFT_894093 [Arabidopsis lyrata subsp.
lyrata]
gi|297334246|gb|EFH64664.1| hypothetical protein ARALYDRAFT_894093 [Arabidopsis lyrata subsp.
lyrata]
Length = 198
Score = 224 bits (571), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 123/168 (73%), Positives = 134/168 (79%), Gaps = 7/168 (4%)
Query: 42 QTSKIGSKSACFLGKSFLMRRHGWKIAFALDTG----GISGSGGQESLNGDGPDLGGTRL 97
+ S I S + KSF +RR KIAFALDTG G +G GGQE +NGD LG TRL
Sbjct: 32 RASSITLTSFPYPMKSFQLRRRNRKIAFALDTGSSIPGDNGGGGQE-MNGDRTGLGSTRL 90
Query: 98 GRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAA 157
GRI AGG+ LL K+N ARKN PMKIFLLLLGFYTANALATILGQTGDWDVLVAG+VVAA
Sbjct: 91 GRIAVAGGKLLLGKINSARKNFPMKIFLLLLGFYTANALATILGQTGDWDVLVAGIVVAA 150
Query: 158 IEGIGMLMYKKPPSS--TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
IEGIGMLMYKKP SS +G+LQSFVV MN+WKAGVCLGLFVDAFKLGS
Sbjct: 151 IEGIGMLMYKKPSSSMFSGKLQSFVVFMNFWKAGVCLGLFVDAFKLGS 198
>gi|18408276|ref|NP_564851.1| Ycf20-like protein [Arabidopsis thaliana]
gi|209572697|sp|O80813.2|YC20L_ARATH RecName: Full=Ycf20-like protein
gi|107738233|gb|ABF83666.1| At1g65420 [Arabidopsis thaliana]
gi|332196250|gb|AEE34371.1| Ycf20-like protein [Arabidopsis thaliana]
Length = 197
Score = 223 bits (569), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 121/166 (72%), Positives = 133/166 (80%), Gaps = 4/166 (2%)
Query: 42 QTSKIGSKSACFLGKSFLMRRHGWKIAFALDTGG-ISGSGGQ-ESLNGDGPDLGGTRLGR 99
+ S I S + KSF +RR KIAFALDTG I G G+ + +NGD LG TRLGR
Sbjct: 32 KASSITLTSFPYPMKSFQIRRPNRKIAFALDTGSSIPGDSGEGQEMNGDRTGLGSTRLGR 91
Query: 100 IVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIE 159
I AGG+QLL K+N ARKN PMKIFLLLLGFYTANALATILGQTGDWDVLVAG+VVAAIE
Sbjct: 92 IAIAGGKQLLGKINSARKNFPMKIFLLLLGFYTANALATILGQTGDWDVLVAGIVVAAIE 151
Query: 160 GIGMLMYKKPPSS--TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
GIGMLMYKKP SS +G+LQSFVV MN+WKAGVCLGLFVDAFKLGS
Sbjct: 152 GIGMLMYKKPSSSMFSGKLQSFVVFMNFWKAGVCLGLFVDAFKLGS 197
>gi|3335348|gb|AAC27150.1|AAC27150 T8F5.20 [Arabidopsis thaliana]
Length = 121
Score = 199 bits (506), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 101/121 (83%), Positives = 108/121 (89%), Gaps = 2/121 (1%)
Query: 85 LNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTG 144
+NGD LG TRLGRI AGG+QLL K+N ARKN PMKIFLLLLGFYTANALATILGQTG
Sbjct: 1 MNGDRTGLGSTRLGRIAIAGGKQLLGKINSARKNFPMKIFLLLLGFYTANALATILGQTG 60
Query: 145 DWDVLVAGVVVAAIEGIGMLMYKKPPSS--TGRLQSFVVMMNYWKAGVCLGLFVDAFKLG 202
DWDVLVAG+VVAAIEGIGMLMYKKP SS +G+LQSFVV MN+WKAGVCLGLFVDAFKLG
Sbjct: 61 DWDVLVAGIVVAAIEGIGMLMYKKPSSSMFSGKLQSFVVFMNFWKAGVCLGLFVDAFKLG 120
Query: 203 S 203
S
Sbjct: 121 S 121
>gi|449442657|ref|XP_004139097.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
Length = 212
Score = 195 bits (495), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 130/205 (63%), Positives = 150/205 (73%), Gaps = 2/205 (0%)
Query: 1 MGALPLHSSLLILERDVFESSYIRLAVSANQKWKNEPSTSGQTSKIGSKSACFLGKSFLM 60
MGA+ LHS LL + S + W E +G++S KSA F KS
Sbjct: 8 MGAVILHSRLLTTANYSPQVSCLTNMPCLIHYWGYEAIFNGRSSSFLLKSAPFPRKSLQT 67
Query: 61 RRHGWKIAFALDTGGISGSGGQESLNGDGP-DLGGTRLGRIVSAGGRQLLEKLNIARKNL 119
RH K+AFALDTGG+S SGG ++++ +LG TRLGR+VS+G RQLLEKLN ARKN
Sbjct: 68 SRHVNKLAFALDTGGVSSSGGGDNIDDGDESNLGRTRLGRLVSSGARQLLEKLNSARKNF 127
Query: 120 PMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-STGRLQS 178
P KIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIE IG+LMY+KPP T RL+S
Sbjct: 128 PTKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEWIGILMYRKPPPLLTRRLKS 187
Query: 179 FVVMMNYWKAGVCLGLFVDAFKLGS 203
F+VM+NYWKAGVCLGLFVDAFKLGS
Sbjct: 188 FIVMVNYWKAGVCLGLFVDAFKLGS 212
>gi|70671583|gb|AAZ06217.1| putative antigen receptor [Oryza sativa Indica Group]
gi|70671616|gb|AAZ06247.1| putative antigen receptor [Oryza sativa Indica Group]
gi|125563084|gb|EAZ08464.1| hypothetical protein OsI_30729 [Oryza sativa Indica Group]
Length = 197
Score = 192 bits (488), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 99/151 (65%), Positives = 113/151 (74%), Gaps = 2/151 (1%)
Query: 54 LGKSFLMRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLN 113
LG S M+R WK FAL+TGG S + Q+ DG LG TRLGR++ A R+LLEKLN
Sbjct: 48 LGLSCQMKRTRWKPVFALETGGPSNADNQD-FEDDGGFLGRTRLGRLIQAAARELLEKLN 106
Query: 114 IARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-S 172
AR P KIFL+LLGFYTANALAT+LGQTGDWDV VA +VVA IEGIGMLMY+KP S
Sbjct: 107 SARNKSPTKIFLVLLGFYTANALATVLGQTGDWDVFVAAIVVATIEGIGMLMYRKPASRP 166
Query: 173 TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
GR S + M+NYWKAGVCLG FVDAFK+GS
Sbjct: 167 PGRFWSMITMVNYWKAGVCLGFFVDAFKVGS 197
>gi|115478380|ref|NP_001062785.1| Os09g0287500 [Oryza sativa Japonica Group]
gi|50251186|dbj|BAD29674.1| antigen receptor-like protein [Oryza sativa Japonica Group]
gi|50252920|dbj|BAD29174.1| antigen receptor-like protein [Oryza sativa Japonica Group]
gi|113631018|dbj|BAF24699.1| Os09g0287500 [Oryza sativa Japonica Group]
gi|215765531|dbj|BAG87228.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641230|gb|EEE69362.1| hypothetical protein OsJ_28696 [Oryza sativa Japonica Group]
Length = 196
Score = 192 bits (488), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 99/151 (65%), Positives = 113/151 (74%), Gaps = 2/151 (1%)
Query: 54 LGKSFLMRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLN 113
LG S M+R WK FAL+TGG S + Q+ DG LG TRLGR++ A R+LLEKLN
Sbjct: 47 LGLSCQMKRTRWKPVFALETGGPSNADNQD-FEDDGGFLGRTRLGRLIQAAARELLEKLN 105
Query: 114 IARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-S 172
AR P KIFL+LLGFYTANALAT+LGQTGDWDV VA +VVA IEGIGMLMY+KP S
Sbjct: 106 SARNKSPTKIFLVLLGFYTANALATVLGQTGDWDVFVAAIVVATIEGIGMLMYRKPASRP 165
Query: 173 TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
GR S + M+NYWKAGVCLG FVDAFK+GS
Sbjct: 166 PGRFWSMITMVNYWKAGVCLGFFVDAFKVGS 196
>gi|50251788|dbj|BAD27720.1| antigen receptor-like [Oryza sativa Japonica Group]
Length = 217
Score = 183 bits (465), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 105/148 (70%), Positives = 117/148 (79%), Gaps = 2/148 (1%)
Query: 57 SFLMRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIAR 116
S+ M+ WK FAL+TGG S + Q+ DG LG TRLGR++ A GR+LLEKLN AR
Sbjct: 71 SYQMKNSRWKPVFALETGGPSNADSQD-FEDDGGFLGRTRLGRLIQAAGRELLEKLNSAR 129
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-STGR 175
N P KIFL+L GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMY+KP S GR
Sbjct: 130 SNSPTKIFLVLFGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPMSRPPGR 189
Query: 176 LQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
QS + M+NYWKAGVCLGLFVDAFKLGS
Sbjct: 190 FQSLIAMVNYWKAGVCLGLFVDAFKLGS 217
>gi|222622484|gb|EEE56616.1| hypothetical protein OsJ_05998 [Oryza sativa Japonica Group]
Length = 191
Score = 182 bits (462), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 105/148 (70%), Positives = 117/148 (79%), Gaps = 2/148 (1%)
Query: 57 SFLMRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIAR 116
S+ M+ WK FAL+TGG S + Q+ DG LG TRLGR++ A GR+LLEKLN AR
Sbjct: 45 SYQMKNSRWKPVFALETGGPSNADSQD-FEDDGGFLGRTRLGRLIQAAGRELLEKLNSAR 103
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-STGR 175
N P KIFL+L GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMY+KP S GR
Sbjct: 104 SNSPTKIFLVLFGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPMSRPPGR 163
Query: 176 LQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
QS + M+NYWKAGVCLGLFVDAFKLGS
Sbjct: 164 FQSLIAMVNYWKAGVCLGLFVDAFKLGS 191
>gi|218190363|gb|EEC72790.1| hypothetical protein OsI_06472 [Oryza sativa Indica Group]
Length = 191
Score = 182 bits (462), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 105/148 (70%), Positives = 117/148 (79%), Gaps = 2/148 (1%)
Query: 57 SFLMRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIAR 116
S+ M+ WK FAL+TGG S + Q+ DG LG TRLGR++ A GR+LLEKLN AR
Sbjct: 45 SYQMKNSRWKPVFALETGGPSNADSQD-FEDDGGFLGRTRLGRLIQAAGRELLEKLNSAR 103
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-STGR 175
N P KIFL+L GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMY+KP S GR
Sbjct: 104 SNSPTKIFLVLFGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPMSRPPGR 163
Query: 176 LQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
QS + M+NYWKAGVCLGLFVDAFKLGS
Sbjct: 164 FQSLIAMVNYWKAGVCLGLFVDAFKLGS 191
>gi|357157876|ref|XP_003577943.1| PREDICTED: ycf20-like protein-like [Brachypodium distachyon]
Length = 144
Score = 180 bits (457), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 104/145 (71%), Positives = 115/145 (79%), Gaps = 2/145 (1%)
Query: 60 MRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNL 119
M+ W+ FAL+TGG + GQ+ + D LG TRLGR+V A RQLLEKLN AR N
Sbjct: 1 MKNSRWRPVFALETGGSPNTDGQD-FDEDSGFLGRTRLGRLVQAAARQLLEKLNSARSNS 59
Query: 120 PMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-STGRLQS 178
P KIFL+LLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMY+KP S GR QS
Sbjct: 60 PTKIFLVLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPVSRPPGRFQS 119
Query: 179 FVVMMNYWKAGVCLGLFVDAFKLGS 203
+ M+NYWKAGVCLGLFVDAFKLGS
Sbjct: 120 LIAMVNYWKAGVCLGLFVDAFKLGS 144
>gi|212275874|ref|NP_001131010.1| uncharacterized protein LOC100192115 [Zea mays]
gi|194690700|gb|ACF79434.1| unknown [Zea mays]
gi|194695206|gb|ACF81687.1| unknown [Zea mays]
gi|195654383|gb|ACG46659.1| hypothetical protein [Zea mays]
Length = 232
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 100/149 (67%), Positives = 116/149 (77%), Gaps = 2/149 (1%)
Query: 56 KSFLMRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIA 115
K + ++ W+ FAL+T G S + GQ+ + D LG TRLGR++ A R+LL+KLN A
Sbjct: 85 KQYQVKNCRWRPTFALETDGPSNTDGQD-FDEDSGFLGRTRLGRLIQAAARELLDKLNSA 143
Query: 116 RKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP-PSSTG 174
R N P KIFL+LLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMY+KP G
Sbjct: 144 RTNSPTKIFLVLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPIARPPG 203
Query: 175 RLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
R QS + M+NYWKAGVCLGLFVDAFKLGS
Sbjct: 204 RFQSLISMVNYWKAGVCLGLFVDAFKLGS 232
>gi|449526593|ref|XP_004170298.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
Length = 169
Score = 177 bits (450), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 111/144 (77%), Positives = 125/144 (86%), Gaps = 2/144 (1%)
Query: 62 RHGWKIAFALDTGGISGSGGQESLNGDGP-DLGGTRLGRIVSAGGRQLLEKLNIARKNLP 120
RH K+AFALDTGG+S SGG ++++ +LG TRLGR+VS+G RQLLEKLN ARKN P
Sbjct: 26 RHANKLAFALDTGGVSSSGGGDNIDDGDESNLGRTRLGRLVSSGARQLLEKLNSARKNFP 85
Query: 121 MKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-STGRLQSF 179
KIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIE IG+LMY+KPP T RL+SF
Sbjct: 86 TKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEWIGILMYRKPPPLLTRRLKSF 145
Query: 180 VVMMNYWKAGVCLGLFVDAFKLGS 203
+VM+NYWKAGVCLGLFVDAFKLGS
Sbjct: 146 IVMVNYWKAGVCLGLFVDAFKLGS 169
>gi|242048754|ref|XP_002462123.1| hypothetical protein SORBIDRAFT_02g019530 [Sorghum bicolor]
gi|241925500|gb|EER98644.1| hypothetical protein SORBIDRAFT_02g019530 [Sorghum bicolor]
Length = 201
Score = 176 bits (447), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 104/165 (63%), Positives = 122/165 (73%), Gaps = 11/165 (6%)
Query: 49 KSACFLGKS---------FLMRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGR 99
+ ACFL + + M+ W+ FAL+TGG S + GQ+ + D LG TRLG
Sbjct: 38 RRACFLRPTKEKSGGALGYQMKNCRWRPTFALETGGPSNTDGQD-FDEDSGFLGRTRLGS 96
Query: 100 IVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIE 159
++ A R+LL+KLN AR N P KIFL+LLGFYTANALATILGQTGDWDVLVAGVVVAAIE
Sbjct: 97 LIQAAARELLDKLNSARTNSPTKIFLVLLGFYTANALATILGQTGDWDVLVAGVVVAAIE 156
Query: 160 GIGMLMYKKPPS-STGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
GIGMLMY+KP + GR QS + M+NYWKAGVCLGLFVDAFKLGS
Sbjct: 157 GIGMLMYRKPITRPPGRFQSLISMVNYWKAGVCLGLFVDAFKLGS 201
>gi|195615956|gb|ACG29808.1| hypothetical protein [Zea mays]
Length = 231
Score = 172 bits (435), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 99/149 (66%), Positives = 115/149 (77%), Gaps = 3/149 (2%)
Query: 56 KSFLMRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIA 115
K + ++ W+ FAL+T G S + GQ+ + D LG TRLGR++ A R+LL+KLN A
Sbjct: 85 KQYQVKNCRWRPTFALETDGPSNTDGQD-FDEDSGFLGRTRLGRLIQAAARELLDKLNSA 143
Query: 116 RKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP-PSSTG 174
R N P KIFL+LL FYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMY+KP G
Sbjct: 144 RTNSPTKIFLVLL-FYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPIARPPG 202
Query: 175 RLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
R QS + M+NYWKAGVCLGLFVDAFKLGS
Sbjct: 203 RFQSLISMVNYWKAGVCLGLFVDAFKLGS 231
>gi|297721015|ref|NP_001172870.1| Os02g0233300 [Oryza sativa Japonica Group]
gi|255670745|dbj|BAH91599.1| Os02g0233300 [Oryza sativa Japonica Group]
Length = 340
Score = 156 bits (394), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 94/143 (65%), Positives = 106/143 (74%), Gaps = 7/143 (4%)
Query: 57 SFLMRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIAR 116
S+ M+ WK FAL+TGG S + Q+ DG LG TRLGR++ A GR+LLEKLN AR
Sbjct: 93 SYQMKNSRWKPVFALETGGPSNADSQD-FEDDGGFLGRTRLGRLIQAAGRELLEKLNSAR 151
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-STGR 175
N P KIFL+L GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMY+KP S GR
Sbjct: 152 SNSPTKIFLVLFGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPMSRPPGR 211
Query: 176 LQSFVVMMNYWKAG-----VCLG 193
QS + M+NYWKAG CLG
Sbjct: 212 FQSLIAMVNYWKAGEAKVRECLG 234
>gi|28394796|gb|AAO42475.1| putative antigen receptor [Arabidopsis lyrata]
Length = 92
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 77/90 (85%), Positives = 83/90 (92%), Gaps = 2/90 (2%)
Query: 103 AGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIG 162
AGG+QLL K+N ARKN PMKIFLLLLGFYTANALATILGQTGDWDVLVAG+VVAAIEGIG
Sbjct: 3 AGGKQLLGKINSARKNFPMKIFLLLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIG 62
Query: 163 MLMYKKPPSS--TGRLQSFVVMMNYWKAGV 190
MLMYKKP SS +G+LQSFVV MN+WKAGV
Sbjct: 63 MLMYKKPSSSMFSGKLQSFVVFMNFWKAGV 92
>gi|28436400|gb|AAO43267.1| putative antigen receptor [Arabidopsis thaliana]
gi|28436402|gb|AAO43268.1| putative antigen receptor [Arabidopsis thaliana]
gi|28436404|gb|AAO43269.1| putative antigen receptor [Arabidopsis thaliana]
gi|28436406|gb|AAO43270.1| putative antigen receptor [Arabidopsis thaliana]
gi|28436408|gb|AAO43271.1| putative antigen receptor [Arabidopsis thaliana]
gi|28436410|gb|AAO43272.1| putative antigen receptor [Arabidopsis thaliana]
gi|28436412|gb|AAO43273.1| putative antigen receptor [Arabidopsis thaliana]
gi|28436414|gb|AAO43274.1| putative antigen receptor [Arabidopsis thaliana]
gi|28436416|gb|AAO43275.1| putative antigen receptor [Arabidopsis thaliana]
Length = 92
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 77/90 (85%), Positives = 83/90 (92%), Gaps = 2/90 (2%)
Query: 103 AGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIG 162
AGG+QLL K+N ARKN PMKIFLLLLGFYTANALATILGQTGDWDVLVAG+VVAAIEGIG
Sbjct: 3 AGGKQLLGKINSARKNFPMKIFLLLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIG 62
Query: 163 MLMYKKPPSS--TGRLQSFVVMMNYWKAGV 190
MLMYKKP SS +G+LQSFVV MN+WKAGV
Sbjct: 63 MLMYKKPSSSMFSGKLQSFVVFMNFWKAGV 92
>gi|168010839|ref|XP_001758111.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690567|gb|EDQ76933.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 129
Score = 139 bits (351), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 69/128 (53%), Positives = 91/128 (71%), Gaps = 1/128 (0%)
Query: 77 SGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANAL 136
SG E ++ LG TRLGR++S R+LL++ N R+N P KIFLLL GF++ANAL
Sbjct: 2 SGGDPGEPVDQGASSLGSTRLGRLISVRRRELLQRWNTVRRNFPAKIFLLLFGFFSANAL 61
Query: 137 ATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTG-RLQSFVVMMNYWKAGVCLGLF 195
ATILGQTGDWDV+ AG++VA IEG+G L+Y + P+ G R + + ++NYWK G LF
Sbjct: 62 ATILGQTGDWDVIAAGILVAIIEGVGHLVYNRMPAFLGERGKPIIELVNYWKIGFEFALF 121
Query: 196 VDAFKLGS 203
VDAFK+GS
Sbjct: 122 VDAFKVGS 129
>gi|302809637|ref|XP_002986511.1| hypothetical protein SELMODRAFT_48834 [Selaginella moellendorffii]
gi|300145694|gb|EFJ12368.1| hypothetical protein SELMODRAFT_48834 [Selaginella moellendorffii]
Length = 109
Score = 129 bits (323), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 73/109 (66%), Positives = 88/109 (80%), Gaps = 1/109 (0%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRLG+++ A LLE+ R+NLP+K+FLLLLGFY+ANALAT+LGQTGDWDVLVA ++
Sbjct: 2 TRLGQLLEARRIDLLERFREIRRNLPLKVFLLLLGFYSANALATVLGQTGDWDVLVAAII 61
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
VA IEGIG LMYK PP T +L+ V M+NYWKAG+ GLFVDAFKLGS
Sbjct: 62 VAVIEGIGFLMYKMPP-VTEKLRFLVRMLNYWKAGLSFGLFVDAFKLGS 109
>gi|255553945|ref|XP_002518013.1| conserved hypothetical protein [Ricinus communis]
gi|223542995|gb|EEF44531.1| conserved hypothetical protein [Ricinus communis]
Length = 238
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 62/141 (43%), Positives = 87/141 (61%), Gaps = 7/141 (4%)
Query: 60 MRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNL 119
RR W+I + D+ G+ S G GGTRL R + A +++ ++ RKNL
Sbjct: 80 FRRITWQIRSSADSSGLDPSSS-------GRTRGGTRLLRAIQALQTKIVARIQEIRKNL 132
Query: 120 PMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSF 179
P+K+ L+GFY A A AT++GQTGDWD+L A + VA +EGIG LMY+ ++S
Sbjct: 133 PIKLLFFLVGFYCATAFATVIGQTGDWDILSAALAVAVVEGIGALMYRASFPLFNNMRSL 192
Query: 180 VVMMNYWKAGVCLGLFVDAFK 200
+ M NYWKAG+ LGLF+D+FK
Sbjct: 193 ITMFNYWKAGLSLGLFLDSFK 213
>gi|302763017|ref|XP_002964930.1| hypothetical protein SELMODRAFT_68854 [Selaginella moellendorffii]
gi|300167163|gb|EFJ33768.1| hypothetical protein SELMODRAFT_68854 [Selaginella moellendorffii]
Length = 109
Score = 127 bits (318), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/109 (66%), Positives = 87/109 (79%), Gaps = 1/109 (0%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRLG+++ A LLE+ R+NLP+K+FLLLLGFY+ANALAT+LGQTGDWDVLVA ++
Sbjct: 2 TRLGQLLEARRIDLLERFREIRRNLPLKVFLLLLGFYSANALATVLGQTGDWDVLVAAII 61
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
VA EGIG LMYK PP T +L+ V M+NYWKAG+ GLFVDAFKLGS
Sbjct: 62 VAVTEGIGFLMYKMPP-VTEKLRFLVRMLNYWKAGLSFGLFVDAFKLGS 109
>gi|302790716|ref|XP_002977125.1| hypothetical protein SELMODRAFT_59700 [Selaginella moellendorffii]
gi|300155101|gb|EFJ21734.1| hypothetical protein SELMODRAFT_59700 [Selaginella moellendorffii]
Length = 109
Score = 126 bits (317), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 71/109 (65%), Positives = 88/109 (80%), Gaps = 1/109 (0%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRLG+++ A LLE+ R+NLP+K+FLLLLGFY+ANALAT+LGQTGDWD+LVA ++
Sbjct: 2 TRLGQLLEARRIDLLERFREIRRNLPLKVFLLLLGFYSANALATVLGQTGDWDMLVAAII 61
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
VA IEGIG LMYK PP T +L+ V M+NYWKAG+ GLFVDAFK+GS
Sbjct: 62 VAVIEGIGFLMYKMPP-VTEKLRFLVRMLNYWKAGLSFGLFVDAFKVGS 109
>gi|413932398|gb|AFW66949.1| hypothetical protein ZEAMMB73_929985 [Zea mays]
gi|413932399|gb|AFW66950.1| hypothetical protein ZEAMMB73_929985 [Zea mays]
gi|413932400|gb|AFW66951.1| hypothetical protein ZEAMMB73_929985 [Zea mays]
Length = 212
Score = 124 bits (311), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 78/183 (42%), Positives = 108/183 (59%), Gaps = 14/183 (7%)
Query: 19 ESSYIRLAVSANQKWKNEPSTSGQTSKIGSKSACFL-GKSFLMRRHGWKIAFALDTGGIS 77
E S RLA A ++ E + S + K G+ C L KS +RRH K+ +A+ T +S
Sbjct: 18 ECSGSRLA--AARRVGREAAVSCRWEKPGT--LCLLRAKSPSLRRHSRKVQWAIRT--MS 71
Query: 78 GSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALA 137
G +S N TRL + + QL KL AR+ LP+KI L+GFY A A A
Sbjct: 72 DDSGDQSGNS-------TRLFSAIRSFWNQLSAKLKKARRGLPVKILFFLIGFYCATAFA 124
Query: 138 TILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVD 197
T++GQTGDWD+L AG+ VA +E IG LMY+ + GR+ + + + NYWKAG+ LGLF+D
Sbjct: 125 TVIGQTGDWDILSAGLAVAIVEVIGALMYRASFAFLGRIGNMITVFNYWKAGLTLGLFLD 184
Query: 198 AFK 200
+FK
Sbjct: 185 SFK 187
>gi|242037377|ref|XP_002466083.1| hypothetical protein SORBIDRAFT_01g000870 [Sorghum bicolor]
gi|241919937|gb|EER93081.1| hypothetical protein SORBIDRAFT_01g000870 [Sorghum bicolor]
Length = 204
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/150 (44%), Positives = 93/150 (62%), Gaps = 10/150 (6%)
Query: 52 CFL-GKSFLMRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLE 110
C L KS + RH K+ +A+ T +S G +S N TRL + + +L
Sbjct: 40 CLLRAKSPSLPRHSRKVEWAIRT--MSDDSGDQSSNS-------TRLFSAIRSFWNKLSA 90
Query: 111 KLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPP 170
KL ARK LP+KI L+GFY A A AT++GQTGDWD+L AG+ VA +E IG LMY+
Sbjct: 91 KLKKARKGLPVKILFFLIGFYCATAFATVIGQTGDWDILSAGLAVAVVEVIGALMYRASF 150
Query: 171 SSTGRLQSFVVMMNYWKAGVCLGLFVDAFK 200
+ GR+++ + + NYWKAG+ LGLF+D+FK
Sbjct: 151 ALLGRIRNMITIFNYWKAGLTLGLFLDSFK 180
>gi|449454528|ref|XP_004145006.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
gi|449474824|ref|XP_004154295.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
Length = 221
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 54/109 (49%), Positives = 75/109 (68%)
Query: 93 GGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAG 152
G TRL RI+ A +L ++N RKNLP+KI L+GFY+A A AT++GQTGDWD+L A
Sbjct: 89 GRTRLIRIIQAIQTKLNARINELRKNLPLKILFFLVGFYSATAFATVIGQTGDWDILSAA 148
Query: 153 VVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKL 201
+ V +EGIG LMY+ +++ + M NYWKAG+ +GLF+D+FK
Sbjct: 149 LAVVVVEGIGALMYRASLPLLNKIKDLITMFNYWKAGLSMGLFLDSFKF 197
>gi|449498939|ref|XP_004160676.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
Length = 221
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 54/109 (49%), Positives = 75/109 (68%)
Query: 93 GGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAG 152
G TRL RI+ A +L ++N RKNLP+KI L+GFY+A A AT++GQTGDWD+L A
Sbjct: 89 GRTRLIRIIQAIQTKLNARINELRKNLPLKILFFLVGFYSATAFATVIGQTGDWDILSAA 148
Query: 153 VVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKL 201
+ V +EGIG LMY+ +++ + M NYWKAG+ +GLF+D+FK
Sbjct: 149 LAVVVVEGIGALMYRASLPLLNKIKDLITMFNYWKAGLSMGLFLDSFKF 197
>gi|115456649|ref|NP_001051925.1| Os03g0852600 [Oryza sativa Japonica Group]
gi|113550396|dbj|BAF13839.1| Os03g0852600, partial [Oryza sativa Japonica Group]
Length = 206
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 58/121 (47%), Positives = 79/121 (65%), Gaps = 3/121 (2%)
Query: 83 ESLNGDGPDLGG---TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATI 139
E++ D D G TRL + + +L KL RK LP+KI L+GFY A A AT+
Sbjct: 63 EAMTDDSADQSGDNNTRLFSAIQSFLSKLYGKLKKLRKGLPLKILFFLIGFYCATAFATV 122
Query: 140 LGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAF 199
+GQTGDWD+L AG+ VA +EGIG LMY+ + GR++ + + NYWKAG+ LGLF+D+F
Sbjct: 123 IGQTGDWDILSAGLAVAIVEGIGALMYRASFAFLGRIRKMITIFNYWKAGLTLGLFLDSF 182
Query: 200 K 200
K
Sbjct: 183 K 183
>gi|356496360|ref|XP_003517036.1| PREDICTED: uncharacterized protein LOC100306594 [Glycine max]
Length = 197
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/141 (41%), Positives = 84/141 (59%), Gaps = 7/141 (4%)
Query: 60 MRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNL 119
+R W + +L+ S S NG G TR+ R++ +L K+ +KNL
Sbjct: 63 FKRMIWSVRSSLNDSSFSPSTS----NGSN---GRTRIIRVIQEFQTKLGSKIQEVKKNL 115
Query: 120 PMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSF 179
PMK+ L+GFY A A AT++GQTGDWD+L A + VA +EGIG LMY+ + +S
Sbjct: 116 PMKLLFFLVGFYCATAFATVIGQTGDWDILSAALAVAVVEGIGALMYRASLPLVSKSRSL 175
Query: 180 VVMMNYWKAGVCLGLFVDAFK 200
+ + NYWKAG+ LGLF+D+FK
Sbjct: 176 ISLFNYWKAGLTLGLFLDSFK 196
>gi|356506255|ref|XP_003521902.1| PREDICTED: ycf20-like protein-like [Glycine max]
Length = 197
Score = 116 bits (290), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 52/108 (48%), Positives = 74/108 (68%)
Query: 93 GGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAG 152
G TR+ R++ +L K+ +KNLP+K+F L+GFY A A AT++GQTGDWD+L A
Sbjct: 89 GRTRIIRVIQEFQTKLGSKIQEVKKNLPLKLFFFLVGFYCATAFATVIGQTGDWDILSAA 148
Query: 153 VVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFK 200
+ VA +EGIG LMY+ + +S + + NYWKAG+ LGLF+D+FK
Sbjct: 149 LAVAVVEGIGALMYRASLPLVRKSRSLISLFNYWKAGLTLGLFLDSFK 196
>gi|224063751|ref|XP_002301276.1| predicted protein [Populus trichocarpa]
gi|222843002|gb|EEE80549.1| predicted protein [Populus trichocarpa]
Length = 152
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/148 (43%), Positives = 89/148 (60%), Gaps = 8/148 (5%)
Query: 53 FLGKSFLMRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKL 112
+L K+F RR W+I ++D G+ S G GTRL R + A +L ++
Sbjct: 12 YLAKNF--RRRAWQIKSSVDGSGLDPSSSSSGGGG------GTRLVRAIQAFQTRLGVRI 63
Query: 113 NIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSS 172
RKNLPMK+ L+GFY A A AT++GQTGDWD+L A + V +EGIG LMY+
Sbjct: 64 REIRKNLPMKLLFFLVGFYCATAFATVIGQTGDWDILSAALAVVVVEGIGALMYRASFPF 123
Query: 173 TGRLQSFVVMMNYWKAGVCLGLFVDAFK 200
++S V + NYWKAG+ LGLF+D+FK
Sbjct: 124 VKSIRSIVSVFNYWKAGLTLGLFLDSFK 151
>gi|296089191|emb|CBI38894.3| unnamed protein product [Vitis vinifera]
Length = 243
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 57/140 (40%), Positives = 84/140 (60%), Gaps = 7/140 (5%)
Query: 61 RRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNLP 120
+R W + ++D + + +G D G TRL R++ +L ++ RK+LP
Sbjct: 87 KRMVWSVRSSVDGNSLDPTPT------NGTD-GRTRLVRVIQGIQIKLSTRILELRKDLP 139
Query: 121 MKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFV 180
MKI L+GFY A A AT++GQTGDWD+L A + V +E IG LMY+ +++ +
Sbjct: 140 MKILFFLVGFYCATAFATVIGQTGDWDILSAALAVFVVELIGALMYRASIHLFDKVRRLI 199
Query: 181 VMMNYWKAGVCLGLFVDAFK 200
M NYWKAG+CLGLF+D+FK
Sbjct: 200 TMFNYWKAGLCLGLFLDSFK 219
>gi|302763747|ref|XP_002965295.1| hypothetical protein SELMODRAFT_7627 [Selaginella moellendorffii]
gi|300167528|gb|EFJ34133.1| hypothetical protein SELMODRAFT_7627 [Selaginella moellendorffii]
Length = 105
Score = 113 bits (282), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 66/106 (62%), Positives = 80/106 (75%), Gaps = 1/106 (0%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRLG+++ A LLE+ R+NLP+K+FLLLLGFY+ NALAT+LGQTGDWDVLV +V
Sbjct: 1 TRLGQLLEARRIDLLERFRETRRNLPLKVFLLLLGFYSGNALATVLGQTGDWDVLVGAIV 60
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFK 200
VA IEGIG LMYK PP T +L+ V M+NYWK G+ GLFVD K
Sbjct: 61 VAVIEGIGFLMYKMPP-VTEKLRFLVKMLNYWKVGLSFGLFVDGSK 105
>gi|225453999|ref|XP_002274644.1| PREDICTED: ycf20-like protein-like [Vitis vinifera]
Length = 217
Score = 113 bits (282), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 57/140 (40%), Positives = 84/140 (60%), Gaps = 7/140 (5%)
Query: 61 RRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNLP 120
+R W + ++D + + +G D G TRL R++ +L ++ RK+LP
Sbjct: 61 KRMVWSVRSSVDGNSLDPTPT------NGTD-GRTRLVRVIQGIQIKLSTRILELRKDLP 113
Query: 121 MKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFV 180
MKI L+GFY A A AT++GQTGDWD+L A + V +E IG LMY+ +++ +
Sbjct: 114 MKILFFLVGFYCATAFATVIGQTGDWDILSAALAVFVVELIGALMYRASIHLFDKVRRLI 173
Query: 181 VMMNYWKAGVCLGLFVDAFK 200
M NYWKAG+CLGLF+D+FK
Sbjct: 174 TMFNYWKAGLCLGLFLDSFK 193
>gi|302809861|ref|XP_002986623.1| hypothetical protein SELMODRAFT_48840 [Selaginella moellendorffii]
gi|300145806|gb|EFJ12480.1| hypothetical protein SELMODRAFT_48840 [Selaginella moellendorffii]
Length = 106
Score = 112 bits (280), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 65/106 (61%), Positives = 80/106 (75%), Gaps = 1/106 (0%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRLG+++ A LLE+ R+NLP+K+FLLLLGFY+ NALAT+LGQTGDWDVLV ++
Sbjct: 2 TRLGQLLEAKRIDLLERFREIRRNLPLKVFLLLLGFYSGNALATVLGQTGDWDVLVGAII 61
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFK 200
VA IEGIG LMYK PP T +L+ V M+NYWK G+ GLFVD K
Sbjct: 62 VAVIEGIGFLMYKIPP-VTEKLRFLVKMLNYWKVGLSFGLFVDGSK 106
>gi|357118772|ref|XP_003561123.1| PREDICTED: ycf20-like protein-like [Brachypodium distachyon]
Length = 219
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 78/120 (65%), Gaps = 2/120 (1%)
Query: 83 ESLNGDGPDLGG--TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATIL 140
+++ D D G TRL + + +L KL R+ P+KI L+GFY A A AT++
Sbjct: 76 KTMADDNADNSGNSTRLFTAIQSFWNKLSGKLKKLRRGFPVKILFFLIGFYCATAFATVI 135
Query: 141 GQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFK 200
GQTGDWD+L AG+ VA +E IG LMY+ + GR+++ + + NYWKAG+ LGLF+D+FK
Sbjct: 136 GQTGDWDILSAGLAVAIVEFIGALMYRASFAFFGRIKNMITIFNYWKAGLTLGLFLDSFK 195
>gi|79315375|ref|NP_001030875.1| uncharacterized protein [Arabidopsis thaliana]
gi|145332877|ref|NP_001078304.1| uncharacterized protein [Arabidopsis thaliana]
gi|9663002|emb|CAC00746.1| putative protein [Arabidopsis thaliana]
gi|45752730|gb|AAS76263.1| At3g56830 [Arabidopsis thaliana]
gi|110740493|dbj|BAE98352.1| hypothetical protein [Arabidopsis thaliana]
gi|332646050|gb|AEE79571.1| uncharacterized protein [Arabidopsis thaliana]
gi|332646051|gb|AEE79572.1| uncharacterized protein [Arabidopsis thaliana]
Length = 188
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 72/108 (66%)
Query: 94 GTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGV 153
G RL + + +LL K+ +K+LP K+F LL+GFY+A A +T +GQTGDWDVL AG+
Sbjct: 81 GLRLIKAIQVLRSKLLVKIQEIKKDLPKKLFFLLVGFYSATAFSTFIGQTGDWDVLSAGL 140
Query: 154 VVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKL 201
V +E IG LMY+ +++S + M NYWK G+ LGLF+D+FK
Sbjct: 141 AVLVVECIGALMYRASIPLINKMRSTITMFNYWKTGLALGLFLDSFKF 188
>gi|42565978|ref|NP_191242.2| uncharacterized protein [Arabidopsis thaliana]
gi|332646049|gb|AEE79570.1| uncharacterized protein [Arabidopsis thaliana]
Length = 230
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 51/107 (47%), Positives = 72/107 (67%)
Query: 94 GTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGV 153
G RL + + +LL K+ +K+LP K+F LL+GFY+A A +T +GQTGDWDVL AG+
Sbjct: 81 GLRLIKAIQVLRSKLLVKIQEIKKDLPKKLFFLLVGFYSATAFSTFIGQTGDWDVLSAGL 140
Query: 154 VVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFK 200
V +E IG LMY+ +++S + M NYWK G+ LGLF+D+FK
Sbjct: 141 AVLVVECIGALMYRASIPLINKMRSTITMFNYWKTGLALGLFLDSFK 187
>gi|297820458|ref|XP_002878112.1| hypothetical protein ARALYDRAFT_486123 [Arabidopsis lyrata subsp.
lyrata]
gi|297323950|gb|EFH54371.1| hypothetical protein ARALYDRAFT_486123 [Arabidopsis lyrata subsp.
lyrata]
Length = 188
Score = 107 bits (266), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 70/108 (64%)
Query: 94 GTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGV 153
G RL + + +LL K +K+LP K+F LL+GFY+A A +T +GQTGDWDVL AGV
Sbjct: 81 GLRLIKALQVLRTKLLVKFQEIKKDLPKKLFFLLVGFYSATAFSTFIGQTGDWDVLSAGV 140
Query: 154 VVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKL 201
V +E IG LMY+ +++ + M NYWK G+ LGLF+D+FK
Sbjct: 141 AVLVVECIGALMYRASIPLINKMRGTITMFNYWKTGLALGLFLDSFKF 188
>gi|302820910|ref|XP_002992120.1| hypothetical protein SELMODRAFT_73099 [Selaginella moellendorffii]
gi|300140046|gb|EFJ06775.1| hypothetical protein SELMODRAFT_73099 [Selaginella moellendorffii]
Length = 73
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/62 (67%), Positives = 52/62 (83%), Gaps = 1/62 (1%)
Query: 129 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKA 188
GFY+ANALAT+LGQTGDWDVLVA ++VA IEGIG LM++ PP T +L+ V M+NYWKA
Sbjct: 13 GFYSANALATVLGQTGDWDVLVAAIIVAVIEGIGFLMHRMPP-VTEKLRFLVRMLNYWKA 71
Query: 189 GV 190
G+
Sbjct: 72 GL 73
>gi|255629007|gb|ACU14848.1| unknown [Glycine max]
Length = 171
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 45/108 (41%), Positives = 63/108 (58%), Gaps = 7/108 (6%)
Query: 60 MRRHGWKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNL 119
+R W + +L+ S S NG G TR+ R++ +L K+ +KNL
Sbjct: 63 FKRMIWSVRSSLNDSSFSPSTS----NGSN---GRTRIIRVIQEFQTKLGSKIQEVKKNL 115
Query: 120 PMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYK 167
PMK+ L+GFY A A AT++GQTGDWD+L A + VA +EGIG LMY+
Sbjct: 116 PMKLLFFLVGFYCATAFATVIGQTGDWDILSAALAVAVVEGIGALMYR 163
>gi|108773201|ref|YP_635793.1| hypothetical protein RF20 [Chara vulgaris]
gi|108773301|ref|YP_635808.1| hypothetical protein RF20 [Chara vulgaris]
gi|77157857|gb|ABA61898.1| hypothetical protein RF20 [Chara vulgaris]
gi|77157957|gb|ABA61998.1| hypothetical protein RF20 [Chara vulgaris]
Length = 137
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 66/115 (57%), Gaps = 6/115 (5%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL + + L+ L + +KIF+LLLGF+ AN L+TIL Q DWD+L+A ++
Sbjct: 23 TRLEQFIKIKATNLISYLENNSYSFCLKIFVLLLGFFHANTLSTILAQVADWDILIAVIL 82
Query: 155 VAAIEGIGMLMYKKPPS------STGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+E IG+L+Y+ P R + + ++N WK G LFVD+FK+GS
Sbjct: 83 SVMVESIGLLIYQIPSRILLFTICKERRKKSLFLLNVWKIGFTYALFVDSFKVGS 137
>gi|11466439|ref|NP_038445.1| hypothetical chloroplast RF20 [Mesostigma viride]
gi|12230812|sp|Q9MUL5.1|YCF20_MESVI RecName: Full=Uncharacterized protein ycf20; Short=RF20
gi|7259582|gb|AAF43883.1|AF166114_95 hypothetical chloroplast RF20 (chloroplast) [Mesostigma viride]
Length = 116
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 57/116 (49%), Positives = 73/116 (62%), Gaps = 4/116 (3%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL + + L K+ N P+KI LLLLGF+ A LAT+ GQTGDWDVLVA
Sbjct: 1 MSNTRLTLFFTNIFQLLDSKIQNYSINFPIKILLLLLGFFIATVLATVFGQTGDWDVLVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRL--QSFVVM--MNYWKAGVCLGLFVDAFKLGS 203
G++VA IE +G MY K S ++ SF+ + +NY K G+ GLFVDAFKLGS
Sbjct: 61 GILVAMIEILGNKMYSKKYISKKQVFDISFLSLIGINYIKIGLIFGLFVDAFKLGS 116
>gi|90994406|ref|YP_536896.1| hypothetical chloroplast protein 20 [Pyropia yezoensis]
gi|122194735|sp|Q1XDS2.1|YCF20_PORYE RecName: Full=Uncharacterized protein ycf20
gi|90818970|dbj|BAE92339.1| unnamed protein product [Pyropia yezoensis]
Length = 108
Score = 72.4 bits (176), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 62/109 (56%), Gaps = 4/109 (3%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL + + L KL + L + LLLGF+ + L+TI GQTGDW ++ A ++
Sbjct: 4 TRLSTFFAYLIKNLNNKLYYSLSELTTGLISLLLGFFISTGLSTIPGQTGDWGIIAASLI 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
VAA E ++Y SS +L + + N +K G+ GLFVDAFKLGS
Sbjct: 64 VAATELTSKIVY----SSHKQLNIKINLFNNFKIGITYGLFVDAFKLGS 108
>gi|11465680|ref|NP_053824.1| ORF20 [Porphyra purpurea]
gi|1723324|sp|P51214.1|YCF20_PORPU RecName: Full=Uncharacterized protein ycf20
gi|1276680|gb|AAC08100.1| hypothetical chloroplast ORF 20 (chloroplast) [Porphyra purpurea]
Length = 108
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 61/109 (55%), Gaps = 4/109 (3%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
T+L S + L +L + L + LLLGF+ + L+TI GQTGDW ++ A ++
Sbjct: 4 TKLSIFFSYFVQNLSSRLYYSLNELTAGLISLLLGFFISTGLSTIPGQTGDWGIIAASLI 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
VAAIE + ++Y R+ ++N K G+ GLFVDAFKLGS
Sbjct: 64 VAAIELVSKIVYSNKKKYGVRIN----LLNNLKIGITYGLFVDAFKLGS 108
>gi|334118318|ref|ZP_08492408.1| protein of unknown function DUF565 [Microcoleus vaginatus FGP-2]
gi|333460303|gb|EGK88913.1| protein of unknown function DUF565 [Microcoleus vaginatus FGP-2]
Length = 110
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 64/109 (58%), Gaps = 2/109 (1%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TR+ R++ G + L+ + + + I LL G + A++TI GQ+ DWD++ AG++
Sbjct: 4 TRINRLIDVLGDRFRGWLSNPWRRISLLIISLLFGTFLGTAISTIAGQSADWDIIAAGLL 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
V E + L+Y P +TG L +V M+N K G+ LFV+AFKLGS
Sbjct: 64 VLLTELVNWLVYGGPRPATGSL--WVEMLNALKIGLTYNLFVEAFKLGS 110
>gi|378787253|gb|AFC39884.1| Ycf20 [Porphyra umbilicalis]
Length = 108
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 62/109 (56%), Gaps = 4/109 (3%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
T+L + S + +L + +L + LLLGF+ + +L+TI GQTGDW ++ A ++
Sbjct: 4 TKLSKFFSYFLNNISSRLYYSLSDLASGLISLLLGFFISTSLSTIPGQTGDWGIVAASLI 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
VA E + ++Y ++ ++N +K G+ GLFVDAFKLGS
Sbjct: 64 VAGTELVSKIVYSNKEKCVIKIN----LVNNFKIGITYGLFVDAFKLGS 108
>gi|354568968|ref|ZP_08988128.1| protein of unknown function DUF565 [Fischerella sp. JSC-11]
gi|353539180|gb|EHC08672.1| protein of unknown function DUF565 [Fischerella sp. JSC-11]
Length = 110
Score = 69.3 bits (168), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 63/112 (56%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL + + R+L E + + + + I LLGF+ +A+AT GQT +WD++VA
Sbjct: 1 MQNTRLNNLFNVLTRRLAEWILNPWRRISILIISFLLGFFLGSAVATTAGQTAEWDIVVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
G +V A E + Y + S R +V +NY K G+ LF++AFKLGS
Sbjct: 61 GTLVLATEIASRIFYNR--SVLARQVFWVQAINYLKVGLTYSLFLEAFKLGS 110
>gi|283794933|ref|YP_003359286.1| hypothetical chloroplast RF20 [Cryptomonas paramecium]
gi|253981905|gb|ACT46822.1| hypothetical chloroplast RF20 [Cryptomonas paramecium]
Length = 105
Score = 66.6 bits (161), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 55/94 (58%), Gaps = 8/94 (8%)
Query: 118 NLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIG--------MLMYKKP 169
L + L+L G + A L+T LGQTGDW +L++G++V +E I ++M
Sbjct: 12 QLLINFILILFGCFIATTLSTTLGQTGDWGMLISGIIVTFLEIISKRTYTIKKLIMILPK 71
Query: 170 PSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ +++ FVV++N K GV G F++AFKLGS
Sbjct: 72 NYQSHKIKYFVVLLNDIKIGVLYGFFIEAFKLGS 105
>gi|81299804|ref|YP_400012.1| hypothetical protein Synpcc7942_0995 [Synechococcus elongatus PCC
7942]
gi|81168685|gb|ABB57025.1| conserved hypothetical protein YCF20 [Synechococcus elongatus PCC
7942]
Length = 131
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 61/114 (53%), Gaps = 2/114 (1%)
Query: 91 DLGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLV 150
D+ TRL ++VS G ++L+ + L ++I LL GF+ N L+ + GQ+G WD
Sbjct: 19 DMQNTRLNQLVSEAGGRVLQWFQQPWRRLSLQIIFLLFGFWLTNTLSLVAGQSGVWDPSF 78
Query: 151 AGVVVAAIEGIGMLMYKKPPS-STGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
A V+ ++E L Y+ P + R F+++ N K G GL ++ KLGS
Sbjct: 79 AAVITISVETCSWLYYRGPGAGKPDRSLPFLLLQNL-KLGALYGLALETLKLGS 131
>gi|209527452|ref|ZP_03275957.1| protein of unknown function DUF565 [Arthrospira maxima CS-328]
gi|376003361|ref|ZP_09781173.1| putative (predicted) allophycocyanin linker protein [Arthrospira
sp. PCC 8005]
gi|423066926|ref|ZP_17055716.1| hypothetical protein SPLC1_S530040 [Arthrospira platensis C1]
gi|209492125|gb|EDZ92475.1| protein of unknown function DUF565 [Arthrospira maxima CS-328]
gi|375328283|emb|CCE16926.1| putative (predicted) allophycocyanin linker protein [Arthrospira
sp. PCC 8005]
gi|406711212|gb|EKD06413.1| hypothetical protein SPLC1_S530040 [Arthrospira platensis C1]
Length = 111
Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 59/112 (52%), Gaps = 1/112 (0%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL R A +Q L+ + + + + LL GF+ A+ATI+GQ G WDV VA
Sbjct: 1 MQNTRLNRFTDAIAQQTLQWFRNPWRRISVLLISLLFGFFLGVAIATIVGQAGSWDVTVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ + E I L Y+K P S ++ ++N K G+ LF+ AF LGS
Sbjct: 61 FITLLVTEWISWLSYRKVPRSEIN-SLWLDLLNSLKIGMTYSLFLLAFLLGS 111
>gi|124112127|ref|YP_001019166.1| hypothetical chloroplast RF20 [Chlorokybus atmophyticus]
gi|124012234|gb|ABM87970.1| hypothetical chloroplast RF20 [Chlorokybus atmophyticus]
Length = 127
Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 13/102 (12%)
Query: 115 ARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTG 174
A + + IF LL+GF+ A ATILG +W+VL A +V IE G +Y K ++
Sbjct: 26 ASFHFSLAIFCLLMGFFMATFFATILGFPKNWNVLNATFLVILIEVTGFFLYGKTSATFN 85
Query: 175 -------------RLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ +V +NY K G+ GLFVDAFKLGS
Sbjct: 86 LYTNKNLHSNKKKNINLWVANINYLKIGLIFGLFVDAFKLGS 127
>gi|443316319|ref|ZP_21045768.1| Protein of unknown function (DUF565) [Leptolyngbya sp. PCC 6406]
gi|442784083|gb|ELR93974.1| Protein of unknown function (DUF565) [Leptolyngbya sp. PCC 6406]
Length = 112
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 61/109 (55%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL +V Q + + L + I LL G + A+A+I GQT D DV+V+ +
Sbjct: 4 TRLATLVDTLANQFGRWVFNPWRRLSLVIIGLLFGNFFGIAIASIAGQTADLDVVVSAFL 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+AA+E ++Y++P G+ + ++N +K G+ LFV+AFKLGS
Sbjct: 64 LAAVEITSWIVYRRPRPKEGQSSLLLEVLNAFKIGITYALFVEAFKLGS 112
>gi|443312485|ref|ZP_21042102.1| Protein of unknown function (DUF565) [Synechocystis sp. PCC 7509]
gi|442777463|gb|ELR87739.1| Protein of unknown function (DUF565) [Synechocystis sp. PCC 7509]
Length = 109
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 59/109 (54%), Gaps = 3/109 (2%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL + A G QL L + L + LL GF+ A++T GQ DWD++ AG++
Sbjct: 4 TRLNNLFGAIGGQLSRWLFNPWRRLSLLAISLLFGFFLGTAVSTTAGQAADWDIIAAGIL 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
VA E + + Y++ +G +V +N K G+ LF++AFKLGS
Sbjct: 64 VALSEIVDRIYYRR---RSGDRSFWVESLNALKIGLTYSLFIEAFKLGS 109
>gi|16329594|ref|NP_440322.1| hypothetical protein sll1509 [Synechocystis sp. PCC 6803]
gi|383321335|ref|YP_005382188.1| hypothetical protein SYNGTI_0426 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383324505|ref|YP_005385358.1| hypothetical protein SYNPCCP_0426 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383490389|ref|YP_005408065.1| hypothetical protein SYNPCCN_0426 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384435655|ref|YP_005650379.1| hypothetical protein SYNGTS_0426 [Synechocystis sp. PCC 6803]
gi|451813753|ref|YP_007450205.1| YCF20 protein [Synechocystis sp. PCC 6803]
gi|6136503|sp|P72983.1|YC20L_SYNY3 RecName: Full=Ycf20-like protein
gi|1652077|dbj|BAA17002.1| ycf20 [Synechocystis sp. PCC 6803]
gi|339272687|dbj|BAK49174.1| hypothetical protein SYNGTS_0426 [Synechocystis sp. PCC 6803]
gi|359270654|dbj|BAL28173.1| hypothetical protein SYNGTI_0426 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359273825|dbj|BAL31343.1| hypothetical protein SYNPCCN_0426 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359276995|dbj|BAL34512.1| hypothetical protein SYNPCCP_0426 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407957475|dbj|BAM50715.1| hypothetical protein BEST7613_1784 [Bacillus subtilis BEST7613]
gi|451779722|gb|AGF50691.1| YCF20 protein [Synechocystis sp. PCC 6803]
Length = 109
Score = 63.5 bits (153), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 57/109 (52%), Gaps = 3/109 (2%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL IV G+QL + + + + + L GF+ A+AT GQ WDV+ A +
Sbjct: 4 TRLNTIVEVRGQQLSQFFRNPWRRISLSLLSFLFGFFVGTAVATTAGQNSQWDVVCAAFI 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ E + Y++ G LQ+ V +N +K GV LF++AFKLGS
Sbjct: 64 LLFCELVNRWFYRRGV-KMGDLQAEV--LNIFKMGVSYSLFLEAFKLGS 109
>gi|56750558|ref|YP_171259.1| hypothetical protein syc0549_c [Synechococcus elongatus PCC 6301]
gi|56685517|dbj|BAD78739.1| hypothetical protein YCF20 [Synechococcus elongatus PCC 6301]
Length = 112
Score = 63.2 bits (152), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 59/110 (53%), Gaps = 2/110 (1%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL ++VS G ++L+ + L ++I LL GF+ N L+ + GQ+G WD A V+
Sbjct: 4 TRLNQLVSEAGGRVLQWFQQPWRRLSLQIIFLLFGFWLTNTLSLVAGQSGVWDPSFAAVI 63
Query: 155 VAAIEGIGMLMYKKPPS-STGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
++E L Y+ P + R F+++ N K G GL ++ KLGS
Sbjct: 64 TISVETCSWLYYRGPGAGKPDRSLPFLLLQNL-KLGALYGLALETLKLGS 112
>gi|428223860|ref|YP_007107957.1| hypothetical protein GEI7407_0404 [Geitlerinema sp. PCC 7407]
gi|427983761|gb|AFY64905.1| hypothetical protein GEI7407_0404 [Geitlerinema sp. PCC 7407]
Length = 110
Score = 63.2 bits (152), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 62/113 (54%), Gaps = 4/113 (3%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL ++ QL L + + + + LL GF+T NAL+T GQ+GD DVLVA
Sbjct: 1 MQNTRLSSLIDVLLAQLRRWLLNPWRRVSLVVISLLFGFFTGNALSTTTGQSGDIDVLVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVV-MMNYWKAGVCLGLFVDAFKLGS 203
+ V+ E + +Y + G Q ++ +N +K G+ LF++AFKLGS
Sbjct: 61 AITVSLTEAVSWFVYGR---RWGERQPLLIDSLNAFKLGLIYSLFLEAFKLGS 110
>gi|428201383|ref|YP_007079972.1| hypothetical protein Ple7327_0997 [Pleurocapsa sp. PCC 7327]
gi|427978815|gb|AFY76415.1| Protein of unknown function (DUF565) [Pleurocapsa sp. PCC 7327]
Length = 107
Score = 62.4 bits (150), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 49/78 (62%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
+L GF+ +AL+T GQ +WD++ AG V+ +E + + Y++ S F+ ++N+
Sbjct: 35 VLFGFFAGSALSTTAGQAAEWDIIAAGFVLLFVEFVSRIFYRRERRSL-----FLEVLNF 89
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G+ LF++AFKLGS
Sbjct: 90 FKIGMIYSLFLEAFKLGS 107
>gi|428316563|ref|YP_007114445.1| protein of unknown function DUF565 [Oscillatoria nigro-viridis PCC
7112]
gi|428240243|gb|AFZ06029.1| protein of unknown function DUF565 [Oscillatoria nigro-viridis PCC
7112]
Length = 110
Score = 62.4 bits (150), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 62/113 (54%), Gaps = 4/113 (3%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TR+ R++ G + L+ + + + I LL G + A++TI GQ+ DWD++ A
Sbjct: 1 MQNTRINRLIDVLGDRFRGWLSNPWRRISLLIISLLFGTFLGTAISTIAGQSADWDIIAA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSF-VVMMNYWKAGVCLGLFVDAFKLGS 203
G++V E L+Y S +SF V M+N K G+ LFV+AFKLGS
Sbjct: 61 GLLVLLTEFANRLVYGSLRSDD---RSFWVQMLNALKIGLTYNLFVEAFKLGS 110
>gi|300868146|ref|ZP_07112779.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300333849|emb|CBN57959.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 110
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 60/109 (55%), Gaps = 2/109 (1%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL R++ QL + L + + + I LL G + A++TI GQ +WD+ AG++
Sbjct: 4 TRLNRLIDVLSEQLGDWLQNPWRRISLLIISLLFGSFLGGAISTIAGQAAEWDIFAAGLL 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
VA E I ++Y+ L+ + ++N K G+ LFV+AFKLGS
Sbjct: 64 VALTEFINWIVYRGTRLVGRSLR--IDILNCLKIGLIYSLFVEAFKLGS 110
>gi|218437362|ref|YP_002375691.1| hypothetical protein PCC7424_0355 [Cyanothece sp. PCC 7424]
gi|218170090|gb|ACK68823.1| protein of unknown function DUF565 [Cyanothece sp. PCC 7424]
Length = 116
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 63/117 (53%), Gaps = 12/117 (10%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGF----YTANALATILGQTGDWDVLV 150
TRL +V ++ KL +N +I +L++GF + A AL+T GQ DWD+++
Sbjct: 4 TRLSTLVDT----IVFKLQRFYRNPWRRISVLVIGFLGGIFVAQALSTTGGQEADWDIVM 59
Query: 151 AGVVVAAIEGIGMLMYKKPPSSTG----RLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
A V+ EG+ + +Y+ P + R F +N +K G+ +F++AFKLGS
Sbjct: 60 ATFVLLFTEGVSIFVYRLPRKQSSNLDIRYSLFTESLNMFKIGITYSMFLEAFKLGS 116
>gi|427420155|ref|ZP_18910338.1| Protein of unknown function (DUF565) [Leptolyngbya sp. PCC 7375]
gi|425762868|gb|EKV03721.1| Protein of unknown function (DUF565) [Leptolyngbya sp. PCC 7375]
Length = 110
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 62/109 (56%), Gaps = 2/109 (1%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL +V G++ + + + + + + LL G + A A++ GQT + D+L++ ++
Sbjct: 4 TRLSTLVDQAGQRFSQWVFNPWRRISLAVISLLFGNFFATAISATAGQTAEIDILISAIL 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
VA +E I L ++ S G+ ++N +K G+ GLFV+AFKLGS
Sbjct: 64 VAMVEFISWLYHRS--RSNGQRTILPELLNSFKLGMVYGLFVEAFKLGS 110
>gi|307153416|ref|YP_003888800.1| hypothetical protein Cyan7822_3584 [Cyanothece sp. PCC 7822]
gi|306983644|gb|ADN15525.1| protein of unknown function DUF565 [Cyanothece sp. PCC 7822]
Length = 116
Score = 60.5 bits (145), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 64/113 (56%), Gaps = 4/113 (3%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL +V A ++ L+ + + + + L GF+ A A++T GQ D DV++A ++
Sbjct: 4 TRLNTLVDATAFRIQRFLSNPWRRISVLLIGFLGGFFAAQAISTTGGQKADLDVVMAALI 63
Query: 155 VAAIEGIGMLMYKKP--PSSTGRLQS--FVVMMNYWKAGVCLGLFVDAFKLGS 203
+ EG + +Y+ PS G ++ FV +N +K G+ GLF++AFKLGS
Sbjct: 64 LLFTEGASIFVYRFSGRPSKAGEVRQWLFVETLNLFKMGMIYGLFLEAFKLGS 116
>gi|409992567|ref|ZP_11275749.1| hypothetical protein APPUASWS_15812 [Arthrospira platensis str.
Paraca]
gi|409936561|gb|EKN78043.1| hypothetical protein APPUASWS_15812 [Arthrospira platensis str.
Paraca]
Length = 111
Score = 60.5 bits (145), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 59/113 (52%), Gaps = 3/113 (2%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL R A +Q L+ + + + + LL GF+ A+ATI+GQ G WDV VA
Sbjct: 1 MQNTRLNRFTDAIAQQTLQWFRNPWRRISVLLISLLFGFFLGVAIATIVGQVGSWDVSVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSF-VVMMNYWKAGVCLGLFVDAFKLGS 203
+ + E I L Y+K S + S + ++N K G+ LF+ AF LGS
Sbjct: 61 FITLLVTEWISWLSYRKVSRS--EINSLGLDLLNSLKIGMTYSLFLLAFLLGS 111
>gi|291566840|dbj|BAI89112.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 111
Score = 60.1 bits (144), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 1/112 (0%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL R A +Q L+ + + + + L GF+ A+ATI+GQ G WDV VA
Sbjct: 1 MQNTRLNRFTDAIAQQTLQWFRNPWRRISVLLISLFFGFFLGVAIATIVGQVGSWDVSVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ + E I L Y+K S ++ ++N K G+ LF+ AF LGS
Sbjct: 61 FITLLVTEWISWLSYRKVSRSEIN-SLWLDLLNSMKIGMTYSLFLLAFLLGS 111
>gi|11465590|ref|NP_045069.1| hypothetical protein CycaCp052 [Cyanidium caldarium]
gi|14548001|sp|Q9MVP1.1|YCF20_CYACA RecName: Full=Uncharacterized protein ycf20
gi|6715544|gb|AAF26453.1| unknown [Cyanidium caldarium]
Length = 101
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 47/80 (58%)
Query: 124 FLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMM 183
F+ L GF+ ++A +TIL QT +W +L A ++++ +E L +K + + +
Sbjct: 22 FIFLFGFFLSSATSTILIQTNEWSILTAAILISIVELFNYLKHKFQFNDRKSGYNCFFFI 81
Query: 184 NYWKAGVCLGLFVDAFKLGS 203
N K G+ GLF+DAFKLGS
Sbjct: 82 NLAKLGLLYGLFIDAFKLGS 101
>gi|428773704|ref|YP_007165492.1| hypothetical protein Cyast_1888 [Cyanobacterium stanieri PCC 7202]
gi|428687983|gb|AFZ47843.1| protein of unknown function (DUF565) [Cyanobacterium stanieri PCC
7202]
Length = 109
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 7/111 (6%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL ++++ G Q+++ + + L + I LLGF+ A+ A I GQ G WDV A
Sbjct: 4 TRLNILLTSIGNQIIDFFSNPWRRLSLNIIAFLLGFFLASTTAAIAGQAGKWDVTYAFFF 63
Query: 155 VAAIEGIGMLMYK--KPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ EG +++YK P R +N +K G L+++ FKLGS
Sbjct: 64 LVFTEGSNIIIYKNRNPNKPLWR-----TTLNAFKIGFAYCLYLEGFKLGS 109
>gi|427730935|ref|YP_007077172.1| hypothetical protein Nos7524_3796 [Nostoc sp. PCC 7524]
gi|427366854|gb|AFY49575.1| Protein of unknown function (DUF565) [Nostoc sp. PCC 7524]
Length = 110
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 59/113 (52%), Gaps = 4/113 (3%)
Query: 92 LGGTRLGRIVSAGGRQL-LEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLV 150
+ TRL + A R L L LN R+ L + + L GF+ A++TI GQ G D+++
Sbjct: 1 MQNTRLNNLFDAIARNLGLWFLNPWRR-LSLLVISFLFGFFLGTAISTIAGQRGTLDIMI 59
Query: 151 AGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
AG +V E + Y + S R + +V +N K G LF++AFKLGS
Sbjct: 60 AGFLVVLTEVTSRIFYSR--SFFARRELWVESLNLLKVGFIYSLFIEAFKLGS 110
>gi|427720360|ref|YP_007068354.1| hypothetical protein Cal7507_5179 [Calothrix sp. PCC 7507]
gi|427352796|gb|AFY35520.1| protein of unknown function DUF565 [Calothrix sp. PCC 7507]
Length = 110
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 54/112 (48%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL + A R+L E + L + + L GF+ A++T GQ DWD++VA
Sbjct: 1 MQNTRLNNLFDAIARRLGEWFTNPWRRLSLLVISFLFGFFLGTAVSTTAGQKADWDIVVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+V E + Y + S L +V +N K G LF++A KLGS
Sbjct: 61 AFLVFLTEISSRIFYNRSFLSKRSL--WVEALNLLKVGFTYSLFIEALKLGS 110
>gi|166363096|ref|YP_001655369.1| hypothetical protein MAE_03550 [Microcystis aeruginosa NIES-843]
gi|166085469|dbj|BAG00177.1| hypothetical protein MAE_03550 [Microcystis aeruginosa NIES-843]
Length = 111
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 51/91 (56%), Gaps = 9/91 (9%)
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP----PSS 172
+ + ++I +L G ++ A+ T GQT WDV AG++V E I ++Y+K P+
Sbjct: 26 RRISLQIICVLFGIFSGQAIVTTAGQTAQWDVTAAGLLVLFTEAISRIVYRKSSQAKPAP 85
Query: 173 TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
R +SF N K G+ LF++AFK+GS
Sbjct: 86 ILR-ESF----NLLKIGITYSLFLEAFKIGS 111
>gi|425442753|ref|ZP_18822990.1| Ycf20-like protein [Microcystis aeruginosa PCC 9717]
gi|425466273|ref|ZP_18845576.1| Ycf20-like protein [Microcystis aeruginosa PCC 9809]
gi|389716121|emb|CCH99606.1| Ycf20-like protein [Microcystis aeruginosa PCC 9717]
gi|389831302|emb|CCI26073.1| Ycf20-like protein [Microcystis aeruginosa PCC 9809]
Length = 113
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 51/91 (56%), Gaps = 9/91 (9%)
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP----PSS 172
+ + ++I +L G ++ A+ T GQT WDV AG++V E I ++Y+K P+
Sbjct: 28 RRISLQIICVLFGIFSGQAIVTTAGQTAQWDVTAAGLLVLFTEAISRIVYRKSSQAKPAP 87
Query: 173 TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
R +SF N K G+ LF++AFK+GS
Sbjct: 88 ILR-ESF----NLLKIGITYSLFLEAFKIGS 113
>gi|153805609|ref|YP_001382192.1| hypothetical chloroplast RF20 [Leptosira terrestris]
gi|134270140|gb|ABO69329.1| hypothetical chloroplast RF20 [Leptosira terrestris]
Length = 127
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 51/96 (53%), Gaps = 5/96 (5%)
Query: 113 NIARKNLPMKIFLLLLGFYTANALATILGQTGD---WDVLVAGVVVAAIEGIGMLMYKKP 169
NI K P+ +F +L GF N T L + + WDVL+ G ++ E I L+Y K
Sbjct: 32 NIISKRFPITLFAILSGFIIGNVFGTFLTKLREFICWDVLILGFILIFCEFINFLIYTKI 91
Query: 170 PSSTGR-LQSFVV-MMNYWKAGVCLGLFVDAFKLGS 203
+ L F + ++N +K G+ G FVDAFK+GS
Sbjct: 92 FNLKAYFLTRFSIKLLNSFKIGLLFGFFVDAFKVGS 127
>gi|428208508|ref|YP_007092861.1| hypothetical protein Chro_3535 [Chroococcidiopsis thermalis PCC
7203]
gi|428010429|gb|AFY88992.1| hypothetical protein Chro_3535 [Chroococcidiopsis thermalis PCC
7203]
Length = 110
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 6/107 (5%)
Query: 97 LGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVA 156
LG I +A R + L + I LL GF+ +A +TI GQ DWD++ AGV+VA
Sbjct: 10 LGAIAAATQRWFFNPW----RRLSVLIVSLLFGFFLGSAFSTIAGQAADWDIIGAGVLVA 65
Query: 157 AIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
E I + Y + + R ++ +N K G+ LF++AFKLGS
Sbjct: 66 GSEIIDRIYYNRIWQA--RRGLWLEALNALKIGLTYSLFLEAFKLGS 110
>gi|434397744|ref|YP_007131748.1| hypothetical protein Sta7437_1212 [Stanieria cyanosphaera PCC 7437]
gi|428268841|gb|AFZ34782.1| hypothetical protein Sta7437_1212 [Stanieria cyanosphaera PCC 7437]
Length = 116
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 59/113 (52%), Gaps = 4/113 (3%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL +V G ++ + + + I +L GF+ +A+AT GQ WDV+ A ++
Sbjct: 4 TRLNSLVDVVGARIELLFSNPWRRFALSIISILFGFFMGSAIATTAGQDASWDVIGAALI 63
Query: 155 VAAIEGIGMLMYKKP----PSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ E + +Y++P + R V ++N +K G+ LF++AFKLGS
Sbjct: 64 LFFTELLSNFVYRRPSRDRTETNLRRPLLVDVVNLFKIGLTYSLFLEAFKLGS 116
>gi|428306677|ref|YP_007143502.1| hypothetical protein Cri9333_3154 [Crinalium epipsammum PCC 9333]
gi|428248212|gb|AFZ13992.1| protein of unknown function DUF565 [Crinalium epipsammum PCC 9333]
Length = 118
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 48/81 (59%), Gaps = 6/81 (7%)
Query: 129 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMY---KKPPSSTGRLQS--FVV-M 182
G + A+ T GQT +WDV+ AGV+V E + +Y ++PP++ L FV +
Sbjct: 38 GIFLGTAIPTTSGQTAEWDVVSAGVLVIFTEAVSRFVYGRNRRPPANEEALSRGLFVAEL 97
Query: 183 MNYWKAGVCLGLFVDAFKLGS 203
+N +K G+ LFV+AFKLGS
Sbjct: 98 LNSFKIGLIYSLFVEAFKLGS 118
>gi|434393240|ref|YP_007128187.1| hypothetical protein Glo7428_2520 [Gloeocapsa sp. PCC 7428]
gi|428265081|gb|AFZ31027.1| hypothetical protein Glo7428_2520 [Gloeocapsa sp. PCC 7428]
Length = 110
Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 61/111 (54%), Gaps = 6/111 (5%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL + ++ QL + L + + +L GF+ +A++TI GQ DWD++ AGV+
Sbjct: 4 TRLNDLFNSMAGQLGRWFFNPWRKLSLLVISVLFGFFLGSAISTIAGQAADWDIIGAGVL 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQS--FVVMMNYWKAGVCLGLFVDAFKLGS 203
VA E + + Y S T R Q ++ +N K G+ LF++AFKLGS
Sbjct: 64 VAWSEIVDRIYY----SRTWRKQRGLWLEALNAGKIGLTYSLFLEAFKLGS 110
>gi|422304172|ref|ZP_16391521.1| Ycf20-like protein [Microcystis aeruginosa PCC 9806]
gi|389790773|emb|CCI13392.1| Ycf20-like protein [Microcystis aeruginosa PCC 9806]
Length = 113
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 51/91 (56%), Gaps = 9/91 (9%)
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP----PSS 172
+ + ++I +L G ++ A+ T GQT WDV AG++V E I ++Y+K P+
Sbjct: 28 RRISLQIICVLFGIFSGQAIVTTAGQTAQWDVTAAGLLVLFTEVISRIVYRKSSQAKPAP 87
Query: 173 TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
R +SF N K G+ LF++AFK+GS
Sbjct: 88 ILR-ESF----NLLKIGITYSLFLEAFKIGS 113
>gi|1351745|sp|P48409.1|YCF20_GALSU RecName: Full=Uncharacterized protein ycf20
gi|282698|pir||S25307 probable allophycocyanin linker protein - red alga (Cyanidium
caldarium) chloroplast
gi|17971|emb|CAA40532.1| apcB L9.5 [Cyanidium caldarium]
gi|256563|gb|AAB50632.1| putative 9.5 kda allophycocyanin linker protein [Cyanidium
caldarium]
Length = 83
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 45/77 (58%), Gaps = 2/77 (2%)
Query: 127 LLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYW 186
LLG +++N L TI QTGDW + + ++A E I + Y + T + + + N +
Sbjct: 9 LLGIFSSNLLCTIYTQTGDWSLYLTSCIIALYEIISYISYNQFIKKTHK--NIINCFNGF 66
Query: 187 KAGVCLGLFVDAFKLGS 203
K G+ GL++DAFKLGS
Sbjct: 67 KIGLIYGLYLDAFKLGS 83
>gi|330040530|ref|XP_003239945.1| hypothetical protein CPARA_3gp389 [Cryptomonas paramecium]
gi|327206871|gb|AEA39047.1| hypothetical protein CPARA_3gp389 [Cryptomonas paramecium]
Length = 137
Score = 56.6 bits (135), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 43/81 (53%), Gaps = 5/81 (6%)
Query: 123 IFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVM 182
IF L GF+ AN+ TI+G DWD L A +++ IE Y +S + +
Sbjct: 62 IFFELFGFFLANSAITIVGSVADWDPLAAAIMLCWIEFFNCKFYSSDFNSI-----YFNL 116
Query: 183 MNYWKAGVCLGLFVDAFKLGS 203
+N +K G+ G+FVDA KL S
Sbjct: 117 VNNFKIGIIFGIFVDALKLTS 137
>gi|425446456|ref|ZP_18826460.1| Ycf20-like protein [Microcystis aeruginosa PCC 9443]
gi|425456084|ref|ZP_18835795.1| Ycf20-like protein [Microcystis aeruginosa PCC 9807]
gi|389733318|emb|CCI02893.1| Ycf20-like protein [Microcystis aeruginosa PCC 9443]
gi|389802905|emb|CCI18105.1| Ycf20-like protein [Microcystis aeruginosa PCC 9807]
Length = 113
Score = 56.2 bits (134), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 51/91 (56%), Gaps = 9/91 (9%)
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKK----PPSS 172
+ + ++I +L G ++ A+ T GQT WDV AG+++ E I ++Y+K P+
Sbjct: 28 RRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLLLFTEAISRIVYRKSSQAKPAP 87
Query: 173 TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
R +SF N K G+ LF++AFK+GS
Sbjct: 88 ILR-ESF----NLLKIGITYSLFLEAFKIGS 113
>gi|390437653|ref|ZP_10226185.1| Ycf20-like protein [Microcystis sp. T1-4]
gi|389838936|emb|CCI30307.1| Ycf20-like protein [Microcystis sp. T1-4]
Length = 113
Score = 56.2 bits (134), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 51/91 (56%), Gaps = 9/91 (9%)
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP----PSS 172
+ + ++I +L G ++ A+ T GQT WDV AG+++ E I ++Y+K P+
Sbjct: 28 RRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLLLFTEAISRIVYRKSAQAKPAP 87
Query: 173 TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
R +SF N K G+ LF++AFK+GS
Sbjct: 88 ILR-ESF----NLLKIGITYSLFLEAFKIGS 113
>gi|411119550|ref|ZP_11391930.1| Protein of unknown function (DUF565) [Oscillatoriales
cyanobacterium JSC-12]
gi|410711413|gb|EKQ68920.1| Protein of unknown function (DUF565) [Oscillatoriales
cyanobacterium JSC-12]
Length = 110
Score = 55.8 bits (133), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 54/112 (48%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL + + L L + + I LL G + A++TI GQ D D+LVA
Sbjct: 1 MQNTRLNTLFDTATQNLSNWLQNPWRRWSVLIISLLFGNFLGTAISTISGQQADLDILVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
G++ E + + Y ++ L V +N K GV LFV+AFKLGS
Sbjct: 61 GILTGVTEVVNRIAYGTSAATRRSLP--VQFLNALKVGVVYSLFVEAFKLGS 110
>gi|434389388|ref|YP_007099999.1| Protein of unknown function (DUF565) [Chamaesiphon minutus PCC
6605]
gi|428020378|gb|AFY96472.1| Protein of unknown function (DUF565) [Chamaesiphon minutus PCC
6605]
Length = 121
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 60/121 (49%), Gaps = 9/121 (7%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL + RQ + + L + I LL G + A+A+AT GQ + DV +
Sbjct: 1 MQNTRLNTLFDRLTRQFGRWVENPWRRLSIIIISLLFGNFLASAIATSTGQRANLDVFAS 60
Query: 152 GVVVAAIEGIGMLMYKK---PPSSTGRLQSF------VVMMNYWKAGVCLGLFVDAFKLG 202
+VA +E IG L Y S+ GR Q + ++N K G+ GLFV+AFKLG
Sbjct: 61 LTLVAIVESIGWLTYGTIFGRRSTEGRDQIIGQRPISIAILNSLKLGLMYGLFVEAFKLG 120
Query: 203 S 203
S
Sbjct: 121 S 121
>gi|186685288|ref|YP_001868484.1| hypothetical protein Npun_R5211 [Nostoc punctiforme PCC 73102]
gi|186467740|gb|ACC83541.1| protein of unknown function DUF565 [Nostoc punctiforme PCC 73102]
Length = 110
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 56/112 (50%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL ++ A +L + + L + I L GF+ NA++T GQ + D++VA
Sbjct: 1 MQNTRLNNLLDAIATRLGQWFLNPWRRLSLLIISFLFGFFLGNAVSTTAGQRAELDIVVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
G +V E + Y + S R FV +N K G LF++AFKLGS
Sbjct: 61 GFLVVLTEVTSRIFYSQ--SFFTRRSLFVDSLNLLKVGFTYSLFLEAFKLGS 110
>gi|160331763|ref|XP_001712588.1| hypothetical protein HAN_3g461 [Hemiselmis andersenii]
gi|159766037|gb|ABW98263.1| hypothetical protein HAN_3g461 [Hemiselmis andersenii]
Length = 146
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 132 TANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVC 191
+A + TI+G DWD L A V++ IE L Y S+ +L +F+ ++N +K G+
Sbjct: 80 SATSAITIIGSVADWDPLAAAVLLCWIELFTKLFY-----SSEKLTNFLKLLNSFKIGII 134
Query: 192 LGLFVDAFKL 201
LG+FVDAFK+
Sbjct: 135 LGMFVDAFKI 144
>gi|425436198|ref|ZP_18816636.1| Ycf20-like protein [Microcystis aeruginosa PCC 9432]
gi|425449828|ref|ZP_18829661.1| Ycf20-like protein [Microcystis aeruginosa PCC 7941]
gi|159028644|emb|CAO88115.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|389679122|emb|CCH92059.1| Ycf20-like protein [Microcystis aeruginosa PCC 9432]
gi|389769599|emb|CCI05595.1| Ycf20-like protein [Microcystis aeruginosa PCC 7941]
Length = 113
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 50/91 (54%), Gaps = 9/91 (9%)
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP----PSS 172
+ + ++I +L G ++ A+ T GQT WDV AG+++ E ++Y+K P+
Sbjct: 28 RRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLLLFTEATSRIVYRKSSQAKPAP 87
Query: 173 TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
R +SF N K G+ LF++AFK+GS
Sbjct: 88 ILR-ESF----NLLKIGITYSLFLEAFKIGS 113
>gi|254417601|ref|ZP_05031338.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196175623|gb|EDX70650.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 118
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 47/79 (59%), Gaps = 6/79 (7%)
Query: 131 YTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMY----KKPP--SSTGRLQSFVVMMN 184
+ +A++T GQT DWDV+ AG+++ E + ++Y ++ P S RL F+ +N
Sbjct: 40 FLGSAISTTAGQTADWDVIAAGILIIFTETVSRIVYGRNRQRLPSLSRASRLSWFLETVN 99
Query: 185 YWKAGVCLGLFVDAFKLGS 203
K G+ +F++AFKLGS
Sbjct: 100 ALKIGLVYSMFLEAFKLGS 118
>gi|440752489|ref|ZP_20931692.1| putative protein Ycf20 [Microcystis aeruginosa TAIHU98]
gi|443648037|ref|ZP_21129867.1| putative protein Ycf20 [Microcystis aeruginosa DIANCHI905]
gi|440176982|gb|ELP56255.1| putative protein Ycf20 [Microcystis aeruginosa TAIHU98]
gi|443335303|gb|ELS49778.1| putative protein Ycf20 [Microcystis aeruginosa DIANCHI905]
Length = 111
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 50/91 (54%), Gaps = 9/91 (9%)
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP----PSS 172
+ + ++I +L G ++ A+ T GQT WDV AG+++ E ++Y+K P+
Sbjct: 26 RRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLLLFTEATSRIVYRKSSQAKPAP 85
Query: 173 TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
R +SF N K G+ LF++AFK+GS
Sbjct: 86 ILR-ESF----NLLKIGITYSLFLEAFKIGS 111
>gi|443328441|ref|ZP_21057038.1| Protein of unknown function (DUF565) [Xenococcus sp. PCC 7305]
gi|442791895|gb|ELS01385.1| Protein of unknown function (DUF565) [Xenococcus sp. PCC 7305]
Length = 114
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 56/111 (50%), Gaps = 2/111 (1%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL +V Q+ N + + + I LL+GF+ +A+ + GQ WD+ A V+
Sbjct: 4 TRLNELVDTAAGQIELLFNNPWRRISLNIIALLMGFFMGSAIVSSAGQDAVWDITGAAVL 63
Query: 155 VAAIEGIGMLMY--KKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ E +Y K S+ R F+ ++N +K G+ LF++AFKL S
Sbjct: 64 LIFAELSSRWIYGQNKKSSAIIRQSLFLDILNKFKIGITYSLFLEAFKLNS 114
>gi|255072633|ref|XP_002499991.1| predicted protein [Micromonas sp. RCC299]
gi|226515253|gb|ACO61249.1| predicted protein [Micromonas sp. RCC299]
Length = 181
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 57/110 (51%), Gaps = 6/110 (5%)
Query: 95 TRLGRIVSAGGRQLLEKLNIAR-KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGV 153
TRLG I++ ++L + R + L KI GFY+AN L+ G G DV+
Sbjct: 77 TRLGSILTTIPQRLDLYFRVRRGRKLLWKIIFAATGFYSANVLSLTFGVLGINDVVAGAF 136
Query: 154 VVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+A E + YKK ++ G+ F +N++K G+C L DAFKLGS
Sbjct: 137 CLAFAEIVTRSFYKK--TNPGKYWDF---LNWFKLGLCYALITDAFKLGS 181
>gi|425462004|ref|ZP_18841478.1| Ycf20-like protein [Microcystis aeruginosa PCC 9808]
gi|389825104|emb|CCI25466.1| Ycf20-like protein [Microcystis aeruginosa PCC 9808]
Length = 113
Score = 53.1 bits (126), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 48/91 (52%), Gaps = 9/91 (9%)
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP----PSS 172
+ + ++I +L G ++ A+ T GQT WDV AG+++ E ++Y+K P+
Sbjct: 28 RRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLLLFTEATSRIVYRKSSQAKPAP 87
Query: 173 TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
R +N K G+ LF++AFK+GS
Sbjct: 88 ILRES-----LNLLKIGITYSLFLEAFKIGS 113
>gi|443475153|ref|ZP_21065112.1| protein of unknown function DUF565 [Pseudanabaena biceps PCC 7429]
gi|443020076|gb|ELS34079.1| protein of unknown function DUF565 [Pseudanabaena biceps PCC 7429]
Length = 117
Score = 53.1 bits (126), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 61/117 (52%), Gaps = 5/117 (4%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL +IV+ R+L + + + + LLLGF+ A+ ++T G + D+L A
Sbjct: 1 MQNTRLSQIVNINIRRLSQWSSNPWRRTSLIAISLLLGFFLASVISTSTGAKSELDILAA 60
Query: 152 GVVVAAIEGIGMLMY--KKPPSSTGRLQSFVV---MMNYWKAGVCLGLFVDAFKLGS 203
+ V +E +Y K+ S G + + M+N K G+ GLF++AFKLGS
Sbjct: 61 AITVFFVELTNRFVYSQKRLTRSDGTITPRPLTSEMLNSLKLGITYGLFLEAFKLGS 117
>gi|218246762|ref|YP_002372133.1| hypothetical protein PCC8801_1939 [Cyanothece sp. PCC 8801]
gi|257059804|ref|YP_003137692.1| hypothetical protein Cyan8802_1966 [Cyanothece sp. PCC 8802]
gi|218167240|gb|ACK65977.1| protein of unknown function DUF565 [Cyanothece sp. PCC 8801]
gi|256589970|gb|ACV00857.1| protein of unknown function DUF565 [Cyanothece sp. PCC 8802]
Length = 111
Score = 52.8 bits (125), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 43/75 (57%), Gaps = 1/75 (1%)
Query: 129 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKA 188
GF+ A+ T +GQ+ WD+ VAG + E I Y + + RL ++ M+N +K
Sbjct: 38 GFFVGQAVTTSVGQSAYWDITVAGFFLLFTEFISRTFYSRLSINNKRL-FWLSMLNTFKM 96
Query: 189 GVCLGLFVDAFKLGS 203
GV GL+++A KLGS
Sbjct: 97 GVIYGLYLEALKLGS 111
>gi|67920532|ref|ZP_00514052.1| Protein of unknown function DUF565 [Crocosphaera watsonii WH 8501]
gi|67858016|gb|EAM53255.1| Protein of unknown function DUF565 [Crocosphaera watsonii WH 8501]
Length = 110
Score = 52.8 bits (125), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 2/109 (1%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL +V + E + + + + + +LLGF+ A++ GQ WD+ V +
Sbjct: 4 TRLNTLVEVTQTKFNETFSNPWRRISLSLISVLLGFFVGQAVSITAGQQTYWDITVGIFL 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ EGI + Y + GR S + +N +K GV L+V+A KLGS
Sbjct: 64 LIFTEGISRITYSRSKKE-GRSLSLDI-LNLFKIGVTYSLYVEALKLGS 110
>gi|428779733|ref|YP_007171519.1| hypothetical protein Dacsa_1477 [Dactylococcopsis salina PCC 8305]
gi|428694012|gb|AFZ50162.1| Protein of unknown function (DUF565) [Dactylococcopsis salina PCC
8305]
Length = 112
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 43/75 (57%)
Query: 129 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKA 188
G + A A+ T GQ G D++VA V++ EGI +Y++ S+ R F+ ++N +K
Sbjct: 38 GIFIAVAIVTSAGQNGRLDIIVAAVLLIFTEGISWFVYRRYSSTDERNLGFLEVLNTFKI 97
Query: 189 GVCLGLFVDAFKLGS 203
G L++ AF LGS
Sbjct: 98 GFTYSLYIQAFILGS 112
>gi|119512830|ref|ZP_01631897.1| hypothetical protein N9414_02411 [Nodularia spumigena CCY9414]
gi|119462503|gb|EAW43473.1| hypothetical protein N9414_02411 [Nodularia spumigena CCY9414]
Length = 110
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 55/112 (49%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL + + R+L + + L + + GF+ A++TI GQ G D+++A
Sbjct: 1 MQNTRLTNLFDSIARRLGQWFLNPWRRLSLLLINFFFGFFLGTAISTIAGQRGLLDIVIA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
G++V E + Y PS + V +N K G +FV++FKLGS
Sbjct: 61 GILVVLTEVTSRIFYS--PSFLSKRSLLVESLNILKVGFIYSMFVESFKLGS 110
>gi|425469331|ref|ZP_18848277.1| Ycf20-like protein [Microcystis aeruginosa PCC 9701]
gi|389881508|emb|CCI37952.1| Ycf20-like protein [Microcystis aeruginosa PCC 9701]
Length = 113
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 48/91 (52%), Gaps = 9/91 (9%)
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP----PSS 172
+ + +++ +L G ++ A+ T GQT WDV AG+++ E ++Y+K P+
Sbjct: 28 RRISLQLICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLLLFTEATSRIVYRKSSQAKPAP 87
Query: 173 TGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
R +N K G+ LF++AFK+GS
Sbjct: 88 ILRES-----LNLLKIGITYSLFLEAFKIGS 113
>gi|449020096|dbj|BAM83498.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 206
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 43/77 (55%), Gaps = 5/77 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL G ++A +++TI+G +WD + V+V +E I Y++P R + + ++N
Sbjct: 134 LLFGAFSATSISTIVGALAEWDPIAGAVIVFIMEWIATQYYRRP-----RPSNIIRILNA 188
Query: 186 WKAGVCLGLFVDAFKLG 202
++ G+ G +DA KL
Sbjct: 189 YQVGLTFGFILDALKLA 205
>gi|119485209|ref|ZP_01619594.1| hypothetical protein L8106_07164 [Lyngbya sp. PCC 8106]
gi|119457437|gb|EAW38562.1| hypothetical protein L8106_07164 [Lyngbya sp. PCC 8106]
Length = 110
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 56/112 (50%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL R+ + +L L + + + +L G ++ ++TI GQ +WDV A
Sbjct: 1 MQNTRLNRLTNVLSERLGRWLRNPWRRISLLTISVLFGSFSGIVISTIAGQQAEWDVTAA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+++ EGI L Y+ P R ++ +N +K GV LF+ AF LGS
Sbjct: 61 FILLLITEGINWLTYRSKP--LIRQPLWIENLNAFKIGVVYSLFLLAFLLGS 110
>gi|434405265|ref|YP_007148150.1| Protein of unknown function (DUF565) [Cylindrospermum stagnale PCC
7417]
gi|428259520|gb|AFZ25470.1| Protein of unknown function (DUF565) [Cylindrospermum stagnale PCC
7417]
Length = 110
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 53/112 (47%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL + RQL + + L + + L GF+ A++T GQ + D++VA
Sbjct: 1 MQNTRLNNLFDTIARQLGQWFINPWRRLSLLVISFLFGFFLGTAVSTTAGQKAELDIVVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
G +V E + Y + R +V +N K G LF++AFKLGS
Sbjct: 61 GFLVLLTEVTSRIFYSR--GFLARRSLWVESLNLLKVGFIYSLFIEAFKLGS 110
>gi|416377790|ref|ZP_11683661.1| hypothetical protein CWATWH0003_0506 [Crocosphaera watsonii WH
0003]
gi|357266150|gb|EHJ14818.1| hypothetical protein CWATWH0003_0506 [Crocosphaera watsonii WH
0003]
Length = 110
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 2/109 (1%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL +V + E + + + + + +LLGF+ A++ GQ WD+ V +
Sbjct: 4 TRLNTLVEVTQTKFNETFSNPWRRISLSLISVLLGFFVGQAVSITAGQKTYWDINVGIFL 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ EGI + Y + GR S + +N +K GV L+V+A KLGS
Sbjct: 64 LIFTEGISRITYSRSKKE-GRSLSLDI-LNLFKIGVTYSLYVEALKLGS 110
>gi|11467646|ref|NP_050698.1| hypothetical chloroplast RF20 [Guillardia theta]
gi|6136609|sp|O78445.1|YCF20_GUITH RecName: Full=Uncharacterized protein ycf20
gi|3602971|gb|AAC35632.1| hypothetical chloroplast RF20 (chloroplast) [Guillardia theta]
Length = 64
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 31/49 (63%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTG 174
+L GF+ + L+TIL QTGDW +L A ++VA IE I YKK P G
Sbjct: 11 ILFGFFLSTLLSTILSQTGDWSILAASILVATIELINKNKYKKKPLIIG 59
>gi|434389053|ref|YP_007099664.1| Protein of unknown function (DUF565) [Chamaesiphon minutus PCC
6605]
gi|428020043|gb|AFY96137.1| Protein of unknown function (DUF565) [Chamaesiphon minutus PCC
6605]
Length = 120
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 50/95 (52%), Gaps = 8/95 (8%)
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMY--------KK 168
+ + + I LL G + A ++T GQ D D+ VA ++V +E + + Y
Sbjct: 26 RRISLVIISLLSGNFLATTVSTTTGQKADLDITVALILVTIVETVSWIAYGSNLGPRRPD 85
Query: 169 PPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
P + G+ ++ ++N K G+ GLFV+AFKLGS
Sbjct: 86 PEAILGQRPLWIAILNSLKLGLIYGLFVEAFKLGS 120
>gi|427707469|ref|YP_007049846.1| hypothetical protein Nos7107_2073 [Nostoc sp. PCC 7107]
gi|427359974|gb|AFY42696.1| hypothetical protein Nos7107_2073 [Nostoc sp. PCC 7107]
Length = 110
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 54/112 (48%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TR+ ++ A L + + L + I L GF+ A++TI GQ + D+ VA
Sbjct: 1 MQNTRINNLLDAIASSLGQLFRNPWRRLSILIISFLFGFFLGTAISTIAGQKAELDIFVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+V E I + Y + + L ++ +N K G LFV+AFKLGS
Sbjct: 61 AFLVLLTELINRIFYSRNLFAKRPL--WLECLNTLKVGFTYSLFVEAFKLGS 110
>gi|440682864|ref|YP_007157659.1| protein of unknown function DUF565 [Anabaena cylindrica PCC 7122]
gi|428679983|gb|AFZ58749.1| protein of unknown function DUF565 [Anabaena cylindrica PCC 7122]
Length = 110
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 56/112 (50%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL ++ R L + + L + + L GF+ +A++T GQ + D++VA
Sbjct: 1 MQNTRLNNLLDTIARLLGQWFVNPWRRLSLLVINFLFGFFLGSAVSTTTGQKAELDIVVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
V+V E + Y + + L +V +N+ K G LF++AFKLGS
Sbjct: 61 AVLVLLTEITSRIFYSRTFFAKQAL--WVEALNFLKVGFIYSLFLEAFKLGS 110
>gi|465500|sp|P34813.1|YCF20_AGLNE RecName: Full=Uncharacterized protein ycf20
gi|304023|gb|AAA18512.1| unknown [Aglaothamnion neglectum]
Length = 100
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 52/99 (52%), Gaps = 7/99 (7%)
Query: 105 GRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGML 164
G+ L +N NL +++ L LG++ + +I +TGDW ++ A ++V E + L
Sbjct: 9 GQYLYYHIN----NLSLQLISLFLGYFFCTIICSIPKETGDWGLITALLIVGINETMSAL 64
Query: 165 MYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+Y + V +N K G+ GLFVD+FKLGS
Sbjct: 65 IYSYKKYENNII---VKTINGIKIGIIYGLFVDSFKLGS 100
>gi|427736638|ref|YP_007056182.1| hypothetical protein Riv7116_3162 [Rivularia sp. PCC 7116]
gi|427371679|gb|AFY55635.1| Protein of unknown function (DUF565) [Rivularia sp. PCC 7116]
Length = 110
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 56/113 (49%), Gaps = 4/113 (3%)
Query: 92 LGGTRLGRIVSAGGRQL-LEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLV 150
+ TRL + R L L LN R+ + + + L GF+ AT GQ D++V
Sbjct: 1 MQNTRLNSLFDLIVRNLALWFLNPWRR-ISLLMISFLFGFFMGTVAATTAGQRAVLDIVV 59
Query: 151 AGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
A +V E ++Y++PP R +V +N+ K G LFV+AFKLGS
Sbjct: 60 AAFLVFLTEIGSRIVYRRPPKE--RRLLWVESLNFLKVGFIYSLFVEAFKLGS 110
>gi|414078888|ref|YP_006998206.1| hypothetical protein ANA_C13740 [Anabaena sp. 90]
gi|413972304|gb|AFW96393.1| hypothetical protein ANA_C13740 [Anabaena sp. 90]
Length = 110
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 55/112 (49%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL ++ L E + L + + L GF+ +A++T GQ + D++VA
Sbjct: 1 MQNTRLNNLLETIFSSLSEWFLNPWRRLSLILISFLFGFFLGSAVSTTAGQKAELDIVVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
G +V E + Y + S R +V ++NY K G LF++A KLGS
Sbjct: 61 GFLVLLTEITSRIFYSR--SFFVRQAFWVEILNYLKVGFIYSLFLEALKLGS 110
>gi|220909685|ref|YP_002484996.1| hypothetical protein Cyan7425_4325 [Cyanothece sp. PCC 7425]
gi|219866296|gb|ACL46635.1| protein of unknown function DUF565 [Cyanothece sp. PCC 7425]
Length = 110
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 54/109 (49%), Gaps = 2/109 (1%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL ++ G Q ++L + L + + LL G Y ALA+ GQ G+ D++ + +V
Sbjct: 4 TRLSTLLDRLGTQFRQQLRNPWRRLSVLVMSLLFGVYLGTALASTAGQLGEIDIVASALV 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
E I L Y + L F ++N K GV GLF+ AF LGS
Sbjct: 64 ALVAEVISWLFYSDRWNFKATL--FGEILNAVKFGVLYGLFLIAFLLGS 110
>gi|158337933|ref|YP_001519109.1| hypothetical protein AM1_4820 [Acaryochloris marina MBIC11017]
gi|359463702|ref|ZP_09252265.1| hypothetical protein ACCM5_33604 [Acaryochloris sp. CCMEE 5410]
gi|158308174|gb|ABW29791.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 110
Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 56/109 (51%), Gaps = 2/109 (1%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL +VS G+QL ++L + + + + LL G Y + A I GQ D+ VA +V
Sbjct: 4 TRLNTLVSGLGQQLKQELRNPWRRISVLVIALLFGIYIGVSFAAIAGQLAYLDITVAALV 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ E + + ++ ++ L +N +K G+ GLFV AF LGS
Sbjct: 64 LLFTEVVNWIFHRNRGNARKALWG--EALNTFKLGLTYGLFVIAFMLGS 110
>gi|308807413|ref|XP_003081017.1| unnamed protein product [Ostreococcus tauri]
gi|116059479|emb|CAL55186.1| unnamed protein product [Ostreococcus tauri]
Length = 178
Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 58/112 (51%), Gaps = 10/112 (8%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFL---LLLGFYTANALATILGQTGDWDVLVA 151
TRL ++++ R+L +L AR++ + ++ + GFY AN + G G DV+
Sbjct: 74 TRLVSVLTSLPRRL--ELYFARRSYRVTLWRAISVFFGFYLANVMTLSFGALGINDVVAG 131
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ VA E + + Y ST + ++ +N++K G C L DAFKLGS
Sbjct: 132 ALSVAFYEIVTRIYY-----STLQQNKWLEFVNWFKVGYCYSLIADAFKLGS 178
>gi|409052509|ref|YP_006666467.1| hypothetical chloroplast RF20 (chloroplast) [Trebouxiophyceae sp.
MX-AZ01]
gi|402746127|gb|AFQ93822.1| hypothetical chloroplast RF20 (chloroplast) [Trebouxiophyceae sp.
MX-AZ01]
Length = 127
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 57/123 (46%), Gaps = 16/123 (13%)
Query: 96 RLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILG---QTGDWDVLVAG 152
RL I+ L ++ +R + +F LLLGF AN T LG + WD ++
Sbjct: 6 RLETILRQASLLLQRQIGRSRLSFSAVLFSLLLGFVAANLFGTALGMLRKAASWDGMILL 65
Query: 153 VVVAAIEGIGMLMYKKPPSSTG------------RLQSFVVMMNYWKAGVCLGLFVDAFK 200
++ IE L Y PPS+ G R + + N +K GV LG FV+AFK
Sbjct: 66 GLIVLIEATNHLAYC-PPSAQGGTSEVAPLPLLTRPIAASRLANSFKVGVLLGFFVEAFK 124
Query: 201 LGS 203
+GS
Sbjct: 125 VGS 127
>gi|224122468|ref|XP_002318844.1| predicted protein [Populus trichocarpa]
gi|222859517|gb|EEE97064.1| predicted protein [Populus trichocarpa]
Length = 155
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GFY A ++ G G DV+ A V V E + Y +P ++ + ++N
Sbjct: 83 LLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTRFYYSRP-----KVTFPIALLNN 137
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL S
Sbjct: 138 FKMGFTYGLFIDAFKLAS 155
>gi|113477257|ref|YP_723318.1| hypothetical protein Tery_3800 [Trichodesmium erythraeum IMS101]
gi|110168305|gb|ABG52845.1| protein of unknown function DUF565 [Trichodesmium erythraeum
IMS101]
Length = 110
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 61/114 (53%), Gaps = 6/114 (5%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL R+ + ++ + L + + + + LL G + A+A++TI GQ G DVL
Sbjct: 1 MQSTRLNRLFNVILKRFEQWLVNPWRRISILLISLLFGNFAASAVSTIAGQEGYLDVLYT 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVV--MMNYWKAGVCLGLFVDAFKLGS 203
+ E + L+Y + G++ F++ ++N K G GLF++AFKLGS
Sbjct: 61 LACLFITEALNWLVYGRR----GKVARFLIIDILNSLKIGFIYGLFLEAFKLGS 110
>gi|212274997|ref|NP_001130729.1| hypothetical protein [Zea mays]
gi|194689966|gb|ACF79067.1| unknown [Zea mays]
gi|195626582|gb|ACG35121.1| hypothetical protein [Zea mays]
gi|223950487|gb|ACN29327.1| unknown [Zea mays]
gi|414586642|tpg|DAA37213.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
gi|414586643|tpg|DAA37214.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
gi|414586644|tpg|DAA37215.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
gi|414586645|tpg|DAA37216.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
gi|414586646|tpg|DAA37217.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
Length = 155
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 39/78 (50%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GFY A ++ G G DV+ A V V E + Y +P + + ++N
Sbjct: 83 LLGGFYVAQTISLSFGTLGVNDVIAAVVCVLLTEYVTKFYYSRPKVTFP-----IALLNN 137
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL S
Sbjct: 138 FKMGFTYGLFIDAFKLAS 155
>gi|449451787|ref|XP_004143642.1| PREDICTED: uncharacterized protein ycf20-like [Cucumis sativus]
gi|449530953|ref|XP_004172456.1| PREDICTED: uncharacterized protein ycf20-like [Cucumis sativus]
Length = 156
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GFY A ++ G G DV+ A V V E + Y +P ++ + ++N
Sbjct: 84 LLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTRFYYSRP-----KVTFPIALLNN 138
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL S
Sbjct: 139 FKMGFTYGLFIDAFKLAS 156
>gi|255539575|ref|XP_002510852.1| conserved hypothetical protein [Ricinus communis]
gi|223549967|gb|EEF51454.1| conserved hypothetical protein [Ricinus communis]
Length = 155
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GFY A ++ G G DV+ A + V E + Y +P ++ V ++N
Sbjct: 83 LLGGFYVAQTISLSFGALGVNDVIAAVICVLLTEYVTRFYYSRP-----KVTFPVALLNN 137
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL S
Sbjct: 138 FKMGFTYGLFIDAFKLAS 155
>gi|229915469|gb|ACQ90814.1| hypothetical chloroplast RF20 [Oocystis solitaria]
Length = 124
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 50/123 (40%), Gaps = 17/123 (13%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATIL---GQTGDWDVLVA 151
TR R + L EK + N IF + GF N T L WD +
Sbjct: 5 TRFFRFLKENF-LLNEKFFFKKINFSSAIFFIFSGFLFGNLFCTFLICVRSIFIWDGFII 63
Query: 152 GVVVAAIEGIGMLMYKK-----------PPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFK 200
+ IE I ++Y K P S +L F+ NY+K G LG F+DAFK
Sbjct: 64 LNFILFIEIINYIIYHKKGRFFLLFWNFPFSFQKKL--FLKNFNYFKIGFLLGFFIDAFK 121
Query: 201 LGS 203
+GS
Sbjct: 122 VGS 124
>gi|399949854|gb|AFP65511.1| hypothetical protein CMESO_344 [Chroomonas mesostigmatica CCMP1168]
Length = 152
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 40/70 (57%), Gaps = 5/70 (7%)
Query: 132 TANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVC 191
+A + TI+G DWD L A V++ IE + Y +T + + ++N +K G+
Sbjct: 86 SATSAITIIGSVADWDPLAAAVMLCWIELFTKIFY-----ATEKSTLYFRLINSFKIGIN 140
Query: 192 LGLFVDAFKL 201
LG+FVDAFK+
Sbjct: 141 LGMFVDAFKI 150
>gi|356566265|ref|XP_003551354.1| PREDICTED: uncharacterized protein ycf20-like [Glycine max]
Length = 151
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GFY A ++ G G DV+ A + V E + Y +P ++ V ++N
Sbjct: 79 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTKFYYSRP-----KVTFPVALLNN 133
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL S
Sbjct: 134 FKMGFTYGLFIDAFKLAS 151
>gi|162606480|ref|XP_001713270.1| hypothetical protein GTHECHR2163 [Guillardia theta]
gi|12580736|emb|CAC27054.1| hypothetical protein [Guillardia theta]
Length = 125
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 36/69 (52%), Gaps = 5/69 (7%)
Query: 133 ANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCL 192
A + TI+G DWD L A V++ IE + YK + +N +K G+ L
Sbjct: 60 ATSAITIIGSVADWDPLAAAVMLCWIEVFNKIFYKNISK-----KKIFNKINIFKIGINL 114
Query: 193 GLFVDAFKL 201
G+FVDAFKL
Sbjct: 115 GIFVDAFKL 123
>gi|71842309|ref|YP_277397.1| hypothetical chloroplast RF20 [Emiliania huxleyi]
gi|60101552|gb|AAX13896.1| hypothetical chloroplast RF20 [Emiliania huxleyi]
gi|336286220|gb|AEI29556.1| hypothetical chloroplast RF20 [Emiliania huxleyi]
Length = 90
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 46/90 (51%), Gaps = 5/90 (5%)
Query: 119 LPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYK-----KPPSST 173
+ +KI GF A+ T +G+ DW +L A + VA++E + Y + SS
Sbjct: 1 MLLKITCFFGGFLLASIFDTTMGEFQDWSILGAALTVASVETASKIYYSFISRVQLLSSK 60
Query: 174 GRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ ++N +K G+ GL VDAFKLGS
Sbjct: 61 TYYSETLFLLNSFKLGLIYGLIVDAFKLGS 90
>gi|284928790|ref|YP_003421312.1| hypothetical protein UCYN_02050 [cyanobacterium UCYN-A]
gi|284809249|gb|ADB94954.1| Protein of unknown function (DUF565) [cyanobacterium UCYN-A]
Length = 110
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 54/111 (48%), Gaps = 6/111 (5%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL + L + +NL + +L+GF+ A+A I GQ W++ V ++
Sbjct: 4 TRLNTLYETIYNNLSRTFSNPWRNLFLNFTSILVGFFIGQAIALIAGQESYWEITVGIIL 63
Query: 155 VAAIEGIGMLMY--KKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ E ++Y K+ L++F N++K G+ L+++A KLGS
Sbjct: 64 LVFTEVSSKIIYGRKRTKKKILWLETF----NFFKMGIVYSLYIEALKLGS 110
>gi|51210016|ref|YP_063680.1| conserved hypothetical plastid protein [Gracilaria tenuistipitata
var. liui]
gi|50657770|gb|AAT79755.1| conserved hypothetical plastid protein [Gracilaria tenuistipitata
var. liui]
Length = 70
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 40/71 (56%), Gaps = 2/71 (2%)
Query: 133 ANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCL 192
+ +T+ QTGDW ++ ++VA+ E I +YK ++ + V N + G+
Sbjct: 2 STVFSTLPSQTGDWGIISGSIIVASNEIISKNIYKYMKNNYTFM--IVHTFNCIRVGIIY 59
Query: 193 GLFVDAFKLGS 203
GLFVDAFKLGS
Sbjct: 60 GLFVDAFKLGS 70
>gi|115459106|ref|NP_001053153.1| Os04g0488600 [Oryza sativa Japonica Group]
gi|38344242|emb|CAD41335.2| OJ991113_30.19 [Oryza sativa Japonica Group]
gi|113564724|dbj|BAF15067.1| Os04g0488600 [Oryza sativa Japonica Group]
gi|215707021|dbj|BAG93481.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737413|dbj|BAG96543.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 155
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 39/78 (50%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GFY A ++ G G DV+ A V V E + Y +P ++ ++N
Sbjct: 83 LLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTKFYYSRP-----KVTFPFALLNN 137
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL S
Sbjct: 138 FKMGFTYGLFIDAFKLAS 155
>gi|428220739|ref|YP_007104909.1| hypothetical protein Syn7502_00625 [Synechococcus sp. PCC 7502]
gi|427994079|gb|AFY72774.1| Protein of unknown function (DUF565) [Synechococcus sp. PCC 7502]
Length = 109
Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 58/112 (51%), Gaps = 3/112 (2%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL I + +L + + + + I LLGF+ A+ ++T+ G +D VA
Sbjct: 1 MQNTRLTVITNTSLSRLERWFSNPWRRISLYIISPLLGFFLASIISTVSGAKSAFDPYVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
++ E I + Y++ + + ++ ++N K G+ G+F++AFKLGS
Sbjct: 61 AALLLVTELISFVAYRR---TKEKRSLYLEILNLLKIGLIYGMFLEAFKLGS 109
>gi|357164342|ref|XP_003580023.1| PREDICTED: ycf20-like protein-like [Brachypodium distachyon]
Length = 155
Score = 46.6 bits (109), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 39/78 (50%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GFY A ++ G DV+ A V V E + Y +P ++ V ++N
Sbjct: 83 LLGGFYVAQTISLSFGALAVNDVIAAVVCVLLTEYVTKFYYSRP-----KVTFPVALLNN 137
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL S
Sbjct: 138 FKMGFTYGLFIDAFKLAS 155
>gi|125548804|gb|EAY94626.1| hypothetical protein OsI_16403 [Oryza sativa Indica Group]
gi|125590821|gb|EAZ31171.1| hypothetical protein OsJ_15270 [Oryza sativa Japonica Group]
Length = 190
Score = 46.6 bits (109), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 39/78 (50%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GFY A ++ G G DV+ A V V E + Y +P ++ ++N
Sbjct: 118 LLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTKFYYSRP-----KVTFPFALLNN 172
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL S
Sbjct: 173 FKMGFTYGLFIDAFKLAS 190
>gi|225469306|ref|XP_002269823.1| PREDICTED: ycf20-like protein isoform 2 [Vitis vinifera]
gi|225469308|ref|XP_002269782.1| PREDICTED: ycf20-like protein isoform 1 [Vitis vinifera]
gi|297741124|emb|CBI31855.3| unnamed protein product [Vitis vinifera]
Length = 155
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 40/78 (51%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GFY A ++ G G DV+ A + V E + Y +P ++ + ++N
Sbjct: 83 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLITEYVTRFYYSRP-----KVTFPLALLNN 137
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL S
Sbjct: 138 FKMGFTYGLFIDAFKLAS 155
>gi|452824347|gb|EME31350.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
Length = 203
Score = 45.8 bits (107), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 47/92 (51%), Gaps = 7/92 (7%)
Query: 113 NIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKK-PPS 171
N+ +K L + +L GF A++L I G D L+AGV+ E Y++ PS
Sbjct: 118 NLPKKQLLSGLMAMLFGFTCASSLTLIFGALSFLDPLIAGVLAVCTELYTRYYYQESSPS 177
Query: 172 STGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ RL +N +K GV GL +DAFK+ S
Sbjct: 178 FSLRL------VNAFKIGVMTGLILDAFKISS 203
>gi|428298335|ref|YP_007136641.1| hypothetical protein Cal6303_1627 [Calothrix sp. PCC 6303]
gi|428234879|gb|AFZ00669.1| protein of unknown function DUF565 [Calothrix sp. PCC 6303]
Length = 109
Score = 45.4 bits (106), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 55/114 (48%), Gaps = 7/114 (6%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL + A R+ E + L + + L GF+ A++T GQ + D++ A
Sbjct: 1 MQNTRLNNLFDAIARRTQEVFVNPWRRLSLLVISFLSGFFLGTAVSTTAGQKAELDIVAA 60
Query: 152 GVVVAAIEGIGMLMYKK--PPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
++ E + + Y++ +S G + ++N K G+ LF++A KLGS
Sbjct: 61 AFILIITETVSRIYYRRGNGTNSLG-----LEVLNTLKIGLMYSLFIEALKLGS 109
>gi|298493111|ref|YP_003723288.1| hypothetical protein Aazo_5069 ['Nostoc azollae' 0708]
gi|298235029|gb|ADI66165.1| protein of unknown function DUF565 ['Nostoc azollae' 0708]
Length = 110
Score = 45.1 bits (105), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 53/112 (47%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL ++ R L + + + + L GF+ +A++T GQ + D++VA
Sbjct: 1 MQNTRLNNLLDTIARSLGQWFVNPWRRFSLLLISWLFGFFMGSAVSTTAGQKAELDIVVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+V E + Y + + R +V +N K G LF++AFKLGS
Sbjct: 61 AFLVLLTEITSRIFYSR--NFFARQAIWVESLNLLKIGFIYSLFLEAFKLGS 110
>gi|145350586|ref|XP_001419683.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579915|gb|ABO97976.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 104
Score = 44.7 bits (104), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 53/111 (47%), Gaps = 10/111 (9%)
Query: 96 RLGRIVSAGGRQLLEKLNIARKNLPMKIFLLL---LGFYTANALATILGQTGDWDVLVAG 152
RL ++++ R+ L R+N + ++ + GFY AN + G G DV+
Sbjct: 1 RLVSVLTSLPRRF--NLYFQRRNYRVTLWRAISAYFGFYLANVMTLSFGAIGINDVVAGA 58
Query: 153 VVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ VA E + + Y + T + F +N++K G C L DAFKLGS
Sbjct: 59 LSVAFFEVVTRIYY--ASAETNKWLEF---LNWFKVGYCYSLIADAFKLGS 104
>gi|412990920|emb|CCO18292.1| predicted protein [Bathycoccus prasinos]
Length = 253
Score = 44.7 bits (104), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 53/112 (47%), Gaps = 10/112 (8%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPM---KIFLLLLGFYTANALATILGQTGDWDVLVA 151
TR+ ++++ ++L L R++ M K + +GFY A+ + G G DV+
Sbjct: 149 TRIASVLTSLPQRL--DLYFQRRSYRMLLWKTIFIFVGFYLASVITLSFGALGINDVVAG 206
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ V E I YK S GR FV N +K G+ L DAFKLGS
Sbjct: 207 ALCVFVYETITRWYYK--TSRPGRKLEFV---NCFKLGLVYSLIADAFKLGS 253
>gi|428211916|ref|YP_007085060.1| hypothetical protein Oscil6304_1433 [Oscillatoria acuminata PCC
6304]
gi|428000297|gb|AFY81140.1| Protein of unknown function (DUF565) [Oscillatoria acuminata PCC
6304]
Length = 110
Score = 44.3 bits (103), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL + +L + L + + + + LLG + A+ TI GQ + DV+VA
Sbjct: 1 MQNTRLNSLFDGTVSELRQSLRNPWRRISVVLISFLLGNFVGTAITTIAGQNANLDVMVA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
V++ E Y+ L + +N +K G+ LFV AF LGS
Sbjct: 61 AVLIVLTEVYSYFFYRIHDQMKRSLLA--QGINSFKIGMIYNLFVLAFTLGS 110
>gi|427713331|ref|YP_007061955.1| hypothetical protein Syn6312_2297 [Synechococcus sp. PCC 6312]
gi|427377460|gb|AFY61412.1| Protein of unknown function (DUF565) [Synechococcus sp. PCC 6312]
Length = 110
Score = 44.3 bits (103), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 55/111 (49%), Gaps = 6/111 (5%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL +++ G L ++L + L + + LL G Y ALA+ GQ G D++ + +V
Sbjct: 4 TRLTTLLNRTGDNLQQQLRNPWRRLALMMIALLFGIYLGAALASTAGQVGYIDIVASALV 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVV--MMNYWKAGVCLGLFVDAFKLGS 203
A E + + Y + Q ++ +N K GV GLF+ AF LGS
Sbjct: 64 AIASEVLSAVFY----GNRWNFQKTILGETLNAIKVGVLYGLFLIAFLLGS 110
>gi|15239166|ref|NP_199119.1| uncharacterized protein [Arabidopsis thaliana]
gi|9757831|dbj|BAB08268.1| unnamed protein product [Arabidopsis thaliana]
gi|88900426|gb|ABD57525.1| At5g43050 [Arabidopsis thaliana]
gi|332007521|gb|AED94904.1| uncharacterized protein [Arabidopsis thaliana]
Length = 158
Score = 43.9 bits (102), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GFY A ++ G G DV+ A + V E + Y + + + ++N
Sbjct: 86 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRTTVTFP-----IALLNN 140
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL S
Sbjct: 141 FKMGFTYGLFIDAFKLAS 158
>gi|443323738|ref|ZP_21052741.1| Protein of unknown function (DUF565) [Gloeocapsa sp. PCC 73106]
gi|442786524|gb|ELR96254.1| Protein of unknown function (DUF565) [Gloeocapsa sp. PCC 73106]
Length = 107
Score = 43.9 bits (102), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 54/111 (48%), Gaps = 9/111 (8%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL +V+A G +L L +N+ + + + GF++ A+ + GQ DV A +
Sbjct: 4 TRLDNLVNALGLRLQNFLANPWRNIVLSLTTVFFGFFSGTAIISTAGQRSLMDVSSAASL 63
Query: 155 VAAIEGIGMLMY--KKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ E I +Y KKP T ++N +K GV +F+ AF +GS
Sbjct: 64 LVVFELINRWVYQSKKPRFWTN-------ILNIFKIGVTYSIFLQAFIVGS 107
>gi|297795157|ref|XP_002865463.1| hypothetical protein ARALYDRAFT_917392 [Arabidopsis lyrata subsp.
lyrata]
gi|297311298|gb|EFH41722.1| hypothetical protein ARALYDRAFT_917392 [Arabidopsis lyrata subsp.
lyrata]
Length = 159
Score = 43.5 bits (101), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GFY A ++ G G DV+ A + V E + Y + + + ++N
Sbjct: 87 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRTTVTFP-----IALLNN 141
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL S
Sbjct: 142 FKMGFTYGLFIDAFKLAS 159
>gi|428768705|ref|YP_007160495.1| hypothetical protein Cyan10605_0303 [Cyanobacterium aponinum PCC
10605]
gi|428682984|gb|AFZ52451.1| protein of unknown function DUF565 [Cyanobacterium aponinum PCC
10605]
Length = 110
Score = 43.5 bits (101), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 52/109 (47%), Gaps = 2/109 (1%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL ++ Q+ LN + L I L G+ T++ L+T L Q G WDV +A
Sbjct: 4 TRLNLLIGNFTNQIKSFLNNPWRKLSFVIIGFLSGYITSDLLSTSLAQAGKWDVPMATTY 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ E M++Y + + + ++N +K G+ GL++ A L S
Sbjct: 64 LIFTEITSMIVYGGSKNRNKIM--WADLLNSFKIGLAFGLYLSAMTLAS 110
>gi|126654810|ref|ZP_01726344.1| hypothetical protein CY0110_10252 [Cyanothece sp. CCY0110]
gi|126623545|gb|EAZ94249.1| hypothetical protein CY0110_10252 [Cyanothece sp. CCY0110]
Length = 110
Score = 43.1 bits (100), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 38/75 (50%), Gaps = 2/75 (2%)
Query: 129 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKA 188
GF+ +++ GQ WD+ V ++ EGI + Y + L + ++N +K
Sbjct: 38 GFFVGQSVSITAGQEAYWDITVGIFLLIFTEGISRIAYSRSSKKARSLS--LDILNLFKI 95
Query: 189 GVCLGLFVDAFKLGS 203
G+ L+V+A KLGS
Sbjct: 96 GITYSLYVEALKLGS 110
>gi|162134394|gb|ABX82660.1| hypothetical chloroplast RF20 [Trebouxia aggregata]
Length = 114
Score = 42.7 bits (99), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 50/115 (43%), Gaps = 26/115 (22%)
Query: 111 KLNIARKN------LPMKIFLLLLGFYTANALATILGQTGD---WDVLVAGVVVAAIEGI 161
K+ +++KN +F L GF N T L WD L+ +++ +E +
Sbjct: 4 KVYLSKKNSFFKEVFSFALFTLFFGFLIGNLFGTFLNTIRSFIFWDGLIIFIILCIMELL 63
Query: 162 GMLMYKK-------------PPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
++Y K P++ +S +NY K G+ G F+DAFK+GS
Sbjct: 64 NFILYNKAKRFFVIYKKISYEPNNQIIWKS----LNYLKIGIMFGFFIDAFKVGS 114
>gi|428776926|ref|YP_007168713.1| hypothetical protein PCC7418_2350 [Halothece sp. PCC 7418]
gi|428691205|gb|AFZ44499.1| hypothetical protein PCC7418_2350 [Halothece sp. PCC 7418]
Length = 112
Score = 42.7 bits (99), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 21/75 (28%), Positives = 39/75 (52%)
Query: 129 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKA 188
G + A+ T GQ G D++VA +++ E + +Y++ + + +MN +K
Sbjct: 38 GIFMGIAIVTSAGQNGRLDIIVAALLLILTEVLSWFVYRQNLEQNEQRSGLIEVMNTFKI 97
Query: 189 GVCLGLFVDAFKLGS 203
G+ L++ AF LGS
Sbjct: 98 GLVYSLYIQAFVLGS 112
>gi|302816212|ref|XP_002989785.1| hypothetical protein SELMODRAFT_49272 [Selaginella moellendorffii]
gi|302816919|ref|XP_002990137.1| hypothetical protein SELMODRAFT_49273 [Selaginella moellendorffii]
gi|300142150|gb|EFJ08854.1| hypothetical protein SELMODRAFT_49273 [Selaginella moellendorffii]
gi|300142351|gb|EFJ09052.1| hypothetical protein SELMODRAFT_49272 [Selaginella moellendorffii]
Length = 104
Score = 42.7 bits (99), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GF+ A ++ G G DV+ + + V E + + + S G V ++N
Sbjct: 32 LLGGFFVAQTISLSFGAVGVNDVIASALCVLLTEYMTRFYFTRQRPSLG-----VALLNN 86
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL S
Sbjct: 87 FKMGFTYGLFIDAFKLAS 104
>gi|294460167|gb|ADE75666.1| unknown [Picea sitchensis]
Length = 200
Score = 42.7 bits (99), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 5/78 (6%)
Query: 126 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNY 185
LL GFY A ++ G G DV+ A + V E Y + ++ V ++N
Sbjct: 128 LLGGFYVAQTISLSFGALGVNDVIAAVLCVIVTEYATKFYYTR-----SKVTFPVALLNN 182
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G GLF+DAFKL +
Sbjct: 183 FKMGFTYGLFIDAFKLAN 200
>gi|172037433|ref|YP_001803934.1| hypothetical protein cce_2520 [Cyanothece sp. ATCC 51142]
gi|354553685|ref|ZP_08972991.1| protein of unknown function DUF565 [Cyanothece sp. ATCC 51472]
gi|171698887|gb|ACB51868.1| hypothetical protein cce_2520 [Cyanothece sp. ATCC 51142]
gi|353554402|gb|EHC23792.1| protein of unknown function DUF565 [Cyanothece sp. ATCC 51472]
Length = 110
Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 41/78 (52%), Gaps = 8/78 (10%)
Query: 129 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMY---KKPPSSTGRLQSFVVMMNY 185
GF+ +++ GQ WD+ V ++ EGI + Y KK S G + ++N
Sbjct: 38 GFFVGQSVSITAGQEAYWDITVGIFLLIFTEGISRIAYSQNKKKGRSLG-----LDILNL 92
Query: 186 WKAGVCLGLFVDAFKLGS 203
+K G+ GL+++A KLGS
Sbjct: 93 FKIGMTYGLYIEALKLGS 110
>gi|384248674|gb|EIE22157.1| hypothetical protein COCSUDRAFT_47768 [Coccomyxa subellipsoidea
C-169]
Length = 169
Score = 42.0 bits (97), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 47/111 (42%), Gaps = 6/111 (5%)
Query: 93 GGTRLG-RIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
G TRL RI + R L + + + G+Y AN ++ G DV+ A
Sbjct: 63 GRTRLATRITTVPRRVDLYFADQPARRILWACIAFACGYYCANTVSLSFGALAINDVVAA 122
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLG 202
+ VA E + L Y P + ++ + +K G+ L DAFKLG
Sbjct: 123 ALTVAFCEVVSSLYYAAPKQTLK-----LLFLQCFKIGIICALIADAFKLG 168
>gi|282897978|ref|ZP_06305973.1| Protein of unknown function DUF565 [Raphidiopsis brookii D9]
gi|281197122|gb|EFA72023.1| Protein of unknown function DUF565 [Raphidiopsis brookii D9]
Length = 120
Score = 42.0 bits (97), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL ++ + + + L + + L GF+ + ++T GQ D+ A
Sbjct: 11 MQNTRLNNLLDTIATNVAQWFVNPWRRLCLLLISWLFGFFLGSVVSTTAGQQAQLDIWAA 70
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
V+V E L Y + S + +V +N+ K G+ LF++AFKLGS
Sbjct: 71 AVLVLLTEVASRLFYSRGFFSQRAI--WVESLNWLKIGLIYSLFLEAFKLGS 120
>gi|282899121|ref|ZP_06307102.1| protein of unknown function DUF565 [Cylindrospermopsis raciborskii
CS-505]
gi|281196037|gb|EFA70953.1| protein of unknown function DUF565 [Cylindrospermopsis raciborskii
CS-505]
Length = 110
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 2/112 (1%)
Query: 92 LGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVA 151
+ TRL ++ + + + L + + L GF+ + ++T GQ D+ A
Sbjct: 1 MQNTRLNNLLDTIATNVAQWFINPWRRLCLLLISWLFGFFLGSVVSTTAGQQAQLDIWAA 60
Query: 152 GVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
V+V E L Y + S + +V +N+ K G+ LF++AFKLGS
Sbjct: 61 AVLVLLTEVASRLFYSRGFFSQRAI--WVESLNWLKIGLIYSLFLEAFKLGS 110
>gi|326332371|ref|ZP_08198650.1| glycolate oxidase, subunit GlcD [Nocardioidaceae bacterium Broad-1]
gi|325949860|gb|EGD41921.1| glycolate oxidase, subunit GlcD [Nocardioidaceae bacterium Broad-1]
Length = 488
Score = 41.6 bits (96), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 66/150 (44%), Gaps = 9/150 (6%)
Query: 12 ILERDVFESSYIRLAVSANQKWKNEPSTSGQTSKIGSKSACFLGKSFLMRRHGWKIAFAL 71
++E V R A + +PS S Q IG A G + ++ +G+ L
Sbjct: 111 VVEPGVINLDVTRAATPEGYYYAPDPS-SQQVCSIGGNVAENSGGAHCLK-YGFTTNHVL 168
Query: 72 DTGGISGSGGQESLNGDGPDLGGTR-LGRIV-SAGGRQLLEKLNIARKNLPMKIFLLLLG 129
++ G + L GD PD G LG +V S G ++ ++ + LP K+ LL+G
Sbjct: 169 AAEFVAADGDRVRLGGDAPDSPGYDLLGAVVGSEGTLGIVTEVTVRLVRLPEKVVTLLVG 228
Query: 130 FYTANALATILGQTGDWDVLVAGVVVAAIE 159
F + + + ++ AG+V AAIE
Sbjct: 229 FDSTDRAGAAVSA-----IIAAGIVPAAIE 253
>gi|167999037|ref|XP_001752224.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696619|gb|EDQ82957.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 103
Score = 40.4 bits (93), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 38/76 (50%), Gaps = 7/76 (9%)
Query: 129 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVV-MMNYWK 187
GFY A ++ G G DV+ A + V E I + T + SF V ++N +K
Sbjct: 34 GFYVAQTISLSFGALGVNDVIAAAMCVLFTEYITRF------TRTRKKISFTVALVNNFK 87
Query: 188 AGVCLGLFVDAFKLGS 203
G GLF+DAFKL S
Sbjct: 88 MGFTYGLFIDAFKLAS 103
>gi|303271965|ref|XP_003055344.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463318|gb|EEH60596.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 200
Score = 40.4 bits (93), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 56/121 (46%), Gaps = 10/121 (8%)
Query: 89 GPDLGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDV 148
G D G TRLG + R +++ LP+ + LL+GF + A D
Sbjct: 84 GGDPGDTRLGATYADIARSFRDQVKKNFNYLPVALICLLVGF---SMCAYFPHPESPGDA 140
Query: 149 LVAGVVVAAIEGIGMLMYKKPPSSTGRLQ-----SFVVM-MNYWKAGVCLGLFVDAFKLG 202
++ ++ E + ++Y P G L+ SF+ + +N +K GV GL DAFK+G
Sbjct: 141 VICLTIMFLSEFLSSVLYS-PKRPRGMLKWLREGSFIPLALNCFKIGVLYGLCCDAFKVG 199
Query: 203 S 203
S
Sbjct: 200 S 200
>gi|428313684|ref|YP_007124661.1| hypothetical protein Mic7113_5625 [Microcoleus sp. PCC 7113]
gi|428255296|gb|AFZ21255.1| Protein of unknown function (DUF565) [Microcoleus sp. PCC 7113]
Length = 118
Score = 39.7 bits (91), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 129 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPP------SSTGRLQSFVVM 182
G + +A+ GQ D++ AGV++ E I ++Y K P ++ L +
Sbjct: 38 GTFLGSAIPATAGQAAKLDLVGAGVLILFTEMISWIVYAKSPQGRPLTENSRGLSLLTQI 97
Query: 183 MNYWKAGVCLGLFVDAFKLGS 203
+N K G+ +FV+AFKLGS
Sbjct: 98 LNAMKIGMTYSMFVEAFKLGS 118
>gi|303271617|ref|XP_003055170.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463144|gb|EEH60422.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 194
Score = 39.3 bits (90), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 44/97 (45%), Gaps = 6/97 (6%)
Query: 107 QLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMY 166
+L ++ R+ L K GFYTAN ++ G G DV+ V E + Y
Sbjct: 104 ELYFRVRKGRRTL-WKAISAFAGFYTANVISLTFGVLGINDVVAGAFCVGFFELVTRAYY 162
Query: 167 KKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
K G+ F +N++K G+ L DAFKLGS
Sbjct: 163 NK--IKPGKYWGF---LNWFKLGLIYALVTDAFKLGS 194
>gi|75907255|ref|YP_321551.1| hypothetical protein Ava_1032 [Anabaena variabilis ATCC 29413]
gi|75700980|gb|ABA20656.1| Protein of unknown function DUF565 [Anabaena variabilis ATCC 29413]
Length = 110
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 36/75 (48%), Gaps = 2/75 (2%)
Query: 129 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKA 188
G + A++T GQ + D+++A +V E + Y + R V +N K
Sbjct: 38 GIFLGTAISTTAGQRSELDIVIAAFLVFLTEVTSRIFYMQ--GFFARRSLLVESLNLLKV 95
Query: 189 GVCLGLFVDAFKLGS 203
G LF++AFKLGS
Sbjct: 96 GFTYSLFIEAFKLGS 110
>gi|332707378|ref|ZP_08427428.1| protein of unknown function, DUF565 [Moorea producens 3L]
gi|332353869|gb|EGJ33359.1| protein of unknown function, DUF565 [Moorea producens 3L]
Length = 116
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 47/93 (50%), Gaps = 8/93 (8%)
Query: 117 KNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYK-KPP----- 170
+ L M + L GF AL T G +W +++G+++ E + L+Y K P
Sbjct: 26 RRLSMVLISFLFGFLVGQALCTTAGTGQEW--ILSGMLIVFTEAVNRLIYSGKFPKGQLN 83
Query: 171 SSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
+ R+ ++N +K G+ ++++AFKLGS
Sbjct: 84 QTVDRVSWLADVVNAFKIGLVYSMYLEAFKLGS 116
>gi|17230364|ref|NP_486912.1| hypothetical protein alr2872 [Nostoc sp. PCC 7120]
gi|17131966|dbj|BAB74571.1| alr2872 [Nostoc sp. PCC 7120]
Length = 110
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 36/75 (48%), Gaps = 2/75 (2%)
Query: 129 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKA 188
G + A++T GQ + D+++A +V E + Y + R V +N K
Sbjct: 38 GIFLGTAISTTAGQRSELDIVIAAFLVFLTEVTSRIFYMQ--GFFARRSLLVESLNLLKV 95
Query: 189 GVCLGLFVDAFKLGS 203
G LF++AFKLGS
Sbjct: 96 GFIYSLFIEAFKLGS 110
>gi|22298711|ref|NP_681958.1| hypothetical protein tlr1168 [Thermosynechococcus elongatus BP-1]
gi|22294892|dbj|BAC08720.1| ycf20 [Thermosynechococcus elongatus BP-1]
Length = 110
Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 95 TRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATILGQTGDWDVLVAGVV 154
TRL ++ G + E+L + L L G + A+++ GQ G D++ + +V
Sbjct: 4 TRLNTLLDRLGAGIQEQLKNPWRRLATLSIAFLFGVFLGLAISSSAGQLGYLDIIASSMV 63
Query: 155 VAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
A E I L Y R F M+N K G+ GLF+ AF LGS
Sbjct: 64 AIAAEVISALFYSDRWKL--RQTLFGEMLNALKFGLLYGLFLVAFLLGS 110
>gi|376403651|ref|YP_005089926.1| ycf20 gene product (chloroplast) [Pedinomonas minor]
gi|229915558|gb|ACQ90902.1| hypothetical chloroplast RF20 [Pedinomonas minor]
Length = 104
Score = 36.6 bits (83), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 31/60 (51%), Gaps = 7/60 (11%)
Query: 144 GDWDVLVAGVVVAAIEGIGMLMYKKPPSSTGRLQSFVVMMNYWKAGVCLGLFVDAFKLGS 203
G WD + V+ E I + +Y G ++ +NY+K G L LF+DAFK+GS
Sbjct: 52 GFWDGFIILFVLLVCELINLFVY---VYKLGIFKT----INYFKIGFLLALFIDAFKVGS 104
>gi|308802359|ref|XP_003078493.1| unnamed protein product [Ostreococcus tauri]
gi|116056945|emb|CAL53234.1| unnamed protein product [Ostreococcus tauri]
Length = 130
Score = 36.2 bits (82), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 33/130 (25%), Positives = 53/130 (40%), Gaps = 12/130 (9%)
Query: 81 GQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIFLLLLGFYTANALATIL 140
E D TRL +I+ + ++ + LP+ + LL+GF + A I
Sbjct: 6 ASEDEEDDDARRNRTRLQKILDSLIKKFFGRAKTNLVKLPVALISLLVGF---SMCALIP 62
Query: 141 GQTGDWDVLVAGVVVAAIEGIGMLMYKKPPS-------STGRLQSFVVMMNYWKAGVCLG 193
D ++ ++ E + ++Y G L +V N +K GV G
Sbjct: 63 HPESPQDAFISFTIILLSEFVSSVLYGVDNQGGLLRWLCEGPLAPMIV--NCFKIGVLYG 120
Query: 194 LFVDAFKLGS 203
LF DAFK+GS
Sbjct: 121 LFCDAFKVGS 130
>gi|108796965|ref|YP_636265.1| hypothetical protein RF20 [Pseudendoclonium akinetum]
gi|56159663|gb|AAV80687.1| hypothetical protein RF20 [Pseudendoclonium akinetum]
Length = 198
Score = 36.2 bits (82), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 13/26 (50%), Positives = 22/26 (84%)
Query: 178 SFVVMMNYWKAGVCLGLFVDAFKLGS 203
+F+ ++N++K G+ LG F+DAFK+GS
Sbjct: 173 NFIKILNFYKIGLLLGFFIDAFKVGS 198
>gi|404213551|ref|YP_006667745.1| Non-ribosomal peptide synthetase modules-related protein [Gordonia
sp. KTR9]
gi|403644350|gb|AFR47590.1| Non-ribosomal peptide synthetase modules-related protein [Gordonia
sp. KTR9]
Length = 5004
Score = 36.2 bits (82), Expect = 8.2, Method: Composition-based stats.
Identities = 41/134 (30%), Positives = 59/134 (44%), Gaps = 17/134 (12%)
Query: 24 RLAVSANQKWKNEPSTSGQTSKIGSKSACFLGKSFLMRRHGWKIAFALDTGGISGSGGQE 83
RL+V A++ + T Q +++G S + +R W A D G+ G
Sbjct: 3318 RLSVPADEAFTYADFTQWQIARLGDPSDPHSRYARELRF--WASTLA-DAPGLLRLPGSV 3374
Query: 84 SLNGDGPDLGGTRLGRIVSAGGRQL-----LEKLNIARKNLPMKIFLLLLGFYTANALAT 138
L+GD PDL G A GR + L+ K L IF + ALAT
Sbjct: 3375 DLDGDDPDLHEAARG---GATGRLSTTIVGTDALHQTAKALSASIFQV-----AHAALAT 3426
Query: 139 ILGQ-TGDWDVLVA 151
+LGQ TG+WD+L+
Sbjct: 3427 VLGQWTGEWDLLIG 3440
>gi|325001780|ref|ZP_08122892.1| FAD/FMN-dependent dehydrogenase [Pseudonocardia sp. P1]
Length = 484
Score = 35.8 bits (81), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 44/170 (25%), Positives = 70/170 (41%), Gaps = 20/170 (11%)
Query: 15 RDVFESSYIRLAVSANQK-----WKNEPSTSGQTSKIGSKSACFLGKSFLMR-----RHG 64
R V E L ++A + +PS S Q IG A G + ++ H
Sbjct: 107 RAVLEPGVTNLEITAAASPHGLYYAPDPS-SQQVCTIGGNVAENSGGAHCLKYGFTTNHV 165
Query: 65 WKIAFALDTGGISGSGGQESLNGDGPDLGGTRLGRIVSAGGRQLLEKLNIARKNLPMKIF 124
I L G + + G +S+ GPDL G LG S G ++ K+ + P +
Sbjct: 166 LSIEVVLPDGSVV-TLGSDSMEQQGPDLRGVFLG---SEGTLGVVTKITVRLLRTPEVMR 221
Query: 125 LLLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKPPSSTG 174
LL F + A ++ D++ AG+V AA+E + L + +TG
Sbjct: 222 TLLADFPSVAAGGQVVS-----DIVAAGIVPAAVEMMDALAIEAAEGATG 266
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.138 0.411
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,246,641,715
Number of Sequences: 23463169
Number of extensions: 137167757
Number of successful extensions: 394511
Number of sequences better than 100.0: 184
Number of HSP's better than 100.0 without gapping: 121
Number of HSP's successfully gapped in prelim test: 63
Number of HSP's that attempted gapping in prelim test: 394231
Number of HSP's gapped (non-prelim): 186
length of query: 203
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 68
effective length of database: 9,191,667,552
effective search space: 625033393536
effective search space used: 625033393536
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)