BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 031119
(165 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255539324|ref|XP_002510727.1| conserved hypothetical protein [Ricinus communis]
gi|223551428|gb|EEF52914.1| conserved hypothetical protein [Ricinus communis]
Length = 264
Score = 270 bits (691), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 133/185 (71%), Positives = 150/185 (81%), Gaps = 20/185 (10%)
Query: 1 MFQLAFTA-CSFPAIVSYALA-------------------ANDWQLGAGEPNGFKSITAF 40
+F +AFT +FP ++S A A DW++GAGEPNGFKS+TAF
Sbjct: 80 LFSVAFTTGFTFPGLISNAFAEIDDFRLYTDDANKFQISIPQDWRVGAGEPNGFKSVTAF 139
Query: 41 YPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDCKA 100
YP+EAS SSVSVVITGLGPDFTRMESFGKVEAFA+TLVSGLDRSW+RPPGVAAKLIDCKA
Sbjct: 140 YPEEASGSSVSVVITGLGPDFTRMESFGKVEAFAETLVSGLDRSWQRPPGVAAKLIDCKA 199
Query: 101 SKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQFVEEESEKYGSNIEKAVA 160
+KG YYIEYTLQNPGE RKHLFSA+GMA NGWYNRLYTVTGQFVEEESE YGS ++K V+
Sbjct: 200 TKGIYYIEYTLQNPGEGRKHLFSALGMAFNGWYNRLYTVTGQFVEEESENYGSKVQKVVS 259
Query: 161 SFRFI 165
SF+FI
Sbjct: 260 SFKFI 264
>gi|449522159|ref|XP_004168095.1| PREDICTED: psbP domain-containing protein 3, chloroplastic-like
[Cucumis sativus]
Length = 188
Score = 264 bits (675), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 128/188 (68%), Positives = 146/188 (77%), Gaps = 23/188 (12%)
Query: 1 MFQLAFTACSFPAIVSYALAAN-----------------------DWQLGAGEPNGFKSI 37
M + FTA SF +VS ALA + DWQ+G GEPNGFKS+
Sbjct: 1 MLGIGFTAFSFQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSV 60
Query: 38 TAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLID 97
TAF+PQE S+S+VSVVI+GLGPD+TRMESFGKVE FADTLVSGLDRSW+RPPGVAAKLID
Sbjct: 61 TAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLID 120
Query: 98 CKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQFVEEESEKYGSNIEK 157
C++SKG YYIEYTLQNPGESRKHL+SAIGM+SNGWYNRLYT+TGQ+ +EESE Y S IEK
Sbjct: 121 CRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEK 180
Query: 158 AVASFRFI 165
V SF FI
Sbjct: 181 VVNSFAFI 188
>gi|449458059|ref|XP_004146765.1| PREDICTED: psbP domain-containing protein 3, chloroplastic-like
[Cucumis sativus]
Length = 257
Score = 262 bits (670), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 127/188 (67%), Positives = 145/188 (77%), Gaps = 23/188 (12%)
Query: 1 MFQLAFTACSFPAIVSYALAAN-----------------------DWQLGAGEPNGFKSI 37
M + FTA SF + S ALA + DWQ+G GEPNGFKS+
Sbjct: 70 MLGIGFTAFSFQEVGSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSV 129
Query: 38 TAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLID 97
TAF+PQE S+S+VSVVI+GLGPD+TRMESFGKVE FADTLVSGLDRSW+RPPGVAAKLID
Sbjct: 130 TAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLID 189
Query: 98 CKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQFVEEESEKYGSNIEK 157
C++SKG YYIEYTLQNPGESRKHL+SAIGM+SNGWYNRLYT+TGQ+ +EESE Y S IEK
Sbjct: 190 CRSSKGIYYIEYTLQNPGESRKHLYSAIGMSSNGWYNRLYTITGQYADEESESYSSKIEK 249
Query: 158 AVASFRFI 165
V SF FI
Sbjct: 250 VVNSFAFI 257
>gi|188509930|gb|ACD56619.1| unknown predicted protein [Gossypioides kirkii]
Length = 274
Score = 261 bits (667), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 120/136 (88%), Positives = 130/136 (95%)
Query: 22 NDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGL 81
DWQ+GAGEPN FKSITAFYP+E ++S+VSV ITGLGPDFTRMESFGKV+AFADTLVSGL
Sbjct: 109 QDWQVGAGEPNNFKSITAFYPEEEANSNVSVAITGLGPDFTRMESFGKVDAFADTLVSGL 168
Query: 82 DRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTG 141
DRSW+RPPGVAAKLIDCKA+ GFYYIEYTLQNPGESR+HLFSAIGMASNGWYNRLYTVTG
Sbjct: 169 DRSWQRPPGVAAKLIDCKAANGFYYIEYTLQNPGESRRHLFSAIGMASNGWYNRLYTVTG 228
Query: 142 QFVEEESEKYGSNIEK 157
QFVEEE+EKYGS IEK
Sbjct: 229 QFVEEEAEKYGSRIEK 244
>gi|225457650|ref|XP_002273332.1| PREDICTED: psbP domain-containing protein 3, chloroplastic [Vitis
vinifera]
gi|297745601|emb|CBI40766.3| unnamed protein product [Vitis vinifera]
Length = 253
Score = 258 bits (658), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 124/187 (66%), Positives = 147/187 (78%), Gaps = 22/187 (11%)
Query: 1 MFQLAFTACSFPAIVSYALA----------------------ANDWQLGAGEPNGFKSIT 38
+ Q+ F+ S AIV ALA DWQ+G+GEP+GFKS+T
Sbjct: 67 VLQMVFSTISLQAIVPNALADTEVPEDFKVYSDEVNKFKIQIPQDWQVGSGEPSGFKSVT 126
Query: 39 AFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDC 98
AFYP+EAS S+VSVVITGLG DFTR+ESFGKV+AFA+ LV+GLDRSW+RPPG+AAKLIDC
Sbjct: 127 AFYPEEASGSNVSVVITGLGADFTRLESFGKVDAFAENLVNGLDRSWQRPPGIAAKLIDC 186
Query: 99 KASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQFVEEESEKYGSNIEKA 158
+A+ GFYYIEY LQNPGESR+HLFSA+GMA+NGWYNRLYTVTGQ++EEESEK+ S IEK
Sbjct: 187 RAANGFYYIEYWLQNPGESRRHLFSAVGMANNGWYNRLYTVTGQYLEEESEKFSSKIEKV 246
Query: 159 VASFRFI 165
VASFRFI
Sbjct: 247 VASFRFI 253
>gi|21537211|gb|AAM61552.1| thylakoid lumen protein, chloroplast precursor [Arabidopsis
thaliana]
Length = 247
Score = 253 bits (646), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 115/144 (79%), Positives = 129/144 (89%)
Query: 22 NDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGL 81
DWQ+G EPNGFKSITAFYPQE S+S+VS+ ITGLGPDFTRMESFGKVEAFA+TLVSGL
Sbjct: 104 QDWQVGQAEPNGFKSITAFYPQETSTSNVSIAITGLGPDFTRMESFGKVEAFAETLVSGL 163
Query: 82 DRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTG 141
DRSW++P GV AKLID +ASKGFYYIEYTLQNPGE+RKHL+SAIGMA+NGWYNRLYTVTG
Sbjct: 164 DRSWQKPVGVTAKLIDSRASKGFYYIEYTLQNPGEARKHLYSAIGMATNGWYNRLYTVTG 223
Query: 142 QFVEEESEKYGSNIEKAVASFRFI 165
QF +EES + S I+K V SFRFI
Sbjct: 224 QFTDEESSEQSSKIQKTVKSFRFI 247
>gi|18411110|ref|NP_565131.1| PsbP domain-containing protein 3 [Arabidopsis thaliana]
gi|18203439|sp|Q9S720.2|PPD3_ARATH RecName: Full=PsbP domain-containing protein 3, chloroplastic;
AltName: Full=OEC23-like protein 2; Flags: Precursor
gi|32815945|gb|AAP88357.1| At1g76450 [Arabidopsis thaliana]
gi|110742837|dbj|BAE99317.1| hypothetical protein [Arabidopsis thaliana]
gi|332197723|gb|AEE35844.1| PsbP domain-containing protein 3 [Arabidopsis thaliana]
Length = 247
Score = 253 bits (645), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 115/144 (79%), Positives = 129/144 (89%)
Query: 22 NDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGL 81
DWQ+G EPNGFKSITAFYPQE S+S+VS+ ITGLGPDFTRMESFGKVEAFA+TLVSGL
Sbjct: 104 QDWQVGQAEPNGFKSITAFYPQETSTSNVSIAITGLGPDFTRMESFGKVEAFAETLVSGL 163
Query: 82 DRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTG 141
DRSW++P GV AKLID +ASKGFYYIEYTLQNPGE+RKHL+SAIGMA+NGWYNRLYTVTG
Sbjct: 164 DRSWQKPVGVTAKLIDSRASKGFYYIEYTLQNPGEARKHLYSAIGMATNGWYNRLYTVTG 223
Query: 142 QFVEEESEKYGSNIEKAVASFRFI 165
QF +EES + S I+K V SFRFI
Sbjct: 224 QFTDEESAEQSSKIQKTVKSFRFI 247
>gi|297839519|ref|XP_002887641.1| hypothetical protein ARALYDRAFT_476802 [Arabidopsis lyrata subsp.
lyrata]
gi|297333482|gb|EFH63900.1| hypothetical protein ARALYDRAFT_476802 [Arabidopsis lyrata subsp.
lyrata]
Length = 225
Score = 252 bits (643), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 114/148 (77%), Positives = 131/148 (88%)
Query: 18 ALAANDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTL 77
++ DWQ+G EPNGFKSITAFYPQE S+S+VS+ ITGLGPDFTRMESFGKVEAFA+TL
Sbjct: 78 SIPQEDWQVGQAEPNGFKSITAFYPQETSTSNVSIAITGLGPDFTRMESFGKVEAFAETL 137
Query: 78 VSGLDRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLY 137
VSGLDRSW++P GV AKLID ++SKGFYYIEYTLQNPGE+RKHL+SAIGMA+NGWYNRLY
Sbjct: 138 VSGLDRSWQKPAGVTAKLIDSRSSKGFYYIEYTLQNPGEARKHLYSAIGMATNGWYNRLY 197
Query: 138 TVTGQFVEEESEKYGSNIEKAVASFRFI 165
TVTGQF +EES + S I+K V SFRFI
Sbjct: 198 TVTGQFTDEESAEQSSKIQKTVKSFRFI 225
>gi|224083366|ref|XP_002306998.1| predicted protein [Populus trichocarpa]
gi|222856447|gb|EEE93994.1| predicted protein [Populus trichocarpa]
Length = 193
Score = 250 bits (639), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 121/179 (67%), Positives = 141/179 (78%), Gaps = 22/179 (12%)
Query: 1 MFQLAFTACSFPAIVSYALAA----------------------NDWQLGAGEPNGFKSIT 38
+F + FTA +FPAI S ALAA WQ+GAGEP+G+KS+T
Sbjct: 8 LFNMVFTAFTFPAIASTALAATGVAEDSRVYTDDANKFKISIPQGWQVGAGEPSGYKSVT 67
Query: 39 AFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDC 98
AFYP+EAS+SSVSVVITGLGPDFTR+ESFGKV+AFA+TLV GLDRSW+RPPGVAAKLID
Sbjct: 68 AFYPEEASNSSVSVVITGLGPDFTRLESFGKVDAFAETLVGGLDRSWQRPPGVAAKLIDS 127
Query: 99 KASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQFVEEESEKYGSNIEK 157
KA+ G YYIEYTLQNPGESR+HL SA+G+ NGWYNRLYTVTGQFV+EESEK+G+ I K
Sbjct: 128 KAANGLYYIEYTLQNPGESRRHLLSALGVTFNGWYNRLYTVTGQFVDEESEKFGTEIRK 186
>gi|357467179|ref|XP_003603874.1| PsbP domain-containing protein [Medicago truncatula]
gi|355492922|gb|AES74125.1| PsbP domain-containing protein [Medicago truncatula]
Length = 241
Score = 249 bits (635), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 112/144 (77%), Positives = 132/144 (91%)
Query: 22 NDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGL 81
+WQ+G GE GFKS+TAFYP+E S+S+VSVVITG+GPDFT+MESFGKVE FADTLVSGL
Sbjct: 98 QEWQIGTGESAGFKSLTAFYPKEQSNSNVSVVITGVGPDFTKMESFGKVEEFADTLVSGL 157
Query: 82 DRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTG 141
DRSW++PPGVAAKLIDCK+SKGFY+IEYTLQ+PGE RKHL+SAIGM +NGWYNRLYTVTG
Sbjct: 158 DRSWKKPPGVAAKLIDCKSSKGFYFIEYTLQSPGEGRKHLYSAIGMLTNGWYNRLYTVTG 217
Query: 142 QFVEEESEKYGSNIEKAVASFRFI 165
Q+ EEE++KY S I+KAV SF+FI
Sbjct: 218 QYGEEETDKYASKIQKAVRSFKFI 241
>gi|388521577|gb|AFK48850.1| unknown [Lotus japonicus]
Length = 250
Score = 248 bits (634), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 112/144 (77%), Positives = 133/144 (92%)
Query: 22 NDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGL 81
DW++G G+ +GFKS+TAFYP+EASSS+VSV ITGLGPDFT+MESFGKV+ FA+TLVSGL
Sbjct: 107 QDWEVGTGDSSGFKSLTAFYPKEASSSNVSVAITGLGPDFTKMESFGKVDEFAETLVSGL 166
Query: 82 DRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTG 141
DRSWR+PPGVAAKLI+ K SKG YYIEY+LQNPGESR+HL+SAIGMA+NGWYNRLYTVTG
Sbjct: 167 DRSWRKPPGVAAKLINSKPSKGIYYIEYSLQNPGESRRHLYSAIGMATNGWYNRLYTVTG 226
Query: 142 QFVEEESEKYGSNIEKAVASFRFI 165
Q+VEEE++KY S I+KAV SF+FI
Sbjct: 227 QYVEEETDKYASEIQKAVTSFKFI 250
>gi|388507848|gb|AFK41990.1| unknown [Medicago truncatula]
Length = 241
Score = 248 bits (632), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 111/144 (77%), Positives = 131/144 (90%)
Query: 22 NDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGL 81
+WQ+G GE GFKS+TAFYP+E S+S+VSVVITG+GPDFT+MESFGKVE FADTLVSGL
Sbjct: 98 QEWQIGTGESAGFKSLTAFYPKEQSNSNVSVVITGVGPDFTKMESFGKVEEFADTLVSGL 157
Query: 82 DRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTG 141
DRSW++PPGVAAKLIDCK+SKGFY+IEYTLQ+PGE RKHL+SAIGM +NGWYNRLYTVTG
Sbjct: 158 DRSWKKPPGVAAKLIDCKSSKGFYFIEYTLQSPGEGRKHLYSAIGMLTNGWYNRLYTVTG 217
Query: 142 QFVEEESEKYGSNIEKAVASFRFI 165
Q+ EEE++KY S I+K V SF+FI
Sbjct: 218 QYGEEETDKYASKIQKTVRSFKFI 241
>gi|388515155|gb|AFK45639.1| unknown [Medicago truncatula]
Length = 241
Score = 246 bits (629), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 111/144 (77%), Positives = 131/144 (90%)
Query: 22 NDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGL 81
+WQ+G GE GFKS+TAFYP+E S+S+VSVVITG+GPDFT+MESFGKVE FADTLVSGL
Sbjct: 98 QEWQIGTGESAGFKSLTAFYPKEQSNSNVSVVITGVGPDFTKMESFGKVEEFADTLVSGL 157
Query: 82 DRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTG 141
DRSW++PPGVAAKLIDCK+SKGFY+IEYTLQ+PGE RKHL+SAIGM +NGWYNRLYTVTG
Sbjct: 158 DRSWKKPPGVAAKLIDCKSSKGFYFIEYTLQSPGEGRKHLYSAIGMLTNGWYNRLYTVTG 217
Query: 142 QFVEEESEKYGSNIEKAVASFRFI 165
Q+ EEE++KY S I+KAV SF+ I
Sbjct: 218 QYGEEETDKYASKIQKAVRSFKLI 241
>gi|356508481|ref|XP_003522985.1| PREDICTED: psbP domain-containing protein 3, chloroplastic-like
[Glycine max]
Length = 244
Score = 246 bits (629), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 115/146 (78%), Positives = 132/146 (90%), Gaps = 2/146 (1%)
Query: 22 NDWQLGAG--EPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVS 79
+WQ+G G E +GFKSITAFYP AS+S+VSVVITGLGPDFTRMESFGKV+ FA TLVS
Sbjct: 99 QEWQVGTGDGESSGFKSITAFYPTVASNSNVSVVITGLGPDFTRMESFGKVDEFAQTLVS 158
Query: 80 GLDRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTV 139
GLDRSWR+PPGVAAKLIDCK+S G YYIEY+LQNPGESR++L+SAIGMASNGWYNRLYTV
Sbjct: 159 GLDRSWRKPPGVAAKLIDCKSSDGIYYIEYSLQNPGESRRYLYSAIGMASNGWYNRLYTV 218
Query: 140 TGQFVEEESEKYGSNIEKAVASFRFI 165
TGQ+VEEE++KY S ++K VASFRFI
Sbjct: 219 TGQYVEEETDKYASKVQKVVASFRFI 244
>gi|356517170|ref|XP_003527262.1| PREDICTED: psbP domain-containing protein 3, chloroplastic-like
[Glycine max]
Length = 248
Score = 245 bits (625), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 114/145 (78%), Positives = 132/145 (91%), Gaps = 2/145 (1%)
Query: 23 DWQLGAG--EPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSG 80
+WQ+G G E +GFKSITAFYP +AS+S+VSVVITGLGPDFTRMESFGKV+ FA TLVSG
Sbjct: 104 EWQVGTGDGESSGFKSITAFYPTQASNSNVSVVITGLGPDFTRMESFGKVDEFAQTLVSG 163
Query: 81 LDRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVT 140
LDRSWR+PPGVAAKLIDCK+S G YYIEY LQNPGESR++L+SAIGMASNGWYNRLYTVT
Sbjct: 164 LDRSWRKPPGVAAKLIDCKSSNGIYYIEYLLQNPGESRRYLYSAIGMASNGWYNRLYTVT 223
Query: 141 GQFVEEESEKYGSNIEKAVASFRFI 165
GQ+VEE+++KY S ++K VASFRFI
Sbjct: 224 GQYVEEDTDKYASKVQKVVASFRFI 248
>gi|147809819|emb|CAN64877.1| hypothetical protein VITISV_030793 [Vitis vinifera]
Length = 247
Score = 244 bits (623), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 117/179 (65%), Positives = 140/179 (78%), Gaps = 22/179 (12%)
Query: 1 MFQLAFTACSFPAIVSYALA----------------------ANDWQLGAGEPNGFKSIT 38
+ Q+ F+ S AIV ALA DWQ+G+GEP+GFKS+T
Sbjct: 67 VLQIVFSTISLQAIVPNALADTEVPEDFKVYSDEVNKFKIQIPQDWQVGSGEPSGFKSVT 126
Query: 39 AFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDC 98
AFYP+EAS S+VSVVITGLG DFTR+ESFGKV+AFA+ LV+GLDRSW+RPPG+AAKLIDC
Sbjct: 127 AFYPEEASGSNVSVVITGLGADFTRLESFGKVDAFAENLVNGLDRSWQRPPGIAAKLIDC 186
Query: 99 KASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQFVEEESEKYGSNIEK 157
+A+ GFYYIEY LQNPGESR+HLFSA+GMA+NGWYNRLYTVTGQ++EEESEK+ S IEK
Sbjct: 187 RAANGFYYIEYWLQNPGESRRHLFSAVGMANNGWYNRLYTVTGQYLEEESEKFSSKIEK 245
>gi|255646871|gb|ACU23906.1| unknown [Glycine max]
Length = 244
Score = 243 bits (621), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 114/146 (78%), Positives = 131/146 (89%), Gaps = 2/146 (1%)
Query: 22 NDWQLGAG--EPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVS 79
+WQ+G G E +GFKSITAFYP AS+S+VSVVITG GPDFTRMESFGKV+ FA TLVS
Sbjct: 99 QEWQVGTGDGESSGFKSITAFYPTVASNSNVSVVITGPGPDFTRMESFGKVDEFAQTLVS 158
Query: 80 GLDRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTV 139
GLDRSWR+PPGVAAKLIDCK+S G YYIEY+LQNPGESR++L+SAIGMASNGWYNRLYTV
Sbjct: 159 GLDRSWRKPPGVAAKLIDCKSSDGIYYIEYSLQNPGESRRYLYSAIGMASNGWYNRLYTV 218
Query: 140 TGQFVEEESEKYGSNIEKAVASFRFI 165
TGQ+VEEE++KY S ++K VASFRFI
Sbjct: 219 TGQYVEEETDKYASKVQKVVASFRFI 244
>gi|357467177|ref|XP_003603873.1| PsbP domain-containing protein [Medicago truncatula]
gi|355492921|gb|AES74124.1| PsbP domain-containing protein [Medicago truncatula]
Length = 253
Score = 240 bits (612), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 112/155 (72%), Positives = 132/155 (85%), Gaps = 12/155 (7%)
Query: 23 DWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLD 82
+WQ+G GE GFKS+TAFYP+E S+S+VSVVITG+GPDFT+MESFGKVE FADTLVSGLD
Sbjct: 99 EWQIGTGESAGFKSLTAFYPKEQSNSNVSVVITGVGPDFTKMESFGKVEEFADTLVSGLD 158
Query: 83 RSWRRPPGVAAKLIDCKASK------------GFYYIEYTLQNPGESRKHLFSAIGMASN 130
RSW++PPGVAAKLIDCK+SK GFY+IEYTLQ+PGE RKHL+SAIGM +N
Sbjct: 159 RSWKKPPGVAAKLIDCKSSKVLTASLPRKNGSGFYFIEYTLQSPGEGRKHLYSAIGMLTN 218
Query: 131 GWYNRLYTVTGQFVEEESEKYGSNIEKAVASFRFI 165
GWYNRLYTVTGQ+ EEE++KY S I+KAV SF+FI
Sbjct: 219 GWYNRLYTVTGQYGEEETDKYASKIQKAVRSFKFI 253
>gi|115477246|ref|NP_001062219.1| Os08g0512500 [Oryza sativa Japonica Group]
gi|42408803|dbj|BAD10064.1| thylakoid lumen protein, chloroplast precursor-like [Oryza sativa
Japonica Group]
gi|113624188|dbj|BAF24133.1| Os08g0512500 [Oryza sativa Japonica Group]
gi|215686998|dbj|BAG90812.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218201441|gb|EEC83868.1| hypothetical protein OsI_29855 [Oryza sativa Indica Group]
gi|222640852|gb|EEE68984.1| hypothetical protein OsJ_27910 [Oryza sativa Japonica Group]
Length = 242
Score = 221 bits (562), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 100/145 (68%), Positives = 124/145 (85%), Gaps = 2/145 (1%)
Query: 22 NDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGL 81
DW +GAGE +GFKS+TAFYP + + S+VSV ITG+GPDFT ++SFG V+AFA+TLV+GL
Sbjct: 97 QDWLIGAGEVSGFKSVTAFYPDQVADSNVSVAITGIGPDFTSLKSFGDVDAFAETLVNGL 156
Query: 82 DRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTG 141
DRSW+RPPGVAAKLI+ +A+ GFYYIEYTLQNPGE R+H+ SAIGMA NGWYNRLYTVTG
Sbjct: 157 DRSWKRPPGVAAKLINSRAANGFYYIEYTLQNPGEQRRHIVSAIGMAFNGWYNRLYTVTG 216
Query: 142 QFVEEES--EKYGSNIEKAVASFRF 164
Q+++E+ +KY + IEK V SFRF
Sbjct: 217 QYIDEDGDVDKYRAQIEKCVQSFRF 241
>gi|357141874|ref|XP_003572377.1| PREDICTED: psbP domain-containing protein 3, chloroplastic-like
[Brachypodium distachyon]
Length = 352
Score = 215 bits (547), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 99/143 (69%), Positives = 122/143 (85%), Gaps = 2/143 (1%)
Query: 24 WQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDR 83
WQ+GAGE +GFK++TAF+P +A+ SSVSVVITG+GPDFT ++SFG V+AFA+ LV+GLDR
Sbjct: 209 WQMGAGEGSGFKNVTAFFPDKAADSSVSVVITGIGPDFTSLKSFGDVDAFAENLVTGLDR 268
Query: 84 SWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQF 143
SW+RP GV AKLID +AS G YYIEYTLQNPG+ R+H+ SAIGMA NGWYNRLYTVTGQ+
Sbjct: 269 SWQRPAGVTAKLIDSRASNGMYYIEYTLQNPGDKRRHIVSAIGMAFNGWYNRLYTVTGQY 328
Query: 144 VE--EESEKYGSNIEKAVASFRF 164
+E EES K+ IEK+V SF+F
Sbjct: 329 IEDDEESVKFKPQIEKSVKSFKF 351
>gi|226496587|ref|NP_001147730.1| LOC100281340 [Zea mays]
gi|195613364|gb|ACG28512.1| thylakoid lumen protein [Zea mays]
gi|224034069|gb|ACN36110.1| unknown [Zea mays]
gi|414869459|tpg|DAA48016.1| TPA: Thylakoid lumen protein isoform 1 [Zea mays]
gi|414869460|tpg|DAA48017.1| TPA: Thylakoid lumen protein isoform 2 [Zea mays]
Length = 256
Score = 212 bits (539), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 97/143 (67%), Positives = 123/143 (86%), Gaps = 2/143 (1%)
Query: 24 WQLGAGEPNGFKSITAFYPQEASS-SSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLD 82
W +GAGE +G KS+TAFYP++A++ S+VSV ITG+GPDFT ++SFG V+AFA+ LV+GLD
Sbjct: 113 WLVGAGEASGIKSVTAFYPEQAATDSNVSVAITGIGPDFTSLKSFGDVDAFAEGLVNGLD 172
Query: 83 RSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQ 142
RSW+RPPG+AAKLID +A+ G YY+EYTLQNPGE R+H+ SAIGMA NGWYNRLYTVTGQ
Sbjct: 173 RSWQRPPGLAAKLIDSRAANGLYYLEYTLQNPGERRRHIVSAIGMAFNGWYNRLYTVTGQ 232
Query: 143 FV-EEESEKYGSNIEKAVASFRF 164
++ +++SEKY IEKAV SFR
Sbjct: 233 YIDDDDSEKYRPQIEKAVGSFRL 255
>gi|25373339|pir||C96792 unknown protein F14G6.5 [imported] - Arabidopsis thaliana
gi|6554474|gb|AAF16656.1|AC012394_5 unknown protein; 20843-19352 [Arabidopsis thaliana]
gi|12323974|gb|AAG51945.1|AC015450_6 unknown protein; 20920-22411 [Arabidopsis thaliana]
Length = 220
Score = 210 bits (534), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 106/169 (62%), Positives = 124/169 (73%), Gaps = 13/169 (7%)
Query: 1 MFQLAFTACSFPAIVSYALAAND----WQLGAGEPNGFKSITAFYPQEASSSSVSVVITG 56
M Q+A + P +S A A + +++ E N F E S VS+ ITG
Sbjct: 61 MLQIASSVFFLPLAISPAFAETNASEAFRVYTDETNKF---------EISIPQVSIAITG 111
Query: 57 LGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGE 116
LGPDFTRMESFGKVEAFA+TLVSGLDRSW++P GV AKLID +ASKGFYYIEYTLQNPGE
Sbjct: 112 LGPDFTRMESFGKVEAFAETLVSGLDRSWQKPVGVTAKLIDSRASKGFYYIEYTLQNPGE 171
Query: 117 SRKHLFSAIGMASNGWYNRLYTVTGQFVEEESEKYGSNIEKAVASFRFI 165
+RKHL+SAIGMA+NGWYNRLYTVTGQF +EES + S I+K V SFRFI
Sbjct: 172 ARKHLYSAIGMATNGWYNRLYTVTGQFTDEESAEQSSKIQKTVKSFRFI 220
>gi|326518660|dbj|BAJ88359.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 287
Score = 209 bits (532), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 95/143 (66%), Positives = 120/143 (83%), Gaps = 2/143 (1%)
Query: 24 WQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDR 83
WQ+GAGE +GFK++TAF+P++ +SSVSVVITG+GPDFT ++SFG V+ FA+ LV+GLDR
Sbjct: 144 WQVGAGERSGFKNVTAFFPEQNPNSSVSVVITGIGPDFTSLKSFGNVDEFAENLVTGLDR 203
Query: 84 SWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQF 143
SW+RP G+ AKLID KA+ G YYIEYTLQNPGE R+H+ SAIGMA NGWYNRLYTVTGQ+
Sbjct: 204 SWQRPAGLTAKLIDSKAANGLYYIEYTLQNPGEKRRHIVSAIGMAFNGWYNRLYTVTGQY 263
Query: 144 V--EEESEKYGSNIEKAVASFRF 164
+ +E+S Y IEK+V SF+F
Sbjct: 264 IDDDEDSAIYKPEIEKSVKSFKF 286
>gi|242082241|ref|XP_002445889.1| hypothetical protein SORBIDRAFT_07g027500 [Sorghum bicolor]
gi|241942239|gb|EES15384.1| hypothetical protein SORBIDRAFT_07g027500 [Sorghum bicolor]
Length = 253
Score = 209 bits (531), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 96/144 (66%), Positives = 123/144 (85%), Gaps = 3/144 (2%)
Query: 24 WQLGAGEPNGFKSITAFYPQEASS-SSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLD 82
W +GAGE +G KS+TAFYP++A++ S+VSV ITG+GPDFT ++SFG V++FA+ LV+GLD
Sbjct: 109 WLVGAGEASGIKSVTAFYPEQAAADSNVSVAITGIGPDFTSLKSFGDVDSFAEGLVNGLD 168
Query: 83 RSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQ 142
RSW+RPPG+AAKLID +A+ G YY+EYTLQNPGE R+H+ SAIGMA NGWYNRLYTVTGQ
Sbjct: 169 RSWQRPPGLAAKLIDSRAANGLYYLEYTLQNPGERRRHIVSAIGMAFNGWYNRLYTVTGQ 228
Query: 143 FV--EEESEKYGSNIEKAVASFRF 164
++ +++SEKY IEKAV SFR
Sbjct: 229 YIDDDDDSEKYRPQIEKAVRSFRL 252
>gi|326512572|dbj|BAJ99641.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 244
Score = 208 bits (529), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 95/143 (66%), Positives = 120/143 (83%), Gaps = 2/143 (1%)
Query: 24 WQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDR 83
WQ+GAGE +GFK++TAF+P++ +SSVSVVITG+GPDFT ++SFG V+ FA+ LV+GLDR
Sbjct: 101 WQVGAGERSGFKNVTAFFPEQNPNSSVSVVITGIGPDFTSLKSFGNVDEFAENLVTGLDR 160
Query: 84 SWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQF 143
SW+RP G+ AKLID KA+ G YYIEYTLQNPGE R+H+ SAIGMA NGWYNRLYTVTGQ+
Sbjct: 161 SWQRPAGLTAKLIDSKAANGLYYIEYTLQNPGEKRRHIVSAIGMAFNGWYNRLYTVTGQY 220
Query: 144 V--EEESEKYGSNIEKAVASFRF 164
+ +E+S Y IEK+V SF+F
Sbjct: 221 IDDDEDSAIYKPEIEKSVKSFKF 243
>gi|326501564|dbj|BAK02571.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 167
Score = 207 bits (527), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 95/143 (66%), Positives = 120/143 (83%), Gaps = 2/143 (1%)
Query: 24 WQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDR 83
WQ+GAGE +GFK++TAF+P++ +SSVSVVITG+GPDFT ++SFG V+ FA+ LV+GLDR
Sbjct: 24 WQVGAGERSGFKNVTAFFPEQNPNSSVSVVITGIGPDFTSLKSFGNVDEFAENLVTGLDR 83
Query: 84 SWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQF 143
SW+RP G+ AKLID KA+ G YYIEYTLQNPGE R+H+ SAIGMA NGWYNRLYTVTGQ+
Sbjct: 84 SWQRPAGLTAKLIDSKAANGLYYIEYTLQNPGEKRRHIVSAIGMAFNGWYNRLYTVTGQY 143
Query: 144 V--EEESEKYGSNIEKAVASFRF 164
+ +E+S Y IEK+V SF+F
Sbjct: 144 IDDDEDSAIYKPEIEKSVKSFKF 166
>gi|116784465|gb|ABK23352.1| unknown [Picea sitchensis]
Length = 256
Score = 205 bits (521), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 90/147 (61%), Positives = 119/147 (80%)
Query: 19 LAANDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLV 78
L DW G G+ +G +++TAFYP+ S+V+V+ITGL D+T+MESFG V+AFA+TLV
Sbjct: 110 LVPRDWIKGEGKTDGQRAVTAFYPESGIVSNVNVIITGLSADYTKMESFGTVDAFAETLV 169
Query: 79 SGLDRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYT 138
+ LDRSW+RPPG AAKL++ K+ G YYIEY+LQ PGES+ HL SAIGMA NGWYNRLYT
Sbjct: 170 NSLDRSWKRPPGQAAKLLNAKSKNGLYYIEYSLQKPGESKIHLLSAIGMAMNGWYNRLYT 229
Query: 139 VTGQFVEEESEKYGSNIEKAVASFRFI 165
VTGQ++E+++ KYGS IEK+++SFR +
Sbjct: 230 VTGQYLEDDAGKYGSKIEKSISSFRLV 256
>gi|215274041|gb|ACJ64918.1| unknown [Arachis diogoi]
Length = 110
Score = 203 bits (517), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 92/110 (83%), Positives = 105/110 (95%)
Query: 56 GLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPG 115
G+GPDFT+MESFGKVE FA+TL+ GLDRSW+RPPGVAAKLIDCK+SKGFYYIEY+LQNPG
Sbjct: 1 GVGPDFTKMESFGKVEEFAETLIGGLDRSWQRPPGVAAKLIDCKSSKGFYYIEYSLQNPG 60
Query: 116 ESRKHLFSAIGMASNGWYNRLYTVTGQFVEEESEKYGSNIEKAVASFRFI 165
ESR+ L+SAIGMASNGWYNRLYTVTGQFVEEE++KY S ++KAVASFRFI
Sbjct: 61 ESRRTLYSAIGMASNGWYNRLYTVTGQFVEEETDKYASKVKKAVASFRFI 110
>gi|168003549|ref|XP_001754475.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694577|gb|EDQ80925.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 255
Score = 172 bits (435), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 105/147 (71%)
Query: 19 LAANDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLV 78
L W G G+ +G + +TAFYP + ++V++VITGLG DFT + SFG + FA+ LV
Sbjct: 109 LVPQAWNRGEGKTSGQRKVTAFYPADGGLTNVNIVITGLGADFTSLGSFGTADNFAENLV 168
Query: 79 SGLDRSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYT 138
+ LDRSW++PPG A+L+DCK+ YY+EYT+Q GE ++HL S +G+ +NGW NRLYT
Sbjct: 169 NSLDRSWQKPPGQKARLVDCKSRADKYYVEYTIQRLGEQQRHLVSVVGIGNNGWVNRLYT 228
Query: 139 VTGQFVEEESEKYGSNIEKAVASFRFI 165
VTGQ+ EE+S KY +I K ++SF+ +
Sbjct: 229 VTGQYFEEDSAKYKQDINKIISSFKIL 255
>gi|302818180|ref|XP_002990764.1| hypothetical protein SELMODRAFT_448162 [Selaginella moellendorffii]
gi|300141502|gb|EFJ08213.1| hypothetical protein SELMODRAFT_448162 [Selaginella moellendorffii]
Length = 221
Score = 135 bits (341), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 98/143 (68%), Gaps = 1/143 (0%)
Query: 23 DWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLD 82
DW G G+ G + + AF+P +A+ +V+V+IT LG DFT + S G V+AFA ++V +D
Sbjct: 78 DWIKGEGK-AGSRRVVAFHPSKATFPNVNVIITNLGADFTGIGSLGSVDAFAASVVGSMD 136
Query: 83 RSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQ 142
RS++RPPG AA+L++ + G YY++YT+Q PGE+++H FS G+ +Y +LYT TGQ
Sbjct: 137 RSYKRPPGTAARLVNAVSRNGMYYLDYTVQTPGEAQRHFFSVAGVGETQFYKQLYTATGQ 196
Query: 143 FVEEESEKYGSNIEKAVASFRFI 165
+ E + ++ +++A+ SFR +
Sbjct: 197 YWEADGDRDRKALQEAIESFRIV 219
>gi|116782591|gb|ABK22563.1| unknown [Picea sitchensis]
Length = 208
Score = 122 bits (305), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 53/94 (56%), Positives = 72/94 (76%)
Query: 19 LAANDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLV 78
L DW G G+ +G +++TAFYP+ S+V+V+ITGL D+T+MES G V+AFA+TLV
Sbjct: 110 LVPRDWIKGEGKTDGQRAVTAFYPESGIVSNVNVIITGLSADYTKMESLGTVDAFAETLV 169
Query: 79 SGLDRSWRRPPGVAAKLIDCKASKGFYYIEYTLQ 112
+ LDRSW+RPPG AAKL++ K+ G YYIEY+LQ
Sbjct: 170 NSLDRSWKRPPGQAAKLLNAKSKNGLYYIEYSLQ 203
>gi|413953068|gb|AFW85717.1| hypothetical protein ZEAMMB73_055926 [Zea mays]
Length = 277
Score = 118 bits (296), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 62/120 (51%), Positives = 77/120 (64%), Gaps = 25/120 (20%)
Query: 24 WQLGAGEPNGFKSITAFYPQEASS-SSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLD 82
W +GAGE +G KS+TAFYP++A++ S+VSV I +GPDFT ++SFG V AFA+ LV
Sbjct: 145 WLVGAGEASGIKSVTAFYPEQAATDSNVSVAIIEIGPDFTSLKSFGDVAAFAEGLVR--- 201
Query: 83 RSWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQ 142
YY+EYT QNPGE R+H+ SAIGMA NGWYNRLY TGQ
Sbjct: 202 ---------------------LYYLEYTSQNPGERRRHIVSAIGMAFNGWYNRLYIATGQ 240
>gi|356577616|ref|XP_003556920.1| PREDICTED: LOW QUALITY PROTEIN: psbP domain-containing protein 3,
chloroplastic-like [Glycine max]
Length = 113
Score = 116 bits (291), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/79 (74%), Positives = 66/79 (83%), Gaps = 2/79 (2%)
Query: 23 DWQLGA--GEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSG 80
+WQ+G GE +GFKSITAFYP AS+S+VSVVITGLGP+FTRMESFGKV FA TLVSG
Sbjct: 35 EWQVGTRDGESSGFKSITAFYPTVASNSNVSVVITGLGPNFTRMESFGKVGEFAQTLVSG 94
Query: 81 LDRSWRRPPGVAAKLIDCK 99
LD S R+P GVAAKLI CK
Sbjct: 95 LDGSXRKPSGVAAKLIHCK 113
>gi|384245846|gb|EIE19338.1| hypothetical protein COCSUDRAFT_58632 [Coccomyxa subellipsoidea
C-169]
Length = 191
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/137 (37%), Positives = 82/137 (59%), Gaps = 6/137 (4%)
Query: 33 GFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSW-----RR 87
G + A+YP+ + ++V+V++T +G D+T + SFG ++F + L++ LDRS+
Sbjct: 49 GTRRALAWYPEGGTDTNVTVLVTNVGADYTALGSFGSADSFGENLIASLDRSFLLRGGMN 108
Query: 88 PPGVAAKLIDCKASKGFYYIEYTLQNPGE-SRKHLFSAIGMASNGWYNRLYTVTGQFVEE 146
P AKL+D K+ G Y++EYT+Q P E +H SA+ + NG YNR YT+T Q E
Sbjct: 109 RPQSRAKLLDAKSRSGMYFLEYTVQKPEEIEPRHFLSAVALGFNGRYNRFYTLTAQCPEA 168
Query: 147 ESEKYGSNIEKAVASFR 163
+ Y + +E V SFR
Sbjct: 169 DLSTYRATLEGVVNSFR 185
>gi|357467181|ref|XP_003603875.1| PsbP domain-containing protein [Medicago truncatula]
gi|355492923|gb|AES74126.1| PsbP domain-containing protein [Medicago truncatula]
Length = 174
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 45/62 (72%), Positives = 55/62 (88%)
Query: 23 DWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLD 82
+WQ+G GE GFKS+TAFYP+E S+S+VSVVITG+GPDFT+MESFGKVE FADTLV LD
Sbjct: 99 EWQIGTGESAGFKSLTAFYPKEQSNSNVSVVITGVGPDFTKMESFGKVEEFADTLVLFLD 158
Query: 83 RS 84
++
Sbjct: 159 QN 160
>gi|307103403|gb|EFN51663.1| hypothetical protein CHLNCDRAFT_139877 [Chlorella variabilis]
Length = 295
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 77/142 (54%), Gaps = 11/142 (7%)
Query: 33 GFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRS-------W 85
G + + AF P++ SV+V I LG D+T + SFG + FA +V+ +D S W
Sbjct: 148 GLRRVVAFLPEDKPEVSVAVTIQFLGADYTGLGSFGTAQDFATGVVNKMDNSYILKLPEW 207
Query: 86 RR---PPGVAAKLIDCK-ASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTG 141
RR PP AKL+D K K Y YTL GE+++ ++ A+ + +NG NR YTV
Sbjct: 208 RRAKEPPVQVAKLLDVKDVDKRQYMFAYTLAKEGEAQRTVYQAVAVGNNGRTNRFYTVNA 267
Query: 142 QFVEEESEKYGSNIEKAVASFR 163
+ E+YG ++K V SFR
Sbjct: 268 SCAGADVEQYGPVLQKIVESFR 289
>gi|159480798|ref|XP_001698469.1| hypothetical protein CHLREDRAFT_205923 [Chlamydomonas reinhardtii]
gi|158282209|gb|EDP07962.1| predicted protein [Chlamydomonas reinhardtii]
Length = 258
Score = 92.4 bits (228), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 57/161 (35%), Positives = 85/161 (52%), Gaps = 21/161 (13%)
Query: 23 DWQLGAGEPNGFKSIT---------AFYPQEASSSSVSVVIT--GLGPDFTRMESFGKVE 71
+W G G+ +G S + A++P+ S V+V IT + +FT++ SFG
Sbjct: 91 NWGFGEGQLSGNSSFSGASGARRTLAWFPENVSPRDVNVTITVTNVSVEFTKLGSFGTPL 150
Query: 72 AFADTLVSGLDRSW---------RRPPGVAAKLIDCKASKGFYYIEYTLQN-PGESRKHL 121
FA LV+ DRS+ R P + AKLID Y++EYTLQ P E ++HL
Sbjct: 151 QFASNLVNSQDRSYMLRGPEWARRNEPIMVAKLIDAGEISNKYFLEYTLQKVPDEPKRHL 210
Query: 122 FSAIGMASNGWYNRLYTVTGQFVEEESEKYGSNIEKAVASF 162
++A+ + NG YNRLY+VT Q +EE +Y + + V S
Sbjct: 211 YTAVALGYNGTYNRLYSVTAQSLEELKPQYEATLLAMVKSL 251
>gi|255083562|ref|XP_002504767.1| oxygen-evolving enhancer PsbP [Micromonas sp. RCC299]
gi|226520035|gb|ACO66025.1| oxygen-evolving enhancer PsbP [Micromonas sp. RCC299]
Length = 248
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 83/153 (54%), Gaps = 7/153 (4%)
Query: 17 YALAA-NDWQLGAGE----PNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVE 71
Y+LA +DW+ G+ P + + AFYP +V+VV T LG D+ +M SFG +
Sbjct: 88 YSLAVPSDWERALGDTSPNPQSTRKVIAFYPPGQPEINVNVVATALGADYPKMGSFGSPD 147
Query: 72 AFADTLVSGLDRSW-RRPPGVAAKLIDCKASKGFYYIEYTLQNPGES-RKHLFSAIGMAS 129
FA + +G+ R R+ P + ++D K + Y IEYT++ P E+ +HL SA+G+
Sbjct: 148 EFAYGVAAGMTRPKPRQGPKQFSYVVDAKNAGDRYVIEYTVERPEENFYQHLLSAVGIGY 207
Query: 130 NGWYNRLYTVTGQFVEEESEKYGSNIEKAVASF 162
NG NRL+T+T E+ + K A+F
Sbjct: 208 NGRVNRLFTLTAVCPEDRYAEMEPMFRKIFATF 240
>gi|414869461|tpg|DAA48018.1| TPA: hypothetical protein ZEAMMB73_646546 [Zea mays]
gi|414869462|tpg|DAA48019.1| TPA: hypothetical protein ZEAMMB73_646546 [Zea mays]
Length = 178
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 35/56 (62%), Positives = 48/56 (85%), Gaps = 1/56 (1%)
Query: 24 WQLGAGEPNGFKSITAFYPQEASS-SSVSVVITGLGPDFTRMESFGKVEAFADTLV 78
W +GAGE +G KS+TAFYP++A++ S+VSV ITG+GPDFT ++SFG V+AFA+ LV
Sbjct: 113 WLVGAGEASGIKSVTAFYPEQAATDSNVSVAITGIGPDFTSLKSFGDVDAFAEGLV 168
>gi|303283586|ref|XP_003061084.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457435|gb|EEH54734.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 265
Score = 72.0 bits (175), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 74/140 (52%), Gaps = 6/140 (4%)
Query: 23 DWQLGAG----EPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLV 78
DW+ G P + +TA++P +V+VV T LG D+ +M SFG + FA +
Sbjct: 112 DWEQAIGTTDDNPQSSRRVTAYFPPGDPDINVNVVCTALGADYPKMGSFGSPDEFAFGVA 171
Query: 79 SGLDRSWRRP-PGVAAKLIDCKASKGFYYIEYTLQNPGES-RKHLFSAIGMASNGWYNRL 136
+G+ R + P + +++ K+ Y+IEYT++ P E +HL S +G+ NG +RL
Sbjct: 172 AGMTRPKPKSGPKQFSYVLNAKSKGDKYFIEYTVERPSEDFYQHLMSVVGVGYNGRVSRL 231
Query: 137 YTVTGQFVEEESEKYGSNIE 156
T T E++ + + +E
Sbjct: 232 ITATAVCPEDKFAENKAKLE 251
>gi|302849364|ref|XP_002956212.1| hypothetical protein VOLCADRAFT_97118 [Volvox carteri f.
nagariensis]
gi|300258515|gb|EFJ42751.1| hypothetical protein VOLCADRAFT_97118 [Volvox carteri f.
nagariensis]
Length = 205
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 44/72 (61%)
Query: 91 VAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQFVEEESEK 150
V AKL+D G Y++EYT+Q E ++HL+S + + NG YNRLYT+T Q +E + +
Sbjct: 89 VVAKLVDATELSGRYFVEYTVQKLPEPQRHLYSLLALGYNGMYNRLYTITAQTLEGDRPQ 148
Query: 151 YGSNIEKAVASF 162
Y + + V S
Sbjct: 149 YEAALLSMVKSL 160
>gi|308799735|ref|XP_003074648.1| thylakoid lumen protein, chloroplast prec (ISS) [Ostreococcus
tauri]
gi|116000819|emb|CAL50499.1| thylakoid lumen protein, chloroplast prec (ISS) [Ostreococcus
tauri]
Length = 195
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 58/128 (45%), Gaps = 5/128 (3%)
Query: 42 PQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDR----SWRRPPGVAAKLID 97
P +V+VV T G DFT+M S G F LV+ L R R A+L+D
Sbjct: 62 PGRTRDVNVTVVATPAGADFTKMGSLGDAYGFGMGLVAPLHRPKLKKGREDRVQRAELVD 121
Query: 98 CKASKGFYYIEYTLQNPGESRKHLFSAI-GMASNGWYNRLYTVTGQFVEEESEKYGSNIE 156
+Y +EYT + P +F + G+ +G LYT T QF E K+ + IE
Sbjct: 122 AYGKGDYYKVEYTFERPAADIDSVFLVLAGLGYDGRVGHLYTTTAQFPRAEESKWRAQIE 181
Query: 157 KAVASFRF 164
V S ++
Sbjct: 182 AIVDSVKY 189
>gi|412985511|emb|CCO18957.1| predicted protein [Bathycoccus prasinos]
Length = 319
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 81/162 (50%), Gaps = 17/162 (10%)
Query: 16 SYALAANDWQL------GAGEPNGFKSITAFYPQEASSSSV--SVVITGLGPDFTRMESF 67
S + + WQL G+G+ +G + + AF P+ A+ V S+V T +G D+ +M SF
Sbjct: 152 SLSYPTDSWQLLVGETGGSGDRSGSRQVIAFAPKNANPKDVNISLVATPVGADYPKMGSF 211
Query: 68 GKVEAFADTLVSGLDR----SWRRPPGVAAKLIDCKASKGF-YYIEYTLQNP--GESRKH 120
G F +V+ +++ R ++LI+ SKG Y++EY L P G K
Sbjct: 212 GSPFEFGYNMVNPMNKPKAKKGREDEVQYSELIEA-GSKGLNYFVEYELTRPSTGIDSKQ 270
Query: 121 LFSAIGMASNGWYNRLYTVTGQFVEEESEKYGSNIEKAVASF 162
L A G+ +G + LY+ T Q + E EK+ IEK + S
Sbjct: 271 LVVA-GIGYDGRVSHLYSTTAQMPKGEEEKWRPEIEKILESL 311
>gi|414869463|tpg|DAA48020.1| TPA: hypothetical protein ZEAMMB73_646546 [Zea mays]
Length = 40
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/39 (69%), Positives = 32/39 (82%), Gaps = 1/39 (2%)
Query: 127 MASNGWYNRLYTVTGQFV-EEESEKYGSNIEKAVASFRF 164
MA NGWYNRLYTVTGQ++ +++SEKY IEKAV SFR
Sbjct: 1 MAFNGWYNRLYTVTGQYIDDDDSEKYRPQIEKAVGSFRL 39
>gi|145341129|ref|XP_001415667.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575890|gb|ABO93959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 192
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 68/159 (42%), Gaps = 17/159 (10%)
Query: 22 NDWQLGAGEP--------NGFKSITAFYPQEASSSSVSVVI--TGLGPDFTRMESFGKVE 71
+DW GE G + + P A++ V+V I T G DFT+M S G
Sbjct: 29 SDWTTSTGETPASEEILGGGARDLFTISPPGANARDVNVTIVATPAGADFTKMGSLGDAY 88
Query: 72 AFADTLVSGLDRSWRRPPGVA-----AKLIDCKASKGFYYIEYTLQNPGESRKHLFSAI- 125
F LV L++ R G A+LID +Y +EY P +F +
Sbjct: 89 GFGYGLVVPLNKP-RAKKGQEDRIQFAELIDSVGKGDYYKVEYKFAKPSTGINSIFFVLA 147
Query: 126 GMASNGWYNRLYTVTGQFVEEESEKYGSNIEKAVASFRF 164
G+ +G + LYT T Q+ E +K+ + +E + S +
Sbjct: 148 GLGYDGRVSHLYTATAQYPRAEEDKWRAQVEAIIDSVVY 186
>gi|255542694|ref|XP_002512410.1| conserved hypothetical protein [Ricinus communis]
gi|223548371|gb|EEF49862.1| conserved hypothetical protein [Ricinus communis]
Length = 268
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 77/161 (47%), Gaps = 19/161 (11%)
Query: 12 PAIVSYALAANDWQLGAGEPNGFKSITAFYPQEASSS-SVSVVITGLGPDFTRMESFGKV 70
P ++ A +++ + EP + AF P +S +VSV+++ + PDF+ +E+FG
Sbjct: 119 PPPLNNAKSSDRRRKNINEP-----VVAFGPPGSSGELNVSVIVSPVPPDFS-IEAFGGP 172
Query: 71 EAFADTLVSGLDRSWRRPPGVAAKLI------DCKASKGFYYIEYTLQNPGESRKHLFSA 124
E + +V + S RR P V L+ D + +Y +E+ +++P R ++ A
Sbjct: 173 EEVGEAVVKTITASGRR-PDVKGSLVQSRLREDSLRNVNYYELEFRVESPSFQRHNV--A 229
Query: 125 IGMASNGWYNRLYTVTGQFVEEESEKYGSNIEKAVASFRFI 165
+ +G RLYT+ Q E +K + + SF +
Sbjct: 230 VCCTRDG---RLYTLNAQAPESAWQKVSPDFSRIAESFTLL 267
>gi|226531512|ref|NP_001142529.1| uncharacterized protein LOC100274771 [Zea mays]
gi|195605962|gb|ACG24811.1| hypothetical protein [Zea mays]
Length = 282
Score = 44.7 bits (104), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 69/127 (54%), Gaps = 17/127 (13%)
Query: 29 GEPNGFKSIT----AFYPQEASSS-SVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDR 83
G P+G ++I+ AF P ++ +VSV+++ + DF+ +E+FG + + ++ + R
Sbjct: 140 GSPSGPRNISEPVAAFGPPGSNGELNVSVIVSTVPRDFS-IETFGSPKDVGEVVLRRIAR 198
Query: 84 SWRRPPGVAAKLIDCKASKG-----FYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYT 138
+ RR PG+ A LID + +Y +E+ +++P R ++ A+ +G RLYT
Sbjct: 199 T-RRSPGINATLIDAAVREDAGRVKYYKLEFRVESPSFRRHNV--AVCCVRDG---RLYT 252
Query: 139 VTGQFVE 145
+ Q E
Sbjct: 253 MNAQAPE 259
>gi|414878822|tpg|DAA55953.1| TPA: hypothetical protein ZEAMMB73_857058 [Zea mays]
Length = 283
Score = 42.0 bits (97), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 68/127 (53%), Gaps = 17/127 (13%)
Query: 29 GEPNGFKSIT----AFYPQEASSS-SVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDR 83
G P+G ++I+ AF P ++ +VSV+++ + DF+ +E+FG + + ++ + R
Sbjct: 141 GSPSGPRNISEPVAAFGPPGSNGELNVSVIVSTVPRDFS-IETFGSPKDVGEVVLRRIAR 199
Query: 84 SWRRPPGVAAKLIDCKASKG-----FYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYT 138
+ RR P + A LID + +Y +E+ +++P R ++ A+ +G RLYT
Sbjct: 200 T-RRSPDINATLIDAAVREDAGRVKYYKLEFRVESPSFRRHNV--AVCCVRDG---RLYT 253
Query: 139 VTGQFVE 145
+ Q E
Sbjct: 254 MNAQAPE 260
>gi|298709150|emb|CBJ31094.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 230
Score = 41.6 bits (96), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/139 (23%), Positives = 60/139 (43%), Gaps = 11/139 (7%)
Query: 24 WQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDR 83
W+ E +G + I + ++ V T DFT + SFG ++ + T++
Sbjct: 97 WERNDAEISGRRKIVVYTSPTTPGANAFVAYTPARGDFTTLGSFGTLDEVSKTVLP---- 152
Query: 84 SWRRPPGVAAKLIDCKASKGFYYIEYTLQNPGESRKHLFSA-IGMASNGWYNRLYTVTGQ 142
GV++++++ K Y +Y + G KH+ + + G +L T+T Q
Sbjct: 153 ---EASGVSSRMLESYTYKNSYVYDYIVDQEGRPEKHIKTMWVLFPEQG---QLATITAQ 206
Query: 143 FVEEESEKYGSNIEKAVAS 161
E + K G I+ +AS
Sbjct: 207 CNESDYAKVGKTIDSLIAS 225
>gi|242055511|ref|XP_002456901.1| hypothetical protein SORBIDRAFT_03g045040 [Sorghum bicolor]
gi|241928876|gb|EES02021.1| hypothetical protein SORBIDRAFT_03g045040 [Sorghum bicolor]
Length = 279
Score = 41.6 bits (96), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 65/125 (52%), Gaps = 13/125 (10%)
Query: 27 GAGEPNGFKSITAFYPQEASSS-SVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSW 85
G N + + AF P +S +VSV+++ + DF+ +E+FG + + ++ + R+
Sbjct: 139 GKSPRNISEPVAAFGPPGSSGELNVSVIVSPVPRDFS-IEAFGSPKDVGEVVLRRIART- 196
Query: 86 RRPPGVAAKLIDCKA-----SKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVT 140
RR P + A LID S +Y +E+ +++P R ++ A+ A +G +LYT+
Sbjct: 197 RRSPDINATLIDAALREDADSVKYYKLEFRVESPSFQRHNV--AVCCARDG---KLYTMN 251
Query: 141 GQFVE 145
Q E
Sbjct: 252 AQAPE 256
>gi|259490126|ref|NP_001159068.1| uncharacterized protein LOC100304114 [Zea mays]
gi|195650741|gb|ACG44838.1| hypothetical protein [Zea mays]
Length = 282
Score = 40.0 bits (92), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 67/127 (52%), Gaps = 17/127 (13%)
Query: 29 GEPNGFKSIT----AFYPQEASSS-SVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDR 83
G P+G ++I+ AF P ++ +VSV+++ + DF+ +E+FG + + ++ + R
Sbjct: 140 GSPSGPRNISEPVAAFGPPGSNGELNVSVIVSTVPRDFS-IETFGSPKDVGEVVLRRIAR 198
Query: 84 SWRRPPGVAAKLIDCKASKG-----FYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYT 138
+ RR P + A LID + +Y +E+ +++P R ++ A+ +G RLY
Sbjct: 199 T-RRSPDINATLIDAXVREDAGRVKYYKLEFRVESPSFRRHNV--AVCCVRDG---RLYX 252
Query: 139 VTGQFVE 145
+ Q E
Sbjct: 253 MNAQAPE 259
>gi|33240547|ref|NP_875489.1| photosystem II oxygen evolving complex protein PsbP
[Prochlorococcus marinus subsp. marinus str. CCMP1375]
gi|33238075|gb|AAQ00142.1| Photosystem II protein P PsbP [Prochlorococcus marinus subsp.
marinus str. CCMP1375]
Length = 185
Score = 38.1 bits (87), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 68/161 (42%), Gaps = 23/161 (14%)
Query: 18 ALAANDWQLGAGEPNGFKSIT-------AFYPQEASSSSVSVVITGLGPDFTRMESFGKV 70
A +ND + G P G+ IT F+ S ++S+VI+ + D +E+ G
Sbjct: 34 AFQSNDGRYGFFYPTGWTRITLKGGPEVVFHDLINSDETLSLVISDISAD-VELENMGSP 92
Query: 71 EAFADTLVSGLDRSWRRPPG--VAAKLIDCKA----SKGFYYIEYTLQNPGESRKHLFSA 124
+ L++ L P G A+L+D K+ + FY IEY + P + R L +
Sbjct: 93 SEVGEKLMNNL----LAPNGGEREAELLDAKSRVADNHTFYDIEYLIHLPDKDRHELATV 148
Query: 125 IGMASNGWYNRLYTVTGQFVEEESEKYGSNIEKAVASFRFI 165
+ G LYT + G E+ ++SF F+
Sbjct: 149 V--VDRG---SLYTFAAGTNDSRWNTVGDLFERVISSFVFL 184
>gi|159115643|ref|XP_001708044.1| Hypothetical protein GL50803_9283 [Giardia lamblia ATCC 50803]
gi|157436153|gb|EDO80370.1| hypothetical protein GL50803_9283 [Giardia lamblia ATCC 50803]
Length = 2413
Score = 36.2 bits (82), Expect = 4.3, Method: Composition-based stats.
Identities = 31/125 (24%), Positives = 51/125 (40%), Gaps = 13/125 (10%)
Query: 38 TAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPP--GVAAKL 95
T P ++ V IT LGP + + VEA ++ V+G + S R P +
Sbjct: 969 TTVVPNITRDDAIDVAITELGPQPEELSTELMVEALSNISVTGHNESARTAPLLQIGTPF 1028
Query: 96 IDCKASKGFYYIEYTLQNPGESRKHLFSAIGMASNGWYNRLYTVTGQFVEEESEKYGSNI 155
+ K F++ Y + E K + S L +T Q +E+E Y ++
Sbjct: 1029 LAFYEGKAFHFEPYFARLKKEKLKDILS----------ENLQAITAQ-LEDERRAYEKSL 1077
Query: 156 EKAVA 160
E+ VA
Sbjct: 1078 EQPVA 1082
>gi|443716926|gb|ELU08219.1| hypothetical protein CAPTEDRAFT_197539 [Capitella teleta]
Length = 457
Score = 35.4 bits (80), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 25/42 (59%)
Query: 35 KSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADT 76
K + + P AS S + ++T +GPD TR S G+ AF+DT
Sbjct: 198 KLVLGYLPSPASDSGAASILTLVGPDSTRRMSRGRYSAFSDT 239
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.133 0.399
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,605,749,414
Number of Sequences: 23463169
Number of extensions: 102228856
Number of successful extensions: 228212
Number of sequences better than 100.0: 58
Number of HSP's better than 100.0 without gapping: 43
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 228134
Number of HSP's gapped (non-prelim): 59
length of query: 165
length of database: 8,064,228,071
effective HSP length: 126
effective length of query: 39
effective length of database: 9,402,836,073
effective search space: 366710606847
effective search space used: 366710606847
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 71 (32.0 bits)