BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 028350
         (210 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255539324|ref|XP_002510727.1| conserved hypothetical protein [Ricinus communis]
 gi|223551428|gb|EEF52914.1| conserved hypothetical protein [Ricinus communis]
          Length = 264

 Score =  220 bits (560), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 115/192 (59%), Positives = 143/192 (74%), Gaps = 11/192 (5%)

Query: 15  SPLHTWSQRPHHASFTAFSNNKGTNQYKKQFVFCCKKQEQEDDAR--TLNRFRIEEQDDD 72
           +PLH    +P+  +F + S +   N  +KQ + CC     ++D       RFR++E    
Sbjct: 20  TPLHLL--KPN-LNFLSASPSFRFNSCRKQHLPCCTNSFPDEDQHQPLFCRFRLQEP--- 73

Query: 73  SRTKRREVMFQLAFT-ACSFPAIVSYALAANEDLRVYTDELNKFEISIPQDWQLGAGEPN 131
               RRE +F +AFT   +FP ++S A A  +D R+YTD+ NKF+ISIPQDW++GAGEPN
Sbjct: 74  --YGRREALFSVAFTTGFTFPGLISNAFAEIDDFRLYTDDANKFQISIPQDWRVGAGEPN 131

Query: 132 GFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVA 191
           GFKS+TAFYP+EAS SSVSVVITGLGPDFTRMESFGKVEAFA+TLVSGLDRSW+RPPGVA
Sbjct: 132 GFKSVTAFYPEEASGSSVSVVITGLGPDFTRMESFGKVEAFAETLVSGLDRSWQRPPGVA 191

Query: 192 AKLIDCKASKGI 203
           AKLIDCKA+KGI
Sbjct: 192 AKLIDCKATKGI 203


>gi|224083366|ref|XP_002306998.1| predicted protein [Populus trichocarpa]
 gi|222856447|gb|EEE93994.1| predicted protein [Populus trichocarpa]
          Length = 193

 Score =  210 bits (535), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 101/140 (72%), Positives = 121/140 (86%), Gaps = 4/140 (2%)

Query: 75  TKRREVMFQLAFTACSFPAIVSYALAAN---EDLRVYTDELNKFEISIPQDWQLGAGEPN 131
           TKRRE +F + FTA +FPAI S ALAA    ED RVYTD+ NKF+ISIPQ WQ+GAGEP+
Sbjct: 2   TKRREALFNMVFTAFTFPAIASTALAATGVAEDSRVYTDDANKFKISIPQGWQVGAGEPS 61

Query: 132 GFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVA 191
           G+KS+TAFYP+EAS+SSVSVVITGLGPDFTR+ESFGKV+AFA+TLV GLDRSW+RPPGVA
Sbjct: 62  GYKSVTAFYPEEASNSSVSVVITGLGPDFTRLESFGKVDAFAETLVGGLDRSWQRPPGVA 121

Query: 192 AKLIDCKASKGI-CIFFSIQ 210
           AKLID KA+ G+  I +++Q
Sbjct: 122 AKLIDSKAANGLYYIEYTLQ 141


>gi|147809819|emb|CAN64877.1| hypothetical protein VITISV_030793 [Vitis vinifera]
          Length = 247

 Score =  206 bits (525), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 112/208 (53%), Positives = 144/208 (69%), Gaps = 20/208 (9%)

Query: 11  MASISPLHTWSQRPHHASFTAFSNNKGTNQY-----KKQFVFCCKKQEQEDDARTLNRFR 65
           MA   PL     R     F+A+S+ KG         KKQ V  CK + +E++  +   F 
Sbjct: 1   MALYFPLPL---RSGSCDFSAYSSKKGCGSRTGKCGKKQRVVFCKNENKEEEKTS---FG 54

Query: 66  IEEQDDDSRTKRREVMFQLAFTACSFPAIVSYALA---ANEDLRVYTDELNKFEISIPQD 122
           I+EQ    R   REV+ Q+ F+  S  AIV  ALA     ED +VY+DE+NKF+I IPQD
Sbjct: 55  IKEQHGGGR---REVVLQIVFSTISLQAIVPNALADTEVPEDFKVYSDEVNKFKIQIPQD 111

Query: 123 WQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDR 182
           WQ+G+GEP+GFKS+TAFYP+EAS S+VSVVITGLG DFTR+ESFGKV+AFA+ LV+GLDR
Sbjct: 112 WQVGSGEPSGFKSVTAFYPEEASGSNVSVVITGLGADFTRLESFGKVDAFAENLVNGLDR 171

Query: 183 SWRRPPGVAAKLIDCKASKGICIFFSIQ 210
           SW+RPPG+AAKLIDC+A+ G   F+ I+
Sbjct: 172 SWQRPPGIAAKLIDCRAANG---FYYIE 196


>gi|225457650|ref|XP_002273332.1| PREDICTED: psbP domain-containing protein 3, chloroplastic [Vitis
           vinifera]
 gi|297745601|emb|CBI40766.3| unnamed protein product [Vitis vinifera]
          Length = 253

 Score =  206 bits (525), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 112/208 (53%), Positives = 144/208 (69%), Gaps = 20/208 (9%)

Query: 11  MASISPLHTWSQRPHHASFTAFSNNKGTNQY-----KKQFVFCCKKQEQEDDARTLNRFR 65
           MA   PL     R     F+A+S+ KG         KKQ V  CK + +E++  +   F 
Sbjct: 1   MALYFPLPL---RSGSCDFSAYSSKKGYGSRTGKCGKKQRVVFCKNENKEEEKTS---FG 54

Query: 66  IEEQDDDSRTKRREVMFQLAFTACSFPAIVSYALA---ANEDLRVYTDELNKFEISIPQD 122
           I+EQ    R   REV+ Q+ F+  S  AIV  ALA     ED +VY+DE+NKF+I IPQD
Sbjct: 55  IKEQHGGGR---REVVLQMVFSTISLQAIVPNALADTEVPEDFKVYSDEVNKFKIQIPQD 111

Query: 123 WQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDR 182
           WQ+G+GEP+GFKS+TAFYP+EAS S+VSVVITGLG DFTR+ESFGKV+AFA+ LV+GLDR
Sbjct: 112 WQVGSGEPSGFKSVTAFYPEEASGSNVSVVITGLGADFTRLESFGKVDAFAENLVNGLDR 171

Query: 183 SWRRPPGVAAKLIDCKASKGICIFFSIQ 210
           SW+RPPG+AAKLIDC+A+ G   F+ I+
Sbjct: 172 SWQRPPGIAAKLIDCRAANG---FYYIE 196


>gi|449458059|ref|XP_004146765.1| PREDICTED: psbP domain-containing protein 3, chloroplastic-like
           [Cucumis sativus]
          Length = 257

 Score =  204 bits (518), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 96/135 (71%), Positives = 111/135 (82%), Gaps = 4/135 (2%)

Query: 73  SRTKRREVMFQLAFTACSFPAIVSYALAAN----EDLRVYTDELNKFEISIPQDWQLGAG 128
           S   RRE+M  + FTA SF  + S ALA +    ED R YTDE NKF + IPQDWQ+G G
Sbjct: 62  SGVNRREIMLGIGFTAFSFQEVGSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNG 121

Query: 129 EPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPP 188
           EPNGFKS+TAF+PQE S+S+VSVVI+GLGPD+TRMESFGKVE FADTLVSGLDRSW+RPP
Sbjct: 122 EPNGFKSVTAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPP 181

Query: 189 GVAAKLIDCKASKGI 203
           GVAAKLIDC++SKGI
Sbjct: 182 GVAAKLIDCRSSKGI 196


>gi|449522159|ref|XP_004168095.1| PREDICTED: psbP domain-containing protein 3, chloroplastic-like
           [Cucumis sativus]
          Length = 188

 Score =  198 bits (503), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 93/127 (73%), Positives = 107/127 (84%), Gaps = 4/127 (3%)

Query: 81  MFQLAFTACSFPAIVSYALAAN----EDLRVYTDELNKFEISIPQDWQLGAGEPNGFKSI 136
           M  + FTA SF  +VS ALA +    ED R YTDE NKF + IPQDWQ+G GEPNGFKS+
Sbjct: 1   MLGIGFTAFSFQEVVSNALAESVVVAEDYRTYTDEANKFSLVIPQDWQVGNGEPNGFKSV 60

Query: 137 TAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLID 196
           TAF+PQE S+S+VSVVI+GLGPD+TRMESFGKVE FADTLVSGLDRSW+RPPGVAAKLID
Sbjct: 61  TAFFPQETSTSNVSVVISGLGPDYTRMESFGKVEEFADTLVSGLDRSWKRPPGVAAKLID 120

Query: 197 CKASKGI 203
           C++SKGI
Sbjct: 121 CRSSKGI 127


>gi|188509930|gb|ACD56619.1| unknown predicted protein [Gossypioides kirkii]
          Length = 274

 Score =  197 bits (500), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 101/153 (66%), Positives = 119/153 (77%), Gaps = 12/153 (7%)

Query: 67  EEQDDDSR---TKRREVMFQLAFTACSFP-AIVSYALAANE-----DLRVYTDELNKFEI 117
           E ++DD     T+RR+ + Q    A SFP  I+S A+A  E     D RVYTDE NKF+I
Sbjct: 46  EMKEDDGHATSTRRRQFILQAPLIAFSFPQLIMSTAIAVAEIDVPQDFRVYTDEQNKFKI 105

Query: 118 SIPQDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLV 177
            IPQDWQ+GAGEPN FKSITAFYP+E ++S+VSV ITGLGPDFTRMESFGKV+AFADTLV
Sbjct: 106 FIPQDWQVGAGEPNNFKSITAFYPEEEANSNVSVAITGLGPDFTRMESFGKVDAFADTLV 165

Query: 178 SGLDRSWRRPPGVAAKLIDCKASKGICIFFSIQ 210
           SGLDRSW+RPPGVAAKLIDCKA+ G   F+ I+
Sbjct: 166 SGLDRSWQRPPGVAAKLIDCKAANG---FYYIE 195


>gi|21537211|gb|AAM61552.1| thylakoid lumen protein, chloroplast precursor [Arabidopsis
           thaliana]
          Length = 247

 Score =  196 bits (497), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 98/149 (65%), Positives = 114/149 (76%), Gaps = 4/149 (2%)

Query: 57  DARTLNRFRIEEQDDDSRTKRREVMFQLAFTACSFPAIVSYALA---ANEDLRVYTDELN 113
           D+    + RI  +D     KRR+VM Q+A +    P  +S A A   A+E  RVYTDE N
Sbjct: 38  DSSNEEQHRISSRDHVG-MKRRDVMLQIASSVFFLPLAISPAFAETNASEAFRVYTDETN 96

Query: 114 KFEISIPQDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFA 173
           KFEISIPQDWQ+G  EPNGFKSITAFYPQE S+S+VS+ ITGLGPDFTRMESFGKVEAFA
Sbjct: 97  KFEISIPQDWQVGQAEPNGFKSITAFYPQETSTSNVSIAITGLGPDFTRMESFGKVEAFA 156

Query: 174 DTLVSGLDRSWRRPPGVAAKLIDCKASKG 202
           +TLVSGLDRSW++P GV AKLID +ASKG
Sbjct: 157 ETLVSGLDRSWQKPVGVTAKLIDSRASKG 185


>gi|18411110|ref|NP_565131.1| PsbP domain-containing protein 3 [Arabidopsis thaliana]
 gi|18203439|sp|Q9S720.2|PPD3_ARATH RecName: Full=PsbP domain-containing protein 3, chloroplastic;
           AltName: Full=OEC23-like protein 2; Flags: Precursor
 gi|32815945|gb|AAP88357.1| At1g76450 [Arabidopsis thaliana]
 gi|110742837|dbj|BAE99317.1| hypothetical protein [Arabidopsis thaliana]
 gi|332197723|gb|AEE35844.1| PsbP domain-containing protein 3 [Arabidopsis thaliana]
          Length = 247

 Score =  196 bits (497), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 98/149 (65%), Positives = 114/149 (76%), Gaps = 4/149 (2%)

Query: 57  DARTLNRFRIEEQDDDSRTKRREVMFQLAFTACSFPAIVSYALA---ANEDLRVYTDELN 113
           D+    + RI  +D     KRR+VM Q+A +    P  +S A A   A+E  RVYTDE N
Sbjct: 38  DSSNEEQHRISSRDHVG-MKRRDVMLQIASSVFFLPLAISPAFAETNASEAFRVYTDETN 96

Query: 114 KFEISIPQDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFA 173
           KFEISIPQDWQ+G  EPNGFKSITAFYPQE S+S+VS+ ITGLGPDFTRMESFGKVEAFA
Sbjct: 97  KFEISIPQDWQVGQAEPNGFKSITAFYPQETSTSNVSIAITGLGPDFTRMESFGKVEAFA 156

Query: 174 DTLVSGLDRSWRRPPGVAAKLIDCKASKG 202
           +TLVSGLDRSW++P GV AKLID +ASKG
Sbjct: 157 ETLVSGLDRSWQKPVGVTAKLIDSRASKG 185


>gi|297839519|ref|XP_002887641.1| hypothetical protein ARALYDRAFT_476802 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297333482|gb|EFH63900.1| hypothetical protein ARALYDRAFT_476802 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 225

 Score =  190 bits (483), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 98/150 (65%), Positives = 113/150 (75%), Gaps = 5/150 (3%)

Query: 57  DARTLNRFRIEEQDDDSRTKRREVMFQLAFTACSFPAIVSYALA---ANEDLRVYTDELN 113
           D+    + RI  +D     KRRE M Q+A +    P  VS A A   A+E  RVYTDE N
Sbjct: 15  DSSNEEQHRISSRDHVG-IKRREAMLQIASSVFFLPLAVSPAFAETNASEAFRVYTDEAN 73

Query: 114 KFEISIPQ-DWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAF 172
           KFEISIPQ DWQ+G  EPNGFKSITAFYPQE S+S+VS+ ITGLGPDFTRMESFGKVEAF
Sbjct: 74  KFEISIPQEDWQVGQAEPNGFKSITAFYPQETSTSNVSIAITGLGPDFTRMESFGKVEAF 133

Query: 173 ADTLVSGLDRSWRRPPGVAAKLIDCKASKG 202
           A+TLVSGLDRSW++P GV AKLID ++SKG
Sbjct: 134 AETLVSGLDRSWQKPAGVTAKLIDSRSSKG 163


>gi|388507848|gb|AFK41990.1| unknown [Medicago truncatula]
          Length = 241

 Score =  181 bits (458), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 85/129 (65%), Positives = 105/129 (81%), Gaps = 3/129 (2%)

Query: 77  RREVMFQLAFTACSFPAIVSYALAAN---EDLRVYTDELNKFEISIPQDWQLGAGEPNGF 133
           RR+++   +  A +  +  S ALA N   ED  +YTD+ NKF+I IPQ+WQ+G GE  GF
Sbjct: 51  RRQLILHTSTAAIAASSTDSNALALNDVSEDFSIYTDDENKFKIDIPQEWQIGTGESAGF 110

Query: 134 KSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAK 193
           KS+TAFYP+E S+S+VSVVITG+GPDFT+MESFGKVE FADTLVSGLDRSW++PPGVAAK
Sbjct: 111 KSLTAFYPKEQSNSNVSVVITGVGPDFTKMESFGKVEEFADTLVSGLDRSWKKPPGVAAK 170

Query: 194 LIDCKASKG 202
           LIDCK+SKG
Sbjct: 171 LIDCKSSKG 179


>gi|357467179|ref|XP_003603874.1| PsbP domain-containing protein [Medicago truncatula]
 gi|355492922|gb|AES74125.1| PsbP domain-containing protein [Medicago truncatula]
          Length = 241

 Score =  179 bits (455), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 85/129 (65%), Positives = 105/129 (81%), Gaps = 3/129 (2%)

Query: 77  RREVMFQLAFTACSFPAIVSYALAAN---EDLRVYTDELNKFEISIPQDWQLGAGEPNGF 133
           RR+++   +  A +  +  S ALA N   ED  +YTD+ NKF+I IPQ+WQ+G GE  GF
Sbjct: 51  RRQLILYTSTAAIAASSTDSNALALNDVSEDFSIYTDDENKFKIDIPQEWQIGTGESAGF 110

Query: 134 KSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAK 193
           KS+TAFYP+E S+S+VSVVITG+GPDFT+MESFGKVE FADTLVSGLDRSW++PPGVAAK
Sbjct: 111 KSLTAFYPKEQSNSNVSVVITGVGPDFTKMESFGKVEEFADTLVSGLDRSWKKPPGVAAK 170

Query: 194 LIDCKASKG 202
           LIDCK+SKG
Sbjct: 171 LIDCKSSKG 179


>gi|388521577|gb|AFK48850.1| unknown [Lotus japonicus]
          Length = 250

 Score =  179 bits (454), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 89/139 (64%), Positives = 113/139 (81%), Gaps = 5/139 (3%)

Query: 77  RREVMFQLAFTA-CSFPAIVSYALAAN---EDLRVYTDELNKFEISIPQDWQLGAGEPNG 132
           RR+++F  +  A  + P+ V  ALA N   ED+R+Y D+ NKF+I IPQDW++G G+ +G
Sbjct: 59  RRQLVFYTSLAAFAAAPSTVLKALALNDVVEDVRIYIDDENKFKIEIPQDWEVGTGDSSG 118

Query: 133 FKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAA 192
           FKS+TAFYP+EASSS+VSV ITGLGPDFT+MESFGKV+ FA+TLVSGLDRSWR+PPGVAA
Sbjct: 119 FKSLTAFYPKEASSSNVSVAITGLGPDFTKMESFGKVDEFAETLVSGLDRSWRKPPGVAA 178

Query: 193 KLIDCKASKGI-CIFFSIQ 210
           KLI+ K SKGI  I +S+Q
Sbjct: 179 KLINSKPSKGIYYIEYSLQ 197


>gi|388515155|gb|AFK45639.1| unknown [Medicago truncatula]
          Length = 241

 Score =  177 bits (449), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 84/129 (65%), Positives = 104/129 (80%), Gaps = 3/129 (2%)

Query: 77  RREVMFQLAFTACSFPAIVSYALAAN---EDLRVYTDELNKFEISIPQDWQLGAGEPNGF 133
           RR+++   +  A +  +  S ALA N   ED  +Y D+ NKF+I IPQ+WQ+G GE  GF
Sbjct: 51  RRQLILYTSTAAIAASSTDSNALALNDVSEDFSIYADDENKFKIDIPQEWQIGTGESAGF 110

Query: 134 KSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAK 193
           KS+TAFYP+E S+S+VSVVITG+GPDFT+MESFGKVE FADTLVSGLDRSW++PPGVAAK
Sbjct: 111 KSLTAFYPKEQSNSNVSVVITGVGPDFTKMESFGKVEEFADTLVSGLDRSWKKPPGVAAK 170

Query: 194 LIDCKASKG 202
           LIDCK+SKG
Sbjct: 171 LIDCKSSKG 179


>gi|357467177|ref|XP_003603873.1| PsbP domain-containing protein [Medicago truncatula]
 gi|355492921|gb|AES74124.1| PsbP domain-containing protein [Medicago truncatula]
          Length = 253

 Score =  177 bits (448), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 84/128 (65%), Positives = 104/128 (81%), Gaps = 3/128 (2%)

Query: 77  RREVMFQLAFTACSFPAIVSYALAAN---EDLRVYTDELNKFEISIPQDWQLGAGEPNGF 133
           RR+++   +  A +  +  S ALA N   ED  +YTD+ NKF+I IPQ+WQ+G GE  GF
Sbjct: 51  RRQLILYTSTAAIAASSTDSNALALNDVSEDFSIYTDDENKFKIDIPQEWQIGTGESAGF 110

Query: 134 KSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAK 193
           KS+TAFYP+E S+S+VSVVITG+GPDFT+MESFGKVE FADTLVSGLDRSW++PPGVAAK
Sbjct: 111 KSLTAFYPKEQSNSNVSVVITGVGPDFTKMESFGKVEEFADTLVSGLDRSWKKPPGVAAK 170

Query: 194 LIDCKASK 201
           LIDCK+SK
Sbjct: 171 LIDCKSSK 178


>gi|356508481|ref|XP_003522985.1| PREDICTED: psbP domain-containing protein 3, chloroplastic-like
           [Glycine max]
          Length = 244

 Score =  176 bits (445), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 92/142 (64%), Positives = 109/142 (76%), Gaps = 8/142 (5%)

Query: 75  TKRREVMFQLAFTACSFPAIVSYALAAN---EDLRVYTDELNKFEISIPQDWQLGAG--E 129
             RR+++        +F   V  ALA N   ED+RVYTD+ NKF+I IPQ+WQ+G G  E
Sbjct: 52  VNRRQLILHTPVAVAAFA--VPNALALNDVSEDVRVYTDDENKFKIEIPQEWQVGTGDGE 109

Query: 130 PNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPG 189
            +GFKSITAFYP  AS+S+VSVVITGLGPDFTRMESFGKV+ FA TLVSGLDRSWR+PPG
Sbjct: 110 SSGFKSITAFYPTVASNSNVSVVITGLGPDFTRMESFGKVDEFAQTLVSGLDRSWRKPPG 169

Query: 190 VAAKLIDCKASKGI-CIFFSIQ 210
           VAAKLIDCK+S GI  I +S+Q
Sbjct: 170 VAAKLIDCKSSDGIYYIEYSLQ 191


>gi|356517170|ref|XP_003527262.1| PREDICTED: psbP domain-containing protein 3, chloroplastic-like
           [Glycine max]
          Length = 248

 Score =  172 bits (437), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 88/141 (62%), Positives = 111/141 (78%), Gaps = 3/141 (2%)

Query: 66  IEEQDDDSRTKRREVMFQLAFTACSFPAIVSYALA-ANEDLRVYTDELNKFEISIPQDWQ 124
           IEE++  +  +R+ ++      A +F    + AL   +ED+RVYTD+ NKF+I IP++WQ
Sbjct: 47  IEEEEGRAVNRRQLILHTPVAAAAAFAVPNALALNDVSEDVRVYTDDENKFKIEIPEEWQ 106

Query: 125 LGAG--EPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDR 182
           +G G  E +GFKSITAFYP +AS+S+VSVVITGLGPDFTRMESFGKV+ FA TLVSGLDR
Sbjct: 107 VGTGDGESSGFKSITAFYPTQASNSNVSVVITGLGPDFTRMESFGKVDEFAQTLVSGLDR 166

Query: 183 SWRRPPGVAAKLIDCKASKGI 203
           SWR+PPGVAAKLIDCK+S GI
Sbjct: 167 SWRKPPGVAAKLIDCKSSNGI 187


>gi|255646871|gb|ACU23906.1| unknown [Glycine max]
          Length = 244

 Score =  172 bits (437), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 91/142 (64%), Positives = 108/142 (76%), Gaps = 8/142 (5%)

Query: 75  TKRREVMFQLAFTACSFPAIVSYALAAN---EDLRVYTDELNKFEISIPQDWQLGAG--E 129
             RR+++        +F   V  ALA N   ED+RVYTD+ NKF+I IPQ+WQ+G G  E
Sbjct: 52  VNRRQLILHTPVAVAAF--AVPNALALNDVSEDVRVYTDDENKFKIEIPQEWQVGTGDGE 109

Query: 130 PNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPG 189
            +GFKSITAFYP  AS+S+VSVVITG GPDFTRMESFGKV+ FA TLVSGLDRSWR+PPG
Sbjct: 110 SSGFKSITAFYPTVASNSNVSVVITGPGPDFTRMESFGKVDEFAQTLVSGLDRSWRKPPG 169

Query: 190 VAAKLIDCKASKGI-CIFFSIQ 210
           VAAKLIDCK+S GI  I +S+Q
Sbjct: 170 VAAKLIDCKSSDGIYYIEYSLQ 191


>gi|115477246|ref|NP_001062219.1| Os08g0512500 [Oryza sativa Japonica Group]
 gi|42408803|dbj|BAD10064.1| thylakoid lumen protein, chloroplast precursor-like [Oryza sativa
           Japonica Group]
 gi|113624188|dbj|BAF24133.1| Os08g0512500 [Oryza sativa Japonica Group]
 gi|215686998|dbj|BAG90812.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218201441|gb|EEC83868.1| hypothetical protein OsI_29855 [Oryza sativa Indica Group]
 gi|222640852|gb|EEE68984.1| hypothetical protein OsJ_27910 [Oryza sativa Japonica Group]
          Length = 242

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 69/108 (63%), Positives = 88/108 (81%), Gaps = 3/108 (2%)

Query: 103 EDLRVYTDELNKFEISIPQDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTR 162
           E  R Y DE NKF I+IPQDW +GAGE +GFKS+TAFYP + + S+VSV ITG+GPDFT 
Sbjct: 79  EGFRTYEDEANKFSIAIPQDWLIGAGEVSGFKSVTAFYPDQVADSNVSVAITGIGPDFTS 138

Query: 163 MESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDCKASKGICIFFSIQ 210
           ++SFG V+AFA+TLV+GLDRSW+RPPGVAAKLI+ +A+ G   F+ I+
Sbjct: 139 LKSFGDVDAFAETLVNGLDRSWKRPPGVAAKLINSRAANG---FYYIE 183


>gi|357141874|ref|XP_003572377.1| PREDICTED: psbP domain-containing protein 3, chloroplastic-like
           [Brachypodium distachyon]
          Length = 352

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 73/143 (51%), Positives = 100/143 (69%), Gaps = 8/143 (5%)

Query: 76  KRREVMFQLAFTACSFPAIVS-------YALAANEDLRVYTDELNKFEISIPQDWQLGAG 128
           +RR+ +  + F+A +   + S        A    E    Y DE NKF I +PQ WQ+GAG
Sbjct: 155 RRRDALLGVVFSAGTATLLASPAGALAEAAAEVQEGFSEYQDEANKFSIVVPQGWQMGAG 214

Query: 129 EPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPP 188
           E +GFK++TAF+P +A+ SSVSVVITG+GPDFT ++SFG V+AFA+ LV+GLDRSW+RP 
Sbjct: 215 EGSGFKNVTAFFPDKAADSSVSVVITGIGPDFTSLKSFGDVDAFAENLVTGLDRSWQRPA 274

Query: 189 GVAAKLIDCKASKGI-CIFFSIQ 210
           GV AKLID +AS G+  I +++Q
Sbjct: 275 GVTAKLIDSRASNGMYYIEYTLQ 297


>gi|356577616|ref|XP_003556920.1| PREDICTED: LOW QUALITY PROTEIN: psbP domain-containing protein 3,
           chloroplastic-like [Glycine max]
          Length = 113

 Score =  146 bits (369), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 72/102 (70%), Positives = 84/102 (82%), Gaps = 2/102 (1%)

Query: 99  LAANEDLRVYTDELNKFEISIPQDWQLGA--GEPNGFKSITAFYPQEASSSSVSVVITGL 156
           +  +ED+RVYT+  NKF+I IPQ+WQ+G   GE +GFKSITAFYP  AS+S+VSVVITGL
Sbjct: 12  IDVSEDVRVYTENENKFKIEIPQEWQVGTRDGESSGFKSITAFYPTVASNSNVSVVITGL 71

Query: 157 GPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDCK 198
           GP+FTRMESFGKV  FA TLVSGLD S R+P GVAAKLI CK
Sbjct: 72  GPNFTRMESFGKVGEFAQTLVSGLDGSXRKPSGVAAKLIHCK 113


>gi|226496587|ref|NP_001147730.1| LOC100281340 [Zea mays]
 gi|195613364|gb|ACG28512.1| thylakoid lumen protein [Zea mays]
 gi|224034069|gb|ACN36110.1| unknown [Zea mays]
 gi|414869459|tpg|DAA48016.1| TPA: Thylakoid lumen protein isoform 1 [Zea mays]
 gi|414869460|tpg|DAA48017.1| TPA: Thylakoid lumen protein isoform 2 [Zea mays]
          Length = 256

 Score =  145 bits (367), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 74/161 (45%), Positives = 109/161 (67%), Gaps = 8/161 (4%)

Query: 51  KQEQEDDARTLNRFRIEEQDDDSRTKRREVMFQLAFTACSFPAIV-SYALA------ANE 103
           ++ Q+  A T+   R    ++    +R  ++  + F+A + P +  + ALA      + E
Sbjct: 34  RRLQQAAATTVCHCRSARVEEGLLGRRDALLLGIVFSAATPPLLAPAGALADEATAESQE 93

Query: 104 DLRVYTDELNKFEISIPQDWQLGAGEPNGFKSITAFYPQEASS-SSVSVVITGLGPDFTR 162
               Y DE NKF I +PQ W +GAGE +G KS+TAFYP++A++ S+VSV ITG+GPDFT 
Sbjct: 94  GFTTYEDEANKFSIQVPQGWLVGAGEASGIKSVTAFYPEQAATDSNVSVAITGIGPDFTS 153

Query: 163 MESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDCKASKGI 203
           ++SFG V+AFA+ LV+GLDRSW+RPPG+AAKLID +A+ G+
Sbjct: 154 LKSFGDVDAFAEGLVNGLDRSWQRPPGLAAKLIDSRAANGL 194


>gi|326501564|dbj|BAK02571.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 167

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 65/111 (58%), Positives = 88/111 (79%), Gaps = 1/111 (0%)

Query: 101 ANEDLRVYTDELNKFEISIPQDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDF 160
           A E    Y DE NKF ++IPQ WQ+GAGE +GFK++TAF+P++  +SSVSVVITG+GPDF
Sbjct: 2   AQEGFTAYEDEANKFTLAIPQGWQVGAGERSGFKNVTAFFPEQNPNSSVSVVITGIGPDF 61

Query: 161 TRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDCKASKGI-CIFFSIQ 210
           T ++SFG V+ FA+ LV+GLDRSW+RP G+ AKLID KA+ G+  I +++Q
Sbjct: 62  TSLKSFGNVDEFAENLVTGLDRSWQRPAGLTAKLIDSKAANGLYYIEYTLQ 112


>gi|326518660|dbj|BAJ88359.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 287

 Score =  142 bits (357), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 74/144 (51%), Positives = 102/144 (70%), Gaps = 9/144 (6%)

Query: 76  KRREVMFQL---AFTACSFPAIV-----SYALAANEDLRVYTDELNKFEISIPQDWQLGA 127
           KRR+ +  +   A  A S P +V     + A  A E    Y DE NKF ++IPQ WQ+GA
Sbjct: 89  KRRDALLGVLLSATAASSAPLLVPAEAFAEAAEAQEGFTAYEDEANKFTLAIPQGWQVGA 148

Query: 128 GEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRP 187
           GE +GFK++TAF+P++  +SSVSVVITG+GPDFT ++SFG V+ FA+ LV+GLDRSW+RP
Sbjct: 149 GERSGFKNVTAFFPEQNPNSSVSVVITGIGPDFTSLKSFGNVDEFAENLVTGLDRSWQRP 208

Query: 188 PGVAAKLIDCKASKGI-CIFFSIQ 210
            G+ AKLID KA+ G+  I +++Q
Sbjct: 209 AGLTAKLIDSKAANGLYYIEYTLQ 232


>gi|326512572|dbj|BAJ99641.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 244

 Score =  140 bits (352), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 74/144 (51%), Positives = 102/144 (70%), Gaps = 9/144 (6%)

Query: 76  KRREVMFQL---AFTACSFPAIV-----SYALAANEDLRVYTDELNKFEISIPQDWQLGA 127
           KRR+ +  +   A  A S P +V     + A  A E    Y DE NKF ++IPQ WQ+GA
Sbjct: 46  KRRDALLGVLLSATAASSAPLLVPAEAFAEAAEAQEGFTAYEDEANKFTLAIPQGWQVGA 105

Query: 128 GEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRP 187
           GE +GFK++TAF+P++  +SSVSVVITG+GPDFT ++SFG V+ FA+ LV+GLDRSW+RP
Sbjct: 106 GERSGFKNVTAFFPEQNPNSSVSVVITGIGPDFTSLKSFGNVDEFAENLVTGLDRSWQRP 165

Query: 188 PGVAAKLIDCKASKGI-CIFFSIQ 210
            G+ AKLID KA+ G+  I +++Q
Sbjct: 166 AGLTAKLIDSKAANGLYYIEYTLQ 189


>gi|116784465|gb|ABK23352.1| unknown [Picea sitchensis]
          Length = 256

 Score =  139 bits (351), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 69/143 (48%), Positives = 99/143 (69%), Gaps = 5/143 (3%)

Query: 73  SRTKRREVMFQLAFTACSFPAIVSYALAA----NEDLRVYTDELNKFEISIPQDWQLGAG 128
           SR  RR+ + Q+A    + P  VS A AA     ED  +Y DE +KF + +P+DW  G G
Sbjct: 61  SRNGRRQAICQIAALFSAIPCTVSAARAAETELQEDYELYKDETDKFSLLVPRDWIKGEG 120

Query: 129 EPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPP 188
           + +G +++TAFYP+    S+V+V+ITGL  D+T+MESFG V+AFA+TLV+ LDRSW+RPP
Sbjct: 121 KTDGQRAVTAFYPESGIVSNVNVIITGLSADYTKMESFGTVDAFAETLVNSLDRSWKRPP 180

Query: 189 GVAAKLIDCKASKGI-CIFFSIQ 210
           G AAKL++ K+  G+  I +S+Q
Sbjct: 181 GQAAKLLNAKSKNGLYYIEYSLQ 203


>gi|242082241|ref|XP_002445889.1| hypothetical protein SORBIDRAFT_07g027500 [Sorghum bicolor]
 gi|241942239|gb|EES15384.1| hypothetical protein SORBIDRAFT_07g027500 [Sorghum bicolor]
          Length = 253

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 62/104 (59%), Positives = 84/104 (80%), Gaps = 1/104 (0%)

Query: 101 ANEDLRVYTDELNKFEISIPQDWQLGAGEPNGFKSITAFYPQEASS-SSVSVVITGLGPD 159
           + E    Y DE NKF I +PQ W +GAGE +G KS+TAFYP++A++ S+VSV ITG+GPD
Sbjct: 87  SQEGFTTYEDEANKFSIQVPQGWLVGAGEASGIKSVTAFYPEQAAADSNVSVAITGIGPD 146

Query: 160 FTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDCKASKGI 203
           FT ++SFG V++FA+ LV+GLDRSW+RPPG+AAKLID +A+ G+
Sbjct: 147 FTSLKSFGDVDSFAEGLVNGLDRSWQRPPGLAAKLIDSRAANGL 190


>gi|116782591|gb|ABK22563.1| unknown [Picea sitchensis]
          Length = 208

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 68/143 (47%), Positives = 98/143 (68%), Gaps = 5/143 (3%)

Query: 73  SRTKRREVMFQLAFTACSFPAIVSYALAA----NEDLRVYTDELNKFEISIPQDWQLGAG 128
           SR  RR+ + Q+A    + P  VS A AA     ED  +Y DE +KF + +P+DW  G G
Sbjct: 61  SRNGRRQAICQIAALFSAIPCTVSAARAAETELQEDYELYKDETDKFSLLVPRDWIKGEG 120

Query: 129 EPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPP 188
           + +G +++TAFYP+    S+V+V+ITGL  D+T+MES G V+AFA+TLV+ LDRSW+RPP
Sbjct: 121 KTDGQRAVTAFYPESGIVSNVNVIITGLSADYTKMESLGTVDAFAETLVNSLDRSWKRPP 180

Query: 189 GVAAKLIDCKASKGI-CIFFSIQ 210
           G AAKL++ K+  G+  I +S+Q
Sbjct: 181 GQAAKLLNAKSKNGLYYIEYSLQ 203


>gi|25373339|pir||C96792 unknown protein F14G6.5 [imported] - Arabidopsis thaliana
 gi|6554474|gb|AAF16656.1|AC012394_5 unknown protein; 20843-19352 [Arabidopsis thaliana]
 gi|12323974|gb|AAG51945.1|AC015450_6 unknown protein; 20920-22411 [Arabidopsis thaliana]
          Length = 220

 Score =  134 bits (336), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 77/149 (51%), Positives = 90/149 (60%), Gaps = 31/149 (20%)

Query: 57  DARTLNRFRIEEQDDDSRTKRREVMFQLAFTACSFPAIVSYALA---ANEDLRVYTDELN 113
           D+    + RI  +D     KRR+VM Q+A +    P  +S A A   A+E  RVYTDE N
Sbjct: 38  DSSNEEQHRISSRDHVG-MKRRDVMLQIASSVFFLPLAISPAFAETNASEAFRVYTDETN 96

Query: 114 KFEISIPQDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFA 173
           KFEISIPQ                           VS+ ITGLGPDFTRMESFGKVEAFA
Sbjct: 97  KFEISIPQ---------------------------VSIAITGLGPDFTRMESFGKVEAFA 129

Query: 174 DTLVSGLDRSWRRPPGVAAKLIDCKASKG 202
           +TLVSGLDRSW++P GV AKLID +ASKG
Sbjct: 130 ETLVSGLDRSWQKPVGVTAKLIDSRASKG 158


>gi|357467181|ref|XP_003603875.1| PsbP domain-containing protein [Medicago truncatula]
 gi|355492923|gb|AES74126.1| PsbP domain-containing protein [Medicago truncatula]
          Length = 174

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 65/111 (58%), Positives = 84/111 (75%), Gaps = 3/111 (2%)

Query: 76  KRREVMFQLAFTACSFPAIVSYALAAN---EDLRVYTDELNKFEISIPQDWQLGAGEPNG 132
            RR+++   +  A +  +  S ALA N   ED  +YTD+ NKF+I IPQ+WQ+G GE  G
Sbjct: 50  NRRQLILYTSTAAIAASSTDSNALALNDVSEDFSIYTDDENKFKIDIPQEWQIGTGESAG 109

Query: 133 FKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRS 183
           FKS+TAFYP+E S+S+VSVVITG+GPDFT+MESFGKVE FADTLV  LD++
Sbjct: 110 FKSLTAFYPKEQSNSNVSVVITGVGPDFTKMESFGKVEEFADTLVLFLDQN 160


>gi|168003549|ref|XP_001754475.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694577|gb|EDQ80925.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 255

 Score =  113 bits (283), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 56/127 (44%), Positives = 82/127 (64%), Gaps = 3/127 (2%)

Query: 76  KRREVMFQ-LAFTACSF--PAIVSYALAANEDLRVYTDELNKFEISIPQDWQLGAGEPNG 132
           +RREV+ Q L   + SF  PA  +    A  +   Y D  +KF + +PQ W  G G+ +G
Sbjct: 64  QRREVIGQALLAMSMSFAPPARSATDTDAATEFTTYEDAADKFTLLVPQAWNRGEGKTSG 123

Query: 133 FKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAA 192
            + +TAFYP +   ++V++VITGLG DFT + SFG  + FA+ LV+ LDRSW++PPG  A
Sbjct: 124 QRKVTAFYPADGGLTNVNIVITGLGADFTSLGSFGTADNFAENLVNSLDRSWQKPPGQKA 183

Query: 193 KLIDCKA 199
           +L+DCK+
Sbjct: 184 RLVDCKS 190


>gi|414869461|tpg|DAA48018.1| TPA: hypothetical protein ZEAMMB73_646546 [Zea mays]
 gi|414869462|tpg|DAA48019.1| TPA: hypothetical protein ZEAMMB73_646546 [Zea mays]
          Length = 178

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 56/135 (41%), Positives = 85/135 (62%), Gaps = 8/135 (5%)

Query: 51  KQEQEDDARTLNRFRIEEQDDDSRTKRREVMFQLAFTACSFPAIV-SYALA------ANE 103
           ++ Q+  A T+   R    ++    +R  ++  + F+A + P +  + ALA      + E
Sbjct: 34  RRLQQAAATTVCHCRSARVEEGLLGRRDALLLGIVFSAATPPLLAPAGALADEATAESQE 93

Query: 104 DLRVYTDELNKFEISIPQDWQLGAGEPNGFKSITAFYPQEASS-SSVSVVITGLGPDFTR 162
               Y DE NKF I +PQ W +GAGE +G KS+TAFYP++A++ S+VSV ITG+GPDFT 
Sbjct: 94  GFTTYEDEANKFSIQVPQGWLVGAGEASGIKSVTAFYPEQAATDSNVSVAITGIGPDFTS 153

Query: 163 MESFGKVEAFADTLV 177
           ++SFG V+AFA+ LV
Sbjct: 154 LKSFGDVDAFAEGLV 168


>gi|215274041|gb|ACJ64918.1| unknown [Arachis diogoi]
          Length = 110

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 43/57 (75%), Positives = 51/57 (89%), Gaps = 1/57 (1%)

Query: 155 GLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDCKASKGI-CIFFSIQ 210
           G+GPDFT+MESFGKVE FA+TL+ GLDRSW+RPPGVAAKLIDCK+SKG   I +S+Q
Sbjct: 1   GVGPDFTKMESFGKVEEFAETLIGGLDRSWQRPPGVAAKLIDCKSSKGFYYIEYSLQ 57


>gi|302818180|ref|XP_002990764.1| hypothetical protein SELMODRAFT_448162 [Selaginella moellendorffii]
 gi|300141502|gb|EFJ08213.1| hypothetical protein SELMODRAFT_448162 [Selaginella moellendorffii]
          Length = 221

 Score = 93.2 bits (230), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 44/111 (39%), Positives = 77/111 (69%), Gaps = 3/111 (2%)

Query: 101 ANEDLRVYTDELNKFEISIPQDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPDF 160
           A ++ +VY +E +KF +++P+DW  G G+  G + + AF+P +A+  +V+V+IT LG DF
Sbjct: 58  AADEYQVY-EEQDKFSLTVPKDWIKGEGK-AGSRRVVAFHPSKATFPNVNVIITNLGADF 115

Query: 161 TRMESFGKVEAFADTLVSGLDRSWRRPPGVAAKLIDCKASKGICIF-FSIQ 210
           T + S G V+AFA ++V  +DRS++RPPG AA+L++  +  G+    +++Q
Sbjct: 116 TGIGSLGSVDAFAASVVGSMDRSYKRPPGTAARLVNAVSRNGMYYLDYTVQ 166


>gi|413953068|gb|AFW85717.1| hypothetical protein ZEAMMB73_055926 [Zea mays]
          Length = 277

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/77 (55%), Positives = 57/77 (74%), Gaps = 1/77 (1%)

Query: 102 NEDLRVYTDELNKFEISIPQDWQLGAGEPNGFKSITAFYPQEASS-SSVSVVITGLGPDF 160
           +E    Y DE NKF I +PQ W +GAGE +G KS+TAFYP++A++ S+VSV I  +GPDF
Sbjct: 124 DEGFTTYEDEANKFNIQVPQGWLVGAGEASGIKSVTAFYPEQAATDSNVSVAIIEIGPDF 183

Query: 161 TRMESFGKVEAFADTLV 177
           T ++SFG V AFA+ LV
Sbjct: 184 TSLKSFGDVAAFAEGLV 200


>gi|255083562|ref|XP_002504767.1| oxygen-evolving enhancer PsbP [Micromonas sp. RCC299]
 gi|226520035|gb|ACO66025.1| oxygen-evolving enhancer PsbP [Micromonas sp. RCC299]
          Length = 248

 Score = 70.5 bits (171), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 83/162 (51%), Gaps = 16/162 (9%)

Query: 61  LNRFRIEEQDDDSRTKRREVMFQLAFTACSFPAIVSYALAANED------LRVYTDELNK 114
           ++R R E   D S  +RRE+   LA  A S  +I ++   A  D       +VY DE +K
Sbjct: 32  VSRTRAEASADGS-LERREL---LAGAATSLVSIAAFPRVARADEPATTLTQVYNDEEDK 87

Query: 115 FEISIPQDWQLGAGE----PNGFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVE 170
           + +++P DW+   G+    P   + + AFYP      +V+VV T LG D+ +M SFG  +
Sbjct: 88  YSLAVPSDWERALGDTSPNPQSTRKVIAFYPPGQPEINVNVVATALGADYPKMGSFGSPD 147

Query: 171 AFADTLVSGLDRSW-RRPPGVAAKLIDCK-ASKGICIFFSIQ 210
            FA  + +G+ R   R+ P   + ++D K A     I ++++
Sbjct: 148 EFAYGVAAGMTRPKPRQGPKQFSYVVDAKNAGDRYVIEYTVE 189


>gi|307103403|gb|EFN51663.1| hypothetical protein CHLNCDRAFT_139877 [Chlorella variabilis]
          Length = 295

 Score = 66.6 bits (161), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 41/118 (34%), Positives = 55/118 (46%), Gaps = 26/118 (22%)

Query: 107 VYTDELNKFEISIPQDWQLGAG----------------EPNGFKSITAFYPQEASSSSVS 150
           VYTD  +KF I +P  W L  G                   G + + AF P++    SV+
Sbjct: 107 VYTDPEDKFSIEVPSGWALATGAFGEEGTLTTNQARFSNAAGLRRVVAFLPEDKPEVSVA 166

Query: 151 VVITGLGPDFTRMESFGKVEAFADTLVSGLDRS-------WRR---PPGVAAKLIDCK 198
           V I  LG D+T + SFG  + FA  +V+ +D S       WRR   PP   AKL+D K
Sbjct: 167 VTIQFLGADYTGLGSFGTAQDFATGVVNKMDNSYILKLPEWRRAKEPPVQVAKLLDVK 224


>gi|303283586|ref|XP_003061084.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457435|gb|EEH54734.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 265

 Score = 56.6 bits (135), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 46/87 (52%), Gaps = 4/87 (4%)

Query: 100 AANEDLRVYTDELNKFEISIPQDWQLGAG----EPNGFKSITAFYPQEASSSSVSVVITG 155
           A   + + Y D  +K+  S+P DW+   G     P   + +TA++P      +V+VV T 
Sbjct: 90  AEGAETKRYVDAEDKYSFSVPADWEQAIGTTDDNPQSSRRVTAYFPPGDPDINVNVVCTA 149

Query: 156 LGPDFTRMESFGKVEAFADTLVSGLDR 182
           LG D+ +M SFG  + FA  + +G+ R
Sbjct: 150 LGADYPKMGSFGSPDEFAFGVAAGMTR 176


>gi|412985511|emb|CCO18957.1| predicted protein [Bathycoccus prasinos]
          Length = 319

 Score = 56.2 bits (134), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 43/140 (30%), Positives = 67/140 (47%), Gaps = 33/140 (23%)

Query: 76  KRREVMFQLAFTACSFPAIV----------SYALAANED--------------LRVYTDE 111
           +R+ +M     TA +F ++V           YA AA ED                 YTDE
Sbjct: 88  RRKAMMVSATTTAFAFASMVLSMDDTNKNNYYAFAAEEDESSTKMMNTNTNKNTTTYTDE 147

Query: 112 LNKFEISIPQD-WQL------GAGEPNGFKSITAFYPQEASSSSV--SVVITGLGPDFTR 162
           + KF +S P D WQL      G+G+ +G + + AF P+ A+   V  S+V T +G D+ +
Sbjct: 148 VLKFSLSYPTDSWQLLVGETGGSGDRSGSRQVIAFAPKNANPKDVNISLVATPVGADYPK 207

Query: 163 MESFGKVEAFADTLVSGLDR 182
           M SFG    F   +V+ +++
Sbjct: 208 MGSFGSPFEFGYNMVNPMNK 227


>gi|159480798|ref|XP_001698469.1| hypothetical protein CHLREDRAFT_205923 [Chlamydomonas reinhardtii]
 gi|158282209|gb|EDP07962.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 258

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 20/110 (18%)

Query: 108 YTDELNKFEISIPQDWQLGAGEPNGFKSIT---------AFYPQEASSSSVSVVIT--GL 156
           YTD  + F + IP +W  G G+ +G  S +         A++P+  S   V+V IT   +
Sbjct: 77  YTDPQDAFTLKIPANWGFGEGQLSGNSSFSGASGARRTLAWFPENVSPRDVNVTITVTNV 136

Query: 157 GPDFTRMESFGKVEAFADTLVSGLDRSW---------RRPPGVAAKLIDC 197
             +FT++ SFG    FA  LV+  DRS+         R  P + AKLID 
Sbjct: 137 SVEFTKLGSFGTPLQFASNLVNSQDRSYMLRGPEWARRNEPIMVAKLIDA 186


>gi|384245846|gb|EIE19338.1| hypothetical protein COCSUDRAFT_58632 [Coccomyxa subellipsoidea
           C-169]
          Length = 191

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 52/85 (61%), Gaps = 6/85 (7%)

Query: 132 GFKSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADTLVSGLDRSWRRPPGVA 191
           G +   A+YP+  + ++V+V++T +G D+T + SFG  ++F + L++ LDRS+    G+ 
Sbjct: 49  GTRRALAWYPEGGTDTNVTVLVTNVGADYTALGSFGSADSFGENLIASLDRSFLLRGGMN 108

Query: 192 -----AKLIDCKASKGICIF-FSIQ 210
                AKL+D K+  G+    +++Q
Sbjct: 109 RPQSRAKLLDAKSRSGMYFLEYTVQ 133


>gi|308799735|ref|XP_003074648.1| thylakoid lumen protein, chloroplast prec (ISS) [Ostreococcus
           tauri]
 gi|116000819|emb|CAL50499.1| thylakoid lumen protein, chloroplast prec (ISS) [Ostreococcus
           tauri]
          Length = 195

 Score = 46.2 bits (108), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 41/91 (45%), Gaps = 9/91 (9%)

Query: 101 ANEDLRVYTDELNKFEISIPQDWQLGAG------EPNGFKS---ITAFYPQEASSSSVSV 151
           A+E LR Y D+  +F +  P DW +  G       P G  +    T   P      +V+V
Sbjct: 13  ADESLRTYADDELRFTLEYPSDWVVATGSLERSENPMGGGARDVWTISPPGRTRDVNVTV 72

Query: 152 VITGLGPDFTRMESFGKVEAFADTLVSGLDR 182
           V T  G DFT+M S G    F   LV+ L R
Sbjct: 73  VATPAGADFTKMGSLGDAYGFGMGLVAPLHR 103


>gi|335046123|ref|ZP_08539146.1| putative membrane protein [Oribacterium sp. oral taxon 108 str.
           F0425]
 gi|333759909|gb|EGL37466.1| putative membrane protein [Oribacterium sp. oral taxon 108 str.
           F0425]
          Length = 343

 Score = 45.4 bits (106), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 52/112 (46%), Gaps = 8/112 (7%)

Query: 51  KQEQEDDARTLNRFRIEEQDDDSRTKRREVMFQLAFTACSFPAIVSYALAANEDLRV--- 107
           K   E++ R    + +  QDD SR K+RE +F + F  CSF A++ +A   N  + +   
Sbjct: 4   KDSTEENRRMPENYHL--QDDASRKKKRERIFTVLFLFCSFCALMIFAFYVNRHIHIKAL 61

Query: 108 YTDELNKFEISIPQDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPD 159
           Y D+L  F     QD+      P G  +   F P   + S + +V  G  P+
Sbjct: 62  YMDDLYLFSFFREQDF-FTFSFPIG--NAVRFRPVYWAISYIEMVFVGNDPN 110


>gi|363896506|ref|ZP_09323057.1| hypothetical protein HMPREF9624_01801 [Oribacterium sp. ACB7]
 gi|361960792|gb|EHL14025.1| hypothetical protein HMPREF9624_01801 [Oribacterium sp. ACB7]
          Length = 613

 Score = 45.1 bits (105), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 52/112 (46%), Gaps = 8/112 (7%)

Query: 51  KQEQEDDARTLNRFRIEEQDDDSRTKRREVMFQLAFTACSFPAIVSYALAANEDLRV--- 107
           K   E++ R    + +  QDD SR K+RE +F + F  CSF A++ +A   N  + +   
Sbjct: 4   KDSTEENRRMPENYHL--QDDASRKKKRERIFTVLFLFCSFCALMIFAFYVNRHIHIKAL 61

Query: 108 YTDELNKFEISIPQDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPD 159
           Y D+L  F     QD+      P G  +   F P   + S + +V  G  P+
Sbjct: 62  YMDDLYLFSFFREQDF-FTFSFPIG--NAVRFRPVYWALSYIEMVFVGNDPN 110


>gi|145341129|ref|XP_001415667.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575890|gb|ABO93959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 192

 Score = 42.0 bits (97), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 10/88 (11%)

Query: 105 LRVYTDELNKFEISIPQDWQLGAGEP--------NGFKSITAFYPQEASSSSVSVVI--T 154
           L  Y D+  KF++  P DW    GE          G + +    P  A++  V+V I  T
Sbjct: 13  LTTYEDDKLKFKVQYPSDWTTSTGETPASEEILGGGARDLFTISPPGANARDVNVTIVAT 72

Query: 155 GLGPDFTRMESFGKVEAFADTLVSGLDR 182
             G DFT+M S G    F   LV  L++
Sbjct: 73  PAGADFTKMGSLGDAYGFGYGLVVPLNK 100


>gi|363899965|ref|ZP_09326471.1| hypothetical protein HMPREF9625_01131 [Oribacterium sp. ACB1]
 gi|395207515|ref|ZP_10397039.1| putative membrane protein [Oribacterium sp. ACB8]
 gi|361956819|gb|EHL10131.1| hypothetical protein HMPREF9625_01131 [Oribacterium sp. ACB1]
 gi|394706592|gb|EJF14099.1| putative membrane protein [Oribacterium sp. ACB8]
          Length = 618

 Score = 40.8 bits (94), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 34/116 (29%), Positives = 53/116 (45%), Gaps = 9/116 (7%)

Query: 49  CKKQ--EQEDDARTLNRFRIEEQDDDSRTKRREVMFQLAFTACSFPAIVSYALAANEDLR 106
            KKQ  + ED     N  ++EE+ +  R KR+  +F+L F  CSF  ++ +A   N  + 
Sbjct: 29  VKKQSLQDEDYVEKKNSLQVEEKRE-KRQKRQGNIFRLLFFLCSFCLLMLFAFYVNRHIH 87

Query: 107 V---YTDELNKFEISIPQDWQLGAGEPNGFKSITAFYPQEASSSSVSVVITGLGPD 159
           +   Y D+L  F     QD+ L    P G      F P   + S + +V  G  P+
Sbjct: 88  IKALYMDDLYLFSFFREQDF-LTFSFPIG--EAVRFRPVYWALSYIEMVFVGNDPN 140


>gi|428171810|gb|EKX40724.1| hypothetical protein GUITHDRAFT_113254 [Guillardia theta CCMP2712]
          Length = 222

 Score = 40.4 bits (93), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 6/77 (7%)

Query: 108 YTDELNKFEISIPQDWQLGAGE-PNGFKS-----ITAFYPQEASSSSVSVVITGLGPDFT 161
           Y D+   FE  +PQ W  G  E P G ++     I +F    +   ++++V   + PD++
Sbjct: 64  YADDARGFEFQVPQGWIQGEAEFPGGNRNPARPKIISFRSPTSDDFNIALVSYSIQPDYS 123

Query: 162 RMESFGKVEAFADTLVS 178
           ++ SFG +E  A  ++ 
Sbjct: 124 KLGSFGTIEDVAKNIIG 140


>gi|443716926|gb|ELU08219.1| hypothetical protein CAPTEDRAFT_197539 [Capitella teleta]
          Length = 457

 Score = 37.0 bits (84), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 25/42 (59%)

Query: 134 KSITAFYPQEASSSSVSVVITGLGPDFTRMESFGKVEAFADT 175
           K +  + P  AS S  + ++T +GPD TR  S G+  AF+DT
Sbjct: 198 KLVLGYLPSPASDSGAASILTLVGPDSTRRMSRGRYSAFSDT 239


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.130    0.388 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,186,195,522
Number of Sequences: 23463169
Number of extensions: 124859424
Number of successful extensions: 282704
Number of sequences better than 100.0: 51
Number of HSP's better than 100.0 without gapping: 43
Number of HSP's successfully gapped in prelim test: 8
Number of HSP's that attempted gapping in prelim test: 282626
Number of HSP's gapped (non-prelim): 52
length of query: 210
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 74
effective length of database: 9,168,204,383
effective search space: 678447124342
effective search space used: 678447124342
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)