BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 021746
         (308 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225443239|ref|XP_002271931.1| PREDICTED: uncharacterized protein LOC100263834 [Vitis vinifera]
          Length = 301

 Score =  467 bits (1202), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 227/284 (79%), Positives = 245/284 (86%), Gaps = 6/284 (2%)

Query: 31  AKPTPVSAFAM------ASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLD 84
           +KPTP   F        +S   TQVAPA IVGGGRVG AL+ MG G DLLVKRG+ VPLD
Sbjct: 18  SKPTPPIKFRFRFRVRASSAMATQVAPAVIVGGGRVGRALQGMGSGDDLLVKRGDSVPLD 77

Query: 85  FEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSK 144
           F GPI VCTRNDDL+AVLE+ P+SRW+DLVFFQNGM+EPWL+SKGL DA+QVLAYFAVSK
Sbjct: 78  FAGPILVCTRNDDLDAVLESTPKSRWSDLVFFQNGMLEPWLQSKGLSDADQVLAYFAVSK 137

Query: 145 LGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISA 204
           LGE PIDGKTDTNPEGLTAAYGKWAS VA RL  GGLSCKVLDKEAFQKQMLEKLIWISA
Sbjct: 138 LGEPPIDGKTDTNPEGLTAAYGKWASAVAGRLHAGGLSCKVLDKEAFQKQMLEKLIWISA 197

Query: 205 FMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVAN 264
           FMLVGARH GATVGVVEKE+RSEVS+LI ELA AAA EKGI FD  MEDRLCAYSRAV++
Sbjct: 198 FMLVGARHPGATVGVVEKEFRSEVSSLIGELASAAAVEKGIVFDEGMEDRLCAYSRAVSH 257

Query: 265 FPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           FPTA KEFKWRNGWFYSLS+KA AEGKPDPCPLHT WLKE+K+V
Sbjct: 258 FPTAAKEFKWRNGWFYSLSQKAIAEGKPDPCPLHTEWLKELKIV 301


>gi|356526300|ref|XP_003531756.1| PREDICTED: uncharacterized protein LOC100810662 [Glycine max]
          Length = 294

 Score =  459 bits (1180), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 215/286 (75%), Positives = 250/286 (87%), Gaps = 3/286 (1%)

Query: 23  PTFSKPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVP 82
           PT S  RF      +  A  +   T+VAPA IVGGGRVG AL++MG GQDLLV+RGE VP
Sbjct: 12  PTTSAFRFKASAMATTIAAPA---TKVAPAVIVGGGRVGRALQDMGTGQDLLVRRGESVP 68

Query: 83  LDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAV 142
           L+FEGPIFVCTRNDDLE+VL++ P SRW DLVFFQNGM+EPWLESKGL+DANQVLAYFAV
Sbjct: 69  LNFEGPIFVCTRNDDLESVLQSTPSSRWGDLVFFQNGMLEPWLESKGLEDANQVLAYFAV 128

Query: 143 SKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWI 202
           SK+GE PIDG+TD NPEGLTAAYGKWAS+VA RL+ GGLSCKVLDKE FQKQMLEKLIWI
Sbjct: 129 SKIGETPIDGRTDINPEGLTAAYGKWASIVAARLNAGGLSCKVLDKEVFQKQMLEKLIWI 188

Query: 203 SAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAV 262
            + MLVGARH G +VGVV+KE+R+E+S+LI ELA AA++EKG+TF+ AME+RLCAYSRAV
Sbjct: 189 CSVMLVGARHGGVSVGVVDKEFRTELSSLITELASAASSEKGLTFEEAMEERLCAYSRAV 248

Query: 263 ANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           A+FPTAVKEFKWRNGWFYSLSEKA+A+GKPDPCPLH+ WLKE+++V
Sbjct: 249 AHFPTAVKEFKWRNGWFYSLSEKATAQGKPDPCPLHSQWLKELRIV 294


>gi|224115808|ref|XP_002332062.1| predicted protein [Populus trichocarpa]
 gi|222831948|gb|EEE70425.1| predicted protein [Populus trichocarpa]
          Length = 269

 Score =  456 bits (1172), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 217/261 (83%), Positives = 238/261 (91%)

Query: 48  QVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPR 107
           +V PA IVG GRVG AL+EMG GQDLLVKRGE VPLDFEGPI VCTRNDDL+AVLEA P+
Sbjct: 9   KVTPAVIVGAGRVGRALQEMGSGQDLLVKRGEPVPLDFEGPILVCTRNDDLDAVLEATPK 68

Query: 108 SRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGK 167
           +RW+DLVFFQNGM+EPW +SKGL DA+ VLAYFAVSKLGE P DGKTDTNPEGLTAAYGK
Sbjct: 69  ARWSDLVFFQNGMLEPWFQSKGLGDADHVLAYFAVSKLGEPPTDGKTDTNPEGLTAAYGK 128

Query: 168 WASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSE 227
           WAS VA RL+ GGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARH GATVGV+EKE+ SE
Sbjct: 129 WASAVAARLNAGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHPGATVGVMEKEFHSE 188

Query: 228 VSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKAS 287
           VS+LI ELA AAAAEKGI F+ A+E+RLCAY+RAVA+FPTAVKEFKWRNGWFYSLSEKA 
Sbjct: 189 VSSLITELASAAAAEKGIVFEEAIEERLCAYARAVAHFPTAVKEFKWRNGWFYSLSEKAV 248

Query: 288 AEGKPDPCPLHTAWLKEIKVV 308
           +EGKPDPCPLHT+WLKE+KVV
Sbjct: 249 SEGKPDPCPLHTSWLKELKVV 269


>gi|118484329|gb|ABK94042.1| unknown [Populus trichocarpa]
          Length = 309

 Score =  454 bits (1169), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 232/309 (75%), Positives = 257/309 (83%), Gaps = 1/309 (0%)

Query: 1   MAANALTRQLSSTSTLWPPLKKPTFSKPRFAKPTPVSAFAMASF-TTTQVAPAAIVGGGR 59
           MA + LT  L ++S +    K   F      +       AMA+  T  +VAPA IVGGGR
Sbjct: 1   MATHTLTATLCNSSLIKSNSKFTFFGTRIITRTQKTQTLAMATAPTAAKVAPAVIVGGGR 60

Query: 60  VGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNG 119
           VG AL+EMG GQDLLVKRG+ VPLDFEGPI VCTRNDDL+AVLE  P+SRW+DLVFFQNG
Sbjct: 61  VGRALQEMGSGQDLLVKRGDPVPLDFEGPILVCTRNDDLDAVLEVTPKSRWSDLVFFQNG 120

Query: 120 MIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVG 179
           M+EPW +SKGL DA+QVLAYFAVSKLGE PIDGKTDT+PEGLTAAYGKWAS VA RL  G
Sbjct: 121 MLEPWFQSKGLGDADQVLAYFAVSKLGEPPIDGKTDTDPEGLTAAYGKWASAVAARLHAG 180

Query: 180 GLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAA 239
           GLSCKVLDKE FQKQMLEKLIWISAFML GARH GATVG VEKE+RSEVS+LIAELA AA
Sbjct: 181 GLSCKVLDKETFQKQMLEKLIWISAFMLAGARHPGATVGAVEKEFRSEVSSLIAELASAA 240

Query: 240 AAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHT 299
           AAEKGI F+ A+E+RLCAYSRAVA+FPTAVKEFKWRNGWFYSLS+KA AEGKPDPCPLHT
Sbjct: 241 AAEKGIVFEEAIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSDKAVAEGKPDPCPLHT 300

Query: 300 AWLKEIKVV 308
           AW+KE+KVV
Sbjct: 301 AWIKELKVV 309


>gi|356523119|ref|XP_003530189.1| PREDICTED: uncharacterized protein LOC100795565 [Glycine max]
          Length = 298

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 211/264 (79%), Positives = 242/264 (91%), Gaps = 1/264 (0%)

Query: 46  TTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAA 105
           +T+VAPA IVGGGRVG AL++MG GQDLLV+RGE VP +FEGPIFVCTRNDDLE+VL++ 
Sbjct: 35  STKVAPAVIVGGGRVGRALQDMGTGQDLLVRRGESVPPNFEGPIFVCTRNDDLESVLQST 94

Query: 106 PRSRWN-DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAA 164
           P SRW  DLVFFQNGM+EPWLESKGL+DANQVLAYFAVSK+GE PIDG+TDTNPEGLTAA
Sbjct: 95  PSSRWRADLVFFQNGMMEPWLESKGLEDANQVLAYFAVSKIGESPIDGRTDTNPEGLTAA 154

Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEY 224
           YGKWASVVA RL+ GGLSCKVLDKEAFQKQMLEKLIWI + MLVGARH G +VGVVEKE+
Sbjct: 155 YGKWASVVAARLNAGGLSCKVLDKEAFQKQMLEKLIWICSVMLVGARHGGVSVGVVEKEF 214

Query: 225 RSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSE 284
           R+E+S+LI ELA AAA+EKG+ F+ AME+RLCAYSRAVA+FPTAVKEFKWRNGWF SLSE
Sbjct: 215 RTELSSLITELASAAASEKGLKFEEAMEERLCAYSRAVAHFPTAVKEFKWRNGWFCSLSE 274

Query: 285 KASAEGKPDPCPLHTAWLKEIKVV 308
           KA+A+GKPDPCPLH+ WLKE+++V
Sbjct: 275 KANAQGKPDPCPLHSQWLKELRIV 298


>gi|224108611|ref|XP_002333371.1| predicted protein [Populus trichocarpa]
 gi|222836353|gb|EEE74760.1| predicted protein [Populus trichocarpa]
          Length = 269

 Score =  449 bits (1156), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 224/264 (84%), Positives = 241/264 (91%)

Query: 45  TTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEA 104
           T  +VAPA IVGGGRVG AL+EMG GQDLLVKRG+ VPLDFEGPI VCTRNDDL+AVLE 
Sbjct: 6   TAAKVAPAVIVGGGRVGRALQEMGSGQDLLVKRGDPVPLDFEGPILVCTRNDDLDAVLEV 65

Query: 105 APRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAA 164
            P+SRW+DLVFFQNGM+EPW +SKGL DA+QVLAYFAVSKLGE PIDGKTDTNPEGLTAA
Sbjct: 66  TPKSRWSDLVFFQNGMLEPWFQSKGLGDADQVLAYFAVSKLGEPPIDGKTDTNPEGLTAA 125

Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEY 224
           YGKWAS VA RL  GGLSCKVLDKE FQKQMLEKLIWISAFMLVGARH GATVG VEKE+
Sbjct: 126 YGKWASAVAARLHAGGLSCKVLDKETFQKQMLEKLIWISAFMLVGARHPGATVGAVEKEF 185

Query: 225 RSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSE 284
           RSEVS+LIAELA AAAAEKGI F+ A+E+RLCAYSRAVA+FPTAVKEFKWRNGWFYSLS+
Sbjct: 186 RSEVSSLIAELASAAAAEKGIVFEEAIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSD 245

Query: 285 KASAEGKPDPCPLHTAWLKEIKVV 308
           KA AEGKPDPCPLHTAWLKE+KVV
Sbjct: 246 KAVAEGKPDPCPLHTAWLKELKVV 269


>gi|224072767|ref|XP_002303871.1| predicted protein [Populus trichocarpa]
 gi|222841303|gb|EEE78850.1| predicted protein [Populus trichocarpa]
          Length = 269

 Score =  447 bits (1149), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 222/264 (84%), Positives = 240/264 (90%)

Query: 45  TTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEA 104
           T  +V PA IVGGGRVG AL+EMG GQDLLVKRG+ VPLDFEGP+ VCTRNDDL+AVLE 
Sbjct: 6   TAAKVVPAVIVGGGRVGRALQEMGSGQDLLVKRGDPVPLDFEGPVLVCTRNDDLDAVLEV 65

Query: 105 APRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAA 164
            P+SRW+DLVFFQNGM+EPW +SKGL DA+QVLAYFAVSKLGE PIDGKTDTNPEGLTAA
Sbjct: 66  TPKSRWSDLVFFQNGMLEPWFQSKGLGDADQVLAYFAVSKLGEPPIDGKTDTNPEGLTAA 125

Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEY 224
           YGKWAS VA RL  GGLSCKVLDKE FQKQMLEKLIWISAFMLVGARH GATVG VEKE+
Sbjct: 126 YGKWASAVAARLHAGGLSCKVLDKETFQKQMLEKLIWISAFMLVGARHPGATVGAVEKEF 185

Query: 225 RSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSE 284
           RSEVS+LIAELA AAAAEKGI F+ A+E+RLCAYSRAVA+FPTAVKEFKWRNGWFYSLS+
Sbjct: 186 RSEVSSLIAELASAAAAEKGIVFEEAIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSD 245

Query: 285 KASAEGKPDPCPLHTAWLKEIKVV 308
           KA AEGKPDPCPLHTAWLKE+KVV
Sbjct: 246 KAVAEGKPDPCPLHTAWLKELKVV 269


>gi|449436211|ref|XP_004135887.1| PREDICTED: uncharacterized protein LOC101216543 [Cucumis sativus]
 gi|449509154|ref|XP_004163511.1| PREDICTED: uncharacterized protein LOC101231909 [Cucumis sativus]
          Length = 300

 Score =  444 bits (1141), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 227/293 (77%), Positives = 246/293 (83%), Gaps = 8/293 (2%)

Query: 24  TFSKPRFAK--PTPVSAFA------MASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLV 75
           T S P        PVS F       MA+ T T+V PA IVG GRVG AL +MG G+D+LV
Sbjct: 8   TISNPSLCSQLSKPVSRFGARRLRIMAAATETKVVPAVIVGSGRVGRALLDMGNGEDVLV 67

Query: 76  KRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQ 135
           KRGE VPLDF GPI VCTRNDDLEAVLEA PRSRWNDLVFFQNGM++PW ESKGLKDANQ
Sbjct: 68  KRGESVPLDFSGPILVCTRNDDLEAVLEATPRSRWNDLVFFQNGMLDPWYESKGLKDANQ 127

Query: 136 VLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQM 195
           VLAYFA+SKLGE P+DG TDTNPEGLTAAYGKWAS VA RLS  GLSCKVL KEAF+KQM
Sbjct: 128 VLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLSAAGLSCKVLGKEAFEKQM 187

Query: 196 LEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRL 255
           LEKLIWISAFMLVGARH GATVG VEK+YRSEVS+LIAELA AAAAE+ + F+  +E+RL
Sbjct: 188 LEKLIWISAFMLVGARHPGATVGAVEKDYRSEVSSLIAELASAAAAERQLVFEEGIEERL 247

Query: 256 CAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           CAYSRAVA+FPTAVKEFKWRNGWFYSLSEKA A GKPDPCPLHTAWLKE+KVV
Sbjct: 248 CAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLKELKVV 300


>gi|297844530|ref|XP_002890146.1| hypothetical protein ARALYDRAFT_471810 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335988|gb|EFH66405.1| hypothetical protein ARALYDRAFT_471810 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 313

 Score =  443 bits (1139), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 214/258 (82%), Positives = 231/258 (89%)

Query: 51  PAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRW 110
           PA IVGGGRVG AL+EMG G+DLLVKRGE VP+DFEGPI VCTRNDDL+AVLEA P+SRW
Sbjct: 56  PAVIVGGGRVGRALQEMGNGEDLLVKRGEAVPVDFEGPILVCTRNDDLDAVLEATPQSRW 115

Query: 111 NDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWAS 170
            DLVFFQNGM+EPW ESKGL D +QVLAYFAVSKLGE P+DGKTDTNPEGLTAAYGKWAS
Sbjct: 116 KDLVFFQNGMMEPWFESKGLGDTDQVLAYFAVSKLGEPPVDGKTDTNPEGLTAAYGKWAS 175

Query: 171 VVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSA 230
            +A RL  GGLSCKVLDKEAFQKQMLEKLIWI AFMLVGARH GA+VG VEKEYR EVS 
Sbjct: 176 EIAARLQSGGLSCKVLDKEAFQKQMLEKLIWICAFMLVGARHPGASVGTVEKEYRDEVSI 235

Query: 231 LIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEG 290
           LI ELA AAAAEKG+TF+  M +RLCAYSRAV++FPTAVKEFKWRNGWFYSLSEKA AEG
Sbjct: 236 LIQELAAAAAAEKGLTFEENMVERLCAYSRAVSHFPTAVKEFKWRNGWFYSLSEKAIAEG 295

Query: 291 KPDPCPLHTAWLKEIKVV 308
           KPDPCPLHT WLKE+KV+
Sbjct: 296 KPDPCPLHTEWLKELKVI 313


>gi|18394322|ref|NP_563991.1| uncharacterized protein [Arabidopsis thaliana]
 gi|6587814|gb|AAF18505.1|AC010924_18 ESTs gb|AI992412, gb|AI994629 come from this gene [Arabidopsis
           thaliana]
 gi|12083328|gb|AAG48823.1|AF332460_1 unknown protein [Arabidopsis thaliana]
 gi|18176355|gb|AAL60029.1| unknown protein [Arabidopsis thaliana]
 gi|332191287|gb|AEE29408.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 313

 Score =  442 bits (1138), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 223/287 (77%), Positives = 248/287 (86%), Gaps = 2/287 (0%)

Query: 22  KPTFSKPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELV 81
           KP+ S+ RFA P   +  A ++ T  ++APA IVGGGRVG AL+EMG G+DLLVKRGE V
Sbjct: 29  KPSVSRARFAVPM--AMAAASAATAKKLAPAVIVGGGRVGRALQEMGNGEDLLVKRGEAV 86

Query: 82  PLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFA 141
           P+DFEGPI VCTRNDDL+AVLEA P+SRW DLVFFQNGM+EPW ESKGL D +QVLAYFA
Sbjct: 87  PVDFEGPILVCTRNDDLDAVLEATPQSRWKDLVFFQNGMMEPWFESKGLGDTDQVLAYFA 146

Query: 142 VSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIW 201
           VSKLGE P+DGKTDTNPEGLTAAYGKWAS +A RL  GGLSCKVLDKEAFQKQMLEKLIW
Sbjct: 147 VSKLGEPPVDGKTDTNPEGLTAAYGKWASEIAARLQSGGLSCKVLDKEAFQKQMLEKLIW 206

Query: 202 ISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRA 261
           I AFMLVGARH GA+VG VEKEYR EVS LI ELA AAAAEKG+TF+  M +RLCAYSRA
Sbjct: 207 ICAFMLVGARHPGASVGTVEKEYRDEVSRLIQELAAAAAAEKGLTFEENMVERLCAYSRA 266

Query: 262 VANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           V++FPTAVKEFKWRNGWFYSLSEKA AEG+PDPCPLHT WLKE+KV+
Sbjct: 267 VSHFPTAVKEFKWRNGWFYSLSEKAIAEGQPDPCPLHTEWLKELKVI 313


>gi|224072759|ref|XP_002303867.1| predicted protein [Populus trichocarpa]
 gi|222841299|gb|EEE78846.1| predicted protein [Populus trichocarpa]
          Length = 276

 Score =  442 bits (1138), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 224/271 (82%), Positives = 241/271 (88%), Gaps = 7/271 (2%)

Query: 45  TTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEA 104
           T  +VAPA IVGGGRVG AL+EMG GQDLLVKRG+ VPLDFEGPI VCTRNDDL+AVLE 
Sbjct: 6   TAAKVAPAVIVGGGRVGRALQEMGSGQDLLVKRGDPVPLDFEGPILVCTRNDDLDAVLEV 65

Query: 105 APRSRWN-------DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTN 157
            P+SRW+       DLVFFQNGM+EPW +SKGL DA+QVLAYFAVSKLGE PIDGKTDTN
Sbjct: 66  TPKSRWSVDVDNLADLVFFQNGMLEPWFQSKGLGDADQVLAYFAVSKLGEPPIDGKTDTN 125

Query: 158 PEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATV 217
           PEGLTAAYGKWAS VA RL  GGLSCKVLDKE FQKQMLEKLIWISAFMLVGARH GATV
Sbjct: 126 PEGLTAAYGKWASAVAARLHAGGLSCKVLDKETFQKQMLEKLIWISAFMLVGARHPGATV 185

Query: 218 GVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNG 277
           G VEKE+RSEVS+LIAELA AAAAEKGI F+ A+E+RLCAYSRAVA+FPTAVKEFKWRNG
Sbjct: 186 GAVEKEFRSEVSSLIAELASAAAAEKGIVFEEAIEERLCAYSRAVAHFPTAVKEFKWRNG 245

Query: 278 WFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           WFYSLS+KA AEGKPDPCPLHTAWLKE+KVV
Sbjct: 246 WFYSLSDKAVAEGKPDPCPLHTAWLKELKVV 276


>gi|298204760|emb|CBI25258.3| unnamed protein product [Vitis vinifera]
          Length = 242

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 205/242 (84%), Positives = 220/242 (90%)

Query: 67  MGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLE 126
           MG G DLLVKRG+ VPLDF GPI VCTRNDDL+AVLE+ P+SRW+DLVFFQNGM+EPWL+
Sbjct: 1   MGSGDDLLVKRGDSVPLDFAGPILVCTRNDDLDAVLESTPKSRWSDLVFFQNGMLEPWLQ 60

Query: 127 SKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVL 186
           SKGL DA+QVLAYFAVSKLGE PIDGKTDTNPEGLTAAYGKWAS VA RL  GGLSCKVL
Sbjct: 61  SKGLSDADQVLAYFAVSKLGEPPIDGKTDTNPEGLTAAYGKWASAVAGRLHAGGLSCKVL 120

Query: 187 DKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGIT 246
           DKEAFQKQMLEKLIWISAFMLVGARH GATVGVVEKE+RSEVS+LI ELA AAA EKGI 
Sbjct: 121 DKEAFQKQMLEKLIWISAFMLVGARHPGATVGVVEKEFRSEVSSLIGELASAAAVEKGIV 180

Query: 247 FDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIK 306
           FD  MEDRLCAYSRAV++FPTA KEFKWRNGWFYSLS+KA AEGKPDPCPLHT WLKE+K
Sbjct: 181 FDEGMEDRLCAYSRAVSHFPTAAKEFKWRNGWFYSLSQKAIAEGKPDPCPLHTEWLKELK 240

Query: 307 VV 308
           +V
Sbjct: 241 IV 242


>gi|148908977|gb|ABR17592.1| unknown [Picea sitchensis]
          Length = 311

 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 200/290 (68%), Positives = 237/290 (81%), Gaps = 5/290 (1%)

Query: 22  KPTFSKPRFAKPTP---VSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRG 78
            P F+K   +   P   VSA AMA+   + V P+ I+G GRVG AL+ MG G+D++V+RG
Sbjct: 24  NPKFNKSLCSANYPRIRVSA-AMAAVQES-VIPSVIIGAGRVGQALQNMGTGKDIIVRRG 81

Query: 79  ELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLA 138
           E VP +F GPIFVCTRNDDL+AVLE  PR RW DLVFFQNGM+EPWL+SKGL +A+QVLA
Sbjct: 82  EKVPENFPGPIFVCTRNDDLDAVLEITPRPRWEDLVFFQNGMLEPWLQSKGLVNASQVLA 141

Query: 139 YFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEK 198
           YFAV+KLGE PIDGKT+TNPEGLTA  GKWAS +A RL + GLSCKVLDK  F+K MLEK
Sbjct: 142 YFAVAKLGEAPIDGKTETNPEGLTACSGKWASAIAARLHLAGLSCKVLDKGNFEKSMLEK 201

Query: 199 LIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAY 258
           LIWISAFMLVGARH G+TVG VEKEYR+EV +LI ELA AAAAEK I FD  +E+RLCAY
Sbjct: 202 LIWISAFMLVGARHQGSTVGTVEKEYRNEVVSLIKELASAAAAEKNIAFDQGLEERLCAY 261

Query: 259 SRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           SR VA+FPTA+KEFKWRNGWFY LS+KA+A+G+PDPCP+HT+WLKE+  +
Sbjct: 262 SRTVAHFPTALKEFKWRNGWFYLLSQKATADGRPDPCPIHTSWLKELGAI 311


>gi|294463883|gb|ADE77464.1| unknown [Picea sitchensis]
          Length = 267

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 190/260 (73%), Positives = 223/260 (85%)

Query: 49  VAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRS 108
           V P+ I+G GRVG AL+ MG G+D++V+RGE VP +F GPIFVCTRNDDL+AVLE  PR 
Sbjct: 8   VIPSVIIGAGRVGQALQNMGTGKDIIVRRGEKVPENFPGPIFVCTRNDDLDAVLEITPRP 67

Query: 109 RWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKW 168
           RW DLVFFQNGM+EPWL+SKGL +A+QVLAYFAV+KLGE PIDGKT+TNPEGLTA  GKW
Sbjct: 68  RWEDLVFFQNGMLEPWLQSKGLVNASQVLAYFAVAKLGEAPIDGKTETNPEGLTACSGKW 127

Query: 169 ASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEV 228
           AS +A RL + GLSCKVLDK  F+K MLEKLIWISAFMLVGARH G+TVG VEKEYR+EV
Sbjct: 128 ASAIAARLHLAGLSCKVLDKGNFEKSMLEKLIWISAFMLVGARHQGSTVGTVEKEYRNEV 187

Query: 229 SALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASA 288
            +LI ELA AAAAEK I FD  +E+RLCAYSR VA+FPTA+KEFKWRNGWFY LS+KA+A
Sbjct: 188 VSLIKELASAAAAEKNIAFDQGLEERLCAYSRTVAHFPTALKEFKWRNGWFYLLSQKATA 247

Query: 289 EGKPDPCPLHTAWLKEIKVV 308
           +G+PDPCP+HT+WLKE+  +
Sbjct: 248 DGRPDPCPIHTSWLKELGAI 267


>gi|255543391|ref|XP_002512758.1| conserved hypothetical protein [Ricinus communis]
 gi|223547769|gb|EEF49261.1| conserved hypothetical protein [Ricinus communis]
          Length = 294

 Score =  410 bits (1053), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 218/303 (71%), Positives = 239/303 (78%), Gaps = 12/303 (3%)

Query: 6   LTRQLSSTSTLWPPLKKPTFSKPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALK 65
           +T  LSS   + P  K    S     KP  +S  AMA+     V+PA IVGGGRVG AL+
Sbjct: 4   ITLSLSSCFIIKPTSKFTFLSTKIITKPQKISILAMATTPAATVSPAVIVGGGRVGRALQ 63

Query: 66  EMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWL 125
           EMG GQDLLVKRGE VPLDFEGPI VCT    L   L+              NGM+EPWL
Sbjct: 64  EMGNGQDLLVKRGEPVPLDFEGPILVCTXXLKLLLSLDGT------------NGMLEPWL 111

Query: 126 ESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKV 185
           ESKGL+DA+QVLAYFAV+KLGE PIDGKTDTNPEGLTAAYGKWA+ VA RL  GGLSCKV
Sbjct: 112 ESKGLEDADQVLAYFAVAKLGEPPIDGKTDTNPEGLTAAYGKWAAAVASRLHAGGLSCKV 171

Query: 186 LDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGI 245
           LDK+AFQKQMLEKLIWI+AFMLVGARH GATVG VEKEYRSEVS+LIAELA AAAAEKGI
Sbjct: 172 LDKDAFQKQMLEKLIWIAAFMLVGARHPGATVGAVEKEYRSEVSSLIAELASAAAAEKGI 231

Query: 246 TFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEI 305
            F+  +E+RLCAYSRAVA+FPTAVKEFKWRNGWFYSLSEKA AEGKPDPCPLHTAWLKE+
Sbjct: 232 VFENGVEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAEGKPDPCPLHTAWLKEL 291

Query: 306 KVV 308
           KV+
Sbjct: 292 KVI 294


>gi|413922060|gb|AFW61992.1| hypothetical protein ZEAMMB73_116806 [Zea mays]
          Length = 307

 Score =  392 bits (1008), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 216/298 (72%), Positives = 241/298 (80%), Gaps = 13/298 (4%)

Query: 18  PPLKKPTFSKPRFAKPTPVSAFAM------ASFTTTQVAPAAIVGGGRVGTALKEMGK-G 70
           PPL++   S      P+P SA A       A+  TT+V PA +VGGGRVG AL  MG  G
Sbjct: 16  PPLRRLLLS------PSPSSAAARRIARVSAAMATTEVHPAVVVGGGRVGQALVSMGPPG 69

Query: 71  QDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGL 130
            D+LV RGE VP D  GPI VCTRNDDL+AVLEA P+SRW DLVFFQNGM++PW + KGL
Sbjct: 70  GDVLVGRGEKVPDDAPGPILVCTRNDDLDAVLEATPKSRWRDLVFFQNGMLDPWFKIKGL 129

Query: 131 KDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEA 190
             ANQVLAYFAVSK+GE P+DG TDTNPEGLTAA+G WA  VA RL  GGL+CKVL+KEA
Sbjct: 130 VGANQVLAYFAVSKVGEPPVDGITDTNPEGLTAAFGSWAPAVAARLQNGGLTCKVLEKEA 189

Query: 191 FQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPA 250
           FQKQMLEKLIWISAFMLVGARH GATVGVVEK YRSEV++LIAELA AAAAE+ +TF   
Sbjct: 190 FQKQMLEKLIWISAFMLVGARHPGATVGVVEKNYRSEVASLIAELASAAAAEEQLTFAEG 249

Query: 251 MEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           MEDRLCAYSRAVA+FPTAVKEFKWRNGWFYSL+EKA A GKPDPCPLHTAWLKEI+V+
Sbjct: 250 MEDRLCAYSRAVAHFPTAVKEFKWRNGWFYSLTEKALAAGKPDPCPLHTAWLKEIRVI 307


>gi|242078737|ref|XP_002444137.1| hypothetical protein SORBIDRAFT_07g009470 [Sorghum bicolor]
 gi|241940487|gb|EES13632.1| hypothetical protein SORBIDRAFT_07g009470 [Sorghum bicolor]
          Length = 309

 Score =  392 bits (1008), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 210/269 (78%), Positives = 231/269 (85%), Gaps = 2/269 (0%)

Query: 42  ASFTTTQVAPAAIVGGGRVGTALKEMGK-GQDLLVKRGELVPLDFEGPIFVCTRNDDLEA 100
           A+  TT V PA +VGGGRVG AL  MG  G D+LV RGE VP D  GPI VCTRNDDL+ 
Sbjct: 41  AAMATTAVQPAVVVGGGRVGQALLSMGPPGGDVLVGRGEKVPDDAPGPILVCTRNDDLDG 100

Query: 101 VLEAAPRSRWN-DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPE 159
           VLEA P+SRW  DLVFFQNGM++PW ESKGL DANQVLAYFAVSKLGE P+DG TDTNPE
Sbjct: 101 VLEATPKSRWRADLVFFQNGMLDPWFESKGLVDANQVLAYFAVSKLGEPPVDGITDTNPE 160

Query: 160 GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGV 219
           GLTAA+G WA  VA RL  GGL+CKVL KEAFQKQMLEKLIWISAFMLVGARH GATVGV
Sbjct: 161 GLTAAFGSWAPAVAARLQNGGLTCKVLQKEAFQKQMLEKLIWISAFMLVGARHPGATVGV 220

Query: 220 VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWF 279
           VEK+YRSEV++LIAELA AAAAE+G+TFD  +E+RLCAYSRAVA+FPTAVKEFKWRNGWF
Sbjct: 221 VEKDYRSEVASLIAELASAAAAERGLTFDEGIEERLCAYSRAVAHFPTAVKEFKWRNGWF 280

Query: 280 YSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           YSL+EKA A G+PDPCPLHTAWLKEIKV+
Sbjct: 281 YSLTEKALAAGRPDPCPLHTAWLKEIKVI 309


>gi|218200803|gb|EEC83230.1| hypothetical protein OsI_28519 [Oryza sativa Indica Group]
          Length = 271

 Score =  389 bits (1000), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 207/271 (76%), Positives = 231/271 (85%), Gaps = 3/271 (1%)

Query: 41  MASFTTTQVAPAAIVGGGRVGTALKEMGK-GQDLLVKRGELVPLDFEGPIFVCTRNDDLE 99
           MA+   T V PA +VGGGRVG AL  MG  G D+LV RGE VP D  GPI VCTRNDDL+
Sbjct: 1   MAAAAKTAVQPAVVVGGGRVGQALLSMGPPGADVLVGRGEKVPEDAPGPILVCTRNDDLD 60

Query: 100 AVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPE 159
           AVL+A PRSRW DLVFFQNGM++PW ESKGL DANQVLAYFAVSKLGE P+DG TDTNPE
Sbjct: 61  AVLDATPRSRWRDLVFFQNGMLDPWFESKGLVDANQVLAYFAVSKLGEPPVDGITDTNPE 120

Query: 160 GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGV 219
           GLTAA+G WA  VA RL  GGL+CKVL+KEAFQKQMLEKLIWISAFMLVGARH GATVG 
Sbjct: 121 GLTAAFGNWAPAVATRLQNGGLTCKVLEKEAFQKQMLEKLIWISAFMLVGARHPGATVGA 180

Query: 220 VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAV--KEFKWRNG 277
           VEKEYRSEV++LIAELA AAA E+G+TFD  +E+RLCAYSRAVA+FPTA+  ++FKWRNG
Sbjct: 181 VEKEYRSEVASLIAELASAAAEERGLTFDAGIEERLCAYSRAVAHFPTALSMEQFKWRNG 240

Query: 278 WFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           WFYSL+EKA A+GKPDPCP HTAWLKEIKV+
Sbjct: 241 WFYSLTEKALAQGKPDPCPFHTAWLKEIKVI 271


>gi|388506366|gb|AFK41249.1| unknown [Medicago truncatula]
          Length = 270

 Score =  389 bits (999), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 197/254 (77%), Positives = 219/254 (86%), Gaps = 5/254 (1%)

Query: 31  AKPTP-----VSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDF 85
           A P P     VS+   ++ TTT+VAPA IVGGGRVG AL+ MG GQDLLVKRG+ +PLDF
Sbjct: 17  ATPLPKFRFRVSSVMASTVTTTKVAPAVIVGGGRVGKALEGMGDGQDLLVKRGDSIPLDF 76

Query: 86  EGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKL 145
           EGPI VCTRNDDL++VL+  P SRW DLVFFQNGM+EPWLESKGLKD NQVLAYFAVSKL
Sbjct: 77  EGPILVCTRNDDLDSVLQTTPLSRWKDLVFFQNGMLEPWLESKGLKDGNQVLAYFAVSKL 136

Query: 146 GERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAF 205
           GE PIDGKTDTNPEGLTAAYGKW+S V+ERL  GGLSCKVLDKEAFQKQMLEKLIWI + 
Sbjct: 137 GESPIDGKTDTNPEGLTAAYGKWSSAVSERLQAGGLSCKVLDKEAFQKQMLEKLIWICSV 196

Query: 206 MLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANF 265
           MLVGARH G +VGVVEKE+  E+ +LIAELA AAA+EKG+TF+ AMEDRLCAYSRAVA+F
Sbjct: 197 MLVGARHGGVSVGVVEKEFHLELCSLIAELASAAASEKGLTFEQAMEDRLCAYSRAVAHF 256

Query: 266 PTAVKEFKWRNGWF 279
           PTAVKEFKWRNGWF
Sbjct: 257 PTAVKEFKWRNGWF 270


>gi|168042101|ref|XP_001773528.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675230|gb|EDQ61728.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 269

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 187/269 (69%), Positives = 212/269 (78%)

Query: 40  AMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLE 99
           AMA+ T  +VA   IVG GRVG A ++MG G+D++V+RGE VP D  GPIFVCTRND L 
Sbjct: 1   AMATATAGEVAAGVIVGAGRVGQAFEKMGGGRDIVVRRGETVPEDSAGPIFVCTRNDVLG 60

Query: 100 AVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPE 159
            +++A P SR  DLVF QNGM+EPWLESKGL DA QVL YFAV+KLG+ P+DGKTDTNPE
Sbjct: 61  TIVDATPASRRGDLVFIQNGMLEPWLESKGLGDATQVLVYFAVAKLGDPPLDGKTDTNPE 120

Query: 160 GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGV 219
           GLTAA GKWA   A RL   GLSCKVLD E F+K MLEKLIWI AFMLVGARH GATVGV
Sbjct: 121 GLTAATGKWAKATAARLHSAGLSCKVLDAEEFKKPMLEKLIWICAFMLVGARHPGATVGV 180

Query: 220 VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWF 279
           VEKE+R EV  LI ELA AA AEK + FD  + DRLCAY+R+VA+FPTAVKEF+WRNGWF
Sbjct: 181 VEKEHRKEVEDLINELAGAAEAEKQLKFDAGIVDRLCAYARSVAHFPTAVKEFEWRNGWF 240

Query: 280 YSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           Y +S +A   GK DPCPLHTAWLKEI VV
Sbjct: 241 YEISRRALESGKSDPCPLHTAWLKEIGVV 269


>gi|147765908|emb|CAN73372.1| hypothetical protein VITISV_037533 [Vitis vinifera]
          Length = 306

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 189/239 (79%), Positives = 202/239 (84%), Gaps = 12/239 (5%)

Query: 47  TQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAP 106
           TQV PA IVGGGRVG AL+ MG G DLLVKRG+ VPLDF GPI VCTRNDDL+AVLE+ P
Sbjct: 3   TQVVPAVIVGGGRVGRALQGMGSGDDLLVKRGDSVPLDFAGPILVCTRNDDLDAVLESTP 62

Query: 107 RSRWN------------DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKT 154
           +SRW+            DLVFFQNGM+EPWL+SKGL DA+QVLAYFAVSKLGE PIDGKT
Sbjct: 63  KSRWSVHDFVYDLFNLADLVFFQNGMLEPWLQSKGLSDADQVLAYFAVSKLGEPPIDGKT 122

Query: 155 DTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTG 214
           DTNPEGLTAAYGKWAS VA RL  GGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARH G
Sbjct: 123 DTNPEGLTAAYGKWASAVAGRLHAGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHPG 182

Query: 215 ATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFK 273
           ATVGVVEKE+RSEVS+LI ELA AAA EKGI FD  MEDRLCAYSRAV++FPTA KE K
Sbjct: 183 ATVGVVEKEFRSEVSSLIGELASAAAVEKGIVFDEGMEDRLCAYSRAVSHFPTAAKEVK 241


>gi|357145388|ref|XP_003573626.1| PREDICTED: uncharacterized protein LOC100834027 [Brachypodium
           distachyon]
          Length = 310

 Score =  383 bits (983), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 207/264 (78%), Positives = 226/264 (85%), Gaps = 2/264 (0%)

Query: 47  TQVAPAAIVGGGRVGTALKEMGK-GQDLLVKRGELVPLDF-EGPIFVCTRNDDLEAVLEA 104
           T V PA +VGGGRVG AL  MG  G D+LV RGE VP     GPI VCTRNDDL+ VLEA
Sbjct: 47  TAVQPAVVVGGGRVGQALLGMGPLGGDVLVGRGEKVPDGAPAGPILVCTRNDDLDGVLEA 106

Query: 105 APRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAA 164
            P+SRW DLVFFQNGM++PW ESKGL  ANQVLAYFAVSK+GE PIDG TDTNPEGLTAA
Sbjct: 107 TPKSRWRDLVFFQNGMLDPWFESKGLVGANQVLAYFAVSKVGEPPIDGITDTNPEGLTAA 166

Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEY 224
           +G WA  VA RL  GGL+CKVL+KEAFQKQMLEKLIWISAFMLVGARH GATVGVVEKEY
Sbjct: 167 FGSWAPAVAARLQNGGLTCKVLEKEAFQKQMLEKLIWISAFMLVGARHPGATVGVVEKEY 226

Query: 225 RSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSE 284
           RSEVS+LIAELA AAAAE+G+ FD  +E+RLCAYSRAVA+FPTAVKEFKWRNGWFYSL+E
Sbjct: 227 RSEVSSLIAELASAAAAERGLMFDEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLTE 286

Query: 285 KASAEGKPDPCPLHTAWLKEIKVV 308
           KA A+GK DPCPLHTAWLKEIKV+
Sbjct: 287 KALAQGKTDPCPLHTAWLKEIKVI 310


>gi|326497007|dbj|BAK02088.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 322

 Score =  379 bits (974), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 203/270 (75%), Positives = 226/270 (83%), Gaps = 3/270 (1%)

Query: 42  ASFTTTQVAPAAIVGGGRVGTALKEMG--KGQDLLVKRGELVPLDF-EGPIFVCTRNDDL 98
           A+   T V PA +VGGGRVG AL  MG   G DLL++RGE +P     GPI VCTRNDDL
Sbjct: 53  AAVMATAVQPAVVVGGGRVGQALLSMGPSSGGDLLLRRGEALPPSAPAGPILVCTRNDDL 112

Query: 99  EAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNP 158
           + VLEA P+SRW DLVFFQNGM++PWLESKGL  ANQVLAYFAVSKLGE P+DG TD NP
Sbjct: 113 DGVLEATPKSRWQDLVFFQNGMLDPWLESKGLAGANQVLAYFAVSKLGEPPVDGITDANP 172

Query: 159 EGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVG 218
           EGLTAA+G WA  VA RL  GGL+CKVL K AF+KQMLEKLIWISAFMLVGARH GATVG
Sbjct: 173 EGLTAAFGNWAPAVAARLHNGGLTCKVLHKGAFEKQMLEKLIWISAFMLVGARHPGATVG 232

Query: 219 VVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGW 278
            VE+EYRSEVS+LIAELA AAAAE+G++FD  ME+RLCAYS AVA+FPTAVKEFKWRNGW
Sbjct: 233 AVEEEYRSEVSSLIAELASAAAAERGLSFDEGMEERLCAYSTAVAHFPTAVKEFKWRNGW 292

Query: 279 FYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           FYSL+EKA A+ KPDPCPLHTAWLKEIKV+
Sbjct: 293 FYSLTEKALAQDKPDPCPLHTAWLKEIKVI 322


>gi|326511878|dbj|BAJ92083.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 306

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 203/270 (75%), Positives = 226/270 (83%), Gaps = 3/270 (1%)

Query: 42  ASFTTTQVAPAAIVGGGRVGTALKEMG--KGQDLLVKRGELVPLDF-EGPIFVCTRNDDL 98
           A+   T V PA +VGGGRVG AL  MG   G DLL++RGE +P     GPI VCTRNDDL
Sbjct: 37  AAVMATAVQPAVVVGGGRVGQALLSMGPSSGGDLLLRRGEALPPSAPAGPILVCTRNDDL 96

Query: 99  EAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNP 158
           + VLEA P+SRW DLVFFQNGM++PWLESKGL  ANQVLAYFAVSKLGE P+DG TD NP
Sbjct: 97  DGVLEATPKSRWQDLVFFQNGMLDPWLESKGLAGANQVLAYFAVSKLGEPPVDGITDANP 156

Query: 159 EGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVG 218
           EGLTAA+G WA  VA RL  GGL+CKVL K AF+KQMLEKLIWISAFMLVGARH GATVG
Sbjct: 157 EGLTAAFGNWAPAVAARLHNGGLTCKVLHKGAFEKQMLEKLIWISAFMLVGARHPGATVG 216

Query: 219 VVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGW 278
            VE+EYRSEVS+LIAELA AAAAE+G++FD  ME+RLCAYS AVA+FPTAVKEFKWRNGW
Sbjct: 217 AVEEEYRSEVSSLIAELASAAAAERGLSFDEGMEERLCAYSTAVAHFPTAVKEFKWRNGW 276

Query: 279 FYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           FYSL+EKA A+ KPDPCPLHTAWLKEIKV+
Sbjct: 277 FYSLTEKALAQDKPDPCPLHTAWLKEIKVI 306


>gi|326489475|dbj|BAK01718.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 306

 Score =  377 bits (967), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 202/270 (74%), Positives = 226/270 (83%), Gaps = 3/270 (1%)

Query: 42  ASFTTTQVAPAAIVGGGRVGTALKEMG--KGQDLLVKRGELVPLDF-EGPIFVCTRNDDL 98
           A+   T V PA +VGGGRVG AL  MG   G DLL++RGE +P     GPI VCTRNDDL
Sbjct: 37  AAVMATAVQPAVVVGGGRVGQALLSMGPSSGGDLLLRRGEALPPSAPAGPILVCTRNDDL 96

Query: 99  EAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNP 158
           + VLEA P+SRW DLVFFQNGM++PWLESKGL  ANQVLAYFAVSKLGE P+DG TD NP
Sbjct: 97  DGVLEATPKSRWQDLVFFQNGMLDPWLESKGLAGANQVLAYFAVSKLGEPPVDGITDANP 156

Query: 159 EGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVG 218
           EGLTAA+G WA  VA RL  GGL+CKVL K AF+KQMLEKLIWISAFMLVGARH GATVG
Sbjct: 157 EGLTAAFGNWAPAVAARLHNGGLTCKVLHKGAFEKQMLEKLIWISAFMLVGARHPGATVG 216

Query: 219 VVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGW 278
            VE+EYRSEVS+LIAELA AAAAE+G++FD  ME+RLCAYS AVA+FPTAVKEFKWR+GW
Sbjct: 217 AVEEEYRSEVSSLIAELASAAAAERGLSFDEGMEERLCAYSTAVAHFPTAVKEFKWRSGW 276

Query: 279 FYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           FYSL+EKA A+ KPDPCPLHTAWLKEIKV+
Sbjct: 277 FYSLTEKALAQDKPDPCPLHTAWLKEIKVI 306


>gi|226504428|ref|NP_001145052.1| uncharacterized protein LOC100278239 [Zea mays]
 gi|195650351|gb|ACG44643.1| hypothetical protein [Zea mays]
          Length = 261

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 202/259 (77%), Positives = 222/259 (85%), Gaps = 1/259 (0%)

Query: 46  TTQVAPAAIVGGGRVGTALKEMGK-GQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEA 104
           TT+V PA +VGGGRVG AL  MG  G D+LV RGE VP D  GPI VCTRNDDL+AVLEA
Sbjct: 3   TTEVHPAVVVGGGRVGQALVSMGPPGGDVLVGRGEKVPDDAPGPILVCTRNDDLDAVLEA 62

Query: 105 APRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAA 164
            P+SRW DLVFFQNGM++PW + KGL  ANQVLAYFAVSK+GE P+DG TDTNPEGLTAA
Sbjct: 63  TPKSRWRDLVFFQNGMLDPWFKIKGLVGANQVLAYFAVSKVGEPPVDGITDTNPEGLTAA 122

Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEY 224
           +G WA  VA RL  GGL+CKVL+KEAFQKQMLEKLIWISAFMLVGARH GATVGVVEK Y
Sbjct: 123 FGSWAPAVAARLQNGGLTCKVLEKEAFQKQMLEKLIWISAFMLVGARHPGATVGVVEKNY 182

Query: 225 RSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSE 284
           RSEV++LIAELA AAAAE+ +TF   MEDRLCAYSRAVA+FPTAVKEFKWRNGWFYSL+E
Sbjct: 183 RSEVASLIAELASAAAAEEQLTFAEGMEDRLCAYSRAVAHFPTAVKEFKWRNGWFYSLTE 242

Query: 285 KASAEGKPDPCPLHTAWLK 303
           KA A GKPDPCPLHTAWL+
Sbjct: 243 KALAAGKPDPCPLHTAWLE 261


>gi|227204173|dbj|BAH56938.1| AT1G16080 [Arabidopsis thaliana]
          Length = 309

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 192/250 (76%), Positives = 214/250 (85%), Gaps = 2/250 (0%)

Query: 22  KPTFSKPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELV 81
           KP+ S+ RFA P  ++A + A  T  ++APA IVGGGRVG AL+EMG G+DLLVKRGE V
Sbjct: 29  KPSVSRARFAVPMAMAAASAA--TAKKLAPAVIVGGGRVGRALQEMGNGEDLLVKRGEAV 86

Query: 82  PLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFA 141
           P+DFEGPI VCTRNDDL+AVLEA P+SRW DLVFFQNGM+EPW ESKGL D +QVLAYFA
Sbjct: 87  PVDFEGPILVCTRNDDLDAVLEATPQSRWKDLVFFQNGMMEPWFESKGLGDTDQVLAYFA 146

Query: 142 VSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIW 201
           VSKLGE P+DGKTDTNPEGLTAAYGKWAS +A RL  GGLSCKVLDKEAFQKQMLEKLIW
Sbjct: 147 VSKLGEPPVDGKTDTNPEGLTAAYGKWASEIAARLQSGGLSCKVLDKEAFQKQMLEKLIW 206

Query: 202 ISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRA 261
           I AFMLVGARH GA+VG VEKEYR EVS LI ELA AAAAEKG+TF+  M +RLCAYSRA
Sbjct: 207 ICAFMLVGARHPGASVGTVEKEYRDEVSRLIQELAAAAAAEKGLTFEENMVERLCAYSRA 266

Query: 262 VANFPTAVKE 271
           V++FPTAVKE
Sbjct: 267 VSHFPTAVKE 276


>gi|302769095|ref|XP_002967967.1| hypothetical protein SELMODRAFT_169446 [Selaginella moellendorffii]
 gi|300164705|gb|EFJ31314.1| hypothetical protein SELMODRAFT_169446 [Selaginella moellendorffii]
          Length = 269

 Score =  363 bits (932), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 173/259 (66%), Positives = 209/259 (80%), Gaps = 1/259 (0%)

Query: 51  PAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFE-GPIFVCTRNDDLEAVLEAAPRSR 109
           PA IVG GR+G+AL+++G G+D+++ RG+  P     GPIFVCTRN+DL+AV++A P  R
Sbjct: 5   PAVIVGRGRIGSALEKLGHGRDVVLLRGQPFPSSLSSGPIFVCTRNNDLDAVVDATPAHR 64

Query: 110 WNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWA 169
             DLVFFQNGM+EPWLE +GL +A QVL YFAV+KLG+ P+DGKTD NPEGLTAA GKWA
Sbjct: 65  REDLVFFQNGMLEPWLEKRGLAEATQVLVYFAVAKLGDEPVDGKTDANPEGLTAACGKWA 124

Query: 170 SVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVS 229
           + VA RL  GGLSCKVLD   F K MLEKLIWISAFMLVGARH GATVG VEK YR EVS
Sbjct: 125 AAVAARLHAGGLSCKVLDAREFAKPMLEKLIWISAFMLVGARHPGATVGDVEKRYRDEVS 184

Query: 230 ALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAE 289
            LI ELA AA++EK + FDP +E+RLC+Y+R+VA FPTAVKEF+WRNGWFY +SE+A A 
Sbjct: 185 ELITELAAAASSEKSVAFDPGLEERLCSYARSVAYFPTAVKEFEWRNGWFYGISERAIAS 244

Query: 290 GKPDPCPLHTAWLKEIKVV 308
           G  DPCPLH+ WL+E++ V
Sbjct: 245 GTTDPCPLHSRWLQEVEAV 263


>gi|302761062|ref|XP_002963953.1| hypothetical protein SELMODRAFT_166437 [Selaginella moellendorffii]
 gi|300167682|gb|EFJ34286.1| hypothetical protein SELMODRAFT_166437 [Selaginella moellendorffii]
          Length = 269

 Score =  350 bits (899), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 173/259 (66%), Positives = 208/259 (80%), Gaps = 1/259 (0%)

Query: 51  PAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFE-GPIFVCTRNDDLEAVLEAAPRSR 109
           PA IVG GR+G+AL+++G G+D+++ RG+  P     GPIFVCTRN+DL+AV++A P  R
Sbjct: 5   PAVIVGRGRIGSALEKLGHGRDVVLLRGQPFPSSLSSGPIFVCTRNNDLDAVVDATPAHR 64

Query: 110 WNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWA 169
             DLVFFQNGM+EPWLE +GL DA QVL YFAV+KLG+ P+DGKTD NPEGLTAA GKWA
Sbjct: 65  REDLVFFQNGMLEPWLEKRGLADATQVLVYFAVAKLGDEPVDGKTDANPEGLTAACGKWA 124

Query: 170 SVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVS 229
           + VA RL  GGLSCKVLD   F K MLEKLIWISAFMLVGARH GATVG VEK YR EVS
Sbjct: 125 AAVAARLHAGGLSCKVLDAREFAKPMLEKLIWISAFMLVGARHPGATVGDVEKRYRDEVS 184

Query: 230 ALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAE 289
            LI ELA AA++EK   F P +E+RLC+Y+R+VA+FPTAVKEF+WRNGWFY +SE+A A 
Sbjct: 185 ELITELAAAASSEKSAAFGPGLEERLCSYARSVAHFPTAVKEFEWRNGWFYGISERAIAS 244

Query: 290 GKPDPCPLHTAWLKEIKVV 308
           G  DPCPLH+ WL+E++ V
Sbjct: 245 GTTDPCPLHSRWLQEVEAV 263


>gi|224053056|ref|XP_002297685.1| predicted protein [Populus trichocarpa]
 gi|222844943|gb|EEE82490.1| predicted protein [Populus trichocarpa]
          Length = 207

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 165/197 (83%), Positives = 179/197 (90%), Gaps = 3/197 (1%)

Query: 112 DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASV 171
           DLVFFQNGM+EPW +SKGL DA+ VLAYFAVSKLGE P DGKTDTNPEGLTAAYGKWAS 
Sbjct: 14  DLVFFQNGMLEPWFQSKGLGDADHVLAYFAVSKLGEPPTDGKTDTNPEGLTAAYGKWASA 73

Query: 172 VAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSAL 231
           V   L  GGLSCK+LDKEAFQKQMLEKLIWISAFMLVGARH GATVGVVEKE+R EVS+L
Sbjct: 74  V---LHAGGLSCKLLDKEAFQKQMLEKLIWISAFMLVGARHPGATVGVVEKEFRYEVSSL 130

Query: 232 IAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGK 291
           I ELA AAAAEKGI F+ A+E+RLCAY+RAVA+FPTAVKEFKWRNGWFYSLSEKA +EGK
Sbjct: 131 ITELASAAAAEKGIVFEEAIEERLCAYARAVAHFPTAVKEFKWRNGWFYSLSEKAVSEGK 190

Query: 292 PDPCPLHTAWLKEIKVV 308
           PDPCPLHT+WLKE+KVV
Sbjct: 191 PDPCPLHTSWLKELKVV 207


>gi|224143703|ref|XP_002336073.1| predicted protein [Populus trichocarpa]
 gi|222869914|gb|EEF07045.1| predicted protein [Populus trichocarpa]
          Length = 189

 Score =  333 bits (855), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 160/189 (84%), Positives = 175/189 (92%)

Query: 120 MIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVG 179
           M+EPW +SKGL DA+QVLAYFAVSKLGE P DGKTDTNPEGLTAAYGKWAS VA RL+ G
Sbjct: 1   MLEPWFQSKGLGDADQVLAYFAVSKLGEPPTDGKTDTNPEGLTAAYGKWASAVAARLNAG 60

Query: 180 GLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAA 239
           GLSCKVLDKEAFQKQMLEKLIWISAFMLVGARH GATVGV+EKE+R EVS+LI ELA AA
Sbjct: 61  GLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHPGATVGVMEKEFRYEVSSLITELASAA 120

Query: 240 AAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHT 299
           AAEKGI F+ A+E+RLCAY+RAVA+FPTAVKEFKWRNGWFYSLSEKA +EGKPDPCPLHT
Sbjct: 121 AAEKGIVFEEAIEERLCAYARAVAHFPTAVKEFKWRNGWFYSLSEKAVSEGKPDPCPLHT 180

Query: 300 AWLKEIKVV 308
           +WLKE+KVV
Sbjct: 181 SWLKELKVV 189


>gi|222640230|gb|EEE68362.1| hypothetical protein OsJ_26666 [Oryza sativa Japonica Group]
          Length = 242

 Score =  323 bits (829), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 177/232 (76%), Positives = 196/232 (84%), Gaps = 1/232 (0%)

Query: 41  MASFTTTQVAPAAIVGGGRVGTALKEMGK-GQDLLVKRGELVPLDFEGPIFVCTRNDDLE 99
           MA+   T V PA +VGGGRVG AL  MG  G D+LV RGE VP D  GPI VCTRNDDL+
Sbjct: 1   MAAAAKTAVQPAVVVGGGRVGQALLSMGPPGADVLVGRGEKVPEDAPGPILVCTRNDDLD 60

Query: 100 AVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPE 159
           AVL+A PRSRW DLVFFQNGM++PW ESKGL DANQVLAYFAVSKLGE P+DG TDTNPE
Sbjct: 61  AVLDATPRSRWRDLVFFQNGMLDPWFESKGLVDANQVLAYFAVSKLGEPPVDGITDTNPE 120

Query: 160 GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGV 219
           GLTAA+G WA  VA RL  GGL+CKVL+KEAFQKQMLEKLIWISAFMLVGARH GATVG 
Sbjct: 121 GLTAAFGNWAPAVATRLQNGGLTCKVLEKEAFQKQMLEKLIWISAFMLVGARHPGATVGA 180

Query: 220 VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKE 271
           VEKEY+SEV++LIAELA AAA E+G+TFD  +E+RLCAYSRAVA+FPTAVKE
Sbjct: 181 VEKEYQSEVASLIAELASAAAEERGLTFDAGIEERLCAYSRAVAHFPTAVKE 232


>gi|297608302|ref|NP_001061403.2| Os08g0266300 [Oryza sativa Japonica Group]
 gi|255678306|dbj|BAF23317.2| Os08g0266300 [Oryza sativa Japonica Group]
          Length = 339

 Score =  322 bits (825), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 197/323 (60%), Positives = 221/323 (68%), Gaps = 50/323 (15%)

Query: 19  PLKKPTFSKPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGK-GQDLLVKR 77
           PL++ + S    A    VSA AMA+   T V PA +VGGGRVG AL  MG  G D+LV R
Sbjct: 15  PLRRLSSSTTTRAFSFRVSAAAMAAAAKTAVQPAVVVGGGRVGQALLSMGPPGADVLVGR 74

Query: 78  GELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVF---------------------- 115
           GE VP D  GPI VCTRNDDL+AVL+A PRSRW                           
Sbjct: 75  GEKVPEDAPGPILVCTRNDDLDAVLDATPRSRWRGTFVPLPALLSSPLRLPPGSRAVSCS 134

Query: 116 ----------FQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAY 165
                      QNGM++PW ESKGL DANQVLAYFAVSKLGE P+DG TDTNPEGLTAA+
Sbjct: 135 ALLLLVSCSAIQNGMLDPWFESKGLVDANQVLAYFAVSKLGEPPVDGITDTNPEGLTAAF 194

Query: 166 GKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYR 225
           G WA  VA RL  GGL+CKVL+KEAFQKQMLEKLIWISAFMLVGARH GATVG VEKEY+
Sbjct: 195 GNWAPAVATRLQNGGLTCKVLEKEAFQKQMLEKLIWISAFMLVGARHPGATVGAVEKEYQ 254

Query: 226 SEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEK 285
           SEV++LIAELA AAA E+G+TFD  +E+RLCAYSRAVA+FPTA                 
Sbjct: 255 SEVASLIAELASAAAEERGLTFDAGIEERLCAYSRAVAHFPTA----------------- 297

Query: 286 ASAEGKPDPCPLHTAWLKEIKVV 308
           A A+GKPDPCPLHTAWLKEIKV+
Sbjct: 298 ALAQGKPDPCPLHTAWLKEIKVI 320


>gi|307105110|gb|EFN53361.1| hypothetical protein CHLNCDRAFT_137108 [Chlorella variabilis]
          Length = 270

 Score =  320 bits (819), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 163/267 (61%), Positives = 194/267 (72%), Gaps = 10/267 (3%)

Query: 51  PAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRW 110
           P  IVGGGRVG AL +MG GQD+LV+RGE V     GPI VCTRNDDL+AV++A P  R 
Sbjct: 5   PFVIVGGGRVGQALADMGAGQDVLVRRGEAVS-GPPGPIVVCTRNDDLQAVVDATPAERR 63

Query: 111 N---------DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGL 161
                      LV+ QNGM++PWL+ +GL +  QVL YFAV+K G+ P DGKTD NPEGL
Sbjct: 64  QGKQAAREAGHLVWIQNGMLQPWLDERGLGETTQVLVYFAVAKKGDPPTDGKTDVNPEGL 123

Query: 162 TAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVE 221
           TAAYG  A+ VA RL   GLSCKVLDK  F K MLEKL+WI AFMLVGARH G TVG VE
Sbjct: 124 TAAYGPHAAAVAARLHSAGLSCKVLDKGEFAKCMLEKLVWICAFMLVGARHGGCTVGEVE 183

Query: 222 KEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYS 281
            ++  EVS LI +LA A +A  GIT D  + +RL AY+R+VA+FPTAVKEF+WRNG+FY 
Sbjct: 184 SQHTGEVSQLIGQLAAAGSAALGITLDSGVPERLLAYARSVAHFPTAVKEFQWRNGYFYG 243

Query: 282 LSEKASAEGKPDPCPLHTAWLKEIKVV 308
           LS++A A G+PDPCP HTAWLKE   +
Sbjct: 244 LSQQAQAAGQPDPCPTHTAWLKEAGAI 270


>gi|384245300|gb|EIE18795.1| hypothetical protein COCSUDRAFT_26256 [Coccomyxa subellipsoidea
           C-169]
          Length = 266

 Score =  317 bits (811), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 156/254 (61%), Positives = 188/254 (74%), Gaps = 2/254 (0%)

Query: 54  IVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDL 113
           IVG GRVG AL EMGK  D+LV RG  +P   +GP+ VCTRND L+ V+   P  R  DL
Sbjct: 14  IVGRGRVGAALAEMGK-DDVLVGRGGRIPEGPQGPVIVCTRNDALDDVVSMTPPDRKKDL 72

Query: 114 VFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVA 173
           VF QNGM++PWL+ +GL D  QVL YFAV+K GE P+DGKTD NPEGLTAA+G  A  VA
Sbjct: 73  VFIQNGMLQPWLDGQGLGDNTQVLVYFAVAKKGEAPLDGKTDLNPEGLTAAWGIHAEAVA 132

Query: 174 ERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIA 233
           ERL   GLSC   D+  F K MLEKLIWISA+MLVGARH+ A +G VE+++RSEV +LI 
Sbjct: 133 ERLKANGLSCSTPDRPEFDKAMLEKLIWISAYMLVGARHS-ANIGDVERQHRSEVGSLIE 191

Query: 234 ELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPD 293
           EL    AAE G+T D    +RL AYSR+VA+FPTAVKEF+WRNGWF+ +S+KA AEG+ D
Sbjct: 192 ELGAGGAAELGVTLDSNFAERLHAYSRSVAHFPTAVKEFQWRNGWFHDISKKALAEGRAD 251

Query: 294 PCPLHTAWLKEIKV 307
           P P HTAWLK++ V
Sbjct: 252 PFPTHTAWLKDLGV 265


>gi|159464353|ref|XP_001690406.1| hypothetical protein CHLREDRAFT_182934 [Chlamydomonas reinhardtii]
 gi|158279906|gb|EDP05665.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 290

 Score =  312 bits (799), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 158/278 (56%), Positives = 193/278 (69%), Gaps = 2/278 (0%)

Query: 31  AKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIF 90
           A PT + A  M    + Q A   IVGGGRVG AL +MG G D++V RG+ +      PI 
Sbjct: 15  ASPTQLRAARMTCNASKQQA-WTIVGGGRVGLALADMGPGGDVVVGRGQKIEGPAGSPIV 73

Query: 91  VCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPI 150
           VCTRNDDL+AV++     R  DLVF QNGM++PWL+++GL    QVL YFAV+K G++P 
Sbjct: 74  VCTRNDDLQAVVDLTAPERRKDLVFIQNGMLQPWLDARGLGGNTQVLVYFAVAKKGDKPT 133

Query: 151 DGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGA 210
           DGKTD NPEGLTAA+G  A  VA+RL  GGLSCKVL    FQ+ MLEKL+WISAFM VGA
Sbjct: 134 DGKTDVNPEGLTAAFGPHAQAVADRLHAGGLSCKVLGAADFQRAMLEKLVWISAFMAVGA 193

Query: 211 RHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVK 270
           +H  A VG VE  + +EVS +I EL  A AAE G+        RL AY+R+VA+FPTAVK
Sbjct: 194 KHK-ANVGQVESAHTAEVSDIIQELCAAGAAELGVGLAGGEVARLLAYARSVAHFPTAVK 252

Query: 271 EFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           EF WRNGWFY LS+KA A G+PDPCP HTA L+E+  +
Sbjct: 253 EFPWRNGWFYGLSQKALAAGQPDPCPKHTALLREVGAI 290


>gi|302828238|ref|XP_002945686.1| hypothetical protein VOLCADRAFT_54899 [Volvox carteri f.
           nagariensis]
 gi|300268501|gb|EFJ52681.1| hypothetical protein VOLCADRAFT_54899 [Volvox carteri f.
           nagariensis]
          Length = 268

 Score =  308 bits (790), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 156/269 (57%), Positives = 191/269 (71%), Gaps = 5/269 (1%)

Query: 40  AMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLE 99
            ++  TT+QV    IVGGGRVG A+ +MG G D++V RG+ V     GPI VCTRNDDL+
Sbjct: 5   CLSGLTTSQVF--TIVGGGRVGQAIADMGPGGDVVVTRGQRVE-GPPGPIVVCTRNDDLQ 61

Query: 100 AVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPE 159
            V    PR     LVF QNGM++PWL+ +GL +  QVL YFAV+K G+RP DGKTD NPE
Sbjct: 62  -VSRRTPRVTPPHLVFIQNGMLQPWLDERGLGENTQVLVYFAVAKKGDRPTDGKTDVNPE 120

Query: 160 GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGV 219
           GLTAAYG+ A  VA+RL  GGLSCKVL K+ F + MLEKL+WISA+MLVGA+H   +VG 
Sbjct: 121 GLTAAYGRHAQAVADRLHSGGLSCKVLGKKDFTRAMLEKLVWISAYMLVGAKHK-VSVGQ 179

Query: 220 VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWF 279
           VE ++  EVS LI EL  A AAE G+       +RL AY+R+VA+FPTAVKEF WRNGWF
Sbjct: 180 VEAQHTGEVSELIRELCTAGAAELGVELAGGEVERLNAYARSVAHFPTAVKEFSWRNGWF 239

Query: 280 YSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           Y L++KA   G+PDPCPLHT  L E+  V
Sbjct: 240 YGLTQKALTAGRPDPCPLHTQLLTEVGAV 268


>gi|255081100|ref|XP_002504116.1| predicted protein [Micromonas sp. RCC299]
 gi|226519383|gb|ACO65374.1| predicted protein [Micromonas sp. RCC299]
          Length = 265

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 152/262 (58%), Positives = 193/262 (73%), Gaps = 2/262 (0%)

Query: 49  VAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLD-FEGPIFVCTRNDDLEAVLEAAPR 107
           VAP+ IVGGGRVG AL +MG   D+++KRG+  P +   GPI+VCTRND L+ V++  P 
Sbjct: 4   VAPSIIVGGGRVGQALADMGVPGDVIMKRGDAFPAEPSTGPIYVCTRNDALQGVIDQTPP 63

Query: 108 SRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGK 167
            R  DLVF QNGM++PWL+++GL D  Q L YFAV+K GE+P DG TD NPEGLTAA GK
Sbjct: 64  ERRADLVFLQNGMLQPWLDARGLGDNTQALIYFAVAKFGEKPTDGITDVNPEGLTAANGK 123

Query: 168 WASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSE 227
            A+ +  RL+ GG+ C VLD +AF+  M EKLIWI AFM VGA H GATVG VE  +R+E
Sbjct: 124 HAAALQARLAAGGMRCHVLDDDAFKASMFEKLIWICAFMAVGAAHPGATVGDVESTHRAE 183

Query: 228 VSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKAS 287
           V  LI EL   +AAEKG+ F+P   +RLCAY+RAVA+FPTAVKEF+WRNG+FY +S++A 
Sbjct: 184 VVELIEELLEGSAAEKGVKFEPGSVERLCAYARAVAHFPTAVKEFEWRNGFFYGISQEAI 243

Query: 288 AEG-KPDPCPLHTAWLKEIKVV 308
             G +PDP P HTA LK++  +
Sbjct: 244 TAGDRPDPFPKHTALLKQVGAI 265


>gi|303280695|ref|XP_003059640.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459476|gb|EEH56772.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 273

 Score =  298 bits (763), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 145/258 (56%), Positives = 183/258 (70%), Gaps = 1/258 (0%)

Query: 49  VAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLD-FEGPIFVCTRNDDLEAVLEAAPR 107
           V P+ I+GGGRVG AL +MG   D++VKRG+  P     GPI+VCTRND L+ V++A P 
Sbjct: 12  VMPSIIIGGGRVGQALFDMGVDGDVIVKRGDAFPSSPATGPIYVCTRNDALQGVVDATPE 71

Query: 108 SRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGK 167
           +R  DLVF QNGM++PWL+++GL    Q L YFAV+K+ E+P DG TD NPEGLTAA G+
Sbjct: 72  ARRKDLVFLQNGMLQPWLDARGLGGNTQALVYFAVAKMNEKPTDGITDANPEGLTAANGE 131

Query: 168 WASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSE 227
           WA   A RL  GGL C VLD +AF+  M EKLIWI AFM VGA H GATVG VE  ++ E
Sbjct: 132 WADAFAARLRSGGLRCHVLDDDAFRASMFEKLIWICAFMGVGAAHPGATVGDVESAHKEE 191

Query: 228 VSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKAS 287
           V ALIAEL   + + KG+ F      RL AY+RAV++FPTAVKEF+WRNG+F  +SE+  
Sbjct: 192 VEALIAELVAGSESAKGVRFAEGTIPRLAAYARAVSHFPTAVKEFEWRNGFFCGISEREL 251

Query: 288 AEGKPDPCPLHTAWLKEI 305
           A G+ DP P HTA +K++
Sbjct: 252 AAGRLDPFPTHTALMKQV 269


>gi|356562062|ref|XP_003549294.1| PREDICTED: uncharacterized protein LOC100803380 [Glycine max]
          Length = 275

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 138/189 (73%), Positives = 162/189 (85%)

Query: 102 LEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGL 161
           +++ P SRW DLVFFQNGM+EPWLESKGL+DANQVLAYF VSK+GE PIDG+TDTNPE L
Sbjct: 41  VQSTPFSRWGDLVFFQNGMMEPWLESKGLEDANQVLAYFVVSKIGETPIDGRTDTNPERL 100

Query: 162 TAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVE 221
            AAYGKWAS+VA RL+VGGLSCKVLDKE FQKQMLEK IWI + MLVGA H G +VG V+
Sbjct: 101 IAAYGKWASIVAARLNVGGLSCKVLDKEVFQKQMLEKRIWICSVMLVGATHGGVSVGAVD 160

Query: 222 KEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYS 281
            E+ +E+S LI ELA  A++EKG+TF+ AME+RLCAYSRAVA+FPTAV+EFKWRNGWFYS
Sbjct: 161 IEFHTELSNLITELASTASSEKGLTFEEAMEERLCAYSRAVAHFPTAVREFKWRNGWFYS 220

Query: 282 LSEKASAEG 290
           L ++ S  G
Sbjct: 221 LFDEISDHG 229


>gi|299116682|emb|CBN74827.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 315

 Score =  295 bits (756), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 143/259 (55%), Positives = 179/259 (69%), Gaps = 2/259 (0%)

Query: 47  TQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAP 106
           T    AAIVGGGR+G AL +MG G D LV RG+++P D +GPI+VCTRNDDLE V+   P
Sbjct: 51  TATTMAAIVGGGRIGCALYDMGGGGDTLVGRGDIIP-DGDGPIYVCTRNDDLEDVISRTP 109

Query: 107 RSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYG 166
            +R  DLVF QNG++ P LE  GL    Q L YFAVSK GE PIDG TD NPEGLT+  G
Sbjct: 110 PARREDLVFLQNGVLGPLLEKHGLAKNTQALIYFAVSKKGEPPIDGITDANPEGLTSTCG 169

Query: 167 KWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRS 226
           KWA  + ERL+ GGL+C  L ++ +  +M EK IWI AFM+ GA H    +G V+ ++  
Sbjct: 170 KWAGELKERLAKGGLTCHELAEDRYTARMYEKHIWICAFMVAGAAHK-CNMGQVDADHAD 228

Query: 227 EVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKA 286
           EV  LI EL  A   E G+ F+  ++DRLCAYSR+VAN+P AVKEF WRNG+F+ +SEKA
Sbjct: 229 EVRTLIGELQKAIEVEYGVAFEDGVKDRLCAYSRSVANYPAAVKEFSWRNGFFWDISEKA 288

Query: 287 SAEGKPDPCPLHTAWLKEI 305
             EG  DPCPLHT  L+++
Sbjct: 289 RKEGTEDPCPLHTKLLRQV 307


>gi|412991433|emb|CCO16278.1| predicted protein [Bathycoccus prasinos]
          Length = 320

 Score =  293 bits (751), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 149/281 (53%), Positives = 188/281 (66%), Gaps = 2/281 (0%)

Query: 29  RFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLD-FEG 87
           R ++   V+A  MAS ++ +V PA I+GGGRVG AL +MG   D++VKR +  P     G
Sbjct: 41  RTSRGRKVTATVMAS-SSNKVEPAIIIGGGRVGQALFDMGVEGDVIVKRDDEFPSSPSSG 99

Query: 88  PIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGE 147
           PIFVCTRND LE V+E  P +R  DL+F QNG ++ +L+SKGL+   Q L YFAV+K GE
Sbjct: 100 PIFVCTRNDVLETVIEKTPANRREDLIFMQNGYLDSFLQSKGLEKNTQALIYFAVAKFGE 159

Query: 148 RPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFML 207
           +P DG T  NPEGLTA  GKW+  VA R   G LSC V D   ++  MLEKLIWISAFML
Sbjct: 160 KPTDGVTSVNPEGLTAVTGKWSDQVAARFHNGDLSCHVKDAAKYRASMLEKLIWISAFML 219

Query: 208 VGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPT 267
           VGA+H GATVG V   ++SEV  LI EL    A    +TF+    +RL AY+ +VA+FPT
Sbjct: 220 VGAQHPGATVGDVATTHKSEVEGLIKELVQGVADINDVTFEAGTLERLLAYADSVAHFPT 279

Query: 268 AVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           AVKEF+WRNG+FY+LS +   + K DP P HTA LK I  +
Sbjct: 280 AVKEFEWRNGFFYNLSTQTMQQSKDDPFPKHTAMLKAINAI 320


>gi|145349172|ref|XP_001419014.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579244|gb|ABO97307.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 269

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 146/269 (54%), Positives = 180/269 (66%), Gaps = 1/269 (0%)

Query: 41  MASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFE-GPIFVCTRNDDLE 99
           M + T  +V P+ I+GGGRVG AL++MG   D++V+RGE  P + + GPI+V TRND LE
Sbjct: 1   MTNATKGEVMPSIIIGGGRVGQALRDMGVEGDVVVRRGEAFPSEPKTGPIYVATRNDVLE 60

Query: 100 AVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPE 159
            V++  P  R  DLVF QNGM++ +L+++GL    Q L YFAV+ LG++P DG T+ NPE
Sbjct: 61  DVIKNCPADRREDLVFMQNGMLQQFLDAQGLGANTQALIYFAVAALGDKPTDGVTELNPE 120

Query: 160 GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGV 219
           GLTAA GKWA  VA R   G LSC VLD   ++  M EKLIWI A MLVGA+H GA VG 
Sbjct: 121 GLTAATGKWADAVAARFHNGDLSCHVLDAPTYEGSMYEKLIWICAMMLVGAQHEGAKVGD 180

Query: 220 VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWF 279
           V   Y+ EV ALI EL     A  G  F P   DRL AY++ VA+FP  VKEF WRNG+F
Sbjct: 181 VATTYKDEVVALIEELLSGTTACTGAKFAPGAVDRLIAYAKTVAHFPCGVKEFPWRNGYF 240

Query: 280 YSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           Y LSEKA A+GK DP P HTA LK I  +
Sbjct: 241 YGLSEKAIADGKSDPFPTHTALLKNIGAI 269


>gi|308806395|ref|XP_003080509.1| unnamed protein product [Ostreococcus tauri]
 gi|116058969|emb|CAL54676.1| unnamed protein product [Ostreococcus tauri]
          Length = 267

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 144/263 (54%), Positives = 174/263 (66%), Gaps = 1/263 (0%)

Query: 47  TQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLD-FEGPIFVCTRNDDLEAVLEAA 105
           T VAPA I+GGGRVG AL+EMG   D++V+RGE  P +   GPIFV TRND LE V+   
Sbjct: 5   TSVAPAIIIGGGRVGQALREMGPEGDVVVRRGEAFPSEPSSGPIFVATRNDVLEDVIANT 64

Query: 106 PRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAY 165
           P +R  DLVF QNGM++ +L+++GL    Q L YFAV+K G+ P DG T+ NPEGLTA  
Sbjct: 65  PANRREDLVFMQNGMLQTFLDAQGLGANTQALIYFAVAKAGDAPTDGVTELNPEGLTAVT 124

Query: 166 GKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYR 225
           GKWA  VA R   G LSC VLD   +Q  M EKLIWI A MLVGA+H GA VG V   ++
Sbjct: 125 GKWADAVAARFHAGNLSCHVLDASTYQGSMYEKLIWICAMMLVGAQHEGAKVGDVATTHK 184

Query: 226 SEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEK 285
            EV ALI EL        G  F P   DRL AY++ VA+FP  VKEF WRNG+F+ LS +
Sbjct: 185 DEVVALIEELLSGTTTCTGAQFAPGAVDRLIAYAKTVAHFPCGVKEFPWRNGYFHGLSAE 244

Query: 286 ASAEGKPDPCPLHTAWLKEIKVV 308
           A AEGKPDP P+HT  L+ I  +
Sbjct: 245 AIAEGKPDPFPIHTKLLQNIGAI 267


>gi|323455643|gb|EGB11511.1| hypothetical protein AURANDRAFT_21633, partial [Aureococcus
           anophagefferens]
          Length = 281

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 139/264 (52%), Positives = 172/264 (65%), Gaps = 6/264 (2%)

Query: 46  TTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDF--EGPIFVCTRNDDLEAVLE 103
           T    PA IVG GR+G+ LK  G   D ++      P D    GPI+VCTRND L +V+ 
Sbjct: 13  TEHDVPAVIVGSGRIGSMLKSDG---DAVMTSSSGWPADAPESGPIYVCTRNDALASVVA 69

Query: 104 AAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTA 163
           A P  R  DL F QNGM+ P+LE +GL DA QVL Y AV+KLGE P DG TD NPEGLTA
Sbjct: 70  AVPEDRRADLCFLQNGMLGPFLEGQGLGDATQVLVYLAVAKLGETPTDGVTDANPEGLTA 129

Query: 164 AYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKE 223
           A GKWA   A RL++  L+C V + +AF   MLEK +WI AFMLVGA H G +VG VE++
Sbjct: 130 ATGKWADAFASRLALKELTCHVKEGDAFAVAMLEKHVWICAFMLVGATH-GCSVGDVERK 188

Query: 224 YRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLS 283
           + +E  AL AEL  A A   G+       +RL AY+R VA+FPTAVKEF+WRNGWFY ++
Sbjct: 189 HAAEFDALAAELLDAGAEALGVAVPAGAPERLKAYARTVAHFPTAVKEFEWRNGWFYKIT 248

Query: 284 EKASAEGKPDPCPLHTAWLKEIKV 307
             A A GK DP PLHT  L ++++
Sbjct: 249 NDAVAAGKEDPMPLHTQKLYDLEL 272


>gi|428179462|gb|EKX48333.1| hypothetical protein GUITHDRAFT_157525 [Guillardia theta CCMP2712]
          Length = 274

 Score =  259 bits (661), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 129/261 (49%), Positives = 174/261 (66%), Gaps = 5/261 (1%)

Query: 52  AAIVGGGRVGTALKEMGKG---QDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRS 108
             I+G GR+GTALKEMGK     D++V+RGE +P +  GPI+V TRNDDL+ V+ A P S
Sbjct: 9   CTIIGAGRIGTALKEMGKASGFDDVVVRRGEKIPAEGSGPIYVTTRNDDLKGVISACPPS 68

Query: 109 RWNDLVFF-QNGMIEPWL-ESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYG 166
           R  DLVF+ QNG IE +L E K   D+ +VL Y AV+KLG++P DG TD NPEGLTAA G
Sbjct: 69  RKGDLVFYGQNGYIEDFLKEEKVFDDSTKVLVYMAVAKLGDKPTDGITDLNPEGLTAATG 128

Query: 167 KWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRS 226
           KWA  VA RL   GL+C VL+++ F+  M EK +WI  FML+G  H G TVG VE +++ 
Sbjct: 129 KWAEQVAARLRGSGLTCNVLNEDEFRACMFEKHMWICTFMLLGVHHGGITVGEVESKHKE 188

Query: 227 EVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKA 286
           +V  ++ E+      E+GI F   + +RLCAY+R+VA+FPTA+KEF WRN ++  ++   
Sbjct: 189 DVFKMVEEMKKCLEVERGIKFKDGVPERLCAYARSVAHFPTALKEFSWRNKFWLDITRAR 248

Query: 287 SAEGKPDPCPLHTAWLKEIKV 307
             +G  D  P HT  L ++ V
Sbjct: 249 QGKGAEDLTPFHTKLLLKLPV 269


>gi|224006582|ref|XP_002292251.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971893|gb|EED90226.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 303

 Score =  217 bits (553), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 121/300 (40%), Positives = 166/300 (55%), Gaps = 21/300 (7%)

Query: 14  STLWPPL---KKPTFSKPRFAKPTPVSAFAMASFTTTQVA--PAAIVGGGRVGTALKEMG 68
           ST +P L      T S   F+   P S    +   TTQ++     ++GGGR+G+ +    
Sbjct: 4   STAFPALLAFSMYTTSAQGFSSSAPSSCTTSSCRQTTQLSATDTIVIGGGRIGSLISNDA 63

Query: 69  KGQDLLVKRGELVPLDF----EGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPW 124
           K    L+ R + V        EGPIF+ TRND LE+++E  P SR  DLVF QNG ++ +
Sbjct: 64  K----LLGRNDSVASSIDPNGEGPIFIATRNDVLESIVEDCPESRRKDLVFLQNGYLDNF 119

Query: 125 LESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCK 184
           L SK L    Q L Y +V+  G  P+DG T  NPEGLTAA G  A   A+RL+  GL C 
Sbjct: 120 LASKDLLSNTQALLYLSVTAKGVDPVDGITTVNPEGLTAATGVHAQAFADRLATLGLKCN 179

Query: 185 VLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKG 244
           V+  E ++  M EKLIWIS +MLVG      +VG    +++  V  +I+EL  A +A++G
Sbjct: 180 VVSTEDYRPAMFEKLIWISTYMLVGTAKNCKSVGQAGSDHKHLVENIISELVAAVSAKEG 239

Query: 245 ITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKE 304
           I F     +RL AY+  V +FP  VKEF+WRN +FY L ++A        CPLH   L+E
Sbjct: 240 IAFQEGTIERLAAYTDVVTDFPCGVKEFEWRNKYFYDLGDEA--------CPLHNGLLRE 291


>gi|219116430|ref|XP_002179010.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409777|gb|EEC49708.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 231

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 101/233 (43%), Positives = 143/233 (61%), Gaps = 2/233 (0%)

Query: 54  IVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDL 113
           I+G GR+G  L E GK +  +  R + +  + +G I + TRND LE+++E  P +R  DL
Sbjct: 1   IIGSGRIGALLAEAGKCK--VSGREDSIDSEGKGSILIATRNDSLESIVEKCPENRRKDL 58

Query: 114 VFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVA 173
           VF QNG ++ +L+SKGL    QVL Y + + +  + +DG T  NPEGLTAA G  A   A
Sbjct: 59  VFMQNGYLDKFLDSKGLLGNTQVLLYLSKASMEAKAVDGVTSVNPEGLTAATGIHAQAFA 118

Query: 174 ERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIA 233
           +RL+   L C V+  ++++  M EKLIWIS +MLVG      +VG   +EYR  V A++ 
Sbjct: 119 DRLASLDLKCNVVSPKSYRPAMFEKLIWISTYMLVGTAKECKSVGQASEEYRDIVEAVVN 178

Query: 234 ELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKA 286
           EL  A  A++GI F      RL AY+  VA+FP AVKEF+WRN +F+ L + A
Sbjct: 179 ELTAAVCAKEGIEFPDGTMARLAAYTDVVADFPCAVKEFQWRNQYFFDLGDDA 231


>gi|388504328|gb|AFK40230.1| unknown [Lotus japonicus]
          Length = 114

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 87/114 (76%), Positives = 102/114 (89%)

Query: 195 MLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDR 254
           MLEKLIWI + MLVGARH G +VG VEKE+  E+ +LI+ELALAAA+EKG+TF+  MEDR
Sbjct: 1   MLEKLIWICSVMLVGARHGGVSVGNVEKEFHLELCSLISELALAAASEKGLTFEEGMEDR 60

Query: 255 LCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
           +CAYSRAVA+FPTAVKEFKWRNGWFYSLSEKA+A+GK DPCPLHT WLKE+K+V
Sbjct: 61  MCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKATAQGKQDPCPLHTQWLKELKIV 114


>gi|428171530|gb|EKX40446.1| hypothetical protein GUITHDRAFT_75611 [Guillardia theta CCMP2712]
          Length = 276

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 98/266 (36%), Positives = 155/266 (58%), Gaps = 15/266 (5%)

Query: 48  QVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFE---------GPIFVCTRNDDL 98
           +++PA IVG GRVG  +K  G   D +V+    +    +         GPI++C  N  L
Sbjct: 2   ELSPAVIVGDGRVGHMMKNRGDPGDAMVRSTSELRFQLKEAPGGGPKSGPIYLCVPNSAL 61

Query: 99  EAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNP 158
           + ++EA P  R  DLVF  +G + P+L+  GL+   Q L YF + KLG  P+DGKT  +P
Sbjct: 62  KELVEACPPERRQDLVFMGSGNLGPFLKQFGLEGNTQALIYFVIDKLGHDPVDGKTSLDP 121

Query: 159 EGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVG 218
           +GLTA  GKW+  +A+RL +  LSC+++D   F+   LE+L+W  A  LV   +    VG
Sbjct: 122 DGLTAVTGKWSGDLAKRLKMVNLSCRIMDSNKFEISRLERLVWTCATHLVAKANKFTKVG 181

Query: 219 VVEKEYRSEVSALIAELALAAAAEKGITFDPAME-----DRLCAYSRAVANFPTAVKEFK 273
            V + + SEV  ++ EL +    E+ ++    +E     +R+  Y+ ++ +F T VK+ +
Sbjct: 182 QVVESHWSEVEEVMHEL-VRVLNEQAVSGSVQLEASQFYERMKEYTLSIGHFNTVVKDLE 240

Query: 274 WRNGWFYSLSEKASAEGKPDPCPLHT 299
           +RNG+FY +S++  A+G PDPCP+HT
Sbjct: 241 FRNGYFYKISQERLAKGLPDPCPMHT 266


>gi|397642896|gb|EJK75525.1| hypothetical protein THAOC_02750 [Thalassiosira oceanica]
          Length = 298

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 110/282 (39%), Positives = 156/282 (55%), Gaps = 28/282 (9%)

Query: 27  KPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFE 86
           + R    T +SA A A+  T+ V    ++GGGR+G+ L E       L+ R + +    +
Sbjct: 29  QSRTTATTSLSA-ANANVATSDV----VIGGGRIGSILAEDAT----LLGREDSISSSID 79

Query: 87  ----GPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAV 142
               GPI+V TRND LE +++  P  R  DLVF QNG ++ +LE KGL    Q L Y +V
Sbjct: 80  PAGTGPIYVATRNDVLEGIVDDCPAGRRKDLVFLQNGYLDSFLERKGLLQNTQALLYLSV 139

Query: 143 SKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWI 202
              G  P+DG T  NPEGLTAA G+WA   A++L+   L   V+  E ++  M EKLIWI
Sbjct: 140 PAKGADPVDGITSVNPEGLTAATGEWAQAFADKLATLDLKMNVVTPEQYRPAMFEKLIWI 199

Query: 203 SAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAV 262
           S +MLVG     ++VG    E++  V  +IAEL  A +A++GI+F     +RL AY+  V
Sbjct: 200 STYMLVGTVKDCSSVGQAGNEHKQLVEDVIAELVAAVSAKEGISFTSGTIERLAAYTDVV 259

Query: 263 ANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKE 304
           A+FP  VK       +FY L ++A        CP+H   L+E
Sbjct: 260 ADFPCGVK-------YFYDLGDEA--------CPIHNRLLRE 286


>gi|37806393|dbj|BAC99931.1| unknown protein [Oryza sativa Japonica Group]
          Length = 142

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 90/113 (79%), Positives = 99/113 (87%)

Query: 120 MIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVG 179
           M++PW ESKGL DANQVLAYFAVSKLGE P+DG TDTNPEGLTAA+G WA  VA RL  G
Sbjct: 1   MLDPWFESKGLVDANQVLAYFAVSKLGEPPVDGITDTNPEGLTAAFGNWAPAVATRLQNG 60

Query: 180 GLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALI 232
           GL+CKVL+KEAFQKQMLEKLIWISAFMLVGARH GATVG VEKEY+SE + +I
Sbjct: 61  GLTCKVLEKEAFQKQMLEKLIWISAFMLVGARHPGATVGAVEKEYQSEFTLVI 113


>gi|452823237|gb|EME30249.1| hypothetical protein Gasu_24020 [Galdieria sulphuraria]
          Length = 305

 Score =  173 bits (439), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 98/262 (37%), Positives = 140/262 (53%), Gaps = 20/262 (7%)

Query: 46  TTQVAPAAIVGGGRVGTAL-KEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEA 104
           T Q     ++G GR+G+ L +  G  + ++V RGE +P D  GPI+VCTRNDDL++++  
Sbjct: 58  TGQSFQEVVIGNGRIGSLLARAPGMKEPVVVTRGESIPADGYGPIYVCTRNDDLDSIVAR 117

Query: 105 APRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAA 164
            P  R  DLVF QNGMI+ +LE + +++  Q L YFAV+  G+ P+DG       G T  
Sbjct: 118 TPSDRRPDLVFIQNGMIQTFLEERNIQNNTQALIYFAVASKGDSPVDG-------GGTVV 170

Query: 165 YGKWASVVAERLSVGGLSCKVL-DKEAFQKQMLEKLIWISAFM-LVGARHTGATVGVVEK 222
           YGKWA   A+R+   G  C V+ DK  F K+M+EKL+WI     L    H      V  +
Sbjct: 171 YGKWAEAFAQRVCSLGCKCSVVEDKTEFIKKMVEKLLWICVMGPLCATSHFTCGQVVANE 230

Query: 223 EYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSL 282
               ++  L+ EL   A     I  D  +  R+  YS  V+++   VKE  WRNGWF  L
Sbjct: 231 TCLRDLEDLVNELKPIAEEVLRIKLDDGVVSRIVEYSNKVSDYRAGVKEIAWRNGWF--L 288

Query: 283 SEKASAEGKPDPCPLHTAWLKE 304
             K +        PLH +++K 
Sbjct: 289 ERKKT--------PLHVSYMKN 302


>gi|428181809|gb|EKX50672.1| hypothetical protein GUITHDRAFT_151218 [Guillardia theta CCMP2712]
          Length = 357

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 98/274 (35%), Positives = 151/274 (55%), Gaps = 21/274 (7%)

Query: 50  APAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEG----PIFVCTRNDDLEAVLEAA 105
            P+ IVGGGRVG  L+  G G D++V RG  +P   EG    P++VC    +LE V+ + 
Sbjct: 72  TPSVIVGGGRVGEMLRGAGGGDDVVVGRGGQIPEVHEGLESFPVYVCVPTSELENVIRSC 131

Query: 106 PRSRWNDLVFFQNGMIEPWLESKGL--KDANQVLAYFAVSKLGERPIDGKTDTNPE---- 159
           P+S+ +DLVF Q+GMIEP L+   L   +  Q+L YF +   G RP D   D   +    
Sbjct: 132 PQSKLDDLVFVQDGMIEPILKKYALCGNENTQMLPYFTLFSKGGRPQDSLVDLGTDAYGE 191

Query: 160 ----GLTAAYGKWASVVAERLSVGGLSCKVLDKEAF----QKQMLEKLIWISAFMLVGAR 211
               G T+  GKWA  + +RL      C    +  F    ++ MLEK+I+ S F LVGA 
Sbjct: 192 PKYAGETSICGKWAGALKQRLDRLNF-CHRHSRTMFYRDWRRAMLEKVIFDSVFPLVGAL 250

Query: 212 HTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRA--VANFPTAV 269
           + G ++G V + +  E   +I E+A +      +T    +E+RL AYS+   V   P  +
Sbjct: 251 NKGISIGEVGQSHTEECQEMITEIARSLPGTLAVTLMYGVEERLFAYSQHDNVEFRPCKI 310

Query: 270 KEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLK 303
           +++ +RNG++Y+ S++A   G PDP P+H+ +L+
Sbjct: 311 EQYPFRNGFWYTFSQQAKQIGFPDPTPMHSEYLE 344


>gi|449016404|dbj|BAM79806.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 312

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 92/238 (38%), Positives = 130/238 (54%), Gaps = 18/238 (7%)

Query: 52  AAIVGGGRVGTALKEMGKGQDL----LVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPR 107
           A +VG GR+GT L    KG +L    L++R + +    +GPI+VCTRNDDL  V+   P 
Sbjct: 59  ATVVGAGRIGTLLAS-AKGDELAPVRLLRRQDSIEGGRKGPIYVCTRNDDLAEVVAKCPP 117

Query: 108 SRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGK 167
            R  DLVF QNGM+  +L    L+ A Q L +FAV ++G+ P DG       G T  +G 
Sbjct: 118 VRRGDLVFLQNGMLRSFLLEHNLEHATQALVFFAVPRVGDPPADG-------GGTVVWGP 170

Query: 168 WASVVAERLSVGGLSCKVL-DKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRS 226
           WA+  A R+   G  C VL DK+ F  +M+EKL+WI  F ++     G  VG + ++   
Sbjct: 171 WAAAFAARVRSCGCQCTVLEDKQEFDCRMIEKLLWICIFGVLSESQNGMPVGEIAEKQPD 230

Query: 227 EVSALIAELALAAAAEKGITFDPAME-----DRLCAYSRAVANFPTAVKEFKWRNGWF 279
            V  L+ ELA  A  E G+    A++       L  YS+ + +F    KE++WRNGWF
Sbjct: 231 RVRQLVDELAPIAEREVGLEGKHALDRGQLLQSLLDYSKKIPDFVAGFKEYRWRNGWF 288


>gi|299117349|emb|CBN75305.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 322

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 96/284 (33%), Positives = 144/284 (50%), Gaps = 31/284 (10%)

Query: 51  PAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDF--------EG--------PIFVCTR 94
           P+ I+G G++GTAL  MG G+D+++ RGE +P           +G        PI+VC  
Sbjct: 29  PSVIIGRGKLGTALASMGMGEDVVLGRGEAIPDSLPCSSGKGEDGEPAMLQNFPIYVCVP 88

Query: 95  NDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLA--YFAVSKLGERPI-- 150
            +++ AV+++ P  + +DLVF  +G +EP L+S+G+    Q  A  YF+V K+G R +  
Sbjct: 89  EEEVAAVIDSCPDDKKDDLVFMSSGCLEPLLKSRGMCRGEQTQATLYFSVDKVGGRAVDA 148

Query: 151 ------DGKTDTNPEGLTAAYGKWASVVAERL--SVGGLSCKVLDKEAFQKQMLEKLIWI 202
                 DG+ +    G TA  GKW    A RL  S  GL C       +++ M EKLI+ 
Sbjct: 149 MTVIGTDGRGEPKLAGETAVCGKWKGAFAMRLQQSPFGLHCSTEYYLEWRRVMFEKLIFD 208

Query: 203 SAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAV 262
            A  LVGA H   TVG V + +  E+  ++ EL     +   ++      +RL AY   V
Sbjct: 209 CAVNLVGALHKFPTVGYVVQYHAEELVDMLFELKNTLRSTLAVSLMNGYPERLIAYGDQV 268

Query: 263 ANFPTA---VKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLK 303
            +  T     + FKW N +FY +S  A      DPCP+HT +L 
Sbjct: 269 GDRQTTRMDARTFKWYNEFFYDISNTALKAKFGDPCPMHTEYLN 312


>gi|149918918|ref|ZP_01907404.1| hypothetical protein PPSIR1_16645 [Plesiocystis pacifica SIR-1]
 gi|149820292|gb|EDM79709.1| hypothetical protein PPSIR1_16645 [Plesiocystis pacifica SIR-1]
          Length = 260

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/264 (38%), Positives = 138/264 (52%), Gaps = 25/264 (9%)

Query: 54  IVGGGRVGTALKEMGKGQDLLVKR-------GELVPLDFEGPIFVCTRNDDLEAVLEAAP 106
           +VG GRVGTAL  +   +  LV R       G   P     PI +  RND L AVLE  P
Sbjct: 3   VVGAGRVGTALAALDPERVTLVDRVANWAFVGAEAPGRPGEPIVLAVRNDALPAVLERIP 62

Query: 107 RSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYG 166
           ++R  DLVF QNGM+ P+L+++GL +A + L +FAV   G           P G +   G
Sbjct: 63  KARHPDLVFVQNGMLRPYLDARGLGEATRGLLFFAVPTRG-------AALEPGGESPFCG 115

Query: 167 KWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRS 226
             A  V E L   G+   V+D   F    LEKLIW SAF L+        VG V +E+  
Sbjct: 116 PKAEAVVEDLLAMGVPAAVVDAPKFAAVELEKLIWNSAFGLL-CEARACDVGTVVREHGE 174

Query: 227 EVSALIAELALAAAAEKGITFDPAME-----DRLCAYSRAVANFPTAVKEFKWRNGWFYS 281
            + AL+AE+      E G+  + A+E     +RL +YS ++A++  AVKE+ WRNGWF  
Sbjct: 175 ALRALVAEMLAVGGPELGLVGEAALELEALVERLASYSLSIASYRGAVKEWTWRNGWFV- 233

Query: 282 LSEKASAEGKPDPCPLHTAWLKEI 305
             E+A A G+   CP+H   L E+
Sbjct: 234 --ERARARGR--ACPVHDGLLAEV 253


>gi|302848804|ref|XP_002955933.1| hypothetical protein VOLCADRAFT_96870 [Volvox carteri f.
           nagariensis]
 gi|300258659|gb|EFJ42893.1| hypothetical protein VOLCADRAFT_96870 [Volvox carteri f.
           nagariensis]
          Length = 353

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 99/277 (35%), Positives = 137/277 (49%), Gaps = 43/277 (15%)

Query: 54  IVGGGRVGTALKEMGKGQDL---LVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRW 110
           +VG GRVG A  +M  G DL   + +R   + LD +GPI+VCT ND L+ VL+  P +R 
Sbjct: 90  LVGYGRVGKAFAKM-LGDDLAATVSRRNGSISLDGKGPIYVCTTNDALDDVLQRTPPARR 148

Query: 111 NDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWAS 170
            DLV  QNG + PWL+  GL+D   V  Y A S  G    DG+        T A G+W  
Sbjct: 149 RDLVLLQNGWLLPWLDRNGLEDVTLVALYMAASP-GGTATDGRR-------TVASGRWGD 200

Query: 171 VVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSA 230
            VA  L+ GG+SC V+ ++A  K ++EK +W S F ++     GA VG +   +R +V A
Sbjct: 201 HVASTLNRGGVSCAVVSRDALYKALVEKALWASIFWMLSCALGGAKVGDIAVHHRLDVKA 260

Query: 231 LIAEL------ALAAAAEK--------------GITFDPAMEDRLCAYSRAVANFPTAVK 270
           L+ EL      +L AAA K              G   D A+ D   +  +AV +   A+ 
Sbjct: 261 LVDELLPMMRQSLEAAAAKRPGADAAAAALRDHGTVLD-ALMDYSMSIGQAVPSREMALA 319

Query: 271 EFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKV 307
           EF WRNG    +             PLH +WL+   V
Sbjct: 320 EFSWRNGALLEMGLT----------PLHCSWLQRAGV 346


>gi|95113962|gb|ABF55515.1| chloroplast At1g16080 protein [Solanum lycopersicum]
          Length = 75

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 62/73 (84%), Positives = 65/73 (89%)

Query: 112 DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASV 171
           +LVFFQNGM+EPW +SKGL DANQVLAYFAVSKLGE P DGKTDTNPEGLTAAYGKWAS 
Sbjct: 3   NLVFFQNGMLEPWFQSKGLGDANQVLAYFAVSKLGEPPTDGKTDTNPEGLTAAYGKWASA 62

Query: 172 VAERLSVGGLSCK 184
           VA RL  GGLSCK
Sbjct: 63  VAARLQNGGLSCK 75


>gi|95113968|gb|ABF55518.1| chloroplast At1g16080 protein [Lactuca sativa]
          Length = 72

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 59/72 (81%), Positives = 64/72 (88%)

Query: 113 LVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVV 172
           LVFFQNGM+EPW ESKGL DA+QVLAYFAV+KLGE PIDG TDTNPEGLTAA+GKW S V
Sbjct: 1   LVFFQNGMLEPWFESKGLGDADQVLAYFAVAKLGEPPIDGITDTNPEGLTAAFGKWGSAV 60

Query: 173 AERLSVGGLSCK 184
           A+RL  GGLSCK
Sbjct: 61  ADRLRAGGLSCK 72


>gi|95113973|gb|ABF55520.1| chloroplast At1g16080 protein [Coffea canephora]
          Length = 69

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 56/69 (81%), Positives = 59/69 (85%)

Query: 116 FQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAER 175
           FQNGM+E W +SKGL D +QVLAYFAVSKLGE P DGKTDTNPEGLTAAYGKWAS VA R
Sbjct: 1   FQNGMLETWXQSKGLSDPDQVLAYFAVSKLGESPTDGKTDTNPEGLTAAYGKWASAVAGR 60

Query: 176 LSVGGLSCK 184
           L  GGLSCK
Sbjct: 61  LRAGGLSCK 69


>gi|255080954|ref|XP_002504043.1| predicted protein [Micromonas sp. RCC299]
 gi|226519310|gb|ACO65301.1| predicted protein [Micromonas sp. RCC299]
          Length = 326

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 98/305 (32%), Positives = 138/305 (45%), Gaps = 43/305 (14%)

Query: 29  RFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKG---QDLLVKRGELVPLDF 85
           R ++ +P    A AS +        IVG GRVGT + +M +    +   V RG +     
Sbjct: 20  RASRMSPARCRADASSSGADTV--TIVGRGRVGTTIGKMCESLGVRHAFVTRG-MASFPP 76

Query: 86  EGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMI-EPWLESKGLKD-ANQVLAYFAVS 143
            GPI+V T   DL+ VL   P  R NDLV  Q G++ + +L+ +GL   A QV  Y + S
Sbjct: 77  SGPIYVATHASDLDDVLALVPTDRANDLVLLQGGLLRDDFLQRRGLAGVATQVALYMSAS 136

Query: 144 KLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGG-LSCKVLDKEAFQKQMLEKLIWI 202
             G       T  +  G T A G  A+ V E L+ GG + C V++++ F+   + KL W 
Sbjct: 137 GDG-------TARDGGGGTCACGPRAADVRELLTRGGNVRCAVVNEDDFRVASVGKLCWT 189

Query: 203 SAFMLVGARHTGA--------TVG-VVEKEYRSEVSALIAELALAAAAEKGITFDP---- 249
           SAF L+  R   A        TVG VV+ E+ +     +A   L  A   G   D     
Sbjct: 190 SAFWLL-CRSVAAERGLPRALTVGEVVDSEHGAAAVRTLARELLDCAEASGELDDTAGDE 248

Query: 250 ---AMEDRLCAYSR----AVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWL 302
              AM   +  YSR    AV +   A+KE  +RNGWF +    +S      P   H   L
Sbjct: 249 AREAMTRAMFEYSRSIPSAVPSLEMALKEGGFRNGWFLARRTVSS------PQRTHETHL 302

Query: 303 KEIKV 307
           + I V
Sbjct: 303 RRIGV 307


>gi|159491118|ref|XP_001703520.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280444|gb|EDP06202.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 349

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 76/274 (27%), Positives = 111/274 (40%), Gaps = 71/274 (25%)

Query: 86  EGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLK-------------- 131
           +GPI+VCT ND L+ VLE  PR+R  DLVFFQNG + PWL    L+              
Sbjct: 86  QGPIYVCTTNDALDWVLEQTPRNRRCDLVFFQNGWLAPWLAHNSLRLGPGSAAGAAAEGG 145

Query: 132 ----------------------DANQVLAYFAVSKLGERPIDGKTDTNPEGL-TAAYGKW 168
                                 +A+   +   V+ +      G   T  +GL T A G+W
Sbjct: 146 AAAAAGREAEAARGDGGGAGGAEASGPGSAVEVTLVALYMAAGPDGTAKDGLRTVASGRW 205

Query: 169 ASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEV 228
           A  V+  LS G + C+ +        ++EKL+W   F ++ A   G  VG +   +R +V
Sbjct: 206 AQHVSRTLSRGAVRCRAVGGADMYGAVVEKLLWACVFWMMSAALGGMNVGDIVARHRGDV 265

Query: 229 SALIAELALAAAAE-KGITFD-----------------------PAMEDRLCAYSRAVAN 264
           +AL AEL      + +GI                           A+ +   + S AV +
Sbjct: 266 AALTAELLPPLCRQLRGIAAAGAGPEAAEAAAALEAPAGAERVVAALVEYSVSISSAVPS 325

Query: 265 FPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLH 298
              A+ EF+WRNG    +           P PLH
Sbjct: 326 RDMALAEFRWRNGALLDMG----------PTPLH 349


>gi|303271083|ref|XP_003054903.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462877|gb|EEH60155.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 376

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 93/311 (29%), Positives = 129/311 (41%), Gaps = 69/311 (22%)

Query: 54  IVGGGRVGTAL-KEMGKGQDLLVKRGELVPLD-FEGPIFVCTRNDDLEAVLEAAPRSRWN 111
           + G GRV + L + +G  + +LV R    P D   GPI+V T   DL AVL   P SR  
Sbjct: 63  VFGRGRVASVLARALGISRCVLVGRYNSFPADDIGGPIYVATHASDLAAVLAKVPASRRR 122

Query: 112 DLVFFQNGMI-EPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWAS 170
           D+V  Q G++   WLE++G  DA QV  + +  + G+  +DG       G T A G  A+
Sbjct: 123 DVVLMQGGLLRRAWLEARGAADATQVALFLSARRDGD-VVDGG------GRTVACGPHAA 175

Query: 171 VVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGA-----TVG--VVEKE 223
            V E L+  G+ C  +D   F     EKL+W S F L+ A   GA     TVG  V + +
Sbjct: 176 HVVETLARVGVRCAEVDATEFGTAATEKLLWASIFWLMCATRGGAGGDAVTVGEIVNDVD 235

Query: 224 YRSEVSALIAEL-----------ALAAAAEKGIT-------------------------- 246
             +   AL  EL             A AAE G +                          
Sbjct: 236 ASARAEALAVELMRVVVAAGELDGGADAAEDGTSTVAGSPPAPALPLLDAEALEYEGTGF 295

Query: 247 -FDPAMEDRLCAYS--RAVANFP-------TAVKEFKWRNGWFYSLSEKASAEGKPDPCP 296
             D   E  L      R   + P        A +E  +RNGWF + +E+ + E      P
Sbjct: 296 GTDAEFEQGLIVIGLFRYCDDIPDAAPSAEMATREGAFRNGWFLAAAERYALE-----TP 350

Query: 297 LHTAWLKEIKV 307
           LH   L++  V
Sbjct: 351 LHERLLRDAGV 361


>gi|224053048|ref|XP_002297681.1| predicted protein [Populus trichocarpa]
 gi|222844939|gb|EEE82486.1| predicted protein [Populus trichocarpa]
          Length = 60

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/74 (63%), Positives = 54/74 (72%), Gaps = 14/74 (18%)

Query: 235 LALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDP 294
           LA AAAAEKGI F  A+E+RLCAY+ AVA+FPTAVKE              A +EGKPDP
Sbjct: 1   LASAAAAEKGIVFKEAIEERLCAYAGAVAHFPTAVKE--------------AVSEGKPDP 46

Query: 295 CPLHTAWLKEIKVV 308
           CPLHT+WLKE+KVV
Sbjct: 47  CPLHTSWLKELKVV 60


>gi|95113971|gb|ABF55519.1| chloroplast At1g16080 protein [Solanum tuberosum]
          Length = 81

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 45/75 (60%), Positives = 50/75 (66%)

Query: 110 WNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWA 169
           W+DLVFFQNGM EP  +SK L++A   LAY AVSKLG+ P DGKTDT PEG   AYGKW 
Sbjct: 7   WSDLVFFQNGMQEPGFQSKALEEAEPSLAYLAVSKLGKPPTDGKTDTIPEGWMPAYGKWP 66

Query: 170 SVVAERLSVGGLSCK 184
           S        GG S K
Sbjct: 67  SPGVPGCRNGGFSSK 81


>gi|422293651|gb|EKU20951.1| hypothetical protein NGA_0116101 [Nannochloropsis gaditana CCMP526]
 gi|422295684|gb|EKU22983.1| hypothetical protein NGA_0116102 [Nannochloropsis gaditana CCMP526]
          Length = 222

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 64/214 (29%), Positives = 100/214 (46%), Gaps = 28/214 (13%)

Query: 116 FQNGMIEPWLESKGL--KDANQVLAYFAVSKLGERPIDG-------KTDTNPE--GLTAA 164
            Q+  +E  L+  GL  K   Q L YF++S  G  P++G       KTD  P+  G T  
Sbjct: 1   MQSACLETLLKRYGLANKKTTQCLPYFSISPFG-LPVEGRVQYDVSKTDGEPKFAGQTVV 59

Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGA---TVGVVE 221
            G W   VAERL    L  +V+D   ++K MLE+L++ S F L+   H  +   T G + 
Sbjct: 60  CGTWRGAVAERLRKQDLHVEVMDYTYWRKHMLERLLFDSLFGLLACVHADSGEKTYGELA 119

Query: 222 KEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPT---------AVKEF 272
           + +  E+  ++ EL+ A A    +     +E R+    R +A  P            +E+
Sbjct: 120 QNHWEEIDDMVWELSRAIANIMAVVLLTHVERRV----RGLAEGPHWRDRKPQMPTCREY 175

Query: 273 KWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIK 306
            +RNG   SLS   ++ G P P P H  +L  ++
Sbjct: 176 PFRNGMLRSLSRSFTSRGFPSPMPTHDEYLDFVR 209


>gi|218200804|gb|EEC83231.1| hypothetical protein OsI_28521 [Oryza sativa Indica Group]
          Length = 162

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 33/39 (84%), Positives = 38/39 (97%)

Query: 269 VKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKV 307
           +++FKWRNGWFYSL+EKA A+GKPDPCPLHTAWLKEIKV
Sbjct: 1   MEQFKWRNGWFYSLTEKALAQGKPDPCPLHTAWLKEIKV 39


>gi|95113966|gb|ABF55517.1| chloroplast At1g16080 protein [Helianthus annuus]
          Length = 48

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 38/48 (79%), Positives = 40/48 (83%)

Query: 137 LAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCK 184
             YFAVS LGE P+DG TD NPEGLTAAYGKWASVVA RL+ GGLSCK
Sbjct: 1   FGYFAVSXLGEPPVDGITDANPEGLTAAYGKWASVVATRLNAGGLSCK 48


>gi|255543393|ref|XP_002512759.1| hypothetical protein RCOM_1442650 [Ricinus communis]
 gi|223547770|gb|EEF49262.1| hypothetical protein RCOM_1442650 [Ricinus communis]
          Length = 76

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 35/48 (72%), Positives = 41/48 (85%), Gaps = 2/48 (4%)

Query: 259 SRAVANFPTAVKE--FKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKE 304
           SRAV + P AV+E  FKWRNGW YSLS+KA AEGKPDPCPL+T+WLK+
Sbjct: 4   SRAVGHLPAAVREAKFKWRNGWSYSLSQKAIAEGKPDPCPLYTSWLKD 51


>gi|397645963|gb|EJK77058.1| hypothetical protein THAOC_01133, partial [Thalassiosira oceanica]
          Length = 147

 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 47/128 (36%), Positives = 68/128 (53%), Gaps = 13/128 (10%)

Query: 27  KPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFE 86
           + R    T +SA A A+  T+ V    ++GGGR+G+ L E       L+ R + +    +
Sbjct: 29  QSRTTATTSLSA-ANANVATSDV----VIGGGRIGSILAEDAT----LLGREDSISSSID 79

Query: 87  ----GPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAV 142
               GPI+V TRND LE +++  P  R  DLVF QNG ++ +LE KGL    Q L Y +V
Sbjct: 80  PAGTGPIYVATRNDVLEGIVDDWPAGRRKDLVFLQNGYLDSFLERKGLLQNTQALLYLSV 139

Query: 143 SKLGERPI 150
              G  P+
Sbjct: 140 PAKGADPV 147


>gi|95113964|gb|ABF55516.1| chloroplast At1g16080 protein [Helianthus annuus]
          Length = 49

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 35/49 (71%), Positives = 37/49 (75%)

Query: 136 VLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCK 184
           VLAYFAVSKLGE  +DG TD NPEGLTAA    ASV   RL+ GGLSCK
Sbjct: 1   VLAYFAVSKLGEPHVDGITDANPEGLTAADDTCASVAYSRLNAGGLSCK 49


>gi|219113737|ref|XP_002186452.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209583302|gb|ACI65922.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 326

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 68/234 (29%), Positives = 100/234 (42%), Gaps = 30/234 (12%)

Query: 84  DFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVS 143
           D   PIFV T       V   +  +R  DLV+  NG+  P  E   +     V+ +F V 
Sbjct: 101 DDGNPIFVATPAHSWRDVYTYSKPARRTDLVWVGNGLPLPAFERSTV-----VVPHFGVL 155

Query: 144 KLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDK-EAFQKQMLEKLIWI 202
           ++GE P+   + ++P   T  YGK    V       G+  KV+D           K++W 
Sbjct: 156 QVGESPV--TSISSPP--TYVYGKHTDAVCTISQKEGVFVKVIDSWHDLYAITARKVLWA 211

Query: 203 SAFMLVGARHTGA---TVGVVEKEYRSEVSALIAELALAAAAEKGIT-FDPAMED---RL 255
           S   LV  RH+     TV    K+ R  +  L++EL     A  G++ +    ED    L
Sbjct: 212 SVMWLV--RHSETPPLTVAAAHKQKRQLLIDLVSELFAPLQAIVGLSAYTNEYEDTFIYL 269

Query: 256 CAYS----RAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEI 305
             YS     A+ N   A+ E K RNG + SLS +        P  LH A ++E+
Sbjct: 270 ELYSMSMPYAIPNKQLAIDEIKERNGIWLSLSAQF-------PQELHQALVREV 316


>gi|223992611|ref|XP_002285989.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977304|gb|EED95630.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 409

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 61/222 (27%), Positives = 97/222 (43%), Gaps = 29/222 (13%)

Query: 63  ALKEMGKGQDLLVKRGELVPLDF----EGPIFVCTRNDDLEAVLEAAP---RSRWNDLVF 115
           A+  +GKG DLL+   E +P         PI++     +L   ++      ++R  D VF
Sbjct: 110 AIVGLGKGNDLLLTPNEPLPPAIGETTSYPIYITLPPYELREFIQQLDEEWKARAEDFVF 169

Query: 116 FQNG----MIEPWLESKGL-KDA-NQVLAYFAVSKLG-------ERPIDGKT----DTNP 158
           F  G    ++EP L   G+ +DA  QV   F++ K G        +P D       D N 
Sbjct: 170 FSGGKICGVVEPTLREFGMCRDAMTQVTVGFSLPKPGTGLGGTTRKPEDMSCNIGADVNG 229

Query: 159 E----GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGA-RHT 213
           E    G + A GKW   +A RLS   + CK +    +++ M E+ ++ + + +VGA R  
Sbjct: 230 ENKWAGESVACGKWNGAIAGRLSENAIRCKTVFYREWRRAMWERALYDAVWNVVGAVRDE 289

Query: 214 GATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRL 255
                 V   +  E S ++ E+A        +T     EDRL
Sbjct: 290 PTDHSDVAMFFELEASDMMWEIANHLRGGLAVTLIYGFEDRL 331


>gi|428178288|gb|EKX47164.1| hypothetical protein GUITHDRAFT_152174 [Guillardia theta CCMP2712]
          Length = 123

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 43/125 (34%), Positives = 59/125 (47%), Gaps = 21/125 (16%)

Query: 54  IVGG-GRVGTALKEMGKGQDLLVKRGELVPLDFEG-------PIFVCTRNDDLEAVLEAA 105
           IVGG GR+G+A++    G DL   R     LD  G       PIFVCT  ++LE V+   
Sbjct: 5   IVGGNGRIGSAIRRFA-GDDLYCTR----KLDKIGVNSPTNTPIFVCTGVENLEEVVSKT 59

Query: 106 PRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAY 165
             SR  DLVF QNG++   L     +D    + YF+V +      +G       G +   
Sbjct: 60  EPSRRRDLVFMQNGLVRSLLVDLE-EDQTIAVLYFSVLERNGPAKEG-------GCSYVQ 111

Query: 166 GKWAS 170
           G+WA 
Sbjct: 112 GRWAQ 116


>gi|397641392|gb|EJK74625.1| hypothetical protein THAOC_03687 [Thalassiosira oceanica]
          Length = 381

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 61/242 (25%), Positives = 101/242 (41%), Gaps = 37/242 (15%)

Query: 63  ALKEMGKGQDLLVKRGELVPLDFEG----PIFVCT---RNDDLEAVLEAAPRSRWNDLVF 115
           A+  +GKG+D++    + +P    G    PI++        D    LE   ++R  D VF
Sbjct: 85  AIVALGKGEDIVRSPRDPLPDTMGGRDKYPIYIALPPYELRDFLQNLEGDWKNRTEDFVF 144

Query: 116 FQNG----MIEPWLESKGL-KDA-NQVLAYFAVSKLGERPIDGKT--------------D 155
              G    ++EP L   G+ +D+  Q+L  F++     RP+ G                D
Sbjct: 145 ISGGRSCGVVEPILREFGMCRDSMTQLLVGFSLP----RPLTGSVNARKPEDLSVRLGED 200

Query: 156 TNPE----GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGAR 211
            N E    G + A GKW   VA RL    + CK      +++ M E+ ++   + +VG+ 
Sbjct: 201 ANGEAKWAGESQACGKWNGAVAHRLKENAIRCKQCFYREWRRAMWEQAMYDCVWNVVGSV 260

Query: 212 HTGATVGV-VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVA-NFPTAV 269
               T  + V   Y  E S ++ ELA        +T     EDRL +++     + P  +
Sbjct: 261 REEKTDHLDVALYYELESSDMMWELANGLRGGLAVTLIYGFEDRLFSFAEQRGKDMPCEI 320

Query: 270 KE 271
            E
Sbjct: 321 DE 322


>gi|317051822|ref|YP_004112938.1| 2-dehydropantoate 2-reductase [Desulfurispirillum indicum S5]
 gi|316946906|gb|ADU66382.1| 2-dehydropantoate 2-reductase [Desulfurispirillum indicum S5]
          Length = 318

 Score = 45.4 bits (106), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 79/167 (47%), Gaps = 17/167 (10%)

Query: 113 LVFFQNGM-IE-PWLESKGLKDANQVLAYFAVSKLGERPIDGKT------DTNPEGLTAA 164
           +V  QNG+ IE P+ ++    +   VLA+  VS+L    I  +          PEG T  
Sbjct: 107 IVLIQNGIDIEVPYRQAYPEHEIISVLAFVCVSRLSPGKILHQDYGKLALGNYPEGKTP- 165

Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVG--VVEK 222
              +   + E L +GG++C+V D+   Q+    KL+W ++F  +     G      + ++
Sbjct: 166 ---YLEALVELLRLGGVTCEVTDR--VQEARWRKLVWNASFNTLSVLGGGVDTAQLLADE 220

Query: 223 EYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAV 269
           E    V  ++ E+ L AAAE G    P++ D    ++R +  + T++
Sbjct: 221 ESCQLVRQVMEEVCLVAAAE-GHRLVPSIVDEQIEHTRVMTPYKTSM 266


>gi|223995543|ref|XP_002287445.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976561|gb|EED94888.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 800

 Score = 44.7 bits (104), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 40/154 (25%), Positives = 65/154 (42%), Gaps = 6/154 (3%)

Query: 88  PIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGE 147
           PI+ C  +  +E+V  A    R  DLVF  N +    L     +D +  + +F V  + +
Sbjct: 92  PIYACIPSSSIESVYYATIPQRRKDLVFLCNCIPSRHLAFASGEDVSIAILHFGVPTMKD 151

Query: 148 RPIDGKTDTNPEG-LTAAYGKWASVVAERLSVGGLSCKVLDKEA-FQKQMLEKLIWISAF 205
                + +T+PE   T  YG+ A+ +   L   G++  V    A  Q    +K+ W S  
Sbjct: 152 ADSYPQANTSPESPPTVVYGRHATALVRLLHNDGIASVVASTPAEVQAAAAKKIAWSSLM 211

Query: 206 ML----VGARHTGATVGVVEKEYRSEVSALIAEL 235
            L    VG      TV  V K    ++  L+ E+
Sbjct: 212 WLLCHDVGNEEDPLTVTEVHKYKSEQLQCLVEEI 245


>gi|397623999|gb|EJK67231.1| hypothetical protein THAOC_11766 [Thalassiosira oceanica]
          Length = 800

 Score = 44.3 bits (103), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 62/240 (25%), Positives = 100/240 (41%), Gaps = 30/240 (12%)

Query: 88  PIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQV-LAYFAVS--- 143
           PI+ C  +  L +V  A    R  D+V+  N +    L      D   V + +F +S   
Sbjct: 124 PIYACVPSTALHSVWTATAPYRRRDIVYLCNCIPSRHLAFVNSSDDVTVGILHFGISFDD 183

Query: 144 KLGERPIDGKTDTNPEG-LTAAYGKWASVVAERLSVGGLSCKVLDK-EAFQKQMLEKLIW 201
            + ERP + K +T+P    T  +G+ A  VA  L   G+  KV  +    Q    +K+ W
Sbjct: 184 SVNERP-NPKLNTSPGSPETILHGRHALAVARLLEDDGIPVKVASEWNEVQSAAAKKIAW 242

Query: 202 ISAFMLVGARH--------TGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAME- 252
            S   L+   H          AT   + K+   E+   +  LA  +   +G + D   E 
Sbjct: 243 SSLMWLLCHEHADPIPVKEVHATKAGLLKQLVEEIHPALTILANESGTVEGSSMDTVAEV 302

Query: 253 -DRLCAYSRAVAN---FPT---AVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEI 305
            D L +YS ++      P+   A+ E   RNG   SL    S EG+      H++ L+++
Sbjct: 303 MDYLESYSMSINGGNVVPSKRLALDELHERNGLLLSL---VSDEGQGR----HSSLLRQV 355


>gi|422295724|gb|EKU23023.1| hypothetical protein NGA_0718300 [Nannochloropsis gaditana CCMP526]
          Length = 124

 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 2/61 (3%)

Query: 88  PIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGL--KDANQVLAYFAVSKL 145
           PI VC    ++E+V+   P+ + NDLVF Q+  +E  L+  G   K   Q L  F++S  
Sbjct: 30  PIDVCVPEWEVESVILGCPKEKPNDLVFMQSACLETLLKRYGWANKKTTQCLPDFSISPF 89

Query: 146 G 146
           G
Sbjct: 90  G 90


>gi|408532212|emb|CCK30386.1| 2-dehydropantoate 2-reductase [Streptomyces davawensis JCM 4913]
          Length = 298

 Score = 38.1 bits (87), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 56/198 (28%), Positives = 86/198 (43%), Gaps = 19/198 (9%)

Query: 88  PIFVC---TRNDDLEAVLEAAPRSRWND--LVFFQNGMIEPWLESKGLKDANQVLAYFAV 142
           P+ VC    ++  L++ LE  P     D  LV F NG+  P +     +D     A   V
Sbjct: 69  PVDVCLVGVKHTSLDSALERIPAEALGDALLVPFLNGVEHPAVLRAHHRDDRVAPAVIRV 128

Query: 143 --SKLGERPIDGKTDTNPEGLTAAYGKWASV--VAERLSVGGLSCKVLDKEAFQKQMLEK 198
             +++G   I+  +      LTA       V  + E L   G   +VLD E+    +  K
Sbjct: 129 ESTRVGPGVIEHGSPFAEIDLTADSVPRERVEGLGEVLGAAGPVTRVLDGES--GVLWAK 186

Query: 199 LIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAY 258
           + +++ F L+  R+ G  +G V  +YR E+ AL+ E+A  + A  G   DP       A 
Sbjct: 187 MSFLAPFALLTTRY-GLPLGEVRTQYRDELVALVEEVAAVSQACGG-PGDPGQ-----AL 239

Query: 259 SRAVANFPTAVKEFKWRN 276
            R  A FP A K    R+
Sbjct: 240 GRYDA-FPAATKSSMQRD 256


>gi|422010884|ref|ZP_16357793.1| peptidase, S8/S53 family [Actinomyces georgiae F0490]
 gi|394767135|gb|EJF47951.1| peptidase, S8/S53 family [Actinomyces georgiae F0490]
          Length = 766

 Score = 37.4 bits (85), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 27/50 (54%), Gaps = 6/50 (12%)

Query: 238 AAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRN---GWFYSLSE 284
           A    KG   DPA +D +   ++AV  F TAVK  +W N   GWFY  S+
Sbjct: 554 ATVTAKGANNDPAADDDI---TQAVGFFATAVKTGRWMNTGKGWFYQYSD 600


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.132    0.397 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,861,057,079
Number of Sequences: 23463169
Number of extensions: 196665070
Number of successful extensions: 504682
Number of sequences better than 100.0: 87
Number of HSP's better than 100.0 without gapping: 74
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 504517
Number of HSP's gapped (non-prelim): 91
length of query: 308
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 166
effective length of database: 9,027,425,369
effective search space: 1498552611254
effective search space used: 1498552611254
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)