BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 020850
         (320 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255540995|ref|XP_002511562.1| conserved hypothetical protein [Ricinus communis]
 gi|223550677|gb|EEF52164.1| conserved hypothetical protein [Ricinus communis]
          Length = 314

 Score =  473 bits (1218), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 237/307 (77%), Positives = 263/307 (85%), Gaps = 6/307 (1%)

Query: 13  HWWSHTKTVTLIWFISFVLFYSFFHMALQNS-SQNYPSSGGITSHSHIPNGERRARLYDK 71
           +WWSH+KTVTLIW    ++  S   +AL+NS SQNYPSS   +      N E+R++LYDK
Sbjct: 9   NWWSHSKTVTLIWSFCVIISISILQLALKNSPSQNYPSSSSDSLS----NMEQRSKLYDK 64

Query: 72  MARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVP 131
           MARDLDD+G +FLK G+TSQSL LSDIFTLKDGSVTPV KAANPPVRANVLYLS +YSV 
Sbjct: 65  MARDLDDNGPSFLKHGQTSQSLSLSDIFTLKDGSVTPVLKAANPPVRANVLYLSSEYSVR 124

Query: 132 ISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDL 191
           IS+AVKRIFSP+FDK IWFQNSS+YHFSMFHASHHI PVPATEDE+EAEA+AV +VAE L
Sbjct: 125 ISEAVKRIFSPYFDKAIWFQNSSVYHFSMFHASHHIGPVPATEDEVEAEASAVGSVAEGL 184

Query: 192 CPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFA 251
           CPL+IVLDRV+LTSTGVLLGCWQVISG DP+TIRAKLRTALP+APEKQLYD AILHTSFA
Sbjct: 185 CPLEIVLDRVVLTSTGVLLGCWQVISGPDPITIRAKLRTALPHAPEKQLYDAAILHTSFA 244

Query: 252 RLLGHPRASPTEP-HQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRMKV 310
           RLLGHP+ SP EP  +LR  HELV  LNNQIRG  AVVSELWYVEEYDVLALALDGRMKV
Sbjct: 245 RLLGHPKDSPAEPSEELRLFHELVARLNNQIRGFVAVVSELWYVEEYDVLALALDGRMKV 304

Query: 311 RKFPLEC 317
           RKF L C
Sbjct: 305 RKFQLGC 311


>gi|225456739|ref|XP_002275470.1| PREDICTED: uncharacterized protein LOC100250065 isoform 1 [Vitis
           vinifera]
 gi|297733993|emb|CBI15240.3| unnamed protein product [Vitis vinifera]
          Length = 324

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 230/310 (74%), Positives = 255/310 (82%), Gaps = 9/310 (2%)

Query: 13  HWWSHTKTVTLIWFISFVLFYSFFHMALQNSSQNYPSSGGITSHSHIPNGERRARLYDKM 72
           +WWSH+KTV LIW  S +LFY      L+NSS NYPS+    S S     ERR++LYDKM
Sbjct: 6   NWWSHSKTVVLIWLTSALLFYFGLQFVLRNSSLNYPSAVSTMSDS-----ERRSKLYDKM 60

Query: 73  ARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVPI 132
             DL++HGAAFL+ GETSQSL LSD+FTLKDG VTPV KAA PPVRANVLYLS ++SVPI
Sbjct: 61  ESDLNEHGAAFLEHGETSQSLSLSDLFTLKDGIVTPVLKAAKPPVRANVLYLSTEFSVPI 120

Query: 133 SDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLC 192
           S+AV+ IFSP+FDKVIWFQNSSLYHFSMFHASHHI+PVPATE EIE EATAV AVA+ LC
Sbjct: 121 SEAVRSIFSPYFDKVIWFQNSSLYHFSMFHASHHITPVPATEVEIEGEATAVEAVAKALC 180

Query: 193 PLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFAR 252
           PLKIVLDRV+LT TGVLLGCWQV SGTDP+TIRAKLRTALP AP+ QLYD AILHTSFAR
Sbjct: 181 PLKIVLDRVLLTPTGVLLGCWQVASGTDPVTIRAKLRTALPRAPKNQLYDNAILHTSFAR 240

Query: 253 LLGHPRASPTEPH----QLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRM 308
           LLG P+ SPTE H    QL+F HELV  LNN IRG EA VSELWYVEEYDVLALAL+GRM
Sbjct: 241 LLGPPKFSPTELHKASDQLQFFHELVTRLNNIIRGFEADVSELWYVEEYDVLALALNGRM 300

Query: 309 KVRKFPLECS 318
           KVR+F L  S
Sbjct: 301 KVRRFKLAAS 310


>gi|356516786|ref|XP_003527074.1| PREDICTED: uncharacterized protein LOC100791859 [Glycine max]
          Length = 315

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 227/310 (73%), Positives = 257/310 (82%), Gaps = 12/310 (3%)

Query: 16  SHTKTVTLIWFISFVLFYSFFHMALQNSSQNYPSSGGITSHSHIPNGERRARLYDKMARD 75
           SH+KTV LIW IS   F++ F MAL NSSQ  PSS  + S +   N  +R++LYDKM RD
Sbjct: 9   SHSKTVVLIWLISAFCFFTLFQMALYNSSQT-PSSTSLDSPNS--NTGQRSKLYDKMGRD 65

Query: 76  LDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVPISDA 135
           LD+HGAAFLK GETSQSL LSDIFTLKDGSVTPV K ANPPVRANVLYLS ++SVPI++A
Sbjct: 66  LDEHGAAFLKHGETSQSLSLSDIFTLKDGSVTPVLKPANPPVRANVLYLSTEFSVPIAEA 125

Query: 136 VKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPLK 195
           VK +F+P+FDK IWFQNSSLYHFSMFHASHHI PVPAT++EIEAEA++V AV   LCPLK
Sbjct: 126 VKNVFNPYFDKAIWFQNSSLYHFSMFHASHHIVPVPATKEEIEAEASSVEAVVATLCPLK 185

Query: 196 IVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLLG 255
           IVLDRV+LTSTGVLLGCWQV SGTDP+TIRAKL+ ALP+APEKQLYD AILHTSFARLLG
Sbjct: 186 IVLDRVVLTSTGVLLGCWQVTSGTDPITIRAKLKNALPHAPEKQLYDAAILHTSFARLLG 245

Query: 256 HPRA-SPTEPHQLR------FLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRM 308
            P+A SP+E  QLR      F H+LVD LN+QI G +AVVSELWYVEEYDVLALAL+GRM
Sbjct: 246 PPKALSPSE--QLRMSDDVQFFHDLVDRLNSQIHGFKAVVSELWYVEEYDVLALALNGRM 303

Query: 309 KVRKFPLECS 318
             RKF L CS
Sbjct: 304 NTRKFRLRCS 313


>gi|255645528|gb|ACU23259.1| unknown [Glycine max]
          Length = 315

 Score =  442 bits (1137), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 225/310 (72%), Positives = 256/310 (82%), Gaps = 12/310 (3%)

Query: 16  SHTKTVTLIWFISFVLFYSFFHMALQNSSQNYPSSGGITSHSHIPNGERRARLYDKMARD 75
           SH+KTV LIW IS   F++ F MAL NSSQ  PSS  + S +   N  +R++LYDKM RD
Sbjct: 9   SHSKTVVLIWLISAFCFFTLFQMALYNSSQT-PSSTSLDSPNS--NTGQRSKLYDKMGRD 65

Query: 76  LDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVPISDA 135
           LD+HGAAFLK GETSQSL LSDIFTLKDGSVTPV K ANPPVRANVLYLS ++SVPI++A
Sbjct: 66  LDEHGAAFLKHGETSQSLSLSDIFTLKDGSVTPVLKPANPPVRANVLYLSTEFSVPIAEA 125

Query: 136 VKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPLK 195
           VK +F+P+FDK IWFQNSSLYHFSMFHASHHI PVPAT++EIEAEA++V AV   LCPLK
Sbjct: 126 VKNVFNPYFDKAIWFQNSSLYHFSMFHASHHIVPVPATKEEIEAEASSVEAVVATLCPLK 185

Query: 196 IVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLLG 255
           IVLDRV+LTSTGVLLGCWQV SGTDP+TIRAKL+ ALP+APEKQLYD AILHTSFARLLG
Sbjct: 186 IVLDRVVLTSTGVLLGCWQVTSGTDPITIRAKLKNALPHAPEKQLYDAAILHTSFARLLG 245

Query: 256 HPRA-SPTEPHQLR------FLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRM 308
            P+A SP+E  QLR      F H+LVD LN+QI G +AVVSELWYVEE+ VLALAL+GRM
Sbjct: 246 PPKALSPSE--QLRMSDDVQFFHDLVDRLNSQIHGFKAVVSELWYVEEHGVLALALNGRM 303

Query: 309 KVRKFPLECS 318
             RKF L CS
Sbjct: 304 NTRKFRLRCS 313


>gi|449477262|ref|XP_004154975.1| PREDICTED: uncharacterized protein LOC101228237 [Cucumis sativus]
          Length = 308

 Score =  439 bits (1129), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 222/307 (72%), Positives = 250/307 (81%), Gaps = 11/307 (3%)

Query: 16  SHTKTVTLIWFISFVLFYSFFHMALQNSSQNYPSSGGITSHSHIPNGERRARLYDKMARD 75
           +H+ TV LIW  S VLF+  F MAL NS+ +  SS   +        E R++LYDKM RD
Sbjct: 9   AHSTTVALIWLTSAVLFFFLFQMALHNSTSSSDSSVSNS--------ELRSKLYDKMERD 60

Query: 76  LDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVPISDA 135
           LD+ GA FLK GETSQSL LSDIFT+KDG+VTPV KAANPPVRANVLYLS +YSVPI +A
Sbjct: 61  LDEKGAVFLKGGETSQSLSLSDIFTIKDGTVTPVLKAANPPVRANVLYLSTEYSVPIFEA 120

Query: 136 VKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPLK 195
           VK IF P+FD+ IWFQNSSLYHFSMFHASHHI+P+PA+ DEIEAEA+AV++  E +C LK
Sbjct: 121 VKSIFDPYFDEAIWFQNSSLYHFSMFHASHHITPIPASNDEIEAEASAVKSATEHMCHLK 180

Query: 196 IVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLLG 255
           IVLDRVILTSTGVLLGCWQVISGTDP+TIRAKLRTALP+APEKQLYD AILHTSFARLLG
Sbjct: 181 IVLDRVILTSTGVLLGCWQVISGTDPVTIRAKLRTALPHAPEKQLYDAAILHTSFARLLG 240

Query: 256 HPRASPT---EPHQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRMKVRK 312
           HP+ S T      +L+  HELV  LN QIRG EAVVSELWYVEEYDVLALAL+GRMKVRK
Sbjct: 241 HPKISQTLDRSSDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKVRK 300

Query: 313 FPLECSS 319
           FPL C S
Sbjct: 301 FPLGCDS 307


>gi|449440782|ref|XP_004138163.1| PREDICTED: uncharacterized protein LOC101220816 [Cucumis sativus]
          Length = 308

 Score =  437 bits (1124), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 221/307 (71%), Positives = 249/307 (81%), Gaps = 11/307 (3%)

Query: 16  SHTKTVTLIWFISFVLFYSFFHMALQNSSQNYPSSGGITSHSHIPNGERRARLYDKMARD 75
           +H+ TV LIW  S VLF+  F MAL NS+ +  SS   +        E R++LYDKM RD
Sbjct: 9   AHSTTVALIWLTSAVLFFFLFQMALHNSTSSSDSSVSNS--------ELRSKLYDKMERD 60

Query: 76  LDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVPISDA 135
           LD+ GA FLK GETSQSL LSDIFT+KDG+VTPV KAANPPVRANVLYLS +YSVPI +A
Sbjct: 61  LDEKGAVFLKGGETSQSLSLSDIFTIKDGTVTPVLKAANPPVRANVLYLSTEYSVPIFEA 120

Query: 136 VKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPLK 195
           VK IF P+FD+ IWFQNSSLYHFSMFHASHHI+P+PA+ DEIEAEA+AV++  E +C LK
Sbjct: 121 VKSIFDPYFDEAIWFQNSSLYHFSMFHASHHITPIPASNDEIEAEASAVKSATEHMCHLK 180

Query: 196 IVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLLG 255
           IVLDRVILTSTGVLLGCWQVISGTDP+TIRAKLRTALP+APEKQLYD AILHTSFARLLG
Sbjct: 181 IVLDRVILTSTGVLLGCWQVISGTDPVTIRAKLRTALPHAPEKQLYDAAILHTSFARLLG 240

Query: 256 HPRASPTE---PHQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRMKVRK 312
           HP+ S T     +  +  HELV  LN QIRG EAVVSELWYVEEYDVLALAL+GRMKVRK
Sbjct: 241 HPKISQTVLSLTNVFQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKVRK 300

Query: 313 FPLECSS 319
           FPL C S
Sbjct: 301 FPLGCDS 307


>gi|224141705|ref|XP_002324205.1| predicted protein [Populus trichocarpa]
 gi|222865639|gb|EEF02770.1| predicted protein [Populus trichocarpa]
          Length = 249

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 204/248 (82%), Positives = 221/248 (89%), Gaps = 2/248 (0%)

Query: 72  MARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVP 131
           M RDLD+ GAAFLK GETSQSLLLSDIFTLKDGSVTPV KAANPPVRANVLYLSP+YSVP
Sbjct: 1   MERDLDERGAAFLKHGETSQSLLLSDIFTLKDGSVTPVLKAANPPVRANVLYLSPEYSVP 60

Query: 132 ISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDL 191
           ISD VK  FS +FDKV WFQNSS+YH SMFHASHHI PVP TEDEIEAE  AV+AVA+ L
Sbjct: 61  ISDNVKNTFSSYFDKV-WFQNSSVYHSSMFHASHHIEPVPVTEDEIEAEVNAVKAVADFL 119

Query: 192 CPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFA 251
           CPLKIVLDRV+LTSTGVLLGCWQVISGTDP+TIRAKL+TALP+AP+KQLYD AILHTSFA
Sbjct: 120 CPLKIVLDRVVLTSTGVLLGCWQVISGTDPLTIRAKLKTALPHAPKKQLYDDAILHTSFA 179

Query: 252 RLLGHPRASPTEP-HQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRMKV 310
           RLLGHP++ P EP  +LRF HELV  LN++IRG EAVVSELWYVEEYDVLALALDGRMKV
Sbjct: 180 RLLGHPKSPPMEPLDELRFFHELVARLNDKIRGFEAVVSELWYVEEYDVLALALDGRMKV 239

Query: 311 RKFPLECS 318
            +F L CS
Sbjct: 240 SRFKLGCS 247


>gi|356507143|ref|XP_003522330.1| PREDICTED: uncharacterized protein LOC100789685, partial [Glycine
           max]
          Length = 309

 Score =  410 bits (1054), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 216/309 (69%), Positives = 246/309 (79%), Gaps = 15/309 (4%)

Query: 21  VTLIWFISFVLFYSFFHMALQNSSQNYPSSGGITSHSHIPNGERRARLYDKMARDLDDHG 80
           V LIWFIS   F++ F MAL NSSQ  PSS  + S +   N  +R++LYDKM RDLD+HG
Sbjct: 3   VVLIWFISAFCFFTLFKMALYNSSQT-PSSTSLDSPN--SNTGQRSKLYDKMGRDLDEHG 59

Query: 81  AAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVPISDAVKRIF 140
           AAFLK GETSQSL LSDIFTLKDGSVTP  K ANPPVRANVLYLS ++SVPI++AVK +F
Sbjct: 60  AAFLKHGETSQSLSLSDIFTLKDGSVTPALKPANPPVRANVLYLSTEFSVPIAEAVKNVF 119

Query: 141 SPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPLKIVLDR 200
           +P+FDK IWFQNSSLYHFSMFHASHHI PVPAT++EIEAEA++V  VA  LCPLKIVLDR
Sbjct: 120 NPYFDKAIWFQNSSLYHFSMFHASHHIVPVPATKEEIEAEASSVEVVAATLCPLKIVLDR 179

Query: 201 VILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLLGHPRA- 259
           V LTSTGVLLGCWQVI+GTDP+TIRA L+ ALP+A EKQLYD AILH SFARLLG P+A 
Sbjct: 180 V-LTSTGVLLGCWQVITGTDPITIRANLKNALPHASEKQLYDAAILHLSFARLLGLPKAL 238

Query: 260 SPT---EPHQLR------FLHELVDSLNNQIRGSE-AVVSELWYVEEYDVLALALDGRMK 309
           SP+      QLR      F H+LV  LN+QI G + +VVSEL YVEEYDVLALAL+GRM 
Sbjct: 239 SPSIICSTEQLRMSDGVHFFHDLVYRLNSQIHGFKVSVVSELLYVEEYDVLALALNGRMN 298

Query: 310 VRKFPLECS 318
            RKF L CS
Sbjct: 299 TRKFRLGCS 307


>gi|359491718|ref|XP_003634311.1| PREDICTED: uncharacterized protein LOC100250065 isoform 2 [Vitis
           vinifera]
          Length = 310

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 211/310 (68%), Positives = 238/310 (76%), Gaps = 27/310 (8%)

Query: 13  HWWSHTKTVTLIWFISFVLFYSFFHMALQNSSQNYPSSGGITSHSHIPNGERRARLYDKM 72
           +WWSH+KTV LIW  S +LFY      L+NSS NYPS       S + + ERR++LYDKM
Sbjct: 6   NWWSHSKTVVLIWLTSALLFYFGLQFVLRNSSLNYPSV------STMSDSERRSKLYDKM 59

Query: 73  ARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVPI 132
             DL++HGAAFL+ GETSQSL LSD+FTLKDG VTPV KAA PPVRANVLYLS ++SVPI
Sbjct: 60  ESDLNEHGAAFLEHGETSQSLSLSDLFTLKDGIVTPVLKAAKPPVRANVLYLSTEFSVPI 119

Query: 133 SDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLC 192
           S+AV+ IFSP+FDKVIWFQNSSLYHFSMFHASHHI+PVPATE EIE EATAV AVA+ LC
Sbjct: 120 SEAVRSIFSPYFDKVIWFQNSSLYHFSMFHASHHITPVPATEVEIEGEATAVEAVAKALC 179

Query: 193 PLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFAR 252
           PLKIVLDRV+LT TGVLLGCWQV SGTDP+TIRAKLRTALP AP+ QLYD AILHTSFAR
Sbjct: 180 PLKIVLDRVLLTPTGVLLGCWQVASGTDPVTIRAKLRTALPRAPKNQLYDNAILHTSFAR 239

Query: 253 LLGHPRASPTEPH----QLRFLHELVDSLNNQIRGSE--------------AVVSELWYV 294
           LLG P+ SPTE H    QL+F HELV  LNN IRG E              +VV EL   
Sbjct: 240 LLGPPKFSPTELHKASDQLQFFHELVTRLNNIIRGFEVFAFCFIVENRESCSVVMELM-- 297

Query: 295 EEYDVLALAL 304
             YDVL++ L
Sbjct: 298 -TYDVLSILL 306


>gi|297842203|ref|XP_002888983.1| hypothetical protein ARALYDRAFT_476589 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334824|gb|EFH65242.1| hypothetical protein ARALYDRAFT_476589 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 308

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 198/318 (62%), Positives = 241/318 (75%), Gaps = 12/318 (3%)

Query: 1   METASARISTRCHWWSHTKTVTLIWFISFVLFYSFFHMALQNSSQNYPSSGGITSHSHIP 60
           M TA+       +W    +TV L W IS  +FYS F M L+NS  +  SS    S++   
Sbjct: 1   MATATTSAIRAPNW----RTVVLFWTISLTIFYSLFQMGLRNSPSSSSSSDSFISYA--- 53

Query: 61  NGERRARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRAN 120
             E+  RLYDKM RD+ ++G  F KQGETSQSL LSD+FTLKDG ++PV K ANPPVRAN
Sbjct: 54  --EQSTRLYDKMERDIQENGPLFFKQGETSQSLSLSDLFTLKDGKISPVLKVANPPVRAN 111

Query: 121 VLYLSPKYSVPISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAE 180
           VL+LS +YSVP+S AV+ +FSP+F+  IWFQ+S +YHFSMFHAS+HI  VPATEDE+E E
Sbjct: 112 VLHLSTEYSVPVSKAVENVFSPYFENTIWFQDSKMYHFSMFHASNHIFSVPATEDEVEGE 171

Query: 181 ATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQL 240
           A AV+AVA  LCPL+I+LDRV+LTSTGVLLGCW+V SG DP+TIR+KLR+ LP APEKQL
Sbjct: 172 AAAVKAVANKLCPLEIILDRVLLTSTGVLLGCWKVYSGDDPITIRSKLRSVLPRAPEKQL 231

Query: 241 YDPAILHTSFARLLGHPRASPTEPHQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDVL 300
           YD AILHTS ARLLG P +S     QL+ +H+L+  LNNQIRG  A+VSELWYVEE+D+L
Sbjct: 232 YDAAILHTSLARLLGPPASS---DDQLQRIHDLITRLNNQIRGFRAIVSELWYVEEFDLL 288

Query: 301 ALALDGRMKVRKFPLECS 318
           ALAL GRMKVR FPL C+
Sbjct: 289 ALALGGRMKVRSFPLGCA 306


>gi|242078325|ref|XP_002443931.1| hypothetical protein SORBIDRAFT_07g004570 [Sorghum bicolor]
 gi|241940281|gb|EES13426.1| hypothetical protein SORBIDRAFT_07g004570 [Sorghum bicolor]
          Length = 333

 Score =  361 bits (927), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 175/260 (67%), Positives = 209/260 (80%), Gaps = 5/260 (1%)

Query: 64  RRARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIF-TLKDGSVTPVHKAANPPVRANVL 122
           +RA LY KMARDLD+ GAAFLK GETSQSL LSD+F T  DG+V P  KAA+PPVRANVL
Sbjct: 70  QRAELYSKMARDLDERGAAFLKGGETSQSLTLSDLFDTAADGAVVPRLKAADPPVRANVL 129

Query: 123 YLSPKYSVPISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEAT 182
           Y+ PK++  IS AVK +F P+FD+ IWFQN+S+YHFSMFHASHH+ P+ ATEDEIEAE  
Sbjct: 130 YMDPKFAAVISKAVKDVFLPYFDQAIWFQNTSMYHFSMFHASHHLEPIVATEDEIEAEVE 189

Query: 183 AVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYD 242
           AV+  AE+ CPLKIVLDRV+LTSTGVLLG WQV SGTDP  IR++LR ALP AP+KQLYD
Sbjct: 190 AVKRAAENFCPLKIVLDRVVLTSTGVLLGLWQVESGTDPAEIRSRLREALPRAPQKQLYD 249

Query: 243 PAILHTSFARLLGHPR----ASPTEPHQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYD 298
           P +LHTSFAR+LGHP+     S +    ++F HELV  +N +IRG +A VSELWYVEEYD
Sbjct: 250 PVLLHTSFARILGHPKLPEEQSASSFDHVKFFHELVVQVNEKIRGFQATVSELWYVEEYD 309

Query: 299 VLALALDGRMKVRKFPLECS 318
           VLALAL+G+MKVR+  L C+
Sbjct: 310 VLALALNGKMKVRRLHLGCN 329


>gi|79321279|ref|NP_001031281.1| uncharacterized protein [Arabidopsis thaliana]
 gi|110738043|dbj|BAF00956.1| hypothetical protein [Arabidopsis thaliana]
 gi|332197485|gb|AEE35606.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 314

 Score =  360 bits (924), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 193/303 (63%), Positives = 230/303 (75%), Gaps = 8/303 (2%)

Query: 19  KTVTLIWFISFVLFYSFFHMALQNSSQNYPSSGGITSHSHIPNGERRARLYDKMARDLDD 78
           +T  L W IS  +FYS F M ++NS      S            E+  RLY+KM +DL +
Sbjct: 15  RTAVLFWTISLTIFYSLFQMGIRNSPSTSSPSSDSFVSY----AEQSTRLYNKMEQDLQE 70

Query: 79  HGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVPISDAVKR 138
           +G  FLKQGETSQSL LSD+FTLKDG + PV K ANPPVRANVL+LS +YSVP+ + VK 
Sbjct: 71  NGPVFLKQGETSQSLSLSDLFTLKDGKIAPVLKVANPPVRANVLHLSTEYSVPVLEVVKN 130

Query: 139 IFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPLKIVL 198
           +FSP+F+  IWFQ+S +YHFSMFHAS+HI  VPATE E+EAEA AV+AVA++LCPL+I+L
Sbjct: 131 VFSPYFENTIWFQDSKMYHFSMFHASNHIFSVPATEVEVEAEAAAVKAVAKELCPLEIIL 190

Query: 199 DRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLLGHPR 258
           DRV+LTSTGVLLGCW+V SG DP+TIR KLR+ LP APEKQLYD AILHTS ARLLG P 
Sbjct: 191 DRVLLTSTGVLLGCWKVNSGDDPITIRLKLRSVLPRAPEKQLYDAAILHTSLARLLGPP- 249

Query: 259 ASPTEP---HQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRMKVRKFPL 315
            SPTE      L+ LHELV  LNNQIRG +A+VSELWYVEE+D+LALAL GRM VR FPL
Sbjct: 250 ISPTEAGSDDHLQLLHELVTRLNNQIRGFKAIVSELWYVEEFDLLALALGGRMNVRSFPL 309

Query: 316 ECS 318
            C+
Sbjct: 310 GCA 312


>gi|212275338|ref|NP_001130121.1| uncharacterized protein LOC100191215 [Zea mays]
 gi|194688344|gb|ACF78256.1| unknown [Zea mays]
 gi|195641352|gb|ACG40144.1| hypothetical protein [Zea mays]
 gi|413917245|gb|AFW57177.1| hypothetical protein ZEAMMB73_835412 [Zea mays]
          Length = 320

 Score =  360 bits (924), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 174/259 (67%), Positives = 208/259 (80%), Gaps = 5/259 (1%)

Query: 65  RARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIF-TLKDGSVTPVHKAANPPVRANVLY 123
           RA LY KMARDLD+ GAAFLK GETSQSL LSD+F T  DG+V P  KAA+PPVRANVLY
Sbjct: 58  RAELYSKMARDLDERGAAFLKGGETSQSLTLSDLFDTADDGAVVPRLKAADPPVRANVLY 117

Query: 124 LSPKYSVPISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATA 183
           + PK++  IS AVK +F P+FD+ IWFQN+S+YHFSMFHASHH+ P+ ATEDEIEAE  A
Sbjct: 118 MDPKFAAVISKAVKDVFLPYFDQAIWFQNTSMYHFSMFHASHHLEPIVATEDEIEAEVEA 177

Query: 184 VRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDP 243
           V+ V ++LCPLKIVLDRV+LTSTGVLLG WQV SGTDP  IR++LR ALP AP+KQLYDP
Sbjct: 178 VKGVTKNLCPLKIVLDRVVLTSTGVLLGLWQVESGTDPAEIRSRLREALPRAPQKQLYDP 237

Query: 244 AILHTSFARLLGHPR----ASPTEPHQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDV 299
            +LHTSFAR+LGHPR     S +    ++F HELV  +N +I G +A +SELWYVEEYDV
Sbjct: 238 VLLHTSFARILGHPRLPKEQSASSFDNVKFFHELVAQINEKICGFQATISELWYVEEYDV 297

Query: 300 LALALDGRMKVRKFPLECS 318
           LALAL+G+MKVR+  L C+
Sbjct: 298 LALALNGKMKVRRLHLGCN 316


>gi|115474997|ref|NP_001061095.1| Os08g0171000 [Oryza sativa Japonica Group]
 gi|40253774|dbj|BAD05713.1| unknown protein [Oryza sativa Japonica Group]
 gi|40253860|dbj|BAD05795.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623064|dbj|BAF23009.1| Os08g0171000 [Oryza sativa Japonica Group]
 gi|215700986|dbj|BAG92410.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 323

 Score =  352 bits (904), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 168/267 (62%), Positives = 209/267 (78%), Gaps = 9/267 (3%)

Query: 59  IPNGE---RRARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANP 115
           IP G    RRA LY +MARDLD+ GAAFL  GETSQSL LS++F  +DG+V P+ KAANP
Sbjct: 49  IPGGSVSSRRAELYGRMARDLDERGAAFLDGGETSQSLTLSELFDTRDGAVVPMLKAANP 108

Query: 116 PVRANVLYLSPKYSVPISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATED 175
           PVRANVLYL P+++  IS AVK +F P+F++ IWFQN S+YHFSMFH SHH+ P+ ATED
Sbjct: 109 PVRANVLYLDPEFASVISKAVKEVFLPYFNQAIWFQNMSIYHFSMFHTSHHLEPIVATED 168

Query: 176 EIEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNA 235
           EIEAE  AV+ V +D+CPLKI+LD+V+LTSTGVLLG WQV SGTDP  IR++LR ALP A
Sbjct: 169 EIEAEVDAVKRVTDDVCPLKIILDQVVLTSTGVLLGLWQVESGTDPADIRSRLREALPRA 228

Query: 236 PEKQLYDPAILHTSFARLLGHPRASPTEPH-----QLRFLHELVDSLNNQIRGSEAVVSE 290
           P+KQLYDP +LHTS AR+LGHP+  P E +      ++F H+LV  +N++IRG +A V E
Sbjct: 229 PQKQLYDPVMLHTSLARILGHPKL-PQEGNAQSLDHVKFFHDLVAQVNSKIRGFQATVKE 287

Query: 291 LWYVEEYDVLALALDGRMKVRKFPLEC 317
           LW+VEEYDVLALAL+G+MKVR+    C
Sbjct: 288 LWFVEEYDVLALALNGKMKVRRLQFGC 314


>gi|222641666|gb|EEE69798.1| hypothetical protein OsJ_29524 [Oryza sativa Japonica Group]
          Length = 323

 Score =  352 bits (903), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 168/267 (62%), Positives = 209/267 (78%), Gaps = 9/267 (3%)

Query: 59  IPNG---ERRARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANP 115
           IP G    RRA LY +MARDLD+ GAAFL  GETSQSL LS++F  +DG+V P+ KAANP
Sbjct: 49  IPGGPVSSRRAELYGRMARDLDERGAAFLDGGETSQSLTLSELFDTRDGAVVPMLKAANP 108

Query: 116 PVRANVLYLSPKYSVPISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATED 175
           PVRANVLYL P+++  IS AVK +F P+F++ IWFQN S+YHFSMFH SHH+ P+ ATED
Sbjct: 109 PVRANVLYLDPEFASVISKAVKEVFLPYFNQAIWFQNMSIYHFSMFHTSHHLEPIVATED 168

Query: 176 EIEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNA 235
           EIEAE  AV+ V +D+CPLKI+LD+V+LTSTGVLLG WQV SGTDP  IR++LR ALP A
Sbjct: 169 EIEAEVDAVKRVTDDVCPLKIILDQVVLTSTGVLLGLWQVESGTDPADIRSRLREALPRA 228

Query: 236 PEKQLYDPAILHTSFARLLGHPRASPTEPH-----QLRFLHELVDSLNNQIRGSEAVVSE 290
           P+KQLYDP +LHTS AR+LGHP+  P E +      ++F H+LV  +N++IRG +A V E
Sbjct: 229 PQKQLYDPVMLHTSLARILGHPKL-PQEGNAQSLDHVKFFHDLVAQVNSKIRGFQATVKE 287

Query: 291 LWYVEEYDVLALALDGRMKVRKFPLEC 317
           LW+VEEYDVLALAL+G+MKVR+    C
Sbjct: 288 LWFVEEYDVLALALNGKMKVRRLQFGC 314


>gi|116794450|gb|ABK27147.1| unknown [Picea sitchensis]
          Length = 329

 Score =  349 bits (895), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 169/273 (61%), Positives = 207/273 (75%), Gaps = 17/273 (6%)

Query: 62  GERRARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANV 121
            ++R  LY+KM+RDL +HGAAFL  GETSQSL LSD+FTL +G VTPV K A+PPVRANV
Sbjct: 54  SKQRTELYEKMSRDLHEHGAAFLAGGETSQSLSLSDLFTLNNGLVTPVLKVADPPVRANV 113

Query: 122 LYLSPKYSVPISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDE----- 176
           L+LSPK+S PIS  VKR+F PHFDK IWFQ+ +LYHFSMFHASHH+ PV AT +E     
Sbjct: 114 LHLSPKFSHPISQIVKRVFLPHFDKAIWFQDENLYHFSMFHASHHLEPVSATAEECCCYN 173

Query: 177 -IEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNA 235
            ++ EA+AV  VA   CPLKI L+RV+LTSTGVLLGCWQ I+GT+P  IR+KL+TALP A
Sbjct: 174 QVDLEASAVEEVARTSCPLKIRLERVVLTSTGVLLGCWQFINGTEPAIIRSKLKTALPRA 233

Query: 236 PEKQLYDPAILHTSFARLLGHPR-----------ASPTEPHQLRFLHELVDSLNNQIRGS 284
           P+ QLY+P +LHTSFAR+LG PR           A  T    LRF HELV  +N +++G 
Sbjct: 234 PKNQLYNPVMLHTSFARVLGCPRVMDGCQSINTKAMHTSEDMLRFFHELVSQMNKELQGF 293

Query: 285 EAVVSELWYVEEYDVLALALDGRMKVRKFPLEC 317
           EA VSE+W+VEE+DVLALAL+GR+K RKF  +C
Sbjct: 294 EATVSEIWFVEEFDVLALALNGRIKSRKFHFKC 326


>gi|357139735|ref|XP_003571433.1| PREDICTED: uncharacterized protein LOC100839923 [Brachypodium
           distachyon]
          Length = 319

 Score =  348 bits (894), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 168/251 (66%), Positives = 203/251 (80%), Gaps = 4/251 (1%)

Query: 64  RRARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLY 123
           RRA LY +MA DLD+ GAAFLK GETSQSL LSD+F ++DG+V P  KAA+PPVRANVLY
Sbjct: 50  RRAELYGRMAGDLDERGAAFLKGGETSQSLTLSDLFDVRDGAVVPKLKAADPPVRANVLY 109

Query: 124 LSPKYSVPISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATA 183
           L P+++  IS AVK +F PHFD+VIWFQNSS+YHFSMFHASHH+ P+ ATEDEIEAEA A
Sbjct: 110 LDPEFAAVISKAVKEVFLPHFDQVIWFQNSSMYHFSMFHASHHLEPILATEDEIEAEADA 169

Query: 184 VRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDP 243
           V+ V E +CPLKIVLDRV+LTSTGVLLG WQV SGTDP  IR+KLR ALP AP+KQLYDP
Sbjct: 170 VKRVTEAVCPLKIVLDRVVLTSTGVLLGLWQVESGTDPADIRSKLREALPRAPQKQLYDP 229

Query: 244 AILHTSFARLLGHPRASPTEP----HQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDV 299
            +LHTSFAR+LG P+    E       ++F H+LV  +N +I G +A +SELWYVEE+DV
Sbjct: 230 VLLHTSFARVLGPPKLPQEENTSSLDHVKFFHDLVAQVNKKIHGFQAKISELWYVEEHDV 289

Query: 300 LALALDGRMKV 310
           LALAL+G+M++
Sbjct: 290 LALALNGKMRL 300


>gi|12324803|gb|AAG52365.1|AC011765_17 unknown protein; 68364-66043 [Arabidopsis thaliana]
          Length = 318

 Score =  345 bits (884), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 192/307 (62%), Positives = 232/307 (75%), Gaps = 12/307 (3%)

Query: 19  KTVTLIWFISFVLFYSFFHMALQNSSQNYPSSGGITSHSHIPNGERRARLYDKMARDLDD 78
           +T  L W IS  +FYS F M ++NS     S+   +S S +   E+  RLY+KM +DL +
Sbjct: 15  RTAVLFWTISLTIFYSLFQMGIRNSP----STSSPSSDSFVSYAEQSTRLYNKMEQDLQE 70

Query: 79  HGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVPISDAVKR 138
           +G  FLKQGETSQSL LSD+FTLKDG + PV K ANPPVRANVL+LS +YSVP+ + VK 
Sbjct: 71  NGPVFLKQGETSQSLSLSDLFTLKDGKIAPVLKVANPPVRANVLHLSTEYSVPVLEVVKN 130

Query: 139 IFSPHFDK----VIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPL 194
           +FSP+F+      I   +S +YHFSMFHAS+HI  VPATE E+EAEA AV+AVA++LCPL
Sbjct: 131 VFSPYFENSKPAAITIYDSKMYHFSMFHASNHIFSVPATEVEVEAEAAAVKAVAKELCPL 190

Query: 195 KIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLL 254
           +I+LDRV+LTSTGVLLGCW+V SG DP+TIR KLR+ LP APEKQLYD AILHTS ARLL
Sbjct: 191 EIILDRVLLTSTGVLLGCWKVNSGDDPITIRLKLRSVLPRAPEKQLYDAAILHTSLARLL 250

Query: 255 GHPRASPTEP---HQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRMKVR 311
           G P  SPTE      L+ LHELV  LNNQIRG +A+VSELWYVEE+D+LALAL GRM VR
Sbjct: 251 GPP-ISPTEAGSDDHLQLLHELVTRLNNQIRGFKAIVSELWYVEEFDLLALALGGRMNVR 309

Query: 312 KFPLECS 318
            FPL C+
Sbjct: 310 SFPLGCA 316


>gi|218200548|gb|EEC82975.1| hypothetical protein OsI_27982 [Oryza sativa Indica Group]
          Length = 354

 Score =  343 bits (881), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 165/259 (63%), Positives = 204/259 (78%), Gaps = 9/259 (3%)

Query: 59  IPNGE---RRARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANP 115
           IP G    RRA LY +MARDLD+ GAAFL  GETSQSL LS++F  +DG+V P  KAANP
Sbjct: 49  IPGGSVSSRRAELYGRMARDLDERGAAFLDGGETSQSLTLSELFDTRDGAVVPRLKAANP 108

Query: 116 PVRANVLYLSPKYSVPISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATED 175
           PVRANVLYL P+++  IS AVK +F P+F++ IWFQN S+YHFSMFH SHH+ P+ ATED
Sbjct: 109 PVRANVLYLDPEFASVISKAVKEVFLPYFNQAIWFQNMSIYHFSMFHTSHHLEPIVATED 168

Query: 176 EIEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNA 235
           EIEAE  AV+ V +D+CPLKI+LD+V+LTSTGVLLG WQV SGTDP  IR++LR ALP A
Sbjct: 169 EIEAEVDAVKRVTDDVCPLKIILDQVVLTSTGVLLGLWQVESGTDPADIRSRLREALPRA 228

Query: 236 PEKQLYDPAILHTSFARLLGHPRASPTEPH-----QLRFLHELVDSLNNQIRGSEAVVSE 290
           P+KQLYDP +LHTS AR+LGHP+  P E +      ++F H+LV  +N++IRG +A V E
Sbjct: 229 PQKQLYDPVMLHTSLARILGHPKL-PQEGNAQSLDHVKFFHDLVAQVNSKIRGFQATVKE 287

Query: 291 LWYVEEYDVLALALDGRMK 309
           LW+VEEYDVLALAL+G+MK
Sbjct: 288 LWFVEEYDVLALALNGKMK 306


>gi|79380240|ref|NP_177593.2| uncharacterized protein [Arabidopsis thaliana]
 gi|56461760|gb|AAV91336.1| At1g74530 [Arabidopsis thaliana]
 gi|58331809|gb|AAW70402.1| At1g74530 [Arabidopsis thaliana]
 gi|332197483|gb|AEE35604.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 289

 Score =  306 bits (784), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 168/268 (62%), Positives = 200/268 (74%), Gaps = 8/268 (2%)

Query: 19  KTVTLIWFISFVLFYSFFHMALQNSSQNYPSSGGITSHSHIPNGERRARLYDKMARDLDD 78
           +T  L W IS  +FYS F M ++NS      S            E+  RLY+KM +DL +
Sbjct: 15  RTAVLFWTISLTIFYSLFQMGIRNSPSTSSPSSDSFVSY----AEQSTRLYNKMEQDLQE 70

Query: 79  HGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVPISDAVKR 138
           +G  FLKQGETSQSL LSD+FTLKDG + PV K ANPPVRANVL+LS +YSVP+ + VK 
Sbjct: 71  NGPVFLKQGETSQSLSLSDLFTLKDGKIAPVLKVANPPVRANVLHLSTEYSVPVLEVVKN 130

Query: 139 IFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPLKIVL 198
           +FSP+F+  IWFQ+S +YHFSMFHAS+HI  VPATE E+EAEA AV+AVA++LCPL+I+L
Sbjct: 131 VFSPYFENTIWFQDSKMYHFSMFHASNHIFSVPATEVEVEAEAAAVKAVAKELCPLEIIL 190

Query: 199 DRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLLGHPR 258
           DRV+LTSTGVLLGCW+V SG DP+TIR KLR+ LP APEKQLYD AILHTS ARLLG P 
Sbjct: 191 DRVLLTSTGVLLGCWKVNSGDDPITIRLKLRSVLPRAPEKQLYDAAILHTSLARLLGPP- 249

Query: 259 ASPTEP---HQLRFLHELVDSLNNQIRG 283
            SPTE      L+ LHELV  LNNQIRG
Sbjct: 250 ISPTEAGSDDHLQLLHELVTRLNNQIRG 277


>gi|30699025|ref|NP_849885.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332197484|gb|AEE35605.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 262

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 154/249 (61%), Positives = 188/249 (75%), Gaps = 9/249 (3%)

Query: 19  KTVTLIWFISFVLFYSFFHMALQNSSQNYPSSGGITSHSHIPNGERRARLYDKMARDLDD 78
           +T  L W IS  +FYS F M ++NS     S+   +S S +   E+  RLY+KM +DL +
Sbjct: 15  RTAVLFWTISLTIFYSLFQMGIRNSP----STSSPSSDSFVSYAEQSTRLYNKMEQDLQE 70

Query: 79  HGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSPKYSVPISDAVKR 138
           +G  FLKQGETSQSL LSD+FTLKDG + PV K ANPPVRANVL+LS +YSVP+ + VK 
Sbjct: 71  NGPVFLKQGETSQSLSLSDLFTLKDGKIAPVLKVANPPVRANVLHLSTEYSVPVLEVVKN 130

Query: 139 IFSPHFDK----VIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPL 194
           +FSP+F+      I   +S +YHFSMFHAS+HI  VPATE E+EAEA AV+AVA++LCPL
Sbjct: 131 VFSPYFENSKPAAITIYDSKMYHFSMFHASNHIFSVPATEVEVEAEAAAVKAVAKELCPL 190

Query: 195 KIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLL 254
           +I+LDRV+LTSTGVLLGCW+V SG DP+TIR KLR+ LP APEKQLYD AILHTS ARLL
Sbjct: 191 EIILDRVLLTSTGVLLGCWKVNSGDDPITIRLKLRSVLPRAPEKQLYDAAILHTSLARLL 250

Query: 255 GHPRASPTE 263
           G P  SPTE
Sbjct: 251 GPP-ISPTE 258


>gi|413917243|gb|AFW57175.1| hypothetical protein ZEAMMB73_835412 [Zea mays]
          Length = 199

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 125/188 (66%), Positives = 152/188 (80%), Gaps = 4/188 (2%)

Query: 135 AVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPL 194
           AVK +F P+FD+ IWFQN+S+YHFSMFHASHH+ P+ ATEDEIEAE  AV+ V ++LCPL
Sbjct: 8   AVKDVFLPYFDQAIWFQNTSMYHFSMFHASHHLEPIVATEDEIEAEVEAVKGVTKNLCPL 67

Query: 195 KIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLL 254
           KIVLDRV+LTSTGVLLG WQV SGTDP  IR++LR ALP AP+KQLYDP +LHTSFAR+L
Sbjct: 68  KIVLDRVVLTSTGVLLGLWQVESGTDPAEIRSRLREALPRAPQKQLYDPVLLHTSFARIL 127

Query: 255 GHPR----ASPTEPHQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRMKV 310
           GHPR     S +    ++F HELV  +N +I G +A +SELWYVEEYDVLALAL+G+MKV
Sbjct: 128 GHPRLPKEQSASSFDNVKFFHELVAQINEKICGFQATISELWYVEEYDVLALALNGKMKV 187

Query: 311 RKFPLECS 318
           R+  L C+
Sbjct: 188 RRLHLGCN 195


>gi|302808959|ref|XP_002986173.1| hypothetical protein SELMODRAFT_123721 [Selaginella moellendorffii]
 gi|300146032|gb|EFJ12704.1| hypothetical protein SELMODRAFT_123721 [Selaginella moellendorffii]
          Length = 267

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 132/257 (51%), Positives = 171/257 (66%), Gaps = 10/257 (3%)

Query: 64  RRARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLY 123
           +R+ +Y KMA    D G AFL  GET+QSL +SDIF+  +G + P+HK A PPVRA VLY
Sbjct: 14  QRSTIYQKMA----DEGRAFLAGGETTQSLKISDIFSFDNGKIHPIHKLAEPPVRAAVLY 69

Query: 124 LSPKYSVPISDAVKRIFSPHFDKV--IWFQNSSLYHFSMFHASHHISPVPATEDEIEAEA 181
           L  K+S  IS+ +      H      IW+QN  +YHFS++HASHH+ PVPA++ EI  E 
Sbjct: 70  LPQKFSRVISEIITETLGSHSSGKSPIWYQNPEMYHFSLYHASHHLEPVPASKSEIAREL 129

Query: 182 TAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLY 241
            AV  VA    PL+I L+RV+LT TG L+GCWQV+ GTDP+ IR +L+  LPNAP+KQLY
Sbjct: 130 EAVDQVARSCSPLRIQLERVVLTCTGALIGCWQVLEGTDPLVIRERLKNKLPNAPKKQLY 189

Query: 242 DPAILHTSFARLLGHPRASPTEPHQ---LRFLHELVDSLNNQIRGSEAVVSELWYVEEYD 298
           D  ILH+SFAR+L  P A+ T+         L +LV  LN  ++  EA + ELW+VEE D
Sbjct: 190 DQLILHSSFARILP-PTANETKKDNRMTTELLEQLVMDLNKALQNFEADIKELWFVEELD 248

Query: 299 VLALALDGRMKVRKFPL 315
           +LALAL GR+K  KF L
Sbjct: 249 ILALALQGRVKTHKFSL 265


>gi|302806469|ref|XP_002984984.1| hypothetical protein SELMODRAFT_157018 [Selaginella moellendorffii]
 gi|300147194|gb|EFJ13859.1| hypothetical protein SELMODRAFT_157018 [Selaginella moellendorffii]
          Length = 262

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 132/258 (51%), Positives = 167/258 (64%), Gaps = 10/258 (3%)

Query: 64  RRARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLY 123
           +R+ +Y KMA    D G AFL  GET+QSL +SDIF+  +G + P+HK A PPVRA VLY
Sbjct: 7   QRSTIYQKMA----DEGRAFLAGGETTQSLKISDIFSFDNGKIHPIHKLAEPPVRAAVLY 62

Query: 124 LSPKYSVPISDAVKRIFSPHFDKV--IWFQNSSLYHFSMFHASHHISPVPATEDEIEAEA 181
           L  K+S  IS+ V      H      IW+QN  +YHFS++HASHH+ PVPA++ EI  E 
Sbjct: 63  LPQKFSRVISEIVTETLGSHSSGKSPIWYQNPEMYHFSLYHASHHLEPVPASKSEIAREL 122

Query: 182 TAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLY 241
            AV  VA    PL+I L+RV+LT TG L+GCWQV+ GTDP+ IR +L+  LPNAP+KQLY
Sbjct: 123 EAVDQVARSCSPLRIQLERVVLTCTGALIGCWQVLEGTDPLVIRERLKNKLPNAPKKQLY 182

Query: 242 DPAILHTSFARLLGHPRASPTEPHQLR----FLHELVDSLNNQIRGSEAVVSELWYVEEY 297
           D  ILH SFAR+L        E    R     L +LV  LN  ++  EA + ELW+VEE 
Sbjct: 183 DKLILHCSFARILPPTANETVETKDNRMTTELLEQLVMDLNKALQNFEADIKELWFVEEL 242

Query: 298 DVLALALDGRMKVRKFPL 315
           D+LALAL GR+K  KF L
Sbjct: 243 DILALALQGRVKTHKFSL 260


>gi|147866002|emb|CAN83053.1| hypothetical protein VITISV_000958 [Vitis vinifera]
          Length = 186

 Score =  242 bits (618), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 124/185 (67%), Positives = 139/185 (75%), Gaps = 26/185 (14%)

Query: 61  NGERRARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRAN 120
           + ERR++LYDKM  DL++HGAAFL+ GETSQSL LSD+FTLKDG VTPV KAA PPVRAN
Sbjct: 3   DSERRSKLYDKMESDLNEHGAAFLEHGETSQSLSLSDLFTLKDGIVTPVLKAAKPPVRAN 62

Query: 121 VLYLSPKYSVPISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAE 180
           VLYLS ++SVPIS                          MFHASHHI+PVPATE EIE E
Sbjct: 63  VLYLSTEFSVPIS--------------------------MFHASHHITPVPATEVEIEGE 96

Query: 181 ATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQL 240
           ATAV AVA+ LCPLKIVLDRV+LT TGVLLGCWQV SGTDP+TIRAKLRTALP AP+ QL
Sbjct: 97  ATAVEAVAKALCPLKIVLDRVLLTPTGVLLGCWQVASGTDPVTIRAKLRTALPRAPKNQL 156

Query: 241 YDPAI 245
             P +
Sbjct: 157 KSPQV 161


>gi|388518873|gb|AFK47498.1| unknown [Lotus japonicus]
          Length = 162

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 108/141 (76%), Positives = 129/141 (91%)

Query: 61  NGERRARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPPVRAN 120
           N E+R++LYDKMARDLD+HGAAF+K GETSQSL +SDIFT+KDG VTPV KAANPPVRAN
Sbjct: 21  NTEQRSKLYDKMARDLDEHGAAFMKHGETSQSLTISDIFTVKDGYVTPVLKAANPPVRAN 80

Query: 121 VLYLSPKYSVPISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAE 180
           +LYLS ++SVPI++AVK IFSPHF+K IWFQNS++YHFSMFHASHHI PVPA+++EIE E
Sbjct: 81  ILYLSTEFSVPIAEAVKSIFSPHFEKAIWFQNSTMYHFSMFHASHHIVPVPASKEEIEVE 140

Query: 181 ATAVRAVAEDLCPLKIVLDRV 201
           A++V+AVA  LCPLKIVLDRV
Sbjct: 141 ASSVKAVAAMLCPLKIVLDRV 161


>gi|384246785|gb|EIE20274.1| hypothetical protein COCSUDRAFT_67560 [Coccomyxa subellipsoidea
           C-169]
          Length = 394

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 114/282 (40%), Positives = 153/282 (54%), Gaps = 27/282 (9%)

Query: 57  SHIPNGERRARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKAANPP 116
           +H  N  RR  LYD +   L   G    +  E +Q L    +F  +DG V P  +    P
Sbjct: 91  AHQMNVVRRG-LYDVLEAQLGHTGLMLGR--EQTQGLDARSLFRKRDGRVEPALQPLQVP 147

Query: 117 VRANVLYLS-PKYSVPISDAVKRIFSP-HFDKVIWFQNSSLYHFSMFHASHHISPVPATE 174
           VRA VL+L  P  +  +S AV R  SP   D  +W Q+ + YH ++FHAS H  PV A+ 
Sbjct: 148 VRAVVLHLGDPAAARALSAAVARHMSPLAQDAGLWLQDPAKYHATLFHASTHEFPVEASS 207

Query: 175 DEIEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRA-------- 226
            E+E E  AV   A  LCP+++VL+RVI T TG +L CWQ++ G+DP  +R         
Sbjct: 208 AEVETEVKAVERSAVHLCPIEVVLERVIATPTGNILACWQILGGSDPAAVRRHGSPCLPS 267

Query: 227 ---KLRTALPNAP---EKQLYDPAILHTSFARLLGHPRASPTE------PHQLRFLHELV 274
               LR ALP AP   ++ ++DPAILHT+ ARLL  PRAS         P     L   V
Sbjct: 268 CQWALRGALPRAPPAEQQTVHDPAILHTTLARLLRLPRASAGRVLRSGAPDGAELLRSAV 327

Query: 275 DSLNNQIRGSEAVVSELWYVEEYDVLALALDGRMKVRKFPLE 316
             ++  + G  A +  LWYVEE D+LALAL+GR  VRK P++
Sbjct: 328 ARISADLCGLRATLPTLWYVEEEDLLALALNGR--VRKHPVQ 367


>gi|307104810|gb|EFN53062.1| hypothetical protein CHLNCDRAFT_54034 [Chlorella variabilis]
          Length = 266

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 84/246 (34%), Positives = 132/246 (53%), Gaps = 17/246 (6%)

Query: 90  SQSLLLSDIFTLKDGSVTPVHKAANPPVRANVLYLSP-KYSVPISDAVKRIFSPHFD-KV 147
           +Q + L D+F ++ G + PV      PVRA V+ LS  + ++ + DAV+R   P      
Sbjct: 18  TQGMALGDMFKIEGGRLRPVVTVLPIPVRAIVMPLSDIQAALRLEDAVQRHLDPLMPPNG 77

Query: 148 IWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPLKIVLDRVILTSTG 207
           IW Q+  LYH ++FHAS H+ PV A   +++ E   ++ V    CPL + L+R++ T +G
Sbjct: 78  IWKQHHRLYHSTIFHASTHMDPVRAAPRDVDMEEYLIKDVGHKACPLHLTLERIVATPSG 137

Query: 208 VLLGCWQVISGTDPMTIRAKLRTALPNAPEKQLY-DPAILHTSFARLLGHPR-------- 258
            +L CWQ+++GTD + IR  L   LP+  ++Q+  D  ILHT+ ARL+  P+        
Sbjct: 138 TVLACWQIMNGTDVLDIRNWLEEILPHHSKQQVVQDRTILHTTLARLVAPPKLDGLAAAA 197

Query: 259 ------ASPTEPHQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRMKVRK 312
                          R L    D +   + G   V+ ++W+VEEY+ LALAL G  + R+
Sbjct: 198 TDGGQGGGGAAADAARVLQAAADRMTTDLCGLRTVIDKMWFVEEYEKLALALRGTFRQRE 257

Query: 313 FPLECS 318
            P+EC 
Sbjct: 258 VPMECE 263


>gi|255078854|ref|XP_002503007.1| predicted protein [Micromonas sp. RCC299]
 gi|226518273|gb|ACO64265.1| predicted protein [Micromonas sp. RCC299]
          Length = 378

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 94/278 (33%), Positives = 142/278 (51%), Gaps = 23/278 (8%)

Query: 65  RARLYDKMARDLDDHGAAFLKQG---ETSQSLLLSDIFTL--KDGSVTPVHKAANPPVRA 119
           R+RLY + A  L+  G     +     T+Q L LSD+F    + G VT      + PVRA
Sbjct: 94  RSRLYGEFASGLEARGLDVFSRAAAASTTQGLGLSDLFDFHPRSGRVTARLHRLDVPVRA 153

Query: 120 NVLYL--SPKYSVPISDAVKRIFSPHF------------DKVIWFQNSSLYHFSMFHASH 165
            VL L  + + +  I    +R+   HF               +W+Q+ + YHFS++HASH
Sbjct: 154 VVLPLPAASRAAARIHRTTRRVLGEHFPGYGSKDGLAGHGDSVWYQDPTAYHFSVYHASH 213

Query: 166 HISPVPATEDEIEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIR 225
           H++ V  T +E   EA A+      +CP++ V++RV++  +G +L  W V +GT+P  +R
Sbjct: 214 HLAEVRVTPEEERLEAAAIAGACASMCPIRAVIERVVVAPSGAVLALWNVAAGTEPSDLR 273

Query: 226 AKLRTALPNAPEKQLY-DPAILHTSFARLLG---HPRASPTEPHQLRFLHELVDSLNNQI 281
           A LR ALPN+P  Q+  D AILHT+ ARLL     P  +           +   ++   +
Sbjct: 274 AALRAALPNSPRAQIVKDRAILHTTVARLLRPPLGPMGAVVRDGGAGAARKAAAAMTASL 333

Query: 282 RGSEAVVSELWYVEEYDVLALALDGRMKVRKFPLECSS 319
            G+EA +   W+V EYD LALAL G  +    P  C +
Sbjct: 334 CGTEATLPVAWFVREYDKLALALGGEYEPETMPFACEA 371


>gi|308806588|ref|XP_003080605.1| unnamed protein product [Ostreococcus tauri]
 gi|116059066|emb|CAL54773.1| unnamed protein product [Ostreococcus tauri]
          Length = 336

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 94/271 (34%), Positives = 132/271 (48%), Gaps = 21/271 (7%)

Query: 65  RARLYDKMARDLDDHG-AAFLKQGETSQSLLLSDIFTLK-DGSVTPVHKAANPPVRANVL 122
           R+  Y  M R+  + G AAF      +Q L + D+FT   DG + P     + PVRA V+
Sbjct: 70  RSVSYANMGREARERGIAAFDVNARATQGLRVDDMFTRDGDGKIKPNLLHLDVPVRAIVI 129

Query: 123 YLSPKYSVPISDAVKRIFSPHFDKV-------IWFQNSSLYHFSMFHASHHISPVPATED 175
              P     +S  + R    H  K        ++ QN  ++H SMFHASHHI     +  
Sbjct: 130 ---PFPVGVVSAMLHRGTLKHLAKFGFQEDVDVYVQNEEMFHASMFHASHHIEEHAVSAS 186

Query: 176 EIEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGT------DPMTIRAKLR 229
           E   E   +R++    CP+  VL+RV +TS GV+L  WQV   T      +P   RA LR
Sbjct: 187 EALEEERRIRSITTKFCPINAVLERVTVTSGGVVLAGWQVARDTWNSKSGEPSEFRAALR 246

Query: 230 TALPNAPEKQLY-DPAILHTSFARLLGHPRASPTEPHQLRFLHELVDSLNNQIRGSEAVV 288
            ALP +P KQL  D  I+HT+ ARL   P+    +    R   +L  +L +++ G E  +
Sbjct: 247 DALPKSPTKQLVSDVNIIHTTLARLTRPPKGRRGDDG--RLAKDLAFALTDELCGLEITL 304

Query: 289 SELWYVEEYDVLALALDGRMKVRKFPLECSS 319
              W+V E+  LALAL G +     P EC S
Sbjct: 305 DRAWFVREHHKLALALRGAVDKVDMPFECHS 335


>gi|302832954|ref|XP_002948041.1| hypothetical protein VOLCADRAFT_88255 [Volvox carteri f.
           nagariensis]
 gi|300266843|gb|EFJ51029.1| hypothetical protein VOLCADRAFT_88255 [Volvox carteri f.
           nagariensis]
          Length = 395

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 77/205 (37%), Positives = 116/205 (56%), Gaps = 14/205 (6%)

Query: 65  RARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGS------VTPVHKAANPPVR 118
           R R+Y+ M       G      G  +Q L  +D+F +   S      +TP+      PVR
Sbjct: 34  RRRVYEAMGAGSRRKGLEL--GGVITQGLTQADLFKVTPDSRSNRILITPILHPLEVPVR 91

Query: 119 ANVLYLSPKYSV-PISDAVKRIFSPHF-DKVIWFQNSSLYHFSMFHASHHISPVPATEDE 176
           A +L L    +   I DAV++   P   +  IW Q+ +LYH +++HAS H+ P+ A+ + 
Sbjct: 92  AMILPLRDNVAAGKIRDAVRQFLLPVLPEGSIWLQDDALYHATLYHASSHVKPIHASAEV 151

Query: 177 IEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVI-SGTDPMTIRAKLRTALPNA 235
           +  E  ++RAV    CP+  VLDRV++TSTGV++ CWQV+  G +P  +RA L  ALPNA
Sbjct: 152 VAEEERSIRAVCGSTCPINAVLDRVVITSTGVVVACWQVLPEGGEPALLRAALGAALPNA 211

Query: 236 PEKQ---LYDPAILHTSFARLLGHP 257
           P ++   + +PA+LHT+ ARLL  P
Sbjct: 212 PPREAQMVKEPAMLHTTVARLLKPP 236



 Score = 43.9 bits (102), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 29/46 (63%)

Query: 274 VDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRMKVRKFPLECSS 319
           VD+L+  + G  A   E+W++EE D+LALAL+GR      PL C S
Sbjct: 341 VDALSATLCGLRAQFGEVWFIEEQDLLALALNGRYVRHVAPLTCPS 386


>gi|412991204|emb|CCO16049.1| predicted protein [Bathycoccus prasinos]
          Length = 424

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 86/263 (32%), Positives = 127/263 (48%), Gaps = 29/263 (11%)

Query: 65  RARLYDKMARDLDDHG---------AAFLKQGETSQSLLLSDIF---TLKDGSVTPVHKA 112
           R+ LY KM R   D              ++    +  L L  +F   T KD  VTP+ + 
Sbjct: 155 RSELYAKMFRSTSDTAVLDIFNPSKTVIVQTANNNNDLTLDKVFVIDTEKD-EVTPILEK 213

Query: 113 ANPPVRANVLYL--SPKYSVPISDAVKRIFSPH-FDK--VIWFQNSSLYHFSMFHASHHI 167
              PVRA VL    + + +  I    + + + + F K   +W QN+ LYH S++HASHH+
Sbjct: 214 LEIPVRAIVLPFPSTSEGAKKIGKITREVLAKNGFSKETSVWLQNAELYHASVYHASHHL 273

Query: 168 SPVPATEDEIEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTI--- 224
               A  +EI  E   V+  A+ +CP+KI L+R+++TS+G L+  W   +  D   I   
Sbjct: 274 DAHKAKGNEIREEVRVVKESAKQICPIKITLERIVITSSGALVSVWNTRNTRDGGEISEF 333

Query: 225 RAKLRTALPNAPEKQLY-DPAILHTSFARLLGHPRASPTEPHQLRFLHELVDSLNNQIRG 283
           R  L   LPNAP  Q+  + +I+H + AR LG    S    +  R        L N + G
Sbjct: 334 RQLLHDNLPNAPVNQIVSNKSIIHATLARFLGTTGTSENAKNVAR-------ELTNVLCG 386

Query: 284 SEAVVSELWYVEEYDVLALALDG 306
            E  +   W+VEE   LALAL+G
Sbjct: 387 LEMTLPMAWFVEERHTLALALEG 409


>gi|303278894|ref|XP_003058740.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459900|gb|EEH57195.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 360

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/277 (33%), Positives = 135/277 (48%), Gaps = 20/277 (7%)

Query: 44  SQNYPSSGGITSHSHIPNGERRARLYDKMARDLDDHGAAFLKQ--GETSQSLLLSDIFTL 101
           S +  ++ G  + SH+ +   R++LY  M  +L + G        G  +Q   + D+F  
Sbjct: 74  SGSVDAASGSWALSHVAS---RSKLYAAMDAELTERGLTAFDDAGGARTQGSGVDDLFAF 130

Query: 102 KDGSVTPVHKAA----NPPVRANVLYLSPKYS------VPISDAVKRIFSPHFDKVIWFQ 151
               VT + +A     +  VRA VL L P+ S         + AV      H  +  W Q
Sbjct: 131 D--RVTGLARAKLTRLDVAVRAIVLPL-PRRSRAAFRMRRATRAVVAAAGYHDGRDAWLQ 187

Query: 152 NSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLG 211
           +   YHFSMFHASHH++P PA+  E+EAE  AV  V    C +   ++RV+ T +G ++ 
Sbjct: 188 DPDAYHFSMFHASHHLAPTPASAAEVEAEMRAVARVVRGACAMTATIERVVATPSGSVVA 247

Query: 212 CWQVISGTDPMTIRAKLRTALPNAPEKQLY-DPAILHTSFARLLG-HPRASPTEPHQLRF 269
            W +  G++P   R  LR ALPNAP  Q+  D  I+HT+ ARLL   P A          
Sbjct: 248 LWNLAGGSEPSAFRDALRAALPNAPAAQIVSDEHIMHTTLARLLRPPPGAGGDAAAATAA 307

Query: 270 LHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDG 306
              +   L  ++ G E  +   W+V+E D LALAL G
Sbjct: 308 AAAIARQLTERLCGVEVTLPIAWFVQERDKLALALGG 344


>gi|145349260|ref|XP_001419055.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579286|gb|ABO97348.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 258

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 84/238 (35%), Positives = 121/238 (50%), Gaps = 12/238 (5%)

Query: 90  SQSLLLSDIFTLK-DGSVTPVHKAANPPVRANVLYLSPKYSVPI--SDAVK--RIFSPHF 144
           +Q L L D+F +  DG ++P       PVRA V+         I     VK  R F    
Sbjct: 23  TQGLRLDDMFKIGVDGKISPNLMVLEVPVRAIVIPFPVGAVSQILHQGTVKHLRRFGFVD 82

Query: 145 DKVIWFQNSSLYHFSMFHASHHISPVPATEDEIEAEATAVRAVAEDLCPLKIVLDRVILT 204
           +  ++ QN  ++HFS+FHASHH+     +E     E +A+R+V    CP++ VL+RV +T
Sbjct: 83  ETNVYLQNPEMFHFSVFHASHHLEEHAVSEVGFREEVSAIRSVTRKFCPIRAVLERVTVT 142

Query: 205 STGVLLGCWQV--ISGTDPMTIRAKLRTALPNAPEKQLY-DPAILHTSFARLLGHPRASP 261
           S GV++  W V   S  +P  +RA+LR  LP AP KQL  D  ILHT+ ARL+  P  + 
Sbjct: 143 SGGVVVAGWNVERTSLGEPSDLRARLREVLPRAPTKQLVSDAFILHTTLARLVRPPDGTV 202

Query: 262 TEPHQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRMKVRKFPLECSS 319
            E         +   L ++  G E  +   W+V E+  LALAL G +     P +C S
Sbjct: 203 AET----LADAMSRDLTDEFCGLELTLDRAWFVSEHHKLALALKGAVDKLDMPFDCHS 256


>gi|159489992|ref|XP_001702973.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158270880|gb|EDO96711.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 438

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 75/205 (36%), Positives = 115/205 (56%), Gaps = 14/205 (6%)

Query: 65  RARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGS------VTPVHKAANPPVR 118
           R R+Y+ M   +   G +    G  +Q L   D++ +   +      +TPV  A   PVR
Sbjct: 63  RRRVYEAMGAGVRSKGLSL--GGVVTQGLKQDDLYRITHDTRSDKMLITPVLHALEVPVR 120

Query: 119 ANVLYLSPKYSV-PISDAVKRIFSPHF-DKVIWFQNSSLYHFSMFHASHHISPVPATEDE 176
           A +L L    +   I DAV+++  P   +  IW Q+  LYH +++HAS H  P+ A  + 
Sbjct: 121 AMILPLRDHVAASKIRDAVQQLLMPVLPEGSIWLQDDELYHATLYHASSHAKPIHARGEV 180

Query: 177 IEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVIS-GTDPMTIRAKLRTALPNA 235
           I  E  A++ V    CP+  VLD+V++T+TGV++ CWQV+  G +P  +RA L  ALPNA
Sbjct: 181 IAEEERAIKRVCGSTCPISAVLDKVVVTTTGVVVACWQVLPIGGEPALLRAALGVALPNA 240

Query: 236 PEKQ---LYDPAILHTSFARLLGHP 257
           P ++   + +PA+LHT+ ARLL  P
Sbjct: 241 PPREAQMVKEPAMLHTTIARLLRPP 265



 Score = 41.2 bits (95), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 27/44 (61%)

Query: 274 VDSLNNQIRGSEAVVSELWYVEEYDVLALALDGRMKVRKFPLEC 317
           VD L+  + G  A   E+W++EE D+LALAL+GR      PL C
Sbjct: 378 VDLLSATLCGLRAQFGEVWFIEEQDLLALALNGRYVRHLAPLTC 421


>gi|413917246|gb|AFW57178.1| hypothetical protein ZEAMMB73_835412 [Zea mays]
          Length = 130

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 49/71 (69%), Positives = 57/71 (80%), Gaps = 1/71 (1%)

Query: 65  RARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIF-TLKDGSVTPVHKAANPPVRANVLY 123
           RA LY KMARDLD+ GAAFLK GETSQSL LSD+F T  DG+V P  KAA+PPVRANVLY
Sbjct: 58  RAELYSKMARDLDERGAAFLKGGETSQSLTLSDLFDTADDGAVVPRLKAADPPVRANVLY 117

Query: 124 LSPKYSVPISD 134
           + PK++  IS+
Sbjct: 118 MDPKFAAVISE 128


>gi|224032785|gb|ACN35468.1| unknown [Zea mays]
          Length = 127

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 49/70 (70%), Positives = 56/70 (80%), Gaps = 1/70 (1%)

Query: 65  RARLYDKMARDLDDHGAAFLKQGETSQSLLLSDIF-TLKDGSVTPVHKAANPPVRANVLY 123
           RA LY KMARDLD+ GAAFLK GETSQSL LSD+F T  DG+V P  KAA+PPVRANVLY
Sbjct: 58  RAELYSKMARDLDERGAAFLKGGETSQSLTLSDLFDTADDGAVVPRLKAADPPVRANVLY 117

Query: 124 LSPKYSVPIS 133
           + PK++  IS
Sbjct: 118 MDPKFAAVIS 127


>gi|413917244|gb|AFW57176.1| hypothetical protein ZEAMMB73_835412 [Zea mays]
          Length = 63

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 44/63 (69%), Positives = 51/63 (80%), Gaps = 1/63 (1%)

Query: 72  MARDLDDHGAAFLKQGETSQSLLLSDIF-TLKDGSVTPVHKAANPPVRANVLYLSPKYSV 130
           MARDLD+ GAAFLK GETSQSL LSD+F T  DG+V P  KAA+PPVRANVLY+ PK++ 
Sbjct: 1   MARDLDERGAAFLKGGETSQSLTLSDLFDTADDGAVVPRLKAADPPVRANVLYMDPKFAA 60

Query: 131 PIS 133
            IS
Sbjct: 61  VIS 63


>gi|384251067|gb|EIE24545.1| hypothetical protein COCSUDRAFT_52988 [Coccomyxa subellipsoidea
           C-169]
          Length = 269

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/186 (24%), Positives = 83/186 (44%), Gaps = 30/186 (16%)

Query: 132 ISDAVKRIFSPHFDKVIWFQNSSLYHFSMFHASHHISPVPA------------------- 172
           +++A+K +  P   +V +       H ++FH S    P P+                   
Sbjct: 73  VTEAIKALL-PAGTQV-YVNAPEKLHITVFHMSRPDDPRPSSMSAADDAITAVPITDRRG 130

Query: 173 -TEDEIEAEATAVRAVAEDLCPLK-IVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRT 230
            T++EI +E  A+ ++ + + PL+ + +  ++L  +G LL C  +    D   +R  LR 
Sbjct: 131 PTDEEIASELAAMTSIVQQMQPLEHLEVSSIVLADSGTLLLC-SIDRRADITKLRQSLRA 189

Query: 231 ALPNAPEKQLYDPAILHTSFARLLGHPRASPTEPHQLRFLHELVDSLNNQIRGSEAVVSE 290
           A P AP+KQ    +I+HT+  R+L  P   P  P  +  +         ++RG+   + +
Sbjct: 190 AYPGAPKKQT---SIIHTTLMRILT-PEQLP--PASVAAIQRQCALWTERLRGTRLTIKK 243

Query: 291 LWYVEE 296
            WY  E
Sbjct: 244 AWYTVE 249


>gi|384250172|gb|EIE23652.1| hypothetical protein COCSUDRAFT_63177 [Coccomyxa subellipsoidea
           C-169]
          Length = 297

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/130 (27%), Positives = 62/130 (47%), Gaps = 8/130 (6%)

Query: 170 VPATEDEI-EAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKL 228
           +PA +  +  AE  A R +A       + +  ++  ++G LL C   ISG     +R K+
Sbjct: 156 IPAPQPPVLTAEKDAFRTLAAQTSAPILEVHSIVFAASGTLLLCCVDISGVL-AGLRTKI 214

Query: 229 RTALPNAPEKQLYDPAILHTSFARLLGHPRASPTEPHQLRFLHELVDSLNNQIRGSEAVV 288
           R   P  P KQ    +I H +  R+L  P+A   +  Q+  +H        +++G++   
Sbjct: 215 RRVFPGGPMKQ---SSIFHVTLMRIL-EPQAMSRD--QVEAIHTACQQETERLKGTQFTA 268

Query: 289 SELWYVEEYD 298
           S LWYV+E+ 
Sbjct: 269 SSLWYVQEWQ 278


>gi|412985412|emb|CCO18858.1| unnamed protein product [Bathycoccus prasinos]
          Length = 593

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 30/141 (21%), Positives = 67/141 (47%), Gaps = 11/141 (7%)

Query: 164 SHHISPVPATEDEIEA--------EATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQV 215
           +H  +P+ +T+ + +         EAT  + V +   P+++V+DRV+L ++G LL C+  
Sbjct: 439 AHASTPISSTQSKFKRSASGAILDEATKAQEVLQTCEPVELVVDRVVLATSGALLLCFDD 498

Query: 216 ISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLLGHPRASPTEPHQLRFLHELVD 275
           +   D    + +   A      + +   +  H + AR+L +         +L+ + +L+ 
Sbjct: 499 V---DDSLKKCRASLASNAVGVRGVQQTSTAHCTLARILPNAGDERLSDFELKQIDQLLT 555

Query: 276 SLNNQIRGSEAVVSELWYVEE 296
               +++G++    + WYV E
Sbjct: 556 KWTKKLKGAKMTCPKAWYVRE 576


>gi|302844073|ref|XP_002953577.1| hypothetical protein VOLCADRAFT_106011 [Volvox carteri f.
           nagariensis]
 gi|300260986|gb|EFJ45201.1| hypothetical protein VOLCADRAFT_106011 [Volvox carteri f.
           nagariensis]
          Length = 375

 Score = 42.7 bits (99), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 86/359 (23%), Positives = 132/359 (36%), Gaps = 94/359 (26%)

Query: 27  ISFVLFYSFFHMALQNSSQNYPSSG--GITSHSHIPNGERR---------------ARLY 69
           +  VL  S + +   +  +N  ++G  G  SH H  NG+ +               A LY
Sbjct: 15  VGAVLAASTWFLRHSSRQRNVLATGRSGEESHQHT-NGDAKVTEDAGPVTQEAKDWAALY 73

Query: 70  DKMARDLDDHGAAFLKQGETSQSLLLSDIFTLKDGSVTPVHKA---ANPPVRANVLYLSP 126
           D+MA   D+  +  L         L S    ++D   TP  K      PP++   + L  
Sbjct: 74  DRMA---DEFASGDLNPKAIEP--LKSWFHGVEDPEETPWPKKYTELEPPLQFGAVRL-- 126

Query: 127 KYSVPISDA--VKRIFS----------PHFDKVIWFQNS-SLYHFSMFHASHHISPVP-- 171
              VP+ D+  V RI +          P   KV  F N+   YH ++FH S    P P  
Sbjct: 127 -LVVPLDDSPVVLRIAASVTADVMRLLPDGTKV--FANARGNYHCTVFHTSQPTDPRPDP 183

Query: 172 -----ATEDEIEAEATAVRAVAEDLCPLKIV----------------LDRVILTSTGVLL 210
                 T+  +E     +   AE    L+IV                L+RV+  STGVLL
Sbjct: 184 TRPDGGTDPTLEPSRRRLPTDAEWRRELEIVRQLVAATPQPLLRPPQLERVVQASTGVLL 243

Query: 211 GCW-QVISGTDPMTIRAKLRTALPNA-----------------------PEKQLYDPAIL 246
             W +V  G     +R +LR A P A                       P K      I+
Sbjct: 244 LTWTEVGQGAVVSDLRRRLREAFPGASTKQTLPRSTHNKVRTPPKNSLSPTKIFVKATII 303

Query: 247 HTSFARLLGHPRASPTEPHQLRFLHELVDSLNNQIRGSEAVVSELWYVEEYDVLALALD 305
           H+S  R++    + P  P  +  + E       ++RG+      LW++ E +   +  D
Sbjct: 304 HSSLLRIVS---SRPLGPAAVDAVSEACRHWTAKLRGTLYTPRTLWFIREMEFSTIEGD 359


>gi|125600144|gb|EAZ39720.1| hypothetical protein OsJ_24159 [Oryza sativa Japonica Group]
          Length = 132

 Score = 42.4 bits (98), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 20/33 (60%), Positives = 22/33 (66%)

Query: 64  RRARLYDKMARDLDDHGAAFLKQGETSQSLLLS 96
           RR  LY  M RDLD+HGA FL+  ETSQS   S
Sbjct: 75  RRVELYGWMTRDLDEHGATFLEGSETSQSRCRS 107


>gi|145355516|ref|XP_001422007.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582246|gb|ABP00301.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 340

 Score = 42.0 bits (97), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 30/125 (24%), Positives = 53/125 (42%), Gaps = 4/125 (3%)

Query: 172 ATEDEIEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTA 231
           A ED +  E +AV          ++ +DRV L  +G L+ C+  + G +  TIR +L  +
Sbjct: 204 AREDAVAYEESAVAEALRGCGECELEVDRVCLAPSGCLILCFNDVHG-NVQTIRERLARS 262

Query: 232 LPNAPEKQLYDPAILHTSFARLLGHPRASPTEPHQLRFLHELVDSLNNQIRGSEAVVSEL 291
              A   Q      +H + ARL         +P + R +  +  ++ + +RG+       
Sbjct: 263 AVGAARAQ---NDTVHCTLARLFARADGFQMDPERARAVQAMCATVTSALRGARFKARSA 319

Query: 292 WYVEE 296
            Y  E
Sbjct: 320 TYCVE 324


>gi|301100434|ref|XP_002899307.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262104224|gb|EEY62276.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 287

 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 51/256 (19%), Positives = 107/256 (41%), Gaps = 33/256 (12%)

Query: 63  ERRARLYDKM----ARDLDDHGAAFLKQGETSQSLLLSD--IFTLKDGSVTPVHKAANPP 116
           ++RA+LY+++     +DL+  G    +Q  T+   L S   +F+ ++  V  V      P
Sbjct: 15  QQRAKLYEELYEGGKKDLNREGVRKTRQNFTTGRTLESKPYVFSSENDGVGLVCA----P 70

Query: 117 VRANVLYLSPKYSVPISDAVKRIFSPHFDK-VIWFQNSSLYHFSMFHASHHISPVPATED 175
           +  ++          ++D + R   P  D  V       L H ++FH SH     P  + 
Sbjct: 71  IPLDMAERCHSIGRELTDLLPR--EPETDNAVACVTPQGLMHVTLFHTSHPGDLAPNAKV 128

Query: 176 EIEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMT------------ 223
               +   ++ +   + P ++   +V++ S+G ++  +Q +   + ++            
Sbjct: 129 RFPQDVAQLKTMCTRISPFRMEPVKVLMASSGAIIMLFQCLPANEKLSDDVVNAHTEFSV 188

Query: 224 --IRAKLRTALPNAPEKQLYDPAILHTSFARLLGHPRASPTEPHQLRF-LHELVDSLNNQ 280
             IR+  +     AP+       I+H +  R+L  P     + ++LR    E+   L   
Sbjct: 189 DYIRSVAKETFTYAPKTDTR--VIIHATLGRVLS-PDVHDIDLNRLRTRCDEITSEL--A 243

Query: 281 IRGSEAVVSELWYVEE 296
           +    A+ ++LW+VEE
Sbjct: 244 VDPQPALFNKLWFVEE 259


>gi|308813193|ref|XP_003083903.1| unnamed protein product [Ostreococcus tauri]
 gi|116055785|emb|CAL57870.1| unnamed protein product [Ostreococcus tauri]
          Length = 392

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 34/138 (24%), Positives = 62/138 (44%), Gaps = 7/138 (5%)

Query: 162 HASHHISPVPAT---EDEIEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISG 218
           +AS     +P T   E+ I  E +A+    +     ++ +DRV L  +G LL C++ ++G
Sbjct: 244 NASGTTRTLPRTLPIENAIAFEESAIADAVKGCGTCELEVDRVCLAPSGCLLLCFKDVNG 303

Query: 219 TDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLLGHPRASPTEPHQLRFLHELVDSLN 278
              + +R +LR  +  A  KQ      LH + ARL      S  +   +  ++ +   + 
Sbjct: 304 GSQV-VRERLRDKVVGAATKQ---NDTLHCTLARLFPKTEDSQLDDTTVAAINVMCQKVT 359

Query: 279 NQIRGSEAVVSELWYVEE 296
           + +R +      L YV E
Sbjct: 360 SSLRHARFTARNLTYVVE 377


>gi|255089589|ref|XP_002506716.1| predicted protein [Micromonas sp. RCC299]
 gi|226521989|gb|ACO67974.1| predicted protein [Micromonas sp. RCC299]
          Length = 368

 Score = 38.5 bits (88), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 35/144 (24%), Positives = 64/144 (44%), Gaps = 7/144 (4%)

Query: 156 YHFSMFHASHHISPVPATED-EIEAEATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQ 214
           YH ++FH  H   P P  +  ++  E      +     P+ + +DRV++ S+GVLL   +
Sbjct: 204 YHSTVFHTGHPTDPRPPRDAYDLAFEVREATQLVGGTTPIAVEVDRVVIASSGVLLALLR 263

Query: 215 VISGTDPMTIRAKLRTALPNAPEKQLYDPAILHTSFARLLGHPRASPTEPHQLRFLHEL- 273
              G +      + R         Q     ++H S  R+L   RA+P +    R+   L 
Sbjct: 264 HPGGGESPVDGLRARCRRRWPRAPQRQATHVMHVSLCRVL---RATP-QCDGARWGEVLV 319

Query: 274 -VDSLNNQIRGSEAVVSELWYVEE 296
            V  ++ ++ G  A +  +W+V+E
Sbjct: 320 RVREVSVELAGMRAQLGTVWHVQE 343


>gi|307108826|gb|EFN57065.1| hypothetical protein CHLNCDRAFT_143817 [Chlorella variabilis]
          Length = 388

 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 32/116 (27%), Positives = 54/116 (46%), Gaps = 7/116 (6%)

Query: 180 EATAVRAVAEDLCPLKIVLDRVILTSTGVLLGCWQVISGTDPMTIRAKLRTALPNAPEKQ 239
           E  A R  A      +  + R+++  +G LL C  V    +   +R +LR A P  P KQ
Sbjct: 247 EIEAFRVAAAGTDAPRFEVHRLVMADSGTLLLC-SVDHTANLAQLRKRLRHAFPGGPPKQ 305

Query: 240 LYDPAILHTSFARLLGHPRASPTEPHQLRFLHELVDSLNNQIRGSEAVVSELWYVE 295
                I+H S ARLL  P+    E  Q+  +  + D+ + ++RG       L++++
Sbjct: 306 ---STIVHASIARLL-TPKQLTRE--QIDRVQAVCDAWSERLRGMRFDPDRLYHIQ 355


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.133    0.407 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,026,137,945
Number of Sequences: 23463169
Number of extensions: 206368053
Number of successful extensions: 503631
Number of sequences better than 100.0: 55
Number of HSP's better than 100.0 without gapping: 42
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 503508
Number of HSP's gapped (non-prelim): 59
length of query: 320
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 178
effective length of database: 9,027,425,369
effective search space: 1606881715682
effective search space used: 1606881715682
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 77 (34.3 bits)