BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 019981
         (333 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255537217|ref|XP_002509675.1| conserved hypothetical protein [Ricinus communis]
 gi|223549574|gb|EEF51062.1| conserved hypothetical protein [Ricinus communis]
          Length = 320

 Score =  492 bits (1267), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 249/334 (74%), Positives = 278/334 (83%), Gaps = 15/334 (4%)

Query: 1   MAGKVASGFTPQAQRVFAVPVQKPAAALAPFPSSSRAHSNSNTQFNGRQFTVRRRSFVLP 60
           MA K+A  F+    RV++ P+ KP  +       S +    ++  +G+  +VRRR  +LP
Sbjct: 1   MASKLA--FSLTFPRVYSAPILKPLIS------LSSSSRIHSSHLHGKHLSVRRRILLLP 52

Query: 61  SKATTDQQ-GQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSN 119
            KAT DQQ G+VEGDEVVD KILQYCSIDKK+KKS+GE+EQEFLQALQAFYYEGKA+MSN
Sbjct: 53  IKATADQQQGKVEGDEVVDGKILQYCSIDKKDKKSIGEMEQEFLQALQAFYYEGKAIMSN 112

Query: 120 EEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEI 179
           EEFDNLKEELMWEGSSVVMLS  EQKFLEASMAYV+G PIMSDEE+D+LK +LK EGSEI
Sbjct: 113 EEFDNLKEELMWEGSSVVMLSCDEQKFLEASMAYVSGNPIMSDEEFDQLKLRLKQEGSEI 172

Query: 180 VVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEP 239
           VVEGPRCSLRSRKVYSDLSVDYLKM LLNVPA VVALGLFFFLDD+TGFEITYLLELPEP
Sbjct: 173 VVEGPRCSLRSRKVYSDLSVDYLKMFLLNVPAAVVALGLFFFLDDLTGFEITYLLELPEP 232

Query: 240 FSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTN 299
           FSFIFTWFAAVP+IV+LS S T  IV++ LILKGPCPNCGTEN SFFGTILSISSGGTTN
Sbjct: 233 FSFIFTWFAAVPVIVWLSLSFTNAIVKDFLILKGPCPNCGTENQSFFGTILSISSGGTTN 292

Query: 300 TINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           ++ CSN      CGT MVYDS TRLITLPEGS A
Sbjct: 293 SVKCSN------CGTDMVYDSKTRLITLPEGSNA 320


>gi|225426724|ref|XP_002282120.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Vitis
           vinifera]
          Length = 331

 Score =  491 bits (1265), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 247/333 (74%), Positives = 277/333 (83%), Gaps = 7/333 (2%)

Query: 1   MAGKVASGFTPQAQRVFAVPVQKPAAALAPFPSSSRAHSNSNTQFNGRQFTVRRRSFVLP 60
           MA K+    T  A R+FA   +KP  + A   S+S      + Q  GRQ  + RR F++ 
Sbjct: 6   MATKLGFALT-TAPRLFASSSRKPLISPASLSSASSCSRVESFQLVGRQLRLSRRLFIIS 64

Query: 61  SKATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNE 120
            KAT DQQGQV+GD+VVDSKIL YCSIDKKEK+S+GE+EQ+FLQALQ+FYYEGKA+MSNE
Sbjct: 65  PKATADQQGQVQGDDVVDSKILPYCSIDKKEKRSIGEMEQDFLQALQSFYYEGKAIMSNE 124

Query: 121 EFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIV 180
           EFDNLKEELMWEGSSVVMLSS EQKFLEASMAYVAG PIMSDEE+D LK KLK+EGSEIV
Sbjct: 125 EFDNLKEELMWEGSSVVMLSSDEQKFLEASMAYVAGNPIMSDEEFDDLKIKLKIEGSEIV 184

Query: 181 VEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPF 240
           VEGPRCSLRS+KVYSDLSVDYLKM LLNVPA VVAL LFFFLDD+TGFEIT+LLEL EPF
Sbjct: 185 VEGPRCSLRSKKVYSDLSVDYLKMFLLNVPAAVVALSLFFFLDDVTGFEITFLLELQEPF 244

Query: 241 SFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNT 300
           SFIFTWFAAVPLI +LSQ++TK I+++ LILKGPCPNCGTENVSFFGTILSIS+GGTTN 
Sbjct: 245 SFIFTWFAAVPLIFWLSQTITKFILKDFLILKGPCPNCGTENVSFFGTILSISNGGTTNN 304

Query: 301 INCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           + CSN      C T MVYDS TRLITLPEGSEA
Sbjct: 305 LKCSN------CETQMVYDSKTRLITLPEGSEA 331


>gi|297742627|emb|CBI34776.3| unnamed protein product [Vitis vinifera]
          Length = 326

 Score =  491 bits (1264), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 247/333 (74%), Positives = 277/333 (83%), Gaps = 7/333 (2%)

Query: 1   MAGKVASGFTPQAQRVFAVPVQKPAAALAPFPSSSRAHSNSNTQFNGRQFTVRRRSFVLP 60
           MA K+    T  A R+FA   +KP  + A   S+S      + Q  GRQ  + RR F++ 
Sbjct: 1   MATKLGFALT-TAPRLFASSSRKPLISPASLSSASSCSRVESFQLVGRQLRLSRRLFIIS 59

Query: 61  SKATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNE 120
            KAT DQQGQV+GD+VVDSKIL YCSIDKKEK+S+GE+EQ+FLQALQ+FYYEGKA+MSNE
Sbjct: 60  PKATADQQGQVQGDDVVDSKILPYCSIDKKEKRSIGEMEQDFLQALQSFYYEGKAIMSNE 119

Query: 121 EFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIV 180
           EFDNLKEELMWEGSSVVMLSS EQKFLEASMAYVAG PIMSDEE+D LK KLK+EGSEIV
Sbjct: 120 EFDNLKEELMWEGSSVVMLSSDEQKFLEASMAYVAGNPIMSDEEFDDLKIKLKIEGSEIV 179

Query: 181 VEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPF 240
           VEGPRCSLRS+KVYSDLSVDYLKM LLNVPA VVAL LFFFLDD+TGFEIT+LLEL EPF
Sbjct: 180 VEGPRCSLRSKKVYSDLSVDYLKMFLLNVPAAVVALSLFFFLDDVTGFEITFLLELQEPF 239

Query: 241 SFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNT 300
           SFIFTWFAAVPLI +LSQ++TK I+++ LILKGPCPNCGTENVSFFGTILSIS+GGTTN 
Sbjct: 240 SFIFTWFAAVPLIFWLSQTITKFILKDFLILKGPCPNCGTENVSFFGTILSISNGGTTNN 299

Query: 301 INCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           + CSN      C T MVYDS TRLITLPEGSEA
Sbjct: 300 LKCSN------CETQMVYDSKTRLITLPEGSEA 326


>gi|224071670|ref|XP_002303553.1| predicted protein [Populus trichocarpa]
 gi|222840985|gb|EEE78532.1| predicted protein [Populus trichocarpa]
          Length = 327

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 248/335 (74%), Positives = 281/335 (83%), Gaps = 10/335 (2%)

Query: 1   MAGKVASGFTPQAQRVFAVPVQKPA-AALAPFPSSSRAHSNSNTQFNGRQFTVRRRSFVL 59
           MA K+A  F   + RVF  P+QKP  ++ +  PS S    ++  Q NG+QF++R R   L
Sbjct: 1   MASKLA--FNLTSPRVFTAPIQKPIISSSSSLPSLSSPSCSTRVQLNGKQFSLRGRMLFL 58

Query: 60  PSKATTDQQ-GQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMS 118
           P+KAT DQQ  QV+ D++ D KILQYCSIDKK KKSLGE+EQ+FLQALQAFYYEGKA+MS
Sbjct: 59  PTKATADQQTDQVQEDDMDDGKILQYCSIDKKGKKSLGEMEQDFLQALQAFYYEGKAIMS 118

Query: 119 NEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSE 178
           NEEFDNLKEELMW+GSSVVMLSS EQKFLEAS+AYV+G PIM+DEE+DKLK KLK EGSE
Sbjct: 119 NEEFDNLKEELMWQGSSVVMLSSDEQKFLEASLAYVSGNPIMNDEEFDKLKIKLKTEGSE 178

Query: 179 IVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPE 238
           IVVEGPRCSLRSRKVYSDLSVDYLKM LLNVPATVVALGLFFFLDD+TGFEITYLLELPE
Sbjct: 179 IVVEGPRCSLRSRKVYSDLSVDYLKMFLLNVPATVVALGLFFFLDDLTGFEITYLLELPE 238

Query: 239 PFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTT 298
           PFSF+FTWFAAVPLIV+L+ +LT  IV++ LILKGPCPNCGTEN SFFGTILSISSGG +
Sbjct: 239 PFSFLFTWFAAVPLIVWLALTLTNAIVKDFLILKGPCPNCGTENGSFFGTILSISSGGAS 298

Query: 299 NTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           NT+ CSN      C T +VYDS TRLITLPEGSEA
Sbjct: 299 NTLKCSN------CSTELVYDSKTRLITLPEGSEA 327


>gi|118487360|gb|ABK95508.1| unknown [Populus trichocarpa]
          Length = 323

 Score =  488 bits (1256), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 249/334 (74%), Positives = 282/334 (84%), Gaps = 12/334 (3%)

Query: 1   MAGKVASGFTPQAQRVFAVPVQKPAAALAPFPSSSRAHSNSNTQFNGRQFTVRRRSFVLP 60
           MA K+A  F   + RVF  P+QKP  + +   SS  + S++  Q NG+QF++R R   LP
Sbjct: 1   MASKLA--FNLTSPRVFTAPIQKPIISSS---SSLPSLSSTRVQLNGKQFSLRGRMLFLP 55

Query: 61  SKATTDQQ-GQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSN 119
           +KAT DQQ  QV+ D++ D KILQYCSIDKK KKSLGE+EQ+FLQALQAFYYEGKA+MSN
Sbjct: 56  TKATADQQTDQVQEDDMDDGKILQYCSIDKKGKKSLGEMEQDFLQALQAFYYEGKAIMSN 115

Query: 120 EEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEI 179
           EEFDNLKEELMW+GSSVVMLSS EQKFLEAS+AYV+G PIM+DEE+DKLK KLK EGSEI
Sbjct: 116 EEFDNLKEELMWQGSSVVMLSSDEQKFLEASLAYVSGNPIMNDEEFDKLKIKLKTEGSEI 175

Query: 180 VVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEP 239
           VVEGPRCSLRSRKVYSDLSVDYLKM LLNVPATVVALGLFFFLDD+TGFEITYLLELPEP
Sbjct: 176 VVEGPRCSLRSRKVYSDLSVDYLKMFLLNVPATVVALGLFFFLDDLTGFEITYLLELPEP 235

Query: 240 FSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTN 299
           FSF+FTWFAAVPLIV+L+ +LT  IV++ LILKGPCPNCGTEN SFFGTILSISSGGT+N
Sbjct: 236 FSFLFTWFAAVPLIVWLALTLTNAIVKDFLILKGPCPNCGTENGSFFGTILSISSGGTSN 295

Query: 300 TINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           T+ CSN      C T +VYDS TRLITLPEGSEA
Sbjct: 296 TLKCSN------CSTELVYDSKTRLITLPEGSEA 323


>gi|356514103|ref|XP_003525746.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Glycine max]
          Length = 321

 Score =  486 bits (1252), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 245/329 (74%), Positives = 270/329 (82%), Gaps = 13/329 (3%)

Query: 9   FTPQAQRVFAVPVQKPAAALAPFPSSSRAHSNSNTQFNGRQFTVRRRSFVLPSKATTDQQ 68
           FT         P +   A L P  S+S  H     QFNGR   +R+R F+L   AT DQQ
Sbjct: 2   FTASKLAFTLTPPRPCTAPLTPISSASGVHL---IQFNGRHLCLRQRLFLLSPMATADQQ 58

Query: 69  GQVEGDE----VVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDN 124
           G+VE  E     VDSKIL YCSIDKKEKKS+GELEQEFLQALQAFYYEGKA+MSNEEFDN
Sbjct: 59  GKVEEFEDDSNTVDSKILPYCSIDKKEKKSIGELEQEFLQALQAFYYEGKAIMSNEEFDN 118

Query: 125 LKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGP 184
           LKEELMWEGSSVVMLSS EQKFLEASMAYV+GKPIMSD+E+D+LK +LK+EGSEIV EGP
Sbjct: 119 LKEELMWEGSSVVMLSSDEQKFLEASMAYVSGKPIMSDKEFDELKLRLKIEGSEIVAEGP 178

Query: 185 RCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIF 244
           RCSLRSRKVYSDLSVDYLKMLLLNVPATV+ALGLFFFLDD+TGFEITYLLELPEPFSFIF
Sbjct: 179 RCSLRSRKVYSDLSVDYLKMLLLNVPATVIALGLFFFLDDLTGFEITYLLELPEPFSFIF 238

Query: 245 TWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCS 304
           TWFAAVPLIV+++ SLT  IV++ +ILKGPCPNCGTEN SFFGTILSIS+GG+TNT+ CS
Sbjct: 239 TWFAAVPLIVWIALSLTNAIVKDFVILKGPCPNCGTENTSFFGTILSISNGGSTNTVKCS 298

Query: 305 NLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           N      CGT M YDS TRLITLPEGS A
Sbjct: 299 N------CGTAMEYDSTTRLITLPEGSNA 321


>gi|255645443|gb|ACU23217.1| unknown [Glycine max]
          Length = 321

 Score =  484 bits (1247), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 244/329 (74%), Positives = 269/329 (81%), Gaps = 13/329 (3%)

Query: 9   FTPQAQRVFAVPVQKPAAALAPFPSSSRAHSNSNTQFNGRQFTVRRRSFVLPSKATTDQQ 68
           FT         P +   A L P  S+S  H     QFNGR   +R+R F+L   AT DQQ
Sbjct: 2   FTASKLAFTLTPPRPCTAPLTPISSASGVHL---IQFNGRHLCLRQRLFLLSPMATADQQ 58

Query: 69  GQVEGDE----VVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDN 124
           G+VE  E     VDSKIL YCSIDKKEKKS+GELEQEFLQALQAFYYEGKA+MSNEEFDN
Sbjct: 59  GKVEEFEDDSNTVDSKILPYCSIDKKEKKSIGELEQEFLQALQAFYYEGKAIMSNEEFDN 118

Query: 125 LKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGP 184
           LKEELMWEGSSVVMLSS EQKFLEASMAYV+GKPIMSD+E+D+LK +LK+EGSEIV EGP
Sbjct: 119 LKEELMWEGSSVVMLSSDEQKFLEASMAYVSGKPIMSDKEFDELKLRLKIEGSEIVAEGP 178

Query: 185 RCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIF 244
           RCSLRSRKVYSDLSVDYLKMLLLNVPATV+ALGLFFFLDD+TGFEITYLLELPEPFSFIF
Sbjct: 179 RCSLRSRKVYSDLSVDYLKMLLLNVPATVIALGLFFFLDDLTGFEITYLLELPEPFSFIF 238

Query: 245 TWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCS 304
           TWF AVPLIV+++ SLT  IV++ +ILKGPCPNCGTEN SFFGTILSIS+GG+TNT+ CS
Sbjct: 239 TWFVAVPLIVWIALSLTNAIVKDFVILKGPCPNCGTENTSFFGTILSISNGGSTNTVKCS 298

Query: 305 NLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           N      CGT M YDS TRLITLPEGS A
Sbjct: 299 N------CGTAMEYDSTTRLITLPEGSNA 321


>gi|224058627|ref|XP_002299573.1| predicted protein [Populus trichocarpa]
 gi|222846831|gb|EEE84378.1| predicted protein [Populus trichocarpa]
          Length = 327

 Score =  481 bits (1237), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 243/338 (71%), Positives = 275/338 (81%), Gaps = 16/338 (4%)

Query: 1   MAGKVASGFTPQAQRVFAVPVQKPAAALAPFPSSSRAHSNSNTQFNGRQFTVRRRSFVLP 60
           MAGK+A  FT  + RV   P+QKP  + +         S++   FN +QF++RRR  + P
Sbjct: 1   MAGKLA--FTLTSPRVLIAPIQKPFISSSSSLPPPSCSSSTRVHFNVKQFSLRRRMLLPP 58

Query: 61  SKATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNE 120
           +KAT DQQ +   DE+VD KILQYCSIDK+ KKS+GE+EQEFLQALQAFYYEGKA+MSNE
Sbjct: 59  TKATADQQAE---DEMVDGKILQYCSIDKRGKKSIGEMEQEFLQALQAFYYEGKAIMSNE 115

Query: 121 EFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEG---- 176
           EFDNLKEELMWEGSSVVMLSS EQKFLEASMAYV+  PIMSD+EYD+LK KLK+      
Sbjct: 116 EFDNLKEELMWEGSSVVMLSSDEQKFLEASMAYVSENPIMSDKEYDELKMKLKVSAWLFS 175

Query: 177 -SEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLE 235
            SEIVVEGPRCSLRSRKVYSDL VDYLKM LLNVPATV ALGLFFFLDD+TGFEITYLLE
Sbjct: 176 RSEIVVEGPRCSLRSRKVYSDLYVDYLKMFLLNVPATVTALGLFFFLDDLTGFEITYLLE 235

Query: 236 LPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSG 295
           LPEPFSF+FTWFAAVPLIV+L+ +LT  IV++ LILKGPCPNCGTEN SFFGTILSISSG
Sbjct: 236 LPEPFSFLFTWFAAVPLIVWLALTLTNAIVKDFLILKGPCPNCGTENGSFFGTILSISSG 295

Query: 296 GTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           GTTNT+ CSN      C T +VY+S TRLITLPEGSEA
Sbjct: 296 GTTNTLKCSN------CSTELVYNSKTRLITLPEGSEA 327


>gi|449460457|ref|XP_004147962.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Cucumis
           sativus]
 gi|449494267|ref|XP_004159497.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Cucumis
           sativus]
          Length = 321

 Score =  472 bits (1215), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 235/309 (76%), Positives = 263/309 (85%), Gaps = 9/309 (2%)

Query: 22  QKPAAALAPFPSSSRAHSNSNTQFNGRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKI 81
           QKPAA+L     SS   S+   + + R+  +RR+  + P +ATTDQ GQVEGDEVVDS +
Sbjct: 22  QKPAASLL---PSSSRSSSQLIRVSSRRLFLRRKLVLPPPRATTDQPGQVEGDEVVDSNV 78

Query: 82  LQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSS 141
           LQYCSIDKKEKK++GELEQEFLQALQAFYYEGKA+MSNEEFDNLKEELMWEGSSVVMLSS
Sbjct: 79  LQYCSIDKKEKKTIGELEQEFLQALQAFYYEGKAIMSNEEFDNLKEELMWEGSSVVMLSS 138

Query: 142 AEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDY 201
            EQKFLEASMAYV+G PI+SD+E+D LK +LK EGSEIVVEGPRCSLRSRKVYSDLSVDY
Sbjct: 139 DEQKFLEASMAYVSGNPILSDKEFDDLKLRLKTEGSEIVVEGPRCSLRSRKVYSDLSVDY 198

Query: 202 LKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLT 261
            KM LLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVP +V+L+ +LT
Sbjct: 199 FKMFLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPFLVWLALTLT 258

Query: 262 KLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSN 321
             +VR+SLILKGPCPNCGTENVSFFGTILS+SSGG  N I C+N      C T +VY+S 
Sbjct: 259 NAVVRDSLILKGPCPNCGTENVSFFGTILSVSSGGNKNNIKCTN------CATELVYNSK 312

Query: 322 TRLITLPEG 330
           TRLITLPE 
Sbjct: 313 TRLITLPEA 321


>gi|18416029|ref|NP_567672.1| PGR5-like protein 1A [Arabidopsis thaliana]
 gi|30685973|ref|NP_849422.1| PGR5-like protein 1A [Arabidopsis thaliana]
 gi|30685977|ref|NP_849423.1| PGR5-like protein 1A [Arabidopsis thaliana]
 gi|75151862|sp|Q8H112.1|PGL1A_ARATH RecName: Full=PGR5-like protein 1A, chloroplastic; Flags: Precursor
 gi|24030260|gb|AAN41305.1| unknown protein [Arabidopsis thaliana]
 gi|332659274|gb|AEE84674.1| PGR5-like protein 1A [Arabidopsis thaliana]
 gi|332659275|gb|AEE84675.1| PGR5-like protein 1A [Arabidopsis thaliana]
 gi|332659276|gb|AEE84676.1| PGR5-like protein 1A [Arabidopsis thaliana]
          Length = 324

 Score =  470 bits (1209), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 222/287 (77%), Positives = 258/287 (89%), Gaps = 7/287 (2%)

Query: 47  GRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQAL 106
           G+  ++RRR F+LP+KATT+Q G V GD V DS +L YCSI+K EKK++GE+EQEFLQAL
Sbjct: 45  GKSISLRRRVFLLPAKATTEQSGPVGGDNV-DSNVLPYCSINKAEKKTIGEMEQEFLQAL 103

Query: 107 QAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYD 166
           Q+FYY+GKA+MSNEEFDNLKEELMWEGSSVVMLSS EQ+FLEASMAYV+G PI++DEEYD
Sbjct: 104 QSFYYDGKAIMSNEEFDNLKEELMWEGSSVVMLSSDEQRFLEASMAYVSGNPILNDEEYD 163

Query: 167 KLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDIT 226
           KLK KLK++GS+IV EGPRCSLRS+KVYSDL+VDY KMLLLNVPATVVALGLFFFLDDIT
Sbjct: 164 KLKLKLKIDGSDIVSEGPRCSLRSKKVYSDLAVDYFKMLLLNVPATVVALGLFFFLDDIT 223

Query: 227 GFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFF 286
           GFEITY++ELPEP+SFIFTWFAAVP+IVYL+ S+TKLI+++ LILKGPCPNCGTEN SFF
Sbjct: 224 GFEITYIMELPEPYSFIFTWFAAVPVIVYLALSITKLIIKDFLILKGPCPNCGTENTSFF 283

Query: 287 GTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           GTILSISSGG TNT+ C+N      CGT MVYDS +RLITLPEGS+A
Sbjct: 284 GTILSISSGGKTNTVKCTN------CGTAMVYDSGSRLITLPEGSQA 324


>gi|357481805|ref|XP_003611188.1| PGR5-like protein 1A [Medicago truncatula]
 gi|355512523|gb|AES94146.1| PGR5-like protein 1A [Medicago truncatula]
          Length = 326

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 227/298 (76%), Positives = 257/298 (86%), Gaps = 8/298 (2%)

Query: 44  QFN--GRQFTVRRRSFVLPSKATTDQQ-GQVEGDEVVDSKILQYCSIDKKEKKSLGELEQ 100
            FN  GR  ++RR  F+L  KAT+DQQ G+VE D VVDSKILQYCSID+KEKKSLGELEQ
Sbjct: 29  HFNAGGRHSSLRRGLFLLSPKATSDQQAGKVEEDAVVDSKILQYCSIDQKEKKSLGELEQ 88

Query: 101 EFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIM 160
           +FLQALQAFYYEGKA MSNEEFDNLKEELMWEGSSVVMLSS+EQKFLEAS+AYV+GKP+M
Sbjct: 89  DFLQALQAFYYEGKATMSNEEFDNLKEELMWEGSSVVMLSSSEQKFLEASIAYVSGKPLM 148

Query: 161 SDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFF 220
           SD+E+D+LK KLKMEGSEIV EGPRCSLRSRKVYSDL+VDYLK LLL VPATV+ALGLFF
Sbjct: 149 SDKEFDELKLKLKMEGSEIVAEGPRCSLRSRKVYSDLTVDYLKALLLKVPATVIALGLFF 208

Query: 221 FLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGT 280
           FLDD+TGFEI YL+ +PEPFSFI TWFAAVP I++ +QS+T  I+++ LILKGPCPNCGT
Sbjct: 209 FLDDVTGFEINYLIAIPEPFSFILTWFAAVPFILWFAQSITNAIIKDFLILKGPCPNCGT 268

Query: 281 ENVSFFGTILSISSGGTTNTINC-----SNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           EN SFFGTILSISSG +TN + C     +N   C SCGT M+YDS TRLITLPEGS A
Sbjct: 269 ENTSFFGTILSISSGDSTNKVKCEKYKYNNPNLCLSCGTAMLYDSTTRLITLPEGSNA 326


>gi|145333783|ref|NP_001078431.1| PGR5-like protein 1A [Arabidopsis thaliana]
 gi|332659277|gb|AEE84677.1| PGR5-like protein 1A [Arabidopsis thaliana]
          Length = 321

 Score =  464 bits (1194), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 221/287 (77%), Positives = 257/287 (89%), Gaps = 10/287 (3%)

Query: 47  GRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQAL 106
           G+  ++RRR F+LP+KATT+Q G   GD V DS +L YCSI+K EKK++GE+EQEFLQAL
Sbjct: 45  GKSISLRRRVFLLPAKATTEQSG---GDNV-DSNVLPYCSINKAEKKTIGEMEQEFLQAL 100

Query: 107 QAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYD 166
           Q+FYY+GKA+MSNEEFDNLKEELMWEGSSVVMLSS EQ+FLEASMAYV+G PI++DEEYD
Sbjct: 101 QSFYYDGKAIMSNEEFDNLKEELMWEGSSVVMLSSDEQRFLEASMAYVSGNPILNDEEYD 160

Query: 167 KLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDIT 226
           KLK KLK++GS+IV EGPRCSLRS+KVYSDL+VDY KMLLLNVPATVVALGLFFFLDDIT
Sbjct: 161 KLKLKLKIDGSDIVSEGPRCSLRSKKVYSDLAVDYFKMLLLNVPATVVALGLFFFLDDIT 220

Query: 227 GFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFF 286
           GFEITY++ELPEP+SFIFTWFAAVP+IVYL+ S+TKLI+++ LILKGPCPNCGTEN SFF
Sbjct: 221 GFEITYIMELPEPYSFIFTWFAAVPVIVYLALSITKLIIKDFLILKGPCPNCGTENTSFF 280

Query: 287 GTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           GTILSISSGG TNT+ C+N      CGT MVYDS +RLITLPEGS+A
Sbjct: 281 GTILSISSGGKTNTVKCTN------CGTAMVYDSGSRLITLPEGSQA 321


>gi|145333797|ref|NP_001078432.1| PGR5-like protein 1A [Arabidopsis thaliana]
 gi|332659278|gb|AEE84678.1| PGR5-like protein 1A [Arabidopsis thaliana]
          Length = 322

 Score =  463 bits (1192), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 221/287 (77%), Positives = 257/287 (89%), Gaps = 9/287 (3%)

Query: 47  GRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQAL 106
           G+  ++RRR F+LP+KATT+Q   V GD V DS +L YCSI+K EKK++GE+EQEFLQAL
Sbjct: 45  GKSISLRRRVFLLPAKATTEQS--VGGDNV-DSNVLPYCSINKAEKKTIGEMEQEFLQAL 101

Query: 107 QAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYD 166
           Q+FYY+GKA+MSNEEFDNLKEELMWEGSSVVMLSS EQ+FLEASMAYV+G PI++DEEYD
Sbjct: 102 QSFYYDGKAIMSNEEFDNLKEELMWEGSSVVMLSSDEQRFLEASMAYVSGNPILNDEEYD 161

Query: 167 KLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDIT 226
           KLK KLK++GS+IV EGPRCSLRS+KVYSDL+VDY KMLLLNVPATVVALGLFFFLDDIT
Sbjct: 162 KLKLKLKIDGSDIVSEGPRCSLRSKKVYSDLAVDYFKMLLLNVPATVVALGLFFFLDDIT 221

Query: 227 GFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFF 286
           GFEITY++ELPEP+SFIFTWFAAVP+IVYL+ S+TKLI+++ LILKGPCPNCGTEN SFF
Sbjct: 222 GFEITYIMELPEPYSFIFTWFAAVPVIVYLALSITKLIIKDFLILKGPCPNCGTENTSFF 281

Query: 287 GTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           GTILSISSGG TNT+ C+N      CGT MVYDS +RLITLPEGS+A
Sbjct: 282 GTILSISSGGKTNTVKCTN------CGTAMVYDSGSRLITLPEGSQA 322


>gi|356497195|ref|XP_003517448.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Glycine max]
          Length = 316

 Score =  463 bits (1191), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 222/290 (76%), Positives = 250/290 (86%), Gaps = 6/290 (2%)

Query: 44  QFNGRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFL 103
           QFNGR   +R R F+    AT DQ    E   VVDSKILQYCSIDKKEKKS+GE+EQEFL
Sbjct: 33  QFNGRHICLRPRLFLFSPMATADQDKVEEDAAVVDSKILQYCSIDKKEKKSVGEMEQEFL 92

Query: 104 QALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDE 163
           QALQAFYYEGKA+MSNEEFDNLKEELMWEGS+VVMLSS EQKFLEASMAYV+GKPI+SD+
Sbjct: 93  QALQAFYYEGKAIMSNEEFDNLKEELMWEGSTVVMLSSDEQKFLEASMAYVSGKPILSDK 152

Query: 164 EYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLD 223
           E+D+LK +LKMEGSEIV EGPRCSLRSRKVYSDLSVDYLKM LLNVPATVVALGLFFFLD
Sbjct: 153 EFDELKLRLKMEGSEIVAEGPRCSLRSRKVYSDLSVDYLKMFLLNVPATVVALGLFFFLD 212

Query: 224 DITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENV 283
           D+TGFEI+YL+++PEPFSFI TWFAA+P I++L+QS+T+ IV++ LILKGPCPNCGTEN 
Sbjct: 213 DVTGFEISYLIKIPEPFSFILTWFAAIPFILWLAQSITRAIVQDFLILKGPCPNCGTENT 272

Query: 284 SFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           SFFGTILS+SSG +TN + C N      C T MVYDS TRLITLPEGS A
Sbjct: 273 SFFGTILSVSSGDSTNKVKCEN------CETKMVYDSKTRLITLPEGSNA 316


>gi|297803832|ref|XP_002869800.1| hypothetical protein ARALYDRAFT_492575 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297315636|gb|EFH46059.1| hypothetical protein ARALYDRAFT_492575 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 320

 Score =  463 bits (1191), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 228/325 (70%), Positives = 277/325 (85%), Gaps = 11/325 (3%)

Query: 13  AQRVFAVPVQKPAAALAPFPSSSRAHSNSNTQFN----GRQFTVRRRSFVLPSKATTDQQ 68
           ++ +F++   +  +A++  P+SS + S+S TQ+      +  ++RRR F+LP+KATT+Q 
Sbjct: 3   SKMLFSLTSPRLFSAVSLKPTSSFSPSSSRTQWAQLTPAKSISLRRRVFLLPAKATTEQS 62

Query: 69  GQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEE 128
           G   GD V DS +L YCSI+K EKK++GE+EQEFLQA+Q+FYY+GKA+MSNEEFDNLK+E
Sbjct: 63  GPGGGDNV-DSNVLPYCSINKAEKKTIGEMEQEFLQAMQSFYYDGKAIMSNEEFDNLKDE 121

Query: 129 LMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSL 188
           LMWEGSSVVMLSS EQ+FLEASMAYV+GKPI++DEEYDKLK KLK++GSEIV EGPRCSL
Sbjct: 122 LMWEGSSVVMLSSDEQRFLEASMAYVSGKPILNDEEYDKLKLKLKIDGSEIVCEGPRCSL 181

Query: 189 RSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFA 248
           RS+KVYSDL+VDY KMLLLNVPATVVALGLFFFLDDITGFEITY++EL EP+SFIFTWFA
Sbjct: 182 RSKKVYSDLAVDYFKMLLLNVPATVVALGLFFFLDDITGFEITYIMELSEPYSFIFTWFA 241

Query: 249 AVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTF 308
           AVP+IVYL+ S+TKLI+++ LILKGPCPNCGTEN SFFGTILSISSGG TN + CSN   
Sbjct: 242 AVPVIVYLALSITKLIIKDFLILKGPCPNCGTENTSFFGTILSISSGGKTNAVKCSN--- 298

Query: 309 CFSCGTTMVYDSNTRLITLPEGSEA 333
              CGT MVYDS +RLITLPEGS+A
Sbjct: 299 ---CGTAMVYDSGSRLITLPEGSQA 320


>gi|356563242|ref|XP_003549873.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Glycine max]
          Length = 332

 Score =  461 bits (1185), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 235/294 (79%), Positives = 258/294 (87%), Gaps = 10/294 (3%)

Query: 44  QFNGRQFTVRRRSFVLPSKATTDQQGQVEGDE----VVDSKILQYCSIDKKEKKSLGELE 99
           QFNGR   +RRR F+L  KAT DQQG+VE  E    VVDSKIL YCSIDKKEKKS+GELE
Sbjct: 45  QFNGRHLCLRRRLFLLSPKATADQQGKVEEFEDDSNVVDSKILPYCSIDKKEKKSIGELE 104

Query: 100 QEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPI 159
           QEFLQALQAFYYEGKA+MSNEEFDNLKEELMWEGSSVVMLSS EQKFLEASMAYV+G PI
Sbjct: 105 QEFLQALQAFYYEGKAIMSNEEFDNLKEELMWEGSSVVMLSSDEQKFLEASMAYVSGNPI 164

Query: 160 MSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLF 219
           MSD+E+D+LK +LK EGSEIV EGPRCSLRS+KVYSDLSVDYLKM LLNVPATV+ALGLF
Sbjct: 165 MSDKEFDELKLRLKKEGSEIVAEGPRCSLRSKKVYSDLSVDYLKMFLLNVPATVIALGLF 224

Query: 220 FFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCG 279
           FFLDD+TGFEITYLLELPEPFSFIFTWFAAVPLIV+++ SLT  IV++ +ILKGPCPNCG
Sbjct: 225 FFLDDLTGFEITYLLELPEPFSFIFTWFAAVPLIVWIALSLTNAIVKDFVILKGPCPNCG 284

Query: 280 TENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           TEN SFFGTILSIS+GG+TNT+ CSN      CGT M YDS +RLITLPEGS A
Sbjct: 285 TENTSFFGTILSISNGGSTNTVKCSN------CGTAMEYDSTSRLITLPEGSNA 332


>gi|388519373|gb|AFK47748.1| unknown [Lotus japonicus]
          Length = 327

 Score =  458 bits (1179), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 231/292 (79%), Positives = 258/292 (88%), Gaps = 8/292 (2%)

Query: 44  QFNGRQFTVRRRSFVLPSKATTDQQGQVEGDE--VVDSKILQYCSIDKKEKKSLGELEQE 101
           QFNGRQ  +RRR F+L  KAT DQQG+VE  E   VDSKI  YCSIDKK+KKS+GE+EQ+
Sbjct: 42  QFNGRQLCLRRRLFLLSPKATADQQGKVEEFEESAVDSKIFPYCSIDKKDKKSVGEMEQD 101

Query: 102 FLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMS 161
           FLQALQ+FYYEGKA+MSNEEFDNLKEELMWEGSSVVMLSS EQKFLEASMAYV+GKPI+S
Sbjct: 102 FLQALQSFYYEGKAIMSNEEFDNLKEELMWEGSSVVMLSSDEQKFLEASMAYVSGKPILS 161

Query: 162 DEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFF 221
           D+EYD LK +LKM GSEIVVEGPRCSLRSRKVYSDLSVDYLKM LLNVPATV+ALGLFFF
Sbjct: 162 DKEYDDLKMRLKMNGSEIVVEGPRCSLRSRKVYSDLSVDYLKMFLLNVPATVIALGLFFF 221

Query: 222 LDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTE 281
           LDD+TGFEITYLLELPEPFSFIFTWFAAVPLIV+L+ ++T +I+++ LILKGPCPNCGTE
Sbjct: 222 LDDLTGFEITYLLELPEPFSFIFTWFAAVPLIVWLALTITNVIIKDFLILKGPCPNCGTE 281

Query: 282 NVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           N SFFGTILSI+SGG++N + CSN      C T MVYDS TRLITLPEGS A
Sbjct: 282 NTSFFGTILSIASGGSSNKVKCSN------CATEMVYDSATRLITLPEGSNA 327


>gi|388506422|gb|AFK41277.1| unknown [Medicago truncatula]
          Length = 315

 Score =  456 bits (1172), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 223/293 (76%), Positives = 252/293 (86%), Gaps = 9/293 (3%)

Query: 44  QFN--GRQFTVRRRSFVLPSKATTDQQ-GQVEGDEVVDSKILQYCSIDKKEKKSLGELEQ 100
            FN  GR  ++RR  F+L  KAT+DQQ G+VE D VV SKILQYCSID+KEKKSLGELEQ
Sbjct: 29  HFNAGGRHSSLRRGLFLLSPKATSDQQAGKVEEDAVVYSKILQYCSIDQKEKKSLGELEQ 88

Query: 101 EFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIM 160
           +FLQALQA YYEGKA MSNEEFDNLKEELMWEGSSVVMLSS+EQKFLEAS+AYV+GKP+M
Sbjct: 89  DFLQALQASYYEGKATMSNEEFDNLKEELMWEGSSVVMLSSSEQKFLEASIAYVSGKPLM 148

Query: 161 SDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFF 220
           SD+E+D+LK KLKMEGSEIV EGPRCSLRSRKVYSDL+VDYLK LLL VPATV+ALGLFF
Sbjct: 149 SDKEFDELKLKLKMEGSEIVAEGPRCSLRSRKVYSDLTVDYLKALLLKVPATVIALGLFF 208

Query: 221 FLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGT 280
           FLDD+TGFEI YL+ +PEPFSFI TWFAAVP I++ +QS+T  I+++ LILKGPCPNCGT
Sbjct: 209 FLDDVTGFEINYLIAIPEPFSFILTWFAAVPFILWFAQSITNAIIKDFLILKGPCPNCGT 268

Query: 281 ENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           EN SFFGTILSISSG +TN + C N      CGT M+YDS TRLITLPEGS A
Sbjct: 269 ENPSFFGTILSISSGDSTNKVKCEN------CGTAMLYDSTTRLITLPEGSNA 315


>gi|3292814|emb|CAA19804.1| putative protein [Arabidopsis thaliana]
 gi|7269136|emb|CAB79244.1| putative protein [Arabidopsis thaliana]
          Length = 340

 Score =  452 bits (1163), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 221/305 (72%), Positives = 257/305 (84%), Gaps = 27/305 (8%)

Query: 47  GRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQAL 106
           G+  ++RRR F+LP+KATT+Q   V GD V DS +L YCSI+K EKK++GE+EQEFLQAL
Sbjct: 45  GKSISLRRRVFLLPAKATTEQS--VGGDNV-DSNVLPYCSINKAEKKTIGEMEQEFLQAL 101

Query: 107 QAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYD 166
           Q+FYY+GKA+MSNEEFDNLKEELMWEGSSVVMLSS EQ+FLEASMAYV+G PI++DEEYD
Sbjct: 102 QSFYYDGKAIMSNEEFDNLKEELMWEGSSVVMLSSDEQRFLEASMAYVSGNPILNDEEYD 161

Query: 167 KLKQKLK------------------MEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLN 208
           KLK KLK                  ++GS+IV EGPRCSLRS+KVYSDL+VDY KMLLLN
Sbjct: 162 KLKLKLKVSIHFLCFSENRHTLIRLIDGSDIVSEGPRCSLRSKKVYSDLAVDYFKMLLLN 221

Query: 209 VPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRES 268
           VPATVVALGLFFFLDDITGFEITY++ELPEP+SFIFTWFAAVP+IVYL+ S+TKLI+++ 
Sbjct: 222 VPATVVALGLFFFLDDITGFEITYIMELPEPYSFIFTWFAAVPVIVYLALSITKLIIKDF 281

Query: 269 LILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLP 328
           LILKGPCPNCGTEN SFFGTILSISSGG TNT+ C+N      CGT MVYDS +RLITLP
Sbjct: 282 LILKGPCPNCGTENTSFFGTILSISSGGKTNTVKCTN------CGTAMVYDSGSRLITLP 335

Query: 329 EGSEA 333
           EGS+A
Sbjct: 336 EGSQA 340


>gi|42566515|ref|NP_192933.2| PGR5-like protein 1B [Arabidopsis thaliana]
 gi|75151402|sp|Q8GYC7.1|PGL1B_ARATH RecName: Full=PGR5-like protein 1B, chloroplastic; Flags: Precursor
 gi|26450517|dbj|BAC42372.1| unknown protein [Arabidopsis thaliana]
 gi|28950891|gb|AAO63369.1| At4g11960 [Arabidopsis thaliana]
 gi|332657674|gb|AEE83074.1| PGR5-like protein 1B [Arabidopsis thaliana]
          Length = 313

 Score =  450 bits (1158), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 224/288 (77%), Positives = 253/288 (87%), Gaps = 7/288 (2%)

Query: 46  NGRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQA 105
           +GR  ++RRR  +LP KA+TDQ GQV G+EV DSKIL YCSI+K EK+++GE+EQEFLQA
Sbjct: 33  HGRSISLRRRLTLLPLKASTDQSGQVGGEEV-DSKILPYCSINKNEKRTIGEMEQEFLQA 91

Query: 106 LQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEY 165
           +Q+FYYEGKA+MSNEEFDNLKEELMWEGSSVVMLSS EQ+FLEASMAYV+G PI+SDEEY
Sbjct: 92  MQSFYYEGKAIMSNEEFDNLKEELMWEGSSVVMLSSDEQRFLEASMAYVSGNPILSDEEY 151

Query: 166 DKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDI 225
           DKLK KLKM+GSEIV EGPRCSLRS+KVYSDL++DY KM LLNVPATVVALGLFFFLDDI
Sbjct: 152 DKLKMKLKMDGSEIVCEGPRCSLRSKKVYSDLAIDYFKMFLLNVPATVVALGLFFFLDDI 211

Query: 226 TGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSF 285
           TGFEITYLLELPEPFSFIFTWFAAVP IVYL+ SLTKLI+++ LILKGPCPNCGTENVSF
Sbjct: 212 TGFEITYLLELPEPFSFIFTWFAAVPAIVYLALSLTKLILKDFLILKGPCPNCGTENVSF 271

Query: 286 FGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           FGTILSI +   TN + CS       CGT MVYDS +RLITLPEG +A
Sbjct: 272 FGTILSIPNDSNTNNVKCSG------CGTEMVYDSGSRLITLPEGGKA 313


>gi|297813705|ref|XP_002874736.1| hypothetical protein ARALYDRAFT_490014 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320573|gb|EFH50995.1| hypothetical protein ARALYDRAFT_490014 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 310

 Score =  449 bits (1155), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 226/301 (75%), Positives = 257/301 (85%), Gaps = 12/301 (3%)

Query: 33  SSSRAHSNSNTQFNGRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKKEK 92
           SS  AH       +GR  ++RRR  +LP KA+TDQ GQV G+EV DSKIL YCSI+K EK
Sbjct: 22  SSKTAHFT-----HGRSISLRRRLLLLPLKASTDQSGQVGGEEV-DSKILPYCSINKSEK 75

Query: 93  KSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMA 152
           +++GE+EQEFLQA+Q+FYYEGKA+MSNEEFDNLKEELMWEGSSVVMLSS EQ+FLEASMA
Sbjct: 76  RTIGEMEQEFLQAMQSFYYEGKAIMSNEEFDNLKEELMWEGSSVVMLSSDEQRFLEASMA 135

Query: 153 YVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPAT 212
           YV+G PI+SDEEYDKLK KLKM+GSEIV EGPRCSLRS+KVYSDL++DY KM LLNVPAT
Sbjct: 136 YVSGNPILSDEEYDKLKMKLKMDGSEIVCEGPRCSLRSKKVYSDLAIDYFKMFLLNVPAT 195

Query: 213 VVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILK 272
           VVALGLFFFLDDITGFEIT+LLELPEPFSFIFTWFAAVP IVYL+ SLTKLI+++ LILK
Sbjct: 196 VVALGLFFFLDDITGFEITHLLELPEPFSFIFTWFAAVPAIVYLALSLTKLILKDFLILK 255

Query: 273 GPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSE 332
           GPCPNCGTENVSFFGTILSI +   TN + CS       CGT M+YDS +RLITLPEG +
Sbjct: 256 GPCPNCGTENVSFFGTILSIPNDSNTNNVKCSG------CGTEMIYDSGSRLITLPEGGK 309

Query: 333 A 333
           A
Sbjct: 310 A 310


>gi|388503800|gb|AFK39966.1| unknown [Lotus japonicus]
          Length = 309

 Score =  437 bits (1123), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 231/335 (68%), Positives = 262/335 (78%), Gaps = 28/335 (8%)

Query: 1   MAGKVASGFTPQAQRVFAVPVQKPAAALAPFPSSSRAHSNSNTQFNGRQFTVRRRSFVLP 60
           MA K+AS  T   QR F    Q P +++    S+ RAH     QFNGR+    RR  +L 
Sbjct: 1   MASKLASTLT--YQRPFTATTQ-PTSSI----STPRAHL---VQFNGRRNVCLRRRRLLL 50

Query: 61  S--KATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMS 118
              KA+ DQQG+VE D  VDS          KEKKS+GELEQ+FLQALQAFYYEGKA+MS
Sbjct: 51  LSPKASADQQGKVEKD-AVDSI---------KEKKSIGELEQDFLQALQAFYYEGKAIMS 100

Query: 119 NEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSE 178
           NEEFDNLKEELMWEGSSVVMLSS EQKFLEA+M YV+GKPIMSD+EYD+LK +LKMEGSE
Sbjct: 101 NEEFDNLKEELMWEGSSVVMLSSDEQKFLEAAMGYVSGKPIMSDKEYDELKLRLKMEGSE 160

Query: 179 IVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPE 238
           IV EGPRCS+RSRKVYSDLS+DYLKM LLNVPATVVALGLFFFLDD+TGFEI  L+E+PE
Sbjct: 161 IVAEGPRCSIRSRKVYSDLSIDYLKMFLLNVPATVVALGLFFFLDDVTGFEINSLIEIPE 220

Query: 239 PFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTT 298
           P+SFI TWFAAVP IV+L+QS+T  I+ + LILKGPCPNCGTEN SFFGTILSISSG +T
Sbjct: 221 PYSFILTWFAAVPFIVWLAQSITNAIINDFLILKGPCPNCGTENTSFFGTILSISSGNST 280

Query: 299 NTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           N + C N      CGT MVYDS+TRLITLPEGS A
Sbjct: 281 NKVKCVN------CGTAMVYDSSTRLITLPEGSNA 309


>gi|4586101|emb|CAB40937.1| putative protein [Arabidopsis thaliana]
 gi|7267897|emb|CAB78239.1| putative protein [Arabidopsis thaliana]
          Length = 338

 Score =  436 bits (1122), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 224/313 (71%), Positives = 253/313 (80%), Gaps = 32/313 (10%)

Query: 46  NGRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQA 105
           +GR  ++RRR  +LP KA+TDQ GQV G+EV DSKIL YCSI+K EK+++GE+EQEFLQA
Sbjct: 33  HGRSISLRRRLTLLPLKASTDQSGQVGGEEV-DSKILPYCSINKNEKRTIGEMEQEFLQA 91

Query: 106 LQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEY 165
           +Q+FYYEGKA+MSNEEFDNLKEELMWEGSSVVMLSS EQ+FLEASMAYV+G PI+SDEEY
Sbjct: 92  MQSFYYEGKAIMSNEEFDNLKEELMWEGSSVVMLSSDEQRFLEASMAYVSGNPILSDEEY 151

Query: 166 DKLKQKLKMEGSEIVVEGPRCSLRSRK-------------------------VYSDLSVD 200
           DKLK KLKM+GSEIV EGPRCSLRS+K                         VYSDL++D
Sbjct: 152 DKLKMKLKMDGSEIVCEGPRCSLRSKKNITLDSLFGTNSRDVIDLWTSLIMQVYSDLAID 211

Query: 201 YLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSL 260
           Y KM LLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVP IVYL+ SL
Sbjct: 212 YFKMFLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPAIVYLALSL 271

Query: 261 TKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDS 320
           TKLI+++ LILKGPCPNCGTENVSFFGTILSI +   TN + CS       CGT MVYDS
Sbjct: 272 TKLILKDFLILKGPCPNCGTENVSFFGTILSIPNDSNTNNVKCSG------CGTEMVYDS 325

Query: 321 NTRLITLPEGSEA 333
            +RLITLPEG +A
Sbjct: 326 GSRLITLPEGGKA 338


>gi|334186444|ref|NP_001190702.1| PGR5-like protein 1B [Arabidopsis thaliana]
 gi|332657675|gb|AEE83075.1| PGR5-like protein 1B [Arabidopsis thaliana]
          Length = 296

 Score =  433 bits (1114), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 211/264 (79%), Positives = 235/264 (89%), Gaps = 7/264 (2%)

Query: 70  QVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEEL 129
           QV G+EV DSKIL YCSI+K EK+++GE+EQEFLQA+Q+FYYEGKA+MSNEEFDNLKEEL
Sbjct: 40  QVGGEEV-DSKILPYCSINKNEKRTIGEMEQEFLQAMQSFYYEGKAIMSNEEFDNLKEEL 98

Query: 130 MWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLR 189
           MWEGSSVVMLSS EQ+FLEASMAYV+G PI+SDEEYDKLK KLKM+GSEIV EGPRCSLR
Sbjct: 99  MWEGSSVVMLSSDEQRFLEASMAYVSGNPILSDEEYDKLKMKLKMDGSEIVCEGPRCSLR 158

Query: 190 SRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAA 249
           S+KVYSDL++DY KM LLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAA
Sbjct: 159 SKKVYSDLAIDYFKMFLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAA 218

Query: 250 VPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFC 309
           VP IVYL+ SLTKLI+++ LILKGPCPNCGTENVSFFGTILSI +   TN + CS     
Sbjct: 219 VPAIVYLALSLTKLILKDFLILKGPCPNCGTENVSFFGTILSIPNDSNTNNVKCS----- 273

Query: 310 FSCGTTMVYDSNTRLITLPEGSEA 333
             CGT MVYDS +RLITLPEG +A
Sbjct: 274 -GCGTEMVYDSGSRLITLPEGGKA 296


>gi|414884313|tpg|DAA60327.1| TPA: hypothetical protein ZEAMMB73_572664 [Zea mays]
          Length = 329

 Score =  426 bits (1095), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 200/267 (74%), Positives = 236/267 (88%), Gaps = 6/267 (2%)

Query: 67  QQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLK 126
           QQ + E D+VVDS +L YCSI++KEKKS+GE+EQEFLQA+QAFYYEGKA+MSNEEFDNLK
Sbjct: 68  QQQEAEADQVVDSNMLPYCSINRKEKKSIGEMEQEFLQAMQAFYYEGKAIMSNEEFDNLK 127

Query: 127 EELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRC 186
           EELMWEGSSVVMLS  EQK LEA+MAYV+G PIM+D+E+D+LK +LK EGS+IV EGPRC
Sbjct: 128 EELMWEGSSVVMLSPDEQKLLEAAMAYVSGNPIMTDDEFDQLKLRLKKEGSDIVQEGPRC 187

Query: 187 SLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTW 246
           SLRSRKVYSDL+VDYLKMLLLNVPA VVAL LFFFLDD TGFEITYLLELPEPFSFIFTW
Sbjct: 188 SLRSRKVYSDLTVDYLKMLLLNVPAAVVALALFFFLDDFTGFEITYLLELPEPFSFIFTW 247

Query: 247 FAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNL 306
           FAA+PLI +++Q++T +IV++ LILKGPCPNCG EN+SFFGTILS+ SGG+ N++ C+  
Sbjct: 248 FAALPLIFWVAQAITNVIVKDFLILKGPCPNCGNENLSFFGTILSVPSGGSKNSVKCA-- 305

Query: 307 TFCFSCGTTMVYDSNTRLITLPEGSEA 333
               SCGT + YDS+TRLITLPE +EA
Sbjct: 306 ----SCGTELEYDSSTRLITLPEPAEA 328


>gi|357110990|ref|XP_003557298.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Brachypodium
           distachyon]
          Length = 322

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 199/279 (71%), Positives = 236/279 (84%), Gaps = 7/279 (2%)

Query: 53  RRRSFVLPSKATTDQQGQVEG-DEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYY 111
           R  S + P  A  ++QG+++  D+VVDS +L YC++D+KEKKS+GE+EQEFLQALQ+FYY
Sbjct: 50  RGGSRLRPRAAEAERQGRMQREDDVVDSNVLPYCNLDRKEKKSIGEMEQEFLQALQSFYY 109

Query: 112 EGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQK 171
           + KA+MSNEEFDNLKEELMWEGSSVVMLS  EQ+ LEASMAYVAGKPIM+D E+D+LK +
Sbjct: 110 DKKAIMSNEEFDNLKEELMWEGSSVVMLSPDEQRLLEASMAYVAGKPIMTDIEFDELKLR 169

Query: 172 LKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEIT 231
           LK EGSEIV EGPRCSLRSRKVYSDL+VDY KM LLNVPA VVAL LFFFLDD TGFEIT
Sbjct: 170 LKKEGSEIVQEGPRCSLRSRKVYSDLTVDYFKMFLLNVPAAVVALTLFFFLDDFTGFEIT 229

Query: 232 YLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILS 291
           YLLELPEPFSFIFTWFAA+PLI +++Q++T  IV++ LILKGPCPNCG EN+SFFGTILS
Sbjct: 230 YLLELPEPFSFIFTWFAALPLIFWVAQAITNAIVKDFLILKGPCPNCGNENLSFFGTILS 289

Query: 292 ISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEG 330
           + SGG  N++ C+N      CGT +VYDS +RLITLPE 
Sbjct: 290 VPSGGAKNSVKCAN------CGTALVYDSGSRLITLPEA 322


>gi|115477403|ref|NP_001062297.1| Os08g0526300 [Oryza sativa Japonica Group]
 gi|42407629|dbj|BAD08743.1| unknown protein [Oryza sativa Japonica Group]
 gi|42761407|dbj|BAD11572.1| unknown protein [Oryza sativa Japonica Group]
 gi|113624266|dbj|BAF24211.1| Os08g0526300 [Oryza sativa Japonica Group]
          Length = 329

 Score =  409 bits (1051), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 201/303 (66%), Positives = 241/303 (79%), Gaps = 11/303 (3%)

Query: 34  SSRAHSNSNTQ---FNGRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKK 90
           SS AH  SN Q    + R +  +        +    Q+   +G  VVDS +L YCSI++K
Sbjct: 35  SSSAHGWSNCQGWRLHHRVWAAQAADQQGGVQQQQQQEENEDG--VVDSNVLPYCSINRK 92

Query: 91  EKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEAS 150
           EKK++GE+EQEFLQALQAFYY+ KAVMSNEEFDNLKEELMWEGSSVVMLS  EQ+ LEAS
Sbjct: 93  EKKTIGEMEQEFLQALQAFYYDKKAVMSNEEFDNLKEELMWEGSSVVMLSPDEQRLLEAS 152

Query: 151 MAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVP 210
           MAYVAG PIM+D E+D+LK +L+ EGSEIV EGPRCSLRSRKVYSDL+VDY KM LLNVP
Sbjct: 153 MAYVAGNPIMTDAEFDELKLRLRKEGSEIVQEGPRCSLRSRKVYSDLTVDYFKMFLLNVP 212

Query: 211 ATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLI 270
           A V+AL LFFFLDD+TGFEITYLLELPEPFSFIFTWFAA+PLI +++Q++T  IV++ LI
Sbjct: 213 AAVLALTLFFFLDDLTGFEITYLLELPEPFSFIFTWFAALPLIFWVAQAITSAIVKDFLI 272

Query: 271 LKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEG 330
           LKGPCPNCG EN+SFFGTILS+ SGG  N++ C+N      C +++VYDS +RLITLPE 
Sbjct: 273 LKGPCPNCGNENLSFFGTILSVPSGGARNSVKCAN------CSSSLVYDSASRLITLPET 326

Query: 331 SEA 333
           +EA
Sbjct: 327 AEA 329


>gi|239050005|ref|NP_001141772.2| uncharacterized protein LOC100273908 [Zea mays]
 gi|194702532|gb|ACF85350.1| unknown [Zea mays]
 gi|238908946|gb|ACF87023.2| unknown [Zea mays]
 gi|414874022|tpg|DAA52579.1| TPA: hypothetical protein ZEAMMB73_712670 [Zea mays]
 gi|414874023|tpg|DAA52580.1| TPA: hypothetical protein ZEAMMB73_712670 [Zea mays]
 gi|414874024|tpg|DAA52581.1| TPA: hypothetical protein ZEAMMB73_712670 [Zea mays]
 gi|414874025|tpg|DAA52582.1| TPA: hypothetical protein ZEAMMB73_712670 [Zea mays]
          Length = 322

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 196/271 (72%), Positives = 231/271 (85%), Gaps = 9/271 (3%)

Query: 65  TDQQGQVE--GDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEF 122
           T+QQGQV+   DEVVD  IL YCSID+K+KK+LGE+EQEFLQALQAFYY+ KA+MSNEEF
Sbjct: 59  TEQQGQVQEQDDEVVDRNILPYCSIDRKQKKTLGEMEQEFLQALQAFYYDQKAIMSNEEF 118

Query: 123 DNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVE 182
           DNLKEELMWEGSSVVML+  EQK LEAS+AY +G PIMSD E+D+LK +LK +GS IV E
Sbjct: 119 DNLKEELMWEGSSVVMLNEDEQKLLEASLAYTSGNPIMSDAEFDELKLRLKTDGSVIVKE 178

Query: 183 GPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSF 242
           GPRCSLR+ KVYSDL+VDYLKM LLNVPAT VALGLFFF+D++TGFEI  + +LPEPF F
Sbjct: 179 GPRCSLRTHKVYSDLNVDYLKMFLLNVPATTVALGLFFFIDELTGFEIN-VFQLPEPFGF 237

Query: 243 IFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTIN 302
           IFTWFAA+PLI++L+QSLTK IV++ LILKGPCPNCGTEN+SFFGTILS+SSGGTTN + 
Sbjct: 238 IFTWFAALPLILFLAQSLTKAIVQDFLILKGPCPNCGTENLSFFGTILSVSSGGTTNKVK 297

Query: 303 CSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           C+N      C T + YDS +R+ITLPE S A
Sbjct: 298 CAN------CSTELEYDSKSRVITLPEASNA 322


>gi|357115252|ref|XP_003559404.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Brachypodium
           distachyon]
          Length = 318

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 195/279 (69%), Positives = 239/279 (85%), Gaps = 9/279 (3%)

Query: 53  RRRSFVLPSKATTDQQGQVEG--DEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFY 110
           RR + +L     TDQQGQV+   DEVVDS +L YCS+D+KEKK++GE+EQEFL+ALQ+FY
Sbjct: 46  RRAALLLLRPRATDQQGQVQQQEDEVVDSNVLPYCSLDRKEKKTIGEMEQEFLRALQSFY 105

Query: 111 YEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQ 170
           Y+ KA+MSNEEFDNLKEELMWEGSSVVMLS+ EQ+ LEASMAY+AG PIM+D E+D+LK 
Sbjct: 106 YDQKAIMSNEEFDNLKEELMWEGSSVVMLSADEQRLLEASMAYIAGNPIMTDAEFDQLKL 165

Query: 171 KLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEI 230
           +LK +GS+IV EGPRCSLRSRKVYSDLSVDYL+M LLNVPATVVALGLFFF+D++TGFE+
Sbjct: 166 RLKQDGSDIVTEGPRCSLRSRKVYSDLSVDYLRMYLLNVPATVVALGLFFFIDELTGFEV 225

Query: 231 TYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTIL 290
             + +LPEPF FIFTWFAA+PLI+ ++QS+T  I+++ LILKGPCPNCGTEN+SF+GTIL
Sbjct: 226 N-VFQLPEPFGFIFTWFAALPLILLIAQSVTNAIIKDVLILKGPCPNCGTENLSFYGTIL 284

Query: 291 SISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPE 329
           SI+SGGTTN + C+N      C T +VY+S TRLITLPE
Sbjct: 285 SIASGGTTNNVKCAN------CKTVLVYNSKTRLITLPE 317


>gi|222640898|gb|EEE69030.1| hypothetical protein OsJ_28011 [Oryza sativa Japonica Group]
          Length = 334

 Score =  403 bits (1035), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 201/308 (65%), Positives = 241/308 (78%), Gaps = 16/308 (5%)

Query: 34  SSRAHSNSNTQ---FNGRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKK 90
           SS AH  SN Q    + R +  +        +    Q+   +G  VVDS +L YCSI++K
Sbjct: 35  SSSAHGWSNCQGWRLHHRVWAAQAADQQGGVQQQQQQEENEDG--VVDSNVLPYCSINRK 92

Query: 91  EKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEAS 150
           EKK++GE+EQEFLQALQAFYY+ KAVMSNEEFDNLKEELMWEGSSVVMLS  EQ+ LEAS
Sbjct: 93  EKKTIGEMEQEFLQALQAFYYDKKAVMSNEEFDNLKEELMWEGSSVVMLSPDEQRLLEAS 152

Query: 151 MAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVP 210
           MAYVAG PIM+D E+D+LK +L+ EGSEIV EGPRCSLRSRKVYSDL+VDY KM LLNVP
Sbjct: 153 MAYVAGNPIMTDAEFDELKLRLRKEGSEIVQEGPRCSLRSRKVYSDLTVDYFKMFLLNVP 212

Query: 211 ATVVALGL-----FFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIV 265
           A V+AL L     FFFLDD+TGFEITYLLELPEPFSFIFTWFAA+PLI +++Q++T  IV
Sbjct: 213 AAVLALTLFRLHRFFFLDDLTGFEITYLLELPEPFSFIFTWFAALPLIFWVAQAITSAIV 272

Query: 266 RESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLI 325
           ++ LILKGPCPNCG EN+SFFGTILS+ SGG  N++ C+N      C +++VYDS +RLI
Sbjct: 273 KDFLILKGPCPNCGNENLSFFGTILSVPSGGARNSVKCAN------CSSSLVYDSASRLI 326

Query: 326 TLPEGSEA 333
           TLPE +EA
Sbjct: 327 TLPETAEA 334


>gi|218201488|gb|EEC83915.1| hypothetical protein OsI_29970 [Oryza sativa Indica Group]
          Length = 326

 Score =  403 bits (1035), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 193/265 (72%), Positives = 226/265 (85%), Gaps = 11/265 (4%)

Query: 74  DEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEG 133
           D VVDS +L YCSI++KEKK++GE+EQEFLQALQAFYY+ KAVMSNEEFDNLKEELMWEG
Sbjct: 68  DGVVDSNVLPYCSINRKEKKTIGEMEQEFLQALQAFYYDKKAVMSNEEFDNLKEELMWEG 127

Query: 134 SSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKV 193
           SSVVMLS  EQ+ LEASMAYVAG PIM+D E+D+LK +L+ EGSEIV EGPRCSLRSRKV
Sbjct: 128 SSVVMLSPDEQRLLEASMAYVAGNPIMTDAEFDELKLRLRKEGSEIVQEGPRCSLRSRKV 187

Query: 194 YSDLSVDYLKMLLLNVPATVVALGL-----FFFLDDITGFEITYLLELPEPFSFIFTWFA 248
           YSDL+VDY KM LLNVPA VVAL L     FFFLDD+TGFEITYLLELPEPFSFIFTWFA
Sbjct: 188 YSDLTVDYFKMFLLNVPAAVVALTLFRLHRFFFLDDLTGFEITYLLELPEPFSFIFTWFA 247

Query: 249 AVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTF 308
           A+PLI +++Q++T  IV++ LILKGPCPNCG EN+SFFGTILS+ SGG  N++ C+N   
Sbjct: 248 ALPLIFWVAQAITSAIVKDFLILKGPCPNCGNENLSFFGTILSVPSGGARNSVKCAN--- 304

Query: 309 CFSCGTTMVYDSNTRLITLPEGSEA 333
              C +++VYDS +RLITLPE +EA
Sbjct: 305 ---CSSSLVYDSASRLITLPETAEA 326


>gi|218194147|gb|EEC76574.1| hypothetical protein OsI_14406 [Oryza sativa Indica Group]
 gi|218194148|gb|EEC76575.1| hypothetical protein OsI_14407 [Oryza sativa Indica Group]
 gi|222626205|gb|EEE60337.1| hypothetical protein OsJ_13437 [Oryza sativa Japonica Group]
          Length = 322

 Score =  396 bits (1018), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 190/262 (72%), Positives = 222/262 (84%), Gaps = 11/262 (4%)

Query: 75  EVVDSKILQYCSID----KKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELM 130
           EVVDS +LQYCSID    K EK+SLGE+EQEFLQALQAFYY+ KA+MSNEEFDNLKEELM
Sbjct: 68  EVVDSNVLQYCSIDGKGKKAEKRSLGEMEQEFLQALQAFYYDQKAIMSNEEFDNLKEELM 127

Query: 131 WEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRS 190
           WEGSSVVMLS  EQ+ LEASMAY AG PIMSD E+D+LK +LK +GS+IV EGPRCSLRS
Sbjct: 128 WEGSSVVMLSPDEQRLLEASMAYAAGNPIMSDAEFDQLKLRLKKDGSDIVTEGPRCSLRS 187

Query: 191 RKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAV 250
           RKVYSDL+VDYLKM LLNVPAT +ALGLFFF+D++TGFEI  + +LPEPF FIFTWFAA+
Sbjct: 188 RKVYSDLTVDYLKMFLLNVPATTLALGLFFFIDELTGFEIN-IFQLPEPFGFIFTWFAAL 246

Query: 251 PLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCF 310
           PLI++++QS+T  IV + LILKGPCPNCG EN+SFFGTILS+SSGG TN + C++     
Sbjct: 247 PLILFIAQSITNAIVNDFLILKGPCPNCGAENLSFFGTILSVSSGGETNKVKCAD----- 301

Query: 311 SCGTTMVYDSNTRLITLPEGSE 332
            C T MVYDS +RLITLPE S+
Sbjct: 302 -CSTEMVYDSKSRLITLPEPSQ 322


>gi|115456709|ref|NP_001051955.1| Os03g0857400 [Oryza sativa Japonica Group]
 gi|30102983|gb|AAP21396.1| unknown protein [Oryza sativa Japonica Group]
 gi|108712199|gb|ABF99994.1| expressed protein [Oryza sativa Japonica Group]
 gi|113550426|dbj|BAF13869.1| Os03g0857400 [Oryza sativa Japonica Group]
 gi|215704395|dbj|BAG93829.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 324

 Score =  396 bits (1017), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 190/262 (72%), Positives = 222/262 (84%), Gaps = 11/262 (4%)

Query: 75  EVVDSKILQYCSID----KKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELM 130
           EVVDS +LQYCSID    K EK+SLGE+EQEFLQALQAFYY+ KA+MSNEEFDNLKEELM
Sbjct: 70  EVVDSNVLQYCSIDGKGKKAEKRSLGEMEQEFLQALQAFYYDQKAIMSNEEFDNLKEELM 129

Query: 131 WEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRS 190
           WEGSSVVMLS  EQ+ LEASMAY AG PIMSD E+D+LK +LK +GS+IV EGPRCSLRS
Sbjct: 130 WEGSSVVMLSPDEQRLLEASMAYAAGNPIMSDAEFDQLKLRLKKDGSDIVTEGPRCSLRS 189

Query: 191 RKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAV 250
           RKVYSDL+VDYLKM LLNVPAT +ALGLFFF+D++TGFEI  + +LPEPF FIFTWFAA+
Sbjct: 190 RKVYSDLTVDYLKMFLLNVPATTLALGLFFFIDELTGFEIN-IFQLPEPFGFIFTWFAAL 248

Query: 251 PLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCF 310
           PLI++++QS+T  IV + LILKGPCPNCG EN+SFFGTILS+SSGG TN + C++     
Sbjct: 249 PLILFIAQSITNAIVNDFLILKGPCPNCGAENLSFFGTILSVSSGGETNKVKCAD----- 303

Query: 311 SCGTTMVYDSNTRLITLPEGSE 332
            C T MVYDS +RLITLPE S+
Sbjct: 304 -CSTEMVYDSKSRLITLPEPSQ 324


>gi|116788866|gb|ABK25031.1| unknown [Picea sitchensis]
          Length = 350

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 181/290 (62%), Positives = 226/290 (77%), Gaps = 9/290 (3%)

Query: 46  NGRQFTV--RRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKK-EKKSLGELEQEF 102
           +GR +T   +R   ++       QQGQV+ +E+VD+ IL YCSIDKK EK++LGELEQ+F
Sbjct: 49  SGRCYTTTTKRPPLIVMQATEQQQQGQVKEEEIVDNNILPYCSIDKKSEKRTLGELEQDF 108

Query: 103 LQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSD 162
           L+ALQ+FY+E K+ M+NEEFD+LKEELMWEGSSVVMLS  EQKF+EAS+AY  GKPIM+D
Sbjct: 109 LEALQSFYFEKKSTMTNEEFDHLKEELMWEGSSVVMLSPDEQKFMEASLAYAVGKPIMTD 168

Query: 163 EEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFL 222
            EYD LK +LK   S++ +EGPRCSLRSR VYSDL+VDY KM LLNVPA VV L L FFL
Sbjct: 169 AEYDDLKFRLKKASSKVALEGPRCSLRSRNVYSDLTVDYFKMFLLNVPAAVVTLTLVFFL 228

Query: 223 DDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTEN 282
           DD+TGFEITYLLELPEP+SFIFTWF  +P    +++++T  I+++ LILKG CPNCGTEN
Sbjct: 229 DDLTGFEITYLLELPEPYSFIFTWFIVLPATYLIARAITNAILKDFLILKGTCPNCGTEN 288

Query: 283 VSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSE 332
           +SFFGTIL++ SGG  N + C N      CGT ++YDS+TRLITL E  +
Sbjct: 289 ISFFGTILTVPSGGPLNEVKCEN------CGTALMYDSDTRLITLAEKKQ 332


>gi|168039121|ref|XP_001772047.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162676648|gb|EDQ63128.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 287

 Score =  352 bits (902), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 159/254 (62%), Positives = 209/254 (82%), Gaps = 6/254 (2%)

Query: 74  DEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEG 133
           D  VD K+L YC I+KK+KK+LGE+EQ+FL+ALQ+FY++ K +MSNEEFD LKEEL WEG
Sbjct: 13  DSEVDDKVLPYCDINKKQKKTLGEMEQDFLEALQSFYFDSKPIMSNEEFDLLKEELTWEG 72

Query: 134 SSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKV 193
           SSVV+LSS EQ+FLEAS++Y AGKPI+SD+ +D+LK KLK +GS++ + GPRCSLRS+KV
Sbjct: 73  SSVVILSSDEQRFLEASLSYAAGKPILSDQAFDELKLKLKQKGSKVAMAGPRCSLRSKKV 132

Query: 194 YSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLI 253
            SD SVDY+KM LLN+PA ++ALGL FFLDDITGFEITYLLELPEP+SF+FTWF  +P  
Sbjct: 133 VSDASVDYVKMTLLNLPAALIALGLVFFLDDITGFEITYLLELPEPYSFLFTWFVVLPTT 192

Query: 254 VYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCG 313
             ++QSLT ++++++LIL GPCPNCG    S+FG+IL+I SGG +N + C       +CG
Sbjct: 193 FLMAQSLTNIVLKDALILNGPCPNCGAGVNSYFGSILTIPSGGPSNNVKCE------ACG 246

Query: 314 TTMVYDSNTRLITL 327
           ++M++D +TRLITL
Sbjct: 247 SSMIFDKDTRLITL 260


>gi|302787374|ref|XP_002975457.1| hypothetical protein SELMODRAFT_232511 [Selaginella moellendorffii]
 gi|300157031|gb|EFJ23658.1| hypothetical protein SELMODRAFT_232511 [Selaginella moellendorffii]
          Length = 323

 Score =  323 bits (828), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 170/296 (57%), Positives = 224/296 (75%), Gaps = 14/296 (4%)

Query: 33  SSSRAHSNSNTQFNGRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKKEK 92
           SS  AHS+     NG+    R R+  +   A+ +Q   V+ +E+ D+KIL YCSIDKK+ 
Sbjct: 27  SSFVAHSHLR---NGQS---RSRNARIVMNASENQTA-VDQEELSDNKILPYCSIDKKKD 79

Query: 93  K-SLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASM 151
           K S+GELEQEFLQALQ+FYY+ + +M+NEEFDNLKEEL+WEGS+VV+LS  EQ+F+EAS+
Sbjct: 80  KKSVGELEQEFLQALQSFYYDKEPMMTNEEFDNLKEELLWEGSNVVILSPDEQRFMEASL 139

Query: 152 AYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPA 211
           AY AGK +MSDEEYD+LK KLK   S++ +EGPRCSLRS+KVYSD +VDYLKM LLN+PA
Sbjct: 140 AYAAGKRLMSDEEYDRLKLKLKKSNSKVAIEGPRCSLRSKKVYSDCTVDYLKMTLLNLPA 199

Query: 212 TVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLIL 271
            +++L + FFLDD+TGFEITYLLELPEP+ FIFTW   +P   + ++ +T L+ +++LIL
Sbjct: 200 VLISLLVVFFLDDLTGFEITYLLELPEPYGFIFTWSVVLPATFFAARFITNLVFKDALIL 259

Query: 272 KGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITL 327
           KGPC NCG+E  S+FGTIL ++SGG+TN + C        C   + +DS TRLITL
Sbjct: 260 KGPCANCGSETNSYFGTILGVASGGSTNDVKCEQ------CKALLKFDSETRLITL 309


>gi|302761284|ref|XP_002964064.1| hypothetical protein SELMODRAFT_66389 [Selaginella moellendorffii]
 gi|300167793|gb|EFJ34397.1| hypothetical protein SELMODRAFT_66389 [Selaginella moellendorffii]
          Length = 250

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 158/256 (61%), Positives = 202/256 (78%), Gaps = 12/256 (4%)

Query: 78  DSKILQYCSIDKKEKK-SLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSV 136
           D+KIL YCSIDKK+ K S+GELEQEFLQALQ+FYY+ K +M+NEEFDNLKEEL+WEGS+V
Sbjct: 1   DNKILPYCSIDKKKDKKSVGELEQEFLQALQSFYYDKKPMMTNEEFDNLKEELLWEGSNV 60

Query: 137 VMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLK-----MEGSEIVVEGPRCSLRSR 191
           V+LS  EQ+F+EAS+AY AGK +MSDEEYD+LK KLK        S++ +EGPRCSLRS+
Sbjct: 61  VILSPDEQRFMEASLAYAAGKRLMSDEEYDRLKLKLKSNCWQKSNSKVAIEGPRCSLRSK 120

Query: 192 KVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVP 251
           KVYSD +VDYLKM LLN+PA +++L + FFLDD+TGFEITYLLELPEP+ FIFTW   +P
Sbjct: 121 KVYSDCTVDYLKMTLLNLPAVLISLLVVFFLDDLTGFEITYLLELPEPYGFIFTWSVVLP 180

Query: 252 LIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFS 311
              + ++ +T L+ +++LILKGPC NCG+E  S+FGTIL ++SGG+TN + C        
Sbjct: 181 ATFFAARFITNLVFKDALILKGPCANCGSETNSYFGTILGVASGGSTNDVKCEQ------ 234

Query: 312 CGTTMVYDSNTRLITL 327
           C   + +DS TRLITL
Sbjct: 235 CKALLKFDSETRLITL 250


>gi|384252004|gb|EIE25481.1| hypothetical protein COCSUDRAFT_28046 [Coccomyxa subellipsoidea
           C-169]
          Length = 349

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 144/294 (48%), Positives = 195/294 (66%), Gaps = 25/294 (8%)

Query: 53  RRRSFVLPSKATTDQQGQVEG------------DEVVDSKILQYCSIDK-----KEKKSL 95
           +R+SF++  +A  D++  +              +E+VD  IL YC++DK     + KKSL
Sbjct: 44  QRQSFIV--RAADDKKSGISAVVEKVKEALPALEEIVDENILDYCTLDKTGARPRSKKSL 101

Query: 96  GELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVA 155
           GE EQ+FL+AL+AFYYE    MSNEEFDNLKEEL+W GS V +LSS EQ+FLEAS+A+ A
Sbjct: 102 GEKEQDFLEALRAFYYEEAPTMSNEEFDNLKEELLWAGSKVAILSSTEQRFLEASLAFQA 161

Query: 156 GKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVA 215
           GK I+SDEEYD LK +L+ + S++V +GPRCS+RSR++YSD   DYLKM LLN+PA +  
Sbjct: 162 GKSIVSDEEYDSLKDQLRKKNSKVVQQGPRCSIRSRRIYSDAVPDYLKMTLLNLPAAIFT 221

Query: 216 LGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPC 275
           L + F +DD+TGFEIT L+ELPEPF     W   +PL   +S SLT L +++ LILKGPC
Sbjct: 222 LIVLFSIDDLTGFEITKLVELPEPFGIFAVWGFVLPLTYIVSSSLTNLALKDFLILKGPC 281

Query: 276 PNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPE 329
           PNC TE  ++FG IL++     TN + C N      C   +  D+  R + + E
Sbjct: 282 PNCLTEVQTYFGDILTVKGSRDTNLVQCGN------CKAKITVDATKRSMIISE 329


>gi|215687350|dbj|BAG91915.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215692659|dbj|BAG88079.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737686|dbj|BAG96816.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 177

 Score =  280 bits (715), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 130/183 (71%), Positives = 156/183 (85%), Gaps = 6/183 (3%)

Query: 151 MAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVP 210
           MAYVAG PIM+D E+D+LK +L+ EGSEIV EGPRCSLRSRKVYSDL+VDY KM LLNVP
Sbjct: 1   MAYVAGNPIMTDAEFDELKLRLRKEGSEIVQEGPRCSLRSRKVYSDLTVDYFKMFLLNVP 60

Query: 211 ATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLI 270
           A V+AL LFFFLDD+TGFEITYLLELPEPFSFIFTWFAA+PLI +++Q++T  IV++ LI
Sbjct: 61  AAVLALTLFFFLDDLTGFEITYLLELPEPFSFIFTWFAALPLIFWVAQAITSAIVKDFLI 120

Query: 271 LKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEG 330
           LKGPCPNCG EN+SFFGTILS+ SGG  N++ C+N      C +++VYDS +RLITLPE 
Sbjct: 121 LKGPCPNCGNENLSFFGTILSVPSGGARNSVKCAN------CSSSLVYDSASRLITLPET 174

Query: 331 SEA 333
           +EA
Sbjct: 175 AEA 177


>gi|307110119|gb|EFN58356.1| hypothetical protein CHLNCDRAFT_34513 [Chlorella variabilis]
          Length = 305

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 130/252 (51%), Positives = 179/252 (71%), Gaps = 10/252 (3%)

Query: 76  VVDSKILQYCSIDKKEKK----SLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMW 131
           V D+ I Q+CS+D   K+    +LGE EQ FL+AL AFYY+G+  +++EEFD LK+EL+W
Sbjct: 46  VDDANINQFCSLDADGKRIAPMTLGEKEQVFLEALSAFYYDGRPAITDEEFDMLKDELVW 105

Query: 132 EGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSR 191
            GS V +LS+ EQ+FLEA ++Y  GKP+MSD ++D LK KL+  GS I  EGPRCSLRSR
Sbjct: 106 SGSKVAVLSTDEQRFLEAQISYNKGKPVMSDTDFDALKLKLRQSGSIITAEGPRCSLRSR 165

Query: 192 KVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVP 251
            +YS+   DYL+M  +N+PA +V LG  F +DD+TGFEIT L+ELPEP   +  W   +P
Sbjct: 166 TMYSNADPDYLRMTAINIPAALVVLGGVFSVDDLTGFEITKLIELPEPGGIVALWGFLLP 225

Query: 252 LIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFS 311
           ++  L+ S+T +I R++LILK PCPNCGTE V++FG IL+++    TN I CS      S
Sbjct: 226 IVYVLASSITNVIFRDALILKAPCPNCGTEAVTYFGDILTVAGNRKTNEIKCS------S 279

Query: 312 CGTTMVYDSNTR 323
           C +++V+DS TR
Sbjct: 280 CQSSLVFDSTTR 291


>gi|159468702|ref|XP_001692513.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278226|gb|EDP03991.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 333

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 125/274 (45%), Positives = 181/274 (66%), Gaps = 9/274 (3%)

Query: 62  KATTDQQGQVEGDEVVDSKILQYCSID---KKEKKSLGELEQEFLQALQAFYYEGKAVMS 118
           ++     G +  DE + +    YC+ID   KK K+SLGE+EQEFL A+ ++YYEGK  MS
Sbjct: 40  RSAKKDDGYISEDEGLGNVAADYCAIDGAGKKAKRSLGEMEQEFLAAMTSWYYEGKPTMS 99

Query: 119 NEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSE 178
           +EEF  LKEEL+W GS V +LSS EQ+FLEASMAY  GKPIM+DE+YD LK +L+ + S 
Sbjct: 100 DEEFSLLKEELIWSGSMVAVLSSDEQRFLEASMAYAKGKPIMTDEDYDALKAELRNKSSI 159

Query: 179 IVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPE 238
           +  +GPRCS+RS+K+Y+D   DYL+M  LN+P  +  LGL F +D  TGF +T L+ELP 
Sbjct: 160 VTAQGPRCSIRSKKMYADAEPDYLRMTALNLPGVLFVLGLVFAVDYSTGFGVTKLVELPA 219

Query: 239 PFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTT 298
           P+  I  W   +P +  ++ +LT++  +++LILK PCP+CG+EN S+FG + +++     
Sbjct: 220 PYGPILLWGLLLPSLFTVAYALTQVGFKDNLILKAPCPSCGSENFSYFGDVFTVAGARGQ 279

Query: 299 NTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSE 332
           N + C N      C   M++D   R++ + E SE
Sbjct: 280 NLVECPN------CKADMIFDEYKRVVVVAETSE 307


>gi|414874028|tpg|DAA52585.1| TPA: hypothetical protein ZEAMMB73_384055 [Zea mays]
          Length = 221

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 130/211 (61%), Positives = 152/211 (72%), Gaps = 39/211 (18%)

Query: 68  QGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKE 127
           Q Q + DEVVD  IL YCSID+K+KK+LGE+EQEFLQALQAFYY+ KA+MSNEEFDNLKE
Sbjct: 10  QVQEQDDEVVDRNILPYCSIDRKQKKTLGEMEQEFLQALQAFYYDQKAIMSNEEFDNLKE 69

Query: 128 ELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCS 187
           ELMWEGSSVV+                                  K   S IV EGPRCS
Sbjct: 70  ELMWEGSSVVI---------------------------------AKQTNSVIVKEGPRCS 96

Query: 188 LRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEI------TYLLELPEPFS 241
           LRS KVYSDL+VDY+KM LLNVPAT VALGLFFF+D++TGFEI       +L++LPEPF 
Sbjct: 97  LRSHKVYSDLNVDYIKMFLLNVPATTVALGLFFFIDELTGFEINVFQVSVHLIQLPEPFG 156

Query: 242 FIFTWFAAVPLIVYLSQSLTKLIVRESLILK 272
           FIFTWFAA+PLI++L+QSLTK IV++ LILK
Sbjct: 157 FIFTWFAALPLILFLAQSLTKAIVQDFLILK 187


>gi|302840317|ref|XP_002951714.1| hypothetical protein VOLCADRAFT_109811 [Volvox carteri f.
           nagariensis]
 gi|300262962|gb|EFJ47165.1| hypothetical protein VOLCADRAFT_109811 [Volvox carteri f.
           nagariensis]
          Length = 339

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 120/275 (43%), Positives = 181/275 (65%), Gaps = 10/275 (3%)

Query: 62  KATTDQQGQVEGDEVVDSKILQYCSIDKKEKK----SLGELEQEFLQALQAFYYEGKAVM 117
           +++ +  G +  DE   +    YC++D K KK    SLGE+EQ+FL A+ A+YYEGK  M
Sbjct: 42  QSSNNGDGFISDDEGTYNVAADYCALDDKGKKASKRSLGEMEQDFLAAMTAWYYEGKPTM 101

Query: 118 SNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGS 177
           S+EEF+ LK+EL+W GS+V +LSS EQ+FLEASMAY  GKPIMSDEEYD LK +L+ + S
Sbjct: 102 SDEEFNLLKDELIWSGSTVAVLSSDEQRFLEASMAYAKGKPIMSDEEYDLLKAELRNKSS 161

Query: 178 EIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELP 237
            +  +GPRCS+RS+K+Y+D   DYL+M  L++P  +  LGL F  D  +GF +T ++ELP
Sbjct: 162 IVTAQGPRCSIRSKKMYADAEPDYLRMTALSLPGVMFVLGLVFVADYSSGFGLTKVIELP 221

Query: 238 EPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGT 297
            P+  +  W   +P +  ++ +LT++  +++LILK PCP+CG EN ++FG + ++S    
Sbjct: 222 APYGVVLLWGLLLPSLFVIAYALTQVGFKDNLILKAPCPSCGAENFTYFGDVFTVSGNRG 281

Query: 298 TNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSE 332
            NT+ C N      C   + +D   R++ + E +E
Sbjct: 282 QNTLECHN------CKADLTFDEYKRIVVVAETAE 310


>gi|356541890|ref|XP_003539405.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Glycine max]
          Length = 246

 Score =  226 bits (575), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 111/161 (68%), Positives = 127/161 (78%), Gaps = 7/161 (4%)

Query: 174 MEGSEIVVEGPRCSLRSRKVYSDLSV-DYLKMLLLNVPATVVALGLFFFLDDITGFEITY 232
           MEGSEIV EGPRCSLRSRKV     V DYLKM LLNVP TVVALGLFFFLDD+TGFEI+Y
Sbjct: 92  MEGSEIVAEGPRCSLRSRKVCIQCPVCDYLKMFLLNVPETVVALGLFFFLDDVTGFEISY 151

Query: 233 LLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSI 292
           L+++PEPFSFI TW AA+P  ++L+QS+T+  V++  ILKGPCPN GTEN SFF TILS+
Sbjct: 152 LIKIPEPFSFILTWSAAIPFNLWLAQSITRATVQDFFILKGPCPNYGTENTSFFKTILSV 211

Query: 293 SSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           SSG +TN + C N      C T MVYDS TRLITLPEGS A
Sbjct: 212 SSGDSTNKVKCEN------CETKMVYDSKTRLITLPEGSNA 246


>gi|34015203|gb|AAQ56398.1| hypothetical protein OSJNBa0003M24.13 [Oryza sativa Japonica Group]
          Length = 203

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 102/167 (61%), Positives = 131/167 (78%), Gaps = 6/167 (3%)

Query: 167 KLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDIT 226
           K+ +      +E+V+     +L+  +VYSDL+VDY KM +LNVPA VVAL LFFFLDD+T
Sbjct: 43  KIPRAFHRLKTELVIYPGITNLQFSQVYSDLTVDYFKMFMLNVPAAVVALTLFFFLDDLT 102

Query: 227 GFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFF 286
           GFEITYLLELPEPFSFIFTWFAA+PLI +++Q++T  IV++ LILKGPC NCG EN+SFF
Sbjct: 103 GFEITYLLELPEPFSFIFTWFAALPLIFWVAQAITSAIVKDFLILKGPCLNCGNENLSFF 162

Query: 287 GTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
           GTILS+ SGG  N++ C+N      CG+++VYDS +RLITLPE +EA
Sbjct: 163 GTILSVPSGGARNSVKCAN------CGSSLVYDSASRLITLPETAEA 203


>gi|194708506|gb|ACF88337.1| unknown [Zea mays]
          Length = 123

 Score =  184 bits (467), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 89/130 (68%), Positives = 107/130 (82%), Gaps = 7/130 (5%)

Query: 204 MLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKL 263
           M LLNVPAT VALGLFFF+D++TGFEI  + +LPEPF FIFTWFAA+PLI++L+QSLTK 
Sbjct: 1   MFLLNVPATTVALGLFFFIDELTGFEIN-VFQLPEPFGFIFTWFAALPLILFLAQSLTKA 59

Query: 264 IVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTR 323
           IV++ LILKGPCPNCGTEN+SFFGTILS+SSGGTTN + C+N      C   + YDS +R
Sbjct: 60  IVQDFLILKGPCPNCGTENLSFFGTILSVSSGGTTNKVKCAN------CSAELEYDSKSR 113

Query: 324 LITLPEGSEA 333
           +ITLPE S A
Sbjct: 114 VITLPEASNA 123


>gi|307104343|gb|EFN52597.1| hypothetical protein CHLNCDRAFT_59756 [Chlorella variabilis]
          Length = 304

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 103/270 (38%), Positives = 154/270 (57%), Gaps = 27/270 (10%)

Query: 77  VDSKILQYCSIDKKEKKSLGEL-----EQEFLQALQAFYYEGKAVMSNEEFDNLKEELMW 131
           VD  I +YCS+DK  ++   EL     EQ FL+A+ AFYY+ K V+S+ EFDNLK+EL W
Sbjct: 44  VDENIGEYCSLDKDGRRPDHELTTAEKEQLFLEAMAAFYYDEKPVLSDTEFDNLKQELQW 103

Query: 132 EGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVV---EGPRCSL 188
           EGS VV+L+  E++ LEA +A+  GKPIMSDE+Y+ LK  L   G+ +     EGP C+L
Sbjct: 104 EGSKVVVLTKEEKRLLEAKLAFKKGKPIMSDEQYEALKASLA--GASVFALPREGPSCTL 161

Query: 189 ------RSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSF 242
                 R +K+ +    D+LKM  L VP  +V  GL    D +TG     LL LP     
Sbjct: 162 GKPDTPRGQKL-AQAQADWLKMAALAVPPPLVIAGLLLGADLLTG---ANLLHLPGTVGI 217

Query: 243 IFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENV-SFFGTILSISSGGTTNTI 301
           +      VP +  L+ +++  + ++++ILK  CPNCG  N+ S+FG IL+++     NT+
Sbjct: 218 VVWGGLVVPTVYVLANAVSSFMFKDAVILKANCPNCGEPNLSSYFGDILTVAGSRDKNTV 277

Query: 302 NCSNLTFCFSCGTTMVYDSNTRLITLPEGS 331
                  C  C + + +D+  R + + E +
Sbjct: 278 ------VCPCCASKLEFDAKNRSVLVAEAA 301


>gi|449019859|dbj|BAM83261.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 328

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 98/276 (35%), Positives = 153/276 (55%), Gaps = 23/276 (8%)

Query: 62  KATTDQQGQVEG-DEVVDSKILQYCSIDKK----EKKSLGELEQEFLQALQAFYYEGKAV 116
           K T   QG V G D   D  +  Y S D +     + SLGE E+ +L+A+ +FY  G+ +
Sbjct: 52  KETEKVQGSVNGSDADADGGLQGYKSTDPRTGASRELSLGEKERLYLEAVYSFY-NGEPI 110

Query: 117 MSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEG 176
           +SNEEF+ LKE+L W GS V++++  EQ+FL A+ A+  GKPIMSDE++D LK +L+ +G
Sbjct: 111 LSNEEFNVLKEDLQWNGSQVILMNRDEQRFLNAAQAFYNGKPIMSDEDFDALKDRLRKQG 170

Query: 177 SEIVV-EGPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLE 235
           S+I + +GPRC++ +   YSD  VD L+  ++  PA  +   L+  L     FE+T L  
Sbjct: 171 SQIALQQGPRCNIETGVCYSDCQVDKLRQAVIYAPAAGIGALLWAGLS----FELTPLRH 226

Query: 236 LPEPFSFIFTWFAAVPLIVYLSQSLTKLIVR-ESLILKGPCPNCGTENVSFFGTILSISS 294
           +    + ++  F   P+I   +Q LT ++++ +  I+ G CP+CG+E   FFG IL +  
Sbjct: 227 V----NPVWNLFIGAPVIYACAQLLTGIVLQNDPQIVVGECPSCGSEQRVFFGDILGVEG 282

Query: 295 GGTTNTINCSNLTFCFSCGTTMVYD-SNTRLITLPE 329
                 I C        C  ++  D    RL  +P 
Sbjct: 283 YRDVAEIKCER------CKASLRLDRKRMRLEQIPR 312


>gi|452824246|gb|EME31250.1| hypothetical protein Gasu_14910 [Galdieria sulphuraria]
          Length = 346

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 92/257 (35%), Positives = 139/257 (54%), Gaps = 18/257 (7%)

Query: 61  SKATTDQQGQVEGDEVVDSKILQYCSID----KKEKKSLGELEQEFLQALQAFYYEGKAV 116
           SK++    G  +   +VDS IL++C++D    +K++ +LGE E  F+ A+ + YY G+ +
Sbjct: 63  SKSSATLPGDQKEPILVDSNILEFCTVDPNTGEKKEMTLGEKELLFMDAISS-YYRGEPI 121

Query: 117 MSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEG 176
           +SN +FD L+EEL W+GS +VMLS  E +FLEA  AY  G PIMSD+E+D+LK KL+  G
Sbjct: 122 LSNNDFDILQEELAWQGSKMVMLSRDELRFLEAYRAYSNGTPIMSDQEFDELKAKLRKSG 181

Query: 177 SEIVVE-GPRCSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLE 235
           S I ++ GPRCS+ +   Y +   D  +ML L  PA  +A  ++        FE T L  
Sbjct: 182 SSIALQKGPRCSIVTETCYCNCVPDRTRMLFLYFPAAGIAALIY----ATVTFEFTPLRH 237

Query: 236 LPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSG 295
           +    + +       P+I   ++  T+L     LIL G CP+C     ++FG    I   
Sbjct: 238 I----TPVLNLLIGTPVIAAAAKIATELFFPSPLILVGECPSCTAPVRAYFGDAFGIKGN 293

Query: 296 GTTNTINC----SNLTF 308
                  C    +NL F
Sbjct: 294 KREAFTQCERCRANLKF 310


>gi|219110655|ref|XP_002177079.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411614|gb|EEC51542.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 326

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 91/257 (35%), Positives = 138/257 (53%), Gaps = 26/257 (10%)

Query: 83  QYCSIDKKE----KKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVM 138
           ++C  D       + ++ E E+ FL ALQ++Y  G+ ++S++EFD LKE+L W GS +V+
Sbjct: 84  EFCITDNDSGSMIRLTIEEKERIFLDALQSYYNSGRKMLSDDEFDLLKEDLTWNGSPMVV 143

Query: 139 LSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEG-PRCSLRSRKVYSDL 197
           ++  E  +L A  AY+ G PIMSD E+D LKQ LK EGS+  V+  P+C + +      L
Sbjct: 144 MNRKEAAYLNAMQAYLKGDPIMSDSEFDSLKQDLKDEGSKFAVQTEPKCYIDTGICKVTL 203

Query: 198 SVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPF---SFIFTWFAAVPLIV 254
             D  +  LL +PATV+ +  +          I +  E+ EPF   + +       PLI 
Sbjct: 204 QEDKFRSNLLYLPATVIGVVAW----------IGFGFEVLEPFIRINPVVLTLMGTPLIY 253

Query: 255 YLSQSLTKLIVRESLILK-GPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCG 313
             +++LT+  + E+  L  GPCP+C  EN  +FG IL +   G T+  N      C +C 
Sbjct: 254 LGARTLTEEFIFENKFLAYGPCPSCQAENRVYFGNILGVE--GFTDVANAK----CPNCK 307

Query: 314 TTMVYDSNT-RLITLPE 329
           TT     NT R  TLP+
Sbjct: 308 TTFTVQRNTLRASTLPK 324


>gi|223999427|ref|XP_002289386.1| hypothetical protein THAPSDRAFT_268713 [Thalassiosira pseudonana
           CCMP1335]
 gi|220974594|gb|EED92923.1| hypothetical protein THAPSDRAFT_268713 [Thalassiosira pseudonana
           CCMP1335]
          Length = 327

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 94/292 (32%), Positives = 141/292 (48%), Gaps = 17/292 (5%)

Query: 20  PVQKPAAALAPFPSSSRAHSNSNTQFNGRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDS 79
           PV    +A+  F S S     S    +G   T  R+     S      +  ++ ++ V  
Sbjct: 24  PVLTGNSAVKSFRSQSSGALFSEEPKDGAMITSGRKELSYDSATGRFFETDLDPEDCVPD 83

Query: 80  KILQYCSIDKKE----KKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSS 135
              +YC +DK      + ++ E E+ FL ALQ++Y+ G+ V+++ EFD LKE+L W GS 
Sbjct: 84  D--EYCVVDKASGELVRLTIEEKERIFLDALQSYYFSGRQVLNDAEFDLLKEDLSWNGSP 141

Query: 136 VVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIV-VEGPRCSLRSRKVY 194
           VV L+  E K+L A  AY+ G P +SD E+D+LK +LK EGS+    + P+C + +    
Sbjct: 142 VVNLNRKEAKYLAAVQAYLKGSPTLSDREFDQLKAELKEEGSQFASSKEPKCYIDTGICT 201

Query: 195 SDLSVDYLKMLLLNVP-ATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLI 253
                D  +  LL +P   ++A+G   F     GFE+   +    P   I     A P I
Sbjct: 202 VTYQNDNFRNNLLYLPVGAILAIGWLGF-----GFEVIEPIVRLNP--LILAAIGA-PFI 253

Query: 254 VYLSQSLTK-LIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCS 304
              S SLT   I   + I+ GPCP C TE   +FG IL +        + CS
Sbjct: 254 YNGSISLTDSFIFPNNKIVYGPCPVCETETRVYFGDILGVEGFSDVAGVKCS 305


>gi|397567808|gb|EJK45793.1| hypothetical protein THAOC_35578 [Thalassiosira oceanica]
          Length = 695

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 81/230 (35%), Positives = 127/230 (55%), Gaps = 18/230 (7%)

Query: 83  QYCSIDKKE----KKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVM 138
           +YC +DK+     + +L E E+ F+ ALQ++Y  G+ ++++ EFD LKE+L W+GSS+V 
Sbjct: 470 EYCIVDKETGNYIRLTLEEKERIFMDALQSYYVSGRQLLNDAEFDMLKEDLAWQGSSLVN 529

Query: 139 LSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEG--PRCSLRSRKVYSD 196
           ++  E K+L A  AY+ G+PI+SDEE+D LK +L ME          P+C + S      
Sbjct: 530 MNRQETKYLAAVQAYLKGEPIVSDEEFDALKTEL-MESKSAFASSKEPKCYIDSGICTVT 588

Query: 197 LSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYL 256
              D  +  LL +PA  + LG+ +      GFE+   L    P   I     + PL++  
Sbjct: 589 FEDDKFRNNLLYLPAGAI-LGILWL---GLGFEVLGPLVRLNP--LILAALGS-PLVING 641

Query: 257 SQSLTK-LIVRESLILKGPCPNCGTENVSFFGTILSI---SSGGTTNTIN 302
           S+ +T   I +++ I  GPCP+CGTE   +FG IL +   S G   N+ +
Sbjct: 642 SKYITDTFIFQDNQIGYGPCPSCGTEMRIYFGNILGVEGFSDGAAENSTS 691


>gi|397598461|gb|EJK57224.1| hypothetical protein THAOC_22761 [Thalassiosira oceanica]
          Length = 321

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 87/308 (28%), Positives = 142/308 (46%), Gaps = 39/308 (12%)

Query: 29  APFPSSSRAHSNSNTQFNGRQFTVRRRSFVLPSKATTDQQGQVEGDEVVDSKILQYCSID 88
            P P +    ++   QF+G +F                 +  +EGD+ +  +  ++C ID
Sbjct: 45  GPPPGTKIVSNSKEVQFDGIRF----------------YETGIEGDDCIPKE--EFCVID 86

Query: 89  KKEKK----SLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQ 144
            +  +    ++ E E+ FL ALQ++Y  G+ +M + EFD LKE+LMW GS +V L+  E 
Sbjct: 87  PENDEPIRLTVEEKERMFLDALQSYYVSGRQIMDDAEFDALKEDLMWNGSELVNLNRREI 146

Query: 145 KFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEG-PRCSLRSRKVYSDLSVDYLK 203
            +LEA   +  G P MSD E+D L+++L+   S + VE  P C + +     +   D  +
Sbjct: 147 AYLEAMQGFNKGAPTMSDAEFDALREELRDAKSVVAVEKEPTCYIETGICTIEWKKDNFR 206

Query: 204 MLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLT-K 262
             L+ +PA   A+  F +L        + L+      + + T     P+I   +Q++T  
Sbjct: 207 NNLIYLPAA--AIMTFIWLGS------SALIVGLNQLNPLLTLIVGSPVIYIAAQAITDN 258

Query: 263 LIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCG-TTMVYDSN 321
            I   +LI  GPCPNC      +FG IL +          C       SC    +V   +
Sbjct: 259 FIFANNLIAYGPCPNCEYPERVYFGNILGVEGFDNQAAKKCK------SCKEEILVQRRS 312

Query: 322 TRLITLPE 329
            R  TLP+
Sbjct: 313 LRAATLPK 320


>gi|428167073|gb|EKX36038.1| hypothetical protein GUITHDRAFT_90093 [Guillardia theta CCMP2712]
          Length = 253

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 117/224 (52%), Gaps = 23/224 (10%)

Query: 92  KKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASM 151
           K +L   E+ +L A  AF+ +GK+++S+ E+D LK +L +EGS V ++S  E KF+ A+ 
Sbjct: 22  KLTLAAKEKLYLDATSAFHNDGKSILSDAEYDKLKSDLAFEGSYVGLMSREEVKFMVAAN 81

Query: 152 AYVAGKPIMSDEEYDKLKQKLKMEGSEIVV-EGPRCSLRSRKVYSDLSVDYLKMLLLNVP 210
            Y  GKPIMSDEE+D L++KLK + S  V+ E P C +  +   SDL  D  K  +L  P
Sbjct: 82  RYQEGKPIMSDEEFDSLRRKLKQQNSRAVIHEVPTCKVDGQTCKSDLIPDTTKNAVLYFP 141

Query: 211 A----TVVALGLFFFLDDITGFEITYL--LELPEPFSFIFTWFAAVPLIVYLSQSLTK-L 263
           A    T+V   L ++   + G     L  L +  PF   FTW            ++T  L
Sbjct: 142 ALIVVTIVWSELAYWFTTLQGGSPNPLTSLVINSPFIAGFTW------------AITNFL 189

Query: 264 IVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTI--NCSN 305
           + ++  +    CP C T    +FG IL + SG T + +   C N
Sbjct: 190 LFQKPFVTSVQCPRCQTYQNIYFGDILFV-SGKTEDVVETQCVN 232


>gi|298707075|emb|CBJ29877.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 347

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 77/258 (29%), Positives = 127/258 (49%), Gaps = 27/258 (10%)

Query: 83  QYCSIDKKEKKSL----GELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVM 138
           +YC+ID+   K +    GE E+ F  A+Q+FYY  + ++ +++FD LKE+L WEGS VV+
Sbjct: 105 EYCAIDETTGKPMLLTTGEKERIFTDAIQSFYYNQRQMLDDDDFDKLKEDLAWEGSPVVL 164

Query: 139 LSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEG-PRCSLRSRKVYSDL 197
           ++  EQ F+ A  AY  G  I+ DEE+D+LK  L+  GS + V   P+C + +       
Sbjct: 165 MNRDEQLFMSAMAAYSRGDKIIDDEEFDRLKASLRESGSVVAVSTEPKCYIDTGICTVTF 224

Query: 198 SVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFI---FTWFAAVPLIV 254
             D  +     +PA  +  GL F         +    ++ EP  ++    T        +
Sbjct: 225 QEDLFRRFATYLPANFI-FGLAF---------LGASWQIIEPIRYLNPLATLLLLTAPTI 274

Query: 255 YLSQSLTKLIVRE--SLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSC 312
             S+  T L + +    +  GPCP+CG +N  FFG++L +      +   C N      C
Sbjct: 275 AASKKFTDLCLYQDNGTVASGPCPSCGAQNRVFFGSMLGVKGNIDDSEFKCDN------C 328

Query: 313 GTTM-VYDSNTRLITLPE 329
           G  + +  ++ R  TLP+
Sbjct: 329 GEIIKIRRADLRARTLPK 346


>gi|297608385|ref|NP_001061513.2| Os08g0310500 [Oryza sativa Japonica Group]
 gi|255678348|dbj|BAF23427.2| Os08g0310500 [Oryza sativa Japonica Group]
          Length = 199

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 54/108 (50%), Positives = 71/108 (65%), Gaps = 13/108 (12%)

Query: 74  DEVVDSKILQYCSIDKKEKKSLGELEQEFL--QALQAFYYEGKAVMSNEEFDNLKEELMW 131
           D+VVDS +L YCSI++KEKK++GE+ QEFL  +   AFYY+ KA++SNE FDNLKEELMW
Sbjct: 71  DDVVDSNVLPYCSINRKEKKTIGEMAQEFLSRRCRPAFYYDKKAIISNEVFDNLKEELMW 130

Query: 132 EGSSVVMLS-----------SAEQKFLEASMAYVAGKPIMSDEEYDKL 168
           EGSSVVML+            A  +   AS     GK   +D+++  L
Sbjct: 131 EGSSVVMLTLSSWPALWIAARAATRLAPASPMSTGGKLTNNDDKHQTL 178


>gi|34015197|gb|AAQ56392.1| hypothetical protein OSJNBa0003M24.11 [Oryza sativa Japonica Group]
 gi|35210546|dbj|BAC92661.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 176

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 48/72 (66%), Positives = 61/72 (84%), Gaps = 2/72 (2%)

Query: 74  DEVVDSKILQYCSIDKKEKKSLGELEQEFL--QALQAFYYEGKAVMSNEEFDNLKEELMW 131
           D+VVDS +L YCSI++KEKK++GE+ QEFL  +   AFYY+ KA++SNE FDNLKEELMW
Sbjct: 71  DDVVDSNVLPYCSINRKEKKTIGEMAQEFLSRRCRPAFYYDKKAIISNEVFDNLKEELMW 130

Query: 132 EGSSVVMLSSAE 143
           EGSSVVML+ +E
Sbjct: 131 EGSSVVMLNDSE 142


>gi|303284627|ref|XP_003061604.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456934|gb|EEH54234.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 329

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 88/291 (30%), Positives = 129/291 (44%), Gaps = 68/291 (23%)

Query: 85  CSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQ 144
           CSID     SL +LE  ++ AL  FY  G   +S+E++D L+EEL W+GS    L   E 
Sbjct: 64  CSIDDPSSCSLADLEMLYIDALWNFYNGGSFTLSDEQYDRLREELNWQGSGFPTLRRYEV 123

Query: 145 KFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGS--------------------------- 177
           +F+EAS+AY  G+P+++DEEY+ LK+K+K EG                            
Sbjct: 124 QFVEASIAYARGEPVVTDEEYEDLKRKVKAEGKRDDVTALLLYTKGQELLDPAQFLRLKD 183

Query: 178 -------EIVVEGPRCSLR--SRKVYSDLSVDYLKMLLLNVPATVVALGLF-FFLDDITG 227
                  ++ + G  C++   S K+ SD S   +  + L + A    LGL  + L  I G
Sbjct: 184 EMAKLDVDVGLRGATCTMSKTSDKLTSDSSA--VTKMYLAIGAVPTVLGLTPYVLGSIFG 241

Query: 228 FEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIV-----RESLILKGPCPNCGTEN 282
           F+I        PF+      A V   + +   LT  IV     + + I+KG CP C  E 
Sbjct: 242 FDI--------PFT------AGVGFGLTVGALLTASIVNYTGLQNAEIVKGECPCCENEI 287

Query: 283 VSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEGSEA 333
             FFG           N+++      C  CGTT   +  T  I    G +A
Sbjct: 288 KQFFG------GESPENSVDYK----CGVCGTTSKLNRTTMKIDQAGGIKA 328


>gi|125551392|gb|EAY97101.1| hypothetical protein OsI_19024 [Oryza sativa Indica Group]
          Length = 176

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 46/72 (63%), Positives = 60/72 (83%), Gaps = 2/72 (2%)

Query: 74  DEVVDSKILQYCSIDKKEKKSLGELEQEFL--QALQAFYYEGKAVMSNEEFDNLKEELMW 131
           D+VVDS +L Y SI++KEKK++GE+ +EFL  +   AFYY+ KAV+SNE FDNLKEELMW
Sbjct: 71  DDVVDSNVLPYYSINRKEKKTIGEMAEEFLSRRCRPAFYYDKKAVISNEVFDNLKEELMW 130

Query: 132 EGSSVVMLSSAE 143
           EGS+VVML+ +E
Sbjct: 131 EGSTVVMLNDSE 142


>gi|428177901|gb|EKX46779.1| hypothetical protein GUITHDRAFT_107552 [Guillardia theta CCMP2712]
          Length = 195

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 74/238 (31%), Positives = 111/238 (46%), Gaps = 55/238 (23%)

Query: 94  SLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAY 153
           SL   EQ +L    A+  + ++++S+E+++ LK +L +EGS V+++S  E KF+ A+  Y
Sbjct: 2   SLFAKEQLYLDCCAAYNVDAESLLSDEDYEKLKTDLTFEGSQVILMSREEIKFMVAANRY 61

Query: 154 VAGKPIMSDEEYDKLKQKLKMEGSEIVV-EGPRCSL--RSRKV-YSDLSVDYLKMLLLNV 209
             GKPIMSDEE+D L++KLK +GS  V  E   C L    RKV  +D+  D  K  LL  
Sbjct: 62  NKGKPIMSDEEFDTLRKKLKNKGSLAVKHEAASCKLDESGRKVCKADIFPDEGKNALLYT 121

Query: 210 PATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESL 269
           PA V+   LF                    ++F   WF                      
Sbjct: 122 PALVLTALLF------------------NEWAF---WF---------------------- 138

Query: 270 ILKGPCPNCGTENVSFFGTILSISSGGTTNTI--NCSNLTFCFSCGTTMVYDSNTRLI 325
             K  CP+CGT    +FG IL +S     +++   C N     +CG  +V D    ++
Sbjct: 139 --KATCPSCGTPQNIYFGDILWVSGKPVESSVQTQCCNK----ACGVELVADKKRMIV 190


>gi|412990023|emb|CCO20665.1| predicted protein [Bathycoccus prasinos]
          Length = 336

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 87/300 (29%), Positives = 131/300 (43%), Gaps = 57/300 (19%)

Query: 60  PSKATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSN 119
           PSK+  D       D   +  +   CSID+ E  SL +LE  ++ AL  +Y +GK  +S+
Sbjct: 52  PSKSMAD-------DTTPNVSVGPSCSIDEPETCSLADLEILYVDALWNYYNDGKFTLSD 104

Query: 120 EEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKME---- 175
            ++D ++EEL W+GS    L   E +F++A+MAY  G+ IM D+EY+ LK+K+K E    
Sbjct: 105 GQYDRIREELNWQGSGFPTLRRDEIQFVQAAMAYSRGEKIMGDDEYEDLKRKIKNEAGKR 164

Query: 176 -----------GSEIVVEG--PRCSLRSRKVYSDLSVDYLKMLLLNVPATV--------- 213
                      G +++ E      S    K+  D+ +      L   P T+         
Sbjct: 165 KDVTALLLYTKGQQLLDEAQFELMSEEMMKLGIDVGMKGATCTLSQTPTTLENDASAVAK 224

Query: 214 --VALGLFFFLDDITGFEITYL-----LELPEPFSFIF-TWFAAVPLIVYLSQSLTKLIV 265
              AL L  FL    G   T +     L +P   +  F T  AA+  + Y     T+L  
Sbjct: 225 MYAALALVPFL---VGLAPTIVFDVLGLHVPNGLALGFATTLAAI--LTYKLAEYTEL-- 277

Query: 266 RESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLI 325
             + IL G CP C       F  + S  +GG T   +      C  CGTT   +   +LI
Sbjct: 278 NNAQILTGQCPCCEEP----FKQLFSGENGGETEMEH-----KCKVCGTTCSLNREKKLI 328


>gi|145352121|ref|XP_001420406.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580640|gb|ABO98699.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 281

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 79/287 (27%), Positives = 126/287 (43%), Gaps = 50/287 (17%)

Query: 77  VDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSV 136
           V+  I   CSID     SL +LE  ++ AL  +Y  GK  +S+E++D L+EEL W+GS  
Sbjct: 8   VNPDIGVSCSIDDPSSCSLADLELMYVDALWNYYNGGKFTLSDEDYDRLREELNWQGSGF 67

Query: 137 VMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSD 196
             L   E +F++A+++Y  G+ ++ D++Y++LK+++K EG    V       + +++  +
Sbjct: 68  PTLRRYEIEFVQAAISYSRGESVVDDDKYEELKRRVKAEGKRTDVTALLLYTKGKELLDE 127

Query: 197 LSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWFAA---VPLI 253
              + L        A +  LG+   +   T        EL    S +FT +AA   VPL+
Sbjct: 128 AEFELLG-------AEMKKLGIEVGMKGATCTLSQTSTELISDTSGVFTMYAALATVPLL 180

Query: 254 VYLSQSLT--------------------------KLI----VRESLILKGPCPNCGTENV 283
           + LS S+                           KL+    ++ + IL G CP C     
Sbjct: 181 LGLSPSILLGFFGVHVPSGLSLGFAATLCAGLTYKLVNYTNLQNAEILIGACPCCEMPIK 240

Query: 284 SFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITLPEG 330
            FFG      S   T    CS      +CGT    +   RLI    G
Sbjct: 241 QFFGG----ESPAETFEHKCS------ACGTKCTLNRPKRLIETAGG 277


>gi|255086363|ref|XP_002509148.1| predicted protein [Micromonas sp. RCC299]
 gi|226524426|gb|ACO70406.1| predicted protein [Micromonas sp. RCC299]
          Length = 318

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 37/92 (40%), Positives = 61/92 (66%)

Query: 85  CSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQ 144
           CSID     SL +LE  ++ AL  +Y  G   +++E++D L+EEL W+GS    L   E 
Sbjct: 53  CSIDDPSSCSLADLEMMYIDALWNYYNGGDFTLTDEQYDRLREELNWQGSGFPTLRRYEV 112

Query: 145 KFLEASMAYVAGKPIMSDEEYDKLKQKLKMEG 176
           +F+EA+++Y  G+P +SD+EY++LK+K++  G
Sbjct: 113 QFVEAAISYARGEPKVSDKEYEELKRKVRAAG 144


>gi|308809053|ref|XP_003081836.1| putative heme activated protein (ISS) [Ostreococcus tauri]
 gi|116060303|emb|CAL55639.1| putative heme activated protein (ISS) [Ostreococcus tauri]
          Length = 651

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 76/277 (27%), Positives = 119/277 (42%), Gaps = 56/277 (20%)

Query: 85  CSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQ 144
           CSID     SL +LE  ++ AL  +Y  GK  +++E++D L+EEL W+GS    L   E 
Sbjct: 386 CSIDDPASCSLADLELMYVDALWNYYNGGKFTLTDEQYDRLREELNWQGSGFPTLRRYEI 445

Query: 145 KFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL-- 202
           +F++A+++Y  G+ ++ D +Y++LK+K+K  G    V       + +++  +   + L  
Sbjct: 446 EFVQAAISYSRGEAVVDDAKYEELKRKVKAAGKRTDVTALLLYAKGKELLDEAEFELLSG 505

Query: 203 --KML-------------------LLNVPATVV----ALGLFFFLDDITGFEITYLL--- 234
             K L                   L N PA+V     AL +  F     G   + LL   
Sbjct: 506 EMKKLGIDVGMRGATCTLSQTSAELSNDPASVASMYSALAIAPF---AVGLAPSILLGFV 562

Query: 235 --ELPEPFS--FIFTWFAAV--PLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGT 288
              LP   S  F  T  AA+   L+VY +       ++ + IL G CP C      FFG 
Sbjct: 563 GVHLPASLSLGFALTVCAALTYKLVVYTN-------LQNAEILLGACPCCEMPIKQFFGG 615

Query: 289 ILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLI 325
              +                C +CGT    +   RLI
Sbjct: 616 EAPLEKFEHK----------CSACGTVCTLNRPQRLI 642


>gi|224104421|ref|XP_002313431.1| predicted protein [Populus trichocarpa]
 gi|222849839|gb|EEE87386.1| predicted protein [Populus trichocarpa]
          Length = 286

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 65/223 (29%), Positives = 104/223 (46%), Gaps = 42/223 (18%)

Query: 115 AVMSNEEFDNLKEELMWEGSSVVM---LSSAEQKFLEASM-----AYVAGKPIMSDEEYD 166
           A+ SN      K  L  EG S +    + +A Q+ LEA       AY +GKP++ D+ +D
Sbjct: 38  AMSSNGVSRTRKVALAPEGPSCLFVGPIETASQETLEALYRQARDAYYSGKPLIIDDMFD 97

Query: 167 KLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDY--------LKMLLLNVPATVVALGL 218
           +++ KL+  GS+ VV+ PRCS+R +  YSD   D         + +L L V  +  AL +
Sbjct: 98  RVELKLRCYGSKCVVKYPRCSIRRQSTYSDAEADISQAFALASIWILFLTVGCSACALPI 157

Query: 219 FFFL----DDITGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQS--------------L 260
            + +     D  G  I++  + P     I  + A V  I++++ S              L
Sbjct: 158 IYTIGLAYQDAFGSVISHGSQTP-----IIGFLATVNGILFMAVSALIGYPIASASVKVL 212

Query: 261 TKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINC 303
             L   + + LKG CPNCG E  +F   + S  S  +T+  +C
Sbjct: 213 QGLWRNDLVALKGACPNCGEEVFAF---VKSDQSNNSTHRADC 252


>gi|168021127|ref|XP_001763093.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685576|gb|EDQ71970.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 227

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 80/158 (50%), Gaps = 18/158 (11%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL 202
           E  + +A  +Y +G+P++ D+ +DK++ +L+  GS++V++ PRCSLR    Y+D  VD  
Sbjct: 25  EALYQQARDSYYSGQPLVVDDMFDKVELQLRWHGSKLVLKYPRCSLRRFTAYADAEVDPS 84

Query: 203 KMLLLNVPATVVALGLFFFLDDITGFEI---TYL-LELPEPFSFIF-----------TWF 247
           +M  L   ATV +L L   L    G  +   TY+     EP +F+            ++ 
Sbjct: 85  QMRAL---ATVWSLLLAVGLGVAVGLPVLIATYMEFNSHEPGTFVLLYTNGAFLSLASFL 141

Query: 248 AAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSF 285
              P+ +     L  L   + + LKG CPNCG E  +F
Sbjct: 142 VGAPIALAAMNQLQALGRGDLVALKGCCPNCGEEVYTF 179


>gi|449019450|dbj|BAM82852.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 340

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 62/213 (29%), Positives = 107/213 (50%), Gaps = 21/213 (9%)

Query: 96  GELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVA 155
           G LE  F++AL++ YYEG  V+++ EF  L++EL   GSS + L+  E+ +++A+     
Sbjct: 86  GALENLFIEALRS-YYEGNPVLTDGEFKTLRDELEHLGSSSLRLNDLEKVWIQAAQQRDM 144

Query: 156 GKPIMSD-----EEYDKLKQKLKMEGSEIV----VEGPRCSLRSRKVY---SDLSVDYLK 203
            + ++++     EE D LK KL    +  +    +     S+  R +Y    D   D  K
Sbjct: 145 DRRLLNELRLSPEELDGLKAKLGASRTSTLDMQSILKSDSSMDKRLMYLLFGDAVEDRFK 204

Query: 204 MLLLNVPATVVALG--LFFFLDDIT---GFEITYLLELPEPFSFIFTWFAAVPLIV--YL 256
           +LLL  PA ++ L   L F L D+      ++T+  ++    S +F   A V      YL
Sbjct: 205 LLLLYAPAVLLCLSSILGFALLDLICLGHIQLTFDDQVAVRASVLFVVIAVVTAWFSNYL 264

Query: 257 S-QSLTKLIVRESLILKGPCPNCGTENVSFFGT 288
           + + L+ L +    +++GPCPNCG +    F +
Sbjct: 265 TPRMLSYLDLGSPEVVRGPCPNCGNDVTCLFSS 297


>gi|224009986|ref|XP_002293951.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970623|gb|EED88960.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 292

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 48/79 (60%)

Query: 102 FLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASMAYVAGKPIMS 161
           +  +L ++Y  GK ++ N  +D L E L WEGSSV  +S+ E +F+    A   G+P+M 
Sbjct: 191 YFDSLHSYYKTGKPLLDNANYDELHENLTWEGSSVATMSAQEAQFVSTVAASKRGEPLMD 250

Query: 162 DEEYDKLKQKLKMEGSEIV 180
           D+EY  LK  LK +GS +V
Sbjct: 251 DDEYKALKADLKQKGSWVV 269


>gi|145346020|ref|XP_001417495.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577722|gb|ABO95788.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 306

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 54/187 (28%), Positives = 79/187 (42%), Gaps = 18/187 (9%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL 202
           E  ++     Y  G P +SDE +D L+ KL   GS +V + PRCS+R + VYSD  VD  
Sbjct: 94  ETLYVACKEKYFTGAPAVSDEYFDALEAKLSYAGSGVVKKYPRCSVRGKAVYSDCVVDEA 153

Query: 203 KMLLLNVP-ATVVALGLFF----FLDDITG-FEITYLLELPE--PFSFIFTWFAAVPLIV 254
           +M  L      ++ALG  F    F DD+   F +      P    F         + LI 
Sbjct: 154 QMRALQTSYLAILALGTLFALVDFGDDLRAVFSVVLGDHGPNVPQFRVPIVGLVGLGLIS 213

Query: 255 YLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGT 314
              + L      E+L + G CP C  +  +F    L +  G           T C  CG 
Sbjct: 214 RGLEKLNAASTGETLAMTGECPACHEDVYAF----LPVRRG------QARERTECHVCGR 263

Query: 315 TMVYDSN 321
            +V+++ 
Sbjct: 264 KIVFETQ 270


>gi|449455102|ref|XP_004145292.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Cucumis
           sativus]
 gi|449473542|ref|XP_004153911.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Cucumis
           sativus]
 gi|449515667|ref|XP_004164870.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Cucumis
           sativus]
          Length = 291

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 85/183 (46%), Gaps = 26/183 (14%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVD-- 200
           E  +L+A  AY +G+P++ D+ +D+++ KL+  GS+ VV+ PRCSLR +  YSD   D  
Sbjct: 71  EALYLQARDAYYSGQPLILDDMFDRVELKLRWYGSKSVVKYPRCSLRRQSTYSDAEEDLS 130

Query: 201 --------YLKMLLLNVPATVVALGLF--FFLDDITGFEITYLLELPEPFSF-------I 243
                   +L  L L   A +V L         D     ++Y      PF F       +
Sbjct: 131 QVLALAGIWLLFLALGCSACLVPLICIADLIFKDPLSLGLSYDTH-GSPFGFLSAINAIL 189

Query: 244 FTWFAAV---PLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTTNT 300
           F  F ++   P+       L  L   + + LKG CPNCG E  +F   + S  + G+ + 
Sbjct: 190 FMAFGSLIGYPITTASVGVLQGLWRNDLVALKGACPNCGEEVFAF---VRSDRANGSPHR 246

Query: 301 INC 303
            +C
Sbjct: 247 SDC 249


>gi|224059484|ref|XP_002299869.1| predicted protein [Populus trichocarpa]
 gi|222847127|gb|EEE84674.1| predicted protein [Populus trichocarpa]
          Length = 294

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 48/169 (28%), Positives = 81/169 (47%), Gaps = 31/169 (18%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDY- 201
           E  + +A  AY +GKP++ D+ +D+++ KL+  GS+ VV+ PRCSLR +  YSD   D  
Sbjct: 76  EALYCQARDAYYSGKPLIVDDMFDRVELKLRWYGSKSVVKYPRCSLRRQSTYSDAEEDIS 135

Query: 202 -------LKMLLLNVPATVVALGLFFFL----DDITGFEITYLLELPEPFSFIFTWFAAV 250
                  + +L L + ++  A  + + +     D  G  I +  + P     I  + A V
Sbjct: 136 QAFALASIWVLFLTIGSSACAFPIIYTVGLAYQDAFGSGIAHGSQAP-----IIGFLATV 190

Query: 251 PLIVYLS--------------QSLTKLIVRESLILKGPCPNCGTENVSF 285
             I++++              + L  L   + + LKG CPNCG E  +F
Sbjct: 191 NGILFMAVGSLIGYPIASASVKVLQGLWRNDLVALKGACPNCGEEVFAF 239


>gi|14719338|gb|AAK73156.1|AC079022_29 unknown protein [Oryza sativa]
 gi|218195959|gb|EEC78386.1| hypothetical protein OsI_18164 [Oryza sativa Indica Group]
 gi|222629941|gb|EEE62073.1| hypothetical protein OsJ_16857 [Oryza sativa Japonica Group]
          Length = 272

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 55/209 (26%), Positives = 89/209 (42%), Gaps = 34/209 (16%)

Query: 139 LSSAEQKFLEASM-----AYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKV 193
           + +A Q+ LEA       +Y +G+P++ D+ +DK++ KL++ GS  VV+ PRCSL+ +  
Sbjct: 41  IETASQEMLEALYHQARDSYYSGQPLIVDDMFDKVELKLRVYGSPSVVKYPRCSLKRQST 100

Query: 194 YSD----------LSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITYLLELPEPFSFI 243
           Y+D          LS  ++ +LL    A +V       L     F   YLL   + F  I
Sbjct: 101 YADAEEDKSMFMALSSIWMLLLLFGTSAFLVPSLCILSLTFGDAFGARYLLYGAKSFDVI 160

Query: 244 ----------FTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSIS 293
                       +    P+      +L  L+    + LKG CPNCG +  +F        
Sbjct: 161 TRVNDMVLIGLGYLIGYPISSASVGALQGLLTNNLVALKGSCPNCGEQVFAFV------- 213

Query: 294 SGGTTNTINCSNLTFCFSCGTTMVYDSNT 322
              T N+I   +   C  C   + Y +  
Sbjct: 214 --KTDNSIKAPHRAECHVCSCPLEYRTKV 240


>gi|168039960|ref|XP_001772464.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162676261|gb|EDQ62746.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 240

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 79/157 (50%), Gaps = 16/157 (10%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL 202
           E  + +A  +Y +G+P++ D+ +DK++ KL+   S +VV+ PRCSL+    Y+D   D  
Sbjct: 26  EALYQQARDSYYSGQPLVVDDMFDKVEVKLRFHKSNLVVKYPRCSLKRYTAYADAEFDPS 85

Query: 203 KM-LLLNVPATVVALGLFFFLDDITGFEITYLLELP--EPFSFIF-----------TWFA 248
           +M  L  V   + ALGL   L  +    IT  + +   EP +F+            ++  
Sbjct: 86  QMRALATVWGFLFALGL--GLAVVLPVLITRYMSISAYEPGAFVLSDMNRVLFSVVSFLV 143

Query: 249 AVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSF 285
             P+ +  ++ L  L   + + LKG CPNCG E  +F
Sbjct: 144 GAPVAMSAAKQLQALGQGDVMALKGCCPNCGAEVYTF 180


>gi|357134989|ref|XP_003569096.1| PREDICTED: PGR5-like protein 1A, chloroplastic-like [Brachypodium
           distachyon]
          Length = 276

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 51/181 (28%), Positives = 85/181 (46%), Gaps = 25/181 (13%)

Query: 139 LSSAEQKFLEASM-----AYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKV 193
           + +A Q+ LEA       +Y +G+P++ D+ +DK++ KL++ GS  VV+ PRCSL+ +  
Sbjct: 45  IETASQEMLEALYRQARDSYYSGQPLIVDDMFDKVELKLRVYGSPSVVKYPRCSLKRQST 104

Query: 194 YSDLSVDY--------LKMLLLNVPATVVALGLFFFLDDITG--FEITYLLELPEPFSFI 243
           Y+D   D+        + MLLL   ++   +  F+ L    G  F   +LL   +    I
Sbjct: 105 YADAEEDHSMFMALSSIWMLLLMFGSSAFLVPSFYALSMTFGDAFGARFLLYGAKSLDGI 164

Query: 244 ----------FTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFGTILSIS 293
                       +    P+      +L  L+    + LKG CPNCG +  +F  T  SI 
Sbjct: 165 TRVNDMVLIGLGYLVGYPIASASVGALQGLLTNNLVALKGSCPNCGEQVFAFVKTDKSIK 224

Query: 294 S 294
           +
Sbjct: 225 A 225


>gi|21554137|gb|AAM63217.1| unknown [Arabidopsis thaliana]
          Length = 299

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 49/169 (28%), Positives = 77/169 (45%), Gaps = 29/169 (17%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL 202
           E  + +A  AY  GKP++ D+ +D+++ KL+  GS+ VV+ PRCSL  +  Y+D   D  
Sbjct: 75  EALYRQAKDAYYNGKPLIVDDMFDRVELKLRWYGSKSVVKYPRCSLLRQSTYADAEDDAS 134

Query: 203 KMLLLN--------------VPATVVALGLFFFLDDI-TGFEITYLLELPEPFSFIFTWF 247
           ++LLL               V  T+  +GL +  D   +G   +  L    P   I + F
Sbjct: 135 QVLLLATVWIMIFLFGSSACVLPTIYGVGLVYGGDPFDSGLVYSNQLSSSVP---ILSKF 191

Query: 248 AAV-----------PLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSF 285
             +           P+     ++L  L   +   LKG CPNCG E  +F
Sbjct: 192 NGILLSVLGPAFGYPIASSAVRALKGLWRNDLTALKGDCPNCGEEVFAF 240


>gi|255076371|ref|XP_002501860.1| hypothetical protein MICPUN_58040 [Micromonas sp. RCC299]
 gi|226517124|gb|ACO63118.1| hypothetical protein MICPUN_58040 [Micromonas sp. RCC299]
          Length = 346

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 53/191 (27%), Positives = 86/191 (45%), Gaps = 24/191 (12%)

Query: 119 NEEFDNLKEELMW-----EGSSVVMLSSAEQKFLE--------ASMAYVAGKPIMSDEEY 165
            E++D + E  MW     EG S + L   E   +E        A   Y +G+P++ D  +
Sbjct: 77  GEKYD-MAEMGMWDVPSIEGPSCIRLPPIEAATVEELEALYAQAKNTYFSGQPVVDDAMF 135

Query: 166 DKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYLKM-LLLNVPATVVALGLFFFLDD 224
           D ++++L+  GS+  V+ PRCS R  +VYSD S D  ++  L      +VALG    + D
Sbjct: 136 DTVERRLRYLGSDAAVKYPRCSRRDMRVYSDASFDVEQLDSLATTWLALVALGCAMVVLD 195

Query: 225 I-----TGFEITYLLELPEPFSFIFTWFAAVPLIVYLS----QSLTKLIVRESLILKGPC 275
                  G       ++  P    F       L V+L+    + L +L    ++ +KG C
Sbjct: 196 FGDAMRAGVGAVAGGDIGGPHGSSFRPPVIGLLGVFLANGGWERLGRLRDGNTVAVKGEC 255

Query: 276 PNCGTENVSFF 286
           P+C  E  +F 
Sbjct: 256 PSCAEEVYAFL 266


>gi|297793495|ref|XP_002864632.1| hypothetical protein ARALYDRAFT_496067 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310467|gb|EFH40891.1| hypothetical protein ARALYDRAFT_496067 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 299

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 74/166 (44%), Gaps = 23/166 (13%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL 202
           E  + +A  AY  GKP++ D+ +D+++ KL+  GS+ VV+ PRCSL  +  Y+D   D  
Sbjct: 75  EALYRQAKDAYYNGKPLIVDDMFDRVELKLRWYGSKSVVKYPRCSLLRQSTYADAEDDAS 134

Query: 203 KMLLLN--------------VPATVVALGLFFFLDDI-TGFEITYLLELPEPFSFIFTWF 247
           ++LLL               V  T+  LGL +  D + +G   +  L    P    F   
Sbjct: 135 QVLLLATIWILIFLFGSSACVLPTMYGLGLVYGGDPVDSGLVYSGQLSSSVPLLSKFNGI 194

Query: 248 A--------AVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSF 285
                      P+     + L  L   +   LKG CPNCG E  +F
Sbjct: 195 LLAVLGPAFGYPIASSAVRVLKGLWRNDLTALKGDCPNCGEEVFAF 240


>gi|356547257|ref|XP_003542032.1| PREDICTED: PGR5-like protein 1B, chloroplastic-like [Glycine max]
          Length = 281

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 51/177 (28%), Positives = 81/177 (45%), Gaps = 27/177 (15%)

Query: 139 LSSAEQKFLEASM-----AYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKV 193
           L +A Q+ LEA       AY +G+P++ D+ +D+L+ KLK  GS+ VV+ PRCS+R    
Sbjct: 52  LHTATQETLEALYSQARDAYYSGEPLILDDMFDRLELKLKWYGSKSVVKYPRCSIRRHST 111

Query: 194 YSDLSVDY-LKMLLLNVPATVVALG-------LFFFLDDITGFEITYLLELPEPFSFIFT 245
           Y+D   D  + + L ++ +  +ALG       +F+ +           L    P S    
Sbjct: 112 YADADEDLSMAIALASLWSLFLALGCSACVSPIFYTVSTAYHRAFDSGLSYGSPSSSGLG 171

Query: 246 WFAAVPLIVYLS--------------QSLTKLIVRESLILKGPCPNCGTENVSFFGT 288
               V  I++++              + L  L   +   LKG CPNCG E  +F  T
Sbjct: 172 LLFVVNSIIFMALGFVIGYPVASASVKVLQGLWRNDLAALKGSCPNCGEEVFAFVRT 228


>gi|326527371|dbj|BAK04627.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 281

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 84/189 (44%), Gaps = 41/189 (21%)

Query: 139 LSSAEQKFLEASM-----AYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKV 193
           + +A Q+ LEA       +Y +G+P++ D+ +DK++ KL++ GS  VV+ PRCSL+ +  
Sbjct: 50  IETASQEMLEALYRQARDSYYSGQPLIVDDMFDKVELKLRVYGSPSVVKYPRCSLKRQSA 109

Query: 194 YSD----------LSVDYLKMLLLNVPATVV----ALGLFFF--------------LDDI 225
           Y+D          LS  +  +LL    A +V     L L F               LD I
Sbjct: 110 YADAEEDHSMFMALSSIWTLLLLFGTSAFLVPSFYTLSLAFGDAFGARSLFSGAKSLDGI 169

Query: 226 TGFEITYLLELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSF 285
           T      L+ L         +    P+      +L  L+    + LKG CPNCG +  +F
Sbjct: 170 TRVNHMVLIGL--------GYLIGYPVASASVGALQGLLTNNVVALKGSCPNCGEQVFAF 221

Query: 286 FGTILSISS 294
             T  SI +
Sbjct: 222 VKTDKSIKA 230


>gi|22327971|ref|NP_568906.2| uncharacterized protein [Arabidopsis thaliana]
 gi|332009798|gb|AED97181.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 301

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/169 (28%), Positives = 76/169 (44%), Gaps = 29/169 (17%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL 202
           E  + +A  AY  GKP++ D+ +D+++ KL+  GS+ VV+ PRCSL  +  Y+D   D  
Sbjct: 75  EALYRQAKDAYYNGKPLIVDDMFDRVELKLRWYGSKSVVKYPRCSLLRQSTYADAEDDAS 134

Query: 203 KMLLLN--------------VPATVVALGLFFFLDDI-TGFEITYLLELPEPFSFIFTWF 247
           ++LLL               V  T+  +GL +  D   +G   +  L    P   I + F
Sbjct: 135 QVLLLATVWIMIFLFGSSACVLPTIYGVGLVYGGDPFDSGLVYSSQLSSSVP---ILSKF 191

Query: 248 AAV-----------PLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSF 285
             +           P+     + L  L   +   LKG CPNCG E  +F
Sbjct: 192 NGILLSVLGPAFGYPIASSAVRVLKGLWRNDLTALKGDCPNCGEEVFAF 240


>gi|30697213|ref|NP_851220.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332009797|gb|AED97180.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 299

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/169 (28%), Positives = 76/169 (44%), Gaps = 29/169 (17%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL 202
           E  + +A  AY  GKP++ D+ +D+++ KL+  GS+ VV+ PRCSL  +  Y+D   D  
Sbjct: 75  EALYRQAKDAYYNGKPLIVDDMFDRVELKLRWYGSKSVVKYPRCSLLRQSTYADAEDDAS 134

Query: 203 KMLLLN--------------VPATVVALGLFFFLDDI-TGFEITYLLELPEPFSFIFTWF 247
           ++LLL               V  T+  +GL +  D   +G   +  L    P   I + F
Sbjct: 135 QVLLLATVWIMIFLFGSSACVLPTIYGVGLVYGGDPFDSGLVYSSQLSSSVP---ILSKF 191

Query: 248 AAV-----------PLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSF 285
             +           P+     + L  L   +   LKG CPNCG E  +F
Sbjct: 192 NGILLSVLGPAFGYPIASSAVRVLKGLWRNDLTALKGDCPNCGEEVFAF 240


>gi|8885546|dbj|BAA97476.1| unnamed protein product [Arabidopsis thaliana]
          Length = 282

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/169 (28%), Positives = 76/169 (44%), Gaps = 29/169 (17%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL 202
           E  + +A  AY  GKP++ D+ +D+++ KL+  GS+ VV+ PRCSL  +  Y+D   D  
Sbjct: 75  EALYRQAKDAYYNGKPLIVDDMFDRVELKLRWYGSKSVVKYPRCSLLRQSTYADAEDDAS 134

Query: 203 KMLLLN--------------VPATVVALGLFFFLDDI-TGFEITYLLELPEPFSFIFTWF 247
           ++LLL               V  T+  +GL +  D   +G   +  L    P   I + F
Sbjct: 135 QVLLLATVWIMIFLFGSSACVLPTIYGVGLVYGGDPFDSGLVYSSQLSSSVP---ILSKF 191

Query: 248 AAV-----------PLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSF 285
             +           P+     + L  L   +   LKG CPNCG E  +F
Sbjct: 192 NGILLSVLGPAFGYPIASSAVRVLKGLWRNDLTALKGDCPNCGEEVFAF 240


>gi|255584751|ref|XP_002533094.1| conserved hypothetical protein [Ricinus communis]
 gi|223527106|gb|EEF29286.1| conserved hypothetical protein [Ricinus communis]
          Length = 287

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 52/185 (28%), Positives = 90/185 (48%), Gaps = 21/185 (11%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL 202
           E  + +A  AY +G+P++ D+ +D+++ KL+  GS+ VV+ PRCS+R +  Y+D   D  
Sbjct: 69  EALYRQARDAYYSGQPLIVDDMFDRVELKLRWYGSKSVVKYPRCSIRRQSTYADAEDDIS 128

Query: 203 KML-LLNVPATVVALGLFFFLDDIT-GFEITYLLELPEPFS---------FIFT-----W 246
           ++  L ++    + +G    +  I   F + Y   +    S         F+ T     +
Sbjct: 129 QVFALASIWIVFLTIGSSLCVGPIIYSFVLAYQDAISSKISQGSQASMVQFLATVNGILF 188

Query: 247 FAAVPLIVY-LSQSLTKL---IVRESLI-LKGPCPNCGTENVSFFGTILSISSGGTTNTI 301
            A   LI Y ++ S  K+   + R  L+ LKG CPNCG E  +F  +  S +S    +  
Sbjct: 189 MAVGTLIGYPIASSSVKVLQGLWRNDLVALKGACPNCGEEVFAFVKSDQSNNSPHRADCH 248

Query: 302 NCSNL 306
            C +L
Sbjct: 249 VCESL 253


>gi|357454157|ref|XP_003597359.1| PGR5-like protein [Medicago truncatula]
 gi|355486407|gb|AES67610.1| PGR5-like protein [Medicago truncatula]
          Length = 286

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 56/181 (30%), Positives = 82/181 (45%), Gaps = 36/181 (19%)

Query: 139 LSSAEQKFLEASM-----AYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKV 193
           L +A Q+ LEA       AY  G+P++ D+ +D+++ KLK  GS+ VV+ PRCS+R +  
Sbjct: 58  LQTATQETLEALYSQARDAYYRGEPLIVDDMFDRVELKLKWYGSKSVVKYPRCSIRRQST 117

Query: 194 YSDLSVDYLKMLLLNVPATVVALGLFF-----------FLDDITGFEITYLL-------- 234
           Y+D   D L M+     A   A  LFF           F      ++  + L        
Sbjct: 118 YADAEED-LSMVF----ALASAWTLFFAFGSSAMVGPMFYSISLAYQNAFNLGLSYGSQA 172

Query: 235 -ELPEPF---SFIFTWFAAV---PLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSFFG 287
            EL   F   + +FT    V   P+     ++L  L   +   LKG CPNCG E  +F  
Sbjct: 173 SELSPLFMVNTILFTALGFVIGYPVASASVKTLQGLWRNDLAALKGSCPNCGEEVFAFVR 232

Query: 288 T 288
           T
Sbjct: 233 T 233


>gi|356557427|ref|XP_003547017.1| PREDICTED: PGR5-like protein 1B, chloroplastic-like [Glycine max]
          Length = 270

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 87/195 (44%), Gaps = 28/195 (14%)

Query: 139 LSSAEQKFLEASM-----AYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKV 193
           L +A Q+ LEA       AY +G P++ D+ +D+++ KLK  GS+ VV+ PRCS+R    
Sbjct: 42  LHTATQETLEALYSQARDAYYSGDPLILDDMFDRVELKLKWYGSKSVVKYPRCSIRRHST 101

Query: 194 YSDLSVDY-LKMLLLNVPATVVALG-------LFFFLDDITGFEITYLLELPEPFSFIFT 245
           Y+D   D  + + L  + +  +ALG       +++ +           L    P S +  
Sbjct: 102 YADADEDLSMAIALAGLWSLFLALGCSACVWPIYYTVSTAYQKAFDSGLSYDSPASVLGL 161

Query: 246 WFAAVPLIVYLS--------------QSLTKLIVRESLILKGPCPNCGTENVSFFGTILS 291
            F  V  I++++              + L  L   +   LKG CPNCG E  +F  T  +
Sbjct: 162 LF-VVNSIIFMTLGLAIGYPVASASVKVLQGLWRNDLAALKGSCPNCGEEVFAFVRTDKA 220

Query: 292 ISSGGTTNTINCSNL 306
            +S    +   C  L
Sbjct: 221 NNSSHRADCHVCECL 235


>gi|356542431|ref|XP_003539670.1| PREDICTED: uncharacterized protein LOC100810752 [Glycine max]
          Length = 280

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 46/165 (27%), Positives = 78/165 (47%), Gaps = 23/165 (13%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL 202
           E  + +A  AY +G+P++ D+ +D+++ +L+  GS+ VV+ PRCS+R +  ++D   D L
Sbjct: 61  EALYCQARDAYYSGQPLIVDDMFDRVELRLRWFGSKSVVKYPRCSIRRQSTFADAEED-L 119

Query: 203 KML--LLNVPATVVALGLFFFLDDIT-----------------GFEITYLLELPEPFSFI 243
            M+  L +  A  +A G    +  ++                 G +   L  L    S I
Sbjct: 120 SMVFALASTWAMFLAFGSLACVGPVSYTVGMAYQNAFDSGLSLGSQTPGLGFLAVVNSLI 179

Query: 244 FTWFAAV---PLIVYLSQSLTKLIVRESLILKGPCPNCGTENVSF 285
           F     V   P+    ++ L  L   + + LKG CPNCG E  +F
Sbjct: 180 FVGLGFVIGYPVASASAKVLQGLWRNDLVALKGACPNCGEEVFAF 224


>gi|219120965|ref|XP_002185714.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209582563|gb|ACI65184.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 246

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 57/109 (52%), Gaps = 3/109 (2%)

Query: 74  DEVVDSKILQYCSIDKKEKKSLGEL---EQEFLQALQAFYYEGKAVMSNEEFDNLKEELM 130
           D+ ++        ++    K+ G L   E  +  AL  +Y  G  +++N +++ L++ L 
Sbjct: 114 DDTIEEDDFVKAIVNNSYWKAAGSLVVKELIYFDALHTYYRAGTPLLNNSDYETLRDNLT 173

Query: 131 WEGSSVVMLSSAEQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEI 179
            EGS VV + + E  F+ A  +   G P+M D +Y  LK++LK +GS +
Sbjct: 174 GEGSVVVNMKAKETIFVMAVASSRRGDPVMDDLQYTTLKRELKAQGSWV 222


>gi|147818033|emb|CAN64889.1| hypothetical protein VITISV_021080 [Vitis vinifera]
          Length = 1059

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 54/215 (25%), Positives = 91/215 (42%), Gaps = 48/215 (22%)

Query: 143  EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL 202
            E  + +A  AY +G+P++ D+ +D+++ KL+  GS+ V++ PRCSLR    Y+D   D  
Sbjct: 839  EALYCQARDAYYSGEPLIVDDMFDRVELKLRWYGSKSVLKYPRCSLRRHYTYADAEED-- 896

Query: 203  KMLLLNVPATVVALGLFFFL---------------------DDITGFEITY------LLE 235
                   P+ V AL   + L                      D     ++Y      L  
Sbjct: 897  -------PSQVFALASIWILFLAFGGSACLVPIIYTVGQAYQDAFNSGLSYSSQASALQL 949

Query: 236  LPEPFSFIFTWFAAVPLIVYLSQSLTKL--IVRESLI-LKGPCPNCGTENVSFFGTILSI 292
            L      +F    +V      S S+  L  + R  L+ LKG CPNCG E  +F      +
Sbjct: 950  LATANGILFMVLGSVIGYPVASASVGVLQGLWRNDLVALKGACPNCGEEVFAF------V 1003

Query: 293  SSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITL 327
             SG   ++ + ++   C  C +++ + +     TL
Sbjct: 1004 KSGQPNSSPHSAD---CHVCESSLEFSTKVEQSTL 1035


>gi|225434915|ref|XP_002280802.1| PREDICTED: PGR5-like protein 1B, chloroplastic [Vitis vinifera]
 gi|297746060|emb|CBI16116.3| unnamed protein product [Vitis vinifera]
          Length = 293

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 21/164 (12%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVD-- 200
           E  + +A  AY +G+P++ D+ +D+++ KL+  GS+ V++ PRCSLR    Y+D   D  
Sbjct: 73  EALYCQARDAYYSGEPLIVDDMFDRVELKLRWYGSKSVLKYPRCSLRQHYTYADAEEDPS 132

Query: 201 ------YLKMLLLNVPATVVALGLFFFL----DDITGFEITY------LLELPEPFSFIF 244
                  + +L L    +   + + + +     D     ++Y      L  L      +F
Sbjct: 133 QVFALASIWILFLAFGGSACLVPIIYTVGQAYQDAFNSGLSYSSQASALQLLATANGILF 192

Query: 245 TWFAAVPLIVYLSQSLTKL--IVRESLI-LKGPCPNCGTENVSF 285
               +V      S S+  L  + R  L+ LKG CPNCG E  +F
Sbjct: 193 MVLGSVIGYPVASASVGVLQGLWRNDLVALKGACPNCGEEVFAF 236


>gi|303276759|ref|XP_003057673.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460330|gb|EEH57624.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 270

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 50/205 (24%), Positives = 84/205 (40%), Gaps = 32/205 (15%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSDLSVDYL 202
           E+ +L+A   Y +G+PI+ D  +D ++ +LK  GS++  + PRCS R   V+ D   DY 
Sbjct: 27  EEVYLQAKNTYFSGQPIVDDAFFDAIETRLKHLGSDVARKYPRCSRRDMTVFGDAEADYE 86

Query: 203 KMLLLNVP-ATVVALGLFFFLDDITGFEITYLLELPEPFSFIFTWF-------------- 247
           +M  L    A  V LG+     D+        LE    F                     
Sbjct: 87  QMGALAATWAGFVFLGIALMGLDVKSAVDAGALEGVFGFGGGGGDGPGLGLAETAGTTRP 146

Query: 248 -AAVPLIV----YLSQS----LTKLIVRESLILKGPCPNCGTENVSFFGTILSISSGGTT 298
            A  P++     +L+Q+    L  L    ++ + G CPNC     +F    L  +S   T
Sbjct: 147 AARAPILAALGGFLAQTGCGKLASLRDGNTIAMSGDCPNCAERVYAF----LPANSAAKT 202

Query: 299 NTINCSNLTFCFSCGTTMVYDSNTR 323
             +     + C  CG  + + ++ +
Sbjct: 203 VKLK----SDCHVCGRGLTFAADVQ 223


>gi|452821325|gb|EME28357.1| hypothetical protein Gasu_41960 [Galdieria sulphuraria]
          Length = 373

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 67/279 (24%), Positives = 111/279 (39%), Gaps = 58/279 (20%)

Query: 92  KKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSSVVMLSSAEQKFLEASM 151
           ++ L  LE  FL+ L + YYEG  + S EEF  L++EL   GS+   L + E  +++AS 
Sbjct: 97  EEELAALETLFLEGLLS-YYEGTPMFSEEEFHTLRDELDHLGSTSTRLHNLEVMWVQASQ 155

Query: 152 AYVAGKPI-----MSDEEYDKLKQKL--------------------KMEGSEIVVEGPR- 185
                +       +S  E   LK+KL                    K++   +V    R 
Sbjct: 156 QRDFDRNFQNEFELSQSEMKSLKEKLLEERKARPPLAEDLALATKRKIQQRSLVRRAARE 215

Query: 186 ------CSLRSRKVYSDLSVDYLKMLLLNVPATVVA------LGLFFFLDDITGFEITYL 233
                  SLR   ++ D ++D  K+L+L  PA ++A      + + FFL D    E+   
Sbjct: 216 SEAALDNSLRY-LLFGDATLDRFKLLVLYAPAALLAGIMSSFMAISFFLFD---GEVEVR 271

Query: 234 LELPEPFSFIFTWFAAVPLIVYLSQSLTKLIVR-----ESLILKGPCPNCGTENVSFFGT 288
           +       F       V  + + +  +T  I+R       L+LKG CP CG+     F  
Sbjct: 272 ISAIGRLRFFLLLAVIVYGVGWFTNFVTPRILRYLDLGHPLLLKGHCPQCGSMVSCLF-- 329

Query: 289 ILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLITL 327
                   T++     +   C  C   + ++   R + L
Sbjct: 330 --------TSSETRIRDERVCSKCNALVGFNHRWRKVYL 360


>gi|242086603|ref|XP_002439134.1| hypothetical protein SORBIDRAFT_09g001176 [Sorghum bicolor]
 gi|241944419|gb|EES17564.1| hypothetical protein SORBIDRAFT_09g001176 [Sorghum bicolor]
          Length = 113

 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 38/54 (70%)

Query: 143 EQKFLEASMAYVAGKPIMSDEEYDKLKQKLKMEGSEIVVEGPRCSLRSRKVYSD 196
           E  + +A  +Y +G+P++ D+ +DK++ KL++ GS+ VV+ PRCS+  +  Y+D
Sbjct: 59  EALYHQARDSYYSGQPLIVDDMFDKVELKLRLYGSKSVVKYPRCSIIRQSTYAD 112


>gi|145516164|ref|XP_001443976.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411376|emb|CAK76579.1| unnamed protein product [Paramecium tetraurelia]
          Length = 315

 Score = 42.7 bits (99), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 49/217 (22%), Positives = 92/217 (42%), Gaps = 41/217 (18%)

Query: 129 LMWEGSSVVMLSSAEQKFL-EASMAYVAGKPIMSDEEYDKLKQK--LKMEGSEIVVEGPR 185
           ++W+ +  +   S + KF  +  + Y A K I  +++ +  K +  ++M+ +  +   P+
Sbjct: 107 IIWKDAEYMQCPSCKMKFCRKCQLEYHADKGISCEQQKELHKDQFYIEMKKNLKICRCPK 166

Query: 186 CSLRSRKVYSDLSVDYLKMLLLNVPATVVALGLFFFLDDITGFEITY--LLELPEPFS-- 241
           C+    K+ S  +  Y +               F FL D+   E  +    +  +P++  
Sbjct: 167 CNNMCEKI-SGCNFMYCR-----------CKTNFCFLCDVELTEAYHSSHFQKNDPYNSP 214

Query: 242 ----FIFTWFA--AVPLI--VYLSQSLTKLIVRESLILKGPCPNCGTEN-----VSFFGT 288
               +  TW     VP +  V     + ++I  E    K PCPNCG++N     + FF  
Sbjct: 215 CRVWYNGTWVDPDKVPKVEPVKEEAKVVQVIPVEEKENKFPCPNCGSKNKDVSQLQFFDK 274

Query: 289 ILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRLI 325
           ++   S      + C N TFC SC  ++   SN  +I
Sbjct: 275 VVQCQS------VKCQNSTFCISCQKSV---SNNDII 302


>gi|902888|gb|AAA87943.1| plasma phospholipid transfer protein [Mus musculus]
          Length = 491

 Score = 42.0 bits (97), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 61/257 (23%), Positives = 112/257 (43%), Gaps = 39/257 (15%)

Query: 37  AHSNSNTQFNGRQFTVRRRSFVLPSKATTDQQGQVEGDEVV----------DSKILQYCS 86
           ++ N + +F G  F +RR ++ LP++A    + Q+E DE +          DS +  Y  
Sbjct: 224 SNGNLDMEFRGAFFPLRRNNWSLPNRAV---EPQLEDDERMVYVAFSEFFFDSAMESYFQ 280

Query: 87  IDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELMWEGSS---VVMLSSAE 143
               +   +G+     L  L    Y G  V+ +    N   +LM E +S     +  S  
Sbjct: 281 AGALQLTLVGDKVPSDLDMLLRATYFGSIVLLSPTVINSPLKLMLEATSPPRCTIKPSGT 340

Query: 144 QKFLEASMAYVAGKPIMSDEEYDK------LKQKLKMEGSEIVVEGPRCSLRSRKVYSDL 197
              + AS+      P++ + E  K      L  KL + G  + V   +  LR  ++YS+ 
Sbjct: 341 TISITASVTITLAPPMLPEVELSKMIMEGRLSAKLTLRGKALRV---KLDLRRFQIYSNQ 397

Query: 198 S-VDYLKMLLLNVP-ATVVALGLFFFLDDIT--GFEITYLLELPEPFSFI------FTWF 247
           S ++ L ++ L  P  T++ +G+   L++ T  G +I     LPE  +F+         F
Sbjct: 398 SALESLALIPLQAPLKTLLQIGVMPLLNERTWRGVQI----PLPEGINFVREVVTNHAGF 453

Query: 248 AAVPLIVYLSQSLTKLI 264
             V   ++L++ L ++I
Sbjct: 454 VTVGADLHLAKGLREVI 470


>gi|218667573|ref|YP_002425666.1| NAD-dependent DNA ligase [Acidithiobacillus ferrooxidans ATCC
           23270]
 gi|218519786|gb|ACK80372.1| DNA ligase, NAD-dependent, putative [Acidithiobacillus ferrooxidans
           ATCC 23270]
          Length = 609

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 22/32 (68%)

Query: 144 QKFLEASMAYVAGKPIMSDEEYDKLKQKLKME 175
            K  EA+ AY AG PIMSD EYD L ++L+ E
Sbjct: 5   HKLEEANAAYRAGAPIMSDAEYDALVERLRRE 36


>gi|357611448|gb|EHJ67494.1| hypothetical protein KGM_19952 [Danaus plexippus]
          Length = 1142

 Score = 40.0 bits (92), Expect = 1.7,   Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 23/50 (46%), Gaps = 5/50 (10%)

Query: 275 CPNCGTENVSFFGTILSISSGGTTNTINCSNLTFCFSCGTTMVYDSNTRL 324
           CPNC TE   FFGT+L        N  +  N   C  CG T   D N R+
Sbjct: 946 CPNCPTEVFRFFGTLLK-----HVNNKHTDNNIICVYCGQTFRRDQNLRV 990


>gi|338718006|ref|XP_001496317.3| PREDICTED: LOW QUALITY PROTEIN: protein kintoun-like [Equus
           caballus]
          Length = 416

 Score = 38.9 bits (89), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 49/108 (45%), Gaps = 8/108 (7%)

Query: 26  AALAPFPSSSRAHSNSNTQFNGRQFTVRR---RSFVLPSKATTDQQGQV---EGDEVVDS 79
           A L    +S + H N  T   G   T +     S   PS+ + D  G V   E D    +
Sbjct: 247 AKLQECSNSEQLHENEETVNEGSHLTEKESTEHSTSFPSE-SADSSGAVQVLETDSCGSA 305

Query: 80  KILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKE 127
             LQ  S+D   K SLG+ EQ   +    F  E +AV SNEE DNLKE
Sbjct: 306 AHLQQESLDIS-KMSLGKSEQSESKMEPEFIKEKRAVYSNEEKDNLKE 352


>gi|448475427|ref|ZP_21603082.1| Mur ligase family CapB protein [Halorubrum aidingense JCM 13560]
 gi|445816419|gb|EMA66316.1| Mur ligase family CapB protein [Halorubrum aidingense JCM 13560]
          Length = 402

 Score = 38.1 bits (87), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 28/137 (20%), Positives = 64/137 (46%), Gaps = 13/137 (9%)

Query: 56  SFVLPSKATTDQQGQVEGDEVVDSKILQYCSIDKKEKKSLGELEQEFLQALQAFYYEGKA 115
           + V+ ++A T+   ++  +  +   +L +C++ +    +LGE  Q+  ++      EG  
Sbjct: 124 AIVMENQAITEYTMRMVNERFLRPNVLLFCNVRQDHNDTLGETRQDIARSFARSVPEGCH 183

Query: 116 VMSNEE----FDNLKEELMWEGSSV---------VMLSSAEQKFLEASMAYVAGKPIMSD 162
           V+S+E+     D LKEE+   G+++           L+ AE   +   +    G+P + D
Sbjct: 184 VISDEQHGAIHDYLKEEIEARGATIEQVDVPERHRHLTGAETVHMLNHVLDHLGEPRIDD 243

Query: 163 EEYDKLKQKLKMEGSEI 179
            E   +   ++ E +++
Sbjct: 244 AELHSMLDAIQPEWTDL 260


>gi|331086070|ref|ZP_08335153.1| DNA ligase [Lachnospiraceae bacterium 9_1_43BFAA]
 gi|330406993|gb|EGG86498.1| DNA ligase [Lachnospiraceae bacterium 9_1_43BFAA]
          Length = 653

 Score = 37.4 bits (85), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 29/42 (69%)

Query: 89  KKEKKSLGELEQEFLQALQAFYYEGKAVMSNEEFDNLKEELM 130
           +++K+ + EL Q   QA +A+Y + K +MSN E+D L +EL+
Sbjct: 2   EQKKQRMQELVQLLNQAGKAYYQDAKEIMSNHEYDRLYDELL 43


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.133    0.379 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,895,798,464
Number of Sequences: 23463169
Number of extensions: 195806891
Number of successful extensions: 573072
Number of sequences better than 100.0: 124
Number of HSP's better than 100.0 without gapping: 93
Number of HSP's successfully gapped in prelim test: 31
Number of HSP's that attempted gapping in prelim test: 572693
Number of HSP's gapped (non-prelim): 176
length of query: 333
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 190
effective length of database: 9,003,962,200
effective search space: 1710752818000
effective search space used: 1710752818000
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)