BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 022528
         (295 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225427403|ref|XP_002263777.1| PREDICTED: uncharacterized protein LOC100265501 [Vitis vinifera]
 gi|296088391|emb|CBI37382.3| unnamed protein product [Vitis vinifera]
          Length = 322

 Score =  399 bits (1026), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 204/290 (70%), Positives = 229/290 (78%), Gaps = 9/290 (3%)

Query: 9   IALPTVPI------PVPSIPRRQA-NCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNS 61
           + L T+PI      P+PS+   +  +CR F +K  + +F G +I  F    R      NS
Sbjct: 1   MTLSTIPIASRISIPIPSLQNPKVLSCRSFQVK-KDGSFCGPKIAAFK-MSRNLEFKANS 58

Query: 62  VSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGG 121
           VS D + S+  +VPFPSDYSE+L+QAK A ELA+KD  +LMEIEFPTAGL+SVPGD EGG
Sbjct: 59  VSGDSSASVGFNVPFPSDYSEILEQAKEATELALKDKKQLMEIEFPTAGLESVPGDGEGG 118

Query: 122 IEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFF 181
           IEMTGSM+LI EFCD+F+ PEK TRTRIFFPEANEVKFAR+S F GASFKLDYLTKPS F
Sbjct: 119 IEMTGSMQLIREFCDIFINPEKATRTRIFFPEANEVKFARQSAFGGASFKLDYLTKPSLF 178

Query: 182 EDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELD 241
           EDFGF  KVKMADRVK EDELFLVAYPYFNVNEMLVVEELY EAV NTA KLIIFNGELD
Sbjct: 179 EDFGFVTKVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYNEAVVNTARKLIIFNGELD 238

Query: 242 RIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
           RIRSGYYP FFYPKLAAL+K+L P MET+YYIHNFKGR GGTLFR   GP
Sbjct: 239 RIRSGYYPPFFYPKLAALTKSLLPKMETVYYIHNFKGRKGGTLFRCYPGP 288


>gi|356496430|ref|XP_003517071.1| PREDICTED: uncharacterized protein LOC100805878 [Glycine max]
          Length = 324

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 200/288 (69%), Positives = 226/288 (78%), Gaps = 4/288 (1%)

Query: 6   NSAIALPTVPIPVPSIPRRQ-ANCRKFSIKCSNE-NFSGQRIITFSPYRRKHSCLTNSVS 63
           +S + L   PI  PS+P    A    FS+K        G    + +P  RK +  T SVS
Sbjct: 4   SSTMILSNSPIASPSLPTSTGAKLETFSLKNDGVIRIRGATASSVAPRIRKTA--TCSVS 61

Query: 64  SDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIE 123
            DGN S+  DVPFP+DYSELL+QA++AA+LA+KD  +LMEIEFPTAGL SVPGD EGGIE
Sbjct: 62  KDGNASVETDVPFPADYSELLEQARVAADLAIKDNRQLMEIEFPTAGLGSVPGDGEGGIE 121

Query: 124 MTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFED 183
           MT SM+LI EFCD F++ EK TRTRIFFPEA+EV FAR+SVF G SFKLDYLTKPSFFED
Sbjct: 122 MTESMQLIREFCDRFISSEKATRTRIFFPEASEVDFARQSVFSGCSFKLDYLTKPSFFED 181

Query: 184 FGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRI 243
           FGF EK+KM+DRVK  DELFLV YPYFNVNE+LVVEELYKEAV NT  KLIIFNGELDRI
Sbjct: 182 FGFVEKIKMSDRVKTGDELFLVGYPYFNVNEILVVEELYKEAVLNTERKLIIFNGELDRI 241

Query: 244 RSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
           RSGYYPSFFYPKLAAL+KT  P+MET+YYIHNFKGRNGGTLFR   GP
Sbjct: 242 RSGYYPSFFYPKLAALTKTFLPMMETVYYIHNFKGRNGGTLFRCYPGP 289


>gi|224071439|ref|XP_002303460.1| predicted protein [Populus trichocarpa]
 gi|222840892|gb|EEE78439.1| predicted protein [Populus trichocarpa]
          Length = 260

 Score =  382 bits (981), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 187/227 (82%), Positives = 201/227 (88%)

Query: 67  NNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTG 126
           ++S+  DVPFP DY ELLDQAK A ELA +D  +LMEIEFPTAGL+SVPGD EGGIEMTG
Sbjct: 2   SSSVEFDVPFPRDYEELLDQAKKATELAWEDNKQLMEIEFPTAGLESVPGDGEGGIEMTG 61

Query: 127 SMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGF 186
           SM+LI EFCD FV+PEK TRTRIFFPEANEVKFAR+S FEG+S KLDYLTKPSFFEDFGF
Sbjct: 62  SMQLIREFCDRFVSPEKTTRTRIFFPEANEVKFARQSAFEGSSLKLDYLTKPSFFEDFGF 121

Query: 187 TEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSG 246
            EKVKM DRVK EDELFLVAYPYFNVNEMLVVEELYKEAV  TA KLIIFNGELDRIRSG
Sbjct: 122 VEKVKMTDRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVETARKLIIFNGELDRIRSG 181

Query: 247 YYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
           YYPSFFYPKLA+L KTLFP+MET+YYIHNFKGRNGGTLFR   GP +
Sbjct: 182 YYPSFFYPKLASLLKTLFPLMETVYYIHNFKGRNGGTLFRCYPGPWQ 228


>gi|255557645|ref|XP_002519852.1| conserved hypothetical protein [Ricinus communis]
 gi|223540898|gb|EEF42456.1| conserved hypothetical protein [Ricinus communis]
          Length = 316

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 181/231 (78%), Positives = 199/231 (86%), Gaps = 1/231 (0%)

Query: 62  VSSDGNNS-INVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEG 120
           VS +G++S +  DVP P DY ELL QAK A +LA+KDG +LMEIEFPTAGL+SVPGD EG
Sbjct: 52  VSRNGSSSSVESDVPLPRDYEELLVQAKKATDLALKDGKQLMEIEFPTAGLESVPGDGEG 111

Query: 121 GIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSF 180
           GIEMT SM+LI +FCD FV+PEK  RTR+FFPEANEVKFAR+S F G+S KLDYLTKPSF
Sbjct: 112 GIEMTESMQLIRQFCDRFVSPEKAARTRVFFPEANEVKFARESAFGGSSLKLDYLTKPSF 171

Query: 181 FEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL 240
           FEDFGF EK+KM DRVK EDELFLVAYPYFNVNEMLVVEELY EAV NT  K+IIFNGEL
Sbjct: 172 FEDFGFVEKIKMTDRVKPEDELFLVAYPYFNVNEMLVVEELYNEAVVNTTRKMIIFNGEL 231

Query: 241 DRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
           DRIRSGYYPSFFYPKLA+L KTLFPVMET+YYIHNFKGR GGTLFR   GP
Sbjct: 232 DRIRSGYYPSFFYPKLASLLKTLFPVMETVYYIHNFKGRKGGTLFRCYPGP 282


>gi|449456759|ref|XP_004146116.1| PREDICTED: uncharacterized protein LOC101209709 [Cucumis sativus]
 gi|449509516|ref|XP_004163611.1| PREDICTED: uncharacterized LOC101209709 [Cucumis sativus]
          Length = 336

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 195/301 (64%), Positives = 221/301 (73%), Gaps = 20/301 (6%)

Query: 4   SSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSC---LTN 60
           +S+S IA   +P+P P +      C + S + ++ +     +  F P R        L+N
Sbjct: 9   ASSSTIATAVLPLPSPKLA-----CFRISHRRTHRSSVSSSMFEFMPRRHLRVLPPNLSN 63

Query: 61  SVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEG 120
             SS  N S ++DVPFP DYS+LL+QAK A E A+ D  +LMEIEFPTAGL+SVPGD EG
Sbjct: 64  RQSS--NASTDLDVPFPRDYSDLLNQAKKATEAALIDNKQLMEIEFPTAGLESVPGDGEG 121

Query: 121 GIEMTGSMRLICEFCDLFVTPEKVTRTRI----------FFPEANEVKFARKSVFEGASF 170
           GIEMT SM+LI +FCD F+ P K TRTR+          FFPEANEVKFAR + FEG SF
Sbjct: 122 GIEMTESMQLIRQFCDCFIDPLKATRTRVTVSIKENHIQFFPEANEVKFARNTAFEGVSF 181

Query: 171 KLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTA 230
           KLDYLTKPSFFEDFGF EKVKMADRVK EDELFLVAYPYFNVNEMLVVEELYKEAV NT 
Sbjct: 182 KLDYLTKPSFFEDFGFVEKVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVQNTT 241

Query: 231 WKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEG 290
            KLIIFNGELDRIRSGYYP FFYPKLAAL KTLFP MET+YYIHNFKG+ GG LFR   G
Sbjct: 242 RKLIIFNGELDRIRSGYYPPFFYPKLAALMKTLFPEMETVYYIHNFKGQKGGVLFRSYPG 301

Query: 291 P 291
           P
Sbjct: 302 P 302


>gi|109289908|gb|AAP45177.2| hypothetical protein SBB1_14t00013 [Solanum bulbocastanum]
          Length = 338

 Score =  360 bits (923), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 187/259 (72%), Positives = 199/259 (76%), Gaps = 29/259 (11%)

Query: 61  SVSSDGNNSINVDVPFPSDYSELLDQ----------------------------AKMAAE 92
           S S D   SI  DVPFP DY+ELL Q                            AK A E
Sbjct: 47  SCSGDRAASIGFDVPFPKDYTELLQQVFILFAFSPLKIGGRGSGNGGGITREIKAKEATE 106

Query: 93  LAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFP 152
           LA+KD  +LMEIEFPTAGL SVPGD EGGIEMTGS++LI EFCDL V PEK T+TRIFFP
Sbjct: 107 LALKDNRQLMEIEFPTAGLGSVPGDGEGGIEMTGSIQLIREFCDLLVIPEKATKTRIFFP 166

Query: 153 EANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNV 212
           EANEVKFAR+S+F GASFKLDYLTKPSFFEDFGFTEKVKMADRVK EDELF+VAYPYFNV
Sbjct: 167 EANEVKFARQSIFGGASFKLDYLTKPSFFEDFGFTEKVKMADRVKPEDELFIVAYPYFNV 226

Query: 213 NEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYY 272
           NEMLVVEELY+ AV NT+ KLIIFNGELDRIRS  YP FFYPKLAALSKTLFP MET+YY
Sbjct: 227 NEMLVVEELYQAAVLNTSRKLIIFNGELDRIRSD-YPPFFYPKLAALSKTLFPKMETVYY 285

Query: 273 IHNFKGRNGGTLFRFLEGP 291
           IHNFKGRNGG LFR   GP
Sbjct: 286 IHNFKGRNGGVLFRCYPGP 304


>gi|357146418|ref|XP_003573985.1| PREDICTED: uncharacterized protein LOC100843789 [Brachypodium
           distachyon]
          Length = 322

 Score =  358 bits (920), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 191/292 (65%), Positives = 211/292 (72%), Gaps = 5/292 (1%)

Query: 1   MPLSSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRK-HSCLT 59
           M ++++ +I+ PT      S+P +Q     F I  S+    G   +     R   H C  
Sbjct: 1   MAMATSYSISNPTF-TSKSSLPNKQVPNWIFPIISSDNGSGGMFTLARRSLRAGFHVC-- 57

Query: 60  NSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSE 119
            +V+ D N        FPSDY+ELL QAK AAE A KDG +L+EIEFPTAGL SVPGD E
Sbjct: 58  -AVTGDQNTRNVFSANFPSDYTELLLQAKDAAESAFKDGKQLLEIEFPTAGLQSVPGDGE 116

Query: 120 GGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPS 179
           GGIEMTGSM LI EFCD FV  EK TRTRIFFPEANEV FAR+S FEG S KLDYLTKPS
Sbjct: 117 GGIEMTGSMLLIREFCDRFVPAEKTTRTRIFFPEANEVTFARQSAFEGCSLKLDYLTKPS 176

Query: 180 FFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGE 239
            FEDFGFT KVKMADRV+ EDE+FLVAYPYFNVNEMLVVEELYKEAV NT  K+IIFNGE
Sbjct: 177 LFEDFGFTTKVKMADRVQPEDEIFLVAYPYFNVNEMLVVEELYKEAVVNTDRKMIIFNGE 236

Query: 240 LDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
           LDRIRSGYYP FFYPKLA LSKT  P MET+YYIHNFKG  GG LFR   GP
Sbjct: 237 LDRIRSGYYPPFFYPKLAELSKTFLPKMETVYYIHNFKGSKGGALFRCYPGP 288


>gi|113208412|gb|ABI34553.1| hypothetical protein SBB1_21t00009 [Solanum bulbocastanum]
          Length = 338

 Score =  358 bits (920), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 187/261 (71%), Positives = 200/261 (76%), Gaps = 29/261 (11%)

Query: 61  SVSSDGNNSINVDVPFPSDYSELLDQ----------------------------AKMAAE 92
           S S D   SI  DVPFP DY+ELL Q                            AK A E
Sbjct: 47  SCSGDRAASIGFDVPFPKDYTELLQQVFILFAFSPLKIGGWGSRNRGGITREIKAKEATE 106

Query: 93  LAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFP 152
           LA+KD  +LMEIEFPTAGL SVPGD EGGIEMTGS++LI EFCDL V PEK T+TRIFFP
Sbjct: 107 LALKDNRQLMEIEFPTAGLGSVPGDGEGGIEMTGSIQLIREFCDLLVIPEKATKTRIFFP 166

Query: 153 EANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNV 212
           EANEVKFAR+S+F GASFKLDYLTKPSFFEDFGFTEKVKMADRVK EDELF+VAYPYFNV
Sbjct: 167 EANEVKFARQSIFGGASFKLDYLTKPSFFEDFGFTEKVKMADRVKPEDELFIVAYPYFNV 226

Query: 213 NEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYY 272
           NEMLVVEELY+ AV NT+ KLIIFNGELDRIRS  YP FFYPKLAALSKTLFP MET+YY
Sbjct: 227 NEMLVVEELYQAAVLNTSRKLIIFNGELDRIRSD-YPPFFYPKLAALSKTLFPKMETVYY 285

Query: 273 IHNFKGRNGGTLFRFLEGPQE 293
           IHNFKGRNGG LFR   GP +
Sbjct: 286 IHNFKGRNGGVLFRCYPGPWK 306


>gi|224034407|gb|ACN36279.1| unknown [Zea mays]
 gi|413926746|gb|AFW66678.1| hypothetical protein ZEAMMB73_267474 [Zea mays]
          Length = 324

 Score =  352 bits (904), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 183/283 (64%), Positives = 213/283 (75%), Gaps = 3/283 (1%)

Query: 4   SSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNSVS 63
           +S  ++A P +    P + ++ +N    +I  SN N +G  + T +  + ++     +V+
Sbjct: 5   TSYGSMANPPITSRTPFLSKQASNWIPATI--SNGNGTGG-MFTVASRKSRNGFQFCAVT 61

Query: 64  SDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIE 123
            D  +    DV FPSDY+ELL QAK AAE A KDG +L+EIEFPTAGL +VPGD EGG E
Sbjct: 62  GDPGSRNVSDVNFPSDYTELLTQAKEAAESAFKDGKQLLEIEFPTAGLQTVPGDGEGGNE 121

Query: 124 MTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFED 183
           MTGSM LI EFCD FV  EK TRTR+FFPEANEV FAR+S FEG S KLDYLTKPS FED
Sbjct: 122 MTGSMLLIREFCDRFVPAEKATRTRVFFPEANEVSFARQSAFEGCSLKLDYLTKPSLFED 181

Query: 184 FGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRI 243
           FGFT KVKMADRVK +DE FLVAYPYFNVNEMLVVEELYKEAV  T+ KLIIFNGELDRI
Sbjct: 182 FGFTTKVKMADRVKPQDETFLVAYPYFNVNEMLVVEELYKEAVVGTSRKLIIFNGELDRI 241

Query: 244 RSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFR 286
           RSGYYP+FFYPKLA LSKT  P ++T+YYIHNFKG  GGTLFR
Sbjct: 242 RSGYYPAFFYPKLAELSKTFLPKLDTVYYIHNFKGAKGGTLFR 284


>gi|18422955|ref|NP_568702.1| uncharacterized protein [Arabidopsis thaliana]
 gi|14326508|gb|AAK60299.1|AF385707_1 AT5g48790/K24G6_12 [Arabidopsis thaliana]
 gi|18700216|gb|AAL77718.1| AT5g48790/K24G6_12 [Arabidopsis thaliana]
 gi|332008342|gb|AED95725.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 316

 Score =  350 bits (899), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 174/234 (74%), Positives = 192/234 (82%), Gaps = 1/234 (0%)

Query: 61  SVSSDGNNSINVD-VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSE 119
           SVS   NN+ +VD VPFP DY EL++QAK A E+A+KD  +LMEIEFPT+GL SVPGD E
Sbjct: 51  SVSGGYNNNTSVDNVPFPRDYVELINQAKEAVEMALKDEKQLMEIEFPTSGLASVPGDGE 110

Query: 120 GGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPS 179
           G  EMT S+ +I EFCD  + PEK   TRIFFPEANEVKFA+K+VF G  FKLDYLTKPS
Sbjct: 111 GATEMTESINMIREFCDRLLAPEKARSTRIFFPEANEVKFAQKTVFGGTYFKLDYLTKPS 170

Query: 180 FFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGE 239
            FEDFGF E+VKMADRVK EDELFLVAYPYFNVNEMLVVEELYKEAV NT  KLIIFNGE
Sbjct: 171 LFEDFGFFERVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVNTDRKLIIFNGE 230

Query: 240 LDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
           LDRIRSGYYP FFYPKLAAL+KTL P MET+YYIHNFKG+ GG LFR   GP +
Sbjct: 231 LDRIRSGYYPKFFYPKLAALTKTLLPKMETVYYIHNFKGQKGGVLFRCYPGPWQ 284


>gi|326523775|dbj|BAJ93058.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 322

 Score =  350 bits (897), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 184/275 (66%), Positives = 199/275 (72%), Gaps = 6/275 (2%)

Query: 19  PSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNSVSSDGNNSIN--VDVPF 76
           PS P +Q    +F    S +   G  I   S  RR      N  +  G+ S        F
Sbjct: 18  PSAPHKQVPNWRFPTINSGDG--GGSIFAIS--RRNLRTWFNVCAVTGDQSTRDVFSADF 73

Query: 77  PSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCD 136
           PSDY+EL+ QAK A E A KDG +L+EIEFPTAGL SVPGD EGGIEMTGSM LI EFCD
Sbjct: 74  PSDYTELIVQAKEATESAFKDGKQLLEIEFPTAGLQSVPGDGEGGIEMTGSMLLIREFCD 133

Query: 137 LFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRV 196
            FV  EKVTRTRIFFPEA EV FAR+S FEG S KLDYLTKPS FEDFGFT KVKMADRV
Sbjct: 134 RFVPAEKVTRTRIFFPEAKEVTFARQSAFEGCSLKLDYLTKPSLFEDFGFTTKVKMADRV 193

Query: 197 KLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKL 256
           + EDE+FLVAYPYFNVNEMLVVEELYKEAV NT  K+IIFNGELDRIRSGYYP FFYPKL
Sbjct: 194 RPEDEIFLVAYPYFNVNEMLVVEELYKEAVLNTERKMIIFNGELDRIRSGYYPPFFYPKL 253

Query: 257 AALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
             LSKT  P +ET+YYIHNFKG  GG LFR   GP
Sbjct: 254 GELSKTFLPKLETVYYIHNFKGSKGGVLFRCYPGP 288


>gi|413926747|gb|AFW66679.1| hypothetical protein ZEAMMB73_267474 [Zea mays]
          Length = 310

 Score =  349 bits (895), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 177/251 (70%), Positives = 198/251 (78%), Gaps = 1/251 (0%)

Query: 36  SNENFSGQRIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAV 95
           SN N +G  + T +  + ++     +V+ D  +    DV FPSDY+ELL QAK AAE A 
Sbjct: 21  SNGNGTGG-MFTVASRKSRNGFQFCAVTGDPGSRNVSDVNFPSDYTELLTQAKEAAESAF 79

Query: 96  KDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEAN 155
           KDG +L+EIEFPTAGL +VPGD EGG EMTGSM LI EFCD FV  EK TRTR+FFPEAN
Sbjct: 80  KDGKQLLEIEFPTAGLQTVPGDGEGGNEMTGSMLLIREFCDRFVPAEKATRTRVFFPEAN 139

Query: 156 EVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEM 215
           EV FAR+S FEG S KLDYLTKPS FEDFGFT KVKMADRVK +DE FLVAYPYFNVNEM
Sbjct: 140 EVSFARQSAFEGCSLKLDYLTKPSLFEDFGFTTKVKMADRVKPQDETFLVAYPYFNVNEM 199

Query: 216 LVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHN 275
           LVVEELYKEAV  T+ KLIIFNGELDRIRSGYYP+FFYPKLA LSKT  P ++T+YYIHN
Sbjct: 200 LVVEELYKEAVVGTSRKLIIFNGELDRIRSGYYPAFFYPKLAELSKTFLPKLDTVYYIHN 259

Query: 276 FKGRNGGTLFR 286
           FKG  GGTLFR
Sbjct: 260 FKGAKGGTLFR 270


>gi|226494690|ref|NP_001145598.1| uncharacterized protein LOC100279074 [Zea mays]
 gi|195658649|gb|ACG48792.1| hypothetical protein [Zea mays]
          Length = 310

 Score =  348 bits (892), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 176/251 (70%), Positives = 198/251 (78%), Gaps = 1/251 (0%)

Query: 36  SNENFSGQRIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAV 95
           SN N +G  + T +  + ++     +V+ D  +    DV FPSDY+ELL QAK AAE A 
Sbjct: 21  SNGNGTGG-MFTVASRKSRNGFQFCAVTGDPGSRNVSDVNFPSDYTELLTQAKEAAESAF 79

Query: 96  KDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEAN 155
           KDG +L+EIEFPTAGL +VPGD EGG EMTGSM LI EFCD FV  EK TRTR+FFPEAN
Sbjct: 80  KDGKQLLEIEFPTAGLQTVPGDGEGGNEMTGSMLLIREFCDRFVPAEKATRTRVFFPEAN 139

Query: 156 EVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEM 215
           EV FAR+S FEG S KLDYLTKPS FEDFGFT KVKMADRVK +DE FLVAYPYFNVNEM
Sbjct: 140 EVSFARQSAFEGCSLKLDYLTKPSLFEDFGFTTKVKMADRVKPQDETFLVAYPYFNVNEM 199

Query: 216 LVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHN 275
           LVVEELYKEAV  T+ KLIIFNGELDRIRSGYYP+FFYPKLA LS+T  P ++T+YYIHN
Sbjct: 200 LVVEELYKEAVVGTSRKLIIFNGELDRIRSGYYPAFFYPKLAELSRTFLPKLDTVYYIHN 259

Query: 276 FKGRNGGTLFR 286
           FKG  GGTLFR
Sbjct: 260 FKGAKGGTLFR 270


>gi|297795571|ref|XP_002865670.1| hypothetical protein ARALYDRAFT_494942 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311505|gb|EFH41929.1| hypothetical protein ARALYDRAFT_494942 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 315

 Score =  347 bits (890), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 172/233 (73%), Positives = 191/233 (81%)

Query: 61  SVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEG 120
           SVS   NN+   +VPFP DY EL++QAK A ELA+KD  +LMEIEFPT+GL SVPGDSEG
Sbjct: 51  SVSGGYNNTSVDNVPFPRDYFELINQAKEAVELAMKDEKQLMEIEFPTSGLASVPGDSEG 110

Query: 121 GIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSF 180
             EMT S+ +I EFCD  + PEK   TRIFFPEANEVKFA+K+VF G  FKLDYLTKPS 
Sbjct: 111 ATEMTESINMIREFCDRLLAPEKARTTRIFFPEANEVKFAQKTVFGGTYFKLDYLTKPSL 170

Query: 181 FEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL 240
           FEDFGF E+VKM+DRVK EDELFLVAYPYFNVNEMLVVEELYKEAV NT  KLIIFNGEL
Sbjct: 171 FEDFGFFERVKMSDRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVNTDRKLIIFNGEL 230

Query: 241 DRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
           DRIRSGYYP FFYPKLAAL+KTL P M+T+YYIHNFKG+ GG LFR   GP +
Sbjct: 231 DRIRSGYYPKFFYPKLAALTKTLLPKMDTVYYIHNFKGQKGGVLFRCYPGPWQ 283


>gi|357484699|ref|XP_003612637.1| hypothetical protein MTR_5g027220 [Medicago truncatula]
 gi|355513972|gb|AES95595.1| hypothetical protein MTR_5g027220 [Medicago truncatula]
          Length = 365

 Score =  345 bits (886), Expect = 9e-93,   Method: Compositional matrix adjust.
 Identities = 181/289 (62%), Positives = 203/289 (70%), Gaps = 49/289 (16%)

Query: 50  PYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKL-------- 101
           P  RK +  + S S DGN S+  D+PFP DYSELL+QAK+A EL ++  MK+        
Sbjct: 44  PTSRKLARCSVSASGDGNASVQTDIPFPFDYSELLEQAKVAVELQLR--MKVAHSKLRSE 101

Query: 102 ------------MEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRI 149
                        EIEFPTAGL+SVPGD EGGIEMTGSM+LI EFCDL ++ EK+TRTRI
Sbjct: 102 STIEPIMNNDLKQEIEFPTAGLESVPGDGEGGIEMTGSMQLIREFCDLSISAEKITRTRI 161

Query: 150 ---------------------------FFPEANEVKFARKSVFEGASFKLDYLTKPSFFE 182
                                      FFPEANEV FAR+S F GASFKLDYLTKPSFF+
Sbjct: 162 MVMRKNLLCNTSSSLPVEIDVQPCKNQFFPEANEVDFARQSAFSGASFKLDYLTKPSFFQ 221

Query: 183 DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR 242
           DFGF EKVKM+DRVK EDELF+VAYPYFNVNEMLVVEELYKEAV NT  KLIIFNGELDR
Sbjct: 222 DFGFVEKVKMSDRVKAEDELFVVAYPYFNVNEMLVVEELYKEAVVNTERKLIIFNGELDR 281

Query: 243 IRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
           IRSGYYP FFYPKLA L+K+  P MET+YYIHNFKGR+ G LFR   GP
Sbjct: 282 IRSGYYPPFFYPKLAGLTKSFLPSMETVYYIHNFKGRDRGILFRCYPGP 330


>gi|242063910|ref|XP_002453244.1| hypothetical protein SORBIDRAFT_04g002440 [Sorghum bicolor]
 gi|241933075|gb|EES06220.1| hypothetical protein SORBIDRAFT_04g002440 [Sorghum bicolor]
          Length = 322

 Score =  345 bits (886), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 179/283 (63%), Positives = 207/283 (73%), Gaps = 5/283 (1%)

Query: 4   SSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNSVS 63
           +S  ++  P +    P + ++ +N     I  +  N +G  + T +    ++     +V+
Sbjct: 5   TSCGSMTKPPITFKTPFVNKQASNW----IPATISNGTGG-MFTVASRNSRNGFQVRAVT 59

Query: 64  SDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIE 123
            D  +    DV FP+DY++LL QAK AAE A KDG +L+EIEFPTAGL +VPGD EGG E
Sbjct: 60  GDPGSRNASDVKFPTDYTQLLMQAKEAAESAFKDGKQLLEIEFPTAGLQTVPGDGEGGNE 119

Query: 124 MTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFED 183
           MTGSM LI EFCD FV  EK TRTR+FFPEANEV FAR+S FEG S KLDYLTKPS FED
Sbjct: 120 MTGSMLLIREFCDRFVPAEKSTRTRVFFPEANEVSFARQSAFEGCSLKLDYLTKPSLFED 179

Query: 184 FGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRI 243
           FGFT KVKMADRVK EDE FLVAYPYFNVNEMLVVEELY EAV  T  KLIIFNGELDRI
Sbjct: 180 FGFTTKVKMADRVKPEDETFLVAYPYFNVNEMLVVEELYNEAVVGTNRKLIIFNGELDRI 239

Query: 244 RSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFR 286
           RSGYYPSFFYPKLA LSKT  P ++T+YYIHNFKG  GGTLFR
Sbjct: 240 RSGYYPSFFYPKLAELSKTFLPKLDTVYYIHNFKGVKGGTLFR 282


>gi|125580675|gb|EAZ21606.1| hypothetical protein OsJ_05234 [Oryza sativa Japonica Group]
 gi|218189983|gb|EEC72410.1| hypothetical protein OsI_05707 [Oryza sativa Indica Group]
          Length = 338

 Score =  335 bits (859), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 186/307 (60%), Positives = 206/307 (67%), Gaps = 24/307 (7%)

Query: 5   SNSAIALPTVPIPVPSIPRRQANCRKF-SIKCSNENFSGQRIITFSPYRRK--HSCLTNS 61
           + S  ++   P+   S P +Q       +I     N++G    T     R   H C  N 
Sbjct: 2   ATSYCSISNPPLSKTSFPNKQVPGWVLRAISKGKGNYTGGIYTTTKRNLRTGFHVCAVNG 61

Query: 62  VSSDGNNSINVD-VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEG 120
               G  + NV    FPSDY+ELL QAK AAE A KDG +L+EIEFPTAGL SVPGDSEG
Sbjct: 62  ----GQGTRNVSGAEFPSDYTELLAQAKEAAESAFKDGKQLLEIEFPTAGLQSVPGDSEG 117

Query: 121 GIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSF 180
           GIEMTGSM LI EFCD FV  EK TRTRIFFPEANEV FAR+S FEG S KLDYLTKPS 
Sbjct: 118 GIEMTGSMLLIREFCDRFVPAEKATRTRIFFPEANEVSFARQSAFEGCSLKLDYLTKPSL 177

Query: 181 FEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL 240
           FEDFGFT KVKM+DRV+ EDE+FLVAYPYFNVNEMLVVEELYKEA+ +T  KLIIFNGEL
Sbjct: 178 FEDFGFTTKVKMSDRVRPEDEIFLVAYPYFNVNEMLVVEELYKEAIVSTDRKLIIFNGEL 237

Query: 241 DRIR----------------SGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTL 284
           DRIR                   YP FFYPKLA LSKT  P +ET+YYIHNFKG  GGTL
Sbjct: 238 DRIRMLVTFLNKREAALMMFENNYPPFFYPKLAELSKTFLPKLETVYYIHNFKGLKGGTL 297

Query: 285 FRFLEGP 291
           FR   GP
Sbjct: 298 FRCYPGP 304


>gi|116793457|gb|ABK26754.1| unknown [Picea sitchensis]
          Length = 337

 Score =  324 bits (831), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 156/223 (69%), Positives = 181/223 (81%)

Query: 71  NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRL 130
           ++DV FP DYSELL Q K+A + A+ D   L+EIEFPTAGLDSV GD+EGGIEM  SM L
Sbjct: 83  DIDVEFPGDYSELLQQVKVATQSALMDSKYLLEIEFPTAGLDSVSGDAEGGIEMNSSMTL 142

Query: 131 ICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKV 190
           I EFC  F+ PE+ TRTRIFFPEA EV+FA+K+VFEG +FK+DYLTKPS  EDFGF  KV
Sbjct: 143 IREFCRRFLKPEEATRTRIFFPEAKEVEFAKKTVFEGVAFKMDYLTKPSLLEDFGFGTKV 202

Query: 191 KMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPS 250
           KMA+RV+  DE+FLVAYPYFNV+EMLVVEELYK+AV +T  KLIIFNGELDRIRSGYYP 
Sbjct: 203 KMAERVQPTDEIFLVAYPYFNVDEMLVVEELYKDAVVHTDRKLIIFNGELDRIRSGYYPP 262

Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
           FFYPK+ AL++   P +ET YYIHNFKGR GGTLFR   GP +
Sbjct: 263 FFYPKIGALARNFLPKLETAYYIHNFKGRVGGTLFRSYPGPWQ 305


>gi|356522807|ref|XP_003530035.1| PREDICTED: uncharacterized protein LOC100802995 [Glycine max]
          Length = 323

 Score =  307 bits (786), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 150/190 (78%), Positives = 167/190 (87%), Gaps = 1/190 (0%)

Query: 71  NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRL 130
           + DVPFP+DYSELL+QA++AA+LA+KD  +LMEIEFPTAGL SVPGD EGGIEMT SM+L
Sbjct: 23  HTDVPFPADYSELLEQARVAADLAIKDNRQLMEIEFPTAGLGSVPGDGEGGIEMTESMQL 82

Query: 131 ICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKV 190
           I EFCD F++ EK TRTRIFFPEA+EV FAR+SVF G SFKLDYLT PSFFEDFGF EK+
Sbjct: 83  IREFCDRFISSEKATRTRIFFPEASEVDFARQSVFSGCSFKLDYLTNPSFFEDFGFVEKI 142

Query: 191 KMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPS 250
           KM DRVK  DELFLV+YPYFN NE+LVVEELYKE V NT  KLIIFNGELDRIRSGYYPS
Sbjct: 143 KMLDRVKTGDELFLVSYPYFNANEILVVEELYKE-VLNTERKLIIFNGELDRIRSGYYPS 201

Query: 251 FFYPKLAALS 260
           FFYPKLAAL+
Sbjct: 202 FFYPKLAALT 211


>gi|9758878|dbj|BAB09432.1| unnamed protein product [Arabidopsis thaliana]
          Length = 248

 Score =  275 bits (704), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 140/187 (74%), Positives = 154/187 (82%), Gaps = 1/187 (0%)

Query: 61  SVSSDGNNSINVD-VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSE 119
           SVS   NN+ +VD VPFP DY EL++QAK A E+A+KD  +LMEIEFPT+GL SVPGD E
Sbjct: 51  SVSGGYNNNTSVDNVPFPRDYVELINQAKEAVEMALKDEKQLMEIEFPTSGLASVPGDGE 110

Query: 120 GGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPS 179
           G  EMT S+ +I EFCD  + PEK   TRIFFPEANEVKFA+K+VF G  FKLDYLTKPS
Sbjct: 111 GATEMTESINMIREFCDRLLAPEKARSTRIFFPEANEVKFAQKTVFGGTYFKLDYLTKPS 170

Query: 180 FFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGE 239
            FEDFGF E+VKMADRVK EDELFLVAYPYFNVNEMLVVEELYKEAV NT  KLIIFNGE
Sbjct: 171 LFEDFGFFERVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVNTDRKLIIFNGE 230

Query: 240 LDRIRSG 246
           LDRIRSG
Sbjct: 231 LDRIRSG 237


>gi|168020280|ref|XP_001762671.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686079|gb|EDQ72470.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 280

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 139/241 (57%), Positives = 177/241 (73%), Gaps = 2/241 (0%)

Query: 53  RKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLD 112
           R     + S S   +  IN  V FP DY+EL++QA+ AA+ A+KD   L+E+EFPTAGLD
Sbjct: 9   RSFVVRSRSGSDPKSKIINKSVDFPKDYNELVNQARRAAQAALKDDKTLLEVEFPTAGLD 68

Query: 113 SVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKL 172
           +VPGD EGGIEM  S+ L+ EFC +F   ++   TRIFFP+A +++ A+ S+F+G SFKL
Sbjct: 69  TVPGDEEGGIEMNTSIVLMKEFCTIF--KDEAPTTRIFFPDAKDMELAKTSIFDGTSFKL 126

Query: 173 DYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK 232
           DYLTKP+  EDFGF  KVKMADRV+  D +F+VAYPYFNVNEM+ VEELYK +   +   
Sbjct: 127 DYLTKPNGLEDFGFGSKVKMADRVQSSDTVFVVAYPYFNVNEMIAVEELYKGSAAASNRP 186

Query: 233 LIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQ 292
           +I+FNGELDRIRSGYYPSFFYPKL +++K   P  ET+YYIHNFKGR+ G LFR   GP 
Sbjct: 187 IIVFNGELDRIRSGYYPSFFYPKLGSIAKEFLPKFETVYYIHNFKGRSRGVLFRMYPGPW 246

Query: 293 E 293
           +
Sbjct: 247 Q 247


>gi|302761398|ref|XP_002964121.1| hypothetical protein SELMODRAFT_166751 [Selaginella moellendorffii]
 gi|300167850|gb|EFJ34454.1| hypothetical protein SELMODRAFT_166751 [Selaginella moellendorffii]
          Length = 303

 Score =  263 bits (673), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 129/240 (53%), Positives = 166/240 (69%), Gaps = 3/240 (1%)

Query: 54  KHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDS 113
           K S      S DGN      VPFPSDY E++ QA+ A + A+ D  KL+E+E P AGL++
Sbjct: 28  KSSWRILRASRDGNVG---SVPFPSDYIEMVKQAQDACQAALDDSKKLLEVEVPPAGLNT 84

Query: 114 VPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLD 173
           V GD EGGIEM  SM ++ +FC    T EK  RTR+FFPE  E+  A+  VF+G+ FKLD
Sbjct: 85  VSGDEEGGIEMNISMEIVQKFCAGMFTGEKAPRTRVFFPELAEMNIAKSGVFDGSMFKLD 144

Query: 174 YLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKL 233
           YLTKPS ++D G  +KVKM++R +  D  F+VAYP+FN NEML VEELY+++   +   +
Sbjct: 145 YLTKPSPWDDIGLGKKVKMSERARPTDATFVVAYPFFNPNEMLAVEELYRDSAKESGCPI 204

Query: 234 IIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
           I+ NG+LD+IR+GYYP FFYPKL AL+KT  P  ET+YYIHNFKGR  GTLFR   GP +
Sbjct: 205 IVINGDLDKIRNGYYPPFFYPKLGALAKTFLPDFETVYYIHNFKGRFAGTLFRAYPGPWQ 264


>gi|302820762|ref|XP_002992047.1| hypothetical protein SELMODRAFT_134592 [Selaginella moellendorffii]
 gi|300140169|gb|EFJ06896.1| hypothetical protein SELMODRAFT_134592 [Selaginella moellendorffii]
          Length = 303

 Score =  263 bits (673), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 127/232 (54%), Positives = 164/232 (70%), Gaps = 3/232 (1%)

Query: 62  VSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGG 121
            S DGN      VPFPSDY E++ QA+ A + A+ D  KL+E+E P AGL++V GD EGG
Sbjct: 36  ASRDGNVG---SVPFPSDYIEMVKQAQDACQAALDDSKKLLEVEVPPAGLNTVSGDEEGG 92

Query: 122 IEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFF 181
           IEM  SM ++ +FC    T EK  RTR+FFPE  E+  A+  VF+G+ +KLDYLTKPS +
Sbjct: 93  IEMNISMEIVQKFCAGMFTGEKAPRTRVFFPELAEMNIAKSGVFDGSMYKLDYLTKPSPW 152

Query: 182 EDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELD 241
           +D G  +KVKM++R +  D  F+VAYP+FN NEML VEELY+E+   +   +I+ NG+LD
Sbjct: 153 DDIGLGKKVKMSERTRPTDATFVVAYPFFNPNEMLAVEELYRESAKESGCPIIVINGDLD 212

Query: 242 RIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
           +IR+GYYP FFYPKL AL+KT  P  ET+YYIHNFKGR  GTLFR   GP +
Sbjct: 213 KIRNGYYPPFFYPKLGALAKTFLPDFETVYYIHNFKGRFAGTLFRAYPGPWQ 264


>gi|440583726|emb|CCH47228.1| hypothetical protein [Lupinus angustifolius]
          Length = 283

 Score =  231 bits (590), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 137/248 (55%), Positives = 153/248 (61%), Gaps = 46/248 (18%)

Query: 42  GQRIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKL 101
           G    +  P  RK   LT    SD + S   +VPFP+DY+ELL+QA++A ELA+KD  +L
Sbjct: 27  GGVTASLVPRNRK---LTGCSVSDVSASTETNVPFPTDYTELLEQARVAVELAMKDNRQL 83

Query: 102 MEIEFPTAGLDSVPG-------------------------------------DSEGGIEM 124
           MEIEFPTAGL SVPG                                     D EGGIEM
Sbjct: 84  MEIEFPTAGLASVPGSPYFTFLLFNFSFWIEFHCTLVPIYFQTDSYHSMISGDGEGGIEM 143

Query: 125 T----GSMRLICEFCDLFVTPEK--VTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKP 178
           T      M    +   L   P    +     FFPEA+EV FAR+SVF GASFKLDYLTKP
Sbjct: 144 TEIKTSVMINTPKILSLVTAPNSSGLLIYVQFFPEASEVDFARQSVFSGASFKLDYLTKP 203

Query: 179 SFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNG 238
           SFF+DFGF EKVKM+DRVK  DELFLVAYPYFNVNEMLVVEELYKEAV NT  KLIIFNG
Sbjct: 204 SFFQDFGFVEKVKMSDRVKAGDELFLVAYPYFNVNEMLVVEELYKEAVLNTERKLIIFNG 263

Query: 239 ELDRIRSG 246
           ELDRIRSG
Sbjct: 264 ELDRIRSG 271


>gi|115443993|ref|NP_001045776.1| Os02g0129300 [Oryza sativa Japonica Group]
 gi|113535307|dbj|BAF07690.1| Os02g0129300, partial [Oryza sativa Japonica Group]
          Length = 161

 Score =  226 bits (575), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 113/144 (78%), Positives = 121/144 (84%)

Query: 116 GDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYL 175
           GDSEGGIEMTGSM LI EFCD FV  EK TRTRIFFPEANEV FAR+S FEG S KLDYL
Sbjct: 7   GDSEGGIEMTGSMLLIREFCDRFVPAEKATRTRIFFPEANEVSFARQSAFEGCSLKLDYL 66

Query: 176 TKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLII 235
           TKPS FEDFGFT KVKM+DRV+ EDE+FLVAYPYFNVNEMLVVEELYKEA+ +T  KLII
Sbjct: 67  TKPSLFEDFGFTTKVKMSDRVRPEDEIFLVAYPYFNVNEMLVVEELYKEAIVSTDRKLII 126

Query: 236 FNGELDRIRSGYYPSFFYPKLAAL 259
           FNGELDRIRSG   +F   + AAL
Sbjct: 127 FNGELDRIRSGLLVTFLNKREAAL 150


>gi|215686777|dbj|BAG89627.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 147

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 105/136 (77%), Positives = 113/136 (83%)

Query: 124 MTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFED 183
           MTGSM LI EFCD FV  EK TRTRIFFPEANEV FAR+S FEG S KLDYLTKPS FED
Sbjct: 1   MTGSMLLIREFCDRFVPAEKATRTRIFFPEANEVSFARQSAFEGCSLKLDYLTKPSLFED 60

Query: 184 FGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRI 243
           FGFT KVKM+DRV+ EDE+FLVAYPYFNVNEMLVVEELYKEA+ +T  KLIIFNGELDRI
Sbjct: 61  FGFTTKVKMSDRVRPEDEIFLVAYPYFNVNEMLVVEELYKEAIVSTDRKLIIFNGELDRI 120

Query: 244 RSGYYPSFFYPKLAAL 259
           RSG   +F   + AAL
Sbjct: 121 RSGLLVTFLNKREAAL 136


>gi|307111351|gb|EFN59585.1| hypothetical protein CHLNCDRAFT_56449 [Chlorella variabilis]
          Length = 336

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 101/237 (42%), Positives = 139/237 (58%), Gaps = 22/237 (9%)

Query: 75  PFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEF 134
           PFP DY++ + QA+ AA  A+ DG  L+E+EFPTA L +V GD+EG  EMT S++ + +F
Sbjct: 60  PFPGDYNQAVRQAQGAAAAALADGASLVEVEFPTASLVAVAGDAEGANEMTYSLQHLRQF 119

Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKS--------------VFEGASFKLDYLTKPSF 180
              +   ++   TRIFFP+  E+K A K               VFEG +FK  YL KP+ 
Sbjct: 120 MRGWK--DQAGTTRIFFPDPTELKVALKGKAMDPNAGSWTIDPVFEGTAFKFGYLMKPNP 177

Query: 181 FEDFGFT-EKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK-----LI 234
           F D G T  K+  AD++   ++L ++AYP+FN  EML V  L++        +     +I
Sbjct: 178 FLDMGITVGKINAADQLDGREQLLVMAYPHFNPQEMLEVAALHEYLAAQAGGREGATPII 237

Query: 235 IFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
            FN ELDRIR+GYYP FFYP +  ++K+L P   T YYI NFKG  GG +FR    P
Sbjct: 238 TFNAELDRIRTGYYPPFFYPAIGKIAKSLLPQFTTAYYIKNFKGATGGCIFRCYPSP 294


>gi|255080176|ref|XP_002503668.1| predicted protein [Micromonas sp. RCC299]
 gi|226518935|gb|ACO64926.1| predicted protein [Micromonas sp. RCC299]
          Length = 369

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 83/246 (33%), Positives = 126/246 (51%), Gaps = 26/246 (10%)

Query: 74  VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICE 133
            PFP DY++++ Q + A +  + DG+ LMEI+FP  GL++ PGD EG +E   +++ +  
Sbjct: 90  TPFPKDYAQMVSQCQKALQHGLDDGLGLMEIQFPPGGLETAPGDVEGNMESNLTVQHLRG 149

Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFAR------------------KSVFEGASF--KLD 173
            C  F   +    TR+FFP+  E K AR                  ++ F   ++   +D
Sbjct: 150 ICAQFERNKTAKTTRVFFPDPIEAKLARTGTNASPDGVRAPSNSETRAWFAPNNWPGPVD 209

Query: 174 YLTKPSFFEDFG----FTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAV--F 227
           +L  PSF    G      ++V   ++ K  D  F+VAYP  NV+E+    ELY+  +   
Sbjct: 210 FLESPSFLSVSGLDKVLNKRVSTWNKAKANDTAFVVAYPVSNVSELTCTRELYEGELGRG 269

Query: 228 NTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRF 287
             A  +++ NGEL+R R+ YYP F+     A  +    V E IY+IHNFKG N   LFR 
Sbjct: 270 TGARPIVVCNGELERTRTNYYPPFWNAGEMAPLREFVKVFEQIYFIHNFKGSNPAVLFRC 329

Query: 288 LEGPQE 293
             GP +
Sbjct: 330 YPGPWQ 335


>gi|145344528|ref|XP_001416783.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577009|gb|ABO95076.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 277

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 78/241 (32%), Positives = 121/241 (50%), Gaps = 22/241 (9%)

Query: 75  PFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEF 134
           PFP DY+EL  QA+ + +   KDG++L+E++FP  GL+   GD EG +E   +   +   
Sbjct: 1   PFPRDYAELERQARESVKRCAKDGVELVELQFPPGGLELASGDLEGNVECNLTTERLRGI 60

Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFA------------------RKSVFEGASFKLDYLT 176
           CD FV     + TR+ FP+  E++ A                   +  F      LDY+ 
Sbjct: 61  CDAFVANGTASTTRVLFPDPTEMRLATTGANAAPDGIRAPEQSDTRGWFADWKGTLDYVD 120

Query: 177 KPSFFEDFGFTE----KVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK 232
            PSF    GF +    K  +++R++ ++  ++VAYP  N++E+    +LY+  V  T   
Sbjct: 121 DPSFMSVSGFDKIFGGKKNISERMRGDETAYVVAYPSANISELANTRDLYEGCVRGTGKS 180

Query: 233 LIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQ 292
           L++ NGEL+R RS YYP F+        ++     E  Y I+NFKG N   LFR    P 
Sbjct: 181 LVVCNGELERTRSNYYPPFWNAGEMGPLRSFCRKFEGAYVIYNFKGSNPAVLFRVYPEPW 240

Query: 293 E 293
           +
Sbjct: 241 Q 241


>gi|159481297|ref|XP_001698718.1| hypothetical protein CHLREDRAFT_205904 [Chlamydomonas reinhardtii]
 gi|158273612|gb|EDO99400.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 364

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 81/219 (36%), Positives = 117/219 (53%), Gaps = 25/219 (11%)

Query: 65  DGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEM 124
           +   ++    PFP+ Y   + QA+ A + A+ DG KL+E+EFP+  L SV GD EG  EM
Sbjct: 41  EAATAVQTPAPFPTSYVMAMRQAQEAVKAALADGAKLVEVEFPSTTLSSVSGDGEGQNEM 100

Query: 125 TGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKS--------------VFEGASF 170
             SM  +  F   F +  +   TR+FFP+  E+  AR                 F   +F
Sbjct: 101 NASMGYLRTFLGGFRS--RAASTRVFFPDNVELAVARSGQTEDPSAGRKALDPQFADVTF 158

Query: 171 KLDYLTKP-SFFEDFGFTEK----VKMADRVKLEDELFLVAYPYFNVNEML-VVEELYKE 224
           +L YLT+  + +  FGF +     VK+   VK  D+L +VAYP FN  E L  V ELY++
Sbjct: 159 QLGYLTEQNAAWAMFGFYKSAFDPVKL---VKDTDDLLVVAYPSFNPREELSAVYELYQQ 215

Query: 225 AVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTL 263
                   ++IFNGELDR+R GYYPS F+P++A   + +
Sbjct: 216 KAKARGMPIVIFNGELDRVRGGYYPSVFFPEIAVRQRAV 254


>gi|424513544|emb|CCO66166.1| predicted protein [Bathycoccus prasinos]
          Length = 423

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 89/284 (31%), Positives = 132/284 (46%), Gaps = 42/284 (14%)

Query: 40  FSGQRIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGM 99
           F G+++    P  R       ++S + N       PFP+DY  ++ QA+ A + A +DG+
Sbjct: 103 FGGKKVEVVLPPSRALGSGRTTISKNTNGGRQY--PFPADYDVMVQQARQALQKAREDGV 160

Query: 100 KLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKF 159
            L EI+FP  GLD  PGD EG +E T +  ++ +        EK+T   + FP+  E+K 
Sbjct: 161 DLGEIQFPPGGLDLAPGDLEGNVECTLTATVLRKILRGMKEEEKIT---VLFPDPTELKL 217

Query: 160 ARKS--------------------VFEGASFKLDYLTKPSFFEDFG----FTEKVKMADR 195
           A++                     +FE    +L+YL  P+ F   G    F +   + DR
Sbjct: 218 AKRGQTGMCAPDGVAPPEVFQTDPLFEDWRGELNYLDDPNAFSVSGLDKIFGKSATVNDR 277

Query: 196 VKL-EDELFLVAYPYFNVNEMLVVEELYKE-----------AVFNTAWK-LIIFNGELDR 242
           V + E  +F+ AYP  N+ E+     LY+            +   T  K L++ NGELDR
Sbjct: 278 VDINEGNMFVCAYPSGNIAELTQTRLLYENIREENESDAPASKIKTKRKSLVVVNGELDR 337

Query: 243 IRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFR 286
            RS YYP F+        +      E IY+IHNFKG N   LFR
Sbjct: 338 TRSNYYPWFWNKNEMEPLREFSQSFEGIYFIHNFKGTNPAVLFR 381


>gi|303272213|ref|XP_003055468.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463442|gb|EEH60720.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 252

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 77/220 (35%), Positives = 114/220 (51%), Gaps = 12/220 (5%)

Query: 86  QAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVT 145
           QA+ + + A+ DG++L+EI+FP+ GLD+ PGD EG +E   ++  +   C  F       
Sbjct: 1   QAQASLQAALDDGVELLEIQFPSGGLDTAPGDVEGNVENNLTVAHLRGICSQFERNGTAK 60

Query: 146 RTRIFFPEANEVKFARKSVFEG----ASF--KLDYLTKPSFFEDFGFTE----KVKMADR 195
            TR+FFP+  E   A           ASF   +DYL +P F    G  +    +  +A R
Sbjct: 61  TTRVFFPDPIERSLALTGAAPSPDGFASFPGPIDYLEQPDFLSVSGLDKMLGTRKTVAMR 120

Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKE--AVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
           V   D  F+VAYP  NV+E++   EL +   A    A  +++ NGEL+R RS YYPSF+ 
Sbjct: 121 VPESDTAFVVAYPCTNVSELVCTRELREGELARAGPARPIVMCNGELERTRSEYYPSFWN 180

Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
                  +      E +Y++HN+KG N   LFR   GP +
Sbjct: 181 VGEMKPLRGFAREFEGVYFVHNYKGSNPAVLFRAYPGPWQ 220


>gi|308802235|ref|XP_003078431.1| unnamed protein product [Ostreococcus tauri]
 gi|116056883|emb|CAL53172.1| unnamed protein product [Ostreococcus tauri]
          Length = 267

 Score =  124 bits (310), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 72/233 (30%), Positives = 115/233 (49%), Gaps = 22/233 (9%)

Query: 83  LLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPE 142
           ++ Q K A + A+ DG +L+E++FP  GL+   GD EG +E   +   +   CD F    
Sbjct: 1   MVRQCKEAMKRAIVDGTELIELQFPPGGLELASGDLEGNVECNLTTERLRGICDGFRELG 60

Query: 143 KVTRTRIFFPEANEVKFA------------------RKSVFEGASFKLDYLTKPSFFEDF 184
              +TR+ FP+  E + A                   +++F     ++DYL  PSF    
Sbjct: 61  MAEKTRVLFPDPTETRLALTGSSPTPDGIRAPEQSETRAMFGDWVGRVDYLDDPSFMSVS 120

Query: 185 GFTE----KVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL 240
           G  +    K  +A+R+  +D  F+VAYP  N++E+    +LY++AV  +   L++ NGE+
Sbjct: 121 GLDKILGTKKSIAERMGADDAAFVVAYPSANISELANTRDLYEDAVRGSGRPLVVCNGEM 180

Query: 241 DRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
           +R RS YYP F+        +      E +Y I+NFKG N   LFR    P +
Sbjct: 181 ERTRSNYYPPFWNAGEMGPLREFARKFEGVYVIYNFKGSNPAVLFRVYPEPWQ 233


>gi|302852030|ref|XP_002957537.1| hypothetical protein VOLCADRAFT_107711 [Volvox carteri f.
           nagariensis]
 gi|300257179|gb|EFJ41431.1| hypothetical protein VOLCADRAFT_107711 [Volvox carteri f.
           nagariensis]
          Length = 271

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 72/222 (32%), Positives = 104/222 (46%), Gaps = 61/222 (27%)

Query: 70  INVDVPFPSDYSELLDQ-------------AKMAAELAVKDGMKLMEIEFPTAGLDSVPG 116
           +    PFP  Y + + Q             A+ A + A+ DG  L+E+EFP+  L SV G
Sbjct: 43  LQAPAPFPVSYDQAMRQLLPRFPAPLFQHSAQEAVKAALADGAPLVEVEFPSTTLSSVSG 102

Query: 117 DSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFA-----------RKSV- 164
           D EG  EM  SM  + +F   F +  +   TR+FFP+  E+  A           RKS+ 
Sbjct: 103 DGEGQNEMNASMGFLRQFLGAFRS--RAASTRVFFPDNVELAVARSGQTEDPAAGRKSLD 160

Query: 165 --FEGASFKLDYLTKP-SFFEDFGFT----EKVKMADRVKLEDELFLVAYPYFNVNEMLV 217
             F  A F+L YLT+  + +  FGF     + VK+   VK  D++ ++AYP FN      
Sbjct: 161 PKFGDAVFQLGYLTQQNAAWAVFGFYKSGFDPVKL---VKDTDDMLVIAYPSFNP----- 212

Query: 218 VEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAAL 259
                               GELDR+R GYYP+ F+P++A L
Sbjct: 213 -------------------RGELDRVRGGYYPALFFPEIAKL 235


>gi|428164159|gb|EKX33196.1| hypothetical protein GUITHDRAFT_156132 [Guillardia theta CCMP2712]
          Length = 215

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 78/149 (52%), Gaps = 19/149 (12%)

Query: 149 IFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYP 208
           + FP+ +E + A +       F L  L+KP            +  DRV      +LV +P
Sbjct: 1   MVFPDPSEARIAFEEYGSQVPFSLSSLSKPK-----------QQEDRVNK----YLVMHP 45

Query: 209 YFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSG----YYPSFFYPKLAALSKTLF 264
            F+V E + ++ELY   V      +IIFNG+L ++RSG    YYP FF+PKLA + +   
Sbjct: 46  VFDVREYIQMDELYMSEVAPKDAAMIIFNGDLFKMRSGGIGGYYPDFFFPKLAQVRRRFM 105

Query: 265 PVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
           P++ET YY+  F+G   G L+R   GP +
Sbjct: 106 PMVETAYYLRVFRGPPVGALYREYPGPWQ 134


>gi|255087178|ref|XP_002505512.1| predicted protein [Micromonas sp. RCC299]
 gi|226520782|gb|ACO66770.1| predicted protein [Micromonas sp. RCC299]
          Length = 433

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 68/247 (27%), Positives = 106/247 (42%), Gaps = 38/247 (15%)

Query: 76  FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPG----DSEGGIEMTGSMRLI 131
            P D S+LL +   + + A+ DG  L+++E P    D V G    DS    E    M ++
Sbjct: 64  LPEDESDLLARIHTSIQAALSDGKVLLDVEVPVQYFDGVVGVGGQDSIAISEFNACMSVL 123

Query: 132 CEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVF---------EGASFK-----LDYLTK 177
            +   LF    +    R+FFP+A E   A K            + A+F      +DYL +
Sbjct: 124 RKIVRLFEWLGQAESVRVFFPDAAECSIALKGAGLNPVSGQWEQAATFHDWPGAVDYLLR 183

Query: 178 PSFFED-----FGFTE-------KVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEA 225
             F        +G+ +       K  +    ++ D L++V YPY N  EM  V  L++E 
Sbjct: 184 DDFVSQTSRKAYGYADLPDFLAGKRDVEQTAEVADRLYVVGYPYDNTGEMEQVMRLWEE- 242

Query: 226 VFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNF-KGRNGGTL 284
               A  +++FNG LD +R+ + P   + K   L     P   T +Y+H F  G   G L
Sbjct: 243 ---HARPILVFNGNLDGVRTSFAP---FGKAKKLKHEFVPKFTTAFYVHKFAAGAAPGLL 296

Query: 285 FRFLEGP 291
           +R    P
Sbjct: 297 YRQYPSP 303


>gi|449018586|dbj|BAM81988.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 247

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 66/223 (29%), Positives = 95/223 (42%), Gaps = 25/223 (11%)

Query: 76  FPSDYSELLDQAKMAAELAVKDGMK---LMEIEFPTAGLDSVPGDSEGGIEMTGSMR-LI 131
            P D + L  Q + A   A +   +   L E+ FP A  D+    S      T   R +I
Sbjct: 10  LPKDTASLHRQVQNALSKATETKTRSPALYEVSFP-AVRDTTAALSRILDANTSHAREII 68

Query: 132 CEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVK 191
             F   F       R  + FP+  E K A K    G+S     L+    +E   F ++V+
Sbjct: 69  KPFAASFRK-----RLHLVFPDVAEAKIAEKVY--GSSEHTFTLSALPLYERPAFLQQVE 121

Query: 192 MADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSF 251
                     L  V  P FN++E L +E   + A+      +++ NG +DR+RS YYP  
Sbjct: 122 AP-------ALVFVVQPGFNIDEWLQLE---RPALLYPDASIVVLNGNMDRLRSNYYPPL 171

Query: 252 FYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQES 294
           FYP+L AL K      E IYY+   K    G LFR    P ++
Sbjct: 172 FYPRLTALRKRYLEQFEPIYYL---KPLPNGLLFRVFPEPWQT 211


>gi|298715350|emb|CBJ27978.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 314

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 71/285 (24%), Positives = 123/285 (43%), Gaps = 51/285 (17%)

Query: 5   SNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNSVSS 64
           +++A+A   +P P+P  PR       +  +  +      R     P R++  CLT     
Sbjct: 25  ASTALAFVALPSPLPRSPR-------YHQRLYDAAAPRPRREKPRPQRQQVQCLTK---- 73

Query: 65  DGNNSINVDVPFPSD-YSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGG 121
                    +P   D Y+ +  Q   A + A+  G+KL+E+EFP     LD   G++   
Sbjct: 74  ---------IPSGKDPYAAVKKQTAEATQDAINAGIKLIELEFPPVRGKLDISLGET--- 121

Query: 122 IEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFF 181
             +  +     E    F +        + FP+  E + A ++ + G +F++         
Sbjct: 122 --LDANRSFARELARSF-SARMGKALWLVFPDDAEAELA-QNTYGGTTFRV--------- 168

Query: 182 EDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELD 241
              G    +K  D    E ++ +V  P F+VNE +V++ L +  V      +++ NG LD
Sbjct: 169 --VGINSAIK--DLKDEECQMQIVVNPGFDVNEWIVLDSLVRPDV-----PMVMLNGNLD 219

Query: 242 RIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFR 286
           ++R GYYP  F+P L    +      ET+YY+   K   GG +FR
Sbjct: 220 KLRGGYYPRIFFPGLYNAKERFLKKFETVYYL---KALPGGWIFR 261


>gi|452824537|gb|EME31539.1| hypothetical protein Gasu_12130 [Galdieria sulphuraria]
          Length = 273

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 91/221 (41%), Gaps = 38/221 (17%)

Query: 74  VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFP------TAGLDSVPGDSEGGIEMTGS 127
           +  P    +L+   + + + A+ DG+KL+E++FP      +A L+ V         M  +
Sbjct: 46  IRLPESNVQLVQDIQESCKSAICDGLKLLEVQFPPLKNIGSAALNQV---------MDAN 96

Query: 128 MRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFT 187
                     F        T + FP+  E K AR+          D+ T  S F      
Sbjct: 97  RTFAKSVVQRFPHVSGNGTTFVVFPDDAESKLARED--------RDFRTLDSVF------ 142

Query: 188 EKVKMADRVKLED-ELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSG 246
               +   + L+D  L ++  P F V E   VE      V      +I+FN +LD++R G
Sbjct: 143 -ITSLQRDIDLQDASLVVILNPGFQVQEWFEVERFCNYQV-----PVILFNADLDKLRGG 196

Query: 247 YYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRF 287
           YYP F YPKL A         E +YY+  F   NG  + R+
Sbjct: 197 YYPRFLYPKLYATKDKCLTKFEPVYYVRFFV--NGALIRRY 235


>gi|209522945|ref|ZP_03271502.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|376001796|ref|ZP_09779650.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|209496532|gb|EDZ96830.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|375329707|emb|CCE15403.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 249

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 96/224 (42%), Gaps = 49/224 (21%)

Query: 76  FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
            P+  SE ++QAK AA  A+ DG KL+++E  FP              IE+  +  +  +
Sbjct: 4   LPTTLSEAIEQAKQAATAALDDGYKLIQVELVFPE-------------IELQ-AQSIASQ 49

Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
           F      P+  T  ++FFP+A     AR+          D+   P    D G T +  + 
Sbjct: 50  FIPALEKPD--TLLKVFFPDAGSAALARR----------DWGETPFRVTDIG-TSRSPVE 96

Query: 194 DRVKLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYP 249
            R++ +D  FLV  P    VN+   VE L+K A   +   +++ N  L+    I  GY  
Sbjct: 97  TRLQPDDGQFLVVSPSPVEVNQ---VENLHKLAGDRS---VVLLNPRLEDVAIIGIGYAA 150

Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
                    L +    ++E+ YY+   K  +G  LFR   G  E
Sbjct: 151 R-------QLRERFLNIIESCYYL---KPLDGAALFRCYPGTWE 184


>gi|423062349|ref|ZP_17051139.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
 gi|406716257|gb|EKD11408.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
          Length = 262

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 96/224 (42%), Gaps = 49/224 (21%)

Query: 76  FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
            P+  SE ++QAK AA  A+ DG KL+++E  FP              IE+  +  +  +
Sbjct: 17  LPTTLSEAIEQAKQAATAALDDGYKLIQVELVFPE-------------IELQ-AQSIASQ 62

Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
           F      P+  T  ++FFP+A     AR+          D+   P    D G T +  + 
Sbjct: 63  FIPALEKPD--TLLKVFFPDAGSAALARR----------DWGETPFRVTDIG-TSRSPVE 109

Query: 194 DRVKLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYP 249
            R++ +D  FLV  P    VN+   VE L+K A   +   +++ N  L+    I  GY  
Sbjct: 110 TRLQPDDGQFLVVSPSPVEVNQ---VENLHKLAGDRS---VVLLNPRLEDVAIIGIGYAA 163

Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
                    L +    ++E+ YY+   K  +G  LFR   G  E
Sbjct: 164 R-------QLRERFLNIIESCYYL---KPLDGAALFRCYPGTWE 197


>gi|224013206|ref|XP_002295255.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969217|gb|EED87559.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 391

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 37/62 (59%), Gaps = 1/62 (1%)

Query: 233 LIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNG-GTLFRFLEGP 291
           +++ NG LD++R G+YP+ F+PKLAA     +   E+++Y+  F  +   G L+R    P
Sbjct: 286 MVVINGALDKVRGGFYPAIFFPKLAATVDRFWKRFESVFYLKPFSDKGVYGWLYRVYPEP 345

Query: 292 QE 293
            +
Sbjct: 346 WQ 347


>gi|291567271|dbj|BAI89543.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 249

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 96/224 (42%), Gaps = 49/224 (21%)

Query: 76  FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
            P+  SE ++QAK AA  A++DG KL+++E  FP              IE+  +  +  +
Sbjct: 4   LPTTLSEAIEQAKQAATAALEDGYKLIQVELVFPE-------------IELQ-AQSIASQ 49

Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
           F      P+  T  ++FFP+A     AR+          D+   P    D G T +  + 
Sbjct: 50  FIPALEKPD--TLLKVFFPDAGSAALARR----------DWGETPFRVTDIG-TSRSPVE 96

Query: 194 DRVKLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYP 249
            R++ +D  FLV  P    VN+   VE L+K A   +   +++ N  L+    I  GY  
Sbjct: 97  TRLQPDDGQFLVVSPSPVEVNQ---VENLHKLAGDRS---VVLLNPRLEDVAIIGIGYAA 150

Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
                    L +     +E+ YY+   K  +G  LFR   G  E
Sbjct: 151 R-------QLRERFLNTIESCYYL---KPLDGAALFRCYPGTWE 184


>gi|409992140|ref|ZP_11275348.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
           Paraca]
 gi|409936997|gb|EKN78453.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
           Paraca]
          Length = 262

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 96/224 (42%), Gaps = 49/224 (21%)

Query: 76  FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
            P+  SE ++QAK AA  A++DG KL+++E  FP              IE+  +  +  +
Sbjct: 17  LPTTLSEAIEQAKQAATAALEDGYKLIQVELVFPE-------------IELQ-AQSIASQ 62

Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
           F      P+  T  ++FFP+A     AR+          D+   P    D G T +  + 
Sbjct: 63  FIPALEKPD--TLLKVFFPDAGAAALARR----------DWGETPFRVTDIG-TSRSPVE 109

Query: 194 DRVKLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYP 249
            R++ +D  FLV  P    VN+   VE L+K A   +   +++ N  L+    I  GY  
Sbjct: 110 TRLQPDDGQFLVVSPSPVEVNQ---VENLHKLAGDRS---VVLLNPRLEDVAIIGIGYTA 163

Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
                    L +     +E+ YY+   K  +G  LFR   G  E
Sbjct: 164 R-------QLRERFLNTIESCYYL---KPLDGAALFRCYPGTWE 197


>gi|397566319|gb|EJK45002.1| hypothetical protein THAOC_36416 [Thalassiosira oceanica]
          Length = 370

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 59/258 (22%), Positives = 102/258 (39%), Gaps = 53/258 (20%)

Query: 83  LLDQAKMAAELAVKDGMKLMEIEFP----TAGLDSVPGDSEGGIEMTGSMRLICEFCDLF 138
           L   AK+A + A+ DG+  +E+EFP     A   S   D +   E+  +     +   +F
Sbjct: 75  LRKTAKLAIDSAIADGVSKIEVEFPPLLGGARSKSQFDDFDNVQELDSNKEWTMQLAPMF 134

Query: 139 VTPE--KVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDF------------ 184
              +  K  RT + FP+  E + A+K  F G  ++    T      +F            
Sbjct: 135 AGDKTYKDGRTWLVFPDLKECELAKKD-FPGQRYQEATFTTIEAVTNFMSSSGSPGSSEE 193

Query: 185 -----------GFTEKV--KMADRVKLEDE-------------LFLVAYPYFN--VNEML 216
                      G +  +  K  D   L D+             L+LV  P     V + +
Sbjct: 194 YAAPWGASLMSGLSSMMGGKDGDAGLLGDQSSLDSLNVDSPANLWLVVQPGNGGPVEDWV 253

Query: 217 VVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNF 276
             E+++  ++      +++ NG LD++R G+Y   F+P LAA  +  +   ET  Y+  F
Sbjct: 254 NCEKMHSPSI-----PMVVVNGALDKVRGGFYAPIFFPALAATVERFWKKFETGLYLKPF 308

Query: 277 KGRNG-GTLFRFLEGPQE 293
             +   G L+R    P +
Sbjct: 309 SDKGVYGWLWRVYPEPWQ 326


>gi|428317816|ref|YP_007115698.1| protein of unknown function DUF1995-containing protein
           [Oscillatoria nigro-viridis PCC 7112]
 gi|428241496|gb|AFZ07282.1| protein of unknown function DUF1995-containing protein
           [Oscillatoria nigro-viridis PCC 7112]
          Length = 248

 Score = 44.3 bits (103), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 50/223 (22%), Positives = 92/223 (41%), Gaps = 47/223 (21%)

Query: 76  FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
            P D +E + Q+++A   A+ DG  L+++E  FP   L +           + +++ + E
Sbjct: 4   LPKDLNEAIAQSRIATAAALSDGKTLLQVELVFPEIALQA----------QSITLQFLPE 53

Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
           F +++         ++FFP+      AR+          D+   P    D G + +  + 
Sbjct: 54  FEEIY------PGVKVFFPDTGAAALARR----------DWGETPFKVTDLG-SSRTPVE 96

Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPS 250
           D++  ED+LFL+  P     E+  VE++Y  A       +I+ N  L+ + +   GY   
Sbjct: 97  DKIAPEDQLFLLINP--AAVEVAQVEKIYIAAAGR---PVILLNPRLEDVATIGIGYAGR 151

Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
                   L       +E+ YYI      +   LFR    P +
Sbjct: 152 -------QLRDRFLNKIESCYYIRPL---DTAALFRCYPQPWQ 184


>gi|219125569|ref|XP_002183049.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405324|gb|EEC45267.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 356

 Score = 43.9 bits (102), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 53/206 (25%), Positives = 88/206 (42%), Gaps = 43/206 (20%)

Query: 87  AKMAAELAVKDGMKLMEIEFP--TAGLDSVPG-DSEGGIEMTGSMRLICEFCDLFVTPEK 143
           A  A + A++DG K +EI+FP  T G  S    D    ++   + R  C    + + P  
Sbjct: 76  AAPALQKALRDGWKQLEIDFPPLTGGDQSKTQFDDFDNVQELNANRDWC----VQLAPAI 131

Query: 144 VTRTR-IFF--PEANEVKFARKSVFEGASFK------------LDYLTKPSFFEDFGFTE 188
            ++ R ++F  P+  E + A++  + G  F+            L    +  + + +G T 
Sbjct: 132 ASKNREVWFILPDDKECELAKEE-WTGQRFRQAAKFTSVRAAVLKTSGESQYSKAWGSTI 190

Query: 189 KVKM----------ADRVKLED-----ELFLVAYPYFN--VNEMLVVEELYKEAVFNTAW 231
              M          AD   L+D        LV  P     V + + VE L+K    + + 
Sbjct: 191 ASTMNKLTGGDGILADSSTLDDLGSGDRFHLVCQPGNGGPVEDWINVERLHKA---DPSQ 247

Query: 232 KLIIFNGELDRIRSGYYPSFFYPKLA 257
              + NG LD++R GYYP+ F+P LA
Sbjct: 248 PTCVVNGALDKVRDGYYPAVFFPALA 273


>gi|334118025|ref|ZP_08492115.1| Domain of unknown function DUF1995-containing protein [Microcoleus
           vaginatus FGP-2]
 gi|333460010|gb|EGK88620.1| Domain of unknown function DUF1995-containing protein [Microcoleus
           vaginatus FGP-2]
          Length = 248

 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 51/216 (23%), Positives = 91/216 (42%), Gaps = 47/216 (21%)

Query: 76  FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
            P D +E + Q+++A   A+ DG  L+++E  FP   L +           + + + + E
Sbjct: 4   LPKDLNEAIAQSRIATAAALSDGKTLLQVELVFPEIALQA----------QSITEQFLPE 53

Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
             +++         ++FFP+A     AR+          D+   P    D G + +  + 
Sbjct: 54  LEEIY------PGVKVFFPDAGAAALARR----------DWGETPFKVTDLG-SSRSPVE 96

Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPS 250
           D++  ED+LFL+  P     E+  VE LY  A       +I+ N  L+ + +   GY   
Sbjct: 97  DKIAPEDQLFLLINP--AAVEVAQVERLYIAAAGR---PVILLNPRLEDVATIGIGYAGR 151

Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFR 286
               +   LSK     +E+ YY+      +   LFR
Sbjct: 152 QLRDRF--LSK-----IESCYYVRPL---DAAALFR 177


>gi|402308913|ref|ZP_10827915.1| deoxyribose-phosphate aldolase [Prevotella sp. MSX73]
 gi|400374492|gb|EJP27410.1| deoxyribose-phosphate aldolase [Prevotella sp. MSX73]
          Length = 298

 Score = 42.0 bits (97), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 57/129 (44%), Gaps = 11/129 (8%)

Query: 44  RIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMA-AELAVKDGMKLM 102
            + T   Y      ++ S+  DG   +NV   FPS  S+   + K+A A LAVKDG   +
Sbjct: 86  HVATICTYPNFAKLVSESLEVDGVQVVNVSGSFPS--SQTFIEVKVAEASLAVKDGATEI 143

Query: 103 EIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFAR- 161
           +I  P     S  GD EG  +  G  +  C  C     P KV       P  ++VK A  
Sbjct: 144 DIVMPVGKYLS--GDYEGVADEIGEQKQACGEC-----PMKVILETGCLPSMSDVKKASI 196

Query: 162 KSVFEGASF 170
            +++ GA +
Sbjct: 197 IAMYAGADY 205


>gi|288925941|ref|ZP_06419871.1| deoxyribose-phosphate aldolase [Prevotella buccae D17]
 gi|315606905|ref|ZP_07881912.1| deoxyribose-phosphate aldolase [Prevotella buccae ATCC 33574]
 gi|288337365|gb|EFC75721.1| deoxyribose-phosphate aldolase [Prevotella buccae D17]
 gi|315251413|gb|EFU31395.1| deoxyribose-phosphate aldolase [Prevotella buccae ATCC 33574]
          Length = 298

 Score = 42.0 bits (97), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 57/129 (44%), Gaps = 11/129 (8%)

Query: 44  RIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMA-AELAVKDGMKLM 102
            + T   Y      ++ S+  DG   +NV   FPS  S+   + K+A A LAVKDG   +
Sbjct: 86  HVATICTYPNFAKLVSESLEVDGVQVVNVSGSFPS--SQTFIEVKVAEASLAVKDGATEI 143

Query: 103 EIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFAR- 161
           +I  P     S  GD EG  +  G  +  C  C     P KV       P  ++VK A  
Sbjct: 144 DIVMPVGKYLS--GDYEGVADEIGEQKQACGEC-----PMKVILETGCLPSMSDVKKASI 196

Query: 162 KSVFEGASF 170
            +++ GA +
Sbjct: 197 IAMYAGADY 205


>gi|113475888|ref|YP_721949.1| hypothetical protein Tery_2247 [Trichodesmium erythraeum IMS101]
 gi|110166936|gb|ABG51476.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 253

 Score = 41.6 bits (96), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 53/218 (24%), Positives = 93/218 (42%), Gaps = 37/218 (16%)

Query: 76  FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
            P+D +E + Q   A + A++DG   ++IE        VP      IE+  +  L  +F 
Sbjct: 4   LPNDINEAIVQGMEATKAALQDGYTRVQIEI------VVP-----DIELQ-AQSLAKQFI 51

Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
              +  E  T+ ++FFP++     AR+  ++ A+FK+         ED G T +  +  +
Sbjct: 52  PALL--ETSTKLKVFFPDSGAAALARRD-WQDATFKI---------EDLG-TSRSPVDKK 98

Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPK 255
           V+ ED+ FL+  P  +  E+   E+L   A       LI    ++  +  GY        
Sbjct: 99  VEPEDQCFLLIAP--SAIEVAQTEKLSNLAGDRPVIMLIPKLEDVSIVGIGYAAR----- 151

Query: 256 LAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
              L +     +E+ YYI +     G  L+R    P +
Sbjct: 152 --QLRERFIKTIESCYYIRSL---GGAALYRCYPSPWQ 184


>gi|119484707|ref|ZP_01619189.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
 gi|119457525|gb|EAW38649.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
          Length = 249

 Score = 40.8 bits (94), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 52/208 (25%), Positives = 83/208 (39%), Gaps = 44/208 (21%)

Query: 76  FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
            P    E ++QAK A + A+ DG KL+++E  FP              IE+  +  +  +
Sbjct: 4   LPKSIEEAVEQAKQATQAALDDGYKLVQVELVFPE-------------IELQ-AQAIAQQ 49

Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
           F  +    E  T  ++FFP+A     AR+          D+   P    D G + +  + 
Sbjct: 50  F--IPAIEESGTVLKVFFPDAGAAALARR----------DWGEIPFKISDLG-SSRSPID 96

Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIR---SGYYPS 250
            RVK +D  FLV  P       + VE++ K +        I+ N  L+ I     GY   
Sbjct: 97  SRVKDDDGRFLVVSPT-----PVEVEQVEKLSQLAGDRVTILLNPRLEDIAIIGIGYAAR 151

Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKG 278
                  AL       +E+ YY+   +G
Sbjct: 152 -------ALRDRFISTIESCYYLRPLEG 172


>gi|219113845|ref|XP_002186506.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209583356|gb|ACI65976.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 379

 Score = 40.8 bits (94), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 44/92 (47%), Gaps = 5/92 (5%)

Query: 74  VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMT-GSMRLIC 132
           +P PS + EL    + A  LA KDG KL+E+EFP      +  D     ++   +++L  
Sbjct: 67  LPPPSSFFELQQDCQRAVRLARKDGHKLLEVEFPPLPAAVLEMDDVSAYDVVQANLKLAL 126

Query: 133 EFCDLFVTPEK----VTRTRIFFPEANEVKFA 160
           +F    +  E+    + +  + FP+  E  FA
Sbjct: 127 DFSKGLLAGERDGSSLKKIALLFPDQAEADFA 158


>gi|384251129|gb|EIE24607.1| hypothetical protein COCSUDRAFT_14109 [Coccomyxa subellipsoidea
           C-169]
          Length = 394

 Score = 40.4 bits (93), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 85/193 (44%), Gaps = 22/193 (11%)

Query: 77  PSDYSELLDQAKMAAELAVKDGMKLMEIEFPT--AGLDSVPGDSEGGIEMTGSMRLICEF 134
           PS + EL++ A  +   A+ DG+  +E+EFP     +D   G S+  I+   +++L    
Sbjct: 92  PSSFQELVNDATASVRAAIGDGLTRLEVEFPALPGNIDGYKGASDWFID--SNIQLAIAA 149

Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGA-----SFKLDYLTKPS--FFEDFGFT 187
             + V  E   R  I  P+  E   + K +F+GA        + +L + S   F  F F 
Sbjct: 150 SRILVK-ESGKRVHILVPDGGEYNRSYK-MFKGALDLADGISMGHLKENSKGVFSSFNFF 207

Query: 188 EKVKMADRVKLED-----ELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR 242
             V  AD   L       ++F+V     +  E+  +E   +E V      L+++N E+D 
Sbjct: 208 GSVPDADAETLSQAARKADVFIVVNA--STIELPDLERYIEEIVGERP--LVLWNLEVDT 263

Query: 243 IRSGYYPSFFYPK 255
           +R+      F PK
Sbjct: 264 LRADLGLLGFPPK 276


>gi|255526474|ref|ZP_05393385.1| UvrD/REP helicase [Clostridium carboxidivorans P7]
 gi|296184847|ref|ZP_06853258.1| putative ATP-dependent DNA helicase PcrA [Clostridium
           carboxidivorans P7]
 gi|255509856|gb|EET86185.1| UvrD/REP helicase [Clostridium carboxidivorans P7]
 gi|296050629|gb|EFG90052.1| putative ATP-dependent DNA helicase PcrA [Clostridium
           carboxidivorans P7]
          Length = 754

 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 37  NENFSGQRII---TFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAEL 93
           N+N + +  I   +FS Y  + + +TNSV+S  N S+N  +P     S L    + A  L
Sbjct: 640 NDNMTARNTIKSKSFSTYSNRSTTITNSVTSHMNRSVNNSMPSSFGESSLNKNKENANSL 699

Query: 94  AVKDGMKLMEIEFPTAGLDSVPGDSEGG 121
            ++D    ++++    G+ ++ G S+ G
Sbjct: 700 KIEDIKAGLKVKHDKFGIGTIVGVSKSG 727


>gi|299469765|emb|CBN76619.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 322

 Score = 38.1 bits (87), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 40/87 (45%), Gaps = 3/87 (3%)

Query: 75  PFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEM-TGSMRLICE 133
           P PS + + + QA+ A E A +DG  L+E+EFP    D +        ++ + ++RL   
Sbjct: 21  PAPSTFEQCIRQAQGAVEDAFEDGFNLVEVEFPPLQQDYLEDSGSSAYDVSSANVRLASR 80

Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFA 160
           F   F    K     I  P+  E+  A
Sbjct: 81  FAQSFAAEGK--EVSILLPDEAELDQA 105


>gi|443309167|ref|ZP_21038918.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
 gi|442780785|gb|ELR90927.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
          Length = 243

 Score = 38.1 bits (87), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 35/149 (23%), Positives = 62/149 (41%), Gaps = 34/149 (22%)

Query: 76  FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
            P    E + QA++A + A+ DG+  +++E     L+ +P                    
Sbjct: 4   LPKTLGEAVSQARIATQNAIADGLNRLQVEILLPELNPMP------------------VA 45

Query: 136 DLFVTPEKVTR---TRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKM 192
           + F++ E+       +IFFP+A     AR+   E     LD  TK           +V +
Sbjct: 46  ERFLSDEEGISHNFNKIFFPDAGAAALARRDWGEVPFELLDISTK-----------RVSV 94

Query: 193 ADRVKLEDELFLVAYPYFNVNEMLVVEEL 221
            ++++ EDE  L   P     E+L +E+L
Sbjct: 95  EEQIQPEDEAILCIAP--TAQEVLQIEKL 121


>gi|119510288|ref|ZP_01629424.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
 gi|119465032|gb|EAW45933.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
          Length = 244

 Score = 38.1 bits (87), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 31/133 (23%), Positives = 57/133 (42%), Gaps = 26/133 (19%)

Query: 76  FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
            P+   + + Q+++A + A+ DG   ++++F    L  +P              +  +F 
Sbjct: 4   LPNSLEQAIAQSRIATQAALADGYTRLQVDFLFPELKLMP--------------VAEQFL 49

Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
            LF   E  +R +IFFP+A     A +  + G  FK+          D G      +  +
Sbjct: 50  SLFT--EYDSRLKIFFPDAGGAALANRD-WAGTPFKI---------LDIGTGRVASIQSK 97

Query: 196 VKLEDELFLVAYP 208
           ++ EDE+FL   P
Sbjct: 98  IQPEDEIFLFIAP 110


>gi|218189920|gb|EEC72347.1| hypothetical protein OsI_05588 [Oryza sativa Indica Group]
          Length = 377

 Score = 37.7 bits (86), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 62/254 (24%), Positives = 107/254 (42%), Gaps = 42/254 (16%)

Query: 71  NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGGIEMTGSM 128
            V V  P  Y  L+  A  +   A+ +G   +EIEFP   + + S  G S+  I+    +
Sbjct: 65  GVSVYKPRSYDVLVSDAARSLACAMDEGKTRLEIEFPPLPSNISSYKGSSDEFIDANIQL 124

Query: 129 RLICEFCDLFVTPEKVTRTRIFFPEANEVKFARK-------SVFEGASFKLDYL-TKP-- 178
            L        +   K TR+ I FP+  E + A +       S+       LD + T P  
Sbjct: 125 ALAVA---RKLKELKGTRSCIVFPDLPEKRRASQLFGTALDSIETATISSLDEVSTGPVN 181

Query: 179 SFFE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK 232
           +FF       DF F + V+  DR K ++   L  +   +  ++  +E+  ++  F ++  
Sbjct: 182 TFFRSMRDTLDFDFADDVE--DRWKSDEPPSLYIFINCSTRDLSTIEKYVEQ--FASSVP 237

Query: 233 LIIFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKGR 279
            ++FN ELD +RS     G+ P   + +  +    +F + +  Y        YI N+   
Sbjct: 238 ALLFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVTPYIVNY--- 294

Query: 280 NGGTLFRFLEGPQE 293
             G +FR   GP +
Sbjct: 295 -SGAVFRQYPGPWQ 307


>gi|115443809|ref|NP_001045684.1| Os02g0117100 [Oryza sativa Japonica Group]
 gi|41052833|dbj|BAD07724.1| unknown protein [Oryza sativa Japonica Group]
 gi|113535215|dbj|BAF07598.1| Os02g0117100 [Oryza sativa Japonica Group]
 gi|125580571|gb|EAZ21502.1| hypothetical protein OsJ_05126 [Oryza sativa Japonica Group]
          Length = 377

 Score = 37.7 bits (86), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 62/254 (24%), Positives = 107/254 (42%), Gaps = 42/254 (16%)

Query: 71  NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGGIEMTGSM 128
            V V  P  Y  L+  A  +   A+ +G   +EIEFP   + + S  G S+  I+    +
Sbjct: 65  GVSVYKPRSYDVLVSDAARSLACAMDEGKTRLEIEFPPLPSNISSYKGSSDEFIDANIQL 124

Query: 129 RLICEFCDLFVTPEKVTRTRIFFPEANEVKFARK-------SVFEGASFKLDYL-TKP-- 178
            L        +   K TR+ I FP+  E + A +       S+       LD + T P  
Sbjct: 125 ALAVA---RKLKELKGTRSCIVFPDLPEKRRASQLFGTALDSIETATISSLDEVSTGPVN 181

Query: 179 SFFE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK 232
           +FF       DF F + V+  DR K ++   L  +   +  ++  +E+  ++  F ++  
Sbjct: 182 TFFRSMRDTLDFDFADDVE--DRWKSDEPPSLYIFINCSTRDLSTIEKYVEQ--FASSVP 237

Query: 233 LIIFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKGR 279
            ++FN ELD +RS     G+ P   + +  +    +F + +  Y        YI N+   
Sbjct: 238 ALLFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVTPYIVNYS-- 295

Query: 280 NGGTLFRFLEGPQE 293
             G +FR   GP +
Sbjct: 296 --GAVFRQYPGPWQ 307


>gi|428183504|gb|EKX52362.1| hypothetical protein GUITHDRAFT_157134 [Guillardia theta CCMP2712]
          Length = 325

 Score = 37.0 bits (84), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 48/196 (24%), Positives = 87/196 (44%), Gaps = 35/196 (17%)

Query: 74  VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIE-MTGSMRLIC 132
            P P  +   ++QA ++A+ A++DG KL+EIEFP     ++  ++ G    +   ++   
Sbjct: 16  TPPPKSFRMCVEQAYLSAKQAIEDGHKLIEIEFPPLPQSAMDNEAIGADTILKAQIQHST 75

Query: 133 EFCDLF-------VTPEKVTRTRIFFPEAN--------EVKF-ARKSVFEGASFKLDYLT 176
           +F  LF       V  + V R R    E +         ++F A K  F+G+  +  ++ 
Sbjct: 76  DFAKLFKNKKTAIVFADIVERNRFIDDETSSNPQSWRGNIRFTALKGGFKGSLIERVWIN 135

Query: 177 KPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIF 236
           K     DF           V+ +D++F++     +  E+  V EL K A       +I+F
Sbjct: 136 K-----DF--------VSEVQEDDDMFIIIGA--SAQELPDVRELCKAAGDRP---VILF 177

Query: 237 NGELDRIRSGYYPSFF 252
           N +L  +R  +   FF
Sbjct: 178 NLKLQVLRGDFGLPFF 193


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.138    0.406 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,695,826,378
Number of Sequences: 23463169
Number of extensions: 193834205
Number of successful extensions: 413903
Number of sequences better than 100.0: 85
Number of HSP's better than 100.0 without gapping: 42
Number of HSP's successfully gapped in prelim test: 43
Number of HSP's that attempted gapping in prelim test: 413800
Number of HSP's gapped (non-prelim): 91
length of query: 295
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 154
effective length of database: 9,050,888,538
effective search space: 1393836834852
effective search space used: 1393836834852
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)