BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022528
(295 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225427403|ref|XP_002263777.1| PREDICTED: uncharacterized protein LOC100265501 [Vitis vinifera]
gi|296088391|emb|CBI37382.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 399 bits (1026), Expect = e-109, Method: Compositional matrix adjust.
Identities = 204/290 (70%), Positives = 229/290 (78%), Gaps = 9/290 (3%)
Query: 9 IALPTVPI------PVPSIPRRQA-NCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNS 61
+ L T+PI P+PS+ + +CR F +K + +F G +I F R NS
Sbjct: 1 MTLSTIPIASRISIPIPSLQNPKVLSCRSFQVK-KDGSFCGPKIAAFK-MSRNLEFKANS 58
Query: 62 VSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGG 121
VS D + S+ +VPFPSDYSE+L+QAK A ELA+KD +LMEIEFPTAGL+SVPGD EGG
Sbjct: 59 VSGDSSASVGFNVPFPSDYSEILEQAKEATELALKDKKQLMEIEFPTAGLESVPGDGEGG 118
Query: 122 IEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFF 181
IEMTGSM+LI EFCD+F+ PEK TRTRIFFPEANEVKFAR+S F GASFKLDYLTKPS F
Sbjct: 119 IEMTGSMQLIREFCDIFINPEKATRTRIFFPEANEVKFARQSAFGGASFKLDYLTKPSLF 178
Query: 182 EDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELD 241
EDFGF KVKMADRVK EDELFLVAYPYFNVNEMLVVEELY EAV NTA KLIIFNGELD
Sbjct: 179 EDFGFVTKVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYNEAVVNTARKLIIFNGELD 238
Query: 242 RIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
RIRSGYYP FFYPKLAAL+K+L P MET+YYIHNFKGR GGTLFR GP
Sbjct: 239 RIRSGYYPPFFYPKLAALTKSLLPKMETVYYIHNFKGRKGGTLFRCYPGP 288
>gi|356496430|ref|XP_003517071.1| PREDICTED: uncharacterized protein LOC100805878 [Glycine max]
Length = 324
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 200/288 (69%), Positives = 226/288 (78%), Gaps = 4/288 (1%)
Query: 6 NSAIALPTVPIPVPSIPRRQ-ANCRKFSIKCSNE-NFSGQRIITFSPYRRKHSCLTNSVS 63
+S + L PI PS+P A FS+K G + +P RK + T SVS
Sbjct: 4 SSTMILSNSPIASPSLPTSTGAKLETFSLKNDGVIRIRGATASSVAPRIRKTA--TCSVS 61
Query: 64 SDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIE 123
DGN S+ DVPFP+DYSELL+QA++AA+LA+KD +LMEIEFPTAGL SVPGD EGGIE
Sbjct: 62 KDGNASVETDVPFPADYSELLEQARVAADLAIKDNRQLMEIEFPTAGLGSVPGDGEGGIE 121
Query: 124 MTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFED 183
MT SM+LI EFCD F++ EK TRTRIFFPEA+EV FAR+SVF G SFKLDYLTKPSFFED
Sbjct: 122 MTESMQLIREFCDRFISSEKATRTRIFFPEASEVDFARQSVFSGCSFKLDYLTKPSFFED 181
Query: 184 FGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRI 243
FGF EK+KM+DRVK DELFLV YPYFNVNE+LVVEELYKEAV NT KLIIFNGELDRI
Sbjct: 182 FGFVEKIKMSDRVKTGDELFLVGYPYFNVNEILVVEELYKEAVLNTERKLIIFNGELDRI 241
Query: 244 RSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
RSGYYPSFFYPKLAAL+KT P+MET+YYIHNFKGRNGGTLFR GP
Sbjct: 242 RSGYYPSFFYPKLAALTKTFLPMMETVYYIHNFKGRNGGTLFRCYPGP 289
>gi|224071439|ref|XP_002303460.1| predicted protein [Populus trichocarpa]
gi|222840892|gb|EEE78439.1| predicted protein [Populus trichocarpa]
Length = 260
Score = 382 bits (981), Expect = e-104, Method: Compositional matrix adjust.
Identities = 187/227 (82%), Positives = 201/227 (88%)
Query: 67 NNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTG 126
++S+ DVPFP DY ELLDQAK A ELA +D +LMEIEFPTAGL+SVPGD EGGIEMTG
Sbjct: 2 SSSVEFDVPFPRDYEELLDQAKKATELAWEDNKQLMEIEFPTAGLESVPGDGEGGIEMTG 61
Query: 127 SMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGF 186
SM+LI EFCD FV+PEK TRTRIFFPEANEVKFAR+S FEG+S KLDYLTKPSFFEDFGF
Sbjct: 62 SMQLIREFCDRFVSPEKTTRTRIFFPEANEVKFARQSAFEGSSLKLDYLTKPSFFEDFGF 121
Query: 187 TEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSG 246
EKVKM DRVK EDELFLVAYPYFNVNEMLVVEELYKEAV TA KLIIFNGELDRIRSG
Sbjct: 122 VEKVKMTDRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVETARKLIIFNGELDRIRSG 181
Query: 247 YYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
YYPSFFYPKLA+L KTLFP+MET+YYIHNFKGRNGGTLFR GP +
Sbjct: 182 YYPSFFYPKLASLLKTLFPLMETVYYIHNFKGRNGGTLFRCYPGPWQ 228
>gi|255557645|ref|XP_002519852.1| conserved hypothetical protein [Ricinus communis]
gi|223540898|gb|EEF42456.1| conserved hypothetical protein [Ricinus communis]
Length = 316
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 181/231 (78%), Positives = 199/231 (86%), Gaps = 1/231 (0%)
Query: 62 VSSDGNNS-INVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEG 120
VS +G++S + DVP P DY ELL QAK A +LA+KDG +LMEIEFPTAGL+SVPGD EG
Sbjct: 52 VSRNGSSSSVESDVPLPRDYEELLVQAKKATDLALKDGKQLMEIEFPTAGLESVPGDGEG 111
Query: 121 GIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSF 180
GIEMT SM+LI +FCD FV+PEK RTR+FFPEANEVKFAR+S F G+S KLDYLTKPSF
Sbjct: 112 GIEMTESMQLIRQFCDRFVSPEKAARTRVFFPEANEVKFARESAFGGSSLKLDYLTKPSF 171
Query: 181 FEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL 240
FEDFGF EK+KM DRVK EDELFLVAYPYFNVNEMLVVEELY EAV NT K+IIFNGEL
Sbjct: 172 FEDFGFVEKIKMTDRVKPEDELFLVAYPYFNVNEMLVVEELYNEAVVNTTRKMIIFNGEL 231
Query: 241 DRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
DRIRSGYYPSFFYPKLA+L KTLFPVMET+YYIHNFKGR GGTLFR GP
Sbjct: 232 DRIRSGYYPSFFYPKLASLLKTLFPVMETVYYIHNFKGRKGGTLFRCYPGP 282
>gi|449456759|ref|XP_004146116.1| PREDICTED: uncharacterized protein LOC101209709 [Cucumis sativus]
gi|449509516|ref|XP_004163611.1| PREDICTED: uncharacterized LOC101209709 [Cucumis sativus]
Length = 336
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 195/301 (64%), Positives = 221/301 (73%), Gaps = 20/301 (6%)
Query: 4 SSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSC---LTN 60
+S+S IA +P+P P + C + S + ++ + + F P R L+N
Sbjct: 9 ASSSTIATAVLPLPSPKLA-----CFRISHRRTHRSSVSSSMFEFMPRRHLRVLPPNLSN 63
Query: 61 SVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEG 120
SS N S ++DVPFP DYS+LL+QAK A E A+ D +LMEIEFPTAGL+SVPGD EG
Sbjct: 64 RQSS--NASTDLDVPFPRDYSDLLNQAKKATEAALIDNKQLMEIEFPTAGLESVPGDGEG 121
Query: 121 GIEMTGSMRLICEFCDLFVTPEKVTRTRI----------FFPEANEVKFARKSVFEGASF 170
GIEMT SM+LI +FCD F+ P K TRTR+ FFPEANEVKFAR + FEG SF
Sbjct: 122 GIEMTESMQLIRQFCDCFIDPLKATRTRVTVSIKENHIQFFPEANEVKFARNTAFEGVSF 181
Query: 171 KLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTA 230
KLDYLTKPSFFEDFGF EKVKMADRVK EDELFLVAYPYFNVNEMLVVEELYKEAV NT
Sbjct: 182 KLDYLTKPSFFEDFGFVEKVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVQNTT 241
Query: 231 WKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEG 290
KLIIFNGELDRIRSGYYP FFYPKLAAL KTLFP MET+YYIHNFKG+ GG LFR G
Sbjct: 242 RKLIIFNGELDRIRSGYYPPFFYPKLAALMKTLFPEMETVYYIHNFKGQKGGVLFRSYPG 301
Query: 291 P 291
P
Sbjct: 302 P 302
>gi|109289908|gb|AAP45177.2| hypothetical protein SBB1_14t00013 [Solanum bulbocastanum]
Length = 338
Score = 360 bits (923), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 187/259 (72%), Positives = 199/259 (76%), Gaps = 29/259 (11%)
Query: 61 SVSSDGNNSINVDVPFPSDYSELLDQ----------------------------AKMAAE 92
S S D SI DVPFP DY+ELL Q AK A E
Sbjct: 47 SCSGDRAASIGFDVPFPKDYTELLQQVFILFAFSPLKIGGRGSGNGGGITREIKAKEATE 106
Query: 93 LAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFP 152
LA+KD +LMEIEFPTAGL SVPGD EGGIEMTGS++LI EFCDL V PEK T+TRIFFP
Sbjct: 107 LALKDNRQLMEIEFPTAGLGSVPGDGEGGIEMTGSIQLIREFCDLLVIPEKATKTRIFFP 166
Query: 153 EANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNV 212
EANEVKFAR+S+F GASFKLDYLTKPSFFEDFGFTEKVKMADRVK EDELF+VAYPYFNV
Sbjct: 167 EANEVKFARQSIFGGASFKLDYLTKPSFFEDFGFTEKVKMADRVKPEDELFIVAYPYFNV 226
Query: 213 NEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYY 272
NEMLVVEELY+ AV NT+ KLIIFNGELDRIRS YP FFYPKLAALSKTLFP MET+YY
Sbjct: 227 NEMLVVEELYQAAVLNTSRKLIIFNGELDRIRSD-YPPFFYPKLAALSKTLFPKMETVYY 285
Query: 273 IHNFKGRNGGTLFRFLEGP 291
IHNFKGRNGG LFR GP
Sbjct: 286 IHNFKGRNGGVLFRCYPGP 304
>gi|357146418|ref|XP_003573985.1| PREDICTED: uncharacterized protein LOC100843789 [Brachypodium
distachyon]
Length = 322
Score = 358 bits (920), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 191/292 (65%), Positives = 211/292 (72%), Gaps = 5/292 (1%)
Query: 1 MPLSSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRK-HSCLT 59
M ++++ +I+ PT S+P +Q F I S+ G + R H C
Sbjct: 1 MAMATSYSISNPTF-TSKSSLPNKQVPNWIFPIISSDNGSGGMFTLARRSLRAGFHVC-- 57
Query: 60 NSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSE 119
+V+ D N FPSDY+ELL QAK AAE A KDG +L+EIEFPTAGL SVPGD E
Sbjct: 58 -AVTGDQNTRNVFSANFPSDYTELLLQAKDAAESAFKDGKQLLEIEFPTAGLQSVPGDGE 116
Query: 120 GGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPS 179
GGIEMTGSM LI EFCD FV EK TRTRIFFPEANEV FAR+S FEG S KLDYLTKPS
Sbjct: 117 GGIEMTGSMLLIREFCDRFVPAEKTTRTRIFFPEANEVTFARQSAFEGCSLKLDYLTKPS 176
Query: 180 FFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGE 239
FEDFGFT KVKMADRV+ EDE+FLVAYPYFNVNEMLVVEELYKEAV NT K+IIFNGE
Sbjct: 177 LFEDFGFTTKVKMADRVQPEDEIFLVAYPYFNVNEMLVVEELYKEAVVNTDRKMIIFNGE 236
Query: 240 LDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
LDRIRSGYYP FFYPKLA LSKT P MET+YYIHNFKG GG LFR GP
Sbjct: 237 LDRIRSGYYPPFFYPKLAELSKTFLPKMETVYYIHNFKGSKGGALFRCYPGP 288
>gi|113208412|gb|ABI34553.1| hypothetical protein SBB1_21t00009 [Solanum bulbocastanum]
Length = 338
Score = 358 bits (920), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 187/261 (71%), Positives = 200/261 (76%), Gaps = 29/261 (11%)
Query: 61 SVSSDGNNSINVDVPFPSDYSELLDQ----------------------------AKMAAE 92
S S D SI DVPFP DY+ELL Q AK A E
Sbjct: 47 SCSGDRAASIGFDVPFPKDYTELLQQVFILFAFSPLKIGGWGSRNRGGITREIKAKEATE 106
Query: 93 LAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFP 152
LA+KD +LMEIEFPTAGL SVPGD EGGIEMTGS++LI EFCDL V PEK T+TRIFFP
Sbjct: 107 LALKDNRQLMEIEFPTAGLGSVPGDGEGGIEMTGSIQLIREFCDLLVIPEKATKTRIFFP 166
Query: 153 EANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNV 212
EANEVKFAR+S+F GASFKLDYLTKPSFFEDFGFTEKVKMADRVK EDELF+VAYPYFNV
Sbjct: 167 EANEVKFARQSIFGGASFKLDYLTKPSFFEDFGFTEKVKMADRVKPEDELFIVAYPYFNV 226
Query: 213 NEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYY 272
NEMLVVEELY+ AV NT+ KLIIFNGELDRIRS YP FFYPKLAALSKTLFP MET+YY
Sbjct: 227 NEMLVVEELYQAAVLNTSRKLIIFNGELDRIRSD-YPPFFYPKLAALSKTLFPKMETVYY 285
Query: 273 IHNFKGRNGGTLFRFLEGPQE 293
IHNFKGRNGG LFR GP +
Sbjct: 286 IHNFKGRNGGVLFRCYPGPWK 306
>gi|224034407|gb|ACN36279.1| unknown [Zea mays]
gi|413926746|gb|AFW66678.1| hypothetical protein ZEAMMB73_267474 [Zea mays]
Length = 324
Score = 352 bits (904), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 183/283 (64%), Positives = 213/283 (75%), Gaps = 3/283 (1%)
Query: 4 SSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNSVS 63
+S ++A P + P + ++ +N +I SN N +G + T + + ++ +V+
Sbjct: 5 TSYGSMANPPITSRTPFLSKQASNWIPATI--SNGNGTGG-MFTVASRKSRNGFQFCAVT 61
Query: 64 SDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIE 123
D + DV FPSDY+ELL QAK AAE A KDG +L+EIEFPTAGL +VPGD EGG E
Sbjct: 62 GDPGSRNVSDVNFPSDYTELLTQAKEAAESAFKDGKQLLEIEFPTAGLQTVPGDGEGGNE 121
Query: 124 MTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFED 183
MTGSM LI EFCD FV EK TRTR+FFPEANEV FAR+S FEG S KLDYLTKPS FED
Sbjct: 122 MTGSMLLIREFCDRFVPAEKATRTRVFFPEANEVSFARQSAFEGCSLKLDYLTKPSLFED 181
Query: 184 FGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRI 243
FGFT KVKMADRVK +DE FLVAYPYFNVNEMLVVEELYKEAV T+ KLIIFNGELDRI
Sbjct: 182 FGFTTKVKMADRVKPQDETFLVAYPYFNVNEMLVVEELYKEAVVGTSRKLIIFNGELDRI 241
Query: 244 RSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFR 286
RSGYYP+FFYPKLA LSKT P ++T+YYIHNFKG GGTLFR
Sbjct: 242 RSGYYPAFFYPKLAELSKTFLPKLDTVYYIHNFKGAKGGTLFR 284
>gi|18422955|ref|NP_568702.1| uncharacterized protein [Arabidopsis thaliana]
gi|14326508|gb|AAK60299.1|AF385707_1 AT5g48790/K24G6_12 [Arabidopsis thaliana]
gi|18700216|gb|AAL77718.1| AT5g48790/K24G6_12 [Arabidopsis thaliana]
gi|332008342|gb|AED95725.1| uncharacterized protein [Arabidopsis thaliana]
Length = 316
Score = 350 bits (899), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 174/234 (74%), Positives = 192/234 (82%), Gaps = 1/234 (0%)
Query: 61 SVSSDGNNSINVD-VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSE 119
SVS NN+ +VD VPFP DY EL++QAK A E+A+KD +LMEIEFPT+GL SVPGD E
Sbjct: 51 SVSGGYNNNTSVDNVPFPRDYVELINQAKEAVEMALKDEKQLMEIEFPTSGLASVPGDGE 110
Query: 120 GGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPS 179
G EMT S+ +I EFCD + PEK TRIFFPEANEVKFA+K+VF G FKLDYLTKPS
Sbjct: 111 GATEMTESINMIREFCDRLLAPEKARSTRIFFPEANEVKFAQKTVFGGTYFKLDYLTKPS 170
Query: 180 FFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGE 239
FEDFGF E+VKMADRVK EDELFLVAYPYFNVNEMLVVEELYKEAV NT KLIIFNGE
Sbjct: 171 LFEDFGFFERVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVNTDRKLIIFNGE 230
Query: 240 LDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
LDRIRSGYYP FFYPKLAAL+KTL P MET+YYIHNFKG+ GG LFR GP +
Sbjct: 231 LDRIRSGYYPKFFYPKLAALTKTLLPKMETVYYIHNFKGQKGGVLFRCYPGPWQ 284
>gi|326523775|dbj|BAJ93058.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 322
Score = 350 bits (897), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 184/275 (66%), Positives = 199/275 (72%), Gaps = 6/275 (2%)
Query: 19 PSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNSVSSDGNNSIN--VDVPF 76
PS P +Q +F S + G I S RR N + G+ S F
Sbjct: 18 PSAPHKQVPNWRFPTINSGDG--GGSIFAIS--RRNLRTWFNVCAVTGDQSTRDVFSADF 73
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCD 136
PSDY+EL+ QAK A E A KDG +L+EIEFPTAGL SVPGD EGGIEMTGSM LI EFCD
Sbjct: 74 PSDYTELIVQAKEATESAFKDGKQLLEIEFPTAGLQSVPGDGEGGIEMTGSMLLIREFCD 133
Query: 137 LFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRV 196
FV EKVTRTRIFFPEA EV FAR+S FEG S KLDYLTKPS FEDFGFT KVKMADRV
Sbjct: 134 RFVPAEKVTRTRIFFPEAKEVTFARQSAFEGCSLKLDYLTKPSLFEDFGFTTKVKMADRV 193
Query: 197 KLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKL 256
+ EDE+FLVAYPYFNVNEMLVVEELYKEAV NT K+IIFNGELDRIRSGYYP FFYPKL
Sbjct: 194 RPEDEIFLVAYPYFNVNEMLVVEELYKEAVLNTERKMIIFNGELDRIRSGYYPPFFYPKL 253
Query: 257 AALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
LSKT P +ET+YYIHNFKG GG LFR GP
Sbjct: 254 GELSKTFLPKLETVYYIHNFKGSKGGVLFRCYPGP 288
>gi|413926747|gb|AFW66679.1| hypothetical protein ZEAMMB73_267474 [Zea mays]
Length = 310
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 177/251 (70%), Positives = 198/251 (78%), Gaps = 1/251 (0%)
Query: 36 SNENFSGQRIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAV 95
SN N +G + T + + ++ +V+ D + DV FPSDY+ELL QAK AAE A
Sbjct: 21 SNGNGTGG-MFTVASRKSRNGFQFCAVTGDPGSRNVSDVNFPSDYTELLTQAKEAAESAF 79
Query: 96 KDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEAN 155
KDG +L+EIEFPTAGL +VPGD EGG EMTGSM LI EFCD FV EK TRTR+FFPEAN
Sbjct: 80 KDGKQLLEIEFPTAGLQTVPGDGEGGNEMTGSMLLIREFCDRFVPAEKATRTRVFFPEAN 139
Query: 156 EVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEM 215
EV FAR+S FEG S KLDYLTKPS FEDFGFT KVKMADRVK +DE FLVAYPYFNVNEM
Sbjct: 140 EVSFARQSAFEGCSLKLDYLTKPSLFEDFGFTTKVKMADRVKPQDETFLVAYPYFNVNEM 199
Query: 216 LVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHN 275
LVVEELYKEAV T+ KLIIFNGELDRIRSGYYP+FFYPKLA LSKT P ++T+YYIHN
Sbjct: 200 LVVEELYKEAVVGTSRKLIIFNGELDRIRSGYYPAFFYPKLAELSKTFLPKLDTVYYIHN 259
Query: 276 FKGRNGGTLFR 286
FKG GGTLFR
Sbjct: 260 FKGAKGGTLFR 270
>gi|226494690|ref|NP_001145598.1| uncharacterized protein LOC100279074 [Zea mays]
gi|195658649|gb|ACG48792.1| hypothetical protein [Zea mays]
Length = 310
Score = 348 bits (892), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 176/251 (70%), Positives = 198/251 (78%), Gaps = 1/251 (0%)
Query: 36 SNENFSGQRIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAV 95
SN N +G + T + + ++ +V+ D + DV FPSDY+ELL QAK AAE A
Sbjct: 21 SNGNGTGG-MFTVASRKSRNGFQFCAVTGDPGSRNVSDVNFPSDYTELLTQAKEAAESAF 79
Query: 96 KDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEAN 155
KDG +L+EIEFPTAGL +VPGD EGG EMTGSM LI EFCD FV EK TRTR+FFPEAN
Sbjct: 80 KDGKQLLEIEFPTAGLQTVPGDGEGGNEMTGSMLLIREFCDRFVPAEKATRTRVFFPEAN 139
Query: 156 EVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEM 215
EV FAR+S FEG S KLDYLTKPS FEDFGFT KVKMADRVK +DE FLVAYPYFNVNEM
Sbjct: 140 EVSFARQSAFEGCSLKLDYLTKPSLFEDFGFTTKVKMADRVKPQDETFLVAYPYFNVNEM 199
Query: 216 LVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHN 275
LVVEELYKEAV T+ KLIIFNGELDRIRSGYYP+FFYPKLA LS+T P ++T+YYIHN
Sbjct: 200 LVVEELYKEAVVGTSRKLIIFNGELDRIRSGYYPAFFYPKLAELSRTFLPKLDTVYYIHN 259
Query: 276 FKGRNGGTLFR 286
FKG GGTLFR
Sbjct: 260 FKGAKGGTLFR 270
>gi|297795571|ref|XP_002865670.1| hypothetical protein ARALYDRAFT_494942 [Arabidopsis lyrata subsp.
lyrata]
gi|297311505|gb|EFH41929.1| hypothetical protein ARALYDRAFT_494942 [Arabidopsis lyrata subsp.
lyrata]
Length = 315
Score = 347 bits (890), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 172/233 (73%), Positives = 191/233 (81%)
Query: 61 SVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEG 120
SVS NN+ +VPFP DY EL++QAK A ELA+KD +LMEIEFPT+GL SVPGDSEG
Sbjct: 51 SVSGGYNNTSVDNVPFPRDYFELINQAKEAVELAMKDEKQLMEIEFPTSGLASVPGDSEG 110
Query: 121 GIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSF 180
EMT S+ +I EFCD + PEK TRIFFPEANEVKFA+K+VF G FKLDYLTKPS
Sbjct: 111 ATEMTESINMIREFCDRLLAPEKARTTRIFFPEANEVKFAQKTVFGGTYFKLDYLTKPSL 170
Query: 181 FEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL 240
FEDFGF E+VKM+DRVK EDELFLVAYPYFNVNEMLVVEELYKEAV NT KLIIFNGEL
Sbjct: 171 FEDFGFFERVKMSDRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVNTDRKLIIFNGEL 230
Query: 241 DRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
DRIRSGYYP FFYPKLAAL+KTL P M+T+YYIHNFKG+ GG LFR GP +
Sbjct: 231 DRIRSGYYPKFFYPKLAALTKTLLPKMDTVYYIHNFKGQKGGVLFRCYPGPWQ 283
>gi|357484699|ref|XP_003612637.1| hypothetical protein MTR_5g027220 [Medicago truncatula]
gi|355513972|gb|AES95595.1| hypothetical protein MTR_5g027220 [Medicago truncatula]
Length = 365
Score = 345 bits (886), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 181/289 (62%), Positives = 203/289 (70%), Gaps = 49/289 (16%)
Query: 50 PYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKL-------- 101
P RK + + S S DGN S+ D+PFP DYSELL+QAK+A EL ++ MK+
Sbjct: 44 PTSRKLARCSVSASGDGNASVQTDIPFPFDYSELLEQAKVAVELQLR--MKVAHSKLRSE 101
Query: 102 ------------MEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRI 149
EIEFPTAGL+SVPGD EGGIEMTGSM+LI EFCDL ++ EK+TRTRI
Sbjct: 102 STIEPIMNNDLKQEIEFPTAGLESVPGDGEGGIEMTGSMQLIREFCDLSISAEKITRTRI 161
Query: 150 ---------------------------FFPEANEVKFARKSVFEGASFKLDYLTKPSFFE 182
FFPEANEV FAR+S F GASFKLDYLTKPSFF+
Sbjct: 162 MVMRKNLLCNTSSSLPVEIDVQPCKNQFFPEANEVDFARQSAFSGASFKLDYLTKPSFFQ 221
Query: 183 DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR 242
DFGF EKVKM+DRVK EDELF+VAYPYFNVNEMLVVEELYKEAV NT KLIIFNGELDR
Sbjct: 222 DFGFVEKVKMSDRVKAEDELFVVAYPYFNVNEMLVVEELYKEAVVNTERKLIIFNGELDR 281
Query: 243 IRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
IRSGYYP FFYPKLA L+K+ P MET+YYIHNFKGR+ G LFR GP
Sbjct: 282 IRSGYYPPFFYPKLAGLTKSFLPSMETVYYIHNFKGRDRGILFRCYPGP 330
>gi|242063910|ref|XP_002453244.1| hypothetical protein SORBIDRAFT_04g002440 [Sorghum bicolor]
gi|241933075|gb|EES06220.1| hypothetical protein SORBIDRAFT_04g002440 [Sorghum bicolor]
Length = 322
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 179/283 (63%), Positives = 207/283 (73%), Gaps = 5/283 (1%)
Query: 4 SSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNSVS 63
+S ++ P + P + ++ +N I + N +G + T + ++ +V+
Sbjct: 5 TSCGSMTKPPITFKTPFVNKQASNW----IPATISNGTGG-MFTVASRNSRNGFQVRAVT 59
Query: 64 SDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIE 123
D + DV FP+DY++LL QAK AAE A KDG +L+EIEFPTAGL +VPGD EGG E
Sbjct: 60 GDPGSRNASDVKFPTDYTQLLMQAKEAAESAFKDGKQLLEIEFPTAGLQTVPGDGEGGNE 119
Query: 124 MTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFED 183
MTGSM LI EFCD FV EK TRTR+FFPEANEV FAR+S FEG S KLDYLTKPS FED
Sbjct: 120 MTGSMLLIREFCDRFVPAEKSTRTRVFFPEANEVSFARQSAFEGCSLKLDYLTKPSLFED 179
Query: 184 FGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRI 243
FGFT KVKMADRVK EDE FLVAYPYFNVNEMLVVEELY EAV T KLIIFNGELDRI
Sbjct: 180 FGFTTKVKMADRVKPEDETFLVAYPYFNVNEMLVVEELYNEAVVGTNRKLIIFNGELDRI 239
Query: 244 RSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFR 286
RSGYYPSFFYPKLA LSKT P ++T+YYIHNFKG GGTLFR
Sbjct: 240 RSGYYPSFFYPKLAELSKTFLPKLDTVYYIHNFKGVKGGTLFR 282
>gi|125580675|gb|EAZ21606.1| hypothetical protein OsJ_05234 [Oryza sativa Japonica Group]
gi|218189983|gb|EEC72410.1| hypothetical protein OsI_05707 [Oryza sativa Indica Group]
Length = 338
Score = 335 bits (859), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 186/307 (60%), Positives = 206/307 (67%), Gaps = 24/307 (7%)
Query: 5 SNSAIALPTVPIPVPSIPRRQANCRKF-SIKCSNENFSGQRIITFSPYRRK--HSCLTNS 61
+ S ++ P+ S P +Q +I N++G T R H C N
Sbjct: 2 ATSYCSISNPPLSKTSFPNKQVPGWVLRAISKGKGNYTGGIYTTTKRNLRTGFHVCAVNG 61
Query: 62 VSSDGNNSINVD-VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEG 120
G + NV FPSDY+ELL QAK AAE A KDG +L+EIEFPTAGL SVPGDSEG
Sbjct: 62 ----GQGTRNVSGAEFPSDYTELLAQAKEAAESAFKDGKQLLEIEFPTAGLQSVPGDSEG 117
Query: 121 GIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSF 180
GIEMTGSM LI EFCD FV EK TRTRIFFPEANEV FAR+S FEG S KLDYLTKPS
Sbjct: 118 GIEMTGSMLLIREFCDRFVPAEKATRTRIFFPEANEVSFARQSAFEGCSLKLDYLTKPSL 177
Query: 181 FEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL 240
FEDFGFT KVKM+DRV+ EDE+FLVAYPYFNVNEMLVVEELYKEA+ +T KLIIFNGEL
Sbjct: 178 FEDFGFTTKVKMSDRVRPEDEIFLVAYPYFNVNEMLVVEELYKEAIVSTDRKLIIFNGEL 237
Query: 241 DRIR----------------SGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTL 284
DRIR YP FFYPKLA LSKT P +ET+YYIHNFKG GGTL
Sbjct: 238 DRIRMLVTFLNKREAALMMFENNYPPFFYPKLAELSKTFLPKLETVYYIHNFKGLKGGTL 297
Query: 285 FRFLEGP 291
FR GP
Sbjct: 298 FRCYPGP 304
>gi|116793457|gb|ABK26754.1| unknown [Picea sitchensis]
Length = 337
Score = 324 bits (831), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 156/223 (69%), Positives = 181/223 (81%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRL 130
++DV FP DYSELL Q K+A + A+ D L+EIEFPTAGLDSV GD+EGGIEM SM L
Sbjct: 83 DIDVEFPGDYSELLQQVKVATQSALMDSKYLLEIEFPTAGLDSVSGDAEGGIEMNSSMTL 142
Query: 131 ICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKV 190
I EFC F+ PE+ TRTRIFFPEA EV+FA+K+VFEG +FK+DYLTKPS EDFGF KV
Sbjct: 143 IREFCRRFLKPEEATRTRIFFPEAKEVEFAKKTVFEGVAFKMDYLTKPSLLEDFGFGTKV 202
Query: 191 KMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPS 250
KMA+RV+ DE+FLVAYPYFNV+EMLVVEELYK+AV +T KLIIFNGELDRIRSGYYP
Sbjct: 203 KMAERVQPTDEIFLVAYPYFNVDEMLVVEELYKDAVVHTDRKLIIFNGELDRIRSGYYPP 262
Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
FFYPK+ AL++ P +ET YYIHNFKGR GGTLFR GP +
Sbjct: 263 FFYPKIGALARNFLPKLETAYYIHNFKGRVGGTLFRSYPGPWQ 305
>gi|356522807|ref|XP_003530035.1| PREDICTED: uncharacterized protein LOC100802995 [Glycine max]
Length = 323
Score = 307 bits (786), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 150/190 (78%), Positives = 167/190 (87%), Gaps = 1/190 (0%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRL 130
+ DVPFP+DYSELL+QA++AA+LA+KD +LMEIEFPTAGL SVPGD EGGIEMT SM+L
Sbjct: 23 HTDVPFPADYSELLEQARVAADLAIKDNRQLMEIEFPTAGLGSVPGDGEGGIEMTESMQL 82
Query: 131 ICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKV 190
I EFCD F++ EK TRTRIFFPEA+EV FAR+SVF G SFKLDYLT PSFFEDFGF EK+
Sbjct: 83 IREFCDRFISSEKATRTRIFFPEASEVDFARQSVFSGCSFKLDYLTNPSFFEDFGFVEKI 142
Query: 191 KMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPS 250
KM DRVK DELFLV+YPYFN NE+LVVEELYKE V NT KLIIFNGELDRIRSGYYPS
Sbjct: 143 KMLDRVKTGDELFLVSYPYFNANEILVVEELYKE-VLNTERKLIIFNGELDRIRSGYYPS 201
Query: 251 FFYPKLAALS 260
FFYPKLAAL+
Sbjct: 202 FFYPKLAALT 211
>gi|9758878|dbj|BAB09432.1| unnamed protein product [Arabidopsis thaliana]
Length = 248
Score = 275 bits (704), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 140/187 (74%), Positives = 154/187 (82%), Gaps = 1/187 (0%)
Query: 61 SVSSDGNNSINVD-VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSE 119
SVS NN+ +VD VPFP DY EL++QAK A E+A+KD +LMEIEFPT+GL SVPGD E
Sbjct: 51 SVSGGYNNNTSVDNVPFPRDYVELINQAKEAVEMALKDEKQLMEIEFPTSGLASVPGDGE 110
Query: 120 GGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPS 179
G EMT S+ +I EFCD + PEK TRIFFPEANEVKFA+K+VF G FKLDYLTKPS
Sbjct: 111 GATEMTESINMIREFCDRLLAPEKARSTRIFFPEANEVKFAQKTVFGGTYFKLDYLTKPS 170
Query: 180 FFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGE 239
FEDFGF E+VKMADRVK EDELFLVAYPYFNVNEMLVVEELYKEAV NT KLIIFNGE
Sbjct: 171 LFEDFGFFERVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVNTDRKLIIFNGE 230
Query: 240 LDRIRSG 246
LDRIRSG
Sbjct: 231 LDRIRSG 237
>gi|168020280|ref|XP_001762671.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686079|gb|EDQ72470.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 280
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 139/241 (57%), Positives = 177/241 (73%), Gaps = 2/241 (0%)
Query: 53 RKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLD 112
R + S S + IN V FP DY+EL++QA+ AA+ A+KD L+E+EFPTAGLD
Sbjct: 9 RSFVVRSRSGSDPKSKIINKSVDFPKDYNELVNQARRAAQAALKDDKTLLEVEFPTAGLD 68
Query: 113 SVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKL 172
+VPGD EGGIEM S+ L+ EFC +F ++ TRIFFP+A +++ A+ S+F+G SFKL
Sbjct: 69 TVPGDEEGGIEMNTSIVLMKEFCTIF--KDEAPTTRIFFPDAKDMELAKTSIFDGTSFKL 126
Query: 173 DYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK 232
DYLTKP+ EDFGF KVKMADRV+ D +F+VAYPYFNVNEM+ VEELYK + +
Sbjct: 127 DYLTKPNGLEDFGFGSKVKMADRVQSSDTVFVVAYPYFNVNEMIAVEELYKGSAAASNRP 186
Query: 233 LIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQ 292
+I+FNGELDRIRSGYYPSFFYPKL +++K P ET+YYIHNFKGR+ G LFR GP
Sbjct: 187 IIVFNGELDRIRSGYYPSFFYPKLGSIAKEFLPKFETVYYIHNFKGRSRGVLFRMYPGPW 246
Query: 293 E 293
+
Sbjct: 247 Q 247
>gi|302761398|ref|XP_002964121.1| hypothetical protein SELMODRAFT_166751 [Selaginella moellendorffii]
gi|300167850|gb|EFJ34454.1| hypothetical protein SELMODRAFT_166751 [Selaginella moellendorffii]
Length = 303
Score = 263 bits (673), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 129/240 (53%), Positives = 166/240 (69%), Gaps = 3/240 (1%)
Query: 54 KHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDS 113
K S S DGN VPFPSDY E++ QA+ A + A+ D KL+E+E P AGL++
Sbjct: 28 KSSWRILRASRDGNVG---SVPFPSDYIEMVKQAQDACQAALDDSKKLLEVEVPPAGLNT 84
Query: 114 VPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLD 173
V GD EGGIEM SM ++ +FC T EK RTR+FFPE E+ A+ VF+G+ FKLD
Sbjct: 85 VSGDEEGGIEMNISMEIVQKFCAGMFTGEKAPRTRVFFPELAEMNIAKSGVFDGSMFKLD 144
Query: 174 YLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKL 233
YLTKPS ++D G +KVKM++R + D F+VAYP+FN NEML VEELY+++ + +
Sbjct: 145 YLTKPSPWDDIGLGKKVKMSERARPTDATFVVAYPFFNPNEMLAVEELYRDSAKESGCPI 204
Query: 234 IIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
I+ NG+LD+IR+GYYP FFYPKL AL+KT P ET+YYIHNFKGR GTLFR GP +
Sbjct: 205 IVINGDLDKIRNGYYPPFFYPKLGALAKTFLPDFETVYYIHNFKGRFAGTLFRAYPGPWQ 264
>gi|302820762|ref|XP_002992047.1| hypothetical protein SELMODRAFT_134592 [Selaginella moellendorffii]
gi|300140169|gb|EFJ06896.1| hypothetical protein SELMODRAFT_134592 [Selaginella moellendorffii]
Length = 303
Score = 263 bits (673), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 127/232 (54%), Positives = 164/232 (70%), Gaps = 3/232 (1%)
Query: 62 VSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGG 121
S DGN VPFPSDY E++ QA+ A + A+ D KL+E+E P AGL++V GD EGG
Sbjct: 36 ASRDGNVG---SVPFPSDYIEMVKQAQDACQAALDDSKKLLEVEVPPAGLNTVSGDEEGG 92
Query: 122 IEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFF 181
IEM SM ++ +FC T EK RTR+FFPE E+ A+ VF+G+ +KLDYLTKPS +
Sbjct: 93 IEMNISMEIVQKFCAGMFTGEKAPRTRVFFPELAEMNIAKSGVFDGSMYKLDYLTKPSPW 152
Query: 182 EDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELD 241
+D G +KVKM++R + D F+VAYP+FN NEML VEELY+E+ + +I+ NG+LD
Sbjct: 153 DDIGLGKKVKMSERTRPTDATFVVAYPFFNPNEMLAVEELYRESAKESGCPIIVINGDLD 212
Query: 242 RIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
+IR+GYYP FFYPKL AL+KT P ET+YYIHNFKGR GTLFR GP +
Sbjct: 213 KIRNGYYPPFFYPKLGALAKTFLPDFETVYYIHNFKGRFAGTLFRAYPGPWQ 264
>gi|440583726|emb|CCH47228.1| hypothetical protein [Lupinus angustifolius]
Length = 283
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 137/248 (55%), Positives = 153/248 (61%), Gaps = 46/248 (18%)
Query: 42 GQRIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKL 101
G + P RK LT SD + S +VPFP+DY+ELL+QA++A ELA+KD +L
Sbjct: 27 GGVTASLVPRNRK---LTGCSVSDVSASTETNVPFPTDYTELLEQARVAVELAMKDNRQL 83
Query: 102 MEIEFPTAGLDSVPG-------------------------------------DSEGGIEM 124
MEIEFPTAGL SVPG D EGGIEM
Sbjct: 84 MEIEFPTAGLASVPGSPYFTFLLFNFSFWIEFHCTLVPIYFQTDSYHSMISGDGEGGIEM 143
Query: 125 T----GSMRLICEFCDLFVTPEK--VTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKP 178
T M + L P + FFPEA+EV FAR+SVF GASFKLDYLTKP
Sbjct: 144 TEIKTSVMINTPKILSLVTAPNSSGLLIYVQFFPEASEVDFARQSVFSGASFKLDYLTKP 203
Query: 179 SFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNG 238
SFF+DFGF EKVKM+DRVK DELFLVAYPYFNVNEMLVVEELYKEAV NT KLIIFNG
Sbjct: 204 SFFQDFGFVEKVKMSDRVKAGDELFLVAYPYFNVNEMLVVEELYKEAVLNTERKLIIFNG 263
Query: 239 ELDRIRSG 246
ELDRIRSG
Sbjct: 264 ELDRIRSG 271
>gi|115443993|ref|NP_001045776.1| Os02g0129300 [Oryza sativa Japonica Group]
gi|113535307|dbj|BAF07690.1| Os02g0129300, partial [Oryza sativa Japonica Group]
Length = 161
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 113/144 (78%), Positives = 121/144 (84%)
Query: 116 GDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYL 175
GDSEGGIEMTGSM LI EFCD FV EK TRTRIFFPEANEV FAR+S FEG S KLDYL
Sbjct: 7 GDSEGGIEMTGSMLLIREFCDRFVPAEKATRTRIFFPEANEVSFARQSAFEGCSLKLDYL 66
Query: 176 TKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLII 235
TKPS FEDFGFT KVKM+DRV+ EDE+FLVAYPYFNVNEMLVVEELYKEA+ +T KLII
Sbjct: 67 TKPSLFEDFGFTTKVKMSDRVRPEDEIFLVAYPYFNVNEMLVVEELYKEAIVSTDRKLII 126
Query: 236 FNGELDRIRSGYYPSFFYPKLAAL 259
FNGELDRIRSG +F + AAL
Sbjct: 127 FNGELDRIRSGLLVTFLNKREAAL 150
>gi|215686777|dbj|BAG89627.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 147
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 105/136 (77%), Positives = 113/136 (83%)
Query: 124 MTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFED 183
MTGSM LI EFCD FV EK TRTRIFFPEANEV FAR+S FEG S KLDYLTKPS FED
Sbjct: 1 MTGSMLLIREFCDRFVPAEKATRTRIFFPEANEVSFARQSAFEGCSLKLDYLTKPSLFED 60
Query: 184 FGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRI 243
FGFT KVKM+DRV+ EDE+FLVAYPYFNVNEMLVVEELYKEA+ +T KLIIFNGELDRI
Sbjct: 61 FGFTTKVKMSDRVRPEDEIFLVAYPYFNVNEMLVVEELYKEAIVSTDRKLIIFNGELDRI 120
Query: 244 RSGYYPSFFYPKLAAL 259
RSG +F + AAL
Sbjct: 121 RSGLLVTFLNKREAAL 136
>gi|307111351|gb|EFN59585.1| hypothetical protein CHLNCDRAFT_56449 [Chlorella variabilis]
Length = 336
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 101/237 (42%), Positives = 139/237 (58%), Gaps = 22/237 (9%)
Query: 75 PFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEF 134
PFP DY++ + QA+ AA A+ DG L+E+EFPTA L +V GD+EG EMT S++ + +F
Sbjct: 60 PFPGDYNQAVRQAQGAAAAALADGASLVEVEFPTASLVAVAGDAEGANEMTYSLQHLRQF 119
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKS--------------VFEGASFKLDYLTKPSF 180
+ ++ TRIFFP+ E+K A K VFEG +FK YL KP+
Sbjct: 120 MRGWK--DQAGTTRIFFPDPTELKVALKGKAMDPNAGSWTIDPVFEGTAFKFGYLMKPNP 177
Query: 181 FEDFGFT-EKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK-----LI 234
F D G T K+ AD++ ++L ++AYP+FN EML V L++ + +I
Sbjct: 178 FLDMGITVGKINAADQLDGREQLLVMAYPHFNPQEMLEVAALHEYLAAQAGGREGATPII 237
Query: 235 IFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGP 291
FN ELDRIR+GYYP FFYP + ++K+L P T YYI NFKG GG +FR P
Sbjct: 238 TFNAELDRIRTGYYPPFFYPAIGKIAKSLLPQFTTAYYIKNFKGATGGCIFRCYPSP 294
>gi|255080176|ref|XP_002503668.1| predicted protein [Micromonas sp. RCC299]
gi|226518935|gb|ACO64926.1| predicted protein [Micromonas sp. RCC299]
Length = 369
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 83/246 (33%), Positives = 126/246 (51%), Gaps = 26/246 (10%)
Query: 74 VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICE 133
PFP DY++++ Q + A + + DG+ LMEI+FP GL++ PGD EG +E +++ +
Sbjct: 90 TPFPKDYAQMVSQCQKALQHGLDDGLGLMEIQFPPGGLETAPGDVEGNMESNLTVQHLRG 149
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFAR------------------KSVFEGASF--KLD 173
C F + TR+FFP+ E K AR ++ F ++ +D
Sbjct: 150 ICAQFERNKTAKTTRVFFPDPIEAKLARTGTNASPDGVRAPSNSETRAWFAPNNWPGPVD 209
Query: 174 YLTKPSFFEDFG----FTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAV--F 227
+L PSF G ++V ++ K D F+VAYP NV+E+ ELY+ +
Sbjct: 210 FLESPSFLSVSGLDKVLNKRVSTWNKAKANDTAFVVAYPVSNVSELTCTRELYEGELGRG 269
Query: 228 NTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRF 287
A +++ NGEL+R R+ YYP F+ A + V E IY+IHNFKG N LFR
Sbjct: 270 TGARPIVVCNGELERTRTNYYPPFWNAGEMAPLREFVKVFEQIYFIHNFKGSNPAVLFRC 329
Query: 288 LEGPQE 293
GP +
Sbjct: 330 YPGPWQ 335
>gi|145344528|ref|XP_001416783.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577009|gb|ABO95076.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 277
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 78/241 (32%), Positives = 121/241 (50%), Gaps = 22/241 (9%)
Query: 75 PFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEF 134
PFP DY+EL QA+ + + KDG++L+E++FP GL+ GD EG +E + +
Sbjct: 1 PFPRDYAELERQARESVKRCAKDGVELVELQFPPGGLELASGDLEGNVECNLTTERLRGI 60
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFA------------------RKSVFEGASFKLDYLT 176
CD FV + TR+ FP+ E++ A + F LDY+
Sbjct: 61 CDAFVANGTASTTRVLFPDPTEMRLATTGANAAPDGIRAPEQSDTRGWFADWKGTLDYVD 120
Query: 177 KPSFFEDFGFTE----KVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK 232
PSF GF + K +++R++ ++ ++VAYP N++E+ +LY+ V T
Sbjct: 121 DPSFMSVSGFDKIFGGKKNISERMRGDETAYVVAYPSANISELANTRDLYEGCVRGTGKS 180
Query: 233 LIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQ 292
L++ NGEL+R RS YYP F+ ++ E Y I+NFKG N LFR P
Sbjct: 181 LVVCNGELERTRSNYYPPFWNAGEMGPLRSFCRKFEGAYVIYNFKGSNPAVLFRVYPEPW 240
Query: 293 E 293
+
Sbjct: 241 Q 241
>gi|159481297|ref|XP_001698718.1| hypothetical protein CHLREDRAFT_205904 [Chlamydomonas reinhardtii]
gi|158273612|gb|EDO99400.1| predicted protein [Chlamydomonas reinhardtii]
Length = 364
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 81/219 (36%), Positives = 117/219 (53%), Gaps = 25/219 (11%)
Query: 65 DGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEM 124
+ ++ PFP+ Y + QA+ A + A+ DG KL+E+EFP+ L SV GD EG EM
Sbjct: 41 EAATAVQTPAPFPTSYVMAMRQAQEAVKAALADGAKLVEVEFPSTTLSSVSGDGEGQNEM 100
Query: 125 TGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKS--------------VFEGASF 170
SM + F F + + TR+FFP+ E+ AR F +F
Sbjct: 101 NASMGYLRTFLGGFRS--RAASTRVFFPDNVELAVARSGQTEDPSAGRKALDPQFADVTF 158
Query: 171 KLDYLTKP-SFFEDFGFTEK----VKMADRVKLEDELFLVAYPYFNVNEML-VVEELYKE 224
+L YLT+ + + FGF + VK+ VK D+L +VAYP FN E L V ELY++
Sbjct: 159 QLGYLTEQNAAWAMFGFYKSAFDPVKL---VKDTDDLLVVAYPSFNPREELSAVYELYQQ 215
Query: 225 AVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTL 263
++IFNGELDR+R GYYPS F+P++A + +
Sbjct: 216 KAKARGMPIVIFNGELDRVRGGYYPSVFFPEIAVRQRAV 254
>gi|424513544|emb|CCO66166.1| predicted protein [Bathycoccus prasinos]
Length = 423
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 132/284 (46%), Gaps = 42/284 (14%)
Query: 40 FSGQRIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGM 99
F G+++ P R ++S + N PFP+DY ++ QA+ A + A +DG+
Sbjct: 103 FGGKKVEVVLPPSRALGSGRTTISKNTNGGRQY--PFPADYDVMVQQARQALQKAREDGV 160
Query: 100 KLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKF 159
L EI+FP GLD PGD EG +E T + ++ + EK+T + FP+ E+K
Sbjct: 161 DLGEIQFPPGGLDLAPGDLEGNVECTLTATVLRKILRGMKEEEKIT---VLFPDPTELKL 217
Query: 160 ARKS--------------------VFEGASFKLDYLTKPSFFEDFG----FTEKVKMADR 195
A++ +FE +L+YL P+ F G F + + DR
Sbjct: 218 AKRGQTGMCAPDGVAPPEVFQTDPLFEDWRGELNYLDDPNAFSVSGLDKIFGKSATVNDR 277
Query: 196 VKL-EDELFLVAYPYFNVNEMLVVEELYKE-----------AVFNTAWK-LIIFNGELDR 242
V + E +F+ AYP N+ E+ LY+ + T K L++ NGELDR
Sbjct: 278 VDINEGNMFVCAYPSGNIAELTQTRLLYENIREENESDAPASKIKTKRKSLVVVNGELDR 337
Query: 243 IRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFR 286
RS YYP F+ + E IY+IHNFKG N LFR
Sbjct: 338 TRSNYYPWFWNKNEMEPLREFSQSFEGIYFIHNFKGTNPAVLFR 381
>gi|303272213|ref|XP_003055468.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463442|gb|EEH60720.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 252
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 77/220 (35%), Positives = 114/220 (51%), Gaps = 12/220 (5%)
Query: 86 QAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVT 145
QA+ + + A+ DG++L+EI+FP+ GLD+ PGD EG +E ++ + C F
Sbjct: 1 QAQASLQAALDDGVELLEIQFPSGGLDTAPGDVEGNVENNLTVAHLRGICSQFERNGTAK 60
Query: 146 RTRIFFPEANEVKFARKSVFEG----ASF--KLDYLTKPSFFEDFGFTE----KVKMADR 195
TR+FFP+ E A ASF +DYL +P F G + + +A R
Sbjct: 61 TTRVFFPDPIERSLALTGAAPSPDGFASFPGPIDYLEQPDFLSVSGLDKMLGTRKTVAMR 120
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKE--AVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
V D F+VAYP NV+E++ EL + A A +++ NGEL+R RS YYPSF+
Sbjct: 121 VPESDTAFVVAYPCTNVSELVCTRELREGELARAGPARPIVMCNGELERTRSEYYPSFWN 180
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
+ E +Y++HN+KG N LFR GP +
Sbjct: 181 VGEMKPLRGFAREFEGVYFVHNYKGSNPAVLFRAYPGPWQ 220
>gi|308802235|ref|XP_003078431.1| unnamed protein product [Ostreococcus tauri]
gi|116056883|emb|CAL53172.1| unnamed protein product [Ostreococcus tauri]
Length = 267
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 72/233 (30%), Positives = 115/233 (49%), Gaps = 22/233 (9%)
Query: 83 LLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPE 142
++ Q K A + A+ DG +L+E++FP GL+ GD EG +E + + CD F
Sbjct: 1 MVRQCKEAMKRAIVDGTELIELQFPPGGLELASGDLEGNVECNLTTERLRGICDGFRELG 60
Query: 143 KVTRTRIFFPEANEVKFA------------------RKSVFEGASFKLDYLTKPSFFEDF 184
+TR+ FP+ E + A +++F ++DYL PSF
Sbjct: 61 MAEKTRVLFPDPTETRLALTGSSPTPDGIRAPEQSETRAMFGDWVGRVDYLDDPSFMSVS 120
Query: 185 GFTE----KVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL 240
G + K +A+R+ +D F+VAYP N++E+ +LY++AV + L++ NGE+
Sbjct: 121 GLDKILGTKKSIAERMGADDAAFVVAYPSANISELANTRDLYEDAVRGSGRPLVVCNGEM 180
Query: 241 DRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
+R RS YYP F+ + E +Y I+NFKG N LFR P +
Sbjct: 181 ERTRSNYYPPFWNAGEMGPLREFARKFEGVYVIYNFKGSNPAVLFRVYPEPWQ 233
>gi|302852030|ref|XP_002957537.1| hypothetical protein VOLCADRAFT_107711 [Volvox carteri f.
nagariensis]
gi|300257179|gb|EFJ41431.1| hypothetical protein VOLCADRAFT_107711 [Volvox carteri f.
nagariensis]
Length = 271
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 72/222 (32%), Positives = 104/222 (46%), Gaps = 61/222 (27%)
Query: 70 INVDVPFPSDYSELLDQ-------------AKMAAELAVKDGMKLMEIEFPTAGLDSVPG 116
+ PFP Y + + Q A+ A + A+ DG L+E+EFP+ L SV G
Sbjct: 43 LQAPAPFPVSYDQAMRQLLPRFPAPLFQHSAQEAVKAALADGAPLVEVEFPSTTLSSVSG 102
Query: 117 DSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFA-----------RKSV- 164
D EG EM SM + +F F + + TR+FFP+ E+ A RKS+
Sbjct: 103 DGEGQNEMNASMGFLRQFLGAFRS--RAASTRVFFPDNVELAVARSGQTEDPAAGRKSLD 160
Query: 165 --FEGASFKLDYLTKP-SFFEDFGFT----EKVKMADRVKLEDELFLVAYPYFNVNEMLV 217
F A F+L YLT+ + + FGF + VK+ VK D++ ++AYP FN
Sbjct: 161 PKFGDAVFQLGYLTQQNAAWAVFGFYKSGFDPVKL---VKDTDDMLVIAYPSFNP----- 212
Query: 218 VEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAAL 259
GELDR+R GYYP+ F+P++A L
Sbjct: 213 -------------------RGELDRVRGGYYPALFFPEIAKL 235
>gi|428164159|gb|EKX33196.1| hypothetical protein GUITHDRAFT_156132 [Guillardia theta CCMP2712]
Length = 215
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 78/149 (52%), Gaps = 19/149 (12%)
Query: 149 IFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYP 208
+ FP+ +E + A + F L L+KP + DRV +LV +P
Sbjct: 1 MVFPDPSEARIAFEEYGSQVPFSLSSLSKPK-----------QQEDRVNK----YLVMHP 45
Query: 209 YFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSG----YYPSFFYPKLAALSKTLF 264
F+V E + ++ELY V +IIFNG+L ++RSG YYP FF+PKLA + +
Sbjct: 46 VFDVREYIQMDELYMSEVAPKDAAMIIFNGDLFKMRSGGIGGYYPDFFFPKLAQVRRRFM 105
Query: 265 PVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
P++ET YY+ F+G G L+R GP +
Sbjct: 106 PMVETAYYLRVFRGPPVGALYREYPGPWQ 134
>gi|255087178|ref|XP_002505512.1| predicted protein [Micromonas sp. RCC299]
gi|226520782|gb|ACO66770.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 68/247 (27%), Positives = 106/247 (42%), Gaps = 38/247 (15%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPG----DSEGGIEMTGSMRLI 131
P D S+LL + + + A+ DG L+++E P D V G DS E M ++
Sbjct: 64 LPEDESDLLARIHTSIQAALSDGKVLLDVEVPVQYFDGVVGVGGQDSIAISEFNACMSVL 123
Query: 132 CEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVF---------EGASFK-----LDYLTK 177
+ LF + R+FFP+A E A K + A+F +DYL +
Sbjct: 124 RKIVRLFEWLGQAESVRVFFPDAAECSIALKGAGLNPVSGQWEQAATFHDWPGAVDYLLR 183
Query: 178 PSFFED-----FGFTE-------KVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEA 225
F +G+ + K + ++ D L++V YPY N EM V L++E
Sbjct: 184 DDFVSQTSRKAYGYADLPDFLAGKRDVEQTAEVADRLYVVGYPYDNTGEMEQVMRLWEE- 242
Query: 226 VFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNF-KGRNGGTL 284
A +++FNG LD +R+ + P + K L P T +Y+H F G G L
Sbjct: 243 ---HARPILVFNGNLDGVRTSFAP---FGKAKKLKHEFVPKFTTAFYVHKFAAGAAPGLL 296
Query: 285 FRFLEGP 291
+R P
Sbjct: 297 YRQYPSP 303
>gi|449018586|dbj|BAM81988.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 247
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 95/223 (42%), Gaps = 25/223 (11%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMK---LMEIEFPTAGLDSVPGDSEGGIEMTGSMR-LI 131
P D + L Q + A A + + L E+ FP A D+ S T R +I
Sbjct: 10 LPKDTASLHRQVQNALSKATETKTRSPALYEVSFP-AVRDTTAALSRILDANTSHAREII 68
Query: 132 CEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVK 191
F F R + FP+ E K A K G+S L+ +E F ++V+
Sbjct: 69 KPFAASFRK-----RLHLVFPDVAEAKIAEKVY--GSSEHTFTLSALPLYERPAFLQQVE 121
Query: 192 MADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSF 251
L V P FN++E L +E + A+ +++ NG +DR+RS YYP
Sbjct: 122 AP-------ALVFVVQPGFNIDEWLQLE---RPALLYPDASIVVLNGNMDRLRSNYYPPL 171
Query: 252 FYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQES 294
FYP+L AL K E IYY+ K G LFR P ++
Sbjct: 172 FYPRLTALRKRYLEQFEPIYYL---KPLPNGLLFRVFPEPWQT 211
>gi|298715350|emb|CBJ27978.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 314
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 123/285 (43%), Gaps = 51/285 (17%)
Query: 5 SNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNSVSS 64
+++A+A +P P+P PR + + + R P R++ CLT
Sbjct: 25 ASTALAFVALPSPLPRSPR-------YHQRLYDAAAPRPRREKPRPQRQQVQCLTK---- 73
Query: 65 DGNNSINVDVPFPSD-YSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGG 121
+P D Y+ + Q A + A+ G+KL+E+EFP LD G++
Sbjct: 74 ---------IPSGKDPYAAVKKQTAEATQDAINAGIKLIELEFPPVRGKLDISLGET--- 121
Query: 122 IEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFF 181
+ + E F + + FP+ E + A ++ + G +F++
Sbjct: 122 --LDANRSFARELARSF-SARMGKALWLVFPDDAEAELA-QNTYGGTTFRV--------- 168
Query: 182 EDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELD 241
G +K D E ++ +V P F+VNE +V++ L + V +++ NG LD
Sbjct: 169 --VGINSAIK--DLKDEECQMQIVVNPGFDVNEWIVLDSLVRPDV-----PMVMLNGNLD 219
Query: 242 RIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFR 286
++R GYYP F+P L + ET+YY+ K GG +FR
Sbjct: 220 KLRGGYYPRIFFPGLYNAKERFLKKFETVYYL---KALPGGWIFR 261
>gi|452824537|gb|EME31539.1| hypothetical protein Gasu_12130 [Galdieria sulphuraria]
Length = 273
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 91/221 (41%), Gaps = 38/221 (17%)
Query: 74 VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFP------TAGLDSVPGDSEGGIEMTGS 127
+ P +L+ + + + A+ DG+KL+E++FP +A L+ V M +
Sbjct: 46 IRLPESNVQLVQDIQESCKSAICDGLKLLEVQFPPLKNIGSAALNQV---------MDAN 96
Query: 128 MRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFT 187
F T + FP+ E K AR+ D+ T S F
Sbjct: 97 RTFAKSVVQRFPHVSGNGTTFVVFPDDAESKLARED--------RDFRTLDSVF------ 142
Query: 188 EKVKMADRVKLED-ELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSG 246
+ + L+D L ++ P F V E VE V +I+FN +LD++R G
Sbjct: 143 -ITSLQRDIDLQDASLVVILNPGFQVQEWFEVERFCNYQV-----PVILFNADLDKLRGG 196
Query: 247 YYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRF 287
YYP F YPKL A E +YY+ F NG + R+
Sbjct: 197 YYPRFLYPKLYATKDKCLTKFEPVYYVRFFV--NGALIRRY 235
>gi|209522945|ref|ZP_03271502.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|376001796|ref|ZP_09779650.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|209496532|gb|EDZ96830.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|375329707|emb|CCE15403.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 249
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 96/224 (42%), Gaps = 49/224 (21%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ SE ++QAK AA A+ DG KL+++E FP IE+ + + +
Sbjct: 4 LPTTLSEAIEQAKQAATAALDDGYKLIQVELVFPE-------------IELQ-AQSIASQ 49
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F P+ T ++FFP+A AR+ D+ P D G T + +
Sbjct: 50 FIPALEKPD--TLLKVFFPDAGSAALARR----------DWGETPFRVTDIG-TSRSPVE 96
Query: 194 DRVKLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYP 249
R++ +D FLV P VN+ VE L+K A + +++ N L+ I GY
Sbjct: 97 TRLQPDDGQFLVVSPSPVEVNQ---VENLHKLAGDRS---VVLLNPRLEDVAIIGIGYAA 150
Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
L + ++E+ YY+ K +G LFR G E
Sbjct: 151 R-------QLRERFLNIIESCYYL---KPLDGAALFRCYPGTWE 184
>gi|423062349|ref|ZP_17051139.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
gi|406716257|gb|EKD11408.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
Length = 262
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 96/224 (42%), Gaps = 49/224 (21%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ SE ++QAK AA A+ DG KL+++E FP IE+ + + +
Sbjct: 17 LPTTLSEAIEQAKQAATAALDDGYKLIQVELVFPE-------------IELQ-AQSIASQ 62
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F P+ T ++FFP+A AR+ D+ P D G T + +
Sbjct: 63 FIPALEKPD--TLLKVFFPDAGSAALARR----------DWGETPFRVTDIG-TSRSPVE 109
Query: 194 DRVKLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYP 249
R++ +D FLV P VN+ VE L+K A + +++ N L+ I GY
Sbjct: 110 TRLQPDDGQFLVVSPSPVEVNQ---VENLHKLAGDRS---VVLLNPRLEDVAIIGIGYAA 163
Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
L + ++E+ YY+ K +G LFR G E
Sbjct: 164 R-------QLRERFLNIIESCYYL---KPLDGAALFRCYPGTWE 197
>gi|224013206|ref|XP_002295255.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969217|gb|EED87559.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 391
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 37/62 (59%), Gaps = 1/62 (1%)
Query: 233 LIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNG-GTLFRFLEGP 291
+++ NG LD++R G+YP+ F+PKLAA + E+++Y+ F + G L+R P
Sbjct: 286 MVVINGALDKVRGGFYPAIFFPKLAATVDRFWKRFESVFYLKPFSDKGVYGWLYRVYPEP 345
Query: 292 QE 293
+
Sbjct: 346 WQ 347
>gi|291567271|dbj|BAI89543.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 249
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 96/224 (42%), Gaps = 49/224 (21%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ SE ++QAK AA A++DG KL+++E FP IE+ + + +
Sbjct: 4 LPTTLSEAIEQAKQAATAALEDGYKLIQVELVFPE-------------IELQ-AQSIASQ 49
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F P+ T ++FFP+A AR+ D+ P D G T + +
Sbjct: 50 FIPALEKPD--TLLKVFFPDAGSAALARR----------DWGETPFRVTDIG-TSRSPVE 96
Query: 194 DRVKLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYP 249
R++ +D FLV P VN+ VE L+K A + +++ N L+ I GY
Sbjct: 97 TRLQPDDGQFLVVSPSPVEVNQ---VENLHKLAGDRS---VVLLNPRLEDVAIIGIGYAA 150
Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
L + +E+ YY+ K +G LFR G E
Sbjct: 151 R-------QLRERFLNTIESCYYL---KPLDGAALFRCYPGTWE 184
>gi|409992140|ref|ZP_11275348.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
Paraca]
gi|409936997|gb|EKN78453.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
Paraca]
Length = 262
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 96/224 (42%), Gaps = 49/224 (21%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ SE ++QAK AA A++DG KL+++E FP IE+ + + +
Sbjct: 17 LPTTLSEAIEQAKQAATAALEDGYKLIQVELVFPE-------------IELQ-AQSIASQ 62
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F P+ T ++FFP+A AR+ D+ P D G T + +
Sbjct: 63 FIPALEKPD--TLLKVFFPDAGAAALARR----------DWGETPFRVTDIG-TSRSPVE 109
Query: 194 DRVKLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYP 249
R++ +D FLV P VN+ VE L+K A + +++ N L+ I GY
Sbjct: 110 TRLQPDDGQFLVVSPSPVEVNQ---VENLHKLAGDRS---VVLLNPRLEDVAIIGIGYTA 163
Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
L + +E+ YY+ K +G LFR G E
Sbjct: 164 R-------QLRERFLNTIESCYYL---KPLDGAALFRCYPGTWE 197
>gi|397566319|gb|EJK45002.1| hypothetical protein THAOC_36416 [Thalassiosira oceanica]
Length = 370
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/258 (22%), Positives = 102/258 (39%), Gaps = 53/258 (20%)
Query: 83 LLDQAKMAAELAVKDGMKLMEIEFP----TAGLDSVPGDSEGGIEMTGSMRLICEFCDLF 138
L AK+A + A+ DG+ +E+EFP A S D + E+ + + +F
Sbjct: 75 LRKTAKLAIDSAIADGVSKIEVEFPPLLGGARSKSQFDDFDNVQELDSNKEWTMQLAPMF 134
Query: 139 VTPE--KVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDF------------ 184
+ K RT + FP+ E + A+K F G ++ T +F
Sbjct: 135 AGDKTYKDGRTWLVFPDLKECELAKKD-FPGQRYQEATFTTIEAVTNFMSSSGSPGSSEE 193
Query: 185 -----------GFTEKV--KMADRVKLEDE-------------LFLVAYPYFN--VNEML 216
G + + K D L D+ L+LV P V + +
Sbjct: 194 YAAPWGASLMSGLSSMMGGKDGDAGLLGDQSSLDSLNVDSPANLWLVVQPGNGGPVEDWV 253
Query: 217 VVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNF 276
E+++ ++ +++ NG LD++R G+Y F+P LAA + + ET Y+ F
Sbjct: 254 NCEKMHSPSI-----PMVVVNGALDKVRGGFYAPIFFPALAATVERFWKKFETGLYLKPF 308
Query: 277 KGRNG-GTLFRFLEGPQE 293
+ G L+R P +
Sbjct: 309 SDKGVYGWLWRVYPEPWQ 326
>gi|428317816|ref|YP_007115698.1| protein of unknown function DUF1995-containing protein
[Oscillatoria nigro-viridis PCC 7112]
gi|428241496|gb|AFZ07282.1| protein of unknown function DUF1995-containing protein
[Oscillatoria nigro-viridis PCC 7112]
Length = 248
Score = 44.3 bits (103), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 50/223 (22%), Positives = 92/223 (41%), Gaps = 47/223 (21%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P D +E + Q+++A A+ DG L+++E FP L + + +++ + E
Sbjct: 4 LPKDLNEAIAQSRIATAAALSDGKTLLQVELVFPEIALQA----------QSITLQFLPE 53
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +++ ++FFP+ AR+ D+ P D G + + +
Sbjct: 54 FEEIY------PGVKVFFPDTGAAALARR----------DWGETPFKVTDLG-SSRTPVE 96
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPS 250
D++ ED+LFL+ P E+ VE++Y A +I+ N L+ + + GY
Sbjct: 97 DKIAPEDQLFLLINP--AAVEVAQVEKIYIAAAGR---PVILLNPRLEDVATIGIGYAGR 151
Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
L +E+ YYI + LFR P +
Sbjct: 152 -------QLRDRFLNKIESCYYIRPL---DTAALFRCYPQPWQ 184
>gi|219125569|ref|XP_002183049.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405324|gb|EEC45267.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 356
Score = 43.9 bits (102), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 88/206 (42%), Gaps = 43/206 (20%)
Query: 87 AKMAAELAVKDGMKLMEIEFP--TAGLDSVPG-DSEGGIEMTGSMRLICEFCDLFVTPEK 143
A A + A++DG K +EI+FP T G S D ++ + R C + + P
Sbjct: 76 AAPALQKALRDGWKQLEIDFPPLTGGDQSKTQFDDFDNVQELNANRDWC----VQLAPAI 131
Query: 144 VTRTR-IFF--PEANEVKFARKSVFEGASFK------------LDYLTKPSFFEDFGFTE 188
++ R ++F P+ E + A++ + G F+ L + + + +G T
Sbjct: 132 ASKNREVWFILPDDKECELAKEE-WTGQRFRQAAKFTSVRAAVLKTSGESQYSKAWGSTI 190
Query: 189 KVKM----------ADRVKLED-----ELFLVAYPYFN--VNEMLVVEELYKEAVFNTAW 231
M AD L+D LV P V + + VE L+K + +
Sbjct: 191 ASTMNKLTGGDGILADSSTLDDLGSGDRFHLVCQPGNGGPVEDWINVERLHKA---DPSQ 247
Query: 232 KLIIFNGELDRIRSGYYPSFFYPKLA 257
+ NG LD++R GYYP+ F+P LA
Sbjct: 248 PTCVVNGALDKVRDGYYPAVFFPALA 273
>gi|334118025|ref|ZP_08492115.1| Domain of unknown function DUF1995-containing protein [Microcoleus
vaginatus FGP-2]
gi|333460010|gb|EGK88620.1| Domain of unknown function DUF1995-containing protein [Microcoleus
vaginatus FGP-2]
Length = 248
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 91/216 (42%), Gaps = 47/216 (21%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P D +E + Q+++A A+ DG L+++E FP L + + + + + E
Sbjct: 4 LPKDLNEAIAQSRIATAAALSDGKTLLQVELVFPEIALQA----------QSITEQFLPE 53
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
+++ ++FFP+A AR+ D+ P D G + + +
Sbjct: 54 LEEIY------PGVKVFFPDAGAAALARR----------DWGETPFKVTDLG-SSRSPVE 96
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPS 250
D++ ED+LFL+ P E+ VE LY A +I+ N L+ + + GY
Sbjct: 97 DKIAPEDQLFLLINP--AAVEVAQVERLYIAAAGR---PVILLNPRLEDVATIGIGYAGR 151
Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFR 286
+ LSK +E+ YY+ + LFR
Sbjct: 152 QLRDRF--LSK-----IESCYYVRPL---DAAALFR 177
>gi|402308913|ref|ZP_10827915.1| deoxyribose-phosphate aldolase [Prevotella sp. MSX73]
gi|400374492|gb|EJP27410.1| deoxyribose-phosphate aldolase [Prevotella sp. MSX73]
Length = 298
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 57/129 (44%), Gaps = 11/129 (8%)
Query: 44 RIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMA-AELAVKDGMKLM 102
+ T Y ++ S+ DG +NV FPS S+ + K+A A LAVKDG +
Sbjct: 86 HVATICTYPNFAKLVSESLEVDGVQVVNVSGSFPS--SQTFIEVKVAEASLAVKDGATEI 143
Query: 103 EIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFAR- 161
+I P S GD EG + G + C C P KV P ++VK A
Sbjct: 144 DIVMPVGKYLS--GDYEGVADEIGEQKQACGEC-----PMKVILETGCLPSMSDVKKASI 196
Query: 162 KSVFEGASF 170
+++ GA +
Sbjct: 197 IAMYAGADY 205
>gi|288925941|ref|ZP_06419871.1| deoxyribose-phosphate aldolase [Prevotella buccae D17]
gi|315606905|ref|ZP_07881912.1| deoxyribose-phosphate aldolase [Prevotella buccae ATCC 33574]
gi|288337365|gb|EFC75721.1| deoxyribose-phosphate aldolase [Prevotella buccae D17]
gi|315251413|gb|EFU31395.1| deoxyribose-phosphate aldolase [Prevotella buccae ATCC 33574]
Length = 298
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 57/129 (44%), Gaps = 11/129 (8%)
Query: 44 RIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMA-AELAVKDGMKLM 102
+ T Y ++ S+ DG +NV FPS S+ + K+A A LAVKDG +
Sbjct: 86 HVATICTYPNFAKLVSESLEVDGVQVVNVSGSFPS--SQTFIEVKVAEASLAVKDGATEI 143
Query: 103 EIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFAR- 161
+I P S GD EG + G + C C P KV P ++VK A
Sbjct: 144 DIVMPVGKYLS--GDYEGVADEIGEQKQACGEC-----PMKVILETGCLPSMSDVKKASI 196
Query: 162 KSVFEGASF 170
+++ GA +
Sbjct: 197 IAMYAGADY 205
>gi|113475888|ref|YP_721949.1| hypothetical protein Tery_2247 [Trichodesmium erythraeum IMS101]
gi|110166936|gb|ABG51476.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 253
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 53/218 (24%), Positives = 93/218 (42%), Gaps = 37/218 (16%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P+D +E + Q A + A++DG ++IE VP IE+ + L +F
Sbjct: 4 LPNDINEAIVQGMEATKAALQDGYTRVQIEI------VVP-----DIELQ-AQSLAKQFI 51
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
+ E T+ ++FFP++ AR+ ++ A+FK+ ED G T + + +
Sbjct: 52 PALL--ETSTKLKVFFPDSGAAALARRD-WQDATFKI---------EDLG-TSRSPVDKK 98
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPK 255
V+ ED+ FL+ P + E+ E+L A LI ++ + GY
Sbjct: 99 VEPEDQCFLLIAP--SAIEVAQTEKLSNLAGDRPVIMLIPKLEDVSIVGIGYAAR----- 151
Query: 256 LAALSKTLFPVMETIYYIHNFKGRNGGTLFRFLEGPQE 293
L + +E+ YYI + G L+R P +
Sbjct: 152 --QLRERFIKTIESCYYIRSL---GGAALYRCYPSPWQ 184
>gi|119484707|ref|ZP_01619189.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
gi|119457525|gb|EAW38649.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
Length = 249
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 52/208 (25%), Positives = 83/208 (39%), Gaps = 44/208 (21%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P E ++QAK A + A+ DG KL+++E FP IE+ + + +
Sbjct: 4 LPKSIEEAVEQAKQATQAALDDGYKLVQVELVFPE-------------IELQ-AQAIAQQ 49
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F + E T ++FFP+A AR+ D+ P D G + + +
Sbjct: 50 F--IPAIEESGTVLKVFFPDAGAAALARR----------DWGEIPFKISDLG-SSRSPID 96
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIR---SGYYPS 250
RVK +D FLV P + VE++ K + I+ N L+ I GY
Sbjct: 97 SRVKDDDGRFLVVSPT-----PVEVEQVEKLSQLAGDRVTILLNPRLEDIAIIGIGYAAR 151
Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKG 278
AL +E+ YY+ +G
Sbjct: 152 -------ALRDRFISTIESCYYLRPLEG 172
>gi|219113845|ref|XP_002186506.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209583356|gb|ACI65976.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 379
Score = 40.8 bits (94), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 44/92 (47%), Gaps = 5/92 (5%)
Query: 74 VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMT-GSMRLIC 132
+P PS + EL + A LA KDG KL+E+EFP + D ++ +++L
Sbjct: 67 LPPPSSFFELQQDCQRAVRLARKDGHKLLEVEFPPLPAAVLEMDDVSAYDVVQANLKLAL 126
Query: 133 EFCDLFVTPEK----VTRTRIFFPEANEVKFA 160
+F + E+ + + + FP+ E FA
Sbjct: 127 DFSKGLLAGERDGSSLKKIALLFPDQAEADFA 158
>gi|384251129|gb|EIE24607.1| hypothetical protein COCSUDRAFT_14109 [Coccomyxa subellipsoidea
C-169]
Length = 394
Score = 40.4 bits (93), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 85/193 (44%), Gaps = 22/193 (11%)
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFPT--AGLDSVPGDSEGGIEMTGSMRLICEF 134
PS + EL++ A + A+ DG+ +E+EFP +D G S+ I+ +++L
Sbjct: 92 PSSFQELVNDATASVRAAIGDGLTRLEVEFPALPGNIDGYKGASDWFID--SNIQLAIAA 149
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGA-----SFKLDYLTKPS--FFEDFGFT 187
+ V E R I P+ E + K +F+GA + +L + S F F F
Sbjct: 150 SRILVK-ESGKRVHILVPDGGEYNRSYK-MFKGALDLADGISMGHLKENSKGVFSSFNFF 207
Query: 188 EKVKMADRVKLED-----ELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR 242
V AD L ++F+V + E+ +E +E V L+++N E+D
Sbjct: 208 GSVPDADAETLSQAARKADVFIVVNA--STIELPDLERYIEEIVGERP--LVLWNLEVDT 263
Query: 243 IRSGYYPSFFYPK 255
+R+ F PK
Sbjct: 264 LRADLGLLGFPPK 276
>gi|255526474|ref|ZP_05393385.1| UvrD/REP helicase [Clostridium carboxidivorans P7]
gi|296184847|ref|ZP_06853258.1| putative ATP-dependent DNA helicase PcrA [Clostridium
carboxidivorans P7]
gi|255509856|gb|EET86185.1| UvrD/REP helicase [Clostridium carboxidivorans P7]
gi|296050629|gb|EFG90052.1| putative ATP-dependent DNA helicase PcrA [Clostridium
carboxidivorans P7]
Length = 754
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 3/88 (3%)
Query: 37 NENFSGQRII---TFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAEL 93
N+N + + I +FS Y + + +TNSV+S N S+N +P S L + A L
Sbjct: 640 NDNMTARNTIKSKSFSTYSNRSTTITNSVTSHMNRSVNNSMPSSFGESSLNKNKENANSL 699
Query: 94 AVKDGMKLMEIEFPTAGLDSVPGDSEGG 121
++D ++++ G+ ++ G S+ G
Sbjct: 700 KIEDIKAGLKVKHDKFGIGTIVGVSKSG 727
>gi|299469765|emb|CBN76619.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 322
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 40/87 (45%), Gaps = 3/87 (3%)
Query: 75 PFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEM-TGSMRLICE 133
P PS + + + QA+ A E A +DG L+E+EFP D + ++ + ++RL
Sbjct: 21 PAPSTFEQCIRQAQGAVEDAFEDGFNLVEVEFPPLQQDYLEDSGSSAYDVSSANVRLASR 80
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFA 160
F F K I P+ E+ A
Sbjct: 81 FAQSFAAEGK--EVSILLPDEAELDQA 105
>gi|443309167|ref|ZP_21038918.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
gi|442780785|gb|ELR90927.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
Length = 243
Score = 38.1 bits (87), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 35/149 (23%), Positives = 62/149 (41%), Gaps = 34/149 (22%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P E + QA++A + A+ DG+ +++E L+ +P
Sbjct: 4 LPKTLGEAVSQARIATQNAIADGLNRLQVEILLPELNPMP------------------VA 45
Query: 136 DLFVTPEKVTR---TRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKM 192
+ F++ E+ +IFFP+A AR+ E LD TK +V +
Sbjct: 46 ERFLSDEEGISHNFNKIFFPDAGAAALARRDWGEVPFELLDISTK-----------RVSV 94
Query: 193 ADRVKLEDELFLVAYPYFNVNEMLVVEEL 221
++++ EDE L P E+L +E+L
Sbjct: 95 EEQIQPEDEAILCIAP--TAQEVLQIEKL 121
>gi|119510288|ref|ZP_01629424.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
gi|119465032|gb|EAW45933.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
Length = 244
Score = 38.1 bits (87), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 31/133 (23%), Positives = 57/133 (42%), Gaps = 26/133 (19%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P+ + + Q+++A + A+ DG ++++F L +P + +F
Sbjct: 4 LPNSLEQAIAQSRIATQAALADGYTRLQVDFLFPELKLMP--------------VAEQFL 49
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
LF E +R +IFFP+A A + + G FK+ D G + +
Sbjct: 50 SLFT--EYDSRLKIFFPDAGGAALANRD-WAGTPFKI---------LDIGTGRVASIQSK 97
Query: 196 VKLEDELFLVAYP 208
++ EDE+FL P
Sbjct: 98 IQPEDEIFLFIAP 110
>gi|218189920|gb|EEC72347.1| hypothetical protein OsI_05588 [Oryza sativa Indica Group]
Length = 377
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 62/254 (24%), Positives = 107/254 (42%), Gaps = 42/254 (16%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGGIEMTGSM 128
V V P Y L+ A + A+ +G +EIEFP + + S G S+ I+ +
Sbjct: 65 GVSVYKPRSYDVLVSDAARSLACAMDEGKTRLEIEFPPLPSNISSYKGSSDEFIDANIQL 124
Query: 129 RLICEFCDLFVTPEKVTRTRIFFPEANEVKFARK-------SVFEGASFKLDYL-TKP-- 178
L + K TR+ I FP+ E + A + S+ LD + T P
Sbjct: 125 ALAVA---RKLKELKGTRSCIVFPDLPEKRRASQLFGTALDSIETATISSLDEVSTGPVN 181
Query: 179 SFFE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK 232
+FF DF F + V+ DR K ++ L + + ++ +E+ ++ F ++
Sbjct: 182 TFFRSMRDTLDFDFADDVE--DRWKSDEPPSLYIFINCSTRDLSTIEKYVEQ--FASSVP 237
Query: 233 LIIFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKGR 279
++FN ELD +RS G+ P + + + +F + + Y YI N+
Sbjct: 238 ALLFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVTPYIVNY--- 294
Query: 280 NGGTLFRFLEGPQE 293
G +FR GP +
Sbjct: 295 -SGAVFRQYPGPWQ 307
>gi|115443809|ref|NP_001045684.1| Os02g0117100 [Oryza sativa Japonica Group]
gi|41052833|dbj|BAD07724.1| unknown protein [Oryza sativa Japonica Group]
gi|113535215|dbj|BAF07598.1| Os02g0117100 [Oryza sativa Japonica Group]
gi|125580571|gb|EAZ21502.1| hypothetical protein OsJ_05126 [Oryza sativa Japonica Group]
Length = 377
Score = 37.7 bits (86), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 62/254 (24%), Positives = 107/254 (42%), Gaps = 42/254 (16%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGGIEMTGSM 128
V V P Y L+ A + A+ +G +EIEFP + + S G S+ I+ +
Sbjct: 65 GVSVYKPRSYDVLVSDAARSLACAMDEGKTRLEIEFPPLPSNISSYKGSSDEFIDANIQL 124
Query: 129 RLICEFCDLFVTPEKVTRTRIFFPEANEVKFARK-------SVFEGASFKLDYL-TKP-- 178
L + K TR+ I FP+ E + A + S+ LD + T P
Sbjct: 125 ALAVA---RKLKELKGTRSCIVFPDLPEKRRASQLFGTALDSIETATISSLDEVSTGPVN 181
Query: 179 SFFE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK 232
+FF DF F + V+ DR K ++ L + + ++ +E+ ++ F ++
Sbjct: 182 TFFRSMRDTLDFDFADDVE--DRWKSDEPPSLYIFINCSTRDLSTIEKYVEQ--FASSVP 237
Query: 233 LIIFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKGR 279
++FN ELD +RS G+ P + + + +F + + Y YI N+
Sbjct: 238 ALLFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVTPYIVNYS-- 295
Query: 280 NGGTLFRFLEGPQE 293
G +FR GP +
Sbjct: 296 --GAVFRQYPGPWQ 307
>gi|428183504|gb|EKX52362.1| hypothetical protein GUITHDRAFT_157134 [Guillardia theta CCMP2712]
Length = 325
Score = 37.0 bits (84), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 48/196 (24%), Positives = 87/196 (44%), Gaps = 35/196 (17%)
Query: 74 VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIE-MTGSMRLIC 132
P P + ++QA ++A+ A++DG KL+EIEFP ++ ++ G + ++
Sbjct: 16 TPPPKSFRMCVEQAYLSAKQAIEDGHKLIEIEFPPLPQSAMDNEAIGADTILKAQIQHST 75
Query: 133 EFCDLF-------VTPEKVTRTRIFFPEAN--------EVKF-ARKSVFEGASFKLDYLT 176
+F LF V + V R R E + ++F A K F+G+ + ++
Sbjct: 76 DFAKLFKNKKTAIVFADIVERNRFIDDETSSNPQSWRGNIRFTALKGGFKGSLIERVWIN 135
Query: 177 KPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIF 236
K DF V+ +D++F++ + E+ V EL K A +I+F
Sbjct: 136 K-----DF--------VSEVQEDDDMFIIIGA--SAQELPDVRELCKAAGDRP---VILF 177
Query: 237 NGELDRIRSGYYPSFF 252
N +L +R + FF
Sbjct: 178 NLKLQVLRGDFGLPFF 193
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.138 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,695,826,378
Number of Sequences: 23463169
Number of extensions: 193834205
Number of successful extensions: 413903
Number of sequences better than 100.0: 85
Number of HSP's better than 100.0 without gapping: 42
Number of HSP's successfully gapped in prelim test: 43
Number of HSP's that attempted gapping in prelim test: 413800
Number of HSP's gapped (non-prelim): 91
length of query: 295
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 154
effective length of database: 9,050,888,538
effective search space: 1393836834852
effective search space used: 1393836834852
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)