BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 020482
(325 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225427403|ref|XP_002263777.1| PREDICTED: uncharacterized protein LOC100265501 [Vitis vinifera]
gi|296088391|emb|CBI37382.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 237/324 (73%), Positives = 264/324 (81%), Gaps = 9/324 (2%)
Query: 9 IALPTVPI------PVPSIPRRQA-NCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNS 61
+ L T+PI P+PS+ + +CR F +K + +F G +I F R NS
Sbjct: 1 MTLSTIPIASRISIPIPSLQNPKVLSCRSFQVK-KDGSFCGPKIAAFK-MSRNLEFKANS 58
Query: 62 VSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGG 121
VS D + S+ +VPFPSDYSE+L+QAK A ELA+KD +LMEIEFPTAGL+SVPGD EGG
Sbjct: 59 VSGDSSASVGFNVPFPSDYSEILEQAKEATELALKDKKQLMEIEFPTAGLESVPGDGEGG 118
Query: 122 IEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFF 181
IEMTGSM+LI EFCD+F+ PEK TRTRIFFPEANEVKFAR+S F GASFKLDYLTKPS F
Sbjct: 119 IEMTGSMQLIREFCDIFINPEKATRTRIFFPEANEVKFARQSAFGGASFKLDYLTKPSLF 178
Query: 182 EDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELD 241
EDFGF KVKMADRVK EDELFLVAYPYFNVNEMLVVEELY EAV NTA KLIIFNGELD
Sbjct: 179 EDFGFVTKVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYNEAVVNTARKLIIFNGELD 238
Query: 242 RIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNK 301
RIRSGYYP FFYPKLAAL+K+L P MET+YYIHNFKGR GGTLFRCYPG WKVL+KV N+
Sbjct: 239 RIRSGYYPPFFYPKLAALTKSLLPKMETVYYIHNFKGRKGGTLFRCYPGPWKVLRKVRNE 298
Query: 302 YICLHQQEVMPSLKEVALDILPSA 325
YICLHQQEVMPSLKEVALDILPSA
Sbjct: 299 YICLHQQEVMPSLKEVALDILPSA 322
>gi|356496430|ref|XP_003517071.1| PREDICTED: uncharacterized protein LOC100805878 [Glycine max]
Length = 324
Score = 459 bits (1181), Expect = e-127, Method: Compositional matrix adjust.
Identities = 229/323 (70%), Positives = 259/323 (80%), Gaps = 5/323 (1%)
Query: 6 NSAIALPTVPIPVPSIPRRQ-ANCRKFSIKCSNE-NFSGQRIITFSPYRRKHSCLTNSVS 63
+S + L PI PS+P A FS+K G + +P RK + T SVS
Sbjct: 4 SSTMILSNSPIASPSLPTSTGAKLETFSLKNDGVIRIRGATASSVAPRIRKTA--TCSVS 61
Query: 64 SDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIE 123
DGN S+ DVPFP+DYSELL+QA++AA+LA+KD +LMEIEFPTAGL SVPGD EGGIE
Sbjct: 62 KDGNASVETDVPFPADYSELLEQARVAADLAIKDNRQLMEIEFPTAGLGSVPGDGEGGIE 121
Query: 124 MTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFED 183
MT SM+LI EFCD F++ EK TRTRIFFPEA+EV FAR+SVF G SFKLDYLTKPSFFED
Sbjct: 122 MTESMQLIREFCDRFISSEKATRTRIFFPEASEVDFARQSVFSGCSFKLDYLTKPSFFED 181
Query: 184 FGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRI 243
FGF EK+KM+DRVK DELFLV YPYFNVNE+LVVEELYKEAV NT KLIIFNGELDRI
Sbjct: 182 FGFVEKIKMSDRVKTGDELFLVGYPYFNVNEILVVEELYKEAVLNTERKLIIFNGELDRI 241
Query: 244 RSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSN-KY 302
RSGYYPSFFYPKLAAL+KT P+MET+YYIHNFKGRNGGTLFRCYPG WKVL++V N KY
Sbjct: 242 RSGYYPSFFYPKLAALTKTFLPMMETVYYIHNFKGRNGGTLFRCYPGPWKVLRRVGNRKY 301
Query: 303 ICLHQQEVMPSLKEVALDILPSA 325
+CLHQQ MPSLKEVAL+ILPSA
Sbjct: 302 VCLHQQNSMPSLKEVALEILPSA 324
>gi|224071439|ref|XP_002303460.1| predicted protein [Populus trichocarpa]
gi|222840892|gb|EEE78439.1| predicted protein [Populus trichocarpa]
Length = 260
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 217/259 (83%), Positives = 232/259 (89%)
Query: 67 NNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTG 126
++S+ DVPFP DY ELLDQAK A ELA +D +LMEIEFPTAGL+SVPGD EGGIEMTG
Sbjct: 2 SSSVEFDVPFPRDYEELLDQAKKATELAWEDNKQLMEIEFPTAGLESVPGDGEGGIEMTG 61
Query: 127 SMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGF 186
SM+LI EFCD FV+PEK TRTRIFFPEANEVKFAR+S FEG+S KLDYLTKPSFFEDFGF
Sbjct: 62 SMQLIREFCDRFVSPEKTTRTRIFFPEANEVKFARQSAFEGSSLKLDYLTKPSFFEDFGF 121
Query: 187 TEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSG 246
EKVKM DRVK EDELFLVAYPYFNVNEMLVVEELYKEAV TA KLIIFNGELDRIRSG
Sbjct: 122 VEKVKMTDRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVETARKLIIFNGELDRIRSG 181
Query: 247 YYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLH 306
YYPSFFYPKLA+L KTLFP+MET+YYIHNFKGRNGGTLFRCYPG W+VL+KV N YICLH
Sbjct: 182 YYPSFFYPKLASLLKTLFPLMETVYYIHNFKGRNGGTLFRCYPGPWQVLRKVRNAYICLH 241
Query: 307 QQEVMPSLKEVALDILPSA 325
QQE MPSLKEVALDILPS
Sbjct: 242 QQEAMPSLKEVALDILPSV 260
>gi|255557645|ref|XP_002519852.1| conserved hypothetical protein [Ricinus communis]
gi|223540898|gb|EEF42456.1| conserved hypothetical protein [Ricinus communis]
Length = 316
Score = 439 bits (1129), Expect = e-121, Method: Compositional matrix adjust.
Identities = 211/265 (79%), Positives = 231/265 (87%), Gaps = 1/265 (0%)
Query: 62 VSSDGNNS-INVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEG 120
VS +G++S + DVP P DY ELL QAK A +LA+KDG +LMEIEFPTAGL+SVPGD EG
Sbjct: 52 VSRNGSSSSVESDVPLPRDYEELLVQAKKATDLALKDGKQLMEIEFPTAGLESVPGDGEG 111
Query: 121 GIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSF 180
GIEMT SM+LI +FCD FV+PEK RTR+FFPEANEVKFAR+S F G+S KLDYLTKPSF
Sbjct: 112 GIEMTESMQLIRQFCDRFVSPEKAARTRVFFPEANEVKFARESAFGGSSLKLDYLTKPSF 171
Query: 181 FEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL 240
FEDFGF EK+KM DRVK EDELFLVAYPYFNVNEMLVVEELY EAV NT K+IIFNGEL
Sbjct: 172 FEDFGFVEKIKMTDRVKPEDELFLVAYPYFNVNEMLVVEELYNEAVVNTTRKMIIFNGEL 231
Query: 241 DRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSN 300
DRIRSGYYPSFFYPKLA+L KTLFPVMET+YYIHNFKGR GGTLFRCYPG WKVL+KV
Sbjct: 232 DRIRSGYYPSFFYPKLASLLKTLFPVMETVYYIHNFKGRKGGTLFRCYPGPWKVLRKVKK 291
Query: 301 KYICLHQQEVMPSLKEVALDILPSA 325
+ ICLHQQE MPSLKEVALDILPSA
Sbjct: 292 ESICLHQQESMPSLKEVALDILPSA 316
>gi|449456759|ref|XP_004146116.1| PREDICTED: uncharacterized protein LOC101209709 [Cucumis sativus]
gi|449509516|ref|XP_004163611.1| PREDICTED: uncharacterized LOC101209709 [Cucumis sativus]
Length = 336
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 222/336 (66%), Positives = 254/336 (75%), Gaps = 20/336 (5%)
Query: 3 LSSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSC---LT 59
+S+S IA +P+P P + C + S + ++ + + F P R L+
Sbjct: 8 FASSSTIATAVLPLPSPKLA-----CFRISHRRTHRSSVSSSMFEFMPRRHLRVLPPNLS 62
Query: 60 NSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSE 119
N SS N S ++DVPFP DYS+LL+QAK A E A+ D +LMEIEFPTAGL+SVPGD E
Sbjct: 63 NRQSS--NASTDLDVPFPRDYSDLLNQAKKATEAALIDNKQLMEIEFPTAGLESVPGDGE 120
Query: 120 GGIEMTGSMRLICEFCDLFVTPEKVTRTRI----------FFPEANEVKFARKSVFEGAS 169
GGIEMT SM+LI +FCD F+ P K TRTR+ FFPEANEVKFAR + FEG S
Sbjct: 121 GGIEMTESMQLIRQFCDCFIDPLKATRTRVTVSIKENHIQFFPEANEVKFARNTAFEGVS 180
Query: 170 FKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNT 229
FKLDYLTKPSFFEDFGF EKVKMADRVK EDELFLVAYPYFNVNEMLVVEELYKEAV NT
Sbjct: 181 FKLDYLTKPSFFEDFGFVEKVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVQNT 240
Query: 230 AWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYP 289
KLIIFNGELDRIRSGYYP FFYPKLAAL KTLFP MET+YYIHNFKG+ GG LFR YP
Sbjct: 241 TRKLIIFNGELDRIRSGYYPPFFYPKLAALMKTLFPEMETVYYIHNFKGQKGGVLFRSYP 300
Query: 290 GSWKVLKKVSNKYICLHQQEVMPSLKEVALDILPSA 325
G WKVL+KV NK++C+HQQE MPSLKEVAL+ILPS+
Sbjct: 301 GPWKVLRKVRNKFVCVHQQEEMPSLKEVALNILPSS 336
>gi|109289908|gb|AAP45177.2| hypothetical protein SBB1_14t00013 [Solanum bulbocastanum]
Length = 338
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 215/293 (73%), Positives = 230/293 (78%), Gaps = 29/293 (9%)
Query: 61 SVSSDGNNSINVDVPFPSDYSELLDQ----------------------------AKMAAE 92
S S D SI DVPFP DY+ELL Q AK A E
Sbjct: 47 SCSGDRAASIGFDVPFPKDYTELLQQVFILFAFSPLKIGGRGSGNGGGITREIKAKEATE 106
Query: 93 LAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFP 152
LA+KD +LMEIEFPTAGL SVPGD EGGIEMTGS++LI EFCDL V PEK T+TRIFFP
Sbjct: 107 LALKDNRQLMEIEFPTAGLGSVPGDGEGGIEMTGSIQLIREFCDLLVIPEKATKTRIFFP 166
Query: 153 EANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNV 212
EANEVKFAR+S+F GASFKLDYLTKPSFFEDFGFTEKVKMADRVK EDELF+VAYPYFNV
Sbjct: 167 EANEVKFARQSIFGGASFKLDYLTKPSFFEDFGFTEKVKMADRVKPEDELFIVAYPYFNV 226
Query: 213 NEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYY 272
NEMLVVEELY+ AV NT+ KLIIFNGELDRIRS YP FFYPKLAALSKTLFP MET+YY
Sbjct: 227 NEMLVVEELYQAAVLNTSRKLIIFNGELDRIRSD-YPPFFYPKLAALSKTLFPKMETVYY 285
Query: 273 IHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMPSLKEVALDILPSA 325
IHNFKGRNGG LFRCYPG WKV ++V + ICLHQQE MPSLKEVALDILPSA
Sbjct: 286 IHNFKGRNGGVLFRCYPGPWKVFRRVGSTNICLHQQESMPSLKEVALDILPSA 338
>gi|113208412|gb|ABI34553.1| hypothetical protein SBB1_21t00009 [Solanum bulbocastanum]
Length = 338
Score = 422 bits (1086), Expect = e-116, Method: Compositional matrix adjust.
Identities = 215/293 (73%), Positives = 230/293 (78%), Gaps = 29/293 (9%)
Query: 61 SVSSDGNNSINVDVPFPSDYSELLDQ----------------------------AKMAAE 92
S S D SI DVPFP DY+ELL Q AK A E
Sbjct: 47 SCSGDRAASIGFDVPFPKDYTELLQQVFILFAFSPLKIGGWGSRNRGGITREIKAKEATE 106
Query: 93 LAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFP 152
LA+KD +LMEIEFPTAGL SVPGD EGGIEMTGS++LI EFCDL V PEK T+TRIFFP
Sbjct: 107 LALKDNRQLMEIEFPTAGLGSVPGDGEGGIEMTGSIQLIREFCDLLVIPEKATKTRIFFP 166
Query: 153 EANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNV 212
EANEVKFAR+S+F GASFKLDYLTKPSFFEDFGFTEKVKMADRVK EDELF+VAYPYFNV
Sbjct: 167 EANEVKFARQSIFGGASFKLDYLTKPSFFEDFGFTEKVKMADRVKPEDELFIVAYPYFNV 226
Query: 213 NEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYY 272
NEMLVVEELY+ AV NT+ KLIIFNGELDRIRS YP FFYPKLAALSKTLFP MET+YY
Sbjct: 227 NEMLVVEELYQAAVLNTSRKLIIFNGELDRIRSD-YPPFFYPKLAALSKTLFPKMETVYY 285
Query: 273 IHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMPSLKEVALDILPSA 325
IHNFKGRNGG LFRCYPG WKV ++V + ICLHQQE MPSLKEVALDILPSA
Sbjct: 286 IHNFKGRNGGVLFRCYPGPWKVFRRVGSTNICLHQQESMPSLKEVALDILPSA 338
>gi|357146418|ref|XP_003573985.1| PREDICTED: uncharacterized protein LOC100843789 [Brachypodium
distachyon]
Length = 322
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 218/325 (67%), Positives = 241/325 (74%), Gaps = 5/325 (1%)
Query: 1 MPLSSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRK-HSCLT 59
M ++++ +I+ PT S+P +Q F I S+ G + R H C
Sbjct: 1 MAMATSYSISNPTF-TSKSSLPNKQVPNWIFPIISSDNGSGGMFTLARRSLRAGFHVC-- 57
Query: 60 NSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSE 119
+V+ D N FPSDY+ELL QAK AAE A KDG +L+EIEFPTAGL SVPGD E
Sbjct: 58 -AVTGDQNTRNVFSANFPSDYTELLLQAKDAAESAFKDGKQLLEIEFPTAGLQSVPGDGE 116
Query: 120 GGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPS 179
GGIEMTGSM LI EFCD FV EK TRTRIFFPEANEV FAR+S FEG S KLDYLTKPS
Sbjct: 117 GGIEMTGSMLLIREFCDRFVPAEKTTRTRIFFPEANEVTFARQSAFEGCSLKLDYLTKPS 176
Query: 180 FFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGE 239
FEDFGFT KVKMADRV+ EDE+FLVAYPYFNVNEMLVVEELYKEAV NT K+IIFNGE
Sbjct: 177 LFEDFGFTTKVKMADRVQPEDEIFLVAYPYFNVNEMLVVEELYKEAVVNTDRKMIIFNGE 236
Query: 240 LDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVS 299
LDRIRSGYYP FFYPKLA LSKT P MET+YYIHNFKG GG LFRCYPG WKVL+KV
Sbjct: 237 LDRIRSGYYPPFFYPKLAELSKTFLPKMETVYYIHNFKGSKGGALFRCYPGPWKVLRKVG 296
Query: 300 NKYICLHQQEVMPSLKEVALDILPS 324
+ CLH+QE MPSLKEVALDILPS
Sbjct: 297 GSFACLHEQEEMPSLKEVALDILPS 321
>gi|18422955|ref|NP_568702.1| uncharacterized protein [Arabidopsis thaliana]
gi|14326508|gb|AAK60299.1|AF385707_1 AT5g48790/K24G6_12 [Arabidopsis thaliana]
gi|18700216|gb|AAL77718.1| AT5g48790/K24G6_12 [Arabidopsis thaliana]
gi|332008342|gb|AED95725.1| uncharacterized protein [Arabidopsis thaliana]
Length = 316
Score = 416 bits (1069), Expect = e-114, Method: Compositional matrix adjust.
Identities = 202/266 (75%), Positives = 223/266 (83%), Gaps = 1/266 (0%)
Query: 61 SVSSDGNNSINVD-VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSE 119
SVS NN+ +VD VPFP DY EL++QAK A E+A+KD +LMEIEFPT+GL SVPGD E
Sbjct: 51 SVSGGYNNNTSVDNVPFPRDYVELINQAKEAVEMALKDEKQLMEIEFPTSGLASVPGDGE 110
Query: 120 GGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPS 179
G EMT S+ +I EFCD + PEK TRIFFPEANEVKFA+K+VF G FKLDYLTKPS
Sbjct: 111 GATEMTESINMIREFCDRLLAPEKARSTRIFFPEANEVKFAQKTVFGGTYFKLDYLTKPS 170
Query: 180 FFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGE 239
FEDFGF E+VKMADRVK EDELFLVAYPYFNVNEMLVVEELYKEAV NT KLIIFNGE
Sbjct: 171 LFEDFGFFERVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVNTDRKLIIFNGE 230
Query: 240 LDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVS 299
LDRIRSGYYP FFYPKLAAL+KTL P MET+YYIHNFKG+ GG LFRCYPG W+VL++
Sbjct: 231 LDRIRSGYYPKFFYPKLAALTKTLLPKMETVYYIHNFKGQKGGVLFRCYPGPWQVLRRTR 290
Query: 300 NKYICLHQQEVMPSLKEVALDILPSA 325
NKYIC+HQQE MPSLKEVALDIL SA
Sbjct: 291 NKYICVHQQESMPSLKEVALDILASA 316
>gi|224034407|gb|ACN36279.1| unknown [Zea mays]
gi|413926746|gb|AFW66678.1| hypothetical protein ZEAMMB73_267474 [Zea mays]
Length = 324
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 213/322 (66%), Positives = 246/322 (76%), Gaps = 4/322 (1%)
Query: 4 SSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNSVS 63
+S ++A P + P + ++ +N +I SN N +G + T + + ++ +V+
Sbjct: 5 TSYGSMANPPITSRTPFLSKQASNWIPATI--SNGNGTGG-MFTVASRKSRNGFQFCAVT 61
Query: 64 SDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIE 123
D + DV FPSDY+ELL QAK AAE A KDG +L+EIEFPTAGL +VPGD EGG E
Sbjct: 62 GDPGSRNVSDVNFPSDYTELLTQAKEAAESAFKDGKQLLEIEFPTAGLQTVPGDGEGGNE 121
Query: 124 MTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFED 183
MTGSM LI EFCD FV EK TRTR+FFPEANEV FAR+S FEG S KLDYLTKPS FED
Sbjct: 122 MTGSMLLIREFCDRFVPAEKATRTRVFFPEANEVSFARQSAFEGCSLKLDYLTKPSLFED 181
Query: 184 FGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRI 243
FGFT KVKMADRVK +DE FLVAYPYFNVNEMLVVEELYKEAV T+ KLIIFNGELDRI
Sbjct: 182 FGFTTKVKMADRVKPQDETFLVAYPYFNVNEMLVVEELYKEAVVGTSRKLIIFNGELDRI 241
Query: 244 RSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSN-KY 302
RSGYYP+FFYPKLA LSKT P ++T+YYIHNFKG GGTLFRCYP WKVL+K S+ Y
Sbjct: 242 RSGYYPAFFYPKLAELSKTFLPKLDTVYYIHNFKGAKGGTLFRCYPEPWKVLRKASSGNY 301
Query: 303 ICLHQQEVMPSLKEVALDILPS 324
+CLHQQE MPSLKEVALDILPS
Sbjct: 302 VCLHQQEEMPSLKEVALDILPS 323
>gi|326523775|dbj|BAJ93058.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 322
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 211/309 (68%), Positives = 231/309 (74%), Gaps = 6/309 (1%)
Query: 19 PSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNSVSSDGNNSIN--VDVPF 76
PS P +Q +F S + G I S RR N + G+ S F
Sbjct: 18 PSAPHKQVPNWRFPTINSGDG--GGSIFAIS--RRNLRTWFNVCAVTGDQSTRDVFSADF 73
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCD 136
PSDY+EL+ QAK A E A KDG +L+EIEFPTAGL SVPGD EGGIEMTGSM LI EFCD
Sbjct: 74 PSDYTELIVQAKEATESAFKDGKQLLEIEFPTAGLQSVPGDGEGGIEMTGSMLLIREFCD 133
Query: 137 LFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRV 196
FV EKVTRTRIFFPEA EV FAR+S FEG S KLDYLTKPS FEDFGFT KVKMADRV
Sbjct: 134 RFVPAEKVTRTRIFFPEAKEVTFARQSAFEGCSLKLDYLTKPSLFEDFGFTTKVKMADRV 193
Query: 197 KLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKL 256
+ EDE+FLVAYPYFNVNEMLVVEELYKEAV NT K+IIFNGELDRIRSGYYP FFYPKL
Sbjct: 194 RPEDEIFLVAYPYFNVNEMLVVEELYKEAVLNTERKMIIFNGELDRIRSGYYPPFFYPKL 253
Query: 257 AALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMPSLKE 316
LSKT P +ET+YYIHNFKG GG LFRCYPG WKVL+KV ++CLH+QE MPSLKE
Sbjct: 254 GELSKTFLPKLETVYYIHNFKGSKGGVLFRCYPGPWKVLRKVGGSFVCLHEQEEMPSLKE 313
Query: 317 VALDILPSA 325
VAL+ILPSA
Sbjct: 314 VALNILPSA 322
>gi|413926747|gb|AFW66679.1| hypothetical protein ZEAMMB73_267474 [Zea mays]
Length = 310
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 207/290 (71%), Positives = 231/290 (79%), Gaps = 2/290 (0%)
Query: 36 SNENFSGQRIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAV 95
SN N +G + T + + ++ +V+ D + DV FPSDY+ELL QAK AAE A
Sbjct: 21 SNGNGTGG-MFTVASRKSRNGFQFCAVTGDPGSRNVSDVNFPSDYTELLTQAKEAAESAF 79
Query: 96 KDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEAN 155
KDG +L+EIEFPTAGL +VPGD EGG EMTGSM LI EFCD FV EK TRTR+FFPEAN
Sbjct: 80 KDGKQLLEIEFPTAGLQTVPGDGEGGNEMTGSMLLIREFCDRFVPAEKATRTRVFFPEAN 139
Query: 156 EVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEM 215
EV FAR+S FEG S KLDYLTKPS FEDFGFT KVKMADRVK +DE FLVAYPYFNVNEM
Sbjct: 140 EVSFARQSAFEGCSLKLDYLTKPSLFEDFGFTTKVKMADRVKPQDETFLVAYPYFNVNEM 199
Query: 216 LVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHN 275
LVVEELYKEAV T+ KLIIFNGELDRIRSGYYP+FFYPKLA LSKT P ++T+YYIHN
Sbjct: 200 LVVEELYKEAVVGTSRKLIIFNGELDRIRSGYYPAFFYPKLAELSKTFLPKLDTVYYIHN 259
Query: 276 FKGRNGGTLFRCYPGSWKVLKKVSN-KYICLHQQEVMPSLKEVALDILPS 324
FKG GGTLFRCYP WKVL+K S+ Y+CLHQQE MPSLKEVALDILPS
Sbjct: 260 FKGAKGGTLFRCYPEPWKVLRKASSGNYVCLHQQEEMPSLKEVALDILPS 309
>gi|226494690|ref|NP_001145598.1| uncharacterized protein LOC100279074 [Zea mays]
gi|195658649|gb|ACG48792.1| hypothetical protein [Zea mays]
Length = 310
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 206/290 (71%), Positives = 231/290 (79%), Gaps = 2/290 (0%)
Query: 36 SNENFSGQRIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAV 95
SN N +G + T + + ++ +V+ D + DV FPSDY+ELL QAK AAE A
Sbjct: 21 SNGNGTGG-MFTVASRKSRNGFQFCAVTGDPGSRNVSDVNFPSDYTELLTQAKEAAESAF 79
Query: 96 KDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEAN 155
KDG +L+EIEFPTAGL +VPGD EGG EMTGSM LI EFCD FV EK TRTR+FFPEAN
Sbjct: 80 KDGKQLLEIEFPTAGLQTVPGDGEGGNEMTGSMLLIREFCDRFVPAEKATRTRVFFPEAN 139
Query: 156 EVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEM 215
EV FAR+S FEG S KLDYLTKPS FEDFGFT KVKMADRVK +DE FLVAYPYFNVNEM
Sbjct: 140 EVSFARQSAFEGCSLKLDYLTKPSLFEDFGFTTKVKMADRVKPQDETFLVAYPYFNVNEM 199
Query: 216 LVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHN 275
LVVEELYKEAV T+ KLIIFNGELDRIRSGYYP+FFYPKLA LS+T P ++T+YYIHN
Sbjct: 200 LVVEELYKEAVVGTSRKLIIFNGELDRIRSGYYPAFFYPKLAELSRTFLPKLDTVYYIHN 259
Query: 276 FKGRNGGTLFRCYPGSWKVLKKVSN-KYICLHQQEVMPSLKEVALDILPS 324
FKG GGTLFRCYP WKVL+K S+ Y+CLHQQE MPSLKEVALDILPS
Sbjct: 260 FKGAKGGTLFRCYPEPWKVLRKASSGSYVCLHQQEEMPSLKEVALDILPS 309
>gi|297795571|ref|XP_002865670.1| hypothetical protein ARALYDRAFT_494942 [Arabidopsis lyrata subsp.
lyrata]
gi|297311505|gb|EFH41929.1| hypothetical protein ARALYDRAFT_494942 [Arabidopsis lyrata subsp.
lyrata]
Length = 315
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 199/265 (75%), Positives = 221/265 (83%)
Query: 61 SVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEG 120
SVS NN+ +VPFP DY EL++QAK A ELA+KD +LMEIEFPT+GL SVPGDSEG
Sbjct: 51 SVSGGYNNTSVDNVPFPRDYFELINQAKEAVELAMKDEKQLMEIEFPTSGLASVPGDSEG 110
Query: 121 GIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSF 180
EMT S+ +I EFCD + PEK TRIFFPEANEVKFA+K+VF G FKLDYLTKPS
Sbjct: 111 ATEMTESINMIREFCDRLLAPEKARTTRIFFPEANEVKFAQKTVFGGTYFKLDYLTKPSL 170
Query: 181 FEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL 240
FEDFGF E+VKM+DRVK EDELFLVAYPYFNVNEMLVVEELYKEAV NT KLIIFNGEL
Sbjct: 171 FEDFGFFERVKMSDRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVNTDRKLIIFNGEL 230
Query: 241 DRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSN 300
DRIRSGYYP FFYPKLAAL+KTL P M+T+YYIHNFKG+ GG LFRCYPG W+VL++ N
Sbjct: 231 DRIRSGYYPKFFYPKLAALTKTLLPKMDTVYYIHNFKGQKGGVLFRCYPGPWQVLRRTRN 290
Query: 301 KYICLHQQEVMPSLKEVALDILPSA 325
YIC+HQQE MPSLKEVALDIL SA
Sbjct: 291 SYICVHQQESMPSLKEVALDILASA 315
>gi|242063910|ref|XP_002453244.1| hypothetical protein SORBIDRAFT_04g002440 [Sorghum bicolor]
gi|241933075|gb|EES06220.1| hypothetical protein SORBIDRAFT_04g002440 [Sorghum bicolor]
Length = 322
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 210/322 (65%), Positives = 240/322 (74%), Gaps = 6/322 (1%)
Query: 4 SSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNSVS 63
+S ++ P + P + ++ +N I + N +G + T + ++ +V+
Sbjct: 5 TSCGSMTKPPITFKTPFVNKQASNW----IPATISNGTGG-MFTVASRNSRNGFQVRAVT 59
Query: 64 SDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIE 123
D + DV FP+DY++LL QAK AAE A KDG +L+EIEFPTAGL +VPGD EGG E
Sbjct: 60 GDPGSRNASDVKFPTDYTQLLMQAKEAAESAFKDGKQLLEIEFPTAGLQTVPGDGEGGNE 119
Query: 124 MTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFED 183
MTGSM LI EFCD FV EK TRTR+FFPEANEV FAR+S FEG S KLDYLTKPS FED
Sbjct: 120 MTGSMLLIREFCDRFVPAEKSTRTRVFFPEANEVSFARQSAFEGCSLKLDYLTKPSLFED 179
Query: 184 FGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRI 243
FGFT KVKMADRVK EDE FLVAYPYFNVNEMLVVEELY EAV T KLIIFNGELDRI
Sbjct: 180 FGFTTKVKMADRVKPEDETFLVAYPYFNVNEMLVVEELYNEAVVGTNRKLIIFNGELDRI 239
Query: 244 RSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSN-KY 302
RSGYYPSFFYPKLA LSKT P ++T+YYIHNFKG GGTLFRCYP WKVL+K S+ Y
Sbjct: 240 RSGYYPSFFYPKLAELSKTFLPKLDTVYYIHNFKGVKGGTLFRCYPEPWKVLRKASSGSY 299
Query: 303 ICLHQQEVMPSLKEVALDILPS 324
ICLHQQE MPSLKEVALDILPS
Sbjct: 300 ICLHQQEEMPSLKEVALDILPS 321
>gi|357484699|ref|XP_003612637.1| hypothetical protein MTR_5g027220 [Medicago truncatula]
gi|355513972|gb|AES95595.1| hypothetical protein MTR_5g027220 [Medicago truncatula]
Length = 365
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 208/322 (64%), Positives = 236/322 (73%), Gaps = 50/322 (15%)
Query: 50 PYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKL-------- 101
P RK + + S S DGN S+ D+PFP DYSELL+QAK+A EL ++ MK+
Sbjct: 44 PTSRKLARCSVSASGDGNASVQTDIPFPFDYSELLEQAKVAVELQLR--MKVAHSKLRSE 101
Query: 102 ------------MEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRI 149
EIEFPTAGL+SVPGD EGGIEMTGSM+LI EFCDL ++ EK+TRTRI
Sbjct: 102 STIEPIMNNDLKQEIEFPTAGLESVPGDGEGGIEMTGSMQLIREFCDLSISAEKITRTRI 161
Query: 150 ---------------------------FFPEANEVKFARKSVFEGASFKLDYLTKPSFFE 182
FFPEANEV FAR+S F GASFKLDYLTKPSFF+
Sbjct: 162 MVMRKNLLCNTSSSLPVEIDVQPCKNQFFPEANEVDFARQSAFSGASFKLDYLTKPSFFQ 221
Query: 183 DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR 242
DFGF EKVKM+DRVK EDELF+VAYPYFNVNEMLVVEELYKEAV NT KLIIFNGELDR
Sbjct: 222 DFGFVEKVKMSDRVKAEDELFVVAYPYFNVNEMLVVEELYKEAVVNTERKLIIFNGELDR 281
Query: 243 IRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKV-SNK 301
IRSGYYP FFYPKLA L+K+ P MET+YYIHNFKGR+ G LFRCYPG WKVL++V S+K
Sbjct: 282 IRSGYYPPFFYPKLAGLTKSFLPSMETVYYIHNFKGRDRGILFRCYPGPWKVLRRVGSSK 341
Query: 302 YICLHQQEVMPSLKEVALDILP 323
++CLHQQ+ MPSLKEVALDILP
Sbjct: 342 FVCLHQQDTMPSLKEVALDILP 363
>gi|125580675|gb|EAZ21606.1| hypothetical protein OsJ_05234 [Oryza sativa Japonica Group]
gi|218189983|gb|EEC72410.1| hypothetical protein OsI_05707 [Oryza sativa Indica Group]
Length = 338
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 212/341 (62%), Positives = 236/341 (69%), Gaps = 24/341 (7%)
Query: 5 SNSAIALPTVPIPVPSIPRRQANCRKF-SIKCSNENFSGQRIITFSPYRRK--HSCLTNS 61
+ S ++ P+ S P +Q +I N++G T R H C N
Sbjct: 2 ATSYCSISNPPLSKTSFPNKQVPGWVLRAISKGKGNYTGGIYTTTKRNLRTGFHVCAVNG 61
Query: 62 VSSDGNNSINVD-VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEG 120
G + NV FPSDY+ELL QAK AAE A KDG +L+EIEFPTAGL SVPGDSEG
Sbjct: 62 ----GQGTRNVSGAEFPSDYTELLAQAKEAAESAFKDGKQLLEIEFPTAGLQSVPGDSEG 117
Query: 121 GIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSF 180
GIEMTGSM LI EFCD FV EK TRTRIFFPEANEV FAR+S FEG S KLDYLTKPS
Sbjct: 118 GIEMTGSMLLIREFCDRFVPAEKATRTRIFFPEANEVSFARQSAFEGCSLKLDYLTKPSL 177
Query: 181 FEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL 240
FEDFGFT KVKM+DRV+ EDE+FLVAYPYFNVNEMLVVEELYKEA+ +T KLIIFNGEL
Sbjct: 178 FEDFGFTTKVKMSDRVRPEDEIFLVAYPYFNVNEMLVVEELYKEAIVSTDRKLIIFNGEL 237
Query: 241 DRIR----------------SGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTL 284
DRIR YP FFYPKLA LSKT P +ET+YYIHNFKG GGTL
Sbjct: 238 DRIRMLVTFLNKREAALMMFENNYPPFFYPKLAELSKTFLPKLETVYYIHNFKGLKGGTL 297
Query: 285 FRCYPGSWKVLKKVSNKYICLHQQEVMPSLKEVALDILPSA 325
FRCYPG WKVL+ + + CLH+QE MPSLKEVALDILPSA
Sbjct: 298 FRCYPGPWKVLRNIGGSFFCLHEQEEMPSLKEVALDILPSA 338
>gi|116793457|gb|ABK26754.1| unknown [Picea sitchensis]
Length = 337
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 182/255 (71%), Positives = 212/255 (83%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRL 130
++DV FP DYSELL Q K+A + A+ D L+EIEFPTAGLDSV GD+EGGIEM SM L
Sbjct: 83 DIDVEFPGDYSELLQQVKVATQSALMDSKYLLEIEFPTAGLDSVSGDAEGGIEMNSSMTL 142
Query: 131 ICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKV 190
I EFC F+ PE+ TRTRIFFPEA EV+FA+K+VFEG +FK+DYLTKPS EDFGF KV
Sbjct: 143 IREFCRRFLKPEEATRTRIFFPEAKEVEFAKKTVFEGVAFKMDYLTKPSLLEDFGFGTKV 202
Query: 191 KMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPS 250
KMA+RV+ DE+FLVAYPYFNV+EMLVVEELYK+AV +T KLIIFNGELDRIRSGYYP
Sbjct: 203 KMAERVQPTDEIFLVAYPYFNVDEMLVVEELYKDAVVHTDRKLIIFNGELDRIRSGYYPP 262
Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEV 310
FFYPK+ AL++ P +ET YYIHNFKGR GGTLFR YPG W+VL+KV +K++C+HQQE
Sbjct: 263 FFYPKIGALARNFLPKLETAYYIHNFKGRVGGTLFRSYPGPWQVLRKVGDKHVCIHQQET 322
Query: 311 MPSLKEVALDILPSA 325
MPSLKEVAL ILPSA
Sbjct: 323 MPSLKEVALSILPSA 337
>gi|168020280|ref|XP_001762671.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686079|gb|EDQ72470.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 280
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 161/274 (58%), Positives = 207/274 (75%), Gaps = 3/274 (1%)
Query: 53 RKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLD 112
R + S S + IN V FP DY+EL++QA+ AA+ A+KD L+E+EFPTAGLD
Sbjct: 9 RSFVVRSRSGSDPKSKIINKSVDFPKDYNELVNQARRAAQAALKDDKTLLEVEFPTAGLD 68
Query: 113 SVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKL 172
+VPGD EGGIEM S+ L+ EFC +F ++ TRIFFP+A +++ A+ S+F+G SFKL
Sbjct: 69 TVPGDEEGGIEMNTSIVLMKEFCTIF--KDEAPTTRIFFPDAKDMELAKTSIFDGTSFKL 126
Query: 173 DYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK 232
DYLTKP+ EDFGF KVKMADRV+ D +F+VAYPYFNVNEM+ VEELYK + +
Sbjct: 127 DYLTKPNGLEDFGFGSKVKMADRVQSSDTVFVVAYPYFNVNEMIAVEELYKGSAAASNRP 186
Query: 233 LIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSW 292
+I+FNGELDRIRSGYYPSFFYPKL +++K P ET+YYIHNFKGR+ G LFR YPG W
Sbjct: 187 IIVFNGELDRIRSGYYPSFFYPKLGSIAKEFLPKFETVYYIHNFKGRSRGVLFRMYPGPW 246
Query: 293 KVLKKVSN-KYICLHQQEVMPSLKEVALDILPSA 325
+VL++V + K++ LH+Q MPSLKEVAL+ILP+A
Sbjct: 247 QVLQRVGDHKFVLLHEQASMPSLKEVALNILPTA 280
>gi|302820762|ref|XP_002992047.1| hypothetical protein SELMODRAFT_134592 [Selaginella moellendorffii]
gi|300140169|gb|EFJ06896.1| hypothetical protein SELMODRAFT_134592 [Selaginella moellendorffii]
Length = 303
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 147/262 (56%), Positives = 189/262 (72%), Gaps = 3/262 (1%)
Query: 62 VSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGG 121
S DGN VPFPSDY E++ QA+ A + A+ D KL+E+E P AGL++V GD EGG
Sbjct: 36 ASRDGNVG---SVPFPSDYIEMVKQAQDACQAALDDSKKLLEVEVPPAGLNTVSGDEEGG 92
Query: 122 IEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFF 181
IEM SM ++ +FC T EK RTR+FFPE E+ A+ VF+G+ +KLDYLTKPS +
Sbjct: 93 IEMNISMEIVQKFCAGMFTGEKAPRTRVFFPELAEMNIAKSGVFDGSMYKLDYLTKPSPW 152
Query: 182 EDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELD 241
+D G +KVKM++R + D F+VAYP+FN NEML VEELY+E+ + +I+ NG+LD
Sbjct: 153 DDIGLGKKVKMSERTRPTDATFVVAYPFFNPNEMLAVEELYRESAKESGCPIIVINGDLD 212
Query: 242 RIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNK 301
+IR+GYYP FFYPKL AL+KT P ET+YYIHNFKGR GTLFR YPG W+VL+ V +
Sbjct: 213 KIRNGYYPPFFYPKLGALAKTFLPDFETVYYIHNFKGRFAGTLFRAYPGPWQVLRSVEGE 272
Query: 302 YICLHQQEVMPSLKEVALDILP 323
+C+H QE MPSLK VAL+ILP
Sbjct: 273 MVCIHSQETMPSLKTVALEILP 294
>gi|302761398|ref|XP_002964121.1| hypothetical protein SELMODRAFT_166751 [Selaginella moellendorffii]
gi|300167850|gb|EFJ34454.1| hypothetical protein SELMODRAFT_166751 [Selaginella moellendorffii]
Length = 303
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 149/270 (55%), Positives = 191/270 (70%), Gaps = 3/270 (1%)
Query: 54 KHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDS 113
K S S DGN VPFPSDY E++ QA+ A + A+ D KL+E+E P AGL++
Sbjct: 28 KSSWRILRASRDGNVG---SVPFPSDYIEMVKQAQDACQAALDDSKKLLEVEVPPAGLNT 84
Query: 114 VPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLD 173
V GD EGGIEM SM ++ +FC T EK RTR+FFPE E+ A+ VF+G+ FKLD
Sbjct: 85 VSGDEEGGIEMNISMEIVQKFCAGMFTGEKAPRTRVFFPELAEMNIAKSGVFDGSMFKLD 144
Query: 174 YLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKL 233
YLTKPS ++D G +KVKM++R + D F+VAYP+FN NEML VEELY+++ + +
Sbjct: 145 YLTKPSPWDDIGLGKKVKMSERARPTDATFVVAYPFFNPNEMLAVEELYRDSAKESGCPI 204
Query: 234 IIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWK 293
I+ NG+LD+IR+GYYP FFYPKL AL+KT P ET+YYIHNFKGR GTLFR YPG W+
Sbjct: 205 IVINGDLDKIRNGYYPPFFYPKLGALAKTFLPDFETVYYIHNFKGRFAGTLFRAYPGPWQ 264
Query: 294 VLKKVSNKYICLHQQEVMPSLKEVALDILP 323
VL+ V + +C+H QE MPSLK VAL+ILP
Sbjct: 265 VLRSVEGEMVCIHSQETMPSLKTVALEILP 294
>gi|356522807|ref|XP_003530035.1| PREDICTED: uncharacterized protein LOC100802995 [Glycine max]
Length = 323
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 150/190 (78%), Positives = 167/190 (87%), Gaps = 1/190 (0%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRL 130
+ DVPFP+DYSELL+QA++AA+LA+KD +LMEIEFPTAGL SVPGD EGGIEMT SM+L
Sbjct: 23 HTDVPFPADYSELLEQARVAADLAIKDNRQLMEIEFPTAGLGSVPGDGEGGIEMTESMQL 82
Query: 131 ICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKV 190
I EFCD F++ EK TRTRIFFPEA+EV FAR+SVF G SFKLDYLT PSFFEDFGF EK+
Sbjct: 83 IREFCDRFISSEKATRTRIFFPEASEVDFARQSVFSGCSFKLDYLTNPSFFEDFGFVEKI 142
Query: 191 KMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPS 250
KM DRVK DELFLV+YPYFN NE+LVVEELYKE V NT KLIIFNGELDRIRSGYYPS
Sbjct: 143 KMLDRVKTGDELFLVSYPYFNANEILVVEELYKE-VLNTERKLIIFNGELDRIRSGYYPS 201
Query: 251 FFYPKLAALS 260
FFYPKLAAL+
Sbjct: 202 FFYPKLAALT 211
>gi|9758878|dbj|BAB09432.1| unnamed protein product [Arabidopsis thaliana]
Length = 248
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 140/187 (74%), Positives = 154/187 (82%), Gaps = 1/187 (0%)
Query: 61 SVSSDGNNSINVD-VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSE 119
SVS NN+ +VD VPFP DY EL++QAK A E+A+KD +LMEIEFPT+GL SVPGD E
Sbjct: 51 SVSGGYNNNTSVDNVPFPRDYVELINQAKEAVEMALKDEKQLMEIEFPTSGLASVPGDGE 110
Query: 120 GGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPS 179
G EMT S+ +I EFCD + PEK TRIFFPEANEVKFA+K+VF G FKLDYLTKPS
Sbjct: 111 GATEMTESINMIREFCDRLLAPEKARSTRIFFPEANEVKFAQKTVFGGTYFKLDYLTKPS 170
Query: 180 FFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGE 239
FEDFGF E+VKMADRVK EDELFLVAYPYFNVNEMLVVEELYKEAV NT KLIIFNGE
Sbjct: 171 LFEDFGFFERVKMADRVKPEDELFLVAYPYFNVNEMLVVEELYKEAVVNTDRKLIIFNGE 230
Query: 240 LDRIRSG 246
LDRIRSG
Sbjct: 231 LDRIRSG 237
>gi|440583726|emb|CCH47228.1| hypothetical protein [Lupinus angustifolius]
Length = 283
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 138/248 (55%), Positives = 155/248 (62%), Gaps = 46/248 (18%)
Query: 42 GQRIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKL 101
G + P RK LT SD + S +VPFP+DY+ELL+QA++A ELA+KD +L
Sbjct: 27 GGVTASLVPRNRK---LTGCSVSDVSASTETNVPFPTDYTELLEQARVAVELAMKDNRQL 83
Query: 102 MEIEFPTAGLDSVPG-------------------------------------DSEGGIEM 124
MEIEFPTAGL SVPG D EGGIEM
Sbjct: 84 MEIEFPTAGLASVPGSPYFTFLLFNFSFWIEFHCTLVPIYFQTDSYHSMISGDGEGGIEM 143
Query: 125 T---GSMRLICEFCDLFVTPEKVTRTRI---FFPEANEVKFARKSVFEGASFKLDYLTKP 178
T S+ + VT + I FFPEA+EV FAR+SVF GASFKLDYLTKP
Sbjct: 144 TEIKTSVMINTPKILSLVTAPNSSGLLIYVQFFPEASEVDFARQSVFSGASFKLDYLTKP 203
Query: 179 SFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNG 238
SFF+DFGF EKVKM+DRVK DELFLVAYPYFNVNEMLVVEELYKEAV NT KLIIFNG
Sbjct: 204 SFFQDFGFVEKVKMSDRVKAGDELFLVAYPYFNVNEMLVVEELYKEAVLNTERKLIIFNG 263
Query: 239 ELDRIRSG 246
ELDRIRSG
Sbjct: 264 ELDRIRSG 271
>gi|115443993|ref|NP_001045776.1| Os02g0129300 [Oryza sativa Japonica Group]
gi|113535307|dbj|BAF07690.1| Os02g0129300, partial [Oryza sativa Japonica Group]
Length = 161
Score = 226 bits (577), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 113/144 (78%), Positives = 121/144 (84%)
Query: 116 GDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYL 175
GDSEGGIEMTGSM LI EFCD FV EK TRTRIFFPEANEV FAR+S FEG S KLDYL
Sbjct: 7 GDSEGGIEMTGSMLLIREFCDRFVPAEKATRTRIFFPEANEVSFARQSAFEGCSLKLDYL 66
Query: 176 TKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLII 235
TKPS FEDFGFT KVKM+DRV+ EDE+FLVAYPYFNVNEMLVVEELYKEA+ +T KLII
Sbjct: 67 TKPSLFEDFGFTTKVKMSDRVRPEDEIFLVAYPYFNVNEMLVVEELYKEAIVSTDRKLII 126
Query: 236 FNGELDRIRSGYYPSFFYPKLAAL 259
FNGELDRIRSG +F + AAL
Sbjct: 127 FNGELDRIRSGLLVTFLNKREAAL 150
>gi|215686777|dbj|BAG89627.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 147
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 105/136 (77%), Positives = 113/136 (83%)
Query: 124 MTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFED 183
MTGSM LI EFCD FV EK TRTRIFFPEANEV FAR+S FEG S KLDYLTKPS FED
Sbjct: 1 MTGSMLLIREFCDRFVPAEKATRTRIFFPEANEVSFARQSAFEGCSLKLDYLTKPSLFED 60
Query: 184 FGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRI 243
FGFT KVKM+DRV+ EDE+FLVAYPYFNVNEMLVVEELYKEA+ +T KLIIFNGELDRI
Sbjct: 61 FGFTTKVKMSDRVRPEDEIFLVAYPYFNVNEMLVVEELYKEAIVSTDRKLIIFNGELDRI 120
Query: 244 RSGYYPSFFYPKLAAL 259
RSG +F + AAL
Sbjct: 121 RSGLLVTFLNKREAAL 136
>gi|307111351|gb|EFN59585.1| hypothetical protein CHLNCDRAFT_56449 [Chlorella variabilis]
Length = 336
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 112/269 (41%), Positives = 162/269 (60%), Gaps = 22/269 (8%)
Query: 75 PFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEF 134
PFP DY++ + QA+ AA A+ DG L+E+EFPTA L +V GD+EG EMT S++ + +F
Sbjct: 60 PFPGDYNQAVRQAQGAAAAALADGASLVEVEFPTASLVAVAGDAEGANEMTYSLQHLRQF 119
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKS--------------VFEGASFKLDYLTKPSF 180
+ ++ TRIFFP+ E+K A K VFEG +FK YL KP+
Sbjct: 120 MRGWK--DQAGTTRIFFPDPTELKVALKGKAMDPNAGSWTIDPVFEGTAFKFGYLMKPNP 177
Query: 181 FEDFGFT-EKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK-----LI 234
F D G T K+ AD++ ++L ++AYP+FN EML V L++ + +I
Sbjct: 178 FLDMGITVGKINAADQLDGREQLLVMAYPHFNPQEMLEVAALHEYLAAQAGGREGATPII 237
Query: 235 IFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKV 294
FN ELDRIR+GYYP FFYP + ++K+L P T YYI NFKG GG +FRCYP +++
Sbjct: 238 TFNAELDRIRTGYYPPFFYPAIGKIAKSLLPQFTTAYYIKNFKGATGGCIFRCYPSPFQI 297
Query: 295 LKKVSNKYICLHQQEVMPSLKEVALDILP 323
+ + + ++E MP+ +EVAL++LP
Sbjct: 298 WSRTRAGFSLVEEREEMPNQREVALEVLP 326
>gi|255080176|ref|XP_002503668.1| predicted protein [Micromonas sp. RCC299]
gi|226518935|gb|ACO64926.1| predicted protein [Micromonas sp. RCC299]
Length = 369
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 102/301 (33%), Positives = 157/301 (52%), Gaps = 29/301 (9%)
Query: 50 PYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTA 109
P RR T+S SS + PFP DY++++ Q + A + + DG+ LMEI+FP
Sbjct: 68 PARRGADEGTSSKSS--LKAGGALTPFPKDYAQMVSQCQKALQHGLDDGLGLMEIQFPPG 125
Query: 110 GLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFAR-------- 161
GL++ PGD EG +E +++ + C F + TR+FFP+ E K AR
Sbjct: 126 GLETAPGDVEGNMESNLTVQHLRGICAQFERNKTAKTTRVFFPDPIEAKLARTGTNASPD 185
Query: 162 ----------KSVFEGASF--KLDYLTKPSFFEDFG----FTEKVKMADRVKLEDELFLV 205
++ F ++ +D+L PSF G ++V ++ K D F+V
Sbjct: 186 GVRAPSNSETRAWFAPNNWPGPVDFLESPSFLSVSGLDKVLNKRVSTWNKAKANDTAFVV 245
Query: 206 AYPYFNVNEMLVVEELYKEAV--FNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTL 263
AYP NV+E+ ELY+ + A +++ NGEL+R R+ YYP F+ A +
Sbjct: 246 AYPVSNVSELTCTRELYEGELGRGTGARPIVVCNGELERTRTNYYPPFWNAGEMAPLREF 305
Query: 264 FPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYI-CLHQQEVMPSLKEVALDIL 322
V E IY+IHNFKG N LFRCYPG W+V+++ + + + E P +++VAL+IL
Sbjct: 306 VKVFEQIYFIHNFKGSNPAVLFRCYPGPWQVMRRRRDDSLEVVWTGEEYPGVQKVALEIL 365
Query: 323 P 323
P
Sbjct: 366 P 366
>gi|424513544|emb|CCO66166.1| predicted protein [Bathycoccus prasinos]
Length = 423
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 106/323 (32%), Positives = 156/323 (48%), Gaps = 44/323 (13%)
Query: 40 FSGQRIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGM 99
F G+++ P R ++S + N PFP+DY ++ QA+ A + A +DG+
Sbjct: 103 FGGKKVEVVLPPSRALGSGRTTISKNTNGGRQY--PFPADYDVMVQQARQALQKAREDGV 160
Query: 100 KLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKF 159
L EI+FP GLD PGD EG +E T + ++ + EK+T + FP+ E+K
Sbjct: 161 DLGEIQFPPGGLDLAPGDLEGNVECTLTATVLRKILRGMKEEEKIT---VLFPDPTELKL 217
Query: 160 ARKS--------------------VFEGASFKLDYLTKPSFFEDFG----FTEKVKMADR 195
A++ +FE +L+YL P+ F G F + + DR
Sbjct: 218 AKRGQTGMCAPDGVAPPEVFQTDPLFEDWRGELNYLDDPNAFSVSGLDKIFGKSATVNDR 277
Query: 196 VKL-EDELFLVAYPYFNVNEMLVVEELYKE-----------AVFNTAWK-LIIFNGELDR 242
V + E +F+ AYP N+ E+ LY+ + T K L++ NGELDR
Sbjct: 278 VDINEGNMFVCAYPSGNIAELTQTRLLYENIREENESDAPASKIKTKRKSLVVVNGELDR 337
Query: 243 IRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKY 302
RS YYP F+ + E IY+IHNFKG N LFRCYP W+V ++ N
Sbjct: 338 TRSNYYPWFWNKNEMEPLREFSQSFEGIYFIHNFKGTNPAVLFRCYPDDWRVFRRRPNDA 397
Query: 303 I-CLHQQEVMP-SLKEVALDILP 323
+ C+ P SLKE+ALD+LP
Sbjct: 398 VECVWSSSTRPKSLKEIALDVLP 420
>gi|145344528|ref|XP_001416783.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577009|gb|ABO95076.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 277
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 91/277 (32%), Positives = 139/277 (50%), Gaps = 28/277 (10%)
Query: 75 PFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEF 134
PFP DY+EL QA+ + + KDG++L+E++FP GL+ GD EG +E + +
Sbjct: 1 PFPRDYAELERQARESVKRCAKDGVELVELQFPPGGLELASGDLEGNVECNLTTERLRGI 60
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFA------------------RKSVFEGASFKLDYLT 176
CD FV + TR+ FP+ E++ A + F LDY+
Sbjct: 61 CDAFVANGTASTTRVLFPDPTEMRLATTGANAAPDGIRAPEQSDTRGWFADWKGTLDYVD 120
Query: 177 KPSFFEDFGFTE----KVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK 232
PSF GF + K +++R++ ++ ++VAYP N++E+ +LY+ V T
Sbjct: 121 DPSFMSVSGFDKIFGGKKNISERMRGDETAYVVAYPSANISELANTRDLYEGCVRGTGKS 180
Query: 233 LIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSW 292
L++ NGEL+R RS YYP F+ ++ E Y I+NFKG N LFR YP W
Sbjct: 181 LVVCNGELERTRSNYYPPFWNAGEMGPLRSFCRKFEGAYVIYNFKGSNPAVLFRVYPEPW 240
Query: 293 KVLKKVSNKYICLHQQEVM------PSLKEVALDILP 323
+V ++ L E + S+K VALD+LP
Sbjct: 241 QVFRRRKGGRDGLETLERVCTFEKFESVKSVALDVLP 277
>gi|303272213|ref|XP_003055468.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463442|gb|EEH60720.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 252
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 92/252 (36%), Positives = 137/252 (54%), Gaps = 14/252 (5%)
Query: 86 QAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVT 145
QA+ + + A+ DG++L+EI+FP+ GLD+ PGD EG +E ++ + C F
Sbjct: 1 QAQASLQAALDDGVELLEIQFPSGGLDTAPGDVEGNVENNLTVAHLRGICSQFERNGTAK 60
Query: 146 RTRIFFPEANEVKFARKSVFEG----ASF--KLDYLTKPSFFEDFGFTE----KVKMADR 195
TR+FFP+ E A ASF +DYL +P F G + + +A R
Sbjct: 61 TTRVFFPDPIERSLALTGAAPSPDGFASFPGPIDYLEQPDFLSVSGLDKMLGTRKTVAMR 120
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKE--AVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
V D F+VAYP NV+E++ EL + A A +++ NGEL+R RS YYPSF+
Sbjct: 121 VPESDTAFVVAYPCTNVSELVCTRELREGELARAGPARPIVMCNGELERTRSEYYPSFWN 180
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKV--SNKYICLHQQEVM 311
+ E +Y++HN+KG N LFR YPG W+VL++ ++ Y ++ E
Sbjct: 181 VGEMKPLRGFAREFEGVYFVHNYKGSNPAVLFRAYPGPWQVLRRRRDTDTYDIVYTCEEY 240
Query: 312 PSLKEVALDILP 323
P L++VALDILP
Sbjct: 241 PGLQKVALDILP 252
>gi|308802235|ref|XP_003078431.1| unnamed protein product [Ostreococcus tauri]
gi|116056883|emb|CAL53172.1| unnamed protein product [Ostreococcus tauri]
Length = 267
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 85/265 (32%), Positives = 136/265 (51%), Gaps = 23/265 (8%)
Query: 83 LLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPE 142
++ Q K A + A+ DG +L+E++FP GL+ GD EG +E + + CD F
Sbjct: 1 MVRQCKEAMKRAIVDGTELIELQFPPGGLELASGDLEGNVECNLTTERLRGICDGFRELG 60
Query: 143 KVTRTRIFFPEANEVKFA------------------RKSVFEGASFKLDYLTKPSFFEDF 184
+TR+ FP+ E + A +++F ++DYL PSF
Sbjct: 61 MAEKTRVLFPDPTETRLALTGSSPTPDGIRAPEQSETRAMFGDWVGRVDYLDDPSFMSVS 120
Query: 185 GFTE----KVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL 240
G + K +A+R+ +D F+VAYP N++E+ +LY++AV + L++ NGE+
Sbjct: 121 GLDKILGTKKSIAERMGADDAAFVVAYPSANISELANTRDLYEDAVRGSGRPLVVCNGEM 180
Query: 241 DRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSN 300
+R RS YYP F+ + E +Y I+NFKG N LFR YP W+VL++ +
Sbjct: 181 ERTRSNYYPPFWNAGEMGPLREFARKFEGVYVIYNFKGSNPAVLFRVYPEPWQVLRRRRD 240
Query: 301 KYI-CLHQQEVMPSLKEVALDILPS 324
+ + E S+K VAL+ILP+
Sbjct: 241 GSLERVATFEKFESVKAVALEILPT 265
>gi|159481297|ref|XP_001698718.1| hypothetical protein CHLREDRAFT_205904 [Chlamydomonas reinhardtii]
gi|158273612|gb|EDO99400.1| predicted protein [Chlamydomonas reinhardtii]
Length = 364
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 81/219 (36%), Positives = 117/219 (53%), Gaps = 25/219 (11%)
Query: 65 DGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEM 124
+ ++ PFP+ Y + QA+ A + A+ DG KL+E+EFP+ L SV GD EG EM
Sbjct: 41 EAATAVQTPAPFPTSYVMAMRQAQEAVKAALADGAKLVEVEFPSTTLSSVSGDGEGQNEM 100
Query: 125 TGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKS--------------VFEGASF 170
SM + F F + + TR+FFP+ E+ AR F +F
Sbjct: 101 NASMGYLRTFLGGFRS--RAASTRVFFPDNVELAVARSGQTEDPSAGRKALDPQFADVTF 158
Query: 171 KLDYLTKP-SFFEDFGFTEK----VKMADRVKLEDELFLVAYPYFNVNEML-VVEELYKE 224
+L YLT+ + + FGF + VK+ VK D+L +VAYP FN E L V ELY++
Sbjct: 159 QLGYLTEQNAAWAMFGFYKSAFDPVKL---VKDTDDLLVVAYPSFNPREELSAVYELYQQ 215
Query: 225 AVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTL 263
++IFNGELDR+R GYYPS F+P++A + +
Sbjct: 216 KAKARGMPIVIFNGELDRVRGGYYPSVFFPEIAVRQRAV 254
>gi|255087178|ref|XP_002505512.1| predicted protein [Micromonas sp. RCC299]
gi|226520782|gb|ACO66770.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 77/276 (27%), Positives = 124/276 (44%), Gaps = 39/276 (14%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPG----DSEGGIEMTGSMRLI 131
P D S+LL + + + A+ DG L+++E P D V G DS E M ++
Sbjct: 64 LPEDESDLLARIHTSIQAALSDGKVLLDVEVPVQYFDGVVGVGGQDSIAISEFNACMSVL 123
Query: 132 CEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVF---------EGASFK-----LDYLTK 177
+ LF + R+FFP+A E A K + A+F +DYL +
Sbjct: 124 RKIVRLFEWLGQAESVRVFFPDAAECSIALKGAGLNPVSGQWEQAATFHDWPGAVDYLLR 183
Query: 178 PSFFED-----FGFTE-------KVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEA 225
F +G+ + K + ++ D L++V YPY N EM V L++E
Sbjct: 184 DDFVSQTSRKAYGYADLPDFLAGKRDVEQTAEVADRLYVVGYPYDNTGEMEQVMRLWEE- 242
Query: 226 VFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNF-KGRNGGTL 284
A +++FNG LD +R+ + P + K L P T +Y+H F G G L
Sbjct: 243 ---HARPILVFNGNLDGVRTSFAP---FGKAKKLKHEFVPKFTTAFYVHKFAAGAAPGLL 296
Query: 285 FRCYPGSWKVLKKVSNKYI-CLHQQEVMPSLKEVAL 319
+R YP W+V + V + C+ + + P L++VA+
Sbjct: 297 YRQYPSPWRVYRAVKGGGMECVAEYDERPELRDVAM 332
>gi|302852030|ref|XP_002957537.1| hypothetical protein VOLCADRAFT_107711 [Volvox carteri f.
nagariensis]
gi|300257179|gb|EFJ41431.1| hypothetical protein VOLCADRAFT_107711 [Volvox carteri f.
nagariensis]
Length = 271
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/287 (30%), Positives = 123/287 (42%), Gaps = 93/287 (32%)
Query: 70 INVDVPFPSDYSELLDQ-------------AKMAAELAVKDGMKLMEIEFPTAGLDSVPG 116
+ PFP Y + + Q A+ A + A+ DG L+E+EFP+ L SV G
Sbjct: 43 LQAPAPFPVSYDQAMRQLLPRFPAPLFQHSAQEAVKAALADGAPLVEVEFPSTTLSSVSG 102
Query: 117 DSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFA-----------RKSV- 164
D EG EM SM + +F F + + TR+FFP+ E+ A RKS+
Sbjct: 103 DGEGQNEMNASMGFLRQFLGAFRS--RAASTRVFFPDNVELAVARSGQTEDPAAGRKSLD 160
Query: 165 --FEGASFKLDYLTKP-SFFEDFGFT----EKVKMADRVKLEDELFLVAYPYFNVNEMLV 217
F A F+L YLT+ + + FGF + VK+ VK D++ ++AYP FN
Sbjct: 161 PKFGDAVFQLGYLTQQNAAWAVFGFYKSGFDPVKL---VKDTDDMLVIAYPSFNP----- 212
Query: 218 VEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFK 277
GELDR+R GYYP+ F+P++A L
Sbjct: 213 -------------------RGELDRVRGGYYPALFFPEIAKL------------------ 235
Query: 278 GRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMPSLKEVALDILPS 324
L R N L + MPSLKEVAL+ILP+
Sbjct: 236 ------LLRN--------PLDENDVRVLWTSDKMPSLKEVALEILPA 268
>gi|428164159|gb|EKX33196.1| hypothetical protein GUITHDRAFT_156132 [Guillardia theta CCMP2712]
Length = 215
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 56/156 (35%), Positives = 85/156 (54%), Gaps = 19/156 (12%)
Query: 149 IFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYP 208
+ FP+ +E + A + F L L+KP + DRV +LV +P
Sbjct: 1 MVFPDPSEARIAFEEYGSQVPFSLSSLSKPK-----------QQEDRVNK----YLVMHP 45
Query: 209 YFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSG----YYPSFFYPKLAALSKTLF 264
F+V E + ++ELY V +IIFNG+L ++RSG YYP FF+PKLA + +
Sbjct: 46 VFDVREYIQMDELYMSEVAPKDAAMIIFNGDLFKMRSGGIGGYYPDFFFPKLAQVRRRFM 105
Query: 265 PVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSN 300
P++ET YY+ F+G G L+R YPG W++L+ V+
Sbjct: 106 PMVETAYYLRVFRGPPVGALYREYPGPWQILRAVNG 141
>gi|452824537|gb|EME31539.1| hypothetical protein Gasu_12130 [Galdieria sulphuraria]
Length = 273
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 65/251 (25%), Positives = 103/251 (41%), Gaps = 39/251 (15%)
Query: 74 VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFP------TAGLDSVPGDSEGGIEMTGS 127
+ P +L+ + + + A+ DG+KL+E++FP +A L+ V M +
Sbjct: 46 IRLPESNVQLVQDIQESCKSAICDGLKLLEVQFPPLKNIGSAALNQV---------MDAN 96
Query: 128 MRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFT 187
F T + FP+ E K AR+ D+ T S F
Sbjct: 97 RTFAKSVVQRFPHVSGNGTTFVVFPDDAESKLARED--------RDFRTLDSVF------ 142
Query: 188 EKVKMADRVKLED-ELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSG 246
+ + L+D L ++ P F V E VE V +I+FN +LD++R G
Sbjct: 143 -ITSLQRDIDLQDASLVVILNPGFQVQEWFEVERFCNYQV-----PVILFNADLDKLRGG 196
Query: 247 YYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLH 306
YYP F YPKL A E +YY+ F G L R YP W+++ + C+
Sbjct: 197 YYPRFLYPKLYATKDKCLTKFEPVYYVRFFV---NGALIRRYPNPWQIVYEEEGCLYCIL 253
Query: 307 QQEVMPSLKEV 317
++ P + V
Sbjct: 254 ERNERPDFQTV 264
>gi|449018586|dbj|BAM81988.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 247
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 67/223 (30%), Positives = 96/223 (43%), Gaps = 25/223 (11%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMK---LMEIEFPTAGLDSVPGDSEGGIEMTGSMR-LI 131
P D + L Q + A A + + L E+ FP A D+ S T R +I
Sbjct: 10 LPKDTASLHRQVQNALSKATETKTRSPALYEVSFP-AVRDTTAALSRILDANTSHAREII 68
Query: 132 CEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVK 191
F F R + FP+ E K A K G+S L+ +E F ++V+
Sbjct: 69 KPFAASFRK-----RLHLVFPDVAEAKIAEKVY--GSSEHTFTLSALPLYERPAFLQQVE 121
Query: 192 MADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSF 251
L V P FN++E L +E + A+ +++ NG +DR+RS YYP
Sbjct: 122 AP-------ALVFVVQPGFNIDEWLQLE---RPALLYPDASIVVLNGNMDRLRSNYYPPL 171
Query: 252 FYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKV 294
FYP+L AL K E IYY+ K G LFR +P W+
Sbjct: 172 FYPRLTALRKRYLEQFEPIYYL---KPLPNGLLFRVFPEPWQT 211
>gi|298715350|emb|CBJ27978.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 314
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 74/294 (25%), Positives = 128/294 (43%), Gaps = 51/294 (17%)
Query: 5 SNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRIITFSPYRRKHSCLTNSVSS 64
+++A+A +P P+P PR + + + R P R++ CLT
Sbjct: 25 ASTALAFVALPSPLPRSPR-------YHQRLYDAAAPRPRREKPRPQRQQVQCLTK---- 73
Query: 65 DGNNSINVDVPFPSD-YSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGG 121
+P D Y+ + Q A + A+ G+KL+E+EFP LD G++
Sbjct: 74 ---------IPSGKDPYAAVKKQTAEATQDAINAGIKLIELEFPPVRGKLDISLGET--- 121
Query: 122 IEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFF 181
+ + E F + + FP+ E + A ++ + G +F++
Sbjct: 122 --LDANRSFARELARSF-SARMGKALWLVFPDDAEAELA-QNTYGGTTFRV--------- 168
Query: 182 EDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELD 241
G +K D E ++ +V P F+VNE +V++ L + V +++ NG LD
Sbjct: 169 --VGINSAIK--DLKDEECQMQIVVNPGFDVNEWIVLDSLVRPDV-----PMVMLNGNLD 219
Query: 242 RIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVL 295
++R GYYP F+P L + ET+YY+ K GG +FR P W+V+
Sbjct: 220 KLRGGYYPRIFFPGLYNAKERFLKKFETVYYL---KALPGGWIFRRAPEDWQVV 270
>gi|209522945|ref|ZP_03271502.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|376001796|ref|ZP_09779650.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|209496532|gb|EDZ96830.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|375329707|emb|CCE15403.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 249
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 110/243 (45%), Gaps = 49/243 (20%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ SE ++QAK AA A+ DG KL+++E FP IE+ + + +
Sbjct: 4 LPTTLSEAIEQAKQAATAALDDGYKLIQVELVFPE-------------IELQ-AQSIASQ 49
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F P+ T ++FFP+A AR+ D+ P D G T + +
Sbjct: 50 FIPALEKPD--TLLKVFFPDAGSAALARR----------DWGETPFRVTDIG-TSRSPVE 96
Query: 194 DRVKLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYP 249
R++ +D FLV P VN+ VE L+K A + +++ N L+ I GY
Sbjct: 97 TRLQPDDGQFLVVSPSPVEVNQ---VENLHKLAGDRS---VVLLNPRLEDVAIIGIGYAA 150
Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQE 309
L + ++E+ YY+ K +G LFRCYPG+W+V ++ +Y + +Q
Sbjct: 151 R-------QLRERFLNIIESCYYL---KPLDGAALFRCYPGTWEVWLEIEGEYQKITEQS 200
Query: 310 VMP 312
P
Sbjct: 201 TKP 203
>gi|423062349|ref|ZP_17051139.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
gi|406716257|gb|EKD11408.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
Length = 262
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 110/243 (45%), Gaps = 49/243 (20%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ SE ++QAK AA A+ DG KL+++E FP IE+ + + +
Sbjct: 17 LPTTLSEAIEQAKQAATAALDDGYKLIQVELVFPE-------------IELQ-AQSIASQ 62
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F P+ T ++FFP+A AR+ D+ P D G T + +
Sbjct: 63 FIPALEKPD--TLLKVFFPDAGSAALARR----------DWGETPFRVTDIG-TSRSPVE 109
Query: 194 DRVKLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYP 249
R++ +D FLV P VN+ VE L+K A + +++ N L+ I GY
Sbjct: 110 TRLQPDDGQFLVVSPSPVEVNQ---VENLHKLAGDRS---VVLLNPRLEDVAIIGIGYAA 163
Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQE 309
L + ++E+ YY+ K +G LFRCYPG+W+V ++ +Y + +Q
Sbjct: 164 R-------QLRERFLNIIESCYYL---KPLDGAALFRCYPGTWEVWLEIEGEYQKITEQS 213
Query: 310 VMP 312
P
Sbjct: 214 TKP 216
>gi|291567271|dbj|BAI89543.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 249
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 110/243 (45%), Gaps = 49/243 (20%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ SE ++QAK AA A++DG KL+++E FP IE+ + + +
Sbjct: 4 LPTTLSEAIEQAKQAATAALEDGYKLIQVELVFPE-------------IELQ-AQSIASQ 49
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F P+ T ++FFP+A AR+ D+ P D G T + +
Sbjct: 50 FIPALEKPD--TLLKVFFPDAGSAALARR----------DWGETPFRVTDIG-TSRSPVE 96
Query: 194 DRVKLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYP 249
R++ +D FLV P VN+ VE L+K A + +++ N L+ I GY
Sbjct: 97 TRLQPDDGQFLVVSPSPVEVNQ---VENLHKLAGDRS---VVLLNPRLEDVAIIGIGYAA 150
Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQE 309
L + +E+ YY+ K +G LFRCYPG+W+V ++ +Y + +Q
Sbjct: 151 R-------QLRERFLNTIESCYYL---KPLDGAALFRCYPGTWEVWLEIDGEYQKITEQS 200
Query: 310 VMP 312
P
Sbjct: 201 TKP 203
>gi|409992140|ref|ZP_11275348.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
Paraca]
gi|409936997|gb|EKN78453.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
Paraca]
Length = 262
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 110/243 (45%), Gaps = 49/243 (20%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ SE ++QAK AA A++DG KL+++E FP IE+ + + +
Sbjct: 17 LPTTLSEAIEQAKQAATAALEDGYKLIQVELVFPE-------------IELQ-AQSIASQ 62
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F P+ T ++FFP+A AR+ D+ P D G T + +
Sbjct: 63 FIPALEKPD--TLLKVFFPDAGAAALARR----------DWGETPFRVTDIG-TSRSPVE 109
Query: 194 DRVKLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYP 249
R++ +D FLV P VN+ VE L+K A + +++ N L+ I GY
Sbjct: 110 TRLQPDDGQFLVVSPSPVEVNQ---VENLHKLAGDRS---VVLLNPRLEDVAIIGIGYTA 163
Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQE 309
L + +E+ YY+ K +G LFRCYPG+W+V ++ +Y + +Q
Sbjct: 164 R-------QLRERFLNTIESCYYL---KPLDGAALFRCYPGTWEVWLEIDGEYQKITEQS 213
Query: 310 VMP 312
P
Sbjct: 214 TKP 216
>gi|119484707|ref|ZP_01619189.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
gi|119457525|gb|EAW38649.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
Length = 249
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 102/242 (42%), Gaps = 46/242 (19%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P E ++QAK A + A+ DG KL+++E FP IE+ + + +
Sbjct: 4 LPKSIEEAVEQAKQATQAALDDGYKLVQVELVFPE-------------IELQ-AQAIAQQ 49
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F + E T ++FFP+A AR+ D+ P D G + + +
Sbjct: 50 F--IPAIEESGTVLKVFFPDAGAAALARR----------DWGEIPFKISDLG-SSRSPID 96
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPS 250
RVK +D FLV P + VE++ K + I+ N L+ I GY
Sbjct: 97 SRVKDDDGRFLVVSPT-----PVEVEQVEKLSQLAGDRVTILLNPRLEDIAIIGIGYAAR 151
Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEV 310
AL +E+ YY+ +G L+RCYP W+V +++ +Y L Q++
Sbjct: 152 -------ALRDRFISTIESCYYLRPLEG--DAALYRCYPSLWEVWQEIDGEYTLLAQEQT 202
Query: 311 MP 312
P
Sbjct: 203 KP 204
>gi|334118025|ref|ZP_08492115.1| Domain of unknown function DUF1995-containing protein [Microcoleus
vaginatus FGP-2]
gi|333460010|gb|EGK88620.1| Domain of unknown function DUF1995-containing protein [Microcoleus
vaginatus FGP-2]
Length = 248
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 66/258 (25%), Positives = 114/258 (44%), Gaps = 50/258 (19%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P D +E + Q+++A A+ DG L+++E FP L + + + + + E
Sbjct: 4 LPKDLNEAIAQSRIATAAALSDGKTLLQVELVFPEIALQA----------QSITEQFLPE 53
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
+++ ++FFP+A AR+ D+ P D G + + +
Sbjct: 54 LEEIY------PGVKVFFPDAGAAALARR----------DWGETPFKVTDLG-SSRSPVE 96
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPS 250
D++ ED+LFL+ P E+ VE LY A +I+ N L+ + + GY
Sbjct: 97 DKIAPEDQLFLLINP--AAVEVAQVERLYIAAAGR---PVILLNPRLEDVATIGIGYAGR 151
Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLH---Q 307
+ LSK +E+ YY+ + LFRCYP SW+V + +N+Y + Q
Sbjct: 152 QLRDRF--LSK-----IESCYYVRPL---DAAALFRCYPQSWQVWLERNNQYELISETAQ 201
Query: 308 QEVMPSLKEVALDILPSA 325
+ V L+ + LPSA
Sbjct: 202 KPVGDDLERILAASLPSA 219
>gi|428317816|ref|YP_007115698.1| protein of unknown function DUF1995-containing protein
[Oscillatoria nigro-viridis PCC 7112]
gi|428241496|gb|AFZ07282.1| protein of unknown function DUF1995-containing protein
[Oscillatoria nigro-viridis PCC 7112]
Length = 248
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 62/258 (24%), Positives = 112/258 (43%), Gaps = 50/258 (19%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P D +E + Q+++A A+ DG L+++E FP L + + +++ + E
Sbjct: 4 LPKDLNEAIAQSRIATAAALSDGKTLLQVELVFPEIALQA----------QSITLQFLPE 53
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +++ ++FFP+ AR+ D+ P D G + + +
Sbjct: 54 FEEIY------PGVKVFFPDTGAAALARR----------DWGETPFKVTDLG-SSRTPVE 96
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPS 250
D++ ED+LFL+ P E+ VE++Y A +I+ N L+ + + GY
Sbjct: 97 DKIAPEDQLFLLINP--AAVEVAQVEKIYIAAAGR---PVILLNPRLEDVATIGIGYAGR 151
Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLH---Q 307
L +E+ YYI + LFRCYP W+V + +++Y + Q
Sbjct: 152 -------QLRDRFLNKIESCYYIRPL---DTAALFRCYPQPWQVWLETNDEYELISETAQ 201
Query: 308 QEVMPSLKEVALDILPSA 325
+ V L+ + LPSA
Sbjct: 202 KPVGDDLERILAASLPSA 219
>gi|113475888|ref|YP_721949.1| hypothetical protein Tery_2247 [Trichodesmium erythraeum IMS101]
gi|110166936|gb|ABG51476.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 253
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 108/244 (44%), Gaps = 37/244 (15%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P+D +E + Q A + A++DG ++IE VP IE+ + L +F
Sbjct: 4 LPNDINEAIVQGMEATKAALQDGYTRVQIEI------VVPD-----IELQ-AQSLAKQFI 51
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
+ E T+ ++FFP++ AR+ ++ A+FK+ ED G T + + +
Sbjct: 52 PALL--ETSTKLKVFFPDSGAAALARRD-WQDATFKI---------EDLG-TSRSPVDKK 98
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPK 255
V+ ED+ FL+ P + E+ E+L A LI ++ + GY
Sbjct: 99 VEPEDQCFLLIAP--SAIEVAQTEKLSNLAGDRPVIMLIPKLEDVSIVGIGYAAR----- 151
Query: 256 LAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMPSLK 315
L + +E+ YYI + G L+RCYP W+V + + +Y + +Q P
Sbjct: 152 --QLRERFIKTIESCYYIRSL---GGAALYRCYPSPWQVWLEENGQYKLIAEQPEKPVGD 206
Query: 316 EVAL 319
EV +
Sbjct: 207 EVDM 210
>gi|224013206|ref|XP_002295255.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969217|gb|EED87559.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 391
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 41/64 (64%), Gaps = 1/64 (1%)
Query: 233 LIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNG-GTLFRCYPGS 291
+++ NG LD++R G+YP+ F+PKLAA + E+++Y+ F + G L+R YP
Sbjct: 286 MVVINGALDKVRGGFYPAIFFPKLAATVDRFWKRFESVFYLKPFSDKGVYGWLYRVYPEP 345
Query: 292 WKVL 295
W+V+
Sbjct: 346 WQVV 349
>gi|397566319|gb|EJK45002.1| hypothetical protein THAOC_36416 [Thalassiosira oceanica]
Length = 370
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/299 (23%), Positives = 120/299 (40%), Gaps = 62/299 (20%)
Query: 83 LLDQAKMAAELAVKDGMKLMEIEFP----TAGLDSVPGDSEGGIEMTGSMRLICEFCDLF 138
L AK+A + A+ DG+ +E+EFP A S D + E+ + + +F
Sbjct: 75 LRKTAKLAIDSAIADGVSKIEVEFPPLLGGARSKSQFDDFDNVQELDSNKEWTMQLAPMF 134
Query: 139 VTPE--KVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDF------------ 184
+ K RT + FP+ E + A+K F G ++ T +F
Sbjct: 135 AGDKTYKDGRTWLVFPDLKECELAKKD-FPGQRYQEATFTTIEAVTNFMSSSGSPGSSEE 193
Query: 185 -----------GFTEKV--KMADRVKLEDE-------------LFLVAYPYFN--VNEML 216
G + + K D L D+ L+LV P V + +
Sbjct: 194 YAAPWGASLMSGLSSMMGGKDGDAGLLGDQSSLDSLNVDSPANLWLVVQPGNGGPVEDWV 253
Query: 217 VVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNF 276
E+++ ++ +++ NG LD++R G+Y F+P LAA + + ET Y+ F
Sbjct: 254 NCEKMHSPSI-----PMVVVNGALDKVRGGFYAPIFFPALAATVERFWKKFETGLYLKPF 308
Query: 277 KGRNG-GTLFRCYPGSWKVL------KKVSNKYICLHQQEVM---PSLKEVALDILPSA 325
+ G L+R YP W+V+ K N + EVM PS +V +L ++
Sbjct: 309 SDKGVYGWLWRVYPEPWQVVYEKVTPDKNGNAQVEYQTVEVMTTKPSYNQVVSILLKAS 367
>gi|384251129|gb|EIE24607.1| hypothetical protein COCSUDRAFT_14109 [Coccomyxa subellipsoidea
C-169]
Length = 394
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/274 (24%), Positives = 118/274 (43%), Gaps = 36/274 (13%)
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFPT--AGLDSVPGDSEGGIEMTGSMRLICEF 134
PS + EL++ A + A+ DG+ +E+EFP +D G S+ I+ +++L
Sbjct: 92 PSSFQELVNDATASVRAAIGDGLTRLEVEFPALPGNIDGYKGASDWFID--SNIQLAIAA 149
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGA-----SFKLDYLTKPS--FFEDFGFT 187
+ V E R I P+ E + K +F+GA + +L + S F F F
Sbjct: 150 SRILVK-ESGKRVHILVPDGGEYNRSYK-MFKGALDLADGISMGHLKENSKGVFSSFNFF 207
Query: 188 EKVKMADRVKLED-----ELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR 242
V AD L ++F+V + E+ +E +E V L+++N E+D
Sbjct: 208 GSVPDADAETLSQAARKADVFIVVNA--STIELPDLERYIEEIVGERP--LVLWNLEVDT 263
Query: 243 IRSGYYPSFFYPKLAALSKTLFPVMETIYYIHN-----------FKGRNGGTLFRCYPGS 291
+R+ F PK L ++YI F G +FR YPG
Sbjct: 264 LRADLGLLGFPPK--ELQYRFLSQFTPVFYIRQRDYSKSVAVSPFIINYSGCIFREYPGP 321
Query: 292 WKV-LKKVSNKYICLHQQEVMPSLKEVALDILPS 324
W+V L++ + +Y C+ + E +L E +++ +
Sbjct: 322 WQVMLRQDNGQYACIAEDERRYNLGEAKEEMMAA 355
>gi|218189920|gb|EEC72347.1| hypothetical protein OsI_05588 [Oryza sativa Indica Group]
Length = 377
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 123/284 (43%), Gaps = 43/284 (15%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGGIEMTGSM 128
V V P Y L+ A + A+ +G +EIEFP + + S G S+ I+ +
Sbjct: 65 GVSVYKPRSYDVLVSDAARSLACAMDEGKTRLEIEFPPLPSNISSYKGSSDEFIDANIQL 124
Query: 129 RLICEFCDLFVTPEKVTRTRIFFPEANEVKFARK-------SVFEGASFKLDYL-TKP-- 178
L + K TR+ I FP+ E + A + S+ LD + T P
Sbjct: 125 ALAVA---RKLKELKGTRSCIVFPDLPEKRRASQLFGTALDSIETATISSLDEVSTGPVN 181
Query: 179 SFFE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK 232
+FF DF F + V+ DR K ++ L + + ++ +E+ ++ F ++
Sbjct: 182 TFFRSMRDTLDFDFADDVE--DRWKSDEPPSLYIFINCSTRDLSTIEKYVEQ--FASSVP 237
Query: 233 LIIFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKGR 279
++FN ELD +RS G+ P + + + +F + + Y YI N+
Sbjct: 238 ALLFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVTPYIVNYS-- 295
Query: 280 NGGTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
G +FR YPG W+V LK+ Y C+ + +L + ++L
Sbjct: 296 --GAVFRQYPGPWQVMLKQADGSYACVAESAARFTLGQAKEELL 337
>gi|443309167|ref|ZP_21038918.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
gi|442780785|gb|ELR90927.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
Length = 243
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 55/233 (23%), Positives = 98/233 (42%), Gaps = 50/233 (21%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P E + QA++A + A+ DG+ +++E L+ +P
Sbjct: 4 LPKTLGEAVSQARIATQNAIADGLNRLQVEILLPELNPMP------------------VA 45
Query: 136 DLFVTPEKVTR---TRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKM 192
+ F++ E+ +IFFP+A AR+ E LD TK +V +
Sbjct: 46 ERFLSDEEGISHNFNKIFFPDAGAAALARRDWGEVPFELLDISTK-----------RVSV 94
Query: 193 ADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIR---SGYYP 249
++++ EDE L P E+L +E+L A+ + +++ N L+ + GY
Sbjct: 95 EEQIQPEDEAILCIAP--TAQEVLQIEKLCN-AMGDRP--VVLLNPRLEDVSIVGIGYAG 149
Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKY 302
L +++ YY+ + GTLFRCYPG W+V + ++ Y
Sbjct: 150 R-------QLRDRFLKNIQSCYYLRPLE---EGTLFRCYPGLWQVWCETTDGY 192
>gi|449528829|ref|XP_004171405.1| PREDICTED: uncharacterized LOC101213889 [Cucumis sativus]
Length = 388
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 122/283 (43%), Gaps = 53/283 (18%)
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFPT--AGLDSVPGDSEGGIEMTGSMRLICEF 134
P Y L+ A + A++DG +EI+FP + + S G S+ I+ + L
Sbjct: 82 PKTYEVLVSDAANSLAYALEDGKMRLEIDFPPLPSNISSYKGSSDDFIDANIQLALAVAR 141
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPS-----------FFE- 182
+ ++ R+ I FP+ E + A + +F+ A +D +T S FF
Sbjct: 142 N---LQEKRGIRSCIVFPDKPEKRRASQ-LFKTALDSIDGITVSSLDDVPAGAVTSFFRS 197
Query: 183 -----DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFN 237
DF F + A R D L + + E+ ++E+ Y E F ++ ++FN
Sbjct: 198 VRNTLDFDFEDD--NAGRWTSSDPPSLYIFINCSTRELGLIEK-YVE-TFASSIPALLFN 253
Query: 238 GELDRIRS-----GYYPS------------FFYPKLAALSKTLFPVMETIYYIHNFKGRN 280
EL+ +R+ G+ P FY ++ SKT + YI N+
Sbjct: 254 LELETLRADLGLLGFPPKDLHYRFLSQFIPVFYIRIREYSKT----VAVAPYIVNY---- 305
Query: 281 GGTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
G LFR YPG W+V LK+ N Y C+ + E +L E ++L
Sbjct: 306 SGALFRQYPGPWQVMLKQSDNSYACVAESETRFTLGETKDELL 348
>gi|115443809|ref|NP_001045684.1| Os02g0117100 [Oryza sativa Japonica Group]
gi|41052833|dbj|BAD07724.1| unknown protein [Oryza sativa Japonica Group]
gi|113535215|dbj|BAF07598.1| Os02g0117100 [Oryza sativa Japonica Group]
gi|125580571|gb|EAZ21502.1| hypothetical protein OsJ_05126 [Oryza sativa Japonica Group]
Length = 377
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 123/284 (43%), Gaps = 43/284 (15%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGGIEMTGSM 128
V V P Y L+ A + A+ +G +EIEFP + + S G S+ I+ +
Sbjct: 65 GVSVYKPRSYDVLVSDAARSLACAMDEGKTRLEIEFPPLPSNISSYKGSSDEFIDANIQL 124
Query: 129 RLICEFCDLFVTPEKVTRTRIFFPEANEVKFARK-------SVFEGASFKLDYL-TKP-- 178
L + K TR+ I FP+ E + A + S+ LD + T P
Sbjct: 125 ALAVA---RKLKELKGTRSCIVFPDLPEKRRASQLFGTALDSIETATISSLDEVSTGPVN 181
Query: 179 SFFE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK 232
+FF DF F + V+ DR K ++ L + + ++ +E+ ++ F ++
Sbjct: 182 TFFRSMRDTLDFDFADDVE--DRWKSDEPPSLYIFINCSTRDLSTIEKYVEQ--FASSVP 237
Query: 233 LIIFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKGR 279
++FN ELD +RS G+ P + + + +F + + Y YI N+
Sbjct: 238 ALLFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVTPYIVNYS-- 295
Query: 280 NGGTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
G +FR YPG W+V LK+ Y C+ + +L + ++L
Sbjct: 296 --GAVFRQYPGPWQVMLKQADGSYACVAESAARFTLGQAKEELL 337
>gi|427715914|ref|YP_007063908.1| hypothetical protein Cal7507_0584 [Calothrix sp. PCC 7507]
gi|427348350|gb|AFY31074.1| protein of unknown function DUF1995-containing protein [Calothrix
sp. PCC 7507]
Length = 243
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/250 (22%), Positives = 101/250 (40%), Gaps = 44/250 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P E + Q++ A + A+ DG +++EF L +P + ++
Sbjct: 4 LPKSLEEAIAQSRTATQAALADGYTRLQVEFLFPELKPMP--------------VAEQYL 49
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
L E +R ++FF + AR+ P D G ++D+
Sbjct: 50 PLLADYE--SRLKVFFADTGAAALARRD-----------WDVPFTISDIGTGRATSVSDK 96
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDR---IRSGYYPSFF 252
++ E+E+FL P + E+ +E+L+ E A I+ N L+ + GY
Sbjct: 97 IQPEEEIFLFIAP--SSVEISQLEKLFAEIGDRPA---ILLNPRLEDAAIVGIGYAAR-- 149
Query: 253 YPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
+ + +ET YY+ + +FRCYPG W+V + + +Y + + P
Sbjct: 150 -----QIRERFINTIETCYYLRPVDDQTA--VFRCYPGLWEVWVETNGEYQKIAELPKRP 202
Query: 313 SLKEVALDIL 322
S E+ L +L
Sbjct: 203 SGDEIDLILL 212
>gi|255543323|ref|XP_002512724.1| conserved hypothetical protein [Ricinus communis]
gi|223547735|gb|EEF49227.1| conserved hypothetical protein [Ricinus communis]
Length = 385
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 87/360 (24%), Positives = 150/360 (41%), Gaps = 58/360 (16%)
Query: 1 MPLSSNSAIALPTVPIPVPSIPRRQANCRKFSIKCSNENFSGQRI-ITFSPYRRKHSCLT 59
M ++S++ ++ P+ P PR FS+ S + I I S ++
Sbjct: 6 MAMASSALPSISRTPLFFPHSPR----TLLFSVSPSLQKLPYPTIRIQCSNTSKQQEESQ 61
Query: 60 NSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPT--AGLDSVPGD 117
+ +S+ N V V P Y L + A A++DG +EI+FP + + S G
Sbjct: 62 SQSTSNLNPRKGVSVYKPKSYDVLANDAANCLAYALQDGKTRLEIDFPPLPSNISSYKGS 121
Query: 118 SEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTK 177
S+ I+ + L + +K TR I FP+ E + A + +F+ A +D +T
Sbjct: 122 SDEFIDANIQLALAI---IRKLQEKKETRACIVFPDKPEKRRASE-LFKAALDSIDGITI 177
Query: 178 PS-----------FFE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEE 220
S FF+ DF F + + R + ++ L + + E+ V+E+
Sbjct: 178 GSLDDVPSGPVSNFFKSVRNTLDFDFEDDNE--GRWQSDEPPSLYVFINCSTRELSVIEK 235
Query: 221 LYKEAVFNTAWKLIIFNGELDRIRSGY----YPS-------------FFYPKLAALSKTL 263
+ F + ++FN ELD +R+ +P+ FY ++ SKT
Sbjct: 236 YVEN--FAGSTPALLFNLELDTLRADLGLLGFPTKDLHYRFLSQFIPVFYIRIREYSKT- 292
Query: 264 FPVMETIYYIHNFKGRNGGTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
+ YI N+ G LFR YPG W+V LK+ Y C+ + +L E ++L
Sbjct: 293 ---VAVAPYIVNY----SGALFRQYPGPWQVMLKQSDGSYACVAESATRFTLGETKEELL 345
>gi|242060200|ref|XP_002451389.1| hypothetical protein SORBIDRAFT_04g001270 [Sorghum bicolor]
gi|241931220|gb|EES04365.1| hypothetical protein SORBIDRAFT_04g001270 [Sorghum bicolor]
Length = 385
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 69/282 (24%), Positives = 120/282 (42%), Gaps = 39/282 (13%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRL 130
V V P Y L+ A + A+ DG +EIEFP + +++L
Sbjct: 73 GVSVYKPRSYDVLVTDAARSLACAIDDGKTRLEIEFPPLPSSISSYKGSSDEFIDANIQL 132
Query: 131 ICEFCDLFVTPEKVTRTRIFFPEANEVKFARK------SVFEGASFKL--DYLTKP--SF 180
+ K T++ I FP+ E + A + EG + D T P SF
Sbjct: 133 ALVVARK-LKELKGTKSCIVFPDQPEKRRASQLFRTAIDTIEGVTVSSLDDVPTDPVNSF 191
Query: 181 FE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLI 234
F+ DF F++ + DR K ++ L + + ++ +E+ ++ F T+ +
Sbjct: 192 FKSIRNTLDFDFSDDNE--DRWKSDEPPSLYIFINSSTRDLASIEKYVEK--FATSVPAL 247
Query: 235 IFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKGRNG 281
+FN ELD +RS G+ P + + + +F + + Y YI N+
Sbjct: 248 LFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVAPYIVNY----S 303
Query: 282 GTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
G +FR YPG W+V LK++ Y C+ + E +L + ++L
Sbjct: 304 GAVFRRYPGPWQVMLKQLDGSYACVAESEARFTLGQAKEELL 345
>gi|219125569|ref|XP_002183049.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405324|gb|EEC45267.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 356
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 69/289 (23%), Positives = 126/289 (43%), Gaps = 61/289 (21%)
Query: 87 AKMAAELAVKDGMKLMEIEFP--TAGLDS-VPGDSEGGIEMTGSMRLICEFCDLFVTPEK 143
A A + A++DG K +EI+FP T G S D ++ + R C + + P
Sbjct: 76 AAPALQKALRDGWKQLEIDFPPLTGGDQSKTQFDDFDNVQELNANRDWC----VQLAPAI 131
Query: 144 VTRTR-IFF--PEANEVKFARKSVFEGASFK------------LDYLTKPSFFEDFGFTE 188
++ R ++F P+ E + A++ + G F+ L + + + +G T
Sbjct: 132 ASKNREVWFILPDDKECELAKEE-WTGQRFRQAAKFTSVRAAVLKTSGESQYSKAWGSTI 190
Query: 189 KVKM----------ADRVKLED-----ELFLVAYPYFN--VNEMLVVEELYKEAVFNTAW 231
M AD L+D LV P V + + VE L+K + +
Sbjct: 191 ASTMNKLTGGDGILADSSTLDDLGSGDRFHLVCQPGNGGPVEDWINVERLHKA---DPSQ 247
Query: 232 KLIIFNGELDRIRSGYYPSFFYPKLAALSKTL--FPVMETIYYIHNFKGRNG-GTLFRCY 288
+ NG LD++R GYYP+ F+P AL++T+ + E ++ + + G L+R Y
Sbjct: 248 PTCVVNGALDKVRDGYYPAVFFP---ALARTIPFYREFEAVFILKPVSDKGVYGWLYRVY 304
Query: 289 PGSWKVL------KKVSNKYICLHQ------QEVMPSLKEVALDILPSA 325
P W+V+ K+ ++ + + + +E PS K++ +L +A
Sbjct: 305 PEPWQVVLQEPQRKQRGDQTVVVVEDKVALVRETRPSYKDIVDALLVTA 353
>gi|357138473|ref|XP_003570816.1| PREDICTED: uncharacterized protein LOC100838483 [Brachypodium
distachyon]
Length = 378
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 121/279 (43%), Gaps = 45/279 (16%)
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGGIEMTGSMRLICEF 134
P Y L+ A + A+ DG +EIEFP + + S G S+ I+ + L
Sbjct: 72 PRSYEVLVADAARSLACAIDDGRTRLEIEFPPLPSNISSYKGSSDEFIDANVQLVLAVAR 131
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPS-----------FFE- 182
+ + TR+ I FP+ E + A + +F A ++ +T S FF+
Sbjct: 132 N---LKELRGTRSCIVFPDQPEKRRASQ-LFRTAIDSIEGVTVSSLDDLPSGPINNFFKS 187
Query: 183 -----DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFN 237
DF F++ + DR K ++ L + + ++ +E+ Y E F + ++FN
Sbjct: 188 IVSTLDFDFSDDNE--DRWKSDEPPSLYIFINSSTRDLSSIEK-YVE-TFAPSTPSVLFN 243
Query: 238 GELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKGRNGGTL 284
ELD +RS G+ P + + + +F + + Y YI N+ G +
Sbjct: 244 LELDTLRSDLGILGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVTPYIVNY----SGAV 299
Query: 285 FRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
FR YPG W+V LK+ Y C+ + +L + ++L
Sbjct: 300 FRQYPGPWQVMLKQADGSYACVAESASRFTLGQAKEELL 338
>gi|449436191|ref|XP_004135877.1| PREDICTED: uncharacterized protein LOC101213889 [Cucumis sativus]
Length = 388
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 121/283 (42%), Gaps = 53/283 (18%)
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGGIEMTGSMRLICEF 134
P Y L+ A + A++DG +EI+FP + + S G S+ I+ + L
Sbjct: 82 PKTYEVLVSDAANSLAYALEDGKMRLEIDFPPLPSNISSYKGSSDDFIDANIQLALAVAR 141
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKP-----------SFFE- 182
+ ++ R+ I FP+ E + A + +F+ A +D +T SFF
Sbjct: 142 N---LQEKRGIRSCIVFPDKPEKRRASQ-LFKTALDSIDGITVSSLDDVPAGAVTSFFRS 197
Query: 183 -----DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFN 237
DF F + A R D L + + E+ ++E+ Y E F ++ ++FN
Sbjct: 198 VRNTLDFDFED--DNAGRWTSSDPPSLYIFINCSTRELGLIEK-YVE-TFASSIPALLFN 253
Query: 238 GELDRIRS-----GYYPS------------FFYPKLAALSKTLFPVMETIYYIHNFKGRN 280
EL+ +R+ G+ P FY ++ SKT + YI N+
Sbjct: 254 LELETLRADLGLLGFPPKDLHYRFLSQFIPVFYIRIREYSKT----VAVAPYIVNY---- 305
Query: 281 GGTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
G LFR Y G W+V LK+ N Y C+ + E +L E ++L
Sbjct: 306 SGALFRQYAGPWQVMLKQSDNSYACVAESETRFTLGETKDELL 348
>gi|300865956|ref|ZP_07110692.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300336022|emb|CBN55850.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 248
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 59/257 (22%), Positives = 106/257 (41%), Gaps = 48/257 (18%)
Query: 74 VPFPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLI 131
+ P E + Q++ A + A+ D ++IE FP L + + + + I
Sbjct: 2 IELPKTLEEAIAQSREAVKAALNDAQTRLQIELVFPEIALQA----------QSITQQFI 51
Query: 132 CEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVK 191
+F +L ++FFP+ AR+ D+ T P D G + +
Sbjct: 52 TDFEEL------ANGLKVFFPDTGSAALARR----------DWGTVPFKITDLG-SSRSP 94
Query: 192 MADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELD---RIRSGYY 248
+A++++ ED++FL+ + E+ VE+L A + +I+ N L+ I GY
Sbjct: 95 VAEKIEPEDQIFLLVNA--SSIEVAQVEQLCNAA---DSRPVILLNPRLEDAATIGIGYA 149
Query: 249 PSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKV-LKKVSNKYICLHQ 307
L +++ YYI LFRCYP SW+V L++ +Y + +
Sbjct: 150 GR-------QLRDRFLNTLQSCYYIRPLPT---AALFRCYPQSWQVWLEETEGEYKLISE 199
Query: 308 QEVMPSLKEVALDILPS 324
P E+ I P+
Sbjct: 200 TAQKPVGDELERIIAPT 216
>gi|363807938|ref|NP_001242453.1| uncharacterized protein LOC100803725 [Glycine max]
gi|255642243|gb|ACU21386.1| unknown [Glycine max]
Length = 381
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 72/283 (25%), Positives = 119/283 (42%), Gaps = 53/283 (18%)
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGGIEMTGSMRLICEF 134
P Y L+ A + A++DG +EI+FP + + S G S+ I+ + L
Sbjct: 75 PKSYEVLVSDAANSLSYALQDGKLRLEIDFPPLPSNISSYKGSSDEFIDANIQLALAVV- 133
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLT-----------KPSFFE- 182
+ +K TR I FP+ E + A + +F+ A +D +T SFF
Sbjct: 134 --RKLKEKKETRACIVFPDKPEKRRACQ-LFKAALDSIDGITIGSLDDVPTGPMTSFFRS 190
Query: 183 -----DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFN 237
DF F + + R + + L + + E+ +E+ ++ F T+ ++FN
Sbjct: 191 VRNTLDFDFEDDNE--GRWQSSEPPSLYIFINCSTRELAYIEKYVEK--FATSTPTLLFN 246
Query: 238 GELDRIRSGY-YPSF----------------FYPKLAALSKTLFPVMETIYYIHNFKGRN 280
ELD +R+ P F FY ++ SKT + YI N+
Sbjct: 247 LELDTLRADLGLPGFSAKDLHYRFLSQFTPVFYIRIREYSKT----VAIAPYIVNY---- 298
Query: 281 GGTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
G +FR YPG W+V LK+ Y C+ + SL E ++L
Sbjct: 299 SGAVFRQYPGPWQVMLKQADGSYACIAESANRFSLGEAKEELL 341
>gi|357467949|ref|XP_003604259.1| hypothetical protein MTR_4g007190 [Medicago truncatula]
gi|355505314|gb|AES86456.1| hypothetical protein MTR_4g007190 [Medicago truncatula]
Length = 375
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 76/301 (25%), Positives = 127/301 (42%), Gaps = 57/301 (18%)
Query: 63 SSDGNNSIN----VDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPG 116
SSD N + V V P Y L A + A++DG +EI+FP + + S G
Sbjct: 51 SSDPNRGFDPKSGVSVYKPKSYEVLATDAANSLNFALQDGKLRIEIDFPPLPSNISSYKG 110
Query: 117 DSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLT 176
S+ I+ +++L+ + +K TR + FP+ E K +F+ A +D LT
Sbjct: 111 SSDDFID--ANIQLVLAVVKK-LQEKKETRACVVFPDKPE-KLRASQLFKAALDSVDGLT 166
Query: 177 -----------KPSFFE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVE 219
SFF DF F ++ + R + + L + + E+ +E
Sbjct: 167 IGSLDDIPAGPVASFFRSVRNTLDFDFEDENE--GRWQSSEPPSLYIFINCSTRELGYIE 224
Query: 220 ELYKEAVFNTAWKLIIFNGELDRIRS-----GYYPS------------FFYPKLAALSKT 262
+ ++ F + ++FN ELD +R+ G+ P FY ++ SKT
Sbjct: 225 KYVEK--FAPSTPTLLFNLELDTLRADLGLLGFPPKDLQYRFLSQFTPVFYIRIRDYSKT 282
Query: 263 LFPVMETIYYIHNFKGRNGGTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDI 321
+ YI N+ G +FR YPG W+V LK+ Y C+ + +L E ++
Sbjct: 283 I----AVAPYIVNY----SGAVFRQYPGPWQVMLKQADGSYACVAESATRFTLGEAKEEL 334
Query: 322 L 322
L
Sbjct: 335 L 335
>gi|298204679|emb|CBI25177.3| unnamed protein product [Vitis vinifera]
Length = 374
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 72/299 (24%), Positives = 127/299 (42%), Gaps = 53/299 (17%)
Query: 61 SVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDS 118
S +D + + V V P Y L+ A + A+ DG +EI+FP + + S G S
Sbjct: 52 SKKADFDPKVGVSVYKPKSYEVLVTDAANSLAYALDDGKTRLEIDFPPLPSNMSSYKGSS 111
Query: 119 EGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKP 178
+ I+ +++L+ + K T+ I FP+ E + A + +F+ A +D ++
Sbjct: 112 DEFID--ANIQLVLAVVRK-LQERKETKACIVFPDKPEKRRASQ-IFKTALDSIDGISIG 167
Query: 179 S-----------FFE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEEL 221
S FF DF F + + K L++ + + E+ +E+
Sbjct: 168 SLDDVPSGPVATFFRSIRDTLDFDFEDDNEGRWESKEAPSLYI--FINCSTRELAAIEKF 225
Query: 222 YKEAVFNTAWKLIIFNGELDRIRSGY----YPS-------------FFYPKLAALSKTLF 264
++ F + ++FN ELD +R+ +P+ FY ++ SKT
Sbjct: 226 VEK--FAPSTPTLLFNLELDTLRADLGLLGFPTKDLHYRFLSQFVPVFYIRIREYSKT-- 281
Query: 265 PVMETIYYIHNFKGRNGGTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
+ YI N+ G LFR YPG W+V LK+ Y C+ + +L E ++L
Sbjct: 282 --VAVAPYIVNY----SGALFRQYPGPWQVMLKQADGSYACVAESATRFTLGETKEELL 334
>gi|225443166|ref|XP_002264352.1| PREDICTED: uncharacterized protein LOC100263772 [Vitis vinifera]
Length = 378
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 72/299 (24%), Positives = 127/299 (42%), Gaps = 53/299 (17%)
Query: 61 SVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDS 118
S +D + + V V P Y L+ A + A+ DG +EI+FP + + S G S
Sbjct: 56 SKKADFDPKVGVSVYKPKSYEVLVTDAANSLAYALDDGKTRLEIDFPPLPSNMSSYKGSS 115
Query: 119 EGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKP 178
+ I+ +++L+ + K T+ I FP+ E + A + +F+ A +D ++
Sbjct: 116 DEFID--ANIQLVLAVVRK-LQERKETKACIVFPDKPEKRRASQ-IFKTALDSIDGISIG 171
Query: 179 S-----------FFE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEEL 221
S FF DF F + + K L++ + + E+ +E+
Sbjct: 172 SLDDVPSGPVATFFRSIRDTLDFDFEDDNEGRWESKEAPSLYI--FINCSTRELAAIEKF 229
Query: 222 YKEAVFNTAWKLIIFNGELDRIRSGY----YPS-------------FFYPKLAALSKTLF 264
++ F + ++FN ELD +R+ +P+ FY ++ SKT
Sbjct: 230 VEK--FAPSTPTLLFNLELDTLRADLGLLGFPTKDLHYRFLSQFVPVFYIRIREYSKT-- 285
Query: 265 PVMETIYYIHNFKGRNGGTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
+ YI N+ G LFR YPG W+V LK+ Y C+ + +L E ++L
Sbjct: 286 --VAVAPYIVNY----SGALFRQYPGPWQVMLKQADGSYACVAESATRFTLGETKEELL 338
>gi|428311732|ref|YP_007122709.1| hypothetical protein Mic7113_3579 [Microcoleus sp. PCC 7113]
gi|428253344|gb|AFZ19303.1| protein of unknown function (DUF1995) [Microcoleus sp. PCC 7113]
Length = 249
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 55/244 (22%), Positives = 98/244 (40%), Gaps = 49/244 (20%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P E + QAK A + A+ DG +++E FP L + + +
Sbjct: 4 LPKTLEEAITQAKEATQSALNDGRTRLQVELVFPEIALQA--------------QSIAQQ 49
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F LF E + ++ FP+ AR+ D+ P D G T + +
Sbjct: 50 FIPLF--EEYGSGLKVLFPDTGAAALARR----------DWGEVPFKISDVG-TSRSPIT 96
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK--LIIFNGELDRIRSGYYPSF 251
++++ ED+ FL+ P VE E + N A ++ N +L+ I S
Sbjct: 97 NKIQAEDKAFLLVAPS-------AVEVAQVETLCNLAGDRPCVLLNPQLEDI------SI 143
Query: 252 FYPKLAA--LSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQE 309
+AA L + L +E YY+ + + R YPG W+V ++ ++Y + ++
Sbjct: 144 VGIGMAARKLRERLLSTIEPCYYLRPI---DPAAILRSYPGLWQVWLEIDDEYQLIAEEP 200
Query: 310 VMPS 313
P+
Sbjct: 201 QRPA 204
>gi|326530656|dbj|BAK01126.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 419
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 73/288 (25%), Positives = 120/288 (41%), Gaps = 41/288 (14%)
Query: 66 GNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMT 125
G V V P Y L+ A + A+ DG +EIEFP +
Sbjct: 102 GELRAGVSVYKPRSYEVLVSDAARSLAAAIDDGRTRLEIEFPPLPSSISSYKGSSDEFID 161
Query: 126 GSMRLICEFC-DLFVTPEKVTRTRIFFPEANEVKFARK------SVFEGASF-KLDYL-T 176
+++L DL K TR+ I FP+ E + A + EG S LD L T
Sbjct: 162 ANVQLALAVVRDL--KKLKGTRSCIVFPDQPEKRRAAQIFKTAIDQIEGISIGSLDDLPT 219
Query: 177 KP--SFFE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFN 228
P +FF+ DF F++ + DR K ++ L + + ++ +E+ + F
Sbjct: 220 GPVDTFFKSIRITLDFDFSDDNE--DRWKSDEPPQLYIFINSSTRDLASIEKYVDQ--FA 275
Query: 229 TAWKLIIFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHN 275
+ ++FN ELD +RS G+ P + + + +F + + Y YI N
Sbjct: 276 ASVPAVLFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVTPYIVN 335
Query: 276 FKGRNGGTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
+ G +FR YPG W+V LK+ Y C+ + +L + ++L
Sbjct: 336 YS----GAVFRQYPGPWQVMLKQADGSYACVAESASRFTLGQAKDELL 379
>gi|254410487|ref|ZP_05024266.1| hypothetical protein MC7420_3002 [Coleofasciculus chthonoplastes
PCC 7420]
gi|196182693|gb|EDX77678.1| hypothetical protein MC7420_3002 [Coleofasciculus chthonoplastes
PCC 7420]
Length = 245
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/239 (22%), Positives = 92/239 (38%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P E + QA+ A + A+ DG +++E FP L + + +
Sbjct: 4 LPQTLDEAIAQAREATQAALNDGYTRLQVELVFPEIALQA--------------QSIAWQ 49
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +F + + ++ FP+A AR+ D+ P D G T + +
Sbjct: 50 FTPIF--EQYGSGLKVLFPDAGAAALARR----------DWGETPFKVTDIG-TSRSPVE 96
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
R++ EDE FL+ P + E+ VE+L A L +L + GY
Sbjct: 97 SRIQPEDEAFLIVSP--SAVEVSQVEKLCNLAGDRPCVLLTPQLEDLKVVGIGYAAR--- 151
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + + + YY+ +G L R YPG W++ + N Y + ++ P
Sbjct: 152 ----QLRERFLSTLTSCYYVQPL---DGAALLRVYPGLWQIWLEKENAYQLIAEEPQKP 203
>gi|119510288|ref|ZP_01629424.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
gi|119465032|gb|EAW45933.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
Length = 244
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/247 (22%), Positives = 108/247 (43%), Gaps = 39/247 (15%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P+ + + Q+++A + A+ DG ++++F L +P + +F
Sbjct: 4 LPNSLEQAIAQSRIATQAALADGYTRLQVDFLFPELKLMP--------------VAEQFL 49
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
LF E +R +IFFP+A A + + G FK+ D G + +
Sbjct: 50 SLFT--EYDSRLKIFFPDAGGAALANRD-WAGTPFKI---------LDIGTGRVASIQSK 97
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPK 255
++ EDE+FL P E+ VE+L E + + + ++ N L+ SG + +
Sbjct: 98 IQPEDEIFLFIAP--TSVEVPQVEKLC-ENIGDRPF--VMLNPRLE--DSGVVGIGYTAR 150
Query: 256 LAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMPSLK 315
+ +E+ YY+ + +FRCYPG W+V +++ +Y + + P+
Sbjct: 151 --QTRQRFISTLESCYYLRPVD--DTTAVFRCYPGLWEVWVEINGEYQKVAELPKRPTGD 206
Query: 316 EVALDIL 322
E LD++
Sbjct: 207 E--LDVI 211
>gi|428183504|gb|EKX52362.1| hypothetical protein GUITHDRAFT_157134 [Guillardia theta CCMP2712]
Length = 325
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 66/279 (23%), Positives = 119/279 (42%), Gaps = 57/279 (20%)
Query: 74 VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIE-MTGSMRLIC 132
P P + ++QA ++A+ A++DG KL+EIEFP ++ ++ G + ++
Sbjct: 16 TPPPKSFRMCVEQAYLSAKQAIEDGHKLIEIEFPPLPQSAMDNEAIGADTILKAQIQHST 75
Query: 133 EFCDLF-------VTPEKVTRTRIFFPEAN--------EVKF-ARKSVFEGASFKLDYLT 176
+F LF V + V R R E + ++F A K F+G+ + ++
Sbjct: 76 DFAKLFKNKKTAIVFADIVERNRFIDDETSSNPQSWRGNIRFTALKGGFKGSLIERVWIN 135
Query: 177 KPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIF 236
K DF V+ +D++F++ + E+ V EL K A +I+F
Sbjct: 136 K-----DF--------VSEVQEDDDMFIIIGA--SAQELPDVRELCKAAGDRP---VILF 177
Query: 237 NGELDRIRSGYYPSFF-------------YPKLAALSKTLFPVMETIYYIHNFKGRNGGT 283
N +L +R + FF P L K+ + ++ N+ G
Sbjct: 178 NLKLQVLRGDFGLPFFPSKSLHNDWLCEALPAYFMLPKSYTKTIAGPPFLINY----SGA 233
Query: 284 LFRCYPGSWKVLKKVSN-----KYICLHQQEVMPSLKEV 317
LFR YPG W++L +V + +Y + + P+L +V
Sbjct: 234 LFRTYPGKWQMLLEVPDEDGGGRYQRVRMLDKRPALSDV 272
>gi|224115852|ref|XP_002332073.1| predicted protein [Populus trichocarpa]
gi|222831959|gb|EEE70436.1| predicted protein [Populus trichocarpa]
Length = 381
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/283 (24%), Positives = 122/283 (43%), Gaps = 53/283 (18%)
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFPT--AGLDSVPGDSEGGIEMTGSMRLICEF 134
P Y L+ A + +++DG +EI+FP + + S G S+ I+ + L
Sbjct: 75 PKSYEVLVTDAANSLAFSLQDGKIRLEIDFPPLPSNISSYKGSSDEFIDANIQLALAV-- 132
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPS-----------FFE- 182
+ ++ TR + FP+ E+ A + +F+ A +D +T S FF+
Sbjct: 133 -IRKLQEKRETRACVVFPDKPEMLRACR-IFKTALDSIDGITIGSLDDIPSGPVTTFFKS 190
Query: 183 -----DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFN 237
DF F + + R + + L + + E+ V+E+ ++ F T+ ++FN
Sbjct: 191 VRNTLDFDFEDDSE--GRWQSNEPPSLYVFINCSTRELSVIEKYVEK--FATSTPTLLFN 246
Query: 238 GELDRIRSGY----YPS-------------FFYPKLAALSKTLFPVMETIYYIHNFKGRN 280
ELD +R+ +P+ FY ++ SKT+ YI N+
Sbjct: 247 LELDTLRADLGLLGFPTKDLHYRFLSQFIPVFYIRIREYSKTI----GVAPYIVNY---- 298
Query: 281 GGTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
G LFR YPG W+V LK+ Y C+ + +L E ++L
Sbjct: 299 SGALFRQYPGPWQVMLKQADGSYACVAESATRFTLGETKEELL 341
>gi|254422515|ref|ZP_05036233.1| hypothetical protein S7335_2667 [Synechococcus sp. PCC 7335]
gi|196190004|gb|EDX84968.1| hypothetical protein S7335_2667 [Synechococcus sp. PCC 7335]
Length = 248
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/226 (22%), Positives = 91/226 (40%), Gaps = 51/226 (22%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P+ E QA++A + A+ G+ L++IE L P + +F
Sbjct: 3 IPTTIPEATKQAEIATQAALDAGISLLQIEMAIPELKHQP--------------IAQQFV 48
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
+LF + ++ FP+A AR+ P DF ++A
Sbjct: 49 ELF----NDRQFKVLFPDAGAAALARRD-----------WNNP----DFVIRGLGELAQT 89
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPSFF 252
V ED+L+L+ P + E+ VE L EA+ +++ N +L+ + GY
Sbjct: 90 VDPEDDLYLIVNP--SAVEVDKVEALCNEALDQP---VVLLNPQLEDVAVVGIGYAAR-- 142
Query: 253 YPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKV 298
L +ET YY+ + G ++R YPG W++ +++
Sbjct: 143 -----QLRDRFLSQIETCYYVRPI---DQGVVYRAYPGPWQIWREI 180
>gi|443320726|ref|ZP_21049809.1| protein of unknown function (DUF1995) [Gloeocapsa sp. PCC 73106]
gi|442789566|gb|ELR99216.1| protein of unknown function (DUF1995) [Gloeocapsa sp. PCC 73106]
Length = 241
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/242 (24%), Positives = 92/242 (38%), Gaps = 49/242 (20%)
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICEF 134
P E + AK A A+ DG + +++E P L + L EF
Sbjct: 4 PQSLEEAIASAKSAVTNALNDGYRRLQVELVIPEIALQA--------------QALALEF 49
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMAD 194
LF E + ++F P+ AR+ D+ P D G T + +
Sbjct: 50 TTLFT--EYGSGLKVFLPDTGAAALARR----------DWGETPFAVTDLG-TNRSPIET 96
Query: 195 RVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWK--LIIFNGELDRIRSGYYPSFF 252
RV D++FLV P VE E + N A +++ +L+ + S
Sbjct: 97 RVNDTDQIFLVVSPS-------AVEVTQVEKLCNLAGDRPVVLLIPQLEDV------SIV 143
Query: 253 YPKLAA--LSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEV 310
LAA L + +E+ YY FK G + + YP W+V ++ N Y L +Q
Sbjct: 144 GIGLAARQLRERFLSTLESCYY---FKPLEGAAVLKNYPSPWQVWRESGNDYQLLCEQGE 200
Query: 311 MP 312
P
Sbjct: 201 KP 202
>gi|413935256|gb|AFW69807.1| hypothetical protein ZEAMMB73_081024 [Zea mays]
Length = 379
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 117/282 (41%), Gaps = 39/282 (13%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRL 130
V V P Y L+ A + A+ DG +EIEFP + +++L
Sbjct: 67 GVSVYKPRSYDVLVTDAARSLACAIDDGKTRLEIEFPXLPSSISSYKGSSDEFIDANIQL 126
Query: 131 ICEFCDLFVTPEKVTRTRIFFPEANEVKFARK------SVFEGASFKL--DYLTKP--SF 180
+ K TR+ I FP+ E + A + EG + D T P SF
Sbjct: 127 ALVVARK-LKELKGTRSCIVFPDQPEKRRASELFKTAIDTIEGVTISSLDDVPTDPVNSF 185
Query: 181 FE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLI 234
F+ DF F++ + R K ++ L + + ++ +E+ ++ F T+ +
Sbjct: 186 FKSIRNTLDFDFSDDNE--GRWKSDEPPSLYIFINSSTRDLASIEKYVEK--FATSVPAL 241
Query: 235 IFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKGRNG 281
+FN ELD +RS G+ P + + + +F + + Y YI N+
Sbjct: 242 LFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVAPYIVNY----S 297
Query: 282 GTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
G +FR YP W+V LK+ Y C+ + E +L + ++L
Sbjct: 298 GAVFRQYPAPWQVMLKQADGSYACVAESEARFTLGQAKEELL 339
>gi|413935257|gb|AFW69808.1| hypothetical protein ZEAMMB73_081024 [Zea mays]
Length = 378
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 117/282 (41%), Gaps = 39/282 (13%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRL 130
V V P Y L+ A + A+ DG +EIEFP + +++L
Sbjct: 67 GVSVYKPRSYDVLVTDAARSLACAIDDGKTRLEIEFPXLPSSISSYKGSSDEFIDANIQL 126
Query: 131 ICEFCDLFVTPEKVTRTRIFFPEANEVKFARK------SVFEGASFKL--DYLTKP--SF 180
+ K TR+ I FP+ E + A + EG + D T P SF
Sbjct: 127 ALVVARK-LKELKGTRSCIVFPDQPEKRRASELFKTAIDTIEGVTISSLDDVPTDPVNSF 185
Query: 181 FE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLI 234
F+ DF F++ + R K ++ L + + ++ +E+ ++ F T+ +
Sbjct: 186 FKSIRNTLDFDFSDDNE--GRWKSDEPPSLYIFINSSTRDLASIEKYVEK--FATSVPAL 241
Query: 235 IFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKGRNG 281
+FN ELD +RS G+ P + + + +F + + Y YI N+
Sbjct: 242 LFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVAPYIVNY----S 297
Query: 282 GTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
G +FR YP W+V LK+ Y C+ + E +L + ++L
Sbjct: 298 GAVFRQYPAPWQVMLKQADGSYACVAESEARFTLGQAKEELL 339
>gi|225450962|ref|XP_002284712.1| PREDICTED: uncharacterized protein LOC100264831 [Vitis vinifera]
gi|296088326|emb|CBI36771.3| unnamed protein product [Vitis vinifera]
Length = 353
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/254 (24%), Positives = 102/254 (40%), Gaps = 41/254 (16%)
Query: 73 DVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLIC 132
D PSD + Q+ A+ V G +E L + D EG + I
Sbjct: 99 DSCLPSDLEGAVRQSGQASASFVSSGGMRAIVELLIPQLQFL--DDEGAQAELWELSRI- 155
Query: 133 EFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKM 192
F D + R + FP+A GA+ L Y + F GF +
Sbjct: 156 -FIDTLIEETGCQRVKAIFPDA------------GAAALLKYRWGDAAF---GFA---SL 196
Query: 193 ADR--VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL--DRIRSGYY 248
+DR V+ EDEL ++ P + + + VE + + LI++N L + + G+
Sbjct: 197 SDRKPVEAEDELVVMVVPDYQM--LAYVERIASNLSDDPPRPLIMWNPRLISEDVGVGF- 253
Query: 249 PSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVL---KKVSNKYICL 305
+ L ++ + T +++ + G +FRCYPGSWKV K N+Y+
Sbjct: 254 ------NVRKLRRSF---LSTFTVVYSMRPLPAGAVFRCYPGSWKVFYDDKDRPNRYLLA 304
Query: 306 HQQEVMPSLKEVAL 319
+Q P +E+ +
Sbjct: 305 KEQISRPDAEELEI 318
>gi|304570679|ref|YP_001806112.2| hypothetical protein cce_4698 [Cyanothece sp. ATCC 51142]
gi|354552131|ref|ZP_08971439.1| protein of unknown function DUF1995-containing protein [Cyanothece
sp. ATCC 51472]
gi|353555453|gb|EHC24841.1| protein of unknown function DUF1995-containing protein [Cyanothece
sp. ATCC 51472]
Length = 246
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 100/251 (39%), Gaps = 43/251 (17%)
Query: 74 VPFPSDYSELLDQAKMAAELAVKDGMK--LMEIEFPTAGLDSVPGDSEGGIEMTGSMRLI 131
V P+ + + QAK AA+LA++ G+ +EI P L + P L
Sbjct: 2 VEVPNSLEQAVTQAKEAAKLALEAGVGRICLEIVIPEIALQAQP--------------LA 47
Query: 132 CEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVK 191
EF LF E + ++ FP+ AR+ D+ P D G +
Sbjct: 48 LEFTTLFKGQE--SGLKVIFPDTGAAALARR----------DWGETPFQVTDLG-SRATP 94
Query: 192 MADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSF 251
+ ++ DE FL+ P + E+ +VE+L A LI ++ + GY
Sbjct: 95 IDKKISEADEAFLLVSP--SSVEVEIVEKLCNLAGHRPVILLIPQLEDVSIVGIGYAAR- 151
Query: 252 FYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVM 311
L + +E++YY F+ +G + R YP W V + Y + +Q
Sbjct: 152 ------QLRERFISTLESVYY---FRPLDGVVVLRSYPSPWLVYLEKEEGYELIAEQGQK 202
Query: 312 PSLKEVALDIL 322
P + ALDIL
Sbjct: 203 PMGE--ALDIL 211
>gi|126656438|ref|ZP_01727699.1| hypothetical protein CY0110_22082 [Cyanothece sp. CCY0110]
gi|126622124|gb|EAZ92831.1| hypothetical protein CY0110_22082 [Cyanothece sp. CCY0110]
Length = 246
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 100/251 (39%), Gaps = 43/251 (17%)
Query: 74 VPFPSDYSELLDQAKMAAELAVKDGMK--LMEIEFPTAGLDSVPGDSEGGIEMTGSMRLI 131
V P + + QAK AA+LA++ G++ +EI P L + P L
Sbjct: 2 VEVPDSLEQAVTQAKEAAKLALEAGVERICLEIVIPEIALQAQP--------------LA 47
Query: 132 CEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVK 191
EF LF E + ++ FP+ AR+ D+ P D G +
Sbjct: 48 LEFTTLFEGQE--SGLKVIFPDTGAAALARR----------DWGETPFQVTDLG-SRATS 94
Query: 192 MADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSF 251
+ ++ DE FL+ P + E+ +VE+L A + LI ++ + GY
Sbjct: 95 IEKKISEADEAFLLVSP--SSVEVEIVEKLCNLAGHRSVILLIPQLEDVSIVGIGYAAR- 151
Query: 252 FYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVM 311
L + +E++YY F+ +G + R YP W V + Y + +Q
Sbjct: 152 ------QLRERFISTLESVYY---FRPLDGVVVLRSYPSPWLVYLEKEEGYELIAEQGQK 202
Query: 312 PSLKEVALDIL 322
P + ALD L
Sbjct: 203 PMGE--ALDTL 211
>gi|194700390|gb|ACF84279.1| unknown [Zea mays]
Length = 378
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 117/282 (41%), Gaps = 39/282 (13%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRL 130
V V P Y L+ A + A+ DG +EIEFP + +++L
Sbjct: 67 GVSVYKPRSYDVLVTDAARSLACAIDDGKTRLEIEFPPLPSSISSYKGSSDEFIDANIQL 126
Query: 131 ICEFCDLFVTPEKVTRTRIFFPEANEVKFARK------SVFEGASFKL--DYLTKP--SF 180
+ K TR+ I FP+ E + A + EG + D T P SF
Sbjct: 127 ALVVARK-LKELKGTRSCIVFPDQPEKRRASELFKTAIDTIEGVTISSLDDVPTDPVNSF 185
Query: 181 FE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLI 234
F+ DF F++ + R K ++ L + + ++ +E+ ++ F T+ +
Sbjct: 186 FKSIRNTLDFDFSDDNE--GRWKSDEPPSLYIFINSSTRDLASIEKYVEK--FATSVPAL 241
Query: 235 IFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKGRNG 281
+FN ELD +RS G+ P + + + +F + + Y YI N+
Sbjct: 242 LFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVAPYIVNY----S 297
Query: 282 GTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
G +FR YP W+V LK+ Y C+ + E +L + ++L
Sbjct: 298 GAVFRQYPAPWQVMLKQADGSYACVAESEARFTLGQAKEELL 339
>gi|326533176|dbj|BAJ93560.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 69/273 (25%), Positives = 112/273 (41%), Gaps = 41/273 (15%)
Query: 66 GNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMT 125
G V V P Y L+ A + A+ DG +EIEFP +
Sbjct: 56 GELRAGVSVYKPRSYEVLVSDAARSLAAAIDDGRTRLEIEFPPLPSSISSYKGSSDEFID 115
Query: 126 GSMRLICEFC-DLFVTPEKVTRTRIFFPEANEVKFARK------SVFEGASF-KLDYLTK 177
+++L DL K TR+ I FP+ E + A + EG S LD L
Sbjct: 116 ANVQLALAVVRDL--KKLKGTRSCIVFPDQPEKRRAAQIFKTAIDQIEGISIGSLDDLPA 173
Query: 178 ---PSFFE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFN 228
+FF+ DF F++ + DR K ++ L + + ++ +E+ + F
Sbjct: 174 GPVDTFFKSIRITLDFDFSDDNE--DRWKSDEPPQLYIFINSSTRDLASIEKYVDQ--FA 229
Query: 229 TAWKLIIFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHN 275
+ ++FN ELD +RS G+ P + + + +F + + Y YI N
Sbjct: 230 ASVPAVLFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVTPYIVN 289
Query: 276 FKGRNGGTLFRCYPGSWKV-LKKVSNKYICLHQ 307
+ G +FR YPG W+V LK+ Y C+ +
Sbjct: 290 Y----SGAVFRQYPGPWQVMLKQADGSYACVAE 318
>gi|302776844|ref|XP_002971564.1| hypothetical protein SELMODRAFT_172340 [Selaginella moellendorffii]
gi|300160696|gb|EFJ27313.1| hypothetical protein SELMODRAFT_172340 [Selaginella moellendorffii]
Length = 381
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 69/277 (24%), Positives = 123/277 (44%), Gaps = 39/277 (14%)
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFP--TAGLDSVPGDSEGGIEMTGSMRLICEF 134
P+ Y L++ A A A+ +G +EIEFP + S G S+ I+ +++L F
Sbjct: 77 PASYDVLVNDAVDATFFALDEGNNRLEIEFPPLPNEISSYKGSSDDFID--ANIQLALAF 134
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDF----GFTEKV 190
+ + T+I FP+ E + A + VF A F L +D GF + +
Sbjct: 135 ANKLNAARGIV-TKIVFPDNVEKRRASR-VFRSA-FDLSKGISLGCLDDVPGGNGFLKAL 191
Query: 191 KMADRVKLEDEL---FLVAYP---YFNVN---EMLVVEELYKEAVFNTAWKLIIFNGELD 241
+ A + ++++ + + P Y VN L + Y +AV +A +++FN +LD
Sbjct: 192 RGAFELDFQEDVSGKWQTSSPPSMYVVVNCSGNELPDLQKYMDAVVGSA-SIVLFNLQLD 250
Query: 242 RIRS-----GY--------YPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCY 288
++RS G+ + S F P ++ + +I N+ G L R Y
Sbjct: 251 KLRSDLGLFGFPGKDLQYEFLSQFLPAFYIRTRDYSKNVPFAPFIVNYS----GALLRRY 306
Query: 289 PGSWKVLKKVSNK-YICLHQQEVMPSLKEVALDILPS 324
PG W+V+ K +N Y C+ + +L + ++L S
Sbjct: 307 PGPWQVMIKQANGVYACVAENRQRFTLGQAKEELLRS 343
>gi|195650641|gb|ACG44788.1| hypothetical protein [Zea mays]
Length = 379
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 116/282 (41%), Gaps = 39/282 (13%)
Query: 71 NVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRL 130
V V P Y L+ A + A+ DG +EIEFP + +++L
Sbjct: 67 GVSVYKPRSYDVLVTDAARSLACAIDDGKTRLEIEFPPLPSSISSYKGSSDEFIDANIQL 126
Query: 131 ICEFCDLFVTPEKVTRTRIFFPEANEVKFARK------SVFEGASFKL--DYLTKP--SF 180
+ K TR+ I FP+ E + A + EG + D T P SF
Sbjct: 127 ALVVARK-LKELKGTRSCIVFPDQPEKRRASELFKTAIDTIEGVTISSLDDVPTDPVNSF 185
Query: 181 FE------DFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLI 234
F+ DF F++ + R K + L + + ++ +E+ ++ F T+ +
Sbjct: 186 FKSIRNTLDFDFSDDNE--GRWKSDQPPSLYIFINSSTRDLASIEKYVEK--FATSVPAL 241
Query: 235 IFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKGRNG 281
+FN ELD +RS G+ P + + + +F + + Y YI N+
Sbjct: 242 LFNLELDTLRSDLGLLGFPPKDLHYRFLSQFTPVFYIRQRDYSKTIAVAPYIVNY----S 297
Query: 282 GTLFRCYPGSWKV-LKKVSNKYICLHQQEVMPSLKEVALDIL 322
G +FR YP W+V LK+ Y C+ + E +L + ++L
Sbjct: 298 GAVFRQYPAPWQVMLKQADGSYACVAESEARFTLGQAKEELL 339
>gi|219113845|ref|XP_002186506.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209583356|gb|ACI65976.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 379
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 67/285 (23%), Positives = 121/285 (42%), Gaps = 43/285 (15%)
Query: 74 VPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMT-GSMRLIC 132
+P PS + EL + A LA KDG KL+E+EFP + D ++ +++L
Sbjct: 67 LPPPSSFFELQQDCQRAVRLARKDGHKLLEVEFPPLPAAVLEMDDVSAYDVVQANLKLAL 126
Query: 133 EFCDLFVTPEK----VTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTE 188
+F + E+ + + + FP+ E FA + G+ + S G +
Sbjct: 127 DFSKGLLAGERDGSSLKKIALLFPDQAEADFAVEKA--GSINPYPGVVISSLLSSEGIDD 184
Query: 189 KVKMADRVKL--------------EDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLI 234
+ +++ L + +L+++ + E+ VE L+K+ T ++
Sbjct: 185 RYFKPEQIFLNLLGKREGSVKPVPDTDLYIILTA--SAQELPDVEALHKQEPDKT---IV 239
Query: 235 IFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHN-----------FKGRNGGT 283
FN +LD +R G + + +PK + L V + +YY+ F G
Sbjct: 240 FFNLKLDVLR-GDFGAPAFPKKEFQDRFLSRV-KPVYYLRTRQYTRSTPKPPFMVNFQGC 297
Query: 284 LFRCYPGSWK-VLKKVSNKYICLHQQEVMPSL---KEVALDILPS 324
LFR YPG ++ +L + +Y L ++ P+L KE D L S
Sbjct: 298 LFRAYPGQYQTLLDTGTGRYRRLVGSDIRPALGAFKEQLTDDLKS 342
>gi|224123156|ref|XP_002319008.1| predicted protein [Populus trichocarpa]
gi|118487454|gb|ABK95555.1| unknown [Populus trichocarpa]
gi|222857384|gb|EEE94931.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 60/252 (23%), Positives = 101/252 (40%), Gaps = 37/252 (14%)
Query: 73 DVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLIC 132
D PSD + Q+ A+ L V G +E L + D EG + I
Sbjct: 85 DSCLPSDLEGAVRQSSQASALFVSSGGMRAIVELLIPQLQFL--DDEGAQAELWELSRI- 141
Query: 133 EFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKM 192
F D + R + FP+A GA+ L YL K + F GF+ +
Sbjct: 142 -FLDTLIEETGSQRIKAIFPDA------------GAAALLKYLWKDAAF---GFS---SL 182
Query: 193 ADR--VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPS 250
+DR V++ED++ ++ P + + E V+++ + + LI++N L G +
Sbjct: 183 SDRKPVQIEDDIVVMVVPDYQMLEK--VQKIVSDLSDDPPRPLIMWNPRLISEDVGVGIN 240
Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVL---KKVSNKYICLHQ 307
+ L + T+Y + G +FRCYP WKV K N+Y+ +
Sbjct: 241 -----VRNLRRYFLSTFTTVYSMRPLPA---GAVFRCYPEMWKVFYDDKDRPNRYLLAQE 292
Query: 308 QEVMPSLKEVAL 319
P +E+ +
Sbjct: 293 SVRRPDAEEIEI 304
>gi|356576779|ref|XP_003556507.1| PREDICTED: uncharacterized protein LOC100782973 isoform 1 [Glycine
max]
Length = 340
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 109/271 (40%), Gaps = 49/271 (18%)
Query: 59 TNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDS 118
T + DG D PSD + Q+ AA + V G +E L + D
Sbjct: 76 TETSVEDGEED---DSCLPSDLEGAVRQSSEAAAIFVSSGGIRAIVELLIPQLQFL--DD 130
Query: 119 EGG-IEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTK 177
EG +E+ R+ F D + K R + FP+A GA+ L Y K
Sbjct: 131 EGAQVELWELSRI---FLDTLIEKTKFQRVKAIFPDA------------GAAALLKYRWK 175
Query: 178 PSFFEDFGFTEKVKMADR--VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTA--WKL 233
+ F ++DR V+ +DE+ ++ P + +ML E + N L
Sbjct: 176 DALFR------FASLSDRKPVESDDEVVVMIVPDY---QMLEYVERIASNLSNIPEPTPL 226
Query: 234 IIFNGEL--DRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGS 291
I++N L + + G+ + L + T+Y++ G +FRCYPG
Sbjct: 227 IMWNPRLISEDVGVGF-------NVRKLRRVFLSTFTTVYFMRPMPF---GAIFRCYPGL 276
Query: 292 WKVL---KKVSNKYICLHQQEVMPSLKEVAL 319
WKV K+ ++Y+ + E+ P +++ L
Sbjct: 277 WKVFSDDKERPDRYLLAKEFEIRPDAEDIEL 307
>gi|323447575|gb|EGB03491.1| hypothetical protein AURANDRAFT_34008 [Aureococcus anophagefferens]
Length = 300
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 103/254 (40%), Gaps = 30/254 (11%)
Query: 95 VKDGMKLMEIEFPTAGLDS--VPGDSEGGIEMTGS-MRLICEFCDLFVTPEKVTRTRIFF 151
V DG +ME+EFP D+ G S+ G +++ + +L +F F + R I +
Sbjct: 16 VDDGDVIMEVEFPPLPADTRAAKGCSDLGRDVSAANTKLAVKFAAAFAE-RRGKRVAIMY 74
Query: 152 PEANEVKFARKSVFE---GASFKLDYLTKP-----SFFEDF-GFTEKVKMADRVKLEDEL 202
P+ E++ A + KL L KP S + F GF K K + L D+
Sbjct: 75 PDTAELERAVEDSGTDEPAPGVKLHSLRKPFNEAESLDQAFLGFFGKGKKNIKA-LPDDA 133
Query: 203 FLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKLAALSKT 262
+ F+ E+ VE L + F +I+FN +LD R F PK L
Sbjct: 134 DVYVCLTFSAQELPDVEYLCELESFGKP--VILFNLKLDTQRGDLGLPAFPPK--DLQWR 189
Query: 263 LFPVMETIYYIHN-----------FKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVM 311
++ +YY+ F G +FRCYPG ++ L Y +
Sbjct: 190 FLSRVKPVYYLRTRQYSLSLPQPPFVVNYQGAIFRCYPGKYQCLLDTGKTYRAVDVSARR 249
Query: 312 PSLKEVALDILPSA 325
P+L E DIL A
Sbjct: 250 PALGEFK-DILTDA 262
>gi|302760013|ref|XP_002963429.1| hypothetical protein SELMODRAFT_166238 [Selaginella moellendorffii]
gi|300168697|gb|EFJ35300.1| hypothetical protein SELMODRAFT_166238 [Selaginella moellendorffii]
Length = 383
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 68/277 (24%), Positives = 122/277 (44%), Gaps = 39/277 (14%)
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFPT--AGLDSVPGDSEGGIEMTGSMRLICEF 134
P+ Y L++ A A+ +G +EIEFP + S G S+ I+ +++L F
Sbjct: 79 PASYDVLVNDVVDATFFALDEGNNRLEIEFPPLPNEISSYKGSSDDFID--ANIQLALAF 136
Query: 135 CDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDF----GFTEKV 190
+ + T+I FP+ E + A + VF A F L +D GF + +
Sbjct: 137 ANKLNAARGIV-TKIVFPDNVEKRRASR-VFRSA-FDLSKGISLGCLDDVPGGNGFLKAL 193
Query: 191 KMADRVKLEDEL---FLVAYP---YFNVN---EMLVVEELYKEAVFNTAWKLIIFNGELD 241
+ A + ++++ + + P Y VN L + Y +AV +A +++FN +LD
Sbjct: 194 RGAFELDFQEDVSGKWQTSSPPSMYVVVNCSGNELPDLQKYMDAVVGSA-SIVLFNLQLD 252
Query: 242 RIRS-----GY--------YPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCY 288
++RS G+ + S F P ++ + +I N+ G L R Y
Sbjct: 253 KLRSDLGLFGFPGKDLQYEFLSQFLPAFYIRTRDYSKNVPFAPFIVNYSG----ALLRRY 308
Query: 289 PGSWKVLKKVSNK-YICLHQQEVMPSLKEVALDILPS 324
PG W+V+ K +N Y C+ + +L + ++L S
Sbjct: 309 PGPWQVMIKQANGVYACVAENRQRFTLGQAKEELLRS 345
>gi|434395506|ref|YP_007130453.1| protein of unknown function DUF1995-containing protein [Gloeocapsa
sp. PCC 7428]
gi|428267347|gb|AFZ33293.1| protein of unknown function DUF1995-containing protein [Gloeocapsa
sp. PCC 7428]
Length = 243
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 46/179 (25%), Positives = 80/179 (44%), Gaps = 30/179 (16%)
Query: 145 TRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFL 204
+ ++FFP+A AR+ D+ + P D G T +V + +V EDE FL
Sbjct: 57 NQLKVFFPDAGAAALARR----------DWGSVPFKILDIG-TGRVPVDQQVLPEDEAFL 105
Query: 205 VAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPSFFYPKLAALSK 261
+ P + E+ VE+L+ +AV + I+ N L+ + GY +
Sbjct: 106 LVAP--SAVEVAQVEKLH-QAVGERPF--ILLNPRLEDVSIVGIGYAGRQLRARF----- 155
Query: 262 TLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSN-KYICLHQQEVMPSLKEVAL 319
+ TI ++ + +G +FRCYP W+V ++ + +Y + +Q P EV L
Sbjct: 156 -----LNTIESCYHLRPLDGAAVFRCYPSPWQVWQENKDGEYQLIAEQPKKPMGDEVDL 209
>gi|356576781|ref|XP_003556508.1| PREDICTED: uncharacterized protein LOC100782973 isoform 2 [Glycine
max]
Length = 326
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 64/269 (23%), Positives = 108/269 (40%), Gaps = 49/269 (18%)
Query: 59 TNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDS 118
T + DG D PSD + Q+ AA + V G +E L + D
Sbjct: 76 TETSVEDGEED---DSCLPSDLEGAVRQSSEAAAIFVSSGGIRAIVELLIPQLQFL--DD 130
Query: 119 EGG-IEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTK 177
EG +E+ R+ F D + K R + FP+A GA+ L Y K
Sbjct: 131 EGAQVELWELSRI---FLDTLIEKTKFQRVKAIFPDA------------GAAALLKYRWK 175
Query: 178 PSFFEDFGFTEKVKMADR--VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTA--WKL 233
+ F ++DR V+ +DE+ ++ P + +ML E + N L
Sbjct: 176 DALFR------FASLSDRKPVESDDEVVVMIVPDY---QMLEYVERIASNLSNIPEPTPL 226
Query: 234 IIFNGEL--DRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGS 291
I++N L + + G+ + L + T+Y++ G +FRCYPG
Sbjct: 227 IMWNPRLISEDVGVGF-------NVRKLRRVFLSTFTTVYFMRPMPF---GAIFRCYPGL 276
Query: 292 WKVL---KKVSNKYICLHQQEVMPSLKEV 317
WKV K+ ++Y+ + E+ P +++
Sbjct: 277 WKVFSDDKERPDRYLLAKEFEIRPDAEDI 305
>gi|427705929|ref|YP_007048306.1| hypothetical protein Nos7107_0483 [Nostoc sp. PCC 7107]
gi|427358434|gb|AFY41156.1| protein of unknown function DUF1995-containing protein [Nostoc sp.
PCC 7107]
Length = 244
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 49/220 (22%), Positives = 89/220 (40%), Gaps = 37/220 (16%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P + + QA+ A + A+ DG ++++ L +P + +F
Sbjct: 4 LPDTLEDAIAQAREATKAALADGYTRVQVDLLFPELKQMP--------------VAEQFL 49
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
LF E +R ++FF +A AR+ + A+F++ D G + +
Sbjct: 50 PLFAEYE--SRLKVFFADAGGAALARRDWVD-AAFQI---------LDIGTGRAASIQSK 97
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPK 255
+K EDE+FL P + + +L K L++ N L+ SG + +
Sbjct: 98 IKPEDEIFLFVSP-----SAVEIPQLEKVCEIIGDRPLVMLNPRLED--SGTVGIGYAAR 150
Query: 256 LAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVL 295
+ + +E+ YY+ +FRCYPG W+VL
Sbjct: 151 --QIRERFLNTIESCYYLRPVD--ENTAVFRCYPGQWEVL 186
>gi|332705285|ref|ZP_08425366.1| protein of unknown function, DUF1995 [Moorea producens 3L]
gi|332356028|gb|EGJ35487.1| protein of unknown function, DUF1995 [Moorea producens 3L]
Length = 251
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 44/178 (24%), Positives = 74/178 (41%), Gaps = 35/178 (19%)
Query: 148 RIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAY 207
+IFFP+ A + D+ P D G T + R++ +D++FL+
Sbjct: 62 KIFFPDTGASALALR----------DWGEVPFKITDVG-TSRSPAETRIEDDDQVFLLVG 110
Query: 208 PYFNVNEMLVVEELYKEAVFNTAWK--LIIFNGELDRIRS---GYYPSFFYPKLAALSKT 262
P VE E + N A +I N +L+ + GY L
Sbjct: 111 PS-------AVEVAQVEKICNLAGDRPCVILNPQLEDVSIVGIGYAAR-------QLRDR 156
Query: 263 LFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMPSLKEVALD 320
+E+ YY+ F G G L+RCYP W+V ++ ++Y + ++ P+ + ALD
Sbjct: 157 FLKTLESCYYLRPFPG---GALWRCYPSMWQVWLEIDDEYQLVTEEPSKPTAE--ALD 209
>gi|298489954|ref|YP_003720131.1| hypothetical protein Aazo_0482 ['Nostoc azollae' 0708]
gi|298231872|gb|ADI63008.1| Domain of unknown function DUF1995 ['Nostoc azollae' 0708]
Length = 244
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 52/247 (21%), Positives = 100/247 (40%), Gaps = 37/247 (14%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P E + Q++ A + A+ DG+ ++++F L +P + +F
Sbjct: 4 LPKTLEEAITQSREAVKSALADGVTRIQVDFLFPELKFMP--------------VAEQFV 49
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
LF E +R ++FF + ++ FK+ ED G + +
Sbjct: 50 PLFA--EYESRVKVFF-ADAGAAALARRDWQNVPFKV---------EDIGTGRAASLQTK 97
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPK 255
++ EDE+FL P + V +L K +++ N L+ SG + +
Sbjct: 98 IQPEDEIFLFIAPT-----PVEVPQLEKMCEIIDTRPIVLLNPRLED--SGVVGIGYAAR 150
Query: 256 LAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMPSLK 315
+ +E+ YY+ + LFRCYPG W+V + +N+Y + + PS
Sbjct: 151 --ETRRRFISTIESCYYLRPVDDESA--LFRCYPGQWEVWLESNNEYEKIAELPKRPSGD 206
Query: 316 EVALDIL 322
E+ + ++
Sbjct: 207 EIDMILM 213
>gi|425449243|ref|ZP_18829085.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|389764162|emb|CCI09454.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
Length = 247
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 98/239 (41%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ E + QAK A +LA++ G + +++E P L + L +
Sbjct: 3 LPNSLEETILQAKAATQLALESGARRIQVELVIPEIALQA--------------QALALD 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +F + + R+ FP+ AR+ E F+L D G + + +
Sbjct: 49 FTSIFDS--YGSGLRVIFPDTGAAMLARRDWGETV-FQLG---------DLG-SRFIPIE 95
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
+++K EDE+FLV P + E+ VE+L A LI ++ + GY
Sbjct: 96 NKIKPEDEVFLVVCP--SSVEINSVEKLCNLAEDRPVVLLIPQLEDVSVVGIGYAAR--- 150
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + ++E+ YY F+ ++R YP W+V + + Y + +Q P
Sbjct: 151 ----QLRERFLSILESCYY---FRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKP 202
>gi|56752173|ref|YP_172874.1| hypothetical protein syc2164_c [Synechococcus elongatus PCC 6301]
gi|81300739|ref|YP_400947.1| hypothetical protein Synpcc7942_1930 [Synechococcus elongatus PCC
7942]
gi|56687132|dbj|BAD80354.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169620|gb|ABB57960.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 247
Score = 45.8 bits (107), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 54/241 (22%), Positives = 102/241 (42%), Gaps = 49/241 (20%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
FP+D ++QAK AA+ A+ G+ ++++ L P + +F
Sbjct: 4 FPADLETAVEQAKEAAQRALAAGLTRIQVDLAIPELKIQP--------------IAWQFL 49
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
F+ ++ + R+FFP+ AR+ D+ P F G E++K A
Sbjct: 50 AAFL--DRGAKLRVFFPDPGAAALARR----------DWGELP--FSVRGM-EELKAA-- 92
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPSFF 252
V+ E+ELF+ P + E+ +E+L +E +I+ N L+ + + GY
Sbjct: 93 VQPEEELFIFIEP--SSVEVQRLEQLCQEIGDRP---VILLNPRLEDVATIGIGYAAR-- 145
Query: 253 YPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + ++ YY+ + G +F+ YP W+V ++ Y L + + P
Sbjct: 146 -----QLRERFLNQWQSAYYLSPLE---GAAIFQAYPQRWQVWQETDTGYELLQEYDQRP 197
Query: 313 S 313
+
Sbjct: 198 N 198
>gi|299469765|emb|CBN76619.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 322
Score = 45.4 bits (106), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 60/257 (23%), Positives = 97/257 (37%), Gaps = 45/257 (17%)
Query: 75 PFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEM-TGSMRLICE 133
P PS + + + QA+ A E A +DG L+E+EFP D + ++ + ++RL
Sbjct: 21 PAPSTFEQCIRQAQGAVEDAFEDGFNLVEVEFPPLQQDYLEDSGSSAYDVSSANVRLASR 80
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKS-----VFEGASFK------------LDYLT 176
F F K I P+ E+ A + +G + + LD L
Sbjct: 81 FAQSFAAEGK--EVSILLPDEAELDQAADDEGGVEISKGVTLRTLRSSGKRTAATLDALF 138
Query: 177 KPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIF 236
G E ++ D VA F+ E+ ++EL K K++ F
Sbjct: 139 MSFVGRGTGVIEPIEGTD--------IYVAL-VFSCQELPDLQELNK---LVPDAKIVFF 186
Query: 237 NGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHN-----------FKGRNGGTLF 285
N LD +R F PK +L ++ +Y + F G F
Sbjct: 187 NLRLDTLRGDLGLPAFPPK--SLHYDFLSQIKPVYLLRTRAYSRTISKKPFLVNYQGAQF 244
Query: 286 RCYPGSWKVLKKVSNKY 302
R YPG ++ L V ++Y
Sbjct: 245 RVYPGEYQCLLDVGSRY 261
>gi|425455386|ref|ZP_18835106.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
gi|389803734|emb|CCI17368.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
Length = 247
Score = 45.4 bits (106), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 56/239 (23%), Positives = 97/239 (40%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ E + QAK A +LA++ G + +++E P L + L +
Sbjct: 3 LPNSLEETILQAKAATQLAIESGARRVQVELVIPEIALQ--------------AQALALD 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F LF + + R+ FP+ AR+ E F+L D G + + +
Sbjct: 49 FTSLFDS--YGSGLRVIFPDTGAAMLARRDWGETV-FQLG---------DLG-SRFIPVE 95
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
+++K EDE FLV P + E+ VE+L A LI ++ + GY
Sbjct: 96 NKIKPEDEAFLVVCP--SSVEINSVEKLCYLAEDRPVVLLIPQLEDVSVVGIGYAAR--- 150
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + ++E+ YY F+ ++R YP W+V + + Y + +Q P
Sbjct: 151 ----QLRERFLSILESCYY---FRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKP 202
>gi|434407903|ref|YP_007150788.1| protein of unknown function (DUF1995) [Cylindrospermum stagnale PCC
7417]
gi|428262158|gb|AFZ28108.1| protein of unknown function (DUF1995) [Cylindrospermum stagnale PCC
7417]
Length = 244
Score = 45.4 bits (106), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 55/247 (22%), Positives = 100/247 (40%), Gaps = 39/247 (15%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P + + Q++ A + A+ DG K ++I+F L +P + +F
Sbjct: 4 LPKTLEDAIAQSREATQAALADGCKRIQIDFLFPELKILP--------------VAEQFL 49
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
+F E +R ++FF +A AR+ + A+F + D G + +
Sbjct: 50 PVFA--EYDSRLKVFFADAGGAALARRD-WADAAFSI---------VDIGTGRTASLQSK 97
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPK 255
++ EDE+FL P + V +L K ++ N L+ SG + +
Sbjct: 98 IQPEDEIFLFIAPT-----SVEVPQLEKLCEIIGDRPVVFLNPRLED--SGVVGIGYTAR 150
Query: 256 LAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMPSLK 315
+ +E+ YY+ +FR YPG W+V + +++Y + + PS
Sbjct: 151 --ETRRRFISTIESCYYLRPVDDETA--VFRSYPGQWEVWVETNDEYQRIAELPKRPSGD 206
Query: 316 EVALDIL 322
E LDIL
Sbjct: 207 E--LDIL 211
>gi|425446838|ref|ZP_18826837.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|389732775|emb|CCI03345.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
Length = 247
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 97/239 (40%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ E + QAK A +LA++ G + +++E P L + L +
Sbjct: 3 LPNSLEETILQAKAATQLALESGARRLQVELVIPEIALQA--------------QALALD 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +F + + R+ FP+ AR+ E F+L D G + + +
Sbjct: 49 FTSIFDS--YGSGLRVIFPDTGAAMLARRDWGETV-FQLG---------DLG-SRFIPVE 95
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
+++K EDE FLV P + E+ VE+L A LI ++ + GY
Sbjct: 96 NKIKPEDEAFLVVCP--SSVEINSVEKLCNLAEDRPVVLLIPQLEDVSVVGIGYAAR--- 150
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + ++E+ YY F+ ++R YP W+V + + Y + +Q P
Sbjct: 151 ----QLRERFLSILESCYY---FRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKP 202
>gi|443314784|ref|ZP_21044317.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 6406]
gi|442785626|gb|ELR95433.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 6406]
Length = 246
Score = 45.1 bits (105), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 53/228 (23%), Positives = 93/228 (40%), Gaps = 51/228 (22%)
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCD 136
PS ++QA+ A A++ G+ + ++ L P + +F
Sbjct: 7 PSSLEAAVNQARQATRAAIEAGVSRIVVDIAIPELKIQP--------------VAEQFYP 52
Query: 137 LFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRV 196
LF E + +++FP+A A++ D+ DF +M R+
Sbjct: 53 LF--EEMGLQFKVYFPDAGAAALAKR----------DWGNP-----DFSIRGINEMKGRL 95
Query: 197 KLEDELFLVAYPY-FNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPSFF 252
+ +DE FL+ P VN+ VE EA T +++ N +L+ I + GY
Sbjct: 96 EADDEAFLIIEPSSVEVND---VESFCNEA---TGRFVVMLNPKLEDIATIGIGYTGR-- 147
Query: 253 YPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSN 300
L + +ETIYY+ +G T+ R YPG W+V + ++
Sbjct: 148 -----QLRERFLSTLETIYYLQPLEG---ATILRAYPGLWQVWGETTD 187
>gi|425459538|ref|ZP_18839024.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
gi|389822715|emb|CCI29585.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
Length = 247
Score = 45.1 bits (105), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 97/239 (40%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ E + QAK A +LA++ G + +++E P L + L +
Sbjct: 3 LPNSLEETILQAKAATQLALESGARRIQVELVIPEIALQA--------------QALALD 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +F + + R+ FP+ AR+ E F+L D G + + +
Sbjct: 49 FTSIFDS--YGSGLRVIFPDTGAAMLARRDWGETV-FQLG---------DLG-SRFIPVE 95
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
+++K EDE FLV P + E+ VE+L A LI ++ + GY
Sbjct: 96 NKIKPEDEAFLVVCP--SSVEINSVEKLCNLAEDRPVVLLIPQLEDVSVVGIGYAAR--- 150
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + ++E+ YY F+ ++R YP W+V + + Y + +Q P
Sbjct: 151 ----QLRERFLSILESCYY---FRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKP 202
>gi|425472740|ref|ZP_18851581.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389881081|emb|CCI38316.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 247
Score = 45.1 bits (105), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 98/239 (41%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ E + QAK A +LA++ G + +++E P L + L +
Sbjct: 3 LPNSLEETILQAKAATQLALESGARRIQVELVIPEIALQA--------------QALALD 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +F + + R+ FP+ AR+ E F+L D G + + +
Sbjct: 49 FTSIFDS--YGSGLRVIFPDTGAAMLARRDWGETV-FQLG---------DLG-SRFIPVE 95
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
+++K EDE+FLV P + E+ VE+L A LI ++ + GY
Sbjct: 96 NKIKPEDEVFLVVCP--SSVEINSVEKLCYLAEDRPVVLLIPQLEDVSVVGIGYAAR--- 150
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + ++E+ YY F+ ++R YP W+V + + Y + +Q P
Sbjct: 151 ----QLRERFLSILESCYY---FRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKP 202
>gi|440753103|ref|ZP_20932306.1| hypothetical protein O53_1480 [Microcystis aeruginosa TAIHU98]
gi|440177596|gb|ELP56869.1| hypothetical protein O53_1480 [Microcystis aeruginosa TAIHU98]
Length = 247
Score = 45.1 bits (105), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 97/239 (40%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ E + QAK A +LA++ G + +++E P L + L +
Sbjct: 3 LPNSLEETILQAKAATQLALESGARRIQVELVIPEIALQA--------------QALALD 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +F + + R+ FP+ AR+ E F+L D G + + +
Sbjct: 49 FTSIFDS--YGSGLRVIFPDTGAAMLARRDWGETV-FQLG---------DLG-SRFIPVE 95
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
+++K EDE FLV P + E+ VE+L A LI ++ + GY
Sbjct: 96 NKIKPEDEAFLVVCP--SSVEINSVEKLCNLAEDRPVVLLIPQLEDVSVVGIGYAAR--- 150
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + ++E+ YY F+ ++R YP W+V + + Y + +Q P
Sbjct: 151 ----QLRERFLSILESCYY---FRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKP 202
>gi|434386365|ref|YP_007096976.1| protein of unknown function (DUF1995) [Chamaesiphon minutus PCC
6605]
gi|428017355|gb|AFY93449.1| protein of unknown function (DUF1995) [Chamaesiphon minutus PCC
6605]
Length = 240
Score = 45.1 bits (105), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 82/194 (42%), Gaps = 28/194 (14%)
Query: 127 SMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGF 186
SM L +F + F ++ ++ F + A++ ++G +FK+ ED G
Sbjct: 41 SMGLARQFLESF--SDRGNTLKVLFTDTGAAALAKRD-WQGVNFKI---------EDLG- 87
Query: 187 TEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSG 246
+ + D+++ EDEL++ P + E+ VE+L A +I+F L ++
Sbjct: 88 NSRSPITDKIQAEDELYIAIDP--SAVEVEQVEKLCNAAGDR---PVILF---LPKLEDA 139
Query: 247 YYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLH 306
Y L + YYI K L+RCYP W+V ++ + YI L
Sbjct: 140 AIVGIGYAA-RQLRDRFLTTLTCAYYI---KPLEASALYRCYPAQWQVWQEQDDDYILLA 195
Query: 307 ---QQEVMPSLKEV 317
Q+ V L E+
Sbjct: 196 ECPQKPVGDELDEI 209
>gi|16329818|ref|NP_440546.1| hypothetical protein slr1702 [Synechocystis sp. PCC 6803]
gi|383321560|ref|YP_005382413.1| hypothetical protein SYNGTI_0651 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383324730|ref|YP_005385583.1| hypothetical protein SYNPCCP_0651 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383490614|ref|YP_005408290.1| hypothetical protein SYNPCCN_0651 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384435880|ref|YP_005650604.1| hypothetical protein SYNGTS_0651 [Synechocystis sp. PCC 6803]
gi|451813978|ref|YP_007450430.1| hypothetical protein MYO_16580 [Synechocystis sp. PCC 6803]
gi|1652303|dbj|BAA17226.1| slr1702 [Synechocystis sp. PCC 6803]
gi|339272912|dbj|BAK49399.1| hypothetical protein SYNGTS_0651 [Synechocystis sp. PCC 6803]
gi|359270879|dbj|BAL28398.1| hypothetical protein SYNGTI_0651 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274050|dbj|BAL31568.1| hypothetical protein SYNPCCN_0651 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277220|dbj|BAL34737.1| hypothetical protein SYNPCCP_0651 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407957706|dbj|BAM50946.1| hypothetical protein BEST7613_2015 [Synechocystis sp. PCC 6803]
gi|451779947|gb|AGF50916.1| hypothetical protein MYO_16580 [Synechocystis sp. PCC 6803]
Length = 251
Score = 44.7 bits (104), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 51/237 (21%), Positives = 96/237 (40%), Gaps = 39/237 (16%)
Query: 72 VDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLI 131
+ +P PS E + +A A + A+ DG + +++E +VP E+ + I
Sbjct: 1 MSLPVPSTIDEAISEAIAATQRALDDGYRRIQVEL------AVP-------EIALQAQAI 47
Query: 132 C-EFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKV 190
EF LF ++ RI FP+ A++ D+ P D G +
Sbjct: 48 ALEFSQLFA--DQGLGLRIIFPDTGAAALAKR----------DWGEVPFQIGDLG-SRYT 94
Query: 191 KMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPS 250
+ ++ D++FLV P + E+ VE+L + A LI ++ + GY
Sbjct: 95 PIGRKIAESDQVFLVVSP--SAVEVQSVEKLCELAGDRPVVLLIPQLEDVSIVGIGYAAR 152
Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQ 307
+ + T++ + F+ +G + R +P W+VL + +Y L +
Sbjct: 153 QLRQRF----------LSTLFSAYYFRPLDGAVVLRSHPSRWQVLLEKGEEYELLTE 199
>gi|425435402|ref|ZP_18815857.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389680066|emb|CCH91215.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
Length = 247
Score = 44.7 bits (104), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 96/239 (40%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ E + QAK A +LA++ G + +++E P L + L +
Sbjct: 3 LPNSLEETILQAKAATQLALESGARRIQVELVIPEIALQA--------------QALALD 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +F + R+ FP+ AR+ E F+L D G + + +
Sbjct: 49 FTSIF--DGYGSGLRVIFPDTGAAMLARRDWGETV-FQLG---------DLG-SRFIPVE 95
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
+++K EDE FLV P + E+ VE+L A LI ++ + GY
Sbjct: 96 NKIKPEDEAFLVVCP--SSVEINSVEKLCNLAEDRPVVLLIPQLEDVSVVGIGYAAR--- 150
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + ++E+ YY F+ ++R YP W+V + + Y + +Q P
Sbjct: 151 ----QLRERFLSILESCYY---FRPLESAIVYRSYPSMWQVWLEKEDGYELISEQSTKP 202
>gi|443658098|ref|ZP_21132025.1| hypothetical protein C789_2565 [Microcystis aeruginosa DIANCHI905]
gi|159027702|emb|CAO89569.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443333038|gb|ELS47616.1| hypothetical protein C789_2565 [Microcystis aeruginosa DIANCHI905]
Length = 247
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 97/239 (40%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ E + QAK A +LA++ G + +++E P L + L +
Sbjct: 3 LPNSLEETILQAKAATQLALESGARRIQVELVIPEIALQA--------------QALALD 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +F + + R+ FP+ AR+ E F+L D G + + +
Sbjct: 49 FTSIFDS--YGSGLRVIFPDTGAAMLARRDWGETV-FQLG---------DLG-SRFIPVE 95
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
+++K EDE FLV P + E+ VE+L A LI ++ + GY
Sbjct: 96 NKIKDEDEAFLVVCP--SSVEINSVEKLCYLAEDRPVVLLIPQLEDVSVVGIGYAAR--- 150
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + ++E+ YY F+ ++R YP W+V + + Y + +Q P
Sbjct: 151 ----QLRERFLSILESCYY---FRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKP 202
>gi|425442993|ref|ZP_18823225.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389715818|emb|CCH99873.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 247
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 96/239 (40%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ E + QAK A +LA++ G + +++E P L + L +
Sbjct: 3 LPNSLEETILQAKAATQLALESGARRIQVELVIPEIALQA--------------QALALD 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +F + + R+ FP+ AR+ E F+L D G + + +
Sbjct: 49 FASIFDS--YGSGLRVIFPDTGAAMLARRDWGETV-FQLG---------DLG-SRFIPIE 95
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
+++K EDE FLV P + E+ VE+L A LI ++ + GY
Sbjct: 96 NKIKPEDEAFLVVCP--SSVEINSVEKLCYLAEDRPVVLLIPQLEDVSVVGIGYAAR--- 150
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + +E+ YY F+ ++R YP W+V + + Y + +Q P
Sbjct: 151 ----QLRERFLSTLESCYY---FRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKP 202
>gi|422304988|ref|ZP_16392325.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389789763|emb|CCI14274.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 247
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 95/239 (39%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ E + QAK A +LA++ G + +++E P L + L +
Sbjct: 3 LPNSLEETILQAKAATQLALESGARRIQVELVIPEIALQA--------------QALALD 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +F + R+ FP+ AR+ E F+L D G + + +
Sbjct: 49 FASIF--DGYGSGLRVIFPDTGAAMLARRDWGETV-FQLG---------DLG-SRFIPIE 95
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
+++K EDE FLV P + E+ VE+L A LI ++ + GY
Sbjct: 96 NKIKPEDEAFLVVCP--SSVEINSVEKLCNLAEDRPVVLLIPQLEDVSVVGIGYAAR--- 150
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + +E+ YY F+ ++R YP W+V + + Y + +Q P
Sbjct: 151 ----QLRERFLSTLESCYY---FRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKP 202
>gi|440681570|ref|YP_007156365.1| protein of unknown function DUF1995-containing protein [Anabaena
cylindrica PCC 7122]
gi|428678689|gb|AFZ57455.1| protein of unknown function DUF1995-containing protein [Anabaena
cylindrica PCC 7122]
Length = 244
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 53/242 (21%), Positives = 96/242 (39%), Gaps = 37/242 (15%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P + + Q++ A + A+ DGM ++++F L +P + +F
Sbjct: 4 LPKTLEDAIAQSREAVKSALADGMTRIQVDFLFPELKFMP--------------IAEQFI 49
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
LF E +R +IFF + + + FK+ ED G + +
Sbjct: 50 PLFA--EYESRLKIFF-ADAGAAALARRDWADSPFKV---------EDIGTGRTASLETK 97
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPK 255
++ EDE+FL P + V +L K I+ N L+ SG + +
Sbjct: 98 IQPEDEIFLFIAPT-----SVEVPQLEKLCEIIGTRPFILLNPRLED--SGVVGIGYAAR 150
Query: 256 LAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMPSLK 315
+ +E+ YY+ + LFRCYPG W+V + +++Y + + PS
Sbjct: 151 --ETRRRFISTIESCYYLRPVDDESA--LFRCYPGDWEVWLETNDEYQKIAELPKRPSGD 206
Query: 316 EV 317
E+
Sbjct: 207 EI 208
>gi|166367819|ref|YP_001660092.1| hypothetical protein MAE_50780 [Microcystis aeruginosa NIES-843]
gi|425464571|ref|ZP_18843881.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|166090192|dbj|BAG04900.1| hypothetical protein MAE_50780 [Microcystis aeruginosa NIES-843]
gi|389833386|emb|CCI22146.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 247
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 96/239 (40%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ E + QAK A +LA++ G + +++E P L + L +
Sbjct: 3 LPNSLEETILQAKAATQLALESGDRRIQVELVIPEIALQA--------------QALALD 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +F + + R+ FP+ AR+ E F+L D G + + +
Sbjct: 49 FTSIFDS--YGSGLRVIFPDTGAAMLARRDWGETV-FQLG---------DLG-SRFIPIE 95
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
+++K EDE FLV P + E+ VE+L A LI ++ + GY
Sbjct: 96 NKIKPEDEAFLVVCP--SSVEINSVEKLCNLAEDRPVVLLIPQLEDVSVVGIGYAAR--- 150
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + +E+ YY F+ ++R YP W+V + + Y + +Q P
Sbjct: 151 ----QLRERFLSTLESCYY---FRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKP 202
>gi|356535083|ref|XP_003536078.1| PREDICTED: uncharacterized protein LOC100803954 [Glycine max]
Length = 344
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 61/256 (23%), Positives = 99/256 (38%), Gaps = 44/256 (17%)
Query: 73 DVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLIC 132
D PSD + Q+ AA V G +E L + D EG + I
Sbjct: 91 DSCLPSDLEGAVRQSSEAAATFVSSGGMRAIVELLIPQLQFL--DDEGAQGELWELSRI- 147
Query: 133 EFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKM 192
F D + K R + FP+A GA+ L Y K + F +
Sbjct: 148 -FLDTLIEKTKSQRVKAIFPDA------------GAAALLKYRWKDALFR------FASL 188
Query: 193 ADR--VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTA--WKLIIFNGEL--DRIRSG 246
+DR V+ +DE+ ++ P + +ML E + N LI++N L + + G
Sbjct: 189 SDRKPVESDDEIVVMIVPDY---QMLEYVERIASNLTNVPEPTPLIMWNPRLISEDVGVG 245
Query: 247 YYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVL---KKVSNKYI 303
+ + L + T+Y++ G +FRCYPG WKV K+ ++Y+
Sbjct: 246 F-------NVRKLRRFFLSTFTTVYFMRPMPF---GAIFRCYPGLWKVFSDDKERPDRYL 295
Query: 304 CLHQQEVMPSLKEVAL 319
+ E P ++ L
Sbjct: 296 LAKEFESRPDADDIEL 311
>gi|186682862|ref|YP_001866058.1| hypothetical protein Npun_R2561 [Nostoc punctiforme PCC 73102]
gi|186465314|gb|ACC81115.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 244
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 55/238 (23%), Positives = 97/238 (40%), Gaps = 37/238 (15%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P+ + + Q++ A + A+ DG +++EF L +P + +F
Sbjct: 4 LPNTLEDAIAQSRTAVQAALADGCTRIQVEFLFPELKFMP--------------VAEQFL 49
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
LF E +R +IFF +A AR+ ++ P D G + +
Sbjct: 50 PLFT--EYDSRLKIFFADAGAAALARR----------NWTDAPFQILDIGTGRTASLQTK 97
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPK 255
++ EDE+FL P E+ VE+L +E ++ N L+ SG + +
Sbjct: 98 IQPEDEIFLFIAP--TSVEVPQVEKLCQEIGDR---PVVFLNPRLED--SGTVGIGYAAR 150
Query: 256 LAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMPS 313
L T +E+ YY+ ++ L RCYPG W+V + +Y + + PS
Sbjct: 151 QTRLRFT--NTIESCYYLRPIDEQSA--LSRCYPGQWEVWLETDGEYQRIAELPTKPS 204
>gi|255542632|ref|XP_002512379.1| conserved hypothetical protein [Ricinus communis]
gi|223548340|gb|EEF49831.1| conserved hypothetical protein [Ricinus communis]
Length = 343
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 60/252 (23%), Positives = 98/252 (38%), Gaps = 37/252 (14%)
Query: 73 DVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLIC 132
D PSD + Q+ A+ L V G +E L + D EG + I
Sbjct: 88 DSCLPSDLEGAVRQSSEASALFVSSGGMKAIVELLIPQLQFL--DEEGAQAELWELSRI- 144
Query: 133 EFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKM 192
F D + + + FP+A GA+ L Y K + FGF +
Sbjct: 145 -FLDTLIEETGCQKVKAIFPDA------------GAAALLKYQWKDA---TFGF---ASL 185
Query: 193 ADR--VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPS 250
+DR V+ EDE+ ++ P + + + VE++ + LI++N L G +
Sbjct: 186 SDRKPVENEDEIVVMVVPDYQM--LGYVEKIASNLSDDPPRPLIMWNPRLISEDVGVGIN 243
Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVL---KKVSNKYICLHQ 307
+ L + T+Y + G +FRCYPG WKV K N+Y+ +
Sbjct: 244 -----VRNLRRYFLSAFTTVYSMRPLPS---GAVFRCYPGMWKVFYDDKDRPNRYLLAKE 295
Query: 308 QEVMPSLKEVAL 319
P +E+ +
Sbjct: 296 LIRRPDAEELEI 307
>gi|168045792|ref|XP_001775360.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673305|gb|EDQ59830.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 404
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 67/270 (24%), Positives = 104/270 (38%), Gaps = 43/270 (15%)
Query: 70 INVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMR 129
+ V V P+ Y L+ A + ++DG+K +EI+FP SV G E +
Sbjct: 91 LGVSVYKPASYETLVADAAKSLSYGLEDGLKRLEIDFPPLP-SSVSGYKGASDEFINANI 149
Query: 130 LICEFCDLFVTPEKVTRTRIFFPEANEVKFARKS------------------VFEGASFK 171
+ V + R+ FP+ E + A +S V GA
Sbjct: 150 QLALALARKVHELRGISCRLVFPDKPEKRKAVRSFGSAIEMTGCVSVGCLDDVPGGAGKS 209
Query: 172 LDYLTKPSFFEDFGFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAW 231
L + +F DF F E V+ E L +V + E+ VEE F
Sbjct: 210 LWGSVRNAF--DFDFGEDVEGKFESSQEPGLCIVL--NCSTAELPAVEEYVN--CFCKDT 263
Query: 232 KLIIFNGELDRIRS-----GYYPSFFYPKLAALSKTLFPVMETIY--------YIHNFKG 278
+++FN E D +R+ G+ P + + A +F V Y +I N+
Sbjct: 264 PVVLFNLETDTLRADLGLLGFPPKDLHYRFLAQFLPVFYVRIRDYSKSVNVAPFILNYS- 322
Query: 279 RNGGTLFRCYPGSWKV-LKKVSNKYICLHQ 307
G L R YPG W+V LK+ Y C+ +
Sbjct: 323 ---GALLRMYPGPWQVMLKQTDGSYACVAE 349
>gi|390438390|ref|ZP_10226864.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|389838196|emb|CCI30988.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 247
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 96/239 (40%), Gaps = 41/239 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEF--PTAGLDSVPGDSEGGIEMTGSMRLICE 133
P+ E + QAK A +LA++ G + +++E P L + L +
Sbjct: 3 LPNSLEETILQAKAATQLALESGARCVQVELVIPEIALQA--------------QALALD 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F +F + + R+ FP+ AR+ E F+L D G + + +
Sbjct: 49 FTSIFDS--YGSGLRVIFPDTGAAMLARRDWGETV-FQLG---------DLG-SRFIPIE 95
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFY 253
+++K EDE FLV P + E+ VE+L A LI ++ + GY
Sbjct: 96 NKIKPEDEAFLVVCP--SSVEINSVEKLCNLAEDRPVVLLIPQLEDVSVVGIGYAAR--- 150
Query: 254 PKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + +E+ YY F+ ++R YP W+V + + Y + +Q P
Sbjct: 151 ----QLRERFLSTLESCYY---FRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKP 202
>gi|297812961|ref|XP_002874364.1| hypothetical protein ARALYDRAFT_489571 [Arabidopsis lyrata subsp.
lyrata]
gi|297320201|gb|EFH50623.1| hypothetical protein ARALYDRAFT_489571 [Arabidopsis lyrata subsp.
lyrata]
Length = 340
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 69/166 (41%), Gaps = 34/166 (20%)
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F D + R + FP+A GA+ L Y K + FGF ++
Sbjct: 145 FLDTLIKETGCERVKAVFPDA------------GAAALLKYRWKDA---TFGFA---SLS 186
Query: 194 DR--VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL--DRIRSGYYP 249
DR V+ EDE+ ++ P + + E VE++ K + LI++N L + + G+
Sbjct: 187 DRKPVEKEDEIIVMVVPDYQMLEY--VEKIAKGLADDPPRPLIMWNPRLISEEVGVGF-- 242
Query: 250 SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVL 295
+ L + T+Y + G +FRCYPG WKV
Sbjct: 243 -----NVRKLRRYFLSSFTTVYSMRPLAA---GAVFRCYPGKWKVF 280
>gi|428306245|ref|YP_007143070.1| hypothetical protein Cri9333_2705 [Crinalium epipsammum PCC 9333]
gi|428247780|gb|AFZ13560.1| protein of unknown function DUF1995-containing protein [Crinalium
epipsammum PCC 9333]
Length = 248
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 51/237 (21%), Positives = 95/237 (40%), Gaps = 39/237 (16%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P E + QA+ A + A+ DG + +E L ++P S+ E+
Sbjct: 4 LPKTLEEAIAQAREATKTAIADGHTRLMVELVFPELKTMPVASQFIPELE---------- 53
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
E + ++FF +A AR+ D+ P D G + + + ++
Sbjct: 54 ------EYGSHLKVFFTDAGAAALARR----------DWGEVPFKISDLG-SSRSSVQNK 96
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPK 255
++ ED++FL+ P + E+ VE+L A L+ + + GY +
Sbjct: 97 IEEEDQIFLLIEP--SAVEIAQVEQLCNAAGDRPVILLVPRLEDAAVVGIGYAARQLRDR 154
Query: 256 LAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
KTL+ + YYI +G LFR +P W+V + ++ Y + ++ P
Sbjct: 155 FI---KTLY----SSYYIRPLEG---AALFRSHPSPWQVWLETNDDYNLIAEETQKP 201
>gi|427731149|ref|YP_007077386.1| hypothetical protein Nos7524_4017 [Nostoc sp. PCC 7524]
gi|427367068|gb|AFY49789.1| protein of unknown function (DUF1995) [Nostoc sp. PCC 7524]
Length = 246
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 55/245 (22%), Positives = 96/245 (39%), Gaps = 43/245 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P+ + + QA+ A + A+ DG +++E L +P + +F
Sbjct: 4 LPNTLEDAIAQAREATKAALADGYTRLQVELLFPELKFMP--------------VAEQFL 49
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
LF E +R +IFF +A AR+ D+ P D G + +
Sbjct: 50 PLFS--EYESRLKIFFADAGASALARR----------DWADVPFQILDIGTGRIASIQSK 97
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELD---RIRSGYYPSFF 252
V+ EDE+FL P E+ VE++ E + + L++ N L+ + GY
Sbjct: 98 VQPEDEIFLFVAP--TSVEVPQVEKIC-EIIGDRP--LVMLNPRLEDPGTVGIGY----- 147
Query: 253 YPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
+ +E+ YY+ +FRCYPG W++ + S + + P
Sbjct: 148 --AARQTRQRFISKIESCYYLRPVDDETA--VFRCYPGLWELWVENSGTWQKFAELPKKP 203
Query: 313 SLKEV 317
S E+
Sbjct: 204 SGDEI 208
>gi|303274516|ref|XP_003056577.1| hypothetical protein MICPUCDRAFT_55736 [Micromonas pusilla
CCMP1545]
gi|226462661|gb|EEH59953.1| hypothetical protein MICPUCDRAFT_55736 [Micromonas pusilla
CCMP1545]
Length = 371
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 72/290 (24%), Positives = 119/290 (41%), Gaps = 48/290 (16%)
Query: 67 NNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGD-------SE 119
N S V P+ Y ++ A + DG LME+EFP SVPG+ S+
Sbjct: 56 NRSDGRPVYSPNSYQDICHHAYQCVVDGLTDGYSLMEVEFP-----SVPGEDANYKAASD 110
Query: 120 GGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVF-------EGASFK- 171
I++ L F +++ K I P+ E + A K+VF EG +
Sbjct: 111 VYIDLNIQYALTI-FSEVYKETGKT--CEILLPDGTEYRRA-KNVFSNMLELSEGCTLNT 166
Query: 172 LDYLTK---PSFFEDF--GFTEKVKMADR-VKLEDELFLVAYPYFNVNEMLVVEELYKEA 225
LD +FFE+ G + + A+ + LE + A + ++ E+
Sbjct: 167 LDGKKTENVSTFFENLVEGAGLRTRAAEEDLNLEHHADIFAIVNLSTIDLPAAEQF---C 223
Query: 226 VFNTAWK-LIIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYY-----------I 273
+ T K L+ N ELD +R+ F +P + L + + IYY +
Sbjct: 224 ITKTCGKPLVFLNNELDTLRAD-LGLFSFPDKDTHYRFLSKI-KPIYYLRPRAYSRTISV 281
Query: 274 HNFKGRNGGTLFRCYPGSWKVL-KKVSNKYICLHQQEVMPSLKEVALDIL 322
F G LFR YP W+V+ K+ + + +C+ + E +L E ++L
Sbjct: 282 SPFVLNYSGALFREYPAPWQVMIKQNTGELVCVAEDEDRFTLGEAKEEML 331
>gi|307150883|ref|YP_003886267.1| hypothetical protein Cyan7822_0982 [Cyanothece sp. PCC 7822]
gi|306981111|gb|ADN12992.1| Domain of unknown function DUF1995 [Cyanothece sp. PCC 7822]
Length = 250
Score = 42.0 bits (97), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 55/236 (23%), Positives = 94/236 (39%), Gaps = 37/236 (15%)
Query: 77 PSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCD 136
P+ E + +AK A ++A++ G+ +++E +P E +E G L EF
Sbjct: 5 PASLEEAVVKAKEATQIALEAGLGRLQVEL------VIP---EIALEAQG---LALEFTS 52
Query: 137 LFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRV 196
LF + + ++ FP+ AR+ D+ P D G + + + +V
Sbjct: 53 LF--EQYGSGLKVLFPDTGAAALARR----------DWGETPFKISDLG-SRFIPVEMKV 99
Query: 197 KLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSFFYPKL 256
EDE FLV P E+ V +L + A LI ++ + GY
Sbjct: 100 TPEDEAFLVVCP--ASIEIQSVAKLCEIAEQRPVLLLIPQLEDVSIVGIGYTAR------ 151
Query: 257 AALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L +E+ YY F+ G + R YPG W+V + Y + ++ P
Sbjct: 152 -QLRDNFLSTLESCYY---FRPLEGAAVVRSYPGLWQVWLEKEQGYELIAEESQKP 203
>gi|428225033|ref|YP_007109130.1| hypothetical protein GEI7407_1587 [Geitlerinema sp. PCC 7407]
gi|427984934|gb|AFY66078.1| protein of unknown function DUF1995 [Geitlerinema sp. PCC 7407]
Length = 244
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 53/242 (21%), Positives = 98/242 (40%), Gaps = 50/242 (20%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIE--FPTAGLDSVPGDSEGGIEMTGSMRLICE 133
P D E + Q++ A + A+ DG +++E FP L ++ I E
Sbjct: 4 LPQDLDEAIAQSRTATQAALDDGHSRLQVELVFPEIDLQAIS---------------IAE 48
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
R +I FP+A A+ E KP +E G +E +
Sbjct: 49 LFAADFEARYGDRFKILFPDAGAAALAKHRWGE----------KP--YEIRGISE---LK 93
Query: 194 DRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPS 250
+++ +++ FL+ P + E+ VE+ +EA + +++ N L+ + + GY
Sbjct: 94 AQIQPDEQAFLMIAP--SSVEVGPVEKFCEEA---SDRPVVMVNPRLEDVATIGIGYAGR 148
Query: 251 FFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEV 310
L + + + YY+ F+G G L R YPG W+V + + Y + ++
Sbjct: 149 -------QLRERFLSTLLSCYYLRPFEG---GALRRSYPGPWEVWLETESGYEKVAEESQ 198
Query: 311 MP 312
P
Sbjct: 199 KP 200
>gi|402308913|ref|ZP_10827915.1| deoxyribose-phosphate aldolase [Prevotella sp. MSX73]
gi|400374492|gb|EJP27410.1| deoxyribose-phosphate aldolase [Prevotella sp. MSX73]
Length = 298
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 57/129 (44%), Gaps = 11/129 (8%)
Query: 44 RIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMA-AELAVKDGMKLM 102
+ T Y ++ S+ DG +NV FPS S+ + K+A A LAVKDG +
Sbjct: 86 HVATICTYPNFAKLVSESLEVDGVQVVNVSGSFPS--SQTFIEVKVAEASLAVKDGATEI 143
Query: 103 EIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFAR- 161
+I P + GD EG + G + C C P KV P ++VK A
Sbjct: 144 DIVMPVGKY--LSGDYEGVADEIGEQKQACGEC-----PMKVILETGCLPSMSDVKKASI 196
Query: 162 KSVFEGASF 170
+++ GA +
Sbjct: 197 IAMYAGADY 205
>gi|67922775|ref|ZP_00516276.1| hypothetical protein CwatDRAFT_3614 [Crocosphaera watsonii WH 8501]
gi|416392960|ref|ZP_11685949.1| hypothetical protein CWATWH0003_2757 [Crocosphaera watsonii WH
0003]
gi|67855391|gb|EAM50649.1| hypothetical protein CwatDRAFT_3614 [Crocosphaera watsonii WH 8501]
gi|357263546|gb|EHJ12537.1| hypothetical protein CWATWH0003_2757 [Crocosphaera watsonii WH
0003]
Length = 246
Score = 41.6 bits (96), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 57/251 (22%), Positives = 99/251 (39%), Gaps = 43/251 (17%)
Query: 74 VPFPSDYSELLDQAKMAAELAVKDGMK--LMEIEFPTAGLDSVPGDSEGGIEMTGSMRLI 131
V P+ + + QAK AA++A++ G+ +EI P + + L
Sbjct: 2 VEVPNSLEQAVSQAKEAAKIALESGVGRICLEIVIPEIAIQA--------------QALA 47
Query: 132 CEFCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVK 191
EF LF + + ++ FP+ AR+ D+ P D G +
Sbjct: 48 LEFTTLFEGQQ--SGLKVIFPDTGAAALARR----------DWGETPFRVTDLG-SRATG 94
Query: 192 MADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRSGYYPSF 251
+ ++ DE+FL+ P + E+ VE+L + A LI ++ + GY
Sbjct: 95 IERKISEADEIFLLVCP--SSVEVETVEKLCELAGDRPVILLIPQLEDVSIVGIGYAAR- 151
Query: 252 FYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVM 311
L +E++YY F+ + + R YP W V + + Y + +Q
Sbjct: 152 ------QLRDRFISTLESVYY---FRPLDDVVVLRSYPSPWLVFLEKEDGYELIAEQGQK 202
Query: 312 PSLKEVALDIL 322
P + ALDIL
Sbjct: 203 PMGE--ALDIL 211
>gi|288925941|ref|ZP_06419871.1| deoxyribose-phosphate aldolase [Prevotella buccae D17]
gi|315606905|ref|ZP_07881912.1| deoxyribose-phosphate aldolase [Prevotella buccae ATCC 33574]
gi|288337365|gb|EFC75721.1| deoxyribose-phosphate aldolase [Prevotella buccae D17]
gi|315251413|gb|EFU31395.1| deoxyribose-phosphate aldolase [Prevotella buccae ATCC 33574]
Length = 298
Score = 41.6 bits (96), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 57/129 (44%), Gaps = 11/129 (8%)
Query: 44 RIITFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMA-AELAVKDGMKLM 102
+ T Y ++ S+ DG +NV FPS S+ + K+A A LAVKDG +
Sbjct: 86 HVATICTYPNFAKLVSESLEVDGVQVVNVSGSFPS--SQTFIEVKVAEASLAVKDGATEI 143
Query: 103 EIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKVTRTRIFFPEANEVKFAR- 161
+I P + GD EG + G + C C P KV P ++VK A
Sbjct: 144 DIVMPVGKY--LSGDYEGVADEIGEQKQACGEC-----PMKVILETGCLPSMSDVKKASI 196
Query: 162 KSVFEGASF 170
+++ GA +
Sbjct: 197 IAMYAGADY 205
>gi|411117915|ref|ZP_11390296.1| protein of unknown function (DUF1995) [Oscillatoriales
cyanobacterium JSC-12]
gi|410711639|gb|EKQ69145.1| protein of unknown function (DUF1995) [Oscillatoriales
cyanobacterium JSC-12]
Length = 241
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 43/185 (23%), Positives = 79/185 (42%), Gaps = 27/185 (14%)
Query: 137 LFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRV 196
L V E + ++FF +A AR+ + G+ + + + D T + +++
Sbjct: 49 LSVFEEYGSHLKVFFTDAGAAALARRE-WAGSPYNIQSI-------DIQGTRQTTPVEQL 100
Query: 197 KL-EDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPSFF 252
L ED +F+ P + E+ VVE++ A +I+FN ++ + GY
Sbjct: 101 VLSEDRVFVFVAP--SAVEVGVVEQIANAAGDR---PVILFNPRMEDVSVVGIGY----- 150
Query: 253 YPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMP 312
L + +E YY+ K G L RCYP W+V + S Y + ++ P
Sbjct: 151 --AARKLRERFLNTIEPCYYL---KPLEGSALIRCYPSLWQVWAETSEGYTLIAEETQKP 205
Query: 313 SLKEV 317
+L+ +
Sbjct: 206 TLERL 210
>gi|147798216|emb|CAN60540.1| hypothetical protein VITISV_018288 [Vitis vinifera]
Length = 367
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 267 METIYYIHNFKGRNGGTLFRCYPGSWKVL---KKVSNKYICLHQQEVMPSLKEVAL 319
+ T +++ + G +FRCYPGSWKV K N+Y+ +Q P +E+ +
Sbjct: 255 LSTFTVVYSMRPLPAGAVFRCYPGSWKVFYDDKDRPNRYLLAKEQISRPDAEELEI 310
>gi|255636951|gb|ACU18808.1| unknown [Glycine max]
Length = 198
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 45/193 (23%), Positives = 80/193 (41%), Gaps = 40/193 (20%)
Query: 134 FCDLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMA 193
F D + K R + FP+A GA+ L Y K + F ++
Sbjct: 16 FLDTLIEKTKFQRVKAIFPDA------------GAAALLKYRWKDALFR------FASLS 57
Query: 194 DR--VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTA--WKLIIFNGEL--DRIRSGY 247
DR V+ +DE+ ++ P + +ML E + N LI++N L + + G+
Sbjct: 58 DRKPVESDDEVVVMIVPDY---QMLEYVERIASNLSNIPEPTPLIMWNPRLISEDVGVGF 114
Query: 248 YPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVL---KKVSNKYIC 304
+ L + T+Y++ G +FRCYPG WKV K+ ++Y+
Sbjct: 115 -------NVRKLRRVFLSTFTTVYFMRPMPF---GAIFRCYPGLWKVFSDDKERPDRYLL 164
Query: 305 LHQQEVMPSLKEV 317
+ E+ P +++
Sbjct: 165 AKEFEIRPDAEDI 177
>gi|18421131|ref|NP_568497.1| uncharacterized protein [Arabidopsis thaliana]
gi|13877993|gb|AAK44074.1|AF370259_1 unknown protein [Arabidopsis thaliana]
gi|17104721|gb|AAL34249.1| unknown protein [Arabidopsis thaliana]
gi|332006318|gb|AED93701.1| uncharacterized protein [Arabidopsis thaliana]
Length = 341
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 52/223 (23%), Positives = 86/223 (38%), Gaps = 38/223 (17%)
Query: 76 FPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFC 135
P+D + Q+ A V G +E L + D EG + + F
Sbjct: 92 LPNDLEGAVRQSGEAGAAFVNSGGNRAIVELLIPQLQFL--DDEGAQAELWDLSRV--FL 147
Query: 136 DLFVTPEKVTRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADR 195
+ + R + FP+A GA+ L Y K + FGF ++DR
Sbjct: 148 ETLIKETGCERVKAVFPDA------------GAAALLKYRWKDA---TFGFA---SLSDR 189
Query: 196 --VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGEL--DRIRSGYYPSF 251
V+ EDE+ ++ P + + E VE++ + LI++N L + + G+
Sbjct: 190 KPVEKEDEIIVMVVPDYQMLEY--VEKIANGLADDPPRPLIMWNPRLISEEVGVGF---- 243
Query: 252 FYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKV 294
+ L + T+Y + G +FRCYPG WKV
Sbjct: 244 ---NVRKLRRYFLSSFTTVYSMRPLAA---GAVFRCYPGKWKV 280
>gi|397642849|gb|EJK75496.1| hypothetical protein THAOC_02778, partial [Thalassiosira oceanica]
Length = 1317
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 15/120 (12%)
Query: 176 TKPSFFEDF--GFTEKVKMADRVKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKL 233
TK SFFEDF E ++ + K L + + Y+ + + +V + VF W
Sbjct: 518 TKASFFEDFKKNHGEPMRFFNDYKTLASLDGIDHTYYGLIDTVVAS---RGIVFAGTW-F 573
Query: 234 IIFNGELDRIRSGYYPSFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLF-RCYPGSW 292
F+G + R+R Y S +Y T F +E Y+HN+ G+L+ R YP W
Sbjct: 574 STFSGYITRLRGYYGMSKYY--------TYFSWLERKRYMHNWMDIWEGSLYAREYPVGW 625
>gi|18410256|ref|NP_565054.1| low PSII accumulation 3 protein [Arabidopsis thaliana]
gi|25082946|gb|AAN72020.1| Unknown protein [Arabidopsis thaliana]
gi|31711852|gb|AAP68282.1| At1g73060 [Arabidopsis thaliana]
gi|332197288|gb|AEE35409.1| low PSII accumulation 3 protein [Arabidopsis thaliana]
Length = 358
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 74/319 (23%), Positives = 124/319 (38%), Gaps = 48/319 (15%)
Query: 40 FSGQRIITFSPYRRKHSCLTNSVSSDGNNSINVD----VPF--PSDYSELLDQAKMAAEL 93
FS I+T R + NS + + S N D VP P Y L A +
Sbjct: 12 FSSPSILT---SRIRCGATANSGGAISSTSSNSDPRRGVPLYKPKSYEVLATDAANSLAF 68
Query: 94 AVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKV-TRTRIFFP 152
A++D +EI+FP + +++L EK+ TR I FP
Sbjct: 69 ALQDSKSRLEIDFPPLPSSISSYKGSSDDFIDANIQLAVTVVRKL--QEKIETRACIVFP 126
Query: 153 EANEVKFARKSVFEGASFKLDYLTKPSFFEDFG-----FTEKVKMADRVKLEDELFLVAY 207
+ E + A + F+ A +D ++ S + G F ++ EDE
Sbjct: 127 DKPEKRRASQR-FKAAFDSVDGISIGSLDDIPGTSVTNFFRSIRSTLDFDFEDENEGTWE 185
Query: 208 P------YFNVN----EMLVVEELYKEAVFNTAWKLIIFNGELDRIRS-----GYYP--- 249
P Y +N E+ +E+ + F ++ ++FN ELD +R+ G+ P
Sbjct: 186 PKEPPTLYIFINCSTRELSFIEKFVE--TFASSTPALLFNLELDTLRADLGLLGFPPKDL 243
Query: 250 -----SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKV-LKKVSNKYI 303
S F P ++ + ++ N+ G LFR YPG W+V LK+ +
Sbjct: 244 HYRFLSQFIPVFYIRTREYSKTVAVAPFVLNYNG----ALFRQYPGPWQVMLKQTDGSFA 299
Query: 304 CLHQQEVMPSLKEVALDIL 322
C+ + +L E ++L
Sbjct: 300 CVAESPTRFTLGETKEELL 318
>gi|255074893|ref|XP_002501121.1| predicted protein [Micromonas sp. RCC299]
gi|226516384|gb|ACO62379.1| predicted protein [Micromonas sp. RCC299]
Length = 553
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 75/308 (24%), Positives = 117/308 (37%), Gaps = 95/308 (30%)
Query: 67 NNSINVDVPFPSDYSELLDQAKMAAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTG 126
+ S DV P +++++ A A A+ DG L+E++ P+ +V D+ +
Sbjct: 168 SQSPGRDVYEPESFAQMVAHAADAVRAAISDGQDLLEVQLPSTAA-TVDSDATQAV---- 222
Query: 127 SMRLICEFCDLFV---TPEKVT--RTRIFFPEANEVKFARKSVFEGASF-KLDYLTKPSF 180
++RL F D FV P RT + P+ E + AR ++FE +F K D T S
Sbjct: 223 NLRLAAAFGDDFVRRGNPRTGLPWRTHVLVPDRTEYERAR-AMFESEAFTKEDSGTAASS 281
Query: 181 FED-------FGFTEKVKMADRVKLEDEL------------------FLVAYPYFNVNEM 215
G +V + +LE L LVA +V E+
Sbjct: 282 VRGGVRGRVTIGTLAEVDTSLAGRLEQTLAGTLGAEEESLQNAMQADLLVAVNCSSV-EL 340
Query: 216 LVVEELYK----------------------EAVFNTAWK---LIIFNGELDRIRS----- 245
L + E YK +AV T+ + L++FN +LD +R
Sbjct: 341 LQI-EAYKATLLEGDGGRNEGPRDAYYSEEDAVARTSARVRPLVVFNCDLDDLRGDLGLV 399
Query: 246 GYYPS------------FFYPKLAALSKTLF------PVMETIYYIHNFKGRNGGTLFRC 287
G+ P FY + +KT + +YY GG LFR
Sbjct: 400 GFPPKALHARFLSRILPAFYVRRREYNKTFLGGKDGGGGVRQVYY--------GGALFRE 451
Query: 288 YPGSWKVL 295
YPG W+V+
Sbjct: 452 YPGPWQVM 459
>gi|255526474|ref|ZP_05393385.1| UvrD/REP helicase [Clostridium carboxidivorans P7]
gi|296184847|ref|ZP_06853258.1| putative ATP-dependent DNA helicase PcrA [Clostridium
carboxidivorans P7]
gi|255509856|gb|EET86185.1| UvrD/REP helicase [Clostridium carboxidivorans P7]
gi|296050629|gb|EFG90052.1| putative ATP-dependent DNA helicase PcrA [Clostridium
carboxidivorans P7]
Length = 754
Score = 38.9 bits (89), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 3/88 (3%)
Query: 37 NENFSGQRII---TFSPYRRKHSCLTNSVSSDGNNSINVDVPFPSDYSELLDQAKMAAEL 93
N+N + + I +FS Y + + +TNSV+S N S+N +P S L + A L
Sbjct: 640 NDNMTARNTIKSKSFSTYSNRSTTITNSVTSHMNRSVNNSMPSSFGESSLNKNKENANSL 699
Query: 94 AVKDGMKLMEIEFPTAGLDSVPGDSEGG 121
++D ++++ G+ ++ G S+ G
Sbjct: 700 KIEDIKAGLKVKHDKFGIGTIVGVSKSG 727
>gi|428208370|ref|YP_007092723.1| hypothetical protein Chro_3395 [Chroococcidiopsis thermalis PCC
7203]
gi|428010291|gb|AFY88854.1| protein of unknown function DUF1995 [Chroococcidiopsis thermalis
PCC 7203]
Length = 272
Score = 38.5 bits (88), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 38/176 (21%), Positives = 75/176 (42%), Gaps = 31/176 (17%)
Query: 145 TRTRIFFPEANEVKFARKSVFEGASFKLDYLTKPSFFEDFGFTEKVKMADRVKLEDELFL 204
++ ++FFP+A AR+ D+ P D G + + ++++ EDE L
Sbjct: 85 SQLKVFFPDAGAAALARR----------DWGEVPFKIVDIG--TRSPLEEKIEPEDEALL 132
Query: 205 VAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS---GYYPSFFYPKLAALSK 261
P + E+ VE+L + A T ++ N L+ + GY L
Sbjct: 133 FISP--SAVEVERVEKLCELATCPT----VMLNPRLEDVAIVGIGYAGR-------QLRT 179
Query: 262 TLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKVLKKVSNKYICLHQQEVMPSLKEV 317
+E+ YY+ + ++FR YPG W++ +++ ++ + +Q P E+
Sbjct: 180 RFLNNIESCYYLRPIENI---SVFRSYPGEWQIWREIEEEFQLITEQPTKPMGDEI 232
>gi|397611168|gb|EJK61207.1| hypothetical protein THAOC_18346, partial [Thalassiosira oceanica]
Length = 336
Score = 37.7 bits (86), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 61/269 (22%), Positives = 115/269 (42%), Gaps = 42/269 (15%)
Query: 84 LDQAKM-AAELAVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMT-GSMRLICEFCDLFVTP 141
L QA + AA++A+ DG L+E+EFP + + D ++ ++ L +F F T
Sbjct: 23 LQQASVKAAQMAMDDGFGLLEVEFPPLPANVLEMDDVSAYDVAKANVNLALDFAKAFATT 82
Query: 142 EKVTRTRIFFPEANEVKFARKSV------FEGASFKLDYLTKPSFFEDFGFTEK---VKM 192
I P+ +E + R+ + F G + L L + +D F + + +
Sbjct: 83 GPKNNVAILLPDESECQIMREDLEMDSNPFPGVT--LTSLRRSEEGDDRVFKPENVLIGL 140
Query: 193 ADR-----VKLEDELFLVAYPYFNVNEMLVVEELY------KEAVFNTAWKLIIFNGELD 241
R VK ++ + + E+ VEELY K+ + ++ +N +LD
Sbjct: 141 LGRGSGGTVKPIEDTSMYIIIGASAQELPDVEELYEQIKDQKDEETGKSPVIVFYNLKLD 200
Query: 242 RIRSGY-YPSF----FYPKLAALSKTLFPVMETIY--------YIHNFKGRNGGTLFRCY 288
+R P+F F + + K ++ + Y +I NF+ G +FR Y
Sbjct: 201 ILRGDLGAPAFPSKEFQDRFLSRVKPVYYLRTRQYSRSISQPPFILNFQ----GCIFRSY 256
Query: 289 PGSWK-VLKKVSNKYICLHQQEVMPSLKE 316
PG ++ +L + +Y + ++ P+L E
Sbjct: 257 PGHYQTLLDTGTGRYRKVVGNDLRPALGE 285
>gi|21537091|gb|AAM61432.1| unknown [Arabidopsis thaliana]
Length = 358
Score = 37.4 bits (85), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 125/321 (38%), Gaps = 52/321 (16%)
Query: 40 FSGQRIITFSPYRRKHSCLTNSVSSDGNNSINVD----VPF--PSDYSELLDQAKMAAEL 93
FS I+T R + NS + + S N D VP P Y L A +
Sbjct: 12 FSSPSILT---SRIRCGATANSGGAISSTSSNSDPRRGVPLYKPKSYEVLATDAANSLAF 68
Query: 94 AVKDGMKLMEIEFPTAGLDSVPGDSEGGIEMTGSMRLICEFCDLFVTPEKV-TRTRIFFP 152
A++D +EI+FP + +++L EK+ TR I FP
Sbjct: 69 ALQDSKSRLEIDFPPLPSSISSYKGSSDDFIDANIQLAVTVVRKL--QEKIETRACIVFP 126
Query: 153 EANEVKFARKSVFEGASFKLDYLTKPS-----------FFE------DFGFTEKVKMADR 195
+ E + A + F+ A +D ++ S FF DF F + +
Sbjct: 127 DKPEKRRASQR-FKAAFDSVDGISIGSLDDIPGTSVTNFFRSIRSTLDFDFENENEGTWE 185
Query: 196 VKLEDELFLVAYPYFNVNEMLVVEELYKEAVFNTAWKLIIFNGELDRIRS-----GYYP- 249
K L++ + + E+ +E+ + F ++ ++FN ELD +R+ G+ P
Sbjct: 186 PKEPPTLYI--FINCSTRELSFIEKFVE--TFASSTPALLFNLELDTLRADLGLLGFPPK 241
Query: 250 -------SFFYPKLAALSKTLFPVMETIYYIHNFKGRNGGTLFRCYPGSWKV-LKKVSNK 301
S F P ++ + ++ N+ G LFR YPG W+V LK+
Sbjct: 242 DLHYRFLSQFIPVFYIRTREYSKTVAVAPFVLNYNG----ALFRQYPGPWQVMLKQTDGS 297
Query: 302 YICLHQQEVMPSLKEVALDIL 322
+ C+ + +L E ++L
Sbjct: 298 FACVAESPTRFTLGETKEELL 318
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.137 0.409
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,222,770,028
Number of Sequences: 23463169
Number of extensions: 217823793
Number of successful extensions: 473581
Number of sequences better than 100.0: 146
Number of HSP's better than 100.0 without gapping: 44
Number of HSP's successfully gapped in prelim test: 102
Number of HSP's that attempted gapping in prelim test: 473374
Number of HSP's gapped (non-prelim): 161
length of query: 325
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 183
effective length of database: 9,027,425,369
effective search space: 1652018842527
effective search space used: 1652018842527
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 77 (34.3 bits)