BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 047235
(276 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449436415|ref|XP_004135988.1| PREDICTED: uncharacterized protein LOC101220175 [Cucumis sativus]
Length = 267
Score = 364 bits (935), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 171/262 (65%), Positives = 219/262 (83%), Gaps = 13/262 (4%)
Query: 26 LHPQLQSP---RFSVLRSSIQPQP--------QAPPIKRESDTSRTEYKPGVLDDLFLSS 74
LHP SP RF+ +S ++ +P Q+ IK + +S+ EYKPG+LDD FL+
Sbjct: 8 LHPPTISPAPLRFTSSKSHLRNRPFILSCSALQSGSIK-DGASSKAEYKPGILDDFFLNV 66
Query: 75 FRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYK 134
FR+K+VQEVG DSEKPGYDGLIE+ + L M GK++S+ +A+ +RIL++LFPPL+LKLY+
Sbjct: 67 FRSKMVQEVGWDSEKPGYDGLIEVASRLTMTGKTNSETIEAS-VRILIALFPPLLLKLYR 125
Query: 135 ILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESK 194
IL+SP+AGGK+AA+MVARVTALTCQWLMG CTVNS++LPDG+SCQSGVFVE+CKYLEESK
Sbjct: 126 ILVSPIAGGKVAAIMVARVTALTCQWLMGTCTVNSIELPDGSSCQSGVFVEKCKYLEESK 185
Query: 195 CVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDI 254
C+G+CINTCKLPTQ+FFKD MG+PLLMEPNF+DYSCQFKFG+LPPLP++D+ LKEPCL+I
Sbjct: 186 CIGICINTCKLPTQSFFKDQMGIPLLMEPNFTDYSCQFKFGVLPPLPEEDSILKEPCLEI 245
Query: 255 CPTSSRRKEVAMNSNVEQCPKA 276
CP ++RR+EV+ + QCPKA
Sbjct: 246 CPNATRRREVSGKISAAQCPKA 267
>gi|449507826|ref|XP_004163139.1| PREDICTED: uncharacterized LOC101220175 [Cucumis sativus]
Length = 267
Score = 362 bits (930), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 170/262 (64%), Positives = 218/262 (83%), Gaps = 13/262 (4%)
Query: 26 LHPQLQSPR---FSVLRSSIQPQP--------QAPPIKRESDTSRTEYKPGVLDDLFLSS 74
LHP SP F+ +S ++ +P Q+ IK + +S+ EYKPG+LDD FL+
Sbjct: 8 LHPPTISPAPLCFTSSKSHLRNRPFILSCSALQSGSIK-DGASSKAEYKPGILDDFFLNV 66
Query: 75 FRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYK 134
FR+K+VQEVG DSEKPGYDGLIE+ + L M GK++S+ +A+ +RIL++LFPPL+LKLY+
Sbjct: 67 FRSKMVQEVGWDSEKPGYDGLIEVASRLTMTGKTNSETIEAS-VRILIALFPPLLLKLYR 125
Query: 135 ILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESK 194
IL+SP+AGGK+AA+MVARVTALTCQWLMG CTVNS++LPDG+SCQSGVFVE+CKYLEESK
Sbjct: 126 ILVSPIAGGKVAAIMVARVTALTCQWLMGTCTVNSIELPDGSSCQSGVFVEKCKYLEESK 185
Query: 195 CVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDI 254
C+G+CINTCKLPTQ+FFKD MG+PLLMEPNF+DYSCQFKFG+LPPLP++D+ LKEPCL+I
Sbjct: 186 CIGICINTCKLPTQSFFKDQMGIPLLMEPNFTDYSCQFKFGVLPPLPEEDSILKEPCLEI 245
Query: 255 CPTSSRRKEVAMNSNVEQCPKA 276
CP ++RR+EV+ + QCPKA
Sbjct: 246 CPNATRRREVSGKISAAQCPKA 267
>gi|225437593|ref|XP_002271003.1| PREDICTED: uncharacterized protein LOC100253777 [Vitis vinifera]
gi|297743993|emb|CBI36963.3| unnamed protein product [Vitis vinifera]
Length = 261
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 172/274 (62%), Positives = 216/274 (78%), Gaps = 22/274 (8%)
Query: 6 LRPSTISFSSSPPRSHHIPKLHPQLQSPRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPG 65
LRP ++FS SPP+S +++P F + SS Q + EYKPG
Sbjct: 7 LRP--LTFSPSPPQSKL------SVKNPSFRISFSSAQ----------SNAVEAGEYKPG 48
Query: 66 VLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLF 125
V DDLFL+ FR+++V+EVG DSEKPGYDGLI++ N LMMK KS+S ++AA +RIL+SLF
Sbjct: 49 VFDDLFLNLFRSRMVKEVGWDSEKPGYDGLIDVANQLMMKSKSNSKVKEAA-VRILISLF 107
Query: 126 PPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVE 185
PP +L LY++L++P+ GGK+AAMMVARVTAL+CQWLMG CTVNSV+LPDG+SC SGVFVE
Sbjct: 108 PPFLLDLYRMLVAPIGGGKVAAMMVARVTALSCQWLMGPCTVNSVNLPDGSSCSSGVFVE 167
Query: 186 RCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDT 245
RCKYLEESKCVG+CINTCKLPTQTFFKDYMGVPL MEP+F++YSCQF FG+LPP P++D+
Sbjct: 168 RCKYLEESKCVGICINTCKLPTQTFFKDYMGVPLAMEPDFTNYSCQFSFGVLPPRPEEDS 227
Query: 246 TLKEPCLDICPTSSRRKEVAMNSN---VEQCPKA 276
TLKEPCL+ICP ++RRKE+ N + + QCPKA
Sbjct: 228 TLKEPCLEICPNATRRKEINRNMDNKELVQCPKA 261
>gi|357442215|ref|XP_003591385.1| hypothetical protein MTR_1g086840 [Medicago truncatula]
gi|355480433|gb|AES61636.1| hypothetical protein MTR_1g086840 [Medicago truncatula]
Length = 260
Score = 340 bits (873), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 156/217 (71%), Positives = 187/217 (86%), Gaps = 1/217 (0%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQI 118
++EYKPGV+DDLFL+ FR KLVQEVG +S+KPGYDGLIE+ N LMMKG ++SD +A +
Sbjct: 44 KSEYKPGVIDDLFLNLFRTKLVQEVGWESKKPGYDGLIEVANRLMMKGTTNSDTIEAT-V 102
Query: 119 RILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSC 178
RIL SLFPP +L+LYK+LI+P+ GGK+AA+MVARVTALTCQWLMG C VNSV+LP+GTS
Sbjct: 103 RILRSLFPPFLLELYKMLIAPIGGGKVAAIMVARVTALTCQWLMGPCKVNSVELPNGTSW 162
Query: 179 QSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238
SGV VERCKYLEESKCVG+C+NTCK PTQTFFKD+MGVPLLM+PNF+DYSCQFKFG+LP
Sbjct: 163 NSGVHVERCKYLEESKCVGICLNTCKFPTQTFFKDHMGVPLLMKPNFADYSCQFKFGVLP 222
Query: 239 PLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPK 275
PLP+DDT LKEPCL+ CP +S R+ + N V CPK
Sbjct: 223 PLPEDDTVLKEPCLEACPNASLRRMASRNKGVTACPK 259
>gi|363807932|ref|NP_001242708.1| uncharacterized protein LOC100788939 [Glycine max]
gi|255647168|gb|ACU24052.1| unknown [Glycine max]
Length = 264
Score = 339 bits (869), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 158/217 (72%), Positives = 185/217 (85%), Gaps = 3/217 (1%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQI 118
++EYKPGV DDLFL+ FRNKLVQEVG DSEKPGYDGLIE+ N LMMKG +++ +AA +
Sbjct: 50 KSEYKPGVFDDLFLNLFRNKLVQEVGWDSEKPGYDGLIEVANRLMMKGTTNTATVEAA-V 108
Query: 119 RILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSC 178
RIL SLFPP +L+LYK+LI P+ GGKIAAMMVARVT LTCQWLMG C +NSVDLPDG SC
Sbjct: 109 RILRSLFPPYLLELYKMLIVPIGGGKIAAMMVARVTVLTCQWLMGPCKLNSVDLPDGISC 168
Query: 179 QSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238
SGV+VERCKYLEESKCVG+C NTCK PTQ+FFKD+MGVPLLMEPNF DYSCQFKFG+LP
Sbjct: 169 SSGVYVERCKYLEESKCVGICTNTCKFPTQSFFKDHMGVPLLMEPNFGDYSCQFKFGVLP 228
Query: 239 PLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPK 275
PL DDT +K+PCL+ CP +S+R+ VA N ++ CPK
Sbjct: 229 PL--DDTIVKDPCLEACPNASQRRTVARNIDITACPK 263
>gi|255548419|ref|XP_002515266.1| conserved hypothetical protein [Ricinus communis]
gi|223545746|gb|EEF47250.1| conserved hypothetical protein [Ricinus communis]
Length = 282
Score = 337 bits (863), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 179/271 (66%), Positives = 216/271 (79%), Gaps = 14/271 (5%)
Query: 6 LRPSTISFSSSPPRSHHIPKLHPQLQSPRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPG 65
LRP +S S SPP +P+L + F + SS+Q +P+ K E +R+EYKPG
Sbjct: 26 LRP--LSISVSPPSRLKLPRLFNR----SFRISCSSLQSEPE----KTEDVGTRSEYKPG 75
Query: 66 VLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLF 125
DD FL+ FRNK+V EVG DSEK GYDGLIE+ N LM+ G S++D RDAA +RIL SLF
Sbjct: 76 FFDDFFLTLFRNKMVAEVGWDSEKAGYDGLIEVANRLMLTGTSNADTRDAA-VRILRSLF 134
Query: 126 PPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVE 185
PPL+L LYK+LISPL GK+AA+MVARVTA+TCQWLMG CTVNS+DLPDG+SC+SGVFVE
Sbjct: 135 PPLLLDLYKLLISPLGEGKVAAIMVARVTAITCQWLMGTCTVNSIDLPDGSSCESGVFVE 194
Query: 186 RCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDT 245
RCKYLEESKCVG+C+NTCKLPTQ FFKDYMGVPLLMEPNF+DYSCQFKFG+LPP P+DD+
Sbjct: 195 RCKYLEESKCVGICVNTCKLPTQAFFKDYMGVPLLMEPNFTDYSCQFKFGVLPPQPEDDS 254
Query: 246 TLKEPCLDICPTSSRRKEVAMNSNVEQCPKA 276
TLKEPCL+ CP +SRR+ ++ N+ CPKA
Sbjct: 255 TLKEPCLEACPIASRRQ---VSLNIAHCPKA 282
>gi|224068582|ref|XP_002302776.1| predicted protein [Populus trichocarpa]
gi|222844502|gb|EEE82049.1| predicted protein [Populus trichocarpa]
Length = 200
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 147/202 (72%), Positives = 175/202 (86%), Gaps = 2/202 (0%)
Query: 61 EYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRI 120
EY+ DD FL FRNK+V+EVG DSEK GYDGLIE+ + LM++ ++ SD DAA +RI
Sbjct: 1 EYRYQFYDDWFLDLFRNKMVKEVGWDSEKAGYDGLIEVASRLMLR-RTPSDTTDAA-VRI 58
Query: 121 LVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQS 180
L SLFPP +L LYK L+SP+ GGK+AAMMVARVT +TCQWLMG C VNSVDLPDG+S +S
Sbjct: 59 LRSLFPPFLLHLYKSLVSPIGGGKLAAMMVARVTVITCQWLMGICKVNSVDLPDGSSWES 118
Query: 181 GVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPL 240
GVFVERCKYLEESKCVG+C+NTCKLPTQTFFKDYMG+PLLMEPNF+DYSCQFKFG+LPPL
Sbjct: 119 GVFVERCKYLEESKCVGICVNTCKLPTQTFFKDYMGIPLLMEPNFNDYSCQFKFGVLPPL 178
Query: 241 PKDDTTLKEPCLDICPTSSRRK 262
P+DD TLKEPCL++CP +S+R+
Sbjct: 179 PEDDGTLKEPCLEVCPIASKRR 200
>gi|351720764|ref|NP_001237955.1| uncharacterized protein LOC100499918 [Glycine max]
gi|255627671|gb|ACU14180.1| unknown [Glycine max]
Length = 265
Score = 314 bits (804), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 150/217 (69%), Positives = 174/217 (80%), Gaps = 3/217 (1%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQI 118
+++YKPGV DDLFL FRNKLVQEVG DS+K GYDGLIE+ N LMMKG ++SD +AA +
Sbjct: 51 KSDYKPGVFDDLFLKLFRNKLVQEVGWDSKKAGYDGLIEVANRLMMKGTTNSDTVEAA-V 109
Query: 119 RILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSC 178
RIL SLFPP +L+LYK+LI+P+ GGKIAAMMVARVT LTCQWLMG C VNSVDLPDGTSC
Sbjct: 110 RILRSLFPPYLLELYKMLIAPIGGGKIAAMMVARVTVLTCQWLMGPCKVNSVDLPDGTSC 169
Query: 179 QSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238
SGV+VERCKYLEESKCVG+C +TCK PTQTFFKD+MGVPLLMEPNF+DYSCQFKFG+LP
Sbjct: 170 SSGVYVERCKYLEESKCVGICTHTCKFPTQTFFKDHMGVPLLMEPNFADYSCQFKFGVLP 229
Query: 239 PLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPK 275
P+DDT +K L N ++ CPK
Sbjct: 230 --PRDDTIVKGTLLGSMSKCKTTTNGCQNIDITACPK 264
>gi|242089393|ref|XP_002440529.1| hypothetical protein SORBIDRAFT_09g002570 [Sorghum bicolor]
gi|241945814|gb|EES18959.1| hypothetical protein SORBIDRAFT_09g002570 [Sorghum bicolor]
Length = 277
Score = 300 bits (769), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 141/217 (64%), Positives = 174/217 (80%), Gaps = 3/217 (1%)
Query: 61 EYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRI 120
EY+P DDL L+ FR+K+V+EVG DSEKPGY GL+E+ N LM+KGKS+ + AA +R+
Sbjct: 61 EYRPSFADDLLLAFFRSKMVEEVGWDSEKPGYAGLMEVANRLMVKGKSAMETEQAA-VRV 119
Query: 121 LVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQS 180
L SLFPP++L LYK L+SP+A G++AAMM+AR TAL+CQWLMG C+VNSV LPDG S S
Sbjct: 120 LQSLFPPVLLVLYKALLSPIANGQLAAMMLARATALSCQWLMGPCSVNSVTLPDGKSWSS 179
Query: 181 GVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPL 240
GVFVE+CKYLEESKC+G+CINTCKLPTQTFFKD+MGV L MEPNF DYSCQF FG+ PP
Sbjct: 180 GVFVEKCKYLEESKCLGICINTCKLPTQTFFKDHMGVDLYMEPNFEDYSCQFNFGVPPPP 239
Query: 241 PKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQ--CPK 275
D LKEPCLDIC + RR+E+ NS+ ++ CP+
Sbjct: 240 LDTDKALKEPCLDICTNARRRRELGRNSSPDELSCPQ 276
>gi|297814129|ref|XP_002874948.1| hypothetical protein ARALYDRAFT_912030 [Arabidopsis lyrata subsp.
lyrata]
gi|297320785|gb|EFH51207.1| hypothetical protein ARALYDRAFT_912030 [Arabidopsis lyrata subsp.
lyrata]
Length = 263
Score = 294 bits (753), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 145/209 (69%), Positives = 173/209 (82%), Gaps = 3/209 (1%)
Query: 54 ESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDAR 113
+ S+ EYKPG+LDD F+ SFRNKLV+EVG DSEKPGY GLIELV L++KG++ S+
Sbjct: 49 DEGASKLEYKPGLLDDFFMQSFRNKLVEEVGSDSEKPGYVGLIELVKLLLLKGRTRSETS 108
Query: 114 DAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLP 173
DAA +RIL SLFPPL+L+LYK+LI+P+A GK+AA+MVARVT LTCQWLMG VN +DLP
Sbjct: 109 DAA-VRILKSLFPPLILELYKLLIAPIAQGKLAALMVARVTVLTCQWLMGPSKVNIIDLP 167
Query: 174 DGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFK 233
+G S SGVFVE+C+YLEESKCVGVCINTCKLPTQTFFKDYMGVPL+MEPNF DYSCQFK
Sbjct: 168 NGESWDSGVFVEKCQYLEESKCVGVCINTCKLPTQTFFKDYMGVPLVMEPNFKDYSCQFK 227
Query: 234 FGILPPLPKDDTTLKEPCLDICPTSSRRK 262
FG+ P P+DD + EPC + C + RRK
Sbjct: 228 FGVAP--PEDDGNVNEPCFETCSIAGRRK 254
>gi|22328234|ref|NP_680560.1| uncharacterized protein [Arabidopsis thaliana]
gi|17065174|gb|AAL32741.1| Unknown protein [Arabidopsis thaliana]
gi|20259952|gb|AAM13323.1| unknown protein [Arabidopsis thaliana]
gi|332656709|gb|AEE82109.1| uncharacterized protein [Arabidopsis thaliana]
Length = 258
Score = 293 bits (750), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 145/212 (68%), Positives = 173/212 (81%), Gaps = 3/212 (1%)
Query: 51 IKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSS 110
+K + + EYKPG LDD F+ SFRNKLV+EVG DSEKPGY GLIELV L++KG++ S
Sbjct: 41 VKSDEGAPKLEYKPGPLDDFFMQSFRNKLVEEVGSDSEKPGYVGLIELVKLLLLKGRTRS 100
Query: 111 DARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSV 170
+ DAA +RIL SLFPPL+L+LYK+LI+P+A GK+AA+MVARVT LTCQWLMG VN +
Sbjct: 101 ETSDAA-VRILKSLFPPLILELYKLLIAPIAQGKLAALMVARVTVLTCQWLMGPSKVNII 159
Query: 171 DLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSC 230
DLP+G S SGVFVE+C+YLEESKCVGVCINTCKLPTQTFFKDYMGVPL+MEPNF DYSC
Sbjct: 160 DLPNGESWDSGVFVEKCQYLEESKCVGVCINTCKLPTQTFFKDYMGVPLVMEPNFKDYSC 219
Query: 231 QFKFGILPPLPKDDTTLKEPCLDICPTSSRRK 262
QFKFG+ P P+DD + EPC + C + RRK
Sbjct: 220 QFKFGVAP--PEDDGNVNEPCFETCSIAGRRK 249
>gi|224036007|gb|ACN37079.1| unknown [Zea mays]
gi|413917538|gb|AFW57470.1| hypothetical protein ZEAMMB73_233894 [Zea mays]
Length = 277
Score = 285 bits (729), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 140/217 (64%), Positives = 174/217 (80%), Gaps = 3/217 (1%)
Query: 61 EYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRI 120
EY+P DDL L+ FR+K+V+EVG DSEKPGY GL+E+ N LM+KGKS+ + AA +R+
Sbjct: 61 EYRPSFADDLLLAFFRSKMVKEVGWDSEKPGYAGLMEVANRLMVKGKSALETEQAA-VRV 119
Query: 121 LVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQS 180
L SLFPPL+L LYK L++P+A G++AAMM+AR TA++CQWLMG C+VNSV LPDG S S
Sbjct: 120 LQSLFPPLLLVLYKALLAPIANGQLAAMMLARATAISCQWLMGSCSVNSVTLPDGKSWSS 179
Query: 181 GVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPL 240
GVFVE+CKYLEESKC+G+CINTCKLPTQTFFKD+MGV L MEPNF DYSCQF FG+ PP
Sbjct: 180 GVFVEKCKYLEESKCLGICINTCKLPTQTFFKDHMGVDLYMEPNFEDYSCQFNFGVPPPP 239
Query: 241 PKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQ--CPK 275
D LKEPCLDIC + RR+E+ NS+ ++ CP+
Sbjct: 240 LDTDKALKEPCLDICTNARRRRELGRNSSPDELSCPQ 276
>gi|115461907|ref|NP_001054553.1| Os05g0131100 [Oryza sativa Japonica Group]
gi|52353659|gb|AAU44225.1| unknown protein [Oryza sativa Japonica Group]
gi|113578104|dbj|BAF16467.1| Os05g0131100 [Oryza sativa Japonica Group]
gi|215695516|dbj|BAG90707.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 270
Score = 270 bits (691), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 145/245 (59%), Positives = 179/245 (73%), Gaps = 7/245 (2%)
Query: 33 PRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGY 92
P S+LR S A P S EY+P DD L+ FR K+V+EVG DSEKPGY
Sbjct: 30 PTTSLLRCSSPSADAASP----SGEGGREYEPSFADDFLLAFFRAKMVEEVGWDSEKPGY 85
Query: 93 DGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVAR 152
+GLIE+ N LM+KGKS+ + +A +R+L SLFPPL+L L+K L++P+A G++A+MMVAR
Sbjct: 86 NGLIEVANRLMIKGKSALETEQSA-VRVLRSLFPPLLLVLFKALLAPIANGQLASMMVAR 144
Query: 153 VTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFK 212
TAL+CQWLMG C +NS+ L +G S SGVFVE+CKYLEESKC+GVCINTCKLPTQTFFK
Sbjct: 145 ATALSCQWLMGPCLLNSITLSNGKSLSSGVFVEKCKYLEESKCLGVCINTCKLPTQTFFK 204
Query: 213 DYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVE- 271
D+MGV L MEPNF DYSCQF FG+ PP D LKEPCLDIC + RRKE+ S+ +
Sbjct: 205 DHMGVDLYMEPNFEDYSCQFNFGVSPPPLDTDKALKEPCLDICTNARRRKELGTGSSTDG 264
Query: 272 -QCPK 275
QCP+
Sbjct: 265 LQCPQ 269
>gi|326534116|dbj|BAJ89408.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 278
Score = 270 bits (689), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 141/249 (56%), Positives = 184/249 (73%), Gaps = 2/249 (0%)
Query: 28 PQLQSPRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDS 87
P +Q PR S + +PP++ +Y+P DDL L+ FR K+V+EVG DS
Sbjct: 30 PSVQGPRRRPPASPSRLYCSSPPVEAPPSGKGGDYRPSFADDLLLAFFRAKMVEEVGWDS 89
Query: 88 EKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAA 147
+KPGY+GLIE+ N LM+KGKS+S+ +A +R+L +LFPPL+L L+K L++P+A G++A+
Sbjct: 90 QKPGYEGLIEVANRLMIKGKSASETEQSA-VRVLQALFPPLLLVLFKALLAPIANGQLAS 148
Query: 148 MMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPT 207
MMVAR TAL+CQWLMG +VNSV LP G S SGVFVE+CKYLEESKC+G+CINTCKLPT
Sbjct: 149 MMVARATALSCQWLMGTSSVNSVTLPSGKSLSSGVFVEKCKYLEESKCLGICINTCKLPT 208
Query: 208 QTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMN 267
QTFFKD+MGV L MEPNF DYSCQF FG+ PP D LKEPCLDIC ++ RR+E+ +
Sbjct: 209 QTFFKDHMGVDLYMEPNFEDYSCQFNFGVPPPPIDTDKALKEPCLDICTSARRRRELGSS 268
Query: 268 SNVEQ-CPK 275
+ + CP+
Sbjct: 269 GSPDGLCPQ 277
>gi|116783256|gb|ABK22858.1| unknown [Picea sitchensis]
Length = 276
Score = 266 bits (680), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 133/223 (59%), Positives = 168/223 (75%), Gaps = 1/223 (0%)
Query: 54 ESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDAR 113
E+ ++ Y+PG LD++FL FR K+ +EVG DS KPGYDGLIE+ N LM K ++ D
Sbjct: 55 ENGATQLNYEPGPLDNIFLFLFRKKMAKEVGWDSNKPGYDGLIEVANCLMTKYRNKLDT- 113
Query: 114 DAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLP 173
+ A +RIL SLFPP +L L++ LI+PLA GK+AA+M ARVTA TCQWLMG TVN +DLP
Sbjct: 114 EQATVRILRSLFPPFLLLLFRKLITPLAEGKLAAIMTARVTAATCQWLMGRSTVNCIDLP 173
Query: 174 DGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFK 233
DG+SC SGV VE+C+YLE SKC G+CI+TCKLPTQTF K+YMG+PLLMEPNF+D+SCQFK
Sbjct: 174 DGSSCNSGVLVEKCQYLEASKCAGICIHTCKLPTQTFIKEYMGIPLLMEPNFNDFSCQFK 233
Query: 234 FGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPKA 276
FG+ DD +L PCL+ICP +RK +V+QCPK
Sbjct: 234 FGVEALPTCDDKSLHVPCLEICPNDVKRKGYQNRLDVQQCPKV 276
>gi|302783414|ref|XP_002973480.1| hypothetical protein SELMODRAFT_99098 [Selaginella moellendorffii]
gi|300159233|gb|EFJ25854.1| hypothetical protein SELMODRAFT_99098 [Selaginella moellendorffii]
Length = 254
Score = 252 bits (643), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 123/222 (55%), Positives = 160/222 (72%), Gaps = 7/222 (3%)
Query: 55 SDTSRTEYKPG-VLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDAR 113
S + Y+ G +LD FLS FRNKL QEVG D+++PGYDGLI+L LM K K+ SD
Sbjct: 5 SREAAASYREGPLLDAAFLSLFRNKLAQEVGRDADRPGYDGLIQLSQLLMAKYKAKSDV- 63
Query: 114 DAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLP 173
+ A +RIL S+FP +L+L++ ++ P+ GK+AA++ ARVT TCQWLMG C++ SV+L
Sbjct: 64 EQATVRILNSMFPQSLLRLFRAVVLPINKGKLAAILSARVTQATCQWLMGTCSIGSVELS 123
Query: 174 DGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFK 233
DGTS SGV VE+CKYLE SKC G+CI+TCKLPTQ F +GVPLLMEPNF+D SCQFK
Sbjct: 124 DGTSIPSGVLVEKCKYLEHSKCAGICIHTCKLPTQAFISKELGVPLLMEPNFADLSCQFK 183
Query: 234 FGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPK 275
FG+ P P+DD ++ PCL++CPT+ RK S+ CPK
Sbjct: 184 FGVEAPSPEDDPSVSTPCLEMCPTAIARK-----SSTPLCPK 220
>gi|302809968|ref|XP_002986676.1| hypothetical protein SELMODRAFT_47788 [Selaginella moellendorffii]
gi|300145564|gb|EFJ12239.1| hypothetical protein SELMODRAFT_47788 [Selaginella moellendorffii]
Length = 208
Score = 248 bits (633), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 117/202 (57%), Positives = 153/202 (75%), Gaps = 2/202 (0%)
Query: 62 YKPG-VLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRI 120
Y+ G +LD FLS RNKL QEVG D+++PGYDGLI+L LM K K+ SD + A +RI
Sbjct: 1 YREGPLLDAAFLSLLRNKLAQEVGRDADRPGYDGLIQLSQLLMAKYKAKSDV-EQATVRI 59
Query: 121 LVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQS 180
L S+FP +L+L++ ++ P+ GK+AA++ ARVT TCQWLMG C+++SV+L DGTS S
Sbjct: 60 LNSMFPQSLLRLFRAVVLPINKGKLAAILSARVTQATCQWLMGTCSISSVELSDGTSIPS 119
Query: 181 GVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPL 240
GV VE+CKYLE SKC G+CI+TCKLPTQ F +GVPLLMEPNF+D SCQFKFG+ P
Sbjct: 120 GVLVEKCKYLEHSKCAGICIHTCKLPTQAFISKELGVPLLMEPNFADLSCQFKFGVEAPS 179
Query: 241 PKDDTTLKEPCLDICPTSSRRK 262
P+DD ++ PCL++CPT+ RK
Sbjct: 180 PEDDPSVSTPCLEMCPTAIARK 201
>gi|168005042|ref|XP_001755220.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693813|gb|EDQ80164.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 231
Score = 237 bits (604), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 108/214 (50%), Positives = 149/214 (69%), Gaps = 1/214 (0%)
Query: 62 YKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRIL 121
Y PG LDD+FL FR+K+ +EVG DS KPGYDGL+++ L+++ +S + + A +R+L
Sbjct: 16 YVPGPLDDIFLKIFRSKMAEEVGWDSPKPGYDGLVDIAKKLLLQYRSGEET-ERATVRVL 74
Query: 122 VSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSG 181
SLFP +L L+K +++P GK AAM+ A+VT TCQWLMG CT+ V+L DG+ SG
Sbjct: 75 RSLFPSWLLPLFKQIVAPFGDGKPAAMLCAQVTIATCQWLMGKCTITEVELADGSKIPSG 134
Query: 182 VFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLP 241
V V++CKYL+E+KC G+CI+TCKLPTQ F MGV L MEPN+ ++SCQF FG+ PP
Sbjct: 135 VLVQKCKYLDETKCAGICIHTCKLPTQAFMNGDMGVRLTMEPNYENFSCQFNFGVDPPPA 194
Query: 242 KDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPK 275
+D LK PCL +CPT++ R + QCP+
Sbjct: 195 AEDPALKTPCLAVCPTAASRGSIRGVPIESQCPQ 228
>gi|226496275|ref|NP_001146228.1| uncharacterized protein LOC100279799 [Zea mays]
gi|219886283|gb|ACL53516.1| unknown [Zea mays]
gi|413917539|gb|AFW57471.1| hypothetical protein ZEAMMB73_233894 [Zea mays]
Length = 182
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 117/182 (64%), Positives = 146/182 (80%), Gaps = 3/182 (1%)
Query: 96 IELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTA 155
+E+ N LM+KGKS+ + AA +R+L SLFPPL+L LYK L++P+A G++AAMM+AR TA
Sbjct: 1 MEVANRLMVKGKSALETEQAA-VRVLQSLFPPLLLVLYKALLAPIANGQLAAMMLARATA 59
Query: 156 LTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYM 215
++CQWLMG C+VNSV LPDG S SGVFVE+CKYLEESKC+G+CINTCKLPTQTFFKD+M
Sbjct: 60 ISCQWLMGSCSVNSVTLPDGKSWSSGVFVEKCKYLEESKCLGICINTCKLPTQTFFKDHM 119
Query: 216 GVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQ--C 273
GV L MEPNF DYSCQF FG+ PP D LKEPCLDIC + RR+E+ NS+ ++ C
Sbjct: 120 GVDLYMEPNFEDYSCQFNFGVPPPPLDTDKALKEPCLDICTNARRRRELGRNSSPDELSC 179
Query: 274 PK 275
P+
Sbjct: 180 PQ 181
>gi|218196032|gb|EEC78459.1| hypothetical protein OsI_18326 [Oryza sativa Indica Group]
Length = 229
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 105/203 (51%), Positives = 137/203 (67%), Gaps = 14/203 (6%)
Query: 33 PRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGY 92
P S+LR S P A + R EY+P DD L+ FR K+V+EVG DSEKPGY
Sbjct: 30 PTTSLLRCS---SPSADTASSSWEGGR-EYEPSFADDFLLAFFRAKMVEEVGWDSEKPGY 85
Query: 93 DGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVAR 152
GLIE+ N M+KGKS+ + +A +R+L SLFPPL+L L+K L+ P+A G++A+MMV
Sbjct: 86 TGLIEVANRPMVKGKSALEIEQSA-VRVLRSLFPPLLLVLFKALLVPIANGQLASMMVGE 144
Query: 153 VTALTCQW----LMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQ 208
T +T + M +V +++P + VFVE+CKYLEESKC+G+CINTCKLPTQ
Sbjct: 145 FTRVTFFFEIIQKMLLTSVQVLEMP-----LTSVFVEKCKYLEESKCLGMCINTCKLPTQ 199
Query: 209 TFFKDYMGVPLLMEPNFSDYSCQ 231
TFFKD++GV L MEPNF DYSCQ
Sbjct: 200 TFFKDHIGVDLYMEPNFEDYSCQ 222
>gi|159489146|ref|XP_001702558.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280580|gb|EDP06337.1| predicted protein [Chlamydomonas reinhardtii]
Length = 280
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 90/232 (38%), Positives = 134/232 (57%), Gaps = 7/232 (3%)
Query: 47 QAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKG 106
QA P+ D + +P L+ + ++ FR K+V+ +G DS+ GYD +I+L L K
Sbjct: 50 QAGPVDAAPDYKPIDSQP--LNIIVMALFRRKMVEALGSDSKLSGYDAIIDLTRKLNTKF 107
Query: 107 KSSSDARDAAQIRILVSLFPPLVLKLYKILIS-PLAGGKIAAMMVARVTALTCQWLMGHC 165
+++++ ++A + IL +LFP + +K++ S PL + + + A TA+TCQWLMG C
Sbjct: 108 RTAAETQEATR-GILNALFPSWLPGAFKVMFSRPLP--EFSCRLNALATAMTCQWLMGPC 164
Query: 166 TVNSVDLPDG-TSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPN 224
VN V++ G GV VERC+YLE++ C VCIN+CK+PTQ+FF+ MG+PL M PN
Sbjct: 165 KVNDVEIDGGKVGTGHGVLVERCRYLEQAGCASVCINSCKVPTQSFFEKDMGLPLTMTPN 224
Query: 225 FSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPKA 276
+ D+SCQF FG P D PC CP+ R V + P A
Sbjct: 225 YDDFSCQFSFGKTPDPVDRDPAFATPCFTQCPSKRSRTPVCGGIELPSMPGA 276
>gi|384249478|gb|EIE22959.1| hypothetical protein COCSUDRAFT_15824 [Coccomyxa subellipsoidea
C-169]
Length = 234
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 127/215 (59%), Gaps = 18/215 (8%)
Query: 67 LDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIR---ILVS 123
L+ +S FR K+V +G +S+K GYD +++L L K SS+ RD Q++ IL+S
Sbjct: 25 LNRAIMSLFRQKMVAAIGKNSDKEGYDAIVDLTRLLNSK---SSNPRDT-QVKTRQILLS 80
Query: 124 LFPPLVLKLYKILIS-PLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDG-TSCQSG 181
LFP + +K++ S P+ +++ + A VT LTCQWLMG C VN V+L G G
Sbjct: 81 LFPSWLPPAFKVMFSKPMP--EVSCQLNAWVTMLTCQWLMGPCKVNDVELDGGRIGSGQG 138
Query: 182 VFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLP 241
V VERC+YLEE+ C VC+N+CK+PTQ FF MG+PL M PN+ D+SCQF FG+ P
Sbjct: 139 VLVERCRYLEETGCASVCLNSCKIPTQEFFAKDMGLPLTMTPNYEDFSCQFSFGLTPKPA 198
Query: 242 KDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPKA 276
D PC CP+ R + ++CP A
Sbjct: 199 AIDEAFATPCFSQCPSKQRHRG-------DRCPGA 226
>gi|307107180|gb|EFN55424.1| hypothetical protein CHLNCDRAFT_134573 [Chlorella variabilis]
Length = 255
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 115/204 (56%), Gaps = 18/204 (8%)
Query: 56 DTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDA 115
D S + +P L+ S FR ++VQ +G DS++ GY +I+L L + SS
Sbjct: 40 DYSAIDTQP--LNRAVYSLFRGRMVQAIGSDSQQEGYAAIIDLTRRLNAQ-HSSPRGTQE 96
Query: 116 AQIRILVSLFPPLVLKLYKILIS-PLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPD 174
A + IL SLFP + + ++ S P+ G L+CQWLMG C VN V++
Sbjct: 97 ATVGILRSLFPGWLPPAFAVMFSKPMPG-------------LSCQWLMGECEVNDVEIDG 143
Query: 175 G-TSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFK 233
G GV V+RC+YLEE+ C VCIN+CK+PTQ FF+ +MG+PL M+PN+ D+SCQF
Sbjct: 144 GRMGAGHGVLVKRCRYLEEAACASVCINSCKVPTQEFFQRHMGLPLEMKPNYEDFSCQFS 203
Query: 234 FGILPPLPKDDTTLKEPCLDICPT 257
FG PP D PC CPT
Sbjct: 204 FGKTPPPEAQDAAFSTPCFQKCPT 227
>gi|428771295|ref|YP_007163085.1| hypothetical protein Cyan10605_2981 [Cyanobacterium aponinum PCC
10605]
gi|428685574|gb|AFZ55041.1| hypothetical protein Cyan10605_2981 [Cyanobacterium aponinum PCC
10605]
Length = 216
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 81/208 (38%), Positives = 121/208 (58%), Gaps = 10/208 (4%)
Query: 57 TSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAA 116
T +TEYK D LF++ F K+ + VG S+K GY+G ++L +M +G++S ++
Sbjct: 4 TDKTEYKDNWFDRLFIALFSRKMAKAVGKKSQKKGYEGFVDLSMQIM-EGRNSQQQQELV 62
Query: 117 QIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDL---- 172
I +L SL P VL L + L SP K A + +WL+G + ++
Sbjct: 63 AI-VLQSLVPSPVLFLIRNLFSPT---KWVCESNAWFATVLFEWLVGESEIREAEIVTED 118
Query: 173 PDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQF 232
T +SGV++++C+YLE S CVG+C+N CKLPTQ FF G+PL M PNF D+SC+
Sbjct: 119 NQVTILKSGVYIKKCRYLEASGCVGMCVNMCKLPTQEFFTKSFGIPLTMTPNFDDFSCEM 178
Query: 233 KFGILPPLPKDDTTLKEPCLD-ICPTSS 259
FG +PP +D+ ++ CL ICPT+S
Sbjct: 179 VFGQVPPAFEDEEASRQSCLKHICPTAS 206
>gi|443315784|ref|ZP_21045258.1| hypothetical protein Lep6406DRAFT_00035130 [Leptolyngbya sp. PCC
6406]
gi|442784621|gb|ELR94487.1| hypothetical protein Lep6406DRAFT_00035130 [Leptolyngbya sp. PCC
6406]
Length = 229
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 76/194 (39%), Positives = 116/194 (59%), Gaps = 8/194 (4%)
Query: 62 YKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRIL 121
Y+ G +D +F+ F K+ + +G + GYDG ++L +M +G+++ + + I +L
Sbjct: 23 YQDGFVDRVFIWLFSRKMSRALGQSTNLQGYDGFVDLSKKIM-QGRNAQEQQALVAI-VL 80
Query: 122 VSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQ-- 179
SL P VL L + + SP ++ + A A +WL+G C V +V++P TS +
Sbjct: 81 KSLVPSPVLWLIRTVFSPT---RLVCELNAWFAARLFEWLVGPCEVTAVEVPGQTSQRTQ 137
Query: 180 -SGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238
SGV +ERC+YLE+S+CVG+CIN CKLPTQ FF G+PL M PNF D+SC+ FG P
Sbjct: 138 RSGVHIERCRYLEQSRCVGMCINMCKLPTQDFFTQEFGIPLTMTPNFEDFSCEMVFGQPP 197
Query: 239 PLPKDDTTLKEPCL 252
P + + +PCL
Sbjct: 198 PPLETEDAYHQPCL 211
>gi|395146561|gb|AFN53713.1| omega-6 desaturase [Linum usitatissimum]
Length = 769
Score = 146 bits (369), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 92/226 (40%), Positives = 119/226 (52%), Gaps = 73/226 (32%)
Query: 51 IKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSS 110
++ + + R++YKPGV DD+FL FR+++V+
Sbjct: 617 VQTKPEVLRSDYKPGVPDDVFLGLFRSRMVKV---------------------------- 648
Query: 111 DARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSV 170
A D A + L P L + I+ L + RVT L+CQWLMG C+VN+V
Sbjct: 649 -ADDFALL-----LLPIL---YFFTDINQLVENR-------RVTQLSCQWLMGKCSVNTV 692
Query: 171 DLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSC 230
DLPDGTS +SG TFF DYMGVPLLMEPNF+DYSC
Sbjct: 693 DLPDGTSWESG---------------------------TFFNDYMGVPLLMEPNFTDYSC 725
Query: 231 QFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPKA 276
QFKFG+ P P+DD +KEPCL ICPT+SRR+ + +S V QCPKA
Sbjct: 726 QFKFGMAAPQPEDDVAVKEPCLAICPTASRRR-IVPDSTV-QCPKA 769
>gi|299470735|emb|CBN79781.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 318
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/195 (38%), Positives = 115/195 (58%), Gaps = 7/195 (3%)
Query: 67 LDDLFLSSFRNKLVQEVGLDSE--KPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSL 124
LD + L FR+KL +EVG D + +PGYDGL++++ L K S ++A++ R+L SL
Sbjct: 101 LDRILLGLFRSKLAEEVGGDGDAFEPGYDGLMDMIKVLNEKFPSKRKTQEASR-RVLKSL 159
Query: 125 FPPLVLKLYKILIS-PLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGT-SCQSGV 182
FP + + ++ S P ++ + A VT + QWLMG +N +++ DGT G+
Sbjct: 160 FPSWLPASFAVMFSKPFPA--FSSRLNAWVTLVASQWLMGPSKLNDIEIDDGTVGVGHGL 217
Query: 183 FVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPK 242
VERC++LE + C VC+NTCK+PT+ FF MG+ L M PN+ D+SCQF F P
Sbjct: 218 LVERCRFLEAAGCASVCMNTCKVPTEEFFAKDMGLALEMTPNYEDFSCQFSFNKTPLARD 277
Query: 243 DDTTLKEPCLDICPT 257
D + C + CP+
Sbjct: 278 MDEAFRVACFEQCPS 292
>gi|422293819|gb|EKU21119.1| hypothetical protein NGA_2097710, partial [Nannochloropsis gaditana
CCMP526]
gi|422293944|gb|EKU21244.1| hypothetical protein NGA_2097720, partial [Nannochloropsis gaditana
CCMP526]
Length = 284
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/197 (40%), Positives = 113/197 (57%), Gaps = 8/197 (4%)
Query: 67 LDDLFLSSFRNKLVQEVG-LDSEKPGYDGLIELVNHLMMKGKSSS--DARDAAQIRILVS 123
D L + FR KLVQ++G D+ + GLIEL+ L + +S + AAQ IL S
Sbjct: 76 FDGLLYAFFRAKLVQQLGGSDTASKDFAGLIELIRKLNTQFPASGKVGTQKAAQ-NILRS 134
Query: 124 LFPPLVLKLYKILIS-PLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGT-SCQSG 181
LFP + + ++ S P +A M A +T LT WLMG + VD+ G+ G
Sbjct: 135 LFPSWLPAAFAVMFSKPFPA--FSARMNAIITGLTTYWLMGESEIIDVDVDGGSVGVGQG 192
Query: 182 VFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLP 241
+ V+RC+YLEE+ C +C+NTCK+PTQ FF MG+PL M P++ D+SC F FG+ PP
Sbjct: 193 LLVKRCRYLEEAGCASICVNTCKIPTQNFFCQDMGLPLTMTPDYDDFSCTFAFGLTPPPV 252
Query: 242 KDDTTLKEPCLDICPTS 258
DD ++ C CPT+
Sbjct: 253 FDDEAMRVACFSQCPTA 269
>gi|254421834|ref|ZP_05035552.1| hypothetical protein S7335_1984 [Synechococcus sp. PCC 7335]
gi|196189323|gb|EDX84287.1| hypothetical protein S7335_1984 [Synechococcus sp. PCC 7335]
Length = 214
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/202 (37%), Positives = 118/202 (58%), Gaps = 9/202 (4%)
Query: 62 YKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRIL 121
+K +LD LF+ F K+ +G + GY+G ++L +M +G+++ + + AA R+L
Sbjct: 7 HKDNLLDRLFIWLFSRKMANAIGSTTAATGYEGFVDLSKQIM-QGRNAQE-QQAAVARVL 64
Query: 122 VSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLP--DGT--S 177
SL P VL + + + SP ++ ++ A +WL+G C V ++ DG S
Sbjct: 65 QSLVPAPVLWVIRTVFSPT---RLVCVLNAWFATQMFEWLVGPCEVAQAEVKGLDGEVRS 121
Query: 178 CQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGIL 237
S V +++C+YLEES+CVG+C+N CKLPTQTFF + G+PL M PNF D SC+ FG +
Sbjct: 122 QPSAVHIKKCRYLEESQCVGMCVNMCKLPTQTFFTEKFGIPLTMIPNFEDLSCEMVFGRV 181
Query: 238 PPLPKDDTTLKEPCLDICPTSS 259
PP +D + + CL C T +
Sbjct: 182 PPAADEDEVMTQSCLSECSTGT 203
>gi|168028991|ref|XP_001767010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681752|gb|EDQ68176.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 213
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 77/194 (39%), Positives = 113/194 (58%), Gaps = 7/194 (3%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQI 118
+T YK LD+ LS +L G+ + K GYDG +EL +M S + A+ +
Sbjct: 19 KTHYKDSWLDNTILSICMRRLGNVTGVSTTKKGYDGFVELTRKVM--ETRSPLLQRASSM 76
Query: 119 RILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSC 178
R+L S PP +LK+ + L + A A T L +WL+G C V V++ +GT
Sbjct: 77 RVLHSAIPPWLLKIIRRF---LPNNQKTAETFAAAT-LYAEWLVGPCEVKEVEV-NGTMQ 131
Query: 179 QSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238
+SGV +++C+YLE S CVG+C+N CK+PTQ FF + +GVPL M PNF D SC+ +G P
Sbjct: 132 KSGVLIKKCRYLESSNCVGMCVNLCKIPTQDFFTNSLGVPLTMTPNFEDMSCEMIYGQTP 191
Query: 239 PLPKDDTTLKEPCL 252
P ++D L++PC
Sbjct: 192 PSIEEDPALQQPCF 205
>gi|323451409|gb|EGB07286.1| hypothetical protein AURANDRAFT_64975 [Aureococcus anophagefferens]
Length = 283
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 84/228 (36%), Positives = 116/228 (50%), Gaps = 19/228 (8%)
Query: 49 PPIKRESDTSRTEY--KPGVLDDLFLSSFRNKLVQEVGLDSEKPG--YDGLIELVNHLMM 104
PP+K+ + T + G+LD F+ F ++ +E+G D+ ++GLIE V L
Sbjct: 22 PPLKKTGPQTPTPFIDDSGLLDRFFMRVFTARVREELGGDARHGSDDFEGLIEEVRVLNA 81
Query: 105 KGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGK-----------IAAMMVARV 153
+G S A+ AA RIL FPP+V + PL +A + A V
Sbjct: 82 RGPSPRAAQ-AAGRRILRRCFPPMVSPANGFRVEPLYDAYRQLFAHEVFRPYSAKLNAWV 140
Query: 154 TALTCQWLMGHCTVNSVDLPD---GTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTF 210
T WLMG TV V+ PD G + VERC++LE C C+N CK+PTQ F
Sbjct: 141 TRACSYWLMGASTVGDVETPDATWGDGAHQKLVVERCRFLEAGGCASACVNLCKIPTQRF 200
Query: 211 FKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTS 258
F + MG+PLLMEP++ D+SC F FG+ PP D L PC CP +
Sbjct: 201 FGEDMGLPLLMEPDYEDFSCTFSFGVAPPPLAVDEALDTPCFSQCPVA 248
>gi|302787921|ref|XP_002975730.1| hypothetical protein SELMODRAFT_103856 [Selaginella moellendorffii]
gi|300156731|gb|EFJ23359.1| hypothetical protein SELMODRAFT_103856 [Selaginella moellendorffii]
Length = 230
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 79/217 (36%), Positives = 122/217 (56%), Gaps = 6/217 (2%)
Query: 39 RSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIEL 98
RS I+ + P K +T YK + D F+S F K+ G S+K GYDG ++
Sbjct: 9 RSLIRCEIAEPSGKPAPMGQKTRYKDSIFDRAFMSLFARKMENATGRASKKTGYDGFVD- 67
Query: 99 VNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTC 158
V+ +++G++ + R + +L+S+ PP + ++ L P K A A +T
Sbjct: 68 VSRGVLQGRNPVEQRALVR-EVLLSIMPPGAPETFRKLFPPT---KWACEFNAAITVPFF 123
Query: 159 QWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVP 218
QWL+G C V++ +G SGV + +C+YLE S CVG+C+N CK+PTQ FF + G+P
Sbjct: 124 QWLVGPCERFEVEV-NGVKQNSGVKILKCRYLENSNCVGMCVNMCKIPTQDFFTNDFGLP 182
Query: 219 LLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDIC 255
L M PNF D SC+ +G+ P ++D LK+PCL +C
Sbjct: 183 LTMTPNFEDMSCEMIYGLQPTSLEEDPALKQPCLQLC 219
>gi|371779167|emb|CBZ39517.1| td6ITP3 protein, partial [Triticum durum]
Length = 113
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 66/96 (68%), Positives = 75/96 (78%)
Query: 170 VDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYS 229
V LP+G SGVFVE+CKYLEESKC+G+CINTCKLPTQTFF D+MGV L MEPNF DYS
Sbjct: 1 VALPNGKPLSSGVFVEKCKYLEESKCLGICINTCKLPTQTFFNDHMGVDLYMEPNFEDYS 60
Query: 230 CQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVA 265
CQF FG+ PP D LKEPCLDIC + RR+E+
Sbjct: 61 CQFNFGVPPPPIDTDKALKEPCLDICTNARRRRELG 96
>gi|428163798|gb|EKX32851.1| hypothetical protein GUITHDRAFT_56235, partial [Guillardia theta
CCMP2712]
Length = 213
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 79/207 (38%), Positives = 111/207 (53%), Gaps = 17/207 (8%)
Query: 64 PGVLDDLF-----LSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQI 118
P D+L+ LS FR +V+ +G D + GYDG++ L LM+ + A
Sbjct: 10 PSYKDELWWDRTALSMFRGAMVKNLGQDVMEQGYDGVMRLA--LMLNQRYEPLETRARTR 67
Query: 119 RILVSLFPPLVLKLYKILIS-PLAGGKIAAMMVARVTALTCQWLMGHCTVNSV------- 170
IL SLFP ++KL+ ++ + P +A + A +T++TC WLMG + +
Sbjct: 68 SILRSLFPVFIIKLFPLMFARPFPA--FSAKLNAYITSVTCSWLMGPMKLFDLKAEEMED 125
Query: 171 DLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSC 230
D D G+ VERC++LEES C VCINTCK+PTQ FF MG+PL MEPN+ + C
Sbjct: 126 DWGDDAGKSQGILVERCRFLEESGCASVCINTCKVPTQEFFIKDMGIPLSMEPNYDTFEC 185
Query: 231 QFKFGILPPLPKDDTTLKEPCLDICPT 257
+FKFG P D PC CP+
Sbjct: 186 EFKFGKRPLQQDTDEIFTTPCFQQCPS 212
>gi|302783805|ref|XP_002973675.1| hypothetical protein SELMODRAFT_173467 [Selaginella moellendorffii]
gi|300158713|gb|EFJ25335.1| hypothetical protein SELMODRAFT_173467 [Selaginella moellendorffii]
Length = 220
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 78/216 (36%), Positives = 122/216 (56%), Gaps = 6/216 (2%)
Query: 39 RSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIEL 98
RS I+ + P K +T YK + D F+S F K+ G S+K GYDG ++
Sbjct: 11 RSLIRCEIAEPSGKPAPMGQKTRYKDSIFDRAFMSLFARKMENATGRASKKTGYDGFVD- 69
Query: 99 VNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTC 158
V+ +++G++ + R + +L+S+ PP + ++ L P K A A +T
Sbjct: 70 VSRGVLQGRNPVEQRALVR-EVLLSIMPPGAPETFRKLFPPT---KWACEFNAAITVPFF 125
Query: 159 QWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVP 218
QWL+G C V++ +G +SGV + +C+YLE S CVG+C+N CK+PTQ FF + G+P
Sbjct: 126 QWLVGPCERFEVEV-NGVKQKSGVKILKCRYLENSNCVGMCVNMCKIPTQDFFTNDFGLP 184
Query: 219 LLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDI 254
L M PNF D SC+ +G+ P ++D LK+PCL +
Sbjct: 185 LTMTPNFEDMSCEMIYGLQPTSLEEDPALKQPCLQL 220
>gi|168000160|ref|XP_001752784.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695947|gb|EDQ82288.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 269
Score = 140 bits (353), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 76/198 (38%), Positives = 119/198 (60%), Gaps = 9/198 (4%)
Query: 60 TEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIR 119
T Y +LD F++ FR K+ +G S+ GY+G ++ V+ +M+G+++ + R A +R
Sbjct: 54 TRYNDNILDKAFIALFRRKMEANLGKTSKMQGYEGFVD-VSKKIMQGRTAVEQR--AVVR 110
Query: 120 -ILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSC 178
+L+SL PP ++ L P K +A A VT QWL+G + +++ +G
Sbjct: 111 DVLLSLLPPGAPAQFRKLFPPT---KWSAEFNAAVTVPFFQWLVGPAELMEIEV-NGVKQ 166
Query: 179 QSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238
SGV + +C+YLE S CVG+C+N CK+PTQ FF + G+PL M PNF D SC+ +G P
Sbjct: 167 MSGVKITKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMTPNFEDMSCEMFYGQSP 226
Query: 239 PLPKDDTTLKEPC-LDIC 255
P ++D LK+PC L++C
Sbjct: 227 PPIEEDPALKQPCFLNMC 244
>gi|302787573|ref|XP_002975556.1| hypothetical protein SELMODRAFT_103815 [Selaginella moellendorffii]
gi|300156557|gb|EFJ23185.1| hypothetical protein SELMODRAFT_103815 [Selaginella moellendorffii]
Length = 275
Score = 140 bits (353), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 82/247 (33%), Positives = 127/247 (51%), Gaps = 27/247 (10%)
Query: 30 LQSPRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLD--- 86
L +P S RS I+ + P K +T YK + D F+S F K+ G+D
Sbjct: 3 LGAPAISFHRSLIRCEIAEPSGKPAPMGQKTRYKDSIFDRAFMSLFSRKMESATGMDKIL 62
Query: 87 ------------------SEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPL 128
+ KPGYDG ++ V+ ++KG++ + R + ++ +S+ PP
Sbjct: 63 TWFISSLYDLKKSDVGRATNKPGYDGFVD-VSRGVLKGRTPVEQRALVR-QVFLSIMPPG 120
Query: 129 VLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCK 188
+ ++ L P K A A +T QWL+G C V++ +G SGV + +C+
Sbjct: 121 APETFRKLFPPT---KWACEFNAAITVPFFQWLVGPCETFEVEV-NGVKQNSGVKILKCR 176
Query: 189 YLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLK 248
YLE S C G+C+N CK+PTQ F + G+PL M PNF D SC+ +G+ PP ++D LK
Sbjct: 177 YLENSSCAGMCVNVCKIPTQDLFTNDFGLPLTMTPNFEDMSCEMIYGLQPPSLEEDPALK 236
Query: 249 EPCLDIC 255
+PCL+ C
Sbjct: 237 QPCLERC 243
>gi|302783505|ref|XP_002973525.1| hypothetical protein SELMODRAFT_99956 [Selaginella moellendorffii]
gi|300158563|gb|EFJ25185.1| hypothetical protein SELMODRAFT_99956 [Selaginella moellendorffii]
Length = 285
Score = 140 bits (352), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 82/247 (33%), Positives = 128/247 (51%), Gaps = 27/247 (10%)
Query: 30 LQSPRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLD--- 86
L +P S RS I+ + P K +T YK + D F+S F K+ G+D
Sbjct: 3 LGAPAISFHRSLIRCKIAEPSGKPAPMGQKTRYKDSIFDRAFMSLFSRKMESATGMDKIL 62
Query: 87 ------------------SEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPL 128
+ KPGYDG ++ V+ ++KG++ + R + ++ +S+ PP
Sbjct: 63 TWFISSLYDLKKSDVGRATNKPGYDGFVD-VSRGVLKGRTPVEQRALVR-QVFLSIMPPG 120
Query: 129 VLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCK 188
+ ++ L P K A A +T QWL+G C V++ +G SGV + +C+
Sbjct: 121 APETFRKLFPPT---KWACEFNAAITVPFFQWLVGPCERFEVEV-NGVKQNSGVKILKCR 176
Query: 189 YLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLK 248
YLE S C G+C+N CK+PTQ F + G+PL M PNF D SC+ +G+ PP ++D LK
Sbjct: 177 YLENSSCAGMCVNVCKIPTQDLFTNDFGLPLTMTPNFEDMSCEMIYGLQPPSLEEDPALK 236
Query: 249 EPCLDIC 255
+PCL++C
Sbjct: 237 QPCLELC 243
>gi|428773769|ref|YP_007165557.1| hypothetical protein Cyast_1955 [Cyanobacterium stanieri PCC 7202]
gi|428688048|gb|AFZ47908.1| hypothetical protein Cyast_1955 [Cyanobacterium stanieri PCC 7202]
Length = 229
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 125/230 (54%), Gaps = 13/230 (5%)
Query: 51 IKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSS 110
++ ++ T++T YK ++D LF++ F K+ + +G + GYDG ++L + +MKG++
Sbjct: 3 LREKNTTTKTIYKDNIIDRLFIALFCRKMEKALGAKTNLKGYDGFVDL-SQKIMKGRNPQ 61
Query: 111 DARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSV 170
+D + IL SL P VL L + + K A + WL+G C + V
Sbjct: 62 QQQDLVAV-ILKSLVPSPVLYLTRTFV---PANKWVCEANAWFAKVLFPWLVGICELREV 117
Query: 171 DLP----DGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFS 226
++ T SGV +++C+YLE S CV +CIN CKLPTQ FF + G+P+ M PNF
Sbjct: 118 EIETENNQKTIQNSGVHIKKCRYLENSGCVAMCINMCKLPTQKFFTESFGIPVTMTPNFE 177
Query: 227 DYSCQFKFGILPPLPKDDTTLKEPCL-DICPTSSRRKEVAMNSNVEQCPK 275
D+SC+ FG PP + ++PCL +IC T+ + N+ CPK
Sbjct: 178 DFSCEMVFGQNPPPLNQEECSRQPCLQEICDTAVTSSTI---KNLAPCPK 224
>gi|385763980|gb|AFI78793.1| putative D27 family protein [Chaetosphaeridium globosum]
Length = 273
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 81/247 (32%), Positives = 137/247 (55%), Gaps = 12/247 (4%)
Query: 7 RPSTISFSSSPPRSHHIPKLHPQLQSPRFSVLRSSI-QPQPQAPPIKRESDTSRTEYKPG 65
+P++ + P R+ +P+ S R +R +I +P + P+ + T+Y
Sbjct: 19 QPTSSLRTRQPLRTTPLPRNAQSAGSRRRGTVRCAIAEPSGKPAPMGQ-----ITKYNDN 73
Query: 66 VLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLF 125
D LF+S F K+ E G + GY+G ++ ++ +M+G+S ++ + A+ R+L+S+
Sbjct: 74 WFDLLFMSLFAKKMEIETGKKTRLTGYEGFVD-ISKRVMQGRSPAE-QQASVRRVLLSML 131
Query: 126 PPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVE 185
PP ++ L P K++A + A +T WL+G + V++ +G SGV +E
Sbjct: 132 PPEAPASFRKLFPPT---KLSAEINAWITVPFFAWLVGPAKLYEVEV-NGVKQWSGVKIE 187
Query: 186 RCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDT 245
+C+YLE S CVG+C+N CK+PTQ FF + G+PL M PNF D SC+ +G L P ++D
Sbjct: 188 KCRYLENSGCVGMCVNMCKVPTQDFFTNEFGLPLTMTPNFEDMSCEMVYGQLAPPVEEDP 247
Query: 246 TLKEPCL 252
K+PC
Sbjct: 248 AYKQPCF 254
>gi|427417077|ref|ZP_18907260.1| hypothetical protein Lepto7375DRAFT_2774 [Leptolyngbya sp. PCC
7375]
gi|425759790|gb|EKV00643.1| hypothetical protein Lepto7375DRAFT_2774 [Leptolyngbya sp. PCC
7375]
Length = 217
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 112/206 (54%), Gaps = 9/206 (4%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQI 118
+T Y D LF+ F +K+ VG S +PGYDG +EL + +M+G+SS +
Sbjct: 8 KTTYHDSFFDQLFIRLFASKMSNAVGECSSRPGYDGFVEL-SQKIMQGRSSQQQQ-QLVA 65
Query: 119 RILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVN--SVDLPDGT 176
+L SL P VL + SP ++ + A +WL+G CTV V G
Sbjct: 66 VVLQSLVPAPVLWGIRTFFSPT---QLVCELNAWFATQLFEWLVGPCTVQLAEVTTASGE 122
Query: 177 SCQ--SGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKF 234
+ Q S V +E+C+YLE+S CVG+C+N CKLPTQ FF + G+PL M PNF D SC F
Sbjct: 123 TRQQKSAVHIEKCRYLEQSGCVGMCVNMCKLPTQQFFTEKFGIPLTMTPNFEDLSCDMVF 182
Query: 235 GILPPLPKDDTTLKEPCLDICPTSSR 260
G +PP + + ++PCL C +S
Sbjct: 183 GQMPPPLETEDAYQQPCLQDCAVASE 208
>gi|385763996|gb|AFI78801.1| putative D27 family protein, partial [Spirogyra pratensis]
Length = 237
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/193 (36%), Positives = 113/193 (58%), Gaps = 6/193 (3%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQI 118
+T+Y + D F++ F K+ G S+ GY+G ++ + +M+G+++ R+A
Sbjct: 27 KTKYNDSIFDRAFMALFAAKMATVTGKRSDIGGYEGFVD-TSRKVMQGRNAQGQREAV-A 84
Query: 119 RILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSC 178
++L+SL PP ++ + P K +A M A +T QWL+G + V++ +G
Sbjct: 85 KVLLSLLPPNAPAQFRKIFPPT---KWSAEMNAAITVPFFQWLVGPAELKEVEV-NGVKQ 140
Query: 179 QSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238
SGV +++C+YLE S CVG+C+N CKLPTQ FF + G+PL M PNF D SC+ FG L
Sbjct: 141 MSGVQIKKCRYLEYSGCVGMCVNMCKLPTQDFFTNEFGLPLTMNPNFEDMSCEMIFGQLS 200
Query: 239 PLPKDDTTLKEPC 251
++D LK+PC
Sbjct: 201 QPLEEDPALKQPC 213
>gi|219109820|ref|XP_002176663.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411198|gb|EEC51126.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 330
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 82/220 (37%), Positives = 114/220 (51%), Gaps = 17/220 (7%)
Query: 48 APPIKRESDTSRTEYKP--GVLDDLFLSSFRNKLVQEVG-LDSEKPGYD--GLIEL---V 99
P + + D +T P +D LFL FR KL VG DS + D G+I+L +
Sbjct: 82 GPSLATKPDYEKTAIGPLGRWMDLLFLRVFRRKLAGHVGGADSSRNVTDFMGIIDLAAAM 141
Query: 100 NHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILIS-PLAGGKIAAMMVARVTALTC 158
N +GK S A+ ++L LFP + Y +L S P ++ M A T +
Sbjct: 142 NRRFSQGKIHSAAQ-----QVLRELFPSWMPGSYAVLFSKPFPA--FSSRMNAWATKVAG 194
Query: 159 QWLMGHCTVNSVDLPDGTSCQS-GVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGV 217
WLMG C +N V + G + G+ V+RC++LEES C +C+N+CK+PTQ FF MG+
Sbjct: 195 TWLMGECEINDVVVDGGEVGEGQGLLVKRCRFLEESGCASICVNSCKIPTQNFFAQDMGL 254
Query: 218 PLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPT 257
PL MEPN+ + CQF FG P + PCL CPT
Sbjct: 255 PLTMEPNYETFECQFSFGRTPDSSTELDAKSTPCLSRCPT 294
>gi|385763986|gb|AFI78796.1| putative D27 protein [Klebsormidium flaccidum]
Length = 328
Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 70/194 (36%), Positives = 117/194 (60%), Gaps = 6/194 (3%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQI 118
+T+YK G++D L ++ FR K+ G +++ GYD ++ V+ +M+GKS+ + + AA
Sbjct: 113 KTQYKDGLIDRLAMNLFRRKMQTVTGARTKETGYDAFVD-VSKALMRGKSAQE-QQAAVS 170
Query: 119 RILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSC 178
R+L+SL P + + + P +++ + A T WL+G V V++ +G
Sbjct: 171 RVLLSLIPRHLPYIIRTFFKPT---RLSLELNALFTPSIFSWLVGPAEVVEVEV-NGVKQ 226
Query: 179 QSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238
++GV +++C+YLE S CVG+C+N CK+PTQ FF G+PL M PNF D SC+ FG +P
Sbjct: 227 KTGVKIKKCRYLEASGCVGMCVNVCKVPTQDFFTKEFGLPLTMNPNFDDMSCEMVFGQVP 286
Query: 239 PLPKDDTTLKEPCL 252
P ++D ++PC
Sbjct: 287 PPIEEDKAFQQPCF 300
>gi|443476031|ref|ZP_21065956.1| hypothetical protein Pse7429DRAFT_1458 [Pseudanabaena biceps PCC
7429]
gi|443019039|gb|ELS33194.1| hypothetical protein Pse7429DRAFT_1458 [Pseudanabaena biceps PCC
7429]
Length = 221
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 116/207 (56%), Gaps = 10/207 (4%)
Query: 58 SRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQ 117
+R EY +D +F+ F K+ + +G + GY+G +EL +M +G+++ + +
Sbjct: 10 ARDEYNDNFIDRMFIWLFSRKMSEALGKGTTIGGYEGFVELSKQIM-QGRNAQEQQILVA 68
Query: 118 IRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTV--NSVDLPDG 175
++L SL P L + SP ++ ++ A A +WL+G C V ++L DG
Sbjct: 69 -KVLQSLVPSPALWAIRTFFSPT---RLVCVLNAWFAAQMFEWLVGPCEVIEAEINLEDG 124
Query: 176 T--SCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFK 233
T S S V +++C+YL +S CVG+C+N CK+PTQ FF + G+PL M PNF D SC+
Sbjct: 125 TLRSQPSAVHIKKCRYLVDSGCVGMCVNMCKVPTQVFFTEKFGIPLTMTPNFEDLSCKMI 184
Query: 234 FGILPPLPKDDTTLKEPCLD-ICPTSS 259
FG +P P+ D + CL CPT+S
Sbjct: 185 FGQMPTDPELDEAFTQSCLKHQCPTAS 211
>gi|298713846|emb|CBJ33737.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 299
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 71/197 (36%), Positives = 116/197 (58%), Gaps = 13/197 (6%)
Query: 62 YKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLI-ELVNHLMMKGKSSSDARDAAQIRI 120
Y +LD + L+ FR + +E+G SE+PGY GLI E N+++++G S D +D +R+
Sbjct: 97 YSESLLDKVALALFRVLVQKEIGYKSEEPGYAGLIDEAQNYMVIQGASVEDQQDMV-VRV 155
Query: 121 LVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSV--DLPDGTSC 178
L ++ P V +YK+ ++P A + A T ++L+G +++ D P
Sbjct: 156 LTTIAGPAVPPVYKLFMAPW---PWAPFLTAFFTPPFFKFLVGPNKLDARKDDTP----- 207
Query: 179 QSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238
GVFVERC++LEE+ C G+C N CK+PT+ FF++ +G+ + MEP+F Y C+ FG+
Sbjct: 208 -GGVFVERCRFLEETNCKGLCTNMCKIPTERFFEETLGLTMAMEPDFDTYECRLSFGLES 266
Query: 239 PLPKDDTTLKEPCLDIC 255
P ++D T+ CL C
Sbjct: 267 PAMEEDDTVPRGCLSGC 283
>gi|384250243|gb|EIE23723.1| hypothetical protein COCSUDRAFT_33175 [Coccomyxa subellipsoidea
C-169]
Length = 246
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 115/217 (52%), Gaps = 8/217 (3%)
Query: 39 RSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIEL 98
R S Q P A K + + Y LD +S F ++ Q++G PGY+G +EL
Sbjct: 4 RCSAQTTPAAATPKVDPFAQKETYNDSPLDKFMVSYFAGRMSQQLGGREYVPGYEGFVEL 63
Query: 99 VNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTC 158
MMKG++S + A +L SL PP + ++ P++ K +A A +T L
Sbjct: 64 SRE-MMKGRNSKQQQQAVS-GVLGSLMPPQASERFRKWF-PVS--KWSAETNALITVLGF 118
Query: 159 QWLMGHCTVNSVDLP---DGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYM 215
+WL+G V++ + +SGV +++C+YLE+S C+G+C+N CK+PT+ FF +
Sbjct: 119 KWLVGPLETKEVEVEFEGEKQKWKSGVQIKKCRYLEQSGCIGMCVNMCKIPTEDFFTNQF 178
Query: 216 GVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCL 252
G+PL M PNF D SC+ FG P + D +PC
Sbjct: 179 GLPLTMNPNFEDLSCEMIFGQKAPPIEQDPLYNQPCF 215
>gi|298714305|emb|CBJ33899.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 220
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/191 (37%), Positives = 115/191 (60%), Gaps = 9/191 (4%)
Query: 66 VLDDLFLSSFRNKLVQEVGLDSEKPGYDGLI-ELVNHLMMKGKSSSDARDAAQIRILVSL 124
+LD + L+ FR + +E+G SE+PGY GLI E N+++++G S D +D A +R+L ++
Sbjct: 22 LLDKVALALFRVLVQKEIGYKSEEPGYAGLIDEAQNYMVIQGASVEDQQDMA-VRVLTTI 80
Query: 125 FPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFV 184
P V +YK+ ++P A + A T ++L+G N +D + GVFV
Sbjct: 81 AGPAVPPVYKLFMAPW---PWAPFLTAFFTPPFFKFLVGP---NKLDARKDDT-PGGVFV 133
Query: 185 ERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDD 244
ERC++LEE+ C G+C N CK+PT+ FF++ +G+ + MEP+F Y C+ FG+ P ++D
Sbjct: 134 ERCRFLEETNCKGLCTNMCKIPTERFFEETLGLTMAMEPDFDTYECRLSFGLESPAMEED 193
Query: 245 TTLKEPCLDIC 255
T+ CL C
Sbjct: 194 DTVPRGCLSGC 204
>gi|255555763|ref|XP_002518917.1| conserved hypothetical protein [Ricinus communis]
gi|223541904|gb|EEF43450.1| conserved hypothetical protein [Ricinus communis]
Length = 277
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 83/245 (33%), Positives = 135/245 (55%), Gaps = 24/245 (9%)
Query: 24 PKLHPQLQ--SPRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQ 81
P+ P LQ SPR + +P + P+ + +T+Y G+ + +F+S F K+ +
Sbjct: 14 PRRGPCLQRCSPRIFIRCRIAEPSGEPAPLGQ-----KTKYTDGLFEKVFMSLFARKMEK 68
Query: 82 -----EVGLDSEKPG-----YDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLK 131
+ G DS+K G Y+ ++ V+ +M+G++ ++ + +L+S+ PP +
Sbjct: 69 FAAPVKNGNDSKKKGWLDSDYETFVD-VSRRVMQGRNRLQQQEVVR-EVLLSMLPPGAPE 126
Query: 132 LYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLE 191
+K L P + AA A +T QWL+G V V++ +G +SGV +++C+YLE
Sbjct: 127 QFKKLFPPT---RWAAEFNAALTVPFFQWLVGPSEVIEVEV-NGVKQKSGVRIKKCRYLE 182
Query: 192 ESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPC 251
S CVG+C+N CK+PTQ FF + G+PL M PNF D SC+ +G PP +D K+PC
Sbjct: 183 NSGCVGMCVNMCKIPTQDFFTNEFGLPLTMIPNFEDMSCEMVYGQAPPPFDEDPASKQPC 242
Query: 252 L-DIC 255
DIC
Sbjct: 243 YADIC 247
>gi|326495048|dbj|BAJ85620.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516196|dbj|BAJ88121.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 264
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 115/206 (55%), Gaps = 19/206 (9%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSE-KPG------------YDGLIELVNHLMMK 105
+TEY+ G L+ F+ F K+ + G + PG Y+ ++ V+ +M
Sbjct: 45 KTEYRDGPLERAFMGLFARKMEKFAGRKKKPDPGGEEEKKAVWEWDYESFVD-VSRRVMV 103
Query: 106 GKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHC 165
G+S + ++A + +L+S+ PP + +K L P + A A +T WL+G
Sbjct: 104 GRSRAQQQEAVR-EVLLSMLPPGAPEQFKKLFPPT---RWACEFNAALTVPFFHWLVGPS 159
Query: 166 TVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNF 225
V V++ DG +SGV +++C+YLE S CVG+C+N CK+PTQ+FF D G+PL M PNF
Sbjct: 160 EVIEVEV-DGVKQRSGVLIKKCRYLENSGCVGMCVNMCKIPTQSFFTDEFGLPLTMNPNF 218
Query: 226 SDYSCQFKFGILPPLPKDDTTLKEPC 251
D SC+ +G +PP ++D K+PC
Sbjct: 219 EDMSCEMIYGQVPPPLEEDPVSKQPC 244
>gi|397590835|gb|EJK55179.1| hypothetical protein THAOC_25114 [Thalassiosira oceanica]
Length = 314
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 110/205 (53%), Gaps = 16/205 (7%)
Query: 65 GVLDDLFLSSFRNKLVQEVGLDSEK----------PGYDGLIELVNHLMMKGKSSSDARD 114
G +D L LS FR K+ + + ++ +DG+I L + + + + ++
Sbjct: 71 GTVDRLLLSYFRIKMAERLARPKDEVKISDSSLAVDDFDGIISLTSSMNALYNNRTKVQE 130
Query: 115 AAQIRILVSLFPPLVLKLY-KILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLP 173
AAQ +LVSLFP +L Y P +A M A T WLMG C VN V++
Sbjct: 131 AAQ-DVLVSLFPRFILDRYPSWFARPFP--TFSARMCAAATTAGGTWLMGECEVNDVEI- 186
Query: 174 DGTSCQS-GVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQF 232
DGT + GV V+RC++L+ES C +C+N+CK+PT+ FF + MG+ L M P++ CQF
Sbjct: 187 DGTLARGQGVHVKRCRFLDESSCASICVNSCKVPTERFFAEDMGLALTMTPDYETGECQF 246
Query: 233 KFGILPPLPKDDTTLKEPCLDICPT 257
FG +P + + + PCL CP+
Sbjct: 247 AFGKMPSEEELLLSKETPCLRRCPS 271
>gi|297839973|ref|XP_002887868.1| hypothetical protein ARALYDRAFT_474875 [Arabidopsis lyrata subsp.
lyrata]
gi|297333709|gb|EFH64127.1| hypothetical protein ARALYDRAFT_474875 [Arabidopsis lyrata subsp.
lyrata]
Length = 250
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 116/209 (55%), Gaps = 13/209 (6%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIEL-------VNHLMMKGKSSSD 111
+T Y G+++ +F+ F K+ + ++ G E V+ +M+G+S
Sbjct: 36 KTRYDDGLVERVFMGLFARKMDKFGSKKKKETKEKGFWEYDYESFVEVSKRVMQGRSRVQ 95
Query: 112 ARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVD 171
++A + +L+S+ PP + ++ L P K AA A +T WL+G V V+
Sbjct: 96 QQEAVR-EVLLSMLPPGAPQQFRKLFPPT---KWAAEFNAALTVPFFHWLVGPSQVIEVE 151
Query: 172 LPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQ 231
+ +G +SGV +++C+YLE S CVG+C+N CK+PTQ FF + G+PL M PNF D SC+
Sbjct: 152 V-NGVKQRSGVRIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFEDMSCE 210
Query: 232 FKFGILPPLPKDDTTLKEPCL-DICPTSS 259
+G PP ++D K+PCL DIC S+
Sbjct: 211 MIYGQAPPAFEEDVATKQPCLADICSMST 239
>gi|18408106|ref|NP_564838.1| uncharacterized protein [Arabidopsis thaliana]
gi|6633822|gb|AAF19681.1|AC009519_15 F1N19.25 [Arabidopsis thaliana]
gi|33589794|gb|AAQ22663.1| At1g64680 [Arabidopsis thaliana]
gi|110740704|dbj|BAE98453.1| hypothetical protein [Arabidopsis thaliana]
gi|332196152|gb|AEE34273.1| uncharacterized protein [Arabidopsis thaliana]
Length = 250
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 116/209 (55%), Gaps = 13/209 (6%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIEL-------VNHLMMKGKSSSD 111
+T Y+ G+++ +F+ F K+ + + G E V+ +M+G+S
Sbjct: 36 KTRYEDGLVERVFMGLFARKMDKFGSKKKKDTKEKGFWEYDYESFVEVSKRVMQGRSRVQ 95
Query: 112 ARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVD 171
++A + +L+S+ PP + ++ L P K AA A +T WL+G V V+
Sbjct: 96 QQEAVR-EVLLSMLPPGAPEQFRKLFPPT---KWAAEFNAALTVPFFHWLVGPSQVIEVE 151
Query: 172 LPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQ 231
+ +G +SGV +++C+YLE S CVG+C+N CK+PTQ FF + G+PL M PN+ D SC+
Sbjct: 152 V-NGVKQRSGVRIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNYEDMSCE 210
Query: 232 FKFGILPPLPKDDTTLKEPCL-DICPTSS 259
+G PP ++D K+PCL DIC S+
Sbjct: 211 MIYGQAPPAFEEDVATKQPCLADICSMSN 239
>gi|168021494|ref|XP_001763276.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685411|gb|EDQ71806.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 175
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 69/176 (39%), Positives = 103/176 (58%), Gaps = 11/176 (6%)
Query: 91 GYDGLIELVNHLMMKGKSSSDARDAA-QIRILVSLFPPLVLKLYKILISPLAGGKIAAMM 149
GY+G++E V+H + + K++++ + A ++R + + P KL+ A +
Sbjct: 4 GYEGMVE-VSHALARNKNAAEQQAAVLRVRHNLPILPDWFRKLFPY-------SDWGAEL 55
Query: 150 VARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQT 209
AR+T L WL+G C V V + D +SGV +++C+YLE S C G+C+N+CK+PTQ
Sbjct: 56 NARITPLFFSWLVGPCEVVEVSVNDKPM-KSGVQIQKCRYLETSGCTGLCVNSCKMPTQY 114
Query: 210 FFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEP-CLDICPTSSRRKEV 264
FF +G+PL MEPNF D SC FG PP +DD K+ C CPTSS+ EV
Sbjct: 115 FFTKELGMPLTMEPNFEDMSCLMIFGQTPPAFEDDLVFKQKCCTTYCPTSSQASEV 170
>gi|158334065|ref|YP_001515237.1| hypothetical protein AM1_0881 [Acaryochloris marina MBIC11017]
gi|158304306|gb|ABW25923.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 214
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/198 (35%), Positives = 108/198 (54%), Gaps = 9/198 (4%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQI 118
+T YK D F+ F K+ Q G SE GY+GL++L +M +G+++ ++A
Sbjct: 2 KTVYKDNWFDRAFIWLFSEKMAQVAGQKSELAGYEGLVDLSVQIM-RGRNAKQQQEALAT 60
Query: 119 RILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDL--PDGT 176
+L SL P VL + L +P K A + WL+G C + V++ +G
Sbjct: 61 -VLRSLIPSFVLLGIRTLFNPT---KRILEWNAWFASRMFTWLVGPCDLTEVEVVGENGQ 116
Query: 177 --SCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKF 234
+ +SG+ +++C+YLEES CVG+C+N CKLPTQ FF G PL + PNF D SC+ F
Sbjct: 117 LRTQRSGLHIQKCRYLEESGCVGMCVNMCKLPTQDFFAKEFGFPLTLTPNFEDMSCEMVF 176
Query: 235 GILPPLPKDDTTLKEPCL 252
G P +++ +PCL
Sbjct: 177 GHPAPPIEEEAVYNQPCL 194
>gi|359457440|ref|ZP_09246003.1| hypothetical protein ACCM5_01849 [Acaryochloris sp. CCMEE 5410]
Length = 214
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/199 (35%), Positives = 110/199 (55%), Gaps = 11/199 (5%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQI 118
+T YK D F+ F K+ Q G SE GY+GL++L +M +G+++ ++A
Sbjct: 2 KTVYKDNWFDRAFIWLFSEKMAQVAGQKSELAGYEGLVDLSVQIM-RGRNAKQQQEALAT 60
Query: 119 RILVSLFPPLVLKLYKILISPLAGG-KIAAMMVARVTALTCQWLMGHCTVNSVDL--PDG 175
+L SL P VL + L +P + A +R+ WL+G C + V++ +G
Sbjct: 61 -VLRSLIPSFVLLGIRTLFNPTQRILEWNAWFASRMFT----WLVGPCDLTEVEVVGENG 115
Query: 176 T--SCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFK 233
+ +SG+ +++C+YLEES CVG+C+N CKLPTQ FF G PL + PNF D SC+
Sbjct: 116 QLRTQRSGLHIQKCRYLEESGCVGMCVNMCKLPTQDFFAKEFGFPLTLTPNFEDMSCEMV 175
Query: 234 FGILPPLPKDDTTLKEPCL 252
FG P +++ +PCL
Sbjct: 176 FGHSAPPIEEEAVYNQPCL 194
>gi|385763978|gb|AFI78792.1| putative D27 protein [Chlorokybus atmophyticus]
Length = 187
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 108/189 (57%), Gaps = 19/189 (10%)
Query: 87 SEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIA 146
S K GY G++E V+H +M+ K++ +A ++ FP + K+ A K
Sbjct: 16 SFKEGYLGMVE-VSHSLMRNKAAKQQHEA-----VLQGFPKVPEWFRKVF----AYTKWG 65
Query: 147 AMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLP 206
A + A VT +WL+G V VD+ +G + +S V +++C+YLE S CVG+C+N CK P
Sbjct: 66 AELNAWVTPTFFKWLVGPMEVRDVDI-NGVTQRSQVHIKKCRYLETSGCVGMCVNLCKFP 124
Query: 207 TQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAM 266
TQ FF + MG+PL M+PNF D SC+ FG +PP ++D +PC CPT+ + VA
Sbjct: 125 TQKFFTEEMGMPLTMKPNFDDLSCEMIFGQVPPPIEEDEARAQPCFATCPTA---RTVAP 181
Query: 267 NSNVEQCPK 275
N CPK
Sbjct: 182 N-----CPK 185
>gi|385763988|gb|AFI78797.1| putative D27 family protein [Nitella hyalina]
Length = 239
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 79/218 (36%), Positives = 121/218 (55%), Gaps = 15/218 (6%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQI 118
+T YK ++D +F F K+ Q G + GYD ++ ++ +M G+S ++ +
Sbjct: 35 KTRYKDSLIDRIFQWLFSRKMAQITGRKAGFNGYDEFVD-ISRAVMNGRSPKKTQEVVR- 92
Query: 119 RILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSC 178
+L+SL PP + ++ L P + +A + A +T WL+G V V++ +G
Sbjct: 93 EVLMSLLPPNAPQTFRKLFPPT---QKSAELNALITTYFFAWLVGPSKVIEVEV-EGRKQ 148
Query: 179 QSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238
SGV +E+C+YLE S CVG+CIN CKLPTQ FF + G+PL M P++ D SC+ FG P
Sbjct: 149 MSGVKIEKCRYLENSGCVGMCINMCKLPTQDFFTNDFGLPLTMNPDYEDMSCEMIFGQAP 208
Query: 239 PLPKDDTTLKEPCL-DICPTSSRRKEVAMNSNVEQCPK 275
P P++D LK+PC IC T+ +V CPK
Sbjct: 209 PPPEEDPALKQPCYAAICSTAV--------PDVAYCPK 238
>gi|326524313|dbj|BAK00540.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 264
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 71/206 (34%), Positives = 114/206 (55%), Gaps = 19/206 (9%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSE-KPG------------YDGLIELVNHLMMK 105
+TEY+ G L+ F+ F K+ + G + PG Y+ ++ V+ +M
Sbjct: 45 KTEYRDGPLERAFMGLFARKMEKFAGRKKKPDPGGEEEKKAVWEWDYESFVD-VSRRVMV 103
Query: 106 GKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHC 165
G+S + ++A + +L+S+ PP + +K L P + A A +T WL+
Sbjct: 104 GRSRAQQQEAVR-EVLLSMLPPGAPEQFKKLFPPT---RWACEFNAALTVPFFHWLVDPS 159
Query: 166 TVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNF 225
V V++ DG +SGV +++C+YLE S CVG+C+N CK+PTQ+FF D G+PL M PNF
Sbjct: 160 EVIEVEV-DGVKQRSGVLIKKCRYLENSGCVGMCVNMCKIPTQSFFTDEFGLPLTMNPNF 218
Query: 226 SDYSCQFKFGILPPLPKDDTTLKEPC 251
D SC+ +G +PP ++D K+PC
Sbjct: 219 EDMSCEMIYGQVPPPLEEDPVSKQPC 244
>gi|449455260|ref|XP_004145371.1| PREDICTED: uncharacterized protein LOC101219340 [Cucumis sativus]
gi|449472846|ref|XP_004153712.1| PREDICTED: uncharacterized protein LOC101218896 [Cucumis sativus]
gi|449509343|ref|XP_004163561.1| PREDICTED: uncharacterized protein LOC101223880 [Cucumis sativus]
Length = 253
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 72/232 (31%), Positives = 112/232 (48%), Gaps = 6/232 (2%)
Query: 29 QLQSPRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSE 88
+++ R S+L + +P + E+ + T Y D + + + G S+
Sbjct: 21 KMKLRRKSILCFGVLTRPAEGELIEETRKTNTVYTDNWFDKIAIDHLSQAVQATSGWRSK 80
Query: 89 KPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAM 148
K GY+ L+E+ M + + I+ L FP +L L K L L K+A
Sbjct: 81 KSGYESLVEVTT--MASRNFNHIKQKEVVIQALGMAFPKPILSLIKAL---LPQSKLARE 135
Query: 149 MVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQ 208
A T + WL+G C V + G ++ V + +C++LE++ C G+CIN CK P Q
Sbjct: 136 YFAAFTTVFFAWLVGPCEVKESEF-KGKREKNVVQIHKCRFLEQTNCAGMCINLCKFPCQ 194
Query: 209 TFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSR 260
F KD +G+P+ M PNF D SC+ FG PP DD LK+PC +C T +
Sbjct: 195 DFIKDSLGMPVTMVPNFDDMSCEMIFGKEPPASIDDPALKQPCYKLCKTKEK 246
>gi|242080297|ref|XP_002444917.1| hypothetical protein SORBIDRAFT_07g001450 [Sorghum bicolor]
gi|241941267|gb|EES14412.1| hypothetical protein SORBIDRAFT_07g001450 [Sorghum bicolor]
Length = 262
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/217 (31%), Positives = 118/217 (54%), Gaps = 21/217 (9%)
Query: 58 SRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPG--------------YDGLIELVNHLM 103
+TEY+ G L+ F+ F K+ + + P Y+ +++ +M
Sbjct: 41 EKTEYRDGPLERAFMGLFARKMEKYATKKKQPPSPEPEEKKKAVWDWDYESFVDVSRRVM 100
Query: 104 MKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMG 163
+ G++ + ++A + +L+S+ PP + ++ L P + A A +T +WL+G
Sbjct: 101 V-GRTHAQQQEAVR-EVLLSMLPPGAPEQFRKLFPPT---RWACEFNAALTVPFFRWLVG 155
Query: 164 HCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEP 223
V V++ DG +SGV +++C+YLE S CVG+C+N CK+PTQ FF + G+PL M P
Sbjct: 156 PSEVIEVEV-DGVKQRSGVLIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNP 214
Query: 224 NFSDYSCQFKFGILPPLPKDDTTLKEPCL-DICPTSS 259
NF D SC+ +G +PP ++D K+PC ++C S+
Sbjct: 215 NFEDMSCEMIYGQVPPPLEEDPVSKQPCYPNLCSMST 251
>gi|224057988|ref|XP_002299424.1| predicted protein [Populus trichocarpa]
gi|222846682|gb|EEE84229.1| predicted protein [Populus trichocarpa]
Length = 237
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 72/212 (33%), Positives = 120/212 (56%), Gaps = 17/212 (8%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQ-----EVGLDSEKPG-----YDGLIELVNHLMMKGKS 108
+T+Y G + F++ F K+ + + G S++ G Y+ ++ V+ +M+G++
Sbjct: 21 KTKYMDGFFEKAFMTLFARKMEKFAAPAKNGSASKEKGWFDYDYESFVD-VSKRVMQGRN 79
Query: 109 SSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVN 168
++ + +L+S+ PP + +K L P K AA A +T QWL+G +
Sbjct: 80 RKQQQEVVR-EVLLSMLPPGAPEQFKKLFPPT---KWAAEFNAALTVPFFQWLVGP-SEV 134
Query: 169 SVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDY 228
+G +SGV +++C+YLE S CVG+C+N CK+PTQ FF + G+PL M PNF D
Sbjct: 135 VEVEVNGEKQKSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMIPNFEDM 194
Query: 229 SCQFKFGILPPLPKDDTTLKEPCL-DICPTSS 259
SC+ +G +PP ++D +K+PCL DIC +S
Sbjct: 195 SCEMVYGQVPPPFEEDPVVKQPCLADICTIAS 226
>gi|356500156|ref|XP_003518899.1| PREDICTED: uncharacterized protein LOC100782912 [Glycine max]
Length = 249
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 114/209 (54%), Gaps = 8/209 (3%)
Query: 48 APPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEV-GLDSEKPGYDGLIELVNHLMMKG 106
A I E+ + YK G+ D + ++ + +K VQE GL + K GY+ L+E +
Sbjct: 34 ADDISGEARKTNHVYKDGLFDRIAIN-YLSKCVQEATGLKNSKSGYESLVEAAT--LASQ 90
Query: 107 KSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCT 166
+ S + I+ L FP +L L + L+ P K A + A T L WL+G
Sbjct: 91 RFSPIEQHQLVIQSLDRAFPKPMLLLIRTLLPP---SKFARKLFAIFTTLFFAWLVGPSE 147
Query: 167 VNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFS 226
V ++ +G ++ V +++C++LEE+ CVG+CIN CKLP+Q+F KD +G+ + M PNF
Sbjct: 148 VRESEV-EGRRERNVVHIKKCRFLEETNCVGMCINLCKLPSQSFIKDSLGMSVNMVPNFD 206
Query: 227 DYSCQFKFGILPPLPKDDTTLKEPCLDIC 255
D SC+ FG PP DD L +PC +C
Sbjct: 207 DMSCEMIFGEDPPESTDDPALNQPCFKLC 235
>gi|449495159|ref|XP_004159751.1| PREDICTED: uncharacterized LOC101210861 [Cucumis sativus]
Length = 263
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 78/261 (29%), Positives = 132/261 (50%), Gaps = 27/261 (10%)
Query: 6 LRPSTISFSSSPPRSHHIPKLHPQLQSPRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPG 65
L+ +I F ++PP+ K+ + R + +S +P P +T+Y G
Sbjct: 4 LKLQSIQFFTAPPKEIRNRKIKSRFI--RCGIAEASGEPAPLG---------QKTKYNDG 52
Query: 66 VLDDLFLSSFRNKLVQEVGLDSEKPGYDGL----------IELVNHLMMKGKSSSDARDA 115
+ +F++ F K+ + ++ +GL V+ +M+GK+ +
Sbjct: 53 PFEKVFMTLFARKMEKFANAKEQRKKKEGLWWDFLYDYERFVDVSKRVMQGKNRMQQQIV 112
Query: 116 AQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDG 175
+ +L+S+ PP ++ L P K A A +T QWL+G V V++ +G
Sbjct: 113 VR-EVLLSMLPPGAPAQFRKLFPPT---KWACEFNALITVPFFQWLVGPSEVVEVEV-NG 167
Query: 176 TSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFG 235
+SGV +++C+YLE S CVG+C+N CK+PTQ FF + G+PL M PNF D SC+ +G
Sbjct: 168 IKQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFEDMSCEMIYG 227
Query: 236 ILPPLPKDDTTLKEPCL-DIC 255
+PP ++D ++PC DIC
Sbjct: 228 QVPPPFEEDPVSEQPCYKDIC 248
>gi|385763984|gb|AFI78795.1| putative D27 protein [Klebsormidium flaccidum]
Length = 165
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/130 (46%), Positives = 79/130 (60%), Gaps = 9/130 (6%)
Query: 146 AAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKL 205
A + AR+T WL+G + V++ DG +SGV +ERC+YLEES C G+C+N CK
Sbjct: 43 GAELNARITPAFFTWLVGPMEIFEVEI-DGVKQRSGVQIERCRYLEESGCTGMCVNLCKF 101
Query: 206 PTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVA 265
PTQTFF + +G+PL MEP F D SCQ FG PP +DD +K+PC +CPT+
Sbjct: 102 PTQTFFTEELGMPLSMEPKFEDLSCQMIFGKKPPDIEDDEVMKQPCFALCPTA------- 154
Query: 266 MNSNVEQCPK 275
N CPK
Sbjct: 155 -NVQAPACPK 163
>gi|116784951|gb|ABK23534.1| unknown [Picea sitchensis]
Length = 275
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 135/269 (50%), Gaps = 30/269 (11%)
Query: 18 PRSHHIPKLHPQLQSPRFSVLRSSI------QPQPQAPPIKRESDTSRTEYKPGVLDDLF 71
PR ++ Q ++ F + R I +P Q P+ + +T Y + D +F
Sbjct: 17 PRHNYSSSYKFQRKNHSFGLRRKMIIECGIAEPSGQPAPMGQ-----KTRYNDNLFDKVF 71
Query: 72 LSSFRNKLVQEVGLDS--EKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLV 129
++ F K+ G S + GY+ +E +M+ G++ ++A + ++L+S+ PP
Sbjct: 72 MALFARKMNNIAGGKSTGREEGYERFVETSRSVML-GRTPKQQQEAVR-QVLLSMLPPGA 129
Query: 130 LKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKY 189
+ ++ L P K AA A VTA WL+G + +G +SGV +++C+Y
Sbjct: 130 PERFRKLFPPT---KWAAEFNAAVTAPFFHWLVGP-SEVVEVEVNGVKQKSGVHIKKCRY 185
Query: 190 LEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKE 249
LE S CVG+C+N CKLPTQ FF + G+PL M PNF D SC +G PP P++D K+
Sbjct: 186 LENSGCVGMCVNMCKLPTQDFFTNEFGLPLTMTPNFEDMSCDMVYGQPPPPPEEDPAFKQ 245
Query: 250 PCL-----------DICPTSSRRKEVAMN 267
PC + CP S RK + M+
Sbjct: 246 PCYAAFCSMAQPDSEACPKLSVRKRLDMS 274
>gi|384250929|gb|EIE24407.1| hypothetical protein COCSUDRAFT_61832 [Coccomyxa subellipsoidea
C-169]
Length = 165
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 63/163 (38%), Positives = 92/163 (56%), Gaps = 15/163 (9%)
Query: 99 VNHLMMKGKSSSDARDAAQIRILVSLFP---PLVLKLYKILISPLAGGKIAAMMVARVTA 155
V+ +MKG+S++ R+A +++ FP P K + K A + AR+T
Sbjct: 4 VSRALMKGRSAAQQREA-----VIAGFPSVPPWFRKAFPY-------SKWGAGLNARITP 51
Query: 156 LTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYM 215
WL+G L DGT +SGV +ERC+YL ESKC G+C+N CK P QTFF + +
Sbjct: 52 AFFTWLVGPMQTVEATLSDGTVQKSGVHIERCRYLAESKCAGMCVNLCKAPVQTFFTEEL 111
Query: 216 GVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTS 258
G+PL M+PNF D+SC+ FG+ P ++D ++ CL C T
Sbjct: 112 GMPLTMKPNFEDFSCEMVFGLTPAPLQEDEVMQAACLKECATG 154
>gi|385763990|gb|AFI78798.1| putative D27 family protein, partial [Penium margaritaceum]
Length = 198
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 106/190 (55%), Gaps = 10/190 (5%)
Query: 73 SSFRNKLVQEVG--LDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVL 130
+S++ K+ G + S+ GYD L++ +M +G++ R +L+S+ PP
Sbjct: 1 ASYQRKMEYFTGSKVSSKLEGYDALVDAARRVM-QGRTPEQQRQVV-ANVLMSMLPPNAP 58
Query: 131 KLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYL 190
++ L P K++A + A +T QWL+G + V++ +G SGV + +C+YL
Sbjct: 59 ATFRRLFPPT---KLSAEINAAITVPLFQWLVGPAKLTEVEV-NGVKQWSGVKITKCRYL 114
Query: 191 EESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKE- 249
E S CVG+C+N CKLPTQ FF + G+PL M PNF D SC+ FG +PP DD L+
Sbjct: 115 ESSGCVGMCVNMCKLPTQDFFTNDFGLPLTMTPNFEDMSCEMVFGQMPPALADDPALQNT 174
Query: 250 PCL-DICPTS 258
C D CP +
Sbjct: 175 XCFKDTCPMA 184
>gi|385763994|gb|AFI78800.1| putative D27 family protein, partial [Penium margaritaceum]
Length = 188
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 101/178 (56%), Gaps = 9/178 (5%)
Query: 84 GLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGG 143
G+ S+ GYD L++ +M +G++ R +L+S+ PP L+ L P
Sbjct: 3 GVSSKLEGYDALVDAARRVM-QGRTPEQQRQVVA-NVLMSMLPPNAPPLFXRLFPPT--- 57
Query: 144 KIAAMMVARVTA-LTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINT 202
K++A + A +T L QWL+G + V++ +G SGV + +C+YLE S CVG+C+N
Sbjct: 58 KLSAEINAAITVPLLSQWLVGPAKLTEVEV-NGVKQWSGVKITKCRYLESSGCVGMCVNM 116
Query: 203 CKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKE-PCL-DICPTS 258
CKLPTQ FF + G+PL M PNF D SC+ FG +PP DD L+ C D CP +
Sbjct: 117 CKLPTQDFFTNDFGLPLTMTPNFEDMSCEMVFGQMPPALADDPALQNTTCFKDTCPMA 174
>gi|307108787|gb|EFN57026.1| hypothetical protein CHLNCDRAFT_57404 [Chlorella variabilis]
Length = 277
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 106/197 (53%), Gaps = 8/197 (4%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQI 118
RT Y+ VLD + F + + +++G +DG ++L +M +G++S++ ++
Sbjct: 35 RTVYRDNVLDRAMIYYFSSVMSKQLGGKPFDGSWDGFVDLSREIM-RGRNSAEQQETV-A 92
Query: 119 RILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSC 178
+L L PP + ++ PL K A A +T L WL+G + V++
Sbjct: 93 GVLAGLLPPQAPERFRRWF-PL--NKFNAETNAFITVLGFAWLVGASELKEVEVEFEGRT 149
Query: 179 Q---SGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFG 235
Q SGV +++C+YLE S CVG+C N CKLPTQ FF + G+PL M+PNF D SC+ FG
Sbjct: 150 QKWMSGVKIKKCRYLESSGCVGMCTNMCKLPTQKFFTETFGLPLTMDPNFEDLSCEMVFG 209
Query: 236 ILPPLPKDDTTLKEPCL 252
PP + D +PC
Sbjct: 210 RAPPPVELDKVYSQPCF 226
>gi|356503568|ref|XP_003520579.1| PREDICTED: uncharacterized protein LOC100813404 [Glycine max]
Length = 253
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 106/201 (52%), Gaps = 7/201 (3%)
Query: 56 DTSRTE-YKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARD 114
+T +T YK + D L + + + GL + K GY+ L+E + K K +
Sbjct: 46 ETRKTNAYKDNLFDRLAIHHLSKSVQEATGLGNNKSGYESLVEAAT--VAKMKFDPIQQQ 103
Query: 115 AAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPD 174
I+ L FP +L L K L+ P K++ A T L WL+G V ++ +
Sbjct: 104 EVIIQALHRAFPKPILSLIKTLLPP---SKLSREYFAVFTTLFFAWLVGPSEVRESEV-N 159
Query: 175 GTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKF 234
G ++ + C++LEE+ CVG+CIN CK+P+Q+F KD +G+P+ M PNF D SC+ F
Sbjct: 160 GRREKNLLNNNLCRFLEETNCVGMCINLCKMPSQSFIKDTLGMPVNMVPNFDDMSCEMIF 219
Query: 235 GILPPLPKDDTTLKEPCLDIC 255
G PP DD LK+PC +C
Sbjct: 220 GQEPPASTDDPALKQPCYKLC 240
>gi|449456933|ref|XP_004146203.1| PREDICTED: uncharacterized protein LOC101210861 [Cucumis sativus]
Length = 263
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 78/261 (29%), Positives = 131/261 (50%), Gaps = 27/261 (10%)
Query: 6 LRPSTISFSSSPPRSHHIPKLHPQLQSPRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPG 65
L+ +I F ++PP+ K+ + R + +S +P P +T+Y G
Sbjct: 4 LKLQSIQFFTAPPKEIRNRKIKSRFI--RCGIAEASGEPAPLG---------QKTKYNDG 52
Query: 66 VLDDLFLSSFRNKLVQEVGLDSEKPGYDGL----------IELVNHLMMKGKSSSDARDA 115
+ +F++ F K+ + ++ +GL V+ +M+GK+ +
Sbjct: 53 PFEKVFMTLFARKMEKFANAKEQRKKKEGLWWDFLYDYERFVDVSKRVMQGKTRMQQQIV 112
Query: 116 AQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDG 175
+ +L+S+ PP ++ L P K A A +T QWL+G V V++ +G
Sbjct: 113 VR-EVLLSMLPPGAPAQFRKLFPPT---KWACEFNALITVPFFQWLVGPSEVVEVEV-NG 167
Query: 176 TSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFG 235
+SGV +++ +YLE S CVG+C+N CK+PTQ FF + G+PL M PNF D SC+ +G
Sbjct: 168 IKQRSGVHIKKLRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFEDMSCEMIYG 227
Query: 236 ILPPLPKDDTTLKEPCL-DIC 255
+PP ++D K+PC DIC
Sbjct: 228 QVPPPFEEDPVSKQPCYKDIC 248
>gi|361064616|gb|AEW07379.1| dwarf 27 [Medicago truncatula]
Length = 252
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 105/205 (51%), Gaps = 6/205 (2%)
Query: 51 IKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSS 110
I E+ YK D L ++ + G+ + K G+D L+E + K ++
Sbjct: 42 ISEETLRKTNVYKDNWFDKLAINHLSKSVQAATGISNNKSGFDSLVEAAT--VASQKFNT 99
Query: 111 DARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSV 170
+ + L FP +L + + ++ P K+A A T + WL+G V
Sbjct: 100 TQQQGIILDALDRAFPKPILSVIRRVMPP---SKLAREYFAVFTTIFFAWLLGPSEVRES 156
Query: 171 DLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSC 230
++ +G ++ V +++C++LEE+ CVG+CIN CK+P+Q F KD G+P+ M PNF D SC
Sbjct: 157 EI-NGRREKNIVHIKKCRFLEETNCVGMCINLCKMPSQLFIKDSFGMPVNMVPNFDDMSC 215
Query: 231 QFKFGILPPLPKDDTTLKEPCLDIC 255
+ FG PP DD LK+PC +C
Sbjct: 216 EMIFGQEPPASTDDPALKQPCYKLC 240
>gi|223994801|ref|XP_002287084.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220978399|gb|EED96725.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 166
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 94/169 (55%), Gaps = 4/169 (2%)
Query: 92 YDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLY-KILISPLAGGKIAAMMV 150
+ G+IE+ + + + + + AQ +LVSLFP +L Y P + +A M
Sbjct: 1 FMGIIEIAARMNSQYSNRTQVQTIAQ-DVLVSLFPTFILDRYPSWFAKPFP--EFSAKMC 57
Query: 151 ARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTF 210
A T + WLMG +VN++ + GV V+RC++LEES+C +C+N+CK+PTQ F
Sbjct: 58 AWATCVGGTWLMGESSVNNIPNMEIGGENMGVLVQRCRFLEESQCASICVNSCKIPTQNF 117
Query: 211 FKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSS 259
F+D MG+ L M P++ CQF FG LP ++ PCL CP+S
Sbjct: 118 FRDNMGLALTMTPDYETGECQFAFGKLPTEEEETLAKDTPCLMRCPSSG 166
>gi|388491274|gb|AFK33703.1| unknown [Medicago truncatula]
Length = 266
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 76/234 (32%), Positives = 118/234 (50%), Gaps = 32/234 (13%)
Query: 59 RTEYKPGVLDDLFLSSFRNKL--------------VQEVGL-DSEKPGYDGLIELVNHLM 103
+T Y + + +F++ F K+ ++ GL D + Y+ +++ +M
Sbjct: 45 KTRYNDSIFEKVFMTLFARKMEPFAEPVIGNAKKKKEKKGLLDVWEYDYESFVDVSKRVM 104
Query: 104 MKGKSSSDARDAAQIR-ILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLM 162
++ S R +R +L+S+ PP ++ L P + AA A +T WL+
Sbjct: 105 LR---RSRLRQQQVVREVLLSMLPPGAPAQFRKLFPPT---RWAAEFNAALTVPFFHWLV 158
Query: 163 GHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLME 222
G V V++ +G +SGV +++C+YLE S CVG C+N CK+PTQ FF + G+PL M
Sbjct: 159 GPSEVIEVEI-NGVKQKSGVHIKKCRYLENSGCVGQCVNMCKIPTQDFFTNEFGLPLTMI 217
Query: 223 PNFSDYSCQFKFGILPPLPKDDTTLKEPCL-DICPTSSRRKEVAMNSNVEQCPK 275
PNF D SC +G PP +DD K+PC DICP + N N CPK
Sbjct: 218 PNFEDMSCDMVYGQTPPSFEDDPVSKQPCYADICPVA--------NPNSSICPK 263
>gi|326492644|dbj|BAJ90178.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 275
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/203 (31%), Positives = 105/203 (51%), Gaps = 8/203 (3%)
Query: 50 PIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSS 109
P++ + + T Y D+L + KL + G+ + K GY GLIE + +
Sbjct: 67 PVRETAAATTTVYHDTWFDNLAIGYLSRKLQEASGIKNGKHGYQGLIEAAVAISRIFRLD 126
Query: 110 SDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNS 169
+ A L P ++ + K+++ P + + A T + WL+G C V
Sbjct: 127 TQCEIVAGA--LERAMPSYIVTMIKVMMPP---SRFSREYFAAFTTIFFPWLVGPCEVRE 181
Query: 170 VDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYS 229
++ DGT ++ V++ +C++LE + CVG+C N CK+P+Q F +D +GV + M PNF D S
Sbjct: 182 SEV-DGTREKNVVYIPKCRFLESTNCVGMCTNLCKIPSQKFMQDSLGVSVYMSPNFEDMS 240
Query: 230 CQFKFGILPPLPKDDTTLKEPCL 252
C+ FG P P+DD LK+PC
Sbjct: 241 CEMIFGQQP--PEDDPALKQPCF 261
>gi|225426574|ref|XP_002279815.1| PREDICTED: uncharacterized protein LOC100256431 [Vitis vinifera]
Length = 266
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 114/209 (54%), Gaps = 18/209 (8%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQ-----EVGLDSEKP------GYDGLIELVNHLMMKGK 107
+T Y G + +F++ F K+ + + G+++EK Y+ ++ V+ +M+G+
Sbjct: 49 KTRYNDGFFEKVFMTLFARKMGRFAAPAKSGIEAEKKRSWWDCDYERFVD-VSKRVMQGR 107
Query: 108 SSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTV 167
S ++ + +L+S+ PP ++ L P + AA A T WL+G +
Sbjct: 108 SRMQQQEVVR-EVLLSMLPPGAPDQFRKLFPPT---RWAAEFNAAFTVPFFAWLVGP-SE 162
Query: 168 NSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSD 227
+G +SGV +++C+YLE S CVG+C+N CK+PTQ FF + G+PL M PNF D
Sbjct: 163 VVEVEVNGVKQRSGVLIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMTPNFED 222
Query: 228 YSCQFKFGILPPLPKDDTTLKEPCL-DIC 255
SC+ +G +PP ++D K+PC DIC
Sbjct: 223 MSCEMVYGQVPPPFEEDPVSKQPCFSDIC 251
>gi|452821691|gb|EME28718.1| hypothetical protein Gasu_37700 [Galdieria sulphuraria]
Length = 248
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 78/217 (35%), Positives = 116/217 (53%), Gaps = 14/217 (6%)
Query: 65 GVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSL 124
G L + FR + +E G S+ GYDGL+E +L K +S + R AA RI+ SL
Sbjct: 38 GPLSSTAIHLFRKAIERETGRVSKHRGYDGLVEDCKYLQ-KYRSPVEQR-AAVCRIISSL 95
Query: 125 F-PPLVLKLYKIL--ISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSG 181
F P+ ++L++ L I P+ A + A T + QWL+G C +++ + + G
Sbjct: 96 FCAPVGIQLFRSLLGIMPVTW---AYHLSAIFTQVFFQWLVGPCQAHAIH---NETFKRG 149
Query: 182 VFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLP 241
VF+ +C++LEES+C G+C+N CK+PTQ FF + +G P MEPN+ SCQ FG PLP
Sbjct: 150 VFISKCRFLEESRCRGMCVNLCKIPTQQFFNNTLGFPFTMEPNYETGSCQITFG-KSPLP 208
Query: 242 KD-DTTLKEPCLDIC-PTSSRRKEVAMNSNVEQCPKA 276
D D ++ C C P + +S C K
Sbjct: 209 LDQDIAVQIRCSGNCLPWHPEKTNSTASSQTHDCIKT 245
>gi|159479726|ref|XP_001697941.1| hypothetical protein CHLREDRAFT_205884 [Chlamydomonas reinhardtii]
gi|158274039|gb|EDO99824.1| predicted protein [Chlamydomonas reinhardtii]
Length = 290
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 75/227 (33%), Positives = 118/227 (51%), Gaps = 17/227 (7%)
Query: 37 VLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLV------QEVGLDSEKP 90
V+ S+ +P + K++ +T Y LD LF+ + K+ Q V + E+P
Sbjct: 25 VVASATPAKPVSDGPKKDPFAEKTVYNDNWLDLLFIKLYSKKMADCLPASQGVHV-PEQP 83
Query: 91 GYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMV 150
YD + + N +M +G+ S + R + +L SL P +++ L P K +A
Sbjct: 84 VYDDFVRISNEIM-RGRGSKEQRLVVR-DVLNSLMPKEAPPVFRALFPPT---KFSAEFN 138
Query: 151 ARVTALTCQWLMGHCTVNSVDL---PDGTS--CQSGVFVERCKYLEESKCVGVCINTCKL 205
A + +L+ WL+G + D+ PDG +S V +++C+YLE S CVG+C+N CK+
Sbjct: 139 ALIASLSFFWLVGASELKEEDVVVGPDGEKRRQRSVVHIKKCRYLEASGCVGMCVNMCKV 198
Query: 206 PTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCL 252
PTQT+F + G+PL M PNF D SC FG +PP D +PC
Sbjct: 199 PTQTYFTEEFGLPLTMNPNFEDLSCDMIFGQMPPPVHLDPVYTQPCF 245
>gi|428172601|gb|EKX41509.1| hypothetical protein GUITHDRAFT_164374 [Guillardia theta CCMP2712]
Length = 398
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 102/202 (50%), Gaps = 10/202 (4%)
Query: 62 YKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRIL 121
Y LD + LS FR+ + +E G SEK G GL+E +++ +++A++ L
Sbjct: 187 YTESPLDKVLLSIFRSLVAKETGFQSEKEGILGLLEQGREYLLRPGQTAEAQNRMVYNTL 246
Query: 122 VSLFPPLVLKLYKILISPLAGGK------IAAMMVARVTALTCQWLMGHCTVNSVDLPDG 175
L P++ YK+ +S + GK AA + + VT +L+G N DG
Sbjct: 247 AGLLTPVMPPFYKVFMSGIIAGKQYGPWPWAAWLTSFVTPTFFGFLVGPSRPNRRK--DG 304
Query: 176 TSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFG 235
G+ VE+CK+L+ES C +CIN CKLP Q FF +G+PL + PNF CQ+ FG
Sbjct: 305 Q--LGGLVVEKCKFLQESGCKSLCINQCKLPAQQFFSQELGLPLTVTPNFETQECQWSFG 362
Query: 236 ILPPLPKDDTTLKEPCLDICPT 257
P D + CL CPT
Sbjct: 363 EHPIDVDKDPRIPRGCLSECPT 384
>gi|357462337|ref|XP_003601450.1| hypothetical protein MTR_3g080840 [Medicago truncatula]
gi|357517075|ref|XP_003628826.1| hypothetical protein MTR_8g067370 [Medicago truncatula]
gi|355490498|gb|AES71701.1| hypothetical protein MTR_3g080840 [Medicago truncatula]
gi|355522848|gb|AET03302.1| hypothetical protein MTR_8g067370 [Medicago truncatula]
Length = 266
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 74/233 (31%), Positives = 116/233 (49%), Gaps = 30/233 (12%)
Query: 59 RTEYKPGVLDDLFLSSFRNKL--------------VQEVGL-DSEKPGYDGLIELVNHLM 103
+T Y + + +F++ F K+ ++ GL D + Y+ +++ +M
Sbjct: 45 KTRYNDSIFEKVFMTLFARKMEPFAEPVIGNAKKKKEKKGLLDVWEYDYESFVDVSKRVM 104
Query: 104 MKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMG 163
++ S + +L+S+ PP ++ L P + AA A +T WL+G
Sbjct: 105 LR--RSRLQQQQVVREVLLSMLPPGAPAQFRKLFPPT---RWAAEFNAALTVPFFHWLVG 159
Query: 164 HCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEP 223
V V++ +G +SGV +++C+YLE S CVG C+N CK+PTQ FF + G+PL M P
Sbjct: 160 PSEVIEVEI-NGVKQKSGVHIKKCRYLENSGCVGQCVNMCKIPTQDFFTNEFGLPLTMIP 218
Query: 224 NFSDYSCQFKFGILPPLPKDDTTLKEPCL-DICPTSSRRKEVAMNSNVEQCPK 275
NF D SC +G PP +DD K+PC DICP + N N CPK
Sbjct: 219 NFEDMSCDMVYGQTPPSFEDDPVSKQPCYADICPVA--------NPNSSICPK 263
>gi|357156309|ref|XP_003577412.1| PREDICTED: uncharacterized protein LOC100825245 [Brachypodium
distachyon]
Length = 277
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 101/195 (51%), Gaps = 16/195 (8%)
Query: 62 YKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRIL 121
Y D+L + L G+ + KPGY+GLIE + S R Q I+
Sbjct: 81 YHDSWFDNLAIGYLSRALQNASGIRNRKPGYEGLIEAAVAI------SRIFRLDTQCEIV 134
Query: 122 VSLF----PPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTS 177
S P ++ + K+++ P + + A T + WL+G C V ++ DGT
Sbjct: 135 ASALEQAMPSYIITMIKVMMPP---SRFSREYFAAFTTIFFPWLVGPCEVRESEV-DGTR 190
Query: 178 CQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGIL 237
++ V++ +C++LE + CVG+C N CK+P+Q F +D +GV + M PNF D SC+ FG
Sbjct: 191 EKNVVYIPKCRFLESTNCVGMCTNLCKIPSQKFMQDSLGVSVYMSPNFDDMSCEMIFGQQ 250
Query: 238 PPLPKDDTTLKEPCL 252
P P+DD LK+PC
Sbjct: 251 P--PEDDPALKQPCF 263
>gi|356536794|ref|XP_003536919.1| PREDICTED: uncharacterized protein LOC100814646 [Glycine max]
Length = 274
Score = 116 bits (291), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 112/205 (54%), Gaps = 8/205 (3%)
Query: 51 IKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEV-GLDSEKPGYDGLIELVNHLMMKGKSS 109
I E+ + YK G+ D + ++ + +K VQE GL + K GY+ L++ + + S
Sbjct: 38 ISGEARKTNHVYKDGLFDRITIN-YLSKCVQEATGLRNNKSGYESLVDAAT--VASQRFS 94
Query: 110 SDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNS 169
+ I+ L FP +L L + L+ P K A + A T L WL+G V
Sbjct: 95 PVEQHQLVIQSLDRAFPKPMLLLIRKLLPP---SKFARKLFAVFTTLFFAWLVGPSEVRE 151
Query: 170 VDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYS 229
++ +G ++ V +++C++LE + CVG+CIN CKLP+Q+F KD +G+ + M PNF D S
Sbjct: 152 SEV-EGRRERNVVHIKKCRFLEGTNCVGMCINLCKLPSQSFIKDSLGISVNMVPNFDDMS 210
Query: 230 CQFKFGILPPLPKDDTTLKEPCLDI 254
C+ FG PP DD LK+PC +
Sbjct: 211 CEMIFGQDPPESTDDPALKQPCFKL 235
>gi|18379048|ref|NP_563673.1| uncharacterized protein [Arabidopsis thaliana]
gi|33589716|gb|AAQ22624.1| At1g03051 [Arabidopsis thaliana]
gi|332189400|gb|AEE27521.1| uncharacterized protein [Arabidopsis thaliana]
Length = 264
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/238 (30%), Positives = 128/238 (53%), Gaps = 27/238 (11%)
Query: 39 RSSIQPQPQAPPI-----KRESDTSRTEY---KPGVLDDLFLS----SFRNKLVQE---V 83
RSSI P + P+ K +T+R E K ++D F S ++ +K +Q+ +
Sbjct: 29 RSSISPTLCSKPVYSGKLKAAKETARIETSNTKNASIEDSFFSKIAINYLSKNLQDAAGI 88
Query: 84 GLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSL---FPPLVLKLYKILISPL 140
S+ YD L++ + + D + + +L SL P ++ L K+ P
Sbjct: 89 SSSSKSTDYDRLVDTATRV----SRNFDTKQQHEF-VLSSLDRALPTVISSLIKMAFPP- 142
Query: 141 AGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCI 200
K++ + A T ++ WL+G V ++ +G +S V++E+C++LE+S CVG+C
Sbjct: 143 --SKVSRELFALFTTISFAWLVGPSEVRETEV-NGRKEKSVVYIEKCRFLEQSNCVGMCT 199
Query: 201 NTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTS 258
+ CK+P+Q F K+ +G+P+ MEP+F+D SC+ FG PP +DD +K+PC + C ++
Sbjct: 200 HICKIPSQIFIKNSLGMPIYMEPDFNDLSCKMMFGREPPEIEDDPAMKQPCFEFCKSN 257
>gi|297848528|ref|XP_002892145.1| hypothetical protein ARALYDRAFT_470283 [Arabidopsis lyrata subsp.
lyrata]
gi|297337987|gb|EFH68404.1| hypothetical protein ARALYDRAFT_470283 [Arabidopsis lyrata subsp.
lyrata]
Length = 264
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 72/238 (30%), Positives = 127/238 (53%), Gaps = 27/238 (11%)
Query: 39 RSSIQPQPQAPPI-----KRESDTSRTE---YKPGVLDDLFLS----SFRNKLVQE---V 83
RSSI P + P+ K +T+R E K + D F S ++ +K +Q+ +
Sbjct: 29 RSSISPTLSSKPVYSGELKAAKETARIEPSNTKNASIQDSFFSKIAINYLSKNLQDAAGI 88
Query: 84 GLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSL---FPPLVLKLYKILISPL 140
S+ YD L++ + + D + + +L SL P ++ L K+ P
Sbjct: 89 SSSSKSTDYDRLVDTATRVA----RNFDTKQQHEF-VLSSLDRALPTVISSLIKMAFPP- 142
Query: 141 AGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCI 200
K++ + A T ++ WL+G V ++ +G +S V++E+C++LE+S CVG+C
Sbjct: 143 --SKLSRELFALFTTISFVWLVGPSEVRETEV-NGRKEKSVVYIEKCRFLEQSNCVGMCT 199
Query: 201 NTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTS 258
+ CK+P+Q F K+ +G+P+ MEP+F+D SC+ FG PP +DD +K+PC + C ++
Sbjct: 200 HICKIPSQIFIKNSLGMPIYMEPDFNDLSCKMMFGREPPEIEDDPAMKQPCFEFCKSN 257
>gi|356547509|ref|XP_003542154.1| PREDICTED: uncharacterized protein LOC100780474 [Glycine max]
Length = 266
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 77/245 (31%), Positives = 122/245 (49%), Gaps = 27/245 (11%)
Query: 27 HPQLQSPRFS-VLR---SSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQ- 81
HP L S R + V+R +P + P+ + +T Y G+ + F++ F K+ +
Sbjct: 18 HPSLCSGRAAGVIRIRCGIAEPSGEPAPLGQ-----KTRYHDGIFEKAFMTLFARKMEKF 72
Query: 82 ---EVGLDSEKPG-------YDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLK 131
G E G Y+ +++ +M + + ++ L+S+ PP
Sbjct: 73 SDPPAGKARENKGWWDWGYDYESFVDVSRRVMQRRSRIQQQQVVREV--LLSMLPPGAPA 130
Query: 132 LYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLE 191
++ L P K AA A +T WL+G V V++ +G +SGV +++C+YLE
Sbjct: 131 QFRKLFPPT---KWAAEFNAALTVPFFDWLVGPSEVVEVEI-NGVKQKSGVHIKKCRYLE 186
Query: 192 ESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPC 251
S CVG+C+N CK+PTQ FF + G+PL M PNF D SC +G PP ++D K+PC
Sbjct: 187 NSGCVGMCVNMCKIPTQDFFTNEFGLPLTMTPNFEDMSCDMVYGQSPPTFEEDPVSKQPC 246
Query: 252 L-DIC 255
DIC
Sbjct: 247 YADIC 251
>gi|414591638|tpg|DAA42209.1| TPA: hypothetical protein ZEAMMB73_458126 [Zea mays]
Length = 216
Score = 114 bits (284), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 96/191 (50%), Gaps = 8/191 (4%)
Query: 62 YKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRIL 121
Y+ D L + L + G+ + K GY+GLIE L + D + L
Sbjct: 18 YRDNWFDKLAIGYLSRNLQEASGMKNRKDGYEGLIEAA--LAISALFRVDQQWDTVASAL 75
Query: 122 VSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSG 181
FP +L + K+++ P + + A T + WL+G C V + DG ++
Sbjct: 76 QRAFPSYILTMIKVMMPP---SRFSREYFAAFTTVFFPWLVGPCEVRESQV-DGREERNV 131
Query: 182 VFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLP 241
V++ +C++LE + CVG+C N CK+P Q F +D +G + M PNF D SC+ FG P P
Sbjct: 132 VYIPKCRFLESTNCVGMCTNLCKIPCQRFIQDSLGTAVYMSPNFEDMSCEMIFGQQP--P 189
Query: 242 KDDTTLKEPCL 252
+DD LK+PC
Sbjct: 190 EDDPALKQPCF 200
>gi|302836227|ref|XP_002949674.1| hypothetical protein VOLCADRAFT_80804 [Volvox carteri f.
nagariensis]
gi|300265033|gb|EFJ49226.1| hypothetical protein VOLCADRAFT_80804 [Volvox carteri f.
nagariensis]
Length = 210
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 103/194 (53%), Gaps = 14/194 (7%)
Query: 88 EKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAA 147
EKP YD + + + +M KG++S R +L+SL PP ++ L P + +A
Sbjct: 12 EKPTYDDFVRISSEIM-KGRNSVQQR-VVVRDVLMSLLPPEAPPAFRKLFPPT---QFSA 66
Query: 148 MMVARVTALTCQWLMGHCTVNSVDL---PDGTS--CQSGVFVERCKYLEESKCVGVCINT 202
A + +L WL+G V D+ P+G +S V +++C+YLE S CVG+C+N
Sbjct: 67 EFNALIASLGFYWLVGESEVKEDDVVVGPNGEKRRQRSVVQIKKCRYLESSGCVGMCVNM 126
Query: 203 CKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDI-CPTSSRR 261
CK+PTQTFF D G+PL M PNF D SC FG PP +D +PC + C ++ R
Sbjct: 127 CKIPTQTFFTDEFGLPLTMNPNFEDLSCSMIFGQAPPPMTEDPAYTQPCFAVQCSIAAGR 186
Query: 262 KEVAMNSNVEQCPK 275
+ S CPK
Sbjct: 187 TD---GSTPPPCPK 197
>gi|226501302|ref|NP_001144840.1| uncharacterized protein LOC100277925 precursor [Zea mays]
gi|195647726|gb|ACG43331.1| hypothetical protein [Zea mays]
gi|414591649|tpg|DAA42220.1| TPA: hypothetical protein ZEAMMB73_436579 [Zea mays]
Length = 262
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 96/191 (50%), Gaps = 8/191 (4%)
Query: 62 YKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRIL 121
Y+ D L + L + G+ + K GY+GLIE L + D + L
Sbjct: 64 YRDNWFDKLAIGYLSRNLQEASGMKNGKDGYEGLIEAA--LAISALFRVDQQWDTVASAL 121
Query: 122 VSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSG 181
FP +L + K+++ P + + A T + WL+G C V + DG ++
Sbjct: 122 QRAFPSYILTMIKVMMPP---SRFSREYFAAFTTVFFPWLVGPCEVRESQV-DGREERNV 177
Query: 182 VFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLP 241
V++ +C++LE + CVG+C N CK+P Q F +D +G + M PNF D SC+ FG P P
Sbjct: 178 VYIPKCRFLESTNCVGMCTNLCKIPCQRFIQDSLGTAVYMSPNFEDMSCEMIFGQQP--P 235
Query: 242 KDDTTLKEPCL 252
+DD LK+PC
Sbjct: 236 EDDPALKQPCF 246
>gi|254946546|gb|ACT91266.1| DWARF27 [Oryza sativa Japonica Group]
Length = 278
Score = 113 bits (282), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 109/217 (50%), Gaps = 15/217 (6%)
Query: 36 SVLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGL 95
+ + S++Q + A P T Y+ D L + L + GL +EK GY+ L
Sbjct: 61 AAMMSTVQTETAAAP-------PATVYRDSWFDKLAIGYLSRNLQEASGLKNEKDGYESL 113
Query: 96 IELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTA 155
I+ L + S D + + L P +L + K+++ P + + A T
Sbjct: 114 IDAA--LAISRIFSLDKQSEIVTQALERALPSYILTMIKVMMPP---SRFSREYFAAFTT 168
Query: 156 LTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYM 215
+ WL+G C V ++ +G ++ V++ +C++LE + CVG+C N CK+P Q F +D +
Sbjct: 169 IFFPWLVGPCEVMESEV-EGRKEKNVVYIPKCRFLESTNCVGMCTNLCKIPCQKFIQDSL 227
Query: 216 GVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCL 252
G+ + M PNF D SC+ FG P P+DD LK+PC
Sbjct: 228 GMKVYMSPNFEDMSCEMIFGQQP--PEDDPALKQPCF 262
>gi|77551663|gb|ABA94460.1| hypothetical protein LOC_Os11g37650 [Oryza sativa Japonica Group]
gi|125577746|gb|EAZ18968.1| hypothetical protein OsJ_34503 [Oryza sativa Japonica Group]
Length = 236
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 109/217 (50%), Gaps = 15/217 (6%)
Query: 36 SVLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGL 95
+ + S++Q + A P T Y+ D L + L + GL +EK GY+ L
Sbjct: 19 AAMMSTVQTETAAAP-------PATVYRDSWFDKLAIGYLSRNLQEASGLKNEKDGYESL 71
Query: 96 IELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTA 155
I+ L + S D + + L P +L + K+++ P + + A T
Sbjct: 72 IDAA--LAISRIFSLDKQSEIVTQALERALPSYILTMIKVMMPP---SRFSREYFAAFTT 126
Query: 156 LTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYM 215
+ WL+G C V ++ +G ++ V++ +C++LE + CVG+C N CK+P Q F +D +
Sbjct: 127 IFFPWLVGPCEVMESEV-EGRKEKNVVYIPKCRFLESTNCVGMCTNLCKIPCQKFIQDSL 185
Query: 216 GVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCL 252
G+ + M PNF D SC+ FG P P+DD LK+PC
Sbjct: 186 GMKVYMSPNFEDMSCEMIFGQQP--PEDDPALKQPCF 220
>gi|356499600|ref|XP_003518626.1| PREDICTED: uncharacterized protein LOC100815863 [Glycine max]
Length = 270
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 107/211 (50%), Gaps = 20/211 (9%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQ------EVGLDSEKPG-------YDGLIELVNHLMMK 105
+T Y G+ + F++ F K+ + G E G Y+ +++ +M +
Sbjct: 51 KTRYNDGIFEKAFMTLFARKMEKFADPPAPAGKARENKGWWDWGYDYESFVDVSRRVMQR 110
Query: 106 GKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHC 165
+ ++ L+S+ PP ++ L P K AA A +T WL+G
Sbjct: 111 RSRIQQQQVVREV--LLSMLPPGAPAQFRKLFPPT---KWAAEFNAALTVPFFDWLVGPS 165
Query: 166 TVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNF 225
V V++ +G +SGV +++C+YLE S CVG+C+N CK+PTQ FF + G+PL M PNF
Sbjct: 166 EVMEVEI-NGVKQKSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMTPNF 224
Query: 226 SDYSCQFKFGILPPLPKDDTTLKEPCL-DIC 255
D SC +G +PP ++D K+ C +IC
Sbjct: 225 EDMSCDMVYGQVPPTFEEDPVSKQACYANIC 255
>gi|115474501|ref|NP_001060847.1| Os08g0114100 [Oryza sativa Japonica Group]
gi|42409291|dbj|BAD10553.1| unknown protein [Oryza sativa Japonica Group]
gi|113622816|dbj|BAF22761.1| Os08g0114100 [Oryza sativa Japonica Group]
gi|125559935|gb|EAZ05383.1| hypothetical protein OsI_27588 [Oryza sativa Indica Group]
Length = 261
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/219 (30%), Positives = 115/219 (52%), Gaps = 24/219 (10%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSE-----------------KPGYDGLIELVNH 101
+TEY+ G ++ F+ F K+ + + S + Y+ ++ V+
Sbjct: 38 KTEYRDGPVERAFMGLFARKMEKYAVVSSSGGKGKEKKKEKSSRSVWEWDYESFVD-VSR 96
Query: 102 LMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWL 161
+M G++ + ++A + +L+S+ PP + +K L P + A A +T WL
Sbjct: 97 RVMVGRTRAQQQEAVR-EVLLSMLPPGAPEQFKKLFPPT---RWACEFNAALTVPFFHWL 152
Query: 162 MGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLM 221
+G + +G +SGV +++C+YLE S CVG+C+N CK+PTQ FF + G+PL M
Sbjct: 153 VGP-SEVVEVEVNGVKQKSGVLIKKCRYLENSGCVGMCVNMCKIPTQNFFTNEFGLPLTM 211
Query: 222 EPNFSDYSCQFKFGILPPLPKDDTTLKEPCL-DICPTSS 259
PNF D SC+ +G +PP ++D K+PC ++C S+
Sbjct: 212 NPNFEDMSCEMIYGQVPPPLEEDPASKQPCYANLCSIST 250
>gi|307111727|gb|EFN59961.1| hypothetical protein CHLNCDRAFT_133062 [Chlorella variabilis]
Length = 164
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 61/160 (38%), Positives = 90/160 (56%), Gaps = 10/160 (6%)
Query: 99 VNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTC 158
V+ MMKG+S+++ R+A ++ FP + ++ P + K A + A++T
Sbjct: 4 VSRSMMKGRSAAEQREA-----VIQGFPEVPEWFRRVF--PYS--KWGAELNAKITPAFF 54
Query: 159 QWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVP 218
WL+G ++ DG S V +ERC+YL ES C G+CIN CK PTQ FF + +G+P
Sbjct: 55 TWLVGPMQTAVTEV-DGQQQMSAVKIERCRYLAESGCTGMCINLCKSPTQAFFTEQLGMP 113
Query: 219 LLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTS 258
L M PNF D SC+ FG PP +D ++PCL C T+
Sbjct: 114 LTMTPNFEDLSCEMVFGKRPPPLSEDPAAQQPCLASCATA 153
>gi|218186055|gb|EEC68482.1| hypothetical protein OsI_36734 [Oryza sativa Indica Group]
Length = 274
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 99/191 (51%), Gaps = 8/191 (4%)
Query: 62 YKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRIL 121
Y+ D L + L + GL +EK GY+ LI+ L + S D + + L
Sbjct: 76 YRDNWFDKLAIGYLSRNLQEASGLKNEKDGYESLIDAA--LAISRIFSLDKQSEIVTQAL 133
Query: 122 VSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSG 181
P +L + K+++ P + + A T + WL+G C V ++ +G ++
Sbjct: 134 ERALPSYILTMIKVMMPP---SRFSREYFAAFTTIFFPWLVGPCEVMESEV-EGRKEKNV 189
Query: 182 VFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLP 241
V++ +C++LE + CVG+C N CK+P Q F +D +G+ + M PNF D SC+ FG P P
Sbjct: 190 VYIPKCRFLESTNCVGMCTNLCKIPCQKFIQDSLGMKVYMSPNFEDMSCEMIFGQQP--P 247
Query: 242 KDDTTLKEPCL 252
+DD LK+PC
Sbjct: 248 EDDPALKQPCF 258
>gi|359495440|ref|XP_003634993.1| PREDICTED: uncharacterized protein LOC100853223 [Vitis vinifera]
Length = 247
Score = 110 bits (274), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 81/241 (33%), Positives = 120/241 (49%), Gaps = 19/241 (7%)
Query: 25 KLHPQLQSPRFSVLRSSIQPQPQAPPI----KRESDT-SRTEYKPGVLDDLFLSSFR-NK 78
KL Q +S ++ R +P+ +PPI R +D + + P +L D + N
Sbjct: 4 KLVTQHRSHVWAGKRGMHKPR-SSPPILAVLARPADNLTLVKETPSLLTDNWFDRIAINH 62
Query: 79 LVQEV----GLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYK 134
L Q V GL + K GY+ L+E M+ + I L FP +L L +
Sbjct: 63 LSQSVQATTGLRNSKSGYESLVEAA--AMVSRNFDPIQQCELVIEALNKAFPSPILSLIR 120
Query: 135 ILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESK 194
L + K A T L WL+G C V ++ +G ++ V +++C++LEES
Sbjct: 121 TL---MPQSKFTREYFAAFTTLFFAWLVGPCKVIESEI-NGRREKNVVHIKKCRFLEESN 176
Query: 195 CVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDI 254
CVG+C+N CK P+Q F KD +G+P+ M PNF D SCQ FG P P DD L++PC +
Sbjct: 177 CVGMCLNLCKNPSQKFIKDSLGMPVNMVPNFDDMSCQMIFGQDP--PGDDPVLRQPCYKL 234
Query: 255 C 255
C
Sbjct: 235 C 235
>gi|296084501|emb|CBI25060.3| unnamed protein product [Vitis vinifera]
Length = 279
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 81/241 (33%), Positives = 120/241 (49%), Gaps = 19/241 (7%)
Query: 25 KLHPQLQSPRFSVLRSSIQPQPQAPPI----KRESDT-SRTEYKPGVLDDLFLSSFR-NK 78
KL Q +S ++ R +P+ +PPI R +D + + P +L D + N
Sbjct: 36 KLVTQHRSHVWAGKRGMHKPR-SSPPILAVLARPADNLTLVKETPSLLTDNWFDRIAINH 94
Query: 79 LVQEV----GLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYK 134
L Q V GL + K GY+ L+E M+ + I L FP +L L +
Sbjct: 95 LSQSVQATTGLRNSKSGYESLVEAA--AMVSRNFDPIQQCELVIEALNKAFPSPILSLIR 152
Query: 135 ILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESK 194
L + K A T L WL+G C V ++ +G ++ V +++C++LEES
Sbjct: 153 TL---MPQSKFTREYFAAFTTLFFAWLVGPCKVIESEI-NGRREKNVVHIKKCRFLEESN 208
Query: 195 CVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDI 254
CVG+C+N CK P+Q F KD +G+P+ M PNF D SCQ FG P P DD L++PC +
Sbjct: 209 CVGMCLNLCKNPSQKFIKDSLGMPVNMVPNFDDMSCQMIFGQDP--PGDDPVLRQPCYKL 266
Query: 255 C 255
C
Sbjct: 267 C 267
>gi|357144459|ref|XP_003573300.1| PREDICTED: uncharacterized protein LOC100837900 [Brachypodium
distachyon]
Length = 275
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 103/203 (50%), Gaps = 15/203 (7%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQ----------EVGLDSEKPGYDGLIELVNHLMMKGKS 108
+T YK G L+ F+ F K+ + + + Y+ +++ +M+
Sbjct: 58 KTVYKDGPLERAFMGLFARKMSKFATKTPNPNPNISRAVWEWDYESFVDVSRRVMVSC-G 116
Query: 109 SSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVN 168
+ + + AA +L+S+ P ++ L P + A A +T WL+G +
Sbjct: 117 TRERQQAAVREVLLSMLPAGAPAQFRKLFPPT---RWACEFNAALTVPFFHWLVGP-SEV 172
Query: 169 SVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDY 228
G +SGV +++C+YLE S CVG+C+N CK+PTQ+FF D G+PL M PNF D
Sbjct: 173 VEVEVAGVKQRSGVLIKKCRYLENSGCVGMCVNMCKIPTQSFFTDEFGLPLTMNPNFEDM 232
Query: 229 SCQFKFGILPPLPKDDTTLKEPC 251
SC+ +G +PP ++D K+PC
Sbjct: 233 SCEMIYGQVPPPLEEDPASKQPC 255
>gi|303284273|ref|XP_003061427.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456757|gb|EEH54057.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 377
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 120/286 (41%), Gaps = 48/286 (16%)
Query: 15 SSPPRSHHIPKLHPQLQSPRFSVLRSSIQPQPQAPPIKRES-DTSRTEYKPGVLDDLFLS 73
+ PPRS + + + S R S P P A +KR T + Y +D FL
Sbjct: 73 ADPPRS--VAPWRRRYRDDAASSRRISHSPPPTA--LKRAGPSTPKPHYDDSPVDRFFLK 128
Query: 74 SFRNKLVQEVGLDSEKP---GYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVL 130
F +L E+G D YD ++ L+ + ++++A+ A R+L SL PP
Sbjct: 129 LFNARLAAELGDDVRGAVTGDYDDVVARCVRLVSEAPNAAEAK-ARGGRVLRSLLPPGTA 187
Query: 131 KLYKILISPLAGGKIAAMMVARVTALTCQWLMG---------HCTVNSVDLPDGTSCQS- 180
+ L G A A VT WL+G V+ P G + ++
Sbjct: 188 PALRFAFG-LFPGWFVARHAAAVTPYLLPWLVGKGRVIDAPEDLMVDDASSPPGNAFEAL 246
Query: 181 ----------------------------GVFVERCKYLEESKCVGVCINTCKLPTQTFFK 212
GV +ERC+ LEES C GVC+N CKLPTQ F
Sbjct: 247 KRMNGERADDRKNRRNQKNQNVPPGYKQGVLLERCRVLEESGCAGVCLNVCKLPTQQFLG 306
Query: 213 DYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTS 258
+ +G+P+ + P++ + C+F FG PP P+ D PC CP +
Sbjct: 307 EELGLPVTLAPDYETFECRFLFGKTPPPPESDPAFDTPCFGQCPVA 352
>gi|308812113|ref|XP_003083364.1| F1N19.25 (ISS) [Ostreococcus tauri]
gi|116055244|emb|CAL57640.1| F1N19.25 (ISS) [Ostreococcus tauri]
Length = 265
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/205 (33%), Positives = 105/205 (51%), Gaps = 11/205 (5%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPG---YDGLIELVNHLMMKGKSSSDARDA 115
R Y+ G LD ++ F K+ +++ PG YD I L L MKG+ + D
Sbjct: 52 RVRYEDGALDFAVMAWFMRKI--GTAINAPPPGEISYDAFIALC-FLQMKGRDAKGMTDV 108
Query: 116 AQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTV-NSVDLPD 174
++ SL PP ++K L PL K + + A +T + W++G TV ++D
Sbjct: 109 TS-GVIRSLVPPGGSAVFKTLF-PL--NKFSCELNATITKIVFAWMVGPMTVETTMDNDL 164
Query: 175 GTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKF 234
G S V +++C++L+ES C +C+N CK TQ F D G+PL ++PNF D SC F F
Sbjct: 165 GIEMASKVHIKKCRWLQESGCTAMCVNMCKCATQEVFTDDFGLPLTIKPNFEDKSCDFYF 224
Query: 235 GILPPLPKDDTTLKEPCLDICPTSS 259
G+ PP + D L C C T++
Sbjct: 225 GLTPPPVEKDEALLFGCNAACATAA 249
>gi|308806576|ref|XP_003080599.1| unnamed protein product [Ostreococcus tauri]
gi|116059060|emb|CAL54767.1| unnamed protein product [Ostreococcus tauri]
Length = 364
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/237 (33%), Positives = 112/237 (47%), Gaps = 32/237 (13%)
Query: 31 QSPRFSVLRSSIQPQPQAPPIKRESDTSR---TEYKPGVLDDLFLSSFRNKL-----VQE 82
++ RF +LR S S T R TEY+ GVLD + + F KL +E
Sbjct: 134 ETVRFGILRGS-------------SATYRDDVTEYEDGVLDAVAIELFNAKLNAVVGARE 180
Query: 83 VGLDSEK-PGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLA 141
G D E+ G+ L+ L + L G+ + RDA R L+S+ P V +K LI P
Sbjct: 181 EGEDGERLRGFARLVRLADKLS-DGRGVVEQRDAV-TRALLSIIPAPVRWAFKRLIEP-- 236
Query: 142 GGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCIN 201
M A+VT WL+G C + + DG + V +++C+YLE+ C C N
Sbjct: 237 -ATWVDEMNAKVTREAFAWLVGPCEIVPRE-SDGA--MASVKLKKCRYLEQCGCAASCTN 292
Query: 202 TCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTS 258
CK+PTQ FFK+ GV ++PN D SC FG+ P + D PC C S
Sbjct: 293 FCKIPTQRFFKEAFGVDARLDPNHEDGSCMMTFGVKPDVV--DAAFAAPCYATCAKS 347
>gi|303290226|ref|XP_003064400.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226453998|gb|EEH51305.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 326
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 103/212 (48%), Gaps = 8/212 (3%)
Query: 50 PIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVG--LDSEKPGYDGLIELVNHLMMKGK 107
P + T++ Y LD ++ F K+ + YD I L L M+G+
Sbjct: 103 PADWRTPTAKVTYADSPLDLALMAWFMRKIAMAIDAPFSPADVSYDAFIALC-FLQMRGR 161
Query: 108 SSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTV 167
RD + I+ SL PP K+++ L K + + A + + W++G TV
Sbjct: 162 DPDGQRDIT-LGIMRSLMPPGGDKVFRKLFPT---NKFSLELNATICKVVFAWMVGPMTV 217
Query: 168 NSVDLPD-GTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFS 226
D D S V +++C++L+ES C G+C+N CK TQ FF+D G+PL ++PNF
Sbjct: 218 EKTDENDLREEMASRVHIKKCRWLQESGCTGMCVNMCKTATQDFFRDDFGLPLTIKPNFE 277
Query: 227 DYSCQFKFGILPPLPKDDTTLKEPCLDICPTS 258
D SC F FG+ PP ++D L C C T+
Sbjct: 278 DKSCDFYFGLTPPPIEEDEALTFGCNATCGTA 309
>gi|427712666|ref|YP_007061290.1| hypothetical protein Syn6312_1586 [Synechococcus sp. PCC 6312]
gi|427376795|gb|AFY60747.1| hypothetical protein Syn6312_1586 [Synechococcus sp. PCC 6312]
Length = 214
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 97/195 (49%), Gaps = 9/195 (4%)
Query: 62 YKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRIL 121
Y D L L K+ Q + L P Y + L +M +G+++ + + A +L
Sbjct: 11 YTDNAFDRLALGLINRKIAQALDLTPPSPTYANFVWLSQQVM-QGRTAQE-QQALIAEVL 68
Query: 122 VSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNS--VDLPDGT--S 177
S+ P VL + SP + + A QWL+G C S V PD
Sbjct: 69 ASVIPRWVLWGIRNFFSP---APLVCELNAWFATRLFQWLVGPCEWQSTLVAGPDQAFRW 125
Query: 178 CQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGIL 237
+S V +++C+YLEES CVG+C+N CKLPTQ FF + G+PL M P+F D SC FG +
Sbjct: 126 QRSRVQIKKCRYLEESGCVGMCVNLCKLPTQKFFTEQFGIPLTMTPDFQDLSCAMVFGQM 185
Query: 238 PPLPKDDTTLKEPCL 252
P ++ ++PCL
Sbjct: 186 PLPFTEEEAAQQPCL 200
>gi|385763982|gb|AFI78794.1| putative D27 family protein, partial [Chaetosphaeridium globosum]
Length = 133
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 72/118 (61%)
Query: 144 KIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTC 203
+ A + A +T +WL+G V +V+L G +SGV +ERC+YL S C G+C+N C
Sbjct: 7 EWGAEVNATITPAFFKWLVGPAKVVAVELEPGKELRSGVQIERCRYLATSGCAGMCVNLC 66
Query: 204 KLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRR 261
KLP QTFF + +G+PL MEPN+ D SC FG LPP ++D PCL C ++ +
Sbjct: 67 KLPCQTFFTEELGMPLTMEPNYQDQSCLMVFGKLPPPREEDPAFASPCLSACSAATSQ 124
>gi|145349690|ref|XP_001419261.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579492|gb|ABO97554.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 187
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 100/197 (50%), Gaps = 15/197 (7%)
Query: 60 TEYKPGVLDDLFLSSFRNKLVQEVGLDS-----EKPGYDGLIELVNHLMMKGKSSSDARD 114
T Y G LD L +S F KL VG ++ EK G++ L+ L + L G++ + R
Sbjct: 1 TTYADGALDALAISLFNAKLAAVVGEEASASANEKRGFERLVALADALA-DGRTVVEQR- 58
Query: 115 AAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPD 174
AA R L+S+ P V L+K +I P M A +T WL+G C V + D
Sbjct: 59 AAVTRALLSIIPAPVRFLFKKMIKP---APWVDEMNAYITREAFAWLVGPCEVMPRE-SD 114
Query: 175 GTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKF 234
G Q V + +C+YLE+S C C N CK+PTQ FFK+ GV ++PN D SC F
Sbjct: 115 GVMAQ--VKLRKCRYLEQSGCSASCANFCKIPTQRFFKEAFGVDARLDPNHEDGSCVMTF 172
Query: 235 GILPPLPKDDTTLKEPC 251
G+ P + +D PC
Sbjct: 173 GVKPDI--NDAAFAAPC 187
>gi|242068981|ref|XP_002449767.1| hypothetical protein SORBIDRAFT_05g022855 [Sorghum bicolor]
gi|241935610|gb|EES08755.1| hypothetical protein SORBIDRAFT_05g022855 [Sorghum bicolor]
Length = 301
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 100/212 (47%), Gaps = 24/212 (11%)
Query: 60 TEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIR 119
T Y D L + L + G+ + K GY+GLIE L + D + +
Sbjct: 92 TTYHDSWFDKLAIGYLSRNLQEASGMKNGKDGYEGLIEAA--LAISALFRVDQQLETVAK 149
Query: 120 ILVSLFPPLVLKLY--KILISPLAG--------GKIAAMM---------VARVTALTCQW 160
L FP +L + K +S + G KI MM A T + W
Sbjct: 150 ALEQAFPSYILTMARDKHKVSIIDGLESRLFFEYKIKIMMPPSRFSREYFAAFTTIFFPW 209
Query: 161 LMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLL 220
L+G C V ++ DG ++ V++ +C++LE + CVG+C N CK+P Q F +D +G +
Sbjct: 210 LVGPCEVRESEV-DGRKEKNVVYIPKCRFLESTNCVGMCTNLCKIPCQKFIQDSLGTAVY 268
Query: 221 MEPNFSDYSCQFKFGILPPLPKDDTTLKEPCL 252
M PNF D SC+ FG P P+DD LK+PC
Sbjct: 269 MSPNFEDMSCEMIFGQQP--PEDDPALKQPCF 298
>gi|302756951|ref|XP_002961899.1| hypothetical protein SELMODRAFT_67943 [Selaginella moellendorffii]
gi|300170558|gb|EFJ37159.1| hypothetical protein SELMODRAFT_67943 [Selaginella moellendorffii]
Length = 160
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 95/161 (59%), Gaps = 7/161 (4%)
Query: 92 YDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVA 151
+DG ++ V MM+G++ + +R+L SL P + + + ++ P++ + A A
Sbjct: 1 FDGFVD-VARKMMQGRTPVQQHEMV-LRVLESLMPWWIGAMVRTIL-PVS--RATAEFYA 55
Query: 152 RVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKY-LEESKCVGVCINTCKLPTQTF 210
T L WL+G V V++ DG +SGV +++C++ LE S CVG+C N CK+P+Q F
Sbjct: 56 HGTTLFTSWLIGPSEVIEVEV-DGVKQKSGVHIQKCRHILERSACVGLCTNMCKVPSQRF 114
Query: 211 FKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPC 251
F +GVP+ M PNF D SC F +G PP ++D+ ++PC
Sbjct: 115 FAKELGVPMTMVPNFEDMSCDFIYGQTPPPLEEDSASRQPC 155
>gi|308809211|ref|XP_003081915.1| unnamed protein product [Ostreococcus tauri]
gi|116060382|emb|CAL55718.1| unnamed protein product [Ostreococcus tauri]
Length = 287
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 69/225 (30%), Positives = 97/225 (43%), Gaps = 26/225 (11%)
Query: 63 KPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILV 122
K LDD +S F L +E D ++G+ E L+ S+ + R A +RIL
Sbjct: 49 KQTFLDDFAISLFAAALARETRTDVGGD-FNGVREACLDLVRTSTSARETR-ARAMRILK 106
Query: 123 SLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVN----SVDLPDGTSC 178
+L P + ++ L A A VT + WL+G VN V DG
Sbjct: 107 TLAPKWAFGAFAAILR-LFPDWFKARHAAAVTPVLMGWLVGDAEVNDGGEGVTYEDGEKA 165
Query: 179 -------------------QSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPL 219
+ GV ++RC+ LEE+ C VC N CK PTQ FF + +G+ +
Sbjct: 166 PTSAWAALDPNGERAPAGYKQGVLLKRCRVLEETGCAAVCANVCKHPTQKFFTEDIGLAM 225
Query: 220 LMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEV 264
M PN+ Y CQF +G P +D K PC C S+ KE
Sbjct: 226 TMTPNYETYECQFTYGAKPKEVGEDEAFKTPCFRQCTASTTMKEA 270
>gi|302817115|ref|XP_002990234.1| hypothetical protein SELMODRAFT_47927 [Selaginella moellendorffii]
gi|300141943|gb|EFJ08649.1| hypothetical protein SELMODRAFT_47927 [Selaginella moellendorffii]
Length = 160
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/161 (35%), Positives = 91/161 (56%), Gaps = 7/161 (4%)
Query: 92 YDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVA 151
+DG ++ V MM+G++ + +R+L SL P + + + + L + A A
Sbjct: 1 FDGFVD-VARKMMQGRTPVQQHEMV-LRVLESLMPWWIGAMIRTI---LPASRATAEFYA 55
Query: 152 RVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKY-LEESKCVGVCINTCKLPTQTF 210
T L WL+G V V++ DG ++GV +++C++ LE S CVG+C N CK+P+Q F
Sbjct: 56 HGTTLFTSWLIGPSEVIEVEV-DGVKQKTGVHIQKCRHILESSACVGLCTNMCKVPSQRF 114
Query: 211 FKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPC 251
F +GVP+ M PNF D SC F +G PP ++D ++PC
Sbjct: 115 FAKELGVPMTMVPNFEDMSCDFIYGQTPPPLEEDPASRQPC 155
>gi|222630086|gb|EEE62218.1| hypothetical protein OsJ_17005 [Oryza sativa Japonica Group]
Length = 198
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 107/199 (53%), Gaps = 37/199 (18%)
Query: 33 PRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGY 92
P S+LR S P A + R EY+P DD + FR K+V+EVG DSEKPGY
Sbjct: 30 PTTSLLRCS---SPSADTASSSGEGGR-EYEPSFADDFLHAFFRAKMVEEVGWDSEKPGY 85
Query: 93 DGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVAR 152
GLIE+ N M+KGKS+ + +A +R+L SL PPL+L L K L+ P+A G++A+MMV
Sbjct: 86 TGLIEVANRPMVKGKSALEIEQSA-VRVLRSLIPPLLLVLLKALVVPIANGQLASMMV-- 142
Query: 153 VTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFK 212
+ GH + +V S +S +R L TFFK
Sbjct: 143 --------VHGHYVMETVR----CSLRSANIWKRANDL------------------TFFK 172
Query: 213 DYMGVPLLMEPNFSDYSCQ 231
D++GV L MEPNF DYSCQ
Sbjct: 173 DHIGVDLYMEPNFEDYSCQ 191
>gi|303278996|ref|XP_003058791.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459951|gb|EEH57246.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 193
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 69/200 (34%), Positives = 103/200 (51%), Gaps = 17/200 (8%)
Query: 60 TEYKPGVLDDLFLSSFRNKL---VQEVGLDSE-----KPGYDGLIELVNHLMMKGKSSSD 111
T Y+ G LD+L ++ F KL +++ +D + K G+D L+ L + L+ G+S S
Sbjct: 1 TRYEDGALDELAIALFNRKLESALRDDAIDVKETTLPKRGFDRLVALAD-LISVGRSPSR 59
Query: 112 ARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVD 171
R A + L+ L PP V +K +I P + M A +T WL+G C + +
Sbjct: 60 QR-AVVLTTLLGLIPPWVRARFKEIIRP--EWRWVDEMNAVITVNAFAWLVGPCEIIPRE 116
Query: 172 LPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQ 231
DG S V + +C+YLE+ C C+N CK+PTQ FF++ GV + P+ SD SC
Sbjct: 117 -SDGV--MSAVKLRKCRYLEQCGCTASCVNFCKMPTQAFFREAFGVDAHLAPDHSDGSCV 173
Query: 232 FKFGILPPLPKDDTTLKEPC 251
FG PP P D + PC
Sbjct: 174 MTFGAKPPSP--DPAFEAPC 191
>gi|297742444|emb|CBI34593.3| unnamed protein product [Vitis vinifera]
Length = 157
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 79/138 (57%), Gaps = 5/138 (3%)
Query: 119 RILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSC 178
+L+S+ PP ++ L P + AA A T WL+G + +G
Sbjct: 9 EVLLSMLPPGAPDQFRKLFPPT---RWAAEFNAAFTVPFFAWLVGP-SEVVEVEVNGVKQ 64
Query: 179 QSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238
+SGV +++C+YLE S CVG+C+N CK+PTQ FF + G+PL M PNF D SC+ +G +P
Sbjct: 65 RSGVLIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMTPNFEDMSCEMVYGQVP 124
Query: 239 PLPKDDTTLKEPCL-DIC 255
P ++D K+PC DIC
Sbjct: 125 PPFEEDPVSKQPCFSDIC 142
>gi|219118645|ref|XP_002180091.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408348|gb|EEC48282.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 451
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/227 (33%), Positives = 109/227 (48%), Gaps = 20/227 (8%)
Query: 61 EYKPGVLDDLFLSSFRNKLVQEVG-LDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIR 119
+Y LD L LS FRN + + G + S K G GL++ M+K + +A+
Sbjct: 230 KYNESPLDKLLLSIFRNLVTKNTGGVTSPKEGILGLVDQGRTFMLKPGQTPEAQHKMVSD 289
Query: 120 ILVSLFPPLVLKLYKILISPLA-------GGK------IAAMMVARVTALTCQWLMGHCT 166
L L P++ Y I +S + GK A + + VT +L+G
Sbjct: 290 TLAGLMTPVLPPFYGIFMSGIVPKIGTEFDGKQFGPWFYAPWLTSVVTPTFFGFLVGPSR 349
Query: 167 VNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFS 226
N DG G+ VE+CK+L++S C G+C++ CKLP Q FFKD +G+PL + PNF
Sbjct: 350 PNHRK--DGQ--LGGLVVEKCKFLQKSGCKGLCLHQCKLPAQQFFKDELGLPLTVSPNFV 405
Query: 227 DYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQC 273
CQ+ FG P +D + CL C SR+ VA S V+ C
Sbjct: 406 TQECQWSFGESPLPASEDPSFPTGCLVGC--ESRKTLVATGSRVDAC 450
>gi|298706224|emb|CBJ29265.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 199
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 93/185 (50%), Gaps = 26/185 (14%)
Query: 92 YDGLIELVNHLMMKGKSSSDARDAAQIR-ILVSLFPPLVLKLYKILISPLAGGKIAAMMV 150
Y+ + L L ++ + R +R +L S+FP Y++L P +
Sbjct: 7 YEDYVALATGL----QAGAPERQREVVRGVLRSVFPAWFPAFYRMLFPP-------SKFS 55
Query: 151 ARVTALTC----QWLMG--HCTVNSVDL--------PDGTSCQSGVFVERCKYLEESKCV 196
A V A C WL+G T VD+ P ++ V VERC+YLE SKC
Sbjct: 56 AEVNAFMCPPLFGWLVGKSELTEGVVDVKAAKEGEGPRQEVWRNTVKVERCRYLEASKCK 115
Query: 197 GVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICP 256
G C+N CKLPT+ FF++ +G+PL M PNF D SC+F FG ++D ++EPC C
Sbjct: 116 GTCMNLCKLPTEAFFREDLGMPLRMTPNFEDLSCEFAFGQDALPAEEDPLMREPCWTECL 175
Query: 257 TSSRR 261
+S ++
Sbjct: 176 SSDKQ 180
>gi|413941681|gb|AFW74330.1| hypothetical protein ZEAMMB73_058801 [Zea mays]
Length = 305
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 111/212 (52%), Gaps = 17/212 (8%)
Query: 59 RTEYKPGVLDDLFLSSFRNKL------VQEVGLDSEKP----GYDGLIELVNHLMMKGKS 108
+TEY+ G L+ F+ F K+ E+ Y+ +++ +M+ G++
Sbjct: 90 KTEYRDGPLERAFMGLFARKMEKYAAKKPAAQAKEERAVWEWDYESFVDVSRRVML-GRT 148
Query: 109 SSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVN 168
+ + A + +L+S+ PP ++ L P + A A +T +WL+G V
Sbjct: 149 RAQQQQAVR-EVLLSMLPPGAPAQFRRLFPPT---RWACEFNAALTVPFFRWLVGPSEVV 204
Query: 169 SVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDY 228
V++ +SGV +E+C+YLE S CVG+C+N CK+PTQ FF + G+PL M PNF D
Sbjct: 205 EVEVGGVRQ-RSGVRIEKCRYLESSGCVGMCVNMCKVPTQDFFTNEFGLPLTMNPNFEDM 263
Query: 229 SCQFKFGILPPLPKDDTTLKEPCL-DICPTSS 259
SC+ +G +PP ++D K+ C +C S+
Sbjct: 264 SCEMIYGQVPPPLEEDPASKQACYPSLCSMST 295
>gi|195609902|gb|ACG26781.1| hypothetical protein [Zea mays]
Length = 265
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 111/213 (52%), Gaps = 17/213 (7%)
Query: 58 SRTEYKPGVLDDLFLSSFRNKLVQEVGLDSE----------KPGYDGLIELVNHLMMKGK 107
+TEY+ G L+ F+ F K+ + + Y+ +++ +M+ G+
Sbjct: 49 EKTEYRDGPLERAFMGLFARKMEKYAAKKPAAQAKEERAVWEWDYESFVDVSRRVML-GR 107
Query: 108 SSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTV 167
+ + + A + +L+S+ PP ++ L P + A A +T +WL+G V
Sbjct: 108 TRAQQQQAVR-EVLLSMLPPGAPAQFRRLFPPT---RWACEFNAALTVPFFRWLVGPSEV 163
Query: 168 NSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSD 227
V++ +SGV +E+C+YLE S CVG+C+N CK+PTQ FF + G+PL M PNF D
Sbjct: 164 VEVEVGGVRQ-RSGVRIEKCRYLESSGCVGMCVNMCKVPTQDFFTNEFGLPLTMNPNFED 222
Query: 228 YSCQFKFGILPPLPKDDTTLKEPCL-DICPTSS 259
SC+ +G +PP ++D K+ C +C S+
Sbjct: 223 MSCEMIYGQVPPPLEEDPASKQACYPSLCSMST 255
>gi|145354075|ref|XP_001421321.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581558|gb|ABO99614.1| predicted protein, partial [Ostreococcus lucimarinus CCE9901]
Length = 202
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/205 (30%), Positives = 101/205 (49%), Gaps = 11/205 (5%)
Query: 59 RTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKP---GYDGLIELVNHLMMKGKSSSDARDA 115
+ Y+ LD ++ F K+ +D+ KP YD I L L MKG+ + D
Sbjct: 2 KVRYEDSALDLALMAWFMAKI--GAAIDAPKPKEISYDEFIALC-FLQMKGRDAVGMGDV 58
Query: 116 AQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPD- 174
++ SL PP ++ L P + + + A +T + W++G V + D
Sbjct: 59 TA-GVIRSLVPPGGNAAFRALFPP---NRFSCELNATITKIVFAWMVGPMEVETTTENDL 114
Query: 175 GTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKF 234
G S V +++C++L+ES C +C+N CK TQ F D G+PL ++PNF + SC F F
Sbjct: 115 GIEMASKVHIKKCRWLQESGCTAMCVNMCKCATQEVFTDDFGLPLTIKPNFENKSCDFYF 174
Query: 235 GILPPLPKDDTTLKEPCLDICPTSS 259
G+ PP + D L C +C T++
Sbjct: 175 GLTPPPIEKDEALLFGCNALCATAA 199
>gi|226501660|ref|NP_001143054.1| uncharacterized protein LOC100275523 [Zea mays]
gi|195613584|gb|ACG28622.1| hypothetical protein [Zea mays]
Length = 265
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 111/213 (52%), Gaps = 17/213 (7%)
Query: 58 SRTEYKPGVLDDLFLSSFRNKLVQEVGLDSE----------KPGYDGLIELVNHLMMKGK 107
+TEY+ G L+ F+ F K+ + + Y+ +++ +M+ G+
Sbjct: 49 EKTEYRDGPLERAFMGLFARKMEKYAAKKPAAQAKEERAVWEWDYESFVDVSRRVML-GR 107
Query: 108 SSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTV 167
+ + + A + +L+S+ PP ++ L P + A A +T +WL+G V
Sbjct: 108 TRAQQQQAVR-EVLLSMLPPGAPAQFRRLFPPT---RWACEFNAALTVPFFRWLVGPSEV 163
Query: 168 NSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSD 227
V++ +SGV +E+C+YLE S CVG+C+N CK+PTQ FF + G+PL M PNF D
Sbjct: 164 VEVEVGGVRQ-RSGVRIEKCRYLESSGCVGMCVNMCKVPTQDFFTNEFGLPLTMNPNFED 222
Query: 228 YSCQFKFGILPPLPKDDTTLKEPCL-DICPTSS 259
SC+ +G +PP ++D K+ C +C S+
Sbjct: 223 MSCEMIYGQVPPPLEEDPASKQACYPSLCSMST 255
>gi|168021666|ref|XP_001763362.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685497|gb|EDQ71892.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 183
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/87 (51%), Positives = 59/87 (67%), Gaps = 1/87 (1%)
Query: 179 QSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238
+SGV +++C+YLE S C G+C+N+CK+PTQ FF +G+PL MEPNF D SC FG P
Sbjct: 92 KSGVQIQKCRYLETSGCTGLCVNSCKMPTQYFFTKELGMPLTMEPNFEDMSCLMIFGQTP 151
Query: 239 PLPKDDTTLKEPCLDI-CPTSSRRKEV 264
P +DD K+ C CPTSS+ EV
Sbjct: 152 PAFEDDLVFKQKCCTTYCPTSSQASEV 178
>gi|255078754|ref|XP_002502957.1| predicted protein [Micromonas sp. RCC299]
gi|226518223|gb|ACO64215.1| predicted protein [Micromonas sp. RCC299]
Length = 310
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 70/222 (31%), Positives = 99/222 (44%), Gaps = 19/222 (8%)
Query: 50 PIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEV--GLDSEK----------PGYDGLIE 97
P+ R T Y G LD + F KL Q V G D E+ G+D L+
Sbjct: 75 PLLRADTEPTTRYIDGPLDAAAIWLFNLKLEQAVNDGADDEERRRNAKGLPAGGFDRLVA 134
Query: 98 LVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALT 157
L + + G++ + R A + L+ L PP V +K LI P M A +T
Sbjct: 135 LADRIASSGRTPPEQR-AVVLATLLGLIPPWVRTQFKRLIDPRWA--WVDKMNALITVNA 191
Query: 158 CQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGV 217
WL+G C + D DG + V + +C+YLE+ C C N CK PT+ FF++ GV
Sbjct: 192 FAWLVGPCEIVPRD-DDGE--LAAVKLRKCRYLEQCGCTASCANFCKRPTEGFFREAFGV 248
Query: 218 PLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSS 259
+ PN D SC FG PP +D +PC C ++
Sbjct: 249 DAHLAPNHEDGSCVMTFGRKPP-EFNDPAFSQPCYSSCAKAA 289
>gi|224013208|ref|XP_002295256.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969218|gb|EED87560.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 455
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 76/234 (32%), Positives = 113/234 (48%), Gaps = 30/234 (12%)
Query: 62 YKPGVLDDLFLSSFRNKLVQE--VGLDSEKPGYDGLIELVNHLMMK----GKSSSDARDA 115
Y LD + LS FR KLV E G+ ++ PG GL+ M K G S D A
Sbjct: 229 YNDSSLDKVLLSIFR-KLVAENTGGIQNDTPGIKGLLIQGRQFMTKELPEGVSYEDHTIA 287
Query: 116 AQIRI---LVSLFPPLVLKLYKILISPLA-------GGK------IAAMMVARVTALTCQ 159
+ L L P++ Y+I +S + GK A + + VT +
Sbjct: 288 QHTMVKNTLGGLMTPVLPPFYRIFMSGIVPKLGTEFDGKQLGPWFYAPFLTSMVTPIFFG 347
Query: 160 WLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPL 219
+L+G N DG + G+ VE+CK+L+ES C G+C++ CK+P Q FFK+ +G+ L
Sbjct: 348 FLVGPSRPNR--RADGQ--RGGLVVEKCKFLQESGCKGLCLHQCKIPAQEFFKEELGLDL 403
Query: 220 LMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQC 273
++PNF CQ+ FG P P++D + CL C + RK++A N C
Sbjct: 404 TVKPNFVTQECQWSFGETPLPPEEDPSFPRGCLVGCES---RKDMAGRKNEALC 454
>gi|255075293|ref|XP_002501321.1| predicted protein [Micromonas sp. RCC299]
gi|226516585|gb|ACO62579.1| predicted protein [Micromonas sp. RCC299]
Length = 380
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/237 (27%), Positives = 109/237 (45%), Gaps = 25/237 (10%)
Query: 56 DTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKP----GYDGLIELVNHLMMKGKSSSD 111
D +R + P + + ++ R LV+EVG + P + G++ V + ++ D
Sbjct: 71 DYARIDASP--ISKVLTATIRKLLVEEVGGVDKDPRPWTEFGGIMNAVREVNDMDGTARD 128
Query: 112 ARDAAQIRILVSLFPPLVLK----LYKILISPLAGGKIAAMMVARVTALTCQWLMGHCT- 166
+ A+ R+ + P L + ++K I P A + V WLMG
Sbjct: 129 VQIRAK-RVFAGILPALGIGWVPPIWKKFIHPNAPDWFSNWAFVLVFTNLFPWLMGPMEG 187
Query: 167 VNSVDLPDGTSCQ-------------SGVFVERCKYLEESKCVGVCINTCKLPTQTFFKD 213
V+ VD+P + V ERC++LE S+C VC+NTCK P+Q + +
Sbjct: 188 VDHVDVPTPAWIRKTFANFPKTFRVPQAVKAERCRFLEMSQCASVCVNTCKAPSQEWLSE 247
Query: 214 YMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNV 270
G+ L ++PN+ D+SCQ+KF + PP +D + PC C + + ++ A V
Sbjct: 248 DFGMDLHIQPNYDDFSCQWKFSVKPPPLYEDAAVMVPCFSKCDSEHKGEKDAFRQQV 304
>gi|255074079|ref|XP_002500714.1| predicted protein [Micromonas sp. RCC299]
gi|226515977|gb|ACO61972.1| predicted protein, partial [Micromonas sp. RCC299]
Length = 152
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 84/157 (53%), Gaps = 6/157 (3%)
Query: 92 YDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVA 151
YD I L L M+G+ RD + I+ SL PP K+++ L K + + A
Sbjct: 1 YDEFIALC-FLQMQGRDPDGQRDIT-LGIMRSLMPPGGDKIFRKLFPT---NKFSLELNA 55
Query: 152 RVTALTCQWLMGHCTVNSVDLPD-GTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTF 210
+ + W++G TV S D G S V +++C++L+ES C G+C+N CK TQ F
Sbjct: 56 VICKIVFAWMVGPMTVESTTENDLGEMIASKVHIKKCRWLQESGCTGMCVNMCKTATQDF 115
Query: 211 FKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTL 247
F + G+PL ++PNF D SC F FG+ PP + D L
Sbjct: 116 FVNDFGLPLTIKPNFEDKSCDFYFGLTPPPIEKDEAL 152
>gi|302848583|ref|XP_002955823.1| hypothetical protein VOLCADRAFT_106966 [Volvox carteri f.
nagariensis]
gi|300258791|gb|EFJ43024.1| hypothetical protein VOLCADRAFT_106966 [Volvox carteri f.
nagariensis]
Length = 235
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/82 (58%), Positives = 58/82 (70%), Gaps = 1/82 (1%)
Query: 151 ARVTALTCQWLMGHCTVNSVDL-PDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQT 209
A TALTCQWLMG C VN V++ GV VERC+YLE++ C VCIN+CK+PTQT
Sbjct: 31 ALATALTCQWLMGPCKVNDVEIDGGVVGKGHGVLVERCRYLEQAGCASVCINSCKIPTQT 90
Query: 210 FFKDYMGVPLLMEPNFSDYSCQ 231
FF MG+PL M PN+ D+SCQ
Sbjct: 91 FFAKDMGLPLTMTPNYDDFSCQ 112
>gi|22299485|ref|NP_682732.1| hypothetical protein tll1942 [Thermosynechococcus elongatus BP-1]
gi|22295668|dbj|BAC09494.1| tll1942 [Thermosynechococcus elongatus BP-1]
Length = 218
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 89/176 (50%), Gaps = 15/176 (8%)
Query: 67 LDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFP 126
L+ L+ + +G+ ++ Y G IE+ L +G+S + + A + L P
Sbjct: 25 LERWCLARLIRAIASAIGVAPQRWDYVGFIEITRQLQ-RGRSPAQ-QQAIVATVFDRLIP 82
Query: 127 PLVLKLYKILISPLAG-GKIAAMMVARVTALTCQWLMGHCTVNSVD------LPDGTSCQ 179
P++ L + L P + A R+T WL+G V+ LP
Sbjct: 83 PMMSTLIRKLFRPSRWVCEWNAWFATRLTG----WLVGASDRYWVEVIPPNQLPQWQ--H 136
Query: 180 SGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFG 235
SGV +++C+YL ES+C+ +C+N CK PT+ FF+ +G+PL M PNF DYSC+ FG
Sbjct: 137 SGVRIQKCRYLAESQCMALCMNLCKKPTEQFFRQRLGIPLTMTPNFKDYSCEMVFG 192
>gi|424513329|emb|CCO65951.1| predicted protein [Bathycoccus prasinos]
Length = 442
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/222 (27%), Positives = 108/222 (48%), Gaps = 24/222 (10%)
Query: 67 LDDLFLSSFRNKLVQEVGLDSE---KPGYDGLIELVNHLM-MKGKSSSDARDAAQIRILV 122
+ + S+ R LVQEVG D++ ++ L+ V + MKG ++ D + A+ R+
Sbjct: 143 ISKVLTSTIRKLLVQEVGKDTDPRDHANFEALMTSVREVNDMKG-TAKDVQTRAK-RVFK 200
Query: 123 SLFPPLVLK----LYKILISPLAGGKIAAMMVARVTALTCQWLMG------HCTVNSVD- 171
+ P L + L+K + P A + V + WLMG H V +
Sbjct: 201 GILPALYIGWIPPLWKKFVDPNAPKWVTGFSFHLVFIVLFPWLMGPMEGAEHEDVKVPEK 260
Query: 172 -------LPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPN 224
LP+ S + ERC++LE S C VC+N+CK+P+Q + ++ G+ L ++PN
Sbjct: 261 LRKTFPFLPEVVSVPQAIKAERCRFLETSSCASVCVNSCKVPSQEWLREDFGMNLHIQPN 320
Query: 225 FSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAM 266
+ D+SC + F PP ++D + PC C + + ++ A+
Sbjct: 321 YDDFSCVWSFNKAPPPLEEDAAILVPCFSNCNSEFKGEKDAL 362
>gi|308810094|ref|XP_003082356.1| unnamed protein product [Ostreococcus tauri]
gi|116060824|emb|CAL57302.1| unnamed protein product [Ostreococcus tauri]
Length = 372
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 99/218 (45%), Gaps = 22/218 (10%)
Query: 70 LFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIR---ILVSLFP 126
+ ++ R LV+EVG D + + ++ + ARD QIR + + P
Sbjct: 76 VLTATIRKLLVEEVGRDVDGRPWTDFAAIMPAVREVNDMDGTARDV-QIRAKRVFAGILP 134
Query: 127 PLVLK----LYKILISPLAGGKIAAMMVARVTALTCQWLMG------HCTVNS------- 169
L + ++K +I P A + V WLMG H V +
Sbjct: 135 ALGIGWVPPIWKKVIHPNAPEWFSNWAFVLVFTNLFPWLMGPMEGVDHVEVPTPEWLRKT 194
Query: 170 -VDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDY 228
+ P V ERC++LE S+C VC+NTCK P+Q + K+ G+ L ++PN+ D+
Sbjct: 195 FANAPKTFRVPQSVKAERCRFLETSQCASVCVNTCKAPSQEWLKEDFGMDLHIQPNYDDF 254
Query: 229 SCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAM 266
SCQ+KF + PP +D + PC C + + ++ A
Sbjct: 255 SCQWKFSVTPPPLYEDAAVMVPCFSKCDSEHKGEKDAF 292
>gi|255086733|ref|XP_002509333.1| predicted protein [Micromonas sp. RCC299]
gi|226524611|gb|ACO70591.1| predicted protein [Micromonas sp. RCC299]
Length = 374
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 115/283 (40%), Gaps = 49/283 (17%)
Query: 32 SPRFSVLRSSIQPQPQAPPIKRESDTS-RTEYKPGVLDDLFLSSFRNKLVQEVGLD--SE 88
SPR + R S + A +KR T+ + Y G +D L +F ++ E+G D +
Sbjct: 49 SPRLVLRRGSRRGDVSAFALKRAGPTTPKPTYNDGPVDRELLRAFHARVASELGEDPSAV 108
Query: 89 KPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAM 148
YD + L+ ++ A+ + R+L SL P +++ I+ L A
Sbjct: 109 TGDYDATMRACVRLVSSARTPDQAQARGE-RVLRSLLPRWFPGFFRLFIA-LFPRWFVAR 166
Query: 149 MVARVTALTCQWLMGHCTVNSVDLPD---------------------------------- 174
A VT + WL+G V +D PD
Sbjct: 167 HAAAVTPMILPWLVGPARV--IDAPDDLPVDDRDRPPANALDSLLTSTSFLSSNVGGGDG 224
Query: 175 -------GTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSD 227
+ GV +ERC+ LEE C VC+N CK+PTQ FF D +G+ + + P++
Sbjct: 225 GEDAKGQAPGYRQGVLLERCRVLEEGGCASVCLNVCKVPTQNFFSD-VGLDVELRPDYET 283
Query: 228 YSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNV 270
+ C+F +G PP +D PC C S R A ++
Sbjct: 284 FECRFVYGKKPPPAGEDPAFDTPCFAQCSISKSRGGGAGATDA 326
>gi|224085551|ref|XP_002307617.1| predicted protein [Populus trichocarpa]
gi|222857066|gb|EEE94613.1| predicted protein [Populus trichocarpa]
Length = 124
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/109 (41%), Positives = 66/109 (60%), Gaps = 1/109 (0%)
Query: 151 ARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTF 210
A T L WL+G V + +G ++ V +++C++LEE+ CVG+C N CK+P+QTF
Sbjct: 17 AAFTTLFFAWLVGPSEVRESEF-NGKKEKNVVHIKKCRFLEETNCVGMCTNLCKIPSQTF 75
Query: 211 FKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSS 259
K +G+P+ M PNF D SC+ FG PP +D K+PC +C SS
Sbjct: 76 IKHSLGMPVDMVPNFDDMSCEMIFGQEPPAITEDPAFKQPCYKLCNNSS 124
>gi|224062291|ref|XP_002300811.1| predicted protein [Populus trichocarpa]
gi|222842537|gb|EEE80084.1| predicted protein [Populus trichocarpa]
Length = 189
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 64/105 (60%), Gaps = 1/105 (0%)
Query: 151 ARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTF 210
A T L WL+G C V D +G ++ V +++C++LEE+ C+G+C N CK+P+QTF
Sbjct: 70 AAFTTLFFVWLVGPCEVRESDF-NGRKEKNVVHIKKCRFLEETDCIGMCTNLCKVPSQTF 128
Query: 211 FKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDIC 255
K G+P+ M PNF D SC+ +G PP +D K+PC +C
Sbjct: 129 IKHSFGMPVNMVPNFDDMSCEMIYGQEPPAITEDPAFKQPCYKLC 173
>gi|397616815|gb|EJK64149.1| hypothetical protein THAOC_15145, partial [Thalassiosira oceanica]
Length = 498
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 109/223 (48%), Gaps = 29/223 (13%)
Query: 67 LDDLFLSSFRNKLVQEVG-LDSEKPGYDGLIELVNHLMMKG-------KSSSDARDAAQI 118
LD + LS FR+++ + G + S+ PG GL+ M K S ++
Sbjct: 277 LDKILLSIFRSQVTENTGGVTSDIPGIKGLLAQGREYMTKELPEGVTYAEHSKEQNTMVK 336
Query: 119 RILVSLFPPLVLKLYKILISPLA--------GGKI-----AAMMVARVTALTCQWLMGHC 165
+ L +L P++ Y+I +S + G +I A + VT + +L+G
Sbjct: 337 KTLAALMTPVLPPFYRIFMSGIVPKLGTEWDGKQIGPWFYAPWLTTIVTPIFFGFLVGPS 396
Query: 166 TVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNF 225
N DG + G+ VE+CK+L+ES C G+C++ CK+P Q FF++ +G+ L ++PNF
Sbjct: 397 RPNRRS--DGQ--RGGLVVEKCKFLQESGCKGLCLHQCKIPAQDFFREELGLDLTVKPNF 452
Query: 226 SDYSCQFKFGILPPLPKDDTTLKEPCLDICPT----SSRRKEV 264
CQ+ FG P P +D + CL C + + R+ EV
Sbjct: 453 VTQECQWSFGEKPLTPDEDPSFPNGCLVGCESRKAMADRKGEV 495
>gi|218196029|gb|EEC78456.1| hypothetical protein OsI_18323 [Oryza sativa Indica Group]
Length = 125
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 92/175 (52%), Gaps = 53/175 (30%)
Query: 103 MMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLM 162
M+KGKS+ + +A +R+L SLFPPL+L L+K L++P+A G++A+MMVAR TAL+CQWLM
Sbjct: 1 MVKGKSALETEQSA-VRVLRSLFPPLLLVLFKALLAPIANGQLASMMVARATALSCQWLM 59
Query: 163 GHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLME 222
G C+VNSV L +G S SG
Sbjct: 60 GPCSVNSVILSNGKSLSSG----------------------------------------- 78
Query: 223 PNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVE--QCPK 275
F FG+ PP D LKEPCLDIC + RRKE+ S+ + QCP+
Sbjct: 79 ---------FNFGVSPPPLDTDKALKEPCLDICTNARRRKELGTGSSTDGLQCPQ 124
>gi|357134928|ref|XP_003569066.1| PREDICTED: uncharacterized protein LOC100840311 [Brachypodium
distachyon]
Length = 157
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/117 (48%), Positives = 82/117 (70%), Gaps = 9/117 (7%)
Query: 48 APPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGK 107
+PP+ D EY+P DDL L+ FR K+V+EVG DS+KPGYDGLIE+ N LM+KGK
Sbjct: 44 SPPV----DVPSGEYRPSFADDLLLAFFRAKMVEEVGWDSQKPGYDGLIEVANRLMIKGK 99
Query: 108 SSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTA----LTCQW 160
S+ + +A +R+L +LFPPL+L L+K L++PLA G++A++MV + A + QW
Sbjct: 100 SALETEQSA-VRVLRALFPPLLLVLFKALLAPLANGQLASLMVGKFIANLKFMQSQW 155
>gi|412986800|emb|CCO15226.1| predicted protein [Bathycoccus prasinos]
Length = 295
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 101/212 (47%), Gaps = 8/212 (3%)
Query: 50 PIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVG--LDSEKPGYDGLIELVNHLMMKGK 107
P + + +Y LD + F K+ +G D+ YD I L L MKG+
Sbjct: 67 PENWDEPNKKIKYADSPLDLFIMGWFMRKISMALGAPFDASNISYDAFIALC-FLQMKGR 125
Query: 108 SSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTV 167
D + + +L SL PP K+++ L PL K + + A++ + W++G TV
Sbjct: 126 EP-DGQRQITMDVLTSLMPPGGEKVFQKLF-PL--NKFSLELNAKICQIVFAWMVGPMTV 181
Query: 168 NSVDLPD-GTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFS 226
+ D S V + +C++L+ES C G+C+N CK TQ FF D +PL ++PNF
Sbjct: 182 ETTTENDLNEPIASKVQITKCRWLQESGCTGMCVNMCKTTTQDFFTDTFNMPLTIKPNFE 241
Query: 227 DYSCQFKFGILPPLPKDDTTLKEPCLDICPTS 258
D SC F FG PP + D L C C T
Sbjct: 242 DKSCAFYFGQTPPPIEKDEALLFGCNQSCSTG 273
>gi|145352213|ref|XP_001420448.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580682|gb|ABO98741.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 228
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/203 (28%), Positives = 91/203 (44%), Gaps = 25/203 (12%)
Query: 93 DGLIELVNHLMMKGKSSSDARD--AAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMV 150
D ++ + + + SS AR+ A + I+ + PP + + L AA
Sbjct: 17 DDFADVRDACLRLVRESSGARETRAKGLMIIRNCAPPGFAGAFGGFLR-LFPKWFAARHA 75
Query: 151 ARVTALTCQWLMGHCTVN------SVDLPDGTSC----------------QSGVFVERCK 188
A VT + WL+G VN ++D+ D S + GV ++RC+
Sbjct: 76 AAVTPMLLPWLVGEAEVNDAPEDVALDVSDDVSTSVFAAMIGAPKVRAGYKQGVLLKRCR 135
Query: 189 YLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLK 248
LEE+ C VC N CK PTQ FF + +G+P+ + PN+ + CQF +G PP + D
Sbjct: 136 VLEETGCAAVCANVCKHPTQKFFTEEIGLPVTLTPNYETFECQFTYGATPPSVEADPAFA 195
Query: 249 EPCLDICPTSSRRKEVAMNSNVE 271
PC C S ++ + +E
Sbjct: 196 SPCFRQCGVSETLRDAECGNALE 218
>gi|145353006|ref|XP_001420823.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581058|gb|ABO99116.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 268
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 79/159 (49%), Gaps = 19/159 (11%)
Query: 112 ARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVD 171
A + A + + +LFP L+ P+ G V V T WL T +
Sbjct: 53 AANYAFVLVFTNLFP--------WLMGPMEG-------VDHVEVPTPAWL----TKTFKN 93
Query: 172 LPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQ 231
+P V ERC++LE S+C VC+NTCK P+Q + K+ G+ L ++PN+ D+SCQ
Sbjct: 94 VPKFVRVPQAVKAERCRFLETSQCASVCVNTCKAPSQEWLKEDFGMDLHIQPNYDDFSCQ 153
Query: 232 FKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNV 270
+KF ++PP +D + PC C + + ++ A V
Sbjct: 154 WKFNVVPPPLYEDAAVMVPCFSKCDSEFKGEKDAFRQRV 192
>gi|303291405|ref|XP_003064988.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226453536|gb|EEH50846.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 281
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 85/170 (50%), Gaps = 15/170 (8%)
Query: 92 YDGLIELVNHLMMKG-----KSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIA 146
YD L+E LM++ K SD + + +I S PP L + L+ +I
Sbjct: 59 YDALVEAA--LMLRDATPPEKLRSDIGNLMRKQI-TSGMPPFALGAMQALVP----SEIL 111
Query: 147 AMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLP 206
M A V + +W+ G T ++ G + V V++C+YLE S C VC+N CKLP
Sbjct: 112 REMNATVASEAAEWMFGPTTRETLAGQGGKRV-TVVNVKKCRYLEASGCASVCVNQCKLP 170
Query: 207 TQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKD-DTTLKEPCLDIC 255
Q + G P+ M+PNF+D SC+ FG PLP+ D +K CL+ C
Sbjct: 171 AQDVMRSEFGTPVYMQPNFNDCSCRMFFG-QEPLPESIDPAIKASCLEGC 219
>gi|255559899|ref|XP_002520968.1| hypothetical protein RCOM_0991210 [Ricinus communis]
gi|223539805|gb|EEF41385.1| hypothetical protein RCOM_0991210 [Ricinus communis]
Length = 244
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 63/116 (54%), Gaps = 3/116 (2%)
Query: 140 LAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVC 199
L + A T L WL+G C V + +G ++ V +++C++LEE+ CVG+C
Sbjct: 101 LPQSRFTREYFAAFTTLFFVWLIGPCQVRESEF-NGRKEKNVVHIKKCRFLEETNCVGMC 159
Query: 200 INTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDIC 255
N CK+PTQTF K +G+P+ M P S Y PP+P +D ++PC +C
Sbjct: 160 TNLCKVPTQTFIKQSLGMPVNMVP--SKYPRSTLLKQDPPIPTEDPAFRQPCYKLC 213
>gi|412991212|emb|CCO16057.1| predicted protein [Bathycoccus prasinos]
Length = 297
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 98/215 (45%), Gaps = 23/215 (10%)
Query: 55 SDTSRTEYKPGVLDDLFLSSFRNKLVQEVG------LDSEKP-----GYDGLIELVNHLM 103
+ + T Y V D ++ F KL ++ L SE Y L+ L N +
Sbjct: 64 ASSGETSYSDSVYDKFAIALFNAKLAGKIMESTEGVLSSEAKQFSMFSYQRLVYLANEVG 123
Query: 104 MKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMG 163
+ K + DA+ L+ L P ++ L+K LI P + + +T WL+G
Sbjct: 124 VVFKGT-DAQRKVVTETLLELIPEIIRILFKKLIKP---SNWVDELNSFITVQFFGWLVG 179
Query: 164 HCTVNSVDLP-DGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLME 222
S +P + + V +++C+YLE+S CVG C+N CK+PT+ FFK+ GV ++
Sbjct: 180 ----PSERVPRESDGVLAAVKLKKCRYLEQSGCVGSCMNFCKVPTENFFKEAFGVDAHLQ 235
Query: 223 PNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPT 257
PN +D SC FG D+ PC C +
Sbjct: 236 PNHADGSCVLTFG---EKLSDEEIFSAPCYLTCKS 267
>gi|297723681|ref|NP_001174204.1| Os05g0131300 [Oryza sativa Japonica Group]
gi|255675993|dbj|BAH92932.1| Os05g0131300 [Oryza sativa Japonica Group]
Length = 131
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 207 TQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAM 266
+ TFFKD++GV L MEPNF DYSCQF FG+ PP D LKEPCLDIC + R + +
Sbjct: 60 SATFFKDHIGVDLYMEPNFEDYSCQFNFGVPPPPLDTDKALKEPCLDICTNAGRWRVLGT 119
Query: 267 NSNVE--QCPK 275
S+ + QCP+
Sbjct: 120 GSSTDSLQCPQ 130
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 21/35 (60%), Positives = 26/35 (74%)
Query: 82 EVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAA 116
EVG DSEKPGY GLIE+ N M+KGKS+ + +A
Sbjct: 27 EVGWDSEKPGYTGLIEVANRPMVKGKSALEIEQSA 61
>gi|422295511|gb|EKU22810.1| hypothetical protein NGA_0357502 [Nannochloropsis gaditana CCMP526]
Length = 298
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 94/192 (48%), Gaps = 25/192 (13%)
Query: 40 SSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPG--YDGLIE 97
SS++P+P AP +Y + +++ F K+ VG S + Y+ +
Sbjct: 127 SSLKPKPFAP-------ERMVKYNDDIFAKIWIFLFTGKIAAVVGAPSPRNFIPYEEYVR 179
Query: 98 LVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALT 157
L + L+++G S+ +A + +L S+ P +++ L ++A ARVT +
Sbjct: 180 L-SRLLLRGGSTKA--KSAVLSVLRSIAFPGFSSIFRALFP--GSSRLACEFNARVTPIF 234
Query: 158 CQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGV 217
WL+G V + ++ + + YLE +C G C++ CKLPTQTFF + +G+
Sbjct: 235 FSWLVGPAKVETSEMLNPVT-----------YLESVQCKGACVSICKLPTQTFFTEDLGM 283
Query: 218 PLLMEPNFSDYS 229
P+ M PNF D S
Sbjct: 284 PVTMTPNFEDLS 295
>gi|255078836|ref|XP_002502998.1| predicted protein [Micromonas sp. RCC299]
gi|226518264|gb|ACO64256.1| predicted protein [Micromonas sp. RCC299]
Length = 417
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 58/117 (49%), Gaps = 1/117 (0%)
Query: 143 GKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINT 202
G + + + +W+ G T ++ P+G V V++C+YLE + C GVC+N
Sbjct: 246 GAVLRETTGTIASEMAEWMFGPTTRETMAGPNGRDVTV-VNVKKCRYLEATGCAGVCVNM 304
Query: 203 CKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSS 259
CKLP Q ++ GV L + PNF SC+ FG P + D L CL C ++
Sbjct: 305 CKLPAQDVMREEFGVGLYVAPNFETCSCKMYFGQEPLPEQIDPALSRGCLSRCGVAA 361
>gi|412987891|emb|CCO19287.1| predicted protein [Bathycoccus prasinos]
Length = 350
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 65/142 (45%), Gaps = 27/142 (19%)
Query: 120 ILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPD----- 174
I+ S P V K + + L A A VT L WL+G VN V L
Sbjct: 166 IIESFIPTKVAKAFGWFLR-LFPDWFARRHAAFVTPLILPWLVGDAEVNDVPLEARARGS 224
Query: 175 ----------------GTSCQS-----GVFVERCKYLEESKCVGVCINTCKLPTQTFFKD 213
GT Q GV V+RC+ LEES CV VC N CK+PT+TFF +
Sbjct: 225 FEESKVPANAFEAVFAGTKSQQEGYKQGVLVKRCRVLEESGCVSVCKNVCKIPTETFFTE 284
Query: 214 YMGVPLLMEPNFSDYSCQFKFG 235
+G+P+ + PN+ CQF +G
Sbjct: 285 KVGLPVTLIPNYETLECQFCYG 306
>gi|218196030|gb|EEC78457.1| hypothetical protein OsI_18324 [Oryza sativa Indica Group]
Length = 556
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 9/59 (15%)
Query: 222 EPNFSD---------YSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVE 271
EP+F+D +F FG+ PP D LKEPCLDIC + RRKE+ S+ +
Sbjct: 54 EPSFADDFLLAFFRAKMVEFNFGVSPPPLDTDKALKEPCLDICTNARRRKELGTGSSTD 112
>gi|388498484|gb|AFK37308.1| unknown [Lotus japonicus]
Length = 140
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 27/36 (75%)
Query: 174 DGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQT 209
+G +SGV +++C+YLE S CVG+C+N CK PT T
Sbjct: 99 NGVKQKSGVHIKKCRYLENSGCVGMCVNMCKTPTTT 134
>gi|52353660|gb|AAU44226.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 491
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 46/98 (46%), Gaps = 29/98 (29%)
Query: 134 KILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEES 193
+ L+ P+A G++A+MMV + GH + +V S +S +R L
Sbjct: 259 EALVVPIANGQLASMMVDFT-------VHGHYVMETVRC----SLRSANIWKRANDL--- 304
Query: 194 KCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQ 231
TFFKD++GV L MEPNF DYSCQ
Sbjct: 305 ---------------TFFKDHIGVDLYMEPNFEDYSCQ 327
>gi|224072337|ref|XP_002303694.1| predicted protein [Populus trichocarpa]
gi|222841126|gb|EEE78673.1| predicted protein [Populus trichocarpa]
Length = 59
Score = 43.1 bits (100), Expect = 0.12, Method: Composition-based stats.
Identities = 18/47 (38%), Positives = 29/47 (61%)
Query: 198 VCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDD 244
+C+N CK+P Q FF + G+PL M P+ D + +G +PP ++D
Sbjct: 1 MCLNMCKIPAQDFFANEFGLPLTMIPDLVDMGFEMVYGQVPPPFEED 47
>gi|386385932|ref|ZP_10071156.1| hypothetical protein STSU_22325 [Streptomyces tsukubaensis
NRRL18488]
gi|385666602|gb|EIF90121.1| hypothetical protein STSU_22325 [Streptomyces tsukubaensis
NRRL18488]
Length = 170
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 35/173 (20%), Positives = 71/173 (41%), Gaps = 20/173 (11%)
Query: 66 VLDDLFLSSFRNKLVQEVGLDSEKPGYDGLI---ELVNHLMMKGKSSSDARDAAQIRILV 122
+ D L L F + +G +S G+D + E G+ + +A+ + +
Sbjct: 1 MFDSLLLKKFNRIRARNLGYESPLTGWDAFVDSSEYEERTFRDGEELNKVVEASYVDFMG 60
Query: 123 SLFPPLVLKLYKILIS-PLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSG 181
P V + + P G +I +++ + +WL+G P +
Sbjct: 61 G--PRTVAAMGRFARRFPRTGTRILSLLTPHL----FRWLVG---------PMERTGPDR 105
Query: 182 VFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKF 234
+ + C +L S G+C CK+PT+ +F + + +PL + P+ +C+ F
Sbjct: 106 MRITNCTFLT-STSPGMCHRLCKVPTEKYFTEKVFIPLTLVPDVKAATCEVTF 157
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.135 0.411
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,394,732,083
Number of Sequences: 23463169
Number of extensions: 181234109
Number of successful extensions: 434225
Number of sequences better than 100.0: 166
Number of HSP's better than 100.0 without gapping: 152
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 433925
Number of HSP's gapped (non-prelim): 177
length of query: 276
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 136
effective length of database: 9,074,351,707
effective search space: 1234111832152
effective search space used: 1234111832152
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)