BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 024782
(262 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225459888|ref|XP_002262923.1| PREDICTED: uncharacterized protein LOC100267182 [Vitis vinifera]
Length = 259
Score = 305 bits (782), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 153/210 (72%), Positives = 177/210 (84%), Gaps = 12/210 (5%)
Query: 53 VALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFS 112
+ VGK +T+LRV+ + ++E+ +PEDLE ++Q++RVLELLRKNRDM+FS
Sbjct: 62 LVFVGKEETELRVSSD------------QQEDPSPEDLEYVSQIQRVLELLRKNRDMIFS 109
Query: 113 EVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLAEEMV 172
EVKLTIMIED REVERRRLLGIED +APTR+DL EALE+VNEGKVP+NR+ALRMLAEEM
Sbjct: 110 EVKLTIMIEDQREVERRRLLGIEDPDAPTREDLVEALEEVNEGKVPENRIALRMLAEEMT 169
Query: 173 QWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSA 232
QWPNLEVEA K+K SKSLYAK TDTG+DP AAKRLN+DWDSAAEIED D+ EVP A
Sbjct: 170 QWPNLEVEAPKKKVSKSLYAKATDTGVDPREAAKRLNMDWDSAAEIEDPDVSDETEVPPA 229
Query: 233 LGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
+GYGALYLVTAFPVIIGISVVLILFYNSLQ
Sbjct: 230 VGYGALYLVTAFPVIIGISVVLILFYNSLQ 259
>gi|297734719|emb|CBI16953.3| unnamed protein product [Vitis vinifera]
Length = 254
Score = 305 bits (781), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 153/210 (72%), Positives = 177/210 (84%), Gaps = 12/210 (5%)
Query: 53 VALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFS 112
+ VGK +T+LRV+ + ++E+ +PEDLE ++Q++RVLELLRKNRDM+FS
Sbjct: 57 LVFVGKEETELRVSSD------------QQEDPSPEDLEYVSQIQRVLELLRKNRDMIFS 104
Query: 113 EVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLAEEMV 172
EVKLTIMIED REVERRRLLGIED +APTR+DL EALE+VNEGKVP+NR+ALRMLAEEM
Sbjct: 105 EVKLTIMIEDQREVERRRLLGIEDPDAPTREDLVEALEEVNEGKVPENRIALRMLAEEMT 164
Query: 173 QWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSA 232
QWPNLEVEA K+K SKSLYAK TDTG+DP AAKRLN+DWDSAAEIED D+ EVP A
Sbjct: 165 QWPNLEVEAPKKKVSKSLYAKATDTGVDPREAAKRLNMDWDSAAEIEDPDVSDETEVPPA 224
Query: 233 LGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
+GYGALYLVTAFPVIIGISVVLILFYNSLQ
Sbjct: 225 VGYGALYLVTAFPVIIGISVVLILFYNSLQ 254
>gi|147844656|emb|CAN82138.1| hypothetical protein VITISV_019177 [Vitis vinifera]
Length = 259
Score = 303 bits (776), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 152/209 (72%), Positives = 175/209 (83%), Gaps = 12/209 (5%)
Query: 54 ALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFSE 113
VGK +T+LRV+ + ++E+ +PEDLE ++Q++RVLELLRKNRDM+FSE
Sbjct: 63 VFVGKEETELRVSSD------------QQEDPSPEDLEYVSQIQRVLELLRKNRDMIFSE 110
Query: 114 VKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLAEEMVQ 173
VKLTIMIED REVERRRLLGIED +APTR+DL EALE+VNEGKVP+NR+ALRMLAEEM Q
Sbjct: 111 VKLTIMIEDQREVERRRLLGIEDPDAPTREDLVEALEEVNEGKVPENRIALRMLAEEMTQ 170
Query: 174 WPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSAL 233
WPN EVEA K+K SKSLYAK TDTG+DP AAKRLN+DWDSAAEIED D+ EVP A+
Sbjct: 171 WPNXEVEAPKKKVSKSLYAKATDTGVDPREAAKRLNMDWDSAAEIEDPDVSDETEVPPAV 230
Query: 234 GYGALYLVTAFPVIIGISVVLILFYNSLQ 262
GYGALYLVTAFPVIIGISVVLILFYNSLQ
Sbjct: 231 GYGALYLVTAFPVIIGISVVLILFYNSLQ 259
>gi|449433453|ref|XP_004134512.1| PREDICTED: uncharacterized protein LOC101220256 [Cucumis sativus]
gi|449515591|ref|XP_004164832.1| PREDICTED: uncharacterized protein LOC101223339 [Cucumis sativus]
Length = 254
Score = 299 bits (766), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 163/263 (61%), Positives = 192/263 (73%), Gaps = 13/263 (4%)
Query: 1 MAATFPQMPLA-SPIISRHGDNQINQHHPLFAATSWRASKLCLCCSRRSSQRVVALVGKG 59
MA PL+ SP S + I++ S ++ L ++R +A VGK
Sbjct: 4 MALQLRHFPLSTSPSSSIEHNYDISRRFRYLPLPS-HGTRPHLPVPSSRNRRSLACVGKE 62
Query: 60 DTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIM 119
DT LR Q +E+ EA +DLE I Q++RVLELL+KNRDMLF+EVKLT+M
Sbjct: 63 DTQLR---------QPSSTTDEQPEA--QDLEYIRQIQRVLELLKKNRDMLFNEVKLTVM 111
Query: 120 IEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEV 179
IEDPREVERRRLLGI++ +APTRDDLA LE+VNEGK PKNRVAL+MLAEEM WPNLE
Sbjct: 112 IEDPREVERRRLLGIDEDDAPTRDDLAATLEEVNEGKFPKNRVALQMLAEEMTNWPNLEA 171
Query: 180 EATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSALGYGALY 239
EA K+K SKSLYAK TDTG++P AAKRLNIDWD+AAEIEDA DDPEVP+A+GYGALY
Sbjct: 172 EAPKKKRSKSLYAKATDTGVNPREAAKRLNIDWDTAAEIEDADLSDDPEVPAAVGYGALY 231
Query: 240 LVTAFPVIIGISVVLILFYNSLQ 262
+VTAFPVIIGISVVLILFYNSLQ
Sbjct: 232 IVTAFPVIIGISVVLILFYNSLQ 254
>gi|359806797|ref|NP_001241562.1| uncharacterized protein LOC100816881 [Glycine max]
gi|255636679|gb|ACU18676.1| unknown [Glycine max]
Length = 261
Score = 296 bits (759), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 154/220 (70%), Positives = 185/220 (84%), Gaps = 6/220 (2%)
Query: 46 RRSSQRV--VALVGKGDTDLRVAQENDLQDQD-QEDEEEEEEATPEDLENIAQVKRVLEL 102
RR+S + V VGK DTDLRV+ LQ+Q ED++ ++E TP+D+E +AQ+KRVLEL
Sbjct: 45 RRNSNTLSHVPSVGKEDTDLRVSS---LQEQQLGEDDDGDDEPTPQDIEYVAQIKRVLEL 101
Query: 103 LRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRV 162
LRKNRDMLF EVKLTIMIEDPREVERRRLLGIED + PTR+DL +AL+++NEGK+PK+R
Sbjct: 102 LRKNRDMLFGEVKLTIMIEDPREVERRRLLGIEDLDGPTREDLVDALDEINEGKIPKDRA 161
Query: 163 ALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAG 222
+L+MLAEE+ WPNLE K+KPSKSLYAK TDTGIDPE+AAK+LNIDWDSAAEIE+
Sbjct: 162 SLQMLAEELASWPNLEDAVPKKKPSKSLYAKATDTGIDPELAAKKLNIDWDSAAEIEETN 221
Query: 223 GGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
D+ EVPS +GYGALYLV+AFP+IIGISVVLILFYNSLQ
Sbjct: 222 ADDETEVPSVVGYGALYLVSAFPIIIGISVVLILFYNSLQ 261
>gi|356526691|ref|XP_003531950.1| PREDICTED: uncharacterized protein LOC100779639 isoform 1 [Glycine
max]
Length = 257
Score = 292 bits (748), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 156/240 (65%), Positives = 188/240 (78%), Gaps = 6/240 (2%)
Query: 25 QHHPLFAATSWRASKLCLCCSRRSSQRV--VALVGKGDTDLRVAQENDLQDQDQEDEEEE 82
HHP T ++L RR+S + V VGK DTDLRV+ + Q D + ++ +
Sbjct: 22 HHHPSVFCTGVAFTRLR---HRRTSNTIFLVPSVGKEDTDLRVSSLQEQQLDDGD-DDND 77
Query: 83 EEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTR 142
EE TP+DLE +AQ+KRVLELLRKNRDM+F EVKLTIMIED REVERRRLLGIED + PTR
Sbjct: 78 EEPTPQDLEYVAQIKRVLELLRKNRDMIFGEVKLTIMIEDAREVERRRLLGIEDLDGPTR 137
Query: 143 DDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPE 202
+DL +AL+++NEGK+PK+R AL+MLAEE+ WPNLE K+KPSKSLYAK TDTGIDPE
Sbjct: 138 EDLVDALDEINEGKIPKDRAALQMLAEELASWPNLEDALPKKKPSKSLYAKATDTGIDPE 197
Query: 203 MAAKRLNIDWDSAAEIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
+AAK+LNIDWDSAAEIE+ D+ EVP +GYGALYLV+AFP+IIGISVVLILFYNSLQ
Sbjct: 198 LAAKKLNIDWDSAAEIEEQNADDETEVPPVVGYGALYLVSAFPIIIGISVVLILFYNSLQ 257
>gi|356526693|ref|XP_003531951.1| PREDICTED: uncharacterized protein LOC100779639 isoform 2 [Glycine
max]
Length = 258
Score = 283 bits (725), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 155/241 (64%), Positives = 187/241 (77%), Gaps = 7/241 (2%)
Query: 25 QHHPLFAATSWRASKLCLCCSRRSSQRV--VALVGKGDTDLRVAQENDLQDQDQEDEEEE 82
HHP T ++L RR+S + V VGK DTDLRV+ + Q D + ++ +
Sbjct: 22 HHHPSVFCTGVAFTRLR---HRRTSNTIFLVPSVGKEDTDLRVSSLQEQQLDDGD-DDND 77
Query: 83 EEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTR 142
EE TP+DLE +AQ+KRVLELLRKNRDM+F EVKLTIMIED REVERRRLLGIED + PTR
Sbjct: 78 EEPTPQDLEYVAQIKRVLELLRKNRDMIFGEVKLTIMIEDAREVERRRLLGIEDLDGPTR 137
Query: 143 DDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPE 202
+DL +AL+++NEGK+PK+R AL+MLAEE+ WPNLE K+KPSKSLYAK TDTGIDPE
Sbjct: 138 EDLVDALDEINEGKIPKDRAALQMLAEELASWPNLEDALPKKKPSKSLYAKATDTGIDPE 197
Query: 203 MAAKRLNIDWDSAAEIEDAGGGDDPEVPSAL-GYGALYLVTAFPVIIGISVVLILFYNSL 261
+AAK+LNIDWDSAAEIE+ D+ E+ L GYGALYLV+AFP+IIGISVVLILFYNSL
Sbjct: 198 LAAKKLNIDWDSAAEIEEQNADDETEMFFLLQGYGALYLVSAFPIIIGISVVLILFYNSL 257
Query: 262 Q 262
Q
Sbjct: 258 Q 258
>gi|224053647|ref|XP_002297910.1| predicted protein [Populus trichocarpa]
gi|222845168|gb|EEE82715.1| predicted protein [Populus trichocarpa]
Length = 215
Score = 277 bits (709), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 135/178 (75%), Positives = 158/178 (88%), Gaps = 1/178 (0%)
Query: 86 TPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDL 145
+P+DL+ + +KRVLELLRKNRDM+FSEVKLT+MIEDPREVERRRLLGI+D++ PTR+DL
Sbjct: 38 SPQDLQYVQDIKRVLELLRKNRDMIFSEVKLTVMIEDPREVERRRLLGIDDADTPTREDL 97
Query: 146 AEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAA 205
AEALEQVNEGK+PK R AL+MLAEEM+ WPNLEVEATK KPSKSLYAK TDTGI+ + AA
Sbjct: 98 AEALEQVNEGKIPKKREALQMLAEEMINWPNLEVEATKTKPSKSLYAKATDTGINLKEAA 157
Query: 206 KRLNIDWDSAAEIEDAGGGDDPEVPSAL-GYGALYLVTAFPVIIGISVVLILFYNSLQ 262
+RL IDWDSAAEIEDA D+ +V + L GYGALYLVT FP+IIG++VVLILFYNSLQ
Sbjct: 158 RRLKIDWDSAAEIEDADMNDETDVVTLLQGYGALYLVTGFPIIIGVAVVLILFYNSLQ 215
>gi|357517361|ref|XP_003628969.1| hypothetical protein MTR_8g070680 [Medicago truncatula]
gi|355522991|gb|AET03445.1| hypothetical protein MTR_8g070680 [Medicago truncatula]
gi|388514717|gb|AFK45420.1| unknown [Medicago truncatula]
Length = 267
Score = 276 bits (706), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 159/239 (66%), Positives = 186/239 (77%), Gaps = 7/239 (2%)
Query: 27 HPLFA-ATSWRASKLCLCCSRRSSQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEA 85
H LF TS+ L + R V G D DL V+ L D D D++EEEE
Sbjct: 33 HSLFTNGTSFTK----LSTTHRRILSFVPSAGNEDADLSVSSTQQLDDDDVVDDDEEEEE 88
Query: 86 -TPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDD 144
TP+DLEN+A++KRVLELL++NRDMLF EVKLTIMIEDPR+VER+RLLGIED +APTR+D
Sbjct: 89 PTPQDLENVAEIKRVLELLKRNRDMLFGEVKLTIMIEDPRDVERKRLLGIEDLDAPTRED 148
Query: 145 LAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVE-ATKQKPSKSLYAKVTDTGIDPEM 203
L ALE+VNEGK+PK++ L+MLAEEM WPNLE E + K+KP KSLYAK TDTGIDP++
Sbjct: 149 LVAALEEVNEGKIPKDKAVLQMLAEEMTAWPNLEAELSPKKKPKKSLYAKTTDTGIDPQV 208
Query: 204 AAKRLNIDWDSAAEIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
AAKRLNIDWDSAAEIE+A D+ EVPS LGYGALYLV+AFPVIIGISVVLILFYNSLQ
Sbjct: 209 AAKRLNIDWDSAAEIEEAKTDDETEVPSVLGYGALYLVSAFPVIIGISVVLILFYNSLQ 267
>gi|21554237|gb|AAM63312.1| unknown [Arabidopsis thaliana]
Length = 280
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 151/286 (52%), Positives = 199/286 (69%), Gaps = 30/286 (10%)
Query: 1 MAATFPQMPLASPIISRHGDNQINQHHPLFAATSWRASKLCLCCS-----------RRSS 49
M Q+PL S Q N + A+S S + +C S R+++
Sbjct: 1 MTTQIFQLPLKYCASSFSSTGQRN-----YGASS-SPSPIVICKSNGISDGLWVKRRKNN 54
Query: 50 QRVVALV---GKGD-TDLRVAQENDLQDQDQEDEEEEEEAT--------PEDLENIAQVK 97
+R +L+ KGD T++RV+ L+ Q++E + ++E PEDL+ + ++K
Sbjct: 55 RRFGSLIVKQEKGDVTEIRVSVPLTLEQQEKEKQNRDDEEDEIDEGDVDPEDLKYVNEIK 114
Query: 98 RVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKV 157
RV+ELLR+NRDM+FSEVKLTIMIEDPRE+ERRRLLGIED++ P+RDDLAEALEQVN+GK+
Sbjct: 115 RVIELLRRNRDMIFSEVKLTIMIEDPRELERRRLLGIEDADTPSRDDLAEALEQVNDGKI 174
Query: 158 PKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAE 217
PK+R LRML EEM++WPNLEVE +K++ KS+YAK TDTGIDP+ AAKRLN++WDSAA
Sbjct: 175 PKDRATLRMLHEEMIRWPNLEVEVSKKQRGKSMYAKSTDTGIDPKEAAKRLNVEWDSAAA 234
Query: 218 IEDAGGGDDPEVPSAL-GYGALYLVTAFPVIIGISVVLILFYNSLQ 262
IE+ D+ V + + GYGALY V+A PVIIGISVVLILFYNSLQ
Sbjct: 235 IEEVDVDDEQGVVTKVAGYGALYFVSALPVIIGISVVLILFYNSLQ 280
>gi|18422466|ref|NP_568637.1| uncharacterized protein [Arabidopsis thaliana]
gi|8953763|dbj|BAA98118.1| unnamed protein product [Arabidopsis thaliana]
gi|110740965|dbj|BAE98577.1| hypothetical protein [Arabidopsis thaliana]
gi|332007760|gb|AED95143.1| uncharacterized protein [Arabidopsis thaliana]
Length = 280
Score = 267 bits (683), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 151/286 (52%), Positives = 198/286 (69%), Gaps = 30/286 (10%)
Query: 1 MAATFPQMPLASPIISRHGDNQINQHHPLFAATSWRASKLCLCCS-----------RRSS 49
M Q+PL S Q N + A+S S + +C S R+++
Sbjct: 1 MTTQIFQLPLKYCASSFSSTGQRN-----YGASS-SPSPIVICKSNGISDGLWVKRRKNN 54
Query: 50 QRVVALV---GKGD-TDLRVAQENDLQDQDQEDEEEEEEAT--------PEDLENIAQVK 97
+R +L+ KGD T++RV L+ Q++E + ++E PEDL+ + ++K
Sbjct: 55 RRFGSLIVKQEKGDVTEIRVPVPLTLEQQEKEKQNRDDEEDEIDEGDVDPEDLKYVNEIK 114
Query: 98 RVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKV 157
RV+ELLR+NRDM+FSEVKLTIMIEDPRE+ERRRLLGIED++ P+RDDLAEALEQVN+GK+
Sbjct: 115 RVIELLRRNRDMIFSEVKLTIMIEDPRELERRRLLGIEDADTPSRDDLAEALEQVNDGKI 174
Query: 158 PKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAE 217
PK+R LRML EEM++WPNLEVE +K++ KS+YAK TDTGIDP+ AAKRLN++WDSAA
Sbjct: 175 PKDRATLRMLHEEMIRWPNLEVEVSKKQRGKSMYAKSTDTGIDPKEAAKRLNVEWDSAAA 234
Query: 218 IEDAGGGDDPEVPSAL-GYGALYLVTAFPVIIGISVVLILFYNSLQ 262
IE+ D+ V + + GYGALY V+A PVIIGISVVLILFYNSLQ
Sbjct: 235 IEEVDVDDEQGVVTKVAGYGALYFVSALPVIIGISVVLILFYNSLQ 280
>gi|255586744|ref|XP_002533992.1| conserved hypothetical protein [Ricinus communis]
gi|223526012|gb|EEF28389.1| conserved hypothetical protein [Ricinus communis]
Length = 269
Score = 267 bits (682), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 158/264 (59%), Positives = 193/264 (73%), Gaps = 17/264 (6%)
Query: 1 MAATFPQMPLASPIISRHGDNQINQHHPLF----------AATSWRASKLCLC-----CS 45
MA Q+P+ASP+ + H LF + + R + CL S
Sbjct: 1 MALQIYQLPVASPLSVSSSSSSSLHTHFLFLYHNHQQQFQSCSRIRLASFCLHQNTSNVS 60
Query: 46 RRSSQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEA-TPEDLENIAQVKRVLELLR 104
R+ + + LVGK D ++RV E + +++D+ED+EEE +P+DL+ I ++K+VL+LL+
Sbjct: 61 WRNRKNGL-LVGKEDLEMRVQDEENEEEEDEEDDEEEAPPPSPQDLQYIQEIKKVLDLLK 119
Query: 105 KNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVAL 164
+NRDMLF+EVKLTIMIEDPREVERRRLLGIED +APTR+DLAEALEQVNEGKVPKNRVAL
Sbjct: 120 RNRDMLFNEVKLTIMIEDPREVERRRLLGIEDLDAPTREDLAEALEQVNEGKVPKNRVAL 179
Query: 165 RMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGG 224
++LAEEM+ WPNLEVEATK P KSLYAK TDTGIDP AAKRL IDWDSAAEIED
Sbjct: 180 QILAEEMMNWPNLEVEATKTAPKKSLYAKATDTGIDPREAAKRLKIDWDSAAEIEDVDSS 239
Query: 225 DDPEVPSALGYGALYLVTAFPVII 248
D+ EVP A+GYGALYLVTAFPVII
Sbjct: 240 DETEVPPAVGYGALYLVTAFPVII 263
>gi|297794895|ref|XP_002865332.1| hypothetical protein ARALYDRAFT_494519 [Arabidopsis lyrata subsp.
lyrata]
gi|297311167|gb|EFH41591.1| hypothetical protein ARALYDRAFT_494519 [Arabidopsis lyrata subsp.
lyrata]
Length = 278
Score = 262 bits (669), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 150/278 (53%), Positives = 190/278 (68%), Gaps = 16/278 (5%)
Query: 1 MAATFPQMPLASPIISRHGDNQINQH-----HPLFAATSWRASKLCLCCSRRSSQRVVAL 55
M Q+PL + S Q N P S S RR +QR ++
Sbjct: 1 MVTQIFQLPLQYCVSSFSSAGQRNYGVSSSPTPFVICKSNGISDDLWVKRRRKNQRFGSV 60
Query: 56 V---GKGD-TDLRVAQENDLQDQDQEDEEEEEEATPED------LENIAQVKRVLELLRK 105
+ KGD T++RV L+ Q++E + ++E E L+ + ++KRVLELLR+
Sbjct: 61 IVKQEKGDVTEIRVPVPLTLEQQEKEKQNRDDEEDEEGEVDPEDLKYVNEIKRVLELLRR 120
Query: 106 NRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALR 165
NRDM+FSEVKLTIMIEDPREVERRRLLGIED++ P+R+DLAEALEQVN+GK+PK+R LR
Sbjct: 121 NRDMIFSEVKLTIMIEDPREVERRRLLGIEDADTPSREDLAEALEQVNDGKIPKDRATLR 180
Query: 166 MLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGD 225
ML EEM++WPNLEVE + ++ KS+YAK TDTGIDP+ AAKRLNI+WDSAA IE+A D
Sbjct: 181 MLHEEMIRWPNLEVEVSNKQRGKSMYAKSTDTGIDPKEAAKRLNIEWDSAAAIEEADVDD 240
Query: 226 DPEVPSAL-GYGALYLVTAFPVIIGISVVLILFYNSLQ 262
+ V + + GYGALY V+A PVIIGISVVLILFYNSLQ
Sbjct: 241 EQGVVTKVAGYGALYFVSALPVIIGISVVLILFYNSLQ 278
>gi|218189215|gb|EEC71642.1| hypothetical protein OsI_04075 [Oryza sativa Indica Group]
Length = 292
Score = 253 bits (646), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 143/217 (65%), Positives = 173/217 (79%), Gaps = 5/217 (2%)
Query: 49 SQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRD 108
S VA G+GD++ + +D D E+ EA+PEDLE+I QVKRVLELL+KNRD
Sbjct: 78 SDTAVAPEGEGDSEPPAS--SDGAAGDIASSAEQPEASPEDLEDIRQVKRVLELLQKNRD 135
Query: 109 MLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLA 168
M F EVKLTIMIEDPR++ER+RLLGIED + TRDDLA+AL +VNEG++P+NRVAL++LA
Sbjct: 136 MTFGEVKLTIMIEDPRDIERKRLLGIEDPDEITRDDLADALVEVNEGRIPENRVALQLLA 195
Query: 169 EEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAA-EIEDAGGGD 225
+EM +WP+LE+EA K KP KS+YAK TDTGIDPE AAKRLNIDWDSAA ++ D
Sbjct: 196 KEMTEWPDLEMEAPKKKSKPGKSVYAKATDTGIDPETAAKRLNIDWDSAADLDDEEEEDD 255
Query: 226 DPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
+ EVPSA+GY ALYL+TAFPVIIGISVVLILFYNSLQ
Sbjct: 256 ETEVPSAVGYSALYLLTAFPVIIGISVVLILFYNSLQ 292
>gi|115440489|ref|NP_001044524.1| Os01g0798500 [Oryza sativa Japonica Group]
gi|55297680|dbj|BAD68270.1| unknown protein [Oryza sativa Japonica Group]
gi|113534055|dbj|BAF06438.1| Os01g0798500 [Oryza sativa Japonica Group]
gi|215686962|dbj|BAG90808.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222619394|gb|EEE55526.1| hypothetical protein OsJ_03754 [Oryza sativa Japonica Group]
Length = 292
Score = 253 bits (645), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 143/217 (65%), Positives = 173/217 (79%), Gaps = 5/217 (2%)
Query: 49 SQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRD 108
S VA G+GD++ + +D D E+ EA+PEDLE+I QVKRVLELL+KNRD
Sbjct: 78 SDTAVAPEGEGDSEPPAS--SDGAAGDIAASAEQPEASPEDLEDIRQVKRVLELLQKNRD 135
Query: 109 MLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLA 168
M F EVKLTIMIEDPR++ER+RLLGIED + TRDDLA+AL +VNEG++P+NRVAL++LA
Sbjct: 136 MTFGEVKLTIMIEDPRDIERKRLLGIEDPDEITRDDLADALVEVNEGRIPENRVALQLLA 195
Query: 169 EEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAA-EIEDAGGGD 225
+EM +WP+LE+EA K KP KS+YAK TDTGIDPE AAKRLNIDWDSAA ++ D
Sbjct: 196 KEMTEWPDLEMEAPKKKSKPGKSVYAKATDTGIDPETAAKRLNIDWDSAADLDDEEEEDD 255
Query: 226 DPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
+ EVPSA+GY ALYL+TAFPVIIGISVVLILFYNSLQ
Sbjct: 256 ETEVPSAVGYSALYLLTAFPVIIGISVVLILFYNSLQ 292
>gi|414880128|tpg|DAA57259.1| TPA: hypothetical protein ZEAMMB73_612794 [Zea mays]
Length = 294
Score = 248 bits (634), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 132/182 (72%), Positives = 157/182 (86%), Gaps = 3/182 (1%)
Query: 84 EATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRD 143
EA+PEDLENI ++KRVLELL+KNRDM F EVKLTIMIEDPR++ER+R LGIED + TRD
Sbjct: 113 EASPEDLENIREIKRVLELLKKNRDMTFGEVKLTIMIEDPRDIERKRTLGIEDPDEITRD 172
Query: 144 DLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDP 201
DLA+AL +VNEGK+P+NRVAL++LA+EM +WP++E+EA K KP KS+YAK TDTGIDP
Sbjct: 173 DLADALAEVNEGKIPENRVALQLLAKEMAEWPDIEIEAPKKKSKPGKSVYAKATDTGIDP 232
Query: 202 EMAAKRLNIDWDSAA-EIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNS 260
E AAKRLN+DWDSAA + D+ EVPSA+GYGALYL+TAFPVIIGISVVLILFYNS
Sbjct: 233 ETAAKRLNLDWDSAADIDGEEEDDDETEVPSAVGYGALYLLTAFPVIIGISVVLILFYNS 292
Query: 261 LQ 262
LQ
Sbjct: 293 LQ 294
>gi|226498212|ref|NP_001142698.1| uncharacterized protein LOC100275005 [Zea mays]
gi|195608354|gb|ACG26007.1| hypothetical protein [Zea mays]
Length = 291
Score = 248 bits (634), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 132/182 (72%), Positives = 157/182 (86%), Gaps = 3/182 (1%)
Query: 84 EATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRD 143
EA+PEDLENI ++KRVLELL+KNRDM F EVKLTIMIEDPR++ER+R LGIED + TRD
Sbjct: 110 EASPEDLENIREIKRVLELLKKNRDMTFGEVKLTIMIEDPRDIERKRTLGIEDPDEITRD 169
Query: 144 DLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDP 201
DLA+AL +VNEGK+P+NRVAL++LA+EM +WP++E+EA K KP KS+YAK TDTGIDP
Sbjct: 170 DLADALAEVNEGKIPENRVALQLLAKEMAEWPDIEIEAPKKKSKPGKSVYAKATDTGIDP 229
Query: 202 EMAAKRLNIDWDSAA-EIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNS 260
E AAKRLN+DWDSAA + D+ EVPSA+GYGALYL+TAFPVIIGISVVLILFYNS
Sbjct: 230 ETAAKRLNLDWDSAADIDGEEEDDDETEVPSAVGYGALYLLTAFPVIIGISVVLILFYNS 289
Query: 261 LQ 262
LQ
Sbjct: 290 LQ 291
>gi|357125434|ref|XP_003564399.1| PREDICTED: uncharacterized protein LOC100835197 [Brachypodium
distachyon]
Length = 290
Score = 240 bits (613), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 128/185 (69%), Positives = 155/185 (83%), Gaps = 3/185 (1%)
Query: 81 EEEEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAP 140
E EA+PEDLE+I +VKRVLELL+KNRDM F EVKLTIMIEDPR+VER+RLLGIED +
Sbjct: 106 EPAEASPEDLEDIREVKRVLELLQKNRDMTFGEVKLTIMIEDPRDVERKRLLGIEDPDEL 165
Query: 141 TRDDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKP--SKSLYAKVTDTG 198
TRDDLA+AL +VNEG++P+NR ALR+LA+EM +WP++E+EA K K KS+YAK TDTG
Sbjct: 166 TRDDLADALVEVNEGRIPENRDALRLLAKEMAEWPDVEIEAPKSKGMFGKSIYAKATDTG 225
Query: 199 IDPEMAAKRLNIDW-DSAAEIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILF 257
IDP AAKRLNIDW +A ++A D+ EVPS +GYGALYL++AFPVIIG+SVVLILF
Sbjct: 226 IDPVAAAKRLNIDWDSAADIDDEADDDDEAEVPSVVGYGALYLLSAFPVIIGVSVVLILF 285
Query: 258 YNSLQ 262
YNSLQ
Sbjct: 286 YNSLQ 290
>gi|255646421|gb|ACU23689.1| unknown [Glycine max]
Length = 233
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 125/198 (63%), Positives = 154/198 (77%), Gaps = 6/198 (3%)
Query: 25 QHHPLFAATSWRASKLCLCCSRRSSQRV--VALVGKGDTDLRVAQENDLQDQDQEDEEEE 82
HHP + T ++L RR+S + V VGK DTDLRV+ + Q D + ++ +
Sbjct: 22 HHHPSVSCTGVAFTRLR---HRRTSNTIFLVPSVGKEDTDLRVSSLQEQQLDDGD-DDND 77
Query: 83 EEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTR 142
EE TP+DLE +AQ+KRVLELLRKNRDM+F EVKLTIMIED REVERRRLLGIED + PTR
Sbjct: 78 EEPTPQDLEYVAQIKRVLELLRKNRDMIFGEVKLTIMIEDAREVERRRLLGIEDLDGPTR 137
Query: 143 DDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPE 202
+DL +AL+++NEGK+PK+R AL+MLAEE+ WPNLE K+KPSKSLYAK TDTGIDPE
Sbjct: 138 EDLVDALDEINEGKIPKDRAALQMLAEELTSWPNLEDALPKKKPSKSLYAKATDTGIDPE 197
Query: 203 MAAKRLNIDWDSAAEIED 220
+AAK+LNIDWDSAAEIE+
Sbjct: 198 LAAKKLNIDWDSAAEIEE 215
>gi|414880129|tpg|DAA57260.1| TPA: hypothetical protein ZEAMMB73_612794 [Zea mays]
Length = 530
Score = 217 bits (553), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 115/165 (69%), Positives = 140/165 (84%), Gaps = 3/165 (1%)
Query: 84 EATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRD 143
EA+PEDLENI ++KRVLELL+KNRDM F EVKLTIMIEDPR++ER+R LGIED + TRD
Sbjct: 113 EASPEDLENIREIKRVLELLKKNRDMTFGEVKLTIMIEDPRDIERKRTLGIEDPDEITRD 172
Query: 144 DLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDP 201
DLA+AL +VNEGK+P+NRVAL++LA+EM +WP++E+EA K KP KS+YAK TDTGIDP
Sbjct: 173 DLADALAEVNEGKIPENRVALQLLAKEMAEWPDIEIEAPKKKSKPGKSVYAKATDTGIDP 232
Query: 202 EMAAKRLNIDWDSAA-EIEDAGGGDDPEVPSALGYGALYLVTAFP 245
E AAKRLN+DWDSAA + D+ EVPSA+GYGALYL+TAFP
Sbjct: 233 ETAAKRLNLDWDSAADIDGEEEDDDETEVPSAVGYGALYLLTAFP 277
>gi|55297679|dbj|BAD68269.1| unknown protein [Oryza sativa Japonica Group]
Length = 270
Score = 207 bits (527), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 126/217 (58%), Positives = 154/217 (70%), Gaps = 27/217 (12%)
Query: 49 SQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRD 108
S VA G+GD++ + +D D E+ EA+PEDLE
Sbjct: 78 SDTAVAPEGEGDSEPPAS--SDGAAGDIAASAEQPEASPEDLE----------------- 118
Query: 109 MLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLA 168
VKLTIMIEDPR++ER+RLLGIED + TRDDLA+AL +VNEG++P+NRVAL++LA
Sbjct: 119 -----VKLTIMIEDPRDIERKRLLGIEDPDEITRDDLADALVEVNEGRIPENRVALQLLA 173
Query: 169 EEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAA-EIEDAGGGD 225
+EM +WP+LE+EA K KP KS+YAK TDTGIDPE AAKRLNIDWDSAA ++ D
Sbjct: 174 KEMTEWPDLEMEAPKKKSKPGKSVYAKATDTGIDPETAAKRLNIDWDSAADLDDEEEEDD 233
Query: 226 DPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
+ EVPSA+GY ALYL+TAFPVIIGISVVLILFYNSLQ
Sbjct: 234 ETEVPSAVGYSALYLLTAFPVIIGISVVLILFYNSLQ 270
>gi|414880127|tpg|DAA57258.1| TPA: hypothetical protein ZEAMMB73_612794 [Zea mays]
Length = 157
Score = 207 bits (526), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 112/157 (71%), Positives = 133/157 (84%), Gaps = 3/157 (1%)
Query: 109 MLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLA 168
M F EVKLTIMIEDPR++ER+R LGIED + TRDDLA+AL +VNEGK+P+NRVAL++LA
Sbjct: 1 MTFGEVKLTIMIEDPRDIERKRTLGIEDPDEITRDDLADALAEVNEGKIPENRVALQLLA 60
Query: 169 EEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAA-EIEDAGGGD 225
+EM +WP++E+EA K KP KS+YAK TDTGIDPE AAKRLN+DWDSAA + D
Sbjct: 61 KEMAEWPDIEIEAPKKKSKPGKSVYAKATDTGIDPETAAKRLNLDWDSAADIDGEEEDDD 120
Query: 226 DPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
+ EVPSA+GYGALYL+TAFPVIIGISVVLILFYNSLQ
Sbjct: 121 ETEVPSAVGYGALYLLTAFPVIIGISVVLILFYNSLQ 157
>gi|242054671|ref|XP_002456481.1| hypothetical protein SORBIDRAFT_03g037100 [Sorghum bicolor]
gi|241928456|gb|EES01601.1| hypothetical protein SORBIDRAFT_03g037100 [Sorghum bicolor]
Length = 460
Score = 191 bits (485), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/139 (69%), Positives = 121/139 (87%), Gaps = 2/139 (1%)
Query: 80 EEEEEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNA 139
+E EA+PEDLE+I ++KRVLELL+KNRDM F EVKLTIMIEDPR++ER+R LGIED +
Sbjct: 86 DEPAEASPEDLESIREIKRVLELLKKNRDMTFGEVKLTIMIEDPRDIERKRTLGIEDPDE 145
Query: 140 PTRDDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEA--TKQKPSKSLYAKVTDT 197
TRDDLA+AL +VNEG++P+NRVAL++LA+EM +WP++E+EA K KP KS+YAK TDT
Sbjct: 146 ITRDDLADALAEVNEGRIPENRVALQLLAKEMSEWPDIEIEAPKKKSKPGKSVYAKATDT 205
Query: 198 GIDPEMAAKRLNIDWDSAA 216
GIDPE AAKRLN+DWDSAA
Sbjct: 206 GIDPETAAKRLNLDWDSAA 224
>gi|168045661|ref|XP_001775295.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673376|gb|EDQ59900.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 172
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 95/174 (54%), Positives = 132/174 (75%), Gaps = 9/174 (5%)
Query: 95 QVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQV-N 153
+V +VL+LL+K RDM F+EV+LTIMIEDPREVERRR LGIED +++D+ AL++V
Sbjct: 2 RVLQVLDLLKKKRDMTFNEVRLTIMIEDPREVERRRQLGIEDERGCSKEDMGIALQEVVY 61
Query: 154 EGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWD 213
EG++P++R+ LR L +EM+ WPNLE E ++ P S YAKVT TG+DP++AA+R +DWD
Sbjct: 62 EGRLPEDRLVLRELTKEMLAWPNLEDEISEVNPLASPYAKVTPTGVDPKVAAQRAKVDWD 121
Query: 214 SAAEIEDAGGGDDPE-----VPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
+AAEI+ G++P+ VP +G+ LYLV+ PVII ++VVLILF+NSLQ
Sbjct: 122 AAAEIQ---PGEEPKDLSDMVPPVVGFSFLYLVSFIPVIIVVAVVLILFFNSLQ 172
>gi|311223468|gb|ADP76805.1| Y3IP1 [Nicotiana tabacum]
Length = 92
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 72/91 (79%), Positives = 81/91 (89%), Gaps = 1/91 (1%)
Query: 173 QWPNLEVEATKQ-KPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPS 231
WPNLEVEATKQ KP +SLYAK TDTGIDP+ AAKRL IDWDSAAEI+++ D+P+VP
Sbjct: 2 SWPNLEVEATKQNKPGRSLYAKATDTGIDPKEAAKRLKIDWDSAAEIDESAESDEPDVPP 61
Query: 232 ALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
ALGYGALYLV+AFP+IIGISVVLILFYNSLQ
Sbjct: 62 ALGYGALYLVSAFPIIIGISVVLILFYNSLQ 92
>gi|307111552|gb|EFN59786.1| hypothetical protein CHLNCDRAFT_133479 [Chlorella variabilis]
Length = 231
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 77/193 (39%), Positives = 109/193 (56%), Gaps = 13/193 (6%)
Query: 78 DEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDS 137
DE EEEA +V +L L++ RDM +EV+LT+ IEDPR ERR L+G+EDS
Sbjct: 44 DEVSEEEAK----RMCREVVGILRTLKEQRDMSLNEVRLTVAIEDPRARERR-LMGMEDS 98
Query: 138 NAPTRDDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDT 197
+ +R+++A AL +V+EG++P +R+ALR L +EM QWP L+ A + K+ A
Sbjct: 99 SGISREEMAAALMEVSEGRIPADRIALRELCKEMSQWPALDEPARVRPWLKTPAAAEGGA 158
Query: 198 GIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSAL--------GYGALYLVTAFPVIIG 249
G AA+ W E+ G D E P + GYGALYLV+ PV+I
Sbjct: 159 GSALTPAAEDAAAAWRQQGEVRPPMGRDPTEQPRGMADQLPDWVGYGALYLVSIAPVLIA 218
Query: 250 ISVVLILFYNSLQ 262
I + +LF NSL+
Sbjct: 219 IGAITVLFLNSLR 231
>gi|159474404|ref|XP_001695315.1| predicted protein [Chlamydomonas reinhardtii]
gi|158275798|gb|EDP01573.1| predicted protein [Chlamydomonas reinhardtii]
Length = 268
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/261 (33%), Positives = 125/261 (47%), Gaps = 43/261 (16%)
Query: 34 SWRASKLCLCCSRRSSQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENI 93
S R + +RR R V + + DL V E ++ Q E E P+DL I
Sbjct: 19 SARVCRRAFAGARRPLPRPVRATSE-EVDLEVQVEQFMKRQ-AEIESGAAFVRPKDLATI 76
Query: 94 ---------------AQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSN 138
A + VL+ L++ RDM F+EVKL + IEDPR ERR IED
Sbjct: 77 IGGDVVSEEMAQRYCADIFEVLKTLKRTRDMSFNEVKLIVSIEDPRARERR-AQDIEDER 135
Query: 139 APTRDDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSL-------- 190
+RD++A+AL +V EG+VP +R+AL+ L +EM+ WP LEV K
Sbjct: 136 GVSRDEMAQALIEVGEGRVPNDRIALKCLHDEMLGWPFLEVATEAPAAPKGTVAAAPAAP 195
Query: 191 -------YAKVT--DTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSALGYGALYLV 241
YA + DT + P + L + + + D +P +GYG LY V
Sbjct: 196 AKPSASDYASLMGGDTVVKPYVMGDNLR-EGEKPQNLTDM-------LPGWVGYGVLYGV 247
Query: 242 TAFPVIIGISVVLILFYNSLQ 262
+A PV++ I +LILFYNSL+
Sbjct: 248 SAIPVLLVIGTILILFYNSLK 268
>gi|308802908|ref|XP_003078767.1| unnamed protein product [Ostreococcus tauri]
gi|116057220|emb|CAL51647.1| unnamed protein product [Ostreococcus tauri]
Length = 226
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 92/172 (53%), Gaps = 10/172 (5%)
Query: 93 IAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQV 152
+ ++K L+LL R+M F+EVKLTI IEDPR E+R GIED + + ++ E L+ +
Sbjct: 63 VVEIKDWLKLLYVKREMSFNEVKLTIGIEDPRLAEQRERYGIEDESGVSAEEKMETLDMI 122
Query: 153 NEGKVPKNRVALRMLAEEMVQWPNL--EVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNI 210
G+ P + A+ + EE WP L E++ T+ S Y +++ + A R
Sbjct: 123 ERGERPTDARAVATVLEEFRNWPGLDAEIDTTQYDEGPSRYEEISMRAAGIKTANPRRVR 182
Query: 211 DWDSAAEIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
+ AE + G + G+ LYLV+A P+ I I+ V I+F+NSLQ
Sbjct: 183 AKEEPAEEQKEG--------TIFGFLPLYLVSAVPIFITIAAVSIMFFNSLQ 226
>gi|311223466|gb|ADP76804.1| Y3IP1 [Nicotiana tabacum]
Length = 65
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/62 (72%), Positives = 53/62 (85%), Gaps = 1/62 (1%)
Query: 174 WPNLEVEATKQ-KPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSA 232
WPNLEVEATKQ KP +SLYAK TDTGI+P+ AAKRL IDWDSAAEI+++ D+P+VP A
Sbjct: 3 WPNLEVEATKQNKPGRSLYAKATDTGINPKEAAKRLKIDWDSAAEIDESAESDEPDVPPA 62
Query: 233 LG 234
LG
Sbjct: 63 LG 64
>gi|384251217|gb|EIE24695.1| hypothetical protein COCSUDRAFT_62118 [Coccomyxa subellipsoidea
C-169]
Length = 1671
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 80/142 (56%), Gaps = 10/142 (7%)
Query: 95 QVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNE 154
+V RVL+ L++ RDM SEVKLT+ IED E++ +GIED +RD++A AL +V +
Sbjct: 66 EVVRVLKELKETRDMSVSEVKLTLAIEDASVREQKEYMGIEDERGASRDEMAAALVEVAD 125
Query: 155 GKVPKNRVALRMLAEEMVQWP---NLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNID 211
G+VPK+R+ALR L +EM+QWP N E A S S Y +TDTG K
Sbjct: 126 GRVPKDRIALRELYKEMIQWPFVGNAEAAAPTAS-SSSPYEAITDTGT------KSAAAG 178
Query: 212 WDSAAEIEDAGGGDDPEVPSAL 233
W A G D E P +L
Sbjct: 179 WRPAQARPKMMGRDKSEAPQSL 200
>gi|412985941|emb|CCO17141.1| predicted protein [Bathycoccus prasinos]
Length = 290
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 94/174 (54%), Gaps = 14/174 (8%)
Query: 95 QVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNE 154
++K L+LL R+M F+EVKL I IEDPR +RR GIED + + ++ + LE V+E
Sbjct: 125 EIKDWLKLLYVKREMSFNEVKLIIGIEDPRLADRRERFGIEDESGCSAEEKIQVLEDVDE 184
Query: 155 GKVPKNRVALRMLAEEMVQWPNLEV-----EATKQKPSKSLYAKVTDTGIDPEM-AAKRL 208
G+VP + VA + E+ +WP L+ E + PS+ + GI KR+
Sbjct: 185 GRVPTDIVACEFVLNELREWPGLDSDIELPERRETGPSRYEEIAMDAAGIKGRTRNPKRM 244
Query: 209 NIDWDSAAEIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
D+ +E+ D PE + LG+ LY V++ P+ I I + I+F NSLQ
Sbjct: 245 LDDY-----LEE---DDKPEEKNLLGFLPLYAVSSAPIFITIFALGIMFVNSLQ 290
>gi|145345491|ref|XP_001417242.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577469|gb|ABO95535.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 163
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 55/173 (31%), Positives = 87/173 (50%), Gaps = 14/173 (8%)
Query: 93 IAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQV 152
+ ++K L+LL R+M F+E+KLT+ IEDPR E R GIED + + D+ E LE +
Sbjct: 2 VTEIKDWLKLLYVKREMSFNEIKLTVGIEDPRLAENRERYGIEDESGVSADEKVETLEMI 61
Query: 153 NEGKVPKNRVALRMLAEEMVQWPNLEV---EATKQKPSKSLYAKVTDTGIDPEMAAKRLN 209
G+ P + A+ L EE WP L+ ++ PS+ E+A +
Sbjct: 62 ERGETPTDARAVSTLLEEFRNWPGLDAVVDQSVDAGPSRY-----------EEIAMRSAG 110
Query: 210 IDWDSAAEIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
I + + ++ + + G+ LYLV+A P+ I + V I+F NSLQ
Sbjct: 111 IKTSNPRRVRAKEAEEEKKEGTIFGFLPLYLVSAVPIFITVFAVGIMFVNSLQ 163
>gi|302854206|ref|XP_002958613.1| hypothetical protein VOLCADRAFT_121729 [Volvox carteri f.
nagariensis]
gi|300256074|gb|EFJ40350.1| hypothetical protein VOLCADRAFT_121729 [Volvox carteri f.
nagariensis]
Length = 196
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/81 (49%), Positives = 57/81 (70%), Gaps = 1/81 (1%)
Query: 99 VLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVP 158
VL+ L++ RDM F+EVKL + IEDPR ERR +ED +RD++A AL + EG++P
Sbjct: 95 VLKTLKRTRDMSFNEVKLIVSIEDPRARERR-AADMEDERGVSRDEMAAALVDIGEGRIP 153
Query: 159 KNRVALRMLAEEMVQWPNLEV 179
+R+AL+ L +EM+ WP LEV
Sbjct: 154 NDRIALKCLHDEMLNWPFLEV 174
>gi|224075389|ref|XP_002304613.1| predicted protein [Populus trichocarpa]
gi|222842045|gb|EEE79592.1| predicted protein [Populus trichocarpa]
Length = 84
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/57 (59%), Positives = 39/57 (68%)
Query: 178 EVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSALG 234
EVEA PSK LYAK DTG DP+ AAKRL I+WDSAAEIE G D+P + +L
Sbjct: 27 EVEAPVTNPSKFLYAKAIDTGKDPKKAAKRLKIEWDSAAEIEYTGLNDEPAMVCSLS 83
>gi|386812234|ref|ZP_10099459.1| conserved hypothetical protein [planctomycete KSU-1]
gi|386404504|dbj|GAB62340.1| conserved hypothetical protein [planctomycete KSU-1]
Length = 439
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 60/125 (48%), Gaps = 19/125 (15%)
Query: 59 GDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTI 118
G+T+L +A + +D + E N+ ++K ++LL N D + + L I
Sbjct: 257 GETNLLMAHSIIISKEDTAEPE-----------NVKRLKEQVKLL--NPDAIIVDAALPI 303
Query: 119 MIEDPREVERRRLLGIEDSNAPTRDDLAE-----ALEQVNEGKVPKNR-VALRMLAEEMV 172
+E+P ++ +R+L IED T +A A E+ G++ R A+R +AE
Sbjct: 304 TVEEPELIKGKRVLVIEDGPTLTHGGMAFGAGILAAEKYGAGEIIDPRPFAIRSIAETFR 363
Query: 173 QWPNL 177
+P++
Sbjct: 364 NYPHI 368
>gi|341820470|emb|CCC56750.1| stage 0 DNA-binding protein [Weissella thailandensis fsh4-2]
Length = 301
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 100/219 (45%), Gaps = 32/219 (14%)
Query: 1 MAATFPQMPLASPIISRHGDNQINQHHPLFAATSWRASKLCLCCSRRSSQRVVALVGKGD 60
+A + Q + P+I RH +Q+ L WRAS++ + + A+V K D
Sbjct: 65 LAESIKQNGVLQPVIVRHNPKDSSQYEILAGERRWRASQIANIDT------IPAIVRKLD 118
Query: 61 TDLRVAQENDLQDQDQEDEEEEEEATPE----DLENIAQVKRVLELLRKNRDMLFSEVKL 116
D ++ Q L++ +ED EEA D N+ Q + V + L K R ++ + ++L
Sbjct: 119 DD-QMLQAAILENLQREDLTPLEEAQAYRDMMDGLNLTQAQ-VAKRLGKARSVVANFLRL 176
Query: 117 TIMIEDPREV---------ERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRML 167
+ ++ +E+ + R LLG+ + R + A V EG + +R L
Sbjct: 177 LNLPDEVKELLNAGQLSMGQARTLLGLHNK----RRIVPVAKRAVAEG------MTVRQL 226
Query: 168 AEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAK 206
E++V N + TK++PS S Y K + T ++ + K
Sbjct: 227 -EQLVNKLNEQSAPTKEQPSLSPYIKASTTALENKFGTK 264
>gi|397904251|ref|ZP_10505170.1| V-type ATP synthase subunit I [Caloramator australicus RC3]
gi|343178998|emb|CCC58069.1| V-type ATP synthase subunit I [Caloramator australicus RC3]
Length = 630
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 75/162 (46%), Gaps = 24/162 (14%)
Query: 36 RASKLCLCCSRRSSQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQ 95
+ KL L C++ ++++ L+ K L V Q DL+D+ ++ + E+E+ + E A+
Sbjct: 5 KMKKLNLACAKEDKEKILKLLQK----LGVVQIVDLKDKLEDTDYEKEKPSNEVEYEFAK 60
Query: 96 VKRVLELLRKNRDMLFSEVKLTIMIEDPREV----ERRRLLGIEDSNAPTRDDLAEAL-- 149
VK E L++ +SE K I P+ V E RL DS+ + A
Sbjct: 61 VKFTYEFLKQ-----YSEEKKGIF--TPKNVLSFEEFERL----DSHLDWEEYYKAAKFI 109
Query: 150 -EQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSL 190
E +N + K +V L ++ W NL+V + K K++
Sbjct: 110 EESMNTNRTKKGKVL--SLIDQYSNWVNLDVSEAELKSLKNV 149
>gi|402470870|gb|EJW04907.1| hypothetical protein EDEG_00127 [Edhazardia aedis USNM 41457]
Length = 823
Score = 37.0 bits (84), Expect = 8.5, Method: Composition-based stats.
Identities = 30/140 (21%), Positives = 72/140 (51%), Gaps = 11/140 (7%)
Query: 56 VGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFSEVK 115
+ K +++ Q DLQD+++ ++E EE E L++ ++++ E L+K ++ + +
Sbjct: 223 IRKQREEIKKYQAKDLQDREKNFKDENEELIGEILKHTEEIRKQREELKKQQEEIRKQQ- 281
Query: 116 LTIMIEDPREVERRRLLGIE----DSNAPTRDDLAEALEQVNEGKVPKNRVALRMLAE-E 170
E+ R+ + + L IE D N + ++A+ E++ + + K + + L + E
Sbjct: 282 -----EEIRKQQEKDLQRIEKNFKDENEKLKAEIAKHTEEIRKQEEEKKKYQAKDLQDRE 336
Query: 171 MVQWPNLEVEATKQKPSKSL 190
+++ E+E ++K + L
Sbjct: 337 KIKFLKNEIEKLEKKDIQGL 356
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.132 0.373
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,064,094,009
Number of Sequences: 23463169
Number of extensions: 168876449
Number of successful extensions: 1262354
Number of sequences better than 100.0: 622
Number of HSP's better than 100.0 without gapping: 75
Number of HSP's successfully gapped in prelim test: 547
Number of HSP's that attempted gapping in prelim test: 1255383
Number of HSP's gapped (non-prelim): 4540
length of query: 262
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 122
effective length of database: 9,074,351,707
effective search space: 1107070908254
effective search space used: 1107070908254
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 75 (33.5 bits)