BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 031254
(163 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|118482173|gb|ABK93016.1| unknown [Populus trichocarpa]
Length = 185
Score = 268 bits (686), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 126/157 (80%), Positives = 141/157 (89%)
Query: 5 KEEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAII 64
K E VKEGVASIALLP+GSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKL I+
Sbjct: 20 KVEVVKEGVASIALLPSGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLTIV 79
Query: 65 DYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAE 124
D+N KE V+VEC+GHDAARF+N++ AHGWEKD+ GMVE++ K KI VSFECETLKA+
Sbjct: 80 DFNSGKEQAVVVECKGHDAARFHNVDQAHGWEKDIVGMVEEKHGKKKIHVSFECETLKAD 139
Query: 125 KVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
K AEDHIKQ+MPKLAGLDAVVNIG+M ISGLD EAE+
Sbjct: 140 KAAEDHIKQFMPKLAGLDAVVNIGRMSISGLDFEAED 176
>gi|224141463|ref|XP_002324091.1| predicted protein [Populus trichocarpa]
gi|222867093|gb|EEF04224.1| predicted protein [Populus trichocarpa]
Length = 185
Score = 268 bits (686), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 126/157 (80%), Positives = 141/157 (89%)
Query: 5 KEEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAII 64
K E VKEGVASIALLP+GSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKL I+
Sbjct: 20 KVEVVKEGVASIALLPSGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLTIV 79
Query: 65 DYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAE 124
D+N KE V+VEC+GHDAARF+N++ AHGWEKD+ GMVE++ K KI VSFECETLKA+
Sbjct: 80 DFNSGKEQAVVVECKGHDAARFHNVDQAHGWEKDIVGMVEEKHGKKKIHVSFECETLKAD 139
Query: 125 KVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
K AEDHIKQ+MPKLAGLDAVVNIG+M ISGLD EAE+
Sbjct: 140 KAAEDHIKQFMPKLAGLDAVVNIGRMSISGLDFEAED 176
>gi|296082981|emb|CBI22282.3| unnamed protein product [Vitis vinifera]
Length = 188
Score = 264 bits (675), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 124/160 (77%), Positives = 141/160 (88%)
Query: 2 ERVKEEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKL 61
E KE V+EGVASIALLP GS+SGHFIQLPH+IC+GLHGTELACERECSRGEDYRLIKL
Sbjct: 21 EERKEGVVREGVASIALLPCGSLSGHFIQLPHAICFGLHGTELACERECSRGEDYRLIKL 80
Query: 62 AIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETL 121
I D++ K+E VIVECRGHDAARF++I+ AHGWE DV GMVEQ+ KNKI VSFECETL
Sbjct: 81 TITDFSSKREQTVIVECRGHDAARFHSIDRAHGWEMDVVGMVEQKHGKNKIVVSFECETL 140
Query: 122 KAEKVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
KA+K AEDHI+Q+MPKLAGLDAVVNIG+M I+GLD+EA E
Sbjct: 141 KADKAAEDHIRQFMPKLAGLDAVVNIGRMSITGLDLEAAE 180
>gi|225452946|ref|XP_002279000.1| PREDICTED: uncharacterized protein LOC100243877 [Vitis vinifera]
gi|147842425|emb|CAN69515.1| hypothetical protein VITISV_009952 [Vitis vinifera]
Length = 172
Score = 263 bits (673), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 124/160 (77%), Positives = 141/160 (88%)
Query: 2 ERVKEEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKL 61
E KE V+EGVASIALLP GS+SGHFIQLPH+IC+GLHGTELACERECSRGEDYRLIKL
Sbjct: 5 EERKEGVVREGVASIALLPCGSLSGHFIQLPHAICFGLHGTELACERECSRGEDYRLIKL 64
Query: 62 AIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETL 121
I D++ K+E VIVECRGHDAARF++I+ AHGWE DV GMVEQ+ KNKI VSFECETL
Sbjct: 65 TITDFSSKREQTVIVECRGHDAARFHSIDRAHGWEMDVVGMVEQKHGKNKIVVSFECETL 124
Query: 122 KAEKVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
KA+K AEDHI+Q+MPKLAGLDAVVNIG+M I+GLD+EA E
Sbjct: 125 KADKAAEDHIRQFMPKLAGLDAVVNIGRMSITGLDLEAAE 164
>gi|255570226|ref|XP_002526073.1| conserved hypothetical protein [Ricinus communis]
gi|223534570|gb|EEF36267.1| conserved hypothetical protein [Ricinus communis]
Length = 175
Score = 263 bits (671), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 125/154 (81%), Positives = 138/154 (89%)
Query: 8 EVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYN 67
EVKEG+ASIALLP GSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKL IIDYN
Sbjct: 14 EVKEGMASIALLPCGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLTIIDYN 73
Query: 68 KKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVA 127
KE V+VECRGHDAARF+NI+HAHGWEKDV M E++ K KI VSF CETLKA+K A
Sbjct: 74 SGKEQAVVVECRGHDAARFHNIDHAHGWEKDVVAMAEEKHGKKKIFVSFVCETLKADKAA 133
Query: 128 EDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
E+HI+++MPKLAGLDAVVNIGKMRISGLD EA++
Sbjct: 134 EEHIRKFMPKLAGLDAVVNIGKMRISGLDFEADD 167
>gi|145339062|ref|NP_189574.2| uncharacterized protein [Arabidopsis thaliana]
gi|332644041|gb|AEE77562.1| uncharacterized protein [Arabidopsis thaliana]
Length = 178
Score = 248 bits (633), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 114/151 (75%), Positives = 134/151 (88%)
Query: 12 GVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKE 71
GVASIALLP G+ISGHFIQ+P SICYGLHGTELACE ECSRGEDYRLIKL IIDYN+KKE
Sbjct: 24 GVASIALLPCGTISGHFIQMPSSICYGLHGTELACETECSRGEDYRLIKLTIIDYNRKKE 83
Query: 72 HDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHI 131
V+VEC+GHDAAR N++ HAHGWE+DV G+VE++ K K++VSFECETLKA++ AEDHI
Sbjct: 84 QTVVVECKGHDAARINDVEHAHGWEEDVIGLVEEKHGKKKVSVSFECETLKADEAAEDHI 143
Query: 132 KQYMPKLAGLDAVVNIGKMRISGLDIEAEED 162
+Q+MPKLAGLDAV+NIG M+ISGLD A E+
Sbjct: 144 RQFMPKLAGLDAVINIGPMKISGLDFAAVEE 174
>gi|351726218|ref|NP_001236863.1| uncharacterized protein LOC100306056 [Glycine max]
gi|255627405|gb|ACU14047.1| unknown [Glycine max]
Length = 170
Score = 247 bits (630), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 114/148 (77%), Positives = 131/148 (88%)
Query: 14 ASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKEHD 73
ASIALLP GSISGHFIQLPHSICYGL GTELACERECSRGEDYRLIKL I D+N KKE
Sbjct: 18 ASIALLPCGSISGHFIQLPHSICYGLCGTELACERECSRGEDYRLIKLTITDFNTKKEQA 77
Query: 74 VIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHIKQ 133
+VEC+GHDAARF++I+HAHGW+KD++GM+E + K +I+VSFECETLKA K AEDHI++
Sbjct: 78 TVVECKGHDAARFHSIDHAHGWDKDITGMIEPKDGKKRISVSFECETLKAGKAAEDHIRK 137
Query: 134 YMPKLAGLDAVVNIGKMRISGLDIEAEE 161
+MPKLAGLDAVVNIGKM ISGLD AE+
Sbjct: 138 FMPKLAGLDAVVNIGKMTISGLDFGAED 165
>gi|449464250|ref|XP_004149842.1| PREDICTED: uncharacterized protein LOC101206322 [Cucumis sativus]
Length = 158
Score = 246 bits (628), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 115/151 (76%), Positives = 130/151 (86%)
Query: 11 EGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKK 70
EGVASIALLP GS+SGHFIQLPHSICYGLHGTEL CERECSRGEDYRLIKL I +YN K+
Sbjct: 4 EGVASIALLPCGSVSGHFIQLPHSICYGLHGTELECERECSRGEDYRLIKLTITNYNNKQ 63
Query: 71 EHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDH 130
E V+VECRGHDAARF++I HAHGWE+DV MVE++ KN+I VSFEC+ LK+EK AEDH
Sbjct: 64 ERTVVVECRGHDAARFHSIVHAHGWEEDVVSMVEKKHGKNEIMVSFECQMLKSEKAAEDH 123
Query: 131 IKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
I+Q+MPKL G DAVVN+G M I GLD EAEE
Sbjct: 124 IRQFMPKLVGQDAVVNVGPMSIGGLDFEAEE 154
>gi|356573522|ref|XP_003554907.1| PREDICTED: uncharacterized protein LOC100795132 [Glycine max]
Length = 169
Score = 245 bits (625), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 116/151 (76%), Positives = 132/151 (87%), Gaps = 1/151 (0%)
Query: 12 GVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKE 71
GVASIALLP GSISGHFIQLPH ICYGL GTELACERECSRGEDYRLIKL I D+N KKE
Sbjct: 20 GVASIALLPCGSISGHFIQLPHYICYGLCGTELACERECSRGEDYRLIKLTITDFNAKKE 79
Query: 72 HDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHI 131
+VEC+GHDAA F++I+HAHGW+KD++GM+E + K +I+VSFECETLKAEK AEDHI
Sbjct: 80 QATVVECKGHDAAWFHSIDHAHGWDKDITGMIEPKDGKKRISVSFECETLKAEKAAEDHI 139
Query: 132 KQYMPKLAGLDAVVNIGKMRISGLDIEAEED 162
+++MPKLAGLDAVVNIGKM ISGLD EED
Sbjct: 140 RKFMPKLAGLDAVVNIGKMTISGLDF-GEED 169
>gi|297818568|ref|XP_002877167.1| hypothetical protein ARALYDRAFT_484699 [Arabidopsis lyrata subsp.
lyrata]
gi|297323005|gb|EFH53426.1| hypothetical protein ARALYDRAFT_484699 [Arabidopsis lyrata subsp.
lyrata]
Length = 177
Score = 244 bits (624), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 112/149 (75%), Positives = 131/149 (87%)
Query: 14 ASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKEHD 73
ASIALLP G+ISGHFIQ+P SICYGLHGTELACE ECSRGEDYRLIKL IIDYN+KKE
Sbjct: 25 ASIALLPCGTISGHFIQMPSSICYGLHGTELACETECSRGEDYRLIKLTIIDYNRKKEQT 84
Query: 74 VIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHIKQ 133
V+VEC+GHDAAR N++ HAHGWE+DV G+VE++ K K++VSFECETLKA+K AEDHI+Q
Sbjct: 85 VVVECKGHDAARINDVEHAHGWEEDVIGLVEEKHGKKKVSVSFECETLKADKAAEDHIRQ 144
Query: 134 YMPKLAGLDAVVNIGKMRISGLDIEAEED 162
+MPKLAGLDAV+NIG M+ISGLD E+
Sbjct: 145 FMPKLAGLDAVINIGPMKISGLDFAVVEE 173
>gi|449526016|ref|XP_004170011.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101206322
[Cucumis sativus]
Length = 158
Score = 244 bits (623), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 114/151 (75%), Positives = 129/151 (85%)
Query: 11 EGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKK 70
EGVASIALLP GS+SGHFIQLPHSICYGLHGTEL CERECSRGEDYRLIKL I +YN K+
Sbjct: 4 EGVASIALLPCGSVSGHFIQLPHSICYGLHGTELECERECSRGEDYRLIKLTITNYNNKQ 63
Query: 71 EHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDH 130
E V+VECRGHDAARF++I HAHGWE+DV MV ++ KN+I VSFEC+ LK+EK AEDH
Sbjct: 64 ERTVVVECRGHDAARFHSIVHAHGWEEDVVSMVXKKHGKNEIMVSFECQMLKSEKAAEDH 123
Query: 131 IKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
I+Q+MPKL G DAVVN+G M I GLD EAEE
Sbjct: 124 IRQFMPKLVGQDAVVNVGPMSIGGLDFEAEE 154
>gi|414865904|tpg|DAA44461.1| TPA: hypothetical protein ZEAMMB73_493817 [Zea mays]
Length = 253
Score = 233 bits (593), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 105/148 (70%), Positives = 125/148 (84%)
Query: 11 EGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKK 70
EGVASIALLP G+ISGHFI+LP SICYGLHGT ++CERECSRGEDYRLIKL IID+ K+
Sbjct: 97 EGVASIALLPTGAISGHFIRLPDSICYGLHGTPISCERECSRGEDYRLIKLTIIDFKSKR 156
Query: 71 EHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDH 130
E ++VECRGHDAAR NI+H HGWE D+ G+VE+E K+ +SFECETLKA+K AEDH
Sbjct: 157 EKVLVVECRGHDAARLQNIDHLHGWEDDIVGLVEKEHGNQKVLLSFECETLKADKDAEDH 216
Query: 131 IKQYMPKLAGLDAVVNIGKMRISGLDIE 158
I +YMP L GLDAVVNIGKM ISG++++
Sbjct: 217 IAKYMPNLCGLDAVVNIGKMSISGINLD 244
>gi|218192310|gb|EEC74737.1| hypothetical protein OsI_10475 [Oryza sativa Indica Group]
Length = 199
Score = 231 bits (590), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 105/157 (66%), Positives = 131/157 (83%)
Query: 6 EEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIID 65
E E EGVASIALLP+G+ISGHFI+LP S+CYGLHGT ++CERECSRGEDYRLIKL+IID
Sbjct: 38 ESETMEGVASIALLPSGAISGHFIRLPDSVCYGLHGTPISCERECSRGEDYRLIKLSIID 97
Query: 66 YNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEK 125
+ K E V+VECRGHDAAR NI+H HGWE D+ G+VE++ K ++SFECETLKA+K
Sbjct: 98 FKNKHEKVVVVECRGHDAARLQNIDHLHGWEDDIVGLVEKKHGNRKFSLSFECETLKADK 157
Query: 126 VAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEED 162
AE+HI +YMP L GLDAVVNIGKM ISG++++ +++
Sbjct: 158 AAEEHISKYMPNLKGLDAVVNIGKMSISGINLDEDDE 194
>gi|223945361|gb|ACN26764.1| unknown [Zea mays]
gi|414865902|tpg|DAA44459.1| TPA: hypothetical protein ZEAMMB73_493817 [Zea mays]
Length = 218
Score = 231 bits (589), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 105/148 (70%), Positives = 125/148 (84%)
Query: 11 EGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKK 70
EGVASIALLP G+ISGHFI+LP SICYGLHGT ++CERECSRGEDYRLIKL IID+ K+
Sbjct: 62 EGVASIALLPTGAISGHFIRLPDSICYGLHGTPISCERECSRGEDYRLIKLTIIDFKSKR 121
Query: 71 EHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDH 130
E ++VECRGHDAAR NI+H HGWE D+ G+VE+E K+ +SFECETLKA+K AEDH
Sbjct: 122 EKVLVVECRGHDAARLQNIDHLHGWEDDIVGLVEKEHGNQKVLLSFECETLKADKDAEDH 181
Query: 131 IKQYMPKLAGLDAVVNIGKMRISGLDIE 158
I +YMP L GLDAVVNIGKM ISG++++
Sbjct: 182 IAKYMPNLCGLDAVVNIGKMSISGINLD 209
>gi|108707258|gb|ABF95053.1| expressed protein [Oryza sativa Japonica Group]
gi|222624590|gb|EEE58722.1| hypothetical protein OsJ_10184 [Oryza sativa Japonica Group]
Length = 193
Score = 230 bits (586), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 101/162 (62%), Positives = 133/162 (82%)
Query: 1 MERVKEEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIK 60
+E E EGVASIALLP+G+ISGHFI+LP S+CYGLHGT ++CERECSRGEDYRLIK
Sbjct: 27 VEEADAGETMEGVASIALLPSGAISGHFIRLPDSVCYGLHGTPISCERECSRGEDYRLIK 86
Query: 61 LAIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECET 120
L+IID+ K+E +VECRGHDAAR NI+H HGWE D+ G+VE++ K ++SFECET
Sbjct: 87 LSIIDFKNKREKVAVVECRGHDAARLQNIDHLHGWEDDIVGLVEKKHGNRKFSISFECET 146
Query: 121 LKAEKVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEED 162
LKA+K A++HI +YMP L+G+DA+VNIGKM ISG++++ +++
Sbjct: 147 LKADKAADEHISKYMPNLSGMDAIVNIGKMSISGINLDEDDE 188
>gi|125585350|gb|EAZ26014.1| hypothetical protein OsJ_09867 [Oryza sativa Japonica Group]
Length = 177
Score = 229 bits (585), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 104/157 (66%), Positives = 130/157 (82%)
Query: 6 EEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIID 65
E E EGVASIALLP+G+ISGHF +LP S+CYGLHGT ++CERECSRGEDYRLIKL+IID
Sbjct: 16 ESETMEGVASIALLPSGAISGHFTRLPDSVCYGLHGTPISCERECSRGEDYRLIKLSIID 75
Query: 66 YNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEK 125
+ K E V+VECRGHDAAR NI+H HGWE D+ G+VE++ K ++SFECETLKA+K
Sbjct: 76 FKNKHEKVVVVECRGHDAARLQNIDHLHGWEDDIVGLVEKKHGNRKFSLSFECETLKADK 135
Query: 126 VAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEED 162
AE+HI +YMP L GLDAVVNIGKM ISG++++ +++
Sbjct: 136 AAEEHISKYMPNLKGLDAVVNIGKMSISGINLDEDDE 172
>gi|115452001|ref|NP_001049601.1| Os03g0257000 [Oryza sativa Japonica Group]
gi|113548072|dbj|BAF11515.1| Os03g0257000 [Oryza sativa Japonica Group]
gi|218192467|gb|EEC74894.1| hypothetical protein OsI_10809 [Oryza sativa Indica Group]
Length = 174
Score = 229 bits (584), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 101/162 (62%), Positives = 133/162 (82%)
Query: 1 MERVKEEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIK 60
+E E EGVASIALLP+G+ISGHFI+LP S+CYGLHGT ++CERECSRGEDYRLIK
Sbjct: 8 VEEADAGETMEGVASIALLPSGAISGHFIRLPDSVCYGLHGTPISCERECSRGEDYRLIK 67
Query: 61 LAIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECET 120
L+IID+ K+E +VECRGHDAAR NI+H HGWE D+ G+VE++ K ++SFECET
Sbjct: 68 LSIIDFKNKREKVAVVECRGHDAARLQNIDHLHGWEDDIVGLVEKKHGNRKFSISFECET 127
Query: 121 LKAEKVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEED 162
LKA+K A++HI +YMP L+G+DA+VNIGKM ISG++++ +++
Sbjct: 128 LKADKAADEHISKYMPNLSGMDAIVNIGKMSISGINLDEDDE 169
>gi|357113045|ref|XP_003558315.1| PREDICTED: uncharacterized protein LOC100829129 [Brachypodium
distachyon]
Length = 194
Score = 223 bits (568), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 98/147 (66%), Positives = 124/147 (84%)
Query: 12 GVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKE 71
GVASIALLP+G+ISGHFI+LP S+CYGLHGT + CERECSRG+DYRLIKL++I++ K E
Sbjct: 39 GVASIALLPSGAISGHFIRLPDSVCYGLHGTPIPCERECSRGDDYRLIKLSVINFKTKTE 98
Query: 72 HDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHI 131
V+VECRGHDAAR N++H HGWE D+ G+V+++ K+ VSFECETLKA+K AE+HI
Sbjct: 99 KVVVVECRGHDAARLQNVDHLHGWEDDIVGLVQKKHGNKKVFVSFECETLKADKAAEEHI 158
Query: 132 KQYMPKLAGLDAVVNIGKMRISGLDIE 158
++YMP L GLDAVVN GKM ISG+++E
Sbjct: 159 RKYMPNLCGLDAVVNTGKMSISGINLE 185
>gi|115451473|ref|NP_001049337.1| Os03g0209700 [Oryza sativa Japonica Group]
gi|108706785|gb|ABF94580.1| hypothetical protein LOC_Os03g11130 [Oryza sativa Japonica Group]
gi|113547808|dbj|BAF11251.1| Os03g0209700 [Oryza sativa Japonica Group]
Length = 234
Score = 211 bits (537), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 95/139 (68%), Positives = 114/139 (82%)
Query: 6 EEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIID 65
E E EGVASIALLP+G+ISGHF +LP S+CYGLHGT ++CERECSRGEDYRLIKL+IID
Sbjct: 16 ESETMEGVASIALLPSGAISGHFTRLPDSVCYGLHGTPISCERECSRGEDYRLIKLSIID 75
Query: 66 YNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEK 125
+ K E V+VECRGHDAAR NI+H HGWE D+ G+VE++ K ++SFECETLKA+K
Sbjct: 76 FKNKHEKVVVVECRGHDAARLQNIDHLHGWEDDIVGLVEKKHGNRKFSLSFECETLKADK 135
Query: 126 VAEDHIKQYMPKLAGLDAV 144
AE+HI +YMP L GLDAV
Sbjct: 136 AAEEHISKYMPNLKGLDAV 154
>gi|302794220|ref|XP_002978874.1| hypothetical protein SELMODRAFT_37399 [Selaginella moellendorffii]
gi|302813579|ref|XP_002988475.1| hypothetical protein SELMODRAFT_27215 [Selaginella moellendorffii]
gi|300143877|gb|EFJ10565.1| hypothetical protein SELMODRAFT_27215 [Selaginella moellendorffii]
gi|300153192|gb|EFJ19831.1| hypothetical protein SELMODRAFT_37399 [Selaginella moellendorffii]
Length = 152
Score = 194 bits (492), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 87/148 (58%), Positives = 112/148 (75%)
Query: 11 EGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKK 70
+G+ASIALLP+GSISGHF+ P S C GL+GTE+ C RECSR EDYRLI L ++DY KK
Sbjct: 4 QGIASIALLPSGSISGHFVHPPSSSCLGLYGTEIPCARECSRAEDYRLINLTLLDYRTKK 63
Query: 71 EHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDH 130
E VIVECRG DA+RF+N++ HGWE+DV + N + +SFECETLKA++ AE H
Sbjct: 64 ETSVIVECRGQDASRFHNVDDVHGWEEDVIKAAKASDPNNHLQISFECETLKADEDAERH 123
Query: 131 IKQYMPKLAGLDAVVNIGKMRISGLDIE 158
+++++P LA AVVNIG M+I GLD+E
Sbjct: 124 LQRFLPALANKGAVVNIGTMKIKGLDLE 151
>gi|11994768|dbj|BAB03124.1| unnamed protein product [Arabidopsis thaliana]
Length = 116
Score = 192 bits (487), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 86/114 (75%), Positives = 102/114 (89%)
Query: 31 LPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKEHDVIVECRGHDAARFNNIN 90
+P SICYGLHGTELACE ECSRGEDYRLIKL IIDYN+KKE V+VEC+GHDAAR N++
Sbjct: 1 MPSSICYGLHGTELACETECSRGEDYRLIKLTIIDYNRKKEQTVVVECKGHDAARINDVE 60
Query: 91 HAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHIKQYMPKLAGLDAV 144
HAHGWE+DV G+VE++ K K++VSFECETLKA++ AEDHI+Q+MPKLAGLDAV
Sbjct: 61 HAHGWEEDVIGLVEEKHGKKKVSVSFECETLKADEAAEDHIRQFMPKLAGLDAV 114
>gi|357496671|ref|XP_003618624.1| HAT family dimerization domain containing protein [Medicago
truncatula]
gi|355493639|gb|AES74842.1| HAT family dimerization domain containing protein [Medicago
truncatula]
Length = 724
Score = 186 bits (472), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 84/119 (70%), Positives = 102/119 (85%)
Query: 43 ELACERECSRGEDYRLIKLAIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGM 102
EL CERECS+GEDYRLIKL I D+N KKE +VEC+GHDAARF+ I+HAHGW+K+++GM
Sbjct: 601 ELECERECSKGEDYRLIKLTITDFNTKKEQATVVECKGHDAARFHCIDHAHGWDKEITGM 660
Query: 103 VEQEQEKNKIAVSFECETLKAEKVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
VEQ+ K +I VSFECETLKAE AEDHI+++MPKLAG+DAVVN+GKM ISGLD A+E
Sbjct: 661 VEQKDGKKRITVSFECETLKAEDAAEDHIRKFMPKLAGMDAVVNVGKMTISGLDFGADE 719
>gi|326530722|dbj|BAK01159.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 113
Score = 148 bits (374), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 68/111 (61%), Positives = 86/111 (77%)
Query: 44 LACERECSRGEDYRLIKLAIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMV 103
+ CERECSRG+DYRLIKL+II++ +K E V+VECRGHDAAR NI+H HGWE DV G+V
Sbjct: 2 IPCERECSRGDDYRLIKLSIINFKRKTEKVVVVECRGHDAARLQNIDHLHGWEDDVVGLV 61
Query: 104 EQEQEKNKIAVSFECETLKAEKVAEDHIKQYMPKLAGLDAVVNIGKMRISG 154
+++ K+ VSFECETLKA+K AE+HI +YMP L GLDAV G ++ G
Sbjct: 62 QKKHGNKKVLVSFECETLKADKAAEEHISKYMPNLCGLDAVGKHGILKFLG 112
>gi|168028007|ref|XP_001766520.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682165|gb|EDQ68585.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 154
Score = 148 bits (374), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 74/149 (49%), Positives = 96/149 (64%), Gaps = 12/149 (8%)
Query: 7 EEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDY 66
EEV EG+ASIAL P+GS+SGHF+ S C GL GTE+ACERECSRGEDYRL+ L II++
Sbjct: 6 EEVLEGIASIALFPSGSLSGHFLHTASSTCLGLFGTEIACERECSRGEDYRLLNLTIINF 65
Query: 67 NKKKEHDVIVECRGHDAARFNNINHAHG----------WEKDVSGMVEQEQEKNKIA--V 114
KKE V+VE +G DAAR + HG W+++V A +
Sbjct: 66 KTKKETTVVVERKGEDAARLSTTCDLHGSFDESLKLFRWDEEVVQEANNRHRSGGTAPVI 125
Query: 115 SFECETLKAEKVAEDHIKQYMPKLAGLDA 143
+FECETLKA + AE+H+K+++P L G DA
Sbjct: 126 TFECETLKAGREAEEHLKKFLPSLVGRDA 154
>gi|414865903|tpg|DAA44460.1| TPA: hypothetical protein ZEAMMB73_493817 [Zea mays]
Length = 124
Score = 129 bits (323), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 58/90 (64%), Positives = 73/90 (81%)
Query: 69 KKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAE 128
K+E ++VECRGHDAAR NI+H HGWE D+ G+VE+E K+ +SFECETLKA+K AE
Sbjct: 26 KREKVLVVECRGHDAARLQNIDHLHGWEDDIVGLVEKEHGNQKVLLSFECETLKADKDAE 85
Query: 129 DHIKQYMPKLAGLDAVVNIGKMRISGLDIE 158
DHI +YMP L GLDAVVNIGKM ISG++++
Sbjct: 86 DHIAKYMPNLCGLDAVVNIGKMSISGINLD 115
>gi|384246351|gb|EIE19841.1| hypothetical protein COCSUDRAFT_58078 [Coccomyxa subellipsoidea
C-169]
Length = 317
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 73/143 (51%), Gaps = 3/143 (2%)
Query: 12 GVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKE 71
G + LP+G +SGH I + G+ G E+ +REC+R +++L+ + +++ + E
Sbjct: 155 GTGYLIWLPDGRLSGHLIS--SGVTLGIMGREILSQRECTRDAEWKLMHVTLLNLISEAE 212
Query: 72 HDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHI 131
+ E G D+AR HG E+ + G + + + + F CET+ A+ AE ++
Sbjct: 213 VGCLAEKTGEDSARLRAPFDLHGSEEHLFGETDLPND-TTLQLEFVCETILADAAAEKYL 271
Query: 132 KQYMPKLAGLDAVVNIGKMRISG 154
++ P L G A VN+G G
Sbjct: 272 ARFSPSLRGRGAFVNVGHFTCGG 294
>gi|307106456|gb|EFN54702.1| hypothetical protein CHLNCDRAFT_135354 [Chlorella variabilis]
Length = 409
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 74/161 (45%), Gaps = 19/161 (11%)
Query: 12 GVASIALLPNGSISGHFIQLPHSIC------YGLHGTELACERECSRGEDYRLIKLAIID 65
G A +ALLPNGS++GH + P + +G+ G REC+R D+ ++++ ++
Sbjct: 182 GTAYLALLPNGSVTGHLVCAPSAFAPARRRSFGVMGEVAVAGRECTRDADWGVLRMTLVS 241
Query: 66 Y--------NKKKEHDVIVECRGHDAARFNNINHA--HGWEKDVSGMVEQE---QEKNKI 112
+ E + E DA R + G E D ++ + +
Sbjct: 242 FAGGGGGGAAGGGEVSFLAERTEGDAMRLRRVAPVPWQGIELDDLPALDAACTCAAASGL 301
Query: 113 AVSFECETLKAEKVAEDHIKQYMPKLAGLDAVVNIGKMRIS 153
++ F C+ L + A+ ++ P LAG A++N+G MR++
Sbjct: 302 SLRFHCQQLLSGAAADALLRSLQPSLAGRGAMLNVGLMRVA 342
>gi|212721684|ref|NP_001132488.1| uncharacterized protein LOC100193947 [Zea mays]
gi|194694520|gb|ACF81344.1| unknown [Zea mays]
gi|414865905|tpg|DAA44462.1| TPA: hypothetical protein ZEAMMB73_493817 [Zea mays]
Length = 131
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/32 (87%), Positives = 30/32 (93%)
Query: 11 EGVASIALLPNGSISGHFIQLPHSICYGLHGT 42
EGVASIALLP G+ISGHFI+LP SICYGLHGT
Sbjct: 97 EGVASIALLPTGAISGHFIRLPDSICYGLHGT 128
>gi|414865901|tpg|DAA44458.1| TPA: hypothetical protein ZEAMMB73_493817, partial [Zea mays]
Length = 117
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/34 (82%), Positives = 31/34 (91%)
Query: 11 EGVASIALLPNGSISGHFIQLPHSICYGLHGTEL 44
EGVASIALLP G+ISGHFI+LP SICYGLHGT +
Sbjct: 62 EGVASIALLPTGAISGHFIRLPDSICYGLHGTRI 95
>gi|357496617|ref|XP_003618597.1| HAT family dimerization domain containing protein [Medicago
truncatula]
gi|355493612|gb|AES74815.1| HAT family dimerization domain containing protein [Medicago
truncatula]
Length = 819
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 40/51 (78%), Gaps = 1/51 (1%)
Query: 96 EKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHIKQ-YMPKLAGLDAVV 145
+K ++GMVEQ+ K + VS +CETLKAE A+DHI+Q ++PKLAG+D ++
Sbjct: 14 DKLITGMVEQKHGKKRTTVSSKCETLKAENAADDHIRQLFLPKLAGIDMLI 64
>gi|357496619|ref|XP_003618598.1| hypothetical protein MTR_6g013550 [Medicago truncatula]
gi|355493613|gb|AES74816.1| hypothetical protein MTR_6g013550 [Medicago truncatula]
Length = 751
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/31 (64%), Positives = 25/31 (80%), Gaps = 1/31 (3%)
Query: 115 SFECETLKAEKVAEDHIKQ-YMPKLAGLDAV 144
SF CETL AE A++HI+Q +MPKLAG+D V
Sbjct: 3 SFRCETLTAENAADNHIRQLFMPKLAGIDGV 33
>gi|440791071|gb|ELR12325.1| hypothetical protein ACA1_373920 [Acanthamoeba castellanii str.
Neff]
Length = 248
Score = 39.3 bits (90), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 38/76 (50%), Gaps = 9/76 (11%)
Query: 18 LLPNGSIS---GHFIQLPHSICYGLHGTEL---ACERE--CSRGEDY-RLIKLAIIDYNK 68
L +G S GHFI L C+G+ TE A E + C +G+D +LAIID+
Sbjct: 21 LFQDGGTSLFQGHFIDLGSRQCFGVTATETEVPAAEAQCCCCQGDDAPTWYRLAIIDFKT 80
Query: 69 KKEHDVIVECRGHDAA 84
+E +V RG D A
Sbjct: 81 GREFTEVVSKRGGDYA 96
>gi|386817151|ref|ZP_10104369.1| permease YjgP/YjgQ family protein [Thiothrix nivea DSM 5205]
gi|386421727|gb|EIJ35562.1| permease YjgP/YjgQ family protein [Thiothrix nivea DSM 5205]
Length = 355
Score = 37.7 bits (86), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/71 (28%), Positives = 38/71 (53%), Gaps = 4/71 (5%)
Query: 66 YNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEK 125
+++KK + V +A R ++I HA E+D SG E E+ I ++++ EK
Sbjct: 164 WSEKKLEGISVYTINPEAGRIDSITHAVSAERDTSGWTLHEVERKTIL----ADSIQQEK 219
Query: 126 VAEDHIKQYMP 136
+A++H + +P
Sbjct: 220 LAQEHTDKLLP 230
>gi|168066545|ref|XP_001785196.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663197|gb|EDQ49975.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 402
Score = 37.0 bits (84), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 17/55 (30%), Positives = 26/55 (47%)
Query: 47 ERECSRGEDYRLIKLAIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSG 101
E CS ED+ + ++ + +H ++ R HD A N I+HAHG G
Sbjct: 10 ESRCSALEDFTALGQIPCPFHGEYKHAILASPRTHDRASINPISHAHGGRTTTRG 64
>gi|346318143|gb|EGX87747.1| acetyl-CoA carboxylase [Cordyceps militaris CM01]
Length = 2271
Score = 35.0 bits (79), Expect = 9.4, Method: Composition-based stats.
Identities = 22/74 (29%), Positives = 36/74 (48%), Gaps = 1/74 (1%)
Query: 54 EDYRLIKLAIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIA 113
E+Y L +A N+K+ R H+ + + WEKD + E EK + +
Sbjct: 2160 EEYMLRHMATSTLNQKESTATTNTTREHNLKLLQSWSRIEKWEKDDQAVAEW-YEKERRS 2218
Query: 114 VSFECETLKAEKVA 127
+ + ETLKAEK++
Sbjct: 2219 IGEKVETLKAEKLS 2232
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.134 0.389
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,530,595,519
Number of Sequences: 23463169
Number of extensions: 97306229
Number of successful extensions: 244316
Number of sequences better than 100.0: 37
Number of HSP's better than 100.0 without gapping: 30
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 244278
Number of HSP's gapped (non-prelim): 38
length of query: 163
length of database: 8,064,228,071
effective HSP length: 124
effective length of query: 39
effective length of database: 9,449,762,411
effective search space: 368540734029
effective search space used: 368540734029
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 71 (32.0 bits)