BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 031254
         (163 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|118482173|gb|ABK93016.1| unknown [Populus trichocarpa]
          Length = 185

 Score =  268 bits (686), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 126/157 (80%), Positives = 141/157 (89%)

Query: 5   KEEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAII 64
           K E VKEGVASIALLP+GSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKL I+
Sbjct: 20  KVEVVKEGVASIALLPSGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLTIV 79

Query: 65  DYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAE 124
           D+N  KE  V+VEC+GHDAARF+N++ AHGWEKD+ GMVE++  K KI VSFECETLKA+
Sbjct: 80  DFNSGKEQAVVVECKGHDAARFHNVDQAHGWEKDIVGMVEEKHGKKKIHVSFECETLKAD 139

Query: 125 KVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
           K AEDHIKQ+MPKLAGLDAVVNIG+M ISGLD EAE+
Sbjct: 140 KAAEDHIKQFMPKLAGLDAVVNIGRMSISGLDFEAED 176


>gi|224141463|ref|XP_002324091.1| predicted protein [Populus trichocarpa]
 gi|222867093|gb|EEF04224.1| predicted protein [Populus trichocarpa]
          Length = 185

 Score =  268 bits (686), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 126/157 (80%), Positives = 141/157 (89%)

Query: 5   KEEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAII 64
           K E VKEGVASIALLP+GSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKL I+
Sbjct: 20  KVEVVKEGVASIALLPSGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLTIV 79

Query: 65  DYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAE 124
           D+N  KE  V+VEC+GHDAARF+N++ AHGWEKD+ GMVE++  K KI VSFECETLKA+
Sbjct: 80  DFNSGKEQAVVVECKGHDAARFHNVDQAHGWEKDIVGMVEEKHGKKKIHVSFECETLKAD 139

Query: 125 KVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
           K AEDHIKQ+MPKLAGLDAVVNIG+M ISGLD EAE+
Sbjct: 140 KAAEDHIKQFMPKLAGLDAVVNIGRMSISGLDFEAED 176


>gi|296082981|emb|CBI22282.3| unnamed protein product [Vitis vinifera]
          Length = 188

 Score =  264 bits (675), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 124/160 (77%), Positives = 141/160 (88%)

Query: 2   ERVKEEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKL 61
           E  KE  V+EGVASIALLP GS+SGHFIQLPH+IC+GLHGTELACERECSRGEDYRLIKL
Sbjct: 21  EERKEGVVREGVASIALLPCGSLSGHFIQLPHAICFGLHGTELACERECSRGEDYRLIKL 80

Query: 62  AIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETL 121
            I D++ K+E  VIVECRGHDAARF++I+ AHGWE DV GMVEQ+  KNKI VSFECETL
Sbjct: 81  TITDFSSKREQTVIVECRGHDAARFHSIDRAHGWEMDVVGMVEQKHGKNKIVVSFECETL 140

Query: 122 KAEKVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
           KA+K AEDHI+Q+MPKLAGLDAVVNIG+M I+GLD+EA E
Sbjct: 141 KADKAAEDHIRQFMPKLAGLDAVVNIGRMSITGLDLEAAE 180


>gi|225452946|ref|XP_002279000.1| PREDICTED: uncharacterized protein LOC100243877 [Vitis vinifera]
 gi|147842425|emb|CAN69515.1| hypothetical protein VITISV_009952 [Vitis vinifera]
          Length = 172

 Score =  263 bits (673), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 124/160 (77%), Positives = 141/160 (88%)

Query: 2   ERVKEEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKL 61
           E  KE  V+EGVASIALLP GS+SGHFIQLPH+IC+GLHGTELACERECSRGEDYRLIKL
Sbjct: 5   EERKEGVVREGVASIALLPCGSLSGHFIQLPHAICFGLHGTELACERECSRGEDYRLIKL 64

Query: 62  AIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETL 121
            I D++ K+E  VIVECRGHDAARF++I+ AHGWE DV GMVEQ+  KNKI VSFECETL
Sbjct: 65  TITDFSSKREQTVIVECRGHDAARFHSIDRAHGWEMDVVGMVEQKHGKNKIVVSFECETL 124

Query: 122 KAEKVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
           KA+K AEDHI+Q+MPKLAGLDAVVNIG+M I+GLD+EA E
Sbjct: 125 KADKAAEDHIRQFMPKLAGLDAVVNIGRMSITGLDLEAAE 164


>gi|255570226|ref|XP_002526073.1| conserved hypothetical protein [Ricinus communis]
 gi|223534570|gb|EEF36267.1| conserved hypothetical protein [Ricinus communis]
          Length = 175

 Score =  263 bits (671), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 125/154 (81%), Positives = 138/154 (89%)

Query: 8   EVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYN 67
           EVKEG+ASIALLP GSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKL IIDYN
Sbjct: 14  EVKEGMASIALLPCGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLTIIDYN 73

Query: 68  KKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVA 127
             KE  V+VECRGHDAARF+NI+HAHGWEKDV  M E++  K KI VSF CETLKA+K A
Sbjct: 74  SGKEQAVVVECRGHDAARFHNIDHAHGWEKDVVAMAEEKHGKKKIFVSFVCETLKADKAA 133

Query: 128 EDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
           E+HI+++MPKLAGLDAVVNIGKMRISGLD EA++
Sbjct: 134 EEHIRKFMPKLAGLDAVVNIGKMRISGLDFEADD 167


>gi|145339062|ref|NP_189574.2| uncharacterized protein [Arabidopsis thaliana]
 gi|332644041|gb|AEE77562.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 178

 Score =  248 bits (633), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 114/151 (75%), Positives = 134/151 (88%)

Query: 12  GVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKE 71
           GVASIALLP G+ISGHFIQ+P SICYGLHGTELACE ECSRGEDYRLIKL IIDYN+KKE
Sbjct: 24  GVASIALLPCGTISGHFIQMPSSICYGLHGTELACETECSRGEDYRLIKLTIIDYNRKKE 83

Query: 72  HDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHI 131
             V+VEC+GHDAAR N++ HAHGWE+DV G+VE++  K K++VSFECETLKA++ AEDHI
Sbjct: 84  QTVVVECKGHDAARINDVEHAHGWEEDVIGLVEEKHGKKKVSVSFECETLKADEAAEDHI 143

Query: 132 KQYMPKLAGLDAVVNIGKMRISGLDIEAEED 162
           +Q+MPKLAGLDAV+NIG M+ISGLD  A E+
Sbjct: 144 RQFMPKLAGLDAVINIGPMKISGLDFAAVEE 174


>gi|351726218|ref|NP_001236863.1| uncharacterized protein LOC100306056 [Glycine max]
 gi|255627405|gb|ACU14047.1| unknown [Glycine max]
          Length = 170

 Score =  247 bits (630), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 114/148 (77%), Positives = 131/148 (88%)

Query: 14  ASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKEHD 73
           ASIALLP GSISGHFIQLPHSICYGL GTELACERECSRGEDYRLIKL I D+N KKE  
Sbjct: 18  ASIALLPCGSISGHFIQLPHSICYGLCGTELACERECSRGEDYRLIKLTITDFNTKKEQA 77

Query: 74  VIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHIKQ 133
            +VEC+GHDAARF++I+HAHGW+KD++GM+E +  K +I+VSFECETLKA K AEDHI++
Sbjct: 78  TVVECKGHDAARFHSIDHAHGWDKDITGMIEPKDGKKRISVSFECETLKAGKAAEDHIRK 137

Query: 134 YMPKLAGLDAVVNIGKMRISGLDIEAEE 161
           +MPKLAGLDAVVNIGKM ISGLD  AE+
Sbjct: 138 FMPKLAGLDAVVNIGKMTISGLDFGAED 165


>gi|449464250|ref|XP_004149842.1| PREDICTED: uncharacterized protein LOC101206322 [Cucumis sativus]
          Length = 158

 Score =  246 bits (628), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 115/151 (76%), Positives = 130/151 (86%)

Query: 11  EGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKK 70
           EGVASIALLP GS+SGHFIQLPHSICYGLHGTEL CERECSRGEDYRLIKL I +YN K+
Sbjct: 4   EGVASIALLPCGSVSGHFIQLPHSICYGLHGTELECERECSRGEDYRLIKLTITNYNNKQ 63

Query: 71  EHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDH 130
           E  V+VECRGHDAARF++I HAHGWE+DV  MVE++  KN+I VSFEC+ LK+EK AEDH
Sbjct: 64  ERTVVVECRGHDAARFHSIVHAHGWEEDVVSMVEKKHGKNEIMVSFECQMLKSEKAAEDH 123

Query: 131 IKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
           I+Q+MPKL G DAVVN+G M I GLD EAEE
Sbjct: 124 IRQFMPKLVGQDAVVNVGPMSIGGLDFEAEE 154


>gi|356573522|ref|XP_003554907.1| PREDICTED: uncharacterized protein LOC100795132 [Glycine max]
          Length = 169

 Score =  245 bits (625), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 116/151 (76%), Positives = 132/151 (87%), Gaps = 1/151 (0%)

Query: 12  GVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKE 71
           GVASIALLP GSISGHFIQLPH ICYGL GTELACERECSRGEDYRLIKL I D+N KKE
Sbjct: 20  GVASIALLPCGSISGHFIQLPHYICYGLCGTELACERECSRGEDYRLIKLTITDFNAKKE 79

Query: 72  HDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHI 131
              +VEC+GHDAA F++I+HAHGW+KD++GM+E +  K +I+VSFECETLKAEK AEDHI
Sbjct: 80  QATVVECKGHDAAWFHSIDHAHGWDKDITGMIEPKDGKKRISVSFECETLKAEKAAEDHI 139

Query: 132 KQYMPKLAGLDAVVNIGKMRISGLDIEAEED 162
           +++MPKLAGLDAVVNIGKM ISGLD   EED
Sbjct: 140 RKFMPKLAGLDAVVNIGKMTISGLDF-GEED 169


>gi|297818568|ref|XP_002877167.1| hypothetical protein ARALYDRAFT_484699 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323005|gb|EFH53426.1| hypothetical protein ARALYDRAFT_484699 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 177

 Score =  244 bits (624), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 112/149 (75%), Positives = 131/149 (87%)

Query: 14  ASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKEHD 73
           ASIALLP G+ISGHFIQ+P SICYGLHGTELACE ECSRGEDYRLIKL IIDYN+KKE  
Sbjct: 25  ASIALLPCGTISGHFIQMPSSICYGLHGTELACETECSRGEDYRLIKLTIIDYNRKKEQT 84

Query: 74  VIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHIKQ 133
           V+VEC+GHDAAR N++ HAHGWE+DV G+VE++  K K++VSFECETLKA+K AEDHI+Q
Sbjct: 85  VVVECKGHDAARINDVEHAHGWEEDVIGLVEEKHGKKKVSVSFECETLKADKAAEDHIRQ 144

Query: 134 YMPKLAGLDAVVNIGKMRISGLDIEAEED 162
           +MPKLAGLDAV+NIG M+ISGLD    E+
Sbjct: 145 FMPKLAGLDAVINIGPMKISGLDFAVVEE 173


>gi|449526016|ref|XP_004170011.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101206322
           [Cucumis sativus]
          Length = 158

 Score =  244 bits (623), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 114/151 (75%), Positives = 129/151 (85%)

Query: 11  EGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKK 70
           EGVASIALLP GS+SGHFIQLPHSICYGLHGTEL CERECSRGEDYRLIKL I +YN K+
Sbjct: 4   EGVASIALLPCGSVSGHFIQLPHSICYGLHGTELECERECSRGEDYRLIKLTITNYNNKQ 63

Query: 71  EHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDH 130
           E  V+VECRGHDAARF++I HAHGWE+DV  MV ++  KN+I VSFEC+ LK+EK AEDH
Sbjct: 64  ERTVVVECRGHDAARFHSIVHAHGWEEDVVSMVXKKHGKNEIMVSFECQMLKSEKAAEDH 123

Query: 131 IKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
           I+Q+MPKL G DAVVN+G M I GLD EAEE
Sbjct: 124 IRQFMPKLVGQDAVVNVGPMSIGGLDFEAEE 154


>gi|414865904|tpg|DAA44461.1| TPA: hypothetical protein ZEAMMB73_493817 [Zea mays]
          Length = 253

 Score =  233 bits (593), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 105/148 (70%), Positives = 125/148 (84%)

Query: 11  EGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKK 70
           EGVASIALLP G+ISGHFI+LP SICYGLHGT ++CERECSRGEDYRLIKL IID+  K+
Sbjct: 97  EGVASIALLPTGAISGHFIRLPDSICYGLHGTPISCERECSRGEDYRLIKLTIIDFKSKR 156

Query: 71  EHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDH 130
           E  ++VECRGHDAAR  NI+H HGWE D+ G+VE+E    K+ +SFECETLKA+K AEDH
Sbjct: 157 EKVLVVECRGHDAARLQNIDHLHGWEDDIVGLVEKEHGNQKVLLSFECETLKADKDAEDH 216

Query: 131 IKQYMPKLAGLDAVVNIGKMRISGLDIE 158
           I +YMP L GLDAVVNIGKM ISG++++
Sbjct: 217 IAKYMPNLCGLDAVVNIGKMSISGINLD 244


>gi|218192310|gb|EEC74737.1| hypothetical protein OsI_10475 [Oryza sativa Indica Group]
          Length = 199

 Score =  231 bits (590), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 105/157 (66%), Positives = 131/157 (83%)

Query: 6   EEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIID 65
           E E  EGVASIALLP+G+ISGHFI+LP S+CYGLHGT ++CERECSRGEDYRLIKL+IID
Sbjct: 38  ESETMEGVASIALLPSGAISGHFIRLPDSVCYGLHGTPISCERECSRGEDYRLIKLSIID 97

Query: 66  YNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEK 125
           +  K E  V+VECRGHDAAR  NI+H HGWE D+ G+VE++    K ++SFECETLKA+K
Sbjct: 98  FKNKHEKVVVVECRGHDAARLQNIDHLHGWEDDIVGLVEKKHGNRKFSLSFECETLKADK 157

Query: 126 VAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEED 162
            AE+HI +YMP L GLDAVVNIGKM ISG++++ +++
Sbjct: 158 AAEEHISKYMPNLKGLDAVVNIGKMSISGINLDEDDE 194


>gi|223945361|gb|ACN26764.1| unknown [Zea mays]
 gi|414865902|tpg|DAA44459.1| TPA: hypothetical protein ZEAMMB73_493817 [Zea mays]
          Length = 218

 Score =  231 bits (589), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 105/148 (70%), Positives = 125/148 (84%)

Query: 11  EGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKK 70
           EGVASIALLP G+ISGHFI+LP SICYGLHGT ++CERECSRGEDYRLIKL IID+  K+
Sbjct: 62  EGVASIALLPTGAISGHFIRLPDSICYGLHGTPISCERECSRGEDYRLIKLTIIDFKSKR 121

Query: 71  EHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDH 130
           E  ++VECRGHDAAR  NI+H HGWE D+ G+VE+E    K+ +SFECETLKA+K AEDH
Sbjct: 122 EKVLVVECRGHDAARLQNIDHLHGWEDDIVGLVEKEHGNQKVLLSFECETLKADKDAEDH 181

Query: 131 IKQYMPKLAGLDAVVNIGKMRISGLDIE 158
           I +YMP L GLDAVVNIGKM ISG++++
Sbjct: 182 IAKYMPNLCGLDAVVNIGKMSISGINLD 209


>gi|108707258|gb|ABF95053.1| expressed protein [Oryza sativa Japonica Group]
 gi|222624590|gb|EEE58722.1| hypothetical protein OsJ_10184 [Oryza sativa Japonica Group]
          Length = 193

 Score =  230 bits (586), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 101/162 (62%), Positives = 133/162 (82%)

Query: 1   MERVKEEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIK 60
           +E     E  EGVASIALLP+G+ISGHFI+LP S+CYGLHGT ++CERECSRGEDYRLIK
Sbjct: 27  VEEADAGETMEGVASIALLPSGAISGHFIRLPDSVCYGLHGTPISCERECSRGEDYRLIK 86

Query: 61  LAIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECET 120
           L+IID+  K+E   +VECRGHDAAR  NI+H HGWE D+ G+VE++    K ++SFECET
Sbjct: 87  LSIIDFKNKREKVAVVECRGHDAARLQNIDHLHGWEDDIVGLVEKKHGNRKFSISFECET 146

Query: 121 LKAEKVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEED 162
           LKA+K A++HI +YMP L+G+DA+VNIGKM ISG++++ +++
Sbjct: 147 LKADKAADEHISKYMPNLSGMDAIVNIGKMSISGINLDEDDE 188


>gi|125585350|gb|EAZ26014.1| hypothetical protein OsJ_09867 [Oryza sativa Japonica Group]
          Length = 177

 Score =  229 bits (585), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 104/157 (66%), Positives = 130/157 (82%)

Query: 6   EEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIID 65
           E E  EGVASIALLP+G+ISGHF +LP S+CYGLHGT ++CERECSRGEDYRLIKL+IID
Sbjct: 16  ESETMEGVASIALLPSGAISGHFTRLPDSVCYGLHGTPISCERECSRGEDYRLIKLSIID 75

Query: 66  YNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEK 125
           +  K E  V+VECRGHDAAR  NI+H HGWE D+ G+VE++    K ++SFECETLKA+K
Sbjct: 76  FKNKHEKVVVVECRGHDAARLQNIDHLHGWEDDIVGLVEKKHGNRKFSLSFECETLKADK 135

Query: 126 VAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEED 162
            AE+HI +YMP L GLDAVVNIGKM ISG++++ +++
Sbjct: 136 AAEEHISKYMPNLKGLDAVVNIGKMSISGINLDEDDE 172


>gi|115452001|ref|NP_001049601.1| Os03g0257000 [Oryza sativa Japonica Group]
 gi|113548072|dbj|BAF11515.1| Os03g0257000 [Oryza sativa Japonica Group]
 gi|218192467|gb|EEC74894.1| hypothetical protein OsI_10809 [Oryza sativa Indica Group]
          Length = 174

 Score =  229 bits (584), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 101/162 (62%), Positives = 133/162 (82%)

Query: 1   MERVKEEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIK 60
           +E     E  EGVASIALLP+G+ISGHFI+LP S+CYGLHGT ++CERECSRGEDYRLIK
Sbjct: 8   VEEADAGETMEGVASIALLPSGAISGHFIRLPDSVCYGLHGTPISCERECSRGEDYRLIK 67

Query: 61  LAIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECET 120
           L+IID+  K+E   +VECRGHDAAR  NI+H HGWE D+ G+VE++    K ++SFECET
Sbjct: 68  LSIIDFKNKREKVAVVECRGHDAARLQNIDHLHGWEDDIVGLVEKKHGNRKFSISFECET 127

Query: 121 LKAEKVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEED 162
           LKA+K A++HI +YMP L+G+DA+VNIGKM ISG++++ +++
Sbjct: 128 LKADKAADEHISKYMPNLSGMDAIVNIGKMSISGINLDEDDE 169


>gi|357113045|ref|XP_003558315.1| PREDICTED: uncharacterized protein LOC100829129 [Brachypodium
           distachyon]
          Length = 194

 Score =  223 bits (568), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 98/147 (66%), Positives = 124/147 (84%)

Query: 12  GVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKE 71
           GVASIALLP+G+ISGHFI+LP S+CYGLHGT + CERECSRG+DYRLIKL++I++  K E
Sbjct: 39  GVASIALLPSGAISGHFIRLPDSVCYGLHGTPIPCERECSRGDDYRLIKLSVINFKTKTE 98

Query: 72  HDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHI 131
             V+VECRGHDAAR  N++H HGWE D+ G+V+++    K+ VSFECETLKA+K AE+HI
Sbjct: 99  KVVVVECRGHDAARLQNVDHLHGWEDDIVGLVQKKHGNKKVFVSFECETLKADKAAEEHI 158

Query: 132 KQYMPKLAGLDAVVNIGKMRISGLDIE 158
           ++YMP L GLDAVVN GKM ISG+++E
Sbjct: 159 RKYMPNLCGLDAVVNTGKMSISGINLE 185


>gi|115451473|ref|NP_001049337.1| Os03g0209700 [Oryza sativa Japonica Group]
 gi|108706785|gb|ABF94580.1| hypothetical protein LOC_Os03g11130 [Oryza sativa Japonica Group]
 gi|113547808|dbj|BAF11251.1| Os03g0209700 [Oryza sativa Japonica Group]
          Length = 234

 Score =  211 bits (537), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 95/139 (68%), Positives = 114/139 (82%)

Query: 6   EEEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIID 65
           E E  EGVASIALLP+G+ISGHF +LP S+CYGLHGT ++CERECSRGEDYRLIKL+IID
Sbjct: 16  ESETMEGVASIALLPSGAISGHFTRLPDSVCYGLHGTPISCERECSRGEDYRLIKLSIID 75

Query: 66  YNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEK 125
           +  K E  V+VECRGHDAAR  NI+H HGWE D+ G+VE++    K ++SFECETLKA+K
Sbjct: 76  FKNKHEKVVVVECRGHDAARLQNIDHLHGWEDDIVGLVEKKHGNRKFSLSFECETLKADK 135

Query: 126 VAEDHIKQYMPKLAGLDAV 144
            AE+HI +YMP L GLDAV
Sbjct: 136 AAEEHISKYMPNLKGLDAV 154


>gi|302794220|ref|XP_002978874.1| hypothetical protein SELMODRAFT_37399 [Selaginella moellendorffii]
 gi|302813579|ref|XP_002988475.1| hypothetical protein SELMODRAFT_27215 [Selaginella moellendorffii]
 gi|300143877|gb|EFJ10565.1| hypothetical protein SELMODRAFT_27215 [Selaginella moellendorffii]
 gi|300153192|gb|EFJ19831.1| hypothetical protein SELMODRAFT_37399 [Selaginella moellendorffii]
          Length = 152

 Score =  194 bits (492), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 87/148 (58%), Positives = 112/148 (75%)

Query: 11  EGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKK 70
           +G+ASIALLP+GSISGHF+  P S C GL+GTE+ C RECSR EDYRLI L ++DY  KK
Sbjct: 4   QGIASIALLPSGSISGHFVHPPSSSCLGLYGTEIPCARECSRAEDYRLINLTLLDYRTKK 63

Query: 71  EHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDH 130
           E  VIVECRG DA+RF+N++  HGWE+DV    +     N + +SFECETLKA++ AE H
Sbjct: 64  ETSVIVECRGQDASRFHNVDDVHGWEEDVIKAAKASDPNNHLQISFECETLKADEDAERH 123

Query: 131 IKQYMPKLAGLDAVVNIGKMRISGLDIE 158
           +++++P LA   AVVNIG M+I GLD+E
Sbjct: 124 LQRFLPALANKGAVVNIGTMKIKGLDLE 151


>gi|11994768|dbj|BAB03124.1| unnamed protein product [Arabidopsis thaliana]
          Length = 116

 Score =  192 bits (487), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 86/114 (75%), Positives = 102/114 (89%)

Query: 31  LPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKEHDVIVECRGHDAARFNNIN 90
           +P SICYGLHGTELACE ECSRGEDYRLIKL IIDYN+KKE  V+VEC+GHDAAR N++ 
Sbjct: 1   MPSSICYGLHGTELACETECSRGEDYRLIKLTIIDYNRKKEQTVVVECKGHDAARINDVE 60

Query: 91  HAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHIKQYMPKLAGLDAV 144
           HAHGWE+DV G+VE++  K K++VSFECETLKA++ AEDHI+Q+MPKLAGLDAV
Sbjct: 61  HAHGWEEDVIGLVEEKHGKKKVSVSFECETLKADEAAEDHIRQFMPKLAGLDAV 114


>gi|357496671|ref|XP_003618624.1| HAT family dimerization domain containing protein [Medicago
           truncatula]
 gi|355493639|gb|AES74842.1| HAT family dimerization domain containing protein [Medicago
           truncatula]
          Length = 724

 Score =  186 bits (472), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 84/119 (70%), Positives = 102/119 (85%)

Query: 43  ELACERECSRGEDYRLIKLAIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGM 102
           EL CERECS+GEDYRLIKL I D+N KKE   +VEC+GHDAARF+ I+HAHGW+K+++GM
Sbjct: 601 ELECERECSKGEDYRLIKLTITDFNTKKEQATVVECKGHDAARFHCIDHAHGWDKEITGM 660

Query: 103 VEQEQEKNKIAVSFECETLKAEKVAEDHIKQYMPKLAGLDAVVNIGKMRISGLDIEAEE 161
           VEQ+  K +I VSFECETLKAE  AEDHI+++MPKLAG+DAVVN+GKM ISGLD  A+E
Sbjct: 661 VEQKDGKKRITVSFECETLKAEDAAEDHIRKFMPKLAGMDAVVNVGKMTISGLDFGADE 719


>gi|326530722|dbj|BAK01159.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 113

 Score =  148 bits (374), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 68/111 (61%), Positives = 86/111 (77%)

Query: 44  LACERECSRGEDYRLIKLAIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMV 103
           + CERECSRG+DYRLIKL+II++ +K E  V+VECRGHDAAR  NI+H HGWE DV G+V
Sbjct: 2   IPCERECSRGDDYRLIKLSIINFKRKTEKVVVVECRGHDAARLQNIDHLHGWEDDVVGLV 61

Query: 104 EQEQEKNKIAVSFECETLKAEKVAEDHIKQYMPKLAGLDAVVNIGKMRISG 154
           +++    K+ VSFECETLKA+K AE+HI +YMP L GLDAV   G ++  G
Sbjct: 62  QKKHGNKKVLVSFECETLKADKAAEEHISKYMPNLCGLDAVGKHGILKFLG 112


>gi|168028007|ref|XP_001766520.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682165|gb|EDQ68585.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 154

 Score =  148 bits (374), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 74/149 (49%), Positives = 96/149 (64%), Gaps = 12/149 (8%)

Query: 7   EEVKEGVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDY 66
           EEV EG+ASIAL P+GS+SGHF+    S C GL GTE+ACERECSRGEDYRL+ L II++
Sbjct: 6   EEVLEGIASIALFPSGSLSGHFLHTASSTCLGLFGTEIACERECSRGEDYRLLNLTIINF 65

Query: 67  NKKKEHDVIVECRGHDAARFNNINHAHG----------WEKDVSGMVEQEQEKNKIA--V 114
             KKE  V+VE +G DAAR +     HG          W+++V             A  +
Sbjct: 66  KTKKETTVVVERKGEDAARLSTTCDLHGSFDESLKLFRWDEEVVQEANNRHRSGGTAPVI 125

Query: 115 SFECETLKAEKVAEDHIKQYMPKLAGLDA 143
           +FECETLKA + AE+H+K+++P L G DA
Sbjct: 126 TFECETLKAGREAEEHLKKFLPSLVGRDA 154


>gi|414865903|tpg|DAA44460.1| TPA: hypothetical protein ZEAMMB73_493817 [Zea mays]
          Length = 124

 Score =  129 bits (323), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 58/90 (64%), Positives = 73/90 (81%)

Query: 69  KKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAE 128
           K+E  ++VECRGHDAAR  NI+H HGWE D+ G+VE+E    K+ +SFECETLKA+K AE
Sbjct: 26  KREKVLVVECRGHDAARLQNIDHLHGWEDDIVGLVEKEHGNQKVLLSFECETLKADKDAE 85

Query: 129 DHIKQYMPKLAGLDAVVNIGKMRISGLDIE 158
           DHI +YMP L GLDAVVNIGKM ISG++++
Sbjct: 86  DHIAKYMPNLCGLDAVVNIGKMSISGINLD 115


>gi|384246351|gb|EIE19841.1| hypothetical protein COCSUDRAFT_58078 [Coccomyxa subellipsoidea
           C-169]
          Length = 317

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 73/143 (51%), Gaps = 3/143 (2%)

Query: 12  GVASIALLPNGSISGHFIQLPHSICYGLHGTELACERECSRGEDYRLIKLAIIDYNKKKE 71
           G   +  LP+G +SGH I     +  G+ G E+  +REC+R  +++L+ + +++   + E
Sbjct: 155 GTGYLIWLPDGRLSGHLIS--SGVTLGIMGREILSQRECTRDAEWKLMHVTLLNLISEAE 212

Query: 72  HDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHI 131
              + E  G D+AR       HG E+ + G  +   +   + + F CET+ A+  AE ++
Sbjct: 213 VGCLAEKTGEDSARLRAPFDLHGSEEHLFGETDLPND-TTLQLEFVCETILADAAAEKYL 271

Query: 132 KQYMPKLAGLDAVVNIGKMRISG 154
            ++ P L G  A VN+G     G
Sbjct: 272 ARFSPSLRGRGAFVNVGHFTCGG 294


>gi|307106456|gb|EFN54702.1| hypothetical protein CHLNCDRAFT_135354 [Chlorella variabilis]
          Length = 409

 Score = 65.5 bits (158), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 40/161 (24%), Positives = 74/161 (45%), Gaps = 19/161 (11%)

Query: 12  GVASIALLPNGSISGHFIQLPHSIC------YGLHGTELACERECSRGEDYRLIKLAIID 65
           G A +ALLPNGS++GH +  P +        +G+ G      REC+R  D+ ++++ ++ 
Sbjct: 182 GTAYLALLPNGSVTGHLVCAPSAFAPARRRSFGVMGEVAVAGRECTRDADWGVLRMTLVS 241

Query: 66  Y--------NKKKEHDVIVECRGHDAARFNNINHA--HGWEKDVSGMVEQE---QEKNKI 112
           +            E   + E    DA R   +      G E D    ++        + +
Sbjct: 242 FAGGGGGGAAGGGEVSFLAERTEGDAMRLRRVAPVPWQGIELDDLPALDAACTCAAASGL 301

Query: 113 AVSFECETLKAEKVAEDHIKQYMPKLAGLDAVVNIGKMRIS 153
           ++ F C+ L +   A+  ++   P LAG  A++N+G MR++
Sbjct: 302 SLRFHCQQLLSGAAADALLRSLQPSLAGRGAMLNVGLMRVA 342


>gi|212721684|ref|NP_001132488.1| uncharacterized protein LOC100193947 [Zea mays]
 gi|194694520|gb|ACF81344.1| unknown [Zea mays]
 gi|414865905|tpg|DAA44462.1| TPA: hypothetical protein ZEAMMB73_493817 [Zea mays]
          Length = 131

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 28/32 (87%), Positives = 30/32 (93%)

Query: 11  EGVASIALLPNGSISGHFIQLPHSICYGLHGT 42
           EGVASIALLP G+ISGHFI+LP SICYGLHGT
Sbjct: 97  EGVASIALLPTGAISGHFIRLPDSICYGLHGT 128


>gi|414865901|tpg|DAA44458.1| TPA: hypothetical protein ZEAMMB73_493817, partial [Zea mays]
          Length = 117

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 28/34 (82%), Positives = 31/34 (91%)

Query: 11 EGVASIALLPNGSISGHFIQLPHSICYGLHGTEL 44
          EGVASIALLP G+ISGHFI+LP SICYGLHGT +
Sbjct: 62 EGVASIALLPTGAISGHFIRLPDSICYGLHGTRI 95


>gi|357496617|ref|XP_003618597.1| HAT family dimerization domain containing protein [Medicago
           truncatula]
 gi|355493612|gb|AES74815.1| HAT family dimerization domain containing protein [Medicago
           truncatula]
          Length = 819

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 27/51 (52%), Positives = 40/51 (78%), Gaps = 1/51 (1%)

Query: 96  EKDVSGMVEQEQEKNKIAVSFECETLKAEKVAEDHIKQ-YMPKLAGLDAVV 145
           +K ++GMVEQ+  K +  VS +CETLKAE  A+DHI+Q ++PKLAG+D ++
Sbjct: 14  DKLITGMVEQKHGKKRTTVSSKCETLKAENAADDHIRQLFLPKLAGIDMLI 64


>gi|357496619|ref|XP_003618598.1| hypothetical protein MTR_6g013550 [Medicago truncatula]
 gi|355493613|gb|AES74816.1| hypothetical protein MTR_6g013550 [Medicago truncatula]
          Length = 751

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 20/31 (64%), Positives = 25/31 (80%), Gaps = 1/31 (3%)

Query: 115 SFECETLKAEKVAEDHIKQ-YMPKLAGLDAV 144
           SF CETL AE  A++HI+Q +MPKLAG+D V
Sbjct: 3   SFRCETLTAENAADNHIRQLFMPKLAGIDGV 33


>gi|440791071|gb|ELR12325.1| hypothetical protein ACA1_373920 [Acanthamoeba castellanii str.
          Neff]
          Length = 248

 Score = 39.3 bits (90), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 38/76 (50%), Gaps = 9/76 (11%)

Query: 18 LLPNGSIS---GHFIQLPHSICYGLHGTEL---ACERE--CSRGEDY-RLIKLAIIDYNK 68
          L  +G  S   GHFI L    C+G+  TE    A E +  C +G+D     +LAIID+  
Sbjct: 21 LFQDGGTSLFQGHFIDLGSRQCFGVTATETEVPAAEAQCCCCQGDDAPTWYRLAIIDFKT 80

Query: 69 KKEHDVIVECRGHDAA 84
           +E   +V  RG D A
Sbjct: 81 GREFTEVVSKRGGDYA 96


>gi|386817151|ref|ZP_10104369.1| permease YjgP/YjgQ family protein [Thiothrix nivea DSM 5205]
 gi|386421727|gb|EIJ35562.1| permease YjgP/YjgQ family protein [Thiothrix nivea DSM 5205]
          Length = 355

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/71 (28%), Positives = 38/71 (53%), Gaps = 4/71 (5%)

Query: 66  YNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIAVSFECETLKAEK 125
           +++KK   + V     +A R ++I HA   E+D SG    E E+  I      ++++ EK
Sbjct: 164 WSEKKLEGISVYTINPEAGRIDSITHAVSAERDTSGWTLHEVERKTIL----ADSIQQEK 219

Query: 126 VAEDHIKQYMP 136
           +A++H  + +P
Sbjct: 220 LAQEHTDKLLP 230


>gi|168066545|ref|XP_001785196.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663197|gb|EDQ49975.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 402

 Score = 37.0 bits (84), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 17/55 (30%), Positives = 26/55 (47%)

Query: 47  ERECSRGEDYRLIKLAIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSG 101
           E  CS  ED+  +      ++ + +H ++   R HD A  N I+HAHG      G
Sbjct: 10  ESRCSALEDFTALGQIPCPFHGEYKHAILASPRTHDRASINPISHAHGGRTTTRG 64


>gi|346318143|gb|EGX87747.1| acetyl-CoA carboxylase [Cordyceps militaris CM01]
          Length = 2271

 Score = 35.0 bits (79), Expect = 9.4,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 36/74 (48%), Gaps = 1/74 (1%)

Query: 54   EDYRLIKLAIIDYNKKKEHDVIVECRGHDAARFNNINHAHGWEKDVSGMVEQEQEKNKIA 113
            E+Y L  +A    N+K+        R H+     + +    WEKD   + E   EK + +
Sbjct: 2160 EEYMLRHMATSTLNQKESTATTNTTREHNLKLLQSWSRIEKWEKDDQAVAEW-YEKERRS 2218

Query: 114  VSFECETLKAEKVA 127
            +  + ETLKAEK++
Sbjct: 2219 IGEKVETLKAEKLS 2232


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.316    0.134    0.389 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,530,595,519
Number of Sequences: 23463169
Number of extensions: 97306229
Number of successful extensions: 244316
Number of sequences better than 100.0: 37
Number of HSP's better than 100.0 without gapping: 30
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 244278
Number of HSP's gapped (non-prelim): 38
length of query: 163
length of database: 8,064,228,071
effective HSP length: 124
effective length of query: 39
effective length of database: 9,449,762,411
effective search space: 368540734029
effective search space used: 368540734029
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 71 (32.0 bits)