BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 045165
         (203 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224081417|ref|XP_002306402.1| predicted protein [Populus trichocarpa]
 gi|222855851|gb|EEE93398.1| predicted protein [Populus trichocarpa]
          Length = 196

 Score =  335 bits (859), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 166/196 (84%), Positives = 179/196 (91%), Gaps = 4/196 (2%)

Query: 12  MEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVF 71
           MEPK  HPLHQIAETPTHKLLLKQWLKEEELI NRI+LKETQIDSV KEITQLY+FFF+F
Sbjct: 1   MEPKQQHPLHQIAETPTHKLLLKQWLKEEELIFNRISLKETQIDSVCKEITQLYIFFFLF 60

Query: 72  HSIALILLFNASSNDPV----EKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLL 127
           HSI+L+LLFNASS DP      ++C+RSWIPSLC+LLCSLG+IWAVRYK+DVE HLEKLL
Sbjct: 61  HSISLLLLFNASSKDPFGAKGSEACKRSWIPSLCSLLCSLGIIWAVRYKTDVECHLEKLL 120

Query: 128 EREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVKKWSARDFVTLFFF 187
           EREKEDG+LLAKCVEELKKKG EFDLLKEVDALRRAKSLRVE K V+KWS RDFVTLFFF
Sbjct: 121 EREKEDGQLLAKCVEELKKKGGEFDLLKEVDALRRAKSLRVETKVVRKWSTRDFVTLFFF 180

Query: 188 SVSCLVLALTRIILCN 203
           SVSCLVL LTRIILCN
Sbjct: 181 SVSCLVLGLTRIILCN 196


>gi|449493024|ref|XP_004159170.1| PREDICTED: uncharacterized protein LOC101230414 [Cucumis sativus]
          Length = 204

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 156/203 (76%), Positives = 180/203 (88%)

Query: 1   MADDQGTKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKE 60
           M D      S+MEPKP HPLHQIAETPTHKLLLKQWLKEEELI  RI+LKETQIDSVRKE
Sbjct: 1   MDDSNNKALSLMEPKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKE 60

Query: 61  ITQLYMFFFVFHSIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVE 120
           IT L++FFFVFHS A++LLFNAS+ D    +C+RSWIPSLC+LL SLG+IWAVRYK+DVE
Sbjct: 61  ITMLHIFFFVFHSTAILLLFNASTKDFHGVACKRSWIPSLCSLLFSLGIIWAVRYKTDVE 120

Query: 121 SHLEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVKKWSARD 180
           +HLEKLLEREKED  LL+KCV+ELK+KG+EFDLLKEVDALRRAKSLRVEAKAV+KWS+RD
Sbjct: 121 AHLEKLLEREKEDRNLLSKCVDELKRKGIEFDLLKEVDALRRAKSLRVEAKAVRKWSSRD 180

Query: 181 FVTLFFFSVSCLVLALTRIILCN 203
           F+TLFFFSVSC+ L + R++LCN
Sbjct: 181 FITLFFFSVSCMFLGIIRVVLCN 203


>gi|449444130|ref|XP_004139828.1| PREDICTED: uncharacterized protein LOC101213346 [Cucumis sativus]
          Length = 193

 Score =  322 bits (824), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 153/192 (79%), Positives = 176/192 (91%)

Query: 12  MEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVF 71
           MEPKP HPLHQIAETPTHKLLLKQWLKEEELI  RI+LKETQIDSVRKEIT L++FFFVF
Sbjct: 1   MEPKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVF 60

Query: 72  HSIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREK 131
           HS A++LLFNAS+ D    +C+RSWIPSLC+LL SLG+IWAVRYK+DVE+HLEKLLEREK
Sbjct: 61  HSTAILLLFNASTKDFHGVACKRSWIPSLCSLLFSLGIIWAVRYKTDVEAHLEKLLEREK 120

Query: 132 EDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVKKWSARDFVTLFFFSVSC 191
           ED  LL+KCV+ELK+KG+EFDLLKEVDALRRAKSLRVEAKAV+KWS+RDF+TLFFFSVSC
Sbjct: 121 EDRNLLSKCVDELKRKGIEFDLLKEVDALRRAKSLRVEAKAVRKWSSRDFITLFFFSVSC 180

Query: 192 LVLALTRIILCN 203
           + L + R++LCN
Sbjct: 181 MFLGIIRVVLCN 192


>gi|15230623|ref|NP_187893.1| uncharacterized protein [Arabidopsis thaliana]
 gi|6997162|gb|AAF34826.1| hypothetical protein [Arabidopsis thaliana]
 gi|9279771|dbj|BAB01416.1| unnamed protein product [Arabidopsis thaliana]
 gi|149944345|gb|ABR46215.1| At3g12870 [Arabidopsis thaliana]
 gi|332641734|gb|AEE75255.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 206

 Score =  321 bits (823), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 164/206 (79%), Positives = 184/206 (89%), Gaps = 3/206 (1%)

Query: 1   MADDQGTKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKE 60
           MA DQ T P IME K  HPLHQIA+TPTHKLLLKQWLKEEELIL+R++ KE+QIDSVR+E
Sbjct: 1   MASDQSTNPPIMEAKTRHPLHQIADTPTHKLLLKQWLKEEELILSRVSHKESQIDSVRRE 60

Query: 61  ITQLYMFFFVFHSIALILLFNASSNDPVEKS---CRRSWIPSLCTLLCSLGLIWAVRYKS 117
           ITQLY+FFF+FHSI+L+LLF+ASS+     S   C+RSWIPSLC LL SLG+IWAVRYKS
Sbjct: 61  ITQLYIFFFLFHSISLLLLFHASSSSSSSASSSACKRSWIPSLCALLSSLGIIWAVRYKS 120

Query: 118 DVESHLEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVKKWS 177
           +VESHLEKLLEREKED KLL KCVEELKKKG+EFDLLKEVDALRRAKSLRVE+K VKKWS
Sbjct: 121 EVESHLEKLLEREKEDAKLLRKCVEELKKKGIEFDLLKEVDALRRAKSLRVESKPVKKWS 180

Query: 178 ARDFVTLFFFSVSCLVLALTRIILCN 203
           ARDFVTLFFFSVSCLVLA+ R+ILC+
Sbjct: 181 ARDFVTLFFFSVSCLVLAMIRLILCD 206


>gi|297834098|ref|XP_002884931.1| hypothetical protein ARALYDRAFT_897496 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330771|gb|EFH61190.1| hypothetical protein ARALYDRAFT_897496 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 208

 Score =  319 bits (818), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 162/208 (77%), Positives = 184/208 (88%), Gaps = 5/208 (2%)

Query: 1   MADDQGTKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKE 60
           MA DQ T P IME K  HPLHQIA+TPTHKLLLKQWLKEEELIL+R++ KE+QIDSVR+E
Sbjct: 1   MASDQSTNPPIMEAKTRHPLHQIADTPTHKLLLKQWLKEEELILSRVSHKESQIDSVRRE 60

Query: 61  ITQLYMFFFVFHSIALILLFNASSNDPVEKS-----CRRSWIPSLCTLLCSLGLIWAVRY 115
           ITQLY+FFF+FHSI+L+LLF+ASS+     +     C+RSWIPSLC LL SLG+IWAVRY
Sbjct: 61  ITQLYIFFFLFHSISLLLLFHASSSSSPSSASASSACKRSWIPSLCALLSSLGIIWAVRY 120

Query: 116 KSDVESHLEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVKK 175
           KS+VESHLEKLLEREKED KLL KCVEELKKKG+EFDLLKEVDALRRAKSLRVE+K V+K
Sbjct: 121 KSEVESHLEKLLEREKEDAKLLRKCVEELKKKGIEFDLLKEVDALRRAKSLRVESKPVRK 180

Query: 176 WSARDFVTLFFFSVSCLVLALTRIILCN 203
           WSARDFVTLFFFSVSCLVLA+ R+ILC+
Sbjct: 181 WSARDFVTLFFFSVSCLVLAMIRLILCD 208


>gi|356508062|ref|XP_003522780.1| PREDICTED: uncharacterized protein LOC100807016 [Glycine max]
          Length = 205

 Score =  300 bits (769), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 150/201 (74%), Positives = 171/201 (85%), Gaps = 4/201 (1%)

Query: 3   DDQGTKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEIT 62
            +  T+ SIME K  HPLHQIAETPTHKLLLKQWLKEEELI +RIALKETQIDS RKEIT
Sbjct: 9   QNNSTRSSIMEWKQMHPLHQIAETPTHKLLLKQWLKEEELINSRIALKETQIDSTRKEIT 68

Query: 63  QLYMFFFVFHSIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESH 122
            LY+FFF+FHS  L+LLFN+S       SC RSW+PSLC+LL SLGLIWA+RYKSDVE+H
Sbjct: 69  TLYIFFFLFHSTTLMLLFNSS----SPSSCHRSWVPSLCSLLFSLGLIWALRYKSDVEAH 124

Query: 123 LEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVKKWSARDFV 182
           +EK+L+REKED  LL KCVEELKKKG+EFDLLKEVDALRRAKSLRVE K V+KWS+RDFV
Sbjct: 125 VEKMLQREKEDKGLLGKCVEELKKKGLEFDLLKEVDALRRAKSLRVENKEVRKWSSRDFV 184

Query: 183 TLFFFSVSCLVLALTRIILCN 203
           +LFFFS++C  LAL R+ILC 
Sbjct: 185 SLFFFSMACFSLALMRVILCT 205


>gi|388503696|gb|AFK39914.1| unknown [Lotus japonicus]
          Length = 208

 Score =  299 bits (766), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 151/205 (73%), Positives = 176/205 (85%), Gaps = 4/205 (1%)

Query: 3   DDQG---TKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRK 59
           DD G   T+ SIME K  HPLHQIAETPTHKLLLKQWLKEEELI  RIALKETQIDS+RK
Sbjct: 4   DDAGAMTTRTSIMEWKQMHPLHQIAETPTHKLLLKQWLKEEELIHGRIALKETQIDSIRK 63

Query: 60  EITQLYMFFFVFHSIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDV 119
           EIT LY++FF+FHS  L+LLFN SS+    ++C RSW+PSLC+LL SLG IWAVRYKSDV
Sbjct: 64  EITMLYIYFFLFHSTTLMLLFNNSSSSTTSRACHRSWVPSLCSLLFSLGSIWAVRYKSDV 123

Query: 120 ESHLEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLR-VEAKAVKKWSA 178
           E+H+EK+L REKED  +L KCVEELKKKG+EFDLLKEVDALRRAKSL+ VE K V+KWS+
Sbjct: 124 EAHMEKMLRREKEDKGMLGKCVEELKKKGLEFDLLKEVDALRRAKSLKVVEVKEVRKWSS 183

Query: 179 RDFVTLFFFSVSCLVLALTRIILCN 203
           RDFV+LFFFS++CL LA+TR+ILC+
Sbjct: 184 RDFVSLFFFSMACLSLAVTRVILCS 208


>gi|351723681|ref|NP_001234984.1| uncharacterized protein LOC100527362 [Glycine max]
 gi|255632177|gb|ACU16448.1| unknown [Glycine max]
          Length = 205

 Score =  292 bits (748), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 147/199 (73%), Positives = 167/199 (83%), Gaps = 4/199 (2%)

Query: 5   QGTKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQL 64
              + SIME K  HPLHQIAETPTHKLLLKQWLKEEELI NRIALKETQIDS RKEIT  
Sbjct: 11  NSIRSSIMEWKQMHPLHQIAETPTHKLLLKQWLKEEELINNRIALKETQIDSTRKEITTP 70

Query: 65  YMFFFVFHSIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLE 124
           Y+FFF+FHS  L+LLFN+S       SC RSW+PSLC+LL SLGLIWA+RYKSDVE+H+E
Sbjct: 71  YIFFFLFHSTTLMLLFNSS----SSSSCHRSWVPSLCSLLFSLGLIWALRYKSDVEAHVE 126

Query: 125 KLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVKKWSARDFVTL 184
           K+L+REKED  LL +CVEELKKKG+EFDLLKEVDALRRAKSLRVE K V+KWS+RD V+ 
Sbjct: 127 KMLQREKEDKGLLGRCVEELKKKGLEFDLLKEVDALRRAKSLRVENKEVRKWSSRDLVSS 186

Query: 185 FFFSVSCLVLALTRIILCN 203
           FFFS++C  LALTR+ILC 
Sbjct: 187 FFFSMACFSLALTRVILCT 205


>gi|225465054|ref|XP_002265299.1| PREDICTED: uncharacterized protein LOC100264983 [Vitis vinifera]
          Length = 202

 Score =  290 bits (743), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 167/200 (83%), Positives = 181/200 (90%)

Query: 4   DQGTKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQ 63
           DQ T P IM+ K  HPLHQIAETPTH+LLLKQWLKEEELILNRI+LKETQID VR+EITQ
Sbjct: 3   DQSTNPLIMDAKSVHPLHQIAETPTHRLLLKQWLKEEELILNRISLKETQIDGVRREITQ 62

Query: 64  LYMFFFVFHSIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHL 123
           LY FFF+FHS  LILLFN SS++P   +C RSWIPSLC+LLCSLGLIWAVRYK+DVESH+
Sbjct: 63  LYCFFFIFHSTLLILLFNNSSSNPNGSACHRSWIPSLCSLLCSLGLIWAVRYKTDVESHM 122

Query: 124 EKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVKKWSARDFVT 183
           EKLLEREKEDGKLL KCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAV+KWSARDFVT
Sbjct: 123 EKLLEREKEDGKLLGKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVRKWSARDFVT 182

Query: 184 LFFFSVSCLVLALTRIILCN 203
            FFF+VSCLVL L R+ILCN
Sbjct: 183 FFFFTVSCLVLGLVRVILCN 202


>gi|219363009|ref|NP_001136939.1| uncharacterized protein LOC100217098 [Zea mays]
 gi|194697690|gb|ACF82929.1| unknown [Zea mays]
 gi|413932910|gb|AFW67461.1| hypothetical protein ZEAMMB73_710114 [Zea mays]
          Length = 204

 Score =  223 bits (568), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 116/206 (56%), Positives = 148/206 (71%), Gaps = 5/206 (2%)

Query: 1   MADDQGTKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKE 60
           MAD       + E +  HPL QIA + TH+LLLKQW+KEE+L+  R+AL+E ++D  RKE
Sbjct: 1   MADSGSI--VVTEARKQHPLSQIAASGTHRLLLKQWVKEEDLLARRVALREARLDGARKE 58

Query: 61  ITQLYMFFFVFHSIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVE 120
           I  LY  FF FH+ +++LLF +SS      +CRRSWIP L +LL SL ++WA+RYKSD E
Sbjct: 59  IAFLYCAFFAFHAASVLLLFLSSSTAGPASACRRSWIPCLVSLLSSLAMLWALRYKSDTE 118

Query: 121 SHLEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAV---KKWS 177
           + LE+LL RE+ED  LLA+CV ELK+KG+ FDLLKEVDALRRAKSLRVEAK     ++W 
Sbjct: 119 AVLERLLAREREDALLLARCVSELKRKGLHFDLLKEVDALRRAKSLRVEAKGADRPRRWQ 178

Query: 178 ARDFVTLFFFSVSCLVLALTRIILCN 203
           ARD      F+V+C VL LTR +LCN
Sbjct: 179 ARDLAVFALFAVACGVLVLTRFLLCN 204


>gi|115455689|ref|NP_001051445.1| Os03g0778400 [Oryza sativa Japonica Group]
 gi|113549916|dbj|BAF13359.1| Os03g0778400 [Oryza sativa Japonica Group]
          Length = 202

 Score =  220 bits (560), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 111/193 (57%), Positives = 141/193 (73%), Gaps = 7/193 (3%)

Query: 13  EPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFH 72
           + K  HPL QIAE+ TH+LLLKQWLKEE+L+  R+AL+E ++D  RKEI  LY  FF FH
Sbjct: 11  QAKARHPLSQIAESGTHRLLLKQWLKEEDLLARRVALREARLDGARKEIAFLYCAFFAFH 70

Query: 73  SIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKE 132
           + +++LLF ++S      +CRRSWIP L +LL SL ++WA+RYK+D E+ LE+LL RE+E
Sbjct: 71  AASILLLFLSASA-STSAACRRSWIPCLVSLLSSLAMLWALRYKADTEAVLERLLARERE 129

Query: 133 DGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAV------KKWSARDFVTLFF 186
           D  LL KCV ELK+KG+EFDLLKEVDALRRAKSLRVEAK        K+W+ARD      
Sbjct: 130 DALLLGKCVAELKRKGLEFDLLKEVDALRRAKSLRVEAKGGAGGERPKRWAARDLAVFLL 189

Query: 187 FSVSCLVLALTRI 199
           F  +C VL LTR+
Sbjct: 190 FGAACGVLVLTRL 202


>gi|12957701|gb|AAK09219.1|AC084320_6 hypothetical protein [Oryza sativa Japonica Group]
 gi|20303603|gb|AAM19030.1|AC084748_20 hypothetical protein [Oryza sativa Japonica Group]
 gi|108711364|gb|ABF99159.1| expressed protein [Oryza sativa Japonica Group]
 gi|125545905|gb|EAY92044.1| hypothetical protein OsI_13736 [Oryza sativa Indica Group]
 gi|125588111|gb|EAZ28775.1| hypothetical protein OsJ_12795 [Oryza sativa Japonica Group]
          Length = 241

 Score =  218 bits (556), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 111/192 (57%), Positives = 140/192 (72%), Gaps = 7/192 (3%)

Query: 13  EPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFH 72
           + K  HPL QIAE+ TH+LLLKQWLKEE+L+  R+AL+E ++D  RKEI  LY  FF FH
Sbjct: 11  QAKARHPLSQIAESGTHRLLLKQWLKEEDLLARRVALREARLDGARKEIAFLYCAFFAFH 70

Query: 73  SIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKE 132
           + +++LLF  S++     +CRRSWIP L +LL SL ++WA+RYK+D E+ LE+LL RE+E
Sbjct: 71  AASILLLF-LSASASTSAACRRSWIPCLVSLLSSLAMLWALRYKADTEAVLERLLARERE 129

Query: 133 DGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAV------KKWSARDFVTLFF 186
           D  LL KCV ELK+KG+EFDLLKEVDALRRAKSLRVEAK        K+W+ARD      
Sbjct: 130 DALLLGKCVAELKRKGLEFDLLKEVDALRRAKSLRVEAKGGAGGERPKRWAARDLAVFLL 189

Query: 187 FSVSCLVLALTR 198
           F  +C VL LTR
Sbjct: 190 FGAACGVLVLTR 201


>gi|326517669|dbj|BAK03753.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 213

 Score =  216 bits (551), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 110/211 (52%), Positives = 146/211 (69%), Gaps = 11/211 (5%)

Query: 4   DQGTKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQ 63
           D     ++  P+  HPL QIA + TH+LLLKQWLKEE+L+  R+AL+E ++D  RKEI  
Sbjct: 3   DASALVAVPPPQQRHPLSQIAASGTHRLLLKQWLKEEDLLARRVALREARLDGARKEIAF 62

Query: 64  LYMFFFVFHSIALILLF---NASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVE 120
           LY  FF FH+ +++LLF   +  S+ P   +CRRSWIP L +LL SL ++WA+RYK+D E
Sbjct: 63  LYCAFFAFHAASVLLLFLSASVPSSGPSSSTCRRSWIPCLVSLLSSLAMLWALRYKADTE 122

Query: 121 SHLEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAV------- 173
           + L ++L RE+ED  LLA+CV ELK+KG++FDLLKEVDALRRAK+LRVEAKA        
Sbjct: 123 AVLGRVLAREQEDAALLARCVAELKRKGLDFDLLKEVDALRRAKALRVEAKAGSGAAADG 182

Query: 174 -KKWSARDFVTLFFFSVSCLVLALTRIILCN 203
             +W ARD      F  +C V+ LTR +LCN
Sbjct: 183 PSRWPARDLPVFALFGAACGVIVLTRFLLCN 213


>gi|357113286|ref|XP_003558435.1| PREDICTED: uncharacterized protein LOC100840254 [Brachypodium
           distachyon]
          Length = 211

 Score =  211 bits (538), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 112/198 (56%), Positives = 142/198 (71%), Gaps = 11/198 (5%)

Query: 17  PHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIAL 76
           PHPL QIA + TH+LLLKQWLKEE+L+  R+AL+E ++D  RKEI  LY  FF FH+ +L
Sbjct: 14  PHPLSQIAASGTHRLLLKQWLKEEDLLARRVALREARLDGARKEIAFLYCAFFAFHAASL 73

Query: 77  ILLFNASSNDPV----EKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKE 132
           +LLF +SS          +CRRSWIP L +LL SL ++WA+RYK+D E+ LE++L RE+E
Sbjct: 74  LLLFLSSSVPVSSGPSTAACRRSWIPCLVSLLSSLAMLWALRYKADTEAVLERVLAREQE 133

Query: 133 DGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAV-------KKWSARDFVTLF 185
           D  LLA+CV ELK+KG+EFDLLKEVDALRRAK+LRVEAK         ++W ARD     
Sbjct: 134 DAALLARCVAELKRKGLEFDLLKEVDALRRAKTLRVEAKGCGIGAESPRRWPARDLPVFA 193

Query: 186 FFSVSCLVLALTRIILCN 203
            F  +C VL LTR +LCN
Sbjct: 194 LFGAACGVLVLTRFLLCN 211


>gi|242037929|ref|XP_002466359.1| hypothetical protein SORBIDRAFT_01g006360 [Sorghum bicolor]
 gi|241920213|gb|EER93357.1| hypothetical protein SORBIDRAFT_01g006360 [Sorghum bicolor]
          Length = 210

 Score =  210 bits (534), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 109/192 (56%), Positives = 142/192 (73%), Gaps = 6/192 (3%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL QIAE+ TH+LLLKQW+KEE+L+  R+AL+E ++D  RKEI  LY  FF FH+ +++
Sbjct: 19  HPLSQIAESGTHRLLLKQWVKEEDLLARRVALREARLDGARKEIAFLYCAFFAFHAASVL 78

Query: 78  LLF--NASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGK 135
           LLF  +AS+      +C+RSWIP L +LL SL ++WA+RYKSD E+ LE++L RE+ED  
Sbjct: 79  LLFLSSASTGSASAAACKRSWIPCLVSLLSSLAMLWALRYKSDTEAVLERMLAREREDAL 138

Query: 136 LLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAV----KKWSARDFVTLFFFSVSC 191
           LLA+CV ELK+KG+ FDLLKEVDALRRAKSLRVEAK      ++W  RD      F+ +C
Sbjct: 139 LLARCVSELKRKGIHFDLLKEVDALRRAKSLRVEAKGGADKPRRWQTRDLAVFALFAAAC 198

Query: 192 LVLALTRIILCN 203
            VL LTR +LCN
Sbjct: 199 GVLVLTRFLLCN 210


>gi|414873194|tpg|DAA51751.1| TPA: hypothetical protein ZEAMMB73_846434 [Zea mays]
          Length = 210

 Score =  187 bits (476), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 104/202 (51%), Positives = 138/202 (68%), Gaps = 9/202 (4%)

Query: 11  IMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFV 70
           + E +  HPL QIA + TH+LLLKQW+KEE+L+  R+AL E  ++  RKEI  LY  FF 
Sbjct: 9   VTEAQQQHPLSQIAASETHRLLLKQWVKEEDLLARRVALWEACLNGARKEIAFLYCAFFA 68

Query: 71  FHSIALILLFNASSNDPV-----EKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEK 125
           FH+ +++LLF +SS+          +CRRSWIP L +LL SL ++WA+ YKSD E+ LE+
Sbjct: 69  FHAASVLLLFLSSSSSSAASAAATTACRRSWIPYLVSLLSSLTMLWALWYKSDTEAVLER 128

Query: 126 LLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAV----KKWSARDF 181
           +L RE+ED   LA+CV ELK+KG+ FDLLKEVD LRRAK+LR EAK      ++W  RD 
Sbjct: 129 VLAREREDALYLARCVSELKRKGLRFDLLKEVDTLRRAKNLRAEAKGAADRPRRWQTRDL 188

Query: 182 VTLFFFSVSCLVLALTRIILCN 203
                F+ +C VL LTR +LCN
Sbjct: 189 AVFALFAAACGVLVLTRFLLCN 210


>gi|414873189|tpg|DAA51746.1| TPA: hypothetical protein ZEAMMB73_807670 [Zea mays]
          Length = 211

 Score =  181 bits (460), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 108/213 (50%), Positives = 141/213 (66%), Gaps = 12/213 (5%)

Query: 1   MADDQGTKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKE 60
           MAD       + E +  HPL QIA + TH+LLLKQW+KEE+L+  R+ L E  ++  RKE
Sbjct: 1   MADSGAL--VVTEARQQHPLSQIAASETHRLLLKQWVKEEDLLARRVVLWEACLNGARKE 58

Query: 61  ITQLYMFFFVFHSIALILLFNASSNDPV------EKSCRRSWIPSLCTLLCSLGLIWAVR 114
           I  LY  FF FHS +++LLF +SS+           +CRRSWI  L +LL SL +IWA+ 
Sbjct: 59  IAFLYCAFFAFHSASVLLLFLSSSSSSSAGLAAATTACRRSWILCLVSLLSSLAMIWALW 118

Query: 115 YKSDVESHLEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAV- 173
           YKSD E+ LE++L RE+ED   LA+CV ELK+KG+ FDLLKEVD +RRAK+LR EAK   
Sbjct: 119 YKSDTEAILERVLAREREDALHLARCVSELKRKGLRFDLLKEVDTVRRAKNLRAEAKGAA 178

Query: 174 ---KKWSARDFVTLFFFSVSCLVLALTRIILCN 203
              ++W  RD     FF+V+C VL LTR +LCN
Sbjct: 179 DRPRRWQTRDLAVFAFFAVACGVLMLTRFLLCN 211


>gi|194695944|gb|ACF82056.1| unknown [Zea mays]
 gi|413932911|gb|AFW67462.1| hypothetical protein ZEAMMB73_710114 [Zea mays]
          Length = 181

 Score =  173 bits (439), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 98/206 (47%), Positives = 127/206 (61%), Gaps = 28/206 (13%)

Query: 1   MADDQGTKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKE 60
           MAD       + E +  HPL QIA + TH+LLLKQW+KEE+L+  R+AL+E ++D  RKE
Sbjct: 1   MADSGSI--VVTEARKQHPLSQIAASGTHRLLLKQWVKEEDLLARRVALREARLDGARKE 58

Query: 61  ITQLYMFFFVFHSIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVE 120
           I  LY  FF FH+ +++LLF +SS      +CRRSWIP L +LL SL ++WA+RYKSD E
Sbjct: 59  IAFLYCAFFAFHAASVLLLFLSSSTAGPASACRRSWIPCLVSLLSSLAMLWALRYKSDTE 118

Query: 121 SHLEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAV---KKWS 177
           + LE+LL RE+ED  LLA                       RAKSLRVEAK     ++W 
Sbjct: 119 AVLERLLAREREDALLLA-----------------------RAKSLRVEAKGADRPRRWQ 155

Query: 178 ARDFVTLFFFSVSCLVLALTRIILCN 203
           ARD      F+V+C VL LTR +LCN
Sbjct: 156 ARDLAVFALFAVACGVLVLTRFLLCN 181


>gi|226493802|ref|NP_001143323.1| uncharacterized protein LOC100275906 [Zea mays]
 gi|195617750|gb|ACG30705.1| hypothetical protein [Zea mays]
 gi|414872819|tpg|DAA51376.1| TPA: hypothetical protein ZEAMMB73_102701 [Zea mays]
          Length = 187

 Score =  152 bits (384), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 87/202 (43%), Positives = 119/202 (58%), Gaps = 32/202 (15%)

Query: 11  IMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFV 70
           + E +  HPL QIA + TH+LLLKQW+KEE+L+  R+AL+E ++D  RKEI  LY  FF 
Sbjct: 9   VTEARQQHPLSQIAASGTHRLLLKQWVKEEDLLARRVALREARLDGARKEIAFLYCAFFA 68

Query: 71  FHS-----IALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEK 125
           FH+     + L    + +++     +CRRSWIP L +LL SL ++WA+RYKSD E+ LE+
Sbjct: 69  FHAASVLLLFLSSSSSVAASAAATTACRRSWIPCLVSLLSSLAMLWALRYKSDTEAVLER 128

Query: 126 LLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAV----KKWSARDF 181
           +L RE+ED  LLA                       RAK+LR EAK      ++W ARD 
Sbjct: 129 VLAREREDALLLA-----------------------RAKNLRAEAKGAADRPRRWQARDL 165

Query: 182 VTLFFFSVSCLVLALTRIILCN 203
                F+ +C VL LTR +LCN
Sbjct: 166 AVFALFAAACGVLVLTRFLLCN 187


>gi|224122256|ref|XP_002330578.1| predicted protein [Populus trichocarpa]
 gi|222872136|gb|EEF09267.1| predicted protein [Populus trichocarpa]
          Length = 221

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 79/187 (42%), Positives = 111/187 (59%), Gaps = 2/187 (1%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +I+  P H LLLK W +EE L   RIALKE+++ S+++EI  L  FF VFH I L 
Sbjct: 35  HPLAEISNCPGHLLLLKLWQREENLFGRRIALKESRMVSIKREIFHLCCFFLVFHGIFLT 94

Query: 78  LLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGKLL 137
           +LF  SS D  E +C+  WIPSL ++  SL  +  V+ K      + + L+REK + + L
Sbjct: 95  ILF-TSSVDSKEHTCKNWWIPSLVSVCTSLVFVSLVQVKICSYWKVSRQLQREKNNDRAL 153

Query: 138 AKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVK-KWSARDFVTLFFFSVSCLVLAL 196
            KC++EL+ KG  FDL KE  + +R KS  VE K     W ++  +T+   S SCLV   
Sbjct: 154 TKCIQELRMKGASFDLSKEPVSGKRMKSSSVEIKWKPFTWCSQYLITICLVSFSCLVFPA 213

Query: 197 TRIILCN 203
           ++ ILC 
Sbjct: 214 SKFILCG 220


>gi|255584418|ref|XP_002532941.1| conserved hypothetical protein [Ricinus communis]
 gi|223527292|gb|EEF29445.1| conserved hypothetical protein [Ricinus communis]
          Length = 220

 Score =  136 bits (343), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 74/187 (39%), Positives = 113/187 (60%), Gaps = 1/187 (0%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +I+ +P H LLLK W +EE+L   +I +KET++DS+++EI QL  FF +FH I L 
Sbjct: 33  HPLTEISSSPGHLLLLKLWQREEDLFNRKIGVKETRMDSIKREIFQLCSFFLIFHGIFLT 92

Query: 78  LLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGKLL 137
           +LF +SS++  E +C++ WIPSL ++  SL  +   + K      + K L+RE+ D + L
Sbjct: 93  ILFTSSSDNKEEHTCKKWWIPSLVSISTSLVFVSLAQVKLFRYWKVWKQLQRERNDNRAL 152

Query: 138 AKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVK-KWSARDFVTLFFFSVSCLVLAL 196
            +C++EL+ KG  FDL KE    +R KS  VE K     W ++  VT+     S L+   
Sbjct: 153 TRCIQELRMKGSSFDLSKEPQCGKRMKSSSVEIKWKPITWCSQYLVTICLVCFSGLLFPA 212

Query: 197 TRIILCN 203
           ++ ILC 
Sbjct: 213 SKFILCG 219


>gi|225462043|ref|XP_002274782.1| PREDICTED: uncharacterized protein LOC100251512 [Vitis vinifera]
          Length = 221

 Score =  136 bits (342), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 77/187 (41%), Positives = 118/187 (63%), Gaps = 2/187 (1%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +I+++P H LLLK W +EE+L   RIAL+ET++DS+++EI QL  FFF+FH+  LI
Sbjct: 36  HPLMEISQSPGHLLLLKLWQREEDLSGRRIALRETRMDSIKREIFQLCCFFFMFHAFYLI 95

Query: 78  LLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGKLL 137
           LLF  SS D  + +CR+ WIP +  +  SL +++ V+ K      + + L+REK DG+ +
Sbjct: 96  LLFT-SSVDLEDHACRKWWIPVVLLVSTSLAIVFLVQVKLWRYWKVYRQLQREKSDGRAV 154

Query: 138 AKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVK-KWSARDFVTLFFFSVSCLVLAL 196
            + ++EL+ KG  FDL KE    RR KS  VE K     W +++ +T+     + LV+  
Sbjct: 155 TRFIQELRMKGASFDLSKEPQNSRRMKSSSVEIKWRPLVWCSQNLITICLVCFTGLVIPA 214

Query: 197 TRIILCN 203
            + ILC+
Sbjct: 215 CKFILCS 221


>gi|224056885|ref|XP_002299072.1| predicted protein [Populus trichocarpa]
 gi|222846330|gb|EEE83877.1| predicted protein [Populus trichocarpa]
          Length = 219

 Score =  134 bits (337), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 76/186 (40%), Positives = 113/186 (60%), Gaps = 2/186 (1%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +I+ +P H LLLK W +EE+L   RIALKE+++DS+++EI QL  FF VFH   L 
Sbjct: 35  HPLAEISNSPGHLLLLKLWQREEDLFGRRIALKESRMDSIKREIFQLCCFFLVFHGFFLT 94

Query: 78  LLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGKLL 137
           +LF  SS D  E SC+  WIPSL ++  SL  ++ V+ K      + + L+REK D + L
Sbjct: 95  ILFT-SSVDSKEHSCKNWWIPSLVSVCTSLVFVFLVQVKVCRYWKVWRQLQREKNDNRAL 153

Query: 138 AKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVK-KWSARDFVTLFFFSVSCLVLAL 196
            +C++EL+ KG  FDL KE  + ++ KS  VE K     W ++  +T+     + LV   
Sbjct: 154 TRCIQELRMKGASFDLSKEPLSGKKMKSSSVEIKWKPLTWCSQYLITICLVCFTGLVFPA 213

Query: 197 TRIILC 202
           ++ +LC
Sbjct: 214 SKFMLC 219


>gi|147789348|emb|CAN62350.1| hypothetical protein VITISV_004944 [Vitis vinifera]
          Length = 218

 Score =  127 bits (320), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 70/154 (45%), Positives = 102/154 (66%), Gaps = 1/154 (0%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +I+++P H LLLK W +EE+L   RIAL+ET++DS+++EI QL  FFF+FH+  LI
Sbjct: 36  HPLMEISQSPGHLLLLKLWQREEDLSGRRIALRETRMDSIKREIFQLCCFFFMFHAFYLI 95

Query: 78  LLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGKLL 137
           LLF  SS D  + +CR+ WIP +  +  SL +++ V+ K      + + L+REK DG+ +
Sbjct: 96  LLFT-SSVDLEDHACRKWWIPVVLLVSTSLAIVFLVQVKLWRYWKVYRQLQREKSDGRAV 154

Query: 138 AKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAK 171
            + ++EL+ KG  FDL KE    RR KS  VE K
Sbjct: 155 TRFIQELRMKGASFDLSKEPQNSRRMKSSSVEIK 188


>gi|356524423|ref|XP_003530828.1| PREDICTED: uncharacterized protein LOC100801421 [Glycine max]
          Length = 232

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 115/201 (57%), Gaps = 7/201 (3%)

Query: 10  SIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFF 69
           +++E    HPL +I+E+P H LLLK W +EEEL   RIA KET++D+++ E+ QL  FFF
Sbjct: 31  NMIEAIEDHPLTEISESPGHLLLLKLWQREEELFSKRIAHKETRMDTIKAELFQLSSFFF 90

Query: 70  VFHSIALILLFNA---SSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKL 126
           +FH+  L LLF +   +        C + W+PS+ +L  S   +  V+ K      +   
Sbjct: 91  IFHAFFLTLLFTSWAKAQQQAHHSVCHKWWLPSMVSLCTSFVFVVLVQMKVHRYWKVWGH 150

Query: 127 LEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVK----KWSARDFV 182
           L+RE+ DG+ + +C++EL+ KG  FDL KE ++    K ++  +  +K     W +++ +
Sbjct: 151 LQRERNDGRAVTRCIQELRMKGASFDLSKEPNSSSSGKRMKSSSVEIKWRPLTWCSKNLL 210

Query: 183 TLFFFSVSCLVLALTRIILCN 203
           T+     S LV   ++++LC 
Sbjct: 211 TISLVCFSGLVFPASKLVLCG 231


>gi|297793101|ref|XP_002864435.1| hypothetical protein ARALYDRAFT_495698 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310270|gb|EFH40694.1| hypothetical protein ARALYDRAFT_495698 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 235

 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 74/192 (38%), Positives = 115/192 (59%), Gaps = 6/192 (3%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +I+E+P H LLLK W +EE+L   R+ LKE++++S+++EI QL  FF VFH     
Sbjct: 43  HPLSEISESPGHLLLLKLWQREEDLFCRRVLLKESRLESIKREIFQLCCFFLVFHGFFFT 102

Query: 78  LLFNA--SSNDPVEKS---CRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKE 132
           L++++  S +D V KS   C++ WIPS  +L  SL L++ V+ K  V   + + + RE+ 
Sbjct: 103 LVYSSSCSDDDEVVKSNAVCKKWWIPSAVSLATSLVLVFLVQAKLFVFWKVYRGVHRERN 162

Query: 133 DGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAK-AVKKWSARDFVTLFFFSVSC 191
           D + L +CV EL+ KG  FDL KE  + +R KS  VE K     W ++  +T+     + 
Sbjct: 163 DNRTLTRCVLELRMKGSSFDLSKEPMSGKRMKSSSVEIKWKPVTWFSQYLITIVLLCFAG 222

Query: 192 LVLALTRIILCN 203
           L   +++ ILC 
Sbjct: 223 LFFPISKFILCG 234


>gi|30696692|ref|NP_200423.2| uncharacterized protein [Arabidopsis thaliana]
 gi|38603818|gb|AAR24654.1| At5g56120 [Arabidopsis thaliana]
 gi|51971979|dbj|BAD44654.1| putative protein [Arabidopsis thaliana]
 gi|332009340|gb|AED96723.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 236

 Score =  109 bits (272), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 71/192 (36%), Positives = 113/192 (58%), Gaps = 6/192 (3%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +I+E+P H LLLK W +EE+L   R+ LKE++++S+++EI QL  FF VFH     
Sbjct: 44  HPLSEISESPGHLLLLKLWQREEDLFCRRVLLKESRLESIKREIFQLCCFFLVFHGFFFT 103

Query: 78  LLFNASSNDPVE-----KSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKE 132
           L++++S +D  +       C++ WIPS  +L  SL L++ V+ K  V   + + + RE+ 
Sbjct: 104 LVYSSSCSDDADVVKSNAVCKKWWIPSAVSLATSLVLVFLVQAKLFVFWKVYRGVNRERN 163

Query: 133 DGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAK-AVKKWSARDFVTLFFFSVSC 191
           D + L +CV EL+ KG  FDL KE  + +R KS  VE K     W ++  +T+     + 
Sbjct: 164 DNRTLTRCVLELRMKGSSFDLSKEPMSGKRMKSSSVEIKWKPVTWFSQYLITIVLLCSAG 223

Query: 192 LVLALTRIILCN 203
           L   +++ ILC 
Sbjct: 224 LFFPVSKFILCG 235


>gi|356566275|ref|XP_003551359.1| PREDICTED: uncharacterized protein LOC100813747 [Glycine max]
          Length = 236

 Score =  108 bits (270), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 73/203 (35%), Positives = 117/203 (57%), Gaps = 9/203 (4%)

Query: 10  SIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFF 69
           +++E    HPL +I+E+P H LLLK W +EEEL   RIA KET++D+++ E+ QL  FFF
Sbjct: 33  NMIEAIEDHPLTEISESPGHLLLLKLWQREEELFAKRIAHKETRMDTIKAELFQLSSFFF 92

Query: 70  VFHSIALILLFNA-----SSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLE 124
           +FH+  L LLF +         P    C + W+PS+ +L  SL  +  V+ K      + 
Sbjct: 93  IFHAFFLTLLFTSWAKAQQQAQPHHSVCHKWWLPSMVSLCTSLVFVVLVQVKVHRYWKVW 152

Query: 125 KLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVD---ALRRAKSLRVEAKAVK-KWSARD 180
             L+RE+ DG+ + +C++EL+ KG  FDL KE +   +++R KS  VE K     W +++
Sbjct: 153 GHLQRERNDGRAVTRCIQELRMKGASFDLSKEPNSSSSVKRMKSSSVEIKWRPLTWCSKN 212

Query: 181 FVTLFFFSVSCLVLALTRIILCN 203
            +T      + LV   ++++LC 
Sbjct: 213 LLTFSLVCFTGLVFPASKLVLCG 235


>gi|449463707|ref|XP_004149573.1| PREDICTED: uncharacterized protein LOC101219596 [Cucumis sativus]
 gi|449526355|ref|XP_004170179.1| PREDICTED: uncharacterized protein LOC101225490 [Cucumis sativus]
          Length = 217

 Score =  105 bits (261), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 67/187 (35%), Positives = 111/187 (59%), Gaps = 5/187 (2%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL QI+E+  H LLLK W ++E L   RI  +ET+++S++++I QL  FFF+FH+++L 
Sbjct: 34  HPLRQISESSGHLLLLKLWQRDEHLFGLRIGRRETKMESLKQQIFQLCCFFFLFHALSLT 93

Query: 78  LLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGKLL 137
           LL+  +S+DP+   C++ W+P++     S   +  V+ K  +       L++EK + + L
Sbjct: 94  LLY--TSSDPI--VCKKWWVPAVVIGATSGVFVIVVQLKLWMYWKARGQLQKEKTENRAL 149

Query: 138 AKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAK-AVKKWSARDFVTLFFFSVSCLVLAL 196
            +CV+EL+ KG  F+L KE     R KS  VE K     W +R+F+T+     S ++   
Sbjct: 150 TRCVQELRMKGSCFNLSKEPQIGNRMKSSSVEIKWGPLTWFSRNFITISLLGFSAIIFVT 209

Query: 197 TRIILCN 203
           ++ ILC 
Sbjct: 210 SKFILCG 216


>gi|9758632|dbj|BAB09294.1| unnamed protein product [Arabidopsis thaliana]
          Length = 237

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/179 (38%), Positives = 106/179 (59%), Gaps = 9/179 (5%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +I+E+P H LLLK W +EE+L   R+ LKE++++S+++EI QL  FF VFH     
Sbjct: 44  HPLSEISESPGHLLLLKLWQREEDLFCRRVLLKESRLESIKREIFQLCCFFLVFHGFFFT 103

Query: 78  LLFNASSNDPVE-----KSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKE 132
           L++++S +D  +       C++ WIPS  +L  SL L++ V+ K  V   + + + RE+ 
Sbjct: 104 LVYSSSCSDDADVVKSNAVCKKWWIPSAVSLATSLVLVFLVQAKLFVFWKVYRGVNRERN 163

Query: 133 DGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVKKWSARDFVTLFFFSVSC 191
           D + L +CV EL+ KG  FDL KE  + +R KS  VE     KW    +    FF++ C
Sbjct: 164 DNRTLTRCVLELRMKGSSFDLSKEPMSGKRMKSSSVEI----KWKPVTWTLHSFFNLHC 218


>gi|357141759|ref|XP_003572337.1| PREDICTED: uncharacterized protein LOC100824811 [Brachypodium
           distachyon]
          Length = 269

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 69/194 (35%), Positives = 97/194 (50%), Gaps = 9/194 (4%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +IA +  H LLLK W +EE  +  R    E ++D+ R++   L   F   H ++L 
Sbjct: 75  HPLGEIASSGGHLLLLKLWQREESRLGRRACALEARMDAARRDAFYLCAAFLALHGLSLA 134

Query: 78  LLFNASS--NDPVEK-SCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDG 134
           LLF ASS   D  E  +CRR W PS  +L  SL +  AV+ +        + L RE+ D 
Sbjct: 135 LLFAASSVSADAGEGGACRRWWAPSSLSLATSLAIATAVQLRVRAYWRAARRLRRERGDA 194

Query: 135 KLLAKCVEELKKKGVEFDLLKEVD---ALRRAKSLRVEAKAV---KKWSARDFVTLFFFS 188
           + LA+CV+EL+ KG  FDL KE       R      VE        +W  R+ VT    +
Sbjct: 195 RALARCVQELRTKGAAFDLSKEPQWGVGARANCCASVEGAGAWGPLRWCYRNIVTACLLA 254

Query: 189 VSCLVLALTRIILC 202
           V+   L   ++ILC
Sbjct: 255 VAAAALPSGKLILC 268


>gi|255544648|ref|XP_002513385.1| hypothetical protein RCOM_1575310 [Ricinus communis]
 gi|223547293|gb|EEF48788.1| hypothetical protein RCOM_1575310 [Ricinus communis]
          Length = 63

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/51 (90%), Positives = 48/51 (94%)

Query: 123 LEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAV 173
           +EKLLEREKEDGKLL KCVEELKKKGVEFDLLKEVDALRRAKSLRVE K +
Sbjct: 1   MEKLLEREKEDGKLLGKCVEELKKKGVEFDLLKEVDALRRAKSLRVETKCI 51


>gi|302800802|ref|XP_002982158.1| hypothetical protein SELMODRAFT_445080 [Selaginella moellendorffii]
 gi|300150174|gb|EFJ16826.1| hypothetical protein SELMODRAFT_445080 [Selaginella moellendorffii]
          Length = 385

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 58/188 (30%), Positives = 95/188 (50%), Gaps = 17/188 (9%)

Query: 19  PLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALIL 78
           P  QIA  P  KL L+ W KEEE++  RI  KE ++ + + EI QL   +FVF  + L  
Sbjct: 193 PYAQIAANPALKLRLELWKKEEEVVNQRIERKEQRLQNTKNEIYQLLGIYFVFQGVILTA 252

Query: 79  LFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGKLLA 138
           L  A++  P      R WIP + ++L SL  I A+ +K +  + L++ +  EK   + L 
Sbjct: 253 LAQATNLPP-----SRVWIPFVLSMLASLATILALIFKLEDCAELQRQIASEKLLSRALF 307

Query: 139 KCVEELKKKGVEF---DL-----LKEVDALRRAKSLRVEAKAVKKWSARDFVTLFFFS-- 188
           +  + LK  G ++   DL     L+    + R++  R+    ++ + A   V LF FS  
Sbjct: 308 QASDTLKSMGNDYNPTDLHLLPHLQNPKTVDRSQLSRIWISHMEGYGAFLLVILFIFSAI 367

Query: 189 --VSCLVL 194
             +SC ++
Sbjct: 368 MAISCWII 375



 Score = 73.9 bits (180), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 50/134 (37%), Positives = 70/134 (52%), Gaps = 5/134 (3%)

Query: 19  PLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALIL 78
           P  QIA     +L L  W KEEE++  RI  KE ++ S + EI QL   +FVF  + L  
Sbjct: 20  PFEQIAGNKALRLRLDFWKKEEEVVNQRIERKEQRLQSAKNEIYQLLGIYFVFQGVILTA 79

Query: 79  LFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGKLLA 138
           L  A +ND    S +  WIP + + L S+  I A+ YK    S LE+ ++ EK   K L 
Sbjct: 80  L--AQAND---LSGKWRWIPFVLSTLASMATILALVYKLRDCSELERQIKGEKLMSKALF 134

Query: 139 KCVEELKKKGVEFD 152
           +  ++LK  G  FD
Sbjct: 135 QASDDLKSMGSAFD 148


>gi|414869264|tpg|DAA47821.1| TPA: hypothetical protein ZEAMMB73_612474 [Zea mays]
          Length = 234

 Score = 75.9 bits (185), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 66/193 (34%), Positives = 98/193 (50%), Gaps = 8/193 (4%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +IA +  H LLLK W +EE+ +  R    E ++D+ R++   L   F  FH ++L 
Sbjct: 41  HPLGEIAASEGHLLLLKLWQREEDRLGRRACALEARMDAARRDAFYLCAAFLAFHGLSLA 100

Query: 78  LLF----NASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKED 133
           LLF     A++      +CRR W PS  +L  SL L  AV+ +          L R++ D
Sbjct: 101 LLFAASVVAAAATSPSAACRRWWAPSSLSLAASLALAAAVQLRVGAYWRAAARLRRDRGD 160

Query: 134 GKLLAKCVEELKKKGVEFDLLKEVD-ALRRAKSLRVEAKAVK---KWSARDFVTLFFFSV 189
            + LA+ V+EL+ KG  FDL KE    + RAK   VE   +    +W  ++ VT    +V
Sbjct: 161 ARALARAVQELRLKGAAFDLSKEPQYGVTRAKCASVEGAGLWAPLRWFRQNVVTACLLAV 220

Query: 190 SCLVLALTRIILC 202
           +   L   + ILC
Sbjct: 221 AAAALPSGKFILC 233


>gi|242080105|ref|XP_002444821.1| hypothetical protein SORBIDRAFT_07g028590 [Sorghum bicolor]
 gi|241941171|gb|EES14316.1| hypothetical protein SORBIDRAFT_07g028590 [Sorghum bicolor]
          Length = 237

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/197 (35%), Positives = 100/197 (50%), Gaps = 11/197 (5%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +IA +  H LLLK W +EE+ +  R    E ++D+ R++   L   F  FH ++L 
Sbjct: 41  HPLGEIAASAGHLLLLKLWQREEDRLGRRACALEARMDAARRDAFYLCAAFLAFHGLSLA 100

Query: 78  LLF-------NASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLERE 130
           LLF        AS +     +CRR W PS  +L  SL L  AV+ +          L R+
Sbjct: 101 LLFAASVAASAASPSPSPSPACRRWWAPSSLSLAASLALAAAVQLRVCAYWRAAARLRRD 160

Query: 131 KEDGKLLAKCVEELKKKGVEFDLLKEVD-ALRRAKSLRVE---AKAVKKWSARDFVTLFF 186
           + D + LA+ V+EL+ KG  FDL KE    + RAK   VE   A A  +W  ++ VTL  
Sbjct: 161 RGDARALARAVQELRLKGAAFDLSKEPQYGVTRAKCASVEGAGAWAPLRWCRQNVVTLCL 220

Query: 187 FSVSCLVLALTRIILCN 203
            +V+   L   + ILC 
Sbjct: 221 LAVAAAALPSGKFILCT 237


>gi|302765427|ref|XP_002966134.1| hypothetical protein SELMODRAFT_407423 [Selaginella moellendorffii]
 gi|300165554|gb|EFJ32161.1| hypothetical protein SELMODRAFT_407423 [Selaginella moellendorffii]
          Length = 212

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 51/134 (38%), Positives = 70/134 (52%), Gaps = 5/134 (3%)

Query: 19  PLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALIL 78
           P  QIA     +L L  W KEEE++  RI  KE ++ S + EI QL   +FVF  + L  
Sbjct: 24  PFEQIAGNKALRLRLDFWKKEEEVVNQRIERKEQRLQSAKNEIYQLLGIYFVFQGVILTA 83

Query: 79  LFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGKLLA 138
           L  A +ND  EK     WIP + + L S+  I A+ YK    S LE+ ++ EK   K L 
Sbjct: 84  L--AQANDLSEK---WRWIPFVLSTLASMATILALVYKLRDCSELERQIKGEKLMSKALF 138

Query: 139 KCVEELKKKGVEFD 152
           +  ++LK  G  FD
Sbjct: 139 QASDDLKSMGSAFD 152


>gi|42407300|dbj|BAD08703.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|42408871|dbj|BAD10130.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 244

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 68/208 (32%), Positives = 101/208 (48%), Gaps = 12/208 (5%)

Query: 8   KPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMF 67
           K +  E    HPL +IA +  H LLLK W +EE  +  R    E  +D+ R++   L   
Sbjct: 37  KAAEAESSGSHPLGEIAGSAGHLLLLKLWQREESRLGRRACALEALMDAARRDAFYLCAA 96

Query: 68  FFVFHSIALILLFNASSN--------DPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDV 119
           F  FH ++L LLF AS +        +     C R W+PS  +L+ SL L  AV+ +   
Sbjct: 97  FLAFHGLSLALLFAASVSASAVSPPAEQRAACCSRWWVPSSLSLVASLALAAAVQLRVCA 156

Query: 120 ESHLEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVD-ALRRAKSLRVEAKAVK---K 175
                + L RE+ D + LA+CV+EL+ KG  FDL KE    + RAK   VE        +
Sbjct: 157 YWRASRRLRRERGDARALARCVQELRMKGAAFDLSKEPQYGVTRAKCASVEGAGAWGPLR 216

Query: 176 WSARDFVTLFFFSVSCLVLALTRIILCN 203
           W  ++ V     +V+   +   + ILC+
Sbjct: 217 WCYQNIVAACLLAVAAATMCSGKFILCS 244


>gi|125562033|gb|EAZ07481.1| hypothetical protein OsI_29740 [Oryza sativa Indica Group]
          Length = 388

 Score = 72.8 bits (177), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 68/208 (32%), Positives = 101/208 (48%), Gaps = 12/208 (5%)

Query: 8   KPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMF 67
           K +  E    HPL +IA +  H LLLK W +EE  +  R    E  +D+ R++   L   
Sbjct: 181 KAAEAESSGSHPLGEIAGSAGHLLLLKLWQREESRLGRRACALEALMDAARRDAFYLCAA 240

Query: 68  FFVFHSIALILLFNASSN--------DPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDV 119
           F  FH ++L LLF AS +        +     C R W+PS  +L+ SL L  AV+ +   
Sbjct: 241 FLAFHGLSLALLFAASVSASAVSPPAEQRAACCSRWWVPSSLSLVASLALAAAVQLRVCA 300

Query: 120 ESHLEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVD-ALRRAKSLRVEAKAV---KK 175
                + L RE+ D + LA+CV+EL+ KG  FDL KE    + RAK   VE        +
Sbjct: 301 YWRASRRLRRERGDARALARCVQELRMKGAAFDLSKEPQYGVTRAKCASVEGAGAWGPLR 360

Query: 176 WSARDFVTLFFFSVSCLVLALTRIILCN 203
           W  ++ V     +V+   +   + ILC+
Sbjct: 361 WCYQNIVAACLLAVAAATMCSGKFILCS 388


>gi|226510179|ref|NP_001144828.1| uncharacterized protein LOC100277912 [Zea mays]
 gi|195647618|gb|ACG43277.1| hypothetical protein [Zea mays]
          Length = 234

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 67/193 (34%), Positives = 96/193 (49%), Gaps = 8/193 (4%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +IA +    LLLK W +EE+ +  R    E ++D+ R++   L   F  FH ++L 
Sbjct: 41  HPLGEIAASEGXLLLLKLWQREEDRLGRRACALEARMDAARRDAFYLCAAFLAFHGLSLA 100

Query: 78  LLFNASSNDPVEKS----CRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKED 133
           LLF AS       S    CRR W PS  +L  SL L  AV+ +          L R++ D
Sbjct: 101 LLFAASVVAAAATSPSAACRRWWAPSSLSLAASLALAAAVQLRVGAYWRAAARLRRDRGD 160

Query: 134 GKLLAKCVEELKKKGVEFDLLKEVD-ALRRAKSLRVEAKAVK---KWSARDFVTLFFFSV 189
            + LA+ V+EL+ KG  FDL KE    + RAK   VE   +    +W  ++ VT    +V
Sbjct: 161 ARALARAVQELRLKGAAFDLSKEPQYGVTRAKCASVEGAGLWAPLRWFRQNVVTACLLAV 220

Query: 190 SCLVLALTRIILC 202
           +   L   + ILC
Sbjct: 221 AAAALPSGKFILC 233


>gi|168017445|ref|XP_001761258.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687598|gb|EDQ73980.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score = 57.0 bits (136), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 70/148 (47%), Gaps = 5/148 (3%)

Query: 4   DQGTKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQ 63
           D+G   +       + L +I  TP  + +LK+W +       +I  +E + +  R EI Q
Sbjct: 205 DKGITATPPTESEANRLAEIEATPRLRTILKEWEQNFNYSWGKIENEENKCNRHRSEIHQ 264

Query: 64  LYMFFFVFHSIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHL 123
           L  +F VF  + L  + + SS      SC+  WIP   + L  L  +WAV  K      L
Sbjct: 265 LIGYFSVFQGVVLTAVAHTSS-----LSCQTWWIPFTLSSLACLATLWAVTQKLRSLFFL 319

Query: 124 EKLLEREKEDGKLLAKCVEELKKKGVEF 151
           + +LE++++  ++L   +  LK  G +F
Sbjct: 320 KCILEKDRQQSQVLHLQIVLLKSMGKDF 347


>gi|125562031|gb|EAZ07479.1| hypothetical protein OsI_29737 [Oryza sativa Indica Group]
 gi|125603876|gb|EAZ43201.1| hypothetical protein OsJ_27799 [Oryza sativa Japonica Group]
          Length = 250

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 59/146 (40%), Gaps = 8/146 (5%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +IA +  H LLLK W +EE  +  R    E  +D+ R++   L   F  FH ++L 
Sbjct: 56  HPLGEIAGSAGHLLLLKLWQREESRLGRRACALEALMDAARRDAFYLCAAFLAFHGLSLA 115

Query: 78  LLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDG--- 134
           LLF AS +     +             CSL      R                       
Sbjct: 116 LLFAASVSASASAAVVSPPAEQRAA-CCSLAGGGGCRRPCRWSRPSRSRRPCSSGSAPGV 174

Query: 135 ----KLLAKCVEELKKKGVEFDLLKE 156
               + LA+CV+EL+ KG  F L KE
Sbjct: 175 RTATRALARCVQELRMKGAAFGLSKE 200


>gi|115477078|ref|NP_001062135.1| Os08g0496400 [Oryza sativa Japonica Group]
 gi|42407299|dbj|BAD08702.1| unknown protein [Oryza sativa Japonica Group]
 gi|42408869|dbj|BAD10128.1| unknown protein [Oryza sativa Japonica Group]
 gi|113624104|dbj|BAF24049.1| Os08g0496400 [Oryza sativa Japonica Group]
 gi|215766869|dbj|BAG99097.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 357

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 59/146 (40%), Gaps = 8/146 (5%)

Query: 18  HPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALI 77
           HPL +IA +  H LLLK W +EE  +  R    E  +D+ R++   L   F  FH ++L 
Sbjct: 56  HPLGEIAGSAGHLLLLKLWQREESRLGRRACALEALMDAARRDAFYLCAAFLAFHGLSLA 115

Query: 78  LLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDG--- 134
           LLF AS +     +             CSL      R                       
Sbjct: 116 LLFAASVSASASAAVVSPPAEQR-AACCSLAGGGGCRRPCRWSRPSRSRRPCSSGSAPGV 174

Query: 135 ----KLLAKCVEELKKKGVEFDLLKE 156
               + LA+CV+EL+ KG  F L KE
Sbjct: 175 RTATRALARCVQELRMKGAAFGLSKE 200


>gi|168020769|ref|XP_001762915.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686024|gb|EDQ72416.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 233

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 44/184 (23%), Positives = 76/184 (41%), Gaps = 9/184 (4%)

Query: 20  LHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALILL 79
           LH I  +  HK  L+ W  ++ L   R+  K  Q  + + E+ QL  F+ VF  + L  +
Sbjct: 39  LHWIENSERHKKALQFWETQDALCTRRVDSKRKQSAAAKNEVYQLIGFYSVFQGVLLTAV 98

Query: 80  FNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGKLLAK 139
             A SN      C   + P + + L S+  +  +  K      LEK ++ EK+  ++   
Sbjct: 99  --AQSN---MLHCNHWYFPFILSALASISALVGICLKFRTVRKLEKTIDSEKQSRQIFVN 153

Query: 140 CVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVKKWSARDFVTLFFFSVSCLVLALTRI 199
            +++L+  G +F   ++     +  S R     +   +    V L     S   L     
Sbjct: 154 RIQKLRWLGQDFKFSRDAQDGGKVNSRRNTILNISIVA----VMLLMVGFSGTFLGAIHR 209

Query: 200 ILCN 203
           ILCN
Sbjct: 210 ILCN 213


>gi|168036457|ref|XP_001770723.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677941|gb|EDQ64405.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 463

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 64/151 (42%), Gaps = 5/151 (3%)

Query: 1   MADDQGTKPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNRIALKETQIDSVRKE 60
           +AD    K  I        L  I  +  HK  L+ W  ++ L   R+  K  Q  +++ E
Sbjct: 176 VADSDTDKEDITARNQIKILRWIESSTRHKKALEFWEAQDNLCTRRVDNKRKQSLALKNE 235

Query: 61  ITQLYMFFFVFHSIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDVE 120
           + QL  F+ VF     +LL   S ++ +   C     P   +LL S   I  V  K    
Sbjct: 236 VYQLIGFYSVFQG---VLLTAVSQSNLLH--CNNWGFPFSLSLLASCSAIVGVYLKFAAV 290

Query: 121 SHLEKLLEREKEDGKLLAKCVEELKKKGVEF 151
             LEK +E EK   +L    +E L+ KG +F
Sbjct: 291 WELEKTIESEKMSRQLYVNRIERLRSKGKKF 321


>gi|222640800|gb|EEE68932.1| hypothetical protein OsJ_27802 [Oryza sativa Japonica Group]
          Length = 388

 Score = 44.3 bits (103), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 91/208 (43%), Gaps = 12/208 (5%)

Query: 8   KPSIMEPKPPHPLHQIAETPTHKLLLKQWLKEEELILNR-IALKETQIDSVRKEITQLYM 66
           K +  E    HPL +IA +  H LLLK W +EE  +  R  A +        + +  L  
Sbjct: 181 KAAEAESSGSHPLGEIAGSAGHLLLLKLWQREESRLGRRACAARGPHGRGAPRRVLPLRG 240

Query: 67  FFFVFHSIALILLFNASSN-------DPVEKSCRRSWIPSLCTLLCSLGLIWAVRYKSDV 119
              +   +   L   + S        +     C R W+PS  +L+ SL L  AV+ +   
Sbjct: 241 LPRLPRPLPRALFAASVSASAVSPPAEQRAACCSRWWVPSSLSLVASLALAAAVQLRVCA 300

Query: 120 ESHLEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVD-ALRRAKSLRVEAKAV---KK 175
                + L RE+ D + LA+CV+EL+ KG  FDL KE    + RAK   VE        +
Sbjct: 301 YWRASRRLRRERGDARALARCVQELRMKGAAFDLSKEPQYGVTRAKCASVEGAGAWGPLR 360

Query: 176 WSARDFVTLFFFSVSCLVLALTRIILCN 203
           W  ++ V     +V+   +   + ILC+
Sbjct: 361 WCYQNIVAACLLAVAAATMCSGKFILCS 388


>gi|168060834|ref|XP_001782398.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666129|gb|EDQ52792.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 477

 Score = 42.7 bits (99), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 30/122 (24%), Positives = 58/122 (47%), Gaps = 5/122 (4%)

Query: 30  KLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALILLFNASSNDPVE 89
           +L L +W    +  ++R+ +K  +  +V+ E+ QL  F+ V   + L     AS+     
Sbjct: 327 RLSLNEWENNFKKAVSRVEVKRERSLNVKNELYQLIEFYSVIQGVVLGAAAQAST----- 381

Query: 90  KSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGKLLAKCVEELKKKGV 149
            +C  +W P+L + L S   + +V  K      L+  L++E++D  ++   +  LK  G 
Sbjct: 382 LTCNTAWGPALLSFLASCATVVSVHNKFKDYIILKHELKKEEDDANVVKGKIGLLKLMGS 441

Query: 150 EF 151
            F
Sbjct: 442 NF 443


>gi|168032433|ref|XP_001768723.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680015|gb|EDQ66455.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 220

 Score = 42.7 bits (99), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 5/107 (4%)

Query: 30  KLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALILLFNASSNDPVE 89
           KL L +W    +  L R+ +K  +  +V+ E+ QL  F+ VF  + L  +  AS+     
Sbjct: 30  KLSLDEWENNFKKALYRVEVKRERSLNVKNEVYQLIGFYSVFQGVVLGAVAQAST----- 84

Query: 90  KSCRRSWIPSLCTLLCSLGLIWAVRYKSDVESHLEKLLEREKEDGKL 136
            +C  +W P+L + L S+  + +V  K     + ++ L+ E+ D  +
Sbjct: 85  LTCNTAWGPALLSGLASIATVGSVHNKFKDYINTKQELKNEEVDANV 131


>gi|168058200|ref|XP_001781098.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667495|gb|EDQ54124.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 235

 Score = 38.5 bits (88), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 7/83 (8%)

Query: 27  PTHKLLLKQWLKEEELILNRIALKETQIDSVRKEITQLYMFFFVFHSIALILLFNASSND 86
           P ++  L++W K  ++  + I  K  QI+S++ EI  L  F+ VF  +    +  A  N 
Sbjct: 70  PVYREALEEWEKRSKVFASSIERKTKQIESLKNEIYNLVGFYSVFQGVVFTAV--AQGN- 126

Query: 87  PVEK--SCRRSWIPSLCTLLCSL 107
             EK  +C +  IP   +LL SL
Sbjct: 127 --EKWLTCYKVAIPITLSLLASL 147


>gi|329767088|ref|ZP_08258616.1| hypothetical protein HMPREF0428_00313 [Gemella haemolysans M341]
 gi|328837813|gb|EGF87438.1| hypothetical protein HMPREF0428_00313 [Gemella haemolysans M341]
          Length = 1706

 Score = 38.5 bits (88), Expect = 1.7,   Method: Composition-based stats.
 Identities = 23/62 (37%), Positives = 31/62 (50%)

Query: 128 EREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVKKWSARDFVTLFFF 187
           EREK   +   + ++E+K KGV++ +L E D L    SL       KK  A DFV     
Sbjct: 83  EREKSTKESRERALKEIKDKGVDYKVLFEYDTLLNGFSLETSYADAKKIQAMDFVNSVEV 142

Query: 188 SV 189
           SV
Sbjct: 143 SV 144


>gi|241889590|ref|ZP_04776888.1| C5a peptidase [Gemella haemolysans ATCC 10379]
 gi|241863212|gb|EER67596.1| C5a peptidase [Gemella haemolysans ATCC 10379]
          Length = 1728

 Score = 38.1 bits (87), Expect = 1.9,   Method: Composition-based stats.
 Identities = 23/62 (37%), Positives = 31/62 (50%)

Query: 128 EREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKAVKKWSARDFVTLFFF 187
           EREK   +   + ++E+K KGV++ +L E D L    SL       KK  A DFV     
Sbjct: 83  EREKSTKEPRERALKEIKDKGVDYKVLFEYDTLLNGFSLETSYADAKKIQAMDFVNSVEV 142

Query: 188 SV 189
           SV
Sbjct: 143 SV 144


>gi|302788027|ref|XP_002975783.1| hypothetical protein SELMODRAFT_415821 [Selaginella moellendorffii]
 gi|300156784|gb|EFJ23412.1| hypothetical protein SELMODRAFT_415821 [Selaginella moellendorffii]
          Length = 391

 Score = 38.1 bits (87), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 46  RIALKETQIDSVRKEITQLYMFFFVFHSIALILLFNASSNDPVEKSCRRSWIPSLCTLLC 105
           ++ L + ++D    EI Q+   +F+F  + L +L   SS +    SCR  WIP + +L+ 
Sbjct: 18  QLTLVQKKLDKRSGEIYQVLGMYFIFEGVVLNILAQLSSKN----SCRNWWIPVVVSLIT 73

Query: 106 S-LGLIWAVRYKSDVESHLEKLLERE 130
           S + L+  V  ++D    + KL +R+
Sbjct: 74  SAVTLVSLVVKQNDCIKLMRKLKKRQ 99


>gi|160894868|ref|ZP_02075642.1| hypothetical protein CLOL250_02418 [Clostridium sp. L2-50]
 gi|156863299|gb|EDO56730.1| SNF2 family N-terminal domain protein [Clostridium sp. L2-50]
          Length = 862

 Score = 36.6 bits (83), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 8/69 (11%)

Query: 33  LKQWLKEEELILNRIALKETQI---DSVRKEITQLYMFFFVFHSI--ALILLFNASSNDP 87
           LKQ +++  +  N   +K+  +   +S+R   TQ YMF  + H+     IL FN S+NDP
Sbjct: 391 LKQAVEKAFVFQNEAGIKQRIVVFTESIR---TQQYMFEELSHAGYEGQILKFNGSTNDP 447

Query: 88  VEKSCRRSW 96
           V K   ++W
Sbjct: 448 VTKQIYKAW 456


>gi|210610976|ref|ZP_03288685.1| hypothetical protein CLONEX_00875 [Clostridium nexile DSM 1787]
 gi|210152201|gb|EEA83208.1| hypothetical protein CLONEX_00875 [Clostridium nexile DSM 1787]
          Length = 862

 Score = 36.2 bits (82), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 8/69 (11%)

Query: 33  LKQWLKEEELILNRIALKETQI---DSVRKEITQLYMFFFVFHSI--ALILLFNASSNDP 87
           LKQ +++  +  N   +K+  +   +S+R   TQ YMF  + H+     IL FN S+NDP
Sbjct: 391 LKQAVEKAFVFQNEAGIKQRIVVFTESIR---TQQYMFEELSHAGYEGQILKFNGSTNDP 447

Query: 88  VEKSCRRSW 96
           V K   ++W
Sbjct: 448 VTKQIYKAW 456


>gi|357464519|ref|XP_003602541.1| hypothetical protein MTR_3g095480 [Medicago truncatula]
 gi|355491589|gb|AES72792.1| hypothetical protein MTR_3g095480 [Medicago truncatula]
          Length = 192

 Score = 36.2 bits (82), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 35/64 (54%), Gaps = 7/64 (10%)

Query: 50  KETQIDSVRKEITQLYMFFFVFHSIALILLFNASSNDPVEKSCRRSWIPSLCTLLCSLGL 109
           K+  ID  +KE+ +L+  FF+F    L L+F A+S  P    CR  WIP   TLL    L
Sbjct: 42  KKRAIDLQKKELHKLFQIFFIF----LSLVFMANSMSP-RLQCRHCWIP--ITLLSMAHL 94

Query: 110 IWAV 113
           I+ V
Sbjct: 95  IFYV 98


>gi|325180736|emb|CCA15143.1| exosome complex exonuclease RRP6like protein putativ [Albugo
           laibachii Nc14]
          Length = 401

 Score = 36.2 bits (82), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 36/63 (57%), Gaps = 7/63 (11%)

Query: 113 VRYKSDVESHLEKLLEREKEDGKLLAKCVEELKKKGVEFDLLKEVDALRRAKSLRVEAKA 172
            R + D+  HL  +LE+ + +GK+ AK VE   +K +       +DA+ R KSLR+ AK 
Sbjct: 339 ARGEQDLIEHLRDILEQYEAEGKVYAKQVELANRKAI-------LDAMPRKKSLRLRAKY 391

Query: 173 VKK 175
           + K
Sbjct: 392 LTK 394


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.323    0.136    0.406 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,183,234,133
Number of Sequences: 23463169
Number of extensions: 122120806
Number of successful extensions: 448220
Number of sequences better than 100.0: 68
Number of HSP's better than 100.0 without gapping: 45
Number of HSP's successfully gapped in prelim test: 23
Number of HSP's that attempted gapping in prelim test: 448104
Number of HSP's gapped (non-prelim): 74
length of query: 203
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 68
effective length of database: 9,191,667,552
effective search space: 625033393536
effective search space used: 625033393536
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 73 (32.7 bits)