BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 024782
         (262 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225459888|ref|XP_002262923.1| PREDICTED: uncharacterized protein LOC100267182 [Vitis vinifera]
          Length = 259

 Score =  305 bits (782), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 153/210 (72%), Positives = 177/210 (84%), Gaps = 12/210 (5%)

Query: 53  VALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFS 112
           +  VGK +T+LRV+ +            ++E+ +PEDLE ++Q++RVLELLRKNRDM+FS
Sbjct: 62  LVFVGKEETELRVSSD------------QQEDPSPEDLEYVSQIQRVLELLRKNRDMIFS 109

Query: 113 EVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLAEEMV 172
           EVKLTIMIED REVERRRLLGIED +APTR+DL EALE+VNEGKVP+NR+ALRMLAEEM 
Sbjct: 110 EVKLTIMIEDQREVERRRLLGIEDPDAPTREDLVEALEEVNEGKVPENRIALRMLAEEMT 169

Query: 173 QWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSA 232
           QWPNLEVEA K+K SKSLYAK TDTG+DP  AAKRLN+DWDSAAEIED    D+ EVP A
Sbjct: 170 QWPNLEVEAPKKKVSKSLYAKATDTGVDPREAAKRLNMDWDSAAEIEDPDVSDETEVPPA 229

Query: 233 LGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           +GYGALYLVTAFPVIIGISVVLILFYNSLQ
Sbjct: 230 VGYGALYLVTAFPVIIGISVVLILFYNSLQ 259


>gi|297734719|emb|CBI16953.3| unnamed protein product [Vitis vinifera]
          Length = 254

 Score =  305 bits (781), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 153/210 (72%), Positives = 177/210 (84%), Gaps = 12/210 (5%)

Query: 53  VALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFS 112
           +  VGK +T+LRV+ +            ++E+ +PEDLE ++Q++RVLELLRKNRDM+FS
Sbjct: 57  LVFVGKEETELRVSSD------------QQEDPSPEDLEYVSQIQRVLELLRKNRDMIFS 104

Query: 113 EVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLAEEMV 172
           EVKLTIMIED REVERRRLLGIED +APTR+DL EALE+VNEGKVP+NR+ALRMLAEEM 
Sbjct: 105 EVKLTIMIEDQREVERRRLLGIEDPDAPTREDLVEALEEVNEGKVPENRIALRMLAEEMT 164

Query: 173 QWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSA 232
           QWPNLEVEA K+K SKSLYAK TDTG+DP  AAKRLN+DWDSAAEIED    D+ EVP A
Sbjct: 165 QWPNLEVEAPKKKVSKSLYAKATDTGVDPREAAKRLNMDWDSAAEIEDPDVSDETEVPPA 224

Query: 233 LGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           +GYGALYLVTAFPVIIGISVVLILFYNSLQ
Sbjct: 225 VGYGALYLVTAFPVIIGISVVLILFYNSLQ 254


>gi|147844656|emb|CAN82138.1| hypothetical protein VITISV_019177 [Vitis vinifera]
          Length = 259

 Score =  303 bits (776), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 152/209 (72%), Positives = 175/209 (83%), Gaps = 12/209 (5%)

Query: 54  ALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFSE 113
             VGK +T+LRV+ +            ++E+ +PEDLE ++Q++RVLELLRKNRDM+FSE
Sbjct: 63  VFVGKEETELRVSSD------------QQEDPSPEDLEYVSQIQRVLELLRKNRDMIFSE 110

Query: 114 VKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLAEEMVQ 173
           VKLTIMIED REVERRRLLGIED +APTR+DL EALE+VNEGKVP+NR+ALRMLAEEM Q
Sbjct: 111 VKLTIMIEDQREVERRRLLGIEDPDAPTREDLVEALEEVNEGKVPENRIALRMLAEEMTQ 170

Query: 174 WPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSAL 233
           WPN EVEA K+K SKSLYAK TDTG+DP  AAKRLN+DWDSAAEIED    D+ EVP A+
Sbjct: 171 WPNXEVEAPKKKVSKSLYAKATDTGVDPREAAKRLNMDWDSAAEIEDPDVSDETEVPPAV 230

Query: 234 GYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           GYGALYLVTAFPVIIGISVVLILFYNSLQ
Sbjct: 231 GYGALYLVTAFPVIIGISVVLILFYNSLQ 259


>gi|449433453|ref|XP_004134512.1| PREDICTED: uncharacterized protein LOC101220256 [Cucumis sativus]
 gi|449515591|ref|XP_004164832.1| PREDICTED: uncharacterized protein LOC101223339 [Cucumis sativus]
          Length = 254

 Score =  299 bits (766), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 163/263 (61%), Positives = 192/263 (73%), Gaps = 13/263 (4%)

Query: 1   MAATFPQMPLA-SPIISRHGDNQINQHHPLFAATSWRASKLCLCCSRRSSQRVVALVGKG 59
           MA      PL+ SP  S   +  I++        S   ++  L      ++R +A VGK 
Sbjct: 4   MALQLRHFPLSTSPSSSIEHNYDISRRFRYLPLPS-HGTRPHLPVPSSRNRRSLACVGKE 62

Query: 60  DTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIM 119
           DT LR         Q     +E+ EA  +DLE I Q++RVLELL+KNRDMLF+EVKLT+M
Sbjct: 63  DTQLR---------QPSSTTDEQPEA--QDLEYIRQIQRVLELLKKNRDMLFNEVKLTVM 111

Query: 120 IEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEV 179
           IEDPREVERRRLLGI++ +APTRDDLA  LE+VNEGK PKNRVAL+MLAEEM  WPNLE 
Sbjct: 112 IEDPREVERRRLLGIDEDDAPTRDDLAATLEEVNEGKFPKNRVALQMLAEEMTNWPNLEA 171

Query: 180 EATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSALGYGALY 239
           EA K+K SKSLYAK TDTG++P  AAKRLNIDWD+AAEIEDA   DDPEVP+A+GYGALY
Sbjct: 172 EAPKKKRSKSLYAKATDTGVNPREAAKRLNIDWDTAAEIEDADLSDDPEVPAAVGYGALY 231

Query: 240 LVTAFPVIIGISVVLILFYNSLQ 262
           +VTAFPVIIGISVVLILFYNSLQ
Sbjct: 232 IVTAFPVIIGISVVLILFYNSLQ 254


>gi|359806797|ref|NP_001241562.1| uncharacterized protein LOC100816881 [Glycine max]
 gi|255636679|gb|ACU18676.1| unknown [Glycine max]
          Length = 261

 Score =  296 bits (759), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 154/220 (70%), Positives = 185/220 (84%), Gaps = 6/220 (2%)

Query: 46  RRSSQRV--VALVGKGDTDLRVAQENDLQDQD-QEDEEEEEEATPEDLENIAQVKRVLEL 102
           RR+S  +  V  VGK DTDLRV+    LQ+Q   ED++ ++E TP+D+E +AQ+KRVLEL
Sbjct: 45  RRNSNTLSHVPSVGKEDTDLRVSS---LQEQQLGEDDDGDDEPTPQDIEYVAQIKRVLEL 101

Query: 103 LRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRV 162
           LRKNRDMLF EVKLTIMIEDPREVERRRLLGIED + PTR+DL +AL+++NEGK+PK+R 
Sbjct: 102 LRKNRDMLFGEVKLTIMIEDPREVERRRLLGIEDLDGPTREDLVDALDEINEGKIPKDRA 161

Query: 163 ALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAG 222
           +L+MLAEE+  WPNLE    K+KPSKSLYAK TDTGIDPE+AAK+LNIDWDSAAEIE+  
Sbjct: 162 SLQMLAEELASWPNLEDAVPKKKPSKSLYAKATDTGIDPELAAKKLNIDWDSAAEIEETN 221

Query: 223 GGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
             D+ EVPS +GYGALYLV+AFP+IIGISVVLILFYNSLQ
Sbjct: 222 ADDETEVPSVVGYGALYLVSAFPIIIGISVVLILFYNSLQ 261


>gi|356526691|ref|XP_003531950.1| PREDICTED: uncharacterized protein LOC100779639 isoform 1 [Glycine
           max]
          Length = 257

 Score =  292 bits (748), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 156/240 (65%), Positives = 188/240 (78%), Gaps = 6/240 (2%)

Query: 25  QHHPLFAATSWRASKLCLCCSRRSSQRV--VALVGKGDTDLRVAQENDLQDQDQEDEEEE 82
            HHP    T    ++L     RR+S  +  V  VGK DTDLRV+   + Q  D + ++ +
Sbjct: 22  HHHPSVFCTGVAFTRLR---HRRTSNTIFLVPSVGKEDTDLRVSSLQEQQLDDGD-DDND 77

Query: 83  EEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTR 142
           EE TP+DLE +AQ+KRVLELLRKNRDM+F EVKLTIMIED REVERRRLLGIED + PTR
Sbjct: 78  EEPTPQDLEYVAQIKRVLELLRKNRDMIFGEVKLTIMIEDAREVERRRLLGIEDLDGPTR 137

Query: 143 DDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPE 202
           +DL +AL+++NEGK+PK+R AL+MLAEE+  WPNLE    K+KPSKSLYAK TDTGIDPE
Sbjct: 138 EDLVDALDEINEGKIPKDRAALQMLAEELASWPNLEDALPKKKPSKSLYAKATDTGIDPE 197

Query: 203 MAAKRLNIDWDSAAEIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           +AAK+LNIDWDSAAEIE+    D+ EVP  +GYGALYLV+AFP+IIGISVVLILFYNSLQ
Sbjct: 198 LAAKKLNIDWDSAAEIEEQNADDETEVPPVVGYGALYLVSAFPIIIGISVVLILFYNSLQ 257


>gi|356526693|ref|XP_003531951.1| PREDICTED: uncharacterized protein LOC100779639 isoform 2 [Glycine
           max]
          Length = 258

 Score =  283 bits (725), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 155/241 (64%), Positives = 187/241 (77%), Gaps = 7/241 (2%)

Query: 25  QHHPLFAATSWRASKLCLCCSRRSSQRV--VALVGKGDTDLRVAQENDLQDQDQEDEEEE 82
            HHP    T    ++L     RR+S  +  V  VGK DTDLRV+   + Q  D + ++ +
Sbjct: 22  HHHPSVFCTGVAFTRLR---HRRTSNTIFLVPSVGKEDTDLRVSSLQEQQLDDGD-DDND 77

Query: 83  EEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTR 142
           EE TP+DLE +AQ+KRVLELLRKNRDM+F EVKLTIMIED REVERRRLLGIED + PTR
Sbjct: 78  EEPTPQDLEYVAQIKRVLELLRKNRDMIFGEVKLTIMIEDAREVERRRLLGIEDLDGPTR 137

Query: 143 DDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPE 202
           +DL +AL+++NEGK+PK+R AL+MLAEE+  WPNLE    K+KPSKSLYAK TDTGIDPE
Sbjct: 138 EDLVDALDEINEGKIPKDRAALQMLAEELASWPNLEDALPKKKPSKSLYAKATDTGIDPE 197

Query: 203 MAAKRLNIDWDSAAEIEDAGGGDDPEVPSAL-GYGALYLVTAFPVIIGISVVLILFYNSL 261
           +AAK+LNIDWDSAAEIE+    D+ E+   L GYGALYLV+AFP+IIGISVVLILFYNSL
Sbjct: 198 LAAKKLNIDWDSAAEIEEQNADDETEMFFLLQGYGALYLVSAFPIIIGISVVLILFYNSL 257

Query: 262 Q 262
           Q
Sbjct: 258 Q 258


>gi|224053647|ref|XP_002297910.1| predicted protein [Populus trichocarpa]
 gi|222845168|gb|EEE82715.1| predicted protein [Populus trichocarpa]
          Length = 215

 Score =  277 bits (709), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 135/178 (75%), Positives = 158/178 (88%), Gaps = 1/178 (0%)

Query: 86  TPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDL 145
           +P+DL+ +  +KRVLELLRKNRDM+FSEVKLT+MIEDPREVERRRLLGI+D++ PTR+DL
Sbjct: 38  SPQDLQYVQDIKRVLELLRKNRDMIFSEVKLTVMIEDPREVERRRLLGIDDADTPTREDL 97

Query: 146 AEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAA 205
           AEALEQVNEGK+PK R AL+MLAEEM+ WPNLEVEATK KPSKSLYAK TDTGI+ + AA
Sbjct: 98  AEALEQVNEGKIPKKREALQMLAEEMINWPNLEVEATKTKPSKSLYAKATDTGINLKEAA 157

Query: 206 KRLNIDWDSAAEIEDAGGGDDPEVPSAL-GYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           +RL IDWDSAAEIEDA   D+ +V + L GYGALYLVT FP+IIG++VVLILFYNSLQ
Sbjct: 158 RRLKIDWDSAAEIEDADMNDETDVVTLLQGYGALYLVTGFPIIIGVAVVLILFYNSLQ 215


>gi|357517361|ref|XP_003628969.1| hypothetical protein MTR_8g070680 [Medicago truncatula]
 gi|355522991|gb|AET03445.1| hypothetical protein MTR_8g070680 [Medicago truncatula]
 gi|388514717|gb|AFK45420.1| unknown [Medicago truncatula]
          Length = 267

 Score =  276 bits (706), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 159/239 (66%), Positives = 186/239 (77%), Gaps = 7/239 (2%)

Query: 27  HPLFA-ATSWRASKLCLCCSRRSSQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEA 85
           H LF   TS+      L  + R     V   G  D DL V+    L D D  D++EEEE 
Sbjct: 33  HSLFTNGTSFTK----LSTTHRRILSFVPSAGNEDADLSVSSTQQLDDDDVVDDDEEEEE 88

Query: 86  -TPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDD 144
            TP+DLEN+A++KRVLELL++NRDMLF EVKLTIMIEDPR+VER+RLLGIED +APTR+D
Sbjct: 89  PTPQDLENVAEIKRVLELLKRNRDMLFGEVKLTIMIEDPRDVERKRLLGIEDLDAPTRED 148

Query: 145 LAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVE-ATKQKPSKSLYAKVTDTGIDPEM 203
           L  ALE+VNEGK+PK++  L+MLAEEM  WPNLE E + K+KP KSLYAK TDTGIDP++
Sbjct: 149 LVAALEEVNEGKIPKDKAVLQMLAEEMTAWPNLEAELSPKKKPKKSLYAKTTDTGIDPQV 208

Query: 204 AAKRLNIDWDSAAEIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           AAKRLNIDWDSAAEIE+A   D+ EVPS LGYGALYLV+AFPVIIGISVVLILFYNSLQ
Sbjct: 209 AAKRLNIDWDSAAEIEEAKTDDETEVPSVLGYGALYLVSAFPVIIGISVVLILFYNSLQ 267


>gi|21554237|gb|AAM63312.1| unknown [Arabidopsis thaliana]
          Length = 280

 Score =  269 bits (687), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 151/286 (52%), Positives = 199/286 (69%), Gaps = 30/286 (10%)

Query: 1   MAATFPQMPLASPIISRHGDNQINQHHPLFAATSWRASKLCLCCS-----------RRSS 49
           M     Q+PL     S     Q N     + A+S   S + +C S           R+++
Sbjct: 1   MTTQIFQLPLKYCASSFSSTGQRN-----YGASS-SPSPIVICKSNGISDGLWVKRRKNN 54

Query: 50  QRVVALV---GKGD-TDLRVAQENDLQDQDQEDEEEEEEAT--------PEDLENIAQVK 97
           +R  +L+    KGD T++RV+    L+ Q++E +  ++E          PEDL+ + ++K
Sbjct: 55  RRFGSLIVKQEKGDVTEIRVSVPLTLEQQEKEKQNRDDEEDEIDEGDVDPEDLKYVNEIK 114

Query: 98  RVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKV 157
           RV+ELLR+NRDM+FSEVKLTIMIEDPRE+ERRRLLGIED++ P+RDDLAEALEQVN+GK+
Sbjct: 115 RVIELLRRNRDMIFSEVKLTIMIEDPRELERRRLLGIEDADTPSRDDLAEALEQVNDGKI 174

Query: 158 PKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAE 217
           PK+R  LRML EEM++WPNLEVE +K++  KS+YAK TDTGIDP+ AAKRLN++WDSAA 
Sbjct: 175 PKDRATLRMLHEEMIRWPNLEVEVSKKQRGKSMYAKSTDTGIDPKEAAKRLNVEWDSAAA 234

Query: 218 IEDAGGGDDPEVPSAL-GYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           IE+    D+  V + + GYGALY V+A PVIIGISVVLILFYNSLQ
Sbjct: 235 IEEVDVDDEQGVVTKVAGYGALYFVSALPVIIGISVVLILFYNSLQ 280


>gi|18422466|ref|NP_568637.1| uncharacterized protein [Arabidopsis thaliana]
 gi|8953763|dbj|BAA98118.1| unnamed protein product [Arabidopsis thaliana]
 gi|110740965|dbj|BAE98577.1| hypothetical protein [Arabidopsis thaliana]
 gi|332007760|gb|AED95143.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 280

 Score =  267 bits (683), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 151/286 (52%), Positives = 198/286 (69%), Gaps = 30/286 (10%)

Query: 1   MAATFPQMPLASPIISRHGDNQINQHHPLFAATSWRASKLCLCCS-----------RRSS 49
           M     Q+PL     S     Q N     + A+S   S + +C S           R+++
Sbjct: 1   MTTQIFQLPLKYCASSFSSTGQRN-----YGASS-SPSPIVICKSNGISDGLWVKRRKNN 54

Query: 50  QRVVALV---GKGD-TDLRVAQENDLQDQDQEDEEEEEEAT--------PEDLENIAQVK 97
           +R  +L+    KGD T++RV     L+ Q++E +  ++E          PEDL+ + ++K
Sbjct: 55  RRFGSLIVKQEKGDVTEIRVPVPLTLEQQEKEKQNRDDEEDEIDEGDVDPEDLKYVNEIK 114

Query: 98  RVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKV 157
           RV+ELLR+NRDM+FSEVKLTIMIEDPRE+ERRRLLGIED++ P+RDDLAEALEQVN+GK+
Sbjct: 115 RVIELLRRNRDMIFSEVKLTIMIEDPRELERRRLLGIEDADTPSRDDLAEALEQVNDGKI 174

Query: 158 PKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAE 217
           PK+R  LRML EEM++WPNLEVE +K++  KS+YAK TDTGIDP+ AAKRLN++WDSAA 
Sbjct: 175 PKDRATLRMLHEEMIRWPNLEVEVSKKQRGKSMYAKSTDTGIDPKEAAKRLNVEWDSAAA 234

Query: 218 IEDAGGGDDPEVPSAL-GYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           IE+    D+  V + + GYGALY V+A PVIIGISVVLILFYNSLQ
Sbjct: 235 IEEVDVDDEQGVVTKVAGYGALYFVSALPVIIGISVVLILFYNSLQ 280


>gi|255586744|ref|XP_002533992.1| conserved hypothetical protein [Ricinus communis]
 gi|223526012|gb|EEF28389.1| conserved hypothetical protein [Ricinus communis]
          Length = 269

 Score =  267 bits (682), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 158/264 (59%), Positives = 193/264 (73%), Gaps = 17/264 (6%)

Query: 1   MAATFPQMPLASPIISRHGDNQINQHHPLF----------AATSWRASKLCLC-----CS 45
           MA    Q+P+ASP+      +     H LF          + +  R +  CL       S
Sbjct: 1   MALQIYQLPVASPLSVSSSSSSSLHTHFLFLYHNHQQQFQSCSRIRLASFCLHQNTSNVS 60

Query: 46  RRSSQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEA-TPEDLENIAQVKRVLELLR 104
            R+ +  + LVGK D ++RV  E + +++D+ED+EEE    +P+DL+ I ++K+VL+LL+
Sbjct: 61  WRNRKNGL-LVGKEDLEMRVQDEENEEEEDEEDDEEEAPPPSPQDLQYIQEIKKVLDLLK 119

Query: 105 KNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVAL 164
           +NRDMLF+EVKLTIMIEDPREVERRRLLGIED +APTR+DLAEALEQVNEGKVPKNRVAL
Sbjct: 120 RNRDMLFNEVKLTIMIEDPREVERRRLLGIEDLDAPTREDLAEALEQVNEGKVPKNRVAL 179

Query: 165 RMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGG 224
           ++LAEEM+ WPNLEVEATK  P KSLYAK TDTGIDP  AAKRL IDWDSAAEIED    
Sbjct: 180 QILAEEMMNWPNLEVEATKTAPKKSLYAKATDTGIDPREAAKRLKIDWDSAAEIEDVDSS 239

Query: 225 DDPEVPSALGYGALYLVTAFPVII 248
           D+ EVP A+GYGALYLVTAFPVII
Sbjct: 240 DETEVPPAVGYGALYLVTAFPVII 263


>gi|297794895|ref|XP_002865332.1| hypothetical protein ARALYDRAFT_494519 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311167|gb|EFH41591.1| hypothetical protein ARALYDRAFT_494519 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 278

 Score =  262 bits (669), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 150/278 (53%), Positives = 190/278 (68%), Gaps = 16/278 (5%)

Query: 1   MAATFPQMPLASPIISRHGDNQINQH-----HPLFAATSWRASKLCLCCSRRSSQRVVAL 55
           M     Q+PL   + S     Q N        P     S   S       RR +QR  ++
Sbjct: 1   MVTQIFQLPLQYCVSSFSSAGQRNYGVSSSPTPFVICKSNGISDDLWVKRRRKNQRFGSV 60

Query: 56  V---GKGD-TDLRVAQENDLQDQDQEDEEEEEEATPED------LENIAQVKRVLELLRK 105
           +    KGD T++RV     L+ Q++E +  ++E   E       L+ + ++KRVLELLR+
Sbjct: 61  IVKQEKGDVTEIRVPVPLTLEQQEKEKQNRDDEEDEEGEVDPEDLKYVNEIKRVLELLRR 120

Query: 106 NRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALR 165
           NRDM+FSEVKLTIMIEDPREVERRRLLGIED++ P+R+DLAEALEQVN+GK+PK+R  LR
Sbjct: 121 NRDMIFSEVKLTIMIEDPREVERRRLLGIEDADTPSREDLAEALEQVNDGKIPKDRATLR 180

Query: 166 MLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGD 225
           ML EEM++WPNLEVE + ++  KS+YAK TDTGIDP+ AAKRLNI+WDSAA IE+A   D
Sbjct: 181 MLHEEMIRWPNLEVEVSNKQRGKSMYAKSTDTGIDPKEAAKRLNIEWDSAAAIEEADVDD 240

Query: 226 DPEVPSAL-GYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           +  V + + GYGALY V+A PVIIGISVVLILFYNSLQ
Sbjct: 241 EQGVVTKVAGYGALYFVSALPVIIGISVVLILFYNSLQ 278


>gi|218189215|gb|EEC71642.1| hypothetical protein OsI_04075 [Oryza sativa Indica Group]
          Length = 292

 Score =  253 bits (646), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 143/217 (65%), Positives = 173/217 (79%), Gaps = 5/217 (2%)

Query: 49  SQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRD 108
           S   VA  G+GD++   +  +D    D     E+ EA+PEDLE+I QVKRVLELL+KNRD
Sbjct: 78  SDTAVAPEGEGDSEPPAS--SDGAAGDIASSAEQPEASPEDLEDIRQVKRVLELLQKNRD 135

Query: 109 MLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLA 168
           M F EVKLTIMIEDPR++ER+RLLGIED +  TRDDLA+AL +VNEG++P+NRVAL++LA
Sbjct: 136 MTFGEVKLTIMIEDPRDIERKRLLGIEDPDEITRDDLADALVEVNEGRIPENRVALQLLA 195

Query: 169 EEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAA-EIEDAGGGD 225
           +EM +WP+LE+EA   K KP KS+YAK TDTGIDPE AAKRLNIDWDSAA   ++    D
Sbjct: 196 KEMTEWPDLEMEAPKKKSKPGKSVYAKATDTGIDPETAAKRLNIDWDSAADLDDEEEEDD 255

Query: 226 DPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           + EVPSA+GY ALYL+TAFPVIIGISVVLILFYNSLQ
Sbjct: 256 ETEVPSAVGYSALYLLTAFPVIIGISVVLILFYNSLQ 292


>gi|115440489|ref|NP_001044524.1| Os01g0798500 [Oryza sativa Japonica Group]
 gi|55297680|dbj|BAD68270.1| unknown protein [Oryza sativa Japonica Group]
 gi|113534055|dbj|BAF06438.1| Os01g0798500 [Oryza sativa Japonica Group]
 gi|215686962|dbj|BAG90808.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222619394|gb|EEE55526.1| hypothetical protein OsJ_03754 [Oryza sativa Japonica Group]
          Length = 292

 Score =  253 bits (645), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 143/217 (65%), Positives = 173/217 (79%), Gaps = 5/217 (2%)

Query: 49  SQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRD 108
           S   VA  G+GD++   +  +D    D     E+ EA+PEDLE+I QVKRVLELL+KNRD
Sbjct: 78  SDTAVAPEGEGDSEPPAS--SDGAAGDIAASAEQPEASPEDLEDIRQVKRVLELLQKNRD 135

Query: 109 MLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLA 168
           M F EVKLTIMIEDPR++ER+RLLGIED +  TRDDLA+AL +VNEG++P+NRVAL++LA
Sbjct: 136 MTFGEVKLTIMIEDPRDIERKRLLGIEDPDEITRDDLADALVEVNEGRIPENRVALQLLA 195

Query: 169 EEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAA-EIEDAGGGD 225
           +EM +WP+LE+EA   K KP KS+YAK TDTGIDPE AAKRLNIDWDSAA   ++    D
Sbjct: 196 KEMTEWPDLEMEAPKKKSKPGKSVYAKATDTGIDPETAAKRLNIDWDSAADLDDEEEEDD 255

Query: 226 DPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           + EVPSA+GY ALYL+TAFPVIIGISVVLILFYNSLQ
Sbjct: 256 ETEVPSAVGYSALYLLTAFPVIIGISVVLILFYNSLQ 292


>gi|414880128|tpg|DAA57259.1| TPA: hypothetical protein ZEAMMB73_612794 [Zea mays]
          Length = 294

 Score =  248 bits (634), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 132/182 (72%), Positives = 157/182 (86%), Gaps = 3/182 (1%)

Query: 84  EATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRD 143
           EA+PEDLENI ++KRVLELL+KNRDM F EVKLTIMIEDPR++ER+R LGIED +  TRD
Sbjct: 113 EASPEDLENIREIKRVLELLKKNRDMTFGEVKLTIMIEDPRDIERKRTLGIEDPDEITRD 172

Query: 144 DLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDP 201
           DLA+AL +VNEGK+P+NRVAL++LA+EM +WP++E+EA   K KP KS+YAK TDTGIDP
Sbjct: 173 DLADALAEVNEGKIPENRVALQLLAKEMAEWPDIEIEAPKKKSKPGKSVYAKATDTGIDP 232

Query: 202 EMAAKRLNIDWDSAA-EIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNS 260
           E AAKRLN+DWDSAA    +    D+ EVPSA+GYGALYL+TAFPVIIGISVVLILFYNS
Sbjct: 233 ETAAKRLNLDWDSAADIDGEEEDDDETEVPSAVGYGALYLLTAFPVIIGISVVLILFYNS 292

Query: 261 LQ 262
           LQ
Sbjct: 293 LQ 294


>gi|226498212|ref|NP_001142698.1| uncharacterized protein LOC100275005 [Zea mays]
 gi|195608354|gb|ACG26007.1| hypothetical protein [Zea mays]
          Length = 291

 Score =  248 bits (634), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 132/182 (72%), Positives = 157/182 (86%), Gaps = 3/182 (1%)

Query: 84  EATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRD 143
           EA+PEDLENI ++KRVLELL+KNRDM F EVKLTIMIEDPR++ER+R LGIED +  TRD
Sbjct: 110 EASPEDLENIREIKRVLELLKKNRDMTFGEVKLTIMIEDPRDIERKRTLGIEDPDEITRD 169

Query: 144 DLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDP 201
           DLA+AL +VNEGK+P+NRVAL++LA+EM +WP++E+EA   K KP KS+YAK TDTGIDP
Sbjct: 170 DLADALAEVNEGKIPENRVALQLLAKEMAEWPDIEIEAPKKKSKPGKSVYAKATDTGIDP 229

Query: 202 EMAAKRLNIDWDSAA-EIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNS 260
           E AAKRLN+DWDSAA    +    D+ EVPSA+GYGALYL+TAFPVIIGISVVLILFYNS
Sbjct: 230 ETAAKRLNLDWDSAADIDGEEEDDDETEVPSAVGYGALYLLTAFPVIIGISVVLILFYNS 289

Query: 261 LQ 262
           LQ
Sbjct: 290 LQ 291


>gi|357125434|ref|XP_003564399.1| PREDICTED: uncharacterized protein LOC100835197 [Brachypodium
           distachyon]
          Length = 290

 Score =  240 bits (613), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 128/185 (69%), Positives = 155/185 (83%), Gaps = 3/185 (1%)

Query: 81  EEEEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAP 140
           E  EA+PEDLE+I +VKRVLELL+KNRDM F EVKLTIMIEDPR+VER+RLLGIED +  
Sbjct: 106 EPAEASPEDLEDIREVKRVLELLQKNRDMTFGEVKLTIMIEDPRDVERKRLLGIEDPDEL 165

Query: 141 TRDDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKP--SKSLYAKVTDTG 198
           TRDDLA+AL +VNEG++P+NR ALR+LA+EM +WP++E+EA K K    KS+YAK TDTG
Sbjct: 166 TRDDLADALVEVNEGRIPENRDALRLLAKEMAEWPDVEIEAPKSKGMFGKSIYAKATDTG 225

Query: 199 IDPEMAAKRLNIDW-DSAAEIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILF 257
           IDP  AAKRLNIDW  +A   ++A   D+ EVPS +GYGALYL++AFPVIIG+SVVLILF
Sbjct: 226 IDPVAAAKRLNIDWDSAADIDDEADDDDEAEVPSVVGYGALYLLSAFPVIIGVSVVLILF 285

Query: 258 YNSLQ 262
           YNSLQ
Sbjct: 286 YNSLQ 290


>gi|255646421|gb|ACU23689.1| unknown [Glycine max]
          Length = 233

 Score =  226 bits (577), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 125/198 (63%), Positives = 154/198 (77%), Gaps = 6/198 (3%)

Query: 25  QHHPLFAATSWRASKLCLCCSRRSSQRV--VALVGKGDTDLRVAQENDLQDQDQEDEEEE 82
            HHP  + T    ++L     RR+S  +  V  VGK DTDLRV+   + Q  D + ++ +
Sbjct: 22  HHHPSVSCTGVAFTRLR---HRRTSNTIFLVPSVGKEDTDLRVSSLQEQQLDDGD-DDND 77

Query: 83  EEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTR 142
           EE TP+DLE +AQ+KRVLELLRKNRDM+F EVKLTIMIED REVERRRLLGIED + PTR
Sbjct: 78  EEPTPQDLEYVAQIKRVLELLRKNRDMIFGEVKLTIMIEDAREVERRRLLGIEDLDGPTR 137

Query: 143 DDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPE 202
           +DL +AL+++NEGK+PK+R AL+MLAEE+  WPNLE    K+KPSKSLYAK TDTGIDPE
Sbjct: 138 EDLVDALDEINEGKIPKDRAALQMLAEELTSWPNLEDALPKKKPSKSLYAKATDTGIDPE 197

Query: 203 MAAKRLNIDWDSAAEIED 220
           +AAK+LNIDWDSAAEIE+
Sbjct: 198 LAAKKLNIDWDSAAEIEE 215


>gi|414880129|tpg|DAA57260.1| TPA: hypothetical protein ZEAMMB73_612794 [Zea mays]
          Length = 530

 Score =  217 bits (553), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 115/165 (69%), Positives = 140/165 (84%), Gaps = 3/165 (1%)

Query: 84  EATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRD 143
           EA+PEDLENI ++KRVLELL+KNRDM F EVKLTIMIEDPR++ER+R LGIED +  TRD
Sbjct: 113 EASPEDLENIREIKRVLELLKKNRDMTFGEVKLTIMIEDPRDIERKRTLGIEDPDEITRD 172

Query: 144 DLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDP 201
           DLA+AL +VNEGK+P+NRVAL++LA+EM +WP++E+EA   K KP KS+YAK TDTGIDP
Sbjct: 173 DLADALAEVNEGKIPENRVALQLLAKEMAEWPDIEIEAPKKKSKPGKSVYAKATDTGIDP 232

Query: 202 EMAAKRLNIDWDSAA-EIEDAGGGDDPEVPSALGYGALYLVTAFP 245
           E AAKRLN+DWDSAA    +    D+ EVPSA+GYGALYL+TAFP
Sbjct: 233 ETAAKRLNLDWDSAADIDGEEEDDDETEVPSAVGYGALYLLTAFP 277


>gi|55297679|dbj|BAD68269.1| unknown protein [Oryza sativa Japonica Group]
          Length = 270

 Score =  207 bits (527), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 126/217 (58%), Positives = 154/217 (70%), Gaps = 27/217 (12%)

Query: 49  SQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRD 108
           S   VA  G+GD++   +  +D    D     E+ EA+PEDLE                 
Sbjct: 78  SDTAVAPEGEGDSEPPAS--SDGAAGDIAASAEQPEASPEDLE----------------- 118

Query: 109 MLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLA 168
                VKLTIMIEDPR++ER+RLLGIED +  TRDDLA+AL +VNEG++P+NRVAL++LA
Sbjct: 119 -----VKLTIMIEDPRDIERKRLLGIEDPDEITRDDLADALVEVNEGRIPENRVALQLLA 173

Query: 169 EEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAA-EIEDAGGGD 225
           +EM +WP+LE+EA   K KP KS+YAK TDTGIDPE AAKRLNIDWDSAA   ++    D
Sbjct: 174 KEMTEWPDLEMEAPKKKSKPGKSVYAKATDTGIDPETAAKRLNIDWDSAADLDDEEEEDD 233

Query: 226 DPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           + EVPSA+GY ALYL+TAFPVIIGISVVLILFYNSLQ
Sbjct: 234 ETEVPSAVGYSALYLLTAFPVIIGISVVLILFYNSLQ 270


>gi|414880127|tpg|DAA57258.1| TPA: hypothetical protein ZEAMMB73_612794 [Zea mays]
          Length = 157

 Score =  207 bits (526), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 112/157 (71%), Positives = 133/157 (84%), Gaps = 3/157 (1%)

Query: 109 MLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRMLA 168
           M F EVKLTIMIEDPR++ER+R LGIED +  TRDDLA+AL +VNEGK+P+NRVAL++LA
Sbjct: 1   MTFGEVKLTIMIEDPRDIERKRTLGIEDPDEITRDDLADALAEVNEGKIPENRVALQLLA 60

Query: 169 EEMVQWPNLEVEA--TKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAA-EIEDAGGGD 225
           +EM +WP++E+EA   K KP KS+YAK TDTGIDPE AAKRLN+DWDSAA    +    D
Sbjct: 61  KEMAEWPDIEIEAPKKKSKPGKSVYAKATDTGIDPETAAKRLNLDWDSAADIDGEEEDDD 120

Query: 226 DPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           + EVPSA+GYGALYL+TAFPVIIGISVVLILFYNSLQ
Sbjct: 121 ETEVPSAVGYGALYLLTAFPVIIGISVVLILFYNSLQ 157


>gi|242054671|ref|XP_002456481.1| hypothetical protein SORBIDRAFT_03g037100 [Sorghum bicolor]
 gi|241928456|gb|EES01601.1| hypothetical protein SORBIDRAFT_03g037100 [Sorghum bicolor]
          Length = 460

 Score =  191 bits (485), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 97/139 (69%), Positives = 121/139 (87%), Gaps = 2/139 (1%)

Query: 80  EEEEEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNA 139
           +E  EA+PEDLE+I ++KRVLELL+KNRDM F EVKLTIMIEDPR++ER+R LGIED + 
Sbjct: 86  DEPAEASPEDLESIREIKRVLELLKKNRDMTFGEVKLTIMIEDPRDIERKRTLGIEDPDE 145

Query: 140 PTRDDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEA--TKQKPSKSLYAKVTDT 197
            TRDDLA+AL +VNEG++P+NRVAL++LA+EM +WP++E+EA   K KP KS+YAK TDT
Sbjct: 146 ITRDDLADALAEVNEGRIPENRVALQLLAKEMSEWPDIEIEAPKKKSKPGKSVYAKATDT 205

Query: 198 GIDPEMAAKRLNIDWDSAA 216
           GIDPE AAKRLN+DWDSAA
Sbjct: 206 GIDPETAAKRLNLDWDSAA 224


>gi|168045661|ref|XP_001775295.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673376|gb|EDQ59900.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 172

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 95/174 (54%), Positives = 132/174 (75%), Gaps = 9/174 (5%)

Query: 95  QVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQV-N 153
           +V +VL+LL+K RDM F+EV+LTIMIEDPREVERRR LGIED    +++D+  AL++V  
Sbjct: 2   RVLQVLDLLKKKRDMTFNEVRLTIMIEDPREVERRRQLGIEDERGCSKEDMGIALQEVVY 61

Query: 154 EGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWD 213
           EG++P++R+ LR L +EM+ WPNLE E ++  P  S YAKVT TG+DP++AA+R  +DWD
Sbjct: 62  EGRLPEDRLVLRELTKEMLAWPNLEDEISEVNPLASPYAKVTPTGVDPKVAAQRAKVDWD 121

Query: 214 SAAEIEDAGGGDDPE-----VPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           +AAEI+    G++P+     VP  +G+  LYLV+  PVII ++VVLILF+NSLQ
Sbjct: 122 AAAEIQ---PGEEPKDLSDMVPPVVGFSFLYLVSFIPVIIVVAVVLILFFNSLQ 172


>gi|311223468|gb|ADP76805.1| Y3IP1 [Nicotiana tabacum]
          Length = 92

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 72/91 (79%), Positives = 81/91 (89%), Gaps = 1/91 (1%)

Query: 173 QWPNLEVEATKQ-KPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPS 231
            WPNLEVEATKQ KP +SLYAK TDTGIDP+ AAKRL IDWDSAAEI+++   D+P+VP 
Sbjct: 2   SWPNLEVEATKQNKPGRSLYAKATDTGIDPKEAAKRLKIDWDSAAEIDESAESDEPDVPP 61

Query: 232 ALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           ALGYGALYLV+AFP+IIGISVVLILFYNSLQ
Sbjct: 62  ALGYGALYLVSAFPIIIGISVVLILFYNSLQ 92


>gi|307111552|gb|EFN59786.1| hypothetical protein CHLNCDRAFT_133479 [Chlorella variabilis]
          Length = 231

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 77/193 (39%), Positives = 109/193 (56%), Gaps = 13/193 (6%)

Query: 78  DEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDS 137
           DE  EEEA         +V  +L  L++ RDM  +EV+LT+ IEDPR  ERR L+G+EDS
Sbjct: 44  DEVSEEEAK----RMCREVVGILRTLKEQRDMSLNEVRLTVAIEDPRARERR-LMGMEDS 98

Query: 138 NAPTRDDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSLYAKVTDT 197
           +  +R+++A AL +V+EG++P +R+ALR L +EM QWP L+  A  +   K+  A     
Sbjct: 99  SGISREEMAAALMEVSEGRIPADRIALRELCKEMSQWPALDEPARVRPWLKTPAAAEGGA 158

Query: 198 GIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSAL--------GYGALYLVTAFPVIIG 249
           G     AA+     W    E+    G D  E P  +        GYGALYLV+  PV+I 
Sbjct: 159 GSALTPAAEDAAAAWRQQGEVRPPMGRDPTEQPRGMADQLPDWVGYGALYLVSIAPVLIA 218

Query: 250 ISVVLILFYNSLQ 262
           I  + +LF NSL+
Sbjct: 219 IGAITVLFLNSLR 231


>gi|159474404|ref|XP_001695315.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158275798|gb|EDP01573.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 268

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 88/261 (33%), Positives = 125/261 (47%), Gaps = 43/261 (16%)

Query: 34  SWRASKLCLCCSRRSSQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENI 93
           S R  +     +RR   R V    + + DL V  E  ++ Q  E E       P+DL  I
Sbjct: 19  SARVCRRAFAGARRPLPRPVRATSE-EVDLEVQVEQFMKRQ-AEIESGAAFVRPKDLATI 76

Query: 94  ---------------AQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSN 138
                          A +  VL+ L++ RDM F+EVKL + IEDPR  ERR    IED  
Sbjct: 77  IGGDVVSEEMAQRYCADIFEVLKTLKRTRDMSFNEVKLIVSIEDPRARERR-AQDIEDER 135

Query: 139 APTRDDLAEALEQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSL-------- 190
             +RD++A+AL +V EG+VP +R+AL+ L +EM+ WP LEV        K          
Sbjct: 136 GVSRDEMAQALIEVGEGRVPNDRIALKCLHDEMLGWPFLEVATEAPAAPKGTVAAAPAAP 195

Query: 191 -------YAKVT--DTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSALGYGALYLV 241
                  YA +   DT + P +    L  + +    + D        +P  +GYG LY V
Sbjct: 196 AKPSASDYASLMGGDTVVKPYVMGDNLR-EGEKPQNLTDM-------LPGWVGYGVLYGV 247

Query: 242 TAFPVIIGISVVLILFYNSLQ 262
           +A PV++ I  +LILFYNSL+
Sbjct: 248 SAIPVLLVIGTILILFYNSLK 268


>gi|308802908|ref|XP_003078767.1| unnamed protein product [Ostreococcus tauri]
 gi|116057220|emb|CAL51647.1| unnamed protein product [Ostreococcus tauri]
          Length = 226

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 92/172 (53%), Gaps = 10/172 (5%)

Query: 93  IAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQV 152
           + ++K  L+LL   R+M F+EVKLTI IEDPR  E+R   GIED +  + ++  E L+ +
Sbjct: 63  VVEIKDWLKLLYVKREMSFNEVKLTIGIEDPRLAEQRERYGIEDESGVSAEEKMETLDMI 122

Query: 153 NEGKVPKNRVALRMLAEEMVQWPNL--EVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNI 210
             G+ P +  A+  + EE   WP L  E++ T+     S Y +++      + A  R   
Sbjct: 123 ERGERPTDARAVATVLEEFRNWPGLDAEIDTTQYDEGPSRYEEISMRAAGIKTANPRRVR 182

Query: 211 DWDSAAEIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
             +  AE +  G        +  G+  LYLV+A P+ I I+ V I+F+NSLQ
Sbjct: 183 AKEEPAEEQKEG--------TIFGFLPLYLVSAVPIFITIAAVSIMFFNSLQ 226


>gi|311223466|gb|ADP76804.1| Y3IP1 [Nicotiana tabacum]
          Length = 65

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 45/62 (72%), Positives = 53/62 (85%), Gaps = 1/62 (1%)

Query: 174 WPNLEVEATKQ-KPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSA 232
           WPNLEVEATKQ KP +SLYAK TDTGI+P+ AAKRL IDWDSAAEI+++   D+P+VP A
Sbjct: 3   WPNLEVEATKQNKPGRSLYAKATDTGINPKEAAKRLKIDWDSAAEIDESAESDEPDVPPA 62

Query: 233 LG 234
           LG
Sbjct: 63  LG 64


>gi|384251217|gb|EIE24695.1| hypothetical protein COCSUDRAFT_62118 [Coccomyxa subellipsoidea
           C-169]
          Length = 1671

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 60/142 (42%), Positives = 80/142 (56%), Gaps = 10/142 (7%)

Query: 95  QVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNE 154
           +V RVL+ L++ RDM  SEVKLT+ IED    E++  +GIED    +RD++A AL +V +
Sbjct: 66  EVVRVLKELKETRDMSVSEVKLTLAIEDASVREQKEYMGIEDERGASRDEMAAALVEVAD 125

Query: 155 GKVPKNRVALRMLAEEMVQWP---NLEVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNID 211
           G+VPK+R+ALR L +EM+QWP   N E  A     S S Y  +TDTG       K     
Sbjct: 126 GRVPKDRIALRELYKEMIQWPFVGNAEAAAPTAS-SSSPYEAITDTGT------KSAAAG 178

Query: 212 WDSAAEIEDAGGGDDPEVPSAL 233
           W  A       G D  E P +L
Sbjct: 179 WRPAQARPKMMGRDKSEAPQSL 200


>gi|412985941|emb|CCO17141.1| predicted protein [Bathycoccus prasinos]
          Length = 290

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 62/174 (35%), Positives = 94/174 (54%), Gaps = 14/174 (8%)

Query: 95  QVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNE 154
           ++K  L+LL   R+M F+EVKL I IEDPR  +RR   GIED +  + ++  + LE V+E
Sbjct: 125 EIKDWLKLLYVKREMSFNEVKLIIGIEDPRLADRRERFGIEDESGCSAEEKIQVLEDVDE 184

Query: 155 GKVPKNRVALRMLAEEMVQWPNLEV-----EATKQKPSKSLYAKVTDTGIDPEM-AAKRL 208
           G+VP + VA   +  E+ +WP L+      E  +  PS+     +   GI       KR+
Sbjct: 185 GRVPTDIVACEFVLNELREWPGLDSDIELPERRETGPSRYEEIAMDAAGIKGRTRNPKRM 244

Query: 209 NIDWDSAAEIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
             D+     +E+    D PE  + LG+  LY V++ P+ I I  + I+F NSLQ
Sbjct: 245 LDDY-----LEE---DDKPEEKNLLGFLPLYAVSSAPIFITIFALGIMFVNSLQ 290


>gi|145345491|ref|XP_001417242.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577469|gb|ABO95535.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 163

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 55/173 (31%), Positives = 87/173 (50%), Gaps = 14/173 (8%)

Query: 93  IAQVKRVLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQV 152
           + ++K  L+LL   R+M F+E+KLT+ IEDPR  E R   GIED +  + D+  E LE +
Sbjct: 2   VTEIKDWLKLLYVKREMSFNEIKLTVGIEDPRLAENRERYGIEDESGVSADEKVETLEMI 61

Query: 153 NEGKVPKNRVALRMLAEEMVQWPNLEV---EATKQKPSKSLYAKVTDTGIDPEMAAKRLN 209
             G+ P +  A+  L EE   WP L+    ++    PS+             E+A +   
Sbjct: 62  ERGETPTDARAVSTLLEEFRNWPGLDAVVDQSVDAGPSRY-----------EEIAMRSAG 110

Query: 210 IDWDSAAEIEDAGGGDDPEVPSALGYGALYLVTAFPVIIGISVVLILFYNSLQ 262
           I   +   +      ++ +  +  G+  LYLV+A P+ I +  V I+F NSLQ
Sbjct: 111 IKTSNPRRVRAKEAEEEKKEGTIFGFLPLYLVSAVPIFITVFAVGIMFVNSLQ 163


>gi|302854206|ref|XP_002958613.1| hypothetical protein VOLCADRAFT_121729 [Volvox carteri f.
           nagariensis]
 gi|300256074|gb|EFJ40350.1| hypothetical protein VOLCADRAFT_121729 [Volvox carteri f.
           nagariensis]
          Length = 196

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 57/81 (70%), Gaps = 1/81 (1%)

Query: 99  VLELLRKNRDMLFSEVKLTIMIEDPREVERRRLLGIEDSNAPTRDDLAEALEQVNEGKVP 158
           VL+ L++ RDM F+EVKL + IEDPR  ERR    +ED    +RD++A AL  + EG++P
Sbjct: 95  VLKTLKRTRDMSFNEVKLIVSIEDPRARERR-AADMEDERGVSRDEMAAALVDIGEGRIP 153

Query: 159 KNRVALRMLAEEMVQWPNLEV 179
            +R+AL+ L +EM+ WP LEV
Sbjct: 154 NDRIALKCLHDEMLNWPFLEV 174


>gi|224075389|ref|XP_002304613.1| predicted protein [Populus trichocarpa]
 gi|222842045|gb|EEE79592.1| predicted protein [Populus trichocarpa]
          Length = 84

 Score = 65.9 bits (159), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/57 (59%), Positives = 39/57 (68%)

Query: 178 EVEATKQKPSKSLYAKVTDTGIDPEMAAKRLNIDWDSAAEIEDAGGGDDPEVPSALG 234
           EVEA    PSK LYAK  DTG DP+ AAKRL I+WDSAAEIE  G  D+P +  +L 
Sbjct: 27  EVEAPVTNPSKFLYAKAIDTGKDPKKAAKRLKIEWDSAAEIEYTGLNDEPAMVCSLS 83


>gi|386812234|ref|ZP_10099459.1| conserved hypothetical protein [planctomycete KSU-1]
 gi|386404504|dbj|GAB62340.1| conserved hypothetical protein [planctomycete KSU-1]
          Length = 439

 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 32/125 (25%), Positives = 60/125 (48%), Gaps = 19/125 (15%)

Query: 59  GDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFSEVKLTI 118
           G+T+L +A    +  +D  + E           N+ ++K  ++LL  N D +  +  L I
Sbjct: 257 GETNLLMAHSIIISKEDTAEPE-----------NVKRLKEQVKLL--NPDAIIVDAALPI 303

Query: 119 MIEDPREVERRRLLGIEDSNAPTRDDLAE-----ALEQVNEGKVPKNR-VALRMLAEEMV 172
            +E+P  ++ +R+L IED    T   +A      A E+   G++   R  A+R +AE   
Sbjct: 304 TVEEPELIKGKRVLVIEDGPTLTHGGMAFGAGILAAEKYGAGEIIDPRPFAIRSIAETFR 363

Query: 173 QWPNL 177
            +P++
Sbjct: 364 NYPHI 368


>gi|341820470|emb|CCC56750.1| stage 0 DNA-binding protein [Weissella thailandensis fsh4-2]
          Length = 301

 Score = 37.7 bits (86), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 56/219 (25%), Positives = 100/219 (45%), Gaps = 32/219 (14%)

Query: 1   MAATFPQMPLASPIISRHGDNQINQHHPLFAATSWRASKLCLCCSRRSSQRVVALVGKGD 60
           +A +  Q  +  P+I RH     +Q+  L     WRAS++    +      + A+V K D
Sbjct: 65  LAESIKQNGVLQPVIVRHNPKDSSQYEILAGERRWRASQIANIDT------IPAIVRKLD 118

Query: 61  TDLRVAQENDLQDQDQEDEEEEEEATPE----DLENIAQVKRVLELLRKNRDMLFSEVKL 116
            D ++ Q   L++  +ED    EEA       D  N+ Q + V + L K R ++ + ++L
Sbjct: 119 DD-QMLQAAILENLQREDLTPLEEAQAYRDMMDGLNLTQAQ-VAKRLGKARSVVANFLRL 176

Query: 117 TIMIEDPREV---------ERRRLLGIEDSNAPTRDDLAEALEQVNEGKVPKNRVALRML 167
             + ++ +E+         + R LLG+ +     R  +  A   V EG      + +R L
Sbjct: 177 LNLPDEVKELLNAGQLSMGQARTLLGLHNK----RRIVPVAKRAVAEG------MTVRQL 226

Query: 168 AEEMVQWPNLEVEATKQKPSKSLYAKVTDTGIDPEMAAK 206
            E++V   N +   TK++PS S Y K + T ++ +   K
Sbjct: 227 -EQLVNKLNEQSAPTKEQPSLSPYIKASTTALENKFGTK 264


>gi|397904251|ref|ZP_10505170.1| V-type ATP synthase subunit I [Caloramator australicus RC3]
 gi|343178998|emb|CCC58069.1| V-type ATP synthase subunit I [Caloramator australicus RC3]
          Length = 630

 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 75/162 (46%), Gaps = 24/162 (14%)

Query: 36  RASKLCLCCSRRSSQRVVALVGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQ 95
           +  KL L C++   ++++ L+ K    L V Q  DL+D+ ++ + E+E+ + E     A+
Sbjct: 5   KMKKLNLACAKEDKEKILKLLQK----LGVVQIVDLKDKLEDTDYEKEKPSNEVEYEFAK 60

Query: 96  VKRVLELLRKNRDMLFSEVKLTIMIEDPREV----ERRRLLGIEDSNAPTRDDLAEAL-- 149
           VK   E L++     +SE K  I    P+ V    E  RL    DS+    +    A   
Sbjct: 61  VKFTYEFLKQ-----YSEEKKGIF--TPKNVLSFEEFERL----DSHLDWEEYYKAAKFI 109

Query: 150 -EQVNEGKVPKNRVALRMLAEEMVQWPNLEVEATKQKPSKSL 190
            E +N  +  K +V    L ++   W NL+V   + K  K++
Sbjct: 110 EESMNTNRTKKGKVL--SLIDQYSNWVNLDVSEAELKSLKNV 149


>gi|402470870|gb|EJW04907.1| hypothetical protein EDEG_00127 [Edhazardia aedis USNM 41457]
          Length = 823

 Score = 37.0 bits (84), Expect = 8.5,   Method: Composition-based stats.
 Identities = 30/140 (21%), Positives = 72/140 (51%), Gaps = 11/140 (7%)

Query: 56  VGKGDTDLRVAQENDLQDQDQEDEEEEEEATPEDLENIAQVKRVLELLRKNRDMLFSEVK 115
           + K   +++  Q  DLQD+++  ++E EE   E L++  ++++  E L+K ++ +  +  
Sbjct: 223 IRKQREEIKKYQAKDLQDREKNFKDENEELIGEILKHTEEIRKQREELKKQQEEIRKQQ- 281

Query: 116 LTIMIEDPREVERRRLLGIE----DSNAPTRDDLAEALEQVNEGKVPKNRVALRMLAE-E 170
                E+ R+ + + L  IE    D N   + ++A+  E++ + +  K +   + L + E
Sbjct: 282 -----EEIRKQQEKDLQRIEKNFKDENEKLKAEIAKHTEEIRKQEEEKKKYQAKDLQDRE 336

Query: 171 MVQWPNLEVEATKQKPSKSL 190
            +++   E+E  ++K  + L
Sbjct: 337 KIKFLKNEIEKLEKKDIQGL 356


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.132    0.373 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,064,094,009
Number of Sequences: 23463169
Number of extensions: 168876449
Number of successful extensions: 1262354
Number of sequences better than 100.0: 622
Number of HSP's better than 100.0 without gapping: 75
Number of HSP's successfully gapped in prelim test: 547
Number of HSP's that attempted gapping in prelim test: 1255383
Number of HSP's gapped (non-prelim): 4540
length of query: 262
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 122
effective length of database: 9,074,351,707
effective search space: 1107070908254
effective search space used: 1107070908254
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 75 (33.5 bits)