BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 027133
         (227 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255556906|ref|XP_002519486.1| conserved hypothetical protein [Ricinus communis]
 gi|223541349|gb|EEF42900.1| conserved hypothetical protein [Ricinus communis]
          Length = 237

 Score =  334 bits (857), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 180/241 (74%), Positives = 196/241 (81%), Gaps = 19/241 (7%)

Query: 2   MTETCAFCFVSPYSSIPSVSRCLREPNSFLLTSVS-----YRRSSRFLRFTADAAS---- 52
           M ET A CF S +SSIPS+ R  +E NSFL T+ S       RSSR LR T  A+     
Sbjct: 1   MAETSALCF-SHFSSIPSICRS-KESNSFLSTTSSKFSNSNPRSSRSLRLTCKASDSSNF 58

Query: 53  ---DSFNFLPWAHR---IEWVQEERVTLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSKS 106
              DSF+F PW+     IEW+QEE+VTLFTADGL+QIGGS+VPR VSSS  DKKQ +SK+
Sbjct: 59  LGDDSFDFFPWSDGDSVIEWIQEEKVTLFTADGLIQIGGSVVPRLVSSS--DKKQEKSKT 116

Query: 107 SQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTVL 166
           SQRFQRFQESDYMDP Q LCLGALF+IAATNGLDMGRRLCI GFCRSIEMLSDVVEDTVL
Sbjct: 117 SQRFQRFQESDYMDPKQGLCLGALFNIAATNGLDMGRRLCIFGFCRSIEMLSDVVEDTVL 176

Query: 167 EHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQWDF 226
           EHGGEVVAAEKA KGGLHEKLTMTVAVP+LWGVPPASETL LAV+SGGGIVEKVYWQWDF
Sbjct: 177 EHGGEVVAAEKAIKGGLHEKLTMTVAVPYLWGVPPASETLRLAVRSGGGIVEKVYWQWDF 236

Query: 227 L 227
           L
Sbjct: 237 L 237


>gi|224140271|ref|XP_002323506.1| predicted protein [Populus trichocarpa]
 gi|222868136|gb|EEF05267.1| predicted protein [Populus trichocarpa]
          Length = 239

 Score =  322 bits (825), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 173/243 (71%), Positives = 194/243 (79%), Gaps = 21/243 (8%)

Query: 2   MTETCAFCFVSPYSSIPSVSRCLREPNSFLLTSVSYRRSSRFLR------FTADAASDSF 55
           M ET A C +SP+SS+PS     REPNS+L  +  ++ S+   R      FT   ASDS 
Sbjct: 1   MAETSALC-LSPFSSLPSNIYRSREPNSYLSFASPFKSSASHSRSSRSIRFTC-KASDSG 58

Query: 56  NFL--------PWA---HRIEWVQEERVTLFTADGLVQIGGSMVPRRVSSSTSDKKQGRS 104
           NFL        PW+   + IEWV+EER+TLFT DGLVQIGGS+VPR V+SS  ++K+G+S
Sbjct: 59  NFLGDESLGFFPWSDGDNDIEWVREERITLFTTDGLVQIGGSVVPRLVASS--NRKRGKS 116

Query: 105 KSSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDT 164
           K+SQRFQRFQESDYMDPNQ LCLGALFDIAATNGLD GRRLCI GFCRSIEMLSDVVEDT
Sbjct: 117 KTSQRFQRFQESDYMDPNQGLCLGALFDIAATNGLDTGRRLCIFGFCRSIEMLSDVVEDT 176

Query: 165 VLEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQW 224
           VLEHGGEVV AEKA KGGLHEKLTMTVAVP+LWGVPPASETL LAV+SGGGIVEKVYWQW
Sbjct: 177 VLEHGGEVVTAEKAIKGGLHEKLTMTVAVPYLWGVPPASETLRLAVRSGGGIVEKVYWQW 236

Query: 225 DFL 227
           DFL
Sbjct: 237 DFL 239


>gi|449440135|ref|XP_004137840.1| PREDICTED: uncharacterized protein LOC101221206 [Cucumis sativus]
          Length = 243

 Score =  318 bits (815), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 161/215 (74%), Positives = 175/215 (81%), Gaps = 16/215 (7%)

Query: 27  PNSFLLT---SVSYRRSSRFLRFTADAA--------SDSFNFLPWAH---RIEWVQEERV 72
           P+ FL +   S+S+ R S  LRF+  ++         DSF   PWA     I W+ EERV
Sbjct: 31  PHPFLFSPKFSLSHHRPSPLLRFSLKSSYSGGFMGDDDSFGLFPWADGDSEIHWLPEERV 90

Query: 73  TLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFD 132
           TLFT DGLVQIGGS+VPRR+SSS  DKKQG+SK+SQRFQRFQESDYMDP Q +CLGALFD
Sbjct: 91  TLFTPDGLVQIGGSIVPRRISSS--DKKQGKSKTSQRFQRFQESDYMDPKQSICLGALFD 148

Query: 133 IAATNGLDMGRRLCILGFCRSIEMLSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVA 192
           IAATNGLDMGRRLCI GFCRS+EMLSDVVED VLE GGEVVAAEKASKGGL EKLTMTVA
Sbjct: 149 IAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLTMTVA 208

Query: 193 VPFLWGVPPASETLHLAVQSGGGIVEKVYWQWDFL 227
           VP LWGVPPASETLHLAVQSGGGIVEKVYWQWDFL
Sbjct: 209 VPLLWGVPPASETLHLAVQSGGGIVEKVYWQWDFL 243


>gi|356548270|ref|XP_003542526.1| PREDICTED: uncharacterized protein LOC100795364 [Glycine max]
          Length = 238

 Score =  315 bits (808), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 148/182 (81%), Positives = 165/182 (90%), Gaps = 6/182 (3%)

Query: 50  AASDSFNFLPWA----HRIEWVQEERVTLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSK 105
           +  DSF F PW+    + I+W+ EER+TLFTADGL+QIGGSMVPRRVSSS  DKK G+SK
Sbjct: 59  SGDDSFGFFPWSDSDNNEIQWIPEERITLFTADGLIQIGGSMVPRRVSSS--DKKHGKSK 116

Query: 106 SSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTV 165
           ++++FQRFQES+YM+PNQ LCLGALFDIAATNGLDMGRRLCI GFCRSIEMLSDVVEDTV
Sbjct: 117 TAKKFQRFQESNYMNPNQGLCLGALFDIAATNGLDMGRRLCIFGFCRSIEMLSDVVEDTV 176

Query: 166 LEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQWD 225
           LEHGGE++AAEKASKG LHEKLTMTVAVP LWGVPPASETLHLAV+SGGGIVEKVYWQWD
Sbjct: 177 LEHGGEIIAAEKASKGDLHEKLTMTVAVPLLWGVPPASETLHLAVKSGGGIVEKVYWQWD 236

Query: 226 FL 227
           FL
Sbjct: 237 FL 238


>gi|356533185|ref|XP_003535148.1| PREDICTED: uncharacterized protein LOC100776845 [Glycine max]
          Length = 236

 Score =  315 bits (808), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 152/198 (76%), Positives = 169/198 (85%), Gaps = 12/198 (6%)

Query: 40  SSRFLRFTADAA-------SDSFNFLPWA---HRIEWVQEERVTLFTADGLVQIGGSMVP 89
           + R L F   A+        DSF F PW+   + I+W+ EER+TLFTADGL+QIGGSM+P
Sbjct: 41  APRLLHFAPKASPSGNISGDDSFGFFPWSDPDNEIQWIPEERITLFTADGLIQIGGSMIP 100

Query: 90  RRVSSSTSDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILG 149
           RRVSSS  DKK  +SK++Q+FQRFQES+YMDPNQ LCLGALFDIAATNGLDMGRRLCI G
Sbjct: 101 RRVSSS--DKKLAKSKTAQKFQRFQESNYMDPNQGLCLGALFDIAATNGLDMGRRLCIFG 158

Query: 150 FCRSIEMLSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLA 209
           FCRSIEMLSDVVEDTVLEHGGE++AAEKASKG LHEKLTMTVAVP LWGVPPASETLHLA
Sbjct: 159 FCRSIEMLSDVVEDTVLEHGGEIIAAEKASKGDLHEKLTMTVAVPLLWGVPPASETLHLA 218

Query: 210 VQSGGGIVEKVYWQWDFL 227
           V+SGGGIVEKVYWQWDFL
Sbjct: 219 VKSGGGIVEKVYWQWDFL 236


>gi|449501054|ref|XP_004161265.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101229752 [Cucumis sativus]
          Length = 244

 Score =  313 bits (801), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 159/215 (73%), Positives = 173/215 (80%), Gaps = 15/215 (6%)

Query: 27  PNSFLLT---SVSYRRSSRFLRFTADAA--------SDSFNFLPWAH---RIEWVQEERV 72
           P+ FL +   S+S+ R S  LRF+  ++         DSF   PWA     I W+ EERV
Sbjct: 31  PHPFLFSPKFSLSHHRPSPLLRFSLKSSYSGGFMGDDDSFGLFPWADGDSEIHWLPEERV 90

Query: 73  TLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFD 132
           TLFT DGLVQIGGS+VPRR+SSS    KQG+SK+SQRFQRFQESDYMDP Q +CLGALFD
Sbjct: 91  TLFTPDGLVQIGGSIVPRRISSSDV-XKQGKSKTSQRFQRFQESDYMDPKQSICLGALFD 149

Query: 133 IAATNGLDMGRRLCILGFCRSIEMLSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVA 192
           IAATNGLDMGRRLCI GFCRS+EMLSDVVED VLE GGEVVAAEKASKGGL EKLTMTVA
Sbjct: 150 IAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLTMTVA 209

Query: 193 VPFLWGVPPASETLHLAVQSGGGIVEKVYWQWDFL 227
           VP LWGVPPASETLHLAVQSGGGIVEKVYWQWDFL
Sbjct: 210 VPLLWGVPPASETLHLAVQSGGGIVEKVYWQWDFL 244


>gi|388492426|gb|AFK34279.1| unknown [Lotus japonicus]
          Length = 239

 Score =  312 bits (799), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 156/207 (75%), Positives = 171/207 (82%), Gaps = 9/207 (4%)

Query: 28  NSFLLTSVSYRRSSRFLRFTADA---ASDSFNFLPWAH----RIEWVQEERVTLFTADGL 80
           NSF L+        R LRF   A      SF F PW+      I+W+ E+R+TLFTA+GL
Sbjct: 35  NSFPLSPSHSITQPRLLRFAPKANHSGDHSFGFFPWSDDADDEIQWLPEDRITLFTAEGL 94

Query: 81  VQIGGSMVPRRVSSSTSDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLD 140
           VQIGGSMVPRRVSSS  DKKQG+ K+S++FQRFQE+DYMDPNQ LCLGALFDIAATNGLD
Sbjct: 95  VQIGGSMVPRRVSSS--DKKQGKFKTSKKFQRFQETDYMDPNQGLCLGALFDIAATNGLD 152

Query: 141 MGRRLCILGFCRSIEMLSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVP 200
           M RRLCI GFCRSIEMLSDVVEDTVLEHGGEV+AAEKASKG LHEKLTMTVAVP LWGVP
Sbjct: 153 MDRRLCIFGFCRSIEMLSDVVEDTVLEHGGEVLAAEKASKGDLHEKLTMTVAVPLLWGVP 212

Query: 201 PASETLHLAVQSGGGIVEKVYWQWDFL 227
           PASETLHLAV+SGGGIVEKVYWQW+FL
Sbjct: 213 PASETLHLAVKSGGGIVEKVYWQWNFL 239


>gi|255646703|gb|ACU23825.1| unknown [Glycine max]
          Length = 236

 Score =  310 bits (793), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 150/198 (75%), Positives = 167/198 (84%), Gaps = 12/198 (6%)

Query: 40  SSRFLRFTADAA-------SDSFNFLPWA---HRIEWVQEERVTLFTADGLVQIGGSMVP 89
           + R L F   A+        DSF F PW+   + I+W+ EER+TLFTADGL+QIGGSM+P
Sbjct: 41  APRLLHFAPKASPSGNISGDDSFGFFPWSDPDNEIQWIPEERITLFTADGLIQIGGSMIP 100

Query: 90  RRVSSSTSDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILG 149
           RRVSSS  DKK  +SK++Q+FQRFQES+YMDPNQ LCLGALFDIAATNGLDMGRRLCI  
Sbjct: 101 RRVSSS--DKKLAKSKTAQKFQRFQESNYMDPNQGLCLGALFDIAATNGLDMGRRLCIFR 158

Query: 150 FCRSIEMLSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLA 209
           FCRSIEMLSDVVEDTVLEHGGE++AAEKASKG LHEKLTMTVAVP LWGVPPA ETLHLA
Sbjct: 159 FCRSIEMLSDVVEDTVLEHGGEIIAAEKASKGDLHEKLTMTVAVPLLWGVPPAFETLHLA 218

Query: 210 VQSGGGIVEKVYWQWDFL 227
           V+SGGGIVEKVYWQWDFL
Sbjct: 219 VKSGGGIVEKVYWQWDFL 236


>gi|225441201|ref|XP_002270394.1| PREDICTED: uncharacterized protein LOC100245118 [Vitis vinifera]
 gi|297739958|emb|CBI30140.3| unnamed protein product [Vitis vinifera]
          Length = 232

 Score =  308 bits (788), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 155/191 (81%), Positives = 168/191 (87%), Gaps = 7/191 (3%)

Query: 41  SRFLRFTADA-ASDSFNFLPWAHR--IEWVQEERVTLFTADGLVQIGGSMVPRRVSSSTS 97
           SR+ RF+A A A D F F PW +   I+WV EERVTLFTADGL+QIGGS+VPR V SS  
Sbjct: 45  SRYPRFSAKASADDPFGFFPWDNENDIQWVPEERVTLFTADGLIQIGGSLVPRHVPSS-- 102

Query: 98  DKKQGRSKSSQRFQRFQESDYMDPNQR-LCLGALFDIAATNGLDMGRRLCILGFCRSIEM 156
           DKK  RS++ Q+FQRFQESDY+DP Q+ LCLGALFDIAATNGLDMGRRLCI GFCRSIEM
Sbjct: 103 DKKHRRSRT-QKFQRFQESDYLDPKQKGLCLGALFDIAATNGLDMGRRLCIFGFCRSIEM 161

Query: 157 LSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGI 216
           LSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVP LWGVPPASETLHLAV+SGGGI
Sbjct: 162 LSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVPLLWGVPPASETLHLAVRSGGGI 221

Query: 217 VEKVYWQWDFL 227
           VEK+YWQWDFL
Sbjct: 222 VEKIYWQWDFL 232


>gi|30686956|ref|NP_850272.1| uncharacterized protein [Arabidopsis thaliana]
 gi|22531199|gb|AAM97103.1| unknown protein [Arabidopsis thaliana]
 gi|23198030|gb|AAN15542.1| unknown protein [Arabidopsis thaliana]
 gi|110741110|dbj|BAE98649.1| hypothetical protein [Arabidopsis thaliana]
 gi|330254222|gb|AEC09316.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 232

 Score =  290 bits (741), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 152/242 (62%), Positives = 178/242 (73%), Gaps = 26/242 (10%)

Query: 2   MTETCAFCF--------VSPY-SSIPSVSRCLREPNSFLLTSVSYRR----SSRFLRFTA 48
           M ET    F        +SP+  S PS +R      S LL+ V   R    +S +  F+ 
Sbjct: 1   MAETSTLLFSTFSSHLTISPFRQSHPSAARF-----SSLLSRVRPSRFAVKASHYGNFSD 55

Query: 49  DAASDSFNFLPWA---HRIEWVQEERVTLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSK 105
           D   D+FNF PW+   + IEWV EER+TLFT+DGLVQIGG+MVPRR+ SS  +KK GRS+
Sbjct: 56  D---DAFNFFPWSDANNEIEWVPEERITLFTSDGLVQIGGNMVPRRIKSS--NKKHGRSR 110

Query: 106 SSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTV 165
           S ++ Q+F ES YMDP Q LCLGALFDIAATNGLDMGRRLCI GFCRS+EMLSDVVEDTV
Sbjct: 111 SLEKHQKFHESAYMDPAQGLCLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDTV 170

Query: 166 LEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQWD 225
           LEHGGE+VA E  S  GL EKLTMTVAVP+LWGVPPA+E LHLAV++GGGIV+KVYWQW 
Sbjct: 171 LEHGGEIVATETESTSGLQEKLTMTVAVPYLWGVPPAAERLHLAVRTGGGIVDKVYWQWH 230

Query: 226 FL 227
           FL
Sbjct: 231 FL 232


>gi|145330677|ref|NP_001078014.1| uncharacterized protein [Arabidopsis thaliana]
 gi|330254223|gb|AEC09317.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 231

 Score =  287 bits (734), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 151/242 (62%), Positives = 176/242 (72%), Gaps = 27/242 (11%)

Query: 2   MTETCAFCF--------VSPY-SSIPSVSRCLREPNSFLLTSVSYRR----SSRFLRFTA 48
           M ET    F        +SP+  S PS +R      S LL+ V   R    +S +  F+ 
Sbjct: 1   MAETSTLLFSTFSSHLTISPFRQSHPSAARF-----SSLLSRVRPSRFAVKASHYGNFSD 55

Query: 49  DAASDSFNFLPWA---HRIEWVQEERVTLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSK 105
           D   D+FNF PW+   + IEWV EER+TLFT+DGLVQIGG+MVPRR+ SS    K GRS+
Sbjct: 56  D---DAFNFFPWSDANNEIEWVPEERITLFTSDGLVQIGGNMVPRRIKSS---NKHGRSR 109

Query: 106 SSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTV 165
           S ++ Q+F ES YMDP Q LCLGALFDIAATNGLDMGRRLCI GFCRS+EMLSDVVEDTV
Sbjct: 110 SLEKHQKFHESAYMDPAQGLCLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDTV 169

Query: 166 LEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQWD 225
           LEHGGE+VA E  S  GL EKLTMTVAVP+LWGVPPA+E LHLAV++GGGIV+KVYWQW 
Sbjct: 170 LEHGGEIVATETESTSGLQEKLTMTVAVPYLWGVPPAAERLHLAVRTGGGIVDKVYWQWH 229

Query: 226 FL 227
           FL
Sbjct: 230 FL 231


>gi|413957025|gb|AFW89674.1| hypothetical protein ZEAMMB73_059428 [Zea mays]
          Length = 238

 Score =  279 bits (714), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 135/190 (71%), Positives = 153/190 (80%), Gaps = 10/190 (5%)

Query: 45  RFTADAASDSFNFLPW--------AHRIEWVQEERVTLFTADGLVQIGGSMVPRRVSSST 96
           R   D   D +   PW           I+WV E+RVTLFT+DGL+QIGGS+VPRRVS+S 
Sbjct: 50  RAGKDDPEDLYGPYPWDQPLDLTAGSDIQWVPEDRVTLFTSDGLIQIGGSLVPRRVSAS- 108

Query: 97  SDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEM 156
            +K+Q + KS Q+ +RFQES YMDPNQ LCLGALF+IAATNGLDMGRRLCI GFCRSIEM
Sbjct: 109 -EKRQRKLKSGQKLRRFQESSYMDPNQSLCLGALFNIAATNGLDMGRRLCIFGFCRSIEM 167

Query: 157 LSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGI 216
           LSDVVEDTVLEHGGEVV AEKAS  GL EKLTM+VAVP LWGVPPASETLH+AV+SGGGI
Sbjct: 168 LSDVVEDTVLEHGGEVVTAEKASSNGLQEKLTMSVAVPLLWGVPPASETLHVAVRSGGGI 227

Query: 217 VEKVYWQWDF 226
           V+K+YWQWD 
Sbjct: 228 VDKIYWQWDL 237


>gi|115450785|ref|NP_001048993.1| Os03g0152600 [Oryza sativa Japonica Group]
 gi|108706229|gb|ABF94024.1| expressed protein [Oryza sativa Japonica Group]
 gi|113547464|dbj|BAF10907.1| Os03g0152600 [Oryza sativa Japonica Group]
          Length = 239

 Score =  278 bits (711), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 136/190 (71%), Positives = 151/190 (79%), Gaps = 10/190 (5%)

Query: 45  RFTADAASDSFNFLPWAHR--------IEWVQEERVTLFTADGLVQIGGSMVPRRVSSST 96
           R   D   D +   PW           I+WV E+RVTLFT+DGLVQIGGS+VPRR++ S 
Sbjct: 51  RAGKDDPEDLYGPYPWDQSLDLTTGLDIQWVPEDRVTLFTSDGLVQIGGSLVPRRITPS- 109

Query: 97  SDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEM 156
            +K+Q + K  Q  QRFQES YMDPNQ LCLGALF+IAATNGLDMGRRLCI GFCRSIEM
Sbjct: 110 -EKRQRKVKGIQNIQRFQESSYMDPNQSLCLGALFNIAATNGLDMGRRLCIFGFCRSIEM 168

Query: 157 LSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGI 216
           LSDVVEDTVLEHGGEVV AEKASK GL EKLTM+VAVP LWGVPPASETLH+AV+SGGGI
Sbjct: 169 LSDVVEDTVLEHGGEVVTAEKASKDGLQEKLTMSVAVPLLWGVPPASETLHVAVRSGGGI 228

Query: 217 VEKVYWQWDF 226
           VEK+YWQWD 
Sbjct: 229 VEKIYWQWDL 238


>gi|242036889|ref|XP_002465839.1| hypothetical protein SORBIDRAFT_01g046780 [Sorghum bicolor]
 gi|241919693|gb|EER92837.1| hypothetical protein SORBIDRAFT_01g046780 [Sorghum bicolor]
          Length = 234

 Score =  278 bits (710), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 134/190 (70%), Positives = 153/190 (80%), Gaps = 10/190 (5%)

Query: 45  RFTADAASDSFNFLPWAH--------RIEWVQEERVTLFTADGLVQIGGSMVPRRVSSST 96
           R   D   D +   PW           I+WV E+RVTLFT+DGL+QIGGS+VPRRVS+S 
Sbjct: 46  RAGKDDPEDLYGPYPWDQPLDLTTGFDIQWVPEDRVTLFTSDGLIQIGGSLVPRRVSAS- 104

Query: 97  SDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEM 156
            +K+Q + KS+Q+ +RFQES YMDPNQ LCLGALF+IAATNGLDMGRRLCI GFCRSIEM
Sbjct: 105 -EKRQRKLKSAQKLRRFQESSYMDPNQSLCLGALFNIAATNGLDMGRRLCIFGFCRSIEM 163

Query: 157 LSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGI 216
           LSDVVEDTVLEHGGEVV AEKAS  GL EKL M+VAVP LWGVPPASETLH+AV+SGGGI
Sbjct: 164 LSDVVEDTVLEHGGEVVTAEKASNDGLQEKLIMSVAVPLLWGVPPASETLHVAVRSGGGI 223

Query: 217 VEKVYWQWDF 226
           V+K+YWQWD 
Sbjct: 224 VDKIYWQWDL 233


>gi|357114002|ref|XP_003558790.1| PREDICTED: uncharacterized protein LOC100846853 [Brachypodium
           distachyon]
          Length = 240

 Score =  277 bits (709), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 132/191 (69%), Positives = 155/191 (81%), Gaps = 10/191 (5%)

Query: 45  RFTADAASDSFNFLPWAH--------RIEWVQEERVTLFTADGLVQIGGSMVPRRVSSST 96
           R   D   D +   PW           IEWVQE+++TLFT+DGLVQIGGS+VPRR+S+S 
Sbjct: 52  RARKDDPEDPYGPYPWEQPLDLSSGFDIEWVQEDKITLFTSDGLVQIGGSLVPRRLSAS- 110

Query: 97  SDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEM 156
            +K++ ++K + +F+RFQES YMDPNQ LCLG+LF+IAATNGLDMGRRLCI GFCRSIEM
Sbjct: 111 -EKRKQKAKGTHKFRRFQESSYMDPNQSLCLGSLFNIAATNGLDMGRRLCIFGFCRSIEM 169

Query: 157 LSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGI 216
           LSDVVEDTVLEHGGEVV AEKAS  GL EKLTM+VAVP LWGVPPASETLH+AV+SGGGI
Sbjct: 170 LSDVVEDTVLEHGGEVVIAEKASNDGLQEKLTMSVAVPLLWGVPPASETLHVAVRSGGGI 229

Query: 217 VEKVYWQWDFL 227
           V+K+YWQWD  
Sbjct: 230 VDKIYWQWDLF 240


>gi|218192108|gb|EEC74535.1| hypothetical protein OsI_10055 [Oryza sativa Indica Group]
          Length = 239

 Score =  276 bits (706), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 135/191 (70%), Positives = 151/191 (79%), Gaps = 10/191 (5%)

Query: 45  RFTADAASDSFNFLPWAHR--------IEWVQEERVTLFTADGLVQIGGSMVPRRVSSST 96
           R   D   D +   PW           I+WV E+RVTLFT+DGLVQIGGS+VPRR++ S 
Sbjct: 51  RAGKDDPEDLYGPYPWDQSLDLTTGLDIQWVPEDRVTLFTSDGLVQIGGSLVPRRITPS- 109

Query: 97  SDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEM 156
            +K+Q + K  Q  +RFQES YMDPNQ LCLGALF+IAATNGLDMGRRLCI GFCRSIEM
Sbjct: 110 -EKRQRKVKGIQNIRRFQESSYMDPNQSLCLGALFNIAATNGLDMGRRLCIFGFCRSIEM 168

Query: 157 LSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGI 216
           LSDVVEDTVLEHGGEVV AEKASK GL EKLTM+VAVP LWGVPPASETLH+AV+SGGGI
Sbjct: 169 LSDVVEDTVLEHGGEVVTAEKASKDGLQEKLTMSVAVPLLWGVPPASETLHVAVRSGGGI 228

Query: 217 VEKVYWQWDFL 227
           VEK+YWQWD  
Sbjct: 229 VEKIYWQWDLF 239


>gi|222624205|gb|EEE58337.1| hypothetical protein OsJ_09445 [Oryza sativa Japonica Group]
          Length = 268

 Score =  275 bits (702), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 131/164 (79%), Positives = 145/164 (88%), Gaps = 2/164 (1%)

Query: 64  IEWVQEERVTLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSKSSQRFQRFQESDYMDPNQ 123
           I+WV E+RVTLFT+DGLVQIGGS+VPRR++ S  +K+Q + K  Q  QRFQES YMDPNQ
Sbjct: 107 IQWVPEDRVTLFTSDGLVQIGGSLVPRRITPS--EKRQRKVKGIQNIQRFQESSYMDPNQ 164

Query: 124 RLCLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTVLEHGGEVVAAEKASKGGL 183
            LCLGALF+IAATNGLDMGRRLCI GFCRSIEMLSDVVEDTVLEHGGEVV AEKASK GL
Sbjct: 165 SLCLGALFNIAATNGLDMGRRLCIFGFCRSIEMLSDVVEDTVLEHGGEVVTAEKASKDGL 224

Query: 184 HEKLTMTVAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQWDFL 227
            EKLTM+VAVP LWGVPPASETLH+AV+SGGGIVEK+YWQWD  
Sbjct: 225 QEKLTMSVAVPLLWGVPPASETLHVAVRSGGGIVEKIYWQWDLF 268


>gi|294463983|gb|ADE77512.1| unknown [Picea sitchensis]
          Length = 291

 Score =  272 bits (695), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 129/181 (71%), Positives = 149/181 (82%), Gaps = 8/181 (4%)

Query: 53  DSFNFLPW------AHRIEWVQEERVTLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSKS 106
           D F   PW         ++WVQE+ +TLFT+DGL++IGGS+VPRR+S   SDK QG   +
Sbjct: 113 DEFGLYPWDSSSSLPDDLQWVQEDTITLFTSDGLMRIGGSVVPRRIS--PSDKDQGILGA 170

Query: 107 SQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTVL 166
           S R+QRFQE +YMDPNQ LCLGA+FDIAATNGLD+GRRLCI GFCRSIEMLSDVVEDTVL
Sbjct: 171 SHRYQRFQEENYMDPNQGLCLGAIFDIAATNGLDLGRRLCIFGFCRSIEMLSDVVEDTVL 230

Query: 167 EHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQWDF 226
           E GGEVVAAEK SKGGLHEKL MTVAVP LWG+PPA++ L  AV++GGGIVEKVYWQWDF
Sbjct: 231 EQGGEVVAAEKESKGGLHEKLRMTVAVPLLWGIPPAADRLRFAVRTGGGIVEKVYWQWDF 290

Query: 227 L 227
           +
Sbjct: 291 I 291


>gi|297827191|ref|XP_002881478.1| hypothetical protein ARALYDRAFT_321394 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327317|gb|EFH57737.1| hypothetical protein ARALYDRAFT_321394 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 202

 Score =  256 bits (653), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 142/235 (60%), Positives = 161/235 (68%), Gaps = 42/235 (17%)

Query: 2   MTETCAFCF--------VSPY-SSIPSVSRCLREPNSFLLTSVSYRRSSRFLRFTADAAS 52
           M ETC+  F        +SP+  S PS +R      S LL+ V   R SRF    A  AS
Sbjct: 1   MAETCSLLFSTFSSHLTISPFRHSHPSAARF-----SSLLSRV---RPSRF----AVKAS 48

Query: 53  DSFNFLPWAHRIEWVQEERVTLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSKSSQRFQR 112
                    + IEWV EER+TLFT DGLVQIGG+M               +SKS ++ Q+
Sbjct: 49  H--------YEIEWVPEERITLFTTDGLVQIGGNM-------------SHKSKSPEKHQK 87

Query: 113 FQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTVLEHGGEV 172
           FQES YMDP Q LCLGALFDIAATNGLDMGRRLCI GFCRS+EMLSDVVEDTVLEHGGE+
Sbjct: 88  FQESAYMDPAQGLCLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDTVLEHGGEI 147

Query: 173 VAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQWDFL 227
           VA E  S  GL EKLTMTVAVP+LWGVPPA+E LHLAV++GGGIV+KVYWQW FL
Sbjct: 148 VATETESTSGLQEKLTMTVAVPYLWGVPPAAERLHLAVRTGGGIVDKVYWQWHFL 202


>gi|302774356|ref|XP_002970595.1| hypothetical protein SELMODRAFT_411301 [Selaginella moellendorffii]
 gi|300162111|gb|EFJ28725.1| hypothetical protein SELMODRAFT_411301 [Selaginella moellendorffii]
          Length = 236

 Score =  227 bits (579), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 115/210 (54%), Positives = 145/210 (69%), Gaps = 17/210 (8%)

Query: 26  EPNSFLLTSVSYRRSSRFLRFTAD--AASDSFNFLPW------AHRIEWVQEERVTLFTA 77
           + ++ +L SVS RRS+R    + D  ++ DS+   PW      +  +EW+Q + +TLFT+
Sbjct: 36  DASNLMLGSVSGRRSARPCASSDDNLSSEDSYGPYPWDSSWQNSDAVEWMQGDTITLFTS 95

Query: 78  DGLVQIGGSMVPRRVSSSTSDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFDIAATN 137
           +GLV+IGGS   R VSSS         +  QR +RF+E  YMDP Q LCLGA+FDIAATN
Sbjct: 96  EGLVRIGGSRFARPVSSS---------RKKQRLRRFREEQYMDPKQELCLGAVFDIAATN 146

Query: 138 GLDMGRRLCILGFCRSIEMLSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVPFLW 197
           GLD+GRR C+ GFCRS+EMLSDVVEDTVLE GGEVV A K +   LHEKL MTVA+P LW
Sbjct: 147 GLDLGRRFCVFGFCRSVEMLSDVVEDTVLEQGGEVVIAAKLTSSDLHEKLKMTVAIPLLW 206

Query: 198 GVPPASETLHLAVQSGGGIVEKVYWQWDFL 227
           GVPP  E L+ A++ GGGIVEK Y+ W F 
Sbjct: 207 GVPPGLENLNQAIRCGGGIVEKAYYSWTFF 236


>gi|302770026|ref|XP_002968432.1| hypothetical protein SELMODRAFT_409238 [Selaginella moellendorffii]
 gi|300164076|gb|EFJ30686.1| hypothetical protein SELMODRAFT_409238 [Selaginella moellendorffii]
          Length = 236

 Score =  225 bits (573), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 114/208 (54%), Positives = 143/208 (68%), Gaps = 17/208 (8%)

Query: 28  NSFLLTSVSYRRSSRFLRFTAD--AASDSFNFLPW------AHRIEWVQEERVTLFTADG 79
           ++ +L S S RRS+R    + D  ++ DS+   PW      +  +EW+Q + +TLFT++G
Sbjct: 38  SNLMLGSASGRRSARPCASSDDNLSSEDSYGPYPWDSSWQNSDAVEWMQGDTITLFTSEG 97

Query: 80  LVQIGGSMVPRRVSSSTSDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFDIAATNGL 139
           LV+IGGS   R VSSS         +  QR +RF+E  YMDP Q LCLGA+FDIAATNGL
Sbjct: 98  LVRIGGSRFARPVSSS---------RKKQRLRRFREEQYMDPKQELCLGAVFDIAATNGL 148

Query: 140 DMGRRLCILGFCRSIEMLSDVVEDTVLEHGGEVVAAEKASKGGLHEKLTMTVAVPFLWGV 199
           D+GRR C+ GFCRS+EMLSDVVEDTVLE GGEVV A K +   LHEKL MTVA+P LWGV
Sbjct: 149 DLGRRFCVFGFCRSVEMLSDVVEDTVLEQGGEVVIAAKLTSSDLHEKLKMTVAIPLLWGV 208

Query: 200 PPASETLHLAVQSGGGIVEKVYWQWDFL 227
           PP  E L+ A++ GGGIVEK Y+ W F 
Sbjct: 209 PPGLENLNQAIRCGGGIVEKAYYSWTFF 236


>gi|168003826|ref|XP_001754613.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694234|gb|EDQ80583.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 158

 Score =  213 bits (542), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 104/164 (63%), Positives = 126/164 (76%), Gaps = 7/164 (4%)

Query: 64  IEWVQEERVTLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSKSSQRFQRFQESDYMDPNQ 123
           ++WV+E+  T FT DGLV+IGGS+ P       S     + K   R +R++E DYMDP Q
Sbjct: 2   VDWVREDTFTFFTTDGLVRIGGSIRP-------SFGIDWQKKHRLRRRRYKEEDYMDPKQ 54

Query: 124 RLCLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTVLEHGGEVVAAEKASKGGL 183
            LCLGA+FDIAATNGLD GR+LC++GFCRSIEML+DVVEDTVL+ GGEVV AEKA+  GL
Sbjct: 55  GLCLGAIFDIAATNGLDRGRKLCVVGFCRSIEMLNDVVEDTVLDLGGEVVIAEKATTEGL 114

Query: 184 HEKLTMTVAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQWDFL 227
           HEKLTM V +P LWGVPPA +TL+ A++SGGGIVEK Y QW FL
Sbjct: 115 HEKLTMMVVMPLLWGVPPAVDTLNYAIRSGGGIVEKTYKQWHFL 158


>gi|388507172|gb|AFK41652.1| unknown [Medicago truncatula]
          Length = 195

 Score =  206 bits (525), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 110/194 (56%), Positives = 137/194 (70%), Gaps = 20/194 (10%)

Query: 2   MTETCA---FCFVSPYSSIPSVSRCLREP----NSFLLTSVSYRRSSRFLRFTADAA--- 51
           M ET     F F SP  +   +SR  + P    NSF ++  S     R LRF+  A    
Sbjct: 1   MAETSTLFRFRFSSPPFNNSIISRSSKPPFSFNNSFHVSPSSSH--PRLLRFSPKATHSS 58

Query: 52  --SDSFNFLPWA---HRIEWVQEERVTLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSKS 106
             +D F F PW+   + I+WV +++ T FT DGL+QIGG+MVPRRV SS    KQG+S++
Sbjct: 59  SNNDFFEFFPWSDDDNEIQWVSQDKFTFFTTDGLIQIGGNMVPRRVKSS---DKQGKSQT 115

Query: 107 SQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTVL 166
           ++++QR+QES+YMDPNQ LCLGALFDIAATNGLDM RRLCI GFCRSIEMLSDVVEDTVL
Sbjct: 116 AKKYQRYQESNYMDPNQGLCLGALFDIAATNGLDMSRRLCIFGFCRSIEMLSDVVEDTVL 175

Query: 167 EHGGEVVAAEKASK 180
           EHGGEV+AAEK ++
Sbjct: 176 EHGGEVIAAEKQAE 189


>gi|108706230|gb|ABF94025.1| expressed protein [Oryza sativa Japonica Group]
          Length = 211

 Score =  183 bits (465), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 94/147 (63%), Positives = 105/147 (71%), Gaps = 10/147 (6%)

Query: 45  RFTADAASDSFNFLPWAHR--------IEWVQEERVTLFTADGLVQIGGSMVPRRVSSST 96
           R   D   D +   PW           I+WV E+RVTLFT+DGLVQIGGS+VPRR++ S 
Sbjct: 51  RAGKDDPEDLYGPYPWDQSLDLTTGLDIQWVPEDRVTLFTSDGLVQIGGSLVPRRITPS- 109

Query: 97  SDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEM 156
            +K+Q + K  Q  QRFQES YMDPNQ LCLGALF+IAATNGLDMGRRLCI GFCRSIEM
Sbjct: 110 -EKRQRKVKGIQNIQRFQESSYMDPNQSLCLGALFNIAATNGLDMGRRLCIFGFCRSIEM 168

Query: 157 LSDVVEDTVLEHGGEVVAAEKASKGGL 183
           LSDVVEDTVLEHGGEV A     K  L
Sbjct: 169 LSDVVEDTVLEHGGEVCATVLHGKKNL 195


>gi|358347765|ref|XP_003637922.1| hypothetical protein MTR_113s0006 [Medicago truncatula]
 gi|355503857|gb|AES85060.1| hypothetical protein MTR_113s0006 [Medicago truncatula]
          Length = 296

 Score =  153 bits (386), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 72/97 (74%), Positives = 84/97 (86%), Gaps = 6/97 (6%)

Query: 84  GGSMVPRRVSSSTSD------KKQGRSKSSQRFQRFQESDYMDPNQRLCLGALFDIAATN 137
           GG+MVPRRV SS  +      +KQG+S++++++QR+QES+YMDPNQ LCLGALFDIAATN
Sbjct: 100 GGNMVPRRVKSSEQNTCSHVSQKQGKSQTAKKYQRYQESNYMDPNQGLCLGALFDIAATN 159

Query: 138 GLDMGRRLCILGFCRSIEMLSDVVEDTVLEHGGEVVA 174
           GLDM RRLCI GFCRSIEMLSDVVEDTVLEHGGEV A
Sbjct: 160 GLDMSRRLCIFGFCRSIEMLSDVVEDTVLEHGGEVGA 196


>gi|384254097|gb|EIE27571.1| hypothetical protein COCSUDRAFT_52208 [Coccomyxa subellipsoidea
           C-169]
          Length = 194

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/161 (45%), Positives = 90/161 (55%), Gaps = 22/161 (13%)

Query: 69  EERVTLFTADGLVQIGGS---MVPRRVSSSTSDKKQGRSKSSQRFQRFQESDYMDPNQRL 125
           EE +T  TA G VQ+       VPRR+ +                    E    +  Q L
Sbjct: 52  EEVITFLTATGFVQLPSRPSWHVPRRMLT-------------------MEPLPTEQEQGL 92

Query: 126 CLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTVLEHGGEVVAAEKASKGGLHE 185
           C+GA F IAATNG D  RRL +LGFCRS+  LS+VVED VL  GG V+     +K G+HE
Sbjct: 93  CVGASFRIAATNGADPKRRLHLLGFCRSVNSLSEVVEDAVLRRGGSVMLHHVETKSGVHE 152

Query: 186 KLTMTVAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQWDF 226
            L MTVA+P L+G+PP  E L  A+ SGGGIVEK+  QW  
Sbjct: 153 VLKMTVAIPLLYGIPPEYEYLRSAISSGGGIVEKMSRQWHM 193


>gi|159482994|ref|XP_001699550.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158272817|gb|EDO98613.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 196

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 73/161 (45%), Positives = 101/161 (62%), Gaps = 17/161 (10%)

Query: 65  EWVQEERVTLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSKSSQRFQRFQESDYMDPNQR 124
           E+V+E  +T FT++G+VQI G    RRV+   +  K    K       ++E       Q 
Sbjct: 47  EYVREV-ITFFTSEGVVQIPG----RRVARPATQAK----KLVAPMPTWEE-------QS 90

Query: 125 LCLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTV-LEHGGEVVAAEKASKGGL 183
           LC+GA F +AATNG+D GRR+ I GFC+S++ L   V+D +  E GGEV+  E+  K GL
Sbjct: 91  LCIGATFSVAATNGMDGGRRVSIEGFCQSVDFLFASVQDALESELGGEVLMQERQLKSGL 150

Query: 184 HEKLTMTVAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQW 224
           HE L MTVAVP L+GVPP  + L+ A++SGGGIV++V   W
Sbjct: 151 HEVLNMTVAVPLLFGVPPQLDVLNEAIRSGGGIVDRVRHVW 191


>gi|307107754|gb|EFN55996.1| hypothetical protein CHLNCDRAFT_52073 [Chlorella variabilis]
          Length = 299

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 57/116 (49%), Positives = 77/116 (66%), Gaps = 2/116 (1%)

Query: 111 QRFQESDYMD--PNQRLCLGALFDIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTVLEH 168
           QR  E D +     Q LC+G++F I+ATNG++  RR+ + GFC S E L + VEDTVL  
Sbjct: 181 QRRSEGDTLPLPGEQGLCIGSVFRISATNGIEPSRRVNLAGFCFSTEQLYERVEDTVLAR 240

Query: 169 GGEVVAAEKASKGGLHEKLTMTVAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQW 224
           GGEV   ++  K G+HE L MTVA+P LWGVPP  E L   +++GGGI++K+  QW
Sbjct: 241 GGEVFETQRLLKSGVHEALVMTVAIPLLWGVPPEYERLKAGIKTGGGIIDKIERQW 296


>gi|302842347|ref|XP_002952717.1| hypothetical protein VOLCADRAFT_105632 [Volvox carteri f.
           nagariensis]
 gi|300262061|gb|EFJ46270.1| hypothetical protein VOLCADRAFT_105632 [Volvox carteri f.
           nagariensis]
          Length = 554

 Score =  116 bits (290), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 67/154 (43%), Positives = 91/154 (59%), Gaps = 16/154 (10%)

Query: 72  VTLFTADGLVQIGGSMVPRRVSSSTSDKKQGRSKSSQRFQRFQESDYMDPNQRLCLGALF 131
           +T FT+DG+V+I G  V R                +QR +           Q LC+GA F
Sbjct: 408 ITFFTSDGVVRIPGKRVLR---------------PAQRHKELVRPMPTWDQQSLCIGASF 452

Query: 132 DIAATNGLDMGRRLCILGFCRSIEMLSDVVEDTVL-EHGGEVVAAEKASKGGLHEKLTMT 190
            +AATNG D  RR+ I GFC+S++ L   V+D +  E GGEV+  E+  K GLHE L +T
Sbjct: 453 SVAATNGTDGTRRVSIEGFCQSVDYLFASVQDALESELGGEVLMQERQLKSGLHEVLKLT 512

Query: 191 VAVPFLWGVPPASETLHLAVQSGGGIVEKVYWQW 224
           VAVPFL+GVPP  E L+ A+++GGGIV++V   W
Sbjct: 513 VAVPFLFGVPPQLEVLNEAIRTGGGIVDRVRHVW 546


>gi|326319081|ref|YP_004236753.1| family 2 glycosyl transferase [Acidovorax avenae subsp. avenae ATCC
           19860]
 gi|323375917|gb|ADX48186.1| glycosyl transferase family 2 [Acidovorax avenae subsp. avenae ATCC
           19860]
          Length = 1464

 Score = 38.9 bits (89), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 19/72 (26%), Positives = 42/72 (58%), Gaps = 3/72 (4%)

Query: 51  ASDSFNFLPWAHRIEWVQEERVTLFTADGLVQIGGS---MVPRRVSSSTSDKKQGRSKSS 107
           A DSF+ +  +H +E V ++++ +     ++  GG    MVP   +++T+D+    S+  
Sbjct: 128 ADDSFDLIICSHVLEHVNDDQLAMRELHRVLAPGGCAILMVPLLATAATTDEDPQESRVE 187

Query: 108 QRFQRFQESDYM 119
           +R++RF + D++
Sbjct: 188 ERWRRFGQDDHV 199


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.134    0.411 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,363,397,107
Number of Sequences: 23463169
Number of extensions: 122271812
Number of successful extensions: 305791
Number of sequences better than 100.0: 30
Number of HSP's better than 100.0 without gapping: 29
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 305721
Number of HSP's gapped (non-prelim): 30
length of query: 227
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 90
effective length of database: 9,144,741,214
effective search space: 823026709260
effective search space used: 823026709260
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 74 (33.1 bits)