BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 019804
         (335 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|297735250|emb|CBI17612.3| unnamed protein product [Vitis vinifera]
          Length = 346

 Score =  383 bits (984), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 181/225 (80%), Positives = 204/225 (90%), Gaps = 1/225 (0%)

Query: 61  PFTLRVTNDSTTST-ELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRD 119
           PF L+VTND    T E+SI++++SEADKIVDGMDFGELCNEFECISSPLVESTARQL RD
Sbjct: 89  PFPLKVTNDPNRRTAEVSIESERSEADKIVDGMDFGELCNEFECISSPLVESTARQLARD 148

Query: 120 ILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSV 179
           ILELREGNRALGTFAVSVKYKDP RSFTGREKYKRRLW T ALD+PSVTVQEMVMLSTSV
Sbjct: 149 ILELREGNRALGTFAVSVKYKDPVRSFTGREKYKRRLWVTDALDDPSVTVQEMVMLSTSV 208

Query: 180 LSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFW 239
           LSIKWT++GKPKS++A+IGGDLI++V S+FTLNQISGQVIEHEE WDLSASS +A+A+FW
Sbjct: 209 LSIKWTIKGKPKSLLASIGGDLIIRVNSQFTLNQISGQVIEHEEFWDLSASSAIAQAYFW 268

Query: 240 ASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
           ASRRL+A TEAGKD  D + N  S++STE+ENL IYPDPSGDPTK
Sbjct: 269 ASRRLFAATEAGKDFADSVKNWGSRLSTEKENLEIYPDPSGDPTK 313


>gi|225430926|ref|XP_002277168.1| PREDICTED: uncharacterized protein LOC100263239 [Vitis vinifera]
          Length = 301

 Score =  383 bits (984), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 181/225 (80%), Positives = 204/225 (90%), Gaps = 1/225 (0%)

Query: 61  PFTLRVTNDSTTST-ELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRD 119
           PF L+VTND    T E+SI++++SEADKIVDGMDFGELCNEFECISSPLVESTARQL RD
Sbjct: 44  PFPLKVTNDPNRRTAEVSIESERSEADKIVDGMDFGELCNEFECISSPLVESTARQLARD 103

Query: 120 ILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSV 179
           ILELREGNRALGTFAVSVKYKDP RSFTGREKYKRRLW T ALD+PSVTVQEMVMLSTSV
Sbjct: 104 ILELREGNRALGTFAVSVKYKDPVRSFTGREKYKRRLWVTDALDDPSVTVQEMVMLSTSV 163

Query: 180 LSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFW 239
           LSIKWT++GKPKS++A+IGGDLI++V S+FTLNQISGQVIEHEE WDLSASS +A+A+FW
Sbjct: 164 LSIKWTIKGKPKSLLASIGGDLIIRVNSQFTLNQISGQVIEHEEFWDLSASSAIAQAYFW 223

Query: 240 ASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
           ASRRL+A TEAGKD  D + N  S++STE+ENL IYPDPSGDPTK
Sbjct: 224 ASRRLFAATEAGKDFADSVKNWGSRLSTEKENLEIYPDPSGDPTK 268


>gi|255547550|ref|XP_002514832.1| conserved hypothetical protein [Ricinus communis]
 gi|223545883|gb|EEF47386.1| conserved hypothetical protein [Ricinus communis]
          Length = 312

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 191/268 (71%), Positives = 215/268 (80%), Gaps = 16/268 (5%)

Query: 19  PTNGGFLYRSHPNSKKCIFTFNPFANSSSSSHHHHHHHHHHSPFTLRVTNDSTTSTELS- 77
           P  GG   RS P+  KCIFT   F  S+SS  +         PF LRVTNDS   TELS 
Sbjct: 26  PATGGLHRRSFPS--KCIFT--SFRPSTSSPRY---------PFLLRVTNDSN-RTELSS 71

Query: 78  -IQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVS 136
            +   +SEADKIVDGMDFGELCNEFECISSPLVE+TARQL RDILELREGNRALGTFAVS
Sbjct: 72  DVAVGRSEADKIVDGMDFGELCNEFECISSPLVEATARQLARDILELREGNRALGTFAVS 131

Query: 137 VKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIAN 196
           V+YKDP RSFTGREKYKR LW T ALDNPSV VQEMVMLSTSVLSIKWT++GKPKS +A+
Sbjct: 132 VRYKDPVRSFTGREKYKRPLWVTGALDNPSVAVQEMVMLSTSVLSIKWTIKGKPKSFLAS 191

Query: 197 IGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFD 256
            GG+LI+KV S+FTLNQISGQVIEHEE+WDLSASSP+A+AFFW SRRL+AT E GKD  D
Sbjct: 192 AGGELIIKVNSQFTLNQISGQVIEHEEVWDLSASSPIAQAFFWTSRRLFATLETGKDLSD 251

Query: 257 LINNLKSKISTEQENLGIYPDPSGDPTK 284
            + NL +++  E+ENL I+PDPSGDPTK
Sbjct: 252 FVKNLTTRLPAERENLEIFPDPSGDPTK 279


>gi|147819671|emb|CAN76393.1| hypothetical protein VITISV_020863 [Vitis vinifera]
          Length = 298

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 178/219 (81%), Positives = 199/219 (90%), Gaps = 1/219 (0%)

Query: 67  TNDSTTST-ELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELRE 125
           TNDS   T E+SI++++SEADKIVDGMDFGELCNEFECISSPLVESTARQL RDILELRE
Sbjct: 47  TNDSNRRTAEVSIESERSEADKIVDGMDFGELCNEFECISSPLVESTARQLARDILELRE 106

Query: 126 GNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWT 185
           GNRALGTFAVSVKYKDP RSFTGREKYKRRLW T ALD+PSVTVQEMVMLSTSVLSIKWT
Sbjct: 107 GNRALGTFAVSVKYKDPVRSFTGREKYKRRLWVTDALDDPSVTVQEMVMLSTSVLSIKWT 166

Query: 186 LRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLY 245
           ++GKPKS++A+IGGDLI++V S+FTLNQISGQVIEHEE WDLSASS +A+A+FWASRRL+
Sbjct: 167 IKGKPKSLLASIGGDLIIRVNSQFTLNQISGQVIEHEEFWDLSASSAIAQAYFWASRRLF 226

Query: 246 ATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
           A TEAGKD  D + N  S++STE ENL IYPDPSGDPTK
Sbjct: 227 AATEAGKDFADSVKNWGSRLSTEXENLEIYPDPSGDPTK 265


>gi|224128790|ref|XP_002328967.1| predicted protein [Populus trichocarpa]
 gi|222839201|gb|EEE77552.1| predicted protein [Populus trichocarpa]
          Length = 306

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 191/266 (71%), Positives = 214/266 (80%), Gaps = 16/266 (6%)

Query: 22  GGFLYRSHPNSKKCIFTFNPFANSSSSSHHHHHHHHHHSPFTLRVTNDSTTSTELSIQ-- 79
           G F  +   +S + IFT  PF    +SS HH        PF LRVTNDS+  TELS    
Sbjct: 21  GSFHRQCCRSSNRFIFTCRPF----TSSLHH--------PFILRVTNDSS-RTELSPAEP 67

Query: 80  -NQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVK 138
            N+ SEAD+IV GMDFGELCNEFECISSPLVESTARQLVRDIL+LREGNRALGTFAVSV+
Sbjct: 68  ANESSEADRIVGGMDFGELCNEFECISSPLVESTARQLVRDILQLREGNRALGTFAVSVR 127

Query: 139 YKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIG 198
           YKDP RSFTGREKYKR LWAT ALDNPSVTVQEM+MLSTS+LSIKWT+ GKPKS IA +G
Sbjct: 128 YKDPVRSFTGREKYKRPLWATGALDNPSVTVQEMIMLSTSILSIKWTITGKPKSFIAGVG 187

Query: 199 GDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLI 258
           GDLIVKV+SKFTLNQISGQVIEHEE WDLSASS V +AFFW SRRL+AT E GKD  DL+
Sbjct: 188 GDLIVKVHSKFTLNQISGQVIEHEEFWDLSASSVVDQAFFWTSRRLFATIENGKDLSDLV 247

Query: 259 NNLKSKISTEQENLGIYPDPSGDPTK 284
            +L ++ S++ EN+ IYPDPSGDPTK
Sbjct: 248 KSLTNRPSSKNENMEIYPDPSGDPTK 273


>gi|356568505|ref|XP_003552451.1| PREDICTED: uncharacterized protein LOC100791782 [Glycine max]
          Length = 307

 Score =  353 bits (907), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 187/271 (69%), Positives = 209/271 (77%), Gaps = 10/271 (3%)

Query: 42  FANSSSSSHHHHHHHHHHSPFTLRVTND---STTSTELSIQNQKSEADKIVDGMDFGELC 98
           F +SS+      H HH+HS F LRV ND   S  S++ +I+   SEADK+VDGMDFGELC
Sbjct: 33  FHSSSTCMPSFLHDHHYHS-FLLRVANDADRSEVSSDTAIKTSYSEADKMVDGMDFGELC 91

Query: 99  NEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWA 158
           NEFECISSPLVESTARQL RDILELREGNRALG FAVSV YKDP RSFTGREKYKRRLWA
Sbjct: 92  NEFECISSPLVESTARQLARDILELREGNRALGIFAVSVAYKDPIRSFTGREKYKRRLWA 151

Query: 159 TTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQV 218
           T ALDNPSVTVQEMVMLSTSVLSIKWT+RGKPKS+   +GGDLI++V SKFTLNQISGQV
Sbjct: 152 TGALDNPSVTVQEMVMLSTSVLSIKWTIRGKPKSV---LGGDLIIRVTSKFTLNQISGQV 208

Query: 219 IEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDP 278
           IEHEE WDLSASS  A+AFFW SR L+A  E+ KD  D   NL SK+ST++ENL +YPDP
Sbjct: 209 IEHEEFWDLSASSASAQAFFWTSRALFAAVESVKDLGDSAKNLSSKMSTKKENLEMYPDP 268

Query: 279 SGDPTKVSSDSRCF---CFCFCLTSNVIYIA 306
           SGDPTK       F    +   L   VIY+ 
Sbjct: 269 SGDPTKFFQRDDSFQQDAYQIALLLAVIYLV 299


>gi|255638033|gb|ACU19331.1| unknown [Glycine max]
          Length = 307

 Score =  352 bits (904), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 186/271 (68%), Positives = 208/271 (76%), Gaps = 10/271 (3%)

Query: 42  FANSSSSSHHHHHHHHHHSPFTLRVTND---STTSTELSIQNQKSEADKIVDGMDFGELC 98
           F +SS+      H HH+HS F LRV ND   S  S++ +I+   SEADK+VDGMDFGELC
Sbjct: 33  FHSSSTCMPSFLHDHHYHS-FLLRVANDADRSEVSSDTAIKTSYSEADKMVDGMDFGELC 91

Query: 99  NEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWA 158
           NEFECISSPLVESTARQL RDILELREGNRALG FAVSV YKDP RSFTGREKYKRRLWA
Sbjct: 92  NEFECISSPLVESTARQLARDILELREGNRALGIFAVSVAYKDPIRSFTGREKYKRRLWA 151

Query: 159 TTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQV 218
           T ALDNPSVTVQEMVMLSTSVLSIKWT+RGKPKS+   +GGDLI++V SKFT NQISGQV
Sbjct: 152 TGALDNPSVTVQEMVMLSTSVLSIKWTIRGKPKSV---LGGDLIIRVTSKFTFNQISGQV 208

Query: 219 IEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDP 278
           IEHEE WDLSASS  A+AFFW SR L+A  E+ KD  D   NL SK+ST++ENL +YPDP
Sbjct: 209 IEHEEFWDLSASSASAQAFFWTSRALFAAVESVKDLGDSAKNLSSKMSTKKENLEMYPDP 268

Query: 279 SGDPTKVSSDSRCF---CFCFCLTSNVIYIA 306
           SGDPTK       F    +   L   VIY+ 
Sbjct: 269 SGDPTKFFQRDDSFQQDAYQIALLLAVIYLV 299


>gi|449442645|ref|XP_004139091.1| PREDICTED: uncharacterized protein LOC101206988 [Cucumis sativus]
 gi|449476192|ref|XP_004154667.1| PREDICTED: uncharacterized protein LOC101230689 [Cucumis sativus]
          Length = 305

 Score =  350 bits (899), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 177/310 (57%), Positives = 223/310 (71%), Gaps = 21/310 (6%)

Query: 1   MSACACASVASALATAPSPTNGGFLYRSHPNSKKCIFTFNPFANSSSSSHHHHHHHHHHS 60
           MS C    V+S     PSP     ++R     K       P A +   S   H       
Sbjct: 1   MSTC----VSSPSLAVPSPAAASTIFRR----KILRGATTPLAVTPPPSQSRHA------ 46

Query: 61  PFTLRVT-NDSTTSTELSIQN--QKSEADKIVDGMDFGELCNEFECISSPLVESTARQLV 117
            F LR   NDS ++ +L+ ++  ++SEADKIVDGMDFGELCNEFECISSPLVESTARQL 
Sbjct: 47  -FPLRAAANDSNSTEQLTAESAVERSEADKIVDGMDFGELCNEFECISSPLVESTARQLA 105

Query: 118 RDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLST 177
           RDIL+LR+G+R+LG FAV VKYKDP R FTGREKYKR+LWAT+ALDNP+ +VQEMVM+ST
Sbjct: 106 RDILQLRQGDRSLGNFAVFVKYKDPIRKFTGREKYKRQLWATSALDNPTTSVQEMVMMST 165

Query: 178 SVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAF 237
           S+L IKWT++GKPKS++A IGGDLI+KV S+FTLNQISGQVIEHEE WD+S+SS +++AF
Sbjct: 166 SILKIKWTIKGKPKSLVAAIGGDLIIKVDSQFTLNQISGQVIEHEESWDVSSSSAISQAF 225

Query: 238 FWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTKVSSDSRCF---CF 294
           FWASR L+A+ EAGKD  D +++L  ++STE++NL ++PDPSGDPTK       F    +
Sbjct: 226 FWASRYLFASAEAGKDLGDSVSSLTGRVSTEKQNLEMFPDPSGDPTKFFQGEDNFQKDAY 285

Query: 295 CFCLTSNVIY 304
            F L   +IY
Sbjct: 286 QFALLLAIIY 295


>gi|363807240|ref|NP_001242357.1| uncharacterized protein LOC100780920 precursor [Glycine max]
 gi|255639967|gb|ACU20276.1| unknown [Glycine max]
          Length = 307

 Score =  346 bits (887), Expect = 9e-93,   Method: Compositional matrix adjust.
 Identities = 183/270 (67%), Positives = 206/270 (76%), Gaps = 9/270 (3%)

Query: 42  FANSSSSSHHHHHHHHHHSPFTLRVTND---STTSTELSIQNQKSEADKIVDGMDFGELC 98
           F +SSS+      H HH+  F LRV ND   S  S++ + +   SEADK+VDGMDFGELC
Sbjct: 32  FFHSSSTCMPSSLHDHHYHSFLLRVANDADRSEMSSDTATEASYSEADKMVDGMDFGELC 91

Query: 99  NEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWA 158
           NEFECISSPLVESTARQL RDILELRE NRALG FAVSV YKDP RSFTGREKYKRRLWA
Sbjct: 92  NEFECISSPLVESTARQLARDILELREENRALGIFAVSVSYKDPIRSFTGREKYKRRLWA 151

Query: 159 TTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQV 218
           T+ALDNPSVTVQEMVMLSTSVLSI+WT+RGKPKS+   +GGDLI++V SKFTLNQ SGQV
Sbjct: 152 TSALDNPSVTVQEMVMLSTSVLSIRWTIRGKPKSV---LGGDLILRVTSKFTLNQTSGQV 208

Query: 219 IEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDP 278
           IEHEE WDLSASS  A+AFFW SR L+A  E+ KD  D   NL SKIST++ENL +YPDP
Sbjct: 209 IEHEEFWDLSASSASAQAFFWTSRALFAAVESVKDLGDNAKNLSSKISTKKENLEMYPDP 268

Query: 279 SGDPTKVSSDSRCF---CFCFCLTSNVIYI 305
           SGDPTK       F    +   L   VIY+
Sbjct: 269 SGDPTKFFQRDDSFQQDVYQIALLLAVIYL 298


>gi|388521943|gb|AFK49033.1| unknown [Lotus japonicus]
          Length = 309

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 169/233 (72%), Positives = 194/233 (83%), Gaps = 4/233 (1%)

Query: 55  HHHHHSPFTLRVTNDSTTS---TELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVES 111
           H HHHS F LRV ND+  +   ++ + +   S+ DKIVDG+DFGELCN+FECISSPLVES
Sbjct: 45  HQHHHS-FLLRVANDADRTELPSDAATETSYSDVDKIVDGLDFGELCNDFECISSPLVES 103

Query: 112 TARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQE 171
           TARQLVRDILELREGNRALGTFA SV+YKDP RSFTGR+KYKR LWAT+AL+NPSVTVQE
Sbjct: 104 TARQLVRDILELREGNRALGTFAFSVRYKDPIRSFTGRDKYKRSLWATSALENPSVTVQE 163

Query: 172 MVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASS 231
           MVMLSTSVLSIKWT++GKPKS IA +GGD+I+++ SKFTLNQISGQVIEHEE WDLSASS
Sbjct: 164 MVMLSTSVLSIKWTMKGKPKSFIARVGGDVILRITSKFTLNQISGQVIEHEEFWDLSASS 223

Query: 232 PVARAFFWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
             A AFFW SR L+AT E+ KD  D   N+ S   +E+ENL IYPDPSGDPTK
Sbjct: 224 ASAHAFFWTSRALFATVESVKDLGDSAKNMSSNFESEKENLEIYPDPSGDPTK 276


>gi|357507799|ref|XP_003624188.1| hypothetical protein MTR_7g080220 [Medicago truncatula]
 gi|355499203|gb|AES80406.1| hypothetical protein MTR_7g080220 [Medicago truncatula]
          Length = 308

 Score =  320 bits (819), Expect = 9e-85,   Method: Compositional matrix adjust.
 Identities = 160/226 (70%), Positives = 184/226 (81%), Gaps = 5/226 (2%)

Query: 64  LRVTNDSTT---STELSIQNQKSEA-DKIVDGMDFGELCNEFECISSPLVESTARQLVRD 119
           LRV ND+     S++ + +N  S+A DK+VD MDFGELCNEFEC SSP VESTARQL RD
Sbjct: 50  LRVANDADRTELSSDTTTENSYSDAADKMVDSMDFGELCNEFECNSSPSVESTARQLARD 109

Query: 120 ILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSV 179
           I ELR GNRALGT+AVSV YKDP RSF+GREKYKR LWA TAL+NPSVTVQEM MLSTSV
Sbjct: 110 IFELRSGNRALGTYAVSVTYKDPIRSFSGREKYKRPLWAITALENPSVTVQEMTMLSTSV 169

Query: 180 LSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFW 239
           LSIKWT+RGKPKSI+A +GG LI++V SKFTLNQISGQVI+HEELWDLS+S   A+AFFW
Sbjct: 170 LSIKWTIRGKPKSILAGVGGGLILRVTSKFTLNQISGQVIKHEELWDLSSSPASAQAFFW 229

Query: 240 ASRRLYATTEAGKDSFDLINNLKSKIST-EQENLGIYPDPSGDPTK 284
           +SR L+AT E+ KD  D   NL +  ST ++EN  IYPDPSGDPTK
Sbjct: 230 SSRVLFATVESVKDLADSAKNLSASFSTKKEENSDIYPDPSGDPTK 275


>gi|22330431|ref|NP_176702.2| uncharacterized protein [Arabidopsis thaliana]
 gi|20466568|gb|AAM20601.1| unknown protein [Arabidopsis thaliana]
 gi|23198128|gb|AAN15591.1| unknown protein [Arabidopsis thaliana]
 gi|332196225|gb|AEE34346.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 286

 Score =  317 bits (811), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 154/228 (67%), Positives = 185/228 (81%), Gaps = 7/228 (3%)

Query: 57  HHHSPFTLRVTNDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQL 116
           HH  P   +V NDST  TE+SI   KSE DK+VD +DFGELCN+FEC SSP VESTARQL
Sbjct: 33  HHRLP---QVMNDST-RTEVSID--KSEVDKLVDKIDFGELCNDFECTSSPQVESTARQL 86

Query: 117 VRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLS 176
           VRDILE+REGNRA   +AVSVKYKDP RSFTGREKYKR +W T+ L+NP+VTVQEMVMLS
Sbjct: 87  VRDILEIREGNRAFACYAVSVKYKDPVRSFTGREKYKRPMWITSGLENPTVTVQEMVMLS 146

Query: 177 TSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARA 236
           TSVL IKWT++GKPKSI+A + GDLIVKV S+FTLNQISGQV EHEE WDLS+SSP+A+A
Sbjct: 147 TSVLRIKWTVKGKPKSILAAVSGDLIVKVKSEFTLNQISGQVFEHEESWDLSSSSPIAQA 206

Query: 237 FFWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
           +FW SRRL+A +E+ KD  D+  +L + ++T +E+  IY DP+ DP K
Sbjct: 207 YFWTSRRLFAASESAKDVADVTKDLTANLTTRKEDTDIYRDPT-DPNK 253


>gi|297837895|ref|XP_002886829.1| hypothetical protein ARALYDRAFT_475533 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332670|gb|EFH63088.1| hypothetical protein ARALYDRAFT_475533 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 286

 Score =  316 bits (810), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 155/228 (67%), Positives = 185/228 (81%), Gaps = 7/228 (3%)

Query: 57  HHHSPFTLRVTNDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQL 116
           HH  P   +V NDST  TE+SI   KSE DK+VD +DFGELCN+FEC SSP VESTARQL
Sbjct: 33  HHRLP---QVMNDST-RTEVSID--KSEVDKLVDRIDFGELCNDFECTSSPQVESTARQL 86

Query: 117 VRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLS 176
           VRDILELREGNRA   +AVSVKYKDP RSFTGREKYKR +W T+ L+NP+VTVQEMVMLS
Sbjct: 87  VRDILELREGNRAFACYAVSVKYKDPVRSFTGREKYKRPMWITSGLENPTVTVQEMVMLS 146

Query: 177 TSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARA 236
           TSVL IKWT++GKPKSI+A + GDLI+KV S+FTLNQISGQVIEHEE WDLS+SSP+A+A
Sbjct: 147 TSVLRIKWTVKGKPKSILAAVSGDLIIKVKSEFTLNQISGQVIEHEESWDLSSSSPIAQA 206

Query: 237 FFWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
           +FW SRRL+A +E+ KD  D+  +L + ++T  E+  IY DP+ DP K
Sbjct: 207 YFWTSRRLFAASESAKDLSDVTKDLTANLTTRTEDTDIYRDPT-DPNK 253


>gi|4646202|gb|AAD26875.1|AC007230_9 ESTs gb|H76289 and gb|H76537 come from this gene [Arabidopsis
           thaliana]
          Length = 218

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 152/219 (69%), Positives = 182/219 (83%), Gaps = 4/219 (1%)

Query: 68  NDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGN 127
           NDST  TE+SI   KSE DK+VD +DFGELCN+FEC SSP VESTARQLVRDILE+REGN
Sbjct: 2   NDST-RTEVSID--KSEVDKLVDKIDFGELCNDFECTSSPQVESTARQLVRDILEIREGN 58

Query: 128 RALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLR 187
           RA   +AVSVKYKDP RSFTGREKYKR +W T+ L+NP+VTVQEMVMLSTSVL IKWT++
Sbjct: 59  RAFACYAVSVKYKDPVRSFTGREKYKRPMWITSGLENPTVTVQEMVMLSTSVLRIKWTVK 118

Query: 188 GKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYAT 247
           GKPKSI+A + GDLIVKV S+FTLNQISGQV EHEE WDLS+SSP+A+A+FW SRRL+A 
Sbjct: 119 GKPKSILAAVSGDLIVKVKSEFTLNQISGQVFEHEESWDLSSSSPIAQAYFWTSRRLFAA 178

Query: 248 TEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTKVS 286
           +E+ KD  D+  +L + ++T +E+  IY DP+ DP KVS
Sbjct: 179 SESAKDVADVTKDLTANLTTRKEDTDIYRDPT-DPNKVS 216


>gi|357507801|ref|XP_003624189.1| hypothetical protein MTR_7g080220 [Medicago truncatula]
 gi|355499204|gb|AES80407.1| hypothetical protein MTR_7g080220 [Medicago truncatula]
          Length = 231

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 167/198 (84%), Gaps = 1/198 (0%)

Query: 88  IVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFT 147
           +VD MDFGELCNEFEC SSP VESTARQL RDI ELR GNRALGT+AVSV YKDP RSF+
Sbjct: 1   MVDSMDFGELCNEFECNSSPSVESTARQLARDIFELRSGNRALGTYAVSVTYKDPIRSFS 60

Query: 148 GREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYS 207
           GREKYKR LWA TAL+NPSVTVQEM MLSTSVLSIKWT+RGKPKSI+A +GG LI++V S
Sbjct: 61  GREKYKRPLWAITALENPSVTVQEMTMLSTSVLSIKWTIRGKPKSILAGVGGGLILRVTS 120

Query: 208 KFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINNLKSKIST 267
           KFTLNQISGQVI+HEELWDLS+S   A+AFFW+SR L+AT E+ KD  D   NL +  ST
Sbjct: 121 KFTLNQISGQVIKHEELWDLSSSPASAQAFFWSSRVLFATVESVKDLADSAKNLSASFST 180

Query: 268 -EQENLGIYPDPSGDPTK 284
            ++EN  IYPDPSGDPTK
Sbjct: 181 KKEENSDIYPDPSGDPTK 198


>gi|357126446|ref|XP_003564898.1| PREDICTED: uncharacterized protein LOC100833780 [Brachypodium
           distachyon]
          Length = 302

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 140/207 (67%), Positives = 173/207 (83%), Gaps = 2/207 (0%)

Query: 80  NQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKY 139
           +++SEAD+++DGMDFGELC++FECISSP VESTARQ+VRDILE+REGNRAL  +AV+VKY
Sbjct: 63  DERSEADRLIDGMDFGELCDDFECISSPYVESTARQIVRDILEIREGNRALSCYAVAVKY 122

Query: 140 KDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKP-KSIIANIG 198
           KDP RSF GREKYKR LW T AL+NP+VTVQEM M STS L+IKWTLRGKP   I + I 
Sbjct: 123 KDPLRSFVGREKYKRPLWITEALENPTVTVQEMSMQSTSTLTIKWTLRGKPSNQIFSAIS 182

Query: 199 GDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLI 258
           G+LI++V S+F LNQISGQV+EH E WDLS SSP A+A+FW SRR+Y+T EAGKD+ +  
Sbjct: 183 GELIIRVDSQFVLNQISGQVLEHVESWDLSGSSPPAQAYFWLSRRVYSTVEAGKDTVEAA 242

Query: 259 NNLKSKIS-TEQENLGIYPDPSGDPTK 284
            +L S++S  + +NL +YPDPSGDPTK
Sbjct: 243 KDLASRLSENKDQNLEVYPDPSGDPTK 269


>gi|115441773|ref|NP_001045166.1| Os01g0912700 [Oryza sativa Japonica Group]
 gi|56784400|dbj|BAD82439.1| unknown protein [Oryza sativa Japonica Group]
 gi|113534697|dbj|BAF07080.1| Os01g0912700 [Oryza sativa Japonica Group]
 gi|215687175|dbj|BAG90945.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218189587|gb|EEC72014.1| hypothetical protein OsI_04885 [Oryza sativa Indica Group]
 gi|222619738|gb|EEE55870.1| hypothetical protein OsJ_04510 [Oryza sativa Japonica Group]
          Length = 312

 Score =  294 bits (752), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 139/206 (67%), Positives = 168/206 (81%), Gaps = 2/206 (0%)

Query: 81  QKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYK 140
           ++S+ DK+VDGMDFGELCN+FECISSP VE+TARQL RDIL+LR+ NRA   +AVSVKYK
Sbjct: 73  ERSQTDKLVDGMDFGELCNDFECISSPYVEATARQLARDILDLRDDNRAFTCYAVSVKYK 132

Query: 141 DPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKS-IIANIGG 199
           DP R+F GREKYKR LW T AL+NP+VTVQEM M STS L+IKWT RGKPK+ I A IGG
Sbjct: 133 DPVRTFVGREKYKRPLWITKALENPTVTVQEMSMQSTSNLTIKWTFRGKPKNPIFATIGG 192

Query: 200 DLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLIN 259
           DLIV V S+F LNQISGQV+E  + WDLSASSP A+A+FW SRR ++T EAGKD+ +   
Sbjct: 193 DLIVSVTSQFVLNQISGQVLEQVDSWDLSASSPPAQAYFWLSRRAFSTVEAGKDTIEAAK 252

Query: 260 NLKSKISTEQ-ENLGIYPDPSGDPTK 284
              S++S+++ ENL +YPDPSGDPTK
Sbjct: 253 GTASRLSSKKDENLEVYPDPSGDPTK 278


>gi|326515272|dbj|BAK03549.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 301

 Score =  293 bits (750), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 138/222 (62%), Positives = 176/222 (79%), Gaps = 7/222 (3%)

Query: 64  LRVTNDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILEL 123
           L V +DS       +Q  +SE D++VDG+DFGELCN+FECISSP VESTARQ+ RDILE+
Sbjct: 53  LHVVDDSK-----EVQTARSETDRLVDGLDFGELCNDFECISSPYVESTARQIARDILEI 107

Query: 124 REGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIK 183
           R+ NRAL  +AV+VKYKDP RSF GREKYKR LW T AL+ P+VTVQEM M STS+L+IK
Sbjct: 108 RQDNRALSCYAVAVKYKDPLRSFVGREKYKRPLWITEALEKPTVTVQEMSMQSTSMLTIK 167

Query: 184 WTLRGKPK-SIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASR 242
           WTLRGKPK +  A  GG+L+++V S+F LNQISGQV+EH E WDLS+SSP+A+A+FW SR
Sbjct: 168 WTLRGKPKNAFFAAAGGELVLRVDSQFVLNQISGQVLEHAESWDLSSSSPLAQAYFWFSR 227

Query: 243 RLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
           R+Y+T EAGKD+ +    + S+++ + +NL +YPDP GDPTK
Sbjct: 228 RVYSTVEAGKDTIEAAKGVASRLNKD-DNLEVYPDPLGDPTK 268


>gi|300681438|emb|CBH32530.1| conserved hypothetical protein, expressed [Triticum aestivum]
          Length = 296

 Score =  286 bits (732), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 133/204 (65%), Positives = 167/204 (81%), Gaps = 2/204 (0%)

Query: 82  KSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKD 141
           +SE D++VDG++FGELCN+FECISSP VESTARQ+ RDILE+R+ NRAL  +AV+VKYKD
Sbjct: 61  RSETDRLVDGLNFGELCNDFECISSPYVESTARQIARDILEIRQDNRALSCYAVAVKYKD 120

Query: 142 PTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPK-SIIANIGGD 200
           P RSF GREKYKR LW T AL+ P+VTVQEM M STS L+IKWTLRGKPK +  A  GG+
Sbjct: 121 PLRSFVGREKYKRPLWITEALEKPTVTVQEMSMQSTSTLTIKWTLRGKPKNAFFAAAGGE 180

Query: 201 LIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINN 260
           L+V+V S+F LNQISGQV+EH E WDLSASSP+A+ +FW SRR+Y+T EAGKD+ +    
Sbjct: 181 LVVRVDSQFVLNQISGQVLEHFESWDLSASSPLAQTYFWFSRRVYSTVEAGKDTIEAAKG 240

Query: 261 LKSKISTEQENLGIYPDPSGDPTK 284
           + S+++ + +NL +YPDP GDPTK
Sbjct: 241 VVSRLNKD-DNLEVYPDPLGDPTK 263


>gi|219363629|ref|NP_001136449.1| uncharacterized protein LOC100216557 [Zea mays]
 gi|194695746|gb|ACF81957.1| unknown [Zea mays]
 gi|413951566|gb|AFW84215.1| hypothetical protein ZEAMMB73_334715 [Zea mays]
          Length = 302

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 141/241 (58%), Positives = 174/241 (72%), Gaps = 10/241 (4%)

Query: 47  SSSHHHHHHHHHHSPFTLRVTNDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISS 106
           S +H         +P  L V +DS    E     ++SE DK+VDGMDFGELC++FECISS
Sbjct: 35  SGAHSPPRRRLRLAP--LHVVDDSKNPPE-----ERSETDKMVDGMDFGELCSDFECISS 87

Query: 107 PLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPS 166
           P VESTARQ+ RDILELRE NRA   +AVSVKYKDP R+F GREKY+R LW T AL+NP 
Sbjct: 88  PYVESTARQVARDILELREDNRAFNCYAVSVKYKDPLRTFVGREKYRRPLWITKALENPV 147

Query: 167 VTVQEMVMLSTSVLSIKWTLRGKPKS-IIANIGGDLIVKVYSKFTLNQISGQVIEHEELW 225
           VTVQEM M STS L+I+W LRGKPK+ + A++GGD+IV+V S+F LNQISGQV+E  E W
Sbjct: 148 VTVQEMSMQSTSNLTIRWALRGKPKNPLFASVGGDVIVRVESRFVLNQISGQVLEQAESW 207

Query: 226 DLSASSPVARAFFWASRRLYATTEAGKDSFDLINNLKSKISTEQ--ENLGIYPDPSGDPT 283
           DLSASS  A+A+FW SRR+Y+  E+GKD+ +   +  S   T +  +NL  YPDPS DPT
Sbjct: 208 DLSASSLPAQAYFWLSRRVYSAVESGKDAIEAAKSAASSGLTTKGDQNLEAYPDPSADPT 267

Query: 284 K 284
           K
Sbjct: 268 K 268


>gi|242055359|ref|XP_002456825.1| hypothetical protein SORBIDRAFT_03g043540 [Sorghum bicolor]
 gi|241928800|gb|EES01945.1| hypothetical protein SORBIDRAFT_03g043540 [Sorghum bicolor]
          Length = 302

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 134/220 (60%), Positives = 165/220 (75%), Gaps = 5/220 (2%)

Query: 66  VTNDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELRE 125
           V +DS+T     +  ++S  DK+VDGMDFGELCN+FECISSP VESTARQ+ RDILELR+
Sbjct: 53  VVDDSSTD----LPEERSRTDKMVDGMDFGELCNDFECISSPYVESTARQIARDILELRD 108

Query: 126 GNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWT 185
            NRA   +AVSVKYKDP R+F GREKYKR LW T AL+ P VTVQEM M STS L+IKW 
Sbjct: 109 DNRAFNCYAVSVKYKDPLRTFVGREKYKRPLWITKALEKPVVTVQEMSMQSTSNLTIKWA 168

Query: 186 LRGKPKS-IIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRL 244
            RGKPK+   A +GGD+IV+V S+F LNQISGQV+E  + WDLSASS  A+A+FW SRR+
Sbjct: 169 FRGKPKNPFFATMGGDVIVRVVSQFVLNQISGQVLEQVDSWDLSASSFPAQAYFWLSRRV 228

Query: 245 YATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
           Y+T E+GKD+ +   +  S ++   +N   YPDPS DPTK
Sbjct: 229 YSTVESGKDTIEAAKSTTSGLTKGDQNFEAYPDPSSDPTK 268


>gi|302790600|ref|XP_002977067.1| hypothetical protein SELMODRAFT_106566 [Selaginella moellendorffii]
 gi|300155043|gb|EFJ21676.1| hypothetical protein SELMODRAFT_106566 [Selaginella moellendorffii]
          Length = 260

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 128/219 (58%), Positives = 163/219 (74%), Gaps = 8/219 (3%)

Query: 70  STTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRA 129
           STT +   ++ ++S ADK+VDGM FG+LC+EF+CISSP VEST RQL RDILE+R+GNRA
Sbjct: 13  STTKSPPPVELERSPADKLVDGMSFGQLCDEFQCISSPSVESTCRQLARDILEIRDGNRA 72

Query: 130 LGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGK 189
           LGT A  VKYKDP RSF GR KY R  W  TAL+NP+V V++M M+STSVL I+WT++G+
Sbjct: 73  LGTLAYFVKYKDPLRSFQGRLKYNRPSWTRTALENPTVAVRQMEMVSTSVLRIQWTIQGR 132

Query: 190 PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTE 249
           PK   A+  GD++V V S FTLNQISGQV+EH E WDLS +SP+A+A+FW SR  Y+T E
Sbjct: 133 PKLAAASFVGDVLVNVTSTFTLNQISGQVLEHREEWDLSGNSPLAQAYFWLSRIAYSTVE 192

Query: 250 AGKDSFDLINNLKSKISTEQENLG----IYPDPSGDPTK 284
           AGKD+ +L+  +   I    +NLG    IY DP+ DP K
Sbjct: 193 AGKDTSELVQGVSKVID---KNLGDDSSIYVDPT-DPKK 227


>gi|168049027|ref|XP_001776966.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671667|gb|EDQ58215.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 237

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 117/205 (57%), Positives = 156/205 (76%), Gaps = 1/205 (0%)

Query: 81  QKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYK 140
           +K+E D++++G+ FG+LC+EFECISSP VE TARQLV+DI++LREG R+L +F V+VKYK
Sbjct: 1   EKTEVDRLMEGLSFGQLCDEFECISSPAVEKTARQLVKDIMDLREGRRSLSSFGVNVKYK 60

Query: 141 DPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIAN-IGG 199
           DP RSF GR+KY+   W +TAL+ PSV V+EM MLSTSVL+IKWTL G PK   A+ +GG
Sbjct: 61  DPLRSFKGRDKYRSANWISTALEKPSVAVREMKMLSTSVLNIKWTLTGTPKLPPASALGG 120

Query: 200 DLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLIN 259
            +++ V S FT+NQISGQV+ HE+ WDLSAS P A+A+FW +R  ++  E GK+  D + 
Sbjct: 121 KVVLAVNSTFTMNQISGQVVLHEDQWDLSASDPAAQAYFWTTRLAFSAVEGGKNLADGVQ 180

Query: 260 NLKSKISTEQENLGIYPDPSGDPTK 284
            +  ++   QEN  IYPDPSGDP K
Sbjct: 181 GMAKQLDKGQENNSIYPDPSGDPRK 205


>gi|302763149|ref|XP_002964996.1| hypothetical protein SELMODRAFT_83381 [Selaginella moellendorffii]
 gi|300167229|gb|EFJ33834.1| hypothetical protein SELMODRAFT_83381 [Selaginella moellendorffii]
          Length = 260

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 126/219 (57%), Positives = 161/219 (73%), Gaps = 8/219 (3%)

Query: 70  STTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRA 129
           STT +   ++ ++S ADK+VDGM FG+LC+EF+CISSP VE+T RQL RDILE+R+GNRA
Sbjct: 13  STTKSPPPVELERSPADKLVDGMSFGQLCDEFQCISSPSVEATCRQLARDILEIRDGNRA 72

Query: 130 LGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGK 189
           LGT A  VKYKDP RSF GR KY R  W  TAL+NP+V V++M M+STSVL I+ T++G+
Sbjct: 73  LGTLAYFVKYKDPLRSFQGRLKYNRPSWTRTALENPTVAVRQMEMVSTSVLRIQCTIQGR 132

Query: 190 PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTE 249
           PK   A+  GD+ V V S FTLNQISGQV+EH E WDLS +SP+A+A+FW SR  Y+T E
Sbjct: 133 PKLAAASFVGDVRVNVTSTFTLNQISGQVLEHREEWDLSGNSPLAQAYFWFSRIAYSTVE 192

Query: 250 AGKDSFDLINNLKSKISTEQENLG----IYPDPSGDPTK 284
           AGKD+ +L+  +   I    +NLG    IY DP+ DP K
Sbjct: 193 AGKDTSELVQGVSKVID---KNLGDDSNIYVDPT-DPKK 227


>gi|168031127|ref|XP_001768073.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680711|gb|EDQ67145.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 271

 Score =  244 bits (624), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 117/223 (52%), Positives = 157/223 (70%), Gaps = 3/223 (1%)

Query: 65  RVTNDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELR 124
           R   D  T++ +    +K+E D+++DG+ FG+LC+EFECISSP VE TARQLV+DI++LR
Sbjct: 16  RGPEDGATTSVVDKTAEKTEVDRLMDGLSFGQLCDEFECISSPAVERTARQLVKDIIDLR 75

Query: 125 EGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKW 184
           EG R+L  F V+V+YKDP RSF GR+KYK   W  TAL+ P+V V+EM M+STSVL IKW
Sbjct: 76  EGERSLSNFGVNVEYKDPLRSFKGRDKYKGANWIKTALEKPTVAVREMKMISTSVLKIKW 135

Query: 185 TLRGKPKSIIAN-IGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRR 243
           T+ G PK   A+ +GG +++ V S FT+NQISGQV  HE+ WDLSAS P A+A+FW +R 
Sbjct: 136 TVTGTPKLPPASALGGRVVLAVSSTFTMNQISGQVTLHEDEWDLSASDPAAQAYFWTARL 195

Query: 244 LYATTEAGKDSFDLINNLKSKISTEQEN--LGIYPDPSGDPTK 284
            ++  E GK+    +  +  ++   QEN    IYPDPSGDP K
Sbjct: 196 AFSAVEGGKNFASGVQGIAKQLDKGQENNSNSIYPDPSGDPRK 238


>gi|255086449|ref|XP_002509191.1| predicted protein [Micromonas sp. RCC299]
 gi|226524469|gb|ACO70449.1| predicted protein [Micromonas sp. RCC299]
          Length = 214

 Score =  107 bits (267), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 68/192 (35%), Positives = 99/192 (51%), Gaps = 10/192 (5%)

Query: 96  ELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRR 155
           ++C++F C SSP VES+ RQ+  DI  +RE  R+L  FAV VKY D  R F GRE +   
Sbjct: 1   QICDDFVCKSSPAVESSLRQIATDICAIREDKRSLNPFAVDVKYDDGERKFEGREGFANH 60

Query: 156 LWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQIS 215
            +    +D+    V EM M      +I W L+G+      N GG + V+V +   LN I+
Sbjct: 61  TYIKDNVDDARAAVTEMRMGDLDAATIVWRLQGR------NSGGAIDVEVTTTLRLNLIT 114

Query: 216 GQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKD-SFDLINNLKSKI--STEQENL 272
           G+  E  E WD + S   A A   +SR+  A  +   D +  L  +++ K   S + E  
Sbjct: 115 GRATEVTERWDPAGSDAGAAAVLASSRKATALPKNIADAAARLGKSIEEKFGSSDDGEMK 174

Query: 273 GIYPDPSGDPTK 284
            +Y DP+ DP K
Sbjct: 175 DVYVDPN-DPMK 185


>gi|303284547|ref|XP_003061564.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456894|gb|EEH54194.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 313

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 64/199 (32%), Positives = 98/199 (49%), Gaps = 17/199 (8%)

Query: 96  ELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRR 155
           ++C+EF C SSP VE+  RQ+  D+  LRE  R+L  FA  ++Y D  R F GRE +K  
Sbjct: 86  QICDEFVCKSSPAVEAWLRQIATDVCALRENARSLTPFANDIEYDDGARVFKGREGFKNH 145

Query: 156 LWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQIS 215
            +    + NP+  V+ M M +     I+W LRG+        GG L  K+ S  T+N I+
Sbjct: 146 AFIAENIGNPAAAVESMRMNAVDEAVIEWRLRGQ------TPGGALDAKITSTLTMNLIT 199

Query: 216 GQVIEHEELWDLS--ASSPVARAFFWASRRLYATTE--------AGKDSFDLINNLKSKI 265
           G+     E WD++   S P A AF  ++R+  A  +          K+  D + ++   +
Sbjct: 200 GRATRWTEAWDVAGGGSDPGAAAFVASTRKASALPKNAADAATAVAKELDDALGSVAGSL 259

Query: 266 STEQENLGIYPDPSGDPTK 284
               E   +  DP+ DP K
Sbjct: 260 GLGGEEREVQVDPN-DPMK 277


>gi|194695636|gb|ACF81902.1| unknown [Zea mays]
          Length = 90

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 40/63 (63%), Positives = 50/63 (79%)

Query: 81  QKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYK 140
           ++S+ DK+VDGMDF ELC++FECISSP VES  RQ+ RDI ELRE NRA   +AV VKY+
Sbjct: 28  ERSQTDKMVDGMDFEELCSDFECISSPYVESIMRQVARDIFELREDNRAFSCYAVPVKYE 87

Query: 141 DPT 143
            P+
Sbjct: 88  APS 90


>gi|194694442|gb|ACF81305.1| unknown [Zea mays]
          Length = 68

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 40/63 (63%), Positives = 50/63 (79%)

Query: 81  QKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYK 140
           ++S+ DK+VDGMDF ELC++FECISSP VES  RQ+ RDI ELRE NRA   +AV VKY+
Sbjct: 6   ERSQTDKMVDGMDFEELCSDFECISSPYVESIMRQVARDIFELREDNRAFSCYAVPVKYE 65

Query: 141 DPT 143
            P+
Sbjct: 66  APS 68


>gi|145352042|ref|XP_001420368.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580602|gb|ABO98661.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 255

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 73/232 (31%), Positives = 112/232 (48%), Gaps = 26/232 (11%)

Query: 89  VDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTG 148
           VDG+   E C++F C SSP VE+T + L++DI   R  NR+   ++  V Y D    F G
Sbjct: 26  VDGL--REPCDDFVCKSSPAVENTLKSLIKDINAARGTNRSAAPYSPDVTYDDGALRFAG 83

Query: 149 REKYKRRL-WATTALDNPSVTVQEMVML-STSVLSIKWTLRGKPKSIIANIGGDLIVKVY 206
            EKYKR   +    L   +  V E+ M  S  V ++KW L         N  G + V V 
Sbjct: 84  SEKYKRYCSYIENNLRQVTTRVTEISMEGSLDVATVKWELNA------VNDIGRVGVDVT 137

Query: 207 SKFTLNQISGQVIEHEELWDLSASSPVAR--AFFWASRRLYA--------TTEAGKDSFD 256
           + + +N I+G+V+EH E W ++ +   A+  A   ++R+ +A          EA K   +
Sbjct: 138 ATYKMNLITGRVLEHREQWVVNPARTDAQAGALLESTRKAHALPLNAMEVADEAKKKFRE 197

Query: 257 LINNLKSKISTEQENLGIYPDPSGDPTKV---SSDSRCFCFCFCLTSNVIYI 305
           +  +  S+   +QEN  IY DP+ DP K       S+       L + VIY+
Sbjct: 198 ISKSFASE--DDQENSNIYVDPN-DPMKFFQQDDTSKTDLLQLSLVAAVIYL 246


>gi|195604514|gb|ACG24087.1| hypothetical protein [Zea mays]
          Length = 89

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 41/70 (58%), Positives = 52/70 (74%), Gaps = 1/70 (1%)

Query: 70  STTSTELSIQN-QKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNR 128
           S   TE+ + + ++S+ DK+VDGMDF ELC++FECISSP VES  RQ+ RDI ELRE NR
Sbjct: 13  SRAITEILVSSRERSQTDKMVDGMDFEELCSDFECISSPYVESIMRQVARDIFELREDNR 72

Query: 129 ALGTFAVSVK 138
           A   +AV VK
Sbjct: 73  AFSCYAVPVK 82


>gi|307109060|gb|EFN57299.1| hypothetical protein CHLNCDRAFT_143902 [Chlorella variabilis]
          Length = 249

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 67/214 (31%), Positives = 107/214 (50%), Gaps = 10/214 (4%)

Query: 98  CNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLW 157
           C+EF C SSP VE++ RQL +D+   R   R    +A +V+Y+D  R F G + Y+R  +
Sbjct: 28  CDEFVCTSSPSVEASVRQLAKDLQ--RANGRWAPIYASTVEYRDAYRRFKGPDGYQRLDF 85

Query: 158 ATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQ 217
            +T +   +V V  M ML  S   I+W L GK   +  ++ G       ++  +N ++G+
Sbjct: 86  VSTNVQQAAVAVTGMRMLDNSSAEIRWRLTGKLGVLPIDVAG------TTEIEMNLLTGR 139

Query: 218 VIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINN-LKSKISTEQENLGIYP 276
           +  H E WDL+A SP A A + ASR L+A  +A  D+       L +  S + E+    P
Sbjct: 140 IERHREKWDLAACSPPAAAAWNASRALWAAKQASGDAQRAAGRALDTLTSVDDEDQYSQP 199

Query: 277 DPSGDPTKVSSDSRCFCFCFCLTSNVIYIAFASV 310
           +P+ DP +       F     L   V+ + +  V
Sbjct: 200 NPN-DPGRFFQQGDTFKQDAALFVGVVMLFWGLV 232


>gi|308809583|ref|XP_003082101.1| unnamed protein product [Ostreococcus tauri]
 gi|116060568|emb|CAL55904.1| unnamed protein product [Ostreococcus tauri]
          Length = 295

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 81/283 (28%), Positives = 126/283 (44%), Gaps = 29/283 (10%)

Query: 43  ANSSSSSHHHHHHHHHHSPFTLRVTNDSTTSTE--------LSIQNQKSEADKIVDGMDF 94
           A++  SS        H  P  L + ++S++             +Q    E    VDG  F
Sbjct: 13  ASTRRSSEEGARVDRHRQPRRLGIHHESSSIASPRRDGRRITRVQATDKEFRASVDG--F 70

Query: 95  GELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKR 154
            E C+EF C SSP VE+T + +V+DI   R   R+   +A  V+Y D    F G EKY +
Sbjct: 71  REPCDEFVCKSSPAVENTLKSVVKDINAARGTTRSSKPYAPDVEYDDGVLKFKGSEKYAK 130

Query: 155 RL-WATTALDNPSVTVQEMVMLST-SVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLN 212
              +    L   +  V E+ M  T  V +++W L G       N  G + V + + + +N
Sbjct: 131 YCSYVENNLRQVTTRVTEISMKDTLDVATVRWELNG------TNDIGRVGVDIEATYKMN 184

Query: 213 QISGQVIEHEELWDLSASSPVAR--AFFWASRR-----LYATTEAGKDSFDLINNLKSKI 265
            I+G+V+EH E W ++ S   A+  A   ++R+     L A   A + S  L    KS  
Sbjct: 185 LITGRVLEHRERWTVNPSRTEAQAGALLESTRKAHALPLNAMELADRASQGLKELTKSFS 244

Query: 266 STEQENLGIYPDPSGDPTKV---SSDSRCFCFCFCLTSNVIYI 305
           S ++    IY +P+ DP K       S+       L + VIY+
Sbjct: 245 SGDEGEKNIYVNPN-DPMKFFQQDDTSKTDLLQLSLVAAVIYL 286


>gi|159469448|ref|XP_001692875.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158277677|gb|EDP03444.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 280

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 101/208 (48%), Gaps = 23/208 (11%)

Query: 83  SEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGT-FAVSVKYKD 141
           S A K ++ +D  + C++F C SSP VE T R L R   EL+ G  A  T +   V Y D
Sbjct: 52  SIATKTIEDLDT-DYCDDFVCTSSPAVEQTVRSLAR---ELQRGRYAAATLYQPGVTYSD 107

Query: 142 PTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDL 201
             R+FTG E YKR+ W    ++    ++  + ML      I W + G       ++GG  
Sbjct: 108 GFRTFTGVEGYKRQRWVIDNVEKFKTSILRLRMLDKGTSEISWQVDG-------SLGGQP 160

Query: 202 IVKVYSKFTL-NQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINN 260
           +  +++     N ++G++  H E W+LS  SP A A   A+R  ++  +A +D+ + +  
Sbjct: 161 VSLLFTTTCEHNLVTGRITNHRESWNLSRCSPPAAAMATANRYAWSAKQAVEDAKEGLTK 220

Query: 261 LKSKI----STEQENLGIYPDPSGDPTK 284
              K+      +  NL     PS DPT+
Sbjct: 221 AAEKLGGGGGADMNNL-----PS-DPTR 242


>gi|195623420|gb|ACG33540.1| hypothetical protein [Zea mays]
          Length = 56

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 37/56 (66%), Positives = 44/56 (78%)

Query: 88  IVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPT 143
           +VDGMDF ELC++FECISSP VES  RQ+ RDI ELRE NRA   +AV VKY+ P+
Sbjct: 1   MVDGMDFEELCSDFECISSPYVESIMRQVARDIFELREDNRAFSCYAVPVKYEAPS 56


>gi|302850742|ref|XP_002956897.1| hypothetical protein VOLCADRAFT_119553 [Volvox carteri f.
           nagariensis]
 gi|300257778|gb|EFJ42022.1| hypothetical protein VOLCADRAFT_119553 [Volvox carteri f.
           nagariensis]
          Length = 802

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 52/169 (30%), Positives = 79/169 (46%), Gaps = 10/169 (5%)

Query: 98  CNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLW 157
           C++F C SSP VE T R L R++   R    +L  +   V Y D  R+FTG E + R+ W
Sbjct: 589 CDDFLCTSSPAVEQTVRSLARELKRGRYTTTSL--YQPGVTYSDGFRTFTGPEGFTRQRW 646

Query: 158 ATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLI-VKVYSKFTLNQISG 216
               +  PS+T+ +M ML      I W L G+       +GG L+ V   +    N ++G
Sbjct: 647 VADNVQEPSLTIVKMRMLDKGTSEISWRLVGR-------LGGILLDVSFNTTCEHNLLTG 699

Query: 217 QVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINNLKSKI 265
           ++    E WDLS   P A      +R  ++  +A  D+ D       K+
Sbjct: 700 RITSLRESWDLSRCPPPAALLATFNRYSWSAKQAVADARDGAGKALQKL 748


>gi|412987779|emb|CCO19175.1| predicted protein [Bathycoccus prasinos]
          Length = 299

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 42/152 (27%), Positives = 69/152 (45%), Gaps = 32/152 (21%)

Query: 98  CNEFECI-SSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYK--- 153
           C+ FEC  +SP VES+ RQL RD+ + ++ NR++  +A  V++ D  R F+G EK+K   
Sbjct: 57  CDAFECKGTSPAVESSLRQLARDLRDGKDSNRSIFPYAKDVRFSDGARRFSGSEKFKNCN 116

Query: 154 ---RRLWATTAL---DNPSVTVQE--------------MVMLSTSVLSIKWTLRGKPKSI 193
                ++    +   DN      +               V+   SV  I W +  K    
Sbjct: 117 CELNSIFGGEKISSSDNSKTNNNDNIRNRKFTNWVDSIEVLPGGSVAKILWRIESKD--- 173

Query: 194 IANIGGDLIVKVYSKFTLNQISGQVIEHEELW 225
                G   VKV +   +N I+G+V+  ++ W
Sbjct: 174 -----GKAKVKVETTLEMNLITGKVLSQDDQW 200


>gi|302840933|ref|XP_002952012.1| hypothetical protein VOLCADRAFT_92622 [Volvox carteri f.
           nagariensis]
 gi|300262598|gb|EFJ46803.1| hypothetical protein VOLCADRAFT_92622 [Volvox carteri f.
           nagariensis]
          Length = 200

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 60/119 (50%), Gaps = 17/119 (14%)

Query: 118 RDILELR---EGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVM 174
           RD +E +    GN     FA    +KDPT    G E Y R   A  AL +PS +  +++ 
Sbjct: 62  RDFVERQYYVTGNLTPQLFAPDCVFKDPTVEVVGVEPYVR---ALQALFDPSTSRADLIS 118

Query: 175 L---STSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQIS--GQVIEHEELWDLS 228
           +   + S + ++W L G  K     + G L +K Y+  T+  +S  G+VI HEE WD+S
Sbjct: 119 IRATAPSTVVLRWRLEGSLK-----LAG-LKIKPYTGTTVYTLSDDGKVIRHEETWDIS 171


>gi|357111214|ref|XP_003557409.1| PREDICTED: uncharacterized protein LOC100837510 [Brachypodium
           distachyon]
          Length = 231

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 60/125 (48%), Gaps = 14/125 (11%)

Query: 126 GNRALGTFAVSVKYKDPTRSFTGREKYKRRL-WATTALDNPSV---TVQEMVMLSTSVLS 181
           GN  LG +A    Y+DPT  F GR KY + L      LD+PS+   ++++ +   T  + 
Sbjct: 111 GNFTLGIYAEDCLYEDPTIKFRGRSKYAQNLDLLVPFLDSPSLELESIEKGLRAETKSIM 170

Query: 182 IKWTLRGK---PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFF 238
             WTLR     P   +  I G+    +  ++       +V+ H E WD+SA   +A+ F 
Sbjct: 171 ATWTLRTYLRLPWRPLIAIRGNTTYDLDEEY-------KVVRHAESWDVSALEAIAQLFV 223

Query: 239 WASRR 243
            A ++
Sbjct: 224 SAPKQ 228


>gi|428779817|ref|YP_007171603.1| hypothetical protein Dacsa_1567 [Dactylococcopsis salina PCC 8305]
 gi|428694096|gb|AFZ50246.1| hypothetical protein Dacsa_1567 [Dactylococcopsis salina PCC 8305]
          Length = 146

 Score = 44.7 bits (104), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 57/126 (45%), Gaps = 15/126 (11%)

Query: 109 VESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLW-----ATTALD 163
           VES   QL  D+  L + + +   ++  + +KDP   F  +  Y+   W     A     
Sbjct: 12  VESAIEQLSHDLPNLFQKDISYKIYSSDIFFKDPVNLFQSKFNYRIIFWTLRFHARLFFT 71

Query: 164 NPSVTVQEMVMLSTSVLSIKWTLRGKPK-SIIANI--GGDLIVKVYSKFTLNQISGQVIE 220
                V ++  + T+++ + WT+RGK +    ANI   GD      S +TLN   G + +
Sbjct: 72  EIYFDVHDIQQIETNIIKVWWTVRGKLRVPWEANIFFNGD------STYTLNS-EGLISD 124

Query: 221 HEELWD 226
           H + WD
Sbjct: 125 HRDSWD 130


>gi|428775767|ref|YP_007167554.1| hypothetical protein PCC7418_1134 [Halothece sp. PCC 7418]
 gi|428690046|gb|AFZ43340.1| Protein of unknown function DUF2358 [Halothece sp. PCC 7418]
          Length = 170

 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 36/137 (26%), Positives = 61/137 (44%), Gaps = 16/137 (11%)

Query: 109 VESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLW-----ATTALD 163
           VES   QL +D+  L + + +   ++ ++ +KDP   F G+  Y+   W     A     
Sbjct: 35  VESAIAQLWQDLPTLFKKDISYQIYSSNIFFKDPVNLFQGKLNYRIIFWTLRFHARLFFT 94

Query: 164 NPSVTVQEMVMLSTSVLSIKWTLRGKPK-SIIANI--GGDLIVKVYSKFTLNQISGQVIE 220
                V ++     +V+ + WT+RGK +    ANI   GD      S + LN+  G +  
Sbjct: 95  EIYFDVHDIQQTENNVIKVWWTVRGKLRVPWQANIFFNGD------STYKLNE-DGLIYH 147

Query: 221 HEELWDLSASSPVARAF 237
           H + WD  A   + + F
Sbjct: 148 HRDNWD-RAPKTILKQF 163


>gi|356526417|ref|XP_003531814.1| PREDICTED: uncharacterized protein LOC100797159 [Glycine max]
          Length = 268

 Score = 42.4 bits (98), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 48/91 (52%), Gaps = 9/91 (9%)

Query: 137 VKYKDPTRSFTGREKYKRRLWA-----TTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPK 191
           + +KDP  +FTG EKYK   WA            ++ V  +   S +V+ I+W LRG P+
Sbjct: 111 ITFKDPLNTFTGIEKYKLIFWALRFHGKILFREIALDVYRIWQPSENVILIRWNLRGVPR 170

Query: 192 SIIANIGGDLIVKVYSKFTLNQISGQVIEHE 222
            +     G+   +  S++ L++ +G++ EH+
Sbjct: 171 -VPWEAKGEF--QGTSRYKLDR-NGKIYEHK 197


>gi|242043308|ref|XP_002459525.1| hypothetical protein SORBIDRAFT_02g006110 [Sorghum bicolor]
 gi|241922902|gb|EER96046.1| hypothetical protein SORBIDRAFT_02g006110 [Sorghum bicolor]
          Length = 245

 Score = 42.0 bits (97), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)

Query: 126 GNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTAL-DNPSV---TVQEMVMLSTSVLS 181
           G+   G +     ++DPT  F G  +Y + L+      D+PS+   ++++ + + T  + 
Sbjct: 124 GDFTPGIYTADCLFEDPTIKFRGLSRYSQNLYLLVPFFDSPSLELESIEKGLRVETKFVK 183

Query: 182 IKWTLRGK---PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFF 238
             W LR     P   +  I G+      + + LN+   +VI H E WD+SA   +A+ F 
Sbjct: 184 ATWKLRTYLRLPWRPLIAIRGN------TTYDLNE-DFKVIRHSESWDVSAVEAIAQIFV 236

Query: 239 WASRRL 244
            A  +L
Sbjct: 237 SAPEQL 242


>gi|351720803|ref|NP_001235141.1| uncharacterized protein LOC100306700 [Glycine max]
 gi|255629317|gb|ACU15003.1| unknown [Glycine max]
          Length = 255

 Score = 41.2 bits (95), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 59/124 (47%), Gaps = 14/124 (11%)

Query: 126 GNRALGTFAVSVKYKDPTRSFTGREKYKRRL-WATTALDNPSVTVQEM---VMLSTSVLS 181
           GN +   +A +  ++DPT  F GRE Y R L       D+ S+ +Q++   V   T+ + 
Sbjct: 112 GNFSSSIYAENCIFEDPTIKFRGRELYARNLKLLVPFFDSASIILQKIDKDVDSDTNFVL 171

Query: 182 IKWTLRGK---PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFF 238
             W LR     P   + +I G  +      + LN+   +++ H E W++SA   V + F 
Sbjct: 172 ASWKLRTNLKLPWRPLISIDGSTV------YELNE-DYKIVRHVESWNISAVEAVLQIFS 224

Query: 239 WASR 242
           + S+
Sbjct: 225 FKSK 228


>gi|302836648|ref|XP_002949884.1| hypothetical protein VOLCADRAFT_109744 [Volvox carteri f.
           nagariensis]
 gi|300264793|gb|EFJ48987.1| hypothetical protein VOLCADRAFT_109744 [Volvox carteri f.
           nagariensis]
          Length = 237

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 52/108 (48%), Gaps = 15/108 (13%)

Query: 139 YKDPTRSFTGREKYKRRLWATTA-LDNPSVTVQEMVMLS-----TSVLSIKWTLR---GK 189
           + DPT +F G + +KR L      LD P+V ++ +  L        V+  +W LR     
Sbjct: 128 FADPTVAFRGVDLWKRNLALLVPFLDQPAVQLKRVQRLGRDEQGAEVVRAEWRLRTFLRL 187

Query: 190 PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAF 237
           P   + +I G       +++TLN+ S +++ H E W +S +  + + F
Sbjct: 188 PWRPLIDIDG------ATEYTLNEESNRIVRHVESWGVSGTQALLQMF 229


>gi|222636639|gb|EEE66771.1| hypothetical protein OsJ_23496 [Oryza sativa Japonica Group]
          Length = 249

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 56/123 (45%), Gaps = 10/123 (8%)

Query: 126 GNRALGTFAVSVKYKDPTRSFTGREKYKRRL-WATTALDNPSV---TVQEMVMLSTSVLS 181
           GN     +     ++DPT  F GR +Y + L       D+PS+    +++ + + T  + 
Sbjct: 128 GNFTPDIYTEDCLFEDPTIKFRGRSRYSQNLDLLVPFFDSPSLELENIEKGLRVETKFIM 187

Query: 182 IKWTLRGKPKSIIANIGGDLIVKVYSKFTLN-QISGQVIEHEELWDLSASSPVARAFFWA 240
             WTLR +    +     DL+V  +   +L  +   +V  H E WD+SA   + + F  A
Sbjct: 188 ATWTLRSRYSQNL-----DLLVPFFDSPSLELENIEKVTRHAESWDVSALEAIGQIFVPA 242

Query: 241 SRR 243
            ++
Sbjct: 243 PKQ 245


>gi|298204898|emb|CBI34205.3| unnamed protein product [Vitis vinifera]
          Length = 216

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 53/113 (46%), Gaps = 14/113 (12%)

Query: 139 YKDPTRSFTGREKYKRRL-WATTALDNPSVTVQEMVMLS---TSVLSIKWTLRGK---PK 191
           ++DPT  F G++ Y R L       D+PS+ +Q++   S   T  +   W LR     P 
Sbjct: 109 FEDPTIKFRGKDLYSRNLKLLVPFFDHPSIALQKIEKGSNSETKFVLASWKLRTYLKLPW 168

Query: 192 SIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRL 244
             + +I G  +  +  +F       +++ H E W++SA   V + F  + +RL
Sbjct: 169 RPLISIAGSTVYDLNDEF-------KIVRHAESWNISALQAVGQIFTPSLQRL 214


>gi|159481098|ref|XP_001698619.1| SOUL3-like protein [Chlamydomonas reinhardtii]
 gi|158282359|gb|EDP08112.1| SOUL3-like protein [Chlamydomonas reinhardtii]
          Length = 232

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 52/108 (48%), Gaps = 15/108 (13%)

Query: 139 YKDPTRSFTGREKYKRRLWATTA-LDNPSVTVQEMVMLS-----TSVLSIKW---TLRGK 189
           + DPT +F G + +KR L   T  L+ P+V +  M  L        V+  +W   T+   
Sbjct: 123 FADPTVAFRGTDLWKRNLALLTPFLEAPNVQLYGMRQLGRDEDGAEVVRAEWRLTTILKL 182

Query: 190 PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAF 237
           P   + ++ G       +++TLN+ S +++ H E W +S +  V + F
Sbjct: 183 PWRPLIDLDG------ATEYTLNEESNRIVRHVEFWSISGTEAVLQMF 224


>gi|17228448|ref|NP_484996.1| hypothetical protein alr0953 [Nostoc sp. PCC 7120]
 gi|17130299|dbj|BAB72910.1| alr0953 [Nostoc sp. PCC 7120]
          Length = 139

 Score = 39.3 bits (90), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 29/134 (21%), Positives = 57/134 (42%), Gaps = 10/134 (7%)

Query: 109 VESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWAT---TALDNP 165
           +E   + L +D+  L E + +   +   + ++DP   F G+  Y+   W       L  P
Sbjct: 7   IEQVIKTLQQDLPTLFEQDISYDIYTKDIYFQDPVNKFKGKFNYRIIFWTLRFHARLFFP 66

Query: 166 SV--TVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEE 223
            +   + E++ L    +  KWT+RG   ++       ++   YS + L Q +  + +H +
Sbjct: 67  EIYFDLHEVLQLDKDTILAKWTVRG---TLRVPWRSQMLFNGYSTYKLRQ-NNLIYQHID 122

Query: 224 LWDLSASSPVARAF 237
            WD      + R F
Sbjct: 123 TWD-RKPGEILRQF 135


>gi|225451265|ref|XP_002272524.1| PREDICTED: uncharacterized protein LOC100264176 [Vitis vinifera]
          Length = 242

 Score = 38.9 bits (89), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 53/113 (46%), Gaps = 14/113 (12%)

Query: 139 YKDPTRSFTGREKYKRRL-WATTALDNPSVTVQEMVMLS---TSVLSIKWTLRGK---PK 191
           ++DPT  F G++ Y R L       D+PS+ +Q++   S   T  +   W LR     P 
Sbjct: 135 FEDPTIKFRGKDLYSRNLKLLVPFFDHPSIALQKIEKGSNSETKFVLASWKLRTYLKLPW 194

Query: 192 SIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRL 244
             + +I G  +  +  +F       +++ H E W++SA   V + F  + +RL
Sbjct: 195 RPLISIAGSTVYDLNDEF-------KIVRHAESWNISALQAVGQIFTPSLQRL 240


>gi|116792207|gb|ABK26275.1| unknown [Picea sitchensis]
          Length = 222

 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 19/100 (19%)

Query: 138 KYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMV---MLSTSVLSIKWTLRGKPK--- 191
           +++DPT   TG EKY   + A   L NP+ + QE++   ++    +  KW L G  K   
Sbjct: 119 RFRDPTTDLTGLEKY---ISAVKFLFNPNTSKQELLSIAVVDPHTIEAKWRLEGYLKFPW 175

Query: 192 --SIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSA 229
              I+   G        +++ L+   G ++ HEE WD+S 
Sbjct: 176 NPHILPYEGS-------TRYVLDD-RGLIMSHEETWDISV 207


>gi|395644905|ref|ZP_10432765.1| polymorphic outer membrane protein [Methanofollis liminatans DSM
           4140]
 gi|395441645|gb|EJG06402.1| polymorphic outer membrane protein [Methanofollis liminatans DSM
           4140]
          Length = 1009

 Score = 38.5 bits (88), Expect = 4.5,   Method: Composition-based stats.
 Identities = 21/76 (27%), Positives = 37/76 (48%)

Query: 119 DILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTS 178
           DI  LR  N ++G   V   +KD TR  T   +    L   T ++ P+V V +MV ++  
Sbjct: 441 DISNLRYSNLSVGNATVRASWKDQTRDITVSFRNYPYLSVKTRVEPPTVAVNDMVNVTVQ 500

Query: 179 VLSIKWTLRGKPKSII 194
           ++   + L  +P  ++
Sbjct: 501 LIGDGYALHSRPIDVV 516


>gi|75906754|ref|YP_321050.1| hypothetical protein Ava_0531 [Anabaena variabilis ATCC 29413]
 gi|75700479|gb|ABA20155.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
          Length = 139

 Score = 38.5 bits (88), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 29/135 (21%), Positives = 54/135 (40%), Gaps = 10/135 (7%)

Query: 109 VESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLW-----ATTALD 163
           +E   + L +D+  L E + +   +   + ++DP   F G+  Y+   W     A     
Sbjct: 7   IEQVIKTLQQDLPTLFEQDISYDIYTPDIYFQDPVNKFKGKFNYRIIFWTLRFHAWLFFT 66

Query: 164 NPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEE 223
                + E+  L    +  KWT+RG+ +         +    YS + L Q +  + EH +
Sbjct: 67  EIYFDLHEVGQLDKDTILAKWTVRGRLR---VPWRSQIFFNGYSTYKLRQ-NNLIYEHID 122

Query: 224 LWDLSASSPVARAFF 238
            WD      + R F+
Sbjct: 123 TWD-RKPGEILRQFW 136


>gi|307108290|gb|EFN56530.1| hypothetical protein CHLNCDRAFT_144158 [Chlorella variabilis]
          Length = 172

 Score = 38.1 bits (87), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 47/101 (46%), Gaps = 16/101 (15%)

Query: 139 YKDPTRSFTGREKYKRRLWATTA-LDNPSVTVQEMVML------STSVLSIKWTLR---G 188
           + DPT SF GR+ YKR L      L  P++ ++ +  L       ++ L  +W L     
Sbjct: 57  FADPTVSFRGRDLYKRNLALLVPFLWEPAIQLRSLRRLPAPAPGGSAQLFAEWRLSCWVR 116

Query: 189 KPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSA 229
            P +   +I G       + +TLN    Q++ H E WD+SA
Sbjct: 117 LPWAPYVDINGT------TTYTLNADGNQIVRHVEQWDVSA 151


>gi|431796645|ref|YP_007223549.1| acetate--CoA ligase [Echinicola vietnamensis DSM 17526]
 gi|430787410|gb|AGA77539.1| acetate--CoA ligase [Echinicola vietnamensis DSM 17526]
          Length = 632

 Score = 37.7 bits (86), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 11/107 (10%)

Query: 111 STARQLVRDIL-ELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTV 169
            T +  V+D++ E  E  + + T  V  + K   +   GR+     +W   A+D    T 
Sbjct: 178 GTKKIAVKDLVDEALEKTKTIETVIVYQRTKQEVKMVEGRD-----IWWHDAIDGQPDTN 232

Query: 170 QEMVMLSTSVLSIKWTL--RGKPKSIIANIGGDLIVKVYSKFTLNQI 214
           +  VM S  +L I +T    GKPK ++   GG +   VYSK+T   +
Sbjct: 233 EAEVMDSEDMLFILYTSGSTGKPKGVVHTTGGYM---VYSKYTFENV 276


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.130    0.387 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,996,878,416
Number of Sequences: 23463169
Number of extensions: 194916738
Number of successful extensions: 1155199
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2249
Number of HSP's successfully gapped in prelim test: 1068
Number of HSP's that attempted gapping in prelim test: 1108459
Number of HSP's gapped (non-prelim): 25139
length of query: 335
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 192
effective length of database: 9,003,962,200
effective search space: 1728760742400
effective search space used: 1728760742400
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)