BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 026283
         (240 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225454876|ref|XP_002278704.1| PREDICTED: uncharacterized protein LOC100247606 [Vitis vinifera]
          Length = 244

 Score =  335 bits (858), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 166/245 (67%), Positives = 190/245 (77%), Gaps = 6/245 (2%)

Query: 1   MATTLTWQPLQLQPKTTLH-----IGRRRAATVRAFRRSDFDRFARRMTSGEAWRDAWRT 55
           M T+L W PL      TL      +  R    VRAFRRSDFD FA+RM SG+AWRDAWR+
Sbjct: 1   MTTSLPWHPLFSSKPQTLRRFAAPVRHRLPMPVRAFRRSDFDGFAKRMASGDAWRDAWRS 60

Query: 56  ANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDF 115
           AN+GFE L+F+AKKTAERI+RQY+VSRR + A  +A   ARE+DREF I  RWR+  +DF
Sbjct: 61  ANDGFELLIFEAKKTAERINRQYAVSRRFSEAVGSAGDWAREVDREFEIGRRWRTVTLDF 120

Query: 116 SRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPLLIGTVA 175
            RNWPRYRKQLNDFL+TPLGRSFATIFFLWFALSGWLFR LI ATWVLP A PLLIGT A
Sbjct: 121 GRNWPRYRKQLNDFLDTPLGRSFATIFFLWFALSGWLFRFLIFATWVLPFAGPLLIGTFA 180

Query: 176 NNFVIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEGDFFSRNGGGKKSTKSDDDIIDV 235
           NNFVIKG CPAC+R+FIG K+QI+RCAGCGNIVWQP+GD  SR   G   + S  +IIDV
Sbjct: 181 NNFVIKGNCPACRRQFIGYKNQIVRCAGCGNIVWQPKGD-VSRGSRGTPPSSSQSEIIDV 239

Query: 236 DFEEK 240
           +FEEK
Sbjct: 240 EFEEK 244


>gi|224078794|ref|XP_002305632.1| predicted protein [Populus trichocarpa]
 gi|222848596|gb|EEE86143.1| predicted protein [Populus trichocarpa]
          Length = 260

 Score =  325 bits (832), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 164/268 (61%), Positives = 196/268 (73%), Gaps = 37/268 (13%)

Query: 1   MATTLTWQPL-------QLQPKTTLHIGRRRAATVRAFR-------------RSDFDRFA 40
           MAT L+WQP           P + +  G   A   R  R             R DFDRFA
Sbjct: 1   MATALSWQPQLPTRIHKHSHPSSFIDAGTGYAFQCRIHRPISSIHSGKALAFRRDFDRFA 60

Query: 41  RRMTSGEAWRDAWRTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDR 100
                    +DAWRTAN+GFEQ +F+AKKTAERIDR+YSVSRRL++ A++A+ RARE+DR
Sbjct: 61  ---------KDAWRTANDGFEQFLFEAKKTAERIDRRYSVSRRLSAVAQSASDRAREIDR 111

Query: 101 EFAISVRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILAT 160
           E+ IS+RWR+F MDF+RNWPRYRKQ+NDF NTP+GRSF TIFFLWFALSGW+FR+ I+AT
Sbjct: 112 EYDISLRWRTFSMDFNRNWPRYRKQINDFFNTPIGRSFVTIFFLWFALSGWMFRLFIIAT 171

Query: 161 WVLPIAAPLLIGTVANNFVIKGACPACKREFIGSKSQIIRCAGCGNIVWQPE-------- 212
           WVLPIA PLLIGT+ANN VIKGACPACKR+F+G K+Q+IRC GCGNIVWQP+        
Sbjct: 172 WVLPIAGPLLIGTLANNLVIKGACPACKRQFVGYKNQVIRCGGCGNIVWQPKGGFPGRGP 231

Query: 213 GDFFSRNGGGKKSTKSDDDIIDVDFEEK 240
           GDFFS  G G +S+KSD +IIDVDFEEK
Sbjct: 232 GDFFSGGGRGTRSSKSDPEIIDVDFEEK 259


>gi|255557915|ref|XP_002519986.1| conserved hypothetical protein [Ricinus communis]
 gi|223540750|gb|EEF42310.1| conserved hypothetical protein [Ricinus communis]
          Length = 231

 Score =  324 bits (830), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 165/244 (67%), Positives = 193/244 (79%), Gaps = 17/244 (6%)

Query: 1   MATTLTWQ--PLQLQPKTTLHIGRRRAATVRAFRRSDFDRFARRMTSGEAWRDAWRTANN 58
           MATTL W+  P     K       R   TVRAF+R DFDR AR         +AWR+AN+
Sbjct: 1   MATTLPWRQPPFPTLLKKRCPAAFRHVGTVRAFQRGDFDRLAR---------NAWRSAND 51

Query: 59  GFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDFSRN 118
           GFEQL+++A+K AERIDR+YSVSRR++  A++AA RARE+DRE  I VRWR+F +DFSRN
Sbjct: 52  GFEQLMYEARKAAERIDRRYSVSRRVSDVAQSAAERAREIDRELEIGVRWRTFTVDFSRN 111

Query: 119 WPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPLLIGTVANNF 178
           WPRYR+QLNDFL+TPLGR FATIFFLWFALSGWLFRILI ATWVLP AAPLLIGTVANN 
Sbjct: 112 WPRYRRQLNDFLDTPLGRGFATIFFLWFALSGWLFRILIFATWVLPFAAPLLIGTVANNL 171

Query: 179 VIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEGDFFSRNGGGK--KSTKSDDDIIDVD 236
           VIKGACPACKR+F+G KSQ+IRCAGCGNIVWQP+    SR+G G+   S+KSD +IIDV+
Sbjct: 172 VIKGACPACKRQFVGYKSQVIRCAGCGNIVWQPD----SRDGRGRGTSSSKSDTNIIDVE 227

Query: 237 FEEK 240
           FEEK
Sbjct: 228 FEEK 231


>gi|297737376|emb|CBI26577.3| unnamed protein product [Vitis vinifera]
          Length = 214

 Score =  323 bits (829), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 157/213 (73%), Positives = 179/213 (84%), Gaps = 1/213 (0%)

Query: 28  VRAFRRSDFDRFARRMTSGEAWRDAWRTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSA 87
           VRAFRRSDFD FA+RM SG+AWRDAWR+AN+GFE L+F+AKKTAERI+RQY+VSRR + A
Sbjct: 3   VRAFRRSDFDGFAKRMASGDAWRDAWRSANDGFELLIFEAKKTAERINRQYAVSRRFSEA 62

Query: 88  ARTAAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFA 147
             +A   ARE+DREF I  RWR+  +DF RNWPRYRKQLNDFL+TPLGRSFATIFFLWFA
Sbjct: 63  VGSAGDWAREVDREFEIGRRWRTVTLDFGRNWPRYRKQLNDFLDTPLGRSFATIFFLWFA 122

Query: 148 LSGWLFRILILATWVLPIAAPLLIGTVANNFVIKGACPACKREFIGSKSQIIRCAGCGNI 207
           LSGWLFR LI ATWVLP A PLLIGT ANNFVIKG CPAC+R+FIG K+QI+RCAGCGNI
Sbjct: 123 LSGWLFRFLIFATWVLPFAGPLLIGTFANNFVIKGNCPACRRQFIGYKNQIVRCAGCGNI 182

Query: 208 VWQPEGDFFSRNGGGKKSTKSDDDIIDVDFEEK 240
           VWQP+GD  SR   G   + S  +IIDV+FEEK
Sbjct: 183 VWQPKGD-VSRGSRGTPPSSSQSEIIDVEFEEK 214


>gi|297837839|ref|XP_002886801.1| hypothetical protein ARALYDRAFT_315473 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332642|gb|EFH63060.1| hypothetical protein ARALYDRAFT_315473 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 247

 Score =  314 bits (804), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 158/218 (72%), Positives = 179/218 (82%), Gaps = 5/218 (2%)

Query: 28  VRAFRRSDFDRFARRMTSGEAWRDAWRTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSA 87
           V+AF+R DFDRFA  + SG+AWRDAWR+AN+GFEQ VF+AKKTAERIDRQY+VSRR +SA
Sbjct: 30  VKAFQRGDFDRFADNVKSGKAWRDAWRSANDGFEQFVFEAKKTAERIDRQYAVSRRFSSA 89

Query: 88  ARTAAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFA 147
           A +A  RARE+DREF IS R RSF  D SRN+P+YRKQ + FLNTPLG SFATIFFLWFA
Sbjct: 90  ASSAGDRAREIDREFGISPRVRSFSADLSRNFPKYRKQFSAFLNTPLGGSFATIFFLWFA 149

Query: 148 LSGWLFRILILATWVLPIAAPLLIGTVANNFVIKGACPACKREFIGSKSQIIRCAGCGNI 207
           LSGWLFR++I+ATWVLPIA PLLIG VANNFVIKG CPACKR+FIG KSQIIRC GCGNI
Sbjct: 150 LSGWLFRVIIIATWVLPIAGPLLIGAVANNFVIKGECPACKRQFIGYKSQIIRCEGCGNI 209

Query: 208 VWQPEGDFFSRN-----GGGKKSTKSDDDIIDVDFEEK 240
           VWQP+GDFFS++       G         IIDVDFEEK
Sbjct: 210 VWQPQGDFFSKDGNNNNSKGNSKKPPKSQIIDVDFEEK 247


>gi|18406650|ref|NP_566028.1| uncharacterized protein [Arabidopsis thaliana]
 gi|13899075|gb|AAK48959.1|AF370532_1 Unknown protein [Arabidopsis thaliana]
 gi|18377520|gb|AAL66926.1| unknown protein [Arabidopsis thaliana]
 gi|20196922|gb|AAC31837.2| expressed protein [Arabidopsis thaliana]
 gi|330255384|gb|AEC10478.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 248

 Score =  293 bits (751), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 159/237 (67%), Positives = 186/237 (78%), Gaps = 12/237 (5%)

Query: 16  TTLHIGRRRAAT---VRAFRRSDFDRFARRMTSGEAWRDAWRTANNGFEQLVFDAKKTAE 72
           TT +   R A++   V+AF+R DFDR A  + SG+AWRDAWR+AN+GFEQ VF+AKKTAE
Sbjct: 12  TTPYPHPRAASSFLQVKAFQRGDFDRLADNVKSGKAWRDAWRSANDGFEQFVFEAKKTAE 71

Query: 73  RIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQLNDFLNT 132
           RIDRQY+VSRR +SAA +AA RARE+DREF I+ R R+   DFSRN+P+YRKQ + FLNT
Sbjct: 72  RIDRQYAVSRRFSSAASSAADRAREIDREFGITPRVRTVSADFSRNFPKYRKQFSAFLNT 131

Query: 133 PLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPLLIGTVANNFVIKGACPACKREFI 192
           PLG SFATIFFLWFALSGWLFR++I+ATWVLPIA PLLIG VANNFVIKG CPACKR+FI
Sbjct: 132 PLGGSFATIFFLWFALSGWLFRVIIIATWVLPIAGPLLIGAVANNFVIKGECPACKRQFI 191

Query: 193 GSKSQIIRCAGCGNIVWQPEGDFFSRN---------GGGKKSTKSDDDIIDVDFEEK 240
           G K+QIIRC GCGNIVWQP+GDFFS++           G         IIDVDFEEK
Sbjct: 192 GYKNQIIRCEGCGNIVWQPQGDFFSKDGNNNNNNNNSKGNSKKPPKSQIIDVDFEEK 248


>gi|356531736|ref|XP_003534432.1| PREDICTED: uncharacterized protein LOC100785744 [Glycine max]
          Length = 240

 Score =  286 bits (732), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 153/247 (61%), Positives = 187/247 (75%), Gaps = 15/247 (6%)

Query: 1   MATTLTWQPLQLQPKTT-----LHIGRRRAATVRAFRRSDFDRFARRMTSGEAWRDAWRT 55
           MA T T  P++ QP  T          +R  TV+AF      R   R+     WR+AWR+
Sbjct: 2   MAATATTVPVRWQPPNTRFRIQPQPQMKRLMTVKAFGNGKGGRDMDRV-----WREAWRS 56

Query: 56  ANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDF 115
           AN+GFE+ VF+AKKTAERIDR+YS+SRRL++ AR AA RARE+DREF I  R+R+F MDF
Sbjct: 57  ANDGFERFVFEAKKTAERIDRRYSLSRRLSAVARAAADRAREIDREFEIGQRYRAFSMDF 116

Query: 116 SRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPLLIGTVA 175
            RNWP+YRKQLN+FL++P+G+SFAT+FF+WFALSGWLFRILI+ATWVLP A PLLIGT+A
Sbjct: 117 QRNWPKYRKQLNNFLDSPVGKSFATLFFIWFALSGWLFRILIIATWVLPFAGPLLIGTLA 176

Query: 176 NNFVIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEGDFFSRNGGGKKS--TKSDDDII 233
           N  VIKG CPACK +F G K+QIIRC GCGNIVWQP+G     NGGG+ +  +KSD DII
Sbjct: 177 NTIVIKGTCPACKMQFGGYKNQIIRCTGCGNIVWQPKG---KGNGGGRTNSKSKSDPDII 233

Query: 234 DVDFEEK 240
           DVDFEE+
Sbjct: 234 DVDFEER 240


>gi|351727667|ref|NP_001235377.1| uncharacterized protein LOC100306460 [Glycine max]
 gi|255628613|gb|ACU14651.1| unknown [Glycine max]
          Length = 226

 Score =  278 bits (710), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 151/244 (61%), Positives = 179/244 (73%), Gaps = 23/244 (9%)

Query: 1   MATT--LTWQPLQLQPKTTLHIGRRRAATVRAFRRS--DFDRFARRMTSGEAWRDAWRTA 56
           MATT  + WQP    P     I  +RA  V+AF +   D DR          W +AWR+A
Sbjct: 2   MATTAPVRWQP----PNIRFRIEPKRAMRVKAFGKGGRDMDR---------VWSEAWRSA 48

Query: 57  NNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDFS 116
           N+GFE+ VF+AKKTAERIDR+YS+SRRL+S AR AA RARE+DREF I  R+R+  +DF 
Sbjct: 49  NDGFERFVFEAKKTAERIDRRYSLSRRLSSVARAAADRAREIDREFEIGQRYRTLSIDFQ 108

Query: 117 RNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPLLIGTVAN 176
           RNWP+YRKQL DFL++PLG+SFAT+ F+W ALSGWLFRILI+ATWVLP A PLLIGT+AN
Sbjct: 109 RNWPKYRKQLKDFLDSPLGKSFATLLFIWLALSGWLFRILIIATWVLPFAGPLLIGTLAN 168

Query: 177 NFVIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEGDFFSRNGGGKKSTKSDDDIIDVD 236
             VIKG CPACK +F G K+QIIRC GCGNIVWQP+G   S      KS KSD DIIDVD
Sbjct: 169 TLVIKGTCPACKMQFGGYKNQIIRCTGCGNIVWQPKGRTNS------KSNKSDPDIIDVD 222

Query: 237 FEEK 240
           FEE+
Sbjct: 223 FEER 226


>gi|449451842|ref|XP_004143669.1| PREDICTED: uncharacterized protein LOC101219174 [Cucumis sativus]
          Length = 247

 Score =  271 bits (693), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 128/195 (65%), Positives = 158/195 (81%)

Query: 46  GEAWRDAWRTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAIS 105
           G+AW   WR AN+GFE+ VF+A+KTAER+DR+YSVSRR+ SAA++ A RARE+DREFAI 
Sbjct: 53  GDAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVGSAAQSVADRAREIDREFAIG 112

Query: 106 VRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPI 165
           +RWR+F +DFSRNWPRYR+QLN+F++TPLG+S  TIFFLWFALSGWLFR LI  TW+LP 
Sbjct: 113 MRWRNFTLDFSRNWPRYRRQLNEFIDTPLGKSVVTIFFLWFALSGWLFRFLIFGTWILPF 172

Query: 166 AAPLLIGTVANNFVIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEGDFFSRNGGGKKS 225
           A P+LIGT AN+ VIKG CPAC REF G K+QII CAGCGN+VWQP+      +  G   
Sbjct: 173 AGPILIGTFANSLVIKGNCPACNREFAGYKNQIISCAGCGNVVWQPKDHGEYNSRKGSSG 232

Query: 226 TKSDDDIIDVDFEEK 240
           +KS  ++IDV+FEEK
Sbjct: 233 SKSQPNVIDVEFEEK 247


>gi|414867827|tpg|DAA46384.1| TPA: hypothetical protein ZEAMMB73_929009 [Zea mays]
          Length = 256

 Score =  233 bits (594), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 135/248 (54%), Positives = 168/248 (67%), Gaps = 23/248 (9%)

Query: 1   MATTLTWQPLQLQPKTTLHIGRRR-----AATV---RAFRRSDFDRFARRMTSGEAWRDA 52
           MAT   W    LQP T   +  RR     AATV   RAF R+DFD F RR         A
Sbjct: 24  MATLARWH---LQPPTARPLAHRRRIPRRAATVVSPRAFSRADFDGFVRR---------A 71

Query: 53  WRTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFR 112
           W+ AN G E+  F+A++ A+R+D ++S+SRRL  A+R A  RA E+D EF I  RWRSF 
Sbjct: 72  WQGANAGAERFAFEARQAAQRLDSRFSISRRLADASRAARARAVEIDAEFGIGRRWRSFS 131

Query: 113 MDFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPLLIG 172
           +DFSRNWPRYR+ +NDF+ TP+GR+ ATIFFLW ALSGWLFRI I  T+VLP AAPLL+G
Sbjct: 132 VDFSRNWPRYRRDINDFMGTPIGRALATIFFLWLALSGWLFRIFIFGTFVLPFAAPLLLG 191

Query: 173 TVANNFVIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEGDFFSRNGGGKKSTKSDDDI 232
           T AN   I+G CPACKR F+G ++Q+IRC  C NIVWQP     S +GG + S  S+ D+
Sbjct: 192 TFANRVAIEGTCPACKRRFVGYRNQVIRCMSCQNIVWQPNN---SPSGGARSSRSSEPDV 248

Query: 233 IDVDFEEK 240
           IDV++EEK
Sbjct: 249 IDVEYEEK 256


>gi|195640336|gb|ACG39636.1| hypothetical protein [Zea mays]
          Length = 232

 Score =  225 bits (574), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 132/247 (53%), Positives = 167/247 (67%), Gaps = 22/247 (8%)

Query: 1   MATTLTWQPLQLQPKTTLHIGRRR-------AATVRAFRRSDFDRFARRMTSGEAWRDAW 53
           MAT   W    LQP T   + RRR         + RAF R+DFD F RR         AW
Sbjct: 1   MATLARWH---LQPPTARPLARRRIPRRAATVVSPRAFSRADFDGFVRR---------AW 48

Query: 54  RTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFRM 113
           + AN G E+  F+A++ A+R+D ++S+SRRL  A+R A  RA E+D EF I  RWRSF +
Sbjct: 49  QGANAGAERFAFEARQAAQRLDSRFSISRRLADASRAARARAVEIDAEFGIGRRWRSFSV 108

Query: 114 DFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPLLIGT 173
           DFSRNWPRYR+++NDF+ TP+GR+ ATIFFLW ALSGWLFRI I  T+VLP AAPLL+GT
Sbjct: 109 DFSRNWPRYRREINDFMETPIGRALATIFFLWLALSGWLFRIFIFGTFVLPFAAPLLLGT 168

Query: 174 VANNFVIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEGDFFSRNGGGKKSTKSDDDII 233
            AN   I+G CPACKR F+G ++Q+IRC  C NIVWQP     S +GG + S  S+ D+I
Sbjct: 169 FANRVAIEGTCPACKRRFVGYRNQVIRCMSCQNIVWQPNN---SPSGGARSSRSSEPDVI 225

Query: 234 DVDFEEK 240
           DV++EEK
Sbjct: 226 DVEYEEK 232


>gi|19224987|gb|AAL86463.1|AC077693_2 unknown protein [Oryza sativa Japonica Group]
 gi|31433597|gb|AAP55089.1| expressed protein [Oryza sativa Japonica Group]
          Length = 237

 Score =  217 bits (552), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 125/222 (56%), Positives = 153/222 (68%), Gaps = 15/222 (6%)

Query: 22  RRRAATV--RAFRR-SDFDRFARRMTSGEAWRDAWRTANNGFEQLVFDAKKTAERIDRQY 78
           R RAATV  RAF   +DFD F RR         AWR AN G E+L F+ ++TA+R+D ++
Sbjct: 28  RSRAATVSPRAFGGGADFDGFVRR---------AWRGANAGAERLAFEVRQTAQRLDGRF 78

Query: 79  SVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRSF 138
           S+SRRL  AAR A  RA E+D E  I  RWRSF +DFSRNWPRYR++L DF+ TP+GR+F
Sbjct: 79  SISRRLAEAARAARARAVEIDAELGIGRRWRSFSVDFSRNWPRYRRELTDFMATPIGRAF 138

Query: 139 ATIFFLWFALSGWLFRILILATWVLPIAAPLLIGTVANNFVIKGACPACKREFIGSKSQI 198
            TIFF+WFALSGWLFR+ I  T+VLP AAPLL+GT  N   I+G CPACKR F+G ++Q+
Sbjct: 139 TTIFFIWFALSGWLFRVFIFGTFVLPFAAPLLLGTFGNRVAIEGTCPACKRRFVGYRNQV 198

Query: 199 IRCAGCGNIVWQPEGDFFSRNGGGKKSTKSDDDIIDVDFEEK 240
           IRC  C NIVWQP      R      S  S  D+IDV+FEEK
Sbjct: 199 IRCMNCQNIVWQPNNRSSGRA---SSSRSSGPDVIDVEFEEK 237


>gi|357147497|ref|XP_003574366.1| PREDICTED: uncharacterized protein LOC100831966 [Brachypodium
           distachyon]
          Length = 246

 Score =  214 bits (545), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 117/210 (55%), Positives = 151/210 (71%), Gaps = 12/210 (5%)

Query: 31  FRRSDFDRFARRMTSGEAWRDAWRTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAART 90
           F R+D D   RR         AWR AN G E+  F+A++ A+R+D +YS+SRR+  AAR 
Sbjct: 49  FSRADLDGLLRR---------AWRGANAGAERFSFEARQAAQRLDGRYSISRRVAEAARA 99

Query: 91  AAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSG 150
           A  RA E+D E  +  RWRSF +DFSRNWPRYR++L DFL+TP+GR+ +T+FF+WFALSG
Sbjct: 100 ARERAAEIDAELGVGRRWRSFSVDFSRNWPRYRRELTDFLSTPIGRALSTLFFVWFALSG 159

Query: 151 WLFRILILATWVLPIAAPLLIGTVANNFVIKGACPACKREFIGSKSQIIRCAGCGNIVWQ 210
           WLFR+ I +T+VLP AAPLL+GT AN   I+G CPACKR F+G ++Q+IRC  C NIVWQ
Sbjct: 160 WLFRVFIFSTFVLPFAAPLLLGTFANRVAIEGTCPACKRRFVGYRNQVIRCMNCQNIVWQ 219

Query: 211 PEGDFFSRNGGGKKSTKSDDDIIDVDFEEK 240
           P     S +GG  +S  S+ DIIDV+FEEK
Sbjct: 220 PSS---SSSGGAGRSRNSEPDIIDVEFEEK 246


>gi|242040137|ref|XP_002467463.1| hypothetical protein SORBIDRAFT_01g028530 [Sorghum bicolor]
 gi|241921317|gb|EER94461.1| hypothetical protein SORBIDRAFT_01g028530 [Sorghum bicolor]
          Length = 244

 Score =  213 bits (543), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 116/207 (56%), Positives = 149/207 (71%), Gaps = 12/207 (5%)

Query: 34  SDFDRFARRMTSGEAWRDAWRTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAV 93
            DFD F RR         AWR+AN G E+L F+A++ A+R+D ++S+SRRL  A+R A  
Sbjct: 50  GDFDGFVRR---------AWRSANAGAERLAFEARQAAKRLDGRFSISRRLAEASRAARA 100

Query: 94  RARELDREFAISVRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLF 153
           RA E+D E  I  RWRSF +DFSRNWPRYR+++NDF+ TP+GR+ ATIFFLW ALSGWLF
Sbjct: 101 RAVEIDAELGIGRRWRSFSVDFSRNWPRYRREINDFMETPIGRALATIFFLWLALSGWLF 160

Query: 154 RILILATWVLPIAAPLLIGTVANNFVIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEG 213
           RI I  T+VLP A PLL+G +AN   I+G CPACKR F+G ++Q+IRC  C NIVWQP  
Sbjct: 161 RIFIFGTFVLPFAVPLLLGPLANRVAIEGTCPACKRRFVGYRNQVIRCMNCQNIVWQPNN 220

Query: 214 DFFSRNGGGKKSTKSDDDIIDVDFEEK 240
              S +GG + S  S+ D+IDV++EEK
Sbjct: 221 ---SSSGGTRSSRSSEPDVIDVEYEEK 244


>gi|326487888|dbj|BAJ89783.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326494256|dbj|BAJ90397.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326505712|dbj|BAJ95527.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 249

 Score =  207 bits (528), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 116/221 (52%), Positives = 151/221 (68%), Gaps = 15/221 (6%)

Query: 23  RRAATVRA---FRRSDFDRFARRMTSGEAWRDAWRTANNGFEQLVFDAKKTAERIDRQYS 79
           RRA+TV     F R+D D   +R         AWR AN G E+L F+A++ A+R+D +YS
Sbjct: 41  RRASTVSPRAFFGRADLDGLLQR---------AWRGANAGAERLSFEARQAAQRLDGRYS 91

Query: 80  VSRRLNSAARTAAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRSFA 139
           +SRR+  AAR A  RA E+D E  +  RWRSF +DFSR+WPRYR++L+DFL TP+GR+ +
Sbjct: 92  ISRRVAEAARAARERAAEIDAELGVGRRWRSFSVDFSRSWPRYRRELSDFLATPIGRALS 151

Query: 140 TIFFLWFALSGWLFRILILATWVLPIAAPLLIGTVANNFVIKGACPACKREFIGSKSQII 199
           T+ F+W ALSGWLFRI I +T+VLP AAPLL+GT AN   I+G+CPACKR F+G ++Q+I
Sbjct: 152 TLTFVWLALSGWLFRIFIFSTFVLPFAAPLLLGTFANRVAIEGSCPACKRRFVGYRNQVI 211

Query: 200 RCAGCGNIVWQPEGDFFSRNGGGKKSTKSDDDIIDVDFEEK 240
           RC  C NIVWQP             S  S  D+IDV+FEEK
Sbjct: 212 RCMNCQNIVWQPNSRSSGGG---GGSRSSASDVIDVEFEEK 249


>gi|302810020|ref|XP_002986702.1| hypothetical protein SELMODRAFT_26406 [Selaginella moellendorffii]
 gi|300145590|gb|EFJ12265.1| hypothetical protein SELMODRAFT_26406 [Selaginella moellendorffii]
          Length = 184

 Score =  172 bits (435), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 81/185 (43%), Positives = 113/185 (61%), Gaps = 6/185 (3%)

Query: 54  RTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFRM 113
           R+AN   E + F+ ++ A   +R+Y  + R          +    D    I  R R F +
Sbjct: 6   RSANQRMEDMGFEVRRNANEWNRKYRFTHRGAEFVDYVKDQLHAADSRLRIRQRARMFGV 65

Query: 114 DFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPLLIGT 173
           D    WP YR++++ FL+TPLG+  +TI F W  LSGW+F++ +++ WVLPIAAPLLIGT
Sbjct: 66  DARMKWPTYRRRVSQFLDTPLGKLVSTIMFAWLVLSGWIFKVFVISMWVLPIAAPLLIGT 125

Query: 174 VANNFVIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEGDFFSRNGGGKKSTKSDDDII 233
           +A N +++GACP CK +FIGS+ Q+I C GC N+VWQP  DF S  G G      D DII
Sbjct: 126 IARNTIVEGACPNCKTQFIGSRGQVIVCRGCRNVVWQPRDDFSSSKGTG------DPDII 179

Query: 234 DVDFE 238
           D+D E
Sbjct: 180 DIDVE 184


>gi|148907934|gb|ABR17087.1| unknown [Picea sitchensis]
          Length = 120

 Score =  172 bits (435), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 81/128 (63%), Positives = 94/128 (73%), Gaps = 8/128 (6%)

Query: 113 MDFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPLLIG 172
           MDF  NWPRY+++LN FLNTPLGRS AT+ FLWFALSGWLFRI+I  TWVLP AAP LIG
Sbjct: 1   MDFRMNWPRYKRELNQFLNTPLGRSVATLVFLWFALSGWLFRIIIFGTWVLPFAAPFLIG 60

Query: 173 TVANNFVIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEGDFFSRNGGGKKSTKSDDDI 232
            VA N VI+GACPACKR FIG ++QII+C  C  +VWQP  DF            ++ DI
Sbjct: 61  AVARNTVIEGACPACKRRFIGKRNQIIQCMNCRAVVWQPREDF--------SKGSANPDI 112

Query: 233 IDVDFEEK 240
           ID+D EEK
Sbjct: 113 IDIDIEEK 120


>gi|414867828|tpg|DAA46385.1| TPA: hypothetical protein ZEAMMB73_929009 [Zea mays]
          Length = 206

 Score =  171 bits (433), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 103/189 (54%), Positives = 126/189 (66%), Gaps = 20/189 (10%)

Query: 1   MATTLTWQPLQLQPKTTLHIGRRR-----AATV---RAFRRSDFDRFARRMTSGEAWRDA 52
           MAT   W    LQP T   +  RR     AATV   RAF R+DFD F RR         A
Sbjct: 24  MATLARWH---LQPPTARPLAHRRRIPRRAATVVSPRAFSRADFDGFVRR---------A 71

Query: 53  WRTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFR 112
           W+ AN G E+  F+A++ A+R+D ++S+SRRL  A+R A  RA E+D EF I  RWRSF 
Sbjct: 72  WQGANAGAERFAFEARQAAQRLDSRFSISRRLADASRAARARAVEIDAEFGIGRRWRSFS 131

Query: 113 MDFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPLLIG 172
           +DFSRNWPRYR+ +NDF+ TP+GR+ ATIFFLW ALSGWLFRI I  T+VLP AAPLL+G
Sbjct: 132 VDFSRNWPRYRRDINDFMGTPIGRALATIFFLWLALSGWLFRIFIFGTFVLPFAAPLLLG 191

Query: 173 TVANNFVIK 181
           T AN   I+
Sbjct: 192 TFANRVAIE 200


>gi|212723572|ref|NP_001132702.1| uncharacterized protein LOC100194183 [Zea mays]
 gi|194695140|gb|ACF81654.1| unknown [Zea mays]
          Length = 183

 Score =  170 bits (430), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 103/189 (54%), Positives = 126/189 (66%), Gaps = 20/189 (10%)

Query: 1   MATTLTWQPLQLQPKTTLHIGRRR-----AATV---RAFRRSDFDRFARRMTSGEAWRDA 52
           MAT   W    LQP T   +  RR     AATV   RAF R+DFD F RR         A
Sbjct: 1   MATLARWH---LQPPTARPLAHRRRIPRRAATVVSPRAFSRADFDGFVRR---------A 48

Query: 53  WRTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFR 112
           W+ AN G E+  F+A++ A+R+D ++S+SRRL  A+R A  RA E+D EF I  RWRSF 
Sbjct: 49  WQGANAGAERFAFEARQAAQRLDSRFSISRRLADASRAARARAVEIDAEFGIGRRWRSFS 108

Query: 113 MDFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPLLIG 172
           +DFSRNWPRYR+ +NDF+ TP+GR+ ATIFFLW ALSGWLFRI I  T+VLP AAPLL+G
Sbjct: 109 VDFSRNWPRYRRDINDFMGTPIGRALATIFFLWLALSGWLFRIFIFGTFVLPFAAPLLLG 168

Query: 173 TVANNFVIK 181
           T AN   I+
Sbjct: 169 TFANRVAIE 177


>gi|168012725|ref|XP_001759052.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689751|gb|EDQ76121.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 189

 Score =  163 bits (413), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 77/189 (40%), Positives = 111/189 (58%), Gaps = 2/189 (1%)

Query: 50  RDAWRTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWR 109
           R A +T N  FE+  +D  K   + DRQY V+ +       A  +A+ +DR+F I  + R
Sbjct: 1   RRAVKTVNLKFEEFAYDFNKQLGKWDRQYRVTEKAQQVKEYATEQAQNVDRQFGIRQKAR 60

Query: 110 SFRMDFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPL 169
           +   DF   +P YR+Q   FL TP G++  ++F +WF +SGW FRI   + W LP A P 
Sbjct: 61  NATTDFRLKFPTYRRQFTTFLETPFGKTVVSLFLVWFIVSGWAFRIFFFSLWFLPFAGPF 120

Query: 170 LIGTVANNFVIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEGDFFSRNGGGKKSTKSD 229
           L+GT++   V++GACP C   ++G +SQ++ C  C  +VWQP  D FSR G  K+  KSD
Sbjct: 121 LLGTISKAAVVEGACPNCGARYVGGRSQVVTCQRCRGVVWQPRQD-FSREGFSKED-KSD 178

Query: 230 DDIIDVDFE 238
             IIDVD +
Sbjct: 179 PTIIDVDID 187


>gi|222613307|gb|EEE51439.1| hypothetical protein OsJ_32525 [Oryza sativa Japonica Group]
          Length = 293

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 122/280 (43%), Positives = 147/280 (52%), Gaps = 75/280 (26%)

Query: 22  RRRAATV--RAFRR-SDFDRFARRMTSGEAWRDAWRTANNGFEQLVFDAKKTAERIDRQY 78
           R RAATV  RAF   +DFD F RR         AWR AN G E+L F+ ++TA+R+D ++
Sbjct: 28  RSRAATVSPRAFGGGADFDGFVRR---------AWRGANAGAERLAFEVRQTAQRLDGRF 78

Query: 79  SVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDFSR----------------NWPR- 121
           S+SRRL  AAR A  RA E+D E  I  RWRSF +DFSR                NW   
Sbjct: 79  SISRRLAEAARAARARAVEIDAELGIGRRWRSFSVDFSRNWPRYRRELTDFMATPNWESF 138

Query: 122 -----------------YRKQLN-----------------------DF-LNTPLGRSFAT 140
                            Y   LN                       DF +NT LG+   T
Sbjct: 139 HFCVREKKYMSCLSEKIYYMPLNSQALSTAMPRTVNWHLHLTGGCHDFGVNTLLGK--MT 196

Query: 141 IFFLWFALSGWLFRILILATWVLPIAAPLLIGTVANNFVIKGACPACKREFIGSKSQIIR 200
           IFF+WFALSGWLFR+ I  T+VLP AAPLL+GT  N   I+G CPACKR F+G ++Q+IR
Sbjct: 197 IFFIWFALSGWLFRVFIFGTFVLPFAAPLLLGTFGNRVAIEGTCPACKRRFVGYRNQVIR 256

Query: 201 CAGCGNIVWQPEGDFFSRNGGGKKSTKSDDDIIDVDFEEK 240
           C  C NIVWQP      R      S  S  D+IDV+FEEK
Sbjct: 257 CMNCQNIVWQPNNRSSGRA---SSSRSSGPDVIDVEFEEK 293


>gi|125533014|gb|EAY79579.1| hypothetical protein OsI_34714 [Oryza sativa Indica Group]
          Length = 243

 Score =  155 bits (392), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 106/226 (46%), Positives = 136/226 (60%), Gaps = 20/226 (8%)

Query: 22  RRRAATV--RAFRR-SDFDRFARRMTSGEAWRDAWRTANNGFEQLVFDAKKTAERIDRQY 78
           R RAATV  RAF   +DFD F RR         AWR AN G E+L F+ ++TA+R+D ++
Sbjct: 31  RSRAATVSPRAFGGGADFDGFVRR---------AWRGANAGAERLAFEVRQTAQRLDGRF 81

Query: 79  SVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRSF 138
           S+SRRL  AAR A  RA E+D E  I  RWRSF +DFSRNWPRYR++L DF+ TP+GR+F
Sbjct: 82  SISRRLAEAARAARARAVEIDAELGIGRRWRSFSVDFSRNWPRYRRELTDFMATPIGRAF 141

Query: 139 ATIFFLWFALSGWLFRILILAT---WVLP-IAAPLLIGTVANNFVIKGACPACKREFIGS 194
               +  F L   + R+ I +    ++ P        GT  N   I+G CPACKR F+G 
Sbjct: 142 TVSAYDLFHLVCTV-RMAIQSFHFWYICPAFCCATSPGTFGNRVAIEGTCPACKRRFVGY 200

Query: 195 KSQIIRCAGCGNIVWQPEGDFFSRNGGGKKSTKSDDDIIDVDFEEK 240
           ++Q+IRC  C NIVWQP      R      S  S  D+IDV+FEEK
Sbjct: 201 RNQVIRCMNCQNIVWQPNNRSSGRA---SSSRSSGPDVIDVEFEEK 243


>gi|147844968|emb|CAN83327.1| hypothetical protein VITISV_034014 [Vitis vinifera]
          Length = 139

 Score =  143 bits (360), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 74/126 (58%), Positives = 89/126 (70%), Gaps = 5/126 (3%)

Query: 1   MATTLTWQPLQLQPKTTLH-----IGRRRAATVRAFRRSDFDRFARRMTSGEAWRDAWRT 55
           M T+L W PL      TL      +  R    VRAFRRSDFD FA+RM SG+AWRDAWR+
Sbjct: 1   MXTSLPWHPLFSSKPQTLRXFAAPVRHRLPMPVRAFRRSDFDGFAKRMASGDAWRDAWRS 60

Query: 56  ANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDF 115
           AN+GFE L+F+AKKTAERIBRQY+VSRR + A  +A   ARE+DREF I  RWR+  +DF
Sbjct: 61  ANDGFELLIFEAKKTAERIBRQYAVSRRFSEAVGSAGDWAREVDREFEIGRRWRTVXLDF 120

Query: 116 SRNWPR 121
            RNWPR
Sbjct: 121 XRNWPR 126


>gi|449532547|ref|XP_004173242.1| PREDICTED: uncharacterized protein LOC101229502, partial [Cucumis
           sativus]
          Length = 146

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 63/92 (68%), Positives = 82/92 (89%)

Query: 46  GEAWRDAWRTANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAIS 105
           G+AW   WR AN+GFE+ VF+A+KTAER+DR+YSVSRR+ SAA++ A RARE+DREFAI 
Sbjct: 53  GDAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVGSAAQSVADRAREIDREFAIG 112

Query: 106 VRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRS 137
           +RWR+F +DFSRNWPRYR+QLN+F++TPLG+S
Sbjct: 113 MRWRNFTLDFSRNWPRYRRQLNEFIDTPLGKS 144


>gi|115483548|ref|NP_001065444.1| Os10g0569000 [Oryza sativa Japonica Group]
 gi|113639976|dbj|BAF27281.1| Os10g0569000 [Oryza sativa Japonica Group]
          Length = 195

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 97/222 (43%), Positives = 120/222 (54%), Gaps = 57/222 (25%)

Query: 22  RRRAATV--RAFRR-SDFDRFARRMTSGEAWRDAWRTANNGFEQLVFDAKKTAERIDRQY 78
           R RAATV  RAF   +DFD F RR         AWR AN G E+L F+ ++TA+R+D ++
Sbjct: 28  RSRAATVSPRAFGGGADFDGFVRR---------AWRGANAGAERLAFEVRQTAQRLDGRF 78

Query: 79  SVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRSF 138
           S+SRRL  AAR A  RA E+D E  I  RWRSF +DFSRNWPRYR++L DF+ TP+GR+F
Sbjct: 79  SISRRLAEAARAARARAVEIDAELGIGRRWRSFSVDFSRNWPRYRRELTDFMATPIGRAF 138

Query: 139 ATIFFLWFALSGWLFRILILATWVLPIAAPLLIGTVANNFVIKGACPACKREFIGSKSQI 198
                                                      G CPACKR F+G ++Q+
Sbjct: 139 T------------------------------------------GTCPACKRRFVGYRNQV 156

Query: 199 IRCAGCGNIVWQPEGDFFSRNGGGKKSTKSDDDIIDVDFEEK 240
           IRC  C NIVWQP      R      S  S  D+IDV+FEEK
Sbjct: 157 IRCMNCQNIVWQPNNRSSGRA---SSSRSSGPDVIDVEFEEK 195


>gi|125533019|gb|EAY79584.1| hypothetical protein OsI_34720 [Oryza sativa Indica Group]
          Length = 92

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 50/92 (54%), Positives = 60/92 (65%), Gaps = 3/92 (3%)

Query: 149 SGWLFRILILATWVLPIAAPLLIGTVANNFVIKGACPACKREFIGSKSQIIRCAGCGNIV 208
            GWLFR+ I  T+VLP AAPLL+GT  N   I+G CPACKR F+G ++Q+IRC  C NIV
Sbjct: 4   EGWLFRVFIFGTFVLPFAAPLLLGTFGNRVAIEGTCPACKRRFVGYRNQVIRCMNCQNIV 63

Query: 209 WQPEGDFFSRNGGGKKSTKSDDDIIDVDFEEK 240
           WQP      R      S  S  D+IDV+FEEK
Sbjct: 64  WQPNNRSSGRA---SSSRSSGPDVIDVEFEEK 92


>gi|159471968|ref|XP_001694128.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158277295|gb|EDP03064.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 181

 Score = 56.6 bits (135), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 72/141 (51%), Gaps = 10/141 (7%)

Query: 6   TWQPLQLQPKT--TLHIGRRRAATVRAF--------RRSDFDRFARRMTSGEAWRDAWRT 55
           T+ PL  +P++      GR   +T  +F         +S ++ FAR     +    A + 
Sbjct: 24  TFHPLLAKPRSIACASSGRPSTSTSGSFDFDEAARTAKSKWESFAREQELEKKANQAMKK 83

Query: 56  ANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDF 115
               FEQ    A++   R+D +Y +S +   AA+ A   AR++D+++ +  R+R+ R D 
Sbjct: 84  GGEVFEQAKDKARRVYVRLDSEYDISSKTAKAAKKAGEAARDIDQQYGLRRRFRAAREDL 143

Query: 116 SRNWPRYRKQLNDFLNTPLGR 136
           +R WP ++KQL++F  T  G+
Sbjct: 144 ARKWPGWQKQLDEFTQTTPGK 164


>gi|307106743|gb|EFN54988.1| expressed protein [Chlorella variabilis]
          Length = 274

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/175 (27%), Positives = 78/175 (44%), Gaps = 17/175 (9%)

Query: 73  RIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQLNDFLNT 132
           ++D +Y V  +   AART    A ELD +F    R R F  D  R  P   ++L+DF  T
Sbjct: 104 KVDEEYDVLGKAAGAARTVNDTAEELDSKFQFRRRLRIFLTDARRMLPLLGRRLHDFSGT 163

Query: 133 PLGRSFATIFFLWFALSGW------LFRILILATWVLPIAAPLLIGTVANNFVIKGACPA 186
           P+G   A  FF  F +S +      + R+L    W+  + +P+LI  +  N  +      
Sbjct: 164 PMG---AITFFFAFLVSVFTGAFWVILRLLFSLMWIGILLSPMLINYI--NARMMAEQQE 218

Query: 187 CKREFIGSKSQIIRCAGCGNIVWQPEGDFFSRN--GGGKKSTKSDDDIIDVDFEE 239
             +E++  + +       G I+     + F+R    G +K+     D+IDV  E 
Sbjct: 219 RYKEYVEEQRRREANPFRGTIL----EEVFARGPRSGRRKAQFDSQDVIDVSAEH 269


>gi|302849668|ref|XP_002956363.1| hypothetical protein VOLCADRAFT_97368 [Volvox carteri f.
           nagariensis]
 gi|300258269|gb|EFJ42507.1| hypothetical protein VOLCADRAFT_97368 [Volvox carteri f.
           nagariensis]
          Length = 300

 Score = 46.2 bits (108), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 72/137 (52%), Gaps = 9/137 (6%)

Query: 24  RAATVRAFRRSD--FDRFARRMTSGEAWRDAWRTANNGFEQLVFDAKKTAERI----DRQ 77
           + A  RAF+++   +D FA+     +   +A+  A+    +    AK+ A R+    D +
Sbjct: 62  KEAASRAFKKAKGRWDAFAKEQRLEQRANEAFNKASESAREATDKAKEKARRVFVELDSE 121

Query: 78  YSVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRS 137
           Y ++ +   A R A   AR++D+ + +  R R+     +R+WP +++QL++F +T  G++
Sbjct: 122 YELTSKSARAVRRAEEMARDIDQTYGVRRRIRAASDYIARSWPTWQRQLDEFSSTWYGKT 181

Query: 138 FATIFF-LWFALSGWLF 153
             T+F  L   LS  LF
Sbjct: 182 --TVFVALCLLLSTPLF 196


>gi|308799291|ref|XP_003074426.1| unnamed protein product [Ostreococcus tauri]
 gi|116000597|emb|CAL50277.1| unnamed protein product [Ostreococcus tauri]
          Length = 272

 Score = 43.1 bits (100), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 33/148 (22%), Positives = 63/148 (42%), Gaps = 10/148 (6%)

Query: 67  AKKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQL 126
           A++TA R++ +Y    +  S    A    +  D +   + +W      F +N P+   + 
Sbjct: 91  ARETANRLNAKYGAGDKAKSVLDDARRMVKNADTKIG-ATKW------FKKNVPKVMDKY 143

Query: 127 NDFLNTPLGRSFATIFFLWFALSG--W-LFRILILATWVLPIAAPLLIGTVANNFVIKGA 183
            +   TP+G+    +F++W   SG  W LF   ++AT+++ +  P LI     N   +  
Sbjct: 144 AEVRATPVGKFANFMFYIWLFTSGVFWNLFYFGLIATFLVNLLYPSLITDQLENMQRRAQ 203

Query: 184 CPACKREFIGSKSQIIRCAGCGNIVWQP 211
                   +G      R +G G + + P
Sbjct: 204 ERVNAMGGMGGTGYDPRMSGMGGMGYDP 231


>gi|170077444|ref|YP_001734082.1| hypothetical protein SYNPCC7002_A0822 [Synechococcus sp. PCC 7002]
 gi|169885113|gb|ACA98826.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
          Length = 143

 Score = 41.2 bits (95), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 47/106 (44%), Gaps = 12/106 (11%)

Query: 115 FSRNWPRYRKQLNDFLNTPLGRSFATIFFLWFALSGWLFRILILATWVLPIAAPLLIGTV 174
           FSRN       L  FL   L  +  T   L + ++G++  IL++     P+A  L  G V
Sbjct: 5   FSRNIQFSSSGLGCFLTFILIAALLTSVGLGWVVNGFIILILLI-----PVAIAL--GVV 57

Query: 175 ANNF-----VIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEGDF 215
              +     +++  CP C  EF G  + + RC  CG +V   E  F
Sbjct: 58  GFQWWLKRNLVEADCPVCGYEFTGLNNSMARCPSCGEVVQITEKKF 103


>gi|428211932|ref|YP_007085076.1| hypothetical protein Oscil6304_1451 [Oscillatoria acuminata PCC
           6304]
 gi|428000313|gb|AFY81156.1| hypothetical protein Oscil6304_1451 [Oscillatoria acuminata PCC
           6304]
          Length = 128

 Score = 38.5 bits (88), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 42/92 (45%), Gaps = 13/92 (14%)

Query: 136 RSFATIFFLWFALS---GWLFR---ILILATWVLPIAAPLLIGT---VANNFVIKGACPA 186
           R    +F +W  +S   GW+ +   ILI+A  +LP+ A    G    ++ N ++   CP 
Sbjct: 18  RWLMPVFIVWVLISIGLGWVVKSILILIVAITLLPVVA--FFGLRWWLSRNLIVD-HCPV 74

Query: 187 CKREFIGSKSQIIRCAGCGNIVWQPEGDFFSR 218
           C  E  G      +C  CG  +   EGD F R
Sbjct: 75  CSYELTGLNQTQFQCPSCGEPL-MGEGDHFVR 105


>gi|384245199|gb|EIE18694.1| hypothetical protein COCSUDRAFT_60004 [Coccomyxa subellipsoidea
           C-169]
          Length = 210

 Score = 37.7 bits (86), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 48/97 (49%), Gaps = 1/97 (1%)

Query: 87  AARTAAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQLNDFLNTPLGRSFATIFFLWF 146
           A++T   +A  +D++F +  + R+   D  R  P +++Q   F  T LG+      F + 
Sbjct: 56  ASKTVKEQAGAVDQKFGVKRKLRNLADDAIRKLPMWKRQYAKFSKTTLGQVVFFAAFFFL 115

Query: 147 ALSGWLFRILILATWVLPIAAPLLIGTVANNFVIKGA 183
             SG ++R+L L  ++L   APL +    N+   K A
Sbjct: 116 CYSGLVWRLLNL-VFILWWVAPLFVLPAINHINKKAA 151


>gi|115361107|ref|YP_778244.1| transmembrane protein [Burkholderia ambifaria AMMD]
 gi|115286435|gb|ABI91910.1| putative transmembrane protein [Burkholderia ambifaria AMMD]
          Length = 739

 Score = 37.7 bits (86), Expect = 4.2,   Method: Composition-based stats.
 Identities = 23/79 (29%), Positives = 39/79 (49%), Gaps = 10/79 (12%)

Query: 33  RSDFDRFARRM-TSGEAWRDAWRT-ANNGFEQLVFDAKKTAERIDRQYSVSRRLNSAART 90
           ++DFDR A+++ T+   WRD W T   N   QL +  +  A        V  R  + A+ 
Sbjct: 80  QADFDRIAQKVATAASPWRDGWNTLIANSHAQLTWSPRPQA--------VVTRGGTGAQN 131

Query: 91  AAVRARELDREFAISVRWR 109
             V  R++   +A ++RW+
Sbjct: 132 YPVLYRDIAAAYACALRWK 150


>gi|145341602|ref|XP_001415895.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576118|gb|ABO94187.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 274

 Score = 37.4 bits (85), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 26/114 (22%), Positives = 51/114 (44%), Gaps = 10/114 (8%)

Query: 68  KKTAERIDRQYSVSRRLNSAARTAAVRARELDREFAISVRWRSFRMDFSRNWPRYRKQLN 127
           ++ A R++ +Y    +  S    A  + +  D +  ++ +W      F  N P+   Q  
Sbjct: 98  REKANRVNAKYGTGNKAKSMLEDARRKVKSADDKMGVT-KW------FKTNVPKAMDQYA 150

Query: 128 DFLNTPLGRSFATIFFLWFALSGWLFRIL---ILATWVLPIAAPLLIGTVANNF 178
              NTP+G+    +F++W   SG  + +L   + AT+++ +  P LI     N 
Sbjct: 151 KVRNTPVGQFANFMFWIWLFTSGIFWNLLYFGLTATFLVNMFMPSLITDQVQNM 204


>gi|452821550|gb|EME28579.1| hypothetical protein Gasu_39560 [Galdieria sulphuraria]
          Length = 201

 Score = 36.6 bits (83), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 29/126 (23%), Positives = 51/126 (40%), Gaps = 10/126 (7%)

Query: 124 KQLNDFLNTPLGRSFATIFFLWFALSG---WLF---RILILATWVLPIAAPLLIGTVANN 177
           K+   F  +  G+        +  LSG   W+F    I      ++PI   L+       
Sbjct: 72  KKTVQFFQSENGQFILFGLVCYLVLSGKISWIFDSFLIFFSVISIIPIIGLLVFRWWVTQ 131

Query: 178 FVIKGACPACKREFIGSKSQIIRCAGCGNIVWQPEGDFFSRNGG----GKKSTKSDDDII 233
            V++  CP C    +G K + + C+ CG ++   +    +++      G K  +S   +I
Sbjct: 132 NVVQVFCPNCSSPVVGIKGREVPCSNCGYLLTVDDQVIIAKDSSNRKDGNKKKQSTISVI 191

Query: 234 DVDFEE 239
           DVD  E
Sbjct: 192 DVDARE 197


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.326    0.137    0.439 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,706,098,518
Number of Sequences: 23463169
Number of extensions: 144440564
Number of successful extensions: 499552
Number of sequences better than 100.0: 47
Number of HSP's better than 100.0 without gapping: 31
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 499486
Number of HSP's gapped (non-prelim): 51
length of query: 240
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 102
effective length of database: 9,121,278,045
effective search space: 930370360590
effective search space used: 930370360590
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 75 (33.5 bits)