BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 028089
         (214 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224130118|ref|XP_002328658.1| predicted protein [Populus trichocarpa]
 gi|222838834|gb|EEE77185.1| predicted protein [Populus trichocarpa]
          Length = 206

 Score =  237 bits (604), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 132/221 (59%), Positives = 156/221 (70%), Gaps = 22/221 (9%)

Query: 1   MSLTIANTAGSGSGSRILYTNPTTDPQHRRHGITFPTSTCSSR----ITRAPFRTLHVVS 56
           MSL IAN  G+G G RILYTNPT     R H  T   +TC  +    + R+P   LH+VS
Sbjct: 1   MSLFIAN--GNGGGGRILYTNPT-----RHHPYTH--TTCRPQFLFPLRRSPQDHLHIVS 51

Query: 57  AA-KKLSS--RTGRFDSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTNINNNKFENAG 113
           A+ KKLSS  R G+FDSKNRR +T +    EE+K+D       +  +  N++ +  EN  
Sbjct: 52  ASNKKLSSVSRIGKFDSKNRRSSTTSTKDQEENKRDGSGGDENVNGEVKNVSTSVGEN-- 109

Query: 114 VIVGVDGSPMPELPGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGAT 173
                DG  +PELPG EPDFWEGPQW  FGFF+QY+WAFGIVFALVACGIAVATYN GA 
Sbjct: 110 ----YDGYFLPELPGDEPDFWEGPQWDGFGFFVQYMWAFGIVFALVACGIAVATYNGGAA 165

Query: 174 DFKETPAYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           DFKETPAYKES+QSRDLLE P+ASNSDVF+SNPTEVAPSL+
Sbjct: 166 DFKETPAYKESIQSRDLLEEPEASNSDVFDSNPTEVAPSLE 206


>gi|224113385|ref|XP_002332599.1| predicted protein [Populus trichocarpa]
 gi|222834250|gb|EEE72727.1| predicted protein [Populus trichocarpa]
          Length = 206

 Score =  234 bits (598), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 131/221 (59%), Positives = 155/221 (70%), Gaps = 22/221 (9%)

Query: 1   MSLTIANTAGSGSGSRILYTNPTTDPQHRRHGITFPTSTCSSR----ITRAPFRTLHVVS 56
           MSL IAN  G+G G RILYTNPT     R H  T   + C  +    + R+P   LH+VS
Sbjct: 1   MSLFIAN--GNGGGGRILYTNPT-----RHHPYTH--TPCRPQFLFPLRRSPQDHLHIVS 51

Query: 57  AA-KKLSS--RTGRFDSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTNINNNKFENAG 113
           A+ KKLSS  R G+FDSKNRR +T +    EE+K+D       +  +  N++ +  EN  
Sbjct: 52  ASNKKLSSVSRIGKFDSKNRRSSTTSTKDQEENKRDGSGGDENVNGEVKNVSTSVGEN-- 109

Query: 114 VIVGVDGSPMPELPGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGAT 173
                DG  +PELPG EPDFWEGPQW  FGFF+QY+WAFGIVFALVACGIAVATYN GA 
Sbjct: 110 ----YDGYFLPELPGDEPDFWEGPQWDGFGFFVQYMWAFGIVFALVACGIAVATYNGGAA 165

Query: 174 DFKETPAYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           DFKETPAYKES+QSRDLLE P+ASNSDVF+SNPTEVAPSL+
Sbjct: 166 DFKETPAYKESIQSRDLLEEPEASNSDVFDSNPTEVAPSLE 206


>gi|255583792|ref|XP_002532648.1| conserved hypothetical protein [Ricinus communis]
 gi|223527608|gb|EEF29721.1| conserved hypothetical protein [Ricinus communis]
          Length = 198

 Score =  216 bits (550), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 122/221 (55%), Positives = 148/221 (66%), Gaps = 30/221 (13%)

Query: 1   MSLTIANTAGSGSGSRILYTNPTTDP--QHRRHGITFPTSTCSSRITRAPFRTLHVVSAA 58
           M+L IA   G  SG RI+ TNPT      H R  I FP          AP R  H++ +A
Sbjct: 1   MALFIATGHGGCSGGRIICTNPTKHHPYTHHRPQIVFPL---------APRRAHHIIVSA 51

Query: 59  KKL--SSRTGRFDSKNRRGNTGAITTTEEDKK--DQQQRTAEIGQDS-TNINNNKFENAG 113
           KKL  S+RTGRFDSKN+R    +ITT ++D    D ++  A+I   S  ++ +N ++   
Sbjct: 52  KKLPSSARTGRFDSKNKRS---SITTKDQDDNTDDNEEVNADISSSSSVDVEDNYYD--- 105

Query: 114 VIVGVDGSPMPELPGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGAT 173
                    + +LP  E +FWEGPQW A GFFLQYLWAFGIVFAL+ACGIAVATYNEGAT
Sbjct: 106 --------ILSKLPKDETEFWEGPQWDALGFFLQYLWAFGIVFALIACGIAVATYNEGAT 157

Query: 174 DFKETPAYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           DFK+TPAYKESVQS++LLE PDAS SDVFESNPTE+APSLD
Sbjct: 158 DFKDTPAYKESVQSQELLEEPDASTSDVFESNPTEIAPSLD 198


>gi|297734196|emb|CBI15443.3| unnamed protein product [Vitis vinifera]
          Length = 248

 Score =  215 bits (548), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 129/219 (58%), Positives = 153/219 (69%), Gaps = 18/219 (8%)

Query: 1   MSLTIANTAGSGSGSRILYTNPTTDPQHRRHGITFPTS--TCSSRITRAPFRTLHVVSAA 58
           MSLTIA TA +  G  +  T+P+T    RRH +  P+   T  +  T    RT H+V  A
Sbjct: 43  MSLTIAGTAVARGGVGLTTTSPSTHLPIRRHFL-LPSLPFTLITTTTTTRSRTSHIV-FA 100

Query: 59  KKLSSRTGRFDSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTNINNNKFENAGVIVGV 118
           KKLSSRTGRFDSKNRR  T    TT+E +++Q++  AEI +  +       EN GV  G 
Sbjct: 101 KKLSSRTGRFDSKNRRSGT----TTKEGQQEQREE-AEIKEIGST------ENVGVADGD 149

Query: 119 DGSP---MPELPGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATDF 175
           D      +PELPG +PDFWEGPQW A GFF+QYLWAFGIVFAL+ACGIAV TYNEGATDF
Sbjct: 150 DDDDGYFLPELPGDKPDFWEGPQWDALGFFVQYLWAFGIVFALIACGIAVVTYNEGATDF 209

Query: 176 KETPAYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           K+TP Y+ESVQSRDLLE P+ SNSDVFESNPTE APSL+
Sbjct: 210 KKTPVYQESVQSRDLLEEPETSNSDVFESNPTEEAPSLE 248


>gi|449440937|ref|XP_004138240.1| PREDICTED: uncharacterized protein LOC101218989 [Cucumis sativus]
 gi|449528110|ref|XP_004171049.1| PREDICTED: uncharacterized protein LOC101223826 [Cucumis sativus]
          Length = 199

 Score =  215 bits (548), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 124/219 (56%), Positives = 155/219 (70%), Gaps = 25/219 (11%)

Query: 1   MSLTIANTAGS-GSGSRILYTNPTTDPQH----RRHGITFPTSTCSSRITRAPFRTLHVV 55
           M LTI N   + G+   ILY   T  P +    RR+ +  P+            RTLH+V
Sbjct: 1   MPLTITNGGATVGTSGGILYRTFTRTPSNYFLFRRNALFHPSKEA---------RTLHLV 51

Query: 56  SAAKKLSSRTGRFDSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTNINNNKFENAGVI 115
            A KK S RTGRFDSKNR+ +T   T  E+++++++ RTA +   S  +     ENAGV+
Sbjct: 52  QA-KKSSFRTGRFDSKNRKSST---TIKEQEEEEERNRTAGVEMGSPVV-----ENAGVV 102

Query: 116 VGVDGSPMPELPGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATDF 175
             VD + +P+LPGL+PDFWEGPQW AFGFFL+YLWAFGIVFA++ACGIAV TYNEGATDF
Sbjct: 103 FDVDEN-LPQLPGLQPDFWEGPQWDAFGFFLEYLWAFGIVFAIIACGIAVTTYNEGATDF 161

Query: 176 KETPAYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           KETPAYKESVQ+R++LE P+ASN DVFESNPTEVAPSL+
Sbjct: 162 KETPAYKESVQTREILE-PEASNPDVFESNPTEVAPSLE 199


>gi|225455912|ref|XP_002276428.1| PREDICTED: uncharacterized protein LOC100261000 [Vitis vinifera]
          Length = 206

 Score =  214 bits (545), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 129/219 (58%), Positives = 153/219 (69%), Gaps = 18/219 (8%)

Query: 1   MSLTIANTAGSGSGSRILYTNPTTDPQHRRHGITFPTS--TCSSRITRAPFRTLHVVSAA 58
           MSLTIA TA +  G  +  T+P+T    RRH +  P+   T  +  T    RT H+V  A
Sbjct: 1   MSLTIAGTAVARGGVGLTTTSPSTHLPIRRHFL-LPSLPFTLITTTTTTRSRTSHIV-FA 58

Query: 59  KKLSSRTGRFDSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTNINNNKFENAGVIVGV 118
           KKLSSRTGRFDSKNRR  T    TT+E +++Q++  AEI +  +       EN GV  G 
Sbjct: 59  KKLSSRTGRFDSKNRRSGT----TTKEGQQEQREE-AEIKEIGST------ENVGVADGD 107

Query: 119 DGSP---MPELPGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATDF 175
           D      +PELPG +PDFWEGPQW A GFF+QYLWAFGIVFAL+ACGIAV TYNEGATDF
Sbjct: 108 DDDDGYFLPELPGDKPDFWEGPQWDALGFFVQYLWAFGIVFALIACGIAVVTYNEGATDF 167

Query: 176 KETPAYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           K+TP Y+ESVQSRDLLE P+ SNSDVFESNPTE APSL+
Sbjct: 168 KKTPVYQESVQSRDLLEEPETSNSDVFESNPTEEAPSLE 206


>gi|22655144|gb|AAM98162.1| unknown protein [Arabidopsis thaliana]
 gi|48310646|gb|AAT41859.1| At1g73885 [Arabidopsis thaliana]
          Length = 212

 Score =  174 bits (442), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/232 (48%), Positives = 142/232 (61%), Gaps = 38/232 (16%)

Query: 1   MSLTIANTAGSGSGSRIL---YTNPTTDPQHRRHGITFPTSTCSSRITRAPFRT-LHVVS 56
           M+LT+++    G G+RI     T+ + +P      + FP           P RT LHVVS
Sbjct: 1   MALTVSS---GGGGARIRCNDLTDSSRNPFSTFRLVNFPN----------PSRTGLHVVS 47

Query: 57  AAKKLSSRTGRFDSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTN------INNNKFE 110
           AAKK S++TGRFDSK RR  T   TTT+E     QQ     G D  N      IN+   +
Sbjct: 48  AAKKPSTQTGRFDSKKRR--TLVPTTTKE-----QQEEGNEGFDGENPPSQIVINDEDGD 100

Query: 111 NAGVIVGVDGSP-------MPELPGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGI 163
              V     G P       + +LPGLEPD +EG +W   GFF+QYLWAFGI+FAL++ G+
Sbjct: 101 RTVVNTRFRGDPKDAPKFAIKDLPGLEPDPFEGEKWDGLGFFVQYLWAFGILFALISGGL 160

Query: 164 AVATYNEGATDFKETPAYKESVQSRDLLEGPDASNS-DVFESNPTEVAPSLD 214
           A  TYNEGATDFKETP YKE++QSRDLL+  ++SNS DVFESNPTEVAPSL+
Sbjct: 161 AAGTYNEGATDFKETPVYKEAIQSRDLLDEAESSNSEDVFESNPTEVAPSLE 212


>gi|297839265|ref|XP_002887514.1| hypothetical protein ARALYDRAFT_339588 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297333355|gb|EFH63773.1| hypothetical protein ARALYDRAFT_339588 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 212

 Score =  174 bits (440), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 112/232 (48%), Positives = 142/232 (61%), Gaps = 38/232 (16%)

Query: 1   MSLTIANTAGSGSGSRIL---YTNPTTDPQHRRHGITFPTSTCSSRITRAPFRT-LHVVS 56
           M+LT+++    G G+RI     T+ + +P      + FP+          P RT LHVVS
Sbjct: 1   MALTVSS---GGGGARIRCNDLTDSSRNPFSTFRLVNFPS----------PSRTCLHVVS 47

Query: 57  AAKKLSSRTGRFDSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTN------INNNKFE 110
           AAKK S++TGRFDSK RR  T   TTT+E     Q      G D  N      IN+   +
Sbjct: 48  AAKKPSTQTGRFDSKKRR--TLVPTTTKE-----QSEEGNGGYDGENPPSEIVINDEDGD 100

Query: 111 NAGVIVGVDGSP-------MPELPGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGI 163
              V     G P       + +LPGLEPD +EG +W   GFF+QYLWAFGI+FAL++ G+
Sbjct: 101 RTVVNTRFRGDPKDAPKFALKDLPGLEPDPFEGEKWDGLGFFVQYLWAFGILFALISGGL 160

Query: 164 AVATYNEGATDFKETPAYKESVQSRDLLEGPDASNS-DVFESNPTEVAPSLD 214
           A  TYNEGATDFKETP YKE++QSRDLL+  ++SNS DVFESNPTEVAPSL+
Sbjct: 161 AAGTYNEGATDFKETPVYKEAIQSRDLLDEAESSNSEDVFESNPTEVAPSLE 212


>gi|115454897|ref|NP_001051049.1| Os03g0710600 [Oryza sativa Japonica Group]
 gi|62733541|gb|AAX95658.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108710708|gb|ABF98503.1| expressed protein [Oryza sativa Japonica Group]
 gi|113549520|dbj|BAF12963.1| Os03g0710600 [Oryza sativa Japonica Group]
 gi|125545462|gb|EAY91601.1| hypothetical protein OsI_13236 [Oryza sativa Indica Group]
 gi|215697069|dbj|BAG91063.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 199

 Score =  157 bits (396), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 86/168 (51%), Positives = 108/168 (64%), Gaps = 14/168 (8%)

Query: 52  LHVVSAAKK-LSSRTGRF---DSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTNINNN 107
           LHV  A +  +SSRT R      KN+R   G++T       D+         D+      
Sbjct: 41  LHVARARRGGVSSRTQRRLEERGKNKRRGGGSVTAPAPPDMDE---------DAAAGEGV 91

Query: 108 KFENAGVIVGVDGSPMPELPGLE-PDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVA 166
            +E   +   V  +PMPELP  E PDFWEGPQW A GFF+QY+WAFG+ F LVACG AVA
Sbjct: 92  DWEGEPLGFEVSTTPMPELPDPEKPDFWEGPQWDALGFFVQYMWAFGVFFGLVACGFAVA 151

Query: 167 TYNEGATDFKETPAYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           TYNEGATDF+ETP+YKESVQ+++  E  ++S SDVFE NPTEVAP+L+
Sbjct: 152 TYNEGATDFRETPSYKESVQTQEFPEESESSGSDVFEGNPTEVAPALE 199


>gi|226495819|ref|NP_001141084.1| uncharacterized protein LOC100273166 [Zea mays]
 gi|194702566|gb|ACF85367.1| unknown [Zea mays]
 gi|413933300|gb|AFW67851.1| hypothetical protein ZEAMMB73_287926 [Zea mays]
          Length = 186

 Score =  144 bits (362), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 86/170 (50%), Positives = 109/170 (64%), Gaps = 21/170 (12%)

Query: 51  TLHVVSAAKK-LSSRTGRF----DSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTNIN 105
            LHV  A +  +SSRT R       K RRG   A         D  +  AE G  +    
Sbjct: 32  ALHVALARRGGVSSRTQRRLEERGGKKRRGGVAA--------PDVDEDAAEAGTAA---- 79

Query: 106 NNKFENAGVIVGVDGSPMPELPGLE-PDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIA 164
             ++E   +   V   PMPELP  E PDFWEGPQW A GFF+QY+WAFG+VF L+ACG+A
Sbjct: 80  --EWEGEPLGFEVSTEPMPELPNPEKPDFWEGPQWEALGFFVQYMWAFGVVFGLIACGVA 137

Query: 165 VATYNEGATDFKETPAYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           VATYN+GATDF++TPAYKES QS++  E  ++S++DVFE NPTEVAP+L+
Sbjct: 138 VATYNDGATDFRDTPAYKES-QSQEFPEESESSSADVFEGNPTEVAPALE 186


>gi|242033219|ref|XP_002464004.1| hypothetical protein SORBIDRAFT_01g010450 [Sorghum bicolor]
 gi|241917858|gb|EER91002.1| hypothetical protein SORBIDRAFT_01g010450 [Sorghum bicolor]
          Length = 185

 Score =  144 bits (362), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 66/98 (67%), Positives = 83/98 (84%), Gaps = 2/98 (2%)

Query: 118 VDGSPMPELPGLE-PDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATDFK 176
           V   PMP+LP  E PDFWEGPQW A GFF+QY+WAFG+VF L+ACG+AVATYN+GATDF+
Sbjct: 89  VSTEPMPQLPDPETPDFWEGPQWEALGFFVQYMWAFGVVFGLIACGVAVATYNDGATDFR 148

Query: 177 ETPAYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           +TPAYKES QS++  E  ++S++DVFE NPTEVAP+L+
Sbjct: 149 DTPAYKES-QSQEFPEEAESSSADVFEGNPTEVAPALE 185


>gi|212723678|ref|NP_001132373.1| uncharacterized protein LOC100193818 [Zea mays]
 gi|194694208|gb|ACF81188.1| unknown [Zea mays]
 gi|195623876|gb|ACG33768.1| hypothetical protein [Zea mays]
 gi|414872406|tpg|DAA50963.1| TPA: hypothetical protein ZEAMMB73_905777 [Zea mays]
          Length = 185

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 74/148 (50%), Positives = 102/148 (68%), Gaps = 7/148 (4%)

Query: 73  RRGNTGAITTTEEDKKDQQQR-----TAEIGQDSTNINNNKFENAGVIVGVDGSPMPELP 127
           RRG   + T    +++  ++R       ++ +D+      ++E   +   V   PMP LP
Sbjct: 39  RRGGVSSRTQRRLEERGGKKRRGGVVAPDVDEDAAEAGVAEWEGEPLGFEVSTEPMPRLP 98

Query: 128 GLE-PDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATDFKETPAYKESVQ 186
             E PDFWEGPQW A GFF+QY+WAFG+VF L+ACG+AVATYN+GATDFK+TPAYKES Q
Sbjct: 99  DPETPDFWEGPQWEALGFFVQYMWAFGVVFGLIACGVAVATYNDGATDFKDTPAYKES-Q 157

Query: 187 SRDLLEGPDASNSDVFESNPTEVAPSLD 214
           S++  E  ++S++DVFE NPTEVAP+L+
Sbjct: 158 SQEFPEESESSSADVFEGNPTEVAPALE 185


>gi|226504622|ref|NP_001145125.1| uncharacterized protein LOC100278350 [Zea mays]
 gi|195651671|gb|ACG45303.1| hypothetical protein [Zea mays]
          Length = 186

 Score =  142 bits (359), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 86/170 (50%), Positives = 108/170 (63%), Gaps = 21/170 (12%)

Query: 51  TLHVVSAAKK-LSSRTGRF----DSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTNIN 105
            LHV  A +  +SSRT R       K RRG   A         D  +  AE G  +    
Sbjct: 32  ALHVALARRGGVSSRTQRRLEERGGKKRRGGVAA--------PDVDEDAAEAGTAA---- 79

Query: 106 NNKFENAGVIVGVDGSPMPELPGLE-PDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIA 164
             ++E   +   V   PMPELP  E PDFWEGPQW A GFF+QY+WAFG+VF L+ACG+A
Sbjct: 80  --EWEGEPLGFEVSTEPMPELPNPEKPDFWEGPQWEALGFFVQYMWAFGVVFGLIACGVA 137

Query: 165 VATYNEGATDFKETPAYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           VATYN+GATDF +TPAYKES QS++  E  ++S++DVFE NPTEVAP+L+
Sbjct: 138 VATYNDGATDFXDTPAYKES-QSQEFPEESESSSADVFEGNPTEVAPALE 186


>gi|116791040|gb|ABK25835.1| unknown [Picea sitchensis]
          Length = 200

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 63/92 (68%), Positives = 77/92 (83%), Gaps = 1/92 (1%)

Query: 123 MPELPGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATDFKETPAYK 182
           +P+LPG EPDFWEGP+W  FGF ++YLWAFG VFALVACG+AV TYN GATDFKETP YK
Sbjct: 107 VPKLPGEEPDFWEGPKWDTFGFIIRYLWAFGFVFALVACGVAVRTYNFGATDFKETPVYK 166

Query: 183 ESVQSRDLL-EGPDASNSDVFESNPTEVAPSL 213
           E+++S+ L  E P+ SNS++FE+NPTE AP L
Sbjct: 167 EAIESQGLFDEAPETSNSEIFENNPTEDAPLL 198


>gi|22330614|ref|NP_683492.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332197400|gb|AEE35521.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 192

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 63/92 (68%), Positives = 79/92 (85%), Gaps = 1/92 (1%)

Query: 123 MPELPGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATDFKETPAYK 182
           + +LPGLEPD +EG +W   GFF+QYLWAFGI+FAL++ G+A  TYNEGATDFKETP YK
Sbjct: 88  IKDLPGLEPDPFEGEKWDGLGFFVQYLWAFGILFALISGGLAAGTYNEGATDFKETPVYK 147

Query: 183 ESVQSRDLLEGPDASNS-DVFESNPTEVAPSL 213
           E++QSRDLL+  ++SNS DVFESNPTEVAP++
Sbjct: 148 EAIQSRDLLDEAESSNSEDVFESNPTEVAPTI 179


>gi|357118118|ref|XP_003560805.1| PREDICTED: uncharacterized protein LOC100828553 [Brachypodium
           distachyon]
          Length = 191

 Score =  136 bits (342), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 78/169 (46%), Positives = 104/169 (61%), Gaps = 18/169 (10%)

Query: 52  LHVVSAAKK-LSSRTGRF----DSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTNINN 106
           LHV  A +  +SSRT R       K RRG   A    + D+   +               
Sbjct: 35  LHVALARRGGVSSRTQRRLEERGGKKRRGGVDAPAPPDADEDYAE-----------AWEG 83

Query: 107 NKFENAGVIVGVDGSPMPELPGLEP-DFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAV 165
            ++E   +   V  +PMP LP  E  DFWEGPQW   GFF+QY+WAFG+ F+L+ACG+AV
Sbjct: 84  VEWEGEPLGFEVSTTPMPYLPDPEQQDFWEGPQWDGLGFFVQYMWAFGVFFSLIACGVAV 143

Query: 166 ATYNEGATDFKETPAYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           ATYN+GATDF++TPA+KES Q+++  E  ++S SDVFE NPTEVAP+L+
Sbjct: 144 ATYNDGATDFRDTPAFKES-QTQEFPEESESSGSDVFEGNPTEVAPALE 191


>gi|310656735|gb|ADP02174.1| unknown [Triticum aestivum]
          Length = 183

 Score =  133 bits (335), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 73/166 (43%), Positives = 101/166 (60%), Gaps = 14/166 (8%)

Query: 52  LHVVSAAKK-LSSRTGR-FDSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTNINNNKF 109
           LHV  A +  +SSRT R  + +  +   G + T + D+                    ++
Sbjct: 29  LHVALARRAGVSSRTQRRLEERGGKKRRGGVETPDADE-----------DTGPAWEGVEW 77

Query: 110 ENAGVIVGVDGSPMPELPGLE-PDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATY 168
           E   +   V   PMP LP  E PDFWEGP     GFF+QY+WAFG+ F+LVACG+AVATY
Sbjct: 78  EGEPLGFEVSTEPMPYLPDPEKPDFWEGPNMEVLGFFVQYMWAFGVFFSLVACGVAVATY 137

Query: 169 NEGATDFKETPAYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           N+GA+DF++TPA+KES Q ++  E  ++S +DVFE NPTEVAP+L+
Sbjct: 138 NDGASDFRDTPAFKESTQVQEFPEESESSGADVFEGNPTEVAPALE 183


>gi|388519053|gb|AFK47588.1| unknown [Medicago truncatula]
          Length = 171

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 85/214 (39%), Positives = 111/214 (51%), Gaps = 43/214 (20%)

Query: 1   MSLTIANTAGSGSGSRILYTNPTTDPQHRRHGITFPTSTCSSRITRAPFRTLHVVSAAKK 60
           M+ T+A   G+G    ILY   TT   +            SSR      RTLH+VSA KK
Sbjct: 1   MAFTVATYGGAG----ILYNKTTTTTPYT-------LLPSSSR------RTLHIVSA-KK 42

Query: 61  LSSRTGRFDSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTNINNNKFENAGVIVGVDG 120
           +SSR+       RR       T   D+K + +   EI   S      ++   G       
Sbjct: 43  ISSRS-------RRNQP----TETVDEKIEPRNEVEIPFYSDEDWPFEWRPPGTYK---- 87

Query: 121 SPMPELPGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATDFKETPA 180
                    +PDF+EG QW   GF  ++LW  G++FA+   G A   YN GA+DFKETPA
Sbjct: 88  ---------DPDFFEGSQWNTVGFVGEWLWILGVLFAVFGGGCAAINYNLGASDFKETPA 138

Query: 181 YKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
           YKES QS++LLE P+ S SD+F+SNPTEVAPSL+
Sbjct: 139 YKES-QSQELLEQPETSESDIFDSNPTEVAPSLN 171


>gi|222625659|gb|EEE59791.1| hypothetical protein OsJ_12309 [Oryza sativa Japonica Group]
          Length = 66

 Score =  103 bits (256), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 46/66 (69%), Positives = 57/66 (86%)

Query: 149 LWAFGIVFALVACGIAVATYNEGATDFKETPAYKESVQSRDLLEGPDASNSDVFESNPTE 208
           +WAFG+ F LVACG AVATYNEGATDF+ETP+YKESVQ+++  E  ++S SDVFE NPTE
Sbjct: 1   MWAFGVFFGLVACGFAVATYNEGATDFRETPSYKESVQTQEFPEESESSGSDVFEGNPTE 60

Query: 209 VAPSLD 214
           VAP+L+
Sbjct: 61  VAPALE 66


>gi|168061524|ref|XP_001782738.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162665771|gb|EDQ52444.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 257

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/88 (48%), Positives = 61/88 (69%), Gaps = 3/88 (3%)

Query: 125 ELPGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATDFKETPAYKES 184
           +L G EPDFWEG  W + GF +QYLW FG+V ++VAC  AV TYN GATDFK T  +KE+
Sbjct: 103 KLEGQEPDFWEGSSWNSLGFAMQYLWVFGVVVSIVACITAVRTYNLGATDFKNTEVFKEA 162

Query: 185 VQSRDLLEGPDASNSDVFESNPTEVAPS 212
           ++ +  ++G + + S VF+ + T+ AP 
Sbjct: 163 IEYQ--VDGSEDTGSSVFD-DATKEAPC 187


>gi|302818540|ref|XP_002990943.1| hypothetical protein SELMODRAFT_448213 [Selaginella moellendorffii]
 gi|300141274|gb|EFJ07987.1| hypothetical protein SELMODRAFT_448213 [Selaginella moellendorffii]
          Length = 230

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 34/70 (48%), Positives = 42/70 (60%), Gaps = 2/70 (2%)

Query: 127 PGLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATD--FKETPAYKES 184
           PG+ PDFWEGPQW   GF LQYLWA GIV A++ CGI   TY     D  F+E P+ ++ 
Sbjct: 115 PGMVPDFWEGPQWNWLGFILQYLWAIGIVVAVITCGILFVTYKPREADYRFREPPSLEDV 174

Query: 185 VQSRDLLEGP 194
            +  D    P
Sbjct: 175 KKILDERRSP 184


>gi|302802177|ref|XP_002982844.1| hypothetical protein SELMODRAFT_445280 [Selaginella moellendorffii]
 gi|300149434|gb|EFJ16089.1| hypothetical protein SELMODRAFT_445280 [Selaginella moellendorffii]
          Length = 171

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 32/64 (50%), Positives = 40/64 (62%), Gaps = 2/64 (3%)

Query: 128 GLEPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATD--FKETPAYKESV 185
           G+ PDFWEGPQW   GF LQYLWA GIV A++ CGI   TY     D  F+E P+ ++  
Sbjct: 57  GMVPDFWEGPQWNWLGFILQYLWAIGIVVAVITCGILFVTYKPREADYRFREPPSLEDVK 116

Query: 186 QSRD 189
           +  D
Sbjct: 117 KILD 120


>gi|414872405|tpg|DAA50962.1| TPA: hypothetical protein ZEAMMB73_905777 [Zea mays]
          Length = 174

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 53/94 (56%), Gaps = 6/94 (6%)

Query: 73  RRGNTGAITTTEEDKKDQQQR-----TAEIGQDSTNINNNKFENAGVIVGVDGSPMPELP 127
           RRG   + T    +++  ++R       ++ +D+      ++E   +   V   PMP LP
Sbjct: 39  RRGGVSSRTQRRLEERGGKKRRGGVVAPDVDEDAAEAGVAEWEGEPLGFEVSTEPMPRLP 98

Query: 128 GLE-PDFWEGPQWGAFGFFLQYLWAFGIVFALVA 160
             E PDFWEGPQW A GFF+QY+WAFG+VF + A
Sbjct: 99  DPETPDFWEGPQWEALGFFVQYMWAFGVVFGVSA 132


>gi|356520589|ref|XP_003528944.1| PREDICTED: uncharacterized protein LOC100800499 [Glycine max]
          Length = 195

 Score = 62.4 bits (150), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 50/104 (48%), Gaps = 20/104 (19%)

Query: 131 PDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATDFKETP----------- 179
           PDFWEG QW   G F+++    G+  A         T++E   + +E             
Sbjct: 92  PDFWEGSQWDWLGIFVKFSPIIGVAVAACLAIYGCFTFHEPPKEMREAAERLYSVQSGVE 151

Query: 180 ---------AYKESVQSRDLLEGPDASNSDVFESNPTEVAPSLD 214
                    +   SV+S  ++E PD  +SDVF+SNPTEVAPSL+
Sbjct: 152 SSAESGLESSADSSVESGKVIEEPDTYDSDVFDSNPTEVAPSLE 195


>gi|351722871|ref|NP_001236747.1| uncharacterized protein LOC100500608 [Glycine max]
 gi|255630746|gb|ACU15734.1| unknown [Glycine max]
          Length = 196

 Score = 60.8 bits (146), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 51/111 (45%), Gaps = 26/111 (23%)

Query: 130 EPDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATDFKETPAYKESVQS-- 187
           +PD WEGPQW   G F+++    GI  A         T++E   + +E      SVQS  
Sbjct: 86  QPDLWEGPQWDWLGIFIKFSPIIGIAVAASLAIYGCFTFHEPPKNVREAAERLYSVQSGV 145

Query: 188 ------------------------RDLLEGPDASNSDVFESNPTEVAPSLD 214
                                    ++ + PDA +SDVF+SNPTEVAPSL+
Sbjct: 146 ESNAESDDVESSAETDVESSAESSSEVFKEPDAYDSDVFDSNPTEVAPSLE 196


>gi|12325153|gb|AAG52529.1|AC016662_23 putative glucosyltransferase; 88035-86003 [Arabidopsis thaliana]
          Length = 570

 Score = 45.4 bits (106), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 59/114 (51%), Gaps = 19/114 (16%)

Query: 1   MSLTIANTAGSGSGSRIL---YTNPTTDPQHRRHGITFPTSTCSSRITRAPFRT-LHVVS 56
           M+LT+++    G G+RI     T+ + +P      + FP           P RT LHVVS
Sbjct: 468 MALTVSS---GGGGARIRCNDLTDSSRNPFSTFRLVNFPN----------PSRTGLHVVS 514

Query: 57  AAKKLSSRTGRFDSKNRRGNTGAITTTEEDKKDQQQRTAEIGQDSTNINNNKFE 110
           AAKK S++TGRFDSK RR  T   TTT+  +K + +      +   +++  K E
Sbjct: 515 AAKKPSTQTGRFDSKKRR--TLVPTTTKNSRKRETKVLTARTRRLRSLSTTKME 566


>gi|332669836|ref|YP_004452844.1| integral membrane sensor signal transduction histidine kinase
           [Cellulomonas fimi ATCC 484]
 gi|332338874|gb|AEE45457.1| integral membrane sensor signal transduction histidine kinase
           [Cellulomonas fimi ATCC 484]
          Length = 418

 Score = 37.7 bits (86), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 31/62 (50%)

Query: 131 PDFWEGPQWGAFGFFLQYLWAFGIVFALVACGIAVATYNEGATDFKETPAYKESVQSRDL 190
           PD W  P+W  +    +YL A G+ F+L    +A  T  E A +   TP  +E V +  L
Sbjct: 11  PDVWLHPRWRGWSRVWRYLLAAGLGFSLWTAQVAALTVPEIAGEAAVTPPSEEIVGAAIL 70

Query: 191 LE 192
           L+
Sbjct: 71  LD 72


>gi|440296522|gb|ELP89326.1| pH-response regulator protein palA/RIM20, putative [Entamoeba
           invadens IP1]
          Length = 941

 Score = 36.2 bits (82), Expect = 8.9,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 37/74 (50%), Gaps = 2/74 (2%)

Query: 142 FGFFLQYLWAFGIVFALVACGIAVATYNEGATDFKETPAYKESV--QSRDLLEGPDASNS 199
           + +FL  + A  +    V  G+A+A Y+EG  + KET    E+   + +D L      N 
Sbjct: 244 YAYFLARVAAGMVAEEDVEYGVAIAQYSEGVKEIKETSLANETYVKEVKDKLNSLKKDNE 303

Query: 200 DVFESNPTEVAPSL 213
           +++ S   EV P+L
Sbjct: 304 EIYMSKIPEVVPAL 317


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.130    0.387 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,584,794,116
Number of Sequences: 23463169
Number of extensions: 149343748
Number of successful extensions: 307339
Number of sequences better than 100.0: 49
Number of HSP's better than 100.0 without gapping: 29
Number of HSP's successfully gapped in prelim test: 20
Number of HSP's that attempted gapping in prelim test: 307270
Number of HSP's gapped (non-prelim): 51
length of query: 214
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 78
effective length of database: 9,168,204,383
effective search space: 715119941874
effective search space used: 715119941874
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 74 (33.1 bits)