BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029325
         (195 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|356497816|ref|XP_003517753.1| PREDICTED: uncharacterized protein LOC100785194 [Glycine max]
          Length = 195

 Score =  178 bits (452), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 111/182 (60%), Positives = 135/182 (74%), Gaps = 8/182 (4%)

Query: 16  SSAKLDHSSPS--PTANLSFHHNRHTNHNKTLSELSICHAQLNYGSAGEDPDEIGEAFFD 73
           S  KL H S +  P +   FH   + NH +    ++ CH +LN  S G +   + EA FD
Sbjct: 19  SHTKLCHFSANNVPFSIQPFHSRFYHNH-RLKPIITNCHGKLN-SSGGGEAYAMDEAGFD 76

Query: 74  EGVDGGAIMDDISDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSF 133
           E  DG   +++ SDDEDD +SSVDLLIRFLQ+ FKKVSKRAKKASRS+LP  ISP+LVSF
Sbjct: 77  E-YDG---VEEESDDEDDAESSVDLLIRFLQSMFKKVSKRAKKASRSVLPTVISPQLVSF 132

Query: 134 AVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSSGNSFNQGGSSFA 193
           AVDG LLLASLS++KALLEV+C++GGTVF  IL+LR IWAA+SYFQSSGNSFNQGG+SF 
Sbjct: 133 AVDGTLLLASLSVVKALLEVICTLGGTVFAAILVLRVIWAAVSYFQSSGNSFNQGGNSFG 192

Query: 194 AT 195
           A 
Sbjct: 193 AV 194


>gi|356500831|ref|XP_003519234.1| PREDICTED: uncharacterized protein LOC100784688 [Glycine max]
          Length = 193

 Score =  178 bits (451), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 114/191 (59%), Positives = 140/191 (73%), Gaps = 16/191 (8%)

Query: 12  VINLS----SAKLDH--SSPSPTANLSFHHNRHTNHNKTLSEL-SICHAQLNYGSAGEDP 64
           +INL+    S+K  H  S+ +P +   FH   +  HN+ L  + + CH +LN    G   
Sbjct: 11  LINLTRPSHSSKHRHFSSNNAPFSIQPFHSGFY--HNRRLKPIITNCHGKLNSSGGGYAM 68

Query: 65  DEIGEAFFDEGVDGGAIMDDISDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSILPA 124
           DE G   FDE  DG   ++D SDDEDD +SSVDLLIRFLQ+ FKKVSKRAKKASRS+LP 
Sbjct: 69  DEAG---FDE-YDG---VEDESDDEDDAESSVDLLIRFLQSMFKKVSKRAKKASRSVLPT 121

Query: 125 AISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSSGNS 184
            ISP+LVSFAVDG LLLASLS++KALLE++C++GGTVF  IL+LR IWAA+SYFQSSGNS
Sbjct: 122 VISPQLVSFAVDGTLLLASLSVVKALLEIICTLGGTVFAAILVLRVIWAAVSYFQSSGNS 181

Query: 185 FNQGGSSFAAT 195
           FNQGG+SF A 
Sbjct: 182 FNQGGNSFGAV 192


>gi|255558302|ref|XP_002520178.1| conserved hypothetical protein [Ricinus communis]
 gi|223540670|gb|EEF42233.1| conserved hypothetical protein [Ricinus communis]
          Length = 254

 Score =  176 bits (446), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 105/177 (59%), Positives = 132/177 (74%), Gaps = 6/177 (3%)

Query: 22  HSSPSPTANLSFHHNRHT----NHNKTLSELSICHAQLNYGSAGEDPDEIGEAFFDEGVD 77
           H S SP  NL FHH          +  +  + ICHA+L+  SAG+  +++GE FFD+  D
Sbjct: 75  HHSLSPPQNLPFHHQSSHFHHLTQSYKIPNIIICHAKLD-ASAGDFHEDVGEMFFDDNND 133

Query: 78  GGAIMDDISDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDG 137
                 +  +D+DD +SS+DLL +FLQ+ FKK+SKRAKKASRSILP+ ISP+LVSFAVDG
Sbjct: 134 L-VEDYESDEDDDDTESSIDLLFKFLQSMFKKISKRAKKASRSILPSVISPQLVSFAVDG 192

Query: 138 ILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSSGNSFNQGGSSFAA 194
           ILLLA+LSI+KALLEVVC++G TVF +ILLLR +WAAISYFQSSGN+F QGGSSF  
Sbjct: 193 ILLLAALSIVKALLEVVCTLGSTVFTVILLLRVVWAAISYFQSSGNTFYQGGSSFGG 249


>gi|224102473|ref|XP_002312691.1| predicted protein [Populus trichocarpa]
 gi|222852511|gb|EEE90058.1| predicted protein [Populus trichocarpa]
          Length = 203

 Score =  176 bits (446), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 93/146 (63%), Positives = 118/146 (80%), Gaps = 5/146 (3%)

Query: 50  ICHAQLNYGSAGEDPDEIGEAFFDEGVDGGAIMDDISDDEDDMDSSVDLLIRFLQATFKK 109
           ICH +L+     + P+EI E FFD+  D   + ++   DED+ +SS+DLLIRFLQ+ FKK
Sbjct: 59  ICHGKLD----SDVPEEINEVFFDDNYDM-MVDEEEISDEDETESSIDLLIRFLQSMFKK 113

Query: 110 VSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLR 169
           +SKRAKKASRS+LPA ISP+LVSFAVDGILLLA+LSI+KALLEVVC++G TVFV+ILLLR
Sbjct: 114 LSKRAKKASRSMLPAVISPQLVSFAVDGILLLAALSIVKALLEVVCTLGSTVFVVILLLR 173

Query: 170 FIWAAISYFQSSGNSFNQGGSSFAAT 195
            +W A+SYFQSS N+ ++GGSSF  T
Sbjct: 174 VVWTAVSYFQSSENTSSKGGSSFGTT 199


>gi|225424833|ref|XP_002272743.1| PREDICTED: uncharacterized protein LOC100243345 [Vitis vinifera]
          Length = 198

 Score =  159 bits (403), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 109/194 (56%), Positives = 143/194 (73%), Gaps = 19/194 (9%)

Query: 11  IVINL---SSAKLD-HSSPSPT---ANLSFHHNRHTNHNKTLSELSICHAQLNYGSAGED 63
           ++INL   ++ KL+ + +PSP       S H    T+     + L+  HA+L+ GSAGE 
Sbjct: 11  LIINLNPTATTKLNLNLTPSPNFACPPQSLHLRHRTSPTGNPTTLT-SHARLS-GSAGEA 68

Query: 64  PDEIGEAFFDEGVDGGAIMDDI--SDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSI 121
           P+EI E+FF++        +D+  SD+ED+ +SS DLLI FLQ+ FKK SK AKKASRS+
Sbjct: 69  PEEIDESFFED--------EDLIESDEEDETESSADLLIEFLQSMFKKASKHAKKASRSV 120

Query: 122 LPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSS 181
           LPAAISP+LVSFAVDG+L+LASLSI+KALLEV C++GGTVFV+ILLLR IWA +SYFQ+S
Sbjct: 121 LPAAISPQLVSFAVDGVLILASLSIIKALLEVFCTLGGTVFVVILLLRVIWATVSYFQTS 180

Query: 182 GNSFNQGGSSFAAT 195
           GN F+QGG+SF  T
Sbjct: 181 GNGFSQGGTSFGTT 194


>gi|296086468|emb|CBI32057.3| unnamed protein product [Vitis vinifera]
          Length = 210

 Score =  157 bits (396), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 108/192 (56%), Positives = 142/192 (73%), Gaps = 19/192 (9%)

Query: 11  IVINL---SSAKLD-HSSPSP---TANLSFHHNRHTNHNKTLSELSICHAQLNYGSAGED 63
           ++INL   ++ KL+ + +PSP       S H    T+     + L+  HA+L+ GSAGE 
Sbjct: 10  LIINLNPTATTKLNLNLTPSPNFACPPQSLHLRHRTSPTGNPTTLT-SHARLS-GSAGEA 67

Query: 64  PDEIGEAFFDEGVDGGAIMDDI--SDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSI 121
           P+EI E+FF++        +D+  SD+ED+ +SS DLLI FLQ+ FKK SK AKKASRS+
Sbjct: 68  PEEIDESFFED--------EDLIESDEEDETESSADLLIEFLQSMFKKASKHAKKASRSV 119

Query: 122 LPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSS 181
           LPAAISP+LVSFAVDG+L+LASLSI+KALLEV C++GGTVFV+ILLLR IWA +SYFQ+S
Sbjct: 120 LPAAISPQLVSFAVDGVLILASLSIIKALLEVFCTLGGTVFVVILLLRVIWATVSYFQTS 179

Query: 182 GNSFNQGGSSFA 193
           GN F+QGG+SF 
Sbjct: 180 GNGFSQGGTSFV 191


>gi|357486397|ref|XP_003613486.1| hypothetical protein MTR_5g037230 [Medicago truncatula]
 gi|355514821|gb|AES96444.1| hypothetical protein MTR_5g037230 [Medicago truncatula]
 gi|388498656|gb|AFK37394.1| unknown [Medicago truncatula]
          Length = 192

 Score =  152 bits (384), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 106/191 (55%), Positives = 134/191 (70%), Gaps = 17/191 (8%)

Query: 12  VINLSSA---KLDHSSPSPTANLSFHHNR-----HTNHNKTLSELSICHAQLNYGSAGED 63
           +INL+S+    L H SP+    L  HH       + NH   L  ++ C+++LN  S GE 
Sbjct: 9   LINLTSSPRSNLHHFSPNNVHIL--HHQPFPSKLYYNHRSNLKTIN-CNSKLNNSSGGEA 65

Query: 64  PDEIGEAFFDEGVDGGAIMDDISDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSILP 123
            +       DEGV GG    +   DEDD +SSVDLLI+FLQ+ FKK++KRAKKASRS+LP
Sbjct: 66  YE------IDEGVFGGYDGIEEESDEDDAESSVDLLIKFLQSMFKKMAKRAKKASRSVLP 119

Query: 124 AAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSSGN 183
             ISP+LVSFAVDG LLLA+LS++KALLEV+C+IGGTVF  IL LR IWAA+S+FQS+GN
Sbjct: 120 TVISPQLVSFAVDGTLLLATLSVVKALLEVICNIGGTVFAAILALRVIWAAVSHFQSNGN 179

Query: 184 SFNQGGSSFAA 194
           SFNQG +SF A
Sbjct: 180 SFNQGNNSFGA 190


>gi|297841727|ref|XP_002888745.1| hypothetical protein ARALYDRAFT_894783 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334586|gb|EFH65004.1| hypothetical protein ARALYDRAFT_894783 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 191

 Score =  142 bits (357), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 68/90 (75%), Positives = 80/90 (88%)

Query: 96  VDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVC 155
           VDLLIRFL++ FKKVSKRAKKASR ILPAA+SP+LVSFAVDGILLL SLSI +A LEV+C
Sbjct: 96  VDLLIRFLRSMFKKVSKRAKKASRRILPAAMSPRLVSFAVDGILLLGSLSITRAFLEVIC 155

Query: 156 SIGGTVFVLILLLRFIWAAISYFQSSGNSF 185
           ++GGTVF +ILL+R  WAA S+FQ+ GNSF
Sbjct: 156 NLGGTVFTVILLIRLFWAAASFFQTYGNSF 185


>gi|21536619|gb|AAM60951.1| unknown [Arabidopsis thaliana]
 gi|159025286|emb|CAL91513.1| SHW1 protein [Arabidopsis thaliana]
          Length = 192

 Score =  141 bits (355), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 68/90 (75%), Positives = 80/90 (88%)

Query: 96  VDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVC 155
           VDLLIRFL++ FKKVSKRAKKASR ILPAA+SP+LVSFAVDGILLL SLSI +A LEV+C
Sbjct: 97  VDLLIRFLRSMFKKVSKRAKKASRRILPAAMSPRLVSFAVDGILLLGSLSITRAFLEVIC 156

Query: 156 SIGGTVFVLILLLRFIWAAISYFQSSGNSF 185
           ++GGTVF +ILL+R  WAA S+FQ+ GNSF
Sbjct: 157 NLGGTVFTVILLIRLFWAAASFFQTYGNSF 186


>gi|18409408|ref|NP_564981.1| short hypocotyl in white light1 protein [Arabidopsis thaliana]
 gi|332196879|gb|AEE35000.1| short hypocotyl in white light1 protein [Arabidopsis thaliana]
          Length = 192

 Score =  139 bits (351), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 67/90 (74%), Positives = 79/90 (87%)

Query: 96  VDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVC 155
           VDLLIRFL++ FKKVSKR KKASR ILPAA+SP+LVSFAVDGILLL SLSI +A LEV+C
Sbjct: 97  VDLLIRFLRSMFKKVSKRTKKASRRILPAAMSPRLVSFAVDGILLLGSLSITRAFLEVIC 156

Query: 156 SIGGTVFVLILLLRFIWAAISYFQSSGNSF 185
           ++GGTVF +ILL+R  WAA S+FQ+ GNSF
Sbjct: 157 NLGGTVFTVILLIRLFWAAASFFQTYGNSF 186


>gi|116791365|gb|ABK25954.1| unknown [Picea sitchensis]
          Length = 214

 Score =  117 bits (294), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 56/105 (53%), Positives = 79/105 (75%), Gaps = 4/105 (3%)

Query: 95  SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
           S+DLL RF+   FKK+S++A+KA+R+ILP+AISP+LV F+V+G+++L  LSILKA L+VV
Sbjct: 106 SLDLLARFIHHMFKKISRKARKAARTILPSAISPQLVGFSVNGVIILTFLSILKAFLQVV 165

Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNSF----NQGGSSFAAT 195
           C++G  VFV ILLLR IW+A+ Y Q  G +       G SS+ A+
Sbjct: 166 CTLGSVVFVTILLLRLIWSAVGYMQKGGKTIEFHGENGYSSYNAS 210


>gi|255562194|ref|XP_002522105.1| conserved hypothetical protein [Ricinus communis]
 gi|223538704|gb|EEF40305.1| conserved hypothetical protein [Ricinus communis]
          Length = 200

 Score =  115 bits (289), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 54/87 (62%), Positives = 71/87 (81%)

Query: 95  SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
           S+DL +RF+Q  FKKVSKRA+KA+RS+LP +IS KLV F+V+G+L+LA L +LKA LEVV
Sbjct: 92  SLDLFVRFIQNVFKKVSKRARKAARSVLPLSISTKLVGFSVNGVLILAFLWVLKAFLEVV 151

Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSS 181
           C++G  VFV ILL+R IW+ I+Y Q S
Sbjct: 152 CTLGSVVFVSILLIRGIWSGITYLQES 178


>gi|224056286|ref|XP_002298793.1| predicted protein [Populus trichocarpa]
 gi|222846051|gb|EEE83598.1| predicted protein [Populus trichocarpa]
          Length = 141

 Score =  112 bits (281), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 54/89 (60%), Positives = 69/89 (77%)

Query: 95  SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
           S+DLL+RF+   FKKVSKRA+KA RS+LP  IS KLV F+V+G+LLLA L +LKA LEVV
Sbjct: 33  SLDLLVRFISNVFKKVSKRARKAVRSVLPVPISTKLVEFSVNGVLLLAFLWVLKAFLEVV 92

Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGN 183
           C++G  VFV ILL+R IW+ ++Y Q   N
Sbjct: 93  CTLGSMVFVSILLIRGIWSGVTYLQERRN 121


>gi|225429191|ref|XP_002276036.1| PREDICTED: uncharacterized protein LOC100251465 [Vitis vinifera]
 gi|297736403|emb|CBI25126.3| unnamed protein product [Vitis vinifera]
          Length = 203

 Score =  112 bits (281), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 51/89 (57%), Positives = 68/89 (76%)

Query: 95  SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
           S+DLL+RF+Q  FKK+SKR +KA RS+LP  IS KLV F+VDG+L+L  L +LKA LEV+
Sbjct: 95  SLDLLVRFVQNVFKKISKRTRKAVRSVLPVPISTKLVGFSVDGVLMLTFLWVLKAFLEVI 154

Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGN 183
           C++G  VFV IL++R IW  +SY Q + N
Sbjct: 155 CTLGSVVFVSILVIRGIWTGVSYVQENRN 183


>gi|297798548|ref|XP_002867158.1| hypothetical protein ARALYDRAFT_491300 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297312994|gb|EFH43417.1| hypothetical protein ARALYDRAFT_491300 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 208

 Score =  109 bits (273), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 52/90 (57%), Positives = 71/90 (78%)

Query: 95  SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
           S+DLL+RF++  F+K+SKRA+KA RSILP +IS KLV F+V+G+L+LA L ILKA LEV 
Sbjct: 100 SLDLLLRFVENVFRKISKRARKAVRSILPVSISTKLVGFSVNGVLILAFLWILKAFLEVA 159

Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNS 184
           C++G  VF  ILL+R +WA ++Y Q S N+
Sbjct: 160 CTLGTIVFTSILLIRGLWAGVAYMQESRNN 189


>gi|30689813|ref|NP_567935.3| uncharacterized protein [Arabidopsis thaliana]
 gi|21554762|gb|AAM63683.1| unknown [Arabidopsis thaliana]
 gi|25054911|gb|AAN71935.1| unknown protein [Arabidopsis thaliana]
 gi|332660877|gb|AEE86277.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 203

 Score =  109 bits (272), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 52/90 (57%), Positives = 71/90 (78%)

Query: 95  SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
           S+DLL+RF++  F+K+SKRA+KA RSILP +IS KLV F+V+G+L+LA L ILKA LEV 
Sbjct: 95  SLDLLLRFVENVFRKISKRARKAVRSILPVSISTKLVGFSVNGVLILAFLWILKAFLEVA 154

Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNS 184
           C++G  VF  ILL+R +WA ++Y Q S N+
Sbjct: 155 CTLGTIVFTSILLIRGLWAGVAYMQESRNN 184


>gi|356497490|ref|XP_003517593.1| PREDICTED: uncharacterized protein LOC100817442 [Glycine max]
          Length = 186

 Score =  105 bits (263), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 51/84 (60%), Positives = 65/84 (77%)

Query: 100 IRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGG 159
           +RF Q  FKKVSKRA+KA RS+LP  IS  LV F+V+GILLLA L ILKA L+V+C++G 
Sbjct: 84  VRFFQNVFKKVSKRARKAVRSVLPLPISSHLVGFSVNGILLLAFLWILKAFLQVLCTLGS 143

Query: 160 TVFVLILLLRFIWAAISYFQSSGN 183
            VFV ILL+R IW+ +S+ Q SG+
Sbjct: 144 VVFVSILLIRGIWSGVSFLQESGH 167


>gi|449438309|ref|XP_004136931.1| PREDICTED: uncharacterized protein LOC101213497 [Cucumis sativus]
          Length = 200

 Score =  103 bits (256), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 68/181 (37%), Positives = 110/181 (60%), Gaps = 4/181 (2%)

Query: 1   MTSAATLSPFIVINLSSAKLDHSSPSPTANLSFHHNRHTNHNKTLSELSICHAQLNYGSA 60
           M+  A LSP  + ++  ++L  S+ SPT+ LS  H  H ++    +     +      + 
Sbjct: 1   MSLPAVLSPPSLFSIPQSQLPFST-SPTSLLS--HPIHISYTLLRATRRTSNFSQGVDNF 57

Query: 61  GEDPDEIGEAFFDEGVDGGAIMDDISDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRS 120
            +D      + FD  + G    ++  D+ED+ D S+DLL+RF++  F+K S+RA+KA RS
Sbjct: 58  VDDRRNWNRSDFDL-IGGEEEEEEDEDEEDEEDRSLDLLVRFVENIFRKSSRRARKAVRS 116

Query: 121 ILPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQS 180
           +LP +I  KLV+F+V+G+L+LA L +LKA LEV+C++G  VFV IL++R +W  I Y Q 
Sbjct: 117 VLPPSIPTKLVAFSVNGVLMLAFLWVLKAFLEVICTLGTAVFVSILIIRGVWIGILYLQD 176

Query: 181 S 181
           +
Sbjct: 177 T 177


>gi|449495707|ref|XP_004159921.1| PREDICTED: uncharacterized LOC101213497 [Cucumis sativus]
          Length = 202

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 46/87 (52%), Positives = 68/87 (78%)

Query: 95  SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
           S+DLL+RF++  F+K S+RA+KA RS+LP +I  KLV+F+V+G+L+LA L +LKA LEV+
Sbjct: 93  SLDLLVRFVENIFRKSSRRARKAVRSVLPPSIPTKLVAFSVNGVLMLAFLWVLKAFLEVI 152

Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSS 181
           C++G  VFV IL++R +W  I Y Q +
Sbjct: 153 CTLGTAVFVSILIIRGVWIGILYLQDT 179


>gi|116781450|gb|ABK22104.1| unknown [Picea sitchensis]
          Length = 190

 Score =  100 bits (250), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 45/85 (52%), Positives = 68/85 (80%)

Query: 100 IRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGG 159
           ++F+Q   KK+S+RA+KA+RSILP+AISP+LV F+V+G+  L  L ++KA L+VVC++G 
Sbjct: 94  VQFVQTMLKKISRRARKAARSILPSAISPQLVGFSVNGLTTLTFLWMVKAFLQVVCTLGS 153

Query: 160 TVFVLILLLRFIWAAISYFQSSGNS 184
            VFV ILLLR +W+AI+Y + +G +
Sbjct: 154 IVFVAILLLRGVWSAIAYIKINGKT 178


>gi|147774848|emb|CAN69077.1| hypothetical protein VITISV_004762 [Vitis vinifera]
          Length = 214

 Score = 95.9 bits (237), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 85/177 (48%), Positives = 117/177 (66%), Gaps = 32/177 (18%)

Query: 11  IVINL---SSAKLD-HSSPSPTANL---SFHHNRHTNHNKTLSELSICHAQLNYGSAGED 63
           ++INL   ++ KL+ + +PSP       SFH +  T+     + L+  HA+L+ GSAGE 
Sbjct: 10  LIINLNPTATTKLNLNFTPSPNFTCPPQSFHLSHRTSPTGNPTTLT-SHARLS-GSAGEA 67

Query: 64  PDEIGEAFFDEGVDGGAIMDDI--SDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSI 121
           P+EI E+FF++        +D+  SD+ED+ +SS DLLI+FLQ+ FKK SK AKKASRS+
Sbjct: 68  PEEIDESFFED--------EDLIESDEEDETESSADLLIKFLQSMFKKASKHAKKASRSV 119

Query: 122 LPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYF 178
           LPAAISP+LVSFAVDG+L+LASLSI+KALLE              L+  IW+ + YF
Sbjct: 120 LPAAISPQLVSFAVDGVLILASLSIIKALLE-------------FLVNTIWSILVYF 163


>gi|302806786|ref|XP_002985124.1| hypothetical protein SELMODRAFT_424220 [Selaginella moellendorffii]
 gi|300146952|gb|EFJ13618.1| hypothetical protein SELMODRAFT_424220 [Selaginella moellendorffii]
          Length = 231

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/109 (42%), Positives = 69/109 (63%), Gaps = 7/109 (6%)

Query: 87  DDEDDMDSSVD-----LLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLL 141
           DD+ D D S+D     LL++ ++   +K+S RA KA+R +LP   +  L+SF V+G +LL
Sbjct: 92  DDDYDRDFSMDGDSMVLLVQLVRTILRKLSSRATKAARGMLPPIFTSDLISFFVNGAVLL 151

Query: 142 ASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSS--GNSFNQG 188
               + KALLEVVCS+G  +F+ +LL+R +W+ +SY QS     SF  G
Sbjct: 152 TVFWVAKALLEVVCSMGSMMFIGLLLVRSVWSILSYLQSQRYSGSFRNG 200


>gi|357113288|ref|XP_003558436.1| PREDICTED: uncharacterized protein LOC100840559 [Brachypodium
           distachyon]
          Length = 203

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 71/93 (76%)

Query: 95  SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
           S+DLL RFLQ+ F+K S+RA++A+RS+LP ++  +LV F+V+G+L+L  L ILK LLEVV
Sbjct: 95  SLDLLARFLQSVFRKASRRARRAARSVLPPSVPAELVKFSVNGVLVLTFLWILKGLLEVV 154

Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNSFNQ 187
           C+ G  VF+ ILL+R IW+ ++Y + +  S+ Q
Sbjct: 155 CTFGSMVFISILLVRGIWSGVTYIRENRYSYIQ 187


>gi|242041657|ref|XP_002468223.1| hypothetical protein SORBIDRAFT_01g042060 [Sorghum bicolor]
 gi|241922077|gb|EER95221.1| hypothetical protein SORBIDRAFT_01g042060 [Sorghum bicolor]
          Length = 202

 Score = 84.7 bits (208), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 69/91 (75%)

Query: 95  SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
           S+DLL RFL + F+K S+RA++A+RS+LP ++  +LV F+V+G+L+L SL ILK LLEVV
Sbjct: 94  SLDLLARFLHSVFRKASRRARRAARSVLPPSVPAELVKFSVNGVLVLTSLWILKGLLEVV 153

Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNSF 185
           C+ G  VF  ILL+R IW+ ++Y + +  S+
Sbjct: 154 CTFGSMVFASILLVRGIWSGVAYIRENRYSY 184


>gi|108707002|gb|ABF94797.1| expressed protein [Oryza sativa Japonica Group]
          Length = 203

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 66/86 (76%)

Query: 100 IRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGG 159
           +RFL + F+KVS+RA++A+RS+LP ++  +LV F+V+G+L+L  L +LK LLEVVC+ G 
Sbjct: 100 VRFLHSVFRKVSRRARRAARSVLPPSVPAELVKFSVNGVLVLTFLWVLKGLLEVVCTFGS 159

Query: 160 TVFVLILLLRFIWAAISYFQSSGNSF 185
            VFV ILL+R IW+ ++Y + +  S+
Sbjct: 160 MVFVTILLVRGIWSGVTYIRENRYSY 185


>gi|414865641|tpg|DAA44198.1| TPA: hypothetical protein ZEAMMB73_391294 [Zea mays]
          Length = 201

 Score = 83.2 bits (204), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 68/91 (74%)

Query: 95  SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
           S+DLL RFL + F+K S+RA++A+RS+LP ++  +LV F+V+G+L+L  L ILK LLEVV
Sbjct: 93  SLDLLARFLHSVFRKASRRARRAARSVLPPSVPAELVKFSVNGVLVLTFLWILKGLLEVV 152

Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNSF 185
           C+ G  VF  ILL+R IW+ ++Y + +  S+
Sbjct: 153 CTFGSMVFASILLVRGIWSGVTYIRENRYSY 183


>gi|224103787|ref|XP_002313193.1| predicted protein [Populus trichocarpa]
 gi|222849601|gb|EEE87148.1| predicted protein [Populus trichocarpa]
          Length = 143

 Score = 80.5 bits (197), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 44/98 (44%), Positives = 61/98 (62%), Gaps = 15/98 (15%)

Query: 93  DSSVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLE 152
           D S+DLL++F+   FKKVSKRA KA RS+L   +S KLV F+V+G+LLLA L +LKA LE
Sbjct: 19  DRSLDLLLKFVSNVFKKVSKRATKAVRSVLAVPLSTKLVEFSVNGVLLLAFLWVLKAFLE 78

Query: 153 VV---------------CSIGGTVFVLILLLRFIWAAI 175
            +               C++G  V   ILL+R +W+ +
Sbjct: 79  AILEEGISLTLWHWMWFCTLGSVVLASILLIRGMWSEV 116


>gi|326500348|dbj|BAK06263.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326529047|dbj|BAK00917.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 203

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 42/87 (48%), Positives = 65/87 (74%)

Query: 101 RFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGT 160
           RFL + F+K S+RA++A+RS+LP ++  +LV F+V+G+L+L  L ILK LLEVVC+ G  
Sbjct: 101 RFLHSVFRKASRRARRAARSVLPPSVPAELVKFSVNGVLVLTFLWILKGLLEVVCTFGSM 160

Query: 161 VFVLILLLRFIWAAISYFQSSGNSFNQ 187
           VF+ ILL+R IW+ ++Y + +  S+ Q
Sbjct: 161 VFISILLVRGIWSGVTYIRENRYSYIQ 187


>gi|3549680|emb|CAA20591.1| putative protein [Arabidopsis thaliana]
 gi|7270327|emb|CAB80095.1| putative protein [Arabidopsis thaliana]
          Length = 181

 Score = 72.0 bits (175), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/90 (42%), Positives = 54/90 (60%), Gaps = 22/90 (24%)

Query: 95  SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
           S+DLL+RF++  F+K+SKRA+KA RSILP +IS KLV+                      
Sbjct: 95  SLDLLLRFVENVFRKISKRARKAVRSILPVSISTKLVA---------------------- 132

Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNS 184
           C++G  VF  ILL+R +WA ++Y Q S N+
Sbjct: 133 CTLGTIVFTSILLIRGLWAGVAYMQESRNN 162


>gi|222624515|gb|EEE58647.1| hypothetical protein OsJ_10028 [Oryza sativa Japonica Group]
          Length = 243

 Score = 64.7 bits (156), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/127 (33%), Positives = 67/127 (52%), Gaps = 40/127 (31%)

Query: 99  LIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLE------ 152
           L+RFL + F+KVS+RA++A+RS+LP ++  +LV F+V+G+L+L  L +LK LLE      
Sbjct: 99  LVRFLHSVFRKVSRRARRAARSVLPPSVPAELVKFSVNGVLVLTFLWVLKGLLESLETSH 158

Query: 153 ----------------------------------VVCSIGGTVFVLILLLRFIWAAISYF 178
                                             VVC+ G  VFV ILL+R IW+ ++Y 
Sbjct: 159 SCWGLAPPLILTVTYNGITLYVQRTNIVHASEFHVVCTFGSMVFVTILLVRGIWSGVTYI 218

Query: 179 QSSGNSF 185
           + +  S+
Sbjct: 219 RENRYSY 225


>gi|218192386|gb|EEC74813.1| hypothetical protein OsI_10633 [Oryza sativa Indica Group]
          Length = 243

 Score = 64.7 bits (156), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/127 (33%), Positives = 67/127 (52%), Gaps = 40/127 (31%)

Query: 99  LIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLE------ 152
           L+RFL + F+KVS+RA++A+RS+LP ++  +LV F+V+G+L+L  L +LK LLE      
Sbjct: 99  LVRFLHSVFRKVSRRARRAARSVLPPSVPAELVKFSVNGVLVLTFLWVLKGLLESLETSH 158

Query: 153 ----------------------------------VVCSIGGTVFVLILLLRFIWAAISYF 178
                                             VVC+ G  VFV ILL+R IW+ ++Y 
Sbjct: 159 GCWGLAPPLILTVTYNGITLYVQRTNIVHASEFHVVCTFGSMVFVTILLVRGIWSGVTYI 218

Query: 179 QSSGNSF 185
           + +  S+
Sbjct: 219 RENRYSY 225


>gi|218190898|gb|EEC73325.1| hypothetical protein OsI_07518 [Oryza sativa Indica Group]
          Length = 106

 Score = 38.9 bits (89), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 43/65 (66%), Gaps = 9/65 (13%)

Query: 87  DDEDD---------MDSSVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDG 137
           +DED+          D S+DLL+RFL + F+KVS+RA++A+RS+LP ++  + V F+V+ 
Sbjct: 37  EDEDNTGEEDDDGDEDRSLDLLVRFLHSVFRKVSRRARRAARSMLPLSVPAERVKFSVNA 96

Query: 138 ILLLA 142
             L +
Sbjct: 97  AFLFS 101


>gi|15292719|gb|AAK92728.1| unknown protein [Arabidopsis thaliana]
          Length = 51

 Score = 38.5 bits (88), Expect = 1.4,   Method: Composition-based stats.
 Identities = 15/32 (46%), Positives = 22/32 (68%)

Query: 153 VVCSIGGTVFVLILLLRFIWAAISYFQSSGNS 184
           V C++G  VF  ILL+R +WA ++Y Q S N+
Sbjct: 1   VACTLGTIVFTSILLIRGLWAGVAYMQESRNN 32


>gi|222622990|gb|EEE57122.1| hypothetical protein OsJ_07002 [Oryza sativa Japonica Group]
          Length = 107

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 37/48 (77%)

Query: 95  SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLA 142
           S+DLL+RFL + F+KVS+RA++A+RS+LP ++  + V F+V+   L +
Sbjct: 55  SLDLLVRFLHSVFRKVSRRARRAARSMLPLSVPAERVKFSVNAAFLFS 102


>gi|298705754|emb|CBJ49062.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 2779

 Score = 35.8 bits (81), Expect = 8.1,   Method: Composition-based stats.
 Identities = 38/123 (30%), Positives = 54/123 (43%), Gaps = 24/123 (19%)

Query: 58   GSAGEDPDEIGEAFFD-------EGVDGGAIMDDISDDEDDMDS----------SVDLLI 100
            G AGED D+IGE   +       EG +GG    D   DED M            S+D+L 
Sbjct: 2138 GRAGEDDDDIGEGIEEVMGPSSPEGGEGGPDAMDQDRDEDGMKESAVNRQTALLSLDVLA 2197

Query: 101  RFL----QATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVCS 156
            R L    Q  F+ V     +    + P A+ P +     DGIL L + + L  L+  +C+
Sbjct: 2198 RVLGRRHQGAFEGVLGDVTEMVAGVGPGAL-PAVGQPGSDGILPLRASAFL--LVATLCA 2254

Query: 157  IGG 159
            + G
Sbjct: 2255 VLG 2257


>gi|389656419|ref|YP_006393297.1| replication protein [Human papillomavirus type 137]
 gi|343411532|gb|AEM24609.1| replication protein [Human papillomavirus type 137]
          Length = 610

 Score = 35.8 bits (81), Expect = 8.1,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 35/75 (46%), Gaps = 6/75 (8%)

Query: 62  EDPDEIGEAFFDEGVDGGAIMDDISDDEDDMDSSVDLLIRFLQAT------FKKVSKRAK 115
           E+  +  E  FDE  DG  I D I DD D++D  + L +   Q T         + ++  
Sbjct: 27  ENEHDTLEELFDESTDGSNISDLIDDDVDEVDQGISLALYNSQLTEDCEIAISNLKRKYV 86

Query: 116 KASRSILPAAISPKL 130
           K+ +    A +SP+L
Sbjct: 87  KSPQQAAIAELSPRL 101


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.132    0.372 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,828,701,438
Number of Sequences: 23463169
Number of extensions: 107106587
Number of successful extensions: 494718
Number of sequences better than 100.0: 78
Number of HSP's better than 100.0 without gapping: 37
Number of HSP's successfully gapped in prelim test: 41
Number of HSP's that attempted gapping in prelim test: 494590
Number of HSP's gapped (non-prelim): 134
length of query: 195
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 60
effective length of database: 9,191,667,552
effective search space: 551500053120
effective search space used: 551500053120
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)