BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 029325
(195 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356497816|ref|XP_003517753.1| PREDICTED: uncharacterized protein LOC100785194 [Glycine max]
Length = 195
Score = 178 bits (452), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 111/182 (60%), Positives = 135/182 (74%), Gaps = 8/182 (4%)
Query: 16 SSAKLDHSSPS--PTANLSFHHNRHTNHNKTLSELSICHAQLNYGSAGEDPDEIGEAFFD 73
S KL H S + P + FH + NH + ++ CH +LN S G + + EA FD
Sbjct: 19 SHTKLCHFSANNVPFSIQPFHSRFYHNH-RLKPIITNCHGKLN-SSGGGEAYAMDEAGFD 76
Query: 74 EGVDGGAIMDDISDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSF 133
E DG +++ SDDEDD +SSVDLLIRFLQ+ FKKVSKRAKKASRS+LP ISP+LVSF
Sbjct: 77 E-YDG---VEEESDDEDDAESSVDLLIRFLQSMFKKVSKRAKKASRSVLPTVISPQLVSF 132
Query: 134 AVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSSGNSFNQGGSSFA 193
AVDG LLLASLS++KALLEV+C++GGTVF IL+LR IWAA+SYFQSSGNSFNQGG+SF
Sbjct: 133 AVDGTLLLASLSVVKALLEVICTLGGTVFAAILVLRVIWAAVSYFQSSGNSFNQGGNSFG 192
Query: 194 AT 195
A
Sbjct: 193 AV 194
>gi|356500831|ref|XP_003519234.1| PREDICTED: uncharacterized protein LOC100784688 [Glycine max]
Length = 193
Score = 178 bits (451), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 114/191 (59%), Positives = 140/191 (73%), Gaps = 16/191 (8%)
Query: 12 VINLS----SAKLDH--SSPSPTANLSFHHNRHTNHNKTLSEL-SICHAQLNYGSAGEDP 64
+INL+ S+K H S+ +P + FH + HN+ L + + CH +LN G
Sbjct: 11 LINLTRPSHSSKHRHFSSNNAPFSIQPFHSGFY--HNRRLKPIITNCHGKLNSSGGGYAM 68
Query: 65 DEIGEAFFDEGVDGGAIMDDISDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSILPA 124
DE G FDE DG ++D SDDEDD +SSVDLLIRFLQ+ FKKVSKRAKKASRS+LP
Sbjct: 69 DEAG---FDE-YDG---VEDESDDEDDAESSVDLLIRFLQSMFKKVSKRAKKASRSVLPT 121
Query: 125 AISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSSGNS 184
ISP+LVSFAVDG LLLASLS++KALLE++C++GGTVF IL+LR IWAA+SYFQSSGNS
Sbjct: 122 VISPQLVSFAVDGTLLLASLSVVKALLEIICTLGGTVFAAILVLRVIWAAVSYFQSSGNS 181
Query: 185 FNQGGSSFAAT 195
FNQGG+SF A
Sbjct: 182 FNQGGNSFGAV 192
>gi|255558302|ref|XP_002520178.1| conserved hypothetical protein [Ricinus communis]
gi|223540670|gb|EEF42233.1| conserved hypothetical protein [Ricinus communis]
Length = 254
Score = 176 bits (446), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 105/177 (59%), Positives = 132/177 (74%), Gaps = 6/177 (3%)
Query: 22 HSSPSPTANLSFHHNRHT----NHNKTLSELSICHAQLNYGSAGEDPDEIGEAFFDEGVD 77
H S SP NL FHH + + + ICHA+L+ SAG+ +++GE FFD+ D
Sbjct: 75 HHSLSPPQNLPFHHQSSHFHHLTQSYKIPNIIICHAKLD-ASAGDFHEDVGEMFFDDNND 133
Query: 78 GGAIMDDISDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDG 137
+ +D+DD +SS+DLL +FLQ+ FKK+SKRAKKASRSILP+ ISP+LVSFAVDG
Sbjct: 134 L-VEDYESDEDDDDTESSIDLLFKFLQSMFKKISKRAKKASRSILPSVISPQLVSFAVDG 192
Query: 138 ILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSSGNSFNQGGSSFAA 194
ILLLA+LSI+KALLEVVC++G TVF +ILLLR +WAAISYFQSSGN+F QGGSSF
Sbjct: 193 ILLLAALSIVKALLEVVCTLGSTVFTVILLLRVVWAAISYFQSSGNTFYQGGSSFGG 249
>gi|224102473|ref|XP_002312691.1| predicted protein [Populus trichocarpa]
gi|222852511|gb|EEE90058.1| predicted protein [Populus trichocarpa]
Length = 203
Score = 176 bits (446), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 93/146 (63%), Positives = 118/146 (80%), Gaps = 5/146 (3%)
Query: 50 ICHAQLNYGSAGEDPDEIGEAFFDEGVDGGAIMDDISDDEDDMDSSVDLLIRFLQATFKK 109
ICH +L+ + P+EI E FFD+ D + ++ DED+ +SS+DLLIRFLQ+ FKK
Sbjct: 59 ICHGKLD----SDVPEEINEVFFDDNYDM-MVDEEEISDEDETESSIDLLIRFLQSMFKK 113
Query: 110 VSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLR 169
+SKRAKKASRS+LPA ISP+LVSFAVDGILLLA+LSI+KALLEVVC++G TVFV+ILLLR
Sbjct: 114 LSKRAKKASRSMLPAVISPQLVSFAVDGILLLAALSIVKALLEVVCTLGSTVFVVILLLR 173
Query: 170 FIWAAISYFQSSGNSFNQGGSSFAAT 195
+W A+SYFQSS N+ ++GGSSF T
Sbjct: 174 VVWTAVSYFQSSENTSSKGGSSFGTT 199
>gi|225424833|ref|XP_002272743.1| PREDICTED: uncharacterized protein LOC100243345 [Vitis vinifera]
Length = 198
Score = 159 bits (403), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 109/194 (56%), Positives = 143/194 (73%), Gaps = 19/194 (9%)
Query: 11 IVINL---SSAKLD-HSSPSPT---ANLSFHHNRHTNHNKTLSELSICHAQLNYGSAGED 63
++INL ++ KL+ + +PSP S H T+ + L+ HA+L+ GSAGE
Sbjct: 11 LIINLNPTATTKLNLNLTPSPNFACPPQSLHLRHRTSPTGNPTTLT-SHARLS-GSAGEA 68
Query: 64 PDEIGEAFFDEGVDGGAIMDDI--SDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSI 121
P+EI E+FF++ +D+ SD+ED+ +SS DLLI FLQ+ FKK SK AKKASRS+
Sbjct: 69 PEEIDESFFED--------EDLIESDEEDETESSADLLIEFLQSMFKKASKHAKKASRSV 120
Query: 122 LPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSS 181
LPAAISP+LVSFAVDG+L+LASLSI+KALLEV C++GGTVFV+ILLLR IWA +SYFQ+S
Sbjct: 121 LPAAISPQLVSFAVDGVLILASLSIIKALLEVFCTLGGTVFVVILLLRVIWATVSYFQTS 180
Query: 182 GNSFNQGGSSFAAT 195
GN F+QGG+SF T
Sbjct: 181 GNGFSQGGTSFGTT 194
>gi|296086468|emb|CBI32057.3| unnamed protein product [Vitis vinifera]
Length = 210
Score = 157 bits (396), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 108/192 (56%), Positives = 142/192 (73%), Gaps = 19/192 (9%)
Query: 11 IVINL---SSAKLD-HSSPSP---TANLSFHHNRHTNHNKTLSELSICHAQLNYGSAGED 63
++INL ++ KL+ + +PSP S H T+ + L+ HA+L+ GSAGE
Sbjct: 10 LIINLNPTATTKLNLNLTPSPNFACPPQSLHLRHRTSPTGNPTTLT-SHARLS-GSAGEA 67
Query: 64 PDEIGEAFFDEGVDGGAIMDDI--SDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSI 121
P+EI E+FF++ +D+ SD+ED+ +SS DLLI FLQ+ FKK SK AKKASRS+
Sbjct: 68 PEEIDESFFED--------EDLIESDEEDETESSADLLIEFLQSMFKKASKHAKKASRSV 119
Query: 122 LPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSS 181
LPAAISP+LVSFAVDG+L+LASLSI+KALLEV C++GGTVFV+ILLLR IWA +SYFQ+S
Sbjct: 120 LPAAISPQLVSFAVDGVLILASLSIIKALLEVFCTLGGTVFVVILLLRVIWATVSYFQTS 179
Query: 182 GNSFNQGGSSFA 193
GN F+QGG+SF
Sbjct: 180 GNGFSQGGTSFV 191
>gi|357486397|ref|XP_003613486.1| hypothetical protein MTR_5g037230 [Medicago truncatula]
gi|355514821|gb|AES96444.1| hypothetical protein MTR_5g037230 [Medicago truncatula]
gi|388498656|gb|AFK37394.1| unknown [Medicago truncatula]
Length = 192
Score = 152 bits (384), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 106/191 (55%), Positives = 134/191 (70%), Gaps = 17/191 (8%)
Query: 12 VINLSSA---KLDHSSPSPTANLSFHHNR-----HTNHNKTLSELSICHAQLNYGSAGED 63
+INL+S+ L H SP+ L HH + NH L ++ C+++LN S GE
Sbjct: 9 LINLTSSPRSNLHHFSPNNVHIL--HHQPFPSKLYYNHRSNLKTIN-CNSKLNNSSGGEA 65
Query: 64 PDEIGEAFFDEGVDGGAIMDDISDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSILP 123
+ DEGV GG + DEDD +SSVDLLI+FLQ+ FKK++KRAKKASRS+LP
Sbjct: 66 YE------IDEGVFGGYDGIEEESDEDDAESSVDLLIKFLQSMFKKMAKRAKKASRSVLP 119
Query: 124 AAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSSGN 183
ISP+LVSFAVDG LLLA+LS++KALLEV+C+IGGTVF IL LR IWAA+S+FQS+GN
Sbjct: 120 TVISPQLVSFAVDGTLLLATLSVVKALLEVICNIGGTVFAAILALRVIWAAVSHFQSNGN 179
Query: 184 SFNQGGSSFAA 194
SFNQG +SF A
Sbjct: 180 SFNQGNNSFGA 190
>gi|297841727|ref|XP_002888745.1| hypothetical protein ARALYDRAFT_894783 [Arabidopsis lyrata subsp.
lyrata]
gi|297334586|gb|EFH65004.1| hypothetical protein ARALYDRAFT_894783 [Arabidopsis lyrata subsp.
lyrata]
Length = 191
Score = 142 bits (357), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 68/90 (75%), Positives = 80/90 (88%)
Query: 96 VDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVC 155
VDLLIRFL++ FKKVSKRAKKASR ILPAA+SP+LVSFAVDGILLL SLSI +A LEV+C
Sbjct: 96 VDLLIRFLRSMFKKVSKRAKKASRRILPAAMSPRLVSFAVDGILLLGSLSITRAFLEVIC 155
Query: 156 SIGGTVFVLILLLRFIWAAISYFQSSGNSF 185
++GGTVF +ILL+R WAA S+FQ+ GNSF
Sbjct: 156 NLGGTVFTVILLIRLFWAAASFFQTYGNSF 185
>gi|21536619|gb|AAM60951.1| unknown [Arabidopsis thaliana]
gi|159025286|emb|CAL91513.1| SHW1 protein [Arabidopsis thaliana]
Length = 192
Score = 141 bits (355), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 68/90 (75%), Positives = 80/90 (88%)
Query: 96 VDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVC 155
VDLLIRFL++ FKKVSKRAKKASR ILPAA+SP+LVSFAVDGILLL SLSI +A LEV+C
Sbjct: 97 VDLLIRFLRSMFKKVSKRAKKASRRILPAAMSPRLVSFAVDGILLLGSLSITRAFLEVIC 156
Query: 156 SIGGTVFVLILLLRFIWAAISYFQSSGNSF 185
++GGTVF +ILL+R WAA S+FQ+ GNSF
Sbjct: 157 NLGGTVFTVILLIRLFWAAASFFQTYGNSF 186
>gi|18409408|ref|NP_564981.1| short hypocotyl in white light1 protein [Arabidopsis thaliana]
gi|332196879|gb|AEE35000.1| short hypocotyl in white light1 protein [Arabidopsis thaliana]
Length = 192
Score = 139 bits (351), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 67/90 (74%), Positives = 79/90 (87%)
Query: 96 VDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVC 155
VDLLIRFL++ FKKVSKR KKASR ILPAA+SP+LVSFAVDGILLL SLSI +A LEV+C
Sbjct: 97 VDLLIRFLRSMFKKVSKRTKKASRRILPAAMSPRLVSFAVDGILLLGSLSITRAFLEVIC 156
Query: 156 SIGGTVFVLILLLRFIWAAISYFQSSGNSF 185
++GGTVF +ILL+R WAA S+FQ+ GNSF
Sbjct: 157 NLGGTVFTVILLIRLFWAAASFFQTYGNSF 186
>gi|116791365|gb|ABK25954.1| unknown [Picea sitchensis]
Length = 214
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 56/105 (53%), Positives = 79/105 (75%), Gaps = 4/105 (3%)
Query: 95 SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
S+DLL RF+ FKK+S++A+KA+R+ILP+AISP+LV F+V+G+++L LSILKA L+VV
Sbjct: 106 SLDLLARFIHHMFKKISRKARKAARTILPSAISPQLVGFSVNGVIILTFLSILKAFLQVV 165
Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNSF----NQGGSSFAAT 195
C++G VFV ILLLR IW+A+ Y Q G + G SS+ A+
Sbjct: 166 CTLGSVVFVTILLLRLIWSAVGYMQKGGKTIEFHGENGYSSYNAS 210
>gi|255562194|ref|XP_002522105.1| conserved hypothetical protein [Ricinus communis]
gi|223538704|gb|EEF40305.1| conserved hypothetical protein [Ricinus communis]
Length = 200
Score = 115 bits (289), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 54/87 (62%), Positives = 71/87 (81%)
Query: 95 SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
S+DL +RF+Q FKKVSKRA+KA+RS+LP +IS KLV F+V+G+L+LA L +LKA LEVV
Sbjct: 92 SLDLFVRFIQNVFKKVSKRARKAARSVLPLSISTKLVGFSVNGVLILAFLWVLKAFLEVV 151
Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSS 181
C++G VFV ILL+R IW+ I+Y Q S
Sbjct: 152 CTLGSVVFVSILLIRGIWSGITYLQES 178
>gi|224056286|ref|XP_002298793.1| predicted protein [Populus trichocarpa]
gi|222846051|gb|EEE83598.1| predicted protein [Populus trichocarpa]
Length = 141
Score = 112 bits (281), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 54/89 (60%), Positives = 69/89 (77%)
Query: 95 SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
S+DLL+RF+ FKKVSKRA+KA RS+LP IS KLV F+V+G+LLLA L +LKA LEVV
Sbjct: 33 SLDLLVRFISNVFKKVSKRARKAVRSVLPVPISTKLVEFSVNGVLLLAFLWVLKAFLEVV 92
Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGN 183
C++G VFV ILL+R IW+ ++Y Q N
Sbjct: 93 CTLGSMVFVSILLIRGIWSGVTYLQERRN 121
>gi|225429191|ref|XP_002276036.1| PREDICTED: uncharacterized protein LOC100251465 [Vitis vinifera]
gi|297736403|emb|CBI25126.3| unnamed protein product [Vitis vinifera]
Length = 203
Score = 112 bits (281), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 51/89 (57%), Positives = 68/89 (76%)
Query: 95 SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
S+DLL+RF+Q FKK+SKR +KA RS+LP IS KLV F+VDG+L+L L +LKA LEV+
Sbjct: 95 SLDLLVRFVQNVFKKISKRTRKAVRSVLPVPISTKLVGFSVDGVLMLTFLWVLKAFLEVI 154
Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGN 183
C++G VFV IL++R IW +SY Q + N
Sbjct: 155 CTLGSVVFVSILVIRGIWTGVSYVQENRN 183
>gi|297798548|ref|XP_002867158.1| hypothetical protein ARALYDRAFT_491300 [Arabidopsis lyrata subsp.
lyrata]
gi|297312994|gb|EFH43417.1| hypothetical protein ARALYDRAFT_491300 [Arabidopsis lyrata subsp.
lyrata]
Length = 208
Score = 109 bits (273), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 52/90 (57%), Positives = 71/90 (78%)
Query: 95 SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
S+DLL+RF++ F+K+SKRA+KA RSILP +IS KLV F+V+G+L+LA L ILKA LEV
Sbjct: 100 SLDLLLRFVENVFRKISKRARKAVRSILPVSISTKLVGFSVNGVLILAFLWILKAFLEVA 159
Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNS 184
C++G VF ILL+R +WA ++Y Q S N+
Sbjct: 160 CTLGTIVFTSILLIRGLWAGVAYMQESRNN 189
>gi|30689813|ref|NP_567935.3| uncharacterized protein [Arabidopsis thaliana]
gi|21554762|gb|AAM63683.1| unknown [Arabidopsis thaliana]
gi|25054911|gb|AAN71935.1| unknown protein [Arabidopsis thaliana]
gi|332660877|gb|AEE86277.1| uncharacterized protein [Arabidopsis thaliana]
Length = 203
Score = 109 bits (272), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 52/90 (57%), Positives = 71/90 (78%)
Query: 95 SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
S+DLL+RF++ F+K+SKRA+KA RSILP +IS KLV F+V+G+L+LA L ILKA LEV
Sbjct: 95 SLDLLLRFVENVFRKISKRARKAVRSILPVSISTKLVGFSVNGVLILAFLWILKAFLEVA 154
Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNS 184
C++G VF ILL+R +WA ++Y Q S N+
Sbjct: 155 CTLGTIVFTSILLIRGLWAGVAYMQESRNN 184
>gi|356497490|ref|XP_003517593.1| PREDICTED: uncharacterized protein LOC100817442 [Glycine max]
Length = 186
Score = 105 bits (263), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 51/84 (60%), Positives = 65/84 (77%)
Query: 100 IRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGG 159
+RF Q FKKVSKRA+KA RS+LP IS LV F+V+GILLLA L ILKA L+V+C++G
Sbjct: 84 VRFFQNVFKKVSKRARKAVRSVLPLPISSHLVGFSVNGILLLAFLWILKAFLQVLCTLGS 143
Query: 160 TVFVLILLLRFIWAAISYFQSSGN 183
VFV ILL+R IW+ +S+ Q SG+
Sbjct: 144 VVFVSILLIRGIWSGVSFLQESGH 167
>gi|449438309|ref|XP_004136931.1| PREDICTED: uncharacterized protein LOC101213497 [Cucumis sativus]
Length = 200
Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 68/181 (37%), Positives = 110/181 (60%), Gaps = 4/181 (2%)
Query: 1 MTSAATLSPFIVINLSSAKLDHSSPSPTANLSFHHNRHTNHNKTLSELSICHAQLNYGSA 60
M+ A LSP + ++ ++L S+ SPT+ LS H H ++ + + +
Sbjct: 1 MSLPAVLSPPSLFSIPQSQLPFST-SPTSLLS--HPIHISYTLLRATRRTSNFSQGVDNF 57
Query: 61 GEDPDEIGEAFFDEGVDGGAIMDDISDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRS 120
+D + FD + G ++ D+ED+ D S+DLL+RF++ F+K S+RA+KA RS
Sbjct: 58 VDDRRNWNRSDFDL-IGGEEEEEEDEDEEDEEDRSLDLLVRFVENIFRKSSRRARKAVRS 116
Query: 121 ILPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQS 180
+LP +I KLV+F+V+G+L+LA L +LKA LEV+C++G VFV IL++R +W I Y Q
Sbjct: 117 VLPPSIPTKLVAFSVNGVLMLAFLWVLKAFLEVICTLGTAVFVSILIIRGVWIGILYLQD 176
Query: 181 S 181
+
Sbjct: 177 T 177
>gi|449495707|ref|XP_004159921.1| PREDICTED: uncharacterized LOC101213497 [Cucumis sativus]
Length = 202
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/87 (52%), Positives = 68/87 (78%)
Query: 95 SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
S+DLL+RF++ F+K S+RA+KA RS+LP +I KLV+F+V+G+L+LA L +LKA LEV+
Sbjct: 93 SLDLLVRFVENIFRKSSRRARKAVRSVLPPSIPTKLVAFSVNGVLMLAFLWVLKAFLEVI 152
Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSS 181
C++G VFV IL++R +W I Y Q +
Sbjct: 153 CTLGTAVFVSILIIRGVWIGILYLQDT 179
>gi|116781450|gb|ABK22104.1| unknown [Picea sitchensis]
Length = 190
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 68/85 (80%)
Query: 100 IRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGG 159
++F+Q KK+S+RA+KA+RSILP+AISP+LV F+V+G+ L L ++KA L+VVC++G
Sbjct: 94 VQFVQTMLKKISRRARKAARSILPSAISPQLVGFSVNGLTTLTFLWMVKAFLQVVCTLGS 153
Query: 160 TVFVLILLLRFIWAAISYFQSSGNS 184
VFV ILLLR +W+AI+Y + +G +
Sbjct: 154 IVFVAILLLRGVWSAIAYIKINGKT 178
>gi|147774848|emb|CAN69077.1| hypothetical protein VITISV_004762 [Vitis vinifera]
Length = 214
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 85/177 (48%), Positives = 117/177 (66%), Gaps = 32/177 (18%)
Query: 11 IVINL---SSAKLD-HSSPSPTANL---SFHHNRHTNHNKTLSELSICHAQLNYGSAGED 63
++INL ++ KL+ + +PSP SFH + T+ + L+ HA+L+ GSAGE
Sbjct: 10 LIINLNPTATTKLNLNFTPSPNFTCPPQSFHLSHRTSPTGNPTTLT-SHARLS-GSAGEA 67
Query: 64 PDEIGEAFFDEGVDGGAIMDDI--SDDEDDMDSSVDLLIRFLQATFKKVSKRAKKASRSI 121
P+EI E+FF++ +D+ SD+ED+ +SS DLLI+FLQ+ FKK SK AKKASRS+
Sbjct: 68 PEEIDESFFED--------EDLIESDEEDETESSADLLIKFLQSMFKKASKHAKKASRSV 119
Query: 122 LPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYF 178
LPAAISP+LVSFAVDG+L+LASLSI+KALLE L+ IW+ + YF
Sbjct: 120 LPAAISPQLVSFAVDGVLILASLSIIKALLE-------------FLVNTIWSILVYF 163
>gi|302806786|ref|XP_002985124.1| hypothetical protein SELMODRAFT_424220 [Selaginella moellendorffii]
gi|300146952|gb|EFJ13618.1| hypothetical protein SELMODRAFT_424220 [Selaginella moellendorffii]
Length = 231
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/109 (42%), Positives = 69/109 (63%), Gaps = 7/109 (6%)
Query: 87 DDEDDMDSSVD-----LLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLL 141
DD+ D D S+D LL++ ++ +K+S RA KA+R +LP + L+SF V+G +LL
Sbjct: 92 DDDYDRDFSMDGDSMVLLVQLVRTILRKLSSRATKAARGMLPPIFTSDLISFFVNGAVLL 151
Query: 142 ASLSILKALLEVVCSIGGTVFVLILLLRFIWAAISYFQSS--GNSFNQG 188
+ KALLEVVCS+G +F+ +LL+R +W+ +SY QS SF G
Sbjct: 152 TVFWVAKALLEVVCSMGSMMFIGLLLVRSVWSILSYLQSQRYSGSFRNG 200
>gi|357113288|ref|XP_003558436.1| PREDICTED: uncharacterized protein LOC100840559 [Brachypodium
distachyon]
Length = 203
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 71/93 (76%)
Query: 95 SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
S+DLL RFLQ+ F+K S+RA++A+RS+LP ++ +LV F+V+G+L+L L ILK LLEVV
Sbjct: 95 SLDLLARFLQSVFRKASRRARRAARSVLPPSVPAELVKFSVNGVLVLTFLWILKGLLEVV 154
Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNSFNQ 187
C+ G VF+ ILL+R IW+ ++Y + + S+ Q
Sbjct: 155 CTFGSMVFISILLVRGIWSGVTYIRENRYSYIQ 187
>gi|242041657|ref|XP_002468223.1| hypothetical protein SORBIDRAFT_01g042060 [Sorghum bicolor]
gi|241922077|gb|EER95221.1| hypothetical protein SORBIDRAFT_01g042060 [Sorghum bicolor]
Length = 202
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 69/91 (75%)
Query: 95 SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
S+DLL RFL + F+K S+RA++A+RS+LP ++ +LV F+V+G+L+L SL ILK LLEVV
Sbjct: 94 SLDLLARFLHSVFRKASRRARRAARSVLPPSVPAELVKFSVNGVLVLTSLWILKGLLEVV 153
Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNSF 185
C+ G VF ILL+R IW+ ++Y + + S+
Sbjct: 154 CTFGSMVFASILLVRGIWSGVAYIRENRYSY 184
>gi|108707002|gb|ABF94797.1| expressed protein [Oryza sativa Japonica Group]
Length = 203
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 66/86 (76%)
Query: 100 IRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGG 159
+RFL + F+KVS+RA++A+RS+LP ++ +LV F+V+G+L+L L +LK LLEVVC+ G
Sbjct: 100 VRFLHSVFRKVSRRARRAARSVLPPSVPAELVKFSVNGVLVLTFLWVLKGLLEVVCTFGS 159
Query: 160 TVFVLILLLRFIWAAISYFQSSGNSF 185
VFV ILL+R IW+ ++Y + + S+
Sbjct: 160 MVFVTILLVRGIWSGVTYIRENRYSY 185
>gi|414865641|tpg|DAA44198.1| TPA: hypothetical protein ZEAMMB73_391294 [Zea mays]
Length = 201
Score = 83.2 bits (204), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 68/91 (74%)
Query: 95 SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
S+DLL RFL + F+K S+RA++A+RS+LP ++ +LV F+V+G+L+L L ILK LLEVV
Sbjct: 93 SLDLLARFLHSVFRKASRRARRAARSVLPPSVPAELVKFSVNGVLVLTFLWILKGLLEVV 152
Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNSF 185
C+ G VF ILL+R IW+ ++Y + + S+
Sbjct: 153 CTFGSMVFASILLVRGIWSGVTYIRENRYSY 183
>gi|224103787|ref|XP_002313193.1| predicted protein [Populus trichocarpa]
gi|222849601|gb|EEE87148.1| predicted protein [Populus trichocarpa]
Length = 143
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 61/98 (62%), Gaps = 15/98 (15%)
Query: 93 DSSVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLE 152
D S+DLL++F+ FKKVSKRA KA RS+L +S KLV F+V+G+LLLA L +LKA LE
Sbjct: 19 DRSLDLLLKFVSNVFKKVSKRATKAVRSVLAVPLSTKLVEFSVNGVLLLAFLWVLKAFLE 78
Query: 153 VV---------------CSIGGTVFVLILLLRFIWAAI 175
+ C++G V ILL+R +W+ +
Sbjct: 79 AILEEGISLTLWHWMWFCTLGSVVLASILLIRGMWSEV 116
>gi|326500348|dbj|BAK06263.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326529047|dbj|BAK00917.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 203
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/87 (48%), Positives = 65/87 (74%)
Query: 101 RFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVCSIGGT 160
RFL + F+K S+RA++A+RS+LP ++ +LV F+V+G+L+L L ILK LLEVVC+ G
Sbjct: 101 RFLHSVFRKASRRARRAARSVLPPSVPAELVKFSVNGVLVLTFLWILKGLLEVVCTFGSM 160
Query: 161 VFVLILLLRFIWAAISYFQSSGNSFNQ 187
VF+ ILL+R IW+ ++Y + + S+ Q
Sbjct: 161 VFISILLVRGIWSGVTYIRENRYSYIQ 187
>gi|3549680|emb|CAA20591.1| putative protein [Arabidopsis thaliana]
gi|7270327|emb|CAB80095.1| putative protein [Arabidopsis thaliana]
Length = 181
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/90 (42%), Positives = 54/90 (60%), Gaps = 22/90 (24%)
Query: 95 SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVV 154
S+DLL+RF++ F+K+SKRA+KA RSILP +IS KLV+
Sbjct: 95 SLDLLLRFVENVFRKISKRARKAVRSILPVSISTKLVA---------------------- 132
Query: 155 CSIGGTVFVLILLLRFIWAAISYFQSSGNS 184
C++G VF ILL+R +WA ++Y Q S N+
Sbjct: 133 CTLGTIVFTSILLIRGLWAGVAYMQESRNN 162
>gi|222624515|gb|EEE58647.1| hypothetical protein OsJ_10028 [Oryza sativa Japonica Group]
Length = 243
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 67/127 (52%), Gaps = 40/127 (31%)
Query: 99 LIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLE------ 152
L+RFL + F+KVS+RA++A+RS+LP ++ +LV F+V+G+L+L L +LK LLE
Sbjct: 99 LVRFLHSVFRKVSRRARRAARSVLPPSVPAELVKFSVNGVLVLTFLWVLKGLLESLETSH 158
Query: 153 ----------------------------------VVCSIGGTVFVLILLLRFIWAAISYF 178
VVC+ G VFV ILL+R IW+ ++Y
Sbjct: 159 SCWGLAPPLILTVTYNGITLYVQRTNIVHASEFHVVCTFGSMVFVTILLVRGIWSGVTYI 218
Query: 179 QSSGNSF 185
+ + S+
Sbjct: 219 RENRYSY 225
>gi|218192386|gb|EEC74813.1| hypothetical protein OsI_10633 [Oryza sativa Indica Group]
Length = 243
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 67/127 (52%), Gaps = 40/127 (31%)
Query: 99 LIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLE------ 152
L+RFL + F+KVS+RA++A+RS+LP ++ +LV F+V+G+L+L L +LK LLE
Sbjct: 99 LVRFLHSVFRKVSRRARRAARSVLPPSVPAELVKFSVNGVLVLTFLWVLKGLLESLETSH 158
Query: 153 ----------------------------------VVCSIGGTVFVLILLLRFIWAAISYF 178
VVC+ G VFV ILL+R IW+ ++Y
Sbjct: 159 GCWGLAPPLILTVTYNGITLYVQRTNIVHASEFHVVCTFGSMVFVTILLVRGIWSGVTYI 218
Query: 179 QSSGNSF 185
+ + S+
Sbjct: 219 RENRYSY 225
>gi|218190898|gb|EEC73325.1| hypothetical protein OsI_07518 [Oryza sativa Indica Group]
Length = 106
Score = 38.9 bits (89), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 43/65 (66%), Gaps = 9/65 (13%)
Query: 87 DDEDD---------MDSSVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDG 137
+DED+ D S+DLL+RFL + F+KVS+RA++A+RS+LP ++ + V F+V+
Sbjct: 37 EDEDNTGEEDDDGDEDRSLDLLVRFLHSVFRKVSRRARRAARSMLPLSVPAERVKFSVNA 96
Query: 138 ILLLA 142
L +
Sbjct: 97 AFLFS 101
>gi|15292719|gb|AAK92728.1| unknown protein [Arabidopsis thaliana]
Length = 51
Score = 38.5 bits (88), Expect = 1.4, Method: Composition-based stats.
Identities = 15/32 (46%), Positives = 22/32 (68%)
Query: 153 VVCSIGGTVFVLILLLRFIWAAISYFQSSGNS 184
V C++G VF ILL+R +WA ++Y Q S N+
Sbjct: 1 VACTLGTIVFTSILLIRGLWAGVAYMQESRNN 32
>gi|222622990|gb|EEE57122.1| hypothetical protein OsJ_07002 [Oryza sativa Japonica Group]
Length = 107
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 37/48 (77%)
Query: 95 SVDLLIRFLQATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLA 142
S+DLL+RFL + F+KVS+RA++A+RS+LP ++ + V F+V+ L +
Sbjct: 55 SLDLLVRFLHSVFRKVSRRARRAARSMLPLSVPAERVKFSVNAAFLFS 102
>gi|298705754|emb|CBJ49062.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 2779
Score = 35.8 bits (81), Expect = 8.1, Method: Composition-based stats.
Identities = 38/123 (30%), Positives = 54/123 (43%), Gaps = 24/123 (19%)
Query: 58 GSAGEDPDEIGEAFFD-------EGVDGGAIMDDISDDEDDMDS----------SVDLLI 100
G AGED D+IGE + EG +GG D DED M S+D+L
Sbjct: 2138 GRAGEDDDDIGEGIEEVMGPSSPEGGEGGPDAMDQDRDEDGMKESAVNRQTALLSLDVLA 2197
Query: 101 RFL----QATFKKVSKRAKKASRSILPAAISPKLVSFAVDGILLLASLSILKALLEVVCS 156
R L Q F+ V + + P A+ P + DGIL L + + L L+ +C+
Sbjct: 2198 RVLGRRHQGAFEGVLGDVTEMVAGVGPGAL-PAVGQPGSDGILPLRASAFL--LVATLCA 2254
Query: 157 IGG 159
+ G
Sbjct: 2255 VLG 2257
>gi|389656419|ref|YP_006393297.1| replication protein [Human papillomavirus type 137]
gi|343411532|gb|AEM24609.1| replication protein [Human papillomavirus type 137]
Length = 610
Score = 35.8 bits (81), Expect = 8.1, Method: Composition-based stats.
Identities = 22/75 (29%), Positives = 35/75 (46%), Gaps = 6/75 (8%)
Query: 62 EDPDEIGEAFFDEGVDGGAIMDDISDDEDDMDSSVDLLIRFLQAT------FKKVSKRAK 115
E+ + E FDE DG I D I DD D++D + L + Q T + ++
Sbjct: 27 ENEHDTLEELFDESTDGSNISDLIDDDVDEVDQGISLALYNSQLTEDCEIAISNLKRKYV 86
Query: 116 KASRSILPAAISPKL 130
K+ + A +SP+L
Sbjct: 87 KSPQQAAIAELSPRL 101
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.132 0.372
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,828,701,438
Number of Sequences: 23463169
Number of extensions: 107106587
Number of successful extensions: 494718
Number of sequences better than 100.0: 78
Number of HSP's better than 100.0 without gapping: 37
Number of HSP's successfully gapped in prelim test: 41
Number of HSP's that attempted gapping in prelim test: 494590
Number of HSP's gapped (non-prelim): 134
length of query: 195
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 60
effective length of database: 9,191,667,552
effective search space: 551500053120
effective search space used: 551500053120
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)