BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 030213
(181 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224137270|ref|XP_002327084.1| predicted protein [Populus trichocarpa]
gi|222835399|gb|EEE73834.1| predicted protein [Populus trichocarpa]
Length = 322
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 85/182 (46%), Positives = 113/182 (62%), Gaps = 7/182 (3%)
Query: 3 MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPSASQFGSRSQSIRFGNAYKFLGNTFPNLKH 62
ME+QSVC G+ PK + S LS++ +S AS G R I N K P K+
Sbjct: 1 MEMQSVCTGITAPKLRLSTNRLLSNEIIS--ASHVGLRCPRILLKNPSKLCVK--PGFKN 56
Query: 63 KIEGYSAWSSKRGAILCASNSGSDTK---LGFPSGENSSLPVAIFNGPEPFRGKSGSVSF 119
+G S S RG ++C S+S + L ++S +P + NG EPF GKSGSVSF
Sbjct: 57 IFKGISWSSLNRGVVICGSSSSNSNADADLKISRRKSSGVPASSSNGVEPFNGKSGSVSF 116
Query: 120 CGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLMG 179
GLTHQ VEEG+L+SAPF E+KGSF+W+L PV L+ SLI+PQ FLG+ IE F+KD +L+
Sbjct: 117 HGLTHQSVEEGRLVSAPFNEEKGSFVWLLGPVALVLSLIVPQFFLGSAIEAFLKDEVLVE 176
Query: 180 IS 181
I+
Sbjct: 177 IA 178
>gi|225458432|ref|XP_002283812.1| PREDICTED: uncharacterized protein LOC100245429 [Vitis vinifera]
gi|302142414|emb|CBI19617.3| unnamed protein product [Vitis vinifera]
Length = 355
Score = 137 bits (346), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 86/178 (48%), Positives = 107/178 (60%), Gaps = 7/178 (3%)
Query: 3 MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPSASQFGSRSQSIRFGNAYKFLGNTFPNLKH 62
ME+QS+ L P + LSH+ VS ++ + Q RF K + P LK
Sbjct: 1 MELQSISCWLPSPDMRLCIHRPLSHRIVSTGHVRW--KDQRFRFEKVLK--SSLMPGLKK 56
Query: 63 KIEGYSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSVSFCGL 122
+ S S RGA LC+ NS SD KL + E +S+ F+G EPFRGKSGSVSF GL
Sbjct: 57 EC---SLSSFGRGATLCSLNSNSDAKLKYSGREITSVSGNGFDGVEPFRGKSGSVSFHGL 113
Query: 123 THQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLMGI 180
THQ VEE KL S+PF+E GSFLW+LAPV LISSL+LPQ FL + IE +KD +L I
Sbjct: 114 THQRVEESKLASSPFKEGTGSFLWVLAPVALISSLVLPQFFLDSAIEAILKDEVLAEI 171
>gi|449476766|ref|XP_004154828.1| PREDICTED: uncharacterized LOC101207359 [Cucumis sativus]
Length = 345
Score = 129 bits (323), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 63/112 (56%), Positives = 79/112 (70%)
Query: 67 YSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSVSFCGLTHQL 126
++ KR A +C ++S +L EN +L + NG EPF GK GSVSF GLTHQL
Sbjct: 48 WAVLKRKRWAFMCVADSNKSPQLELSGEENHALYASRLNGVEPFHGKCGSVSFHGLTHQL 107
Query: 127 VEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLM 178
VEE KLMSAPF+E+KGS LW+LAPV ISSLILPQ+FLG +IE F K+ +L+
Sbjct: 108 VEESKLMSAPFREEKGSILWVLAPVAFISSLILPQVFLGGLIEAFFKNRILV 159
>gi|449460131|ref|XP_004147799.1| PREDICTED: uncharacterized protein LOC101207359 [Cucumis sativus]
Length = 345
Score = 128 bits (321), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 62/112 (55%), Positives = 79/112 (70%)
Query: 67 YSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSVSFCGLTHQL 126
++ KR A +C ++S +L EN ++ + NG EPF GK GSVSF GLTHQL
Sbjct: 48 WAVLKRKRWAFMCVADSNKSPQLELSGEENHAMYASRLNGVEPFHGKCGSVSFHGLTHQL 107
Query: 127 VEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLM 178
VEE KLMSAPF+E+KGS LW+LAPV ISSLILPQ+FLG +IE F K+ +L+
Sbjct: 108 VEESKLMSAPFREEKGSILWVLAPVAFISSLILPQVFLGGLIEAFFKNRILV 159
>gi|356510304|ref|XP_003523879.1| PREDICTED: uncharacterized protein LOC100805065 [Glycine max]
Length = 356
Score = 127 bits (319), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 80/181 (44%), Positives = 106/181 (58%), Gaps = 12/181 (6%)
Query: 3 MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPS---ASQFGSRSQSIRFGNAYKFLGNTFPN 59
ME+Q++C L SSN LS SP+ + F ++ Q FGNA K L
Sbjct: 1 MELQTLCCALPSQPLHSSNHRILS----SPAFGIRAPFTNQRQRFHFGNASKCLKWVI-- 54
Query: 60 LKHKIEGYSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSVSF 119
K S + ++ +S+S + L ++SS+PV FNG EPF GKSGSVSF
Sbjct: 55 ---KEPSLLGLSKFQRLLVHSSDSNLNGGLEVHPKQSSSVPVTTFNGAEPFHGKSGSVSF 111
Query: 120 CGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLMG 179
CGLTHQ VEEGKL S+PF+ GSFLW+LAP I+SLILPQ F+ NV+E F+ + +L+
Sbjct: 112 CGLTHQSVEEGKLESSPFEGKGGSFLWVLAPAAFIASLILPQFFVDNVVEAFLNNVILID 171
Query: 180 I 180
I
Sbjct: 172 I 172
>gi|255647675|gb|ACU24299.1| unknown [Glycine max]
Length = 356
Score = 124 bits (312), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 79/180 (43%), Positives = 108/180 (60%), Gaps = 10/180 (5%)
Query: 3 MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPSASQFGSRSQSIRFGNAYKFLGNTFPNLKH 62
ME+Q++C L S F +H+ +S A FG+ + + ++F GN LK
Sbjct: 1 MELQTLCCAL-----PSQPLHFSNHRILSSPA--FGTHAPLTNQIHRFRF-GNPSKCLKW 52
Query: 63 KIEGYS--AWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSVSFC 120
I+G S S + ++ AS+S + L ++SS+PV FNG EPF GKSGSVSFC
Sbjct: 53 VIKGTSLLGLSESKKLLVHASDSNLNGGLEVHPKQSSSVPVTTFNGVEPFHGKSGSVSFC 112
Query: 121 GLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLMGI 180
GLTHQ VEEGKL S+PF GSFLW+LAP I+SL LPQ F+ NV+E F+ + +L+ I
Sbjct: 113 GLTHQSVEEGKLESSPFGVKGGSFLWVLAPAAFIASLTLPQFFVDNVVEAFLNNVMLIDI 172
>gi|356519084|ref|XP_003528204.1| PREDICTED: uncharacterized protein LOC100780682 [Glycine max]
Length = 356
Score = 122 bits (306), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 78/180 (43%), Positives = 107/180 (59%), Gaps = 10/180 (5%)
Query: 3 MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPSASQFGSRSQSIRFGNAYKFLGNTFPNLKH 62
ME+Q++C L S F +H+ +S A FG+ + + ++F GN LK
Sbjct: 1 MELQTLCCAL-----PSQPLHFSNHRILSSPA--FGTHAPLTNQIHRFRF-GNPSKCLKW 52
Query: 63 KIEGYS--AWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSVSFC 120
I+ S S + ++ AS+S + L ++SS+PV FNG EPF GKSGSVSFC
Sbjct: 53 VIKETSLLGLSESKKLLVHASDSNLNGGLEVHPKQSSSVPVTTFNGVEPFHGKSGSVSFC 112
Query: 121 GLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLMGI 180
GLTHQ VEEGKL S+PF GSFLW+LAP I+SL LPQ F+ NV+E F+ + +L+ I
Sbjct: 113 GLTHQSVEEGKLESSPFGVKGGSFLWVLAPAAFIASLTLPQFFVDNVVEAFLNNVMLIDI 172
>gi|255538514|ref|XP_002510322.1| conserved hypothetical protein [Ricinus communis]
gi|223551023|gb|EEF52509.1| conserved hypothetical protein [Ricinus communis]
Length = 339
Score = 122 bits (305), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 81/180 (45%), Positives = 104/180 (57%), Gaps = 27/180 (15%)
Query: 3 MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPSASQFGSRSQSIRFGNAYKFLGNTFPNLKH 62
ME+QSVC GL PK + L+H+ S ++ S++ G A K
Sbjct: 1 MELQSVCHGLTAPKLR------LTHRL-----SSHQHQTVSLKIGIASK----------- 38
Query: 63 KIEGYSAWSSKRGAILCASNSGSDTK-LGFPSGENSSLPVAIFNGPEPFRGKSGSVSFCG 121
Y + S+KR I +S++ S L E + +P A FN EPFRGKSGS+SF G
Sbjct: 39 ---SYLSSSNKRVTIRASSSNSSRNTNLELSDREGTGVPDASFNELEPFRGKSGSISFYG 95
Query: 122 LTHQLVEEGKLMSAPFQ-EDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLMGI 180
LTHQ VE GKL+SAPF ED GSFLWIL P LI+SL +PQ F+ NVIE F+KD +L+ I
Sbjct: 96 LTHQSVEAGKLVSAPFNGEDNGSFLWILGPAALIASLFIPQFFISNVIEAFLKDEILVEI 155
>gi|8778686|gb|AAF79694.1|AC020889_2 T1N15.7 [Arabidopsis thaliana]
Length = 343
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/126 (51%), Positives = 83/126 (65%), Gaps = 11/126 (8%)
Query: 61 KHKIEGYSA--WSS---KRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSG 115
+HK++ W+ R + C S+S SD++ + S+ + E FRGKSG
Sbjct: 38 RHKLDSRQTLLWNKPQLSRVRVAC-SSSQSDSRPEKKQSDKSN-----YARAELFRGKSG 91
Query: 116 SVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDN 175
SVSF GLTHQLVEE KL+SAPFQE+KGSFLW+LAPVVLISSLILPQ FL +IE K++
Sbjct: 92 SVSFNGLTHQLVEESKLVSAPFQEEKGSFLWVLAPVVLISSLILPQFFLSGIIEATFKND 151
Query: 176 LLMGIS 181
+ G S
Sbjct: 152 TVAGRS 157
>gi|15221197|ref|NP_175279.1| uncharacterized protein [Arabidopsis thaliana]
gi|56381889|gb|AAV85663.1| At1g48460 [Arabidopsis thaliana]
gi|57222220|gb|AAW39017.1| At1g48460 [Arabidopsis thaliana]
gi|332194175|gb|AEE32296.1| uncharacterized protein [Arabidopsis thaliana]
Length = 340
Score = 110 bits (274), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/125 (51%), Positives = 82/125 (65%), Gaps = 11/125 (8%)
Query: 61 KHKIEGYSA--WSS---KRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSG 115
+HK++ W+ R + C S+S SD++ + S+ + E FRGKSG
Sbjct: 38 RHKLDSRQTLLWNKPQLSRVRVAC-SSSQSDSRPEKKQSDKSN-----YARAELFRGKSG 91
Query: 116 SVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDN 175
SVSF GLTHQLVEE KL+SAPFQE+KGSFLW+LAPVVLISSLILPQ FL +IE K++
Sbjct: 92 SVSFNGLTHQLVEESKLVSAPFQEEKGSFLWVLAPVVLISSLILPQFFLSGIIEATFKND 151
Query: 176 LLMGI 180
+ I
Sbjct: 152 TVAEI 156
>gi|297847108|ref|XP_002891435.1| hypothetical protein ARALYDRAFT_473986 [Arabidopsis lyrata subsp.
lyrata]
gi|297337277|gb|EFH67694.1| hypothetical protein ARALYDRAFT_473986 [Arabidopsis lyrata subsp.
lyrata]
Length = 331
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 75/183 (40%), Positives = 97/183 (53%), Gaps = 41/183 (22%)
Query: 3 MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPSASQFGSRS-----QSIRFGNAYKFLGNTF 57
ME +++C G P+ S+ LS +S F +R Q++R+
Sbjct: 1 MESKAICLGFLPPRLGFSSPHLLSRSPFLQGSSTFATRQKLDSRQTLRWNK--------- 51
Query: 58 PNLKHKIEGYSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSV 117
P L RG + C+S SG+++ + E FRGKSGSV
Sbjct: 52 PQLS------------RGHVACSSKQ---------SGKSN------YARAELFRGKSGSV 84
Query: 118 SFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLL 177
SF GLTHQLVEE KL+SAPFQE+KGSFLW+LAPVVLISSLILPQ FL IE K++ +
Sbjct: 85 SFNGLTHQLVEESKLVSAPFQEEKGSFLWVLAPVVLISSLILPQFFLSGAIEASFKNDTV 144
Query: 178 MGI 180
I
Sbjct: 145 AEI 147
>gi|148909474|gb|ABR17835.1| unknown [Picea sitchensis]
Length = 284
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 39/67 (58%)
Query: 110 FRGKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIE 169
F GK G V+F G HQ ++EG+ S +E S +W+L P+ L++S++LP +L E
Sbjct: 132 FDGKPGFVAFGGGNHQHLKEGEFSSTSSEEGNRSLIWLLGPIALVASVVLPPFYLRKSFE 191
Query: 170 DFIKDNL 176
D+L
Sbjct: 192 AVFGDSL 198
>gi|224119486|ref|XP_002318085.1| predicted protein [Populus trichocarpa]
gi|222858758|gb|EEE96305.1| predicted protein [Populus trichocarpa]
Length = 329
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 62/136 (45%), Gaps = 5/136 (3%)
Query: 42 QSIRFGNAYKFLGNTFPNLKHKIEGYSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPV 101
S F + L FP +H I S S+ I ++ D + +S +
Sbjct: 12 HSCHFIDDLPKLKGKFPRRQHIIFAASQDQSQYSEIKPDASEEEDDHPTAEALPTTSSSI 71
Query: 102 AIFNGPEPFRGKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQ 161
F G + GK G +SF +++V+E ++ + Q + S LW + P VL++S I P
Sbjct: 72 NHFPGTD---GKPGLISFYNRPYKMVDE--VIISNVQRNPSSLLWFIGPAVLVASFIFPS 126
Query: 162 MFLGNVIEDFIKDNLL 177
++L ++ +D+LL
Sbjct: 127 LYLRKILSAVFEDSLL 142
>gi|255540607|ref|XP_002511368.1| conserved hypothetical protein [Ricinus communis]
gi|223550483|gb|EEF51970.1| conserved hypothetical protein [Ricinus communis]
Length = 367
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
Query: 93 SGENSSLPVAIFNGPEPFRGKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVV 152
+ EN + + FN + GK G VSF +++ E + Q+++ S LW P V
Sbjct: 99 TAENLAPTSSFFNHLQGTDGKPGLVSFYNRPYKM--EDDVSKTNVQKNQSSLLWFAGPSV 156
Query: 153 LISSLILPQMFLGNVIEDFIKDNLL 177
L++S I P ++L ++ +D+LL
Sbjct: 157 LVASFIFPSLYLRRILSTVFEDSLL 181
>gi|449477656|ref|XP_004155083.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101208061
[Cucumis sativus]
Length = 366
Score = 44.3 bits (103), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 47/87 (54%), Gaps = 5/87 (5%)
Query: 92 PSGENSSLPVAIFNGPEPFRGKSGSVSFCGLTHQLVEEGKLMS-APFQEDKGSFLWILAP 150
P+ E+ SL F E GK G +SF + +EGK + + Q ++ FLW + P
Sbjct: 97 PTDEDISLENNSFLHYEGTGGKPGFISF----YNHSKEGKRIPLSSVQSNQYKFLWFVGP 152
Query: 151 VVLISSLILPQMFLGNVIEDFIKDNLL 177
VL++S I P ++L ++ + +D+LL
Sbjct: 153 AVLVASFIFPSLYLRKLLSNIFEDSLL 179
>gi|359807649|ref|NP_001241168.1| uncharacterized protein LOC100803499 [Glycine max]
gi|255634981|gb|ACU17849.1| unknown [Glycine max]
Length = 371
Score = 43.9 bits (102), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
GK G +SF H+ + K++ + + S LW L P VL++S I P ++L V+
Sbjct: 121 GKPGLISFYSRPHR--RDSKILLPNSERSQNSILWFLGPAVLVASFIFPSLYLRKVLSII 178
Query: 172 IKDNLL 177
+D+LL
Sbjct: 179 FEDSLL 184
>gi|21592953|gb|AAM64903.1| unknown [Arabidopsis thaliana]
Length = 367
Score = 42.7 bits (99), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
GK G +SF ++ E ++ Q G LW++ P VL+SS ILP ++L ++
Sbjct: 121 GKPGFISFYNPRNK--TEDIIIPPETQSPWGRLLWLIGPAVLVSSFILPPVYLRRIVSAV 178
Query: 172 IKDNLL 177
+D+LL
Sbjct: 179 FEDSLL 184
>gi|18424658|ref|NP_568963.1| uncharacterized protein [Arabidopsis thaliana]
gi|89000983|gb|ABD59081.1| At5g63040 [Arabidopsis thaliana]
gi|332010306|gb|AED97689.1| uncharacterized protein [Arabidopsis thaliana]
Length = 366
Score = 42.4 bits (98), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
GK G +SF ++ E ++ Q G LW++ P VL+SS ILP ++L ++
Sbjct: 120 GKPGFISFYNPRNK--TEDIIIPPETQSPWGRLLWLIGPAVLVSSFILPPVYLRRIVSAV 177
Query: 172 IKDNLL 177
+D+LL
Sbjct: 178 FEDSLL 183
>gi|297832432|ref|XP_002884098.1| hypothetical protein ARALYDRAFT_343424 [Arabidopsis lyrata subsp.
lyrata]
gi|297329938|gb|EFH60357.1| hypothetical protein ARALYDRAFT_343424 [Arabidopsis lyrata subsp.
lyrata]
Length = 407
Score = 42.4 bits (98), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 5/74 (6%)
Query: 104 FNGPEPFRGKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMF 163
+NG + GK G +SF ++ E ++ Q G LW++ P VL+SS ILP ++
Sbjct: 156 YNGND---GKPGFISFYNPRNK--TEDIIVPPESQSTWGRLLWLIGPAVLVSSFILPPIY 210
Query: 164 LGNVIEDFIKDNLL 177
L ++ +D+LL
Sbjct: 211 LRRIVSAVFEDSLL 224
>gi|42573776|ref|NP_974984.1| uncharacterized protein [Arabidopsis thaliana]
gi|9758304|dbj|BAB08847.1| unnamed protein product [Arabidopsis thaliana]
gi|332010307|gb|AED97690.1| uncharacterized protein [Arabidopsis thaliana]
Length = 366
Score = 42.4 bits (98), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
GK G +SF ++ E ++ Q G LW++ P VL+SS ILP ++L ++
Sbjct: 120 GKPGFISFYNPRNK--TEDIIIPPETQSPWGRLLWLIGPAVLVSSFILPPVYLRRIVSAV 177
Query: 172 IKDNLL 177
+D+LL
Sbjct: 178 FEDSLL 183
>gi|225456967|ref|XP_002278631.1| PREDICTED: uncharacterized protein LOC100251871 [Vitis vinifera]
Length = 319
Score = 41.2 bits (95), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 38/66 (57%), Gaps = 2/66 (3%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
GK G +SF ++ +E L+S P ++ + LW + P +L++S I P ++L ++
Sbjct: 69 GKPGLISFYNRPYKRADE-DLISGP-PRNQNNLLWFIGPSILVASFIFPSLYLRRILSTV 126
Query: 172 IKDNLL 177
+D+LL
Sbjct: 127 FEDSLL 132
>gi|297733750|emb|CBI14997.3| unnamed protein product [Vitis vinifera]
Length = 356
Score = 41.2 bits (95), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 38/66 (57%), Gaps = 2/66 (3%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
GK G +SF ++ +E L+S P ++ + LW + P +L++S I P ++L ++
Sbjct: 106 GKPGLISFYNRPYKRADE-DLISGP-PRNQNNLLWFIGPSILVASFIFPSLYLRRILSTV 163
Query: 172 IKDNLL 177
+D+LL
Sbjct: 164 FEDSLL 169
>gi|356508691|ref|XP_003523088.1| PREDICTED: uncharacterized protein LOC100796720 [Glycine max]
Length = 372
Score = 41.2 bits (95), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
GK G +SF ++ + K++ + + S LW L P VL++S I P ++L V+
Sbjct: 122 GKPGLLSFYNRPYR--RDSKILLPNSERSQNSILWFLGPAVLVASFIFPSLYLRKVLSII 179
Query: 172 IKDNLL 177
+D+LL
Sbjct: 180 FEDSLL 185
>gi|222619131|gb|EEE55263.1| hypothetical protein OsJ_03171 [Oryza sativa Japonica Group]
Length = 372
Score = 41.2 bits (95), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 39/67 (58%), Gaps = 4/67 (5%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGS-FLWILAPVVLISSLILPQMFLGNVIED 170
GKSG +SF G ++Q+ + S P + S +W + P +L++ L+LP ++L V+
Sbjct: 124 GKSGFISFQGSSYQM---KSVESVPHPGKEASRLVWFVGPTILVAFLVLPSLYLRKVLSA 180
Query: 171 FIKDNLL 177
+D+LL
Sbjct: 181 VFEDSLL 187
>gi|168009397|ref|XP_001757392.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691515|gb|EDQ77877.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 252
Score = 40.4 bits (93), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 11/72 (15%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLM--SAPFQE----DKGSFLWILAPVVLISSLILPQMFLG 165
GK G VSFC GK S +QE +GS LW+ P+ L+ S++ P ++L
Sbjct: 1 GKPGFVSFCS-----AGSGKEQPESTEYQELKPSQRGSLLWLFGPLALVFSVVGPPLYLR 55
Query: 166 NVIEDFIKDNLL 177
V E ++D+LL
Sbjct: 56 RVFESILEDSLL 67
>gi|218188924|gb|EEC71351.1| hypothetical protein OsI_03428 [Oryza sativa Indica Group]
Length = 372
Score = 39.7 bits (91), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 39/67 (58%), Gaps = 4/67 (5%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGS-FLWILAPVVLISSLILPQMFLGNVIED 170
GKSG +SF G ++++ + S P + S +W + P +L++ L+LP ++L V+
Sbjct: 124 GKSGFISFQGSSYRM---KSVESVPHPGKEASRLVWFVGPTILVAFLVLPSLYLRKVLSA 180
Query: 171 FIKDNLL 177
+D+LL
Sbjct: 181 VFEDSLL 187
>gi|357461909|ref|XP_003601236.1| hypothetical protein MTR_3g077500 [Medicago truncatula]
gi|355490284|gb|AES71487.1| hypothetical protein MTR_3g077500 [Medicago truncatula]
Length = 377
Score = 38.9 bits (89), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 19/66 (28%), Positives = 37/66 (56%), Gaps = 2/66 (3%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
GK G +SF ++ E L+++ + + + LW + P VL++S I P ++L ++
Sbjct: 127 GKPGFISFYNRPYRRDSEILLLNS--ERSQNNILWFVGPAVLVASFIFPSLYLRKLLSII 184
Query: 172 IKDNLL 177
+D+LL
Sbjct: 185 FEDSLL 190
>gi|388517087|gb|AFK46605.1| unknown [Medicago truncatula]
Length = 376
Score = 38.5 bits (88), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 19/66 (28%), Positives = 37/66 (56%), Gaps = 2/66 (3%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
GK G +SF ++ E L+++ + + + LW + P VL++S I P ++L ++
Sbjct: 126 GKPGFISFYNRPYRRDSEILLLNS--ERSQNNILWFVGPAVLVASFIFPSLYLRKLLSII 183
Query: 172 IKDNLL 177
+D+LL
Sbjct: 184 FEDSLL 189
>gi|357461911|ref|XP_003601237.1| hypothetical protein MTR_3g077500 [Medicago truncatula]
gi|355490285|gb|AES71488.1| hypothetical protein MTR_3g077500 [Medicago truncatula]
Length = 330
Score = 38.1 bits (87), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/66 (28%), Positives = 37/66 (56%), Gaps = 2/66 (3%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
GK G +SF ++ E L+++ + + + LW + P VL++S I P ++L ++
Sbjct: 80 GKPGFISFYNRPYRRDSEILLLNS--ERSQNNILWFVGPAVLVASFIFPSLYLRKLLSII 137
Query: 172 IKDNLL 177
+D+LL
Sbjct: 138 FEDSLL 143
>gi|326496917|dbj|BAJ98485.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 409
Score = 37.4 bits (85), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 37/67 (55%), Gaps = 4/67 (5%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGS-FLWILAPVVLISSLILPQMFLGNVIED 170
GK G +SF G + Q + + S P + S +W + P +L++ L+LP ++L V+
Sbjct: 161 GKPGFISFHGGSSQTIS---VESVPHPGKEASRLVWFVGPTILVAFLVLPSLYLRKVLSA 217
Query: 171 FIKDNLL 177
+D+LL
Sbjct: 218 VFEDSLL 224
>gi|441671766|ref|XP_003281660.2| PREDICTED: LOW QUALITY PROTEIN: nephrocystin-4 [Nomascus
leucogenys]
Length = 1583
Score = 37.0 bits (84), Expect = 3.5, Method: Composition-based stats.
Identities = 27/93 (29%), Positives = 42/93 (45%), Gaps = 4/93 (4%)
Query: 9 CRGLQLPKFKSSNCGFLSHQFV--SPSASQFGSRSQSIRFGNAYKFLGNTFPNLKHKIEG 66
CRGL P +S+CGF P+A+ G RS+S+ +L + P+ +
Sbjct: 31 CRGLPAPPNPASSCGFRQQHLHLDGPAAAPIGWRSRSVALDLGSDWLRS--PDGESDARR 88
Query: 67 YSAWSSKRGAILCASNSGSDTKLGFPSGENSSL 99
S +RGA L + + G P G++S L
Sbjct: 89 LPGLSVRRGAPLSEAPPAGGSGRGAPFGQSSVL 121
>gi|414880826|tpg|DAA57957.1| TPA: hypothetical protein ZEAMMB73_152959 [Zea mays]
Length = 384
Score = 36.2 bits (82), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 36/67 (53%), Gaps = 4/67 (5%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGS-FLWILAPVVLISSLILPQMFLGNVIED 170
GK G +SF T + + + S P + S +W + P +L+S L+LP ++L V+
Sbjct: 136 GKPGFISFQESTFR---KSTVESVPHPGKEASRLVWFIGPTILVSFLVLPSLYLRKVLSA 192
Query: 171 FIKDNLL 177
+D+LL
Sbjct: 193 VFEDSLL 199
>gi|195646440|gb|ACG42688.1| hypothetical protein [Zea mays]
Length = 375
Score = 36.2 bits (82), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 36/67 (53%), Gaps = 4/67 (5%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGS-FLWILAPVVLISSLILPQMFLGNVIED 170
GK G +SF T + + + S P + S +W + P +L+S L+LP ++L V+
Sbjct: 127 GKPGFISFQESTFR---KSTVESVPHPGKEASRLVWFIGPTILVSFLVLPSLYLRKVLSA 183
Query: 171 FIKDNLL 177
+D+LL
Sbjct: 184 VFEDSLL 190
>gi|219363149|ref|NP_001136799.1| uncharacterized protein LOC100216945 [Zea mays]
gi|194697152|gb|ACF82660.1| unknown [Zea mays]
gi|414880825|tpg|DAA57956.1| TPA: hypothetical protein ZEAMMB73_152959 [Zea mays]
Length = 375
Score = 36.2 bits (82), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 36/67 (53%), Gaps = 4/67 (5%)
Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGS-FLWILAPVVLISSLILPQMFLGNVIED 170
GK G +SF T + + + S P + S +W + P +L+S L+LP ++L V+
Sbjct: 127 GKPGFISFQESTFR---KSTVESVPHPGKEASRLVWFIGPTILVSFLVLPSLYLRKVLSA 183
Query: 171 FIKDNLL 177
+D+LL
Sbjct: 184 VFEDSLL 190
>gi|310830389|ref|YP_003965490.1| Minor extracellular protease epr [Paenibacillus polymyxa SC2]
gi|309249856|gb|ADO59422.1| Minor extracellular protease epr [Paenibacillus polymyxa SC2]
Length = 1035
Score = 35.4 bits (80), Expect = 9.5, Method: Composition-based stats.
Identities = 25/103 (24%), Positives = 47/103 (45%), Gaps = 5/103 (4%)
Query: 48 NAYKFLGNTF--PNLKHKIEGYSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFN 105
N Y ++ + N+++K+EG +AW G+IL +N+ + G S + + N
Sbjct: 310 NIYAYINASLDSTNIQYKLEGDTAWKPYNGSILLTNNTTIYAQTSNSGGYQSEVVSLVVN 369
Query: 106 GPEPFRGKSGSVSFCGLTHQLV---EEGKLMSAPFQEDKGSFL 145
+ S +VS G ++ + + S +Q D GS+L
Sbjct: 370 NIDKTAPTSPTVSIVGNKLTIIPGSDPSGISSTKYQIDNGSWL 412
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,956,907,940
Number of Sequences: 23463169
Number of extensions: 124727463
Number of successful extensions: 235997
Number of sequences better than 100.0: 38
Number of HSP's better than 100.0 without gapping: 22
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 235969
Number of HSP's gapped (non-prelim): 38
length of query: 181
length of database: 8,064,228,071
effective HSP length: 133
effective length of query: 48
effective length of database: 9,238,593,890
effective search space: 443452506720
effective search space used: 443452506720
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)