BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 033102
(127 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224142377|ref|XP_002324535.1| predicted protein [Populus trichocarpa]
gi|222865969|gb|EEF03100.1| predicted protein [Populus trichocarpa]
Length = 196
Score = 189 bits (480), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 92/131 (70%), Positives = 110/131 (83%), Gaps = 4/131 (3%)
Query: 1 MASTSAFTAQISSIH----WKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSL 56
MAST+A TAQ+++++ WK+P +S NLNLN+N+ GK++LS+S +QKS +RS
Sbjct: 1 MASTTAVTAQVTNLNGLGDWKRPFPASLPLGNLNLNANVVGKQLLSWSGKQKSERNYRSF 60
Query: 57 RVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFD 116
RV GLFGGKK+N+EK DDAPSKAGI GNMQNLYETVKKAQMVVQVEAVRVQKELA+AEFD
Sbjct: 61 RVIGLFGGKKENSEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELASAEFD 120
Query: 117 GYCEGELIKVC 127
GYCEGELIKV
Sbjct: 121 GYCEGELIKVT 131
>gi|255577037|ref|XP_002529403.1| conserved hypothetical protein [Ricinus communis]
gi|223531151|gb|EEF32999.1| conserved hypothetical protein [Ricinus communis]
Length = 195
Score = 175 bits (443), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 91/130 (70%), Positives = 106/130 (81%), Gaps = 5/130 (3%)
Query: 1 MASTSAFTAQISSIH----WKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSL 56
MASTSA A++S+++ WKKP SS NL+LN+NL G++++S+ QKS RSL
Sbjct: 1 MASTSAVAARVSNLNGLADWKKPNHSSFPIGNLSLNANLVGQQLVSWPGHQKSRRCSRSL 60
Query: 57 RVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFD 116
RV+GLFGGKKDNNEK D+ SKAGILGNMQNLYETVKKAQMVVQVEAV+VQKELA AEFD
Sbjct: 61 RVHGLFGGKKDNNEKSDEQ-SKAGILGNMQNLYETVKKAQMVVQVEAVKVQKELALAEFD 119
Query: 117 GYCEGELIKV 126
GYCEGELIKV
Sbjct: 120 GYCEGELIKV 129
>gi|225445842|ref|XP_002277907.1| PREDICTED: UPF0133 protein Synpcc7942_0464 [Vitis vinifera]
gi|297743656|emb|CBI36539.3| unnamed protein product [Vitis vinifera]
Length = 196
Score = 164 bits (416), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 87/130 (66%), Positives = 98/130 (75%), Gaps = 4/130 (3%)
Query: 1 MASTSAFTAQISSI----HWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSL 56
MASTSA T QIS++ K TSSS F L+ N+NL M + + G+ RS+
Sbjct: 1 MASTSALTTQISNLCGFTDCKNTTSSSLFFCKLSPNTNLVSTRMFCQYADSRMGNNRRSV 60
Query: 57 RVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFD 116
RVYGLFGG K+N+EKGDDA SKAG LGNMQNLYETVKKAQMVVQVEAV+VQKELA AEFD
Sbjct: 61 RVYGLFGGNKENSEKGDDARSKAGFLGNMQNLYETVKKAQMVVQVEAVKVQKELAEAEFD 120
Query: 117 GYCEGELIKV 126
GYCEGELIKV
Sbjct: 121 GYCEGELIKV 130
>gi|356543329|ref|XP_003540114.1| PREDICTED: UPF0133 protein Synpcc7942_0464-like [Glycine max]
Length = 196
Score = 157 bits (398), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 82/130 (63%), Positives = 93/130 (71%), Gaps = 4/130 (3%)
Query: 1 MASTSAFTAQISSI----HWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSL 56
M +T T +S+ WKK S L+ NSN+ ++LS QK G+ R
Sbjct: 1 MTTTPCVTGALSNAVGFRDWKKHVPLSASLCKLSFNSNIVDMKILSRCGCQKVGNDQRCF 60
Query: 57 RVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFD 116
R+Y LFGGKKDN+ K DDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFD
Sbjct: 61 RLYALFGGKKDNSGKSDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFD 120
Query: 117 GYCEGELIKV 126
GYCEGEL+KV
Sbjct: 121 GYCEGELVKV 130
>gi|351724105|ref|NP_001236790.1| uncharacterized protein LOC100305795 [Glycine max]
gi|255626621|gb|ACU13655.1| unknown [Glycine max]
Length = 195
Score = 146 bits (369), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/130 (60%), Positives = 90/130 (69%), Gaps = 5/130 (3%)
Query: 1 MASTSAFTAQISSI----HWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSL 56
M + + T +S+ WKK S L+ +SN+ + +S +K GH R
Sbjct: 1 MTTAPSLTGPLSNAVGFSDWKKHAPLSASLCKLSSSSNIVDMK-ISRCGCRKVGHGQRCF 59
Query: 57 RVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFD 116
R+Y LFGGKKDNN K DD PSKAGILGNMQNLYETVKKAQMVVQVEA RVQKELAAAEFD
Sbjct: 60 RLYALFGGKKDNNGKSDDVPSKAGILGNMQNLYETVKKAQMVVQVEAARVQKELAAAEFD 119
Query: 117 GYCEGELIKV 126
GYCEGEL+KV
Sbjct: 120 GYCEGELVKV 129
>gi|388500608|gb|AFK38370.1| unknown [Lotus japonicus]
Length = 188
Score = 145 bits (366), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 78/126 (61%), Positives = 90/126 (71%), Gaps = 4/126 (3%)
Query: 1 MASTSAFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYG 60
MAST + I T SS LN +S++ ++L S +K+GH R LR+YG
Sbjct: 1 MASTVSLPNAIGLCPRNNHTLSSTSHCKLNSSSSIVDMKILPQSGSRKAGHGQRHLRLYG 60
Query: 61 LFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCE 120
LFGGKKD GDD+ +K GILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCE
Sbjct: 61 LFGGKKD----GDDSSNKGGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCE 116
Query: 121 GELIKV 126
GEL+KV
Sbjct: 117 GELVKV 122
>gi|223948417|gb|ACN28292.1| unknown [Zea mays]
gi|413926357|gb|AFW66289.1| hypothetical protein ZEAMMB73_706905 [Zea mays]
Length = 186
Score = 144 bits (363), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 73/121 (60%), Positives = 89/121 (73%), Gaps = 2/121 (1%)
Query: 6 AFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYGLFGGK 65
A +A ++ + ++ S+ SN N+ G L S+++++ +RS RVY LFGGK
Sbjct: 2 APSAALAPVAFRSAFSTPLASNPTRNRINIEGTFCLPCSARKRAS--YRSFRVYSLFGGK 59
Query: 66 KDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIK 125
KD +E GD+APSKAGI GNMQNLYETVKKAQMVVQVEAVRVQKELAA E DGYCEGELIK
Sbjct: 60 KDKDENGDEAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAATEIDGYCEGELIK 119
Query: 126 V 126
V
Sbjct: 120 V 120
>gi|388507246|gb|AFK41689.1| unknown [Lotus japonicus]
Length = 188
Score = 143 bits (361), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/126 (61%), Positives = 88/126 (69%), Gaps = 4/126 (3%)
Query: 1 MASTSAFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYG 60
MAST + I T SS LN +S++ ++L S +K GH R LR+YG
Sbjct: 1 MASTVSLPNAIGLCPRNNHTLSSTSHCKLNSSSSIVDMKILPQSGSRKVGHGQRHLRLYG 60
Query: 61 LFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCE 120
LFGGKKD GDD+P+K ILGNMQNLYETVKKAQMVVQVEAVRVQKELAA EFDGYCE
Sbjct: 61 LFGGKKD----GDDSPNKGRILGNMQNLYETVKKAQMVVQVEAVRVQKELAATEFDGYCE 116
Query: 121 GELIKV 126
GEL+KV
Sbjct: 117 GELVKV 122
>gi|413935880|gb|AFW70431.1| hypothetical protein ZEAMMB73_296576 [Zea mays]
Length = 265
Score = 143 bits (360), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/126 (57%), Positives = 87/126 (69%), Gaps = 6/126 (4%)
Query: 1 MASTSAFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYG 60
MA ++A + P +S N +N+ G L S+++++ +R RVY
Sbjct: 80 MAPSTALAPVAFRSAFSTPLASKPTRNRMNIE----GAFCLPCSTRKRA--SYRPFRVYS 133
Query: 61 LFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCE 120
LFGGKKD +E GD+APSKAGI GNMQNLYETVKKAQMVVQVEAVRVQKELAA E DGYCE
Sbjct: 134 LFGGKKDKDENGDEAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAATEIDGYCE 193
Query: 121 GELIKV 126
GELIKV
Sbjct: 194 GELIKV 199
>gi|226501168|ref|NP_001141299.1| uncharacterized protein LOC100273390 [Zea mays]
gi|194703866|gb|ACF86017.1| unknown [Zea mays]
Length = 186
Score = 142 bits (358), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 73/126 (57%), Positives = 87/126 (69%), Gaps = 6/126 (4%)
Query: 1 MASTSAFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYG 60
MA ++A + P +S N +N+ G L S+++++ +R RVY
Sbjct: 1 MAPSTALAPVAFRSAFSTPLASKPTRNRMNIE----GAFCLPCSTRKRA--SYRPFRVYS 54
Query: 61 LFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCE 120
LFGGKKD +E GD+APSKAGI GNMQNLYETVKKAQMVVQVEAVRVQKELAA E DGYCE
Sbjct: 55 LFGGKKDKDENGDEAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAATEIDGYCE 114
Query: 121 GELIKV 126
GELIKV
Sbjct: 115 GELIKV 120
>gi|449457887|ref|XP_004146679.1| PREDICTED: nucleoid-associated protein Synpcc7942_0464-like
[Cucumis sativus]
gi|449503169|ref|XP_004161868.1| PREDICTED: nucleoid-associated protein Synpcc7942_0464-like
[Cucumis sativus]
Length = 187
Score = 139 bits (349), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 81/128 (63%), Positives = 89/128 (69%), Gaps = 9/128 (7%)
Query: 1 MASTSAFTAQISSIHWKKPTSSSNFSNNLNLNS--NLAGKEMLSFSSQQKSGHQFRSLRV 58
MAST + +AQ+ ++ S+F NLNS N+ G LS K RSL V
Sbjct: 1 MASTISLSAQLPNLR-----GISDFKKRSNLNSMSNIVGTR-LSPYGPWKVEKNNRSLCV 54
Query: 59 YGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGY 118
YGLFGGKKD EK DDAPSKAGI GNMQ LYETV+ AQ VVQVEAVRVQKELAAAEFDGY
Sbjct: 55 YGLFGGKKDE-EKSDDAPSKAGIFGNMQKLYETVRTAQKVVQVEAVRVQKELAAAEFDGY 113
Query: 119 CEGELIKV 126
CEGELIKV
Sbjct: 114 CEGELIKV 121
>gi|388505390|gb|AFK40761.1| unknown [Lotus japonicus]
Length = 191
Score = 137 bits (346), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 79/130 (60%), Positives = 89/130 (68%), Gaps = 9/130 (6%)
Query: 1 MASTSAFTAQISSIH----WKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSL 56
MAST + T Q+S+IH KK + S LN ++N+ LS +K G R
Sbjct: 1 MASTVSLTGQLSNIHGASERKKLSLLSTSLCKLNSSANVVDMRPLSQRGHKKVGRD-RCF 59
Query: 57 RVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFD 116
RVY LFGGKK+NN D SK G+LGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFD
Sbjct: 60 RVYALFGGKKENN----DGDSKTGMLGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFD 115
Query: 117 GYCEGELIKV 126
GYCE ELIKV
Sbjct: 116 GYCEDELIKV 125
>gi|115444621|ref|NP_001046090.1| Os02g0180200 [Oryza sativa Japonica Group]
gi|50252089|dbj|BAD28019.1| unknown protein [Oryza sativa Japonica Group]
gi|113535621|dbj|BAF08004.1| Os02g0180200 [Oryza sativa Japonica Group]
gi|125538336|gb|EAY84731.1| hypothetical protein OsI_06100 [Oryza sativa Indica Group]
gi|125581039|gb|EAZ21970.1| hypothetical protein OsJ_05624 [Oryza sativa Japonica Group]
Length = 185
Score = 135 bits (339), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/126 (59%), Positives = 89/126 (70%), Gaps = 7/126 (5%)
Query: 1 MASTSAFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYG 60
MA ++A +S + +K S F N N+ G L +++K+ + RS RVY
Sbjct: 1 MAPSTA----LSPVAFKSSFSPLLF-NPTRSKINVEGAFCLPCYNRKKASN--RSFRVYS 53
Query: 61 LFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCE 120
LFGGKKD +E G++APSKAGI GNMQNLYETVKKAQMVVQVEAVRVQKELAA E DGYCE
Sbjct: 54 LFGGKKDKDENGEEAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAATEIDGYCE 113
Query: 121 GELIKV 126
GELIKV
Sbjct: 114 GELIKV 119
>gi|297825313|ref|XP_002880539.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326378|gb|EFH56798.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 182
Score = 134 bits (336), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/106 (67%), Positives = 83/106 (78%), Gaps = 8/106 (7%)
Query: 23 SNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYGLFGG-KKDNNEKGDDAPSKAGI 81
S+ S N +LNS + +++ QQKS + +RSLRV GLFGG KDNN +D SKAGI
Sbjct: 19 SHVSGNASLNS-----QRVTWPKQQKSKNGYRSLRVNGLFGGGNKDNNS--EDGQSKAGI 71
Query: 82 LGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVC 127
GNMQN+YETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGEL+KV
Sbjct: 72 FGNMQNMYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELVKVT 117
>gi|242060746|ref|XP_002451662.1| hypothetical protein SORBIDRAFT_04g005460 [Sorghum bicolor]
gi|241931493|gb|EES04638.1| hypothetical protein SORBIDRAFT_04g005460 [Sorghum bicolor]
Length = 185
Score = 133 bits (335), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/111 (62%), Positives = 82/111 (73%), Gaps = 7/111 (6%)
Query: 16 WKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYGLFGGKKDNNEKGDDA 75
+ P +S+ N +N+ G L S+++++ +R RVY LFGGKKD +E GD+
Sbjct: 16 FSTPLASNPTRNRINIE----GAFCLPCSTRKRAS--YRPFRVYSLFGGKKDKDENGDE- 68
Query: 76 PSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKV 126
PSKAGI GNMQNLYETVKKAQMVVQVEAVRVQKELAA E DGYCEGELIKV
Sbjct: 69 PSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAATEIDGYCEGELIKV 119
>gi|18400436|ref|NP_565561.1| uncharacterized protein [Arabidopsis thaliana]
gi|15081684|gb|AAK82497.1| At2g24020/T29E15.22 [Arabidopsis thaliana]
gi|20147169|gb|AAM10301.1| At2g24020/T29E15.22 [Arabidopsis thaliana]
gi|20197299|gb|AAC63670.2| expressed protein [Arabidopsis thaliana]
gi|330252424|gb|AEC07518.1| uncharacterized protein [Arabidopsis thaliana]
Length = 182
Score = 127 bits (320), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 70/106 (66%), Positives = 80/106 (75%), Gaps = 8/106 (7%)
Query: 23 SNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYGLFGG-KKDNNEKGDDAPSKAGI 81
S+ S N +LNS + ++ Q KS + +RSLRV GLFGG KDNN +D SKAGI
Sbjct: 19 SHVSGNASLNS-----QRRTWPKQYKSKNGYRSLRVNGLFGGGNKDNNS--EDGQSKAGI 71
Query: 82 LGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVC 127
GNMQN+YETVKKAQMVVQVEAVRVQKELAAAEFDGYC GEL+KV
Sbjct: 72 FGNMQNMYETVKKAQMVVQVEAVRVQKELAAAEFDGYCAGELVKVT 117
>gi|255548728|ref|XP_002515420.1| conserved hypothetical protein [Ricinus communis]
gi|223545364|gb|EEF46869.1| conserved hypothetical protein [Ricinus communis]
Length = 196
Score = 127 bits (320), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/124 (58%), Positives = 84/124 (67%), Gaps = 2/124 (1%)
Query: 2 ASTSAFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYGL 61
AST S K PT SS +N +N G L S ++G SLR+ GL
Sbjct: 8 ASTPKLGNCCSLTFHKNPTFSSISFCKINSFANHIGP--LIVSRPDRTGKVSLSLRLTGL 65
Query: 62 FGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEG 121
FGGKK+NN+K ++ SKAG++GNMQNLYETVKKAQMVVQVEAVRVQKELA AEFDGYC+
Sbjct: 66 FGGKKENNDKSEEKQSKAGLMGNMQNLYETVKKAQMVVQVEAVRVQKELAVAEFDGYCQD 125
Query: 122 ELIK 125
ELIK
Sbjct: 126 ELIK 129
>gi|297798904|ref|XP_002867336.1| hypothetical protein ARALYDRAFT_913413 [Arabidopsis lyrata subsp.
lyrata]
gi|297313172|gb|EFH43595.1| hypothetical protein ARALYDRAFT_913413 [Arabidopsis lyrata subsp.
lyrata]
Length = 181
Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 77/128 (60%), Positives = 88/128 (68%), Gaps = 12/128 (9%)
Query: 1 MASTSAFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYG 60
MAST+ T + KP S FSN N ++ + ++ + + +Q RSLRV G
Sbjct: 1 MASTATTT------DFFKPLLSP-FSNG-NAAQRISRQNVVWLNRKNSGNNQSRSLRVNG 52
Query: 61 LFGG-KKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYC 119
LFGG KKDN E D SKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELA AEFDGYC
Sbjct: 53 LFGGGKKDNKE---DGQSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAVAEFDGYC 109
Query: 120 EGELIKVC 127
EGEL+KV
Sbjct: 110 EGELVKVT 117
>gi|326518424|dbj|BAJ88241.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 210
Score = 127 bits (318), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/129 (56%), Positives = 89/129 (68%), Gaps = 10/129 (7%)
Query: 1 MASTSAFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYG 60
MAS++A +S + +K S +F N G L S+ ++S + RS RVY
Sbjct: 23 MASSTA----LSPVAFKSSFSPLSF-RPARTRINAQGAFCLPCSTWKRSSN--RSFRVYS 75
Query: 61 LFGGKKDNNEKGDDAPSKAGI---LGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDG 117
LFGGKK+ +E GD+AP+K GI +GNMQNLYETVKKAQMVVQVEAVRVQKELAA E +G
Sbjct: 76 LFGGKKEKDENGDEAPAKGGIFGNIGNMQNLYETVKKAQMVVQVEAVRVQKELAATEIEG 135
Query: 118 YCEGELIKV 126
YCEGELIKV
Sbjct: 136 YCEGELIKV 144
>gi|15234797|ref|NP_194791.1| uncharacterized protein [Arabidopsis thaliana]
gi|7269963|emb|CAB79780.1| putative protein [Arabidopsis thaliana]
gi|18650641|gb|AAL75890.1| AT4g30620/F17I23_40 [Arabidopsis thaliana]
gi|56236132|gb|AAV84522.1| At4g30620 [Arabidopsis thaliana]
gi|98960937|gb|ABF58952.1| At4g30620 [Arabidopsis thaliana]
gi|332660387|gb|AEE85787.1| uncharacterized protein [Arabidopsis thaliana]
Length = 180
Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/83 (78%), Positives = 71/83 (85%), Gaps = 4/83 (4%)
Query: 46 QQKSGHQFRSLRVYGLFGG-KKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAV 104
+++SG+ RSLRV GLFGG KKDN E D SKAGILGNMQNLYETVKKAQMVVQVEAV
Sbjct: 37 RKQSGNNNRSLRVNGLFGGGKKDNKE---DGQSKAGILGNMQNLYETVKKAQMVVQVEAV 93
Query: 105 RVQKELAAAEFDGYCEGELIKVC 127
RVQKELA AEFDGYC+GEL+KV
Sbjct: 94 RVQKELAVAEFDGYCQGELVKVT 116
>gi|116791488|gb|ABK26000.1| unknown [Picea sitchensis]
Length = 193
Score = 126 bits (317), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 70/128 (54%), Positives = 87/128 (67%), Gaps = 3/128 (2%)
Query: 1 MASTSAFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQ--QKSGHQFRSLRV 58
MAS+++ + + + P++ S FS N S A ++ F S+ Q+ + +L V
Sbjct: 1 MASSASCMSACTGTRYYLPSADS-FSFRRNCVSPHANRQNTHFVSRGGQRGRYHRGALCV 59
Query: 59 YGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGY 118
LFGGKK+NNE D+ KAG+ GNMQ LYETVKKAQMVVQVEAVRVQKELAA EFDGY
Sbjct: 60 KSLFGGKKENNENTDETAKKAGLFGNMQGLYETVKKAQMVVQVEAVRVQKELAATEFDGY 119
Query: 119 CEGELIKV 126
CEGELIKV
Sbjct: 120 CEGELIKV 127
>gi|334184400|ref|NP_001189586.1| uncharacterized protein [Arabidopsis thaliana]
gi|330252425|gb|AEC07519.1| uncharacterized protein [Arabidopsis thaliana]
Length = 180
Score = 125 bits (314), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 77/128 (60%), Positives = 88/128 (68%), Gaps = 14/128 (10%)
Query: 1 MASTSAFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYG 60
MAS +A T S+ + P S N S LNS + ++ Q KS + +RSLRV G
Sbjct: 1 MASMAATTNFTKSMLF--PFSHGNAS----LNS-----QRRTWPKQYKSKNGYRSLRVNG 49
Query: 61 LFGG-KKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYC 119
LFGG KDNN +D SKAGI GNMQN+YETVKKAQMVVQVEAVRVQKELAAAEFDGYC
Sbjct: 50 LFGGGNKDNNS--EDGQSKAGIFGNMQNMYETVKKAQMVVQVEAVRVQKELAAAEFDGYC 107
Query: 120 EGELIKVC 127
GEL+KV
Sbjct: 108 AGELVKVT 115
>gi|21553866|gb|AAM62959.1| unknown [Arabidopsis thaliana]
gi|32892213|gb|AAP89016.1| complex interacting protein 9 [Arabidopsis thaliana]
Length = 180
Score = 123 bits (308), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 64/83 (77%), Positives = 70/83 (84%), Gaps = 4/83 (4%)
Query: 46 QQKSGHQFRSLRVYGLFGG-KKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAV 104
+++SG+ RSLRV LFGG KKDN E D SKAGILGNMQNLYETVKKAQMVVQVEAV
Sbjct: 37 RKQSGNNNRSLRVNVLFGGGKKDNKE---DGQSKAGILGNMQNLYETVKKAQMVVQVEAV 93
Query: 105 RVQKELAAAEFDGYCEGELIKVC 127
RVQKELA AEFDGYC+GEL+KV
Sbjct: 94 RVQKELAVAEFDGYCQGELVKVT 116
>gi|413926359|gb|AFW66291.1| hypothetical protein ZEAMMB73_706905 [Zea mays]
Length = 121
Score = 119 bits (297), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 62/110 (56%), Positives = 78/110 (70%), Gaps = 2/110 (1%)
Query: 6 AFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYGLFGGK 65
A +A ++ + ++ S+ SN N+ G L S+++++ + RS RVY LFGGK
Sbjct: 2 APSAALAPVAFRSAFSTPLASNPTRNRINIEGTFCLPCSARKRASY--RSFRVYSLFGGK 59
Query: 66 KDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEF 115
KD +E GD+APSKAGI GNMQNLYETVKKAQMVVQVEAVRVQKELAA F
Sbjct: 60 KDKDENGDEAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAYAF 109
>gi|357480043|ref|XP_003610307.1| Complex interacting protein [Medicago truncatula]
gi|355511362|gb|AES92504.1| Complex interacting protein [Medicago truncatula]
Length = 191
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 60/94 (63%), Positives = 70/94 (74%), Gaps = 1/94 (1%)
Query: 33 SNLAGKEMLSFSSQQKSGHQFRSLRVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETV 92
+N+ ++S QK G R+ RV+ LFGGKKDN E GD+ G LGNMQNL+ETV
Sbjct: 34 ANVVDMRLVSPRGLQKVGSNHRNFRVHALFGGKKDN-ESGDNKGGMLGQLGNMQNLFETV 92
Query: 93 KKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKV 126
KKAQ VVQVE+ R+QKELAAAEFDGYCEGELIKV
Sbjct: 93 KKAQAVVQVESSRIQKELAAAEFDGYCEGELIKV 126
>gi|194697066|gb|ACF82617.1| unknown [Zea mays]
gi|413926358|gb|AFW66290.1| hypothetical protein ZEAMMB73_706905 [Zea mays]
Length = 150
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/110 (56%), Positives = 78/110 (70%), Gaps = 2/110 (1%)
Query: 6 AFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYGLFGGK 65
A +A ++ + ++ S+ SN N+ G L S+++++ +RS RVY LFGGK
Sbjct: 2 APSAALAPVAFRSAFSTPLASNPTRNRINIEGTFCLPCSARKRAS--YRSFRVYSLFGGK 59
Query: 66 KDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEF 115
KD +E GD+APSKAGI GNMQNLYETVKKAQMVVQVEAVRVQKELAA F
Sbjct: 60 KDKDENGDEAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAALHF 109
>gi|297850530|ref|XP_002893146.1| hypothetical protein ARALYDRAFT_335341 [Arabidopsis lyrata subsp.
lyrata]
gi|297338988|gb|EFH69405.1| hypothetical protein ARALYDRAFT_335341 [Arabidopsis lyrata subsp.
lyrata]
Length = 148
Score = 113 bits (282), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 57/75 (76%), Positives = 61/75 (81%), Gaps = 4/75 (5%)
Query: 53 FRSLRVY-GLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELA 111
RSLR + G GK+ E DD +KAGILGNMQNLYETVKKAQMVVQVEAVRVQKELA
Sbjct: 14 LRSLRFFSGFISGKR---EPEDDVDTKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELA 70
Query: 112 AAEFDGYCEGELIKV 126
AEFDGYCEGEL+KV
Sbjct: 71 VAEFDGYCEGELVKV 85
>gi|168025964|ref|XP_001765503.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683353|gb|EDQ69764.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 190
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/76 (68%), Positives = 60/76 (78%), Gaps = 5/76 (6%)
Query: 54 RSLRVYGLFGGKKDNNEKGDDAPSKAGI---LGNMQNLYETVKKAQMVVQVEAVRVQKEL 110
RSL V L+GGKK+ N DDA K G+ LGNMQ LY+TV+KAQ VVQVEAVRVQKEL
Sbjct: 50 RSLNVTSLWGGKKEENP--DDAAKKQGLGNMLGNMQGLYDTVRKAQQVVQVEAVRVQKEL 107
Query: 111 AAAEFDGYCEGELIKV 126
AAAEFDGYC+ EL++V
Sbjct: 108 AAAEFDGYCDDELVRV 123
>gi|147843449|emb|CAN79971.1| hypothetical protein VITISV_010071 [Vitis vinifera]
Length = 118
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 46/49 (93%), Positives = 47/49 (95%)
Query: 78 KAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKV 126
KAG LGNMQNLYETVKKAQMVVQVEAV+VQKELA AEFDGYCEGELIKV
Sbjct: 4 KAGFLGNMQNLYETVKKAQMVVQVEAVKVQKELAEAEFDGYCEGELIKV 52
>gi|168047282|ref|XP_001776100.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672610|gb|EDQ59145.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 287
Score = 95.1 bits (235), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 51/76 (67%), Positives = 59/76 (77%), Gaps = 5/76 (6%)
Query: 54 RSLRVYGLFGGKKDNNEKGDDAPSKAGI---LGNMQNLYETVKKAQMVVQVEAVRVQKEL 110
RSL V L+GGKK+ N D+A K G+ LGNMQ LY+TV+KAQ VVQVEAVRVQKEL
Sbjct: 147 RSLSVTSLWGGKKEENS--DEAAKKQGLGNMLGNMQGLYDTVRKAQQVVQVEAVRVQKEL 204
Query: 111 AAAEFDGYCEGELIKV 126
AAAEFDGY + EL+KV
Sbjct: 205 AAAEFDGYSDDELVKV 220
>gi|168014741|ref|XP_001759910.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689040|gb|EDQ75414.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 363
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 48/78 (61%), Positives = 55/78 (70%), Gaps = 8/78 (10%)
Query: 54 RSLRVYGLFGGKKDNNEKGDDAPSKAGI------LGNMQNLYETVKKAQMVVQVEAVRVQ 107
R L + L+GG KD N D K G+ LGNMQNLY+TVKKAQ VVQVEA +VQ
Sbjct: 96 RPLSITSLWGGNKDQNPA--DTNKKGGMGNMLGALGNMQNLYDTVKKAQQVVQVEAAKVQ 153
Query: 108 KELAAAEFDGYCEGELIK 125
KELAAAEFDGYC+ EL+K
Sbjct: 154 KELAAAEFDGYCDEELVK 171
>gi|226503713|ref|NP_001140993.1| uncharacterized protein LOC100273072 [Zea mays]
gi|194702084|gb|ACF85126.1| unknown [Zea mays]
gi|413926361|gb|AFW66293.1| hypothetical protein ZEAMMB73_706905 [Zea mays]
Length = 108
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/42 (95%), Positives = 40/42 (95%)
Query: 85 MQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKV 126
MQNLYETVKKAQMVVQVEAVRVQKELAA E DGYCEGELIKV
Sbjct: 1 MQNLYETVKKAQMVVQVEAVRVQKELAATEIDGYCEGELIKV 42
>gi|413935881|gb|AFW70432.1| hypothetical protein ZEAMMB73_296576 [Zea mays]
Length = 108
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/42 (95%), Positives = 40/42 (95%)
Query: 85 MQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKV 126
MQNLYETVKKAQMVVQVEAVRVQKELAA E DGYCEGELIKV
Sbjct: 1 MQNLYETVKKAQMVVQVEAVRVQKELAATEIDGYCEGELIKV 42
>gi|302756569|ref|XP_002961708.1| hypothetical protein SELMODRAFT_76950 [Selaginella moellendorffii]
gi|302762711|ref|XP_002964777.1| hypothetical protein SELMODRAFT_82830 [Selaginella moellendorffii]
gi|300167010|gb|EFJ33615.1| hypothetical protein SELMODRAFT_82830 [Selaginella moellendorffii]
gi|300170367|gb|EFJ36968.1| hypothetical protein SELMODRAFT_76950 [Selaginella moellendorffii]
Length = 111
Score = 84.3 bits (207), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/46 (82%), Positives = 42/46 (91%)
Query: 80 GILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIK 125
GI GNMQ LY+TV+KAQ VVQVEAV+VQKELAA EFDGYCEGEL+K
Sbjct: 1 GIFGNMQELYQTVRKAQQVVQVEAVKVQKELAATEFDGYCEGELVK 46
>gi|159480638|ref|XP_001698389.1| predicted protein [Chlamydomonas reinhardtii]
gi|28207763|gb|AAO32624.1| CR084 protein [Chlamydomonas reinhardtii]
gi|158282129|gb|EDP07882.1| predicted protein [Chlamydomonas reinhardtii]
Length = 157
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/43 (60%), Positives = 33/43 (76%)
Query: 84 NMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKV 126
+M+NL E+VKKAQ +VQ E RVQ ELAA EF+GY E E ++V
Sbjct: 50 DMKNLMESVKKAQQLVQTETARVQAELAATEFEGYDEEETVRV 92
>gi|384247335|gb|EIE20822.1| DUF149-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 111
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/43 (60%), Positives = 32/43 (74%)
Query: 85 MQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVC 127
M N+ E VKKAQ +VQVEA +VQ+EL+ AEFDGY E I+V
Sbjct: 1 MANMMENVKKAQQLVQVEAAKVQEELSKAEFDGYSSDETIRVV 43
>gi|307103022|gb|EFN51287.1| hypothetical protein CHLNCDRAFT_28254 [Chlorella variabilis]
Length = 110
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 33/46 (71%)
Query: 82 LGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVC 127
GNM NL E +KKAQ +VQVEA +VQ+ELA +FDGY E ++V
Sbjct: 4 FGNMGNLMENLKKAQAMVQVEAAKVQEELANTDFDGYSADETVRVV 49
>gi|302831828|ref|XP_002947479.1| hypothetical protein VOLCADRAFT_73224 [Volvox carteri f.
nagariensis]
gi|300267343|gb|EFJ51527.1| hypothetical protein VOLCADRAFT_73224 [Volvox carteri f.
nagariensis]
Length = 159
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/43 (58%), Positives = 32/43 (74%)
Query: 84 NMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKV 126
+M+NL E+VKKAQ +VQ E RVQ ELA EF+GY E E ++V
Sbjct: 52 DMKNLMESVKKAQQLVQTETARVQSELANTEFEGYDEEETVRV 94
>gi|413926360|gb|AFW66292.1| hypothetical protein ZEAMMB73_706905 [Zea mays]
Length = 115
Score = 52.0 bits (123), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 44/73 (60%), Gaps = 2/73 (2%)
Query: 6 AFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYGLFGGK 65
A +A ++ + ++ S+ SN N+ G L S+++++ +RS RVY LFGGK
Sbjct: 2 APSAALAPVAFRSAFSTPLASNPTRNRINIEGTFCLPCSARKRAS--YRSFRVYSLFGGK 59
Query: 66 KDNNEKGDDAPSK 78
KD +E GD+APSK
Sbjct: 60 KDKDENGDEAPSK 72
>gi|254422595|ref|ZP_05036313.1| conserved hypothetical protein TIGR00103 [Synechococcus sp. PCC
7335]
gi|196190084|gb|EDX85048.1| conserved hypothetical protein TIGR00103 [Synechococcus sp. PCC
7335]
Length = 114
Score = 35.8 bits (81), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 82 LGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVC 127
LG M+ L E KKAQ V Q A R+Q+EL E +G G L+KV
Sbjct: 11 LGKMKELSEAFKKAQQV-QEGAKRLQEELEQMEIEGTAGGGLVKVI 55
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.311 0.126 0.348
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,788,836,381
Number of Sequences: 23463169
Number of extensions: 63835410
Number of successful extensions: 142812
Number of sequences better than 100.0: 53
Number of HSP's better than 100.0 without gapping: 39
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 142738
Number of HSP's gapped (non-prelim): 53
length of query: 127
length of database: 8,064,228,071
effective HSP length: 93
effective length of query: 34
effective length of database: 10,177,120,650
effective search space: 346022102100
effective search space used: 346022102100
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 71 (32.0 bits)