BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 028862
(202 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255559869|ref|XP_002520953.1| conserved hypothetical protein [Ricinus communis]
gi|223539790|gb|EEF41370.1| conserved hypothetical protein [Ricinus communis]
Length = 202
Score = 212 bits (539), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 110/197 (55%), Positives = 144/197 (73%), Gaps = 9/197 (4%)
Query: 9 NMEEYSAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
+++EYSA++T I+FD IPLLRGP+P DDPSSG Y LAFR+ QSWA+AY+ ESK++
Sbjct: 8 DLDEYSASATTIEFDFQIPLLRGPIPAAESDDPSSGPYFLAFRNPQSWATAYKLSESKII 67
Query: 66 SQCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGF 125
SQCE GARIGC+++AS+KCKPPWWRNLIG + AD KE+E+CE REME CL+ AK+KC GF
Sbjct: 68 SQCEGGARIGCAISASQKCKPPWWRNLIGWKVADLKEKENCEEREMEGCLVAAKEKCVGF 127
Query: 126 AKDRLLRPFQDARIAGRVGPKEVEK-LVCWASLPETERSLWRDLIGSEWLGEREFRVRTN 184
AK++ RPF +ARIA VG +EV K L+C S+PE W L+G + + F + TN
Sbjct: 128 AKEKCFRPFSEARIA--VGKEEVVKRLMCLVSVPEKTGIKWFRLMGID--DKCRFGI-TN 182
Query: 185 YRAADLLGSDNNYKRFL 201
+A++LL D Y F
Sbjct: 183 QKASELLAFDPKYTWFF 199
>gi|224062315|ref|XP_002300815.1| predicted protein [Populus trichocarpa]
gi|222842541|gb|EEE80088.1| predicted protein [Populus trichocarpa]
Length = 201
Score = 211 bits (537), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 111/197 (56%), Positives = 140/197 (71%), Gaps = 10/197 (5%)
Query: 9 NMEEYSAASTVIKFDKPIPLLRGPVPH---DDPSSGSYLLAFRSHQSWASAYESCESKVL 65
N++EYSA++T IKFD+PIPLLRGP+P DDPS ++LAFRS QSWA AY++CESK++
Sbjct: 8 NLDEYSASATTIKFDRPIPLLRGPIPAGPPDDPSISPHVLAFRSPQSWAVAYKTCESKII 67
Query: 66 SQCEHGARIGCSVNASKKCKPPWWRNLI-GQRPADFKEREDCEVREMEACLIVAKDKCAG 124
SQCE GAR GC++ AS KCKPPWWR I G R D KERE CE RE+E CL+ AKDKC G
Sbjct: 68 SQCEEGARFGCAITASNKCKPPWWRGFIGGARLEDLKERERCEDRELEGCLVAAKDKCVG 127
Query: 125 FAKDRLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTN 184
FAK++ +PF+DARIA R + V LV S+P+ S W LIG G+ F V TN
Sbjct: 128 FAKEKCWKPFRDARIAVR-EEEVVRNLVSLVSVPQ-RSSKWISLIG---FGKFNFGV-TN 181
Query: 185 YRAADLLGSDNNYKRFL 201
+RA++ LGS+ N+ F
Sbjct: 182 FRASEFLGSNKNHAWFF 198
>gi|356570604|ref|XP_003553475.1| PREDICTED: uncharacterized protein LOC100790558 [Glycine max]
Length = 156
Score = 179 bits (453), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 84/144 (58%), Positives = 113/144 (78%), Gaps = 4/144 (2%)
Query: 1 MDSTGTNSNMEEYSAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAY 57
MD++ + N+++YS +STVIKFD+P+PLLRGP+P DDPS+G Y+LAFR ++WAS++
Sbjct: 1 MDASEPD-NVDDYSPSSTVIKFDRPVPLLRGPLPAGPSDDPSAGPYVLAFRDSRAWASSF 59
Query: 58 ESCESKVLSQCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIV 117
+CE K++ QCE GARIGC+V+AS+KCK PWW++L G +D KERE CEVREM C
Sbjct: 60 AACERKIVEQCEEGARIGCAVSASRKCKLPWWKSLAGPTLSDLKEREQCEVREMADCEAA 119
Query: 118 AKDKCAGFAKDRLLRPFQDARIAG 141
AK+KC GFA+D+ L PF+DARI G
Sbjct: 120 AKEKCVGFARDKCLVPFRDARIRG 143
>gi|449455252|ref|XP_004145367.1| PREDICTED: uncharacterized protein LOC101218153 [Cucumis sativus]
gi|449472029|ref|XP_004153474.1| PREDICTED: uncharacterized protein LOC101205543 [Cucumis sativus]
Length = 162
Score = 173 bits (438), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 85/146 (58%), Positives = 107/146 (73%), Gaps = 6/146 (4%)
Query: 9 NMEEYSAASTVIKFDKPIPLLRGPV---PHDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
N EYS++ST+I F +PIPLLRGPV ++PS+G YLLAFR Q+W SA+ +CESK++
Sbjct: 19 NPHEYSSSSTIITFQRPIPLLRGPVRASQSENPSAGPYLLAFRDRQAWESAFRACESKII 78
Query: 66 SQCEHGARIGCSVNASKKCKPPWWRNLI-GQRPADFKEREDCEVREMEACLIVAKDKCAG 124
QCE GARIGCS+ AS KCKPPWW L+ ++ D KERE CE EMEACL VAK+KC G
Sbjct: 79 EQCEVGARIGCSITASNKCKPPWWGFLLRSKKGLDLKEREQCEELEMEACLAVAKEKCVG 138
Query: 125 FAKDRLLRPFQDARIAGRVGPKEVEK 150
FAK++ +PF +AR+ G G K EK
Sbjct: 139 FAKEKCYKPFMEARVVG--GRKLTEK 162
>gi|225437166|ref|XP_002274823.1| PREDICTED: uncharacterized protein LOC100254442 [Vitis vinifera]
gi|296084494|emb|CBI25053.3| unnamed protein product [Vitis vinifera]
Length = 208
Score = 172 bits (436), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 96/199 (48%), Positives = 134/199 (67%), Gaps = 10/199 (5%)
Query: 6 TNSNMEEYSAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCES 62
T +++ +YS ++T+ FD P+PLLRGPVP DDPS+G ++LAFR ++W SA+++CES
Sbjct: 3 TPTDLSDYSPSATITPFDCPVPLLRGPVPASPSDDPSAGPFVLAFRDSRAWRSAFKACES 62
Query: 63 KVLSQCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKC 122
K++ QCE GARIGC+V+AS KC PPWW LIG+ P ERE+CE REM AC+ +K+KC
Sbjct: 63 KIIEQCEAGARIGCAVSASSKCGPPWWWALIGRSPPSLAEREECEEREMAACVAASKEKC 122
Query: 123 AGFAKDRLLRPFQDARIA------GRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGE 176
FAKD+ L PF+DARIA R +E +L+ WAS+ S+ + +GS W+
Sbjct: 123 VKFAKDKCLGPFRDARIAVNGRDLTRKKKREAVELIFWASMGGRRNSIGLENLGS-WVLS 181
Query: 177 REFRVRTNYRAADLLGSDN 195
+ TNYR +DLL + N
Sbjct: 182 KCKLGETNYRGSDLLDTQN 200
>gi|297821264|ref|XP_002878515.1| hypothetical protein ARALYDRAFT_324763 [Arabidopsis lyrata subsp.
lyrata]
gi|297324353|gb|EFH54774.1| hypothetical protein ARALYDRAFT_324763 [Arabidopsis lyrata subsp.
lyrata]
Length = 173
Score = 172 bits (436), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 91/181 (50%), Positives = 118/181 (65%), Gaps = 19/181 (10%)
Query: 11 EEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEH 70
++YSAA+T + F++PIPLLRGPVP S GSY+LAFRS SW++A++ CE+ + QC+
Sbjct: 5 DDYSAAATTVVFERPIPLLRGPVP----SGGSYVLAFRSLDSWSAAFKRCETLIKDQCQE 60
Query: 71 GARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKDRL 130
GARIGC+V+AS CKPPWWR D +ER+ CE RE +AC+ +K KCA FAKD+
Sbjct: 61 GARIGCAVSASNNCKPPWWRG-----SGDMRERDKCEEREFQACVAASKGKCAAFAKDKC 115
Query: 131 LRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNYRAADL 190
F DARI+ KEVE +VC AS+PE R WRDL+G +G A DL
Sbjct: 116 SGAFLDARIS-----KEVEGMVCLASMPEHSR--WRDLMG---IGSLHLHTNNCCTARDL 165
Query: 191 L 191
L
Sbjct: 166 L 166
>gi|449530293|ref|XP_004172130.1| PREDICTED: uncharacterized LOC101205543 [Cucumis sativus]
Length = 151
Score = 172 bits (436), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 81/137 (59%), Positives = 103/137 (75%), Gaps = 4/137 (2%)
Query: 9 NMEEYSAASTVIKFDKPIPLLRGPV---PHDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
N EYS++ST+I F +PIPLLRGPV ++PS+G YLLAFR Q+W SA+ +CESK++
Sbjct: 8 NPHEYSSSSTIITFQRPIPLLRGPVRASQSENPSAGPYLLAFRDRQAWESAFRACESKII 67
Query: 66 SQCEHGARIGCSVNASKKCKPPWWRNLI-GQRPADFKEREDCEVREMEACLIVAKDKCAG 124
QCE GARIGCS+ AS KCKPPWW L+ ++ D KERE CE EMEACL VAK+KC G
Sbjct: 68 EQCEVGARIGCSITASNKCKPPWWGFLLRSKKGLDLKEREQCEELEMEACLAVAKEKCVG 127
Query: 125 FAKDRLLRPFQDARIAG 141
FAK++ +PF +AR+ G
Sbjct: 128 FAKEKCYKPFMEARVVG 144
>gi|18412754|ref|NP_567146.1| uncharacterized protein [Arabidopsis thaliana]
gi|98960925|gb|ABF58946.1| At3g63390 [Arabidopsis thaliana]
gi|332646953|gb|AEE80474.1| uncharacterized protein [Arabidopsis thaliana]
Length = 175
Score = 150 bits (379), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 95/174 (54%), Positives = 117/174 (67%), Gaps = 17/174 (9%)
Query: 11 EEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEH 70
+++SAA+T + FD+PIPLLRGPVP S GSY+LAFRS SW++A++ CE+ + QCE
Sbjct: 6 DDFSAAATTVVFDRPIPLLRGPVP----SGGSYVLAFRSLDSWSAAFKRCETLIKDQCEE 61
Query: 71 GARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKDRL 130
GA+IGC+V+AS CKPPWWR G D +ERE CE RE E C+ AK KCA FAKD+
Sbjct: 62 GAKIGCAVSASNNCKPPWWRGSGG----DMREREKCEEREFEGCVAAAKGKCAAFAKDKC 117
Query: 131 LRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTN 184
F DARIA KEVE LV AS+PE R WRDL+G LG R+ TN
Sbjct: 118 SGAFLDARIAKD---KEVEGLVWLASMPEESR--WRDLMG---LGG-SLRLHTN 162
>gi|7573322|emb|CAB87792.1| putative protein [Arabidopsis thaliana]
gi|21554112|gb|AAM63192.1| unknown [Arabidopsis thaliana]
Length = 174
Score = 150 bits (378), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 95/174 (54%), Positives = 117/174 (67%), Gaps = 17/174 (9%)
Query: 11 EEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEH 70
+++SAA+T + FD+PIPLLRGPVP S GSY+LAFRS SW++A++ CE+ + QCE
Sbjct: 5 DDFSAAATTVVFDRPIPLLRGPVP----SGGSYVLAFRSLDSWSAAFKRCETLIKDQCEE 60
Query: 71 GARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKDRL 130
GA+IGC+V+AS CKPPWWR G D +ERE CE RE E C+ AK KCA FAKD+
Sbjct: 61 GAKIGCAVSASNNCKPPWWRGSGG----DMREREKCEEREFEGCVAAAKGKCAAFAKDKC 116
Query: 131 LRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTN 184
F DARIA KEVE LV AS+PE R WRDL+G LG R+ TN
Sbjct: 117 SGAFLDARIAKD---KEVEGLVWLASMPEESR--WRDLMG---LGG-SLRLHTN 161
>gi|110736892|dbj|BAF00403.1| hypothetical protein [Arabidopsis thaliana]
Length = 167
Score = 147 bits (370), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 95/171 (55%), Positives = 114/171 (66%), Gaps = 17/171 (9%)
Query: 14 SAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEHGAR 73
SAA+T + FD+PIPLLRGPVP S GSY+LAFRS SW++A++ CE+ + QCE GA+
Sbjct: 1 SAAATTVVFDRPIPLLRGPVP----SGGSYVLAFRSLDSWSAAFKRCETLIKDQCEEGAK 56
Query: 74 IGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKDRLLRP 133
IGC+V+AS CKPPWWR G D +ERE CE RE E C+ AK KCA FAKD+
Sbjct: 57 IGCAVSASNNCKPPWWRGSGG----DMREREKCEEREFEGCVAAAKGKCAAFAKDKCSGA 112
Query: 134 FQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTN 184
F DARIA KEVE LV AS+PE R WRDL+G LG R+ TN
Sbjct: 113 FLDARIAKD---KEVEGLVWLASMPEESR--WRDLMG---LGG-SLRLHTN 154
>gi|125535103|gb|EAY81651.1| hypothetical protein OsI_36821 [Oryza sativa Indica Group]
Length = 206
Score = 143 bits (360), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 123/201 (61%), Gaps = 12/201 (5%)
Query: 5 GTNSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKV 64
G+ E+YS A+TV++FD P+PLLR PVP S +LAFR SW +A+E+ E+ +
Sbjct: 8 GSVDQSEDYSPAATVVRFDPPLPLLRAPVPSSAASGEPPVLAFRDAASWRAAWEAAEASL 67
Query: 65 LSQCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAG 124
LSQCE GAR GCS+ AS+KCKPPWW+ L+G P D++ERE CE REM ACL A++ C
Sbjct: 68 LSQCEAGARSGCSITASRKCKPPWWKGLLGGAPTDYQERERCEEREMAACLEAAREACVK 127
Query: 125 FAKDRLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVR-- 182
FAK++ + PF+DARIA G E K W + S + S++ F R
Sbjct: 128 FAKEKCIEPFRDARIASE-GLLENTKFAIWGAGSNRTSSASLCITNSQY----PFNPRPG 182
Query: 183 -TNYRAADLLGS----DNNYK 198
TNY+ +DLL S DNN K
Sbjct: 183 STNYKGSDLLDSFPSEDNNIK 203
>gi|115486277|ref|NP_001068282.1| Os11g0616200 [Oryza sativa Japonica Group]
gi|77551945|gb|ABA94742.1| expressed protein [Oryza sativa Japonica Group]
gi|113645504|dbj|BAF28645.1| Os11g0616200 [Oryza sativa Japonica Group]
gi|125577881|gb|EAZ19103.1| hypothetical protein OsJ_34635 [Oryza sativa Japonica Group]
gi|215697312|dbj|BAG91306.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 206
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 123/201 (61%), Gaps = 12/201 (5%)
Query: 5 GTNSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKV 64
G+ E+YS A+TV++FD P+PLLR PVP S +LAFR SW +A+E+ E+ +
Sbjct: 8 GSVDQSEDYSPAATVVRFDPPLPLLRAPVPSSAASGEPPVLAFRDAASWRAAWEAAEASL 67
Query: 65 LSQCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAG 124
LSQCE GAR GCS+ AS+KCKPPWW+ L+G P D++ERE CE REM ACL A++ C
Sbjct: 68 LSQCEAGARSGCSITASRKCKPPWWKGLLGGAPTDYQERERCEEREMAACLEAAREACVK 127
Query: 125 FAKDRLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVR-- 182
FAK++ + PF+DARIA G E K W + S + S++ F R
Sbjct: 128 FAKEKCIEPFRDARIASE-GLLENTKFAIWGAGSNRTSSASLCIANSQY----PFNPRPG 182
Query: 183 -TNYRAADLLGS----DNNYK 198
TNY+ +DLL S DNN K
Sbjct: 183 STNYKGSDLLDSFPSEDNNIK 203
>gi|357156167|ref|XP_003577364.1| PREDICTED: uncharacterized protein LOC100836805 [Brachypodium
distachyon]
Length = 202
Score = 127 bits (319), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/183 (43%), Positives = 112/183 (61%)
Query: 11 EEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEH 70
E+YS A+TV++FD P+PLLR PVP +LAFR SW +A+E+ E+ + SQCE
Sbjct: 14 EDYSPAATVVRFDSPLPLLRAPVPSSAAGDEPPVLAFRDAASWRAAWEAAEASLFSQCEA 73
Query: 71 GARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKDRL 130
GAR GCS+ A++KCKPPWW+ L G P D++ERE CE REM +CL AK+ C F+K++
Sbjct: 74 GARSGCSIMATRKCKPPWWKGLFGGAPTDYQEREQCEEREMASCLESAKEACIKFSKEKC 133
Query: 131 LRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNYRAADL 190
PF+DARIA ++ + +V A+ + S + + TNY+ +DL
Sbjct: 134 SAPFRDARIASEGLLEDTDFVVWGAAAGNNKTSSASFCVANRQHTPNPGAAATNYKGSDL 193
Query: 191 LGS 193
L S
Sbjct: 194 LDS 196
>gi|242069173|ref|XP_002449863.1| hypothetical protein SORBIDRAFT_05g024570 [Sorghum bicolor]
gi|241935706|gb|EES08851.1| hypothetical protein SORBIDRAFT_05g024570 [Sorghum bicolor]
Length = 202
Score = 126 bits (317), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 115/188 (61%), Gaps = 2/188 (1%)
Query: 6 TNSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
+ + E+YSA +T+++FD P+PLLR PV PS +LAFR SW +A+E+ E+ ++
Sbjct: 7 ADGSAEDYSAEATLVRFDPPLPLLRAPVRSPAPSGEGPVLAFRDAASWRAAWEAAEANLV 66
Query: 66 SQCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGF 125
SQCE GAR GCS++AS+KCKPPWW+ L P D++ERE CE REM ACL AK+ C F
Sbjct: 67 SQCEAGARSGCSISASRKCKPPWWKGLFVAAPTDYEERERCEEREMAACLEAAKEACIKF 126
Query: 126 AKDRLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNY 185
AK + + PF+DARIA G E W + + S + ++ + TNY
Sbjct: 127 AKAKCIGPFRDARIASE-GLVENTDFHVWGAAADKSSSTSACALNNQQSFSPDPGA-TNY 184
Query: 186 RAADLLGS 193
R +D+L S
Sbjct: 185 RGSDVLDS 192
>gi|226530317|ref|NP_001143017.1| uncharacterized protein LOC100275481 [Zea mays]
gi|195612990|gb|ACG28325.1| hypothetical protein [Zea mays]
Length = 180
Score = 122 bits (306), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 96/139 (69%), Gaps = 4/139 (2%)
Query: 6 TNSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
+ + E+YSA +TV++FD P+PLLR PVP S +LAFR SW +A+E+ E+ ++
Sbjct: 7 ADDSAEDYSAEATVVRFDPPLPLLRAPVPSPASSGEGPVLAFRDVASWRAAWEAAEASLV 66
Query: 66 SQCEHGARIGCSVNASKKCKPPWWRNLI----GQRPADFKEREDCEVREMEACLIVAKDK 121
SQCE GAR GCSV+AS+KCKPPWW+ L+ P D++ER CE REM ACL A++
Sbjct: 67 SQCEAGARSGCSVSASRKCKPPWWKRLLVVVAAAPPTDYEERSRCEEREMAACLEAAREA 126
Query: 122 CAGFAKDRLLRPFQDARIA 140
CA FAK + PFQDARIA
Sbjct: 127 CAKFAKAKCAGPFQDARIA 145
>gi|242034065|ref|XP_002464427.1| hypothetical protein SORBIDRAFT_01g018080 [Sorghum bicolor]
gi|241918281|gb|EER91425.1| hypothetical protein SORBIDRAFT_01g018080 [Sorghum bicolor]
Length = 202
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 82/187 (43%), Positives = 110/187 (58%), Gaps = 2/187 (1%)
Query: 7 NSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLS 66
+ + ++YSA +T + FD P+PLLR PV PS +LAFR SW +A+E+ E+ ++S
Sbjct: 8 DDSAQDYSAKATAVPFDPPLPLLRAPVRFPAPSGEDPVLAFRDAASWRAAWEAAEASLVS 67
Query: 67 QCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFA 126
QCE GAR GCS+ AS+KCKPPWW+ L P D++ERE CE REM ACL AK+ C FA
Sbjct: 68 QCEVGARSGCSIGASRKCKPPWWKGLFVAAPTDYEERERCEEREMAACLEAAKEACIKFA 127
Query: 127 KDRLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNYR 186
K + PF DARIA G E W + + S + + + V TNYR
Sbjct: 128 KGKCTGPFGDARIASE-GLLENTDFDVWGAAVDKSSSTSGCALNNRQSFSPDPGV-TNYR 185
Query: 187 AADLLGS 193
+D+L S
Sbjct: 186 GSDVLDS 192
>gi|23928446|gb|AAN40032.1| putative protein [Zea mays]
Length = 165
Score = 116 bits (290), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 72/142 (50%), Positives = 95/142 (66%), Gaps = 7/142 (4%)
Query: 6 TNSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
+ + E+YSA +TV++FD P+PLLR PVP S +LAFR SW +A+E+ E+ ++
Sbjct: 7 ADDSAEDYSAEATVVRFDPPLPLLRAPVPSPASSGEGPVLAFRDAASWRAAWEAAEASLV 66
Query: 66 SQC---EHGARIGCSVNASKKCKPPWWRNLI----GQRPADFKEREDCEVREMEACLIVA 118
SQC E GAR GCSV+AS+KCKPPWW+ L+ P D++ER CE REM ACL A
Sbjct: 67 SQCELAEAGARSGCSVSASRKCKPPWWKRLLVVVAAAPPTDYEERSRCEEREMAACLEAA 126
Query: 119 KDKCAGFAKDRLLRPFQDARIA 140
++ C FAK + PFQDARIA
Sbjct: 127 REACVKFAKAKCAGPFQDARIA 148
>gi|168035348|ref|XP_001770172.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678549|gb|EDQ65006.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 141
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 54/133 (40%), Positives = 85/133 (63%), Gaps = 6/133 (4%)
Query: 14 SAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEH 70
S + VI FD +PL+R PVP +D S G Y+LAFR+ ++W A+++C+ K+ +QCE
Sbjct: 6 SPSVLVIAFDPFLPLVRIPVPVGRDEDSSKGPYVLAFRNEETWRQAWKTCKEKIATQCEA 65
Query: 71 GARIGCSVNASKKCKPPWWRNLI---GQRPADFKEREDCEVREMEACLIVAKDKCAGFAK 127
GA++GCSV+A+ C+ PWW++ + G D R+DCE REM CL+ + ++C FA+
Sbjct: 66 GAKVGCSVSAAMACRQPWWKSYLPFSGSGSDDQFRRQDCEGREMRGCLLSSHERCFTFAR 125
Query: 128 DRLLRPFQDARIA 140
+ + + RIA
Sbjct: 126 EICDPVYANMRIA 138
>gi|302784989|ref|XP_002974266.1| hypothetical protein SELMODRAFT_101436 [Selaginella moellendorffii]
gi|300157864|gb|EFJ24488.1| hypothetical protein SELMODRAFT_101436 [Selaginella moellendorffii]
Length = 238
Score = 109 bits (273), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 101/194 (52%), Gaps = 9/194 (4%)
Query: 12 EYSAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVLSQC 68
+ S ++T I F+ +P+LR PVP DDPS G ++LAF+ SW+ A+++CE ++ SQC
Sbjct: 47 DLSPSATTIAFEPVLPVLRVPVPAGSDDDPSKGPFVLAFKDEASWSRAWQNCEKQITSQC 106
Query: 69 EHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKD 128
E GCS+ ASK C+ PWW+ + + RE CE + M ACL ++ FA+
Sbjct: 107 EAATMKGCSITASKACRSPWWKAFVPNFNTSTEGREACEEKAMMACLAASRRDGIKFARG 166
Query: 129 RLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNYRAA 188
F DARIA ++ + ++ + + + LI E G FR R+ A
Sbjct: 167 VCEEVFGDARIANKLQFANPQGHTNRSTENASTSNHNQVLI--ENPGSTNFRGRSLTDGA 224
Query: 189 DLLGSDNNYKRFLG 202
D D +Y+ LG
Sbjct: 225 D----DKHYQSSLG 234
>gi|302807885|ref|XP_002985636.1| hypothetical protein SELMODRAFT_122681 [Selaginella moellendorffii]
gi|300146545|gb|EFJ13214.1| hypothetical protein SELMODRAFT_122681 [Selaginella moellendorffii]
Length = 284
Score = 109 bits (272), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 65/194 (33%), Positives = 102/194 (52%), Gaps = 9/194 (4%)
Query: 12 EYSAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVLSQC 68
+ S ++ I F+ +P+LR PVP DDPS G ++LAF+ SW+ A+++CE ++ SQC
Sbjct: 47 DLSPSAITIAFEPVLPVLRVPVPAGSDDDPSKGPFVLAFKDEASWSRAWQNCEKQITSQC 106
Query: 69 EHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKD 128
E GCS+ ASK C+ PWW+ + ++RE CE + M ACL ++ FA+
Sbjct: 107 EAATMKGCSITASKACRSPWWKAFVPNFNTSTEDREACEEKAMMACLAASRRDGIKFARG 166
Query: 129 RLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNYRAA 188
F DARIA ++ + + ++ + + + LI E G FR R+ A
Sbjct: 167 VCEEVFGDARIANKLQFADPQGHTNRSTENASTSNHNQVLI--ENPGSTNFRGRSLTDGA 224
Query: 189 DLLGSDNNYKRFLG 202
D D +Y+ LG
Sbjct: 225 D----DKHYQSSLG 234
>gi|168044182|ref|XP_001774561.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674116|gb|EDQ60629.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 293
Score = 99.4 bits (246), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 55/139 (39%), Positives = 89/139 (64%), Gaps = 6/139 (4%)
Query: 8 SNMEEYSAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCESKV 64
S+ S TV+ FD IPL R PVP +DPS G Y+LAFR+ ++W ++ +C+ V
Sbjct: 15 SSQVNLSPYVTVVAFDPLIPLARIPVPAGQEEDPSKGPYVLAFRNDETWRQSWRTCKENV 74
Query: 65 LSQCEHGARIGCSVNASKKCKPPWWRN---LIGQRPADFKEREDCEVREMEACLIVAKDK 121
++QCE GA++GCS++A+K C+ PWW+ +G D +RE+CE REM +CL+ + ++
Sbjct: 75 VTQCEAGAKVGCSISAAKACRQPWWKAYLPFLGSGSDDPSKREECERREMRSCLLSSHER 134
Query: 122 CAGFAKDRLLRPFQDARIA 140
C +A++ + + + RIA
Sbjct: 135 CFTYARETCVPVYSNMRIA 153
>gi|302795037|ref|XP_002979282.1| hypothetical protein SELMODRAFT_110619 [Selaginella
moellendorffii]
gi|300153050|gb|EFJ19690.1| hypothetical protein SELMODRAFT_110619 [Selaginella
moellendorffii]
Length = 60
Score = 62.8 bits (151), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 28/60 (46%), Positives = 39/60 (65%), Gaps = 3/60 (5%)
Query: 32 PVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEHGARIGCSVNASKKCKPPW 88
PVP +DDPS G ++LAF+ SW+ A++ CE ++ S CE GCS+ ASK C+ PW
Sbjct: 1 PVPAGSNDDPSKGPFILAFKDKASWSHAWQYCEKQITSYCEAATMKGCSITASKACRSPW 60
>gi|302813810|ref|XP_002988590.1| hypothetical protein SELMODRAFT_128383 [Selaginella
moellendorffii]
gi|300143697|gb|EFJ10386.1| hypothetical protein SELMODRAFT_128383 [Selaginella
moellendorffii]
Length = 60
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/60 (46%), Positives = 38/60 (63%), Gaps = 3/60 (5%)
Query: 32 PVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEHGARIGCSVNASKKCKPPW 88
PVP +DDPS G ++LAF+ SW+ A++ CE ++ S CE GCS+ ASK C PW
Sbjct: 1 PVPTGSNDDPSKGPFILAFKDKASWSRAWQYCEKQITSYCEAATMKGCSITASKACCSPW 60
>gi|302795041|ref|XP_002979284.1| hypothetical protein SELMODRAFT_59163 [Selaginella
moellendorffii]
gi|300153052|gb|EFJ19692.1| hypothetical protein SELMODRAFT_59163 [Selaginella
moellendorffii]
Length = 61
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 40/57 (70%), Gaps = 3/57 (5%)
Query: 16 ASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCE 69
++T+I F+ +P+LR P+P DDPS G ++LAF+ SW+ A++ CE ++ SQC+
Sbjct: 5 SATMIAFEPVLPVLRVPIPTGSDDDPSKGPFILAFKDEASWSHAWQYCEKQITSQCK 61
>gi|307102844|gb|EFN51111.1| hypothetical protein CHLNCDRAFT_141467 [Chlorella variabilis]
Length = 112
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 53/107 (49%), Gaps = 2/107 (1%)
Query: 45 LAFRSHQSWASAYESCESKVLSQCEHGARIGCSVNASKKCKPPWWRNLIGQRP-ADFKER 103
LAF S S + CE KV+ C G R GCS+ A+++C P L+G P ++ER
Sbjct: 4 LAFASEASREQQHRGCEGKVIEVCTEGLRAGCSLQAARQCASPPLLRLLGLVPEQSWEER 63
Query: 104 EDCEVREMEACLIVAKDKCAGFAKDRLLRPFQDARIAGRVGPKEVEK 150
E C+ R M+ACL C A+ F +AG+ +E EK
Sbjct: 64 EACQQRCMDACLAQQTPACQQHAQHFCADVF-AGPVAGQQQGREQEK 109
>gi|413920417|gb|AFW60349.1| hypothetical protein ZEAMMB73_302102 [Zea mays]
Length = 89
Score = 53.1 bits (126), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 44/64 (68%)
Query: 6 TNSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
+ + E+YSA +TV++FD P+PLLR PVP S +LAFR SW +A+E+ E+ ++
Sbjct: 7 ADDSAEDYSAEATVVRFDPPLPLLRAPVPSPASSGEGPVLAFRDAASWRAAWEAAEASLV 66
Query: 66 SQCE 69
SQCE
Sbjct: 67 SQCE 70
>gi|406920230|gb|EKD58332.1| hypothetical protein ACD_56C00148G0013 [uncultured bacterium]
Length = 327
Score = 39.3 bits (90), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 98 ADFKEREDCEV---REMEACLIVAKDKCAGFAKDRLLRPFQDARIAGRVGPKEVEKLVCW 154
ADF+ +E E RE E ++KDK F K++L P+ D RI G K+V+K++
Sbjct: 17 ADFRSKEKIEKMLKREKEKYDGLSKDKKELFDKEKLTNPYSDTRIHFDGGAKQVKKIMAG 76
Query: 155 ASLPETERSLWRDL 168
+ +E + R L
Sbjct: 77 IDIDSSELMVARYL 90
>gi|413920416|gb|AFW60348.1| hypothetical protein ZEAMMB73_751749 [Zea mays]
Length = 68
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 21/30 (70%)
Query: 111 MEACLIVAKDKCAGFAKDRLLRPFQDARIA 140
M ACL A++ CA FAK + PFQDARIA
Sbjct: 1 MAACLEAAREACAKFAKAKCAGPFQDARIA 30
>gi|254385548|ref|ZP_05000874.1| conserved hypothetical protein [Streptomyces sp. Mg1]
gi|194344419|gb|EDX25385.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length = 415
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 47/104 (45%), Gaps = 12/104 (11%)
Query: 88 WWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKDRLLRPFQDARIAGRVGPKE 147
WWR G+ F+ + ++ A A + AG R + DA++ +
Sbjct: 196 WWRLATGEEEFSFRPYKP----KLHAVYRTAAGEVAGLVSYRSDDHWTDAKVP--LNTLR 249
Query: 148 VEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNYRAADLL 191
VE+L+ A+ PE ER+LW L +W R+RT RA D L
Sbjct: 250 VEELL--AATPEAERALWHFLCSVDW----ALRIRTGLRATDDL 287
>gi|260655401|ref|ZP_05860889.1| lipopolysaccharide O-Ag biosynthesis protein FlmB [Jonquetella
anthropi E3_33 E1]
gi|424845213|ref|ZP_18269824.1| putative PLP-dependent enzyme possibly involved in cell wall
biogenesis [Jonquetella anthropi DSM 22815]
gi|260629849|gb|EEX48043.1| lipopolysaccharide O-Ag biosynthesis protein FlmB [Jonquetella
anthropi E3_33 E1]
gi|363986651|gb|EHM13481.1| putative PLP-dependent enzyme possibly involved in cell wall
biogenesis [Jonquetella anthropi DSM 22815]
Length = 386
Score = 36.2 bits (82), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 46/111 (41%), Gaps = 8/111 (7%)
Query: 23 DKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEHGARIGCSVNASK 82
D P+ L GP P + S ++L C + + + A+IGCSV+
Sbjct: 279 DLPVQLPMGPTPEREHSWHLFMLRLSDDAPI------CRDNFIQEMSN-AKIGCSVHYIP 331
Query: 83 KCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVA-KDKCAGFAKDRLLR 132
P+W+ +P DF ED RE+ + A KD + +++ R
Sbjct: 332 LHAQPYWKQRYNLQPQDFPVAEDAWKREVSLPIFTAMKDSDVDYVIEQVRR 382
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.425
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,378,818,237
Number of Sequences: 23463169
Number of extensions: 133925712
Number of successful extensions: 238602
Number of sequences better than 100.0: 32
Number of HSP's better than 100.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 238501
Number of HSP's gapped (non-prelim): 33
length of query: 202
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 67
effective length of database: 9,191,667,552
effective search space: 615841725984
effective search space used: 615841725984
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)