BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 021987
(304 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q99089|CPRF1_PETCR Common plant regulatory factor 1 OS=Petroselinum crispum GN=CPRF1
PE=2 SV=1
Length = 411
Score = 296 bits (758), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 160/308 (51%), Positives = 206/308 (66%), Gaps = 19/308 (6%)
Query: 1 MGNNEDGKSFKSEK---PSSPPPSDQGNIHMYTDWAAMQAYYGPRVAIPPYYNSPIASGH 57
MGN +D K+ K EK P P DQ N H+Y DWAAMQAYYGPRVA+PPY+N +ASG
Sbjct: 1 MGNTDDVKAVKPEKLSSPPPPAAPDQSNSHVYPDWAAMQAYYGPRVALPPYFNPAVASGQ 60
Query: 58 APQPYMWGPAQPMMPPYGAPYAAIYSTGGVYAHPAVPLGSHAHNHGVPTSPAAVTPLNTE 117
+P PYMWGP QP+MPPYG PYAA+Y+ GGVYAHP VPL A +P++ +
Sbjct: 61 SPHPYMWGPPQPVMPPYGVPYAALYAHGGVYAHPGVPL--------------AASPMSMD 106
Query: 118 APTKSSGNADRGLAKKLKGLDGLAMSIGNASAESAEGGAEQRPSQS-EADGSTDGSDGNT 176
KSSG + GL KKLKG D LAMSIGN A+S+EG E+ SQS E +GS+DGS+ N+
Sbjct: 107 THAKSSGTNEHGLIKKLKGHDDLAMSIGNGKADSSEGEMERTLSQSKETEGSSDGSNENS 166
Query: 177 VRAGQSRKKRSREGTPIAGGDGKTDIQSTPVPVGVNATPDKVLATAVAPTSVSGKPVGPV 236
RA + +KR R+ P G+ K + QS+ +P A +K+L VA V+GK VG V
Sbjct: 167 KRAAVNGRKRGRDEAPNMIGEVKIETQSSVIP-SPRAKSEKLLGITVATPMVAGKVVGTV 225
Query: 237 LSPGMPTKLELRNAPGMNVKASPTSVPQPCAVLPPETWIQNERELKRERRKQSNRESARR 296
+SP M + LEL+++P + SP QP ++P ++W+ N+R+LKRERRKQSNRESARR
Sbjct: 226 VSPSMTSSLELKDSPKEHAVNSPAGGQQPSTMMPNDSWLHNDRDLKRERRKQSNRESARR 285
Query: 297 SRLRKQVK 304
SRLRKQ +
Sbjct: 286 SRLRKQAE 293
>sp|P42776|GBF3_ARATH G-box-binding factor 3 OS=Arabidopsis thaliana GN=GBF3 PE=1 SV=2
Length = 382
Score = 270 bits (690), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 160/296 (54%), Positives = 194/296 (65%), Gaps = 27/296 (9%)
Query: 11 KSEKPSSPPPSDQGNIHMYTDWAAMQAYYGPRVAIPPYYNSPIA-SGHAPQPYMWGPAQP 69
KS+KPSSPP DQ N+H+Y DWAAMQAYYGPRVA+PPYYNS +A SGH P PYMW P Q
Sbjct: 13 KSDKPSSPP-VDQTNVHVYPDWAAMQAYYGPRVAMPPYYNSAMAASGHPPPPYMWNP-QH 70
Query: 70 MMPPYGAPYAAIY-STGGVYAHPAVPLGSHAHNHGVPTSPAAVTPLNTEAPTKSSGNADR 128
MM PYGAPYAA+Y GGVYAHP +P+GS P T L+ + PTKS+GN D
Sbjct: 71 MMSPYGAPYAAVYPHGGGVYAHPGIPMGSLPQGQKDPPLTTPGTLLSIDTPTKSTGNTDN 130
Query: 129 GLAKKLKGLDGLAMSIGNASAESAEGGAEQRPSQSEADGSTDGSDGNTVRAGQSRKKRSR 188
GL KKLK DGLAMS+GN + E+ ++ + SE DGSTDGSDGNT A + + KRSR
Sbjct: 131 GLMKKLKEFDGLAMSLGNGNPENGADEHKRSRNSSETDGSTDGSDGNTTGADEPKLKRSR 190
Query: 189 EGTPIAGGDGKTDIQSTPVPVGVNATPDKVLATAVAPTSVSGKPVGPVLSPGMPTKLELR 248
EGTP DGK +Q++ ++ D T V SG +LS
Sbjct: 191 EGTPTK--DGKQLVQASSFHSVSPSSGD----TGVKLIQGSGA----ILS---------- 230
Query: 249 NAPGMNVKASPTSVPQPCAVLPPETWIQNERELKRERRKQSNRESARRSRLRKQVK 304
PG++ ++P + Q A++PPETW+QNERELKRERRKQSNRESARRSRLRKQ +
Sbjct: 231 --PGVSANSNPF-MSQSLAMVPPETWLQNERELKRERRKQSNRESARRSRLRKQAE 283
>sp|P42775|GBF2_ARATH G-box-binding factor 2 OS=Arabidopsis thaliana GN=GBF2 PE=1 SV=1
Length = 360
Score = 218 bits (554), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 141/308 (45%), Positives = 177/308 (57%), Gaps = 39/308 (12%)
Query: 1 MGNNEDGK-SFKSEKPSSPPPSDQGNIHMYT-DWAAMQAYYGPRVAIPPYYNSPIASGHA 58
MG+NE+G + S+KPS +Q N+H+Y DWAAMQAYYGPRV IP YYNS +A GHA
Sbjct: 1 MGSNEEGNPTNNSDKPSQAAAPEQSNVHVYHHDWAAMQAYYGPRVGIPQYYNSNLAPGHA 60
Query: 59 PQPYMWGPAQPMMPPYGAPYAAIYSTGGVYAHPAVPLGSHAHNHGVPTSPAAVTPLNTEA 118
P PYMW PMM PYGAPY GGVYAHP V +GS ++ TPL +A
Sbjct: 61 PPPYMWASPSPMMAPYGAPYPPFCPPGGVYAHPGVQMGSQPQGPVSQSASGVTTPLTIDA 120
Query: 119 PTKSSGNADRGLAKKLKGLDGLAMSIGNASAESAEGGAEQRPSQSEA--DGSTDGSDGNT 176
P S+GN+D G KKLK DGLAMSI N SAE + + S + DGS++GSDGNT
Sbjct: 121 PANSAGNSDHGFMKKLKEFDGLAMSISNNKVGSAEHSSSEHRSSQSSENDGSSNGSDGNT 180
Query: 177 VRAGQSRKKRSREGTPIAGGDGKTDIQSTPVPVGVNATPDKVLATAVAPTSVSGKPVGPV 236
G+ +++ R+ + G+ + S P+ G N PD + T
Sbjct: 181 T-GGEQSRRKRRQQRSPSTGERPSSQNSLPL-RGENEKPDVTMGT--------------- 223
Query: 237 LSPGMPTKLELRNAPGMNVKASPTSVPQPCAVLPPETWIQNERELKRERRKQSNRESARR 296
P MPT + +N+ GMN VPQP W NE+E+KRE+RKQSNRESARR
Sbjct: 224 --PVMPTAMSFQNSAGMN------GVPQP--------W--NEKEVKREKRKQSNRESARR 265
Query: 297 SRLRKQVK 304
SRLRKQ +
Sbjct: 266 SRLRKQAE 273
>sp|Q99142|TAF1_TOBAC Transcriptional activator TAF-1 (Fragment) OS=Nicotiana tabacum
GN=TAF1 PE=2 SV=1
Length = 265
Score = 193 bits (491), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 116/226 (51%), Positives = 142/226 (62%), Gaps = 16/226 (7%)
Query: 85 GGVYAHPAVPLGSHAHNHGVPTSPAAV-----TPLNTEAPTKSSGNADRGLAKKLKGLDG 139
GGVYAHP VP+GSH HG+ TSPA L+ +A KSS N+DRGL
Sbjct: 3 GGVYAHPGVPIGSHPPGHGMATSPAVSQAMDGASLSLDASAKSSENSDRGL--------- 53
Query: 140 LAMSIGNASAESAEGGAEQRPSQS-EADGSTDGSDGNTVRAGQSRKKRSREGTPIAGGDG 198
LAMS+GN SA++ EGGA+ SQS + + STDGSD N + KKRSRE TP GD
Sbjct: 54 LAMSLGNGSADNIEGGADHGNSQSGDTEDSTDGSDTNGAGVSERSKKRSRETTPDNSGDS 113
Query: 199 KTDIQSTPVPVGVNATPDKVLATAVAPTSVSGKPVGPVLSPGMPTKLELRNAPGMNVKAS 258
K+ ++ +N +K + AV P V K +G VLSP M T LE+RN ++KAS
Sbjct: 114 KSHLRRCQPTGEINDDSEKAI-VAVRPGKVGEKVMGTVLSPSMTTTLEMRNPASTHLKAS 172
Query: 259 PTSVPQPCAVLPPETWIQNERELKRERRKQSNRESARRSRLRKQVK 304
PT+V Q LP E W+QNERELKRE+RKQSNRESARRSRLRKQ +
Sbjct: 173 PTNVSQLSPALPNEAWLQNERELKREKRKQSNRESARRSRLRKQAE 218
>sp|P42774|GBF1_ARATH G-box-binding factor 1 OS=Arabidopsis thaliana GN=GBF1 PE=2 SV=2
Length = 315
Score = 108 bits (270), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 103/311 (33%), Positives = 141/311 (45%), Gaps = 72/311 (23%)
Query: 1 MGNNEDGKSFKSEKPSSP----PPSDQGNIHMYTDWA-AMQAYYGPRVAIPPYYNSPIAS 55
MG +ED FK+ KP+S PP+ Y DW +MQAYYG P++ SP+ S
Sbjct: 1 MGTSEDKMPFKTTKPTSSAQEVPPTP------YPDWQNSMQAYYGGGGTPNPFFPSPVGS 54
Query: 56 GHAPQPYMWGPAQPMMPPYG--APYAAIYSTGGVYAHPAVPLGSHAHNHGVPTSPAAVTP 113
+P PYMWG MMPPYG PY A+Y G VYAHP++P+ P P
Sbjct: 55 -PSPHPYMWGAQHHMMPPYGTPVPYPAMYPPGAVYAHPSMPM------------PPNSGP 101
Query: 114 LNTEAPTKSSGNADRGLAKKLKGLDGLAMSIGNASAESAEGGAEQRPSQSEADGSTDGSD 173
N E P K + + K +G ++ + + A E S GS+D +D
Sbjct: 102 TNKE-PAKDQASGKKSKGNSKKKAEGGDKALSGSGNDGASHSDE-----SVTAGSSDEND 155
Query: 174 GNTVRAGQSRKKRSREGTPIAGGDGKTDIQSTPVPVGVNATPDKVLATAVAPTSVSGKPV 233
N + Q ++ G +A ++ T SV KPV
Sbjct: 156 ENANQQEQGSIRKPSFGQMLADASSQS-------------------TTGEIQGSVPMKPV 196
Query: 234 GPVLSPGMPTKLELRNAPGMNVKASPTSVPQPCAVLPPETWIQNERELKRERRKQSNRES 293
P T L + GM++ +S VP +++ERELKR++RKQSNRES
Sbjct: 197 APG------TNLNI----GMDLWSSQAGVP-----------VKDERELKRQKRKQSNRES 235
Query: 294 ARRSRLRKQVK 304
ARRSRLRKQ +
Sbjct: 236 ARRSRLRKQAE 246
>sp|Q99091|CPRF3_PETCR Light-inducible protein CPRF3 OS=Petroselinum crispum GN=CPRF3 PE=2
SV=1
Length = 296
Score = 75.9 bits (185), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 104/259 (40%), Gaps = 88/259 (33%)
Query: 48 YYNSPIASGHAPQPYMWGPAQPMMPPYGAP--YAAIYSTGGVYAHPAVPLGSHAHNHGVP 105
+Y S + S +P PYMW + PYG P Y A++ GG++ HP VP
Sbjct: 48 FYASTVGSP-SPHPYMWRNQHRFILPYGIPMQYPALFLPGGIFTHPI-----------VP 95
Query: 106 TSPAAVTPLNTEAPTKSSGNADRGLAKKLKGLDGLAMSIGNASAESAEGGAEQRPSQSEA 165
T P + P + E K S R AKK G+ G +++ + + GAE + + S +
Sbjct: 96 TDP-NLAPTSGEVGRKISDEKGRTSAKKSIGVSG-------STSFAVDKGAENQKAASSS 147
Query: 166 DGSTDGSDGNTVRAGQSRKKRSREGTPIAGGDGKTDIQSTPVPVGVNATPDKVLATAVAP 225
D G DG +++S P+ V AP
Sbjct: 148 DNDCPSLSSEN------------------GVDGSLEVRSNPLDVA-------------AP 176
Query: 226 TSVSGKPVGPVLSPGMPTKLELRNAPGMNVKASPTSVPQPCAVLPPETWIQNERELKRER 285
++ V+ GM P+ + +ERELKR+R
Sbjct: 177 GAI-------VVHDGM----------------------------LPDQRVNDERELKRQR 201
Query: 286 RKQSNRESARRSRLRKQVK 304
RKQSNRESARRSRLRKQ K
Sbjct: 202 RKQSNRESARRSRLRKQAK 220
>sp|P25032|EMBP1_WHEAT DNA-binding protein EMBP-1 OS=Triticum aestivum PE=1 SV=1
Length = 354
Score = 54.3 bits (129), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/26 (96%), Positives = 26/26 (100%)
Query: 277 NERELKRERRKQSNRESARRSRLRKQ 302
+ERELKRERRKQSNRESARRSRLRKQ
Sbjct: 247 DERELKRERRKQSNRESARRSRLRKQ 272
>sp|P23922|HBP1A_WHEAT Transcription factor HBP-1a OS=Triticum aestivum PE=2 SV=1
Length = 349
Score = 48.5 bits (114), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 23/33 (69%), Positives = 28/33 (84%), Gaps = 2/33 (6%)
Query: 272 ETWIQNERELKRERRKQSNRESARRSRLRKQVK 304
E W +ERELK+++RK SNRESARRSRLRKQ +
Sbjct: 246 EQW--DERELKKQKRKLSNRESARRSRLRKQAE 276
>sp|Q54Y73|BZPD_DICDI Probable basic-leucine zipper transcription factor D
OS=Dictyostelium discoideum GN=bzpD PE=3 SV=1
Length = 834
Score = 38.1 bits (87), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 29/41 (70%), Gaps = 1/41 (2%)
Query: 264 QPCAVLPPETWIQ-NERELKRERRKQSNRESARRSRLRKQV 303
+P V+ P T Q +ER +K++RR NRESA+ SR+RK++
Sbjct: 374 EPVQVVDPPTHNQEDERNVKKQRRLIKNRESAQLSRMRKKI 414
>sp|P24068|OCS1_MAIZE Ocs element-binding factor 1 OS=Zea mays GN=OBF1 PE=2 SV=2
Length = 151
Score = 37.7 bits (86), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 17/21 (80%), Positives = 20/21 (95%)
Query: 282 KRERRKQSNRESARRSRLRKQ 302
+RE+R+ SNRESARRSRLRKQ
Sbjct: 26 RREKRRLSNRESARRSRLRKQ 46
>sp|P08678|CYAA_YEAST Adenylate cyclase OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=CYR1 PE=1 SV=2
Length = 2026
Score = 35.8 bits (81), Expect = 0.44, Method: Composition-based stats.
Identities = 29/100 (29%), Positives = 40/100 (40%), Gaps = 6/100 (6%)
Query: 200 TDIQSTPVPVGVNATPDKVLATAVAPTSVSGKPVGPVLSPGMPTKLELRNAPGMNVKASP 259
TDI T P VN+T + T A P GP S M AP ++
Sbjct: 395 TDITKTVPPTAVNSTLNSTHGTETASPKTVIMPEGPRKSVSMADLSVAAAAPNGEFTSTS 454
Query: 260 TSVPQPCAVLPPETWIQNERELKRERRKQSNRESARRSRL 299
Q A P++W + E KR++ K R +RRS +
Sbjct: 455 NDRSQWVA---PQSW---DVETKRKKTKPKGRSKSRRSSI 488
>sp|Q9FGX2|BZIP1_ARATH Basic leucine zipper 1 OS=Arabidopsis thaliana GN=BZIP1 PE=1 SV=1
Length = 145
Score = 35.0 bits (79), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 16/21 (76%), Positives = 19/21 (90%)
Query: 282 KRERRKQSNRESARRSRLRKQ 302
K+ +RK SNRESARRSRL+KQ
Sbjct: 16 KKRKRKLSNRESARRSRLKKQ 36
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.308 0.127 0.376
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 131,697,791
Number of Sequences: 539616
Number of extensions: 6335166
Number of successful extensions: 20544
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 43
Number of HSP's successfully gapped in prelim test: 533
Number of HSP's that attempted gapping in prelim test: 19738
Number of HSP's gapped (non-prelim): 1305
length of query: 304
length of database: 191,569,459
effective HSP length: 117
effective length of query: 187
effective length of database: 128,434,387
effective search space: 24017230369
effective search space used: 24017230369
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 61 (28.1 bits)