BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022223
(300 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q2V9B0|MY1R1_SOLTU Transcription factor MYB1R1 OS=Solanum tuberosum PE=2 SV=1
Length = 297
Score = 162 bits (411), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 135/300 (45%), Positives = 166/300 (55%), Gaps = 41/300 (13%)
Query: 23 GGGSSGAGSEENGIMLFGVRVTEGSSSFRKSISMDNLSQFDQPQDSNA------------ 70
GGG G IMLFGVRV RKS+S+++LSQ++ P +N
Sbjct: 17 GGGFGGE------IMLFGVRVK--VDPMRKSVSLNDLSQYEHPNANNNNNGGDNNESSKV 68
Query: 71 --DAGYASDD--IVHASGRSRERKRGVPWTEDEHRLFLLGLQKVGKGDWRGISRNFVKTR 126
D GYAS D + H S RERKRGVPWTE+EH+LFLLGLQKVGKGDWRGISRNFVKTR
Sbjct: 69 AQDEGYASADDAVQHQSNSGRERKRGVPWTEEEHKLFLLGLQKVGKGDWRGISRNFVKTR 128
Query: 127 TPTQVASHAQKYFLRRFNQNKRRRRSSLFDITASST-----DTARHQETITVPLPQPQLN 181
TPTQVASHAQKYFLRR N N+RRRRSSLFDIT S + +++ I V P
Sbjct: 129 TPTQVASHAQKYFLRRSNLNRRRRRSSLFDITTDSVSVMPIEEVENKQEIPVVAPATLPT 188
Query: 182 RHPGGFPVSAYPVKHSAVVLPFTGEKSMGNLTIGPSNKPRTSPKLIRPIPIHPVPPSSKL 241
FPV+ ++ P +KS T+ + +S L+ P+P+ +P S
Sbjct: 189 TKTNAFPVAP---TVGPIIFPVQIDKSREYPTLLRHDHGNSS-MLVGPVPMFSMPNPSTA 244
Query: 242 GGLNLNQKATHVDVDTLPLSLKLSTPSSSSDEQSAQTPHSSAAFQAMSS-GDSNSIISVA 300
LN N +T P SL L S Q++ T HS A+ MSS + SII VA
Sbjct: 245 IDLNANHNST-----IEPSSLSLRLSLSLDQGQASSTRHS--AYNVMSSFSNGESIIRVA 297
>sp|Q8S9H7|DIV_ANTMA Transcription factor DIVARICATA OS=Antirrhinum majus GN=DIVARICATA
PE=2 SV=1
Length = 307
Score = 120 bits (302), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 55/79 (69%), Positives = 68/79 (86%), Gaps = 2/79 (2%)
Query: 82 ASGR--SRERKRGVPWTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYF 139
+SGR +ERK+GVPWTE+EH+LFL+GL+K GKGDWR ISRNFV TRTPTQVASHAQKYF
Sbjct: 119 SSGRPSEQERKKGVPWTEEEHKLFLMGLKKYGKGDWRNISRNFVITRTPTQVASHAQKYF 178
Query: 140 LRRFNQNKRRRRSSLFDIT 158
+R+ + K +RR+S+ DIT
Sbjct: 179 IRQLSGGKDKRRASIHDIT 197
>sp|Q869R9|MYBJ_DICDI Myb-like protein J OS=Dictyostelium discoideum GN=mybJ PE=3 SV=1
Length = 734
Score = 73.2 bits (178), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 48/68 (70%), Gaps = 3/68 (4%)
Query: 83 SGRSRERKRGVPWTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYFLRR 142
+G+ K+G WT++EH FL G+Q GKG W+ I++ FV TRTPTQ+ SHAQKY+LR+
Sbjct: 369 TGKKTSLKQG--WTKEEHIRFLNGIQIHGKGAWKEIAQ-FVGTRTPTQIQSHAQKYYLRQ 425
Query: 143 FNQNKRRR 150
+ K +R
Sbjct: 426 KQETKNKR 433
>sp|Q54HX6|MYBI_DICDI Myb-like protein I OS=Dictyostelium discoideum GN=mybI PE=3 SV=1
Length = 977
Score = 58.9 bits (141), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 38/57 (66%), Gaps = 1/57 (1%)
Query: 85 RSRERKRGVPWTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYFLR 141
+S ++K+ WT +EH F+ L K G D + IS+ +V TR PTQV +HAQKYFLR
Sbjct: 165 QSEKKKQSRYWTPEEHSRFIEALSKYGHKDVKSISQ-YVSTRNPTQVRTHAQKYFLR 220
>sp|Q54Z40|MYBH_DICDI Myb-like protein H OS=Dictyostelium discoideum GN=mybH PE=3 SV=1
Length = 1217
Score = 55.1 bits (131), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 92 GVPWTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYFLRRFNQN 146
PW+ +EH LFL ++K G+G+W+ IS +K+R Q+ +HA+ YF + QN
Sbjct: 229 NTPWSNEEHELFLKAIEKYGRGNWKLIS-TLIKSRNTLQIKNHARIYFDKISQQN 282
Score = 38.9 bits (89), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 95 WTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYFLR 141
WT++E RLF+ K+ D + I + VKT+T QV SHAQK+ L+
Sbjct: 153 WTKEEERLFVEAY-KLYDKDNKKIQEH-VKTKTILQVRSHAQKFALK 197
>sp|Q6R0H0|ASG4_ARATH Transcription factor ASG4 OS=Arabidopsis thaliana GN=ASG4 PE=2 SV=1
Length = 287
Score = 48.1 bits (113), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 95 WTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYFLR 141
WTE EH FL L + DW+ I + FV ++T Q+ SHAQKYFL+
Sbjct: 64 WTEQEHDKFLEALHLFDR-DWKKI-KAFVGSKTVIQIRSHAQKYFLK 108
>sp|Q54IF9|MYBG_DICDI Myb-like protein G OS=Dictyostelium discoideum GN=mybG PE=3 SV=1
Length = 423
Score = 47.4 bits (111), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 95 WTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYFLR 141
WT++EH+ FL L + DW+ I +FV ++T Q+ SHAQKYF++
Sbjct: 44 WTDEEHQKFLEALTLFDR-DWKKI-ESFVGSKTVIQIRSHAQKYFIK 88
>sp|P92973|CCA1_ARATH Protein CCA1 OS=Arabidopsis thaliana GN=CCA1 PE=1 SV=1
Length = 608
Score = 45.8 bits (107), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 35/54 (64%), Gaps = 5/54 (9%)
Query: 86 SRERKRGVPWTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYF 139
+++R+R WTE+EH F+ L+ G+ W+ I + V T+T Q+ SHAQK+F
Sbjct: 21 TKQRER---WTEEEHNRFIEALRLYGRA-WQKIEEH-VATKTAVQIRSHAQKFF 69
>sp|Q86HX9|MYBK_DICDI Myb-like protein K OS=Dictyostelium discoideum GN=mybK PE=3 SV=1
Length = 894
Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 95 WTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYF 139
WT +EH FL +Q+ G D+ I++ FV+TR QV +H Y
Sbjct: 657 WTSEEHNKFLEAVQQFGIKDYHAIAK-FVQTRNHHQVRTHVNTYL 700
>sp|Q69Z66|MYSM1_MOUSE Histone H2A deubiquitinase MYSM1 OS=Mus musculus GN=Mysm1 PE=2 SV=2
Length = 819
Score = 40.0 bits (92), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 93 VPWTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYF 139
V WT +E LF GL K G+ W I+ +K+RT QV S+A++YF
Sbjct: 116 VKWTVEEKELFEQGLAKFGRR-WTKIA-TLLKSRTVLQVKSYARQYF 160
>sp|Q5VVJ2|MYSM1_HUMAN Histone H2A deubiquitinase MYSM1 OS=Homo sapiens GN=MYSM1 PE=1 SV=1
Length = 828
Score = 39.7 bits (91), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 93 VPWTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYF 139
V WT +E LF GL K G+ W IS+ + +RT QV S+A++YF
Sbjct: 119 VKWTIEEKELFEQGLAKFGRR-WTKISK-LIGSRTVLQVKSYARQYF 163
>sp|Q5RGA4|MYSM1_DANRE Histone H2A deubiquitinase MYSM1 OS=Danio rerio GN=mysm1 PE=2 SV=2
Length = 822
Score = 38.9 bits (89), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 95 WTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYF 139
W E+E LF GL + G+ W I++ + TRT QV S+A++YF
Sbjct: 102 WAEEEKELFEKGLAQFGRR-WTKIAK-LIGTRTVLQVKSYAKQYF 144
>sp|Q6R0H1|LHY_ARATH Protein LHY OS=Arabidopsis thaliana GN=LHY PE=1 SV=2
Length = 645
Score = 38.9 bits (89), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 72/190 (37%), Gaps = 52/190 (27%)
Query: 86 SRERKRGVPWTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYF--LRRF 143
+++R+R WTEDEH FL L+ G+ W+ I + + T+T Q+ SHAQK+F L +
Sbjct: 21 TKQRER---WTEDEHERFLEALRLYGRA-WQRIEEH-IGTKTAVQIRSHAQKFFTKLEKE 75
Query: 144 NQNKRRRRSSLFDITASSTDTARHQETITVPLPQPQLNRHPGGFPVSAYPVKHSAVVLPF 203
+ K DI R T P P R PG S+ V +
Sbjct: 76 AEVKGIPVCQALDIEIPPPRPKRKPNT---PYP-----RKPGNNGTSSSQVSSAK----- 122
Query: 204 TGEKSMGNLTIGPSNKPRTSPKLIRPIPIHPVPPSSKLGGLNLNQKATHVDVDTLPLSLK 263
KL+ S LNQ +D++ +P S K
Sbjct: 123 -------------------DAKLV-----------SSASSSQLNQ--AFLDLEKMPFSEK 150
Query: 264 LSTPSSSSDE 273
ST + DE
Sbjct: 151 TSTGKENQDE 160
>sp|Q5F3F2|MYSM1_CHICK Histone H2A deubiquitinase MYSM1 OS=Gallus gallus GN=MYSM1 PE=2
SV=1
Length = 832
Score = 37.7 bits (86), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 93 VPWTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYF 139
+ WT +E LF GL K G+ W I++ + +RT QV S+A++YF
Sbjct: 120 LKWTSEEKELFEQGLVKYGRR-WTKIAK-LIGSRTVLQVKSYARQYF 164
>sp|Q6NRB5|TAD2B_XENLA Transcriptional adapter 2-beta OS=Xenopus laevis GN=tada2b PE=2
SV=1
Length = 420
Score = 36.6 bits (83), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 14/46 (30%), Positives = 26/46 (56%)
Query: 95 WTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYFL 140
WT E +L L +++ G G+W ++ + +RTPT+V H ++
Sbjct: 70 WTSREEQLLLDAIEQFGFGNWEDMAAHVGASRTPTEVMEHYVTMYI 115
>sp|Q9LYP5|ARR21_ARATH Putative two-component response regulator ARR21 OS=Arabidopsis
thaliana GN=ARR21 PE=2 SV=3
Length = 613
Score = 35.4 bits (80), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 31/61 (50%), Gaps = 9/61 (14%)
Query: 89 RKRGVPWTEDEHRLFLLGLQKVGKGDWRGISRNF-----VKTRTPTQVASHAQKY--FLR 141
+K+ + WT+ H LFL ++ +G + + + V T VASH QKY FLR
Sbjct: 221 KKKKIQWTDSLHDLFLQAIRHIGLD--KAVPKKILAFMSVPYLTRENVASHLQKYRIFLR 278
Query: 142 R 142
R
Sbjct: 279 R 279
>sp|Q5RBN9|TAD2B_PONAB Transcriptional adapter 2-beta OS=Pongo abelii GN=TADA2B PE=2 SV=1
Length = 420
Score = 34.7 bits (78), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 13/46 (28%), Positives = 25/46 (54%)
Query: 95 WTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYFL 140
WT E +L L +++ G G+W ++ + +RTP +V H ++
Sbjct: 70 WTSREEQLLLDAIEQFGFGNWEDMAAHVGASRTPQEVMEHYVSMYI 115
>sp|Q86TJ2|TAD2B_HUMAN Transcriptional adapter 2-beta OS=Homo sapiens GN=TADA2B PE=1 SV=2
Length = 420
Score = 34.7 bits (78), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 13/46 (28%), Positives = 25/46 (54%)
Query: 95 WTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYFL 140
WT E +L L +++ G G+W ++ + +RTP +V H ++
Sbjct: 70 WTSREEQLLLDAIEQFGFGNWEDMAAHVGASRTPQEVMEHYVSMYI 115
>sp|Q15554|TERF2_HUMAN Telomeric repeat-binding factor 2 OS=Homo sapiens GN=TERF2 PE=1
SV=2
Length = 500
Score = 33.9 bits (76), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 12/28 (42%), Positives = 18/28 (64%)
Query: 95 WTEDEHRLFLLGLQKVGKGDWRGISRNF 122
WT +E G+QK G+G+W IS+N+
Sbjct: 450 WTVEESEWVKAGVQKYGEGNWAAISKNY 477
>sp|Q02336|ADA2_YEAST Transcriptional adapter 2 OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=ADA2 PE=1 SV=1
Length = 434
Score = 33.9 bits (76), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 1/46 (2%)
Query: 95 WTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYFL 140
W DE + G Q +G G+W+ I+ + + +R +V H KY+L
Sbjct: 65 WGADEELQLIKGAQTLGLGNWQDIA-DHIGSRGKEEVKEHYLKYYL 109
>sp|Q7Y0W3|EHD1_ORYSI Two-component response regulator EHD1 OS=Oryza sativa subsp. indica
PE=2 SV=1
Length = 341
Score = 31.6 bits (70), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 34/71 (47%), Gaps = 11/71 (15%)
Query: 77 DDIVHASGRSRER----KRGVPWTEDEHRLFLLGLQKVGKGDWRGISRNF-----VKTRT 127
DD+V+ +G + K + WT HR F+ + +G+ + + + VK T
Sbjct: 181 DDMVNYNGEITDIRDLGKSRLTWTTQLHRQFIAAVNHLGED--KAVPKKILGIMKVKHLT 238
Query: 128 PTQVASHAQKY 138
QVASH QKY
Sbjct: 239 REQVASHLQKY 249
>sp|Q7Y0W5|EHD1_ORYSJ Two-component response regulator EHD1 OS=Oryza sativa subsp.
japonica GN=EHD1 PE=1 SV=1
Length = 341
Score = 31.6 bits (70), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 34/71 (47%), Gaps = 11/71 (15%)
Query: 77 DDIVHASGRSRER----KRGVPWTEDEHRLFLLGLQKVGKGDWRGISRNF-----VKTRT 127
DD+V+ +G + K + WT HR F+ + +G+ + + + VK T
Sbjct: 181 DDMVNYNGEITDIRDLGKSRLTWTTQLHRQFIAAVNHLGED--KAVPKKILGIMKVKHLT 238
Query: 128 PTQVASHAQKY 138
QVASH QKY
Sbjct: 239 REQVASHLQKY 249
>sp|Q503N9|TAD2B_DANRE Transcriptional adapter 2-beta OS=Danio rerio GN=tada2b PE=2 SV=1
Length = 486
Score = 31.6 bits (70), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 12/46 (26%), Positives = 24/46 (52%)
Query: 95 WTEDEHRLFLLGLQKVGKGDWRGISRNFVKTRTPTQVASHAQKYFL 140
WT E + L +++ G G+W ++ + +RTP +V H ++
Sbjct: 70 WTSREEQSLLDAIEQYGFGNWEDMAAHVGASRTPQEVMDHYVSMYI 115
>sp|A1K3G6|RL25_AZOSB 50S ribosomal protein L25 OS=Azoarcus sp. (strain BH72) GN=rplY
PE=3 SV=1
Length = 200
Score = 31.6 bits (70), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 41/87 (47%), Gaps = 9/87 (10%)
Query: 223 SPKLIRPIPIHPV----PPSSKLGGLNLNQKATHVDVDTLPLSLKLSTPSSSSDEQSAQT 278
S KL +P+H V P+ KLGG + T +DV LP SL + ++ Q+
Sbjct: 98 SQKLHLKVPLHFVNGDNAPAVKLGGNIIAHVMTELDVQCLPSSLPEFIEVDLAALEAGQS 157
Query: 279 PHSS-----AAFQAMSSGDSNSIISVA 300
H S A +A+ G+ + +++ A
Sbjct: 158 IHVSQLKLPAGVEAVHHGEGDPVVASA 184
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.312 0.128 0.372
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 114,826,760
Number of Sequences: 539616
Number of extensions: 5064537
Number of successful extensions: 14887
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 25
Number of HSP's successfully gapped in prelim test: 52
Number of HSP's that attempted gapping in prelim test: 14746
Number of HSP's gapped (non-prelim): 164
length of query: 300
length of database: 191,569,459
effective HSP length: 117
effective length of query: 183
effective length of database: 128,434,387
effective search space: 23503492821
effective search space used: 23503492821
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 61 (28.1 bits)