BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 024485
(267 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q6R0H0|ASG4_ARATH Transcription factor ASG4 OS=Arabidopsis thaliana GN=ASG4 PE=2 SV=1
Length = 287
Score = 212 bits (540), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 118/230 (51%), Positives = 153/230 (66%), Gaps = 25/230 (10%)
Query: 21 LFRFDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAEHLPPPRPKRKAAHPYPQKA 80
L FDRDWKKI+AF+GSKTVIQIRSHAQKYFLKVQKNGT EHLPPPRPKRKA HPYPQKA
Sbjct: 76 LHLFDRDWKKIKAFVGSKTVIQIRSHAQKYFLKVQKNGTKEHLPPPRPKRKANHPYPQKA 135
Query: 81 SKNAPMLSQVSGSFQSSSAQLEPGHFLRPDSSSMLMIPMASAATSWTNNVQTVSLS-PAS 139
K + SS+A + + +S ++ +T+ + + +S P+S
Sbjct: 136 PK---------FTLSSSNALFQHDYLYNTNSHPVI-------STTRKHGLVHCDVSIPSS 179
Query: 140 KGPE---VANNRSNSTDSTPKARVSGELTDQGGELTDQGNNSHPLRVLPDFAQVYTFIGS 196
E V+ N +++ S K R T E DQ + P RV P+FA+VY FIGS
Sbjct: 180 VIKEEFGVSENCCSTSSSRDKQR-----TRIVTETNDQESCGKPHRVAPNFAEVYNFIGS 234
Query: 197 VFDPNASDHVQKLKKMDPIDVETVLLLMRNLSINLTSPDFEDHRRLLSSY 246
VFDP + HV++LK+MDPI++ETVLLLM+NLS+NLTSP+F++ R+L+SSY
Sbjct: 235 VFDPKTTGHVKRLKEMDPINLETVLLLMKNLSVNLTSPEFDEQRKLISSY 284
>sp|Q54IF9|MYBG_DICDI Myb-like protein G OS=Dictyostelium discoideum GN=mybG PE=3 SV=1
Length = 423
Score = 140 bits (352), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 80/221 (36%), Positives = 118/221 (53%), Gaps = 13/221 (5%)
Query: 24 FDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAEHLPPPRPKRKAAHPYPQKASKN 83
FDRDWKKIE+F+GSKTVIQIRSHAQKYF+KVQKN T E +PPPRPKRK+ PYPQK +
Sbjct: 59 FDRDWKKIESFVGSKTVIQIRSHAQKYFIKVQKNNTGERIPPPRPKRKSIQPYPQKQKHD 118
Query: 84 APMLSQVSGSFQSSSAQLEPGHFLRPDSSSMLMIPMASAATSWTNNVQTVSLSPASKGPE 143
G+F S L HF+ S + M N+ ++P+ +
Sbjct: 119 G------MGAFIPDS--LSGNHFISSSSFATWMTYRGLMP-----NISESQINPSDLQKQ 165
Query: 144 VANNRSNSTDSTPKARVSGELTDQGGELTDQGNNSHPLRVLPDFAQVYTFIGSVFDPNAS 203
+ + + GG + +N+ + P+F ++Y F+ ++F+ N +
Sbjct: 166 LEQLQQAQQYIQQAVTTAQSSQRNGGLPPNPSSNNGGTTLTPNFPKIYAFLSNLFESNGT 225
Query: 204 DHVQKLKKMDPIDVETVLLLMRNLSINLTSPDFEDHRRLLS 244
+ L + ID ET+ +LM NL+INL + + D+ + LS
Sbjct: 226 SFTEALSDLSMIDRETMQILMHNLAINLANQQYRDNHQTLS 266
>sp|Q6R0H1|LHY_ARATH Protein LHY OS=Arabidopsis thaliana GN=LHY PE=1 SV=2
Length = 645
Score = 57.0 bits (136), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 49/96 (51%), Gaps = 15/96 (15%)
Query: 13 RW-----PYIIMILFRFDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAEHLPPPR 67
RW + L + R W++IE IG+KT +QIRSHAQK+F K++K + +P +
Sbjct: 26 RWTEDEHERFLEALRLYGRAWQRIEEHIGTKTAVQIRSHAQKFFTKLEKEAEVKGIPVCQ 85
Query: 68 P----------KRKAAHPYPQKASKNAPMLSQVSGS 93
KRK PYP+K N SQVS +
Sbjct: 86 ALDIEIPPPRPKRKPNTPYPRKPGNNGTSSSQVSSA 121
>sp|P92973|CCA1_ARATH Protein CCA1 OS=Arabidopsis thaliana GN=CCA1 PE=1 SV=1
Length = 608
Score = 52.8 bits (125), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 30/45 (66%)
Query: 17 IIMILFRFDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAE 61
I L + R W+KIE + +KT +QIRSHAQK+F KV+K A+
Sbjct: 35 FIEALRLYGRAWQKIEEHVATKTAVQIRSHAQKFFSKVEKEAEAK 79
>sp|Q54Z40|MYBH_DICDI Myb-like protein H OS=Dictyostelium discoideum GN=mybH PE=3 SV=1
Length = 1217
Score = 49.7 bits (117), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 30/35 (85%)
Query: 24 FDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNG 58
+D+D KKI+ + +KT++Q+RSHAQK+ LK++KNG
Sbjct: 168 YDKDNKKIQEHVKTKTILQVRSHAQKFALKLEKNG 202
Score = 34.7 bits (78), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 23/33 (69%), Gaps = 1/33 (3%)
Query: 27 DWKKIEAFIGSKTVIQIRSHAQKYFLKV-QKNG 58
+WK I I S+ +QI++HA+ YF K+ Q+NG
Sbjct: 251 NWKLISTLIKSRNTLQIKNHARIYFDKISQQNG 283
>sp|Q5VVJ2|MYSM1_HUMAN Histone H2A deubiquitinase MYSM1 OS=Homo sapiens GN=MYSM1 PE=1 SV=1
Length = 828
Score = 45.8 bits (107), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 4/54 (7%)
Query: 21 LFRFDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAEHLPPPRPKRKAAH 74
L +F R W KI IGS+TV+Q++S+A++YF KN L P +K H
Sbjct: 133 LAKFGRRWTKISKLIGSRTVLQVKSYARQYF----KNKVKCGLDKETPNQKTGH 182
>sp|Q5F3F2|MYSM1_CHICK Histone H2A deubiquitinase MYSM1 OS=Gallus gallus GN=MYSM1 PE=2
SV=1
Length = 832
Score = 44.3 bits (103), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 28/41 (68%)
Query: 21 LFRFDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAE 61
L ++ R W KI IGS+TV+Q++S+A++YF KN +E
Sbjct: 134 LVKYGRRWTKIAKLIGSRTVLQVKSYARQYFKNKAKNDGSE 174
>sp|Q869R9|MYBJ_DICDI Myb-like protein J OS=Dictyostelium discoideum GN=mybJ PE=3 SV=1
Length = 734
Score = 42.4 bits (98), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 24/29 (82%)
Query: 28 WKKIEAFIGSKTVIQIRSHAQKYFLKVQK 56
WK+I F+G++T QI+SHAQKY+L+ ++
Sbjct: 399 WKEIAQFVGTRTPTQIQSHAQKYYLRQKQ 427
>sp|Q5RGA4|MYSM1_DANRE Histone H2A deubiquitinase MYSM1 OS=Danio rerio GN=mysm1 PE=2 SV=2
Length = 822
Score = 41.2 bits (95), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 24/31 (77%)
Query: 21 LFRFDRDWKKIEAFIGSKTVIQIRSHAQKYF 51
L +F R W KI IG++TV+Q++S+A++YF
Sbjct: 114 LAQFGRRWTKIAKLIGTRTVLQVKSYAKQYF 144
>sp|Q6KAQ7|ZZZ3_MOUSE ZZ-type zinc finger-containing protein 3 OS=Mus musculus GN=Zzz3
PE=2 SV=2
Length = 910
Score = 37.7 bits (86), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 26 RDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNG 58
R W+KI +G++T Q+ S QKYF+K+ K G
Sbjct: 685 RRWQKIADELGNRTAKQVASRVQKYFIKLTKAG 717
>sp|Q8IYH5|ZZZ3_HUMAN ZZ-type zinc finger-containing protein 3 OS=Homo sapiens GN=ZZZ3
PE=1 SV=1
Length = 903
Score = 37.7 bits (86), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 26 RDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNG 58
R W+KI +G++T Q+ S QKYF+K+ K G
Sbjct: 678 RRWQKIADELGNRTAKQVASRVQKYFIKLTKAG 710
>sp|Q69Z66|MYSM1_MOUSE Histone H2A deubiquitinase MYSM1 OS=Mus musculus GN=Mysm1 PE=2 SV=2
Length = 819
Score = 37.0 bits (84), Expect = 0.14, Method: Composition-based stats.
Identities = 18/53 (33%), Positives = 31/53 (58%), Gaps = 4/53 (7%)
Query: 21 LFRFDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAEHLPPPRPKRKAA 73
L +F R W KI + S+TV+Q++S+A++YF KN + P +K++
Sbjct: 130 LAKFGRRWTKIATLLKSRTVLQVKSYARQYF----KNKVKWDVEKETPTQKSS 178
>sp|Q54HX6|MYBI_DICDI Myb-like protein I OS=Dictyostelium discoideum GN=mybI PE=3 SV=1
Length = 977
Score = 36.6 bits (83), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 12/31 (38%), Positives = 22/31 (70%)
Query: 26 RDWKKIEAFIGSKTVIQIRSHAQKYFLKVQK 56
+D K I ++ ++ Q+R+HAQKYFL++ +
Sbjct: 193 KDVKSISQYVSTRNPTQVRTHAQKYFLRIDR 223
>sp|Q811B3|ATS12_MOUSE A disintegrin and metalloproteinase with thrombospondin motifs 12
OS=Mus musculus GN=Adamts12 PE=1 SV=2
Length = 1600
Score = 34.7 bits (78), Expect = 0.69, Method: Composition-based stats.
Identities = 32/129 (24%), Positives = 53/129 (41%), Gaps = 29/129 (22%)
Query: 55 QKNGTAEH-----LPPP--RPKRKAAHPYPQKASKNAPMLSQVSGS-----------FQS 96
+ TAEH +P P RP + P+ S + P ++ + + +Q+
Sbjct: 1018 KNQSTAEHDPFKPIPAPTSRPTPLSTPTVPESMSTSTPTINSLGSTIASQEDANGMGWQN 1077
Query: 97 SSAQLEPGHFLRPDSSSMLMIPMASAATSWT-------NNVQTVSLSPASKGPEVANNRS 149
+S Q E G S S +P+ TSW+ NV + ++ P S+G A S
Sbjct: 1078 NSTQAEEGSHFPTSSGSTSQVPV----TSWSLSIQPDDENVSSSAIGPTSEGDFWATTTS 1133
Query: 150 NSTDSTPKA 158
+S S+ A
Sbjct: 1134 DSGLSSSDA 1142
>sp|A0JMR6|MYSM1_XENLA Histone H2A deubiquitinase MYSM1 OS=Xenopus laevis GN=mysm1 PE=2
SV=1
Length = 818
Score = 33.9 bits (76), Expect = 1.3, Method: Composition-based stats.
Identities = 13/31 (41%), Positives = 21/31 (67%)
Query: 21 LFRFDRDWKKIEAFIGSKTVIQIRSHAQKYF 51
L F R W I IGS++V+Q++++A+ YF
Sbjct: 124 LATFGRRWTSIARLIGSRSVLQVKNYARHYF 154
>sp|O24731|THSA_THEK8 Thermosome subunit alpha OS=Thermococcus sp. (strain KS-8) GN=thsA
PE=3 SV=1
Length = 549
Score = 33.5 bits (75), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 54/116 (46%), Gaps = 15/116 (12%)
Query: 137 PASKG-PEVANNR-SNSTDSTPKARV-SGELTDQGGELTDQGNNSHPLRVLPDFAQVYTF 193
PA+K EVA + + D T A V +GEL + EL DQ N HP ++ +A
Sbjct: 77 PAAKMMVEVAKTQDKEAGDGTTTAVVIAGELLRKAEELLDQ--NIHPSIIIKGYALAAEK 134
Query: 194 IGSVFDPNASDHVQKLKKMDPIDVETVLLLMRNLSINLTSPDFEDHRRLLSSYEID 249
+ + A K++P D ET L+R ++T + E H+ LL+ +D
Sbjct: 135 AQEILEEIAI-------KVNPDDEET---LLRIAMTSITGKNAESHKELLAKLAVD 180
>sp|Q8S9H7|DIV_ANTMA Transcription factor DIVARICATA OS=Antirrhinum majus GN=DIVARICATA
PE=2 SV=1
Length = 307
Score = 33.1 bits (74), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 17 IIMILFRFDR-DWKKI-EAFIGSKTVIQIRSHAQKYFLKVQKNG 58
+M L ++ + DW+ I F+ ++T Q+ SHAQKYF++ G
Sbjct: 142 FLMGLKKYGKGDWRNISRNFVITRTPTQVASHAQKYFIRQLSGG 185
>sp|Q2V9B0|MY1R1_SOLTU Transcription factor MYB1R1 OS=Solanum tuberosum PE=2 SV=1
Length = 297
Score = 31.6 bits (70), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 13/26 (50%), Positives = 18/26 (69%), Gaps = 1/26 (3%)
Query: 27 DWKKI-EAFIGSKTVIQIRSHAQKYF 51
DW+ I F+ ++T Q+ SHAQKYF
Sbjct: 116 DWRGISRNFVKTRTPTQVASHAQKYF 141
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.315 0.131 0.382
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 100,406,239
Number of Sequences: 539616
Number of extensions: 4013078
Number of successful extensions: 13054
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 24
Number of HSP's successfully gapped in prelim test: 50
Number of HSP's that attempted gapping in prelim test: 12934
Number of HSP's gapped (non-prelim): 155
length of query: 267
length of database: 191,569,459
effective HSP length: 115
effective length of query: 152
effective length of database: 129,513,619
effective search space: 19686070088
effective search space used: 19686070088
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (27.7 bits)