BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 024485
         (267 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q6R0H0|ASG4_ARATH Transcription factor ASG4 OS=Arabidopsis thaliana GN=ASG4 PE=2 SV=1
          Length = 287

 Score =  212 bits (540), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 118/230 (51%), Positives = 153/230 (66%), Gaps = 25/230 (10%)

Query: 21  LFRFDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAEHLPPPRPKRKAAHPYPQKA 80
           L  FDRDWKKI+AF+GSKTVIQIRSHAQKYFLKVQKNGT EHLPPPRPKRKA HPYPQKA
Sbjct: 76  LHLFDRDWKKIKAFVGSKTVIQIRSHAQKYFLKVQKNGTKEHLPPPRPKRKANHPYPQKA 135

Query: 81  SKNAPMLSQVSGSFQSSSAQLEPGHFLRPDSSSMLMIPMASAATSWTNNVQTVSLS-PAS 139
            K          +  SS+A  +  +    +S  ++       +T+  + +    +S P+S
Sbjct: 136 PK---------FTLSSSNALFQHDYLYNTNSHPVI-------STTRKHGLVHCDVSIPSS 179

Query: 140 KGPE---VANNRSNSTDSTPKARVSGELTDQGGELTDQGNNSHPLRVLPDFAQVYTFIGS 196
              E   V+ N  +++ S  K R     T    E  DQ +   P RV P+FA+VY FIGS
Sbjct: 180 VIKEEFGVSENCCSTSSSRDKQR-----TRIVTETNDQESCGKPHRVAPNFAEVYNFIGS 234

Query: 197 VFDPNASDHVQKLKKMDPIDVETVLLLMRNLSINLTSPDFEDHRRLLSSY 246
           VFDP  + HV++LK+MDPI++ETVLLLM+NLS+NLTSP+F++ R+L+SSY
Sbjct: 235 VFDPKTTGHVKRLKEMDPINLETVLLLMKNLSVNLTSPEFDEQRKLISSY 284


>sp|Q54IF9|MYBG_DICDI Myb-like protein G OS=Dictyostelium discoideum GN=mybG PE=3 SV=1
          Length = 423

 Score =  140 bits (352), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 80/221 (36%), Positives = 118/221 (53%), Gaps = 13/221 (5%)

Query: 24  FDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAEHLPPPRPKRKAAHPYPQKASKN 83
           FDRDWKKIE+F+GSKTVIQIRSHAQKYF+KVQKN T E +PPPRPKRK+  PYPQK   +
Sbjct: 59  FDRDWKKIESFVGSKTVIQIRSHAQKYFIKVQKNNTGERIPPPRPKRKSIQPYPQKQKHD 118

Query: 84  APMLSQVSGSFQSSSAQLEPGHFLRPDSSSMLMIPMASAATSWTNNVQTVSLSPASKGPE 143
                   G+F   S  L   HF+   S +  M            N+    ++P+    +
Sbjct: 119 G------MGAFIPDS--LSGNHFISSSSFATWMTYRGLMP-----NISESQINPSDLQKQ 165

Query: 144 VANNRSNSTDSTPKARVSGELTDQGGELTDQGNNSHPLRVLPDFAQVYTFIGSVFDPNAS 203
           +   +            +      GG   +  +N+    + P+F ++Y F+ ++F+ N +
Sbjct: 166 LEQLQQAQQYIQQAVTTAQSSQRNGGLPPNPSSNNGGTTLTPNFPKIYAFLSNLFESNGT 225

Query: 204 DHVQKLKKMDPIDVETVLLLMRNLSINLTSPDFEDHRRLLS 244
              + L  +  ID ET+ +LM NL+INL +  + D+ + LS
Sbjct: 226 SFTEALSDLSMIDRETMQILMHNLAINLANQQYRDNHQTLS 266


>sp|Q6R0H1|LHY_ARATH Protein LHY OS=Arabidopsis thaliana GN=LHY PE=1 SV=2
          Length = 645

 Score = 57.0 bits (136), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/96 (36%), Positives = 49/96 (51%), Gaps = 15/96 (15%)

Query: 13  RW-----PYIIMILFRFDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAEHLPPPR 67
           RW        +  L  + R W++IE  IG+KT +QIRSHAQK+F K++K    + +P  +
Sbjct: 26  RWTEDEHERFLEALRLYGRAWQRIEEHIGTKTAVQIRSHAQKFFTKLEKEAEVKGIPVCQ 85

Query: 68  P----------KRKAAHPYPQKASKNAPMLSQVSGS 93
                      KRK   PYP+K   N    SQVS +
Sbjct: 86  ALDIEIPPPRPKRKPNTPYPRKPGNNGTSSSQVSSA 121


>sp|P92973|CCA1_ARATH Protein CCA1 OS=Arabidopsis thaliana GN=CCA1 PE=1 SV=1
          Length = 608

 Score = 52.8 bits (125), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 30/45 (66%)

Query: 17 IIMILFRFDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAE 61
           I  L  + R W+KIE  + +KT +QIRSHAQK+F KV+K   A+
Sbjct: 35 FIEALRLYGRAWQKIEEHVATKTAVQIRSHAQKFFSKVEKEAEAK 79


>sp|Q54Z40|MYBH_DICDI Myb-like protein H OS=Dictyostelium discoideum GN=mybH PE=3 SV=1
          Length = 1217

 Score = 49.7 bits (117), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 30/35 (85%)

Query: 24  FDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNG 58
           +D+D KKI+  + +KT++Q+RSHAQK+ LK++KNG
Sbjct: 168 YDKDNKKIQEHVKTKTILQVRSHAQKFALKLEKNG 202



 Score = 34.7 bits (78), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 23/33 (69%), Gaps = 1/33 (3%)

Query: 27  DWKKIEAFIGSKTVIQIRSHAQKYFLKV-QKNG 58
           +WK I   I S+  +QI++HA+ YF K+ Q+NG
Sbjct: 251 NWKLISTLIKSRNTLQIKNHARIYFDKISQQNG 283


>sp|Q5VVJ2|MYSM1_HUMAN Histone H2A deubiquitinase MYSM1 OS=Homo sapiens GN=MYSM1 PE=1 SV=1
          Length = 828

 Score = 45.8 bits (107), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 4/54 (7%)

Query: 21  LFRFDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAEHLPPPRPKRKAAH 74
           L +F R W KI   IGS+TV+Q++S+A++YF    KN     L    P +K  H
Sbjct: 133 LAKFGRRWTKISKLIGSRTVLQVKSYARQYF----KNKVKCGLDKETPNQKTGH 182


>sp|Q5F3F2|MYSM1_CHICK Histone H2A deubiquitinase MYSM1 OS=Gallus gallus GN=MYSM1 PE=2
           SV=1
          Length = 832

 Score = 44.3 bits (103), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 28/41 (68%)

Query: 21  LFRFDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAE 61
           L ++ R W KI   IGS+TV+Q++S+A++YF    KN  +E
Sbjct: 134 LVKYGRRWTKIAKLIGSRTVLQVKSYARQYFKNKAKNDGSE 174


>sp|Q869R9|MYBJ_DICDI Myb-like protein J OS=Dictyostelium discoideum GN=mybJ PE=3 SV=1
          Length = 734

 Score = 42.4 bits (98), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 15/29 (51%), Positives = 24/29 (82%)

Query: 28  WKKIEAFIGSKTVIQIRSHAQKYFLKVQK 56
           WK+I  F+G++T  QI+SHAQKY+L+ ++
Sbjct: 399 WKEIAQFVGTRTPTQIQSHAQKYYLRQKQ 427


>sp|Q5RGA4|MYSM1_DANRE Histone H2A deubiquitinase MYSM1 OS=Danio rerio GN=mysm1 PE=2 SV=2
          Length = 822

 Score = 41.2 bits (95), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 24/31 (77%)

Query: 21  LFRFDRDWKKIEAFIGSKTVIQIRSHAQKYF 51
           L +F R W KI   IG++TV+Q++S+A++YF
Sbjct: 114 LAQFGRRWTKIAKLIGTRTVLQVKSYAKQYF 144


>sp|Q6KAQ7|ZZZ3_MOUSE ZZ-type zinc finger-containing protein 3 OS=Mus musculus GN=Zzz3
           PE=2 SV=2
          Length = 910

 Score = 37.7 bits (86), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 26  RDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNG 58
           R W+KI   +G++T  Q+ S  QKYF+K+ K G
Sbjct: 685 RRWQKIADELGNRTAKQVASRVQKYFIKLTKAG 717


>sp|Q8IYH5|ZZZ3_HUMAN ZZ-type zinc finger-containing protein 3 OS=Homo sapiens GN=ZZZ3
           PE=1 SV=1
          Length = 903

 Score = 37.7 bits (86), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 26  RDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNG 58
           R W+KI   +G++T  Q+ S  QKYF+K+ K G
Sbjct: 678 RRWQKIADELGNRTAKQVASRVQKYFIKLTKAG 710


>sp|Q69Z66|MYSM1_MOUSE Histone H2A deubiquitinase MYSM1 OS=Mus musculus GN=Mysm1 PE=2 SV=2
          Length = 819

 Score = 37.0 bits (84), Expect = 0.14,   Method: Composition-based stats.
 Identities = 18/53 (33%), Positives = 31/53 (58%), Gaps = 4/53 (7%)

Query: 21  LFRFDRDWKKIEAFIGSKTVIQIRSHAQKYFLKVQKNGTAEHLPPPRPKRKAA 73
           L +F R W KI   + S+TV+Q++S+A++YF    KN     +    P +K++
Sbjct: 130 LAKFGRRWTKIATLLKSRTVLQVKSYARQYF----KNKVKWDVEKETPTQKSS 178


>sp|Q54HX6|MYBI_DICDI Myb-like protein I OS=Dictyostelium discoideum GN=mybI PE=3 SV=1
          Length = 977

 Score = 36.6 bits (83), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 12/31 (38%), Positives = 22/31 (70%)

Query: 26  RDWKKIEAFIGSKTVIQIRSHAQKYFLKVQK 56
           +D K I  ++ ++   Q+R+HAQKYFL++ +
Sbjct: 193 KDVKSISQYVSTRNPTQVRTHAQKYFLRIDR 223


>sp|Q811B3|ATS12_MOUSE A disintegrin and metalloproteinase with thrombospondin motifs 12
            OS=Mus musculus GN=Adamts12 PE=1 SV=2
          Length = 1600

 Score = 34.7 bits (78), Expect = 0.69,   Method: Composition-based stats.
 Identities = 32/129 (24%), Positives = 53/129 (41%), Gaps = 29/129 (22%)

Query: 55   QKNGTAEH-----LPPP--RPKRKAAHPYPQKASKNAPMLSQVSGS-----------FQS 96
            +   TAEH     +P P  RP   +    P+  S + P ++ +  +           +Q+
Sbjct: 1018 KNQSTAEHDPFKPIPAPTSRPTPLSTPTVPESMSTSTPTINSLGSTIASQEDANGMGWQN 1077

Query: 97   SSAQLEPGHFLRPDSSSMLMIPMASAATSWT-------NNVQTVSLSPASKGPEVANNRS 149
            +S Q E G      S S   +P+    TSW+        NV + ++ P S+G   A   S
Sbjct: 1078 NSTQAEEGSHFPTSSGSTSQVPV----TSWSLSIQPDDENVSSSAIGPTSEGDFWATTTS 1133

Query: 150  NSTDSTPKA 158
            +S  S+  A
Sbjct: 1134 DSGLSSSDA 1142


>sp|A0JMR6|MYSM1_XENLA Histone H2A deubiquitinase MYSM1 OS=Xenopus laevis GN=mysm1 PE=2
           SV=1
          Length = 818

 Score = 33.9 bits (76), Expect = 1.3,   Method: Composition-based stats.
 Identities = 13/31 (41%), Positives = 21/31 (67%)

Query: 21  LFRFDRDWKKIEAFIGSKTVIQIRSHAQKYF 51
           L  F R W  I   IGS++V+Q++++A+ YF
Sbjct: 124 LATFGRRWTSIARLIGSRSVLQVKNYARHYF 154


>sp|O24731|THSA_THEK8 Thermosome subunit alpha OS=Thermococcus sp. (strain KS-8) GN=thsA
           PE=3 SV=1
          Length = 549

 Score = 33.5 bits (75), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 54/116 (46%), Gaps = 15/116 (12%)

Query: 137 PASKG-PEVANNR-SNSTDSTPKARV-SGELTDQGGELTDQGNNSHPLRVLPDFAQVYTF 193
           PA+K   EVA  +   + D T  A V +GEL  +  EL DQ  N HP  ++  +A     
Sbjct: 77  PAAKMMVEVAKTQDKEAGDGTTTAVVIAGELLRKAEELLDQ--NIHPSIIIKGYALAAEK 134

Query: 194 IGSVFDPNASDHVQKLKKMDPIDVETVLLLMRNLSINLTSPDFEDHRRLLSSYEID 249
              + +  A        K++P D ET   L+R    ++T  + E H+ LL+   +D
Sbjct: 135 AQEILEEIAI-------KVNPDDEET---LLRIAMTSITGKNAESHKELLAKLAVD 180


>sp|Q8S9H7|DIV_ANTMA Transcription factor DIVARICATA OS=Antirrhinum majus GN=DIVARICATA
           PE=2 SV=1
          Length = 307

 Score = 33.1 bits (74), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 16/44 (36%), Positives = 27/44 (61%), Gaps = 2/44 (4%)

Query: 17  IIMILFRFDR-DWKKI-EAFIGSKTVIQIRSHAQKYFLKVQKNG 58
            +M L ++ + DW+ I   F+ ++T  Q+ SHAQKYF++    G
Sbjct: 142 FLMGLKKYGKGDWRNISRNFVITRTPTQVASHAQKYFIRQLSGG 185


>sp|Q2V9B0|MY1R1_SOLTU Transcription factor MYB1R1 OS=Solanum tuberosum PE=2 SV=1
          Length = 297

 Score = 31.6 bits (70), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 13/26 (50%), Positives = 18/26 (69%), Gaps = 1/26 (3%)

Query: 27  DWKKI-EAFIGSKTVIQIRSHAQKYF 51
           DW+ I   F+ ++T  Q+ SHAQKYF
Sbjct: 116 DWRGISRNFVKTRTPTQVASHAQKYF 141


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.315    0.131    0.382 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 100,406,239
Number of Sequences: 539616
Number of extensions: 4013078
Number of successful extensions: 13054
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 24
Number of HSP's successfully gapped in prelim test: 50
Number of HSP's that attempted gapping in prelim test: 12934
Number of HSP's gapped (non-prelim): 155
length of query: 267
length of database: 191,569,459
effective HSP length: 115
effective length of query: 152
effective length of database: 129,513,619
effective search space: 19686070088
effective search space used: 19686070088
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (27.7 bits)