BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 004260
(765 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q6R0H1|LHY_ARATH Protein LHY OS=Arabidopsis thaliana GN=LHY PE=1 SV=2
Length = 645
Score = 438 bits (1126), Expect = e-122, Method: Compositional matrix adjust.
Identities = 359/779 (46%), Positives = 447/779 (57%), Gaps = 155/779 (19%)
Query: 1 MDTYSSGEDLVLKTRKPYTITKQRERWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQ 60
MDT +SGE+L+ K RKPYTITKQRERWTE+EH RFLEAL+LYGRAWQRIEEHIGTK AVQ
Sbjct: 1 MDTNTSGEELLAKARKPYTITKQRERWTEDEHERFLEALRLYGRAWQRIEEHIGTKTAVQ 60
Query: 61 IRSHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPRPKRKPRNPYPRKTCTNA-PMSQIG- 118
IRSHAQKFF+KLEKEA KG+P+ QA+DI+IPPPRPKRKP PYPRK N SQ+
Sbjct: 61 IRSHAQKFFTKLEKEAEVKGIPVCQALDIEIPPPRPKRKPNTPYPRKPGNNGTSSSQVSS 120
Query: 119 AKDGKLRSSVSSLRCNQV-LDLEKEPICDRPNGDEKPTYTIESQKDNCSEVFILHQEAHC 177
AKD KL SS SS + NQ LDLEK P EK + E+Q +NCS
Sbjct: 121 AKDAKLVSSASSSQLNQAFLDLEKMPFS------EKTSTGKENQDENCS----------- 163
Query: 178 SSVSSVNKNSMPTPVGLRDSCNLREFVPSLKEVVNQDETRESYVTVELKANEKFGKPDAK 237
VS+VNK +PT K+V ET
Sbjct: 164 -GVSTVNKYPLPT-----------------KQVSGDIET--------------------- 184
Query: 238 LALQDNGSSKPLNLENACPSHEKSVHGEKRDDVADALPTAEVQATQNYPRHVNVHILDGS 297
SK ++NA + V + +D D V + QNYP H + I++G+
Sbjct: 185 --------SKTSTVDNAV----QDVPKKNKD--KDGNDGTTVHSMQNYPWHFHADIVNGN 230
Query: 298 LGTGTQS-PSDMPMQESIFHPIGE-VHGNPNLFTNPAASATTENESNVPKSTHQSFPTFH 355
+ Q+ PS M Q+ +FHP+ E HG+ NL A++ TT ++HQ+FP H
Sbjct: 231 IAKCPQNHPSGMVSQDFMFHPMREETHGHANLQATTASATTT--------ASHQAFPACH 282
Query: 356 PPFTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNPAAHAAASFAATFWPYTNMETSADSP 415
QDDYRSFL ISS FS+LI+STLLQNPAAHAAA+FAA+ WPY ++ S DS
Sbjct: 283 --------SQDDYRSFLQISSTFSNLIMSTLLQNPAAHAAATFAASVWPYASVGNSGDS- 333
Query: 416 TCPQGGFLSRQMSSPPSMAAIAAATVAAATAWWAAHGLLPLCAPFHAPFICPPACATAVS 475
+ SSPPS+ AIAAATVAAATAWWA+HGLLP+CAP AP C P AV
Sbjct: 334 -------STPMSSSPPSITAIAAATVAAATAWWASHGLLPVCAP--APITCVPFSTVAVP 384
Query: 476 SMDTAQVPA-AKTDKNDNTPPNPALKDHQPDLEYSEALQAQNSASKSLTVLTSDSEESAS 534
+ PA + D +NT QP + + ALQ QN ASKS + DS+E+
Sbjct: 385 T------PAMTEMDTVENT---------QPFEKQNTALQDQNLASKSPASSSDDSDETGV 429
Query: 535 AKLKMELKATDH--EKNSEENELHDSNKTKNRKQVDRSSCGSNTPSSSEVETDALEKQEK 592
KL + K D E+ +HDSN + + VDRSSCGSNTPS S+ ETDAL+K EK
Sbjct: 430 TKLNADSKTNDDKIEEVVVTAAVHDSNTAQKKNLVDRSSCGSNTPSGSDAETDALDKMEK 489
Query: 593 GKEESKEIDPSLP-ATDPGNRRSRSSSN------ITDSWKEVSEVGRLAFQALFSREVLP 645
KE+ KE D + P + NR+ + N TDSWKEVSE GR+AFQALF+RE LP
Sbjct: 490 DKEDVKETDENQPDVIELNNRKIKMRDNNSNNNATTDSWKEVSEEGRIAFQALFARERLP 549
Query: 646 QSFSPPHDLKDKMQQDNVEDKQNGNKKDGDKSL-FDLNSKTWGSCFGHQEVEKSTVSGVE 704
QSFSPP + +N N+K D S+ N K+ SC Q
Sbjct: 550 QSFSPP------------QVAENVNRKQSDTSMPLAPNFKSQDSCAADQ----------- 586
Query: 705 NNGGEGLLTIGLGHGK-LKARRTGFKPYKRCSVEAKENRILNTGNQAEEKCPKRIRVEG 762
EG++ IG+G K LK R+TGFKPYKRCS+E KE+++ N NQ++EK KR+R+EG
Sbjct: 587 ----EGVVMIGVGTCKSLKTRQTGFKPYKRCSMEVKESQVGNINNQSDEKVCKRLRLEG 641
>sp|P92973|CCA1_ARATH Protein CCA1 OS=Arabidopsis thaliana GN=CCA1 PE=1 SV=1
Length = 608
Score = 356 bits (914), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 314/781 (40%), Positives = 407/781 (52%), Gaps = 198/781 (25%)
Query: 1 MDTYSSGEDLVLKTRKPYTITKQRERWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQ 60
M+T SSGEDLV+KTRKPYTITKQRERWTEEEHNRF+EAL+LYGRAWQ+IEEH+ TK AVQ
Sbjct: 1 METNSSGEDLVIKTRKPYTITKQRERWTEEEHNRFIEALRLYGRAWQKIEEHVATKTAVQ 60
Query: 61 IRSHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPRPKRKPRNPYPRKTCTNAP-MSQIGA 119
IRSHAQKFFSK+EKEA +KGV +GQA+DI IPPPRPKRKP NPYPRKT + MS+ G
Sbjct: 61 IRSHAQKFFSKVEKEAEAKGVAMGQALDIAIPPPRPKRKPNNPYPRKTGSGTILMSKTGV 120
Query: 120 KDGKLRSSVSSLRCNQVLDLEKEPICDRPNGDEKPTYTIES--QKDNCSEVFILHQEAHC 177
DGK + L EK + N D + + E Q+DNCS+ F HQ +
Sbjct: 121 NDGK-----------ESLGSEKVSHPEMANEDRQQSKPEEKTLQEDNCSDCFT-HQ--YL 166
Query: 178 SSVSSVNKNSMPTPVGLRDSCNLREFVPSLKEVVNQDETRESYVTVELKANEKFGKPDAK 237
S+ SS+NK+ + T ++ REF+PS +E + R+
Sbjct: 167 SAASSMNKSCIET----SNASTFREFLPSREEGSQNNRVRKE------------------ 204
Query: 238 LALQDNGSSKPLNLENACPSHEKSVHGEKRDDVADALPTAEVQATQNYPRHVNVHILDGS 297
S+ LN A +L Q Q YP H+ V + GS
Sbjct: 205 -------SNSDLN--------------------AKSLENGNEQGPQTYPMHIPVLVPLGS 237
Query: 298 LGTGTQSPSDMPMQESIFHPIGEVHGNPNLFTNPAASATTENESNVPKSTHQSFPTFHPP 357
+ S+ HP E +P+ +QSFP
Sbjct: 238 -----------SITSSLSHPPSEPDSHPHTVAG----------------DYQSFPNH--- 267
Query: 358 FTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNPAAHAAASFAATFWPYTNMETSADSPTC 417
I+STLLQ PA + AA+FA++FWP ++S SP
Sbjct: 268 ------------------------IMSTLLQTPALYTAATFASSFWP---PDSSGGSPV- 299
Query: 418 PQGGFLSRQMSSPPSMAAIAAATVAAATAWWAAHGLLPLCAPF-------HAPFICPPAC 470
+SPP++AA+AAATVAAA+AWWAA+GLLPLCAP H P P+C
Sbjct: 300 --------PGNSPPNLAAMAAATVAAASAWWAANGLLPLCAPLSSGGFTSHPPSTFGPSC 351
Query: 471 ATAVSSMDTAQVPAAKTDKNDNTPPNPALKDHQPDLEYSEALQAQNSASKSLTVLTSDSE 530
+ T Q + ++ + E+SEA +A++S DSE
Sbjct: 352 DVEYTKASTLQHGSVQSREQ----------------EHSEASKARSSL---------DSE 386
Query: 531 ESASAKLKMELKATDHEKNSE--ENELHDSNKTKNRKQVDRSSCGSNTPSSSE-VETDAL 587
+ ++ + K HE+ S E++ S+ +RKQVDRSSCGSNTPSSS+ VE DA
Sbjct: 387 D-----VENKSKPVCHEQPSATPESDAKGSDGAGDRKQVDRSSCGSNTPSSSDDVEADAS 441
Query: 588 EKQEKGKE-ESKEI--DPSLPATDPGN-RRSRSSSNITDSWKEVSEVGRLAFQALFSREV 643
E+QE G E KE D + P T N RRSR SSNITD WK VS+ GR+AFQALFSREV
Sbjct: 442 ERQEDGTNGEVKETNEDTNKPQTSESNARRSRISSNITDPWKSVSDEGRIAFQALFSREV 501
Query: 644 LPQSFSPPHDLKDKMQQDNVEDKQNGNKKDGDKSLFDLNSKTWGSCFGHQEVEKSTVSGV 703
LPQSF+ ++ +++ ++ DLN + QE +++T
Sbjct: 502 LPQSFT---------YREEHREEEQQQQEQRYPMALDLNFTAQLTPVDDQEEKRNT---- 548
Query: 704 ENNGGEGLLTIGLGHGKLKAR-RTGFKPYKRCSVEAKENRILNTG--NQAEEKCPKRIRV 760
G L IGL KL +R RTGFKPYKRCS+EAKE+RILN E+K PKR+R+
Sbjct: 549 ------GFLGIGLDASKLMSRGRTGFKPYKRCSMEAKESRILNNNPIIHVEQKDPKRMRL 602
Query: 761 E 761
E
Sbjct: 603 E 603
>sp|Q54IF9|MYBG_DICDI Myb-like protein G OS=Dictyostelium discoideum GN=mybG PE=3 SV=1
Length = 423
Score = 118 bits (295), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 53/96 (55%), Positives = 68/96 (70%), Gaps = 10/96 (10%)
Query: 12 LKTRKPYTITKQRERWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRSHAQKFFSK 71
LK+RKPYTI+KQRE WT+EEH +FLEAL L+ R W++IE +G+K +QIRSHAQK+F K
Sbjct: 29 LKSRKPYTISKQRENWTDEEHQKFLEALTLFDRDWKKIESFVGSKTVIQIRSHAQKYFIK 88
Query: 72 LEKEALSKGVPIGQAIDIDIPPPRPKRKPRNPYPRK 107
++K + IPPPRPKRK PYP+K
Sbjct: 89 VQKNNTGE----------RIPPPRPKRKSIQPYPQK 114
>sp|Q6R0H0|ASG4_ARATH Transcription factor ASG4 OS=Arabidopsis thaliana GN=ASG4 PE=2 SV=1
Length = 287
Score = 114 bits (286), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 54/105 (51%), Positives = 71/105 (67%), Gaps = 10/105 (9%)
Query: 3 TYSSGEDLVLKTRKPYTITKQRERWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIR 62
T S ED K RKPYTITK RE WTE+EH++FLEAL L+ R W++I+ +G+K +QIR
Sbjct: 40 TMSFFEDPTKKVRKPYTITKSRENWTEQEHDKFLEALHLFDRDWKKIKAFVGSKTVIQIR 99
Query: 63 SHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPRPKRKPRNPYPRK 107
SHAQK+F K++K + +PPPRPKRK +PYP+K
Sbjct: 100 SHAQKYFLKVQKNGTKE----------HLPPPRPKRKANHPYPQK 134
>sp|Q54Z40|MYBH_DICDI Myb-like protein H OS=Dictyostelium discoideum GN=mybH PE=3 SV=1
Length = 1217
Score = 65.1 bits (157), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 38/54 (70%)
Query: 24 RERWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRSHAQKFFSKLEKEAL 77
R WT+EE F+EA KLY + ++I+EH+ TK +Q+RSHAQKF KLEK +
Sbjct: 150 RSSWTKEEERLFVEAYKLYDKDNKKIQEHVKTKTILQVRSHAQKFALKLEKNGI 203
Score = 49.7 bits (117), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 34/50 (68%), Gaps = 1/50 (2%)
Query: 27 WTEEEHNRFLEALKLYGRA-WQRIEEHIGTKKAVQIRSHAQKFFSKLEKE 75
W+ EEH FL+A++ YGR W+ I I ++ +QI++HA+ +F K+ ++
Sbjct: 232 WSNEEHELFLKAIEKYGRGNWKLISTLIKSRNTLQIKNHARIYFDKISQQ 281
>sp|Q869R9|MYBJ_DICDI Myb-like protein J OS=Dictyostelium discoideum GN=mybJ PE=3 SV=1
Length = 734
Score = 62.0 bits (149), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 43/60 (71%), Gaps = 1/60 (1%)
Query: 21 TKQRERWTEEEHNRFLEALKLYGR-AWQRIEEHIGTKKAVQIRSHAQKFFSKLEKEALSK 79
T ++ WT+EEH RFL ++++G+ AW+ I + +GT+ QI+SHAQK++ + ++E +K
Sbjct: 373 TSLKQGWTKEEHIRFLNGIQIHGKGAWKEIAQFVGTRTPTQIQSHAQKYYLRQKQETKNK 432
>sp|Q54HX6|MYBI_DICDI Myb-like protein I OS=Dictyostelium discoideum GN=mybI PE=3 SV=1
Length = 977
Score = 58.9 bits (141), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 39/55 (70%), Gaps = 1/55 (1%)
Query: 22 KQRERWTEEEHNRFLEALKLYG-RAWQRIEEHIGTKKAVQIRSHAQKFFSKLEKE 75
KQ WT EEH+RF+EAL YG + + I +++ T+ Q+R+HAQK+F ++++E
Sbjct: 170 KQSRYWTPEEHSRFIEALSKYGHKDVKSISQYVSTRNPTQVRTHAQKYFLRIDRE 224
>sp|Q5RGA4|MYSM1_DANRE Histone H2A deubiquitinase MYSM1 OS=Danio rerio GN=mysm1 PE=2 SV=2
Length = 822
Score = 49.3 bits (116), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 1/58 (1%)
Query: 13 KTRKPYTITKQRE-RWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRSHAQKFF 69
K +KP + + RW EEE F + L +GR W +I + IGT+ +Q++S+A+++F
Sbjct: 87 KVKKPLVKSSAAQTRWAEEEKELFEKGLAQFGRRWTKIAKLIGTRTVLQVKSYAKQYF 144
>sp|Q5VVJ2|MYSM1_HUMAN Histone H2A deubiquitinase MYSM1 OS=Homo sapiens GN=MYSM1 PE=1 SV=1
Length = 828
Score = 48.1 bits (113), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 34/57 (59%)
Query: 26 RWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRSHAQKFFSKLEKEALSKGVP 82
+WT EE F + L +GR W +I + IG++ +Q++S+A+++F K L K P
Sbjct: 120 KWTIEEKELFEQGLAKFGRRWTKISKLIGSRTVLQVKSYARQYFKNKVKCGLDKETP 176
>sp|Q5F3F2|MYSM1_CHICK Histone H2A deubiquitinase MYSM1 OS=Gallus gallus GN=MYSM1 PE=2
SV=1
Length = 832
Score = 47.0 bits (110), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 17/44 (38%), Positives = 30/44 (68%)
Query: 26 RWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRSHAQKFF 69
+WT EE F + L YGR W +I + IG++ +Q++S+A+++F
Sbjct: 121 KWTSEEKELFEQGLVKYGRRWTKIAKLIGSRTVLQVKSYARQYF 164
>sp|A0JMR6|MYSM1_XENLA Histone H2A deubiquitinase MYSM1 OS=Xenopus laevis GN=mysm1 PE=2
SV=1
Length = 818
Score = 46.6 bits (109), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 18/59 (30%), Positives = 34/59 (57%)
Query: 11 VLKTRKPYTITKQRERWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRSHAQKFF 69
+ + R P + +WT+EE N F + L +GR W I IG++ +Q++++A+ +F
Sbjct: 96 IKRVRSPAKASSSPVKWTKEEKNLFEQGLATFGRRWTSIARLIGSRSVLQVKNYARHYF 154
>sp|Q8S9H7|DIV_ANTMA Transcription factor DIVARICATA OS=Antirrhinum majus GN=DIVARICATA
PE=2 SV=1
Length = 307
Score = 46.6 bits (109), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 42/85 (49%), Gaps = 8/85 (9%)
Query: 27 WTEEEHNRFLEALKLYGRA-WQRIEEH-IGTKKAVQIRSHAQKFF------SKLEKEALS 78
WTEEEH FL LK YG+ W+ I + + T+ Q+ SHAQK+F K ++ A
Sbjct: 134 WTEEEHKLFLMGLKKYGKGDWRNISRNFVITRTPTQVASHAQKYFIRQLSGGKDKRRASI 193
Query: 79 KGVPIGQAIDIDIPPPRPKRKPRNP 103
+ D P P K+ P +P
Sbjct: 194 HDITTVNLSDNQTPSPDNKKPPSSP 218
>sp|Q2V9B0|MY1R1_SOLTU Transcription factor MYB1R1 OS=Solanum tuberosum PE=2 SV=1
Length = 297
Score = 43.9 bits (102), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
Query: 4 YSSGEDLVLKTRKPYTITKQRERWTEEEHNRFLEALKLYGRA-WQRIEEH-IGTKKAVQI 61
Y+S +D V K+ WTEEEH FL L+ G+ W+ I + + T+ Q+
Sbjct: 74 YASADDAVQHQSNSGRERKRGVPWTEEEHKLFLLGLQKVGKGDWRGISRNFVKTRTPTQV 133
Query: 62 RSHAQKFF 69
SHAQK+F
Sbjct: 134 ASHAQKYF 141
>sp|Q8IYH5|ZZZ3_HUMAN ZZ-type zinc finger-containing protein 3 OS=Homo sapiens GN=ZZZ3
PE=1 SV=1
Length = 903
Score = 43.9 bits (102), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 10/63 (15%)
Query: 27 WTEEEHNRFLEALKLY------GRAWQRIEEHIGTKKAVQIRSHAQKFFSKLEKEALSKG 80
WT EE + + L Y R WQ+I + +G + A Q+ S QK+F KL K G
Sbjct: 655 WTVEEQKKLEQLLIKYPPEEVESRRWQKIADELGNRTAKQVASRVQKYFIKLTK----AG 710
Query: 81 VPI 83
+P+
Sbjct: 711 IPV 713
>sp|Q6KAQ7|ZZZ3_MOUSE ZZ-type zinc finger-containing protein 3 OS=Mus musculus GN=Zzz3
PE=2 SV=2
Length = 910
Score = 43.5 bits (101), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 10/63 (15%)
Query: 27 WTEEEHNRFLEALKLY------GRAWQRIEEHIGTKKAVQIRSHAQKFFSKLEKEALSKG 80
WT EE + + L Y R WQ+I + +G + A Q+ S QK+F KL K G
Sbjct: 662 WTVEEQKKLEQLLLKYPPEEVESRRWQKIADELGNRTAKQVASRVQKYFIKLTK----AG 717
Query: 81 VPI 83
+P+
Sbjct: 718 IPV 720
>sp|Q86HX9|MYBK_DICDI Myb-like protein K OS=Dictyostelium discoideum GN=mybK PE=3 SV=1
Length = 894
Score = 43.5 bits (101), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 26 RWTEEEHNRFLEALKLYG-RAWQRIEEHIGTKKAVQIRSHAQKFFSKLEK 74
WT EEHN+FLEA++ +G + + I + + T+ Q+R+H + +K
Sbjct: 656 HWTSEEHNKFLEAVQQFGIKDYHAIAKFVQTRNHHQVRTHVNTYLKNQKK 705
>sp|Q9UKL0|RCOR1_HUMAN REST corepressor 1 OS=Homo sapiens GN=RCOR1 PE=1 SV=1
Length = 482
Score = 41.6 bits (96), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 29/49 (59%)
Query: 15 RKPYTITKQRERWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRS 63
R P I K RWT EE ++A++ YGR +Q I + IG K VQ+++
Sbjct: 371 RLPEVIQKCNARWTTEEQLLAVQAIRKYGRDFQAISDVIGNKSVVQVKN 419
>sp|Q8CFE3|RCOR1_MOUSE REST corepressor 1 OS=Mus musculus GN=Rcor1 PE=1 SV=2
Length = 477
Score = 41.6 bits (96), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 29/49 (59%)
Query: 15 RKPYTITKQRERWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRS 63
R P I K RWT EE ++A++ YGR +Q I + IG K VQ+++
Sbjct: 365 RLPEVIQKCNARWTTEEQLLAVQAIRKYGRDFQAISDVIGNKSVVQVKN 413
>sp|Q69Z66|MYSM1_MOUSE Histone H2A deubiquitinase MYSM1 OS=Mus musculus GN=Mysm1 PE=2 SV=2
Length = 819
Score = 41.6 bits (96), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 17/57 (29%), Positives = 32/57 (56%)
Query: 26 RWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRSHAQKFFSKLEKEALSKGVP 82
+WT EE F + L +GR W +I + ++ +Q++S+A+++F K + K P
Sbjct: 117 KWTVEEKELFEQGLAKFGRRWTKIATLLKSRTVLQVKSYARQYFKNKVKWDVEKETP 173
>sp|Q9XI07|SWI3C_ARATH SWI/SNF complex subunit SWI3C OS=Arabidopsis thaliana GN=SWI3C PE=1
SV=1
Length = 807
Score = 39.7 bits (91), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 15/44 (34%), Positives = 25/44 (56%)
Query: 25 ERWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRSHAQKF 68
+ WT++E LEA++LY W +I +H+G+K Q H +
Sbjct: 401 DNWTDQETLLLLEAVELYNENWVQIADHVGSKSKAQCILHFLRL 444
>sp|Q90WN5|RCOR1_XENLA REST corepressor 1 OS=Xenopus laevis GN=rcor1 PE=2 SV=1
Length = 431
Score = 39.3 bits (90), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 26/38 (68%)
Query: 26 RWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRS 63
RWT EE ++A+++YGR +Q I + IG K VQ+++
Sbjct: 331 RWTTEEQLLAVQAIRMYGRDFQAISDVIGNKSVVQVKN 368
>sp|Q54YB7|MYBF_DICDI Myb-like protein F OS=Dictyostelium discoideum GN=mybF PE=3 SV=1
Length = 564
Score = 38.9 bits (89), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 24 RERWTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRSHAQKFFSKLEK 74
R WT EE + E YG+ W++I+ H K Q+ SH Q + K+ K
Sbjct: 414 RTVWTLEEEELYKEVFNHYGKNWKKIKTHFPDKSKSQVTSHGQ-YLIKINK 463
>sp|Q54K19|MYBD_DICDI Myb-like protein D OS=Dictyostelium discoideum GN=mybD PE=3 SV=1
Length = 595
Score = 38.5 bits (88), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 20/75 (26%), Positives = 33/75 (44%), Gaps = 12/75 (16%)
Query: 27 WTEEEHNRFLEALKLYGRAWQRIEEHIGTKKAVQIRSHAQKFFSKLEKEALSKGVPIGQA 86
WT+EE + + YG++W+ I H K Q++SH Q + IG+
Sbjct: 443 WTQEEDEKMAQLYNKYGKSWKAIHSHFDDKTREQVQSHGQYL------------IRIGKL 490
Query: 87 IDIDIPPPRPKRKPR 101
DI + +RK +
Sbjct: 491 EDIHRDGRKERRKGK 505
>sp|Q62901|RERE_RAT Arginine-glutamic acid dipeptide repeats protein OS=Rattus
norvegicus GN=Rere PE=2 SV=2
Length = 1559
Score = 36.2 bits (82), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 32/52 (61%), Gaps = 3/52 (5%)
Query: 3 TYSSGEDLVLKTRKPYTITKQRER-WTEEEHNRFLEALKLYGRAWQRIEEHI 53
+Y +G+ L +KP + K E+ WTE+E RF++ L+ YG+ + RI + +
Sbjct: 372 SYDAGKALQRLVKKP--VPKLIEKCWTEDEVKRFVKGLRQYGKNFFRIRKEL 421
>sp|Q80TZ9|RERE_MOUSE Arginine-glutamic acid dipeptide repeats protein OS=Mus musculus
GN=Rere PE=1 SV=3
Length = 1558
Score = 35.8 bits (81), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 32/52 (61%), Gaps = 3/52 (5%)
Query: 3 TYSSGEDLVLKTRKPYTITKQRER-WTEEEHNRFLEALKLYGRAWQRIEEHI 53
+Y +G+ L +KP + K E+ WTE+E RF++ L+ YG+ + RI + +
Sbjct: 373 SYDAGKALQRLVKKP--VPKLIEKCWTEDEVKRFVKGLRQYGKNFFRIRKEL 422
>sp|Q9P2R6|RERE_HUMAN Arginine-glutamic acid dipeptide repeats protein OS=Homo sapiens
GN=RERE PE=1 SV=2
Length = 1566
Score = 35.4 bits (80), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 31/51 (60%), Gaps = 3/51 (5%)
Query: 4 YSSGEDLVLKTRKPYTITKQRER-WTEEEHNRFLEALKLYGRAWQRIEEHI 53
Y +G+ L +KP + K E+ WTE+E RF++ L+ YG+ + RI + +
Sbjct: 374 YDAGKALQRLVKKP--VPKLIEKCWTEDEVKRFVKGLRQYGKNFFRIRKEL 422
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.310 0.126 0.366
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 293,948,271
Number of Sequences: 539616
Number of extensions: 12809683
Number of successful extensions: 39760
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 93
Number of HSP's successfully gapped in prelim test: 502
Number of HSP's that attempted gapping in prelim test: 37582
Number of HSP's gapped (non-prelim): 1879
length of query: 765
length of database: 191,569,459
effective HSP length: 125
effective length of query: 640
effective length of database: 124,117,459
effective search space: 79435173760
effective search space used: 79435173760
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 65 (29.6 bits)