BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 006856
(628 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P48785|PRH_ARATH Pathogenesis-related homeodomain protein OS=Arabidopsis thaliana
GN=PRH PE=2 SV=1
Length = 796
Score = 486 bits (1251), Expect = e-136, Method: Compositional matrix adjust.
Identities = 258/443 (58%), Positives = 328/443 (74%), Gaps = 18/443 (4%)
Query: 139 KRRKEKVELDEASRLQRRTRYLLIKMKLEQNLIDAYSGEGWKGHSREKIRPEKELQRAKK 198
+++ KVE+D++ RLQRRTRYLLIKMK++QNLIDAY+ EGWKG SREKIRP+KEL+RA+K
Sbjct: 91 QQKDNKVEVDDSLRLQRRTRYLLIKMKMQQNLIDAYATEGWKGQSREKIRPDKELERARK 150
Query: 199 QILKCKIGIRDAIRQLDSLSSVGCIEGSVIATDGSVHHEHIICAKCKLREAFPDNDIVLC 258
+IL CK+G+RDAIRQLD LSSVG +E VIA+DGS+HH+HI CA+C REAFPDNDI+LC
Sbjct: 151 EILNCKLGLRDAIRQLDLLSSVGSMEEKVIASDGSIHHDHIFCAECNSREAFPDNDIILC 210
Query: 259 DGTCNCAFHQKCLDPPLDTES----NQGWFCKFCECKMEIIESMNAHIGTSFSVNSNWQD 314
DGTCN AFHQKCLDPPL+TES +QGWFCKFC+CK+EII++MNA IGT F V+SNWQD
Sbjct: 211 DGTCNRAFHQKCLDPPLETESIPPGDQGWFCKFCDCKIEIIDTMNAQIGTHFPVDSNWQD 270
Query: 315 IFKEEAAFPDGCSALLNQEEEWPSDDSEDDDYNPERRE----NSCSISRAGTDDDPSSST 370
IF EEA+ P G A +N E +WPSDDS+DDDY+PE RE NS ++S G D+ S
Sbjct: 271 IFNEEASLPIGSEATVNNEADWPSDDSKDDDYDPEMRENGGGNSSNVSGDGGGDNDEESI 330
Query: 371 SLSWFSDSETFSESMRWEMESNGYKNYSVDSSIGSDETSDGEIICGRRQRRTVDYKKLYD 430
S S S+ + + + ++ + + + + ETS+ E +CG RQRRTVDY +LY
Sbjct: 331 STSLSLSSDGVA------LSTGSWEGHRLSNMVEQCETSNEETVCGPRQRRTVDYTQLYY 384
Query: 431 EMFGKDASAFEQLSEDEDWGPAKRRRKEKESDAVNSLMTLYGSEEKYSKV----KTAEVK 486
EMFGKDA EQ SEDEDWGP RR++++ESDA ++L+T+ S +K V + +E
Sbjct: 385 EMFGKDAVLQEQGSEDEDWGPNDRRKRKRESDAGSTLVTMCESSKKDQDVVETLEQSERD 444
Query: 487 KKLPSNAKIRRSFHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKWFKNA 546
N RR R+P NAVEKLRQVFAE ELPS+ V++ L+KELSL+PEKVNKWFKN
Sbjct: 445 SVSVENKGGRRRMFRLPRNAVEKLRQVFAETELPSKAVRDRLAKELSLDPEKVNKWFKNT 504
Query: 547 RYLALKARKVESARQVSGSPRIS 569
RY+AL+ RK ES +Q S +S
Sbjct: 505 RYMALRNRKTESVKQPGDSKTVS 527
>sp|Q04996|HAT31_ARATH Homeobox protein HAT3.1 OS=Arabidopsis thaliana GN=HAT3.1 PE=2 SV=3
Length = 723
Score = 193 bits (491), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 94/196 (47%), Positives = 130/196 (66%), Gaps = 9/196 (4%)
Query: 135 GRSKKRRK-----EKVELDEASRLQRRTRYLLIKMKLEQNLIDAYSGEGWKGHSREKIRP 189
GR KK+ K + E DE +R++++ RY L ++ EQ+LIDAYS EGWKG S EKIRP
Sbjct: 157 GRPKKKNKTMNKGQVREDDEYTRIKKKLRYFLNRINYEQSLIDAYSLEGWKGSSLEKIRP 216
Query: 190 EKELQRAKKQILKCKIGIRDAIRQLDSLSSVGCIEGSVIATDGSVHHEHIICAKCKLREA 249
EKEL+RA K+IL+ K+ IRD + LD+L + G + S+ TDG + E I CAKC ++
Sbjct: 217 EKELERATKEILRRKLKIRDLFQHLDTLCAEGSLPESLFDTDGEISSEDIFCAKCGSKDL 276
Query: 250 FPDNDIVLCDGTCNCAFHQKCLDPPLDTE----SNQGWFCKFCECKMEIIESMNAHIGTS 305
DNDI+LCDG C+ FHQ CL+PPL E ++GW C C+CK + ++ +N +GT
Sbjct: 277 SVDNDIILCDGFCDRGFHQYCLEPPLRKEDIPPDDEGWLCPGCDCKDDSLDLLNDSLGTK 336
Query: 306 FSVNSNWQDIFKEEAA 321
FSV+ +W+ IF E AA
Sbjct: 337 FSVSDSWEKIFPEAAA 352
Score = 49.7 bits (117), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 78/179 (43%), Gaps = 41/179 (22%)
Query: 399 VDSSIGSDETSDGEIICGRRQRRTVDYKKLYDEMF-------------------GK---- 435
++S +G D+ G + RR +DYKKLYDE + GK
Sbjct: 501 LESDVGLDDGPAG--VSRRRNVERLDYKKLYDEEYDNVPTSSSDDDDWDKTARMGKEDSE 558
Query: 436 -----DASAFEQLSEDEDWGPAKRRRKEKESDAVNSL-MTLYGSEEKYSKVKTAEVKKKL 489
D +Q S ED K RK K +D ++L M G E + E++K
Sbjct: 559 SEDEGDTVPLKQSSNAEDHTSKKLIRKSKRADKKDTLEMPQEGPGENGG---SGEIEKSS 615
Query: 490 PSNAKIRRSFHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKWFKNARY 548
S K + P ++L F EN+ P + KE+L+KEL + ++VN WFK+ R+
Sbjct: 616 SSACK------QTDP-KTQRLYISFQENQYPDKATKESLAKELQMTVKQVNNWFKHRRW 667
>sp|P46605|HOX1A_MAIZE Homeobox protein HOX1A OS=Zea mays GN=HOX1A PE=2 SV=1
Length = 719
Score = 186 bits (473), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 85/193 (44%), Positives = 126/193 (65%), Gaps = 4/193 (2%)
Query: 134 KGRSKKRRKEKVELDEASRLQRRTRYLLIKMKLEQNLIDAYSGEGWKGHSREKIRPEKEL 193
K R R K DE S++++R RY+L +M EQ+LI+AY+ EGWK S +KIRPEKEL
Sbjct: 105 KRRKMSRASNKSSTDEFSQIRKRVRYILNRMNYEQSLIEAYASEGWKNQSLDKIRPEKEL 164
Query: 194 QRAKKQILKCKIGIRDAIRQLDSLSSVGCIEGSVIATDGSVHHEHIICAKCKLREAFPDN 253
+RAK +IL+CK+ IR+ R +DSL S G I+ ++ ++G + E I C+ C +A N
Sbjct: 165 ERAKSEILRCKLRIREVFRNIDSLLSKGKIDETLFDSEGEISCEDIFCSTCGSNDATLGN 224
Query: 254 DIVLCDGTCNCAFHQKCLDPPLDTE----SNQGWFCKFCECKMEIIESMNAHIGTSFSVN 309
DI+LCDG C+ FHQ CL+PPL TE ++GW C C+CK++ I+ +N G++ S+
Sbjct: 225 DIILCDGACDRGFHQNCLNPPLRTEDIPMGDEGWLCPACDCKIDCIDLINELHGSNISIE 284
Query: 310 SNWQDIFKEEAAF 322
+W+ +F + AA
Sbjct: 285 DSWEKVFPDAAAM 297
>sp|P48786|PRH_PETCR Pathogenesis-related homeodomain protein OS=Petroselinum crispum
GN=PRH PE=2 SV=1
Length = 1088
Score = 181 bits (460), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 150/502 (29%), Positives = 222/502 (44%), Gaps = 100/502 (19%)
Query: 147 LDEASRLQRRTRYLLIKMKLEQNLIDAYSGEGWKGHSREKIRPEKELQRAKKQILKCKIG 206
+DE R++ RYLL ++K E+N +DAYSGEGWKG S +KI+PEKEL+RAK +I K+
Sbjct: 487 VDEFCRIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKPEKELKRAKAEIFGRKLK 546
Query: 207 IRDAIRQLDSLSSVGCIEGSVIATDGSVHHEHIICAKCKLREAFPDNDIVLCDGTCNCAF 266
IRD ++LD S G + + + G + E I CAKC ++ NDI+LCDG C+ F
Sbjct: 547 IRDLFQRLDLARSEGRLPEILFDSRGEIDSEDIFCAKCGSKDVTLSNDIILCDGACDRGF 606
Query: 267 HQKCLDPPLDTE----SNQGWFCKFCECKMEIIESMNAHIGTSFSVNSNWQDIFK----- 317
HQ CLDPPL E ++GW C CECK++ I+ +N T+ + +W+ +F
Sbjct: 607 HQFCLDPPLLKEYIPPDDEGWLCPGCECKIDCIKLLNDSQETNILLGDSWEKVFAEEAAA 666
Query: 318 --------EEAAFPDGCSALLNQ-------EEEWPSDDSEDD--DYNPE-------RREN 353
+ + P S + +E+ DDS D DY E R++N
Sbjct: 667 AASGKNLDDNSGLPSDDSEDDDYDPGGPDLDEKVQGDDSSTDESDYQSESDDMQVIRQKN 726
Query: 354 SCSI------------SRAGTDDDPSSSTSLSWFSDSETFSESM---------RWEMESN 392
S + S TD S+ + SDSE F+ + + S
Sbjct: 727 SRGLPSDDSEDDEYDPSGLVTDQMYKDSSCSDFTSDSEDFTGVFDDYKDTGKAQGPLAST 786
Query: 393 GYKNYSVDSSIGSDETSDGEIICGRRQRRTVDYKKLYDEMFGKDASAFEQLSEDEDWGPA 452
+ + G E D + RRQ ++DYKKL D F K + LS D
Sbjct: 787 PDHVRNNEEGCGHPEQGDTAPLYPRRQVESLDYKKLNDIEFSKMCDILDILSSQLDVIIC 846
Query: 453 KRRRKE---KESDAVNSLMTLYGSEEKYSKVKTAEVKKK------LPSNAKIRRSFHR-- 501
++E SD+ + + S +K + K A ++ L + K R S H
Sbjct: 847 TGNQEEYGNTSSDSSDEDYMVTSSPDKNNSDKEATAMERGRESGDLELDQKARESTHNRR 906
Query: 502 ---------------------MPP--------------NAVEKLRQVFAENELPSRIVKE 526
P +A ++L Q F EN+ P R VKE
Sbjct: 907 YIKKFAVEGTDSFLSRSCEDSAAPVAGSKSTSKTLHGEHATQRLLQSFKENQYPQRAVKE 966
Query: 527 NLSKELSLEPEKVNKWFKNARY 548
+L+ EL+L +V+ WF N R+
Sbjct: 967 SLAAELALSVRQVSNWFNNRRW 988
>sp|Q96BD5|PF21A_HUMAN PHD finger protein 21A OS=Homo sapiens GN=PHF21A PE=1 SV=1
Length = 680
Score = 49.3 bits (116), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 35/67 (52%), Gaps = 8/67 (11%)
Query: 226 SVIATDGSVHHEHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFC 285
S +TDG +H + C+ C+ +++CD TC+ +H CLDPPL T W C
Sbjct: 478 SPTSTDGDIHED--FCSVCRK-----SGQLLMCD-TCSRVYHLDCLDPPLKTIPKGMWIC 529
Query: 286 KFCECKM 292
C+ +M
Sbjct: 530 PRCQDQM 536
>sp|P87233|PHF1_SCHPO SWM histone demethylase complex subunit phf1 OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=phf1 PE=1 SV=1
Length = 461
Score = 48.5 bits (114), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 52/120 (43%), Gaps = 15/120 (12%)
Query: 240 ICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQG----WFCKFCE---CKM 292
+C+ C+ + N IV CDG CN +HQ C PP+D + Q WFC C+ K
Sbjct: 192 LCSVCQRGHSPLSNRIVFCDG-CNSPYHQLCHHPPIDDATVQDVDAEWFCMKCQYRRAKQ 250
Query: 293 EIIESMNAHIGTSFSVNSNWQDIFKEEAAFPDGCSALLNQEEEWPSDDSEDDDYNPERRE 352
+ M A ++ + + ++ P +L E+ +PS YNP RE
Sbjct: 251 PLETGMTAQ---DLGLSESDKKMYLSSLPTPHLADLILFCEKSYPSLPI----YNPRTRE 303
>sp|Q09908|PHF2_SCHPO SWM histone demethylase complex subunit phf2 OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=phf2 PE=1 SV=1
Length = 538
Score = 47.8 bits (112), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 28/52 (53%), Gaps = 5/52 (9%)
Query: 241 CAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTE----SNQGWFCKFC 288
C+ C+ ++ P N IV CDG CN FHQ C +P + E N WFC C
Sbjct: 235 CSVCQRLQSPPKNRIVFCDG-CNTPFHQLCHEPYISDELLDSPNGEWFCDDC 285
>sp|Q8NEZ4|MLL3_HUMAN Histone-lysine N-methyltransferase MLL3 OS=Homo sapiens GN=MLL3 PE=1
SV=3
Length = 4911
Score = 47.4 bits (111), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 46/95 (48%), Gaps = 18/95 (18%)
Query: 237 EHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCE-CKMEII 295
E +C C +A ++LCD C+ ++H CLDPPL T GW CK+C C+
Sbjct: 1006 ECTVCEACG--KATDPGRLLLCDD-CDISYHTYCLDPPLQTVPKGGWKCKWCVWCR---- 1058
Query: 296 ESMNAHIG-TSFSVNSNWQDIFKEEAAFPDGCSAL 329
H G TS + WQ+ + + A C++L
Sbjct: 1059 -----HCGATSAGLRCEWQNNYTQCAP----CASL 1084
Score = 39.3 bits (90), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 3/54 (5%)
Query: 240 ICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCECKME 293
+C CK ++ D+ +++CD TC+ +H CL P + + GW CK C +E
Sbjct: 390 VCQNCK--QSGEDSKMLVCD-TCDKGYHTFCLQPVMKSVPTNGWKCKNCRICIE 440
>sp|Q6ZPK0|PF21A_MOUSE PHD finger protein 21A OS=Mus musculus GN=Phf21a PE=1 SV=2
Length = 659
Score = 47.4 bits (111), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 8/67 (11%)
Query: 226 SVIATDGSVHHEHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFC 285
S +TDG +H + C+ C+ +++CD TC+ +H CL+PPL T W C
Sbjct: 458 SPTSTDGDIHED--FCSVCRK-----SGQLLMCD-TCSRVYHLDCLEPPLKTIPKGMWIC 509
Query: 286 KFCECKM 292
C+ +M
Sbjct: 510 PRCQDQM 516
>sp|Q8BRH4|MLL3_MOUSE Histone-lysine N-methyltransferase MLL3 OS=Mus musculus GN=Mll3 PE=2
SV=2
Length = 4903
Score = 47.0 bits (110), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 46/95 (48%), Gaps = 18/95 (18%)
Query: 237 EHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCE-CKMEII 295
E +C C +A ++LCD C+ ++H CLDPPL T GW CK+C C+
Sbjct: 999 ECTVCEACG--KATDPGRLLLCDD-CDISYHTYCLDPPLQTVPKGGWKCKWCVWCR---- 1051
Query: 296 ESMNAHIG-TSFSVNSNWQDIFKEEAAFPDGCSAL 329
H G TS + WQ+ + + A C++L
Sbjct: 1052 -----HCGATSAGLRCEWQNNYTQCAP----CASL 1077
Score = 38.9 bits (89), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 3/54 (5%)
Query: 240 ICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCECKME 293
+C CK ++ D+ +++CD TC+ +H CL P + + GW CK C +E
Sbjct: 389 VCQNCK--QSGEDSKMLVCD-TCDKGYHTFCLQPVMKSVPTNGWKCKNCRICIE 439
>sp|Q9P1Y6|PHRF1_HUMAN PHD and RING finger domain-containing protein 1 OS=Homo sapiens
GN=PHRF1 PE=1 SV=3
Length = 1649
Score = 46.6 bits (109), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 252 DNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCECKMEII 295
++ ++LCDG C+ +H +CLDPPL WFC C ++
Sbjct: 195 EDRLLLCDG-CDAGYHMECLDPPLQEVPVDEWFCPECAAPGVVL 237
>sp|A6H619|PHRF1_MOUSE PHD and RING finger domain-containing protein 1 OS=Mus musculus
GN=Phrf1 PE=1 SV=2
Length = 1682
Score = 46.2 bits (108), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 255 IVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFC 288
++LCDG C+ +H +CLDPPL WFC C
Sbjct: 200 LLLCDG-CDAGYHMECLDPPLQEVPVDEWFCPEC 232
>sp|Q9HDV4|LID2_SCHPO Lid2 complex component lid2 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=lid2 PE=1 SV=1
Length = 1513
Score = 46.2 bits (108), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 1/34 (2%)
Query: 255 IVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFC 288
I+LCDG C A+H CLDPPL + + W+C C
Sbjct: 283 ILLCDG-CEAAYHTSCLDPPLTSIPKEDWYCDAC 315
>sp|O97159|CHDM_DROME Chromodomain-helicase-DNA-binding protein Mi-2 homolog
OS=Drosophila melanogaster GN=Mi-2 PE=1 SV=2
Length = 1982
Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 25/51 (49%), Gaps = 6/51 (11%)
Query: 240 ICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCEC 290
C CK D +LC +C A+H CL+PPLDT + W C C C
Sbjct: 439 FCRVCK------DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 483
Score = 40.0 bits (92), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 26/54 (48%), Gaps = 7/54 (12%)
Query: 236 HEHI-ICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFC 288
HEH C C+ +I+LCD TC A+H CL+P LD W C C
Sbjct: 374 HEHQDYCEVCQ-----QGGEIILCD-TCPRAYHLVCLEPELDEPPEGKWSCPHC 421
>sp|Q63625|PHRF1_RAT PHD and RING finger domain-containing protein 1 OS=Rattus
norvegicus GN=Phrf1 PE=1 SV=2
Length = 1685
Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 255 IVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFC 288
++LCDG C+ +H +CLDPPL WFC C
Sbjct: 203 LLLCDG-CDAGYHMECLDPPLQEVPVDEWFCPEC 235
>sp|Q6PDK2|MLL2_MOUSE Histone-lysine N-methyltransferase MLL2 OS=Mus musculus GN=Mll2 PE=1
SV=2
Length = 5588
Score = 45.4 bits (106), Expect = 0.001, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 33/57 (57%), Gaps = 3/57 (5%)
Query: 237 EHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCECKME 293
E I+C C +A + ++LCD C+ ++H CLDPPL T GW CK+C M+
Sbjct: 1382 ECIVCEVCG--QASDPSRLLLCD-DCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQ 1435
Score = 38.5 bits (88), Expect = 0.18, Method: Composition-based stats.
Identities = 17/52 (32%), Positives = 29/52 (55%), Gaps = 5/52 (9%)
Query: 241 CAKCKL----REAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFC 288
C +CK+ R+ D+ +++C+ TC+ +H CL PP++ W CK C
Sbjct: 270 CPECKVCQSCRKPGNDSKMLVCE-TCDKGYHTFCLKPPMEDLPAHSWKCKTC 320
Score = 32.7 bits (73), Expect = 8.1, Method: Composition-based stats.
Identities = 18/60 (30%), Positives = 26/60 (43%), Gaps = 3/60 (5%)
Query: 230 TDGSVHHEHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCE 289
+DG+ H E CA C+ D++ C +C +H CLD L W C C+
Sbjct: 218 SDGAAHLEEARCAVCEGPGQL--CDLLFCT-SCGHHYHGACLDTALTARKRASWQCPECK 274
>sp|O14686|MLL2_HUMAN Histone-lysine N-methyltransferase MLL2 OS=Homo sapiens GN=MLL2 PE=1
SV=2
Length = 5537
Score = 45.4 bits (106), Expect = 0.001, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 33/57 (57%), Gaps = 3/57 (5%)
Query: 237 EHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCECKME 293
E I+C C +A + ++LCD C+ ++H CLDPPL T GW CK+C M+
Sbjct: 1426 ECIVCEVCG--QASDPSRLLLCD-DCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQ 1479
Score = 38.5 bits (88), Expect = 0.15, Method: Composition-based stats.
Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 5/52 (9%)
Query: 241 CAKCKLREAF--PDND--IVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFC 288
C +CK+ +A P ND +++C+ TC+ +H CL PP++ W CK C
Sbjct: 270 CPECKVCQACRKPGNDSKMLVCE-TCDKGYHTFCLKPPMEELPAHSWKCKAC 320
>sp|Q8C966|PF21B_MOUSE PHD finger protein 21B OS=Mus musculus GN=Phf21b PE=2 SV=1
Length = 487
Score = 45.4 bits (106), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 30/58 (51%), Gaps = 7/58 (12%)
Query: 232 GSVHHEHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCE 289
G + H+ CA CK R A + C GTC+ A+H CLDPPL T W C C+
Sbjct: 302 GEITHDEF-CAACK-RGA----SLQPC-GTCSGAYHLSCLDPPLKTPPKGLWVCPKCQ 352
>sp|Q14839|CHD4_HUMAN Chromodomain-helicase-DNA-binding protein 4 OS=Homo sapiens GN=CHD4
PE=1 SV=2
Length = 1912
Score = 45.1 bits (105), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 24/55 (43%), Gaps = 6/55 (10%)
Query: 236 HEHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCEC 290
H C CK D +LC TC ++H CL+PPL N W C C C
Sbjct: 447 HHMEFCRVCK------DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 43.1 bits (100), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 29/66 (43%), Gaps = 6/66 (9%)
Query: 224 EGSVIATDGSVHHEHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGW 283
E V A DG C C+ +I+LCD TC A+H CLDP ++ W
Sbjct: 356 EEEVTAVDGYETDHQDYCEVCQQ-----GGEIILCD-TCPRAYHMVCLDPDMEKAPEGKW 409
Query: 284 FCKFCE 289
C CE
Sbjct: 410 SCPHCE 415
>sp|Q6PDQ2|CHD4_MOUSE Chromodomain-helicase-DNA-binding protein 4 OS=Mus musculus GN=Chd4
PE=1 SV=1
Length = 1915
Score = 45.1 bits (105), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 24/55 (43%), Gaps = 6/55 (10%)
Query: 236 HEHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCEC 290
H C CK D +LC TC ++H CL+PPL N W C C C
Sbjct: 440 HHMEFCRVCK------DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 488
Score = 43.1 bits (100), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 29/66 (43%), Gaps = 6/66 (9%)
Query: 224 EGSVIATDGSVHHEHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGW 283
E V A DG C C+ +I+LCD TC A+H CLDP ++ W
Sbjct: 349 EEEVTAVDGYETDHQDYCEVCQQ-----GGEIILCD-TCPRAYHMVCLDPDMEKAPEGKW 402
Query: 284 FCKFCE 289
C CE
Sbjct: 403 SCPHCE 408
>sp|Q96EK2|PF21B_HUMAN PHD finger protein 21B OS=Homo sapiens GN=PHF21B PE=2 SV=1
Length = 531
Score = 45.1 bits (105), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 30/58 (51%), Gaps = 8/58 (13%)
Query: 234 VHHEHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCECK 291
H EH CA CK R A ++ C GTC A+H CL+PPL T W C C+ K
Sbjct: 350 THDEH--CAACK-RGA----NLQPC-GTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQK 399
>sp|G5EBZ4|LE418_CAEEL Protein let-418 OS=Caenorhabditis elegans GN=let-418 PE=1 SV=1
Length = 1829
Score = 44.7 bits (104), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 8/74 (10%)
Query: 218 SSVGCIE-GSVIATDGSVHHEHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPL- 275
S CIE G + + C CK E +++LCD +C C+FH C+DPPL
Sbjct: 296 SCAHCIEHGPEVVKEEPAKQNDEFCKICKETE-----NLLLCD-SCVCSFHAYCIDPPLT 349
Query: 276 DTESNQGWFCKFCE 289
+ + W C CE
Sbjct: 350 EVPKEETWSCPRCE 363
Score = 35.4 bits (80), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 6/48 (12%)
Query: 241 CAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFC 288
C +CK D +++LCD TC A+H C+D ++ W C C
Sbjct: 259 CEECK-----QDGELLLCD-TCPRAYHTVCIDENMEEPPEGDWSCAHC 300
>sp|Q96T23|RSF1_HUMAN Remodeling and spacing factor 1 OS=Homo sapiens GN=RSF1 PE=1 SV=2
Length = 1441
Score = 44.3 bits (103), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 30/54 (55%), Gaps = 7/54 (12%)
Query: 241 CAKCKLREAFPDND--IVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCECKM 292
C KC L P++ I+LCD +C+ +H CL PPL + WFC C+ K+
Sbjct: 894 CKKCGL----PNHPELILLCD-SCDSGYHTACLRPPLMIIPDGEWFCPPCQHKL 942
>sp|Q69T58|ROC8_ORYSJ Homeobox-leucine zipper protein ROC8 OS=Oryza sativa subsp.
japonica GN=ROC8 PE=2 SV=2
Length = 710
Score = 44.3 bits (103), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 30/52 (57%)
Query: 496 RRSFHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKWFKNAR 547
R+ +HR P +++L +F E P + LS+EL LEP ++ WF+N R
Sbjct: 16 RKRYHRHTPRQIQQLEAMFKECPHPDENQRAQLSRELGLEPRQIKFWFQNRR 67
>sp|Q9W0T1|NU301_DROME Nucleosome-remodeling factor subunit NURF301 OS=Drosophila
melanogaster GN=E(bx) PE=1 SV=2
Length = 2669
Score = 44.3 bits (103), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 52/126 (41%), Gaps = 17/126 (13%)
Query: 170 LIDAYSGEGWKGHSREKIRPEKELQRAKKQILKCK----IGIRDAIRQLDSLSSVGCIEG 225
LID+ + W R + +K R IL GI + + L LS
Sbjct: 268 LIDSIT---WPEVLRSYVESDKTFDRNVFHILSHTEYPYTGIDNRLEVLQFLSDQFLTSN 324
Query: 226 S---VIATDGSVHHEHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQG 282
S V+ +G +H++ C++ D +LC TC +H +C+DPP++ +
Sbjct: 325 SIRDVMLQEGPIHYDD----HCRVCHRLGD---LLCCETCPAVYHLECVDPPMNDVPTED 377
Query: 283 WFCKFC 288
W C C
Sbjct: 378 WQCGLC 383
>sp|A8DZJ1|BAZ1B_XENLA Tyrosine-protein kinase BAZ1B OS=Xenopus laevis GN=baz1b PE=2 SV=2
Length = 1441
Score = 43.5 bits (101), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 43/91 (47%), Gaps = 8/91 (8%)
Query: 200 ILKCKIGIRDAIRQLDSLSSVGCIEGSVIA-TDGSVHHEHIICAKCKLREAFPDNDIVLC 258
+ K K+ IRDA + S + + G + A + E+ C C R+ D+ ++LC
Sbjct: 1116 VEKWKVAIRDA----QTFSRMHVLLGMLDACIKWDMSSENARCKVC--RKKGEDDKLILC 1169
Query: 259 DGTCNCAFHQKCLDPPLDTESNQGWFCKFCE 289
D CN AFH CL P L + W C C+
Sbjct: 1170 D-ECNKAFHLFCLRPVLFNIPDGEWLCPACQ 1199
>sp|Q8BRB7|KAT6B_MOUSE Histone acetyltransferase KAT6B OS=Mus musculus GN=Kat6b PE=2 SV=3
Length = 1872
Score = 43.1 bits (100), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 241 CAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCECK 291
C+ C+++ DN ++ CD +C+ FH +C DPPL W C+ C K
Sbjct: 273 CSACRVQGKNADN-MLFCD-SCDRGFHMECCDPPLSRMPKGMWICQVCRPK 321
>sp|Q8WYB5|KAT6B_HUMAN Histone acetyltransferase KAT6B OS=Homo sapiens GN=KAT6B PE=1 SV=3
Length = 2073
Score = 43.1 bits (100), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 241 CAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCECK 291
C+ C+++ DN ++ CD +C+ FH +C DPPL W C+ C K
Sbjct: 272 CSACRVQGRNADN-MLFCD-SCDRGFHMECCDPPLSRMPKGMWICQVCRPK 320
>sp|Q9SGH2|MBD9_ARATH Methyl-CpG-binding domain-containing protein 9 OS=Arabidopsis
thaliana GN=MBD9 PE=2 SV=1
Length = 2176
Score = 43.1 bits (100), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 3/49 (6%)
Query: 240 ICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFC 288
+C C + + D+ ++LCD TC+ +H CL+PPL + W+C C
Sbjct: 1289 VCKVCGVDKD--DDSVLLCD-TCDAEYHTYCLNPPLIRIPDGNWYCPSC 1334
>sp|Q9LMT8|HDG12_ARATH Homeobox-leucine zipper protein HDG12 OS=Arabidopsis thaliana
GN=HDG12 PE=2 SV=1
Length = 687
Score = 42.7 bits (99), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 28/49 (57%)
Query: 499 FHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKWFKNAR 547
FHR P+ +++L F E + P + LS+EL L P ++ WF+N R
Sbjct: 25 FHRHTPHQIQRLESTFNECQHPDEKQRNQLSRELGLAPRQIKFWFQNRR 73
>sp|Q8WML3|KAT6B_MACFA Histone acetyltransferase KAT6B OS=Macaca fascicularis GN=KAT6B
PE=2 SV=1
Length = 1784
Score = 42.7 bits (99), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 241 CAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCECK 291
C+ C+++ DN ++ CD +C+ FH +C DPPL W C+ C K
Sbjct: 272 CSACRVQGRNADN-MLFCD-SCDRGFHMECCDPPLSRMPKGMWICQVCRPK 320
>sp|Q12873|CHD3_HUMAN Chromodomain-helicase-DNA-binding protein 3 OS=Homo sapiens GN=CHD3
PE=1 SV=3
Length = 2000
Score = 42.4 bits (98), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 21/37 (56%), Gaps = 1/37 (2%)
Query: 253 NDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCE 289
+I+LCD TC A+H CLDP LD W C CE
Sbjct: 389 GEIILCD-TCPRAYHLVCLDPELDRAPEGKWSCPHCE 424
Score = 42.0 bits (97), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 26/58 (44%), Gaps = 8/58 (13%)
Query: 235 HHEHIICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCECKM 292
H E+ C CK D +LC C ++H CL+PPL N W C C C +
Sbjct: 455 HMEY--CRVCK------DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPV 504
>sp|Q8TDI0|CHD5_HUMAN Chromodomain-helicase-DNA-binding protein 5 OS=Homo sapiens GN=CHD5
PE=2 SV=1
Length = 1954
Score = 42.4 bits (98), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 22/51 (43%), Gaps = 6/51 (11%)
Query: 240 ICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCEC 290
C CK D +LC C ++H CL+PPL N W C C C
Sbjct: 418 FCRVCK------DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 462
Score = 40.0 bits (92), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 21/36 (58%), Gaps = 1/36 (2%)
Query: 254 DIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFCE 289
+I+LCD TC A+H CLDP L+ W C CE
Sbjct: 354 EIILCD-TCPRAYHLVCLDPELEKAPEGKWSCPHCE 388
>sp|A2BIL7|BAZ1B_DANRE Tyrosine-protein kinase BAZ1B OS=Danio rerio GN=baz1b PE=1 SV=2
Length = 1536
Score = 42.0 bits (97), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 41/86 (47%), Gaps = 10/86 (11%)
Query: 208 RDAIRQLDSLSSVGCIEGSV---IATDGSVHHEHIICAKCKL-REAFPDNDIVLCDGTCN 263
+ A+R+ + S + + G + I D S + A+CK+ R D+ ++LCD CN
Sbjct: 1171 KTAVREAQTFSRMHVLLGMLDACIKWDMSAEN-----ARCKVCRRKGEDDKLILCD-ECN 1224
Query: 264 CAFHQKCLDPPLDTESNQGWFCKFCE 289
AFH CL P L W C C+
Sbjct: 1225 KAFHLFCLRPALYRIPAGEWLCPACQ 1250
>sp|Q09819|YAC5_SCHPO Uncharacterized protein C16C9.05 OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=SPAC16C9.05 PE=1 SV=1
Length = 404
Score = 42.0 bits (97), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 8/53 (15%)
Query: 241 CAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTES--NQGWFCKFCECK 291
C+ C R F + C+G C C+FH CL+PPL E+ WFC C K
Sbjct: 120 CSACGGRGLF-----ICCEG-CPCSFHLSCLEPPLTPENIPEGSWFCVTCSIK 166
>sp|Q9M9P4|HDG8_ARATH Homeobox-leucine zipper protein HDG8 OS=Arabidopsis thaliana
GN=HDG8 PE=2 SV=2
Length = 699
Score = 42.0 bits (97), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 29/52 (55%)
Query: 496 RRSFHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKWFKNAR 547
+R+ HR P +++L F E P + L +EL LEP+++ WF+N R
Sbjct: 24 KRTCHRHTPQQIQRLEAYFKECPHPDERQRNQLCRELKLEPDQIKFWFQNKR 75
>sp|Q9UIG0|BAZ1B_HUMAN Tyrosine-protein kinase BAZ1B OS=Homo sapiens GN=BAZ1B PE=1 SV=2
Length = 1483
Score = 42.0 bits (97), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 43/86 (50%), Gaps = 10/86 (11%)
Query: 208 RDAIRQLDSLSSVGCIEGSV---IATDGSVHHEHIICAKCKL-REAFPDNDIVLCDGTCN 263
+ AIR+ + S + + G + I D S + A+CK+ R+ D+ ++LCD CN
Sbjct: 1153 KTAIREAQTFSRMHVLLGMLDACIKWDMSAEN-----ARCKVCRKKGEDDKLILCD-ECN 1206
Query: 264 CAFHQKCLDPPLDTESNQGWFCKFCE 289
AFH CL P L + W C C+
Sbjct: 1207 KAFHLFCLRPALYEVPDGEWQCPACQ 1232
>sp|Q7Y0V9|ROC4_ORYSJ Homeobox-leucine zipper protein ROC4 OS=Oryza sativa subsp.
japonica GN=ROC4 PE=2 SV=2
Length = 813
Score = 41.6 bits (96), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 29/52 (55%)
Query: 496 RRSFHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKWFKNAR 547
++ +HR P +++L +F E P + LSK L LEP +V WF+N R
Sbjct: 105 KKRYHRHTPQQIQELEAMFKECPHPDEKQRAELSKRLGLEPRQVKFWFQNRR 156
>sp|Q9Z277|BAZ1B_MOUSE Tyrosine-protein kinase BAZ1B OS=Mus musculus GN=Baz1b PE=1 SV=2
Length = 1479
Score = 41.6 bits (96), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 43/86 (50%), Gaps = 10/86 (11%)
Query: 208 RDAIRQLDSLSSVGCIEGSV---IATDGSVHHEHIICAKCKL-REAFPDNDIVLCDGTCN 263
+ AIR+ + S + + G + I D S + A+CK+ R+ D+ ++LCD CN
Sbjct: 1153 KTAIREAQTFSRMHVLLGMLDACIKWDMSAEN-----ARCKVCRKKGEDDKLILCD-ECN 1206
Query: 264 CAFHQKCLDPPLDTESNQGWFCKFCE 289
AFH CL P L + W C C+
Sbjct: 1207 KAFHLFCLRPALYEVPDGEWQCPACQ 1232
>sp|P46607|HGL2_ARATH Homeobox-leucine zipper protein GLABRA 2 OS=Arabidopsis thaliana
GN=GL2 PE=2 SV=3
Length = 747
Score = 41.6 bits (96), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 18/57 (31%), Positives = 31/57 (54%)
Query: 491 SNAKIRRSFHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKWFKNAR 547
+N + R+ +HR + + + +F E P ++ LSK+L L P +V WF+N R
Sbjct: 97 TNKRKRKKYHRHTTDQIRHMEALFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRR 153
>sp|Q04779|RCO1_YEAST Transcriptional regulatory protein RCO1 OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=RCO1 PE=1 SV=1
Length = 684
Score = 41.2 bits (95), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 29/61 (47%), Gaps = 8/61 (13%)
Query: 240 ICAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTES--NQGWFCKFCECKMEIIES 297
C+ C +F LC TC +FH CLDPP+D + W C C+ K+ I S
Sbjct: 262 FCSACNQSGSF------LCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNS 315
Query: 298 M 298
M
Sbjct: 316 M 316
>sp|Q92794|KAT6A_HUMAN Histone acetyltransferase KAT6A OS=Homo sapiens GN=KAT6A PE=1 SV=2
Length = 2004
Score = 41.2 bits (95), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 241 CAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFC 288
C+ C+ + DN ++ CD +C+ FH +C DPPL W C+ C
Sbjct: 265 CSSCRDQGKNADN-MLFCD-SCDRGFHMECCDPPLTRMPKGMWICQIC 310
>sp|Q5TKR9|KAT6A_RAT Histone acetyltransferase KAT6A OS=Rattus norvegicus GN=Kat6a PE=2
SV=2
Length = 1998
Score = 40.8 bits (94), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 241 CAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFC 288
C+ C+ + DN ++ CD +C+ FH +C DPPL W C+ C
Sbjct: 265 CSSCRDQGKNADN-MLFCD-SCDRGFHMECCDPPLTRMPKGMWICQIC 310
>sp|Q9M2E8|HDG1_ARATH Homeobox-leucine zipper protein HDG1 OS=Arabidopsis thaliana
GN=HDG1 PE=2 SV=1
Length = 808
Score = 40.8 bits (94), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 31/54 (57%)
Query: 494 KIRRSFHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKWFKNAR 547
K ++ +HR P ++ L VF E P + +LS+ L+L+P +V WF+N R
Sbjct: 109 KKKKRYHRHTPKQIQDLESVFKECAHPDEKQRLDLSRRLNLDPRQVKFWFQNRR 162
>sp|E7EZF3|UHRF1_DANRE E3 ubiquitin-protein ligase UHRF1 OS=Danio rerio GN=uhrf1 PE=1 SV=1
Length = 776
Score = 40.8 bits (94), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 241 CAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDT-ESNQGWFCKFC 288
C C +++ PD + LCD C+ AFH CL+PPL T ++ W+C C
Sbjct: 316 CHVCGIKQD-PDKQL-LCD-ECDMAFHTYCLNPPLTTIPDDEDWYCPDC 361
>sp|Q8BZ21|KAT6A_MOUSE Histone acetyltransferase KAT6A OS=Mus musculus GN=Kat6a PE=1 SV=2
Length = 2003
Score = 40.8 bits (94), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 241 CAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFC 288
C+ C+ + DN ++ CD +C+ FH +C DPPL W C+ C
Sbjct: 265 CSSCRDQGKNADN-MLFCD-SCDRGFHMECCDPPLTRMPKGMWICQIC 310
>sp|Q6EPF0|ROC5_ORYSJ Homeobox-leucine zipper protein ROC5 OS=Oryza sativa subsp.
japonica GN=ROC5 PE=2 SV=1
Length = 804
Score = 40.8 bits (94), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 37/65 (56%), Gaps = 1/65 (1%)
Query: 484 EVKKKLPSNAKIRRS-FHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKW 542
+V+ PSN++ R+ +HR P +++L +F E P + LS+ LSL+ +V W
Sbjct: 86 DVEDAEPSNSRKRKKRYHRHTPQQIQELEALFKECPHPDEKQRAELSRRLSLDARQVKFW 145
Query: 543 FKNAR 547
F+N R
Sbjct: 146 FQNRR 150
>sp|P56916|GSC2_MOUSE Homeobox protein goosecoid-2 OS=Mus musculus GN=Gsc2 PE=2 SV=1
Length = 198
Score = 40.8 bits (94), Expect = 0.035, Method: Composition-based stats.
Identities = 22/79 (27%), Positives = 38/79 (48%)
Query: 494 KIRRSFHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKWFKNARYLALKA 553
+ RR ++ L +F +N+ P +E L+ + L E+V WFKN R
Sbjct: 119 RTRRHRTIFSEEQLQALEALFVQNQYPDVGTRERLAVRIRLREERVEVWFKNRRAKWRHQ 178
Query: 554 RKVESARQVSGSPRISKES 572
++ S+R + G+ + KES
Sbjct: 179 KRASSSRLLPGTKKTPKES 197
>sp|Q8SVD3|HD3_ENCCU Homeobox protein HD-3 OS=Encephalitozoon cuniculi (strain GB-M1)
GN=HD-3 PE=4 SV=1
Length = 107
Score = 40.4 bits (93), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 30/69 (43%), Gaps = 4/69 (5%)
Query: 501 RMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKWFKNARYLALKARKV---- 556
RM L F +N PS +E LSK L + P V WF+N R A KV
Sbjct: 12 RMTAGQTRVLMSFFKDNPFPSTTAREKLSKVLGVGPRTVQIWFQNQRQKARGQAKVSDRE 71
Query: 557 ESARQVSGS 565
E R +GS
Sbjct: 72 EGPRACTGS 80
>sp|Q9S775|PKL_ARATH CHD3-type chromatin-remodeling factor PICKLE OS=Arabidopsis
thaliana GN=PKL PE=1 SV=1
Length = 1384
Score = 40.0 bits (92), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 21/35 (60%), Gaps = 1/35 (2%)
Query: 254 DIVLCDGTCNCAFHQKCLDPPLDTESNQGWFCKFC 288
++V C+ TC AFH KCL PPL S + W C C
Sbjct: 60 NLVSCN-TCTYAFHAKCLVPPLKDASVENWRCPEC 93
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.311 0.127 0.362
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 230,696,796
Number of Sequences: 539616
Number of extensions: 9869919
Number of successful extensions: 36842
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 237
Number of HSP's successfully gapped in prelim test: 452
Number of HSP's that attempted gapping in prelim test: 30875
Number of HSP's gapped (non-prelim): 2652
length of query: 628
length of database: 191,569,459
effective HSP length: 124
effective length of query: 504
effective length of database: 124,657,075
effective search space: 62827165800
effective search space used: 62827165800
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 64 (29.3 bits)