BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000067
(2445 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O23372|ATXR3_ARATH Probable histone-lysine N-methyltransferase ATXR3 OS=Arabidopsis
thaliana GN=ATXR3 PE=1 SV=2
Length = 2335
Score = 2681 bits (6950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1432/2524 (56%), Positives = 1777/2524 (70%), Gaps = 270/2524 (10%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
M DGGVACMPL +IME+ PI +KTT+C GN S KT
Sbjct: 1 MSDGGVACMPLL------NIMEKLPIVEKTTLCGGNES-----------------KTAAT 37
Query: 61 SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVK-IKKVIAVKKKEVQKNSGSS---- 115
+ N + S ++K E+ +N K + S +K+IVK I+KV+ + K+ QK +
Sbjct: 38 TENGHTSIATKVPESQPAN-KPSASSQPVKKKRIVKVIRKVVKRRPKQPQKQADEQLKDQ 96
Query: 116 ---------------------KSNNNGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEG 154
KS G K VENGG G +EVEEG
Sbjct: 97 PPSQVVQLPAESQLQIKEQDKKSEFKGGTSGVKEVENGGDSG----------FKDEVEEG 146
Query: 155 ELGTLKW----ENGEFVQPEKSQPQSQLQSQSKQIEKGEIIV------------------ 192
ELGTLK ENGE + P KS Q +IEKGEI+
Sbjct: 147 ELGTLKLHEDLENGE-ISPVKSL-------QKSEIEKGEIVGESWKKDEPTKGEFSHLKY 198
Query: 193 ---------FSS-KCRRGETEKGESGLWRGNKDDIEKGEFIPDRWHK-EVVKDEYGYSKS 241
FS+ K +G E+ E WR D+IEKGEFIPDRW K + KD++ Y +S
Sbjct: 199 HKGYVERRDFSADKNWKGGKEEREFRSWRDPSDEIEKGEFIPDRWQKMDTGKDDHSYIRS 258
Query: 242 RR----------YDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWESGQERNVR 291
RR Y+Y+ ERTPP G++ ED+Y ++EF SG +R R
Sbjct: 259 RRNGVDREKTWKYEYEYERTPPGGRFVNEDIYHQREF--------------RSGLDRTTR 304
Query: 292 ISSKIVDDEGLYKGEHNNGKNHGREYFH-GNRFKRHGTDSDSGDRKY-YGDYGDFAGLKS 349
ISSKIV +E L+K E+NN N +EY GNR KRHG + DS +RK+ Y DYGD+ K
Sbjct: 305 ISSKIVIEENLHKNEYNNSSNFVKEYSSTGNRLKRHGAEPDSIERKHSYADYGDYGSSKC 364
Query: 350 RRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRHGR 409
R+LSDD SRS+HS+HYS+HS E+ +R+S S+ SSL+KY +H + S ++ D+HG
Sbjct: 365 RKLSDDC-SRSLHSDHYSQHSAERLYRDSYPSKNSSLEKYPRKHQDASFPAKAFSDKHGH 423
Query: 410 SPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYA 469
SPS SD SPHDR RY+++R DRSPY+RERSPY ++S +A
Sbjct: 424 SPSRSDWSPHDRSRYHENR---------------------DRSPYARERSPYIFEKSSHA 462
Query: 470 REKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREAS 529
R++SP DR H RSP +E SP DR+R DR D PN++E + R+R N HRE S
Sbjct: 463 RKRSPRDRRHH--DYRRSPSYSEWSPHDRSRPSDRRDYIPNFMEDTQSDRNRRNGHREIS 520
Query: 530 SKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTANCES 589
K+G E+R+ + ++ E K K+SN + S SS+KE Q K+ + + ++ EK + C+S
Sbjct: 521 RKSGVRERRDCQTGTE-LEIKHKYKESNGKESTSSSKELQGKNILYNNSLLVEKNSVCDS 579
Query: 590 HKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGKWFYLDHC 649
K P ++ KEP QV P EEL SME DMDICDTPPH P +DSS+GKWFYLD+
Sbjct: 580 SKIPVPCATG---KEPVQVGEAPTEELPSMEVDMDICDTPPHEPMASDSSLGKWFYLDYY 636
Query: 650 GMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQ 709
G E GP+RL DLK L+E+G+L SDH IKH D+NRW
Sbjct: 637 GTEHGPARLSDLKALMEQGILFSDHMIKHSDNNRW------------------------- 671
Query: 710 LVSPPEASGNLLADTGDTA------QSTGEEFPVTLQSQCCPDGSAAAAESSEDLHIDVR 763
LV+PPEA GNLL D DT Q G+ P + + PDG E+ ED ID+R
Sbjct: 672 LVNPPEAPGNLLEDIADTTEAVCIEQGAGDSLPELVSVRTLPDGKEIFVENREDFQIDMR 731
Query: 764 VGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDELYI 823
V LLDG T+ PG+E ETLGE L+ V+++ VG +P + ++E
Sbjct: 732 VENLLDGRTITPGREFETLGEALKVN---VEFEETRRCVTSEGVVGMFRPMKRAIEEFKS 788
Query: 824 SDTKMKEAAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLND 883
D E+ E+ S WFSGRWSCKGGDW R DEA+QDR +KK VLND
Sbjct: 789 DDAYGSESDEIGS--------------WFSGRWSCKGGDWIRQDEASQDRYYKKKIVLND 834
Query: 884 GFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGGSRSTQSKLA 943
GFPLC M KSG+EDPRW+ KDDLYYP S RL+LP WA++ DERN
Sbjct: 835 GFPLCLMQKSGHEDPRWHHKDDLYYPLSSSRLELPLWAFSVVDERNQ------------- 881
Query: 944 AVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSAE 1003
RGVK ++L VVR+N+ VVND + +PR+KVR+KER SR AR +++D +R S E
Sbjct: 882 -TRGVKASLLSVVRLNSLVVNDQVPPIPDPRAKVRSKERCPSRPARPSPASSDSKRESVE 940
Query: 1004 SDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSE 1063
S S S A QDSQG WK+ +NTP+DRLCTVDDLQL +G+W+Y DGAG E+GP SFSE
Sbjct: 941 SHSQSTASTGQDSQGLWKTDTSVNTPRDRLCTVDDLQLHIGDWFYTDGAGQEQGPLSFSE 1000
Query: 1064 LQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQ 1123
LQ LV++G I+ H+SVFRK DK+WVP+T T++ + G+ GL +++Q
Sbjct: 1001 LQKLVEKGFIKSHSSVFRKSDKIWVPVTSITKSPETIAMLRGKTPALPSACQGLVVSETQ 1060
Query: 1124 DAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQ 1183
D E + ++NS FH +HPQF+GY RGKLH+LVMK++K+R+F+AAIN+V+D WI+A+Q
Sbjct: 1061 DFKYSEMDTSLNS--FHGVHPQFLGYFRGKLHQLVMKTFKSRDFSAAINDVVDSWIHARQ 1118
Query: 1184 PKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEEELQTIQDESTFEDLCGDASFPG 1242
PKKE+E ++Y+ SE ++ KRARL+ ES D E E+ +DE TFEDLCGD +F
Sbjct: 1119 PKKESEKYMYQSSELNSCYTKRARLMAGESGEDSEMEDTQMFQKDELTFEDLCGDLTFNI 1178
Query: 1243 EESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVD 1302
E + S+ WGLLDGH LA VFH LR D+KSLAFAS+TCRHW+A + YK ISRQVD
Sbjct: 1179 EGNRSAGTVGIYWGLLDGHALARVFHMLRYDVKSLAFASMTCRHWKATINSYKDISRQVD 1238
Query: 1303 LSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCG 1362
LSS+GP+CTDS +R +N ++KEK++SI+LVGCTN+T+ MLEEIL+ P +SS+DI GC
Sbjct: 1239 LSSLGPSCTDSRLRSIMNTYNKEKIDSIILVGCTNVTASMLEEILRLHPRISSVDITGCS 1298
Query: 1363 QFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKSKGLGDDMDDFGD 1422
QFG+L + + N++W++ Q +R + + S+IRSLKQ T+ KSKGLG D DDFG+
Sbjct: 1299 QFGDLTVNYKNVSWLRCQNTRSGELH---SRIRSLKQTTD----VAKSKGLGGDTDDFGN 1351
Query: 1423 LKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIKKSENGYKRME 1482
LKDYF+ V+KRDSANQ FRRSLY+RSK++DAR+SS+ILSRDAR+RRW+IKKSE+GYKR+E
Sbjct: 1352 LKDYFDRVEKRDSANQLFRRSLYKRSKLYDARRSSAILSRDARIRRWAIKKSEHGYKRVE 1411
Query: 1483 EFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMCRDAIKAKNR 1542
EFLASSL+ IM+ NTF+FF KV++IE +MK GYY+SHGL SVK+DISRMCR+AIK
Sbjct: 1412 EFLASSLRGIMKQNTFDFFALKVSQIEEKMKNGYYVSHGLRSVKEDISRMCREAIK---- 1467
Query: 1543 GSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKM 1602
+E+MKSW+D S GL SAT KY KKLSK
Sbjct: 1468 -------------------------------DELMKSWQDGS--GLSSAT-KYNKKLSKT 1493
Query: 1603 VSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDG 1662
V+E+KYM+R++ T NG DYGEYASDREI++RLSKLNRKS S S+TS + S++G
Sbjct: 1494 VAEKKYMSRTSDTFGVNGASDYGEYASDREIKRRLSKLNRKSFSSESDTSSE---LSDNG 1550
Query: 1663 KSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLD-FSDDREWGARMTKASLVPPV 1721
KSD+ S+ S ++S+ D RS+GR+++ R FT D+ D +++REWGARMTKASLVPPV
Sbjct: 1551 KSDNYSSASASESESDIRSEGRSQDLRIEKYFTADDSFDSVTEEREWGARMTKASLVPPV 1610
Query: 1722 TRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQ 1781
TRKYEVI++Y IVADEE+V+RKMRVSLPEDY EKLNAQ+NG EELDMELPEVK+YKPRK
Sbjct: 1611 TRKYEVIEKYAIVADEEEVQRKMRVSLPEDYGEKLNAQRNGIEELDMELPEVKEYKPRKL 1670
Query: 1782 LGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGN 1841
LGD+V EQEVYGIDPYTHNLLLDSMP ELDW+L +KH FIEDV+LRTLN+QVR FTG+G+
Sbjct: 1671 LGDEVLEQEVYGIDPYTHNLLLDSMPGELDWSLQDKHSFIEDVVLRTLNRQVRLFTGSGS 1730
Query: 1842 TPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEG 1901
TPM++PL+PVIEE+++ A ++CD+RTMKMC+G+LK ++SR DDKYV+YRKGLGVVCNKEG
Sbjct: 1731 TPMVFPLRPVIEELKESAREECDIRTMKMCQGVLKEIESRSDDKYVSYRKGLGVVCNKEG 1790
Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLV 1961
GFGE+DFVVEFLGEVYPVWKWFEKQDGIRSLQ+N DPAPEFYNIYLERPKGDADGYDLV
Sbjct: 1791 GFGEEDFVVEFLGEVYPVWKWFEKQDGIRSLQENKTDPAPEFYNIYLERPKGDADGYDLV 1850
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
VVDAMH ANYASRICHSCRPNCEAKVTAVDGHYQIGIY+VR I YGEEITFDYNSVTESK
Sbjct: 1851 VVDAMHMANYASRICHSCRPNCEAKVTAVDGHYQIGIYSVRAIEYGEEITFDYNSVTESK 1910
Query: 2022 EEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYL 2081
EEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLK+ HGLL+RH+LMLEAC LNSVSEEDYL
Sbjct: 1911 EEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKDWHGLLERHRLMLEACVLNSVSEEDYL 1970
Query: 2082 ELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEV 2141
ELGRAGLGSCLLGGLP+W++AYSARLVRFIN ERTKLPEEIL+HNLEEKRKYFSDI L+V
Sbjct: 1971 ELGRAGLGSCLLGGLPDWMIAYSARLVRFINFERTKLPEEILKHNLEEKRKYFSDIHLDV 2030
Query: 2142 EKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKG 2201
EKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR VFGDPK APPP+ERL+PEETVSF+W G
Sbjct: 2031 EKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRHVFGDPKNAPPPLERLTPEETVSFVWNG 2090
Query: 2202 EGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCT 2261
+GSLV+EL+Q ++PH+EE LN+L+SKI HDPSGS D+ +EL++SLLWLRDE+R+LPCT
Sbjct: 2091 DGSLVDELLQSLSPHLEEGPLNELRSKIHGHDPSGSADVLKELQRSLLWLRDEIRDLPCT 2150
Query: 2262 YKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKT 2321
YKCR+DAAADLIHIYAYTKCFF+V+EY++F S PV+ISPLDLG KYADKLG ++ YRKT
Sbjct: 2151 YKCRNDAAADLIHIYAYTKCFFKVREYQSFISSPVHISPLDLGAKYADKLGESIKEYRKT 2210
Query: 2322 YGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVR 2381
YGENYCLGQLI+W+ QTN DPD TL +A+RGCLSLPD+ SFYAK QKPS+HRVYGPKTV+
Sbjct: 2211 YGENYCLGQLIYWYNQTNTDPDLTLVKATRGCLSLPDVASFYAKAQKPSKHRVYGPKTVK 2270
Query: 2382 FMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSL-TGCPLDREMVHWLKHRPAIFQ 2440
M+S+M KQPQRPWPKD+IW FKS+PR+FGSPM D+ L LDRE++ WL++R +FQ
Sbjct: 2271 TMVSQMSKQPQRPWPKDKIWTFKSTPRVFGSPMFDAVLNNSSSLDRELLQWLRNRRHVFQ 2330
Query: 2441 AMWD 2444
A WD
Sbjct: 2331 ATWD 2334
>sp|Q9Y7R4|SET1_SCHPO Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=set1 PE=1 SV=1
Length = 920
Score = 82.0 bits (201), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 74/141 (52%), Gaps = 26/141 (18%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---V 1961
++D V+E++GE+ IR +N + Y+ GD+ + + V
Sbjct: 803 KNDMVIEYIGEI------------IRQRVADNREKN------YVREGIGDSYLFRIDEDV 844
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
+VDA K N A I HSC PNC A++ V+G +I IY R I +GEE+T+DY +
Sbjct: 845 IVDATKKGNIARFINHSCAPNCIARIIRVEGKRKIVIYADRDIMHGEELTYDY----KFP 900
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
EE + CLCG+ CRG YLN
Sbjct: 901 EEADKIPCLCGAPTCRG-YLN 920
>sp|Q18221|SET2_CAEEL Probable histone-lysine N-methyltransferase set-2 OS=Caenorhabditis
elegans GN=set-2 PE=2 SV=2
Length = 1507
Score = 79.3 bits (194), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 72/145 (49%), Gaps = 28/145 (19%)
Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGY 1958
D+ +VE++G+ IRSL + A E I YL R
Sbjct: 1387 SIAPDEMIVEYIGQT------------IRSLVAEEREKAYERRGIGSSYLFR-------I 1427
Query: 1959 DLV-VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
DL V+DA + N+A I HSC+PNC AKV ++G +I IY+ I GEEIT+DY
Sbjct: 1428 DLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFP 1487
Query: 2018 TESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG++ CRG YLN
Sbjct: 1488 IED----DKIDCLCGAKTCRG-YLN 1507
>sp|Q4PB36|SET1_USTMA Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Ustilago
maydis (strain 521 / FGSC 9021) GN=SET1 PE=3 SV=1
Length = 1468
Score = 77.0 bits (188), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/137 (33%), Positives = 67/137 (48%), Gaps = 28/137 (20%)
Query: 1907 DFVVEFLGEVYPVW------KWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
D V+E++GEV K +E+Q ++ YL R D
Sbjct: 1351 DMVIEYVGEVVRQQVADEREKQYERQGN---------------FSTYLFRVDDD------ 1389
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
+VVDA HK N A + H C PNC AK+ ++G +I ++ I GEE+T+DY + +
Sbjct: 1390 LVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKTAIRAGEELTYDYKFQSSA 1449
Query: 2021 KEEYEASVCLCGSQVCR 2037
+E +A CLCGS CR
Sbjct: 1450 DDE-DAIPCLCGSPGCR 1465
>sp|Q99MY8|ASH1L_MOUSE Histone-lysine N-methyltransferase ASH1L OS=Mus musculus GN=Ash1l
PE=1 SV=3
Length = 2958
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2138 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2174
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2175 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2234
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2235 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2272
>sp|Q9NR48|ASH1L_HUMAN Histone-lysine N-methyltransferase ASH1L OS=Homo sapiens GN=ASH1L
PE=1 SV=2
Length = 2969
Score = 76.3 bits (186), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2244
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2282
>sp|Q9VYD1|C1716_DROME Probable histone-lysine N-methyltransferase CG1716 OS=Drosophila
melanogaster GN=Set2 PE=1 SV=2
Length = 2313
Score = 75.9 bits (185), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ GE F++E++GEV + FE++ + S +N +Y + L
Sbjct: 1371 KKGCGITAELLIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 1421
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A V+DA K N + I HSC PN E + V+G +IG ++V+ I GEE
Sbjct: 1422 --RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 1473
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + + +A C C + CRG
Sbjct: 1474 ITFDYQYLRYGR---DAQRCYCEAANCRG 1499
>sp|Q4I5R3|SET1_GIBZE Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Gibberella zeae (strain PH-1 / ATCC MYA-4620 / FGSC
9075 / NRRL 31084) GN=SET1 PE=3 SV=2
Length = 1263
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE + + I +++N YL+ G + + D
Sbjct: 1141 IAKDDMIIEYVGE--------QVRQQISEIRENR----------YLKSGIGSSYLFRIDD 1182
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1183 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIALNEELTYDYKFERE 1242
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1243 IG-STDRIPCLCGTAACKG-FLN 1263
>sp|Q2GWF3|SET1_CHAGB Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Chaetomium globosum (strain ATCC 6205 / CBS 148.51 /
DSM 1962 / NBRC 6347 / NRRL 1970) GN=SET1 PE=3 SV=1
Length = 1076
Score = 73.6 bits (179), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 70/141 (49%), Gaps = 23/141 (16%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---DLV 1961
+DD ++E++GE E + I L++N YL+ G + + D
Sbjct: 956 KDDMIIEYVGE--------EVRQQIAELRENR----------YLKSGIGSSYLFRIDDNT 997
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 998 VIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELG 1057
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1058 ST-DRIPCLCGTAACKG-FLN 1076
>sp|O14026|SET2_SCHPO Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=set2 PE=1 SV=1
Length = 798
Score = 73.2 bits (178), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 75/155 (48%), Gaps = 20/155 (12%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
D ++ +KG G+ + +D FV E++GEV P K+ ++ +++ + + F
Sbjct: 183 DVFLTEKKGFGL--RADANLPKDTFVYEYIGEVIPEQKFRKR------MRQYDSEGIKHF 234
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y + L+ KG+ +DA + + A HSCRPNC V ++GI+ R
Sbjct: 235 YFMMLQ--KGE-------YIDATKRGSLARFCNHSCRPNCYVDKWMVGDKLRMGIFCKRD 285
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
I GEE+TFDYN + +A C CG C G
Sbjct: 286 IIRGEELTFDYNV---DRYGAQAQPCYCGEPCCVG 317
>sp|Q1LY77|SE1BA_DANRE Histone-lysine N-methyltransferase SETD1B-A OS=Danio rerio GN=setd1ba
PE=1 SV=2
Length = 1844
Score = 72.4 bits (176), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1768 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIED 1827
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG++ CRG+
Sbjct: 1828 ----EKIPCLCGAENCRGT 1842
>sp|Q5ABG1|SET1_CANAL Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Candida
albicans (strain SC5314 / ATCC MYA-2876) GN=SET1 PE=3
SV=1
Length = 1040
Score = 72.4 bits (176), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 49/84 (58%), Gaps = 2/84 (2%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D V+DA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY
Sbjct: 959 DNTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFER 1018
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E+ +E E CLCG+ C+G YLN
Sbjct: 1019 ETNDE-ERIRCLCGAPGCKG-YLN 1040
>sp|Q03164|MLL1_HUMAN Histone-lysine N-methyltransferase MLL OS=Homo sapiens GN=MLL PE=1
SV=5
Length = 3969
Score = 71.6 bits (174), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3840 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3882
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3883 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3936
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3937 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3969
>sp|P55200|MLL1_MOUSE Histone-lysine N-methyltransferase MLL OS=Mus musculus GN=Mll PE=1
SV=3
Length = 3966
Score = 71.6 bits (174), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3837 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3879
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3880 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3933
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3934 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3966
>sp|Q9BYW2|SETD2_HUMAN Histone-lysine N-methyltransferase SETD2 OS=Homo sapiens GN=SETD2
PE=1 SV=3
Length = 2564
Score = 71.2 bits (173), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1559 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1607
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1608 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1661
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1662 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1689
>sp|Q9UPS6|SET1B_HUMAN Histone-lysine N-methyltransferase SETD1B OS=Homo sapiens GN=SETD1B
PE=1 SV=2
Length = 1923
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1847 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1906
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1907 VK----IPCLCGSENCRGT 1921
>sp|Q8X0S9|SET1_NEUCR Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A /
CBS 708.71 / DSM 1257 / FGSC 987) GN=set-1 PE=3 SV=1
Length = 1313
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE E + I L++ YL+ G + + D
Sbjct: 1191 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1232
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1233 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1292
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1293 IGST-DRIPCLCGTAACKG-FLN 1313
>sp|Q8CFT2|SET1B_MOUSE Histone-lysine N-methyltransferase SETD1B OS=Mus musculus GN=Setd1b
PE=2 SV=2
Length = 1985
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1909 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1968
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1969 VK----IPCLCGSENCRGT 1983
>sp|Q5F3P8|SET1B_CHICK Histone-lysine N-methyltransferase SETD1B OS=Gallus gallus GN=SETD1B
PE=2 SV=1
Length = 2008
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1932 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1991
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1992 VK----IPCLCGSENCRGT 2006
>sp|O96028|NSD2_HUMAN Histone-lysine N-methyltransferase NSD2 OS=Homo sapiens GN=WHSC1 PE=1
SV=1
Length = 1365
Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>sp|Q24742|TRX_DROVI Histone-lysine N-methyltransferase trithorax OS=Drosophila virilis
GN=trx PE=3 SV=1
Length = 3828
Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3701 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3746
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I HSC PNC +KV + GH I I+ +R I GEE+T
Sbjct: 3747 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3802
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3803 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3828
>sp|Q6BKL7|SET1_DEBHA Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Debaryomyces hansenii (strain ATCC 36239 / CBS 767 /
JCM 1990 / NBRC 0083 / IGC 2968) GN=SET1 PE=3 SV=2
Length = 1088
Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
VVDA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY E+
Sbjct: 1009 TVVDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFEKET 1068
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ E CLCG+ C+G YLN
Sbjct: 1069 NDA-ERIRCLCGAPGCKG-YLN 1088
>sp|Q2UMH3|SET1_ASPOR Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40)
GN=set1 PE=3 SV=1
Length = 1229
Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1150 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1209
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1210 DSD-DRIPCLCGSTGCKG-FLN 1229
>sp|Q8BVE8|NSD2_MOUSE Histone-lysine N-methyltransferase NSD2 OS=Mus musculus GN=Whsc1 PE=1
SV=2
Length = 1365
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>sp|Q66J90|SET1B_XENLA Histone-lysine N-methyltransferase SETD1B OS=Xenopus laevis GN=setd1b
PE=2 SV=1
Length = 1938
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1862 TIIDATKCGNFARFINHSCNPNCYAKVVTVESQKKIVIYSKQYINVNEEITYDYKFPIED 1921
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCG++ CRG+
Sbjct: 1922 VK----IPCLCGAENCRGT 1936
>sp|Q5B0Y5|SET1_EMENI Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS
112.46 / NRRL 194 / M139) GN=set1 PE=3 SV=1
Length = 1220
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1141 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1200
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1201 DSD-DRIPCLCGSAGCKG-FLN 1220
>sp|Q945S8|ASHH3_ARATH Histone-lysine N-methyltransferase ASHH3 OS=Arabidopsis thaliana
GN=ASHH3 PE=2 SV=2
Length = 363
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 36/187 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V +E GE F++E++GEV + + L K FY + R
Sbjct: 127 GSGIVAEEEIEAGE--FIIEYVGEV------IDDKTCEERLWKMKHRGETNFYLCEITRD 178
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA HK N + I HSC PN + + +DG +IGI+ RGI GE +T
Sbjct: 179 ---------MVIDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKGEHLT 229
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR------GSYLNLTGEGAFEKVLKEL--------- 2056
+DY V ++ C CG+ CR S + + AF V EL
Sbjct: 230 YDYQFVQFGADQD----CHCGAVGCRRKLGVKPSKPKIASDEAFNLVAHELAQTLPKVHQ 285
Query: 2057 HGLLDRH 2063
+GL++RH
Sbjct: 286 NGLVNRH 292
>sp|Q1DR06|SET1_COCIM Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Coccidioides immitis (strain RS) GN=SET1 PE=3 SV=1
Length = 1271
Score = 69.7 bits (169), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1192 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREW 1251
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1252 DSD-DRIPCLCGSAGCKG-FLN 1271
>sp|Q6CIT4|SET1_KLULA Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 /
DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=SET1 PE=3
SV=1
Length = 1000
Score = 69.7 bits (169), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I H C P+C AK+ VDG +I IY +R I EE+T+DY E+
Sbjct: 921 TVIDATKRGGIARFINHCCEPSCTAKIIKVDGRKRIVIYALRDIGTNEELTYDYKFERET 980
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 981 -DEGERLPCLCGAPSCKG-FLN 1000
>sp|Q08D57|SET1B_XENTR Histone-lysine N-methyltransferase SETD1B OS=Xenopus tropicalis
GN=setd1b PE=2 SV=1
Length = 1956
Score = 69.7 bits (169), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1880 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQYINVNEEITYDYKFPIED 1939
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCG++ CRG+
Sbjct: 1940 VK----IPCLCGAENCRGT 1954
>sp|Q4WNH8|SET1_ASPFU Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 /
CBS 101355 / FGSC A1100) GN=set1 PE=3 SV=1
Length = 1241
Score = 68.9 bits (167), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1162 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREW 1221
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1222 DSD-DRIPCLCGSTGCKG-FLN 1241
>sp|P20659|TRX_DROME Histone-lysine N-methyltransferase trithorax OS=Drosophila
melanogaster GN=trx PE=1 SV=4
Length = 3726
Score = 68.9 bits (167), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3599 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3644
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I H C PNC +KV + GH I I+ +R I GEE+T
Sbjct: 3645 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3700
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3701 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3726
>sp|P38827|SET1_YEAST Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
GN=SET1 PE=1 SV=1
Length = 1080
Score = 68.9 bits (167), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C PNC AK+ V G +I IY +R I EE+T+DY E
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080
>sp|Q9M1X9|ASHH4_ARATH Putative histone-lysine N-methyltransferase ASHH4 OS=Arabidopsis
thaliana GN=ASHH4 PE=3 SV=1
Length = 352
Score = 68.2 bits (165), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 21/146 (14%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V +++ GE F++E++GEV + + L K N FY +
Sbjct: 122 GYGIVADEDINSGE--FIIEYVGEV------IDDKICEERLWKLNHKVETNFYLCQINWN 173
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA HK N + I HSC PN E + +DG +IGI+ R I+ GE++T
Sbjct: 174 ---------MVIDATHKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATRFINKGEQLT 224
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
+DY V ++ C CG+ CR
Sbjct: 225 YDYQFVQFGADQD----CYCGAVCCR 246
>sp|Q6FKB1|SET1_CANGA Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Candida
glabrata (strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC
0622 / NRRL Y-65) GN=SET1 PE=3 SV=1
Length = 1111
Score = 68.2 bits (165), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V G +I IY +R I EE+T+DY E+
Sbjct: 1032 TVIDATKKGGIARFINHCCEPSCTAKIIKVGGKRRIVIYALRDIAANEELTYDYKFERET 1091
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
E E CLCG+ C+G +LN
Sbjct: 1092 DAE-ERLPCLCGAPSCKG-FLN 1111
>sp|Q2LAE1|ASHH2_ARATH Histone-lysine N-methyltransferase ASHH2 OS=Arabidopsis thaliana
GN=ASHH2 PE=1 SV=1
Length = 1759
Score = 68.2 bits (165), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 66/134 (49%), Gaps = 17/134 (12%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
E F++E++GEV + + +Q + + FY + L +G + V+D
Sbjct: 1048 EGQFLIEYVGEVLDMQSYETRQKEYAFKGQKH------FYFMTL-------NGNE--VID 1092
Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
A K N I HSC PNC + V+G +GI++++ + G+E+TFDYN V
Sbjct: 1093 AGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFG--A 1150
Query: 2025 EASVCLCGSQVCRG 2038
A C CGS CRG
Sbjct: 1151 AAKKCYCGSSHCRG 1164
>sp|Q75D88|SET1_ASHGO Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Ashbya
gossypii (strain ATCC 10895 / CBS 109.51 / FGSC 9923 /
NRRL Y-1056) GN=SET1 PE=3 SV=2
Length = 975
Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V G +I IY +R I EE+T+DY E+
Sbjct: 896 TVIDATKKGGIARFINHCCDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYKFERET 955
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 956 DDE-ERLPCLCGAPNCKG-FLN 975
>sp|Q8MT36|MES4_DROME Probable histone-lysine N-methyltransferase Mes-4 OS=Drosophila
melanogaster GN=Mes-4 PE=1 SV=2
Length = 1427
Score = 66.2 bits (160), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 74/151 (49%), Gaps = 25/151 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V + G DFV+E++GEV + R +++ D +Y + +E+
Sbjct: 1244 RGFGLVNREPIAVG--DFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 1295
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNCE + V+ +++GI+ ++ I E+
Sbjct: 1296 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSEL 1346
Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
TF+Y + + SK+ C CG++ C G
Sbjct: 1347 TFNYLWDDLMNNSKK-----ACFCGAKRCSG 1372
>sp|Q96L73|NSD1_HUMAN Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific OS=Homo sapiens GN=NSD1 PE=1 SV=1
Length = 2696
Score = 65.5 bits (158), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1966 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2010
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2011 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2067
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2068 TVCKCGAPNCSG 2079
>sp|O88491|NSD1_MOUSE Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific OS=Mus musculus GN=Nsd1 PE=1 SV=1
Length = 2588
Score = 65.1 bits (157), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1864 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1908
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1909 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1965
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1966 TVCKCGAPNCSG 1977
>sp|Q6P2L6|NSD3_MOUSE Histone-lysine N-methyltransferase NSD3 OS=Mus musculus GN=Whsc1l1
PE=1 SV=2
Length = 1439
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1148 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1199
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1200 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1250
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
+ I G E+TF+YN +VC CG+ C G +L + + A + E
Sbjct: 1251 LCDIPAGMELTFNYNLDCLGNGR---TVCHCGADNCSG-FLGVRPKSACTSAVDE 1301
>sp|C6KTD2|HKNMT_PLAF7 Putative histone-lysine N-methyltransferase PFF1440w OS=Plasmodium
falciparum (isolate 3D7) GN=PFF1440w PE=3 SV=1
Length = 6753
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 70/143 (48%), Gaps = 21/143 (14%)
Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----IYLERPKGDAD 1956
G+G + EF+ E PV ++ + IR++ D ++Y+ Y+ R +
Sbjct: 6623 GYGL--YTCEFINEGEPVIEYI--GEYIRNII---SDKREKYYDKIESSCYMFRLNEN-- 6673
Query: 1957 GYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQ-IGIYTVRGIHYGEEITFDYN 2015
+++DA N + I HSC PNC K+ + D + + I I+ R I EEIT+DY
Sbjct: 6674 ----IIIDATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIAAHEEITYDYQ 6729
Query: 2016 SVTESKEEYEASVCLCGSQVCRG 2038
ES E + +CLCGS C G
Sbjct: 6730 FGVES--EGKKLICLCGSSTCLG 6750
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.314 0.131 0.389
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 965,474,865
Number of Sequences: 539616
Number of extensions: 44537966
Number of successful extensions: 298373
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 1297
Number of HSP's successfully gapped in prelim test: 1195
Number of HSP's that attempted gapping in prelim test: 127499
Number of HSP's gapped (non-prelim): 54081
length of query: 2445
length of database: 191,569,459
effective HSP length: 134
effective length of query: 2311
effective length of database: 119,260,915
effective search space: 275611974565
effective search space used: 275611974565
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 70 (31.6 bits)