BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000255
(1778 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O23372|ATXR3_ARATH Probable histone-lysine N-methyltransferase ATXR3 OS=Arabidopsis
thaliana GN=ATXR3 PE=1 SV=2
Length = 2335
Score = 2317 bits (6005), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1161/1822 (63%), Positives = 1404/1822 (77%), Gaps = 138/1822 (7%)
Query: 1 MEEDMDICDTPPHVPAVTDSSVGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKH 60
ME DMDICDTPPH P +DSS+GKWFYLD+ G E GP+RL DLK L+E+G+L SDH IKH
Sbjct: 606 MEVDMDICDTPPHEPMASDSSLGKWFYLDYYGTEHGPARLSDLKALMEQGILFSDHMIKH 665
Query: 61 LDSNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTA------QSTG 114
D+NRW LV+PPEA GNLL D DT Q G
Sbjct: 666 SDNNRW-------------------------LVNPPEAPGNLLEDIADTTEAVCIEQGAG 700
Query: 115 EEFPVTLQSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGE-------- 166
+ P + + PDG E+ ED ID+RV LLDG T+ PG+E ETLGE
Sbjct: 701 DSLPELVSVRTLPDGKEIFVENREDFQIDMRVENLLDGRTITPGREFETLGEALKVNVEF 760
Query: 167 -------------------------LKSGDKDHWVVCFDSDE---WFSGRWSCKGGDWKR 198
KS D +SDE WFSGRWSCKGGDW R
Sbjct: 761 EETRRCVTSEGVVGMFRPMKRAIEEFKSDD----AYGSESDEIGSWFSGRWSCKGGDWIR 816
Query: 199 NDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACP 258
DEA+QDR +KK VLNDGFPLC M KSG+EDPRW+ KDDLYYP S RL+LP WA++
Sbjct: 817 QDEASQDRYYKKKIVLNDGFPLCLMQKSGHEDPRWHHKDDLYYPLSSSRLELPLWAFSVV 876
Query: 259 DERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSS 318
DERN RGVK ++L VVR+N+ VVND + +PR+KVR+KER S
Sbjct: 877 DERNQ--------------TRGVKASLLSVVRLNSLVVNDQVPPIPDPRAKVRSKERCPS 922
Query: 319 RSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGE 378
R AR +++D +R S ES S S A QDSQG WK+ +NTP+DRLCTVDDLQL +G+
Sbjct: 923 RPARPSPASSDSKRESVESHSQSTASTGQDSQGLWKTDTSVNTPRDRLCTVDDLQLHIGD 982
Query: 379 WYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHG 438
W+Y DGAG E+GP SFSELQ LV++G I+ H+SVFRK DK+WVP+T T++ + G
Sbjct: 983 WFYTDGAGQEQGPLSFSELQKLVEKGFIKSHSSVFRKSDKIWVPVTSITKSPETIAMLRG 1042
Query: 439 EKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNR 498
+ GL +++QD E + ++NS FH +HPQF+GY RGKLH+LVMK++K+R
Sbjct: 1043 KTPALPSACQGLVVSETQDFKYSEMDTSLNS--FHGVHPQFLGYFRGKLHQLVMKTFKSR 1100
Query: 499 EFAAAINEVLDPWINAKQPKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEEELQT 557
+F+AAIN+V+D WI+A+QPKKE+E ++Y+ SE ++ KRARL+ ES D E E+
Sbjct: 1101 DFSAAINDVVDSWIHARQPKKESEKYMYQSSELNSCYTKRARLMAGESGEDSEMEDTQMF 1160
Query: 558 IQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTC 617
+DE TFEDLCGD +F E + S+ WGLLDGH LA VFH LR D+KSLAFAS+TC
Sbjct: 1161 QKDELTFEDLCGDLTFNIEGNRSAGTVGIYWGLLDGHALARVFHMLRYDVKSLAFASMTC 1220
Query: 618 RHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLE 677
RHW+A + YK ISRQVDLSS+GP+CTDS +R +N ++KEK++SI+LVGCTN+T+ MLE
Sbjct: 1221 RHWKATINSYKDISRQVDLSSLGPSCTDSRLRSIMNTYNKEKIDSIILVGCTNVTASMLE 1280
Query: 678 EILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKS 737
EIL+ P +SS+DI GC QFG+L + + N++W++ Q +R + + S+IRSLKQ T+
Sbjct: 1281 EILRLHPRISSVDITGCSQFGDLTVNYKNVSWLRCQNTRSGELH---SRIRSLKQTTD-- 1335
Query: 738 SSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDA 797
KSKGLG D DDFG+LKDYF+ V+KRDSANQ FRRSLY+RSK++DAR+SS+ILSRDA
Sbjct: 1336 --VAKSKGLGGDTDDFGNLKDYFDRVEKRDSANQLFRRSLYKRSKLYDARRSSAILSRDA 1393
Query: 798 RMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGS 857
R+RRW+IKKSE+GYKR+EEFLASSL+ IM+ NTF+FF KV++IE +MK GYY+SHGL S
Sbjct: 1394 RIRRWAIKKSEHGYKRVEEFLASSLRGIMKQNTFDFFALKVSQIEEKMKNGYYVSHGLRS 1453
Query: 858 VKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDES 917
VK+DISRMCR+AIK +E+MKSW+D S
Sbjct: 1454 VKEDISRMCREAIK-----------------------------------DELMKSWQDGS 1478
Query: 918 PAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKS 977
GL SAT KY KKLSK V+E+KYM+R++ T NG DYGEYASDREI++RLSKLNRKS
Sbjct: 1479 --GLSSAT-KYNKKLSKTVAEKKYMSRTSDTFGVNGASDYGEYASDREIKRRLSKLNRKS 1535
Query: 978 LDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLD-FS 1036
S S+TS + S++GKSD+ S+ S ++S+ D RS+GR+++ R FT D+ D +
Sbjct: 1536 FSSESDTSSE---LSDNGKSDNYSSASASESESDIRSEGRSQDLRIEKYFTADDSFDSVT 1592
Query: 1037 DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGS 1096
++REWGARMTKASLVPPVTRKYEVI++Y IVADEE+V+RKMRVSLPEDY EKLNAQ+NG
Sbjct: 1593 EEREWGARMTKASLVPPVTRKYEVIEKYAIVADEEEVQRKMRVSLPEDYGEKLNAQRNGI 1652
Query: 1097 EELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIED 1156
EELDMELPEVK+YKPRK LGD+V EQEVYGIDPYTHNLLLDSMP ELDW+L +KH FIED
Sbjct: 1653 EELDMELPEVKEYKPRKLLGDEVLEQEVYGIDPYTHNLLLDSMPGELDWSLQDKHSFIED 1712
Query: 1157 VLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPD 1216
V+LRTLN+QVR FTG+G+TPM++PL+PVIEE+++ A ++CD+RTMKMC+G+LK ++SR D
Sbjct: 1713 VVLRTLNRQVRLFTGSGSTPMVFPLRPVIEELKESAREECDIRTMKMCQGVLKEIESRSD 1772
Query: 1217 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1276
DKYV+YRKGLGVVCNKEGGFGE+DFVVEFLGEVYPVWKWFEKQDGIRSLQ+N DPAPEF
Sbjct: 1773 DKYVSYRKGLGVVCNKEGGFGEEDFVVEFLGEVYPVWKWFEKQDGIRSLQENKTDPAPEF 1832
Query: 1277 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 1336
YNIYLERPKGDADGYDLVVVDAMH ANYASRICHSCRPNCEAKVTAVDGHYQIGIY+VR
Sbjct: 1833 YNIYLERPKGDADGYDLVVVDAMHMANYASRICHSCRPNCEAKVTAVDGHYQIGIYSVRA 1892
Query: 1337 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRH 1396
I YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLK+ HGLL+RH
Sbjct: 1893 IEYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKDWHGLLERH 1952
Query: 1397 QLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEIL 1456
+LMLEAC LNSVSEEDYLELGRAGLGSCLLGGLP+W++AYSARLVRFIN ERTKLPEEIL
Sbjct: 1953 RLMLEACVLNSVSEEDYLELGRAGLGSCLLGGLPDWMIAYSARLVRFINFERTKLPEEIL 2012
Query: 1457 RHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAP 1516
+HNLEEKRKYFSDI L+VEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR VFGDPK AP
Sbjct: 2013 KHNLEEKRKYFSDIHLDVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRHVFGDPKNAP 2072
Query: 1517 PPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRE 1576
PP+ERL+PEETVSF+W G+GSLV+EL+Q ++PH+EE LN+L+SKI HDPSGS D+ +E
Sbjct: 2073 PPLERLTPEETVSFVWNGDGSLVDELLQSLSPHLEEGPLNELRSKIHGHDPSGSADVLKE 2132
Query: 1577 LRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDL 1636
L++SLLWLRDE+R+LPCTYKCR+DAAADLIHIYAYTKCFF+V+EY++F S PV+ISPLDL
Sbjct: 2133 LQRSLLWLRDEIRDLPCTYKCRNDAAADLIHIYAYTKCFFKVREYQSFISSPVHISPLDL 2192
Query: 1637 GPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFY 1696
G KYADKLG ++ YRKTYGENYCLGQLI+W+ QTN DPD TL +A+RGCLSLPD+ SFY
Sbjct: 2193 GAKYADKLGESIKEYRKTYGENYCLGQLIYWYNQTNTDPDLTLVKATRGCLSLPDVASFY 2252
Query: 1697 AKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTG-C 1755
AK QKPS+HRVYGPKTV+ M+S+M KQPQRPWPKD+IW FKS+PR+FGSPM D+ L
Sbjct: 2253 AKAQKPSKHRVYGPKTVKTMVSQMSKQPQRPWPKDKIWTFKSTPRVFGSPMFDAVLNNSS 2312
Query: 1756 PLDREMVHWLKHRPAIFQAMWD 1777
LDRE++ WL++R +FQA WD
Sbjct: 2313 SLDRELLQWLRNRRHVFQATWD 2334
>sp|Q9Y7R4|SET1_SCHPO Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=set1 PE=1 SV=1
Length = 920
Score = 81.3 bits (199), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 74/141 (52%), Gaps = 26/141 (18%)
Query: 1238 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---V 1294
++D V+E++GE+ IR +N + Y+ GD+ + + V
Sbjct: 803 KNDMVIEYIGEI------------IRQRVADNREKN------YVREGIGDSYLFRIDEDV 844
Query: 1295 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 1354
+VDA K N A I HSC PNC A++ V+G +I IY R I +GEE+T+DY +
Sbjct: 845 IVDATKKGNIARFINHSCAPNCIARIIRVEGKRKIVIYADRDIMHGEELTYDY----KFP 900
Query: 1355 EEYEASVCLCGSQVCRGSYLN 1375
EE + CLCG+ CRG YLN
Sbjct: 901 EEADKIPCLCGAPTCRG-YLN 920
>sp|Q18221|SET2_CAEEL Probable histone-lysine N-methyltransferase set-2 OS=Caenorhabditis
elegans GN=set-2 PE=2 SV=2
Length = 1507
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 72/145 (49%), Gaps = 28/145 (19%)
Query: 1235 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGY 1291
D+ +VE++G+ IRSL + A E I YL R
Sbjct: 1387 SIAPDEMIVEYIGQT------------IRSLVAEEREKAYERRGIGSSYLFR-------I 1427
Query: 1292 DLV-VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 1350
DL V+DA + N+A I HSC+PNC AKV ++G +I IY+ I GEEIT+DY
Sbjct: 1428 DLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFP 1487
Query: 1351 TESKEEYEASVCLCGSQVCRGSYLN 1375
E + CLCG++ CRG YLN
Sbjct: 1488 IED----DKIDCLCGAKTCRG-YLN 1507
>sp|Q99MY8|ASH1L_MOUSE Histone-lysine N-methyltransferase ASH1L OS=Mus musculus GN=Ash1l
PE=1 SV=3
Length = 2958
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1217 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1276
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2138 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2174
Query: 1277 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1331
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2175 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2234
Query: 1332 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 1371
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2235 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2272
>sp|Q9NR48|ASH1L_HUMAN Histone-lysine N-methyltransferase ASH1L OS=Homo sapiens GN=ASH1L
PE=1 SV=2
Length = 2969
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1217 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1276
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184
Query: 1277 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1331
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2244
Query: 1332 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 1371
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2282
>sp|Q9VYD1|C1716_DROME Probable histone-lysine N-methyltransferase CG1716 OS=Drosophila
melanogaster GN=Set2 PE=1 SV=2
Length = 2313
Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 78/153 (50%), Gaps = 20/153 (13%)
Query: 1219 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1278
+ +KG G+ GE F++E++GEV + FE++ + S +N +Y
Sbjct: 1367 FRTEKKGCGITAELLIPPGE--FIMEYVGEVI-DSEEFERRQHLYSKDRNRH-----YYF 1418
Query: 1279 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 1338
+ L +G+A V+DA K N + I HSC PN E + V+G +IG ++V+ I
Sbjct: 1419 MAL---RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQ 1469
Query: 1339 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 1371
GEEITFDY + + +A C C + CRG
Sbjct: 1470 PGEEITFDYQYLRYGR---DAQRCYCEAANCRG 1499
>sp|Q4PB36|SET1_USTMA Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Ustilago
maydis (strain 521 / FGSC 9021) GN=SET1 PE=3 SV=1
Length = 1468
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 46/137 (33%), Positives = 67/137 (48%), Gaps = 28/137 (20%)
Query: 1240 DFVVEFLGEVYPVW------KWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1293
D V+E++GEV K +E+Q ++ YL R D
Sbjct: 1351 DMVIEYVGEVVRQQVADEREKQYERQGN---------------FSTYLFRVDDD------ 1389
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
+VVDA HK N A + H C PNC AK+ ++G +I ++ I GEE+T+DY + +
Sbjct: 1390 LVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKTAIRAGEELTYDYKFQSSA 1449
Query: 1354 KEEYEASVCLCGSQVCR 1370
+E +A CLCGS CR
Sbjct: 1450 DDE-DAIPCLCGSPGCR 1465
>sp|Q4I5R3|SET1_GIBZE Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Gibberella zeae (strain PH-1 / ATCC MYA-4620 / FGSC
9075 / NRRL 31084) GN=SET1 PE=3 SV=2
Length = 1263
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)
Query: 1236 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1292
+DD ++E++GE + + I +++N YL+ G + + D
Sbjct: 1141 IAKDDMIIEYVGE--------QVRQQISEIRENR----------YLKSGIGSSYLFRIDD 1182
Query: 1293 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 1352
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1183 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIALNEELTYDYKFERE 1242
Query: 1353 SKEEYEASVCLCGSQVCRGSYLN 1375
+ CLCG+ C+G +LN
Sbjct: 1243 IG-STDRIPCLCGTAACKG-FLN 1263
>sp|Q1LY77|SE1BA_DANRE Histone-lysine N-methyltransferase SETD1B-A OS=Danio rerio GN=setd1ba
PE=1 SV=2
Length = 1844
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1768 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIED 1827
Query: 1354 KEEYEASVCLCGSQVCRGS 1372
E CLCG++ CRG+
Sbjct: 1828 ----EKIPCLCGAENCRGT 1842
>sp|Q2GWF3|SET1_CHAGB Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Chaetomium globosum (strain ATCC 6205 / CBS 148.51 /
DSM 1962 / NBRC 6347 / NRRL 1970) GN=SET1 PE=3 SV=1
Length = 1076
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 70/141 (49%), Gaps = 23/141 (16%)
Query: 1238 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---DLV 1294
+DD ++E++GE E + I L++N YL+ G + + D
Sbjct: 956 KDDMIIEYVGE--------EVRQQIAELRENR----------YLKSGIGSSYLFRIDDNT 997
Query: 1295 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 1354
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 998 VIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELG 1057
Query: 1355 EEYEASVCLCGSQVCRGSYLN 1375
+ CLCG+ C+G +LN
Sbjct: 1058 ST-DRIPCLCGTAACKG-FLN 1076
>sp|Q5ABG1|SET1_CANAL Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Candida
albicans (strain SC5314 / ATCC MYA-2876) GN=SET1 PE=3
SV=1
Length = 1040
Score = 72.0 bits (175), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 49/84 (58%), Gaps = 2/84 (2%)
Query: 1292 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 1351
D V+DA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY
Sbjct: 959 DNTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFER 1018
Query: 1352 ESKEEYEASVCLCGSQVCRGSYLN 1375
E+ +E E CLCG+ C+G YLN
Sbjct: 1019 ETNDE-ERIRCLCGAPGCKG-YLN 1040
>sp|Q03164|MLL1_HUMAN Histone-lysine N-methyltransferase MLL OS=Homo sapiens GN=MLL PE=1
SV=5
Length = 3969
Score = 72.0 bits (175), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 52/165 (31%), Positives = 75/165 (45%), Gaps = 37/165 (22%)
Query: 1219 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1278
Y + G G+ C + GE V+E+ G V IRS+Q + + ++Y
Sbjct: 3834 YRSPIHGRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYY- 3875
Query: 1279 IYLERPKGDADGY--------DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIG 1330
D+ G D VVDA N A I HSC PNC ++V +DG I
Sbjct: 3876 --------DSKGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIV 3927
Query: 1331 IYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 1375
I+ +R I+ GEE+T+DY E + C CG++ CR +LN
Sbjct: 3928 IFAMRKIYRGEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3969
>sp|O14026|SET2_SCHPO Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=set2 PE=1 SV=1
Length = 798
Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 75/155 (48%), Gaps = 20/155 (12%)
Query: 1217 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1276
D ++ +KG G+ + +D FV E++GEV P K+ ++ +++ + + F
Sbjct: 183 DVFLTEKKGFGL--RADANLPKDTFVYEYIGEVIPEQKFRKR------MRQYDSEGIKHF 234
Query: 1277 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 1336
Y + L+ KG+ +DA + + A HSCRPNC V ++GI+ R
Sbjct: 235 YFMMLQ--KGE-------YIDATKRGSLARFCNHSCRPNCYVDKWMVGDKLRMGIFCKRD 285
Query: 1337 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 1371
I GEE+TFDYN + +A C CG C G
Sbjct: 286 IIRGEELTFDYNV---DRYGAQAQPCYCGEPCCVG 317
>sp|P55200|MLL1_MOUSE Histone-lysine N-methyltransferase MLL OS=Mus musculus GN=Mll PE=1
SV=3
Length = 3966
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 52/165 (31%), Positives = 75/165 (45%), Gaps = 37/165 (22%)
Query: 1219 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1278
Y + G G+ C + GE V+E+ G V IRS+Q + + ++Y
Sbjct: 3831 YRSPIHGRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYY- 3872
Query: 1279 IYLERPKGDADGY--------DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIG 1330
D+ G D VVDA N A I HSC PNC ++V +DG I
Sbjct: 3873 --------DSKGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIV 3924
Query: 1331 IYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 1375
I+ +R I+ GEE+T+DY E + C CG++ CR +LN
Sbjct: 3925 IFAMRKIYRGEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3966
>sp|Q9UPS6|SET1B_HUMAN Histone-lysine N-methyltransferase SETD1B OS=Homo sapiens GN=SETD1B
PE=1 SV=2
Length = 1923
Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1847 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1906
Query: 1354 KEEYEASVCLCGSQVCRGS 1372
+ CLCGS+ CRG+
Sbjct: 1907 VK----IPCLCGSENCRGT 1921
>sp|Q9BYW2|SETD2_HUMAN Histone-lysine N-methyltransferase SETD2 OS=Homo sapiens GN=SETD2
PE=1 SV=3
Length = 2564
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1223 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1282
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1559 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1607
Query: 1283 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 1342
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1608 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1661
Query: 1343 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 1374
+TFDY K EA C CGS CRG YL
Sbjct: 1662 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1689
>sp|Q5F3P8|SET1B_CHICK Histone-lysine N-methyltransferase SETD1B OS=Gallus gallus GN=SETD1B
PE=2 SV=1
Length = 2008
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1932 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1991
Query: 1354 KEEYEASVCLCGSQVCRGS 1372
+ CLCGS+ CRG+
Sbjct: 1992 VK----IPCLCGSENCRGT 2006
>sp|Q8CFT2|SET1B_MOUSE Histone-lysine N-methyltransferase SETD1B OS=Mus musculus GN=Setd1b
PE=2 SV=2
Length = 1985
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1909 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1968
Query: 1354 KEEYEASVCLCGSQVCRGS 1372
+ CLCGS+ CRG+
Sbjct: 1969 VK----IPCLCGSENCRGT 1983
>sp|Q945S8|ASHH3_ARATH Histone-lysine N-methyltransferase ASHH3 OS=Arabidopsis thaliana
GN=ASHH3 PE=2 SV=2
Length = 363
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 36/187 (19%)
Query: 1225 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1284
G G+V +E GE F++E++GEV + + L K FY + R
Sbjct: 127 GSGIVAEEEIEAGE--FIIEYVGEV------IDDKTCEERLWKMKHRGETNFYLCEITRD 178
Query: 1285 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 1344
+V+DA HK N + I HSC PN + + +DG +IGI+ RGI GE +T
Sbjct: 179 ---------MVIDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKGEHLT 229
Query: 1345 FDYNSVTESKEEYEASVCLCGSQVCRG------SYLNLTGEGAFEKVLKEL--------- 1389
+DY V ++ C CG+ CR S + + AF V EL
Sbjct: 230 YDYQFVQFGADQD----CHCGAVGCRRKLGVKPSKPKIASDEAFNLVAHELAQTLPKVHQ 285
Query: 1390 HGLLDRH 1396
+GL++RH
Sbjct: 286 NGLVNRH 292
>sp|Q24742|TRX_DROVI Histone-lysine N-methyltransferase trithorax OS=Drosophila virilis
GN=trx PE=3 SV=1
Length = 3828
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/157 (35%), Positives = 75/157 (47%), Gaps = 23/157 (14%)
Query: 1219 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1278
+ ++ G G+ C K+ GE V+E+ GE+ IRS + + +
Sbjct: 3695 FRSHIHGRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRG 3740
Query: 1279 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 1338
I K D D +VVDA + N A I HSC PNC +KV + GH I I+ +R I
Sbjct: 3741 IGCYMFKID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIV 3796
Query: 1339 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 1375
GEE+T+DY E E C CGS+ CR YLN
Sbjct: 3797 QGEELTYDYKFPFED----EKIPCSCGSKRCR-KYLN 3828
>sp|O96028|NSD2_HUMAN Histone-lysine N-methyltransferase NSD2 OS=Homo sapiens GN=WHSC1 PE=1
SV=1
Length = 1365
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1224 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1283
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124
Query: 1284 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 1343
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 1344 TFDYNSVTESKEEYEASVCLCGSQVCRG 1371
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>sp|Q8BVE8|NSD2_MOUSE Histone-lysine N-methyltransferase NSD2 OS=Mus musculus GN=Whsc1 PE=1
SV=2
Length = 1365
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1224 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1283
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1124
Query: 1284 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 1343
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 1344 TFDYNSVTESKEEYEASVCLCGSQVCRG 1371
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>sp|Q6BKL7|SET1_DEBHA Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Debaryomyces hansenii (strain ATCC 36239 / CBS 767 /
JCM 1990 / NBRC 0083 / IGC 2968) GN=SET1 PE=3 SV=2
Length = 1088
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
VVDA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY E+
Sbjct: 1009 TVVDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFEKET 1068
Query: 1354 KEEYEASVCLCGSQVCRGSYLN 1375
+ E CLCG+ C+G YLN
Sbjct: 1069 NDA-ERIRCLCGAPGCKG-YLN 1088
>sp|Q2UMH3|SET1_ASPOR Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40)
GN=set1 PE=3 SV=1
Length = 1229
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1150 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1209
Query: 1354 KEEYEASVCLCGSQVCRGSYLN 1375
+ + CLCGS C+G +LN
Sbjct: 1210 DSD-DRIPCLCGSTGCKG-FLN 1229
>sp|Q66J90|SET1B_XENLA Histone-lysine N-methyltransferase SETD1B OS=Xenopus laevis GN=setd1b
PE=2 SV=1
Length = 1938
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1862 TIIDATKCGNFARFINHSCNPNCYAKVVTVESQKKIVIYSKQYINVNEEITYDYKFPIED 1921
Query: 1354 KEEYEASVCLCGSQVCRGS 1372
+ CLCG++ CRG+
Sbjct: 1922 VK----IPCLCGAENCRGT 1936
>sp|Q8X0S9|SET1_NEUCR Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A /
CBS 708.71 / DSM 1257 / FGSC 987) GN=set-1 PE=3 SV=1
Length = 1313
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)
Query: 1236 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1292
+DD ++E++GE E + I L++ YL+ G + + D
Sbjct: 1191 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1232
Query: 1293 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 1352
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1233 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1292
Query: 1353 SKEEYEASVCLCGSQVCRGSYLN 1375
+ CLCG+ C+G +LN
Sbjct: 1293 IG-STDRIPCLCGTAACKG-FLN 1313
>sp|Q1DR06|SET1_COCIM Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Coccidioides immitis (strain RS) GN=SET1 PE=3 SV=1
Length = 1271
Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1192 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREW 1251
Query: 1354 KEEYEASVCLCGSQVCRGSYLN 1375
+ + CLCGS C+G +LN
Sbjct: 1252 DSD-DRIPCLCGSAGCKG-FLN 1271
>sp|Q5B0Y5|SET1_EMENI Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS
112.46 / NRRL 194 / M139) GN=set1 PE=3 SV=1
Length = 1220
Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1141 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1200
Query: 1354 KEEYEASVCLCGSQVCRGSYLN 1375
+ + CLCGS C+G +LN
Sbjct: 1201 DSD-DRIPCLCGSAGCKG-FLN 1220
>sp|Q08D57|SET1B_XENTR Histone-lysine N-methyltransferase SETD1B OS=Xenopus tropicalis
GN=setd1b PE=2 SV=1
Length = 1956
Score = 68.9 bits (167), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1880 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQYINVNEEITYDYKFPIED 1939
Query: 1354 KEEYEASVCLCGSQVCRGS 1372
+ CLCG++ CRG+
Sbjct: 1940 VK----IPCLCGAENCRGT 1954
>sp|Q6CIT4|SET1_KLULA Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 /
DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=SET1 PE=3
SV=1
Length = 1000
Score = 68.9 bits (167), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
V+DA + A I H C P+C AK+ VDG +I IY +R I EE+T+DY E+
Sbjct: 921 TVIDATKRGGIARFINHCCEPSCTAKIIKVDGRKRIVIYALRDIGTNEELTYDYKFERET 980
Query: 1354 KEEYEASVCLCGSQVCRGSYLN 1375
+E E CLCG+ C+G +LN
Sbjct: 981 -DEGERLPCLCGAPSCKG-FLN 1000
>sp|P20659|TRX_DROME Histone-lysine N-methyltransferase trithorax OS=Drosophila
melanogaster GN=trx PE=1 SV=4
Length = 3726
Score = 68.6 bits (166), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 74/157 (47%), Gaps = 23/157 (14%)
Query: 1219 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1278
+ ++ G G+ C K+ GE V+E+ GE+ IRS + + +
Sbjct: 3593 FRSHIHGRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRG 3638
Query: 1279 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 1338
I K D D +VVDA + N A I H C PNC +KV + GH I I+ +R I
Sbjct: 3639 IGCYMFKID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIV 3694
Query: 1339 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 1375
GEE+T+DY E E C CGS+ CR YLN
Sbjct: 3695 QGEELTYDYKFPFED----EKIPCSCGSKRCR-KYLN 3726
>sp|Q9M1X9|ASHH4_ARATH Putative histone-lysine N-methyltransferase ASHH4 OS=Arabidopsis
thaliana GN=ASHH4 PE=3 SV=1
Length = 352
Score = 68.6 bits (166), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 21/146 (14%)
Query: 1225 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1284
G G+V +++ GE F++E++GEV + + L K N FY +
Sbjct: 122 GYGIVADEDINSGE--FIIEYVGEV------IDDKICEERLWKLNHKVETNFYLCQINWN 173
Query: 1285 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 1344
+V+DA HK N + I HSC PN E + +DG +IGI+ R I+ GE++T
Sbjct: 174 ---------MVIDATHKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATRFINKGEQLT 224
Query: 1345 FDYNSVTESKEEYEASVCLCGSQVCR 1370
+DY V ++ C CG+ CR
Sbjct: 225 YDYQFVQFGADQD----CYCGAVCCR 246
>sp|Q4WNH8|SET1_ASPFU Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 /
CBS 101355 / FGSC A1100) GN=set1 PE=3 SV=1
Length = 1241
Score = 68.2 bits (165), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1162 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREW 1221
Query: 1354 KEEYEASVCLCGSQVCRGSYLN 1375
+ + CLCGS C+G +LN
Sbjct: 1222 DSD-DRIPCLCGSTGCKG-FLN 1241
>sp|P38827|SET1_YEAST Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
GN=SET1 PE=1 SV=1
Length = 1080
Score = 68.2 bits (165), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
V+DA K A I H C PNC AK+ V G +I IY +R I EE+T+DY E
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFERE- 1059
Query: 1354 KEEYEASVCLCGSQVCRGSYLN 1375
K++ E CLCG+ C+G +LN
Sbjct: 1060 KDDEERLPCLCGAPNCKG-FLN 1080
>sp|Q6CEK8|SET1_YARLI Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Yarrowia
lipolytica (strain CLIB 122 / E 150) GN=SET1 PE=3 SV=1
Length = 1170
Score = 67.4 bits (163), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 45/82 (54%), Gaps = 3/82 (3%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
VVDA + A I H C P+C AK+ V+G +I IY R I EE+T+DY E
Sbjct: 1092 TVVDATKRGGIARFINHCCTPSCTAKIIKVEGQKRIVIYASRDIAANEELTYDYKFEKEI 1151
Query: 1354 KEEYEASVCLCGSQVCRGSYLN 1375
E E CLCG+ C+G YLN
Sbjct: 1152 GE--ERIPCLCGAPGCKG-YLN 1170
>sp|Q2LAE1|ASHH2_ARATH Histone-lysine N-methyltransferase ASHH2 OS=Arabidopsis thaliana
GN=ASHH2 PE=1 SV=1
Length = 1759
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 77/155 (49%), Gaps = 19/155 (12%)
Query: 1217 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1276
+++ + +KG G+ ++ E F++E++GEV + + +Q + + F
Sbjct: 1029 ERFQSGKKGYGLRLLED--VREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKH------F 1080
Query: 1277 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 1336
Y + L +G + V+DA K N I HSC PNC + V+G +GI++++
Sbjct: 1081 YFMTL-------NGNE--VIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQD 1131
Query: 1337 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 1371
+ G+E+TFDYN V A C CGS CRG
Sbjct: 1132 LKKGQELTFDYNYVRVFG--AAAKKCYCGSSHCRG 1164
>sp|P0CO26|SET1_CRYNJ Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Cryptococcus neoformans var. neoformans serotype D
(strain JEC21 / ATCC MYA-565) GN=SET1 PE=3 SV=1
Length = 1469
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
+V DA K + + I HSC P+ AK+ V+G +I IY R ++ GEEI +DY ES
Sbjct: 1390 IVCDATFKGSVSRLINHSCDPSANAKIIKVNGQSKIVIYAERTLYPGEEILYDYKFPLES 1449
Query: 1354 KEEYEASVCLCGSQVCRGSYLN 1375
CLCG+ CRG +LN
Sbjct: 1450 DPALRVP-CLCGAATCRG-WLN 1469
>sp|P0CO27|SET1_CRYNB Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Cryptococcus neoformans var. neoformans serotype D
(strain B-3501A) GN=SET1 PE=3 SV=1
Length = 1469
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
+V DA K + + I HSC P+ AK+ V+G +I IY R ++ GEEI +DY ES
Sbjct: 1390 IVCDATFKGSVSRLINHSCDPSANAKIIKVNGQSKIVIYAERTLYPGEEILYDYKFPLES 1449
Query: 1354 KEEYEASVCLCGSQVCRGSYLN 1375
CLCG+ CRG +LN
Sbjct: 1450 DPALRVP-CLCGAATCRG-WLN 1469
>sp|Q4PBL3|SET2_USTMA Histone-lysine N-methyltransferase, H3 lysine-36 specific OS=Ustilago
maydis (strain 521 / FGSC 9021) GN=SET2 PE=3 SV=1
Length = 972
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 66/259 (25%), Positives = 108/259 (41%), Gaps = 40/259 (15%)
Query: 1228 VVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI 1279
+V ++ GFG ++ F+ E++GEV + ++ +Q+ + FY +
Sbjct: 250 IVQTEKKGFGLRACQDIPKETFIYEYVGEVMNQTTFLQR------MQQYRIEGIRHFYFM 303
Query: 1280 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 1339
L+ + Y +DA K I HSC PNC V H ++GI+ R I
Sbjct: 304 MLQ-----PNEY----LDATKKGGKGRFINHSCNPNCAVSKWQVGKHLRMGIFAKRNIQK 354
Query: 1340 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLM 1399
GEE+TF+YN + +A C CG C G+ L G K +L G+ D L
Sbjct: 355 GEELTFNYNV---DRYGNDAQECFCGEPNCVGT---LGG-----KTQTDLSGMDD---LF 400
Query: 1400 LEACELNSVSEEDYLELGRAGLGSCL-LGGLPNWVVAYSARLVRFINLERTKLPEEILRH 1458
L+A ++ E+ + R G L L +P R + R P+ +
Sbjct: 401 LDALGISDEVEQTEAKGSRRKRGKRLDLDFIPQMRAIQEHEATRVMTAARQAGPKREILE 460
Query: 1459 NLEEKRKYFSDICLEVEKS 1477
L + + +D+ V+KS
Sbjct: 461 KLLRRMEMTTDV--NVQKS 477
>sp|Q6FKB1|SET1_CANGA Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Candida
glabrata (strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC
0622 / NRRL Y-65) GN=SET1 PE=3 SV=1
Length = 1111
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
V+DA K A I H C P+C AK+ V G +I IY +R I EE+T+DY E+
Sbjct: 1032 TVIDATKKGGIARFINHCCEPSCTAKIIKVGGKRRIVIYALRDIAANEELTYDYKFERET 1091
Query: 1354 KEEYEASVCLCGSQVCRGSYLN 1375
E E CLCG+ C+G +LN
Sbjct: 1092 DAE-ERLPCLCGAPSCKG-FLN 1111
>sp|Q8MT36|MES4_DROME Probable histone-lysine N-methyltransferase Mes-4 OS=Drosophila
melanogaster GN=Mes-4 PE=1 SV=2
Length = 1427
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 74/151 (49%), Gaps = 25/151 (16%)
Query: 1224 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1283
+G G+V + G DFV+E++GEV + R +++ D +Y + +E+
Sbjct: 1244 RGFGLVNREPIAVG--DFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 1295
Query: 1284 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 1343
++DA K N A + HSC PNCE + V+ +++GI+ ++ I E+
Sbjct: 1296 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSEL 1346
Query: 1344 TFDY---NSVTESKEEYEASVCLCGSQVCRG 1371
TF+Y + + SK+ C CG++ C G
Sbjct: 1347 TFNYLWDDLMNNSKK-----ACFCGAKRCSG 1372
>sp|Q4IB50|SET2_GIBZE Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Gibberella zeae (strain PH-1 / ATCC MYA-4620 / FGSC
9075 / NRRL 31084) GN=SET2 PE=3 SV=2
Length = 911
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 69/153 (45%), Gaps = 28/153 (18%)
Query: 1223 RKGLGVVCNKEGGFGEDDFVVEFLGEVY--PVWKWFEKQDGIRSLQKNNEDPAPEFYNIY 1280
+KG G+ + + +DFV E++GEV P ++ R + + +E+ FY +
Sbjct: 175 KKGFGLRADSD--LQPNDFVFEYIGEVINEPTFR--------RRMIQYDEEGIKHFYFMS 224
Query: 1281 LERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYG 1340
L + + VDA K NY HSC PNC V ++GI+T R I G
Sbjct: 225 LNKSE---------FVDATKKGNYGRFCNHSCNPNCYVDKWVVGDKLRMGIFTSRKIQSG 275
Query: 1341 EEITFDYNSVTESKEEYEA--SVCLCGSQVCRG 1371
EE+ F+YN + Y A C CG C G
Sbjct: 276 EELVFNYNV-----DRYGADPQPCYCGEPNCVG 303
>sp|Q75D88|SET1_ASHGO Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Ashbya
gossypii (strain ATCC 10895 / CBS 109.51 / FGSC 9923 /
NRRL Y-1056) GN=SET1 PE=3 SV=2
Length = 975
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
V+DA K A I H C P+C AK+ V G +I IY +R I EE+T+DY E+
Sbjct: 896 TVIDATKKGGIARFINHCCDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYKFERET 955
Query: 1354 KEEYEASVCLCGSQVCRGSYLN 1375
+E E CLCG+ C+G +LN
Sbjct: 956 DDE-ERLPCLCGAPNCKG-FLN 975
>sp|O08550|MLL4_MOUSE Histone-lysine N-methyltransferase MLL4 OS=Mus musculus GN=Wbp7 PE=1
SV=3
Length = 2713
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 76/162 (46%), Gaps = 31/162 (19%)
Query: 1219 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1278
Y + G G+ C + GE V+E+ G V IRS+ + + +FY+
Sbjct: 2578 YRSAIHGRGLFCKRNIDAGE--MVIEYSGIV------------IRSVLTDKRE---KFYD 2620
Query: 1279 -----IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 1333
Y+ R D +D VVDA N A I HSC PNC ++V V+G I I+
Sbjct: 2621 GKGIGCYMFR----MDDFD--VVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFA 2674
Query: 1334 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 1375
+R I GEE+T+DY E + C CG++ CR +LN
Sbjct: 2675 LRRILRGEELTYDYKFPIE--DASNKLPCNCGAKRCR-RFLN 2713
>sp|O15047|SET1A_HUMAN Histone-lysine N-methyltransferase SETD1A OS=Homo sapiens GN=SETD1A
PE=1 SV=3
Length = 1707
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 43/79 (54%), Gaps = 4/79 (5%)
Query: 1294 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 1353
++DA N A I H C PNC AKV ++ +I IY+ + I EEIT+DY E
Sbjct: 1631 TIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLED 1690
Query: 1354 KEEYEASVCLCGSQVCRGS 1372
+ CLCG++ CRGS
Sbjct: 1691 NK----IPCLCGTESCRGS 1705
>sp|Q54HS3|SET1_DICDI Histone-lysine N-methyltransferase set1 OS=Dictyostelium discoideum
GN=set1 PE=1 SV=1
Length = 1486
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 42/81 (51%), Gaps = 4/81 (4%)
Query: 1292 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 1351
D ++DA K N A I H C PNC AKV + +I IY R I+ GEEIT+DY
Sbjct: 1408 DDTIIDATFKGNLARFINHCCDPNCIAKVLTIGNQKKIIIYAKRDINIGEEITYDYKFPI 1467
Query: 1352 ESKEEYEASVCLCGSQVCRGS 1372
E + CLC S CR +
Sbjct: 1468 EDVK----IPCLCKSPKCRQT 1484
>sp|P0CO28|SET2_CRYNJ Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Cryptococcus neoformans var. neoformans serotype D
(strain JEC21 / ATCC MYA-565) GN=SET2 PE=3 SV=1
Length = 834
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 72/156 (46%), Gaps = 26/156 (16%)
Query: 1225 GLGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1276
+ VV ++ G+G + + E++GEV K F K+ +Q+ ++ F
Sbjct: 182 NVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVV-AEKTFRKR-----MQQYADEGIRHF 235
Query: 1277 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 1336
Y + L++ + +DA K HSC PNCE + V ++GI+T R
Sbjct: 236 YFMMLQKEE---------YIDATKKGGIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRD 286
Query: 1337 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGS 1372
+ GEEITF+YN + ++A C CG C G+
Sbjct: 287 VIKGEEITFNYNV---DRYGHDAQTCYCGEPNCVGT 319
>sp|P0CO29|SET2_CRYNB Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Cryptococcus neoformans var. neoformans serotype D
(strain B-3501A) GN=SET2 PE=3 SV=1
Length = 834
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 72/156 (46%), Gaps = 26/156 (16%)
Query: 1225 GLGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1276
+ VV ++ G+G + + E++GEV K F K+ +Q+ ++ F
Sbjct: 182 NVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVV-AEKTFRKR-----MQQYADEGIRHF 235
Query: 1277 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 1336
Y + L++ + +DA K HSC PNCE + V ++GI+T R
Sbjct: 236 YFMMLQKEE---------YIDATKKGGIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRD 286
Query: 1337 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGS 1372
+ GEEITF+YN + ++A C CG C G+
Sbjct: 287 VIKGEEITFNYNV---DRYGHDAQTCYCGEPNCVGT 319
>sp|Q9UMN6|MLL4_HUMAN Histone-lysine N-methyltransferase MLL4 OS=Homo sapiens GN=WBP7 PE=1
SV=1
Length = 2715
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 76/162 (46%), Gaps = 31/162 (19%)
Query: 1219 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1278
Y + G G+ C + GE V+E+ G V IRS+ + + +FY+
Sbjct: 2580 YRSAIHGRGLFCKRNIDAGE--MVIEYSGIV------------IRSVLTDKRE---KFYD 2622
Query: 1279 -----IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 1333
Y+ R D +D VVDA N A I HSC PNC ++V V+G I I+
Sbjct: 2623 GKGIGCYMFR----MDDFD--VVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFA 2676
Query: 1334 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 1375
+R I GEE+T+DY E + C CG++ CR +LN
Sbjct: 2677 LRRILRGEELTYDYKFPIE--DASNKLPCNCGAKRCR-RFLN 2715
>sp|Q6C5G5|SET2_YARLI Histone-lysine N-methyltransferase, H3 lysine-36 specific OS=Yarrowia
lipolytica (strain CLIB 122 / E 150) GN=set-2 PE=3 SV=1
Length = 768
Score = 64.7 bits (156), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 54/173 (31%), Positives = 75/173 (43%), Gaps = 24/173 (13%)
Query: 1223 RKGLGVVCNKEGGFGEDDFVVEFLGEVY--PVWKWFEKQDGIRSLQKNNEDPAPEFYNIY 1280
+KG G+ K+ GE FV E++GEV P F+++ I + Q FY +
Sbjct: 101 KKGFGLRATKDIAAGE--FVYEYVGEVIDEPT---FKERTAIYTTQG-----VKHFYFMM 150
Query: 1281 LERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYG 1340
L+ KG+ +DA K HSC PN + V ++GI+ R I G
Sbjct: 151 LQ--KGE-------FIDATAKGGLGRFCNHSCAPNGHVEKWVVGKRLRMGIFASRHIQRG 201
Query: 1341 EEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL 1393
EE+TFDYN + EA C CG + C G T + KV L L
Sbjct: 202 EEVTFDYNV---DRYGAEAQACYCGEKNCVGFLGGKTQTESASKVSGTLTAAL 251
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.133 0.400
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 688,810,665
Number of Sequences: 539616
Number of extensions: 30807251
Number of successful extensions: 96599
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 123
Number of HSP's successfully gapped in prelim test: 260
Number of HSP's that attempted gapping in prelim test: 89797
Number of HSP's gapped (non-prelim): 2565
length of query: 1778
length of database: 191,569,459
effective HSP length: 132
effective length of query: 1646
effective length of database: 120,340,147
effective search space: 198079881962
effective search space used: 198079881962
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 69 (31.2 bits)