BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 038692
(443 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q949T8|ASHR3_ARATH Histone-lysine N-methyltransferase ASHR3 OS=Arabidopsis thaliana
GN=ASHR3 PE=1 SV=1
Length = 497
Score = 524 bits (1350), Expect = e-148, Method: Compositional matrix adjust.
Identities = 276/488 (56%), Positives = 326/488 (66%), Gaps = 60/488 (12%)
Query: 1 MPDLANLALSSSSSSLTLTRCASSLKPLAPPHSA--TESPGSDSAVV--KTLALTGEEEN 56
M DL N+++S+S + LT C S L + P A SP + + K L + EE
Sbjct: 1 MLDLGNMSMSAS---VALTCCPSFLPAASGPELAKSINSPENLAGECNGKHLPMIPPEEE 57
Query: 57 V----CANGNGHSVRVMKRCRGAKNISGLEDHVAAWVKKKMELGVPQSNCSLPFLVGAKK 112
V ANG R K L+DHV WVK+++ GV +S C LPFLVGAKK
Sbjct: 58 VKDIKIANGVTAFTRKQNPSDRVKKGFVLDDHVKDWVKRRVASGVSESTCFLPFLVGAKK 117
Query: 113 MIECRACHRFIYHGEEVFCSVRGCGGVYHFICVKERLGISNPRNFKCPQHACFICRQRLQ 172
M++C CH+ +Y GE++ CSVRGC G YH +C KE LG S FKCPQH CF+C+QR Q
Sbjct: 118 MVDCLVCHKPVYPGEDLSCSVRGCQGAYHSLCAKESLGFSKSSKFKCPQHECFVCKQRTQ 177
Query: 173 WRCVRCTIASHDKCAPWPDRVIHLKDQPGRAVCWRHPAKWLLDK---------QEVFCRL 223
WRCV+C +A+HDK +PW ++HLKDQPGRAVCWRHP W LD +EVFC+L
Sbjct: 178 WRCVKCPMAAHDKHSPWSKEILHLKDQPGRAVCWRHPTDWRLDTKHAVAQSEIEEVFCQL 237
Query: 224 PLPYADEEFKIDLTWKDLMENKVGPPP--------------------------------- 250
PLPY +EEFKIDL WKD + K PP
Sbjct: 238 PLPYVEEEFKIDLAWKDSVV-KEDPPSYVHIRRNIYLVKKKRDNANDGVGCTNCGPNCDR 296
Query: 251 -----VQYISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYI 305
VQ ISCSK C C E+C NRPFRKEKKIKIVKTE CGWGVEAAE INK +FI+EYI
Sbjct: 297 SCVCRVQCISCSKGCSCPESCGNRPFRKEKKIKIVKTEHCGWGVEAAESINKEDFIVEYI 356
Query: 306 GEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKW 365
GEVI DA CEQRLWDMK++G+++FYMCEI+KDFTIDATFKGN SRFLNHSC+PNC+LEKW
Sbjct: 357 GEVISDAQCEQRLWDMKHKGMKDFYMCEIQKDFTIDATFKGNASRFLNHSCNPNCVLEKW 416
Query: 366 QVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGK-LEL 424
QVEGETRVGVFAAR I+AGEPLTYDYRFVQFGPEVKC CG+ +CQGYLGTKRK L +
Sbjct: 417 QVEGETRVGVFAARQIEAGEPLTYDYRFVQFGPEVKCNCGSENCQGYLGTKRKEPNCLVV 476
Query: 425 CWGSKRKR 432
WG+KR+R
Sbjct: 477 SWGAKRRR 484
>sp|Q945S8|ASHH3_ARATH Histone-lysine N-methyltransferase ASHH3 OS=Arabidopsis thaliana
GN=ASHH3 PE=2 SV=2
Length = 363
Score = 202 bits (515), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 97/176 (55%), Positives = 121/176 (68%), Gaps = 15/176 (8%)
Query: 256 CSKACHCS-------------ETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEF 300
C CHC CNN+PF R KK+K+++TE CG G+ A E I GEF
Sbjct: 82 CGSNCHCGMLFSSCSSSCKCGSECNNKPFQQRHVKKMKLIQTEKCGSGIVAEEEIEAGEF 141
Query: 301 IIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNC 360
IIEY+GEVIDD CE+RLW MK+RG NFY+CEI +D IDAT KGN SR++NHSC+PN
Sbjct: 142 IIEYVGEVIDDKTCEERLWKMKHRGETNFYLCEITRDMVIDATHKGNKSRYINHSCNPNT 201
Query: 361 ILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTK 416
++KW ++GETR+G+FA R IK GE LTYDY+FVQFG + C+CGA C+ LG K
Sbjct: 202 QMQKWIIDGETRIGIFATRGIKKGEHLTYDYQFVQFGADQDCHCGAVGCRRKLGVK 257
>sp|Q9M1X9|ASHH4_ARATH Putative histone-lysine N-methyltransferase ASHH4 OS=Arabidopsis
thaliana GN=ASHH4 PE=3 SV=1
Length = 352
Score = 192 bits (488), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 114/151 (75%), Gaps = 2/151 (1%)
Query: 268 NRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRG 325
N+PF++ KK+K+V+TE CG+G+ A E IN GEFIIEY+GEVIDD +CE+RLW + ++
Sbjct: 102 NKPFQQRHIKKMKLVQTEKCGYGIVADEDINSGEFIIEYVGEVIDDKICEERLWKLNHKV 161
Query: 326 VQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGE 385
NFY+C+I + IDAT KGN SR++NHSC PN ++KW ++GETR+G+FA R I GE
Sbjct: 162 ETNFYLCQINWNMVIDATHKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATRFINKGE 221
Query: 386 PLTYDYRFVQFGPEVKCYCGASSCQGYLGTK 416
LTYDY+FVQFG + CYCGA C+ LG K
Sbjct: 222 QLTYDYQFVQFGADQDCYCGAVCCRKKLGAK 252
>sp|O96028|NSD2_HUMAN Histone-lysine N-methyltransferase NSD2 OS=Homo sapiens GN=WHSC1 PE=1
SV=1
Length = 1365
Score = 147 bits (371), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 42.0 bits (97), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>sp|Q9BYW2|SETD2_HUMAN Histone-lysine N-methyltransferase SETD2 OS=Homo sapiens GN=SETD2
PE=1 SV=3
Length = 2564
Score = 147 bits (371), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1527 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1586
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1587 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1646
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1647 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1706
Query: 431 KRS 433
+RS
Sbjct: 1707 ERS 1709
>sp|Q9BZ95|NSD3_HUMAN Histone-lysine N-methyltransferase NSD3 OS=Homo sapiens GN=WHSC1L1
PE=1 SV=1
Length = 1437
Score = 147 bits (371), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1119 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1179 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1238
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1239 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1289
Score = 42.4 bits (98), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>sp|Q8BVE8|NSD2_MOUSE Histone-lysine N-methyltransferase NSD2 OS=Mus musculus GN=Whsc1 PE=1
SV=2
Length = 1365
Score = 147 bits (370), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 42.7 bits (99), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVNQCGKFYHEACVKKYPLTVFESRGFRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>sp|Q6P2L6|NSD3_MOUSE Histone-lysine N-methyltransferase NSD3 OS=Mus musculus GN=Whsc1l1
PE=1 SV=2
Length = 1439
Score = 145 bits (365), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 74/170 (43%), Positives = 99/170 (58%), Gaps = 3/170 (1%)
Query: 252 QYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
QY + C + C N+ F K +++KTE GWG+ I KGEF+ EY+GE+I
Sbjct: 1123 QYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGELI 1182
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V G
Sbjct: 1183 DEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNG 1242
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+ RVG+FA I AG LT++Y G C+CGA +C G+LG + K
Sbjct: 1243 DVRVGLFALCDIPAGMELTFNYNLDCLGNGRTVCHCGADNCSGFLGVRPK 1292
Score = 40.8 bits (94), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVSVCGKFYHEACVRKFPTAIFESKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>sp|Q59XV0|SET2_CANAL Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Candida albicans (strain SC5314 / ATCC MYA-2876)
GN=SET2 PE=3 SV=1
Length = 844
Score = 144 bits (363), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 66/164 (40%), Positives = 103/164 (62%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
++ C C + C N+ F R+ K+K+++TE G+G+ A + I + +FI EYIGEVID+
Sbjct: 121 VNRNCLCGDDCQNQRFQNRQYSKVKVIQTELKGYGLIAEQDIEENQFIYEYIGEVIDEIS 180
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
QR+ + R +++FY + D IDAT KG+ RF+NHSC+PN ++KW V R+
Sbjct: 181 FRQRMIEYDLRHLKHFYFMMLSNDSFIDATEKGSLGRFINHSCNPNAFVDKWHVGDRLRM 240
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+FA R I GE +T+DY ++G + + CYCG +C ++G K
Sbjct: 241 GIFAKRKISRGEEITFDYNVDRYGAQSQPCYCGEPNCIKFMGGK 284
>sp|Q2LAE1|ASHH2_ARATH Histone-lysine N-methyltransferase ASHH2 OS=Arabidopsis thaliana
GN=ASHH2 PE=1 SV=1
Length = 1759
Score = 142 bits (358), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 102/160 (63%), Gaps = 4/160 (2%)
Query: 259 ACHCSETCNNRPFRKEKKIKIVKTEFC--GWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C + C+N+ F+K K +K + + G+G+ E + +G+F+IEY+GEV+D E
Sbjct: 1008 TCPAGDLCSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYET 1067
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R + ++G ++FY + + IDA KGN RF+NHSC+PNC EKW V GE VG+F
Sbjct: 1068 RQKEYAFKGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIF 1127
Query: 377 AARSIKAGEPLTYDYRFVQ-FGPEV-KCYCGASSCQGYLG 414
+ + +K G+ LT+DY +V+ FG KCYCG+S C+GY+G
Sbjct: 1128 SMQDLKKGQELTFDYNYVRVFGAAAKKCYCGSSHCRGYIG 1167
>sp|Q6BM04|SET2_DEBHA Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Debaryomyces hansenii (strain ATCC 36239 / CBS 767 /
JCM 1990 / NBRC 0083 / IGC 2968) GN=SET2 PE=3 SV=2
Length = 731
Score = 142 bits (357), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 68/170 (40%), Positives = 109/170 (64%), Gaps = 4/170 (2%)
Query: 251 VQYISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
V + CS K C C C N+ F+K++ + +++TE G+G+ A E I++ FI EYIGE
Sbjct: 83 VTSVECSNKFCTCGNDCQNQRFQKKQYANVTVIQTELKGYGLRANEDISESSFIYEYIGE 142
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID+ +R+ D + + +FY ++KD IDAT KG+ +RF NHSC+PN ++KW V
Sbjct: 143 VIDEESFRKRMIDYDTKKLIHFYFMMLKKDSFIDATMKGSLARFCNHSCNPNAYVDKWVV 202
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
+ R+G+F+ R+I+ GE +T+DY ++G + + CYCG +C ++G K
Sbjct: 203 GEKLRMGIFSKRNIQKGEEITFDYNVDRYGAQSQPCYCGEPNCIKWMGGK 252
>sp|Q7RZU4|SET2_NEUCR Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A /
CBS 708.71 / DSM 1257 / FGSC 987) GN=set-2 PE=3 SV=1
Length = 954
Score = 142 bits (357), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 101/160 (63%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C+C C N+ F++++ + ++KTE G+G+ A + +FI EYIGEVI++ R
Sbjct: 165 CNCGSGCQNQRFQRKQYADVSVIKTEKKGFGLRANTDLQVNDFIFEYIGEVINEPTFRSR 224
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY + K +DAT KGN RF NHSCDPNC ++KW V + R+G+FA
Sbjct: 225 MVKYDKEGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCDPNCYVDKWVVGDKLRMGIFA 284
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+IKAGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 285 GRAIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIGGK 324
>sp|Q96L73|NSD1_HUMAN Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific OS=Homo sapiens GN=NSD1 PE=1 SV=1
Length = 2696
Score = 142 bits (357), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1925 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1984
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1985 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2044
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2045 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2086
Score = 33.9 bits (76), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 75/181 (41%), Gaps = 32/181 (17%)
Query: 33 SATESPGSDSAVVKTLALTGEEENVCANGNGHS--VRVMKRCRGAKNI-SGLEDHVAAWV 89
S+ E PGS+ ++ T +E V G H + K+ +G + + L+++V
Sbjct: 1491 SSKEIPGSEGELMPHRTATSPKETV-EEGVEHDPGMPASKKMQGERGGGAALKENVCQNC 1549
Query: 90 KKKMELGVPQSNCSLPFLVGAKKMIE-------CRACHRFIY-------HGEEV-FCSVR 134
+K EL + ++ C F + + E C C I+ GE+V C +
Sbjct: 1550 EKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNECRTGIHTCFVCKQSGEDVKRCLLP 1609
Query: 135 GCGGVYHFICV-KERLGISNPRNFKCPQHACFIC-----------RQRLQWRCVRCTIAS 182
CG YH CV K + + F+C H C C + RL RCVRC +A
Sbjct: 1610 LCGKFYHEECVQKYPPTVMQNKGFRCSLHICITCHAANPANVSASKGRL-MRCVRCPVAY 1668
Query: 183 H 183
H
Sbjct: 1669 H 1669
>sp|O88491|NSD1_MOUSE Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific OS=Mus musculus GN=Nsd1 PE=1 SV=1
Length = 2588
Score = 141 bits (356), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 69/164 (42%), Positives = 96/164 (58%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
C C C ++ R+ ++I +T GWG+ I KGEF+ EY+GE+ID+ C
Sbjct: 1823 CPAGVRCQNQCFSK--RQYPDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECR 1880
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+
Sbjct: 1881 ARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGL 1940
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
FA IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1941 FALSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1984
Score = 40.4 bits (93), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 39/88 (44%), Gaps = 21/88 (23%)
Query: 116 CRACHRFIY-------HGEEV-FCSVRGCGGVYHFICV-KERLGISNPRNFKCPQHACFI 166
C CH I+ GE+V C + CG YH CV K ++ + F+CP H C
Sbjct: 1481 CNECHTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQKYPPTVTQNKGFRCPLHICIT 1540
Query: 167 C-----------RQRLQWRCVRCTIASH 183
C + RL RCVRC +A H
Sbjct: 1541 CHAANPANVSASKGRL-MRCVRCPVAYH 1567
>sp|Q1DU03|SET2_COCIM Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Coccidioides immitis (strain RS) GN=SET2 PE=3 SV=2
Length = 1011
Score = 140 bits (354), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 112/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C ++C N+ F++ + K+ ++KTE G+G+ A + EFI EYIGEVI++
Sbjct: 180 MECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFIFEYIGEVINE 239
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 240 PQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 299
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 300 RMGIFAERYIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 354
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 355 SHATIEALGI 364
>sp|Q5ASA5|SET2_EMENI Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 /
CBS 112.46 / NRRL 194 / M139) GN=set2 PE=3 SV=1
Length = 980
Score = 140 bits (353), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 110/190 (57%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++ + + ++KTE G+G+ A E + +FI EY+GEVI++
Sbjct: 218 IECMGDCGCGPDCQNQRFQRREYANVAVIKTEKKGYGLRAEEDLRPHQFIFEYVGEVINE 277
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 278 GPFHRRMRQYDAEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 337
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R I+AGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 338 RMGIFAERHIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 392
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 393 SNATIEALGI 402
>sp|Q4WTT2|SET2_ASPFU Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 /
CBS 101355 / FGSC A1100) GN=set2 PE=3 SV=1
Length = 966
Score = 140 bits (352), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 112/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++++ + ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 191 IECVGDCSCGAECQNQRFQRKEYANVAVIKTEKKGFGLRAETDLRPHQFIFEYVGEVINE 250
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
A +R+ G+++FY + + +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 251 AQFRRRMRQYDEEGIKHFYFMSLSRGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 310
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R+I+AGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 311 RMGIFAERAIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTDRATKL 365
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 366 SNATIEALGI 375
>sp|Q2H988|SET2_CHAGB Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Chaetomium globosum (strain ATCC 6205 / CBS 148.51 /
DSM 1962 / NBRC 6347 / NRRL 1970) GN=SET2 PE=3 SV=1
Length = 894
Score = 139 bits (349), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 104/167 (62%), Gaps = 4/167 (2%)
Query: 254 ISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C S C+C E C N+ F++++ + ++KTE G+G+ + +F+ EY+GEVI+
Sbjct: 104 IECVSGDCNCGEGCENQRFQRKQYANVSVIKTEKKGFGLRTDADLQANDFVFEYVGEVIN 163
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ R G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 164 EPTFRNRTVKYDKEGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDK 223
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R+I+AGE L ++Y ++G + + CYCG S+C G++G K
Sbjct: 224 LRMGIFATRAIRAGEELVFNYNVDRYGADPQPCYCGESNCVGFIGGK 270
>sp|Q6C5G5|SET2_YARLI Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Yarrowia lipolytica (strain CLIB 122 / E 150)
GN=set-2 PE=3 SV=1
Length = 768
Score = 137 bits (345), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/186 (39%), Positives = 109/186 (58%), Gaps = 8/186 (4%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C + C N+ F+ +K + ++ TE G+G+ A + I GEF+ EY+GEVID+ ++R
Sbjct: 75 VRCCKGCQNKRFQGKKYASVDVISTEKKGFGLRATKDIAAGEFVYEYVGEVIDEPTFKER 134
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+GV++FY ++K IDAT KG RF NHSC PN +EKW V R+G+FA
Sbjct: 135 TAIYTTQGVKHFYFMMLQKGEFIDATAKGGLGRFCNHSCAPNGHVEKWVVGKRLRMGIFA 194
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSSTA 436
+R I+ GE +T+DY ++G E + CYCG +C G+LG GK + SK + TA
Sbjct: 195 SRHIQRGEEVTFDYNVDRYGAEAQACYCGEKNCVGFLG-----GKTQTESASKVSGTLTA 249
Query: 437 CLAIIT 442
L + +
Sbjct: 250 ALGLTS 255
>sp|Q9VYD1|C1716_DROME Probable histone-lysine N-methyltransferase CG1716 OS=Drosophila
melanogaster GN=Set2 PE=1 SV=2
Length = 2313
Score = 136 bits (343), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 102/164 (62%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C N+ F++ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1339 IECGPLCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDS 1398
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1399 EEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGEL 1458
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ + I+ GE +T+DY+++++G + +CYC A++C+G++G
Sbjct: 1459 RIGFFSVKPIQPGEEITFDYQYLRYGRDAQRCYCEAANCRGWIG 1502
>sp|Q2UTN6|SET2_ASPOR Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40)
GN=set2 PE=3 SV=1
Length = 965
Score = 136 bits (342), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 111/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++++ ++ ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 202 IECVGDCGCGPDCQNQRFQRKEYAQVAVIKTEKKGFGLRAEADLRPHQFIYEYVGEVINE 261
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT +GN RF NHSC+PNC ++KW V +
Sbjct: 262 GQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKRGNLGRFCNHSCNPNCYVDKWVVGEKL 321
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R I+AGE L ++Y ++G + + CYCG +C G++G G+ + +K
Sbjct: 322 RMGIFAERDIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GRTQTERATKL 376
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 377 SNATIEALGI 386
>sp|Q84WW6|ASHH1_ARATH Histone-lysine N-methyltransferase ASHH1 OS=Arabidopsis thaliana
GN=ASHH1 PE=2 SV=1
Length = 492
Score = 135 bits (341), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K K K++K E GWG+ A E I G+FI+EY GEVI ++R
Sbjct: 72 CPCGVYCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKR 131
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ GV++ Y+ + IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 132 AQTYETHGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFA 191
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
SI L YDY F + G +V+C CGA +C G+LG K +
Sbjct: 192 KESISPRTELAYDYNFEWYGGAKVRCLCGAVACSGFLGAKSR 233
>sp|Q4IB50|SET2_GIBZE Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Gibberella zeae (strain PH-1 / ATCC MYA-4620 / FGSC
9075 / NRRL 31084) GN=SET2 PE=3 SV=2
Length = 911
Score = 134 bits (338), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 106/167 (63%), Gaps = 4/167 (2%)
Query: 254 ISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS + +C+ C N+ F++++ + ++KTE G+G+ A + +F+ EYIGEVI+
Sbjct: 142 MECSAEGGNCAGGCQNQRFQRKQYANVSVIKTEKKGFGLRADSDLQPNDFVFEYIGEVIN 201
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ +R+ G+++FY + K +DAT KGN+ RF NHSC+PNC ++KW V +
Sbjct: 202 EPTFRRRMIQYDEEGIKHFYFMSLNKSEFVDATKKGNYGRFCNHSCNPNCYVDKWVVGDK 261
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+F +R I++GE L ++Y ++G + + CYCG +C G++G K
Sbjct: 262 LRMGIFTSRKIQSGEELVFNYNVDRYGADPQPCYCGEPNCVGFIGGK 308
>sp|Q4PBL3|SET2_USTMA Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Ustilago maydis (strain 521 / FGSC 9021) GN=SET2 PE=3
SV=1
Length = 972
Score = 134 bits (336), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 103/167 (61%), Gaps = 4/167 (2%)
Query: 254 ISCSKA-CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I CS + C + C N+ F + + + IV+TE G+G+ A + I K FI EY+GEV++
Sbjct: 222 IECSASKCRWGKQCRNQRFHRRQYVDVDIVQTEKKGFGLRACQDIPKETFIYEYVGEVMN 281
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
QR+ + G+++FY ++ + +DAT KG RF+NHSC+PNC + KWQV
Sbjct: 282 QTTFLQRMQQYRIEGIRHFYFMMLQPNEYLDATKKGGKGRFINHSCNPNCAVSKWQVGKH 341
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R+I+ GE LT++Y ++G + + C+CG +C G LG K
Sbjct: 342 LRMGIFAKRNIQKGEELTFNYNVDRYGNDAQECFCGEPNCVGTLGGK 388
>sp|Q757Y8|SET2_ASHGO Histone-lysine N-methyltransferase, H3 lysine-36 specific OS=Ashbya
gossypii (strain ATCC 10895 / CBS 109.51 / FGSC 9923 /
NRRL Y-1056) GN=SET2 PE=3 SV=2
Length = 684
Score = 131 bits (329), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 95/158 (60%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C + C N+ F+K++ I + +TE G+GV A I EFI EYIGEVI +A R+
Sbjct: 95 CGDDCQNQRFQKKEYADIAVFQTEKKGYGVRAERDIEANEFIYEYIGEVISEADFRDRMV 154
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D RG ++FY ++ IDAT +G +RF NHSC+PN + KW V G+ ++G+FA R
Sbjct: 155 DYDMRGFKHFYFMMLQAGEFIDATERGCLARFCNHSCNPNAYVSKWDVAGKLKMGIFAHR 214
Query: 380 SIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
I GE +T+DY ++G + CYC +C G+LG K
Sbjct: 215 KILKGEEITFDYNVDRYGATAQPCYCDEPNCIGFLGGK 252
>sp|Q18221|SET2_CAEEL Probable histone-lysine N-methyltransferase set-2 OS=Caenorhabditis
elegans GN=set-2 PE=2 SV=2
Length = 1507
Score = 130 bits (327), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 89/149 (59%), Gaps = 1/149 (0%)
Query: 266 CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRG 325
N FRK K IK ++ GWG+ A E I E I+EYIG+ I + E+R + RG
Sbjct: 1360 INQLKFRK-KMIKFARSRIHGWGLYAMESIAPDEMIVEYIGQTIRSLVAEEREKAYERRG 1418
Query: 326 VQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGE 385
+ + Y+ I IDAT +GNF+RF+NHSC PNC + +EGE R+ +++ IK GE
Sbjct: 1419 IGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGE 1478
Query: 386 PLTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CGA +C+GYL
Sbjct: 1479 EITYDYKFPIEDDKIDCLCGAKTCRGYLN 1507
>sp|P46995|SET2_YEAST Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
GN=SET2 PE=1 SV=2
Length = 733
Score = 130 bits (326), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 94/158 (59%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT+ G+GV A + I +FI EY GEVI++ RL
Sbjct: 105 CGNDCQNQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+ + R+G+FA R
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQR 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G + KCYC +C G+LG K
Sbjct: 225 KILKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGK 262
>sp|Q8MT36|MES4_DROME Probable histone-lysine N-methyltransferase Mes-4 OS=Drosophila
melanogaster GN=Mes-4 PE=1 SV=2
Length = 1427
Score = 130 bits (326), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 102/186 (54%), Gaps = 4/186 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C NR F RK ++++V G+G+ EPI G+F+IEY+GEVI+ A ++R
Sbjct: 1217 CKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRR 1276
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +N+Y + KDF IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 1277 MEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFA 1336
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSST 435
+ I LT++Y + K C+CGA C G +G K K ++ K+ R +
Sbjct: 1337 IKDIPVNSELTFNYLWDDLMNNSKKACFCGAKRCSGEIGGKLKDDAVKAHAKLKQMRRAK 1396
Query: 436 ACLAII 441
A I
Sbjct: 1397 ASAVRI 1402
>sp|Q6FX50|SET2_CANGA Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Candida glabrata (strain ATCC 2001 / CBS 138 / JCM
3761 / NBRC 0622 / NRRL Y-65) GN=SET2 PE=3 SV=1
Length = 716
Score = 127 bits (319), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 93/158 (58%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I + KT+ G+GV A I +FI EY GEVI++ RL
Sbjct: 123 CGNDCANQRFQKKEYANIAVFKTKMKGYGVRAESDIEINDFIYEYKGEVIEEEEFRDRLV 182
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D + ++FY ++ IDAT KG+ +RF NHSC+PN + KW V G+ R+G+FA R
Sbjct: 183 DYDQKKFRHFYFMMLQSGEFIDATIKGSLARFCNHSCNPNAYVNKWVVAGKLRMGIFAKR 242
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G KCYC +C G+LG K
Sbjct: 243 KILKGEEITFDYNVDRYGAAAQKCYCEEPNCIGFLGGK 280
>sp|O14026|SET2_SCHPO Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=set2 PE=1 SV=1
Length = 798
Score = 125 bits (314), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 62/158 (39%), Positives = 95/158 (60%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C +C N+ F++ + K+ + TE G+G+ A + K F+ EYIGEVI + +R+
Sbjct: 165 CGPSCQNQRFQRHEFAKVDVFLTEKKGFGLRADANLPKDTFVYEYIGEVIPEQKFRKRMR 224
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
G+++FY ++K IDAT +G+ +RF NHSC PNC ++KW V + R+G+F R
Sbjct: 225 QYDSEGIKHFYFMMLQKGEYIDATKRGSLARFCNHSCRPNCYVDKWMVGDKLRMGIFCKR 284
Query: 380 SIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
I GE LT+DY ++G + + CYCG C GY+G K
Sbjct: 285 DIIRGEELTFDYNVDRYGAQAQPCYCGEPCCVGYIGGK 322
>sp|P0CO29|SET2_CRYNB Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Cryptococcus neoformans var. neoformans serotype D
(strain B-3501A) GN=SET2 PE=3 SV=1
Length = 834
Score = 125 bits (314), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 101/176 (57%), Gaps = 6/176 (3%)
Query: 253 YISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
YI C + C + C+N+ F K + + +V TE G+G+ A+ I I EYIGEV+
Sbjct: 156 YIECIAGECRAGKHCHNQQFSKRQYANVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVV 215
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
+ +R+ G+++FY ++K+ IDAT KG RF NHSC+PNC ++KW V
Sbjct: 216 AEKTFRKRMQQYADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGR 275
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG--TKRKIGKL 422
R+G+F R + GE +T++Y ++G + + CYCG +C G +G T+ IG +
Sbjct: 276 RLRMGIFTKRDVIKGEEITFNYNVDRYGHDAQTCYCGEPNCVGTIGGKTQTDIGTM 331
>sp|P0CO28|SET2_CRYNJ Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Cryptococcus neoformans var. neoformans serotype D
(strain JEC21 / ATCC MYA-565) GN=SET2 PE=3 SV=1
Length = 834
Score = 125 bits (314), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 101/176 (57%), Gaps = 6/176 (3%)
Query: 253 YISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
YI C + C + C+N+ F K + + +V TE G+G+ A+ I I EYIGEV+
Sbjct: 156 YIECIAGECRAGKHCHNQQFSKRQYANVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVV 215
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
+ +R+ G+++FY ++K+ IDAT KG RF NHSC+PNC ++KW V
Sbjct: 216 AEKTFRKRMQQYADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGR 275
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG--TKRKIGKL 422
R+G+F R + GE +T++Y ++G + + CYCG +C G +G T+ IG +
Sbjct: 276 RLRMGIFTKRDVIKGEEITFNYNVDRYGHDAQTCYCGEPNCVGTIGGKTQTDIGTM 331
>sp|Q99MY8|ASH1L_MOUSE Histone-lysine N-methyltransferase ASH1L OS=Mus musculus GN=Ash1l
PE=1 SV=3
Length = 2958
Score = 125 bits (313), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2109 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2168
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2169 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2227
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2228 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2277
>sp|Q9NR48|ASH1L_HUMAN Histone-lysine N-methyltransferase ASH1L OS=Homo sapiens GN=ASH1L
PE=1 SV=2
Length = 2969
Score = 125 bits (313), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2119 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2179 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2237
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2238 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2287
>sp|Q9Y7R4|SET1_SCHPO Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=set1 PE=1 SV=1
Length = 920
Score = 122 bits (307), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 88/142 (61%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K++ + G+ A E I+K + +IEYIGE+I + + R + G+ + Y+
Sbjct: 779 RKKQLHFGPSRIHTLGLFAMENIDKNDMVIEYIGEIIRQRVADNREKNYVREGIGDSYLF 838
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
I +D +DAT KGN +RF+NHSC PNCI +VEG+ ++ ++A R I GE LTYDY+
Sbjct: 839 RIDEDVIVDATKKGNIARFINHSCAPNCIARIIRVEGKRKIVIYADRDIMHGEELTYDYK 898
Query: 393 FVQFGPEVKCYCGASSCQGYLG 414
F + ++ C CGA +C+GYL
Sbjct: 899 FPEEADKIPCLCGAPTCRGYLN 920
>sp|Q6CXP5|SET2_KLULA Histone-lysine N-methyltransferase, H3 lysine-36 specific
OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 /
DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=SET2 PE=3
SV=1
Length = 702
Score = 122 bits (306), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 64/158 (40%), Positives = 89/158 (56%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K+ I + KTE G+GV A I FI EYIGEVI + R+
Sbjct: 92 CGTDCQNQRFQKKAYADISVFKTERKGFGVRANSDIEPHNFIYEYIGEVIQEEEFRNRMV 151
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
G ++FY ++ IDAT KG +RF NHSC+PN + KW V G+ ++G+FA R
Sbjct: 152 KYDQMGFKHFYFMMLQTGQFIDATLKGCIARFCNHSCNPNAYVNKWVVNGKLKMGIFANR 211
Query: 380 SIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
I GE +T+DY ++G + CYC +C G+LG K
Sbjct: 212 HISKGEEVTFDYNVDRYGANAQPCYCEEPNCIGFLGGK 249
>sp|Q6CEK8|SET1_YARLI Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Yarrowia
lipolytica (strain CLIB 122 / E 150) GN=SET1 PE=3 SV=1
Length = 1170
Score = 118 bits (296), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 64/169 (37%), Positives = 95/169 (56%), Gaps = 5/169 (2%)
Query: 251 VQYISCSKACHCSET--CNNRPFRKEKK-IKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
V I+ K +ET N RK KK +K ++ WG+ A EPI E IIEY+GE
Sbjct: 1002 VADINMQKQLLSTETDVLNFNQLRKRKKPVKFARSAIHNWGLYAIEPIAANEMIIEYVGE 1061
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
V+ + + R G+ + Y+ + + +DAT +G +RF+NH C P+C + +V
Sbjct: 1062 VVRQEIADLREARYMRSGIGSSYLFRVDESTVVDATKRGGIARFINHCCTPSCTAKIIKV 1121
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRF-VQFGPE-VKCYCGASSCQGYLG 414
EG+ R+ ++A+R I A E LTYDY+F + G E + C CGA C+GYL
Sbjct: 1122 EGQKRIVIYASRDIAANEELTYDYKFEKEIGEERIPCLCGAPGCKGYLN 1170
>sp|Q6BKL7|SET1_DEBHA Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Debaryomyces hansenii (strain ATCC 36239 / CBS 767 /
JCM 1990 / NBRC 0083 / IGC 2968) GN=SET1 PE=3 SV=2
Length = 1088
Score = 117 bits (294), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 91/167 (54%), Gaps = 6/167 (3%)
Query: 254 ISCSKACHCSET--CNNRPFRKEKK-IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
IS K SET N K KK + ++ WG+ A EPI E IIEY+GE I
Sbjct: 922 ISAQKQMLGSETDILNLNALTKRKKPVSFARSAIHNWGLYALEPIAAKEMIIEYVGESIR 981
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ E R G+ + Y+ I ++ +DAT KG +RF+NH C+P+C + +VEG+
Sbjct: 982 QQVAEHRERSYLKTGIGSSYLFRIDENTVVDATKKGGIARFINHCCNPSCTAKIIKVEGK 1041
Query: 371 TRVGVFAARSIKAGEPLTYDYRF---VQFGPEVKCYCGASSCQGYLG 414
R+ ++A R I+A E LTYDY+F ++C CGA C+GYL
Sbjct: 1042 KRIVIYALRDIEANEELTYDYKFEKETNDAERIRCLCGAPGCKGYLN 1088
>sp|Q9VW15|ASH1_DROME Histone-lysine N-methyltransferase ash1 OS=Drosophila melanogaster
GN=ash1 PE=1 SV=3
Length = 2226
Score = 117 bits (293), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 91/170 (53%), Gaps = 7/170 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEKKIKIVK---TEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y CS + C E C N+ ++ V+ T GWGV PI KG +I+EY+GEV
Sbjct: 1364 YTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLPIAKGTYILEYVGEV 1423
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + +QR+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V
Sbjct: 1424 VTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVN 1482
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G +R+ +FA R+I+ GE LTYDY F F P C C C+G +G K
Sbjct: 1483 GLSRMVLFAKRAIEEGEELTYDYNFSLFNPSEGQPCRCNTPQCRGVIGGK 1532
>sp|Q54HS3|SET1_DICDI Histone-lysine N-methyltransferase set1 OS=Dictyostelium discoideum
GN=set1 PE=1 SV=1
Length = 1486
Score = 117 bits (292), Expect = 2e-25, Method: Composition-based stats.
Identities = 52/141 (36%), Positives = 84/141 (59%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
+ K+IK +++ WG+ A E I+ + +IEYIGEVI + ++R +G+ + Y+
Sbjct: 1345 RRKRIKFERSDIHDWGLFAMETISAKDMVIEYIGEVIRQKVADEREKRYVKKGIGSSYLF 1404
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D IDATFKGN +RF+NH CDPNCI + + + ++ ++A R I GE +TYDY+
Sbjct: 1405 RVDDDTIIDATFKGNLARFINHCCDPNCIAKVLTIGNQKKIIIYAKRDINIGEEITYDYK 1464
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C C + C+ L
Sbjct: 1465 FPIEDVKIPCLCKSPKCRQTL 1485
>sp|Q66J90|SET1B_XENLA Histone-lysine N-methyltransferase SETD1B OS=Xenopus laevis GN=setd1b
PE=2 SV=1
Length = 1938
Score = 117 bits (292), Expect = 2e-25, Method: Composition-based stats.
Identities = 57/147 (38%), Positives = 86/147 (58%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK KKI+ K+ WG+ A EPI E +IEY+G+ I + + R + G+
Sbjct: 1792 NQLKFRK-KKIRFCKSHIHDWGLFAMEPIVADEMVIEYVGQNIRQVIADMREKRYEDEGI 1850
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ YM + D IDAT GNF+RF+NHSC+PNC + VE + ++ +++ + I E
Sbjct: 1851 GSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVVTVESQKKIVIYSKQYINVNEE 1910
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA +C+G L
Sbjct: 1911 ITYDYKFPIEDVKIPCLCGAENCRGTL 1937
>sp|Q8X0S9|SET1_NEUCR Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A /
CBS 708.71 / DSM 1257 / FGSC 987) GN=set-1 PE=3 SV=1
Length = 1313
Score = 116 bits (290), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 84/145 (57%), Gaps = 3/145 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E INK + IIEY+GE + + E R G+ + Y+
Sbjct: 1168 KRKKPVKFARSAIHNWGLYAMENINKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYL 1227
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1228 FRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDY 1287
Query: 392 RF---VQFGPEVKCYCGASSCQGYL 413
+F + + C CG ++C+G+L
Sbjct: 1288 KFEREIGSTDRIPCLCGTAACKGFL 1312
>sp|Q1LY77|SE1BA_DANRE Histone-lysine N-methyltransferase SETD1B-A OS=Danio rerio GN=setd1ba
PE=1 SV=2
Length = 1844
Score = 115 bits (289), Expect = 5e-25, Method: Composition-based stats.
Identities = 56/147 (38%), Positives = 86/147 (58%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK KKI+ ++ WG+ A EPI E +IEY+G+ I + + R + G+
Sbjct: 1698 NQLKFRK-KKIRFCRSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGI 1756
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ YM + D IDAT GNF+RF+NHSC+PNC + VE + ++ +++ + I E
Sbjct: 1757 GSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEE 1816
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA +C+G L
Sbjct: 1817 ITYDYKFPIEDEKIPCLCGAENCRGTL 1843
>sp|Q08D57|SET1B_XENTR Histone-lysine N-methyltransferase SETD1B OS=Xenopus tropicalis
GN=setd1b PE=2 SV=1
Length = 1956
Score = 115 bits (289), Expect = 6e-25, Method: Composition-based stats.
Identities = 56/147 (38%), Positives = 86/147 (58%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK KK++ K+ WG+ A EPI E +IEY+G+ I + + R + G+
Sbjct: 1810 NQLKFRK-KKLRFCKSHIHDWGLFAMEPIIADEMVIEYVGQNIRQVIADMREKRYEDEGI 1868
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ YM + D IDAT GNF+RF+NHSC+PNC + VE + ++ +++ + I E
Sbjct: 1869 GSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQYINVNEE 1928
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA +C+G L
Sbjct: 1929 ITYDYKFPIEDVKIPCLCGAENCRGTL 1955
>sp|Q5ABG1|SET1_CANAL Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Candida
albicans (strain SC5314 / ATCC MYA-2876) GN=SET1 PE=3
SV=1
Length = 1040
Score = 115 bits (288), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K + ++ WG+ A EPI E IIEY+GE I + E R G+ + Y+
Sbjct: 895 KRKKPVTFARSAIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLKTGIGSSYL 954
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NH C P+C + +VEG+ R+ ++A R I+A E LTYDY
Sbjct: 955 FRIDDNTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDY 1014
Query: 392 RFVQFGPE---VKCYCGASSCQGYLG 414
+F + + ++C CGA C+GYL
Sbjct: 1015 KFERETNDEERIRCLCGAPGCKGYLN 1040
>sp|Q6FKB1|SET1_CANGA Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Candida
glabrata (strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC
0622 / NRRL Y-65) GN=SET1 PE=3 SV=1
Length = 1111
Score = 112 bits (280), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 92/169 (54%), Gaps = 6/169 (3%)
Query: 252 QYISCSKACHCSET---CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Q I KA +E+ N+ +++K + ++ WG+ A EPIN E +IEY+GE
Sbjct: 943 QNIEAQKAATGTESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPINAKEMVIEYVGER 1002
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I + E R G+ + Y+ I + IDAT KG +RF+NH C+P+C + +V
Sbjct: 1003 IRQPVAEMRERRYIKNGIGSSYLFRIDEHTVIDATKKGGIARFINHCCEPSCTAKIIKVG 1062
Query: 369 GETRVGVFAARSIKAGEPLTYDYRF---VQFGPEVKCYCGASSCQGYLG 414
G+ R+ ++A R I A E LTYDY+F + C CGA SC+G+L
Sbjct: 1063 GKRRIVIYALRDIAANEELTYDYKFERETDAEERLPCLCGAPSCKGFLN 1111
>sp|Q4PB36|SET1_USTMA Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Ustilago
maydis (strain 521 / FGSC 9021) GN=SET1 PE=3 SV=1
Length = 1468
Score = 112 bits (279), Expect = 7e-24, Method: Composition-based stats.
Identities = 52/144 (36%), Positives = 85/144 (59%), Gaps = 3/144 (2%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K++K K+ WG+ A E I G+ +IEY+GEV+ + ++R + +G + Y+
Sbjct: 1325 RKKQLKFAKSPIHDWGLYAMELIPAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLF 1384
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D +DAT KGN +R +NH C PNC + + GE R+ +FA +I+AGE LTYDY+
Sbjct: 1385 RVDDDLVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKTAIRAGEELTYDYK 1444
Query: 393 FVQFGPE---VKCYCGASSCQGYL 413
F + + C CG+ C+ +L
Sbjct: 1445 FQSSADDEDAIPCLCGSPGCRRFL 1468
>sp|Q4I5R3|SET1_GIBZE Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Gibberella zeae (strain PH-1 / ATCC MYA-4620 / FGSC
9075 / NRRL 31084) GN=SET1 PE=3 SV=2
Length = 1263
Score = 111 bits (278), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 54/146 (36%), Positives = 83/146 (56%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E I K + IIEY+GE + + E R G+ + Y+
Sbjct: 1118 KRKKPVKFARSAIHNWGLYAMENIAKDDMIIEYVGEQVRQQISEIRENRYLKSGIGSSYL 1177
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1178 FRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIALNEELTYDY 1237
Query: 392 RF---VQFGPEVKCYCGASSCQGYLG 414
+F + + C CG ++C+G+L
Sbjct: 1238 KFEREIGSTDRIPCLCGTAACKGFLN 1263
>sp|Q2GWF3|SET1_CHAGB Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Chaetomium globosum (strain ATCC 6205 / CBS 148.51 /
DSM 1962 / NBRC 6347 / NRRL 1970) GN=SET1 PE=3 SV=1
Length = 1076
Score = 110 bits (276), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 84/145 (57%), Gaps = 3/145 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E I K + IIEY+GE + + E R G+ + Y+
Sbjct: 931 KRKKPVKFARSAIHNWGLYAMENIPKDDMIIEYVGEEVRQQIAELRENRYLKSGIGSSYL 990
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 991 FRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDY 1050
Query: 392 RFV-QFGP--EVKCYCGASSCQGYL 413
+F + G + C CG ++C+G+L
Sbjct: 1051 KFERELGSTDRIPCLCGTAACKGFL 1075
>sp|Q6CIT4|SET1_KLULA Histone-lysine N-methyltransferase, H3 lysine-4 specific
OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 /
DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=SET1 PE=3
SV=1
Length = 1000
Score = 110 bits (275), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 55/150 (36%), Positives = 85/150 (56%), Gaps = 3/150 (2%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ +++K + ++ WG+ A EPI E IIEY+GE I + E R G+
Sbjct: 851 NQLTKRKKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGESIRQPVAEMREKRYIKSGIG 910
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I ++ IDAT +G +RF+NH C+P+C + +V+G R+ ++A R I E L
Sbjct: 911 SSYLFRIDENTVIDATKRGGIARFINHCCEPSCTAKIIKVDGRKRIVIYALRDIGTNEEL 970
Query: 388 TYDYRF---VQFGPEVKCYCGASSCQGYLG 414
TYDY+F G + C CGA SC+G+L
Sbjct: 971 TYDYKFERETDEGERLPCLCGAPSCKGFLN 1000
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.323 0.137 0.456
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 175,477,153
Number of Sequences: 539616
Number of extensions: 7666765
Number of successful extensions: 14558
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 169
Number of HSP's successfully gapped in prelim test: 51
Number of HSP's that attempted gapping in prelim test: 13981
Number of HSP's gapped (non-prelim): 323
length of query: 443
length of database: 191,569,459
effective HSP length: 121
effective length of query: 322
effective length of database: 126,275,923
effective search space: 40660847206
effective search space used: 40660847206
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 63 (28.9 bits)